1 2021-11-30 liuhongt <hongtao.liu@intel.com>
5 * config/i386/i386.md (*x86_64_shld_1): Set_attr
8 (*x86_64_shrd_1): Ditto.
11 2021-11-30 Bill Schmidt <wschmidt@linux.ibm.com>
13 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
15 (altivec_resolve_new_overloaded_builtin): Likewise.
17 2021-11-30 Martin Jambor <mjambor@suse.cz>
20 * ipa-sra.c (scan_function): Also check ECF_LOOPING_CONST_OR_PURE flag.
22 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
24 PR tree-optimization/103494
25 * tree-vect-stmts.c (vect_get_gather_scatter_ops): Remove ncopies
26 argument and calculate ncopies from gs_info->offset_vectype
28 (vectorizable_store, vectorizable_load): Update accordingly.
30 2021-11-30 Iain Buclaw <ibuclaw@gdcproject.org>
32 * Makefile.in (GDC): New variable.
33 (GDCFLAGS): New variable.
34 * configure: Regenerate.
35 * configure.ac: Add call to ACX_PROG_GDC. Substitute GDCFLAGS.
37 2021-11-30 Martin Liska <mliska@suse.cz>
38 Martin Jambor <mjambor@suse.cz>
41 * ipa-param-manipulation.c
42 (ipa_param_body_adjustments::prepare_debug_expressions): Be
43 careful about hash_map reallocating itself. Simpify a return
44 which always returns true.
46 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
48 PR tree-optimization/103440
49 * gimple-range-fold.cc (fold_using_range::range_of_phi): Continue
50 normal param processing for equiv params.
52 2021-11-30 Richard Biener <rguenther@suse.de>
54 * config/i386/i386.c (ix86_shift_rotate_cost): Remove
56 * tree-chrec.c (evolution_function_is_invariant_rec_p):
58 * tree-if-conv.c (if_convertible_stmt_p): Likewise.
59 * tree-ssa-pre.c (fully_constant_expression): Likewise.
60 * tree-vrp.c (operand_less_p): Likewise.
61 * reload.c (reg_overlap_mentioned_for_reload_p): Remove
62 unreachable gcc_unreachable ().
63 * sel-sched-ir.h (bb_next_bb): Likewise.
64 * varasm.c (compare_constant): Likewise.
66 2021-11-30 Richard Biener <rguenther@suse.de>
68 PR tree-optimization/103489
69 * tree-vect-loop.c (vectorizable_phi): Verify argument
70 vector type compatibility to mitigate bool pattern recog
73 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
75 * omp-offload.c (oacc_loop_auto_partitions): Remove erroneous
76 "Orphan reductions cannot have gang partitioning" handling.
78 2021-11-30 Thomas Schwinge <thomas@codesourcery.com>
80 * omp-offload.c (oacc_loop_process): Implement "gang reduction on
81 an orphan loop" checking.
83 2021-11-30 Cesar Philippidis <cesar@codesourcery.com>
84 Thomas Schwinge <thomas@codesourcery.com>
86 * omp-general.h (enum oacc_loop_flags): Add OLF_REDUCTION enum.
87 * omp-low.c (lower_oacc_head_mark): Use it to mark OpenACC
89 * omp-offload.c (oacc_loop_auto_partitions): Don't assign gang
90 level parallelism to orphan reductions.
92 2021-11-30 Aldy Hernandez <aldyh@redhat.com>
94 PR tree-optimization/103451
95 * range-op.cc (operator_div::wi_fold): Remove
96 can_throw_non_call_exceptions special case.
97 * tree-ssa-sink.c (sink_code_in_bb): Same.
99 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
101 * tree-vect-slp.c (arg1_arg4_map): New variable.
102 (vect_get_operand_map): Handle IFN_MASK_GATHER_LOAD.
103 (vect_build_slp_tree_1): Likewise.
104 (vect_build_slp_tree_2): Likewise.
105 * tree-vect-stmts.c (vectorizable_load): Expect the mask to be
106 the last SLP child node rather than the first.
108 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
110 * tree-if-conv.c: Include tree-eh.h.
111 (predicate_statements): Remove pe argument. Don't hoist
113 (combine_blocks): Remove pe argument.
114 (ifcvt_available_on_edge_p, ifcvt_can_hoist): New functions.
115 (ifcvt_hoist_invariants): Likewise.
116 (tree_if_conversion): Update call to combine_blocks. Call
117 ifcvt_hoist_invariants after VN.
119 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
121 PR tree-optimization/102467
122 * doc/sourcebuild.texi (vect_gather_load_ifn): Document.
123 * tree-vect-data-refs.c (vect_analyze_data_ref_dependence):
124 Commonize safelen handling. Punt for anything involving
125 gathers and scatters unless safelen says otherwise.
126 * tree-vect-slp.c (arg1_map): New variable.
127 (vect_get_operand_map): Handle IFN_GATHER_LOAD.
128 (vect_build_slp_tree_1): Likewise.
129 (vect_build_slp_tree_2): Likewise.
130 (compatible_calls_p): If vect_get_operand_map returns nonnull,
131 check that any skipped arguments are equal.
132 (vect_slp_analyze_node_operations_1): Tighten reduction check.
133 * tree-vect-stmts.c (check_load_store_for_partial_vectors): Take
135 (vect_get_gather_scatter_ops): Take slp_node and ncopies arguments.
137 (vectorizable_store, vectorizable_load): Adjust accordingly.
139 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
141 * gimple.h (gimple_num_args, gimple_arg, gimple_arg_ptr): New
143 * tree-vect-slp.c (cond_expr_maps, arg2_map): New variables.
144 (vect_get_operand_map): New function.
145 (vect_get_and_check_slp_defs): Fix outdated comment.
146 Use vect_get_operand_map and new gimple argument accessors.
147 (vect_build_slp_tree_2): Likewise.
149 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
151 * tree-vect-slp.c (vect_build_slp_tree_1): Use code_helper
152 to record the operations performed by statements, only using
153 CALL_EXPR for things that don't map to built-in or internal
154 functions. For shifts, require all shift amounts to be equal
155 if optab_vector is not supported but optab_scalar is.
157 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
159 * tree-vect-data-refs.c (vect_check_gather_scatter): Continue
160 processing conversions if the current offset is a pointer.
162 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
164 * tree-vectorizer.h (vect_is_reduction): Use STMT_VINFO_REDUC_IDX.
166 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
168 * genopinit.c (main): Turn supports_vec_gather_load and
169 supports_vec_scatter_store into signed char arrays and remove
170 supports_vec_gather_load_cached and supports_vec_scatter_store_cached.
171 * optabs-query.c (supports_vec_convert_optab_p): Add a mode parameter.
172 If the mode is not VOIDmode, test only for that mode.
173 (supports_vec_gather_load_p): Likewise.
174 (supports_vec_scatter_store_p): Likewise.
175 * optabs-query.h (supports_vec_gather_load_p): Likewise.
176 (supports_vec_scatter_store_p): Likewise.
177 * tree-vect-data-refs.c (vect_check_gather_scatter): Pass the
178 vector mode to supports_vec_gather_load_p and
179 supports_vec_scatter_store_p.
181 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
183 * internal-fn.c (commutative_binary_fn_p): Handle IFN_ADD_OVERFLOW
184 and IFN_MUL_OVERFLOW.
186 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
188 * internal-fn.c (commutative_binary_fn_p): Handle IFN_UBSAN_CHECK_ADD
189 and IFN_UBSAN_CHECK_MUL.
191 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
193 * internal-fn.c (commutative_binary_fn_p): Handle IFN_COMPLEX_MUL.
195 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
197 * gimple-fold.c: Include internal-fn.h.
198 (fold_stmt_1): If a function maps to an internal one, use
199 first_commutative_argument to canonicalize the order of
200 commutative arguments.
201 * gimple-match-head.c (gimple_resimplify2, gimple_resimplify3)
202 (gimple_resimplify4, gimple_resimplify5): Extend commutativity
205 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
207 * doc/md.texi (reduc_fmin_scal_@var{m}): Document.
208 (reduc_fmax_scal_@var{m}): Likewise.
209 * optabs.def (reduc_fmax_scal_optab): New optab.
210 (reduc_fmin_scal_optab): Likewise
211 * internal-fn.def (REDUC_FMAX, REDUC_FMIN): New functions.
212 * tree-vect-loop.c (reduction_fn_for_scalar_code): Handle
213 CASE_CFN_FMAX and CASE_CFN_FMIN.
214 (neutral_op_for_reduction): Likewise.
215 (needs_fold_left_reduction_p): Likewise.
216 * config/aarch64/iterators.md (FMAXMINV): New iterator.
217 (fmaxmin): Handle UNSPEC_FMAXNMV and UNSPEC_FMINNMV.
218 * config/aarch64/aarch64-simd.md (reduc_<optab>_scal_<mode>): Fix
220 (reduc_<fmaxmin>_scal_<mode>): New pattern.
221 * config/aarch64/aarch64-sve.md (reduc_<fmaxmin>_scal_<mode>):
224 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
226 * builtins.h (associated_internal_fn): Declare overload that
227 takes a (combined_cfn, return type) pair.
228 * builtins.c (associated_internal_fn): Split new overload out
229 of original fndecl version. Also provide an overload that takes
230 a (combined_cfn, return type) pair.
231 * internal-fn.h (commutative_binary_fn_p): Declare.
232 (commutative_ternary_fn_p): Likewise.
233 (associative_binary_fn_p): Likewise.
234 * internal-fn.c (commutative_binary_fn_p, commutative_ternary_fn_p):
235 New functions, split out from...
236 (first_commutative_argument): ...here.
237 (associative_binary_fn_p): New function.
238 * gimple-match.h (code_helper): Add a constructor that takes
240 (commutative_binary_op_p): Declare.
241 (commutative_ternary_op_p): Likewise.
242 (first_commutative_argument): Likewise.
243 (associative_binary_op_p): Likewise.
244 (canonicalize_code): Likewise.
245 (directly_supported_p): Likewise.
246 (get_conditional_internal_fn): Likewise.
247 (gimple_build): New overloads that takes a code_helper.
248 * gimple-fold.c (gimple_build): Likewise.
249 * gimple-match-head.c (commutative_binary_op_p): New function.
250 (commutative_ternary_op_p): Likewise.
251 (first_commutative_argument): Likewise.
252 (associative_binary_op_p): Likewise.
253 (canonicalize_code): Likewise.
254 (directly_supported_p): Likewise.
255 (get_conditional_internal_fn): Likewise.
256 * tree-vectorizer.h: Include gimple-match.h.
257 (neutral_op_for_reduction): Take a code_helper instead of a tree_code.
258 (needs_fold_left_reduction_p): Likewise.
259 (reduction_fn_for_scalar_code): Likewise.
260 (vect_can_vectorize_without_simd_p): Declare a nNew overload that takes
262 * tree-vect-loop.c: Include case-cfn-macros.h.
263 (fold_left_reduction_fn): Take a code_helper instead of a tree_code.
264 (reduction_fn_for_scalar_code): Likewise.
265 (neutral_op_for_reduction): Likewise.
266 (needs_fold_left_reduction_p): Likewise.
267 (use_mask_by_cond_expr_p): Likewise.
268 (build_vect_cond_expr): Likewise.
269 (vect_create_partial_epilog): Likewise. Use gimple_build rather
270 than gimple_build_assign.
271 (check_reduction_path): Handle calls and operate on code_helpers
272 rather than tree_codes.
273 (vect_is_simple_reduction): Likewise.
274 (vect_model_reduction_cost): Likewise.
275 (vect_find_reusable_accumulator): Likewise.
276 (vect_create_epilog_for_reduction): Likewise.
277 (vect_transform_cycle_phi): Likewise.
278 (vectorizable_reduction): Likewise. Make more use of
280 (vect_transform_reduction): Use gimple_extract_op but expect
282 (vect_can_vectorize_without_simd_p): New overload that takes
284 * tree-vect-stmts.c (vectorizable_call): Handle reductions in
286 * tree-vect-patterns.c (vect_mark_pattern_stmts): Use
287 gimple_extract_op when updating STMT_VINFO_REDUC_IDX.
289 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
291 * gimple-match.h (code_helper): Provide == and != overloads.
292 (code_helper::operator tree_code): Make explicit.
293 (code_helper::operator combined_fn): Likewise.
294 * gimple-match-head.c (convert_conditional_op): Use explicit
295 conversions where necessary.
296 (gimple_resimplify1, gimple_resimplify2, gimple_resimplify3): Likewise.
297 (maybe_push_res_to_seq, gimple_simplify): Likewise.
298 * gimple-fold.c (replace_stmt_with_simplification): Likewise.
300 2021-11-30 Richard Sandiford <richard.sandiford@arm.com>
302 * gimple-match.h (code_helper): Add functions for querying whether
303 the code represents an internal_fn or a built_in_function.
304 Provide explicit conversion operators for both cases.
305 (gimple_extract_op): Declare.
306 * gimple-match-head.c (gimple_extract): New function, extracted from...
307 (gimple_simplify): ...here.
308 (gimple_extract_op): New function.
310 2021-11-30 Eric Botcazou <ebotcazou@adacore.com>
313 * config/i386/i386.c (ix86_output_call_insn): Beef up comment about
314 nops emitted with SEH.
315 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
316 the cold section, emit a nop before the directive if the previous
317 active instruction is a call.
319 2021-11-30 Roger Sayle <roger@nextmovesoftware.com>
320 Uroš Bizjak <ubizjak@gmail.com>
323 * config/i386/sse.md (define_split): New splitters to simplify
324 not;vec_duplicate;and as vec_duplicate;andn.
326 2021-11-30 Richard Biener <rguenther@suse.de>
329 * match.pd (-((int)x >> (prec - 1)) to (unsigned)x >> (prec - 1)):
330 Use VIEW_CONVERT_EXPR for vectors.
332 2021-11-30 Richard Biener <rguenther@suse.de>
334 * cfgrtl.c (skip_insns_after_block): Refactor code to
335 be more easily readable.
336 * expr.c (op_by_pieces_d::run): Remove unreachable
338 * sched-deps.c (sched_analyze): Remove unreachable
340 * sel-sched-ir.c (in_same_ebb_p): Likewise.
341 * tree-ssa-alias.c (nonoverlapping_refs_since_match_p):
342 Remove unreachable code.
343 * tree-vect-slp.c (vectorize_slp_instance_root_stmt):
344 Refactor to avoid unreachable loop iteration.
345 * tree.c (walk_tree_1): Remove unreachable break.
346 * vec-perm-indices.c (vec_perm_indices::series_p): Remove
349 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
352 * config/rs6000/rs6000-call.c (rs6000_builtin_decl): Remove builtin mask
355 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
357 * config/rs6000/rs6000.c
358 (rs6000_cost_data::update_target_cost_per_stmt): Adjust the way to
359 compute extra penalized cost. Remove useless parameter.
360 (rs6000_cost_data::rs6000_add_stmt_cost): Adjust the call to function
361 update_target_cost_per_stmt.
363 2021-11-30 Kewen Lin <linkw@linux.ibm.com>
366 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
368 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
369 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
370 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
371 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
372 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
373 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
374 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
375 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
377 2021-11-30 Andrew MacLeod <amacleod@redhat.com>
379 PR tree-optimization/103467
380 * gimple-range-gori.cc (range_def_chain::register_dependency): Don't
381 use an object reference after a potential resize.
383 2021-11-29 Uroš Bizjak <ubizjak@gmail.com>
386 * config/i386/i386.md (*movhi_internal): Introduce (*v,C) alternative.
387 Do not allocate non-GPR registers. Optimize xmm->xmm moves when
388 optimizing for size. Fix vpinsrw insn template.
389 (*movhf_internal): Fix pinsrw and pextrw insn templates for
390 AVX targets. Use sselog1 type instead of sselog. Optimize GPR moves.
391 Optimize xmm->xmm moves for TARGET_SSE_PARTIAL_REG_DEPENDENCY
392 and TARGET_SSE_SPLIT_REGS targets.
394 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
396 * configure: Re-regenerate.
398 2021-11-29 Eric Gallager <egallager@gcc.gnu.org>
401 * Makefile.in: Substitute CTAGS, ETAGS, and CSCOPE
402 variables. Use ETAGS variable in TAGS target.
403 * configure: Regenerate.
404 * configure.ac: Allow CTAGS, ETAGS, and CSCOPE
405 variables to be overridden.
407 2021-11-29 Paul A. Clarke <pc@us.ibm.com>
409 * config/rs6000/xmmintrin.h (_mm_movemask_ps): Use vec_extractm
411 * config/rs6000/emmintrin.h (_mm_movemask_pd): Likewise.
412 (_mm_movemask_epi8): Likewise.
414 2021-11-29 Richard Biener <rguenther@suse.de>
416 * read-rtl-function.c (function_reader::read_rtx_operand):
417 Return only after resetting m_in_call_function_usage.
419 2021-11-29 Roger Sayle <roger@nextmovesoftware.com>
421 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti): Perform the
422 conversion via V2DImode using vec_extractv2didi on TARGET_SSE2.
423 * config/i386/sse.md (rotlv1ti3, rotrv1ti3): Change constraint
424 on QImode shift amounts from const_int_operand to general_operand.
426 2021-11-29 Richard Biener <rguenther@suse.de>
428 * tree.h (reverse_storage_order_for_component_p): Remove
429 spurious gcc_unreachable.
430 * cfganal.c (dfs_find_deadend): Likewise.
431 * fold-const-call.c (fold_const_logb): Likewise.
432 (fold_const_significand): Likewise.
433 * gimple-ssa-store-merging.c (lhs_valid_for_store_merging_p):
436 2021-11-29 Richard Biener <rguenther@suse.de>
438 * vec.c (qsort_chk): Do not return the void return value
439 from the noreturn qsort_chk_error.
440 * ccmp.c (expand_ccmp_expr_1): Remove unreachable return.
441 * df-scan.c (df_ref_equal_p): Likewise.
442 * dwarf2out.c (is_base_type): Likewise.
443 (add_const_value_attribute): Likewise.
444 * fixed-value.c (fixed_arithmetic): Likewise.
445 * gimple-fold.c (gimple_fold_builtin_fputs): Likewise.
446 * gimple-ssa-strength-reduction.c (stmt_cost): Likewise.
447 * graphite-isl-ast-to-gimple.c
448 (gcc_expression_from_isl_expr_op): Likewise.
449 (gcc_expression_from_isl_expression): Likewise.
450 * ipa-fnsummary.c (will_be_nonconstant_expr_predicate):
452 * lto-streamer-in.c (lto_input_mode_table): Likewise.
454 2021-11-29 liuhongt <hongtao.liu@intel.com>
457 * config/i386/i386.c (inline_secondary_memory_needed): HImode
458 move between GPR and SSE registers is supported under
459 TARGET_SSE2 and above.
460 * config/i386/i386.md (extendhfsf2): Optimize expander.
462 * config/i386/sse.md (sse2p4_1): Adjust attr for V8HFmode to
465 2021-11-29 liuhongt <hongtao.liu@intel.com>
469 * config/i386/i386.c (ix86_secondary_reload): Without
470 TARGET_SSE4_1, General register is needed to move HImode from
471 sse register to memory.
472 * config/i386/sse.md (*vec_extrachf): Use %vpextrw instead of
473 pextrw in output templates.
474 * config/i386/i386.md (movhi_internal): Ditto, also fix typo of
475 MEM_P (operands[1]) and adjust mode/prefix/type attribute for
476 alternatives related to sse register.
478 2021-11-29 Richard Biener <rguenther@suse.de>
480 PR tree-optimization/103458
481 * tree-ssa-dce.c (make_forwarders_with_degenerate_phis): Do not
482 create forwarders for blocks with abnormal predecessors.
484 2021-11-29 Richard Biener <rguenther@suse.de>
486 * gimple-predicate-analysis.cc (can_be_invalidated_p):
487 Restore semantics to the one before the split from
490 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
492 * profile.c: Include sreal.h
493 (struct bb_stats): New.
494 (cmp_stats): New function.
495 (compute_branch_probabilities): Output bb stats.
497 2021-11-28 Jan Hubicka <hubicka@ucw.cz>
499 * cfghooks.c: Include sreal.h, profile.h.
500 (profile_record_check_consistency): Fix checking of count counsistency;
501 record also dynamic mismatches.
502 * cfgrtl.c (rtl_account_profile_record): Similarly.
503 * tree-cfg.c (gimple_account_profile_record): Likewise.
504 * cfghooks.h (struct profile_record): Remove num_mismatched_freq_in,
505 num_mismatched_freq_out, turn time to double, add
506 dyn_mismatched_prob_out, dyn_mismatched_count_in,
507 num_mismatched_prob_out; remove num_mismatched_count_out.
508 * passes.c (account_profile_1): New function.
509 (account_profile_in_list): New function.
510 (pass_manager::dump_profile_report): Rewrite.
511 (execute_one_ipa_transform_pass): Check profile consistency after
513 (execute_all_ipa_transforms): Remove cfun test; record all transform
515 (execute_one_pass): Fix collecting of profile stats.
517 2021-11-27 Jakub Jelinek <jakub@redhat.com>
520 * config/i386/i386.md (x86_64_shld, x86_shld, x86_64_shrd, x86_shrd):
521 Change insn pattern to accurately describe the instructions.
522 (*x86_64_shld_1, *x86_shld_1, *x86_64_shrd_1, *x86_shrd_1): New
523 define_insn patterns.
524 (*x86_64_shld_2, *x86_shld_2, *x86_64_shrd_2, *x86_shrd_2): New
525 define_insn_and_split patterns.
526 (*ashl<dwi>3_doubleword_mask, *ashl<dwi>3_doubleword_mask_1,
527 *<insn><dwi>3_doubleword_mask, *<insn><dwi>3_doubleword_mask_1,
528 ix86_rotl<dwi>3_doubleword, ix86_rotr<dwi>3_doubleword): Adjust
529 splitters for x86_{,64_}sh{l,r}d pattern changes.
531 2021-11-27 Jakub Jelinek <jakub@redhat.com>
533 PR tree-optimization/103435
534 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Avoid UB if
535 n->range - rsize == 8, just clear both *cmpnop and *cmpxchg in that
538 2021-11-27 Martin Jambor <mjambor@suse.cz>
541 * ipa-prop.c (ipcp_transform_function): Call
542 delete_unreachable_blocks_update_callgraph instead of returning
545 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
547 * tree-object-size.c (object_sizes_grow, object_sizes_release,
548 object_sizes_unknown_p, object_sizes_get, object_size_set_force,
549 object_sizes_set): New functions.
550 (addr_object_size, compute_builtin_object_size,
551 expr_object_size, call_object_size, unknown_object_size,
552 merge_object_sizes, plus_stmt_object_size,
553 cond_expr_object_size, collect_object_sizes_for,
554 check_for_plus_in_loops_1, init_object_sizes,
555 fini_object_sizes): Adjust.
557 2021-11-26 Siddhesh Poyarekar <siddhesh@gotplt.org>
559 * tree-object-size.c: New enum.
560 (object_sizes, computed, addr_object_size,
561 compute_builtin_object_size, expr_object_size, call_object_size,
562 merge_object_sizes, plus_stmt_object_size,
563 collect_object_sizes_for, init_object_sizes, fini_object_sizes,
564 object_sizes_execute): Replace magic numbers with enums.
566 2021-11-26 Roger Sayle <roger@nextmovesoftware.com>
568 * tree-ssa-loop-ivopts.c (cand_value_at): Take a class
569 tree_niter_desc* argument instead of just a tree for NITER.
570 If we require the iv candidate value at the end of the final
571 loop iteration, try using the original loop bound as the
572 NITER for sufficiently simple loops.
573 (may_eliminate_iv): Update (only) call to cand_value_at.
575 2021-11-26 Jan Hubicka <hubicka@ucw.cz>
577 * ipa-modref.c (analyze_function): Drop parameter F and dump
578 cgraph node name rather than cfun name.
579 (modref_generate): Update.
580 (modref_summaries::insert):Update.
581 (modref_summaries_lto::insert):Update.
582 (pass_modref::execute):Update.
583 (ipa_merge_modref_summary_after_inlining): Improve combining of
586 2021-11-26 Jan Hubicka <jh@suse.cz>
589 * ipa-modref.c (update_escape_summary_1): Fix handling of min_flags.
591 2021-11-26 konglin1 <lingling.kong@intel.com>
594 * config/i386/i386.c (ix86_can_change_mode_class): Allow 16 bit data in XMM register
596 * config/i386/i386.md (extendhfsf2): Add extenndhfsf2 for TARGET_F16C.
597 (extendhfdf2): Restrict extendhfdf for TARGET_AVX512FP16 only.
598 (*extendhf<mode>2): Rename from extendhf<mode>2.
599 (truncsfhf2): Likewise.
600 (truncdfhf2): Likewise.
601 (*trunc<mode>2): Likewise.
603 2021-11-26 liuhongt <hongtao.liu@intel.com>
606 * match.pd: Fix typo, use the type of second parameter, not
609 2021-11-25 Jan Hubicka <jh@suse.cz>
611 * ipa-cp.c (ipa_value_range_from_jfunc): Remove forgotten early return.
613 2021-11-25 Roger Sayle <roger@nextmovesoftware.com>
616 * match.pd (minus @0 @0): Check tree_expr_maybe_infinite_p.
618 2021-11-25 Martin Jambor <mjambor@suse.cz>
621 * ipa-prop.h (ipa_get_param): New overload. Move bits of the existing
623 * ipa-param-manipulation.h (ipa_param_adjustments): New member
624 function get_updated_index_or_split.
625 * ipa-param-manipulation.c
626 (ipa_param_adjustments::get_updated_index_or_split): New function.
627 * ipa-prop.c (adjust_agg_replacement_values): Reimplement, add
628 capability to identify scalarized parameters and perform substitution
630 (ipcp_transform_function): Create descriptors earlier, handle new
631 return values of adjust_agg_replacement_values.
633 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
635 * gimple-range-path.cc (path_range_query::compute_ranges_defined): Remove.
636 (path_range_query::compute_ranges_in_block): Revert to bitmap order.
637 * gimple-range-path.h: Remove compute_ranges_defined.
639 2021-11-25 Andrew Stubbs <ams@codesourcery.com>
642 * config/gcn/gcn.c (move_callee_saved_registers): Ensure that the
643 number of spilled registers is counted correctly.
645 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
647 * ipa-prop.h (ipa_node_params::ipa_node_params): Initialize
650 2021-11-25 Andrew MacLeod <amacleod@redhat.com>
652 PR tree-optimization/103359
653 * gimple-range-fold.cc (fold_using_range::range_of_phi): If arg is
654 equivalent to def, don't initially include it's range.
656 2021-11-25 Jan Hubicka <hubicka@ucw.cz>
658 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not check
661 2021-11-25 Richard Biener <rguenther@suse.de>
663 * cgraphunit.c (symbol_table::output_weakrefs): Remove
665 (get_alias_symbol): Remove now unused function.
667 2021-11-25 Richard Biener <rguenther@suse.de>
669 * cfgrtl.c (rtl_verify_fallthru): Do not stop verifying
671 (skip_insns_after_block): Remove unreachable break and continue.
673 2021-11-25 Richard Biener <rguenther@suse.de>
675 * cfgexpand.c (label_rtx_for_bb): Remove dead loop construct.
677 2021-11-25 Richard Biener <rguenther@suse.de>
679 * regset.h (REG_SET_EMPTY_P): New macro.
680 * cfgcleanup.c (thread_jump): Use REG_SET_EMPTY_P.
682 2021-11-25 Martin Liska <mliska@suse.cz>
684 * doc/invoke.texi: Use @option for -Wuninitialized.
686 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
688 PR tree-optimization/103254
689 * gimple-range-path.cc (path_range_query::compute_ranges): Move
690 exported boolean code...
691 (path_range_query::compute_imports): ...here.
693 2021-11-25 Aldy Hernandez <aldyh@redhat.com>
695 PR tree-optimization/103254
696 * gimple-range-path.cc (path_range_query::compute_ranges_defined): New
697 (path_range_query::compute_ranges_in_block): Move to
698 compute_ranges_defined.
699 * gimple-range-path.h (compute_ranges_defined): New.
701 2021-11-25 Jakub Jelinek <jakub@redhat.com>
703 PR tree-optimization/103417
704 * match.pd ((X & Y) CMP 0): Only handle eq and ne. Commonalize
707 2021-11-25 Jakub Jelinek <jakub@redhat.com>
709 PR tree-optimization/103376
710 * gimple-ssa-store-merging.c (perform_symbolic_merge): For
711 BIT_IOR_EXPR, if masked1 && masked2 && masked1 != masked2, don't
712 punt, but set the corresponding result byte to MARKER_BYTE_UNKNOWN.
713 For BIT_XOR_EXPR similarly and if masked1 == masked2 and the
714 byte isn't MARKER_BYTE_UNKNOWN, set the corresponding result byte to
717 2021-11-25 Jakub Jelinek <jakub@redhat.com>
720 * doc/invoke.texi (-Wcomma-subscript): Document that for
721 -std=c++20 the option isn't enabled by default with -Wno-deprecated
722 but for -std=c++23 it is.
724 2021-11-24 Jan Hubicka <hubicka@ucw.cz>
726 * ipa-modref.c (implicit_eaf_flags_for_edge_and_arg): Break out from...
727 (modref_merge_call_site_flags): ... here.
728 (ipa_merge_modref_summary_after_inlining): Use it.
730 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
732 * cfgloop.c (verify_loop_structure): Reduce scope of
733 'class loop *loop' variable.
734 * ipa-fnsummary.c (analyze_function_body): Likewise.
735 * loop-init.c (fix_loop_structure): Likewise.
736 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
737 * predict.c (predict_loops): Likewise.
738 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
739 * tree-vectorizer.c (pass_vectorize::execute): Likewise.
741 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
743 PR tree-optimization/103231
744 * gimple-range.cc (gimple_ranger::gimple_ranger): Create stmt stack.
745 (gimple_ranger::gimple_ranger): Delete stmt stack.
746 (gimple_ranger::range_of_stmt): Process depenedencies if they have no
748 (gimple_ranger::prefill_name): New.
749 (gimple_ranger::prefill_stmt_dependencies): New.
750 * gimple-range.h (class gimple_ranger): Add prototypes.
752 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
754 * gimple-range-cache.cc (ranger_cache::get_global_range): Always
755 return a range, return if it came from the cache or not.
756 (get_non_stale_global_range): Rename to get_global_range, and return
757 the temporal state in a flag.
758 * gimple-range-cache.h (get_non_stale_global_range): Rename and adjust.
759 * gimple-range.cc (gimple_ranger::range_of_expr): No need to query
761 (gimple_ranger::range_of_stmt): Adjust for global cache temporal state
764 2021-11-24 Andrew MacLeod <amacleod@redhat.com>
766 * gimple-range.cc (gimple_ranger::range_on_edge): Call trailer when
767 a constant is encountered to terminate the trace.
769 2021-11-24 Maciej W. Rozycki <macro@embecosm.com>
772 * reload.c (find_reloads_address_1): Also accept the ASHIFT form
773 of indexed addressing.
774 (find_reloads): Adjust accordingly.
776 2021-11-24 Richard Biener <rguenther@suse.de>
777 Jan Hubicka <jh@suse.cz>
779 PR tree-optimization/103168
780 * ipa-modref.h (struct modref_summary): Add load_accesses.
781 * ipa-modref.c (modref_summary::finalize): Initialize load_accesses.
782 * tree-ssa-sccvn.c (visit_reference_op_call): Use modref
783 info to walk the virtual use->def chain to CSE const/pure
784 function calls possibly reading from memory.
786 2021-11-24 Thomas Schwinge <thomas@codesourcery.com>
789 * gimplify.c (gimplify_adjust_omp_clauses): Restore previous
792 2021-11-24 Richard Biener <rguenther@suse.de>
795 * match.pd: Avoid canonicalizing (le/ge @0 @0) to (eq @0 @0)
796 with NaNs and -ftrapping-math.
798 2021-11-24 Jakub Jelinek <jakub@redhat.com>
801 * omp-general.c (omp_context_selector_matches): For ACCEL_COMPILER,
802 return 0 for kind(host) and continue for kind(nohost).
804 2021-11-24 Jakub Jelinek <jakub@redhat.com>
807 * attribs.h (lookup_attribute): Allow attr_name to start with
808 underscore, as long as canonicalize_attr_name returns false.
809 (lookup_attribute_by_prefix): Don't call get_attribute_name twice.
810 * attribs.c (extract_attribute_substring): Reimplement using
811 canonicalize_attr_name.
812 (register_scoped_attribute): Change gcc_assert into
813 gcc_checking_assert, verify !canonicalize_attr_name rather than
814 that str.str doesn't start with '_'.
816 2021-11-24 Jakub Jelinek <jakub@redhat.com>
818 PR tree-optimization/103376
819 * gimple-ssa-store-merging.c (perform_symbolic_merge): Add CODE
820 argument. If CODE is not BIT_IOR_EXPR, ensure that one of masked1
822 (find_bswap_or_nop_1, find_bswap_or_nop,
823 imm_store_chain_info::try_coalesce_bswap): Adjust
824 perform_symbolic_merge callers.
826 2021-11-24 Richard Biener <rguenther@suse.de>
828 * tree-ssa-loop-ivopts.c (find_givs): Take loop body as
829 argument instead of re-computing it.
830 (find_interesting_uses): Likewise.
831 (find_induction_variables): Pass through loop body.
832 (tree_ssa_iv_optimize_loop): Pass down loop body.
834 2021-11-24 Tamar Christina <tamar.christina@arm.com>
836 * tree-ssa-phiopt.c (spaceship_replacement): Handle new canonical
839 2021-11-24 Tamar Christina <tamar.christina@arm.com>
841 * tree.c (bitmask_inv_cst_vector_p): New.
842 * tree.h (bitmask_inv_cst_vector_p): New.
843 * match.pd: Use it in new bitmask compare pattern.
845 2021-11-24 Jason Merrill <jason@redhat.com>
847 * timevar.h (class auto_cond_timevar): New.
849 2021-11-24 Hongtao Liu <hongtao.liu@intel.com>
850 H.J. Lu <hongjiu.lu@intel.com>
852 PR tree-optimization/103194
853 * match.pd (gimple_nop_atomic_bit_test_and_p): Extended to
855 * tree-ssa-ccp.c (gimple_nop_convert): Declare.
856 (optimize_atomic_bit_test_and): Enhance
857 optimize_atomic_bit_test_and to handle truncation.
859 2021-11-23 Martin Sebor <msebor@redhat.com>
862 * Makefile.in (OBJS): Add gimple-warn-recursion.o.
863 * common.opt: Add -Winfinite-recursion.
864 * doc/invoke.texi (-Winfinite-recursion): Document.
865 * passes.def (pass_warn_recursion): Schedule a new pass.
866 * tree-pass.h (make_pass_warn_recursion): Declare.
867 * gimple-warn-recursion.c: New file.
869 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
871 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Change
872 error messages for ENB_P8V and ENB_P9V.
874 2021-11-23 Bill Schmidt <wschmidt@linux.ibm.com>
876 * config/rs6000/rs6000-builtin-new.def: Add power6-64 stanza. Move
877 CMPB to power6-64 stanza.
878 * config/rs6000/rs6000-call.c (rs6000_invalid_new_builtin): Handle
880 (rs6000_new_builtin_is_supported): Likewise.
881 (rs6000_expand_new_builtin): Likewise. Clean up formatting.
882 (rs6000_init_builtins): Handle ENB_P6_64 case.
883 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): Add BSTZ_P6_64.
884 (stanza_map): Add entry mapping power6-64 to BSTZ_P6_64.
885 (enable_string): Add "ENB_P6_64".
886 (write_decls): Add ENB_P6_64 to bif_enable enum.
888 2021-11-23 Jan Hubicka <jh@suse.cz>
890 * ipa-modref-tree.h (struct modref_tree): Remove max_bases, max_refs
892 (modref_tree::modref_tree): Remove parametr.
893 (modref_tree::insert_base): Add max_bases parameter.
894 (modref_tree::insert): Add max_bases, max_refs, max_accesses
896 (modref_tree::insert): New member function.
897 (modref_tree::merge): Add max_bases, max_refs, max_accesses
899 (modref_tree::insert): New member function.
900 * ipa-modref-tree.c (test_insert_search_collapse): Update.
901 (test_merge): Update.
902 * ipa-modref.c (dump_records): Don't dump max_refs and max_bases.
903 (dump_lto_records): Likewise.
904 (modref_summary::finalize): Fix whitespace.
905 (get_modref_function_summary): Likewise.
906 (modref_access_analysis::record_access): Update.
907 (modref_access_analysis::record_access_lto): Update.
908 (modref_access_analysis::process_fnspec): Update.
909 (analyze_function): Update.
910 (modref_summaries::duplicate): Update.
911 (modref_summaries_lto::duplicate): Update.
912 (write_modref_records): Update.
913 (read_modref_records): Update.
914 (read_section): Update.
915 (propagate_unknown_call): Update.
916 (modref_propagate_in_scc): Update.
917 (ipa_merge_modref_summary_after_inlining): Update.
919 2021-11-23 Martin Liska <mliska@suse.cz>
921 * doc/invoke.texi: Remove 2 more duplicite param descriptions.
923 2021-11-23 Richard Biener <rguenther@suse.de>
925 PR tree-optimization/103361
926 * gimple-loop-jam.c (adjust_unroll_factor): Use lambda_int
927 for the dependence distance.
928 * tree-data-ref.c (print_lambda_vector): Properly print a lambda_int.
930 2021-11-23 Jakub Jelinek <jakub@redhat.com>
932 * tree-inline.h (struct copy_body_data): Remove
933 transform_lang_insert_block member.
934 * tree-inline.c (remap_block): Don't call
935 id->transform_lang_insert_block.
936 (optimize_inline_calls, copy_gimple_seq_and_replace_locals,
937 tree_function_versioning, maybe_inline_call_in_expr,
938 copy_fn): Don't initialize id.transform_lang_insert_block.
939 * gimplify.c (gimplify_omp_loop): Likewise.
941 2021-11-23 Jan Hubicka <hubicka@ucw.cz>
943 PR tree-optimization/103335
944 * tree-ssa-dse.c (valid_ao_ref_for_dse): Rename to ...
945 (valid_ao_ref_kill_for_dse): ... this; do not check that boundaries
946 are divisible by BITS_PER_UNIT.
947 (get_byte_aligned_range_containing_ref): New function.
948 (get_byte_aligned_range_contained_in_ref): New function.
949 (normalize_ref): Rename to ...
950 (get_byte_range): ... this one; handle accesses not aligned to byte
951 boundary; return range in bytes rater than updating ao_ref.
952 (clear_live_bytes_for_ref): Take write ref by reference; simplify using
954 (setup_live_bytes_from_ref): Likewise.
955 (clear_bytes_written_by): Update.
956 (live_bytes_read): Update.
957 (dse_classify_store): Simplify tech before live_bytes_read checks.
959 2021-11-23 Andrew Pinski <apinski@marvell.com>
961 PR tree-optimization/102216
962 * gimple-fold.c (fold_stmt_1): Add canonicalization
963 of "&MEM[ssa_n, CST]" to "ssa_n p+ CST", note this
964 can only be done if !in_place.
966 2021-11-23 Jakub Jelinek <jakub@redhat.com>
969 * gimplify.c (replace_reduction_placeholders): Remove.
970 (note_no_context_vars): New function.
971 (gimplify_omp_loop): For OMP_PARALLEL's BIND_EXPR create a new
972 BLOCK. Use copy_tree_body_r with walk_tree instead of unshare_expr
973 and replace_reduction_placeholders for duplication of
974 OMP_CLAUSE_REDUCTION_{INIT,MERGE} expressions. Ensure all mentioned
975 automatic vars have DECL_CONTEXT set to non-NULL before doing so
976 and reset it afterwards for those vars and their corresponding
979 2021-11-23 Haochen Gui <guihaoc@gcc.gnu.org>
982 * config/rs6000/altivec.md (altivec_vreve<mode>2 for VEC_K): Use
983 xxbrq for v16qi, xxbrq + xxbrh for v8hi and xxbrq + xxbrw for v4si
984 or v4sf when p9_vector is set.
985 (altivec_vreve<mode>2 for VEC_64): Defined. Implemented by xxswapd.
987 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
989 PR tree-optimization/102232
990 * match.pd (x * (1 + y / x) - y) -> (x - y % x): New optimization.
992 2021-11-23 Navid Rahimi <navidrahimi@microsoft.com>
994 PR tree-optimization/96779
995 * match.pd (-x == x) -> (x == 0): New optimization.
997 2021-11-22 Roger Sayle <roger@nextmovesoftware.com>
999 PR tree-optimization/98953
1000 PR tree-optimization/103345
1001 * gimple-ssa-store-merging.c (find_bswap_or_nop_1): Handle
1002 BIT_XOR_EXPR and PLUS_EXPR the same as BIT_IOR_EXPR.
1003 (pass_optimize_bswap::execute): Likewise.
1005 2021-11-22 Martin Liska <mliska@suse.cz>
1007 * doc/invoke.texi: Remove duplicate documentation for 3 params.
1009 2021-11-22 Richard Biener <rguenther@suse.de>
1011 PR tree-optimization/103351
1012 * tree-ssa-dce.c (sort_phi_args): Sort after e->dest_idx as
1015 2021-11-22 liuhongt <hongtao.liu@intel.com>
1018 * config/i386/constraints.md (Bk): New
1019 define_memory_constraint.
1020 * config/i386/i386-protos.h (ix86_gpr_tls_address_pattern_p):
1022 * config/i386/i386.c (ix86_gpr_tls_address_pattern_p): New
1024 * config/i386/i386.md (*movsi_internal): Don't allow
1025 mask/sse/mmx move in TLS code sequences.
1026 (*movdi_internal): Ditto.
1028 2021-11-22 Kewen Lin <linkw@linux.ibm.com>
1030 * config/xtensa/xtensa.md (movdi_internal, movdf_internal): Fix split
1033 2021-11-21 Jakub Jelinek <jakub@redhat.com>
1036 * config/i386/i386-options.c (ix86_valid_target_attribute_p): If
1037 fndecl already has DECL_FUNCTION_SPECIFIC_TARGET, use that as base
1038 instead of target_option_default_node.
1040 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1043 * ipa-modref.c (parm_map_for_arg): Rename to ...
1044 (parm_map_for_ptr): .. this one; handle static chain and calls to
1046 (modref_access_analysis::get_access): Use parm_map_for_ptr.
1047 (modref_access_analysis::process_fnspec): Update.
1048 (modref_access_analysis::analyze_load): Update.
1049 (modref_access_analysis::analyze_store): Update.
1051 2021-11-21 Jan Hubicka <hubicka@ucw.cz>
1053 * ipa-modref.c (ignore_nondeterminism_p): Move earlier in source
1055 (ignore_retval_p): Likewise.
1056 (ignore_stores_p): Likewise.
1057 (parm_map_for_arg): Likewise.
1058 (class modref_access_analysis): New class.
1059 (modref_access_analysis::set_side_effects): New member function.
1060 (modref_access_analysis::set_nondeterministic): New member function.
1061 (get_access): Turn to ...
1062 (modref_access_analysis::get_access): ... this one.
1063 (record_access): Turn to ...
1064 (modref_access_analysis::record_access): ... this one.
1065 (record_access_lto): Turn to ...
1066 (modref_access_analysis::record_access_lto): ... This one.
1067 (record_access_p): Turn to ...
1068 (modref_access_analysis::record_access_p): ... This one
1069 (modref_access_analysis::record_unknown_load): New member function.
1070 (modref_access_analysis::record_unknown_store): New member function.
1071 (get_access_for_fnspec): Turn to ...
1072 (modref_access_analysis::get_access_for_fnspec): ... this one.
1073 (merge_call_side_effects): Turn to ...
1074 (moderf_access_analysis::merge_call_side_effects): Turn to ...
1075 (collapse_loads): Move later in source code.
1076 (collapse_stores): Move later in source code.
1077 (process_fnspec): Turn to ...
1078 (modref_access_analysis::process_fnspec): ... this one.
1079 (analyze_call): Turn to ...
1080 (modref_access_analysis::analyze_call): ... this one.
1081 (struct summary_ptrs): Remove.
1082 (analyze_load): Turn to ...
1083 (modref_access_analysis::analyze_load): ... this one.
1084 (analyze_store): Turn to ...
1085 (modref_access_analysis::analyze_store): ... this one.
1086 (analyze_stmt): Turn to ...
1087 (modref_access_analysis::analyze_stmt): ... This one.
1088 (remove_summary): Remove.
1089 (modref_access_analysis::propagate): Break out from ...
1090 (modref_access_analysis::analyze): Break out from ...
1091 (analyze_function): ... here.
1093 2021-11-21 Roger Sayle <roger@nextmovesoftware.com>
1094 Robin Dapp <rdapp@linux.ibm.com>
1097 * tree-ssa-math-opts.c (convert_mult_to_widen): Recognize
1098 signed WIDEN_MULT_EXPR if the target supports umul_widen_optab.
1100 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1103 * ipa-modref.c (ignore_nondeterminism_p): Allow looping pure/cont.
1104 (merge_call_side_effects): Improve debug output.
1106 2021-11-20 Jan Hubicka <hubicka@ucw.cz>
1109 * ipa-pure-const.c (propagate_pure_const): Fix merging of loping flag.
1111 2021-11-20 Jeff Law <jeffreyalaw@gmail.com>
1113 PR tree-optimization/103226
1114 * config/bfin/bfin.md (doloop pattern, splitter and expander): Clobber
1117 2021-11-20 Andrew Pinski <apinski@marvell.com>
1119 PR tree-optimization/103220
1120 * match.pd ((type) X bitop CST): Don't check if CST
1121 fits into the type if only the sign changes.
1123 2021-11-20 Alexandre Oliva <oliva@adacore.com>
1125 PR tree-optimization/102988
1126 * gimple-harden-conditionals.cc (detach_value): Copy SSA_NAME
1127 without decl sharing.
1129 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1131 * doc/invoke.texi: Remove whitespace after an @option.
1133 2021-11-19 Paul A. Clarke <pc@us.ibm.com>
1135 * config/rs6000/emmintrin.h (_mm_sad_epu8): Use vec_absd when
1136 _ARCH_PWR9, optimize vec_sum2s when LE.
1138 2021-11-19 Iain Sandoe <iain@sandoe.co.uk>
1141 * config/darwin-driver.c (darwin_driver_init): Handle exported
1142 symbols and symbol lists (suppress automatic export of the TLS
1144 * config/darwin.c (darwin_rename_builtins): Remove workaround.
1145 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): Likewise.
1146 (REAL_LIBGCC_SPEC): Handle revised library uses.
1147 * config/darwin.opt (nodefaultexport): New.
1148 * config/i386/darwin.h (PR80556_WORKAROUND): Remove.
1149 * config/i386/darwin32-biarch.h (PR80556_WORKAROUND): Likewise.
1150 * config/i386/darwin64-biarch.h (PR80556_WORKAROUND): Likewise.
1152 2021-11-19 Martin Jambor <mjambor@suse.cz>
1154 * opts.c (default_options_table): Switch off
1155 flag_semantic_interposition at Ofast.
1156 * doc/invoke.texi (Optimize Options): Document that Ofast switches off
1157 -fsemantic-interposition.
1159 2021-11-19 Jan Hubicka <hubicka@ucw.cz>
1161 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Do not guard modref
1162 by !gimple_call_chain.
1164 2021-11-19 Martin Sebor <msebor@redhat.com>
1168 * doc/invoke.texi (-Waddress): Update.
1170 2021-11-19 Andrew MacLeod <amacleod@redhat.com>
1172 PR tree-optimization/103254
1173 * gimple-range-gori.cc (range_def_chain::get_def_chain): Limit the
1174 depth for all statements with multple ssa names.
1176 2021-11-19 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
1178 * config/s390/s390.md (define_peephole2): Variable insn points
1179 to the first matched insn. Use peep2_next_insn(1) to refer to
1180 the second matched insn.
1182 2021-11-19 Tamar Christina <tamar.christina@arm.com>
1184 PR tree-optimization/103311
1186 * tree-vect-slp-patterns.c (vect_validate_multiplication): Fix CONJ
1187 test to new codegen.
1188 (complex_mul_pattern::matches): Move check downwards.
1190 2021-11-19 Martin Liska <mliska@suse.cz>
1193 2021-11-19 Martin Liska <mliska@suse.cz>
1195 * cfgexpand.c (pass_expand::execute): Use option directly.
1196 * function.c (allocate_struct_function): Likewise.
1197 * gimple-low.c (lower_function_body): Likewise.
1198 (lower_stmt): Likewise.
1199 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1200 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1201 * ipa-split.c (split_function): Likewise.
1202 * lto-streamer-in.c (input_function): Likewise.
1203 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1204 * ssa-iterators.h (num_imm_uses): Likewise.
1205 * tree-cfg.c (make_blocks): Likewise.
1206 (gimple_merge_blocks): Likewise.
1207 * tree-inline.c (tree_function_versioning): Likewise.
1208 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1209 * tree-sra.c (analyze_access_subtree): Likewise.
1210 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1211 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1212 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1213 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1214 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1215 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1216 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1217 (target_for_debug_bind): Likewise.
1218 (insert_debug_temp_for_var_def): Likewise.
1219 (insert_debug_temps_for_defs): Likewise.
1220 (reset_debug_uses): Likewise.
1221 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1222 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1223 (adjust_debug_stmts): Likewise.
1224 (adjust_phi_and_debug_stmts): Likewise.
1225 (vect_do_peeling): Likewise.
1226 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1227 (vect_transform_loop): Likewise.
1228 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1229 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1230 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1232 2021-11-19 Giuliano Belinassi <gbelinassi@suse.de>
1234 * gcc.c (process_command): Skip dumpdir override if file is a
1236 * doc/invoke.texi: Update -dumpdir documentation.
1238 2021-11-19 Andrew Pinski <apinski@marvell.com>
1240 PR tree-optimization/103314
1241 * match.pd ((type) X op CST): Restrict the equal
1242 TYPE_PRECISION case to GIMPLE only.
1244 2021-11-19 Martin Liska <mliska@suse.cz>
1247 * ipa-modref-tree.h (struct modref_parm_map): Add default
1249 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Use it.
1251 2021-11-19 Richard Biener <rguenther@suse.de>
1253 PR middle-end/103248
1254 * tree-eh.c (operation_could_trap_helper_p): Properly handle
1255 fixed-point RDIV_EXPR.
1257 2021-11-19 Richard Biener <rguenther@suse.de>
1259 PR tree-optimization/102436
1260 * tree-ssa-loop-im.c (execute_sm_if_changed): Add mode
1261 to just create the if structure and return the then block.
1262 (execute_sm): Add flag to indicate the var will re-use
1264 (hoist_memory_references): Support a single conditional
1265 block with all stores as special case.
1267 2021-11-19 Andrew Pinski <apinski@marvell.com>
1269 PR tree-optimization/103317
1270 * tree-ssa-phiopt.c (minmax_replacement): For the non empty
1271 middle bb case, check to make sure it has a single predecessor.
1273 2021-11-19 Andrew Pinski <apinski@marvell.com>
1275 PR tree-optimization/103257
1277 ((m1 >/</>=/<= m2) * d -> (m1 >/</>=/<= m2) ? d : 0):
1278 Disable until !canonicalize_math_p ().
1280 2021-11-19 Marek Polacek <polacek@redhat.com>
1284 * doc/invoke.texi: Update documentation for -Wuninitialized.
1285 * tree.c (stabilize_reference): Set location.
1287 2021-11-19 liuhongt <hongtao.liu@intel.com>
1290 * config/i386/x86-tune-costs.h (skylake_cost): Reduce cost of
1291 storing 256/512-bit SSE register to be equal to cost of
1292 unaligned store to avoid odd alignment peeling.
1293 (icelake_cost): Ditto.
1295 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1297 * config/rs6000/predicates.md (current_file_function_operand):
1298 Add flag_semantic_interposition to call of decl_replaceable_p.
1300 2021-11-18 Martin Liska <mliska@suse.cz>
1302 * ipa-modref.c (analyze_function): Do not execute the code
1303 only if dump_file != NULL.
1305 2021-11-18 Martin Liska <mliska@suse.cz>
1307 * ipa-modref.c (analyze_function): Use fnode instead of repeated
1308 cgraph_node::get (current_function_decl).
1310 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1312 * cgraph.c (cgraph_node::get_availability): Update call of
1314 (cgraph_node::verify_node): Verify that semantic_interposition flag
1316 * cgraph.h: (symtab_node): Add semantic_interposition flag.
1317 * cgraphclones.c (set_new_clone_decl_and_node_flags): Clear
1318 semantic_interposition flag.
1319 * cgraphunit.c (cgraph_node::finalize_function): Set
1320 semantic_interposition flag.
1321 (cgraph_node::add_new_function): Likewise.
1322 (varpool_node::finalize_decl): Likewise.
1323 (cgraph_node::create_wrapper): Likewise.
1324 * common.opt (fsemantic-interposition): Turn to optimization node.
1325 * lto-cgraph.c (lto_output_node): Stream semantic_interposition.
1326 (lto_output_varpool_node): Likewise.
1327 (input_overwrite_node): Likewise.
1328 (input_varpool_node): Likewise.
1329 * symtab.c (symtab_node::dump_base): Dump new flag.
1330 * varasm.c (decl_replaceable_p): Add semantic_interposition_p
1332 * varasm.h (decl_replaceable_p): Update declaration.
1333 * varpool.c (varpool_node::ctor_useable_for_folding_p):
1334 Use semantic_interposition flag.
1335 (varpool_node::get_availability): Likewise.
1336 (varpool_node::create_alias): Copy semantic_interposition flag.
1338 2021-11-18 Jan Hubicka <hubicka@ucw.cz>
1341 * ipa-modref.c (modref_eaf_analysis::merge_call_lhs_flags): Unused
1342 parameter may still be returned.
1343 (modref_eaf_analysis::analyze_ssa_name): Call merge_call_lhs_flags
1344 even for unused function args.
1346 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1348 PR tree-optimization/103311
1349 * tree-vect-slp-patterns.c (complex_mul_pattern::matches,
1350 complex_fms_pattern::matches): Check for multiplications.
1352 2021-11-18 H.J. Lu <hjl.tools@gmail.com>
1355 * config/i386/i386.c (ix86_output_jmp_thunk_or_indirect): Emit
1356 CS prefix for -mindirect-branch-cs-prefix.
1357 (ix86_output_indirect_branch_via_reg): Likewise.
1358 * config/i386/i386.opt: Add -mindirect-branch-cs-prefix.
1359 * doc/invoke.texi: Document -mindirect-branch-cs-prefix.
1361 2021-11-18 David Edelsohn <dje.gcc@gmail.com>
1363 * config/rs6000/driver-rs6000.c (detect_processor_aix): Add
1366 2021-11-18 Tamar Christina <tamar.christina@arm.com>
1368 PR tree-optimization/103253
1369 * tree-ssa-math-opts.c (convert_mult_to_fma): Check for LHS.
1371 2021-11-18 Matthias Kretz <m.kretz@gsi.de>
1373 * doc/extend.texi: Document __builtin_assoc_barrier.
1375 2021-11-18 Martin Liska <mliska@suse.cz>
1377 * cfgexpand.c (pass_expand::execute): Use option directly.
1378 * function.c (allocate_struct_function): Likewise.
1379 * gimple-low.c (lower_function_body): Likewise.
1380 (lower_stmt): Likewise.
1381 * gimple-ssa-backprop.c (backprop::prepare_change): Likewise.
1382 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call): Likewise.
1383 * ipa-split.c (split_function): Likewise.
1384 * lto-streamer-in.c (input_function): Likewise.
1385 * sese.c (sese_insert_phis_for_liveouts): Likewise.
1386 * ssa-iterators.h (num_imm_uses): Likewise.
1387 * tree-cfg.c (make_blocks): Likewise.
1388 (gimple_merge_blocks): Likewise.
1389 * tree-inline.c (tree_function_versioning): Likewise.
1390 * tree-loop-distribution.c (generate_loops_for_partition): Likewise.
1391 * tree-sra.c (analyze_access_subtree): Likewise.
1392 * tree-ssa-dce.c (remove_dead_stmt): Likewise.
1393 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1394 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
1395 * tree-ssa-reassoc.c (reassoc_remove_stmt): Likewise.
1396 * tree-ssa-tail-merge.c (tail_merge_optimize): Likewise.
1397 * tree-ssa-threadedge.c (propagate_threaded_block_debug_into): Likewise.
1398 * tree-ssa.c (gimple_replace_ssa_lhs): Likewise.
1399 (target_for_debug_bind): Likewise.
1400 (insert_debug_temp_for_var_def): Likewise.
1401 (insert_debug_temps_for_defs): Likewise.
1402 (reset_debug_uses): Likewise.
1403 * tree-ssanames.c (release_ssa_name_fn): Likewise.
1404 * tree-vect-loop-manip.c (adjust_vec_debug_stmts): Likewise.
1405 (adjust_debug_stmts): Likewise.
1406 (adjust_phi_and_debug_stmts): Likewise.
1407 (vect_do_peeling): Likewise.
1408 * tree-vect-loop.c (vect_transform_loop_stmt): Likewise.
1409 (vect_transform_loop): Likewise.
1410 * tree.h (MAY_HAVE_DEBUG_MARKER_STMTS): Remove
1411 (MAY_HAVE_DEBUG_BIND_STMTS): Remove.
1412 (MAY_HAVE_DEBUG_STMTS): Use options directly.
1414 2021-11-18 Richard Biener <rguenther@suse.de>
1416 PR tree-optimization/103277
1417 * tree-ssa-dse.c (need_ab_cleanup): New.
1418 (dse_optimize_redundant_stores): Adjust.
1419 (delete_dead_or_redundant_assignment): Get extra
1420 need_ab_cleanup argument and set when abnormal cleanup is
1422 (dse_optimize_call): Adjust.
1423 (dse_optimize_stmt): Likewise.
1424 (pass_dse::execute): Allocate and deallocate need_ab_cleanup.
1425 Perform abnormal cleanup.
1426 * tree-ssa-dse.h (delete_dead_or_redundant_assignment): Adjust.
1428 2021-11-18 Hongyu Wang <hongyu.wang@intel.com>
1430 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1431 Adjust generated cfg to avoid infinite loop.
1433 2021-11-18 konglin1 <lingling.kong@intel.com>
1435 * config/i386/avx512fp16intrin.h (_mm512_mul_pch): Add alias for _mm512_fmul_pch.
1436 (_mm512_mask_mul_pch): Likewise.
1437 (_mm512_maskz_mul_pch): Likewise.
1438 (_mm512_mul_round_pch): Likewise.
1439 (_mm512_mask_mul_round_pch): Likewise.
1440 (_mm512_maskz_mul_round_pch): Likewise.
1441 (_mm512_cmul_pch): Likewise.
1442 (_mm512_mask_cmul_pch): Likewise.
1443 (_mm512_maskz_cmul_pch): Likewise.
1444 (_mm512_cmul_round_pch): Likewise.
1445 (_mm512_mask_cmul_round_pch): Likewise.
1446 (_mm512_maskz_cmul_round_pch): Likewise.
1447 (_mm_mul_sch): Likewise.
1448 (_mm_mask_mul_sch): Likewise.
1449 (_mm_maskz_mul_sch): Likewise.
1450 (_mm_mul_round_sch): Likewise.
1451 (_mm_mask_mul_round_sch): Likewise.
1452 (_mm_maskz_mul_round_sch): Likewise.
1453 (_mm_cmul_sch): Likewise.
1454 (_mm_mask_cmul_sch): Likewise.
1455 (_mm_maskz_cmul_sch): Likewise.
1456 (_mm_cmul_round_sch): Likewise.
1457 (_mm_mask_cmul_round_sch): Likewise.
1458 (_mm_maskz_cmul_round_sch): Likewise.
1459 * config/i386/avx512fp16vlintrin.h (_mm_mul_pch): Likewise.
1460 (_mm_mask_mul_pch): Likewise.
1461 (_mm_maskz_mul_pch): Likewise.
1462 (_mm256_mul_pch): Likewise.
1463 (_mm256_mask_mul_pch): Likewise.
1464 (_mm256_maskz_mul_pch): Likewise.
1465 (_mm_cmul_pch): Likewise.
1466 (_mm_mask_cmul_pch): Likewise.
1467 (_mm_maskz_cmul_pch): Likewise.
1468 (_mm256_cmul_pch): Likewise.
1469 (_mm256_mask_cmul_pch): Likewise.
1470 (_mm256_maskz_cmul_pch): Likewise.
1472 2021-11-17 Andrew Pinski <apinski@marvell.com>
1474 PR tree-optimization/103228
1475 PR tree-optimization/55177
1476 * match.pd ((type) X bitop CST): Also do this
1477 transformation for nop conversions.
1479 2021-11-17 Martin Sebor <msebor@redhat.com>
1481 PR tree-optimization/102759
1482 * gimple-array-bounds.cc (build_printable_array_type): Move...
1483 * gimple-ssa-warn-access.cc (build_printable_array_type): Avoid
1484 pathological function redeclarations that remove a previously
1486 Improve formatting of function arguments in informational notes.
1487 * pointer-query.cc (build_printable_array_type): ...to here.
1488 * pointer-query.h (build_printable_array_type): Declared.
1490 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1493 * config/i386/i386-opts.h (harden_sls): New enum.
1494 * config/i386/i386.c (output_indirect_thunk): Mitigate against
1495 SLS for function return.
1496 (ix86_output_function_return): Likewise.
1497 (ix86_output_jmp_thunk_or_indirect): Mitigate against indirect
1499 (ix86_output_indirect_jmp): Likewise.
1500 (ix86_output_call_insn): Likewise.
1501 * config/i386/i386.opt: Add -mharden-sls=.
1502 * doc/invoke.texi: Document -mharden-sls=.
1504 2021-11-17 H.J. Lu <hjl.tools@gmail.com>
1507 * config/i386/i386.c (ix86_code_end): Remove "%!" before ret.
1508 (ix86_output_function_return): Likewise.
1509 * config/i386/i386.md (simple_return_pop_internal): Likewise.
1511 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1514 * ipa-modref.c (read_modref_records): Fix streaminig in of every_access
1517 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1519 * config/i386/i386.c (indirect_thunks_used): Redefine as HARD_REG_SET.
1520 (ix86_code_end): Use TEST_HARD_REG_BIT on indirect_thunks_used.
1521 (ix86_output_indirect_branch_via_reg): Use SET_HARD_REG_BIT
1522 on indirect_thunks_used.
1523 (ix86_output_indirect_function_return): Ditto.
1525 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1527 * ipa-modref-tree.c: Include cgraph.h and tree-streamer.h.
1528 (modref_access_node::stream_out): New member function.
1529 (modref_access_node::stream_in): New member function.
1530 * ipa-modref-tree.h (modref_access_node::stream_out,
1531 modref_access_node::stream_in): Declare.
1532 * ipa-modref.c (modref_summary_lto::useful_p): Free useless kills.
1533 (modref_summary_lto::dump): Dump kills.
1534 (analyze_store): Record kills for LTO
1535 (analyze_stmt): Likewise.
1536 (modref_summaries_lto::duplicate): Duplicate kills.
1537 (write_modref_records): Use new stream_out member function.
1538 (read_modref_records): Likewise.
1539 (modref_write): Stream out kills.
1540 (read_section): Stream in kills
1541 (remap_kills): New function.
1542 (update_signature): Use it.
1544 2021-11-17 Uroš Bizjak <ubizjak@gmail.com>
1546 * config/i386/i386.h (LEGACY_SSE_REGNO_P): New predicate.
1547 (SSE_REGNO_P): Use LEGACY_SSE_REGNO_P predicate.
1548 * config/i386/i386.c (zero_all_vector_registers):
1549 Use LEGACY_SSE_REGNO_P predicate.
1550 (ix86_register_priority): Use REX_INT_REGNO_P, REX_SSE_REGNO_P
1551 and EXT_REG_SSE_REGNO_P predicates.
1552 (ix86_hard_regno_call_part_clobbered): Use REX_SSE_REGNO_P
1553 and LEGACY_SSE_REGNO_P predicates.
1555 2021-11-17 Jason Merrill <jason@redhat.com>
1557 * doc/invoke.texi (C++ Dialect Options): Document
1558 -fimplicit-constexpr.
1560 2021-11-17 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
1562 * config/aarch64/aarch64-modes.def (VECTOR_MODE): New V8DI mode.
1563 * config/aarch64/aarch64.c (aarch64_hard_regno_mode_ok): Handle
1565 * config/aarch64/iterators.md (define_mode_attr nunits): Add entry
1568 2021-11-17 Martin Uecker <uecker@gcc.gnu.org>
1572 * gimplify.c (gimplify_var_or_parm_decl): Update comment.
1573 (gimplify_compound_lval): Gimplify base expression first.
1574 (gimplify_target_expr): Add comment.
1576 2021-11-17 Jakub Jelinek <jakub@redhat.com>
1578 PR tree-optimization/103192
1579 * tree-ssa-loop-im.c (move_computations_worker): Use
1580 reset_flow_sensitive_info instead of manually clearing
1581 SSA_NAME_RANGE_INFO and do it for all SSA_NAMEs, not just ones
1582 with integral types.
1584 2021-11-17 Jakub Jelinek <jakub@redhat.com>
1586 PR tree-optimization/103255
1587 * gimple-range-fold.cc (fold_using_range::range_of_address): Return
1588 range_nonzero rather than unadjusted base's range. Formatting fixes.
1590 2021-11-17 Richard Sandiford <richard.sandiford@arm.com>
1592 * doc/md.texi (cond_fmin@var{mode}, cond_fmax@var{mode}): Document.
1593 * optabs.def (cond_fmin_optab, cond_fmax_optab): New optabs.
1594 * internal-fn.def (COND_FMIN, COND_FMAX): New functions.
1595 * internal-fn.c (first_commutative_argument): Handle them.
1596 (FOR_EACH_COND_FN_PAIR): Likewise.
1597 * match.pd (UNCOND_BINARY, COND_BINARY): Likewise.
1598 * config/aarch64/aarch64-sve.md (cond_<fmaxmin><mode>): New
1601 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1603 * config/i386/i386.md (*add<dwi>3_doubleword, *addv<dwi>4_doubleword,
1604 *addv<dwi>4_doubleword_1, *sub<dwi>3_doubleword,
1605 *subv<dwi>4_doubleword, *subv<dwi>4_doubleword_1,
1606 *add<dwi>3_doubleword_cc_overflow_1, *divmodsi4_const,
1607 *neg<dwi>2_doubleword, *tls_dynamic_gnu2_combine_64_<mode>): Fix split
1610 2021-11-17 Andrew Pinski <apinski@marvell.com>
1612 PR tree-optimization/103288
1613 * tree-ssa-phiopt.c (value_replacement): Return early if middle
1614 block has more than one pred.
1616 2021-11-17 Kewen Lin <linkw@linux.ibm.com>
1618 * config/visium/visium.md (*add<mode>3_insn, *addsi3_insn, *addi3_insn,
1619 *sub<mode>3_insn, *subsi3_insn, *subdi3_insn, *neg<mode>2_insn,
1620 *negdi2_insn, *and<mode>3_insn, *ior<mode>3_insn, *xor<mode>3_insn,
1621 *one_cmpl<mode>2_insn, *ashl<mode>3_insn, *ashr<mode>3_insn,
1622 *lshr<mode>3_insn, *trunchiqi2_insn, *truncsihi2_insn,
1623 *truncdisi2_insn, *extendqihi2_insn, *extendqisi2_insn,
1624 *extendhisi2_insn, *extendsidi2_insn, *zero_extendqihi2_insn,
1625 *zero_extendqisi2_insn, *zero_extendsidi2_insn): Fix split condition.
1627 2021-11-17 Marek Polacek <polacek@redhat.com>
1629 PR preprocessor/103026
1630 * doc/invoke.texi: Document -Wbidi-chars.
1632 2021-11-17 Jan Hubicka <hubicka@ucw.cz>
1635 * ipa-modref.c (ipa_merge_modref_summary_after_inlining): Fix clearing
1638 2021-11-16 Jan Hubicka <jh@suse.cz>
1640 * ipa-modref.c (get_modref_function_summary): Declare.
1641 * ipa-modref.h (get_modref_function_summary): New function.
1642 * tree-ssa-dse.c (clear_live_bytes_for_ref): Break out from ...
1643 (clear_bytes_written_by): ... here; also clear memory killed by
1646 2021-11-16 Iain Sandoe <iain@sandoe.co.uk>
1648 * ggc-common.c (gt_pch_save): If we cannot find a suitable
1649 memory segment for save, then error-out, do not try to
1651 (gt_pch_restore): Save the existing line table, and when
1652 the replacement is being read, use that when constructing
1655 2021-11-16 Peter Bergner <bergner@linux.ibm.com>
1658 * config/rs6000/mma.md (*vsx_assemble_pair): Add early-clobber for
1660 (*mma_assemble_acc): Likewise.
1662 2021-11-16 Martin Sebor <msebor@redhat.com>
1664 PR tree-optimization/102960
1665 * gimple-fold.c (get_range_strlen): Take bitmap as an argument rather
1666 than a pointer to it.
1667 (get_range_strlen_tree): Same. Remove bitmap allocation. Use
1669 (get_maxval_strlen): Use an auto_bitmap.
1670 * tree-ssa-strlen.c (get_range_strlen_dynamic): Factor out PHI
1672 (get_range_strlen_phi): ...into this function.
1673 Avoid assuming maximum string length is constant
1674 (printf_strlen_execute): Dump pointer query cache contents when
1675 details are requisted.
1677 2021-11-16 Jason Merrill <jason@redhat.com>
1679 * langhooks.h (struct lang_hooks): Adjust comment.
1680 * print-tree.c (print_node): Also call print_xnode hook for
1683 2021-11-16 Andrew Pinski <apinski@marvell.com>
1685 PR tree-optimization/103218
1686 * match.pd: New pattern for "((type)(a<0)) << SIGNBITOFA".
1688 2021-11-16 Claudiu Zissulescu <claziss@synopsys.com>
1690 * config/arc/arc.md (maddhisi4): Use a single move to accumulator.
1691 (umaddhisi4): Likewise.
1692 (machi): Update pattern.
1695 2021-11-16 Richard Biener <rguenther@suse.de>
1697 PR tree-optimization/102880
1698 * tree-ssa-dce.c (sort_phi_args): New function.
1699 (make_forwarders_with_degenerate_phis): Likewise.
1700 (perform_tree_ssa_dce): Call
1701 make_forwarders_with_degenerate_phis.
1703 2021-11-16 Richard Biener <rguenther@suse.de>
1705 PR tree-optimization/102880
1706 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Push
1707 single_pred (bb1) condition to places that really need it.
1708 (match_simplify_replacement): Likewise.
1709 (value_replacement): Likewise.
1710 (replace_phi_edge_with_variable): Deal with extra edges
1713 2021-11-16 Martin Jambor <mjambor@suse.cz>
1715 * cfgexpand.c (expand_gimple_basic_block): Use build_debug_expr_decl,
1716 add a fixme note about the mode assignment perhaps being unnecessary.
1717 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
1719 (ipa_param_body_adjustments::mark_dead_statements): Likewise.
1720 (ipa_param_body_adjustments::reset_debug_stmts): Likewise.
1721 * tree-inline.c (remap_ssa_name): Likewise.
1722 (tree_function_versioning): Likewise.
1723 * tree-into-ssa.c (rewrite_debug_stmt_uses): Likewise.
1724 * tree-ssa-loop-ivopts.c (remove_unused_ivs): Likewise.
1725 * tree-ssa.c (insert_debug_temp_for_var_def): Likewise.
1727 2021-11-16 Jakub Jelinek <jakub@redhat.com>
1729 PR tree-optimization/103208
1730 * omp-expand.c (expand_omp_build_cond): New function.
1731 (expand_omp_for_init_counts, expand_omp_for_init_vars,
1732 expand_omp_for_static_nochunk, expand_omp_for_static_chunk): Use it.
1734 2021-11-16 Jakub Jelinek <jakub@redhat.com>
1736 PR tree-optimization/102009
1737 * gimple-ssa-warn-access.cc (pass_waccess::check_alloc_size_call):
1738 Punt if any of alloc_size arguments is out of bounds vs. number of
1741 2021-11-16 Roger Sayle <roger@nextmovesoftware.com>
1743 * config/i386/i386.md (*bmi2_rorx<mode3>_1): Make conditional
1744 on !optimize_function_for_size_p.
1745 (*<any_rotate><mode>3_1): Add preferred_for_size attribute.
1746 (define_splits): Conditionalize on !optimize_function_for_size_p.
1747 (*bmi2_rorxsi3_1_zext): Likewise.
1748 (*<any_rotate>si2_1_zext): Add preferred_for_size attribute.
1749 (define_splits): Conditionalize on !optimize_function_for_size_p.
1751 2021-11-16 Jan Hubicka <jh@suse.cz>
1754 * ipa-modref.c (merge_call_side_effects): Fix uninitialized
1757 2021-11-16 Andrew Pinski <apinski@marvell.com>
1759 PR tree-optimization/103245
1760 * match.pd: Combine the abs pattern matching using multiplication.
1761 Adding optional nop_convert too.
1763 2021-11-16 H.J. Lu <hjl.tools@gmail.com>
1765 PR middle-end/103268
1766 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Add a missing
1769 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1771 * gimple-fold.c (gimple_fold_builtin_strncat): Use ranges to
1772 determine if it is safe to transform to strcat.
1773 (gimple_fold_builtin_snprintf): Likewise.
1775 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1777 * gimple-fold.c (known_lower): New function.
1778 (gimple_fold_builtin_strncat_chk,
1779 gimple_fold_builtin_memory_chk, gimple_fold_builtin_stxcpy_chk,
1780 gimple_fold_builtin_stxncpy_chk,
1781 gimple_fold_builtin_snprintf_chk,
1782 gimple_fold_builtin_sprintf_chk): Use it.
1784 2021-11-15 Siddhesh Poyarekar <siddhesh@gotplt.org>
1786 * gimple-fold.c (dump_transformation): New function.
1787 (gimple_fold_builtin_stxcpy_chk,
1788 gimple_fold_builtin_stxncpy_chk): Use it. Simplify to
1789 BUILT_IN_STRNCPY if return value is not used.
1791 2021-11-15 H.J. Lu <hjl.tools@gmail.com>
1793 PR middle-end/103184
1794 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Check optab
1795 before transforming equivalent, but slighly different cases to
1796 their canonical forms.
1798 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
1801 * config/darwin.h (TARGET_DTORS_FROM_CXA_ATEXIT): New.
1802 * doc/tm.texi: Regenerated.
1803 * doc/tm.texi.in: Add TARGET_DTORS_FROM_CXA_ATEXIT hook.
1804 * ipa.c (cgraph_build_static_cdtor_1): Return the built
1806 (build_cxa_atexit_decl): New.
1807 (build_dso_handle_decl): New.
1808 (build_cxa_dtor_registrations): New.
1809 (compare_cdtor_tu_order): New.
1810 (build_cxa_atexit_fns): New.
1811 (ipa_cdtor_merge): If dtors_from_cxa_atexit is set,
1812 process the DTORs/CTORs accordingly.
1813 (pass_ipa_cdtor_merge::gate): Also run if
1814 dtors_from_cxa_atexit is set.
1815 * target.def (dtors_from_cxa_atexit): New hook.
1817 2021-11-15 Iain Sandoe <iain@sandoe.co.uk>
1819 * config.in: Regenerate.
1820 * configure: Regenerate.
1821 * configure.ac: Test ld64 for -platform-version support.
1823 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
1825 * diagnostic-spec.h (typedef xint_hash_t)
1826 (typedef xint_hash_map_t): Replace with...
1827 (typedef nowarn_map_t): ... this.
1828 (nowarn_map): Adjust.
1829 * diagnostic-spec.c (nowarn_map, suppress_warning_at): Likewise.
1831 2021-11-15 Thomas Schwinge <thomas@codesourcery.com>
1833 * profile.c (branch_prob): Use 'location_hash' for
1836 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
1838 PR tree-optimization/103207
1839 * value-range.cc (irange::set): Drop overflow.
1841 2021-11-15 Jakub Jelinek <jakub@redhat.com>
1843 * gimplify.c (optimize_target_teams): Only add OMP_CLAUSE_THREAD_LIMIT
1844 to OMP_TARGET_CLAUSES if it isn't there already.
1846 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
1848 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
1849 Remove useless code.
1850 (path_range_query::ssa_defined_in_bb): New.
1851 (path_range_query::ssa_range_in_phi): Avoid fold_range call that
1852 could trigger additional lookups.
1853 Do not use the cache for ARGs defined in this block.
1854 (path_range_query::compute_ranges_in_block): Use ssa_defined_in_bb.
1855 (path_range_query::maybe_register_phi_relation): Same.
1856 (path_range_query::range_of_stmt): Adjust comment.
1857 * gimple-range-path.h (ssa_defined_in_bb): New.
1859 2021-11-15 Aldy Hernandez <aldyh@redhat.com>
1861 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
1862 Default to global range if nothing found.
1864 2021-11-15 Richard Biener <rguenther@suse.de>
1866 PR tree-optimization/103237
1867 * tree-vect-loop.c (vect_is_simple_reduction): Fail for
1868 double reductions with multiple inner loop LC PHI nodes.
1870 2021-11-15 Hongyu Wang <hongyu.wang@intel.com>
1873 * config/i386/i386-expand.c (ix86_expand_atomic_fetch_op_loop):
1874 New expand function.
1875 * config/i386/i386-options.c (ix86_target_string): Add
1876 -mrelax-cmpxchg-loop flag.
1877 (ix86_valid_target_attribute_inner_p): Likewise.
1878 * config/i386/i386-protos.h (ix86_expand_atomic_fetch_op_loop):
1879 New expand function prototype.
1880 * config/i386/i386.opt: Add -mrelax-cmpxchg-loop.
1881 * config/i386/sync.md (atomic_fetch_<logic><mode>): New expander
1883 (atomic_<logic>_fetch<mode>): Likewise.
1884 (atomic_fetch_nand<mode>): Likewise.
1885 (atomic_nand_fetch<mode>): Likewise.
1886 (atomic_fetch_<logic><mode>): New expander for DI,TI modes.
1887 (atomic_<logic>_fetch<mode>): Likewise.
1888 (atomic_fetch_nand<mode>): Likewise.
1889 (atomic_nand_fetch<mode>): Likewise.
1890 * doc/invoke.texi: Document -mrelax-cmpxchg-loop.
1892 2021-11-15 Richard Biener <rguenther@suse.de>
1894 PR tree-optimization/103219
1895 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Use single_exit
1896 to determine the exit for the VN region.
1898 2021-11-15 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
1900 * tree-ssa-loop.c (pass_vectorize): Move to tree-vectorizer.c.
1901 (pass_data_vectorize): Likewise.
1902 (make_pass_vectorize): Likewise.
1903 * tree-vectorizer.c (vectorize_loops): Merge with
1904 pass_vectorize::execute and replace cfun occurences with fun param.
1905 (adjust_simduid_builtins): Add fun param, replace cfun occurences with
1906 fun, and adjust callers approrpiately.
1907 (note_simd_array_uses): Likewise.
1908 (vect_loop_dist_alias_call): Likewise.
1909 (set_uid_loop_bbs): Likewise.
1910 (vect_transform_loops): Likewise.
1911 (try_vectorize_loop_1): Likewise.
1912 (try_vectorize_loop): Likewise.
1914 2021-11-15 Jakub Jelinek <jakub@redhat.com>
1917 * config/i386/sync.md (atomic_bit_test_and_set<mode>,
1918 atomic_bit_test_and_complement<mode>,
1919 atomic_bit_test_and_reset<mode>): Use OPTAB_WIDEN instead of
1922 2021-11-15 Maciej W. Rozycki <macro@embecosm.com>
1924 * config/vax/vax.c (vax_lra_p): New prototype and function.
1925 (TARGET_LRA_P): Wire it.
1926 * config/vax/vax.opt (mlra): New option.
1927 * doc/invoke.texi (Option Summary, VAX Options): Document the
1930 2021-11-14 Jan Hubicka <jh@suse.cz>
1932 * ipa-modref.h (struct modref_summary): Add nondeterministic
1933 and calls_interposable flags.
1934 * ipa-modref.c (modref_summary::modref_summary): Initialize new flags.
1935 (modref_summary::useful_p): Check new flags.
1936 (struct modref_summary_lto): Add nondeterministic and
1937 calls_interposable flags.
1938 (modref_summary_lto::modref_summary_lto): Initialize new flags.
1939 (modref_summary_lto::useful_p): Check new flags.
1940 (modref_summary::dump): Dump new flags.
1941 (modref_summary_lto::dump): Dump new flags.
1942 (ignore_nondeterminism_p): New function.
1943 (merge_call_side_effects): Merge new flags.
1944 (process_fnspec): Likewise.
1945 (analyze_load): Volatile access is nondeterministic.
1946 (analyze_store): Liekwise.
1947 (analyze_stmt): Volatile ASM is nondeterministic.
1948 (analyze_function): Clear new flags.
1949 (modref_summaries::duplicate): Duplicate new flags.
1950 (modref_summaries_lto::duplicate): Duplicate new flags.
1951 (modref_write): Stream new flags.
1952 (read_section): Stream new flags.
1953 (propagate_unknown_call): Update new flags.
1954 (modref_propagate_in_scc): Propagate new flags.
1955 * tree-ssa-alias.c (ref_maybe_used_by_call_p_1): Check
1957 * tree-ssa-structalias.c (determine_global_memory_access):
1960 2021-11-14 Maciej W. Rozycki <macro@embecosm.com>
1962 * config/vax/vax.h (SET_RATIO): New macro.
1963 * config/vax/vax.md (UNSPEC_SETMEM_FILL): New constant.
1964 (setmemhi): New expander.
1965 (setmemhi1): New insn and splitter.
1966 (*setmemhi1): New insn.
1968 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
1970 * ipa-modref-tree.c (modref_access_node::update_for_kills): New
1972 (modref_access_node::merge_for_kills): Likewise.
1973 (modref_access_node::insert_kill): Likewise.
1974 * ipa-modref-tree.h (modref_access_node::update_for_kills,
1975 modref_access_node::merge_for_kills, modref_access_node::insert_kill):
1977 (modref_access_node::useful_for_kill): New member function.
1978 * ipa-modref.c (modref_summary::useful_p): Release useless kills.
1979 (lto_modref_summary): Add kills.
1980 (modref_summary::dump): Dump kills.
1981 (record_access): Add mdoref_access_node parameter.
1982 (record_access_lto): Likewise.
1983 (merge_call_side_effects): Merge kills.
1984 (analyze_call): Add ALWAYS_EXECUTED param and pass it around.
1985 (struct summary_ptrs): Add always_executed filed.
1986 (analyze_load): Update.
1987 (analyze_store): Update; record kills.
1988 (analyze_stmt): Add always_executed; record kills in clobbers.
1989 (analyze_function): Track always_executed.
1990 (modref_summaries::duplicate): Duplicate kills.
1991 (update_signature): Release kills.
1992 * ipa-modref.h (struct modref_summary): Add kills.
1993 * tree-ssa-alias.c (alias_stats): Add kill stats.
1994 (dump_alias_stats): Dump kill stats.
1995 (store_kills_ref_p): Break out from ...
1996 (stmt_kills_ref_p): Use it; handle modref info based kills.
1998 2021-11-14 Aldy Hernandez <aldyh@redhat.com>
2000 PR tree-optimization/103229
2001 * gimple-range-cache.cc (ssa_global_cache::clear): Do not pass
2002 null value to memset.
2004 2021-11-14 Jan Hubicka <hubicka@ucw.cz>
2006 * ipa-modref-tree.c (modref_access_node::get_call_arg): New member
2008 (modref_access_node::get_ao_ref): Likewise.
2009 * ipa-modref-tree.h (modref_access_node::get_call_arg): Declare.
2010 (modref_access_node::get_ao_ref): Declare.
2011 * tree-ssa-alias.c (modref_may_conflict): Use new accessors.
2012 * tree-ssa-dse.c (dse_optimize_call): Use new accessors.
2014 2021-11-13 Jan Hubicka <jh@suse.cz>
2017 * dbgcnt.def (ipa_attr): New counters.
2018 * ipa-pure-const.c: Include dbgcnt.c
2019 (ipa_make_function_const): Use debug counter.
2020 (ipa_make_function_pure): Likewise.
2021 (propagate_pure_const): Fix bug in my previous change.
2023 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2025 * ipa-modref-tree.c (modref_access_node::range_info_useful_p):
2026 Offline from ipa-modref-tree.h.
2027 (modref_access_node::dump): Move from ipa-modref.c; make member
2029 * ipa-modref-tree.h (modref_access_node::range_info_useful_p.
2030 modref_access_node::dump): Declare.
2031 * ipa-modref.c (dump_access): Remove.
2032 (dump_records): Update.
2033 (dump_lto_records): Update.
2034 (record_access): Update.
2035 (record_access_lto): Update.
2037 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2039 * ipa-modref.c (modref_summary::modref_summary): Clear new flags.
2040 (modref_summary::dump): Dump try_dse.
2041 (modref_summary::finalize): Add FUN attribute; compute try-dse.
2042 (analyze_function): Update.
2043 (read_section): Update.
2044 (update_signature): Update.
2045 (pass_ipa_modref::execute): Update.
2046 * ipa-modref.h (struct modref_summary):
2047 * tree-ssa-alias.c (ao_ref_init_from_ptr_and_range): Export.
2048 * tree-ssa-alias.h (ao_ref_init_from_ptr_and_range): Declare.
2049 * tree-ssa-dse.c (dse_optimize_call): New function.
2050 (dse_optimize_stmt): Use it.
2052 2021-11-13 Jan Hubicka <hubicka@ucw.cz>
2054 * ipa-modref-tree.c: Move #if CHECKING_P to proper place.
2056 2021-11-13 Jan Hubicka <jh@suse.cz>
2059 (struct modref_access_node): Move longer member functions to
2061 (modref_ref_node::try_merge_with): Turn into modreef_acces_node member
2063 * ipa-modref-tree.c (modref_access_node::contains): Move here
2064 from ipa-modref-tree.h.
2065 (modref_access_node::update): Likewise.
2066 (modref_access_node::merge): Likewise.
2067 (modref_access_node::closer_pair_p): Likewise.
2068 (modref_access_node::forced_merge): Likewise.
2069 (modref_access_node::update2): Likewise.
2070 (modref_access_node::combined_offsets): Likewise.
2071 (modref_access_node::try_merge_with): Likewise.
2072 (modref_access_node::insert): Likewise.
2074 2021-11-13 Jan Hubicka <jh@suse.cz>
2076 * ipa-modref.c (modref_summary::global_memory_read_p): Remove.
2077 (modref_summary::global_memory_written_p): Remove.
2078 (modref_summary::dump): Dump new flags.
2079 (modref_summary::finalize): New member function.
2080 (analyze_function): Call it.
2081 (read_section): Call it.
2082 (update_signature): Call it.
2083 (pass_ipa_modref::execute): Call it.
2084 * ipa-modref.h (struct modref_summary): Remove
2085 global_memory_read_p and global_memory_written_p.
2086 Add global_memory_read, global_memory_written.
2087 * tree-ssa-structalias.c (determine_global_memory_access):
2090 2021-11-13 Jan Hubicka <jh@suse.cz>
2092 * ipa-fnsummary.c (compute_fn_summary): Use type_attribut_allowed_p
2093 * ipa-param-manipulation.c
2094 (ipa_param_adjustments::type_attribute_allowed_p):
2095 New member function.
2096 (drop_type_attribute_if_params_changed_p): New function.
2097 (build_adjusted_function_type): Use it.
2098 * ipa-param-manipulation.h: Add type_attribute_allowed_p.
2100 2021-11-13 David Malcolm <dmalcolm@redhat.com>
2102 * doc/invoke.texi (Static Analyzer Options): Add
2103 -Wno-analyzer-tainted-allocation-size,
2104 -Wno-analyzer-tainted-divisor, -Wno-analyzer-tainted-offset, and
2105 -Wno-analyzer-tainted-size to list. Add
2106 -Wanalyzer-tainted-allocation-size, -Wanalyzer-tainted-divisor,
2107 -Wanalyzer-tainted-offset, and -Wanalyzer-tainted-size to list
2108 of options effectively enabled by -fanalyzer.
2109 (-Wanalyzer-tainted-allocation-size): New.
2110 (-Wanalyzer-tainted-array-index): Tweak wording; add link to CWE.
2111 (-Wanalyzer-tainted-divisor): New.
2112 (-Wanalyzer-tainted-offset): New.
2113 (-Wanalyzer-tainted-size): New.
2115 2021-11-13 Jan Hubicka <jh@suse.cz>
2117 * attr-fnspec.h (attr_fnspec::arg_eaf_flags): Break out from ...
2118 * gimple.c (gimple_call_arg_flags): ... here.
2119 * ipa-modref.c (analyze_parms): Record flags known from fnspec.
2120 (modref_merge_call_site_flags): Use arg_eaf_flags.
2122 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2124 PR tree-optimization/103222
2125 * gimple-range-path.cc (path_range_query::compute_ranges_in_phis):
2127 (path_range_query::compute_ranges_in_block): Call
2128 compute_ranges_in_phis.
2129 * gimple-range-path.h (path_range_query::compute_ranges_in_phis):
2132 2021-11-13 Jan Hubicka <jh@suse.cz>
2134 * ipa-fnsummary.c (compute_fn_summary): Do not give up on signature
2135 changes on "fn spec" attribute; give up on varadic types.
2136 * ipa-param-manipulation.c: Include attribs.h.
2137 (build_adjusted_function_type): New parameter ARG_MODIFIED; if it is
2138 true remove "fn spec" attribute.
2139 (ipa_param_adjustments::build_new_function_type): Update.
2140 (ipa_param_body_adjustments::modify_formal_parameters): update.
2141 * ipa-sra.c: Include attribs.h.
2142 (ipa_sra_preliminary_function_checks): Do not check for TYPE_ATTRIBUTES.
2144 2021-11-13 Aldy Hernandez <aldyh@redhat.com>
2146 * gimple-range-path.cc (path_range_query::path_range_query): Merge
2148 (path_range_query::import_p): Move from header file.
2149 (path_range_query::~path_range_query): Adjust for combined ctors.
2150 * gimple-range-path.h: Merge ctors.
2151 (path_range_query::import_p): Move to .cc file.
2153 2021-11-13 Jan Hubicka <jh@suse.cz>
2155 PR tree-optimization/103182
2156 * ipa-modref.c (callee_to_caller_flags): Fix merging of flags.
2157 (modref_eaf_analysis::analyze_ssa_name): Fix merging of flags.
2159 2021-11-12 Stafford Horne <shorne@gmail.com>
2161 * config/or1k/or1k-protos.h (or1k_profile_hook): New function.
2162 * config/or1k/or1k.h (PROFILE_HOOK): Change macro to reference
2163 new function or1k_profile_hook.
2164 * config/or1k/or1k.c (struct machine_function): Add new field
2165 set_mcount_arg_insn.
2166 (or1k_profile_hook): New function.
2167 (or1k_init_pic_reg): Update to inject pic rtx after _mcount arg
2169 (or1k_frame_pointer_required): Frame pointer no longer needed
2172 2021-11-12 Jan Hubicka <jh@suse.cz>
2174 PR tree-optimization/103209
2175 * tree-ssa-structalias.c (find_func_aliases_for_call): Fix
2176 use of handle_rhs_call
2178 2021-11-12 Aldy Hernandez <aldyh@redhat.com>
2180 PR tree-optimization/103202
2181 * gimple-range-path.cc
2182 (path_range_query::compute_ranges_in_block): Solve PHI imports first.
2184 2021-11-12 Jan Hubicka <jh@suse.cz>
2186 * ipa-pure-const.c (propagate_pure_const): Remove redundant check;
2187 fix call of ipa_make_function_const and ipa_make_function_pure.
2189 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2191 * config/aarch64/aarch64.c (aarch64_vector_op::n_advsimd_ops): Delete.
2192 (aarch64_vector_op::m_seen_loads): Likewise.
2193 (aarch64_vector_costs::aarch64_vector_costs): Don't push to
2195 (aarch64_vector_op::count_ops): Remove vectype and factor parameters.
2196 Remove code that tries to predict different vec_flags from the
2198 (aarch64_vector_costs::add_stmt_cost): Update accordingly.
2199 Remove m_advsimd_ops handling.
2201 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2203 * config/aarch64/aarch64.c (aarch64_vector_costs::m_saw_sve_only_op)
2204 (aarch64_sve_only_stmt_p): Delete.
2205 (aarch64_vector_costs::prefer_unrolled_loop): New function,
2206 extracted from adjust_body_cost.
2207 (aarch64_vector_costs::better_main_loop_than_p): New function,
2208 using heuristics extracted from adjust_body_cost and
2209 adjust_body_cost_sve.
2210 (aarch64_vector_costs::adjust_body_cost_sve): Remove
2211 advsimd_cycles_per_iter and could_use_advsimd parameters.
2212 Update after changes above.
2213 (aarch64_vector_costs::adjust_body_cost): Update after changes above.
2215 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2217 * config/aarch64/aarch64.c (aarch64_vec_op_count::m_vf_factor):
2218 New member variable.
2219 (aarch64_vec_op_count::aarch64_vec_op_count): Add a parameter for it.
2220 (aarch64_vec_op_count::vf_factor): New function.
2221 (aarch64_vector_costs::aarch64_vector_costs): When costing for
2222 neoverse-512tvb, pass a vf_factor of 2 for the Neoverse V1 version
2224 (aarch64_vector_costs::adjust_body_cost): Read the vf factor
2225 instead of hard-coding 2.
2227 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2229 * config/aarch64/aarch64.c
2230 (aarch64_vec_op_count::rename_cycles_per_iter): New function.
2231 (aarch64_vec_op_count::min_nonpred_cycles_per_iter): Likewise.
2232 (aarch64_vec_op_count::min_pred_cycles_per_iter): Likewise.
2233 (aarch64_vec_op_count::min_cycles_per_iter): Likewise.
2234 (aarch64_vec_op_count::dump): Move earlier in file. Dump the
2235 above properties too.
2236 (aarch64_estimate_min_cycles_per_iter): Delete.
2237 (adjust_body_cost): Use aarch64_vec_op_count::min_cycles_per_iter
2238 instead of aarch64_estimate_min_cycles_per_iter. Rely on the dump
2239 routine to print CPI estimates.
2240 (adjust_body_cost_sve): Likewise. Use the other functions above
2241 instead of doing the work inline.
2243 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2245 * config/aarch64/aarch64.c (aarch64_vec_op_count): Allow default
2247 (aarch64_vec_op_count::base_issue_info): Remove handling of null
2249 (aarch64_vec_op_count::simd_issue_info): Likewise.
2250 (aarch64_vec_op_count::sve_issue_info): Likewise.
2251 (aarch64_vector_costs::m_ops): Turn into a vector.
2252 (aarch64_vector_costs::m_advsimd_ops): Likewise.
2253 (aarch64_vector_costs::aarch64_vector_costs): Add entries to
2254 the vectors based on aarch64_tune_params.
2255 (aarch64_vector_costs::analyze_loop_vinfo): Update the pred_ops
2256 of all entries in m_ops.
2257 (aarch64_vector_costs::add_stmt_cost): Call count_ops for all
2259 (aarch64_estimate_min_cycles_per_iter): Remove issue_info
2260 parameter and get the information from the ops instead.
2261 (aarch64_vector_costs::adjust_body_cost_sve): Take a
2262 aarch64_vec_issue_info instead of a aarch64_vec_op_count.
2263 (aarch64_vector_costs::adjust_body_cost): Update call accordingly.
2264 Exit earlier if m_ops is empty for either cost structure.
2266 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2268 * config/aarch64/aarch64.c (aarch64_vector_costs::m_scalar_ops)
2269 (aarch64_vector_costs::m_sve_ops): Replace with...
2270 (aarch64_vector_costs::m_ops): ...this.
2271 (aarch64_vector_costs::analyze_loop_vinfo): Update accordingly.
2272 (aarch64_vector_costs::adjust_body_cost_sve): Likewise.
2273 (aarch64_vector_costs::aarch64_vector_costs): Likewise.
2274 Initialize m_vec_flags here rather than in add_stmt_cost.
2275 (aarch64_vector_costs::count_ops): Test for scalar reductions too.
2276 Allow vectype to be null.
2277 (aarch64_vector_costs::add_stmt_cost): Call count_ops for scalar
2278 code too. Don't require vectype to be nonnull.
2279 (aarch64_vector_costs::adjust_body_cost): Take the loop_vec_info
2280 and scalar costs as parameters. Use the scalar costs to determine
2281 the cycles per iteration of the scalar loop, then multiply it
2282 by the estimated VF.
2283 (aarch64_vector_costs::finish_cost): Update call accordingly.
2285 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2287 * config/aarch64/aarch64.c (aarch64_dr_type): New function.
2288 (aarch64_vector_costs::count_ops): Use it rather than the
2289 vectype to determine floatness.
2291 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2293 * config/aarch64/aarch64.c (aarch64_sve_in_loop_reduction_latency):
2294 Remove vectype parameter and get floatness from the type of the
2296 (arch64_in_loop_reduction_latency): Likewise.
2297 (aarch64_detect_vector_stmt_subtype): Update caller.
2298 (aarch64_vector_costs::count_ops): Likewise.
2300 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2302 * config/aarch64/aarch64.c (aarch64_sve_op_count): Fold into...
2303 (aarch64_vec_op_count): ...this. Add a constructor.
2304 (aarch64_vec_op_count::vec_flags): New function.
2305 (aarch64_vec_op_count::base_issue_info): Likewise.
2306 (aarch64_vec_op_count::simd_issue_info): Likewise.
2307 (aarch64_vec_op_count::sve_issue_info): Likewise.
2308 (aarch64_vec_op_count::m_issue_info): New member variable.
2309 (aarch64_vec_op_count::m_vec_flags): Likewise.
2310 (aarch64_vector_costs): Add a constructor.
2311 (aarch64_vector_costs::m_sve_ops): Change type to aarch64_vec_op_count.
2312 (aarch64_vector_costs::aarch64_vector_costs): New function.
2313 Initialize m_scalar_ops, m_advsimd_ops and m_sve_ops.
2314 (aarch64_vector_costs::count_ops): Remove vec_flags and
2315 issue_info parameters, using the new aarch64_vec_op_count
2317 (aarch64_vector_costs::add_stmt_cost): Update call accordingly.
2318 (aarch64_sve_op_count::dump): Fold into...
2319 (aarch64_vec_op_count::dump): ..here.
2321 2021-11-12 Richard Sandiford <richard.sandiford@arm.com>
2323 * config/aarch64/aarch64.c: Include tree-dfa.h.
2324 (aarch64_check_consecutive_mems): New function that takes MEM_EXPR
2325 and MEM_OFFSET into account.
2326 (aarch64_swap_ldrstr_operands): Use it.
2327 (aarch64_operands_ok_for_ldpstp): Likewise. Check that the
2328 address of the second memory doesn't depend on the result of
2331 2021-11-12 Jan Hubicka <hubicka@ucw.cz>
2333 * ipa-pure-const.c (ipa_make_function_pure): Fix exit condition.
2335 2021-11-12 Jan Hubicka <jh@suse.cz>
2337 PR tree-optimization/103175
2338 * ipa-modref.c (modref_lattice::merge): Add sanity check.
2339 (callee_to_caller_flags): Make flags adjustment sane.
2340 (modref_eaf_analysis::analyze_ssa_name): Likewise.
2342 2021-11-12 Jan Hubicka <jh@suse.cz>
2345 * ipa-modref.c (analyze_function, modref_propagate_in_scc): Do
2346 not mark pure/const function if there are side-effects.
2348 2021-11-12 Chung-Lin Tang <cltang@codesourcery.com>
2350 * tree.h (OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P): New access macro for
2351 'implicit' bit, using 'base.deprecated_flag' field of tree_node.
2352 * tree-pretty-print.c (dump_omp_clause): Add support for printing
2353 implicit attribute in tree dumping.
2354 * gimplify.c (gimplify_adjust_omp_clauses_1):
2355 Set OMP_CLAUSE_MAP_RUNTIME_IMPLICIT_P to 1 if map clause is implicitly
2357 (gimplify_adjust_omp_clauses): Adjust place of adding implicitly created
2358 clauses, from simple append, to starting of list, after non-map clauses.
2359 * omp-low.c (lower_omp_target): Add GOMP_MAP_IMPLICIT bits into kind
2360 values passed to libgomp for implicit maps.
2362 2021-11-12 Jakub Jelinek <jakub@redhat.com>
2364 * omp-builtins.def (BUILT_IN_GOMP_TEAMS): Remove.
2365 (BUILT_IN_GOMP_TEAMS4): New.
2366 * builtin-types.def (BT_FN_VOID_UINT_UINT): Remove.
2367 (BT_FN_BOOL_UINT_UINT_UINT_BOOL): New.
2368 * omp-low.c (lower_omp_teams): Use GOMP_teams4 instead of
2369 GOMP_teams, pass to it also num_teams lower-bound expression
2370 or a dup of upper-bound if it is missing and a flag whether
2371 it is the first call or not.
2373 2021-11-12 Martin Liska <mliska@suse.cz>
2375 PR tree-optimization/102497
2376 * gimple-predicate-analysis.cc (add_pred): Remove unused
2379 2021-11-12 Richard Biener <rguenther@suse.de>
2381 PR tree-optimization/103204
2382 * tree-ssa-sccvn.c (valueize_refs_1): Re-valueize the
2383 top operand after folding in an address.
2385 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2387 * gimple-range-path.cc (path_range_query::path_range_query): New
2388 ctor without a ranger.
2389 (path_range_query::~path_range_query): Free ranger if necessary.
2390 (path_range_query::range_on_path_entry): Adjust m_ranger for pointer.
2391 (path_range_query::ssa_range_in_phi): Same.
2392 (path_range_query::compute_ranges_in_block): Same.
2393 (path_range_query::compute_imports): Same.
2394 (path_range_query::compute_ranges): Same.
2395 (path_range_query::range_of_stmt): Same.
2396 (path_range_query::compute_outgoing_relations): Same.
2397 * gimple-range-path.h (class path_range_query): New ctor.
2398 * tree-ssa-loop-ch.c (ch_base::copy_headers): Remove gimple_ranger
2399 as path_range_query allocates one.
2400 * tree-ssa-threadbackward.c (class back_threader): Remove m_ranger.
2401 (back_threader::~back_threader): Same.
2403 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2405 * tree-ssa-threadbackward.c
2406 (back_threader_profitability::profitable_path_p): Remove loop
2407 crossing restriction.
2409 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2411 * ipa-modref.c (analyze_function): Do pure/const discovery, return
2413 (pass_modref::execute): If pure/const is discovered fixup cfg.
2414 (ignore_edge): Do not ignore pure/const edges.
2415 (modref_propagate_in_scc): Do pure/const discovery, return true if
2416 cdtor was promoted pure/const.
2417 (pass_ipa_modref::execute): If needed remove unreachable functions.
2418 * ipa-pure-const.c (warn_function_noreturn): Fix whitespace.
2419 (warn_function_cold): Likewise.
2420 (skip_function_for_local_pure_const): Move earlier.
2421 (ipa_make_function_const): Break out from ...
2422 (ipa_make_function_pure): Break out from ...
2423 (propagate_pure_const): ... here.
2424 (pass_local_pure_const::execute): Use it.
2425 * ipa-utils.h (ipa_make_function_const): Declare.
2426 (ipa_make_function_pure): Declare.
2427 * passes.def: Move early modref after pure-const.
2429 2021-11-11 David Malcolm <dmalcolm@redhat.com>
2432 * diagnostic-show-locus.c (def_policy): Use def_tabstop.
2434 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2436 * config/aarch64/aarch64-builtins.c (TYPES_COMBINE): Delete.
2437 (TYPES_COMBINEP): Delete.
2438 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2439 qualified builtins for vcombine_* intrinsics.
2440 * config/aarch64/arm_neon.h (vcombine_s8): Remove unnecessary
2442 (vcombine_s16): Likewise.
2443 (vcombine_s32): Likewise.
2444 (vcombine_f32): Likewise.
2445 (vcombine_u8): Use type-qualified builtin and remove casts.
2446 (vcombine_u16): Likewise.
2447 (vcombine_u32): Likewise.
2448 (vcombine_u64): Likewise.
2449 (vcombine_p8): Likewise.
2450 (vcombine_p16): Likewise.
2451 (vcombine_p64): Likewise.
2452 (vcombine_bf16): Remove unnecessary cast.
2453 * config/aarch64/iterators.md (VD_I): New mode iterator.
2454 (VDC_P): New mode iterator.
2456 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2458 * config/aarch64/aarch64-builtins.c (TYPES_LOAD1_U): Define.
2459 (TYPES_LOAD1_P): Define.
2460 (TYPES_STORE1_U): Define.
2461 (TYPES_STORE1P): Rename to...
2462 (TYPES_STORE1_P): This.
2463 (get_mem_type_for_load_store): Add unsigned and poly types.
2464 (aarch64_general_gimple_fold_builtin): Add unsigned and poly
2465 type-qualified builtin declarations.
2466 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2467 qualified builtins for LD1/ST1.
2468 * config/aarch64/arm_neon.h (vld1_p8): Use type-qualified
2469 builtin and remove cast.
2470 (vld1_p16): Likewise.
2471 (vld1_u8): Likewise.
2472 (vld1_u16): Likewise.
2473 (vld1_u32): Likewise.
2474 (vld1q_p8): Likewise.
2475 (vld1q_p16): Likewise.
2476 (vld1q_p64): Likewise.
2477 (vld1q_u8): Likewise.
2478 (vld1q_u16): Likewise.
2479 (vld1q_u32): Likewise.
2480 (vld1q_u64): Likewise.
2481 (vst1_p8): Likewise.
2482 (vst1_p16): Likewise.
2483 (vst1_u8): Likewise.
2484 (vst1_u16): Likewise.
2485 (vst1_u32): Likewise.
2486 (vst1q_p8): Likewise.
2487 (vst1q_p16): Likewise.
2488 (vst1q_p64): Likewise.
2489 (vst1q_u8): Likewise.
2490 (vst1q_u16): Likewise.
2491 (vst1q_u32): Likewise.
2492 (vst1q_u64): Likewise.
2493 * config/aarch64/iterators.md (VALLP_NO_DI): New iterator.
2495 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2497 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2498 builtins for vector reduction.
2499 * config/aarch64/arm_neon.h (vaddv_u8): Use type-qualified
2500 builtin and remove casts.
2501 (vaddv_u16): Likewise.
2502 (vaddv_u32): Likewise.
2503 (vaddvq_u8): Likewise.
2504 (vaddvq_u16): Likewise.
2505 (vaddvq_u32): Likewise.
2506 (vaddvq_u64): Likewise.
2508 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2510 * config/aarch64/aarch64-simd-builtins.def:
2511 * config/aarch64/arm_neon.h (vpaddq_u8): Use type-qualified
2512 builtin and remove casts.
2513 (vpaddq_u16): Likewise.
2514 (vpaddq_u32): Likewise.
2515 (vpaddq_u64): Likewise.
2516 (vpadd_u8): Likewise.
2517 (vpadd_u16): Likewise.
2518 (vpadd_u32): Likewise.
2519 (vpaddd_u64): Likewise.
2521 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2523 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2524 builtins for [r]subhn[2].
2525 * config/aarch64/arm_neon.h (vsubhn_s16): Remove unnecessary
2527 (vsubhn_s32): Likewise.
2528 (vsubhn_s64): Likewise.
2529 (vsubhn_u16): Use type-qualified builtin and remove casts.
2530 (vsubhn_u32): Likewise.
2531 (vsubhn_u64): Likewise.
2532 (vrsubhn_s16): Remove unnecessary cast.
2533 (vrsubhn_s32): Likewise.
2534 (vrsubhn_s64): Likewise.
2535 (vrsubhn_u16): Use type-qualified builtin and remove casts.
2536 (vrsubhn_u32): Likewise.
2537 (vrsubhn_u64): Likewise.
2538 (vrsubhn_high_s16): Remove unnecessary cast.
2539 (vrsubhn_high_s32): Likewise.
2540 (vrsubhn_high_s64): Likewise.
2541 (vrsubhn_high_u16): Use type-qualified builtin and remove
2543 (vrsubhn_high_u32): Likewise.
2544 (vrsubhn_high_u64): Likewise.
2545 (vsubhn_high_s16): Remove unnecessary cast.
2546 (vsubhn_high_s32): Likewise.
2547 (vsubhn_high_s64): Likewise.
2548 (vsubhn_high_u16): Use type-qualified builtin and remove
2550 (vsubhn_high_u32): Likewise.
2551 (vsubhn_high_u64): Likewise.
2553 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2555 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2556 builtins for [r]addhn[2].
2557 * config/aarch64/arm_neon.h (vaddhn_s16): Remove unnecessary
2559 (vaddhn_s32): Likewise.
2560 (vaddhn_s64): Likewise.
2561 (vaddhn_u16): Use type-qualified builtin and remove casts.
2562 (vaddhn_u32): Likewise.
2563 (vaddhn_u64): Likewise.
2564 (vraddhn_s16): Remove unnecessary cast.
2565 (vraddhn_s32): Likewise.
2566 (vraddhn_s64): Likewise.
2567 (vraddhn_u16): Use type-qualified builtin and remove casts.
2568 (vraddhn_u32): Likewise.
2569 (vraddhn_u64): Likewise.
2570 (vaddhn_high_s16): Remove unnecessary cast.
2571 (vaddhn_high_s32): Likewise.
2572 (vaddhn_high_s64): Likewise.
2573 (vaddhn_high_u16): Use type-qualified builtin and remove
2575 (vaddhn_high_u32): Likewise.
2576 (vaddhn_high_u64): Likewise.
2577 (vraddhn_high_s16): Remove unnecessary cast.
2578 (vraddhn_high_s32): Likewise.
2579 (vraddhn_high_s64): Likewise.
2580 (vraddhn_high_u16): Use type-qualified builtin and remove
2582 (vraddhn_high_u32): Likewise.
2583 (vraddhn_high_u64): Likewise.
2585 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2587 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2588 qualifiers in generator macros for uhsub builtins.
2589 * config/aarch64/arm_neon.h (vhsub_s8): Remove unnecessary
2591 (vhsub_s16): Likewise.
2592 (vhsub_s32): Likewise.
2593 (vhsub_u8): Use type-qualified builtin and remove casts.
2594 (vhsub_u16): Likewise.
2595 (vhsub_u32): Likewise.
2596 (vhsubq_s8): Remove unnecessary cast.
2597 (vhsubq_s16): Likewise.
2598 (vhsubq_s32): Likewise.
2599 (vhsubq_u8): Use type-qualified builtin and remove casts.
2600 (vhsubq_u16): Likewise.
2601 (vhsubq_u32): Likewise.
2603 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2605 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2606 qualifiers in generator macros for u[r]hadd builtins.
2607 * config/aarch64/arm_neon.h (vhadd_s8): Remove unnecessary
2609 (vhadd_s16): Likewise.
2610 (vhadd_s32): Likewise.
2611 (vhadd_u8): Use type-qualified builtin and remove casts.
2612 (vhadd_u16): Likewise.
2613 (vhadd_u32): Likewise.
2614 (vhaddq_s8): Remove unnecessary cast.
2615 (vhaddq_s16): Likewise.
2616 (vhaddq_s32): Likewise.
2617 (vhaddq_u8): Use type-qualified builtin and remove casts.
2618 (vhaddq_u16): Likewise.
2619 (vhaddq_u32): Likewise.
2620 (vrhadd_s8): Remove unnecessary cast.
2621 (vrhadd_s16): Likewise.
2622 (vrhadd_s32): Likewise.
2623 (vrhadd_u8): Use type-qualified builtin and remove casts.
2624 (vrhadd_u16): Likewise.
2625 (vrhadd_u32): Likewise.
2626 (vrhaddq_s8): Remove unnecessary cast.
2627 (vrhaddq_s16): Likewise.
2628 (vrhaddq_s32): Likewise.
2629 (vrhaddq_u8): Use type-wualified builtin and remove casts.
2630 (vrhaddq_u16): Likewise.
2631 (vrhaddq_u32): Likewise.
2633 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2635 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2636 qualifiers in generator macros for usub[lw][2] builtins.
2637 * config/aarch64/arm_neon.h (vsubl_s8): Remove unnecessary
2639 (vsubl_s16): Likewise.
2640 (vsubl_s32): Likewise.
2641 (vsubl_u8): Use type-qualified builtin and remove casts.
2642 (vsubl_u16): Likewise.
2643 (vsubl_u32): Likewise.
2644 (vsubl_high_s8): Remove unnecessary cast.
2645 (vsubl_high_s16): Likewise.
2646 (vsubl_high_s32): Likewise.
2647 (vsubl_high_u8): Use type-qualified builtin and remove casts.
2648 (vsubl_high_u16): Likewise.
2649 (vsubl_high_u32): Likewise.
2650 (vsubw_s8): Remove unnecessary casts.
2651 (vsubw_s16): Likewise.
2652 (vsubw_s32): Likewise.
2653 (vsubw_u8): Use type-qualified builtin and remove casts.
2654 (vsubw_u16): Likewise.
2655 (vsubw_u32): Likewise.
2656 (vsubw_high_s8): Remove unnecessary cast.
2657 (vsubw_high_s16): Likewise.
2658 (vsubw_high_s32): Likewise.
2659 (vsubw_high_u8): Use type-qualified builtin and remove casts.
2660 (vsubw_high_u16): Likewise.
2661 (vsubw_high_u32): Likewise.
2663 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2665 * config/aarch64/aarch64-simd-builtins.def: Use BINOPU type
2666 qualifiers in generator macros for uadd[lw][2] builtins.
2667 * config/aarch64/arm_neon.h (vaddl_s8): Remove unnecessary
2669 (vaddl_s16): Likewise.
2670 (vaddl_s32): Likewise.
2671 (vaddl_u8): Use type-qualified builtin and remove casts.
2672 (vaddl_u16): Likewise.
2673 (vaddl_u32): Likewise.
2674 (vaddl_high_s8): Remove unnecessary cast.
2675 (vaddl_high_s16): Likewise.
2676 (vaddl_high_s32): Likewise.
2677 (vaddl_high_u8): Use type-qualified builtin and remove casts.
2678 (vaddl_high_u16): Likewise.
2679 (vaddl_high_u32): Likewise.
2680 (vaddw_s8): Remove unnecessary cast.
2681 (vaddw_s16): Likewise.
2682 (vaddw_s32): Likewise.
2683 (vaddw_u8): Use type-qualified builtin and remove casts.
2684 (vaddw_u16): Likewise.
2685 (vaddw_u32): Likewise.
2686 (vaddw_high_s8): Remove unnecessary cast.
2687 (vaddw_high_s16): Likewise.
2688 (vaddw_high_s32): Likewise.
2689 (vaddw_high_u8): Use type-qualified builtin and remove casts.
2690 (vaddw_high_u16): Likewise.
2691 (vaddw_high_u32): Likewise.
2693 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2695 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2696 qualified builtins for [R]SHRN[2].
2697 * config/aarch64/arm_neon.h (vshrn_n_u16): Use type-qualified
2698 builtin and remove casts.
2699 (vshrn_n_u32): Likewise.
2700 (vshrn_n_u64): Likewise.
2701 (vrshrn_high_n_u16): Likewise.
2702 (vrshrn_high_n_u32): Likewise.
2703 (vrshrn_high_n_u64): Likewise.
2704 (vrshrn_n_u16): Likewise.
2705 (vrshrn_n_u32): Likewise.
2706 (vrshrn_n_u64): Likewise.
2707 (vshrn_high_n_u16): Likewise.
2708 (vshrn_high_n_u32): Likewise.
2709 (vshrn_high_n_u64): Likewise.
2711 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2713 * config/aarch64/aarch64-simd-builtins.def: Declare unsigned
2714 type-qualified builtins for XTN[2].
2715 * config/aarch64/arm_neon.h (vmovn_high_u16): Use type-
2716 qualified builtin and remove casts.
2717 (vmovn_high_u32): Likewise.
2718 (vmovn_high_u64): Likewise.
2719 (vmovn_u16): Likewise.
2720 (vmovn_u32): Likewise.
2721 (vmovn_u64): Likewise.
2723 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2725 * config/aarch64/aarch64-simd-builtins.def: Use poly type
2726 qualifier in builtin generator macros.
2727 * config/aarch64/arm_neon.h (vmul_p8): Use type-qualified
2728 builtin and remove casts.
2729 (vmulq_p8): Likewise.
2730 (vmull_high_p8): Likewise.
2731 (vmull_p8): Likewise.
2733 2021-11-11 Jonathan Wright <jonathan.wright@arm.com>
2735 * config/aarch64/aarch64-simd-builtins.def: Declare type-
2736 qualified builtin generators for unsigned MLA/MLS intrinsics.
2737 * config/aarch64/arm_neon.h (vmla_n_u16): Use type-qualified
2739 (vmla_n_u32): Likewise.
2740 (vmla_u8): Likewise.
2741 (vmla_u16): Likewise.
2742 (vmla_u32): Likewise.
2743 (vmlaq_n_u16): Likewise.
2744 (vmlaq_n_u32): Likewise.
2745 (vmlaq_u8): Likewise.
2746 (vmlaq_u16): Likewise.
2747 (vmlaq_u32): Likewise.
2748 (vmls_n_u16): Likewise.
2749 (vmls_n_u32): Likewise.
2750 (vmls_u8): Likewise.
2751 (vmls_u16): Likewise.
2752 (vmls_u32): Likewise.
2753 (vmlsq_n_u16): Likewise.
2754 (vmlsq_n_u32): Likewise.
2755 (vmlsq_u8): Likewise.
2756 (vmlsq_u16): Likewise.
2757 (vmlsq_u32): Likewise.
2759 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2761 * ipa-modref.c (modref_summary::useful_p): Check also for side-effects
2762 with looping const/pure.
2763 (modref_summary_lto::useful_p): Likewise.
2764 (merge_call_side_effects): Merge side effects before early exit
2766 (process_fnspec): Also handle pure functions.
2767 (analyze_call): Do not early exit on looping pure const.
2768 (propagate_unknown_call): Also handle nontrivial SCC as side-effect.
2769 (modref_propagate_in_scc): Update.
2771 2021-11-11 Richard Biener <rguenther@suse.de>
2773 PR tree-optimization/103190
2774 * tree-ssa-reassoc.c (insert_stmt_after): Only assert on asm goto.
2776 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2778 * gimple-range-path.cc (path_range_query::add_copies_to_imports):
2780 (path_range_query::compute_imports): ...this. Adapt it so it can
2781 be passed the imports bitmap instead of working on m_imports.
2782 (path_range_query::compute_ranges): Call compute_imports in all
2783 cases unless an imports bitmap is passed.
2784 * gimple-range-path.h (path_range_query::compute_imports): New.
2785 (path_range_query::add_copies_to_imports): Remove.
2786 * tree-ssa-threadbackward.c (back_threader::resolve_def): Remove.
2787 (back_threader::find_paths_to_names): Inline resolve_def.
2788 (back_threader::find_paths): Call compute_imports.
2789 (back_threader::resolve_phi): Adjust comment.
2791 2021-11-11 Richard Biener <rguenther@suse.de>
2793 PR tree-optimization/103188
2794 * tree-ssa-loop-ch.c (should_duplicate_loop_header_p):
2795 Remove query parameter, split out check for size
2797 (ch_base::m_ranger, cb_base::m_query): Remove.
2798 (ch_base::copy_headers): Split processing loop into
2799 analysis around which we allocate and use ranger and
2800 transform where we do not.
2801 (pass_ch::execute): Do not allocate/free ranger here.
2802 (pass_ch_vect::execute): Likewise.
2804 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2806 * ipa-pure-const.c (propagate_pure_const): Self recursion is
2809 2021-11-11 Jan Hubicka <hubicka@ucw.cz>
2811 * cgraph.c (set_noreturn_flag_1): New function.
2812 (cgraph_node::set_noreturn_flag): New member function
2813 * cgraph.h (cgraph_node::set_noreturn_flags): Declare.
2814 * ipa-pure-const.c (pass_local_pure_const::execute): Use it.
2816 2021-11-11 Aldy Hernandez <aldyh@redhat.com>
2817 Richard Biener <rguenther@suse.de>
2819 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): Resolve
2820 statically to the edge remaining in the loop.
2822 2021-11-11 Richard Biener <rguenther@suse.de>
2824 PR middle-end/103181
2825 * tree-eh.c (operation_could_trap_helper_p): Properly
2826 check vector constants for a zero element for integer
2827 division. Separate floating point and integer division code.
2829 2021-11-11 Jakub Jelinek <jakub@redhat.com>
2832 * dwarf2out.c (field_byte_offset): Do the PCC_BITFIELD_TYPE_MATTERS
2833 handling only for DECL_BIT_FIELD_TYPE decls.
2835 2021-11-11 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
2838 * config/aarch64/aarch64.c (aarch64_process_target_attr): Check if
2839 token is arch extension without leading '+' and emit appropriate
2840 diagnostic for the same.
2842 2021-11-11 Jakub Jelinek <jakub@redhat.com>
2844 * tree.h (OMP_CLAUSE_NUM_TEAMS_EXPR): Rename to ...
2845 (OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR): ... this.
2846 (OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR): Define.
2847 * tree.c (omp_clause_num_ops): Increase num ops for
2848 OMP_CLAUSE_NUM_TEAMS to 2.
2849 * tree-pretty-print.c (dump_omp_clause): Print optional lower bound
2850 for OMP_CLAUSE_NUM_TEAMS.
2851 * gimplify.c (gimplify_scan_omp_clauses): Gimplify
2852 OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR if non-NULL.
2853 (optimize_target_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR instead
2854 of OMP_CLAUSE_NUM_TEAMS_EXPR. Handle OMP_CLAUSE_NUM_TEAMS_LOWER_EXPR.
2855 * omp-low.c (lower_omp_teams): Use OMP_CLAUSE_NUM_TEAMS_UPPER_EXPR
2856 instead of OMP_CLAUSE_NUM_TEAMS_EXPR.
2857 * omp-expand.c (expand_teams_call, get_target_arguments): Likewise.
2859 2021-11-11 Richard Biener <rguenther@suse.de>
2861 * cfganal.c (find_pdom): Remove.
2862 (control_dependences::find_control_dependence): Remove
2863 special-casing of entry block, call get_immediate_dominator
2865 * gimple-predicate-analysis.cc (find_pdom): Remove.
2866 (find_dom): Likewise.
2867 (find_control_equiv_block): Call get_immediate_dominator
2869 (compute_control_dep_chain): Likewise.
2870 (predicate::init_from_phi_def): Likewise.
2872 2021-11-11 Richard Biener <rguenther@suse.de>
2874 * cfganal.h (control_dependences::control_dependence_map):
2876 (control_dependences::m_bitmaps): New.
2877 * cfganal.c (control_dependences::set_control_dependence_map_bit):
2879 (control_dependences::clear_control_dependence_bitmap):
2881 (control_dependences::find_control_dependence): Do not
2882 find_edge for the abnormal edge test.
2883 (control_dependences::control_dependences): Instead do not
2884 add abnormal edges to the edge list. Adjust.
2885 (control_dependences::~control_dependences): Likewise.
2886 (control_dependences::get_edges_dependent_on): Likewise.
2887 * function-tests.c: Include bitmap.h.
2889 2021-11-11 Kewen Lin <linkw@linux.ibm.com>
2891 * doc/invoke.texi: Change references to "future cpu" to "power10",
2892 "-mcpu=future" to "-mcpu=power10". Adjust words for float128.
2894 2021-11-11 Cui,Lili <lili.cui@intel.com>
2896 * config/i386/i386-options.c (m_CORE_AVX2): Remove Alderlake
2898 (processor_cost_table): Use alderlake_cost for Alderlake.
2899 * config/i386/i386.c (ix86_sched_init_global): Handle Alderlake.
2900 * config/i386/x86-tune-costs.h (struct processor_costs): Add alderlake
2902 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Alderlake
2904 (ix86_adjust_cost): Handle Alderlake.
2905 * config/i386/x86-tune.def (X86_TUNE_SCHEDULE): Enable for Alderlake.
2906 (X86_TUNE_PARTIAL_REG_DEPENDENCY): Likewise.
2907 (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY): Likewise.
2908 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): Likewise.
2909 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
2910 (X86_TUNE_MEMORY_MISMATCH_STALL): Likewise.
2911 (X86_TUNE_USE_LEAVE): Likewise.
2912 (X86_TUNE_PUSH_MEMORY): Likewise.
2913 (X86_TUNE_USE_INCDEC): Likewise.
2914 (X86_TUNE_INTEGER_DFMODE_MOVES): Likewise.
2915 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
2916 (X86_TUNE_USE_SAHF): Likewise.
2917 (X86_TUNE_USE_BT): Likewise.
2918 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
2919 (X86_TUNE_ONE_IF_CONV_INSN): Likewise.
2920 (X86_TUNE_AVOID_MFENCE): Likewise.
2921 (X86_TUNE_USE_SIMODE_FIOP): Likewise.
2922 (X86_TUNE_EXT_80387_CONSTANTS): Likewise.
2923 (X86_TUNE_SSE_UNALIGNED_LOAD_OPTIMAL): Likewise.
2924 (X86_TUNE_SSE_UNALIGNED_STORE_OPTIMAL): Likewise.
2925 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
2926 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
2927 (X86_TUNE_AVOID_4BYTE_PREFIXES): Likewise.
2928 (X86_TUNE_USE_GATHER): Disable for Alderlake.
2929 (X86_TUNE_AVX256_MOVE_BY_PIECES): Likewise.
2930 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
2932 2021-11-11 liuhongt <hongtao.liu@intel.com>
2935 * config/i386/sse.md (V_128_256): Extend to V8HF/V16HF.
2936 (avxsizesuffix): Ditto.
2938 2021-11-11 Kito Cheng <kito.cheng@sifive.com>
2940 * common/config/riscv/riscv-common.c (riscv_subset_list::to_string): Fix
2941 wrong marco checking.
2943 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2945 PR tree-optimization/102906
2946 * tree-ssa-loop-ch.c (entry_loop_condition_is_static): New.
2947 (should_duplicate_loop_header_p): Call entry_loop_condition_is_static.
2948 (class ch_base): Add m_ranger and m_query.
2949 (ch_base::copy_headers): Pass m_query to
2950 entry_loop_condition_is_static.
2951 (pass_ch::execute): Allocate and deallocate m_ranger and
2953 (pass_ch_vect::execute): Same.
2955 2021-11-10 Andrew Pinski <apinski@marvell.com>
2958 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>):
2959 Use vwcore iterator for the r constraint output string.
2961 2021-11-10 qing zhao <qing.zhao@oracle.com>
2963 * internal-fn.c (expand_DEFERRED_INIT): Apply pattern initialization
2964 only when have_insn_for return true for the mode. Fix a memory leak.
2966 2021-11-10 Christophe Lyon <christophe.lyon@foss.st.com>
2968 * config/arm/arm.c (cortexa9_extra_costs, cortexa8_extra_costs,
2969 cortexa5_extra_costs, cortexa7_extra_costs,
2970 cortexa12_extra_costs, cortexa15_extra_costs, v7m_extra_costs):
2971 Initialize movi, dup and extract costing fields.
2973 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
2975 * gimple-range-path.cc (path_range_query::path_range_query): Do
2977 (path_range_query::dump): Change m_path uses to non-pointer.
2978 (path_range_query::defined_outside_path): Same.
2979 (path_range_query::set_path): Same.
2980 (path_range_query::add_copies_to_imports): Same.
2981 (path_range_query::range_of_stmt): Same.
2982 (path_range_query::compute_outgoing_relations): Same.
2983 (path_range_query::compute_ranges): Imports are now optional.
2984 Implement overload that takes an edge.
2985 * gimple-range-path.h (class path_range_query): Make imports
2986 optional for compute_ranges. Add compute_ranges(edge) overload.
2987 Make m_path an auto_vec instead of a pointer and adjust
2990 2021-11-10 Tamar Christina <tamar.christina@arm.com>
2992 * tree-vectorizer.h (struct scalar_cond_masked_key): Add inverted_p.
2993 (default_hash_traits<scalar_conf_masked_key>): Likewise.
2994 * tree-vect-stmts.c (vectorizable_condition): Check if inverse of mask
2996 * tree-vectorizer.c (scalar_cond_masked_key::get_cond_ops_from_tree):
2997 Register mask inverses.
2999 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3001 * tree-vectorizer.c (vectorize_loops): Do local CSE through RPVN upon
3002 successful vectorization.
3004 2021-11-10 Andrew MacLeod <amacleod@redhat.com>
3006 * gimple-range-cache.cc (sbr_vector::grow): New.
3007 (sbr_vector::set_bb_range): Call grow.
3008 (sbr_vector::get_bb_range): Same.
3009 (sbr_vector::bb_range_p): Remove assert.
3011 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3013 * config/aarch64/aarch64-simd.md (*aarch64_topbits_shuffle<mode>_le
3014 ,*aarch64_topbits_shuffle<mode>_be): Remove.
3016 2021-11-10 Jan Hubicka <jh@suse.cz>
3018 * ipa-modref.c: Include tree-eh.h
3019 (modref_summary::modref_summary): Initialize side_effects.
3020 (struct modref_summary_lto): New bool field side_effects.
3021 (modref_summary_lto::modref_summary_lto): Initialize side_effects.
3022 (modref_summary::dump): Dump side_effects.
3023 (modref_summary_lto::dump): Dump side_effects.
3024 (merge_call_side_effects): Merge side effects.
3025 (process_fnspec): Calls to non-const/pure or looping
3026 function is a side effect.
3027 (analyze_call): Self-recursion is a side-effect; handle
3029 (analyze_load): Watch for volatile and throwing memory.
3030 (analyze_store): Likewise.
3031 (analyze_stmt): Watch for volatitle asm.
3032 (analyze_function): Handle side_effects.
3033 (modref_summaries::duplicate): Duplicate side_effects.
3034 (modref_summaries_lto::duplicate): Likewise.
3035 (modref_write): Stream side_effects.
3036 (read_section): Likewise.
3037 (update_signature): Update.
3038 (propagate_unknown_call): Handle side_effects.
3039 (modref_propagate_in_scc): Likewise.
3040 * ipa-modref.h (struct modref_summary): Add side_effects.
3041 * ipa-pure-const.c (special_builtin_state): Rename to ...
3042 (builtin_safe_for_const_function_p): ... this one.
3043 (check_call): Update.
3044 (finite_function_p): Break out from ...
3045 (propagate_pure_const): ... here
3046 * ipa-utils.h (finite_function): Declare.
3048 2021-11-10 Lucas A. M. Magalhães <lamm@linux.ibm.com>
3050 * config.gcc (powerpc*-*-*): Remove -rpath from
3051 --with-advance-toolchain.
3053 2021-11-10 Marek Polacek <polacek@redhat.com>
3056 * attribs.c (struct scoped_attributes): Add a bool member.
3057 (lookup_scoped_attribute_spec): Forward declare.
3058 (register_scoped_attributes): New bool parameter, defaulted to
3060 (handle_ignored_attributes_option): New function.
3061 (free_attr_data): New function.
3062 (init_attributes): Call handle_ignored_attributes_option.
3063 (attr_namespace_ignored_p): New function.
3064 (decl_attributes): Check attr_namespace_ignored_p before
3066 * attribs.h (free_attr_data): Declare.
3067 (register_scoped_attributes): Adjust declaration.
3068 (handle_ignored_attributes_option): Declare.
3069 (canonicalize_attr_name): New function template.
3070 (canonicalize_attr_name): Use it.
3071 * common.opt (Wattributes=): New option with a variable.
3072 * doc/extend.texi: Document #pragma GCC diagnostic ignored_attributes.
3073 * doc/invoke.texi: Document -Wno-attributes=.
3074 * opts.c (common_handle_option) <case OPT_Wattributes_>: Handle.
3075 * plugin.h (register_scoped_attributes): Adjust declaration.
3076 * toplev.c (compile_file): Call free_attr_data.
3078 2021-11-10 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3080 * config/arm/arm-cpus.in (cortex-a710): New CPU.
3081 * config/arm/arm-tables.opt: Regenerate.
3082 * config/arm/arm-tune.md: Regenerate.
3083 * doc/invoke.texi: Update docs.
3085 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3087 * config/aarch64/aarch64-builtins.c
3088 (aarch64_general_gimple_fold_builtin): Mark argument as unused.
3090 2021-11-10 Martin Liska <mliska@suse.cz>
3092 * lto-wrapper.c (merge_and_complain): Make the first argument
3095 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3097 * config/aarch64/iterators.md (optab): Use fmax_nan instead of
3098 smax_nan and fmin_nan instead of smin_nan.
3099 (maxmin_uns): Rename to...
3100 (fmaxmin): ...this and make the same changes. Remove entries
3101 unrelated to fmax* and fmin*.
3102 * config/aarch64/aarch64.md (<maxmin_uns><mode>3): Rename to...
3103 (<fmaxmin><mode>3): ...this.
3104 * config/aarch64/aarch64-simd.md (aarch64_<maxmin_uns>p<mode>):
3106 (aarch64_<optab>p<mode>): ...this.
3107 (<maxmin_uns><mode>3): Rename to...
3108 (<fmaxmin><mode>3): ...this.
3109 (reduc_<maxmin_uns>_scal_<mode>): Rename to...
3110 (reduc_<optab>_scal_<mode>): ...this and update gen* call.
3111 (aarch64_reduc_<maxmin_uns>_internal<mode>): Rename to...
3112 (aarch64_reduc_<optab>_internal<mode>): ...this.
3113 (aarch64_reduc_<maxmin_uns>_internalv2si): Rename to...
3114 (aarch64_reduc_<optab>_internalv2si): ...this.
3115 * config/aarch64/aarch64-sve.md (<maxmin_uns><mode>3): Rename to...
3116 (<fmaxmin><mode>3): ...this.
3117 * config/aarch64/aarch64-simd-builtins.def (smax_nan, smin_nan)
3119 (fmax_nan, fmin_nan): ...this.
3120 * config/aarch64/arm_neon.h (vmax_f32, vmax_f64, vmaxq_f32, vmaxq_f64)
3121 (vmin_f32, vmin_f64, vminq_f32, vminq_f64, vmax_f16, vmaxq_f16)
3122 (vmin_f16, vminq_f16): Update accordingly.
3124 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3126 * tree-vectorizer.h (vector_costs::finish_cost): Take the
3127 corresponding scalar costs as a parameter.
3128 (finish_cost): Likewise.
3129 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost)
3130 (vect_estimate_min_profitable_iters): Update accordingly.
3131 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Likewise.
3132 * tree-vectorizer.c (vector_costs::finish_cost): Likewise.
3133 * config/aarch64/aarch64.c (aarch64_vector_costs::finish_cost):
3135 * config/rs6000/rs6000.c (rs6000_cost_data::finish_cost): Likewise.
3137 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3139 * tree-vectorizer.h (_loop_vec_info::scalar_costs): New member
3141 (_loop_vec_info::single_scalar_iteration_cost): Delete.
3142 (LOOP_VINFO_SINGLE_SCALAR_ITERATION_COST): Delete.
3143 (vector_costs::total_cost): New function.
3144 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3145 after above changes.
3146 (_loop_vec_info::~_loop_vec_info): Delete scalar_costs.
3147 (vect_compute_single_scalar_iteration_cost): Store the costs
3148 in loop_vinfo->scalar_costs.
3149 (vect_estimate_min_profitable_iters): Get the scalar cost from
3150 loop_vinfo->scalar_costs.
3152 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3154 * tree-vectorizer.h (vector_costs::better_main_loop_than_p)
3155 (vector_costs::better_epilogue_loop_than_p)
3156 (vector_costs::compare_inside_loop_cost)
3157 (vector_costs::compare_outside_loop_cost): Likewise.
3158 * tree-vectorizer.c (vector_costs::better_main_loop_than_p)
3159 (vector_costs::better_epilogue_loop_than_p)
3160 (vector_costs::compare_inside_loop_cost)
3161 (vector_costs::compare_outside_loop_cost): New functions,
3162 containing code moved from...
3163 * tree-vect-loop.c (vect_better_loop_vinfo_p): ...here.
3165 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3167 * tree-vectorizer.h (_loop_vec_info): Remove vec_outside_cost
3168 and vec_inside_cost.
3169 (vector_costs::outside_cost): New function.
3170 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
3172 (vect_estimate_min_profitable_iters): Likewise.
3173 (vect_better_loop_vinfo_p): Get the inside and outside costs
3174 from the loop_vec_infos' vector_costs.
3176 2021-11-10 Richard Sandiford <richard.sandiford@arm.com>
3178 * tree-vectorizer.h (vec_info::target_cost_data): Replace with...
3179 (_loop_vec_info::vector_costs): ...this.
3180 (LOOP_VINFO_TARGET_COST_DATA): Delete.
3181 * tree-vectorizer.c (vec_info::vec_info): Remove target_cost_data
3183 (vec_info::~vec_info): Remove corresponding delete.
3184 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Initialize
3185 vector_costs to null.
3186 (_loop_vec_info::~_loop_vec_info): Delete vector_costs.
3187 (vect_analyze_loop_operations): Update after above changes.
3188 (vect_analyze_loop_2): Likewise.
3189 (vect_estimate_min_profitable_iters): Likewise.
3190 * tree-vect-slp.c (vect_slp_analyze_operations): Likewise.
3192 2021-11-10 Jan Hubicka <hubicka@ucw.cz>
3194 * tree-core.h (EAF_DIRECT): Remove.
3195 (EAF_NOCLOBBER): Remove.
3196 (EAF_UNUSED): Remove.
3197 (EAF_NOESCAPE): Remove.
3198 (EAF_NO_DIRECT_CLOBBER): New.
3199 (EAF_NO_INDIRECT_CLOBBER): New.
3200 (EAF_NODIRECTESCAPE): Remove.
3201 (EAF_NO_DIRECT_ESCAPE): New.
3202 (EAF_NO_INDIRECT_ESCAPE): New.
3203 (EAF_NOT_RETURNED): Remove.
3204 (EAF_NOT_RETURNED_INDIRECTLY): New.
3205 (EAF_NOREAD): Remove.
3206 (EAF_NO_DIRECT_READ): New.
3207 (EAF_NO_INDIRECT_READ): New.
3208 * gimple.c (gimple_call_arg_flags): Update for new flags.
3209 (gimple_call_retslot_flags): Update for new flags.
3210 * ipa-modref.c (dump_eaf_flags): Likewise.
3211 (remove_useless_eaf_flags): Likewise.
3212 (deref_flags): Likewise.
3213 (modref_lattice::init): Likewise.
3214 (modref_lattice::merge): Likewise.
3215 (modref_lattice::merge_direct_load): Likewise.
3216 (modref_lattice::merge_direct_store): Likewise.
3217 (modref_eaf_analysis::merge_call_lhs_flags): Likewise.
3218 (callee_to_caller_flags): Likewise.
3219 (modref_eaf_analysis::analyze_ssa_name): Likewise.
3220 (modref_eaf_analysis::propagate): Likewise.
3221 (modref_merge_call_site_flags): Likewise.
3222 * ipa-modref.h (interposable_eaf_flags): Likewise.
3223 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1) Likewise.
3224 * tree-ssa-structalias.c (handle_call_arg): Likewise.
3225 (handle_rhs_call): Likewise.
3226 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Likewise.
3228 2021-11-10 Tamar Christina <tamar.christina@arm.com>
3230 * tree-vect-slp-patterns.c (complex_mul_pattern::matches): Remove l1node.
3232 2021-11-10 Andre Vieira <andre.simoesdiasvieira@arm.com>
3234 * config/aarch64/aarch64-builtins.c
3235 (aarch64_general_gimple_fold_builtin): Change pointer alignment and
3238 2021-11-10 Jan Hubicka <jh@suse.cz>
3240 * ipa-modref-tree.h (modref_tree::remap_params): Fix off-by-one error.
3242 2021-11-10 H.J. Lu <hongjiu.lu@intel.com>
3243 Hongtao Liu <hongtao.liu@intel.com>
3245 PR middle-end/102566
3246 * match.pd (nop_atomic_bit_test_and_p): New match.
3247 * tree-ssa-ccp.c (convert_atomic_bit_not): New function.
3248 (gimple_nop_atomic_bit_test_and_p): New prototype.
3249 (optimize_atomic_bit_test_and): Transform equivalent, but slighly
3250 different cases to their canonical forms.
3252 2021-11-10 liuhongt <hongtao.liu@intel.com>
3254 PR tree-optimization/103126
3255 * tree-vect-loop.c (neutral_op_for_reduction): Remove static.
3256 * tree-vectorizer.h (neutral_op_for_reduction): Declare.
3257 * tree-if-conv.c : Include tree-vectorizer.h.
3258 (is_cond_scalar_reduction): Handle
3259 BIT_XOR_EXPR/BIT_IOR_EXPR/BIT_AND_EXPR.
3260 (convert_scalar_cond_reduction): Ditto.
3262 2021-11-10 konglin1 <lingling.kong@intel.com>
3264 * config/i386/sse.md (cmul<conj_op><mode>3): add new define_expand.
3265 (cmla<conj_op><mode>4): Likewise
3267 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3269 * tree-ssa-threadedge.c: Do not include
3270 gimple-ssa-evrp-analyze.h.
3271 * value-pointer-equiv.cc: Same.
3273 2021-11-10 Aldy Hernandez <aldyh@redhat.com>
3275 * tree-ssa-threadbackward.c
3276 (back_threader::maybe_register_path_dump): Abstract path dumping...
3277 (dump_path): ...here.
3278 (back_threader::resolve_phi): Call dump_path.
3281 2021-11-10 konglin1 <lingling.kong@intel.com>
3283 * config/i386/sse.md (fma_<complexpairopname>_<mode>_pair):
3284 Add new define_insn.
3285 (fma_<mode>_fmaddc_bcst): Add new define_insn_and_split.
3286 (fma_<mode>_fcmaddc_bcst): Likewise
3288 2021-11-10 liuhongt <hongtao.liu@intel.com>
3291 * match.pd: Simplify (trunc)fmax/fmin((extend)a, (extend)b) to
3294 2021-11-10 Andrew Pinski <apinski@marvell.com>
3297 * config/aarch64/aarch64.c (aarch64_evpc_ins): Don't use target
3298 as an input, use original one.
3300 2021-11-10 Sandra Loosemore <sandra@codesourcery.com>
3302 * config/nios2/nios2.c (nios2_can_inline_p): New.
3303 (TARGET_CAN_INLINE_P): Define.
3305 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3307 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin):
3308 Disable gimple fold for RS6000_BIF_{XVMINDP,XVMINSP,VMINFP} and
3309 RS6000_BIF_{XVMAXDP,XVMAXSP,VMAXFP} when fast-math is not set.
3310 (lxvrse_expand_builtin): Modify the expansion for sign extension.
3311 All extensions are done within VSX registers.
3313 2021-11-09 Andrew Pinski <apinski@marvell.com>
3315 PR tree-optimization/10352
3316 * match.pd: Remove check of TYPE_PRECISION for
3319 2021-11-09 Andrew MacLeod <amacleod@redhat.com>
3321 * function.c (allocate_struct_function): Don't set x_range_query.
3322 * function.h (get_range_query): Move to value-query.h.
3323 * gimple-range.cc (enable_ranger): Check that query is currently NULL.
3324 (disable_ranger): Clear function current query field.
3325 * value-query.cc (get_global_range_query): Relocate to:
3326 * value-query.h (get_global_range_query): Here and inline.
3327 (get_range_query): Relocate here from function.h.
3329 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3331 * tree-ssa-threadbackward.c
3332 (back_threader::maybe_register_path_dump): New.
3333 (back_threader::maybe_register_path): Call maybe_register_path_dump.
3335 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3337 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
3338 Return NULL when unprofitable.
3340 2021-11-09 Martin Jambor <mjambor@suse.cz>
3342 * tree.h (build_debug_expr_decl): Declare.
3343 * tree.c (build_debug_expr_decl): New function.
3344 * cfgexpand.c (avoid_deep_ter_for_debug): Use build_debug_expr_decl
3345 instead of building a DEBUG_EXPR_DECL.
3346 * ipa-param-manipulation.c
3347 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3348 * omp-simd-clone.c (ipa_simd_modify_stmt_ops): Likewise.
3349 * tree-ssa-ccp.c (optimize_atomic_bit_test_and): Likewise.
3350 * tree-ssa-phiopt.c (spaceship_replacement): Likewise.
3351 * tree-ssa-reassoc.c (make_new_ssa_for_def): Likewise.
3353 2021-11-09 Bill Schmidt <wschmidt@linux.ibm.com>
3355 * config/rs6000/rs6000-call.c (rs6000_type_string): New function.
3356 (def_builtin): Change debug formatting for easier parsing and
3357 include more information.
3358 (rs6000_init_builtins): Add dump of autogenerated builtins.
3359 (altivec_init_builtins): Dump __builtin_altivec_mask_for_load for
3362 2021-11-09 Marek Polacek <polacek@redhat.com>
3364 * ipa-modref.c (analyze_parms): Fix a typo.
3366 2021-11-09 Kito Cheng <kito.cheng@sifive.com>
3369 * common/config/riscv/riscv-common.c (multi_letter_subset_rank): Remove
3370 assertion for Z*-ext.
3372 2021-11-09 Jan Hubicka <hubicka@ucw.cz>
3374 * ipa-modref.c (analyze_parms): Add past_flags, past_retslot_flags
3375 and past_static_chain; merge past summary with current one.
3376 (analyze_function): Update.
3378 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3380 * doc/invoke.texi (Invoking GCC): Document --param=threader-debug.
3382 2021-11-09 Martin Liska <mliska@suse.cz>
3384 * print-rtl.c (rtx_writer::rtx_writer): Make the compilation
3385 conditional based on
3386 * print-rtl.h (class rtx_writer): Likewise.
3388 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3390 * input.h (location_hash): Use 'BUILTINS_LOCATION' as spare value
3391 for 'Deleted'. Turn into a '#define'.
3393 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3395 * gimple-range-path.cc (path_range_query::dump): Clean up.
3396 (path_range_query::compute_ranges): Same.
3397 * value-relation.cc (path_oracle::dump): Same.
3399 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3401 * dumpfile.c (dump_options): Remove TDF_THREADING entry.
3402 * dumpfile.h (enum dump_flag): Remove TDF_THREADING and adjust
3404 * flag-types.h (enum threader_debug): New.
3405 * gimple-range-path.cc (DEBUG_SOLVER): Use param_threader_debug.
3406 * params.opt: Add entry for --param=threader-debug=.
3408 2021-11-09 Thomas Schwinge <thomas@codesourcery.com>
3410 PR middle-end/101204
3412 * diagnostic-spec.h (typedef xint_hash_t): Turn into...
3413 (struct xint_hash_t): ... this.
3414 * doc/gty.texi: Update.
3416 2021-11-09 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3418 * config/arm/arm-cpus.in (armv9): New define.
3419 (ARMv9a): New group.
3420 (armv9-a): New arch definition.
3421 * config/arm/arm-tables.opt: Regenerate.
3422 * config/arm/arm.h (BASE_ARCH_9A): New arch enum value.
3423 * config/arm/t-aprofile: Added armv9-a and armv9+simd.
3424 * config/arm/t-arm-elf: Added arm9-a, v9_fps and all_v9_archs
3425 to MULTILIB_MATCHES.
3426 * config/arm/t-multilib: Added v9_a_nosimd_variants and
3427 v9_a_simd_variants to MULTILIB_MATCHES.
3428 * doc/invoke.texi: Update docs.
3430 2021-11-09 Martin Jambor <mjambor@suse.cz>
3433 * ipa-param-manipulation.c (replace_with_mapped_expr): Early
3434 return with error_mark_mode when part of expression is mapped to
3436 (ipa_param_body_adjustments::remap_with_debug_expressions): Set
3437 mapped value to NULL if walk_tree returns error_mark_mode.
3439 2021-11-09 Aldy Hernandez <aldyh@redhat.com>
3441 * tree-ssa-strlen.c (compare_nonzero_chars): Pass statement
3443 (get_addr_stridx): Same.
3445 (get_range_strlen_dynamic): Same.
3446 (handle_builtin_strlen): Same.
3447 (handle_builtin_strchr): Same.
3448 (handle_builtin_strcpy): Same.
3449 (maybe_diag_stxncpy_trunc): Same.
3450 (handle_builtin_stxncpy_strncat): Same.
3451 (handle_builtin_memcpy): Same.
3452 (handle_builtin_strcat): Same.
3453 (handle_alloc_call): Same.
3454 (handle_builtin_memset): Same.
3455 (handle_builtin_string_cmp): Same.
3456 (handle_pointer_plus): Same.
3457 (count_nonzero_bytes_addr): Same.
3458 (count_nonzero_bytes): Same.
3459 (handle_store): Same.
3460 (fold_strstr_to_strncmp): Same.
3461 (handle_integral_assign): Same.
3462 (check_and_optimize_stmt): Same.
3463 (class strlen_dom_walker): Replace evrp with ranger.
3464 (strlen_dom_walker::before_dom_children): Remove evrp.
3465 (strlen_dom_walker::after_dom_children): Remove evrp.
3466 * gimple-ssa-warn-access.cc (maybe_check_access_sizes):
3467 Restrict sprintf output.
3469 2021-11-09 Martin Liska <mliska@suse.cz>
3471 * genconditions.c (write_one_condition): Add const qualifier
3474 2021-11-09 Jeff Law <jeffreyalaw@gmail.com>
3476 * match.pd: New pattern to simplify (1 << n) & M ==/!= 0 for M
3479 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3481 * config/rs6000/rs6000-builtin-new.def (VEC_INIT_V16QI): Use
3482 escape-newline support.
3483 (VEC_INIT_V4SI): Likewise.
3484 (VEC_INIT_V8HI): Likewise.
3485 (PACK_V1TI): Likewise.
3487 (VFIRSTMISMATCHOREOSINDEX_V16QI): Likewise.
3488 (VFIRSTMISMATCHOREOSINDEX_V8HI): Likewise.
3489 (VFIRSTMISMATCHOREOSINDEX_V4SI): Likewise.
3493 (FMAF128_ODD): Likewise.
3494 (VSCEQPUO): Likewise.
3496 (VSIEQPF): Likewise.
3497 (VSTDCQP): Likewise.
3498 (PACK_TD): Likewise.
3499 (TABORTDC): Likewise.
3500 (TABORTDCI): Likewise.
3501 (SE_LXVRBX): Likewise.
3502 (SE_LXVRHX): Likewise.
3503 (SE_LXVRWX): Likewise.
3504 (SE_LXVRDX): Likewise.
3505 (VREPLACE_UN_UV2DI): Likewise.
3506 (VREPLACE_UN_UV4SI): Likewise.
3507 (VREPLACE_UN_V2DI): Likewise.
3508 (VREPLACE_ELT_UV2DI): Likewise.
3509 (VREPLACE_ELT_V2DI): Likewise.
3510 (ZE_LXVRBX): Likewise.
3511 (ZE_LXVRHX): Likewise.
3512 (ZE_LXVRWX): Likewise.
3513 (ZE_LXVRDX): Likewise.
3515 (CNTLZDM): Likewise.
3516 (CNTTZDM): Likewise.
3519 (PMXVBF16GER2): Likewise.
3520 (PMXVBF16GER2_INTERNAL): Likewise.
3521 (PMXVBF16GER2NN): Likewise.
3522 (PMXVBF16GER2NN_INTERNAL): Likewise.
3523 (PMXVBF16GER2NP): Likewise.
3524 (PMXVBF16GER2NP_INTERNAL): Likewise.
3525 (PMXVBF16GER2PN): Likewise.
3526 (PMXVBF16GER2PN_INTERNAL): Likewise.
3527 (PMXVBF16GER2PP): Likewise.
3528 (PMXVBF16GER2PP_INTERNAL): Likewise.
3529 (PMXVF16GER2): Likewise.
3530 (PMXVF16GER2_INTERNAL): Likewise.
3531 (PMXVF16GER2NN): Likewise.
3532 (PMXVF16GER2NN_INTERNAL): Likewise.
3533 (PMXVF16GER2NP): Likewise.
3534 (PMXVF16GER2NP_INTERNAL): Likewise.
3535 (PMXVF16GER2PN): Likewise.
3536 (PMXVF16GER2PN_INTERNAL): Likewise.
3537 (PMXVF16GER2PP): Likewise.
3538 (PMXVF16GER2PP_INTERNAL): Likewise.
3539 (PMXVF32GER_INTERNAL): Likewise.
3540 (PMXVF32GERNN): Likewise.
3541 (PMXVF32GERNN_INTERNAL): Likewise.
3542 (PMXVF32GERNP): Likewise.
3543 (PMXVF32GERNP_INTERNAL): Likewise.
3544 (PMXVF32GERPN): Likewise.
3545 (PMXVF32GERPN_INTERNAL): Likewise.
3546 (PMXVF32GERPP): Likewise.
3547 (PMXVF32GERPP_INTERNAL): Likewise.
3548 (PMXVF64GER): Likewise.
3549 (PMXVF64GER_INTERNAL): Likewise.
3550 (PMXVF64GERNN): Likewise.
3551 (PMXVF64GERNN_INTERNAL): Likewise.
3552 (PMXVF64GERNP): Likewise.
3553 (PMXVF64GERNP_INTERNAL): Likewise.
3554 (PMXVF64GERPN): Likewise.
3555 (PMXVF64GERPN_INTERNAL): Likewise.
3556 (PMXVF64GERPP): Likewise.
3557 (PMXVF64GERPP_INTERNAL): Likewise.
3558 (PMXVI16GER2): Likewise.
3559 (PMXVI16GER2_INTERNAL): Likewise.
3560 (PMXVI16GER2PP): Likewise.
3561 (PMXVI16GER2PP_INTERNAL): Likewise.
3562 (PMXVI16GER2S): Likewise.
3563 (PMXVI16GER2S_INTERNAL): Likewise.
3564 (PMXVI16GER2SPP): Likewise.
3565 (PMXVI16GER2SPP_INTERNAL): Likewise.
3566 (PMXVI4GER8): Likewise.
3567 (PMXVI4GER8_INTERNAL): Likewise.
3568 (PMXVI4GER8PP): Likewise.
3569 (PMXVI4GER8PP_INTERNAL): Likewise.
3570 (PMXVI8GER4): Likewise.
3571 (PMXVI8GER4_INTERNAL): Likewise.
3572 (PMXVI8GER4PP): Likewise.
3573 (PMXVI8GER4PP_INTERNAL): Likewise.
3574 (PMXVI8GER4SPP): Likewise.
3575 (PMXVI8GER4SPP_INTERNAL): Likewise.
3576 * config/rs6000/rs6000-gen-builtins.c (MAXLINES): New macro.
3577 (linebuf): Increase size.
3578 (lines): New variable.
3579 (lastline): Likewise.
3580 (real_line_pos): New function.
3581 (diag): Change signature.
3582 (bif_diag): Change signature; support escape-newline handling.
3583 (ovld_diag): Likewise.
3584 (fatal): Move earlier.
3585 (consume_whitespace): Adjust diag call.
3586 (advance_line): Add escape-newline handling; call fatal.
3587 (safe_inc_pos): Adjust diag call.
3588 (match_identifier): Likewise.
3589 (match_integer): Likewise.
3590 (match_to_right_bracket): Call fatal instead of diag; adjust diag
3592 (match_basetype): Adjust diag calls.
3593 (match_bracketed_pair): Likewise.
3594 (match_const_restriction): Likewise.
3595 (match_type): Likewise.
3596 (parse_args): Likewise.
3597 (parse_bif_attrs): Likewise.
3598 (complete_vector_type): Likewise.
3599 (complete_base_type): Likewise.
3600 (parse_prototype): Likewise.
3601 (parse_bif_entry): Likewise.
3602 (parse_bif_stanza): Likewise.
3603 (parse_ovld_entry): Likewise.
3604 (parse_ovld_stanza): Likewise.
3605 (main): Allocate buffers for lines[].
3607 2021-11-08 Eric Botcazou <ebotcazou@adacore.com>
3609 * config/i386/i386.c (legitimize_pic_address): Adjust comment and
3610 use the REG argument on the CM_LARGE_PIC code path as well.
3611 * config/i386/predicates.md (gotoff_operand): Do not treat VxWorks
3612 specially with the large code models.
3614 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3616 * gimple.c (gimple_call_static_chain_flags): Revert the workaround
3617 allowing interposition since issues with binds_to_local_def were
3620 2021-11-08 Andrew MacLeod <amacleod@redhat.com>
3622 PR tree-optimization/103122
3623 * gimple-range.cc (gimple_ranger::range_of_expr): Request the cache
3624 entry with "calulate new values" set to false.
3626 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3628 * builtins.c (is_simple_builtin): Add builitin_dwarf_cfa
3629 and builtin_return_address.
3630 (builtin_fnspec): Annotate builtin_return,
3631 bulitin_eh_pointer, builtin_eh_filter, builtin_unwind_resume,
3632 builtin_cxa_end_cleanup, builtin_eh_copy_values,
3633 builtin_frame_address, builtin_apply_args,
3634 builtin_asan_before_dynamic_init, builtin_asan_after_dynamic_init,
3635 builtin_prefetch, builtin_dwarf_cfa, builtin_return_addrss
3637 * ipa-pure-const.c (special_builtin_state): Add builtin_dwarf_cfa
3638 and builtin_return_address.
3640 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3642 PR tree-optimization/103177
3643 * passes.def: Move uncprop after pure/const and modref.
3645 2021-11-08 Martin Jambor <mjambor@suse.cz>
3649 * tree-inline.c (remap_gimple_stmt): Unshare the expression without
3650 location before invoking remap_with_debug_expressions on it.
3651 * ipa-param-manipulation.c
3652 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
3654 2021-11-08 David Edelsohn <dje.gcc@gmail.com>
3656 * config/rs6000/vsx.md (vsx_splat_v4si_di): Revert "wa"
3659 2021-11-08 Richard Biener <rguenther@suse.de>
3661 * cfgloop.c (verify_loop_structure): Use a temporary BB flag
3662 instead of an sbitmap to cache irreducible state.
3664 2021-11-08 Aldy Hernandez <aldyh@redhat.com>
3666 PR tree-optimization/103120
3667 * gimple-range-path.cc (path_range_query::range_defined_in_block):
3668 Bail if there's a cache entry.
3670 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3672 * config/rs6000/rs6000.c (rs6000_builtin_reciprocal): Use
3673 rs6000_builtin_decls_x when appropriate.
3674 (add_condition_to_bb): Likewise.
3675 (rs6000_atomic_assign_expand_fenv): Likewise.
3677 2021-11-08 Bill Schmidt <wschmidt@linux.ibm.com>
3679 * config/rs6000/rs6000-call.c (rs6000_new_builtin_decl): New function.
3680 (rs6000_builtin_decl): Call it.
3682 2021-11-08 Martin Liska <mliska@suse.cz>
3684 * cgraph.c (cgraph_node::dump): Dump it from decl.
3686 2021-11-08 Martin Liska <mliska@suse.cz>
3688 PR gcov-profile/100520
3689 * coverage.c (coverage_compute_profile_id): Strip .gk when
3690 compare debug is used.
3691 * system.h (endswith): New function.
3693 2021-11-08 Martin Liska <mliska@suse.cz>
3695 * cgraph.c (cgraph_node::dump): Dump static_chain_decl.
3697 2021-11-08 Thomas Schwinge <thomas@codesourcery.com>
3699 * config/rs6000/rbtree.c: Fix 'Copyright (C) 2020-21' into '2020-2021'
3700 * config/rs6000/rbtree.h: Likewise.
3701 * config/rs6000/rs6000-builtin-new.def: Likewise.
3702 * config/rs6000/rs6000-gen-builtins.c: Likewise.
3703 * config/rs6000/rs6000-overload.def: Likewise.
3705 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
3707 * config/aarch64/aarch64.c (aarch64_ldn_stn_vectors): New function.
3708 (aarch64_address_cost): Use it instead of testing for CImode and
3711 2021-11-08 Richard Sandiford <richard.sandiford@arm.com>
3713 * genmodes.c (emit_insn_modes_h): Define NUM_MODE_* macros.
3714 * expmed.h (NUM_MODE_INT): Delete in favor of genmodes definitions.
3715 (NUM_MODE_PARTIAL_INT, NUM_MODE_VECTOR_INT): Likewise.
3716 * real.h (real_format_for_mode): Use NUM_MODE_FLOAT and
3717 NUM_MODE_DECIMAL_FLOAT.
3718 (REAL_MODE_FORMAT): Likewise.
3720 2021-11-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
3722 * tree-vect-loop.c (vect_better_loop_vinfo_p): Change how epilogue loop
3725 2021-11-08 Richard Biener <rguenther@suse.de>
3727 * tree-vectorizer.h (vect_create_loop_vinfo): Add main_loop_info
3729 * tree-vect-loop.c (vect_create_loop_vinfo): Likewise. Set
3730 LOOP_VINFO_ORIG_LOOP_INFO and conditionalize set of
3731 LOOP_VINFO_NITERS_ASSUMPTIONS.
3732 (vect_analyze_loop_1): Adjust.
3733 (vect_analyze_loop): Move loop constraint setting and
3734 SCEV/niter reset here from vect_create_loop_vinfo to perform
3736 (vect_analyze_loop_form): Move dumping of symbolic niters
3737 here from vect_create_loop_vinfo.
3739 2021-11-08 Jan Hubicka <hubicka@ucw.cz>
3741 * ipa-modref-tree.h (enum modref_special_parms): New enum.
3742 (struct modref_access_node): update for special parms.
3743 (struct modref_ref_node): Likewise.
3744 (struct modref_parm_map): Likewise.
3745 (struct modref_tree): Likewise.
3746 * ipa-modref.c (dump_access): Likewise.
3747 (get_access): Detect static chain.
3748 (parm_map_for_arg): Take tree as arg instead of
3750 (merge_call_side_effects): Compute map for static chain.
3751 (process_fnspec): Update.
3752 (struct escape_point): Remove retslot_arg and static_chain_arg.
3753 (analyze_parms): Update.
3754 (compute_parm_map): Update.
3755 (propagate_unknown_call): Update.
3756 (modref_propagate_in_scc): Update.
3757 (modref_merge_call_site_flags): Update.
3758 (ipa_merge_modref_summary_after_inlining): Update.
3759 * tree-ssa-alias.c (modref_may_conflict): Handle static chain.
3760 * ipa-modref-tree.c (test_merge): Update.
3762 2021-11-08 Haochen Gui <guihaoc@gcc.gnu.org>
3764 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin): Disable
3765 gimple fold for VSX_BUILTIN_XVMINDP, ALTIVEC_BUILTIN_VMINFP,
3766 VSX_BUILTIN_XVMAXDP, ALTIVEC_BUILTIN_VMAXFP when fast-math is not
3769 2021-11-08 liuhongt <hongtao.liu@intel.com>
3771 PR tree-optimization/103077
3772 * doc/invoke.texi (Options That Control Optimization):
3773 Update documentation for -ftree-loop-vectorize and
3774 -ftree-slp-vectorize which are enabled by default at -02.
3776 2021-11-08 liuhongt <hongtao.liu@intel.com>
3779 * match.pd (Simplifcation (trunc)copysign((extend)a, (extend)b)
3780 to .COPYSIGN (a, b)): Add !HONOR_SNANS.
3782 2021-11-08 liuhongt <hongtao.liu@intel.com>
3785 * match.pd: Simplify
3786 (trunc)fma ((extend)a, (extend)b, (extend)c) to IFN_FMA (a, b,
3787 c) under flag_unsafe_math_optimizations.
3789 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
3792 * params.opt (modref-max-depth): Add range.
3793 (modref-max-adjustments): Fix range.
3795 2021-11-07 Aldy Hernandez <aldyh@redhat.com>
3797 * tree-pass.h (make_pass_vrp_threader): Remove.
3798 * tree-ssa-threadbackward.c
3799 (back_threader_profitability::profitable_path_p): Remove
3800 ASSERT_EXPR references.
3801 * tree-ssa-threadedge.c (jt_state::register_equivs_stmt): Same.
3802 * tree-vrp.c (vrp_folder::simplify_casted_conds): Same.
3803 (execute_vrp): Same.
3804 (class hybrid_threader): Remove.
3805 (hybrid_threader::hybrid_threader): Remove.
3806 (hybrid_threader::~hybrid_threader): Remove.
3807 (hybrid_threader::before_dom_children): Remove.
3808 (hybrid_threader::after_dom_children): Remove.
3809 (execute_vrp_threader): Remove.
3810 (class pass_vrp_threader): Remove.
3811 (make_pass_vrp_threader): Remove.
3813 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
3815 * gimple.c (gimple_call_arg_flags): Use interposable_eaf_flags.
3816 (gimple_call_retslot_flags): Likewise.
3817 (gimple_call_static_chain_flags): Likewise.
3818 * ipa-modref.c (remove_useless_eaf_flags): Do not remove everything for
3820 (modref_summary::useful_p): Likewise.
3821 (modref_summary_lto::useful_p): Likewise.
3822 (analyze_parms): Do not give up on NOVOPS.
3823 (analyze_function): When dumping report chnages in EAF flags
3824 between IPA and local pass.
3825 (modref_merge_call_site_flags): Compute implicit eaf flags
3826 based on callee ecf_flags and fnspec; if the function does not
3827 bind to current defs use interposable_eaf_flags.
3828 (modref_propagate_flags_in_scc): Update.
3829 * ipa-modref.h (interposable_eaf_flags): New function.
3831 2021-11-07 Bill Schmidt <wschmidt@linux.ibm.com>
3833 * config/rs6000/rs6000-call.c (rs6000_expand_new_builtin): New
3835 (rs6000_invalid_new_builtin): New function.
3836 (rs6000_expand_builtin): Call rs6000_expand_new_builtin.
3837 (rs6000_expand_ldst_mask): New function.
3838 (new_cpu_expand_builtin): Likewise.
3839 (elemrev_icode): Likewise.
3840 (ldv_expand_builtin): Likewise.
3841 (lxvrse_expand_builtin): Likewise.
3842 (lxvrze_expand_builtin): Likewise.
3843 (stv_expand_builtin): Likewise.
3844 (new_mma_expand_builtin): Likewise.
3845 (new_htm_spr_num): Likewise.
3846 (new_htm_expand_builtin): Likewise.
3847 (rs6000_expand_new_builtin): Likewise.
3848 (rs6000_init_builtins): Initialize altivec_builtin_mask_for_load.
3850 2021-11-07 Jan Hubicka <hubicka@ucw.cz>
3852 * ipa-modref.c (modref_lattice): Add do_dataflow,
3853 changed and propagate_to fields.
3854 (modref_lattice::release): Free propagate_to
3855 (modref_lattice::merge): Do not give up early on unknown
3857 (modref_lattice::merge_deref): Likewise.
3858 (modref_eaf_analysis): Update toplevel comment.
3859 (modref_eaf_analysis::analyze_ssa_name): Record postponned ssa names;
3860 do optimistic dataflow initialization.
3861 (modref_eaf_analysis::merge_with_ssa_name): Build dataflow graph.
3862 (modref_eaf_analysis::propagate): New member function.
3863 (analyze_parms): Update to new API of modref_eaf_analysis.
3865 2021-11-06 Jan Hubicka <hubicka@ucw.cz>
3867 * cgraph.h (cgraph_node::can_be_discarded_p): Do not
3868 return true on functions from other partition.
3870 2021-11-06 David Edelsohn <dje.gcc@gmail.com>
3872 * config/rs6000/vsx.md (vsx_splat_v4si): Change constraints to "wa".
3873 (vsx_splat_v4si_di): Change constraint to "wa".
3875 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
3877 PR tree-optimization/103061
3878 * value-relation.cc (path_oracle::path_oracle): Initialize
3880 (path_oracle::killing_def): Set m_killed_defs.
3881 (path_oracle::query_relation): Do not look at the root oracle for
3883 * value-relation.h (class path_oracle): Add m_killed_defs.
3885 2021-11-06 Aldy Hernandez <aldyh@redhat.com>
3887 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
3888 Remove gotos and other cleanups.
3890 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
3893 * ipa-modref-tree.h (modref_tree::insert): Do nothing for
3894 paradoxical and zero sized accesses.
3896 2021-11-05 Jan Hubicka <hubicka@ucw.cz>
3899 * ipa-modref-tree.h (struct modref_access_node): Avoid left shift
3902 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
3904 * config/darwin.h (ASM_GENERATE_INTERNAL_LABEL): Add LTRAMP
3905 to the list of symbol prefixes that must be made linker-
3908 2021-11-05 Iain Sandoe <iain@sandoe.co.uk>
3910 * config.host: Add support for aarch64-*-darwin.
3911 * config/aarch64/host-aarch64-darwin.c: New file.
3912 * config/aarch64/x-darwin: New file.
3914 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
3916 PR tree-optimization/103093
3917 * gimple-range-gori.cc (range_def_chain::get_imports): Remove assert.
3919 2021-11-05 Andrew MacLeod <amacleod@redhat.com>
3921 PR tree-optimization/102943
3922 * gimple-range-cache.cc (class update_list): New.
3923 (update_list::add): Replace add_to_update.
3924 (update_list::pop): New.
3925 (ranger_cache::ranger_cache): Adjust.
3926 (ranger_cache::~ranger_cache): Adjust.
3927 (ranger_cache::add_to_update): Delete.
3928 (ranger_cache::propagate_cache): Adjust to new class.
3929 (ranger_cache::propagate_updated_value): Ditto.
3930 (ranger_cache::fill_block_cache): Ditto.
3931 * gimple-range-cache.h (class ranger_cache): Adjust to update class.
3933 2021-11-05 Richard Biener <rguenther@suse.de>
3935 * tree-vect-loop.c (vect_analyze_loop): Remove obsolete
3936 comment and expand on another one. Combine nested if.
3938 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
3941 * config/pa/pa-modes.def: Add OImode integer type.
3942 * config/pa/pa.c (pa_scalar_mode_supported_p): Allow TImode
3944 * config/pa/pa.h (MIN_UNITS_PER_WORD) Define to MIN_UNITS_PER_WORD
3945 to UNITS_PER_WORD if IN_LIBGCC2.
3946 * config/pa/pa.md (addti3, addvti3, subti3, subvti3, negti2,
3947 negvti2, ashlti3, shrpd_internal): New patterns.
3948 Change some multi instruction types to multi.
3950 2021-11-05 Jakub Jelinek <jakub@redhat.com>
3953 * config/i386/i386.h
3954 (stringop_algs::stringop_strategy::stringop_strategy): Make the ctor
3957 2021-11-05 Wilco Dijkstra <wdijkstr@arm.com>
3960 * config/aarch64/aarch64.c (aarch64_mov_operand_p): Strip the salt
3962 * config/aarch64/constraints.md: Support const in Usw.
3964 2021-11-05 John David Anglin <danglin@gcc.gnu.org>
3966 * config/pa/pa.h (PREFERRED_DEBUGGING_TYPE): Define to DWARF2_DEBUG.
3967 * config/pa/pa64-hpux.h (PREFERRED_DEBUGGING_TYPE): Remove define.
3969 2021-11-05 Richard Biener <rguenther@suse.de>
3971 * tree-vectorizer.h (vec_info_shared::n_stmts): Add.
3972 (LOOP_VINFO_N_STMTS): Likewise.
3973 (vec_info_for_bb): Remove unused function.
3974 * tree-vectorizer.c (vec_info_shared::vec_info_shared):
3975 Initialize n_stmts member.
3976 * tree-vect-loop.c: Remove INCLUDE_FUNCTIONAL.
3977 (vect_create_loop_vinfo): Do not set loop->aux.
3978 (vect_analyze_loop_2): Do not get n_stmts as argument,
3979 instead use LOOP_VINFO_N_STMTS. Set LOOP_VINFO_VECTORIZABLE_P
3981 (vect_analyze_loop_1): Remove callback, get the mode iterator
3982 and autodetected_vector_mode as argument, advancing the
3983 iterator and initializing autodetected_vector_mode here.
3984 (vect_analyze_loop): Split analysis loop into two, first
3985 processing main loops only and then epilogues.
3987 2021-11-05 Martin Jambor <mjambor@suse.cz>
3989 * ipa-prop.c (compute_complex_assign_jump_func): Remove
3990 unnecessary check for RECORD_TYPE.
3992 2021-11-05 Gerald Pfeifer <gerald@pfeifer.com>
3994 * doc/install.texi2html: Do not generate old.html any longer.
3996 2021-11-05 Martin Liska <mliska@suse.cz>
3999 * opts.c (finish_options): Reset flag_gtoggle when it is used.
4001 2021-11-05 Jakub Jelinek <jakub@redhat.com>
4004 * dwarf2out.c (add_const_value_attribute): Add MODE argument, use it
4005 in CONST_WIDE_INT handling. Adjust recursive calls.
4006 (add_location_or_const_value_attribute): Pass DECL_MODE (decl) to
4007 new add_const_value_attribute argument.
4008 (tree_add_const_value_attribute): Pass TYPE_MODE (type) to new
4009 add_const_value_attribute argument.
4011 2021-11-05 Rasmus Villemoes <rasmus.villemoes@prevas.dk>
4013 * config/vx-common.h: Test value of TARGET_VXWORKS7 rather
4016 2021-11-05 Richard Biener <rguenther@suse.de>
4018 * tree-vectorizer.h (struct vect_loop_form_info): New.
4019 (vect_analyze_loop_form): Adjust.
4020 (vect_create_loop_vinfo): New.
4021 * tree-parloops.c (gather_scalar_reductions): Adjust for
4022 vect_analyze_loop_form API change.
4023 * tree-vect-loop.c: Include <functional>.
4024 (vect_analyze_loop_form_1): Rename to vect_analyze_loop_form,
4025 take struct vect_loop_form_info as output parameter and adjust.
4026 (vect_analyze_loop_form): Rename to vect_create_loop_vinfo and
4027 split out call to the original vect_analyze_loop_form_1.
4028 (vect_reanalyze_as_main_loop): Rename to...
4029 (vect_analyze_loop_1): ... this, factor out the call to
4030 vect_analyze_loop_form and generalize to be able to use it twice ...
4031 (vect_analyze_loop): ... here. Perform vect_analyze_loop_form
4034 2021-11-05 Xionghu Luo <luoxhu@linux.ibm.com>
4037 * config/rs6000/fusion.md: Regenerate.
4038 * config/rs6000/genfusion.pl: Fix incorrect clobber constraint.
4040 2021-11-04 Andreas Krebbel <krebbel@linux.ibm.com>
4042 * config/s390/s390.h (STACK_CHECK_MOVING_SP): New macro
4045 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4047 * config/aarch64/aarch64-builtins.c
4048 (aarch64_general_gimple_fold_builtin): Add ashl, sshl, ushl, ashr,
4049 ashr_simd, lshr, lshr_simd.
4050 * config/aarch64/aarch64-simd-builtins.def (lshr): Use USHIFTIMM.
4051 * config/aarch64/arm_neon.h (vshr_n_u8, vshr_n_u16, vshr_n_u32,
4052 vshrq_n_u8, vshrq_n_u16, vshrq_n_u32, vshrq_n_u64): Fix type hack.
4054 2021-11-04 Tamar Christina <tamar.christina@arm.com>
4056 * match.pd: New negate+shift pattern.
4058 2021-11-04 Andrew MacLeod <amacleod@redhat.com>
4060 PR tree-optimization/103079
4061 * gimple-range-gori.cc (gimple_range_calc_op1): Treat undefined as
4063 (gimple_range_calc_op2): Ditto.
4065 2021-11-04 Martin Jambor <mjambor@suse.cz>
4068 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
4069 members remap_with_debug_expressions, m_dead_ssa_debug_equiv,
4070 m_dead_stmt_debug_equiv and prepare_debug_expressions. Added
4071 parameter to mark_dead_statements.
4072 * ipa-param-manipulation.c: Include tree-phinodes.h and cfgexpand.h.
4073 (ipa_param_body_adjustments::mark_dead_statements): New parameter
4074 debugstack, push into it all SSA names used in debug statements,
4075 produce m_dead_ssa_debug_equiv mapping for the removed param.
4076 (replace_with_mapped_expr): New function.
4077 (ipa_param_body_adjustments::remap_with_debug_expressions): Likewise.
4078 (ipa_param_body_adjustments::prepare_debug_expressions): Likewise.
4079 (ipa_param_body_adjustments::common_initialization): Gather and
4080 procecc SSA which will be removed but are in debug statements. Simplify.
4081 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
4083 * tree-inline.c (remap_gimple_stmt): Create a debug bind when possible
4084 when avoiding a copy of an unnecessary statement. Remap removed SSA
4085 names in existing debug statements.
4086 (tree_function_versioning): Do not create DEBUG_EXPR_DECL for removed
4087 parameters if we have already done so.
4089 2021-11-04 Jan Hubicka <hubicka@ucw.cz>
4092 * gimple.c (gimple_call_static_chain_flags): Handle case when
4093 nested function does not bind locally.
4095 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4097 * config/aarch64/aarch64.c (aarch64_function_value): Generate
4098 a register rtx for Neon vector-tuple modes.
4099 (aarch64_layout_arg): Likewise.
4101 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4103 * lower-subreg.c (simple_move): Prevent decomposition if
4104 modes are not tieable.
4106 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
4107 Richard Sandiford <richard.sandiford@arm.com>
4109 * config/aarch64/aarch64-builtins.c (v2x8qi_UP): Define.
4110 (v2x4hi_UP): Likewise.
4111 (v2x4hf_UP): Likewise.
4112 (v2x4bf_UP): Likewise.
4113 (v2x2si_UP): Likewise.
4114 (v2x2sf_UP): Likewise.
4115 (v2x1di_UP): Likewise.
4116 (v2x1df_UP): Likewise.
4117 (v2x16qi_UP): Likewise.
4118 (v2x8hi_UP): Likewise.
4119 (v2x8hf_UP): Likewise.
4120 (v2x8bf_UP): Likewise.
4121 (v2x4si_UP): Likewise.
4122 (v2x4sf_UP): Likewise.
4123 (v2x2di_UP): Likewise.
4124 (v2x2df_UP): Likewise.
4125 (v3x8qi_UP): Likewise.
4126 (v3x4hi_UP): Likewise.
4127 (v3x4hf_UP): Likewise.
4128 (v3x4bf_UP): Likewise.
4129 (v3x2si_UP): Likewise.
4130 (v3x2sf_UP): Likewise.
4131 (v3x1di_UP): Likewise.
4132 (v3x1df_UP): Likewise.
4133 (v3x16qi_UP): Likewise.
4134 (v3x8hi_UP): Likewise.
4135 (v3x8hf_UP): Likewise.
4136 (v3x8bf_UP): Likewise.
4137 (v3x4si_UP): Likewise.
4138 (v3x4sf_UP): Likewise.
4139 (v3x2di_UP): Likewise.
4140 (v3x2df_UP): Likewise.
4141 (v4x8qi_UP): Likewise.
4142 (v4x4hi_UP): Likewise.
4143 (v4x4hf_UP): Likewise.
4144 (v4x4bf_UP): Likewise.
4145 (v4x2si_UP): Likewise.
4146 (v4x2sf_UP): Likewise.
4147 (v4x1di_UP): Likewise.
4148 (v4x1df_UP): Likewise.
4149 (v4x16qi_UP): Likewise.
4150 (v4x8hi_UP): Likewise.
4151 (v4x8hf_UP): Likewise.
4152 (v4x8bf_UP): Likewise.
4153 (v4x4si_UP): Likewise.
4154 (v4x4sf_UP): Likewise.
4155 (v4x2di_UP): Likewise.
4156 (v4x2df_UP): Likewise.
4157 (TYPES_GETREGP): Delete.
4158 (TYPES_SETREGP): Likewise.
4159 (TYPES_LOADSTRUCT_U): Define.
4160 (TYPES_LOADSTRUCT_P): Likewise.
4161 (TYPES_LOADSTRUCT_LANE_U): Likewise.
4162 (TYPES_LOADSTRUCT_LANE_P): Likewise.
4163 (TYPES_STORE1P): Move for consistency.
4164 (TYPES_STORESTRUCT_U): Define.
4165 (TYPES_STORESTRUCT_P): Likewise.
4166 (TYPES_STORESTRUCT_LANE_U): Likewise.
4167 (TYPES_STORESTRUCT_LANE_P): Likewise.
4168 (aarch64_simd_tuple_types): Define.
4169 (aarch64_lookup_simd_builtin_type): Handle tuple type lookup.
4170 (aarch64_init_simd_builtin_functions): Update frontend lookup
4171 for builtin functions after handling arm_neon.h pragma.
4172 (register_tuple_type): Manually set modes of single-integer
4173 tuple types. Record tuple types.
4174 * config/aarch64/aarch64-modes.def
4175 (ADV_SIMD_D_REG_STRUCT_MODES): Define D-register tuple modes.
4176 (ADV_SIMD_Q_REG_STRUCT_MODES): Define Q-register tuple modes.
4177 (SVE_MODES): Give single-vector modes priority over vector-
4179 (VECTOR_MODES_WITH_PREFIX): Set partial-vector mode order to
4180 be after all single-vector modes.
4181 * config/aarch64/aarch64-simd-builtins.def: Update builtin
4182 generator macros to reflect modifications to the backend
4184 * config/aarch64/aarch64-simd.md (aarch64_simd_ld2<mode>):
4185 Use vector-tuple mode iterator and rename to...
4186 (aarch64_simd_ld2<vstruct_elt>): This.
4187 (aarch64_simd_ld2r<mode>): Use vector-tuple mode iterator and
4189 (aarch64_simd_ld2r<vstruct_elt>): This.
4190 (aarch64_vec_load_lanesoi_lane<mode>): Use vector-tuple mode
4191 iterator and rename to...
4192 (aarch64_vec_load_lanes<mode>_lane<vstruct_elt>): This.
4193 (vec_load_lanesoi<mode>): Use vector-tuple mode iterator and
4195 (vec_load_lanes<mode><vstruct_elt>): This.
4196 (aarch64_simd_st2<mode>): Use vector-tuple mode iterator and
4198 (aarch64_simd_st2<vstruct_elt>): This.
4199 (aarch64_vec_store_lanesoi_lane<mode>): Use vector-tuple mode
4200 iterator and rename to...
4201 (aarch64_vec_store_lanes<mode>_lane<vstruct_elt>): This.
4202 (vec_store_lanesoi<mode>): Use vector-tuple mode iterator and
4204 (vec_store_lanes<mode><vstruct_elt>): This.
4205 (aarch64_simd_ld3<mode>): Use vector-tuple mode iterator and
4207 (aarch64_simd_ld3<vstruct_elt>): This.
4208 (aarch64_simd_ld3r<mode>): Use vector-tuple mode iterator and
4210 (aarch64_simd_ld3r<vstruct_elt>): This.
4211 (aarch64_vec_load_lanesci_lane<mode>): Use vector-tuple mode
4212 iterator and rename to...
4213 (vec_load_lanesci<mode>): This.
4214 (aarch64_simd_st3<mode>): Use vector-tuple mode iterator and
4216 (aarch64_simd_st3<vstruct_elt>): This.
4217 (aarch64_vec_store_lanesci_lane<mode>): Use vector-tuple mode
4218 iterator and rename to...
4219 (vec_store_lanesci<mode>): This.
4220 (aarch64_simd_ld4<mode>): Use vector-tuple mode iterator and
4222 (aarch64_simd_ld4<vstruct_elt>): This.
4223 (aarch64_simd_ld4r<mode>): Use vector-tuple mode iterator and
4225 (aarch64_simd_ld4r<vstruct_elt>): This.
4226 (aarch64_vec_load_lanesxi_lane<mode>): Use vector-tuple mode
4227 iterator and rename to...
4228 (vec_load_lanesxi<mode>): This.
4229 (aarch64_simd_st4<mode>): Use vector-tuple mode iterator and
4231 (aarch64_simd_st4<vstruct_elt>): This.
4232 (aarch64_vec_store_lanesxi_lane<mode>): Use vector-tuple mode
4233 iterator and rename to...
4234 (vec_store_lanesxi<mode>): This.
4235 (mov<mode>): Define for Neon vector-tuple modes.
4236 (aarch64_ld1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4238 (aarch64_ld1x3<vstruct_elt>): This.
4239 (aarch64_ld1_x3_<mode>): Use vector-tuple mode iterator and
4241 (aarch64_ld1_x3_<vstruct_elt>): This.
4242 (aarch64_ld1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4244 (aarch64_ld1x4<vstruct_elt>): This.
4245 (aarch64_ld1_x4_<mode>): Use vector-tuple mode iterator and
4247 (aarch64_ld1_x4_<vstruct_elt>): This.
4248 (aarch64_st1x2<VALLDIF:mode>): Use vector-tuple mode iterator
4250 (aarch64_st1x2<vstruct_elt>): This.
4251 (aarch64_st1_x2_<mode>): Use vector-tuple mode iterator and
4253 (aarch64_st1_x2_<vstruct_elt>): This.
4254 (aarch64_st1x3<VALLDIF:mode>): Use vector-tuple mode iterator
4256 (aarch64_st1x3<vstruct_elt>): This.
4257 (aarch64_st1_x3_<mode>): Use vector-tuple mode iterator and
4259 (aarch64_st1_x3_<vstruct_elt>): This.
4260 (aarch64_st1x4<VALLDIF:mode>): Use vector-tuple mode iterator
4262 (aarch64_st1x4<vstruct_elt>): This.
4263 (aarch64_st1_x4_<mode>): Use vector-tuple mode iterator and
4265 (aarch64_st1_x4_<vstruct_elt>): This.
4266 (*aarch64_mov<mode>): Define for vector-tuple modes.
4267 (*aarch64_be_mov<mode>): Likewise.
4268 (aarch64_ld<VSTRUCT:nregs>r<VALLDIF:mode>): Use vector-tuple
4269 mode iterator and rename to...
4270 (aarch64_ld<nregs>r<vstruct_elt>): This.
4271 (aarch64_ld2<mode>_dreg): Use vector-tuple mode iterator and
4273 (aarch64_ld2<vstruct_elt>_dreg): This.
4274 (aarch64_ld3<mode>_dreg): Use vector-tuple mode iterator and
4276 (aarch64_ld3<vstruct_elt>_dreg): This.
4277 (aarch64_ld4<mode>_dreg): Use vector-tuple mode iterator and
4279 (aarch64_ld4<vstruct_elt>_dreg): This.
4280 (aarch64_ld<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4281 iterator and rename to...
4282 (aarch64_ld<nregs><vstruct_elt>): Use vector-tuple mode
4283 iterator and rename to...
4284 (aarch64_ld<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4285 (aarch64_ld1x2<VQ:mode>): Delete.
4286 (aarch64_ld1x2<VDC:mode>): Use vector-tuple mode iterator and
4288 (aarch64_ld1x2<vstruct_elt>): This.
4289 (aarch64_ld<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4290 tuple mode iterator and rename to...
4291 (aarch64_ld<nregs>_lane<vstruct_elt>): This.
4292 (aarch64_get_dreg<VSTRUCT:mode><VDC:mode>): Delete.
4293 (aarch64_get_qreg<VSTRUCT:mode><VQ:mode>): Likewise.
4294 (aarch64_st2<mode>_dreg): Use vector-tuple mode iterator and
4296 (aarch64_st2<vstruct_elt>_dreg): This.
4297 (aarch64_st3<mode>_dreg): Use vector-tuple mode iterator and
4299 (aarch64_st3<vstruct_elt>_dreg): This.
4300 (aarch64_st4<mode>_dreg): Use vector-tuple mode iterator and
4302 (aarch64_st4<vstruct_elt>_dreg): This.
4303 (aarch64_st<VSTRUCT:nregs><VDC:mode>): Use vector-tuple mode
4304 iterator and rename to...
4305 (aarch64_st<nregs><vstruct_elt>): This.
4306 (aarch64_st<VSTRUCT:nregs><VQ:mode>): Use vector-tuple mode
4307 iterator and rename to aarch64_st<nregs><vstruct_elt>.
4308 (aarch64_st<VSTRUCT:nregs>_lane<VALLDIF:mode>): Use vector-
4309 tuple mode iterator and rename to...
4310 (aarch64_st<nregs>_lane<vstruct_elt>): This.
4311 (aarch64_set_qreg<VSTRUCT:mode><VQ:mode>): Delete.
4312 (aarch64_simd_ld1<mode>_x2): Use vector-tuple mode iterator
4314 (aarch64_simd_ld1<vstruct_elt>_x2): This.
4315 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
4316 Refactor to include new vector-tuple modes.
4317 (aarch64_classify_vector_mode): Add cases for new vector-
4319 (aarch64_advsimd_partial_struct_mode_p): Define.
4320 (aarch64_advsimd_full_struct_mode_p): Likewise.
4321 (aarch64_advsimd_vector_array_mode): Likewise.
4322 (aarch64_sve_data_mode): Change location in file.
4323 (aarch64_array_mode): Handle case of Neon vector-tuple modes.
4324 (aarch64_hard_regno_nregs): Handle case of partial Neon
4326 (aarch64_classify_address): Refactor to include handling of
4327 Neon vector-tuple modes.
4328 (aarch64_print_operand): Print "d" for "%R" for a partial
4329 Neon vector structure.
4330 (aarch64_expand_vec_perm_1): Use new vector-tuple mode.
4331 (aarch64_modes_tieable_p): Prevent tieing Neon partial struct
4332 modes with scalar machines modes larger than 8 bytes.
4333 (aarch64_can_change_mode_class): Don't allow changes between
4334 partial and full Neon vector-structure modes.
4335 * config/aarch64/arm_neon.h (vst2_lane_f16): Use updated
4336 builtin and remove boiler-plate code for opaque mode.
4337 (vst2_lane_f32): Likewise.
4338 (vst2_lane_f64): Likewise.
4339 (vst2_lane_p8): Likewise.
4340 (vst2_lane_p16): Likewise.
4341 (vst2_lane_p64): Likewise.
4342 (vst2_lane_s8): Likewise.
4343 (vst2_lane_s16): Likewise.
4344 (vst2_lane_s32): Likewise.
4345 (vst2_lane_s64): Likewise.
4346 (vst2_lane_u8): Likewise.
4347 (vst2_lane_u16): Likewise.
4348 (vst2_lane_u32): Likewise.
4349 (vst2_lane_u64): Likewise.
4350 (vst2q_lane_f16): Likewise.
4351 (vst2q_lane_f32): Likewise.
4352 (vst2q_lane_f64): Likewise.
4353 (vst2q_lane_p8): Likewise.
4354 (vst2q_lane_p16): Likewise.
4355 (vst2q_lane_p64): Likewise.
4356 (vst2q_lane_s8): Likewise.
4357 (vst2q_lane_s16): Likewise.
4358 (vst2q_lane_s32): Likewise.
4359 (vst2q_lane_s64): Likewise.
4360 (vst2q_lane_u8): Likewise.
4361 (vst2q_lane_u16): Likewise.
4362 (vst2q_lane_u32): Likewise.
4363 (vst2q_lane_u64): Likewise.
4364 (vst3_lane_f16): Likewise.
4365 (vst3_lane_f32): Likewise.
4366 (vst3_lane_f64): Likewise.
4367 (vst3_lane_p8): Likewise.
4368 (vst3_lane_p16): Likewise.
4369 (vst3_lane_p64): Likewise.
4370 (vst3_lane_s8): Likewise.
4371 (vst3_lane_s16): Likewise.
4372 (vst3_lane_s32): Likewise.
4373 (vst3_lane_s64): Likewise.
4374 (vst3_lane_u8): Likewise.
4375 (vst3_lane_u16): Likewise.
4376 (vst3_lane_u32): Likewise.
4377 (vst3_lane_u64): Likewise.
4378 (vst3q_lane_f16): Likewise.
4379 (vst3q_lane_f32): Likewise.
4380 (vst3q_lane_f64): Likewise.
4381 (vst3q_lane_p8): Likewise.
4382 (vst3q_lane_p16): Likewise.
4383 (vst3q_lane_p64): Likewise.
4384 (vst3q_lane_s8): Likewise.
4385 (vst3q_lane_s16): Likewise.
4386 (vst3q_lane_s32): Likewise.
4387 (vst3q_lane_s64): Likewise.
4388 (vst3q_lane_u8): Likewise.
4389 (vst3q_lane_u16): Likewise.
4390 (vst3q_lane_u32): Likewise.
4391 (vst3q_lane_u64): Likewise.
4392 (vst4_lane_f16): Likewise.
4393 (vst4_lane_f32): Likewise.
4394 (vst4_lane_f64): Likewise.
4395 (vst4_lane_p8): Likewise.
4396 (vst4_lane_p16): Likewise.
4397 (vst4_lane_p64): Likewise.
4398 (vst4_lane_s8): Likewise.
4399 (vst4_lane_s16): Likewise.
4400 (vst4_lane_s32): Likewise.
4401 (vst4_lane_s64): Likewise.
4402 (vst4_lane_u8): Likewise.
4403 (vst4_lane_u16): Likewise.
4404 (vst4_lane_u32): Likewise.
4405 (vst4_lane_u64): Likewise.
4406 (vst4q_lane_f16): Likewise.
4407 (vst4q_lane_f32): Likewise.
4408 (vst4q_lane_f64): Likewise.
4409 (vst4q_lane_p8): Likewise.
4410 (vst4q_lane_p16): Likewise.
4411 (vst4q_lane_p64): Likewise.
4412 (vst4q_lane_s8): Likewise.
4413 (vst4q_lane_s16): Likewise.
4414 (vst4q_lane_s32): Likewise.
4415 (vst4q_lane_s64): Likewise.
4416 (vst4q_lane_u8): Likewise.
4417 (vst4q_lane_u16): Likewise.
4418 (vst4q_lane_u32): Likewise.
4419 (vst4q_lane_u64): Likewise.
4420 (vtbl3_s8): Likewise.
4421 (vtbl3_u8): Likewise.
4422 (vtbl3_p8): Likewise.
4423 (vtbl4_s8): Likewise.
4424 (vtbl4_u8): Likewise.
4425 (vtbl4_p8): Likewise.
4426 (vld1_u8_x3): Likewise.
4427 (vld1_s8_x3): Likewise.
4428 (vld1_u16_x3): Likewise.
4429 (vld1_s16_x3): Likewise.
4430 (vld1_u32_x3): Likewise.
4431 (vld1_s32_x3): Likewise.
4432 (vld1_u64_x3): Likewise.
4433 (vld1_s64_x3): Likewise.
4434 (vld1_f16_x3): Likewise.
4435 (vld1_f32_x3): Likewise.
4436 (vld1_f64_x3): Likewise.
4437 (vld1_p8_x3): Likewise.
4438 (vld1_p16_x3): Likewise.
4439 (vld1_p64_x3): Likewise.
4440 (vld1q_u8_x3): Likewise.
4441 (vld1q_s8_x3): Likewise.
4442 (vld1q_u16_x3): Likewise.
4443 (vld1q_s16_x3): Likewise.
4444 (vld1q_u32_x3): Likewise.
4445 (vld1q_s32_x3): Likewise.
4446 (vld1q_u64_x3): Likewise.
4447 (vld1q_s64_x3): Likewise.
4448 (vld1q_f16_x3): Likewise.
4449 (vld1q_f32_x3): Likewise.
4450 (vld1q_f64_x3): Likewise.
4451 (vld1q_p8_x3): Likewise.
4452 (vld1q_p16_x3): Likewise.
4453 (vld1q_p64_x3): Likewise.
4454 (vld1_u8_x2): Likewise.
4455 (vld1_s8_x2): Likewise.
4456 (vld1_u16_x2): Likewise.
4457 (vld1_s16_x2): Likewise.
4458 (vld1_u32_x2): Likewise.
4459 (vld1_s32_x2): Likewise.
4460 (vld1_u64_x2): Likewise.
4461 (vld1_s64_x2): Likewise.
4462 (vld1_f16_x2): Likewise.
4463 (vld1_f32_x2): Likewise.
4464 (vld1_f64_x2): Likewise.
4465 (vld1_p8_x2): Likewise.
4466 (vld1_p16_x2): Likewise.
4467 (vld1_p64_x2): Likewise.
4468 (vld1q_u8_x2): Likewise.
4469 (vld1q_s8_x2): Likewise.
4470 (vld1q_u16_x2): Likewise.
4471 (vld1q_s16_x2): Likewise.
4472 (vld1q_u32_x2): Likewise.
4473 (vld1q_s32_x2): Likewise.
4474 (vld1q_u64_x2): Likewise.
4475 (vld1q_s64_x2): Likewise.
4476 (vld1q_f16_x2): Likewise.
4477 (vld1q_f32_x2): Likewise.
4478 (vld1q_f64_x2): Likewise.
4479 (vld1q_p8_x2): Likewise.
4480 (vld1q_p16_x2): Likewise.
4481 (vld1q_p64_x2): Likewise.
4482 (vld1_s8_x4): Likewise.
4483 (vld1q_s8_x4): Likewise.
4484 (vld1_s16_x4): Likewise.
4485 (vld1q_s16_x4): Likewise.
4486 (vld1_s32_x4): Likewise.
4487 (vld1q_s32_x4): Likewise.
4488 (vld1_u8_x4): Likewise.
4489 (vld1q_u8_x4): Likewise.
4490 (vld1_u16_x4): Likewise.
4491 (vld1q_u16_x4): Likewise.
4492 (vld1_u32_x4): Likewise.
4493 (vld1q_u32_x4): Likewise.
4494 (vld1_f16_x4): Likewise.
4495 (vld1q_f16_x4): Likewise.
4496 (vld1_f32_x4): Likewise.
4497 (vld1q_f32_x4): Likewise.
4498 (vld1_p8_x4): Likewise.
4499 (vld1q_p8_x4): Likewise.
4500 (vld1_p16_x4): Likewise.
4501 (vld1q_p16_x4): Likewise.
4502 (vld1_s64_x4): Likewise.
4503 (vld1_u64_x4): Likewise.
4504 (vld1_p64_x4): Likewise.
4505 (vld1q_s64_x4): Likewise.
4506 (vld1q_u64_x4): Likewise.
4507 (vld1q_p64_x4): Likewise.
4508 (vld1_f64_x4): Likewise.
4509 (vld1q_f64_x4): Likewise.
4510 (vld2_s64): Likewise.
4511 (vld2_u64): Likewise.
4512 (vld2_f64): Likewise.
4513 (vld2_s8): Likewise.
4514 (vld2_p8): Likewise.
4515 (vld2_p64): Likewise.
4516 (vld2_s16): Likewise.
4517 (vld2_p16): Likewise.
4518 (vld2_s32): Likewise.
4519 (vld2_u8): Likewise.
4520 (vld2_u16): Likewise.
4521 (vld2_u32): Likewise.
4522 (vld2_f16): Likewise.
4523 (vld2_f32): Likewise.
4524 (vld2q_s8): Likewise.
4525 (vld2q_p8): Likewise.
4526 (vld2q_s16): Likewise.
4527 (vld2q_p16): Likewise.
4528 (vld2q_p64): Likewise.
4529 (vld2q_s32): Likewise.
4530 (vld2q_s64): Likewise.
4531 (vld2q_u8): Likewise.
4532 (vld2q_u16): Likewise.
4533 (vld2q_u32): Likewise.
4534 (vld2q_u64): Likewise.
4535 (vld2q_f16): Likewise.
4536 (vld2q_f32): Likewise.
4537 (vld2q_f64): Likewise.
4538 (vld3_s64): Likewise.
4539 (vld3_u64): Likewise.
4540 (vld3_f64): Likewise.
4541 (vld3_s8): Likewise.
4542 (vld3_p8): Likewise.
4543 (vld3_s16): Likewise.
4544 (vld3_p16): Likewise.
4545 (vld3_s32): Likewise.
4546 (vld3_u8): Likewise.
4547 (vld3_u16): Likewise.
4548 (vld3_u32): Likewise.
4549 (vld3_f16): Likewise.
4550 (vld3_f32): Likewise.
4551 (vld3_p64): Likewise.
4552 (vld3q_s8): Likewise.
4553 (vld3q_p8): Likewise.
4554 (vld3q_s16): Likewise.
4555 (vld3q_p16): Likewise.
4556 (vld3q_s32): Likewise.
4557 (vld3q_s64): Likewise.
4558 (vld3q_u8): Likewise.
4559 (vld3q_u16): Likewise.
4560 (vld3q_u32): Likewise.
4561 (vld3q_u64): Likewise.
4562 (vld3q_f16): Likewise.
4563 (vld3q_f32): Likewise.
4564 (vld3q_f64): Likewise.
4565 (vld3q_p64): Likewise.
4566 (vld4_s64): Likewise.
4567 (vld4_u64): Likewise.
4568 (vld4_f64): Likewise.
4569 (vld4_s8): Likewise.
4570 (vld4_p8): Likewise.
4571 (vld4_s16): Likewise.
4572 (vld4_p16): Likewise.
4573 (vld4_s32): Likewise.
4574 (vld4_u8): Likewise.
4575 (vld4_u16): Likewise.
4576 (vld4_u32): Likewise.
4577 (vld4_f16): Likewise.
4578 (vld4_f32): Likewise.
4579 (vld4_p64): Likewise.
4580 (vld4q_s8): Likewise.
4581 (vld4q_p8): Likewise.
4582 (vld4q_s16): Likewise.
4583 (vld4q_p16): Likewise.
4584 (vld4q_s32): Likewise.
4585 (vld4q_s64): Likewise.
4586 (vld4q_u8): Likewise.
4587 (vld4q_u16): Likewise.
4588 (vld4q_u32): Likewise.
4589 (vld4q_u64): Likewise.
4590 (vld4q_f16): Likewise.
4591 (vld4q_f32): Likewise.
4592 (vld4q_f64): Likewise.
4593 (vld4q_p64): Likewise.
4594 (vld2_dup_s8): Likewise.
4595 (vld2_dup_s16): Likewise.
4596 (vld2_dup_s32): Likewise.
4597 (vld2_dup_f16): Likewise.
4598 (vld2_dup_f32): Likewise.
4599 (vld2_dup_f64): Likewise.
4600 (vld2_dup_u8): Likewise.
4601 (vld2_dup_u16): Likewise.
4602 (vld2_dup_u32): Likewise.
4603 (vld2_dup_p8): Likewise.
4604 (vld2_dup_p16): Likewise.
4605 (vld2_dup_p64): Likewise.
4606 (vld2_dup_s64): Likewise.
4607 (vld2_dup_u64): Likewise.
4608 (vld2q_dup_s8): Likewise.
4609 (vld2q_dup_p8): Likewise.
4610 (vld2q_dup_s16): Likewise.
4611 (vld2q_dup_p16): Likewise.
4612 (vld2q_dup_s32): Likewise.
4613 (vld2q_dup_s64): Likewise.
4614 (vld2q_dup_u8): Likewise.
4615 (vld2q_dup_u16): Likewise.
4616 (vld2q_dup_u32): Likewise.
4617 (vld2q_dup_u64): Likewise.
4618 (vld2q_dup_f16): Likewise.
4619 (vld2q_dup_f32): Likewise.
4620 (vld2q_dup_f64): Likewise.
4621 (vld2q_dup_p64): Likewise.
4622 (vld3_dup_s64): Likewise.
4623 (vld3_dup_u64): Likewise.
4624 (vld3_dup_f64): Likewise.
4625 (vld3_dup_s8): Likewise.
4626 (vld3_dup_p8): Likewise.
4627 (vld3_dup_s16): Likewise.
4628 (vld3_dup_p16): Likewise.
4629 (vld3_dup_s32): Likewise.
4630 (vld3_dup_u8): Likewise.
4631 (vld3_dup_u16): Likewise.
4632 (vld3_dup_u32): Likewise.
4633 (vld3_dup_f16): Likewise.
4634 (vld3_dup_f32): Likewise.
4635 (vld3_dup_p64): Likewise.
4636 (vld3q_dup_s8): Likewise.
4637 (vld3q_dup_p8): Likewise.
4638 (vld3q_dup_s16): Likewise.
4639 (vld3q_dup_p16): Likewise.
4640 (vld3q_dup_s32): Likewise.
4641 (vld3q_dup_s64): Likewise.
4642 (vld3q_dup_u8): Likewise.
4643 (vld3q_dup_u16): Likewise.
4644 (vld3q_dup_u32): Likewise.
4645 (vld3q_dup_u64): Likewise.
4646 (vld3q_dup_f16): Likewise.
4647 (vld3q_dup_f32): Likewise.
4648 (vld3q_dup_f64): Likewise.
4649 (vld3q_dup_p64): Likewise.
4650 (vld4_dup_s64): Likewise.
4651 (vld4_dup_u64): Likewise.
4652 (vld4_dup_f64): Likewise.
4653 (vld4_dup_s8): Likewise.
4654 (vld4_dup_p8): Likewise.
4655 (vld4_dup_s16): Likewise.
4656 (vld4_dup_p16): Likewise.
4657 (vld4_dup_s32): Likewise.
4658 (vld4_dup_u8): Likewise.
4659 (vld4_dup_u16): Likewise.
4660 (vld4_dup_u32): Likewise.
4661 (vld4_dup_f16): Likewise.
4662 (vld4_dup_f32): Likewise.
4663 (vld4_dup_p64): Likewise.
4664 (vld4q_dup_s8): Likewise.
4665 (vld4q_dup_p8): Likewise.
4666 (vld4q_dup_s16): Likewise.
4667 (vld4q_dup_p16): Likewise.
4668 (vld4q_dup_s32): Likewise.
4669 (vld4q_dup_s64): Likewise.
4670 (vld4q_dup_u8): Likewise.
4671 (vld4q_dup_u16): Likewise.
4672 (vld4q_dup_u32): Likewise.
4673 (vld4q_dup_u64): Likewise.
4674 (vld4q_dup_f16): Likewise.
4675 (vld4q_dup_f32): Likewise.
4676 (vld4q_dup_f64): Likewise.
4677 (vld4q_dup_p64): Likewise.
4678 (vld2_lane_u8): Likewise.
4679 (vld2_lane_u16): Likewise.
4680 (vld2_lane_u32): Likewise.
4681 (vld2_lane_u64): Likewise.
4682 (vld2_lane_s8): Likewise.
4683 (vld2_lane_s16): Likewise.
4684 (vld2_lane_s32): Likewise.
4685 (vld2_lane_s64): Likewise.
4686 (vld2_lane_f16): Likewise.
4687 (vld2_lane_f32): Likewise.
4688 (vld2_lane_f64): Likewise.
4689 (vld2_lane_p8): Likewise.
4690 (vld2_lane_p16): Likewise.
4691 (vld2_lane_p64): Likewise.
4692 (vld2q_lane_u8): Likewise.
4693 (vld2q_lane_u16): Likewise.
4694 (vld2q_lane_u32): Likewise.
4695 (vld2q_lane_u64): Likewise.
4696 (vld2q_lane_s8): Likewise.
4697 (vld2q_lane_s16): Likewise.
4698 (vld2q_lane_s32): Likewise.
4699 (vld2q_lane_s64): Likewise.
4700 (vld2q_lane_f16): Likewise.
4701 (vld2q_lane_f32): Likewise.
4702 (vld2q_lane_f64): Likewise.
4703 (vld2q_lane_p8): Likewise.
4704 (vld2q_lane_p16): Likewise.
4705 (vld2q_lane_p64): Likewise.
4706 (vld3_lane_u8): Likewise.
4707 (vld3_lane_u16): Likewise.
4708 (vld3_lane_u32): Likewise.
4709 (vld3_lane_u64): Likewise.
4710 (vld3_lane_s8): Likewise.
4711 (vld3_lane_s16): Likewise.
4712 (vld3_lane_s32): Likewise.
4713 (vld3_lane_s64): Likewise.
4714 (vld3_lane_f16): Likewise.
4715 (vld3_lane_f32): Likewise.
4716 (vld3_lane_f64): Likewise.
4717 (vld3_lane_p8): Likewise.
4718 (vld3_lane_p16): Likewise.
4719 (vld3_lane_p64): Likewise.
4720 (vld3q_lane_u8): Likewise.
4721 (vld3q_lane_u16): Likewise.
4722 (vld3q_lane_u32): Likewise.
4723 (vld3q_lane_u64): Likewise.
4724 (vld3q_lane_s8): Likewise.
4725 (vld3q_lane_s16): Likewise.
4726 (vld3q_lane_s32): Likewise.
4727 (vld3q_lane_s64): Likewise.
4728 (vld3q_lane_f16): Likewise.
4729 (vld3q_lane_f32): Likewise.
4730 (vld3q_lane_f64): Likewise.
4731 (vld3q_lane_p8): Likewise.
4732 (vld3q_lane_p16): Likewise.
4733 (vld3q_lane_p64): Likewise.
4734 (vld4_lane_u8): Likewise.
4735 (vld4_lane_u16): Likewise.
4736 (vld4_lane_u32): Likewise.
4737 (vld4_lane_u64): Likewise.
4738 (vld4_lane_s8): Likewise.
4739 (vld4_lane_s16): Likewise.
4740 (vld4_lane_s32): Likewise.
4741 (vld4_lane_s64): Likewise.
4742 (vld4_lane_f16): Likewise.
4743 (vld4_lane_f32): Likewise.
4744 (vld4_lane_f64): Likewise.
4745 (vld4_lane_p8): Likewise.
4746 (vld4_lane_p16): Likewise.
4747 (vld4_lane_p64): Likewise.
4748 (vld4q_lane_u8): Likewise.
4749 (vld4q_lane_u16): Likewise.
4750 (vld4q_lane_u32): Likewise.
4751 (vld4q_lane_u64): Likewise.
4752 (vld4q_lane_s8): Likewise.
4753 (vld4q_lane_s16): Likewise.
4754 (vld4q_lane_s32): Likewise.
4755 (vld4q_lane_s64): Likewise.
4756 (vld4q_lane_f16): Likewise.
4757 (vld4q_lane_f32): Likewise.
4758 (vld4q_lane_f64): Likewise.
4759 (vld4q_lane_p8): Likewise.
4760 (vld4q_lane_p16): Likewise.
4761 (vld4q_lane_p64): Likewise.
4762 (vqtbl2_s8): Likewise.
4763 (vqtbl2_u8): Likewise.
4764 (vqtbl2_p8): Likewise.
4765 (vqtbl2q_s8): Likewise.
4766 (vqtbl2q_u8): Likewise.
4767 (vqtbl2q_p8): Likewise.
4768 (vqtbl3_s8): Likewise.
4769 (vqtbl3_u8): Likewise.
4770 (vqtbl3_p8): Likewise.
4771 (vqtbl3q_s8): Likewise.
4772 (vqtbl3q_u8): Likewise.
4773 (vqtbl3q_p8): Likewise.
4774 (vqtbl4_s8): Likewise.
4775 (vqtbl4_u8): Likewise.
4776 (vqtbl4_p8): Likewise.
4777 (vqtbl4q_s8): Likewise.
4778 (vqtbl4q_u8): Likewise.
4779 (vqtbl4q_p8): Likewise.
4780 (vqtbx2_s8): Likewise.
4781 (vqtbx2_u8): Likewise.
4782 (vqtbx2_p8): Likewise.
4783 (vqtbx2q_s8): Likewise.
4784 (vqtbx2q_u8): Likewise.
4785 (vqtbx2q_p8): Likewise.
4786 (vqtbx3_s8): Likewise.
4787 (vqtbx3_u8): Likewise.
4788 (vqtbx3_p8): Likewise.
4789 (vqtbx3q_s8): Likewise.
4790 (vqtbx3q_u8): Likewise.
4791 (vqtbx3q_p8): Likewise.
4792 (vqtbx4_s8): Likewise.
4793 (vqtbx4_u8): Likewise.
4794 (vqtbx4_p8): Likewise.
4795 (vqtbx4q_s8): Likewise.
4796 (vqtbx4q_u8): Likewise.
4797 (vqtbx4q_p8): Likewise.
4798 (vst1_s64_x2): Likewise.
4799 (vst1_u64_x2): Likewise.
4800 (vst1_f64_x2): Likewise.
4801 (vst1_s8_x2): Likewise.
4802 (vst1_p8_x2): Likewise.
4803 (vst1_s16_x2): Likewise.
4804 (vst1_p16_x2): Likewise.
4805 (vst1_s32_x2): Likewise.
4806 (vst1_u8_x2): Likewise.
4807 (vst1_u16_x2): Likewise.
4808 (vst1_u32_x2): Likewise.
4809 (vst1_f16_x2): Likewise.
4810 (vst1_f32_x2): Likewise.
4811 (vst1_p64_x2): Likewise.
4812 (vst1q_s8_x2): Likewise.
4813 (vst1q_p8_x2): Likewise.
4814 (vst1q_s16_x2): Likewise.
4815 (vst1q_p16_x2): Likewise.
4816 (vst1q_s32_x2): Likewise.
4817 (vst1q_s64_x2): Likewise.
4818 (vst1q_u8_x2): Likewise.
4819 (vst1q_u16_x2): Likewise.
4820 (vst1q_u32_x2): Likewise.
4821 (vst1q_u64_x2): Likewise.
4822 (vst1q_f16_x2): Likewise.
4823 (vst1q_f32_x2): Likewise.
4824 (vst1q_f64_x2): Likewise.
4825 (vst1q_p64_x2): Likewise.
4826 (vst1_s64_x3): Likewise.
4827 (vst1_u64_x3): Likewise.
4828 (vst1_f64_x3): Likewise.
4829 (vst1_s8_x3): Likewise.
4830 (vst1_p8_x3): Likewise.
4831 (vst1_s16_x3): Likewise.
4832 (vst1_p16_x3): Likewise.
4833 (vst1_s32_x3): Likewise.
4834 (vst1_u8_x3): Likewise.
4835 (vst1_u16_x3): Likewise.
4836 (vst1_u32_x3): Likewise.
4837 (vst1_f16_x3): Likewise.
4838 (vst1_f32_x3): Likewise.
4839 (vst1_p64_x3): Likewise.
4840 (vst1q_s8_x3): Likewise.
4841 (vst1q_p8_x3): Likewise.
4842 (vst1q_s16_x3): Likewise.
4843 (vst1q_p16_x3): Likewise.
4844 (vst1q_s32_x3): Likewise.
4845 (vst1q_s64_x3): Likewise.
4846 (vst1q_u8_x3): Likewise.
4847 (vst1q_u16_x3): Likewise.
4848 (vst1q_u32_x3): Likewise.
4849 (vst1q_u64_x3): Likewise.
4850 (vst1q_f16_x3): Likewise.
4851 (vst1q_f32_x3): Likewise.
4852 (vst1q_f64_x3): Likewise.
4853 (vst1q_p64_x3): Likewise.
4854 (vst1_s8_x4): Likewise.
4855 (vst1q_s8_x4): Likewise.
4856 (vst1_s16_x4): Likewise.
4857 (vst1q_s16_x4): Likewise.
4858 (vst1_s32_x4): Likewise.
4859 (vst1q_s32_x4): Likewise.
4860 (vst1_u8_x4): Likewise.
4861 (vst1q_u8_x4): Likewise.
4862 (vst1_u16_x4): Likewise.
4863 (vst1q_u16_x4): Likewise.
4864 (vst1_u32_x4): Likewise.
4865 (vst1q_u32_x4): Likewise.
4866 (vst1_f16_x4): Likewise.
4867 (vst1q_f16_x4): Likewise.
4868 (vst1_f32_x4): Likewise.
4869 (vst1q_f32_x4): Likewise.
4870 (vst1_p8_x4): Likewise.
4871 (vst1q_p8_x4): Likewise.
4872 (vst1_p16_x4): Likewise.
4873 (vst1q_p16_x4): Likewise.
4874 (vst1_s64_x4): Likewise.
4875 (vst1_u64_x4): Likewise.
4876 (vst1_p64_x4): Likewise.
4877 (vst1q_s64_x4): Likewise.
4878 (vst1q_u64_x4): Likewise.
4879 (vst1q_p64_x4): Likewise.
4880 (vst1_f64_x4): Likewise.
4881 (vst1q_f64_x4): Likewise.
4882 (vst2_s64): Likewise.
4883 (vst2_u64): Likewise.
4884 (vst2_f64): Likewise.
4885 (vst2_s8): Likewise.
4886 (vst2_p8): Likewise.
4887 (vst2_s16): Likewise.
4888 (vst2_p16): Likewise.
4889 (vst2_s32): Likewise.
4890 (vst2_u8): Likewise.
4891 (vst2_u16): Likewise.
4892 (vst2_u32): Likewise.
4893 (vst2_f16): Likewise.
4894 (vst2_f32): Likewise.
4895 (vst2_p64): Likewise.
4896 (vst2q_s8): Likewise.
4897 (vst2q_p8): Likewise.
4898 (vst2q_s16): Likewise.
4899 (vst2q_p16): Likewise.
4900 (vst2q_s32): Likewise.
4901 (vst2q_s64): Likewise.
4902 (vst2q_u8): Likewise.
4903 (vst2q_u16): Likewise.
4904 (vst2q_u32): Likewise.
4905 (vst2q_u64): Likewise.
4906 (vst2q_f16): Likewise.
4907 (vst2q_f32): Likewise.
4908 (vst2q_f64): Likewise.
4909 (vst2q_p64): Likewise.
4910 (vst3_s64): Likewise.
4911 (vst3_u64): Likewise.
4912 (vst3_f64): Likewise.
4913 (vst3_s8): Likewise.
4914 (vst3_p8): Likewise.
4915 (vst3_s16): Likewise.
4916 (vst3_p16): Likewise.
4917 (vst3_s32): Likewise.
4918 (vst3_u8): Likewise.
4919 (vst3_u16): Likewise.
4920 (vst3_u32): Likewise.
4921 (vst3_f16): Likewise.
4922 (vst3_f32): Likewise.
4923 (vst3_p64): Likewise.
4924 (vst3q_s8): Likewise.
4925 (vst3q_p8): Likewise.
4926 (vst3q_s16): Likewise.
4927 (vst3q_p16): Likewise.
4928 (vst3q_s32): Likewise.
4929 (vst3q_s64): Likewise.
4930 (vst3q_u8): Likewise.
4931 (vst3q_u16): Likewise.
4932 (vst3q_u32): Likewise.
4933 (vst3q_u64): Likewise.
4934 (vst3q_f16): Likewise.
4935 (vst3q_f32): Likewise.
4936 (vst3q_f64): Likewise.
4937 (vst3q_p64): Likewise.
4938 (vst4_s64): Likewise.
4939 (vst4_u64): Likewise.
4940 (vst4_f64): Likewise.
4941 (vst4_s8): Likewise.
4942 (vst4_p8): Likewise.
4943 (vst4_s16): Likewise.
4944 (vst4_p16): Likewise.
4945 (vst4_s32): Likewise.
4946 (vst4_u8): Likewise.
4947 (vst4_u16): Likewise.
4948 (vst4_u32): Likewise.
4949 (vst4_f16): Likewise.
4950 (vst4_f32): Likewise.
4951 (vst4_p64): Likewise.
4952 (vst4q_s8): Likewise.
4953 (vst4q_p8): Likewise.
4954 (vst4q_s16): Likewise.
4955 (vst4q_p16): Likewise.
4956 (vst4q_s32): Likewise.
4957 (vst4q_s64): Likewise.
4958 (vst4q_u8): Likewise.
4959 (vst4q_u16): Likewise.
4960 (vst4q_u32): Likewise.
4961 (vst4q_u64): Likewise.
4962 (vst4q_f16): Likewise.
4963 (vst4q_f32): Likewise.
4964 (vst4q_f64): Likewise.
4965 (vst4q_p64): Likewise.
4966 (vtbx4_s8): Likewise.
4967 (vtbx4_u8): Likewise.
4968 (vtbx4_p8): Likewise.
4969 (vld1_bf16_x2): Likewise.
4970 (vld1q_bf16_x2): Likewise.
4971 (vld1_bf16_x3): Likewise.
4972 (vld1q_bf16_x3): Likewise.
4973 (vld1_bf16_x4): Likewise.
4974 (vld1q_bf16_x4): Likewise.
4975 (vld2_bf16): Likewise.
4976 (vld2q_bf16): Likewise.
4977 (vld2_dup_bf16): Likewise.
4978 (vld2q_dup_bf16): Likewise.
4979 (vld3_bf16): Likewise.
4980 (vld3q_bf16): Likewise.
4981 (vld3_dup_bf16): Likewise.
4982 (vld3q_dup_bf16): Likewise.
4983 (vld4_bf16): Likewise.
4984 (vld4q_bf16): Likewise.
4985 (vld4_dup_bf16): Likewise.
4986 (vld4q_dup_bf16): Likewise.
4987 (vst1_bf16_x2): Likewise.
4988 (vst1q_bf16_x2): Likewise.
4989 (vst1_bf16_x3): Likewise.
4990 (vst1q_bf16_x3): Likewise.
4991 (vst1_bf16_x4): Likewise.
4992 (vst1q_bf16_x4): Likewise.
4993 (vst2_bf16): Likewise.
4994 (vst2q_bf16): Likewise.
4995 (vst3_bf16): Likewise.
4996 (vst3q_bf16): Likewise.
4997 (vst4_bf16): Likewise.
4998 (vst4q_bf16): Likewise.
4999 (vld2_lane_bf16): Likewise.
5000 (vld2q_lane_bf16): Likewise.
5001 (vld3_lane_bf16): Likewise.
5002 (vld3q_lane_bf16): Likewise.
5003 (vld4_lane_bf16): Likewise.
5004 (vld4q_lane_bf16): Likewise.
5005 (vst2_lane_bf16): Likewise.
5006 (vst2q_lane_bf16): Likewise.
5007 (vst3_lane_bf16): Likewise.
5008 (vst3q_lane_bf16): Likewise.
5009 (vst4_lane_bf16): Likewise.
5010 (vst4q_lane_bf16): Likewise.
5011 * config/aarch64/geniterators.sh: Modify iterator regex to
5012 match new vector-tuple modes.
5013 * config/aarch64/iterators.md (insn_count): Extend mode
5014 attribute with vector-tuple type information.
5016 (Vendreg): Likewise.
5019 (VSTRUCT_2D): New mode iterator.
5020 (VSTRUCT_2DNX): Likewise.
5021 (VSTRUCT_2DX): Likewise.
5022 (VSTRUCT_2Q): Likewise.
5023 (VSTRUCT_2QD): Likewise.
5024 (VSTRUCT_3D): Likewise.
5025 (VSTRUCT_3DNX): Likewise.
5026 (VSTRUCT_3DX): Likewise.
5027 (VSTRUCT_3Q): Likewise.
5028 (VSTRUCT_3QD): Likewise.
5029 (VSTRUCT_4D): Likewise.
5030 (VSTRUCT_4DNX): Likewise.
5031 (VSTRUCT_4DX): Likewise.
5032 (VSTRUCT_4Q): Likewise.
5033 (VSTRUCT_4QD): Likewise.
5034 (VSTRUCT_D): Likewise.
5035 (VSTRUCT_Q): Likewise.
5036 (VSTRUCT_QD): Likewise.
5037 (VSTRUCT_ELT): New mode attribute.
5038 (vstruct_elt): Likewise.
5039 * genmodes.c (VECTOR_MODE): Add default prefix and order
5041 (VECTOR_MODE_WITH_PREFIX): Define.
5042 (make_vector_mode): Add mode prefix and order parameters.
5044 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5046 * expmed.c (extract_bit_field_1): Ensure modes are tieable.
5048 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5050 * expr.c (emit_group_load_1): Remove historic workaround.
5052 2021-11-04 Jonathan Wright <jonathan.wright@arm.com>
5054 * config/aarch64/aarch64-builtins.c (aarch64_init_simd_builtins):
5055 Factor out main loop to...
5056 (aarch64_init_simd_builtin_functions): This new function.
5057 (register_tuple_type): Define.
5058 (aarch64_scalar_builtin_type_p): Define.
5059 (handle_arm_neon_h): Define.
5060 * config/aarch64/aarch64-c.c (aarch64_pragma_aarch64): Handle
5061 pragma for arm_neon.h.
5062 * config/aarch64/aarch64-protos.h (aarch64_advsimd_struct_mode_p):
5064 (handle_arm_neon_h): Likewise.
5065 * config/aarch64/aarch64.c (aarch64_advsimd_struct_mode_p):
5066 Remove static modifier.
5067 * config/aarch64/arm_neon.h (target): Remove Neon vector
5068 structure type definitions.
5070 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5072 PR tree-optimization/102943
5073 * gimple-range-path.cc (path_range_query::range_on_path_entry):
5074 Prefer range_of_expr unless there are no statements in the BB.
5076 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5078 PR tree-optimization/102943
5079 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
5080 Avoid duplicate calculation of paths.
5082 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5084 PR tree-optimization/102943
5085 * gimple-range-path.cc (path_range_query::compute_phi_relations):
5086 Only compute relations for SSA names in the import list.
5087 (path_range_query::compute_outgoing_relations): Same.
5088 * gimple-range-path.h (path_range_query::import_p): New.
5090 2021-11-04 Richard Biener <rguenther@suse.de>
5092 PR rtl-optimization/103075
5093 * simplify-rtx.c (exact_int_to_float_conversion_p): Return
5094 false for a VOIDmode operand.
5096 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5098 * config/aarch64/aarch64.c (aarch64_vector_costs): Make member
5099 variables private and add "m_" to their names. Remove is_loop.
5100 (aarch64_record_potential_advsimd_unrolling): Replace with...
5101 (aarch64_vector_costs::record_potential_advsimd_unrolling): ...this.
5102 (aarch64_analyze_loop_vinfo): Replace with...
5103 (aarch64_vector_costs::analyze_loop_vinfo): ...this.
5104 Move initialization of (m_)vec_flags to add_stmt_cost.
5105 (aarch64_analyze_bb_vinfo): Delete.
5106 (aarch64_count_ops): Replace with...
5107 (aarch64_vector_costs::count_ops): ...this.
5108 (aarch64_vector_costs::add_stmt_cost): Set m_vec_flags,
5109 using m_costing_for_scalar to test whether we're costing
5110 scalar or vector code.
5111 (aarch64_adjust_body_cost_sve): Replace with...
5112 (aarch64_vector_costs::adjust_body_cost_sve): ...this.
5113 (aarch64_adjust_body_cost): Replace with...
5114 (aarch64_vector_costs::adjust_body_cost): ...this.
5115 (aarch64_vector_costs::finish_cost): Use m_vinfo instead of is_loop.
5117 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5119 * target.def (targetm.vectorize.init_cost): Replace with...
5120 (targetm.vectorize.create_costs): ...this.
5121 (targetm.vectorize.add_stmt_cost): Delete.
5122 (targetm.vectorize.finish_cost): Likewise.
5123 (targetm.vectorize.destroy_cost_data): Likewise.
5124 * doc/tm.texi.in (TARGET_VECTORIZE_INIT_COST): Replace with...
5125 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5126 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5127 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5128 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5129 * doc/tm.texi: Regenerate.
5130 * tree-vectorizer.h (vec_info::vec_info): Remove target_cost_data
5132 (vec_info::target_cost_data): Change from a void * to a vector_costs *.
5133 (vector_costs): New class.
5134 (init_cost): Take a vec_info and return a vector_costs.
5135 (dump_stmt_cost): Remove data parameter.
5136 (add_stmt_cost): Replace vinfo and data parameters with a vector_costs.
5137 (add_stmt_costs): Likewise.
5138 (finish_cost): Replace data parameter with a vector_costs.
5139 (destroy_cost_data): Delete.
5140 * tree-vectorizer.c (dump_stmt_cost): Remove data argument and
5142 (vec_info::vec_info): Remove the target_cost_data parameter and
5143 initialize the member variable to null instead.
5144 (vec_info::~vec_info): Delete target_cost_data instead of calling
5146 (vector_costs::add_stmt_cost): New function.
5147 (vector_costs::finish_cost): Likewise.
5148 (vector_costs::record_stmt_cost): Likewise.
5149 (vector_costs::adjust_cost_for_freq): Likewise.
5150 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Update
5151 call to vec_info::vec_info.
5152 (vect_compute_single_scalar_iteration_cost): Update after above
5153 changes to costing interface.
5154 (vect_analyze_loop_operations): Likewise.
5155 (vect_estimate_min_profitable_iters): Likewise.
5156 (vect_analyze_loop_2): Initialize LOOP_VINFO_TARGET_COST_DATA
5157 at the start_over point, where it needs to be recreated after
5158 trying without slp. Update retry code accordingly.
5159 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Update call
5160 to vec_info::vec_info.
5161 (vect_slp_analyze_operation): Update after above changes to costing
5163 (vect_bb_vectorization_profitable_p): Likewise.
5164 * targhooks.h (default_init_cost): Replace with...
5165 (default_vectorize_create_costs): ...this.
5166 (default_add_stmt_cost): Delete.
5167 (default_finish_cost, default_destroy_cost_data): Likewise.
5168 * targhooks.c (default_init_cost): Replace with...
5169 (default_vectorize_create_costs): ...this.
5170 (default_add_stmt_cost): Delete, moving logic to vector_costs instead.
5171 (default_finish_cost, default_destroy_cost_data): Delete.
5172 * config/aarch64/aarch64.c (aarch64_vector_costs): Inherit from
5173 vector_costs. Add a constructor.
5174 (aarch64_init_cost): Replace with...
5175 (aarch64_vectorize_create_costs): ...this.
5176 (aarch64_add_stmt_cost): Replace with...
5177 (aarch64_vector_costs::add_stmt_cost): ...this. Use record_stmt_cost
5178 to adjust the cost for inner loops.
5179 (aarch64_finish_cost): Replace with...
5180 (aarch64_vector_costs::finish_cost): ...this.
5181 (aarch64_destroy_cost_data): Delete.
5182 (TARGET_VECTORIZE_INIT_COST): Replace with...
5183 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5184 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5185 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5186 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5187 * config/i386/i386.c (ix86_vector_costs): New structure.
5188 (ix86_init_cost): Replace with...
5189 (ix86_vectorize_create_costs): ...this.
5190 (ix86_add_stmt_cost): Replace with...
5191 (ix86_vector_costs::add_stmt_cost): ...this. Use adjust_cost_for_freq
5192 to adjust the cost for inner loops.
5193 (ix86_finish_cost, ix86_destroy_cost_data): Delete.
5194 (TARGET_VECTORIZE_INIT_COST): Replace with...
5195 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5196 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5197 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5198 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5199 * config/rs6000/rs6000.c (TARGET_VECTORIZE_INIT_COST): Replace with...
5200 (TARGET_VECTORIZE_CREATE_COSTS): ...this.
5201 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
5202 (TARGET_VECTORIZE_FINISH_COST): Likewise.
5203 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
5204 (rs6000_cost_data): Inherit from vector_costs.
5205 Add a constructor. Drop loop_info, cost and costing_for_scalar
5206 in favor of the corresponding vector_costs member variables.
5207 Add "m_" to the names of the remaining member variables and
5209 (rs6000_density_test): Replace with...
5210 (rs6000_cost_data::density_test): ...this.
5211 (rs6000_init_cost): Replace with...
5212 (rs6000_vectorize_create_costs): ...this.
5213 (rs6000_update_target_cost_per_stmt): Replace with...
5214 (rs6000_cost_data::update_target_cost_per_stmt): ...this.
5215 (rs6000_add_stmt_cost): Replace with...
5216 (rs6000_cost_data::add_stmt_cost): ...this. Use adjust_cost_for_freq
5217 to adjust the cost for inner loops.
5218 (rs6000_adjust_vect_cost_per_loop): Replace with...
5219 (rs6000_cost_data::adjust_vect_cost_per_loop): ...this.
5220 (rs6000_finish_cost): Replace with...
5221 (rs6000_cost_data::finish_cost): ...this. Group loop code
5222 into a single if statement and pass the loop_vinfo down to
5224 (rs6000_destroy_cost_data): Delete.
5226 2021-11-04 Aldy Hernandez <aldyh@redhat.com>
5228 PR tree-optimization/103062
5229 PR tree-optimization/103062
5230 * value-pointer-equiv.cc (ssa_equiv_stack::ssa_equiv_stack):
5231 Increase size of allocation by 1.
5232 (ssa_equiv_stack::push_replacement): Grow as needed.
5233 (ssa_equiv_stack::get_replacement): Same.
5234 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
5235 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Remove delete.
5236 (pointer_equiv_analyzer::set_global_equiv): Grow as needed.
5237 (pointer_equiv_analyzer::get_equiv): Same.
5238 (pointer_equiv_analyzer::get_equiv_expr): Remove const.
5239 * value-pointer-equiv.h (class pointer_equiv_analyzer): Remove
5240 const markers. Use auto_vec instead of tree *.
5242 2021-11-04 Richard Biener <rguenther@suse.de>
5244 * tree-ssa-sccvn.c (vn_nary_op_insert_into): Remove always
5245 true parameter and inline valueization.
5246 (vn_nary_op_lookup_1): Inline valueization from ...
5247 (vn_nary_op_compute_hash): ... here and remove it here.
5248 * tree-ssa-pre.c (phi_translate_1): Do not valueize
5249 before vn_nary_lookup_pieces.
5250 (get_representative_for): Mark created SSA representatives
5253 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5255 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): Assert
5256 that the operand has a vector mode. Use subreg_lowpart_offset
5257 to test whether an index corresponds to the low part.
5259 2021-11-04 Richard Sandiford <richard.sandiford@arm.com>
5261 * read-rtl.c: Remove dead !GENERATOR_FILE block.
5262 * read-rtl-function.c (function_reader::consolidate_singletons):
5263 Generate canonical CONST_VECTORs.
5265 2021-11-04 liuhongt <hongtao.liu@intel.com>
5268 * config/i386/predicates.md (reg_or_notreg_operand): Rename to ..
5269 (regmem_or_bitnot_regmem_operand): .. and extend to handle
5271 * config/i386/sse.md (*<avx512>_vpternlog<mode>_1): Force_reg
5272 the operands which are required to be register_operand.
5273 (*<avx512>_vpternlog<mode>_2): Ditto.
5274 (*<avx512>_vpternlog<mode>_3): Ditto.
5275 (*<avx512>_vternlog<mode>_all): Disallow embeded broadcast for
5276 vector HFmodes since it's not a real AVX512FP16 instruction.
5278 2021-11-04 liuhongt <hongtao.liu@intel.com>
5281 * match.pd: simplify (trunc)copysign((extend)a, (extend)b) to
5282 .COPYSIGN (a,b) when a and b are same type as the truncation
5283 type and has less precision than extend type.
5285 2021-11-04 Richard Biener <rguenther@suse.de>
5287 * doc/generic.texi: Update TARGET_MEM_REF and MEM_REF
5290 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5292 * config/i386/sse.md (VI2_AVX512VNNIBW): New mode iterator.
5293 (VI1_AVX512VNNI): Likewise.
5294 (SDOT_VPDP_SUF): New mode_attr.
5297 (sdot_prod<mode>): Use VI2_AVX512F iterator, expand to
5298 vpdpwssd when VNNI targets available.
5299 (usdot_prod<mode>): New expander for vector QImode.
5301 2021-11-04 Hongyu Wang <hongyu.wang@intel.com>
5303 * config/i386/amxtileintrin.h (_tile_loadd_internal): Add
5304 parentheses to base and stride.
5305 (_tile_stream_loadd_internal): Likewise.
5306 (_tile_stored_internal): Likewise.
5308 2021-11-03 Maciej W. Rozycki <macro@embecosm.com>
5310 * config/riscv/riscv.c (riscv_class_max_nregs): Swap the
5311 arguments to `reg_class_subset_p'.
5313 2021-11-03 Joseph Myers <joseph@codesourcery.com>
5316 * fold-const.c (fold_init): New function.
5317 * fold-const.h (fold_init): New prototype.
5319 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5321 * params.opt (param_vrp2_mode): Make ranger the default for VRP2.
5323 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5325 * gimple-range-fold.cc (fold_using_range::range_of_phi): Don't import
5326 a range from edge if arg == phidef.
5328 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5330 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Test
5331 for constant before any other processing.
5333 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5335 * flag-types.h (RANGER_DEBUG_ALL): Fix values.
5337 2021-11-03 Andrew MacLeod <amacleod@redhat.com>
5339 * gimple-range.cc (gimple_ranger::gimple_ranger): Initialize current_bb.
5340 (gimple_ranger::range_of_expr): Pick up range_on_entry when there is
5341 no explcit context and current_bb is set.
5342 (gimple_ranger::fold_stmt): New.
5343 * gimple-range.h (current_bb, fold_stmt): New.
5344 * tree-vrp.c (rvrp_folder::fold_stmt): Call ranger's fold_stmt.
5346 2021-11-03 Richard Biener <rguenther@suse.de>
5348 PR tree-optimization/102970
5349 * tree-ssa-pre.c (phi_translate_1): Drop clique and base
5350 when translating a MEM_REF over a backedge.
5352 2021-11-03 Philipp Tomsich <philipp.tomsich@vrull.eu>
5354 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New Ampere-1 core.
5355 * config/aarch64/aarch64-tune.md: Regenerate.
5356 * config/aarch64/aarch64-cost-tables.h: Add extra costs for Ampere-1.
5357 * config/aarch64/aarch64.c: Add tuning structures for Ampere-1.
5358 * doc/invoke.texi: Add documentation for Ampere-1 core.
5360 2021-11-03 Wilco Dijkstra <wdijkstr@arm.com>
5362 * config/aarch64/aarch64.md (movsi): Add alternative for GOT accesses.
5364 (ldr_got_small_<mode>): Remove pattern.
5365 (ldr_got_small_sidi): Likewise.
5366 * config/aarch64/aarch64.c (aarch64_load_symref_appropriately): Keep
5367 GOT accesses as moves.
5368 (aarch64_print_operand): Correctly print got_lo12 in L specifier.
5369 (aarch64_mov_operand_p): Make GOT accesses valid move operands.
5370 * config/aarch64/constraints.md: Add new constraint Usw for GOT access.
5372 2021-11-03 Martin Liska <mliska@suse.cz>
5374 * gcov.c (read_line): Remove dead variable.
5376 2021-11-03 Martin Liska <mliska@suse.cz>
5379 * ipa-fnsummary.c (edge_predicate_pool): Rename predicate class to ipa_predicate.
5380 (ipa_fn_summary::account_size_time): Likewise.
5381 (edge_set_predicate): Likewise.
5382 (set_hint_predicate): Likewise.
5383 (add_freqcounting_predicate): Likewise.
5384 (evaluate_conditions_for_known_args): Likewise.
5385 (evaluate_properties_for_edge): Likewise.
5386 (remap_freqcounting_preds_after_dup): Likewise.
5387 (ipa_fn_summary_t::duplicate): Likewise.
5388 (set_cond_stmt_execution_predicate): Likewise.
5389 (set_switch_stmt_execution_predicate): Likewise.
5390 (compute_bb_predicates): Likewise.
5391 (will_be_nonconstant_expr_predicate): Likewise.
5392 (will_be_nonconstant_predicate): Likewise.
5393 (phi_result_unknown_predicate): Likewise.
5394 (predicate_for_phi_result): Likewise.
5395 (analyze_function_body): Likewise.
5396 (compute_fn_summary): Likewise.
5397 (summarize_calls_size_and_time): Likewise.
5398 (estimate_calls_size_and_time): Likewise.
5399 (ipa_call_context::estimate_size_and_time): Likewise.
5400 (remap_edge_summaries): Likewise.
5401 (remap_freqcounting_predicate): Likewise.
5402 (ipa_merge_fn_summary_after_inlining): Likewise.
5403 (ipa_update_overall_fn_summary): Likewise.
5404 (read_ipa_call_summary): Likewise.
5405 (inline_read_section): Likewise.
5406 * ipa-fnsummary.h (struct ipa_freqcounting_predicate): Likewise.
5407 * ipa-predicate.c (predicate::add_clause): Likewise.
5408 (ipa_predicate::add_clause): Likewise.
5409 (predicate::or_with): Likewise.
5410 (ipa_predicate::or_with): Likewise.
5411 (predicate::evaluate): Likewise.
5412 (ipa_predicate::evaluate): Likewise.
5413 (predicate::probability): Likewise.
5414 (ipa_predicate::probability): Likewise.
5415 (dump_condition): Likewise.
5416 (dump_clause): Likewise.
5417 (predicate::dump): Likewise.
5418 (ipa_predicate::dump): Likewise.
5419 (predicate::debug): Likewise.
5420 (ipa_predicate::debug): Likewise.
5421 (predicate::remap_after_duplication): Likewise.
5422 (ipa_predicate::remap_after_duplication): Likewise.
5423 (predicate::remap_after_inlining): Likewise.
5424 (ipa_predicate::remap_after_inlining): Likewise.
5425 (predicate::stream_in): Likewise.
5426 (ipa_predicate::stream_in): Likewise.
5427 (predicate::stream_out): Likewise.
5428 (ipa_predicate::stream_out): Likewise.
5429 (add_condition): Likewise.
5430 * ipa-predicate.h (class predicate): Likewise.
5431 (class ipa_predicate): Likewise.
5432 (add_condition): Likewise.
5434 2021-11-03 Richard Biener <rguenther@suse.de>
5436 * bitmap.h (bitmap_bit_p): Change the return type to bool.
5437 * bitmap.c (bitmap_bit_p): Likewise.
5438 * sbitmap.h (bitmap_bit_p): Likewise.
5439 (bitmap_set_bit): Return whether the bit changed.
5440 (bitmap_clear_bit): Likewise.
5441 * tree-ssa.c (verify_vssa): Make use of the changed state
5442 from bitmap_set_bit.
5444 2021-11-03 Richard Biener <rguenther@suse.de>
5446 PR middle-end/103033
5447 * internal-fn.c (expand_DEFERRED_INIT): Elide the
5448 native_interpret_expr path in favor of folding the
5449 VIEW_CONVERT_EXPR generated when punning the RHS.
5451 2021-11-03 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5453 * config/s390/s390.c (s390_loop_unroll_adjust): In case of early
5456 2021-11-03 Jan Hubicka <hubicka@ucw.cz>
5459 * ipa-modref.c (callee_to_caller_flags): New function.
5460 (modref_eaf_analysis::analyze_ssa_name): Use it.
5461 (ipa_merge_modref_summary_after_inlining): Fix whitespace.
5463 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5465 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
5468 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5469 Uroš Bizjak <ubizjak@gmail.com>
5471 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
5472 rotations by non-constant amounts.
5474 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5476 * ipa-modref.c: Fix anonymous namespace placement.
5477 (class modref_eaf_analysis): New class.
5478 (analyze_ssa_name_flags): Turn to ...
5479 (modref_eaf_analysis::analyze_ssa_name): ... this one.
5480 (merge_call_lhs_flags): Turn to ...
5481 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
5482 (modref_eaf_analysis::merge_with_ssa_name): New member function.
5483 (record_escape_points): Turn to ...
5484 (modref_eaf_analysis::record_escape_points): ... this one.
5485 (analyze_parms): Updat
5486 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
5488 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5490 * gimple.c (gimple_call_static_chain_flags): New function.
5491 * gimple.h (gimple_call_static_chain_flags): Declare
5492 * ipa-modref.c (modref_summary::modref_summary): Initialize
5494 (modref_summary_lto::modref_summary_lto): Likewise.
5495 (modref_summary::useful_p): Test static_chain_flags.
5496 (modref_summary_lto::useful_p): Likewise.
5497 (struct modref_summary_lto): Add static_chain_flags.
5498 (modref_summary::dump): Dump static_chain_flags.
5499 (modref_summary_lto::dump): Likewise.
5500 (struct escape_point): Add static_cahin_arg.
5501 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
5502 (analyze_parms): Handle static chains.
5503 (modref_summaries::duplicate): Duplicate static_chain_flags.
5504 (modref_summaries_lto::duplicate): Likewise.
5505 (modref_write): Stream static_chain_flags.
5506 (read_section): Likewise.
5507 (modref_merge_call_site_flags): Handle static_chain_flags.
5508 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
5509 * tree-ssa-structalias.c (handle_rhs_call): Use
5510 gimple_static_chain_flags.
5512 2021-11-02 Richard Biener <rguenther@suse.de>
5514 PR tree-optimization/103029
5515 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
5516 the PHI nodes in the loop maintain their original operand
5519 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
5521 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
5522 (EAF_NOREAD): Renumber.
5523 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
5524 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
5525 (deref_flags): Likewise.
5526 (modref_lattice::init): Likewise.
5527 (modref_lattice::merge): Likewise.
5528 (merge_call_lhs_flags): Likewise.
5529 (analyze_ssa_name_flags): Likewise.
5530 (modref_merge_call_site_flags): Likewise.
5531 * tree-ssa-structalias.c (handle_call_arg): Likewise.
5533 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
5535 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
5536 check and remove an unused local variable with shNadd/shNadd.uw
5539 2021-11-02 Tamar Christina <tamar.christina@arm.com>
5541 PR tree-optimization/103007
5542 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
5545 2021-11-02 Richard Biener <rguenther@suse.de>
5547 PR middle-end/103038
5548 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
5549 (can_native_interpret_type_p): Likewise.
5550 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
5551 RHS if the LHS is an SSA name.
5553 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
5555 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
5556 * langhooks-def.h (lhd_simulate_record_decl): Declare.
5557 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
5558 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
5559 * langhooks.c (lhd_simulate_record_decl): New function.
5561 2021-11-02 Jakub Jelinek <jakub@redhat.com>
5564 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
5565 modes with TImode inner mode if 32-bit.
5567 2021-11-02 liuhongt <hongtao.liu@intel.com>
5569 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
5570 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
5571 vect_slp_v4si_store_unalign): Document efficient target.
5572 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
5573 vect_slp_v16qi_store_unalign_1): Ditto.
5574 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
5575 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
5576 (struct_4char_block_move, struct_8char_block_move,
5577 struct_16char_block_move): Ditto.
5579 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
5580 Jakub Jelinek <jakub@redhat.com>
5583 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
5584 ix86_expand_ti_to_v1ti): New helper functions.
5585 (ix86_expand_v1ti_shift): Check if the amount operand is an
5586 integer constant, and expand as a TImode shift if it isn't.
5587 (ix86_expand_v1ti_rotate): Check if the amount operand is an
5588 integer constant, and expand as a TImode rotate if it isn't.
5589 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
5590 right shifts of V1TImode quantities.
5591 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
5592 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
5593 to QImode general_operand, and let the helper functions lower
5594 shifts by non-constant operands, as TImode shifts. Make
5595 conditional on TARGET_64BIT.
5596 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
5597 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
5598 Make conditional on TARGET_64BIT.
5600 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
5602 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
5603 "*tabort_1_plus"): Remove operands non-null check.
5605 2021-11-01 Martin Liska <mliska@suse.cz>
5607 * opt-functions.awk: Add new sanity checking.
5608 * optc-gen.awk: Add new argument to integer_range_info.
5609 * params.opt: Update 2 params which have negative IntegerRange.
5611 2021-11-01 qing zhao <qing.zhao@oracle.com>
5613 * gimplify.c (gimplify_decl_expr): Do not add call to
5614 __builtin_clear_padding when a variable is a gimple register
5615 or it might not have padding.
5616 (gimplify_init_constructor): Likewise.
5618 2021-11-01 Tamar Christina <tamar.christina@arm.com>
5620 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
5621 movi, dup and extract costing fields.
5622 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
5623 thunderx_extra_costs, thunderx2t99_extra_costs,
5624 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
5626 * config/arm/aarch-cost-tables.h (generic_extra_costs,
5627 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
5628 exynosm1_extra_costs, xgene1_extra_costs): Likewise
5629 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
5630 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
5632 2021-11-01 Tamar Christina <tamar.christina@arm.com>
5634 * cse.c (add_to_set): New.
5635 (find_sets_in_insn): Register constants in sets.
5636 (canonicalize_insn): Use auto_vec instead.
5637 (cse_insn): Try materializing using vec_dup.
5638 * rtl.h (simplify_context::simplify_gen_vec_select,
5639 simplify_gen_vec_select): New.
5640 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
5642 2021-11-01 David Malcolm <dmalcolm@redhat.com>
5644 * common.opt (fdiagnostics-escape-format=): New.
5645 (diagnostics_escape_format): New enum.
5646 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
5647 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
5648 * diagnostic-format-json.cc (json_end_diagnostic): Add
5649 "escape-source" attribute.
5650 * diagnostic-show-locus.c
5651 (exploc_with_display_col::exploc_with_display_col): Replace
5652 "tabstop" param with a cpp_char_column_policy and add an "aspect"
5653 param. Use these to compute m_display_col accordingly.
5654 (struct char_display_policy): New struct.
5655 (layout::m_policy): New field.
5656 (layout::m_escape_on_output): New field.
5657 (def_policy): New function.
5658 (make_range): Update for changes to exploc_with_display_col ctor.
5659 (default_print_decoded_ch): New.
5660 (width_per_escaped_byte): New.
5661 (escape_as_bytes_width): New.
5662 (escape_as_bytes_print): New.
5663 (escape_as_unicode_width): New.
5664 (escape_as_unicode_print): New.
5666 (layout::layout): Initialize new fields. Update m_exploc ctor
5667 call for above change to ctor.
5668 (layout::maybe_add_location_range): Update for changes to
5669 exploc_with_display_col ctor.
5670 (layout::calculate_x_offset_display): Update for change to
5672 (layout::print_source_line): Pass policy
5673 to cpp_display_width_computation. Capture cpp_decoded_char when
5674 calling process_next_codepoint. Move printing of source code to
5675 m_policy.m_print_cb.
5676 (line_label::line_label): Pass in policy rather than context.
5677 (layout::print_any_labels): Update for change to line_label ctor.
5678 (get_affected_range): Pass in policy rather than context, updating
5679 calls to location_compute_display_column accordingly.
5680 (get_printed_columns): Likewise, also for cpp_display_width.
5681 (correction::correction): Pass in policy rather than tabstop.
5682 (correction::compute_display_cols): Pass m_policy rather than
5683 m_tabstop to cpp_display_width.
5684 (correction::m_tabstop): Replace with...
5685 (correction::m_policy): ...this.
5686 (line_corrections::line_corrections): Pass in policy rather than
5688 (line_corrections::m_context): Replace with...
5689 (line_corrections::m_policy): ...this.
5690 (line_corrections::add_hint): Update to use m_policy rather than
5692 (line_corrections::add_hint): Likewise.
5693 (layout::print_trailing_fixits): Likewise.
5694 (selftest::test_display_widths): New.
5695 (selftest::test_layout_x_offset_display_utf8): Update to use
5696 policy rather than tabstop.
5697 (selftest::test_one_liner_labels_utf8): Add test of escaping
5699 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
5700 use policy rather than tabstop.
5701 (selftest::test_overlapped_fixit_printing): Likewise.
5702 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
5703 (selftest::test_overlapped_fixit_printing_2): Likewise.
5704 (selftest::test_tab_expansion): Likewise.
5705 (selftest::test_escaping_bytes_1): New.
5706 (selftest::test_escaping_bytes_2): New.
5707 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
5708 * diagnostic.c (diagnostic_initialize): Initialize
5709 context->escape_format.
5710 (convert_column_unit): Update to use default character width policy.
5711 (selftest::test_diagnostic_get_location_text): Likewise.
5712 * diagnostic.h (enum diagnostics_escape_format): New enum.
5713 (diagnostic_context::escape_format): New field.
5714 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
5715 (-fdiagnostics-format=): Add "escape-source" attribute to examples
5716 of JSON output, and document it.
5717 * input.c (location_compute_display_column): Pass in "policy"
5718 rather than "tabstop", passing to
5719 cpp_byte_column_to_display_column.
5720 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
5721 * input.h (class cpp_char_column_policy): New forward decl.
5722 (location_compute_display_column): Pass in "policy" rather than
5724 * opts.c (common_handle_option): Handle
5725 OPT_fdiagnostics_escape_format_.
5726 * selftest.c (temp_source_file::temp_source_file): New ctor
5727 overload taking a size_t.
5728 * selftest.h (temp_source_file::temp_source_file): Likewise.
5730 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
5732 * dbgcnt.def: Add debug counter for back_thread[12] and
5733 back_threadfull[12].
5734 * passes.def: Pass "first" argument to each back threading pass.
5735 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
5737 (back_threader::debug_counter): New.
5738 (back_threader::maybe_register_path): Call debug_counter.
5740 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
5742 * tree-ssa-threadbackward.c (BT_NONE): New.
5745 (back_threader::back_threader): Add flags.
5746 Move loop initialization here.
5747 (back_threader::~back_threader): New.
5748 (back_threader::find_taken_edge_switch): Change solver and ranger
5750 (back_threader::find_taken_edge_cond): Same.
5751 (back_threader::find_paths_to_names): Same.
5752 (back_threader::find_paths): Same.
5753 (back_threader::dump): Same.
5754 (try_thread_blocks): Merge into thread_blocks.
5755 (back_threader::thread_blocks): New.
5756 (do_early_thread_jumps): Merge into thread_blocks.
5757 (do_thread_jumps): Merge into thread_blocks.
5758 (back_threader::thread_through_all_blocks): Remove.
5760 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
5762 PR tree-optimization/103003
5763 * value-relation.cc (dom_oracle::register_relation): If the 2
5764 ssa names are the same, don't register any relation.
5766 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
5768 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
5769 redundant check for calls_eh_return.
5770 * config/aarch64/aarch64.md (*do_return): Likewise.
5772 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
5774 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
5775 duplicate_loop_to_header_edge to
5776 duplicate_loop_body_to_header_edge.
5777 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
5778 * cfghooks.h (struct cfg_hooks): Likewise.
5779 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
5780 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
5781 (clone_loop_to_header_edge): Likewise.
5782 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
5783 * cfgrtl.c (struct cfg_hooks): Likewise.
5784 * doc/loop.texi: Likewise.
5785 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
5786 (unroll_loop_runtime_iterations): Likewise.
5787 (unroll_loop_stupid): Likewise.
5788 (apply_opt_in_copies): Likewise.
5789 * tree-cfg.c (struct cfg_hooks): Likewise.
5790 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
5791 (try_peel_loop): Likewise.
5792 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
5793 (gimple_duplicate_loop_body_to_header_edge): Likewise.
5794 (tree_transform_and_unroll_loop): Likewise.
5795 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
5798 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
5800 * cfgloopmanip.c (loop_version): Refactor loopify to
5801 loop_version. Move condition generation after loopify.
5803 * cfgloopmanip.h (loopify): Delete.
5805 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
5807 * ipa-fnsummary.c: Include tree-dfa.h.
5808 (points_to_local_or_readonly_memory_p): Return true on return
5810 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
5813 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
5815 PR middle-end/102972
5816 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
5817 internal Fortran name; new permit_num_teams arg to permit
5818 omp_get_num_teams and omp_get_team_num.
5819 (scan_omp_1_stmt): Update call to it, add missing call for
5820 reverse offload, and check for strictly nested API calls in teams.
5822 2021-10-30 Jakub Jelinek <jakub@redhat.com>
5824 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
5826 2021-10-29 David Malcolm <dmalcolm@redhat.com>
5828 * selftest.c (assert_streq): Add newlines when emitting non-equal
5831 2021-10-29 David Malcolm <dmalcolm@redhat.com>
5833 * Makefile.in: Fix syntax for reference to LIBDEPS in
5836 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
5838 * doc/install.texi: Bump required minimum DejaGnu version.
5840 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
5841 Andrew MacLeod <amacleod@redhat.com>
5843 * value-relation.cc (path_oracle::killing_def): Add a
5844 self-equivalence so we don't look to the root oracle.
5846 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
5848 * passes.def: Replace the pass_thread_jumps before VRP* with
5849 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
5850 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
5851 Remove hyphen from "thread-full" name.
5853 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
5855 PR tree-optimization/102983
5856 * gimple-range-cache.h (propagate_updated_value): Make public.
5857 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
5858 when processing gcond stmts.
5860 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
5862 * gimple.c (gimple_call_retslot_flags): New function.
5863 * gimple.h (gimple_call_retslot_flags): Declare.
5864 * ipa-modref.c: Include tree-cfg.h.
5865 (struct escape_entry): Turn parm_index to signed.
5866 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
5867 (modref_summary::modref_summary): Initialize retslot_flags.
5868 (struct modref_summary_lto): Likewise.
5869 (modref_summary::useful_p): Check retslot_flags.
5870 (modref_summary_lto::useful_p): Likewise.
5871 (modref_summary::dump): Dump retslot_flags.
5872 (modref_summary_lto::dump): Likewise.
5873 (struct escape_point): Add hidden_args enum.
5874 (analyze_ssa_name_flags): Ignore return slot return;
5875 use gimple_call_retslot_flags.
5876 (record_escape_points): Break out from ...
5877 (analyze_parms): ... here; handle retslot_flags.
5878 (modref_summaries::duplicate): Duplicate retslot_flags.
5879 (modref_summaries_lto::duplicate): Likewise.
5880 (modref_write_escape_summary): Stream parm_index as signed.
5881 (modref_read_escape_summary): Likewise.
5882 (modref_write): Stream retslot_flags.
5883 (read_section): Likewise.
5884 (struct escape_map): Fix typo in comment.
5885 (update_escape_summary_1): Fix whitespace.
5886 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
5887 (modref_merge_call_site_flags): Merge retslot_flags.
5888 * ipa-modref.h (struct modref_summary): Add retslot_flags.
5889 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
5891 2021-10-29 Tamar Christina <tamar.christina@arm.com>
5893 PR tree-optimization/102977
5894 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
5895 (vect_detect_pair_op): Add crosslane check.
5896 (vect_match_call_complex_mla): Remove.
5897 (class complex_mul_pattern): Update comment.
5898 (complex_mul_pattern::matches): Update detection.
5899 (class complex_fma_pattern): Remove.
5900 (complex_fma_pattern::matches): Remove.
5901 (complex_fma_pattern::recognize): Remove.
5902 (complex_fma_pattern::build): Remove.
5903 (class complex_fms_pattern): Update comment.
5904 (complex_fms_pattern::matches): Remove.
5905 (complex_operations_pattern::recognize): Remove complex_fma_pattern
5907 2021-10-29 Jakub Jelinek <jakub@redhat.com>
5909 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
5912 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
5915 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
5916 patterns match and emit for VSX xxpermdi.
5918 2021-10-29 liuhongt <hongtao.liu@intel.com>
5921 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
5923 (V16HF_FTYPE_V16HF): Ditto.
5924 (V32HF_FTYPE_V32HF): Ditto.
5925 (V8HF_FTYPE_V8HF_ROUND): Ditto.
5926 (V16HF_FTYPE_V16HF_ROUND): Ditto.
5927 (V32HF_FTYPE_V32HF_ROUND): Ditto.
5928 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
5929 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
5930 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
5931 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
5932 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
5933 * config/i386/i386-builtins.c
5934 (ix86_builtin_vectorized_function): Enable vectorization for
5935 HFmode FLOOR/CEIL/TRUNC operation.
5936 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
5938 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
5941 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
5942 Andrew MacLeod <amacleod@redhat.com>
5944 * value-relation.cc (path_oracle::killing_def): Walk the
5945 equivalency list and remove SSA from any equivalencies.
5947 2021-10-28 Stafford Horne <shorne@gmail.com>
5949 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
5952 2021-10-28 Jakub Jelinek <jakub@redhat.com>
5954 PR tree-optimization/102951
5955 * fold-const.h (address_compare): Declare.
5956 * fold-const.c (address_compare): New function.
5957 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
5958 address_compare helper.
5959 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
5962 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
5964 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
5966 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
5968 PR tree-optimization/102940
5969 * tree-vrp.c (execute_ranger_vrp): Reset scev.
5971 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
5973 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
5975 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
5977 * configure: Regenerate.
5978 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
5980 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
5982 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
5984 2021-10-28 Richard Biener <rguenther@suse.de>
5987 * fold-const.c (fold_convert_const): Avoid int to float
5988 constant folding with -frounding-math and inexact result.
5989 * simplify-rtx.c (simplify_const_unary_operation): Likewise
5990 for both float and unsigned_float.
5992 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
5994 * tree-ssa-threadbackward.c
5995 (back_threader::find_taken_edge_switch): Use find_case_label_range
5996 instead of find_taken_edge.
5998 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
6000 * tree-ssa-threadbackward.c (class back_threader_registry):
6001 Inherit from back_jt_path_registry.
6002 (back_threader_registry::thread_through_all_blocks): Remove.
6003 (back_threader_registry::register_path): Remove
6004 m_lowlevel_registry prefix.
6006 2021-10-28 Richard Biener <rguenther@suse.de>
6009 * fold-const.c (fold_convert_const_real_from_real): Honor
6010 -frounding-math if the conversion is not exact.
6011 * simplify-rtx.c (simplify_const_unary_operation): Do not
6012 simplify FLOAT_TRUNCATE with sign dependent rounding.
6014 2021-10-28 Richard Biener <rguenther@suse.de>
6016 PR tree-optimization/102949
6017 * tree-vect-stmts.c (ensure_base_align): Look at the
6018 dr_info of a group leader and assert we are looking at
6019 one with analyzed alignment.
6021 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
6024 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
6025 V1T1 mode for unaligned load and store.
6027 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6029 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
6032 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
6034 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
6037 2021-10-28 Alexandre Oliva <oliva@adacore.com>
6039 * common.opt (fharden-compares): New.
6040 (fharden-conditional-branches): New.
6041 * doc/invoke.texi: Document new options.
6042 * gimple-harden-conditionals.cc: New.
6043 * Makefile.in (OBJS): Build it.
6044 * passes.def: Add new passes.
6045 * tree-pass.h (make_pass_harden_compares): Declare.
6046 (make_pass_harden_conditional_branches): Declare.
6048 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6051 * config/rs6000/altivec.md: Add vsx register constraints.
6052 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
6053 (vsx_xxsel<mode>2): Likewise.
6054 (vsx_xxsel<mode>3): Likewise.
6055 (vsx_xxsel<mode>4): Likewise.
6057 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
6060 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
6061 (altivec_vsel<mode>): ... this and update define.
6062 (*altivec_vsel<mode>_uns): Delete.
6063 (altivec_vsel<mode>2): New define_insn.
6064 (altivec_vsel<mode>3): Likewise.
6065 (altivec_vsel<mode>4): Likewise.
6066 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
6067 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
6069 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
6070 selection instead of per element.
6071 * config/rs6000/vector.md:
6072 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
6073 (vsx_xxsel<mode>): ... this and update define.
6074 (*vsx_xxsel<mode>_uns): Delete.
6075 (vsx_xxsel<mode>2): New define_insn.
6076 (vsx_xxsel<mode>3): Likewise.
6077 (vsx_xxsel<mode>4): Likewise.
6079 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
6081 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
6082 HFmode rsqrt without TARGET_SSE_MATH.
6083 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
6084 btrunc condition to be restricted by -ftrapping-math, adjust
6085 use_rsqrt_p function call.
6086 * config/i386/i386.md (rcphf2): New define_insn.
6087 (rsqrthf2): Likewise.
6088 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
6089 (div<mode>3): New expander for HF mode.
6090 (rsqrt<mode>2): Likewise.
6091 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
6092 (*avx512fp16_vmrsqrtv8hf2): Likewise.
6094 2021-10-27 Saagar Jha <saagar@saagarjha.com>
6096 * config.gcc: Adjust for Darwin21.
6097 * config/darwin-c.c (macosx_version_as_macro): Likewise.
6098 * config/darwin-driver.c (validate_macosx_version_min):
6100 (darwin_find_version_from_kernel): Likewise.
6102 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6104 * gimple-range-path.cc
6105 (path_range_query::range_defined_in_block): Call killing_def.
6107 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6109 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
6110 operands as well as relation.
6111 * gimple-range-path.cc
6112 (path_range_query::compute_ranges_in_block): Compute PHI relations
6113 first. Compute outgoing relations at the end.
6114 (path_range_query::compute_ranges): Remove call to compute_relations.
6115 (path_range_query::compute_relations): Remove.
6116 (path_range_query::maybe_register_phi_relation): New.
6117 (path_range_query::compute_phi_relations): Abstract out
6118 registering one PHI relation to...
6119 (path_range_query::compute_outgoing_relations): ...here.
6120 * gimple-range-path.h (class path_range_query): Remove
6122 Add maybe_register_phi_relation.
6124 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
6125 Andrew MacLeod <amacleod@redhat.com>
6127 * value-relation.cc (path_oracle::killing_def): Kill second
6130 2021-10-27 Martin Jambor <mjambor@suse.cz>
6132 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
6133 profile feedback depending on their local availability.
6135 2021-10-27 Martin Jambor <mjambor@suse.cz>
6137 * params.opt (param_ipa_cp_profile_count_base): New parameter.
6138 * doc/invoke.texi (Optimize Options): Add entry for
6139 ipa-cp-profile-count-base.
6140 * ipa-cp.c (max_count): Replace with base_count, replace all
6141 occurrences too, unless otherwise stated.
6142 (ipcp_cloning_candidate_p): identify mostly-directly called
6143 functions based on their counts, not max_count.
6144 (compare_edge_profile_counts): New function.
6145 (ipcp_propagate_stage): Instead of setting max_count, find the
6146 appropriate edge count in a sorted vector of counts of eligible
6147 edges and make it the base_count.
6149 2021-10-27 Martin Jambor <mjambor@suse.cz>
6151 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
6152 n_nonrec_calls and itself, document all fields.
6153 (init_caller_stats): Initialize the above new fields.
6154 (gather_caller_stats): Gather self-recursive counts and calls number.
6155 (get_info_about_necessary_edges): Gather counts of self-recursive and
6156 other edges bringing in the requested value separately.
6157 (dump_profile_updates): Rework to dump info about a single node only.
6158 (lenient_count_portion_handling): New function.
6159 (struct gather_other_count_struct): New type.
6160 (gather_count_of_non_rec_edges): New function.
6161 (struct desc_incoming_count_struct): New type.
6162 (analyze_clone_icoming_counts): New function.
6163 (adjust_clone_incoming_counts): Likewise.
6164 (update_counts_for_self_gen_clones): Likewise.
6165 (update_profiling_info): Rewritten.
6166 (update_specialized_profile): Adjust call to dump_profile_updates.
6167 (create_specialized_node): Do not update profiling info.
6168 (decide_about_value): New parameter self_gen_clones, either push new
6169 clones into it or updat their profile counts. For self-recursively
6170 generated values, use a portion of the node count instead of count
6171 from self-recursive edges to estimate goodness.
6172 (decide_whether_version_node): Gather clones for self-generated values
6173 in a new vector, update their profiles at once at the end.
6175 2021-10-27 Richard Biener <rguenther@suse.de>
6177 * tree-vectorizer.c (vect_transform_loops): New function,
6179 (try_vectorize_loop_1): ... here. Simplify as epilogues
6180 are now fully handled in the split part.
6182 2021-10-27 Jakub Jelinek <jakub@redhat.com>
6184 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
6185 iterators with pointer types.
6186 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
6188 2021-10-26 Martin Sebor <msebor@redhat.com>
6190 PR tree-optimization/102238
6191 PR tree-optimization/102919
6192 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
6193 (array_elt_at_offset): Move to pointer-query.
6194 (set_aggregate_size_and_offset): New function.
6195 (field_at_offset): Move to pointer-query.
6196 (get_origin_and_offset): Rename...
6197 (get_origin_and_offset_r): this. Add an argument. Make aggregate
6198 handling more robust.
6199 (get_origin_and_offset): New.
6200 (alias_offset): Add an argument.
6201 (format_string): Use subobject size determined by get_origin_and_offset.
6202 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
6203 Improve/correct handling of aggregates.
6204 (array_elt_at_offset): Same.
6205 * pointer-query.h (field_at_offset): Declare.
6206 (array_elt_at_offset): Declare.
6208 2021-10-26 Martin Sebor <msebor@redhat.com>
6210 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
6212 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
6213 (expand_builtin_fork_or_exec): Same.
6214 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
6215 compute_objsize additional arguments.
6216 (inbounds_memaccess_p): Same.
6217 (array_bounds_checker::check_array_bounds): Add an assert. Stash
6218 statement in a member.
6219 (check_array_bounds_dom_walker::before_dom_children): Same.
6220 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
6221 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
6222 (handle_printf_call): Pass a new argument.
6223 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
6224 (check_access): Add an argument and pass it along to callees.
6225 (check_read_access): Make a member function.
6226 (pass_waccess::check_strcat): Pass access_data ctor additional
6228 (pass_waccess::check_strncat): Same.
6229 (pass_waccess::check_stxcpy): Same.
6230 (pass_waccess::check_stxncpy): Same.
6231 (pass_waccess::check_strncmp): Same.
6232 (pass_waccess::check_read_access): Same.
6233 (pass_waccess::check_builtin): Same.
6234 (pass_waccess::maybe_check_access_sizes): Same.
6235 (pass_waccess::maybe_check_dealloc_call): Same.
6236 * gimple-ssa-warn-access.h (check_read_access): Declare a new
6238 * pointer-query.cc (compute_objsize_r): Add an argument.
6239 (gimple_call_return_array): Same.
6240 (gimple_call_alloc_size): Same.
6241 (access_ref::access_ref): Same.
6242 (access_ref::get_ref): Same.
6243 (pointer_query::get_ref): Same.
6244 (handle_min_max_size): Pass an arguments to callees.
6245 (handle_array_ref): Add an argument.
6246 (handle_mem_ref): Same.
6247 (compute_objsize): Same.
6248 * pointer-query.h (struct access_ref): Adjust signatures.
6249 (struct access_data): Same.
6250 (gimple_call_alloc_size): Add an argument.
6251 (gimple_parm_array_size): Same.
6252 (compute_objsize): Same.
6253 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
6254 argument to compute_objsize.
6255 (strlen_pass::maybe_warn_overflow): Same.
6256 (maybe_diag_stxncpy_trunc): Same.
6258 2021-10-26 Martin Sebor <msebor@redhat.com>
6260 PR middle-end/102453
6261 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
6262 (pass_waccess::check_atomic_builtin): Call it.
6264 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
6266 PR rtl-optimization/102842
6267 * lra-constraints.c (match_reload): Ignore out in checking values
6269 (curr_insn_transform): Collect outputs before doing reloads of operands.
6271 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
6274 * config/rs6000/x86intrin.h: Move some included headers to new
6275 headers. Include new immintrin.h instead of those headers.
6276 * config/rs6000/immintrin.h: New.
6277 * config/rs6000/x86gprintrin.h: New.
6278 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
6280 2021-10-26 Richard Biener <rguenther@suse.de>
6282 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
6283 Remove byte_offset parameter.
6284 (vect_create_data_ref_ptr): Likewise.
6285 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6287 (vect_create_data_ref_ptr): Likewise.
6288 * tree-vect-stmts.c (vectorizable_store): Adjust.
6289 (vectorizable_load): Likewise.
6291 2021-10-26 Richard Biener <rguenther@suse.de>
6293 PR tree-optimization/96109
6294 * tree-vectorizer.h (dr_misalignment): Add optional offset
6296 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
6297 offset applied for negative stride accesses.
6298 (vect_enhance_data_refs_alignment): Compute negative stride
6299 access offset and pass it to dr_misalignment.
6300 * tree-vect-stmts.c (get_negative_load_store_type): Pass
6301 negative offset to dr_misalignment.
6302 (get_group_load_store_type): Likewise.
6303 (get_load_store_type): Likewise.
6304 (vectorizable_store): Remove asserts about alignment.
6305 (vectorizable_load): Likewise.
6307 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6309 PR tree-optimization/102897
6310 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
6312 2021-10-26 Richard Biener <rguenther@suse.de>
6314 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
6315 Take offset in bytes.
6316 (vect_create_data_ref_ptr): Likewise.
6317 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
6318 offset by element size.
6319 (vect_create_cond_for_align_checks): Likewise.
6320 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
6321 (vectorizable_load): Remove duplicate leftover from merge
6324 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
6326 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
6327 function to expand V1TI mode logical shifts by integer constants.
6328 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
6329 mode rotations by integer constants.
6330 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
6331 ix86_expand_v1ti_rotate): Prototype new functions here.
6332 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
6333 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
6335 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6337 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6338 Avoid threading circular paths.
6340 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6342 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
6343 Attempt to resolve all incoming paths to a PHI.
6344 (back_threader::resolve_def): Always return true for PHIs.
6346 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
6348 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
6349 Always try to resolve path without looking back.
6350 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
6351 edge is a back edge.
6353 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
6355 PR tree-optimization/102789
6356 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
6357 update inits of simd_lane_access.
6359 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6361 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
6362 move to vr-values.c.
6363 (simplify_casted_conds): Move to vrp_folder class.
6364 (execute_vrp): Call via vrp_folder now.
6365 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
6366 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
6367 * vr-values.h (simplify_casted_cond): Add prototype.
6369 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6371 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
6374 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
6376 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
6378 (_mm_testc_si128): Likewise.
6379 (_mm_testnzc_si128): Likewise.
6380 (_mm_blend_ps): Likewise.
6381 (_mm_blendv_ps): Likewise.
6382 (_mm_blend_pd): Likewise.
6383 (_mm_blendv_pd): Likewise.
6384 (_mm_ceil_pd): Likewise.
6385 (_mm_ceil_sd): Likewise.
6386 (_mm_ceil_ps): Likewise.
6387 (_mm_ceil_ss): Likewise.
6388 (_mm_floor_pd): Likewise.
6389 (_mm_floor_sd): Likewise.
6390 (_mm_floor_ps): Likewise.
6391 (_mm_floor_ss): Likewise.
6392 (_mm_minpos_epu16): Likewise.
6393 (_mm_mul_epi32): Likewise.
6394 (_mm_cvtepi8_epi16): Likewise.
6395 (_mm_packus_epi32): Likewise.
6396 (_mm_cmpgt_epi64): Likewise.
6398 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
6400 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
6401 shifts of the mode's smin_value and smax_value when the bit count
6402 operand doesn't have side-effects.
6403 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
6404 when the bit count operand doesn't have side-effects.
6405 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
6406 compile-time evaluation of saturating left shifts with constant
6409 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6411 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
6414 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
6416 * flag-types.h (enum ranger_debug): Adjust values.
6417 * params.opt (ranger_debug): Ditto.
6419 2021-10-25 Martin Jambor <mjambor@suse.cz>
6421 PR tree-optimization/102886
6422 * tree-sra.c (totally_scalarize_subtree): Fix the out of
6425 2021-10-25 Andrew Pinski <apinski@marvell.com>
6427 * tree-ssa-dce.c (simple_dce_from_worklist):
6428 Check stmt_unremovable_because_of_non_call_eh_p also
6429 before removing the statement.
6431 2021-10-25 Richard Biener <rguenther@suse.de>
6433 PR tree-optimization/102905
6434 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
6435 Use vect_supportable_dr_alignment again to determine whether
6436 an access is supported when not aligned.
6438 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6440 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
6443 2021-10-25 Jim Wilson <jimw@sifive.com>
6444 Kito Cheng <kito.cheng@sifive.com>
6445 Jia-Wei Chen <jiawei@iscas.ac.cn>
6446 Shi-Hua Liao <shihua@iscas.ac.cn>
6448 * config/riscv/bitmanip.md (shiftm1): New.
6449 (*bset<mode>): Ditto.
6450 (*bset<mode>_mask): Ditto.
6451 (*bset<mode>_1): Ditto.
6452 (*bset<mode>_1_mask): Ditto.
6453 (*bseti<mode>): Ditto.
6454 (*bclr<mode>): Ditto.
6455 (*bclri<mode>): Ditto.
6456 (*binv<mode>): Ditto.
6457 (*binvi<mode>): Ditto.
6458 (*bext<mode>): Ditto.
6460 * config/riscv/predicates.md (splittable_const_int_operand):
6462 (single_bit_mask_operand): New.
6463 (not_single_bit_mask_operand): Ditto.
6464 (const31_operand): Ditto.
6465 (const63_operand): Ditto.
6466 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
6467 (riscv_output_move): Ditto.
6468 (riscv_print_operand): Handle new operand type: T and S.
6469 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
6471 2021-10-25 Jim Wilson <jimw@sifive.com>
6473 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
6476 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6478 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6480 (riscv_rtx_costs): Ditto.
6482 2021-10-25 Jim Wilson <jimw@sifive.com>
6483 Kito Cheng <kito.cheng@sifive.com>
6484 Jia-Wei Chen <jiawei@iscas.ac.cn>
6486 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
6487 (bitmanip_minmax): New.
6488 (clz_ctz_pcnt): New.
6489 (bitmanip_optab): New.
6490 (bitmanip_insn): New.
6491 (*<optab>_not<mode>): New.
6492 (*xor_not<mode>): New.
6493 (<bitmanip_optab>si2): New.
6494 (*<bitmanip_optab>disi2): New.
6495 (<bitmanip_optab>di2): New.
6496 (*zero_extendhi<GPR:mode>2_bitmanip): New.
6497 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
6498 (*zero_extendhi<GPR:mode>2_zbb): New.
6501 (rotrsi3_sext): New.
6504 (rotlsi3_sext): New.
6505 (bswap<mode>2): New.
6506 (<bitmanip_optab><mode>3): New.
6507 * config/riscv/riscv.md (type): Add rotate.
6508 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
6509 (*zero_extendhi<GPR:mode>2): New.
6510 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
6511 (*extend<SHORT:mode><SUPERQI:mode>2): New.
6513 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6515 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
6517 (riscv_rtx_costs): Ditto.
6519 2021-10-25 Jim Wilson <jimw@sifive.com>
6520 Kito Cheng <kito.cheng@sifive.com>
6521 Jia-Wei Chen <jiawei@iscas.ac.cn>
6523 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
6528 (riscv_rtx_costs): Ditto.
6529 * config/riscv/riscv.md: Include bitmanip.md
6530 (type): Add bitmanip bype.
6531 (zero_extendsidi2): Change to define_expand pattern.
6532 (*zero_extendsidi2_internal): New.
6533 (zero_extendsidi2_shifted): Disable for ZBA.
6535 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
6537 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
6538 Add zba, zbb, zbc and zbs.
6539 (riscv_ext_flag_table): Ditto.
6540 * config/riscv/riscv-opts.h (MASK_ZBA): New.
6544 (TARGET_ZBA): Ditto.
6545 (TARGET_ZBB): Ditto.
6546 (TARGET_ZBC): Ditto.
6547 (TARGET_ZBS): Ditto.
6548 * config/riscv/riscv.opt (riscv_zb_subext): New.
6550 2021-10-25 liuhongt <hongtao.liu@intel.com>
6553 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
6554 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
6556 2021-10-25 Richard Biener <rguenther@suse.de>
6558 PR tree-optimization/102920
6559 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
6560 controlling VN_TOP matching behavior.
6561 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
6562 (vn_phi_eq): Do not optimistically match VN_TOP.
6564 2021-10-25 konglin1 <lingling.kong@intel.com>
6566 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
6567 define_insn_and_split.
6568 (fma_<mode>_fadd_fcmul):Likewise
6569 (fma_<complexopname>_<mode>_fma_zero):Likewise
6571 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6573 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
6574 TARGET_DISABLE_FPREGS.
6575 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
6576 MASK_DISABLE_FPREGS.
6577 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
6578 cost of hardware integer multiplication.
6579 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
6580 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
6581 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
6582 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
6583 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
6584 SOFT_FLOAT. Revise comment.
6585 (msoft-float): New option.
6587 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
6589 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
6591 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
6593 * doc/install.texi (Specific): Remove obsolete details
6594 around GNU/Linux on Itanium.
6595 (Specific): Remove reference to Windows for Itanium.
6597 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
6599 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
6602 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
6604 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
6606 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
6608 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
6609 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
6610 (one_cmplv1ti2): New define expand.
6612 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
6615 * Makefile.in: Handle dvidir and install-dvi target.
6616 * configure: Regenerate.
6617 * configure.ac: Add install-dvi to target_list.
6619 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
6621 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
6624 2021-10-22 Richard Biener <rguenther@suse.de>
6626 PR tree-optimization/102893
6627 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
6628 test for an exit edge.
6630 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
6631 Andrew MacLeod <amacleod@redhat.com>
6633 * gimple-range-path.cc (path_range_query::compute_phi_relations):
6634 Kill any global relations we may know before registering a new
6636 * value-relation.cc (path_oracle::killing_def): New.
6637 * value-relation.h (path_oracle::killing_def): New.
6639 2021-10-22 Richard Biener <rguenther@suse.de>
6642 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
6644 (vn_phi_lookup): Likewise.
6646 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
6649 * doc/invoke.texi: Document -fcf-protection requires i686 or
6652 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
6654 PR middle-end/102764
6655 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
6657 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
6659 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
6660 macro and all invocations.
6662 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6664 * doc/invoke.texi (ranger-debug): Document.
6665 * flag-types.h (enum ranger_debug): New.
6666 (enum evrp_mode): Remove debug values.
6667 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
6668 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
6669 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
6670 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
6671 (execute_early_vrp): Use evrp-mode directly.
6672 * params.opt (enum evrp_mode): Remove debug values.
6673 (ranger-debug): New.
6674 (ranger-logical-depth): Relocate to be in alphabetical order.
6676 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6678 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
6679 * flag-types.h: (enum vrp_mode): New.
6680 * params.opt: (vrp1-mode, vrp2-mode): New.
6681 * tree-vrp.c (vrp_pass_num): New.
6682 (pass_vrp::pass_vrp): Set pass number.
6683 (pass_vrp::execute): Choose which VRP mode to execute.
6685 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
6687 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
6688 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
6689 * tree-vrp.c (class rvrp_folder): Relocate here.
6690 (execute_ranger_vrp): New.
6691 * tree-vrp.h (execute_ranger_vrp): Export.
6693 2021-10-21 Martin Liska <mliska@suse.cz>
6697 * opts.c (finish_options): Process flag_var_tracking* options
6698 here as they can be adjusted by optimize attribute.
6699 Process also flag_syntax_only and flag_gtoggle.
6700 * toplev.c (process_options): Remove it here.
6701 * common.opt: Make debug_nonbind_markers_p as PerFunction
6702 attribute as it depends on optimization level.
6704 2021-10-21 Martin Jambor <mjambor@suse.cz>
6706 PR tree-optimization/102505
6707 * tree-sra.c (totally_scalarize_subtree): Check that the
6708 encountered field fits within the acces we would like to put it
6711 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
6713 * tree-ssa-threadbackward.c
6714 (back_threader::maybe_register_path): Remove circular paths check.
6716 2021-10-21 Richard Biener <rguenther@suse.de>
6718 * toplev.c (process_options): Move the initial debug_hooks
6720 (toplev::main): ... before the call of the post_options
6723 2021-10-21 Richard Biener <rguenther@suse.de>
6725 PR tree-optimization/102847
6726 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
6727 load cost in the prologue for VMAT_INVARIANT.
6729 2021-10-21 Richard Biener <rguenther@suse.de>
6731 PR tree-optimization/102847
6732 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
6733 handle VMAT_INVARIANT as a splat in the prologue.
6735 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
6738 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
6739 move to use the same logic as HImode.
6741 2021-10-21 Richard Biener <rguenther@suse.de>
6743 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
6744 superfluous gimple_call_nothrow_p check.
6746 2021-10-21 Andrew Pinski <apinski@marvell.com>
6748 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
6749 Mark the ssa-name of the rhs as one to be removed.
6750 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
6751 Call simple_dce_from_worklist at the end to a simple dce.
6753 2021-10-21 Andrew Pinski <apinski@marvell.com>
6755 * tree-cfg.c (maybe_remove_writeonly_store): New function
6756 factored out from ...
6757 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
6759 2021-10-21 Andrew Pinski <apinski@marvell.com>
6761 * tree-cfg.c (execute_fixup_cfg): Remove comment
6762 about standalone pass.
6764 2021-10-21 Andrew Pinski <apinski@marvell.com>
6766 * tree-cfg.c (execute_fixup_cfg): Output when the statement
6767 is removed when it is a write only var.
6769 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
6771 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6772 Avoid threading circular paths.
6774 2021-10-20 Alex Coplan <alex.coplan@arm.com>
6776 * calls.c (initialize_argument_information): Remove some dead
6777 code, remove handling for function_arg returning const_int.
6778 * doc/tm.texi: Delete documentation for unused target hooks.
6779 * doc/tm.texi.in: Likewise.
6780 * target.def (load_bounds_for_arg): Delete.
6781 (store_bounds_for_arg): Delete.
6782 (load_returned_bounds): Delete.
6783 (store_returned_bounds): Delete.
6784 * targhooks.c (default_load_bounds_for_arg): Delete.
6785 (default_store_bounds_for_arg): Delete.
6786 (default_load_returned_bounds): Delete.
6787 (default_store_returned_bounds): Delete.
6788 * targhooks.h (default_load_bounds_for_arg): Delete.
6789 (default_store_bounds_for_arg): Delete.
6790 (default_load_returned_bounds): Delete.
6791 (default_store_returned_bounds): Delete.
6793 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
6795 * doc/extend.texi (Basic Asm): Clarify that asm is not an
6797 * doc/invoke.texi (-fno-asm): Fix description for C++.
6799 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
6801 * doc/install.texi: Remove link to old.html
6803 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6805 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
6807 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6809 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
6811 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6813 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
6815 * config/aarch64/constraints.md (D1): New.
6817 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6819 * config/aarch64/aarch64-simd.md
6820 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
6821 (*aarch64_topbits_shuffle<mode>_le): New.
6822 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
6823 (*aarch64_topbits_shuffle<mode>_be): New.
6824 * config/aarch64/predicates.md
6825 (aarch64_simd_shift_imm_vec_exact_top): New.
6827 2021-10-20 Tamar Christina <tamar.christina@arm.com>
6829 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
6830 *aarch64_<srn_op>shrn<mode>2_vect_le,
6831 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
6832 * config/aarch64/iterators.md (srn_op): New.
6834 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
6836 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
6838 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
6839 ctx->outer instead of ctx.
6841 2021-10-20 Martin Liska <mliska@suse.cz>
6844 2021-10-19 Martin Liska <mliska@suse.cz>
6847 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
6848 * system.h (strip_whilespaces): New function.
6850 2021-10-20 Martin Liska <mliska@suse.cz>
6853 2021-10-19 Martin Liska <mliska@suse.cz>
6856 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
6859 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
6861 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
6862 Change to 8 with -Os, 11 otherwise.
6864 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
6866 * config/aarch64/aarch64.c (neoversev1_tunings):
6867 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
6868 (neoversen2_tunings): Likewise.
6870 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
6873 * config/aarch64/aarch64.md (frint_pattern): Update comment.
6874 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
6875 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
6876 * config/aarch64/arm_neon.h: Likewise.
6877 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
6879 2021-10-20 Martin Liska <mliska@suse.cz>
6881 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
6882 already declared global variable.
6883 (arm_unwind_emit_set): Use out_file as function argument.
6884 (arm_unwind_emit): Likewise.
6885 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
6886 (machopic_output_stub_indirection): Likewise.
6887 (machopic_output_indirection): Likewise.
6888 (machopic_finish): Likewise.
6889 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
6890 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
6891 * config/ia64/ia64.c (process_epilogue): Likewise.
6892 (process_cfa_adjust_cfa): Likewise.
6893 (process_cfa_register): Likewise.
6894 (process_cfa_offset): Likewise.
6895 (ia64_asm_unwind_emit): Likewise.
6896 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
6898 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
6900 * config/aarch64/aarch64-builtins.c
6901 (aarch64_general_gimple_fold_builtin):
6902 lower vld1 and vst1 variants of the neon builtins
6903 * config/aarch64/aarch64-protos.h:
6904 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
6905 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
6908 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
6910 * match.pd: Generate IFN_TRUNC.
6912 2021-10-20 Richard Biener <rguenther@suse.de>
6914 PR tree-optimization/102853
6915 * tree-data-ref.c (split_constant_offset_1): Bail out
6916 immediately if the expression traps on overflow.
6918 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6920 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
6922 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6924 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
6925 Remove superflous debugging message.
6926 (back_jt_path_registry::duplicate_thread_path): Same.
6928 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6930 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
6932 (back_threader_registry::register_path): Remove m_threaded_paths.
6934 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6936 PR tree-optimization/102814
6937 * doc/invoke.texi: Document --param=max-fsm-thread-length.
6938 * params.opt: Add --param=max-fsm-thread-length.
6939 * tree-ssa-threadbackward.c
6940 (back_threader_profitability::profitable_path_p): Fail on paths
6941 longer than max-fsm-thread-length.
6943 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
6945 PR middle-end/102764
6946 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
6947 statement to reset the current location for the outgoing edges.
6949 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
6950 Richard Biener <rguenther@suse.de>
6952 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
6953 on the same line as the threading cancellation.
6954 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
6955 Avoid threading through loop headers where the path remains in the
6958 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
6960 * tree-object-size.c (unknown): Make into a function. Adjust
6962 (unknown_object_size): Simplify implementation.
6964 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
6965 Kewen Lin <linkw@linux.ibm.com>
6967 * doc/sourcebuild.texi (Effective-Target Keywords): Document
6968 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
6969 vect_slp_v16qi_store, vect_slp_v2hi_store,
6970 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
6972 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
6974 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
6976 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
6980 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
6981 * config/rs6000/pmmintrin.h: Same.
6982 * config/rs6000/smmintrin.h: Same.
6983 * config/rs6000/tmmintrin.h: Same.
6985 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
6987 * config.gcc (extra_headers): Add nmmintrin.h.
6989 2021-10-19 Richard Biener <rguenther@suse.de>
6991 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
6992 misalignment parameter.
6993 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
6994 Do not change DR_MISALIGNMENT in place, instead pass the
6995 adjusted misalignment to vect_supportable_dr_alignment.
6996 (vect_peeling_supportable): Likewise.
6997 (vect_peeling_hash_get_lowest_cost): Adjust.
6998 (vect_enhance_data_refs_alignment): Likewise.
6999 (vect_vfa_access_size): Likewise.
7000 (vect_supportable_dr_alignment): Add misalignment
7001 parameter and simplify.
7002 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
7003 (get_group_load_store_type): Likewise.
7004 (get_load_store_type): Likewise.
7006 2021-10-19 Clément Chigot <clement.chigot@atos.net>
7008 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
7009 __tls_get_addr reference to .text csect.
7011 2021-10-19 Martin Liska <mliska@suse.cz>
7014 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
7017 2021-10-19 Richard Biener <rguenther@suse.de>
7019 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
7020 (vect_get_load_cost): Likewise.
7021 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
7022 alignment support scheme and misalignment as arguments
7024 (vect_get_peeling_costs_all_drs): Compute that info here
7025 and note that we shouldn't need to.
7026 * tree-vect-stmts.c (vect_model_store_cost): Get
7027 alignment support scheme and misalignment as arguments.
7028 (vect_get_store_cost): Likewise.
7029 (vect_model_load_cost): Likewise.
7030 (vect_get_load_cost): Likewise.
7031 (vectorizable_store): Pass down alignment support scheme
7032 and misalignment to costing.
7033 (vectorizable_load): Likewise.
7035 2021-10-19 Richard Biener <rguenther@suse.de>
7037 * tree-vect-stmts.c (get_negative_load_store_type): Add
7038 offset output parameter and initialize it.
7039 (get_group_load_store_type): Likewise.
7040 (get_load_store_type): Likewise.
7041 (vectorizable_store): Use offset as computed by
7042 get_load_store_type.
7043 (vectorizable_load): Likewise.
7045 2021-10-19 Richard Biener <rguenther@suse.de>
7047 PR tree-optimization/102827
7048 * tree-if-conv.c (predicate_statements): Add pe parameter
7049 and use that edge to insert invariant stmts on.
7050 (combine_blocks): Pass through pe.
7051 (tree_if_conversion): Compute the edge to insert invariant
7052 stmts on and pass it along.
7054 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
7057 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
7058 sssubaddv2hi3): Swap the order of operators in vec_concat.
7060 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
7062 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
7063 (altivec_vmrghb_direct): New.
7064 (*altivec_vmrghh_internal): Delete.
7065 (altivec_vmrghh_direct): New.
7066 (*altivec_vmrghw_internal): Delete.
7067 (altivec_vmrghw_direct_<mode>): New.
7068 (altivec_vmrghw_direct): Delete.
7069 (*altivec_vmrglb_internal): Delete.
7070 (altivec_vmrglb_direct): New.
7071 (*altivec_vmrglh_internal): Delete.
7072 (altivec_vmrglh_direct): New.
7073 (*altivec_vmrglw_internal): Delete.
7074 (altivec_vmrglw_direct_<mode>): New.
7075 (altivec_vmrglw_direct): Delete.
7076 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
7077 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
7079 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
7080 (vsx_xxmrglw_<mode>): Adjust.
7082 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
7084 * passes.def: Change threading comment before pass_ccp pass.
7086 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
7088 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
7089 Modify the expansion for sign extension. All extensions are done
7090 within VSX registers.
7092 2021-10-19 Richard Biener <rguenther@suse.de>
7094 * tree-vect-stmts.c (get_group_load_store_type): Add
7095 misalignment output parameter and initialize it.
7096 (get_group_load_store_type): Likewise.
7097 (vectorizable_store): Remove now redundant queries.
7098 (vectorizable_load): Likewise.
7100 2021-10-19 Richard Biener <rguenther@suse.de>
7102 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
7103 check_aligned argument.
7104 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
7106 (vect_peeling_hash_insert): Add supportable_if_not_aligned
7107 argument and do not call vect_supportable_dr_alignment here.
7108 (vect_peeling_supportable): Adjust.
7109 (vect_enhance_data_refs_alignment): Compute whether the
7110 access is supported with different alignment here and
7111 pass that down to vect_peeling_hash_insert.
7112 (vect_vfa_access_size): Adjust.
7113 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
7114 (vect_get_load_cost): Likewise.
7115 (get_negative_load_store_type): Likewise.
7116 (get_group_load_store_type): Likewise.
7117 (get_load_store_type): Likewise.
7119 2021-10-19 Martin Liska <mliska@suse.cz>
7122 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
7123 * system.h (strip_whilespaces): New function.
7125 2021-10-19 dianhong xu <dianhong.xu@intel.com>
7127 * config/i386/avx512fp16intrin.h:
7128 (_mm512_set1_pch): New intrinsic.
7129 * config/i386/avx512fp16vlintrin.h:
7130 (_mm256_set1_pch): New intrinsic.
7131 (_mm_set1_pch): Ditto.
7133 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
7135 PR tree-optimization/102796
7136 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
7137 normally. Return get_tree_range for non gimple_range_ssa_p names.
7138 (gimple_ranger::range_of_stmt): Use get_tree_range for non
7139 gimple_range_ssa_p names.
7141 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
7144 * config/i386/i386.c (ix86_print_operand_address):
7145 Error out for non-address_operand asm operands.
7147 2021-10-18 Richard Biener <rguenther@suse.de>
7149 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
7150 not auto-convert dr_alignment_support to bool.
7151 (vect_peeling_supportable): Likewise.
7152 (vect_enhance_data_refs_alignment): Likewise.
7153 (vect_supportable_dr_alignment): Commonize read/write case.
7154 * tree-vect-stmts.c (vect_get_store_cost): Use
7155 dr_alignment_support, not int, for the vect_supportable_dr_alignment
7157 (vect_get_load_cost): Likewise.
7159 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
7161 * tree-object-size.c (object_sizes_execute): Consolidate LHS
7162 null check and do it early.
7164 2021-10-18 Richard Biener <rguenther@suse.de>
7166 * tree-vect-stmts.c (vectorizable_store): Use the
7167 computed alignment scheme instead of querying
7170 2021-10-18 Richard Biener <rguenther@suse.de>
7172 * tree-vect-stmts.c (vectorizable_store): Do not recompute
7173 alignment scheme already determined by get_load_store_type.
7175 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7177 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
7178 Clone corresponding pass.
7180 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7182 * combine.c (recog_for_combine): For an unrecognized move/set of
7183 a constant, try force_const_mem to place it in the constant pool.
7184 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
7185 Delete global variables (that are no longer assigned a cost value).
7186 (cse_insn): Simplify logic for deciding whether to place a folded
7187 constant in the constant pool using force_const_mem.
7188 (cse_main): Remove zero initialization of constant_pool_entries_cost
7189 and constant_pool_entries_regcost.
7190 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
7191 fractionally more expensive, when optimizing for speed.
7193 2021-10-18 Martin Liska <mliska@suse.cz>
7195 PR gcov-profile/102746
7196 PR gcov-profile/102747
7197 * gcov.c (main): Return return_code.
7198 (output_gcov_file): Mark return_code when error happens.
7199 (generate_results): Likewise.
7200 (read_graph_file): Likewise.
7201 (read_count_file): Likewise.
7203 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7205 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
7206 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
7207 a popcount, allowing compile-time evaluation/simplification.
7208 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
7210 2021-10-18 Richard Biener <rguenther@suse.de>
7212 PR tree-optimization/102788
7213 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
7215 (vect_pattern_recog_1): Likewise.
7216 (vect_recog_bool_pattern): Continue matching the pattern
7217 even if we do not have a vector type for a conversion
7220 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
7222 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
7223 Evalute SS_NEG and SS_ABS of a constant argument.
7225 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
7228 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
7229 * internal-fn.def: Add entry for COND_NEG internal function.
7230 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
7231 NEGATE_EXPR, COND_NEG mapping.
7232 * optabs.def: Add entry for cond_neg_optab.
7233 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
7234 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
7235 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
7237 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
7239 * tree-ssa-strlen.c (class strlen_pass): Rename from
7241 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
7242 (handle_assign): Same.
7243 (adjust_last_stmt): Same.
7244 (maybe_warn_overflow): Same.
7245 (handle_builtin_strlen): Same.
7246 (handle_builtin_strchr): Same.
7247 (handle_builtin_strcpy): Same.
7248 (handle_builtin_strncat): Same.
7249 (handle_builtin_stxncpy_strncat): Same.
7250 (handle_builtin_memcpy): Same.
7251 (handle_builtin_strcat): Same.
7252 (handle_alloc_call): Same.
7253 (handle_builtin_memset): Same.
7254 (handle_builtin_memcmp): Same.
7255 (get_len_or_size): Same.
7256 (strxcmp_eqz_result): Same.
7257 (handle_builtin_string_cmp): Same.
7258 (handle_pointer_plus): Same.
7259 (count_nonzero_bytes_addr): Same.
7260 (count_nonzero_bytes): Same.
7261 (handle_store): Same.
7262 (strlen_check_and_optimize_call): Same.
7263 (handle_integral_assign): Same.
7264 (check_and_optimize_stmt): Same.
7265 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
7267 2021-10-18 Richard Biener <rguenther@suse.de>
7269 PR tree-optimization/102798
7270 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
7271 Only copy points-to info to newly generated SSA names.
7273 2021-10-18 Martin Liska <mliska@suse.cz>
7275 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
7276 * gcov.c (get_cycles_count): Likewise.
7277 * lto-compress.c (lto_compression_zlib): Likewise.
7278 (lto_uncompression_zlib): Likewise.
7279 * targhooks.c (default_pch_valid_p): Likewise.
7281 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
7283 * tree-pass.h (make_pass_thread_jumps_full): New.
7284 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
7285 (try_thread_blocks): Add resolve and speed arguments.
7286 (pass_thread_jumps::execute): Inline.
7287 (do_early_thread_jumps): New.
7288 (do_thread_jumps): New.
7289 (make_pass_thread_jumps): Move.
7290 (pass_early_thread_jumps::gate): Inline.
7291 (pass_early_thread_jumps::execute): Inline.
7292 (class pass_thread_jumps_full): New.
7294 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
7296 * configure.ac: Treat powerpc64*-*-freebsd* the same as
7297 powerpc64-*-freebsd*.
7298 * configure: Regenerate.
7300 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
7302 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
7303 set_ptr_nonull to set_ptr_nonnull in comments.
7305 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
7307 PR tree-optimization/102720
7308 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
7309 of call used and clobbered sets.
7311 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7313 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
7314 skip gomp and itm when r or nodefaultlibs is given.
7315 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
7316 Update ordering of exclusions, remove duplicate 'v' addition
7317 (collect2 will add this from the main command line).
7319 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7321 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
7322 filelist and framework options in specs instead of code.
7323 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
7324 specs that are really driver ones.
7325 (DARWIN_CC1_SPEC): Likewise.
7326 (CPP_SPEC): Likewise.
7327 (SYSROOT_SPEC): Append space.
7328 (LINK_SYSROOT_SPEC): Remove most driver link specs.
7329 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
7330 (STARTFILE_SPEC): Likewise.
7331 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
7332 (ASM_SPEC): Update driver-related specs.
7333 (ASM_FINAL_SPEC): Likewise.
7334 * config/darwin.opt: Remove now unused option aliases.
7335 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
7336 (ASM_SPEC): Update driver-related specs.
7338 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
7340 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
7341 allow standard_sse_constant_p immediate constants.
7343 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
7345 * config.gcc: Add tm-dwarf2.h to tm_d-file.
7347 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
7349 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
7350 that occur in abnormal phis.
7351 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
7352 abnormal and EH edges.
7353 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
7354 (rvrp_folder::value_on_edge): Ditto.
7355 (rvrp_folder::value_of_stmt): Ditto.
7356 (hybrid_folder::value_of_expr): Ditto for ranger queries.
7357 (hybrid_folder::value_on_edge): Ditto.
7358 (hybrid_folder::value_of_stmt): Ditto.
7359 * value-query.cc (gimple_range_global): Always return a range if
7360 the type is supported.
7362 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
7364 * config/pa/pa.md: Consistently use "rG" constraint for copy
7365 instruction in move patterns.
7367 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
7369 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
7372 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
7373 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
7374 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
7375 (ELFABIVERSION_AMDGPU_HSA_V4): New.
7376 (SET_SRAM_ECC_UNSUPPORTED): New.
7377 (copy_early_debug_info): Create elf flags to match the other objects.
7378 (main): Just let the attribute flags pass through.
7380 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7382 * tree-loop-distribution.c (reduction_var_overflows_first):
7383 Pass the type of reduction_var as first argument as it is also
7384 done for the load type.
7385 (loop_distribution::transform_reduction_loop): Add missing
7386 TREE_TYPE while determining precission of reduction_var.
7388 2021-10-15 Richard Biener <rguenther@suse.de>
7390 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
7392 * toplev.c (process_options): Warn when STABS debugging is
7393 enabled but not the preferred format.
7394 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
7396 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
7398 2021-10-15 Richard Biener <rguenther@suse.de>
7401 * tree-inline.c (copy_bb): Avoid underflowing nargs.
7403 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7405 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
7406 HFmode input operand to HImode.
7407 (ix86_vectorize_vec_perm_const): Likewise.
7408 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
7410 (*avx512f_permvar_truncv8siv8hi_1_hf):
7413 2021-10-15 Richard Biener <rguenther@suse.de>
7415 PR middle-end/102682
7416 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
7417 not create a paradoxical subreg.
7419 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
7421 * config/i386/i386-expand.c (ix86_expand_vector_init):
7422 For half_vector concat for HFmode, handle them like HImode.
7424 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7426 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
7427 (back_threader::back_threader): Same.
7428 (back_threader::resolve_phi): Try to solve without looking back if
7430 (back_threader::find_paths_to_names): Same.
7431 (try_thread_blocks): Pass resolve argument to back threader.
7432 (pass_early_thread_jumps::execute): Same.
7434 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7436 * doc/invoke.texi: Remove max-fsm-thread-length,
7437 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
7439 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
7441 (back_threader_registry::back_threader_registry): Same.
7442 (back_threader_profitability::profitable_path_p): Remove
7443 param_max_fsm_thread-length.
7444 (back_threader_registry::register_path): Remove
7445 m_max_allowable_paths.
7447 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7449 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
7451 (back_threader::~back_threader): Do not release m_path.
7453 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
7455 PR tree-optimization/102738
7456 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
7458 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
7460 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
7461 (omp_mark_declare_variant): Move from c-omp.c.
7462 (omp_context_name_list_prop): Update for Fortran strings.
7463 * omp-general.h (omp_check_context_selector): New prototype.
7464 (omp_mark_declare_variant): New prototype.
7466 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7468 * config/rs6000/rs6000.c (rs6000_density_test): Move early
7469 exit test further up the function.
7471 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7473 * config/arm/arm.c (arm_add_stmt_cost): Delete.
7474 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
7476 2021-10-14 Martin Jambor <mjambor@suse.cz>
7478 * doc/invoke.texi (Optimize Options): Add entry for
7479 ipa-cp-recursive-freq-factor.
7481 2021-10-14 Tamar Christina <tamar.christina@arm.com>
7483 * match.pd: New rule.
7485 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
7488 * ipa-modref-tree.h (modref_access_node::update2):
7489 Also check that parm_offset is unchanged.
7490 (modref_ref_node::insert_access): Fix updating of
7493 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7495 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
7498 2021-10-14 Richard Biener <rguenther@suse.de>
7500 PR tree-optimization/102659
7501 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
7502 rewrite pointer typed undefined overflow operations.
7503 (predicate_statements): Likewise. Make sure to emit invariant
7504 conversions in the preheader.
7505 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
7506 non-empty preheaders.
7507 * tree-data-ref.c (dr_analyze_indices): Strip useless
7508 conversions to the MEM_REF base type.
7510 2021-10-14 Martin Liska <mliska@suse.cz>
7512 * common.opt: Stop using AUTODETECT_VALUE
7513 and use EnabledBy where possible.
7514 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
7515 * toplev.c (AUTODETECT_VALUE): Remove macro.
7516 (process_options): Simplify by using EnabledBy and
7517 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
7520 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
7522 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
7524 (vld1_s32_x3): Likewise.
7526 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7528 PR tree-optimization/102736
7529 PR tree-optimization/102736
7530 * gimple-range-path.cc (path_range_query::range_on_path_entry):
7531 Assert that the requested range is defined outside the path.
7532 (path_range_query::ssa_range_in_phi): Do not call
7533 range_on_path_entry for SSA names that are defined within the
7536 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
7538 * config/darwin-driver.c (darwin_find_version_from_kernel):
7539 Quote internal identifiers and avoid contractions in
7541 (darwin_default_min_version): Likewise.
7542 (darwin_driver_init): Likewise.
7544 2021-10-14 Martin Jambor <mjambor@suse.cz>
7546 * params.opt (ipa-cp-recursive-freq-factor): New.
7547 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
7548 scc_no, self_recursion_generated_level, same_scc and
7549 self_recursion_generated_p.
7550 (ipcp_lattice::add_value): Replaced parameter unlimited with
7551 same_lat_gen_level, usit it determine limit of values and store it to
7553 (ipcp_lattice<valtype>::print): Dump the new fileds.
7554 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
7555 parameter and store it to the new value.
7556 (self_recursively_generated_p): Removed.
7557 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
7558 instead of self_recursively_generated_p, store self generation level
7560 (value_topo_info<valtype>::add_val): Set scc_no.
7561 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
7562 recursively feeding values and self generated values by appropriate
7565 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
7567 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
7568 redundant test for flag_vect_cost_model.
7570 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
7572 * bitmap.c (debug): New overloaded function for auto_bitmaps.
7573 * bitmap.h (debug): Same.
7575 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
7577 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
7578 Adjust builtin call.
7579 (_mm512_mask3_fcmadd_pch): Likewise.
7580 (_mm512_mask_fmadd_pch): Likewise
7581 (_mm512_mask3_fmadd_pch): Likewise
7582 (_mm512_mask_fcmadd_round_pch): Likewise
7583 (_mm512_mask3_fcmadd_round_pch): Likewise
7584 (_mm512_mask_fmadd_round_pch): Likewise
7585 (_mm512_mask3_fmadd_round_pch): Likewise
7586 (_mm_mask_fcmadd_sch): Likewise
7587 (_mm_mask3_fcmadd_sch): Likewise
7588 (_mm_mask_fmadd_sch): Likewise
7589 (_mm_mask3_fmadd_sch): Likewise
7590 (_mm_mask_fcmadd_round_sch): Likewise
7591 (_mm_mask3_fcmadd_round_sch): Likewise
7592 (_mm_mask_fmadd_round_sch): Likewise
7593 (_mm_mask3_fmadd_round_sch): Likewise
7594 (_mm_fcmadd_round_sch): Likewise
7595 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
7596 Adjust builtin call.
7597 (_mm_mask3_fmadd_pch): Likewise
7598 (_mm256_mask_fmadd_pch): Likewise
7599 (_mm256_mask3_fmadd_pch): Likewise
7600 (_mm_mask_fcmadd_pch): Likewise
7601 (_mm_mask3_fcmadd_pch): Likewise
7602 (_mm256_mask_fcmadd_pch): Likewise
7603 (_mm256_mask3_fcmadd_pch): Likewise
7604 * config/i386/i386-builtin.def: Add mask3 builtin for complex
7605 fma, and adjust mask_builtin to corresponding expander.
7606 * config/i386/i386-expand.c (ix86_expand_round_builtin):
7607 Skip eraseing embedded rounding for expanders that emits
7609 * config/i386/sse.md (complexmove): New mode_attr.
7610 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
7611 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
7612 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
7613 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
7614 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
7615 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
7616 * config/i386/subst.md (round_embedded_complex): New subst.
7618 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
7620 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
7621 (cbranchdf4): Likewise.
7622 Add missing move patterns for TARGET_SOFT_FLOAT.
7624 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
7626 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
7627 pseudo intermediate when moving a SUBREG into a hard register,
7628 by checking ix86_hardreg_mov_ok.
7629 (ix86_expand_vector_extract): Store zero-extended SImode
7630 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
7632 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
7633 complex (SUBREG) sets of (vector) hard registers before reload, by
7634 checking ix86_hardreg_mov_ok.
7636 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
7638 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
7640 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
7642 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
7644 2021-10-13 Martin Sebor <msebor@redhat.com>
7646 PR middle-end/102630
7647 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
7649 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
7651 * collect2.c (is_lto_object_file): Release simple-object
7652 resources, close files.
7654 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
7656 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
7657 when checking mve addressing modes.
7658 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
7660 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
7661 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
7662 there is a single Ux constraint.
7664 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
7666 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
7667 multiplication on 32-bit PA 1.1 and 2.0 targets.
7669 2021-10-13 Martin Liska <mliska@suse.cz>
7671 PR gcov-profile/90364
7672 * coverage.c (build_info): Emit checksum to the global variable.
7673 (build_info_type): Add new field for checksum.
7674 (coverage_obj_finish): Pass object_checksum.
7675 (coverage_init): Use 0 as checksum for .gcno files.
7676 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
7677 * gcov.c (read_graph_file): Read also checksum.
7678 * doc/invoke.texi: Document the behaviour change.
7680 2021-10-13 Richard Biener <rguenther@suse.de>
7682 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
7683 start at integer value 2.
7684 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
7685 the iterator for GSI_LAST_NEW_STMT.
7686 (gsi_insert_seq_nodes_after): Likewise.
7687 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
7688 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
7689 arguments to gsi_replace.
7691 2021-10-13 Martin Liska <mliska@suse.cz>
7694 * common.opt: Use EnabledBy instead of detection in
7695 finish_options and process_options.
7696 * opts.c (finish_options): Remove handling of
7697 x_flag_unroll_all_loops.
7698 * toplev.c (process_options): Likewise for flag_web and
7699 flag_rename_registers.
7701 2021-10-13 Richard Biener <rguenther@suse.de>
7703 PR tree-optimization/102659
7704 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
7705 (if_convertible_gimple_assign_stmt_p): Mark the loop for
7706 rewrite when stmts with undefined behavior on integer
7708 (combine_blocks): Predicate also when we need to rewrite stmts.
7709 (predicate_statements): Rewrite affected stmts to something
7710 with well-defined behavior on overflow.
7711 (tree_if_conversion): Initialize need_to_rewrite_undefined.
7713 2021-10-13 Richard Biener <rguenther@suse.de>
7716 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
7719 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
7721 * dwarf2ctf.c: Fix typo in comment.
7723 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
7726 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
7727 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
7728 avoid generating an automatic #define of vec_cpsgn. Use the
7729 correct built-in for V4SFmode that doesn't depend on VSX.
7731 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
7735 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
7736 define_insn_and_split pattern. Add alternative 1 and split it
7737 post reload to insert operand 1 into the low part of operand 0.
7738 (*sub<mode>_1_slp): Ditto.
7739 (*and<mode>_1_slp): Ditto.
7740 (*<any_or:code><mode>_1_slp): Ditto.
7741 (*ashl<mode>3_1_slp): Ditto.
7742 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
7743 (*<any_rotate:insn><mode>3_1_slp): Ditto.
7744 (*neg<mode>_1_slp): New insn_and_split pattern.
7745 (*one_cmpl<mode>_1_slp): Ditto.
7747 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
7749 * doc/install.texi: Update MinGW and mingw-64 Binaries
7752 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
7754 * doc/extend.texi (Common Variable Attributes): Fix typos in
7755 alloc_size documentation.
7757 2021-10-12 Richard Biener <rguenther@suse.de>
7759 PR tree-optimization/102696
7760 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
7761 the tree fatally failed when we reject a BIT_FIELD_REF.
7763 2021-10-12 Richard Biener <rguenther@suse.de>
7765 PR tree-optimization/102572
7766 * tree-vect-stmts.c (vect_build_gather_load_calls): When
7767 gathering the vectorized defs for the mask pass in the
7768 desired mask vector type so invariants will be handled
7771 2021-10-12 Tamar Christina <tamar.christina@arm.com>
7773 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
7774 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
7775 *fcmuo<mode>_nor_combine): New.
7777 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
7780 * config/sparc/sparc-modes.def (OI): New integer mode.
7782 2021-10-12 Jakub Jelinek <jakub@redhat.com>
7784 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
7785 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
7788 2021-10-12 Jakub Jelinek <jakub@redhat.com>
7790 * tree-vectorizer.h (loop_cost_model): New function.
7791 (unlimited_cost_model): Use it.
7792 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
7793 call instead of flag_vect_cost_model.
7794 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
7795 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
7796 of flag_simd_cost_model.
7798 2021-10-12 liuhongt <hongtao.liu@intel.com>
7801 * config/i386/i386-expand.c (emit_reduc_half): Handle
7803 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
7804 (reduc_plus_scal_v4qi): Ditto.
7806 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7808 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
7809 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
7810 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
7812 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7814 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
7815 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
7816 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
7817 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
7818 _mm_cvtepu32_epi64): New.
7820 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7822 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
7823 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
7825 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
7827 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
7828 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
7829 _mm_max_epi32, _mm_max_epu32): New.
7831 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
7833 * ipa-modref-tree.h (struct modref_access_node): Revert
7835 (struct modref_ref_node): Likewise.
7837 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
7839 * ipa-modref-tree.h (modref_tree::global_access_p): New member
7842 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
7843 ignore_stores_eaf_flags): Move to ipa-modref.h
7844 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
7845 (modref_summary::global_memory_read_p): New member function.
7846 (modref_summary::global_memory_written_p): New member function.
7847 * ipa-modref.h (modref_summary::global_memory_read_p,
7848 modref_summary::global_memory_written_p): Declare.
7849 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
7850 ignore_stores_eaf_flags): move here.
7851 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
7853 (handle_rhs_call): Rewrite.
7854 (handle_call_arg): New function.
7855 (determine_global_memory_access): New function.
7856 (handle_const_call): Remove
7857 (handle_pure_call): Remove
7858 (find_func_aliases_for_call): Update use of handle_rhs_call.
7859 (compute_points_to_sets): Handle global memory acccesses
7862 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
7864 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
7865 documentation, mention UBSAN_OPTIONS, similar to what is done
7866 for AddressSanitizer.
7868 2021-10-11 Richard Biener <rguenther@suse.de>
7870 PR middle-end/102683
7871 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
7872 availability before building an integer type for storage
7875 2021-10-11 Richard Biener <rguenther@suse.de>
7877 PR middle-end/101480
7878 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
7881 2021-10-11 Martin Liska <mliska@suse.cz>
7883 * common.opt: Remove Init(2) for some options.
7884 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
7885 use rather OPTION_SET_P.
7887 2021-10-11 Martin Liska <mliska@suse.cz>
7889 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
7890 * flag-types.h (enum ira_region): Likewise.
7891 * toplev.c (process_options): Use OPTION_SET_P instead of
7892 IRA_REGION_AUTODETECT.
7894 2021-10-11 Jakub Jelinek <jakub@redhat.com>
7896 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
7897 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
7899 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7901 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
7902 * config/s390/s390.c (s390_rawmemchr): New function.
7903 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
7904 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
7905 the pattern vfees<mode> from vx-builtins.md.
7906 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
7908 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
7910 * builtins.c (get_memory_rtx): Change to external linkage.
7911 * builtins.h (get_memory_rtx): Add function prototype.
7912 * doc/md.texi (rawmemchr<mode>): Document.
7913 * internal-fn.c (expand_RAWMEMCHR): Define.
7914 * internal-fn.def (RAWMEMCHR): Add.
7915 * optabs.def (rawmemchr_optab): Add.
7916 * tree-loop-distribution.c (find_single_drs): Change return code
7917 behaviour by also returning true if no single store was found
7919 (loop_distribution::classify_partition): Respect the new return
7920 code behaviour of function find_single_drs.
7921 (loop_distribution::execute): Call new function
7922 transform_reduction_loop in order to replace rawmemchr or strlen
7923 like loops by calls into builtins.
7924 (generate_reduction_builtin_1): New function.
7925 (generate_rawmemchr_builtin): New function.
7926 (generate_strlen_builtin_1): New function.
7927 (generate_strlen_builtin): New function.
7928 (generate_strlen_builtin_using_rawmemchr): New function.
7929 (reduction_var_overflows_first): New function.
7930 (determine_reduction_stmt_1): New function.
7931 (determine_reduction_stmt): New function.
7932 (loop_distribution::transform_reduction_loop): New function.
7934 2021-10-11 Martin Liska <mliska@suse.cz>
7936 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
7937 and remove legacy hashing code.
7939 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
7942 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
7943 CONST_INT for BEGIN and END, and use gcc_assert rather than
7946 2021-10-10 Jakub Jelinek <jakub@redhat.com>
7949 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
7950 use MO_VAL_SET if loc is not sp.
7952 2021-10-10 Andrew Pinski <apinski@marvell.com>
7954 PR tree-optimization/102622
7955 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
7956 Swap the order of a?0:pow2cst and a?0:-1 transformations.
7958 2021-10-09 liuhongt <hongtao.liu@intel.com>
7961 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
7963 (ix86_use_mask_cmp_p): Ditto.
7964 (ix86_expand_sse_movcc): Ditto.
7965 * config/i386/i386.md (setcc_hf_mask): New define_insn.
7966 (movhf_mask): Ditto.
7967 (UNSPEC_MOVCC_MASK): New unspec.
7968 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
7970 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
7972 PR rtl-optimization/102627
7973 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
7975 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
7977 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
7980 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
7982 * value-range.cc (irange::debug): New.
7983 * value-range.h (irange::debug): New.
7985 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
7987 PR tree-optimization/102385
7988 * predict.h (change_edge_frequency): Declare.
7989 * predict.c (change_edge_frequency): New function.
7990 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
7992 (tree_unroll_loop): Likewise.
7993 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
7994 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
7996 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
7997 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
7998 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
7999 to retrieve the exit edges. Make all the old profile update code
8000 conditional on !single_loop_p -- the case it was written for --
8001 and use a different approach for the single-loop case.
8003 2021-10-08 Martin Liska <mliska@suse.cz>
8005 * config/alpha/alpha.c (alpha_option_override): Use new macro
8007 * config/arc/arc.c (arc_override_options): Likewise.
8008 * config/arm/arm.c (arm_option_override): Likewise.
8009 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
8010 * config/c6x/c6x.c (c6x_option_override): Likewise.
8011 * config/csky/csky.c: Likewise.
8012 * config/darwin.c (darwin_override_options): Likewise.
8013 * config/frv/frv.c (frv_option_override): Likewise.
8014 * config/i386/djgpp.h: Likewise.
8015 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
8016 (ix86_max_noce_ifcvt_seq_cost): Likewise.
8017 * config/ia64/ia64.c (ia64_option_override): Likewise.
8018 (ia64_override_options_after_change): Likewise.
8019 * config/m32c/m32c.c (m32c_option_override): Likewise.
8020 * config/m32r/m32r.c (m32r_init): Likewise.
8021 * config/m68k/m68k.c (m68k_option_override): Likewise.
8022 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
8023 * config/mips/mips.c (mips_option_override): Likewise.
8024 * config/nios2/nios2.c (nios2_option_override): Likewise.
8025 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
8026 * config/pa/pa.c (pa_option_override): Likewise.
8027 * config/riscv/riscv.c (riscv_option_override): Likewise.
8028 * config/rs6000/aix71.h: Likewise.
8029 * config/rs6000/aix72.h: Likewise.
8030 * config/rs6000/aix73.h: Likewise.
8031 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
8032 (rs6000_override_options_after_change): Likewise.
8033 (rs6000_linux64_override_options): Likewise.
8034 (glibc_supports_ieee_128bit): Likewise.
8035 (rs6000_option_override_internal): Likewise.
8036 (rs6000_file_start): Likewise.
8037 (rs6000_darwin_file_start): Likewise.
8038 * config/rs6000/rtems.h: Likewise.
8039 * config/rs6000/sysv4.h: Likewise.
8040 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
8041 * config/s390/s390.c (s390_option_override): Likewise.
8042 * config/sh/linux.h: Likewise.
8043 * config/sh/netbsd-elf.h (while): Likewise.
8044 * config/sh/sh.c (sh_option_override): Likewise.
8045 * config/sol2.c (solaris_override_options): Likewise.
8046 * config/sparc/sparc.c (sparc_option_override): Likewise.
8047 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
8048 * config/visium/visium.c (visium_option_override): Likewise.
8049 * config/vxworks.c (vxworks_override_options): Likewise.
8050 * lto-opts.c (lto_write_options): Likewise.
8051 * omp-expand.c (expand_omp_simd): Likewise.
8052 * omp-general.c (omp_max_vf): Likewise.
8053 * omp-offload.c (oacc_xform_loop): Likewise.
8054 * opts.h (OPTION_SET_P): Likewise.
8055 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
8056 * toplev.c (process_options): Likewise.
8057 * tree-predcom.c: Likewise.
8058 * tree-sra.c (analyze_all_variable_accesses): Likewise.
8060 2021-10-08 liuhongt <hongtao.liu@intel.com>
8063 * config/i386/i386.c (ix86_optab_supported_p):
8064 Return true for HFmode.
8065 * match.pd: Simplify (_Float16) ceil ((double) x) to
8066 __builtin_ceilf16 (a) when a is _Float16 type and
8067 direct_internal_fn_supported_p.
8069 2021-10-08 liuhongt <hongtao.liu@intel.com>
8072 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
8073 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
8074 (reduc_<code>_scal_v4hi): New.
8076 2021-10-08 liuhongt <hongtao.liu@intel.com>
8078 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
8079 * doc/invoke.texi (Options That Control Optimization): Update
8081 * opts.c (default_options_table): Enable auto-vectorization at
8082 O2 with very-cheap cost model.
8083 (finish_options): Use cheap cost model for
8084 explicit -ftree{,-loop}-vectorize.
8086 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8088 * ctfc.c (ctfc_delete_container): Free hash table contents.
8090 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
8092 * toplev.c (process_options): Do not warn for GNU GIMPLE.
8094 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
8096 * tree-object-size.c (addr_object_size,
8097 compute_builtin_object_size): Drop PDECL and POFF arguments.
8098 (addr_object_size): Adjust calls.
8099 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
8102 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
8104 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
8105 representing signed and unsigned high-part multiplication resp.
8106 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
8107 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
8108 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
8109 SS_DIV, US_DIV]: Similar simplifications for saturating
8111 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
8112 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
8113 Implement compile-time evaluation for constant operands.
8114 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
8116 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
8117 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
8118 Mention the new smul_highpart and umul_highpart RTX codes.
8119 * doc/invoke.texi: Silence @xref "compilation" warnings.
8121 2021-10-07 Martin Jambor <mjambor@suse.cz>
8124 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
8125 case when the source reference description corresponds to a
8126 referance taken in a function src->caller is inlined to.
8128 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
8131 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
8133 (modref_access_node::try_merge_with): Add sanity check that there
8134 are no redundant entries in the list.
8136 2021-10-07 Richard Biener <rguenther@suse.de>
8138 PR tree-optimization/102608
8139 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
8142 2021-10-07 Martin Liska <mliska@suse.cz>
8144 * toplev.c (toplev::main): Make
8145 save_opt_decoded_options a pointer type
8146 * toplev.h: Likewise.
8148 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8150 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
8151 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
8152 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
8154 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8156 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
8157 Adjust for new -msram-ecc=any behaviour.
8158 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
8159 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
8160 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
8161 (EF_AMDGPU_XNACK_V3): ... this.
8162 (EF_AMDGPU_SRAM_ECC): Rename to ...
8163 (EF_AMDGPU_SRAM_ECC_V3): ... this.
8164 (EF_AMDGPU_FEATURE_XNACK_V4): New.
8165 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
8166 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
8167 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
8168 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
8169 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
8170 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
8171 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
8172 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
8173 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
8174 (SET_XNACK_ON): New.
8175 (SET_XNACK_OFF): New.
8177 (SET_SRAM_ECC_ON): New.
8178 (SET_SRAM_ECC_ANY): New.
8179 (SET_SRAM_ECC_OFF): New.
8180 (TEST_SRAM_ECC_ANY): New.
8181 (TEST_SRAM_ECC_ON): New.
8182 (main): Implement HSACOv4 and -msram-ecc=any.
8184 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
8186 * config.in: Regenerate.
8187 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
8191 (A_FIJI): Rename to ...
8193 (A_900): Rename to ...
8195 (A_906): Rename to ...
8197 (A_908): Rename to ...
8199 (SRAMOPT): New macro.
8200 (ASM_SPEC): Adjust xnack option usage.
8201 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
8202 * configure: Regenerate.
8203 * configure.ac: Detect LLVM assembler dialect.
8205 2021-10-07 Richard Biener <rguenther@suse.de>
8207 * tree-pretty-print.c (dump_generic_node): Do not elide
8208 printing '&' when dumping with -gimple.
8210 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8212 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
8214 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
8215 (adjust_imagpart_expr): Ditto.
8216 * value-range.cc (irange::irange_intersect): Call new routine if
8217 RHS is a single pair.
8218 (irange::intersect): New wide_int version.
8219 * value-range.h (class irange): New prototype.
8221 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8223 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
8224 Add parameter to limit size when recognizing switches.
8225 (gimple_outgoing_range::edge_range_p): Check size limit.
8226 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
8227 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
8228 that exceed the size limit.
8229 (gori_compute::gori_compute): Add initializer.
8230 * params.opt (evrp-switch-limit): New.
8231 * doc/invoke.texi: Update docs.
8233 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8235 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
8236 TYPE_MAX_VALUE instead of creating new trees when possible.
8238 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
8240 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
8241 zero and non-zero more efficently.
8243 2021-10-06 Richard Biener <rguenther@suse.de>
8246 * dumpfile.h (TDF_GIMPLE_VAL): New.
8247 (dump_flag): Re-order and adjust TDF_* flags. Make
8248 the enum uint32_t. Use std::underlying_type in the
8250 (optgroup_flag): Likewise for the operator overloads.
8251 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
8252 in _Literal if TDF_GIMPLE_VAL.
8253 * gimple-pretty-print.c (dump_gimple_assign): Add
8254 TDF_GIMPLE_VAL to flags when dumping operands where only
8255 is_gimple_val are allowed.
8256 (dump_gimple_cond): Likewise.
8258 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
8260 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
8263 2021-10-05 qing zhao <qing.zhao@oracle.com>
8265 PR middle-end/102359
8266 * gimplify.c (gimplify_decl_expr): Not add initialization for an
8267 auto variable when it has been initialized by frontend.
8269 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
8271 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
8274 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8276 * common/config/avr/avr-common.c (avr_handle_option): Mark
8277 argument as ATTRIBUTE_UNUSED.
8279 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
8281 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
8282 Undefine before redefinition.
8284 2021-10-05 Richard Biener <rguenther@suse.de>
8286 * toplev.c (no_backend): Remove global var.
8287 (process_options): Pass in no_backend, move post_options
8288 langhook call to toplev::main.
8289 (do_compile): Pass in no_backend, move process_options call
8291 (toplev::run_self_tests): Check no_backend at the caller.
8292 (toplev::main): Call post_options and process_options
8293 split out from do_compile, do self-tests only if
8294 no_backend is initialized.
8296 2021-10-05 Richard Biener <rguenther@suse.de>
8298 * tree-cfg.c (dump_function_to_file): Dump the UID of the
8299 function as part of the name when requested.
8300 * tree-pretty-print.c (dump_function_name): Dump the UID when
8301 requested and the langhook produced the actual name.
8303 2021-10-05 Richard Biener <rguenther@suse.de>
8305 PR middle-end/102587
8306 PR middle-end/102285
8307 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
8308 zero-initialization as last resort, use the constant
8309 size as given by the DEFERRED_INIT argument to build
8312 2021-10-04 Marek Polacek <polacek@redhat.com>
8315 * doc/invoke.texi: Document -Warray-compare.
8317 2021-10-04 Richard Biener <rguenther@suse.de>
8319 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
8320 variables are not to be initialized.
8322 2021-10-04 Richard Biener <rguenther@suse.de>
8324 * expr.h (non_mem_decl_p): Declare.
8325 (mem_ref_refers_to_non_mem_p): Likewise.
8326 * expr.c (non_mem_decl_p): Export.
8327 (mem_ref_refers_to_non_mem_p): Likewise.
8328 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
8329 but check the base with mem_ref_refers_to_non_mem_p
8332 2021-10-04 Richard Biener <rguenther@suse.de>
8334 PR tree-optimization/102570
8335 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
8336 we are using clique for the internal function code.
8337 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
8338 internal function code.
8339 (print_vn_reference_ops): Print the internal function code.
8340 (vn_reference_op_compute_hash): Hash it.
8341 (copy_reference_ops_from_call): Record it.
8342 (visit_stmt): Remove the restriction around internal function
8344 (fully_constant_vn_reference_p): Use fold_const_call and handle
8346 (vn_reference_eq): Compare call return types.
8347 * tree-ssa-pre.c (create_expression_by_pieces): Handle
8348 generating calls to internal functions.
8349 (compute_avail): Remove the restriction around internal function
8352 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
8354 PR tree-optimization/102560
8355 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
8356 marker for invalid_range.
8358 2021-10-04 Richard Biener <rguenther@suse.de>
8360 PR middle-end/102587
8361 * internal-fn.c (expand_DEFERRED_INIT): Guard register
8362 initialization path an avoid initializing VLA registers
8365 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
8367 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
8369 2021-10-03 Martin Liska <mliska@suse.cz>
8371 * toplev.c (toplev::main): Check opt_index if it is a part
8374 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
8376 PR tree-optimization/102563
8377 * range-op.cc (operator_lshift::op1_range): Do not clobber
8380 2021-10-02 Martin Liska <mliska@suse.cz>
8382 * toplev.c (toplev::main): save_decoded_options[0] is program
8383 name and so it should be skipped.
8385 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8387 PR tree-optimization/102546
8388 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
8389 X << Y is non-zero implies X is also non-zero.
8391 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8393 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8395 * config/aarch64/aarch64-tune.md: Regenerate.
8396 * doc/invoke.texi: Update docs.
8398 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8400 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8402 * config/aarch64/aarch64-tune.md: Regenerate.
8403 * doc/invoke.texi: Update docs.
8405 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8407 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
8409 * config/aarch64/aarch64-tune.md: Regenerate.
8410 * doc/invoke.texi: Update docs.
8412 2021-10-01 Martin Sebor <msebor@redhat.com>
8415 * doc/invoke.texi (-Waddress): Update.
8416 * gengtype.c (write_types): Avoid -Waddress.
8417 * poly-int.h (POLY_SET_COEFF): Avoid using null.
8419 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
8422 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
8425 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8427 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
8429 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
8431 * gimple-range-path.cc (path_range_query::compute_ranges): Use
8433 * gimple-range-path.h (class path_range_query): Remove shadowed
8435 (path_range_query::get_path_oracle): New.
8437 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8438 Richard Biener <rguenther@suse.de>
8441 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
8442 INT_MIN / -1 division detection from here ...
8443 (-fsanitize=signed-integer-overflow): ... and add it here.
8445 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8447 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
8449 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
8450 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
8451 (AARCH64_ISA_V9): New ISA flag.
8452 * doc/invoke.texi: Update docs.
8454 2021-10-01 Martin Liska <mliska@suse.cz>
8456 * toplev.c (toplev::main): Save decoded optimization options.
8457 * toplev.h (save_opt_decoded_options): New.
8458 * doc/extend.texi: Be more clear about optimize and target
8461 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8463 * explow.c: Include langhooks.h.
8464 (set_stack_check_libfunc): Build a proper function type.
8466 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
8469 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
8470 not return true for external weak function symbols in medium model.
8472 2021-10-01 Jakub Jelinek <jakub@redhat.com>
8474 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
8475 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
8476 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
8477 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
8478 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
8479 OMP_CLAUSE_SCHEDULE_STATIC.
8481 2021-10-01 Richard Biener <rguenther@suse.de>
8483 PR middle-end/102518
8484 * tree-inline.c (setup_one_parameter): Avoid substituting
8485 an invariant into contexts where a GIMPLE register is not valid.
8487 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
8489 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
8490 * config/arm/arm-tables.opt: Regenerate.
8491 * config/arm/arm-tune.md: Regenerate.
8492 * doc/invoke.texi: Update docs.
8494 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
8497 * config/i386/i386.md
8498 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
8501 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
8503 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
8504 omp_{c,re}alloc, fix omp_alloc/omp_free.
8506 2021-09-30 Martin Liska <mliska@suse.cz>
8508 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
8509 asm_out_file and stream directly to MYFILE.
8511 2021-09-30 Richard Biener <rguenther@suse.de>
8513 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8514 Restore and fix condition under which we apply npeel to
8515 the DRs misalignment value.
8517 2021-09-30 Richard Biener <rguenther@suse.de>
8519 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
8520 Fix npeel check for variable amount of peeling.
8522 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8524 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
8526 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8528 * gimple-range.cc (gimple_ranger::debug): New.
8529 * gimple-range.h (class gimple_ranger): Add debug.
8531 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
8533 PR middle-end/102519
8534 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
8536 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
8539 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
8540 (btf_finalize): Empty the hash_map btf_var_ids.
8542 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8544 * tree-vrp.c (thread_through_all_blocks): Return bool.
8545 (execute_vrp_threader): Return TODO_* flags.
8546 (pass_data_vrp_threader): Set todo_flags_finish to 0.
8548 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
8550 * timevar.def (TV_TREE_VRP_THREADER): New.
8551 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
8553 2021-09-29 David Faust <david.faust@oracle.com>
8555 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
8557 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
8559 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
8560 new type qualifier enum.
8561 (TYPES_TERNOP_SSSU): Likewise.
8562 (TYPES_TERNOP_PPPU): Likewise.
8563 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
8564 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
8566 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
8567 builtin and remove casts.
8568 (vqtbl1_s8): Likewise.
8569 (vqtbl1q_p8): Likewise.
8570 (vqtbl1q_s8): Likewise.
8571 (vqtbx1_s8): Likewise.
8572 (vqtbx1_p8): Likewise.
8573 (vqtbx1q_s8): Likewise.
8574 (vqtbx1q_p8): Likewise.
8575 (vtbl1_p8): Likewise.
8576 (vtbl2_p8): Likewise.
8577 (vtbx2_p8): Likewise.
8579 2021-09-29 Richard Biener <rguenther@suse.de>
8581 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
8583 (vect_update_misalignment_for_peel): Use it to update
8584 misaligned to the value necessary for an aligned access.
8585 (vect_get_peeling_costs_all_drs): Likewise.
8586 (vect_enhance_data_refs_alignment): Likewise.
8588 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8590 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
8591 emitted operations and adjust heuristic for code size.
8593 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8595 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
8596 emitted operations and adjust heuristic for code size.
8598 2021-09-29 Jakub Jelinek <jakub@redhat.com>
8600 PR middle-end/102504
8601 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
8602 in OMP_SCOPE clauses, not just on worksharing construct clauses.
8604 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
8606 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
8607 (<u>mulditi3): Call <su>muldi3_highpart.
8608 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
8609 (<u>mulsidi3): Call <su>mulsi3_highpart.
8610 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
8612 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
8614 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
8616 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
8618 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
8619 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
8622 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
8624 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
8625 (AARCH64_FL_V8_7): Likewise.
8626 (AARCH64_FL_FOR_ARCH8_7): Likewise.
8627 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
8628 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
8629 * doc/invoke.texi: Document the above.
8631 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8633 * dbgcnt.c (dbg_cnt_counter): New.
8634 * dbgcnt.h (dbg_cnt_counter): New.
8635 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
8636 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
8637 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
8638 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
8641 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8643 * cfgcleanup.c (pass_jump::execute): Check
8644 flag_expensive_optimizations.
8645 (pass_jump_after_combine::gate): Same.
8646 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
8647 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
8648 * tree-ssa-threadupdate.c
8649 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
8652 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
8654 * tree-ssa-reassoc.c (biased_names): New global.
8655 (propagate_bias_p): New function.
8656 (loop_carried_phi): Remove.
8657 (propagate_rank): Propagate bias along single uses.
8658 (get_rank): Update biased_names when needed.
8660 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
8662 * passes.def (pass_reassoc): Rename parameter to early_p.
8663 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
8665 (phi_rank): Don't bias loop-carried phi ranks
8666 before vectorization pass.
8667 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
8668 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
8670 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
8672 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
8674 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
8676 2021-09-28 Jakub Jelinek <jakub@redhat.com>
8679 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
8680 special 80387 instruction XFmode constants if flag_rounding_math.
8682 2021-09-28 Richard Biener <rguenther@suse.de>
8684 PR tree-optimization/100112
8685 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
8686 referece into the hashtable twice in case last_vuse is
8687 different from the original vuse on the stmt.
8689 2021-09-28 Jakub Jelinek <jakub@redhat.com>
8691 PR middle-end/102492
8692 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
8693 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
8694 clauses on SIMD constructs.
8696 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8698 PR tree-optimization/102511
8699 * gimple-range-path.cc (path_range_query::range_on_path_entry):
8700 Return VARYING when nothing found.
8702 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
8705 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
8707 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
8708 (VALID_MMX_REG_MODE): Add V4HFmode.
8709 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
8710 vector mode condition.
8711 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
8713 (function_arg_32): Add V4HFmode.
8714 (function_arg_advance_32): Likewise.
8715 * config/i386/i386.md (mode): Add V4HF/V2HF.
8716 (MODE_SIZE): Likewise.
8717 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
8718 (V_32): Add V2HF mode.
8719 (VHF_32_64): New mode iterator.
8720 (*mov<mode>_internal): Adjust sse alternatives to support
8722 (*mov<mode>_internal): Adjust sse alternatives to support
8724 (<insn><mode>3): New define_insn for add/sub/mul/div.
8726 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
8728 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
8730 (pass_early_thread_jumps::gate): Same.
8731 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
8732 Return if !flag_thread_jumps.
8733 * tree-ssa-threadupdate.c
8734 (jt_path_registry::register_jump_thread): Assert that
8735 flag_thread_jumps is true.
8737 2021-09-28 liuhongt <hongtao.liu@intel.com>
8740 (simplify_context::simplify_binary_operation_1): Relax
8741 condition of simplifying (vec_concat:M (vec_select op0
8742 index0)(vec_select op1 index1)) to allow different modes
8743 between op0 and M, but have same inner mode.
8745 2021-09-28 liuhongt <hongtao.liu@intel.com>
8747 * config/i386/i386-expand.c (emit_reduc_half): Handle
8748 V8HF/V16HF/V32HFmode.
8749 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
8750 (REDUC_SSE_SMINMAX_MODE): Ditto.
8751 (REDUC_PLUS_MODE): Add V16HF and V32HF.
8752 (REDUC_SMINMAX_MODE): Ditto.
8754 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8756 * gimple-range-path.cc
8757 (path_range_query::precompute_ranges_in_block): Rename to...
8758 (path_range_query::compute_ranges_in_block): ...this.
8759 (path_range_query::precompute_ranges): Rename to...
8760 (path_range_query::compute_ranges): ...this.
8761 (path_range_query::precompute_relations): Rename to...
8762 (path_range_query::compute_relations): ...this.
8763 (path_range_query::precompute_phi_relations): Rename to...
8764 (path_range_query::compute_phi_relations): ...this.
8765 * gimple-range-path.h: Rename precompute* to compute*.
8766 * tree-ssa-threadbackward.c
8767 (back_threader::find_taken_edge_switch): Same.
8768 (back_threader::find_taken_edge_cond): Same.
8769 * tree-ssa-threadedge.c
8770 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
8771 (hybrid_jt_state::register_equivs_stmt): Inline...
8772 * tree-ssa-threadedge.h: ...here.
8774 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8776 * tree-vrp.c (lhs_of_dominating_assert): Remove.
8777 (class vrp_jt_state): Remove.
8778 (class vrp_jt_simplifier): Remove.
8779 (vrp_jt_simplifier::simplify): Remove.
8780 (class vrp_jump_threader): Remove.
8781 (vrp_jump_threader::vrp_jump_threader): Remove.
8782 (vrp_jump_threader::~vrp_jump_threader): Remove.
8783 (vrp_jump_threader::before_dom_children): Remove.
8784 (vrp_jump_threader::after_dom_children): Remove.
8786 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8788 * passes.def (pass_vrp_threader): New.
8789 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
8790 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
8791 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
8792 (hybrid_jt_simplifier::simplify): New.
8793 (hybrid_jt_simplifier::compute_ranges_from_state): New.
8794 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
8795 (class hybrid_jt_simplifier): New.
8796 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
8798 (class hybrid_threader): New.
8799 (hybrid_threader::hybrid_threader): New.
8800 (hybrid_threader::~hybrid_threader): New.
8801 (hybrid_threader::before_dom_children): New.
8802 (hybrid_threader::after_dom_children): New.
8803 (execute_vrp_threader): New.
8804 (class pass_vrp_threader): New.
8805 (make_pass_vrp_threader): New.
8807 2021-09-27 Martin Liska <mliska@suse.cz>
8809 * output.h (enum section_flag): New.
8810 (SECTION_FORGET): Remove.
8811 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
8812 (SECTION_STYLE_MASK): Define it based on other enum
8814 * varasm.c (switch_to_section): Remove unused handling of
8817 2021-09-27 Martin Liska <mliska@suse.cz>
8819 * common.opt: Add new variable flag_default_complex_method.
8820 * opts.c (finish_options): Handle flags related to
8821 x_flag_complex_method.
8822 * toplev.c (process_options): Remove option handling related
8823 to flag_complex_method.
8825 2021-09-27 Richard Biener <rguenther@suse.de>
8827 PR middle-end/102450
8828 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
8829 type_for_size, instead use int_mode_for_size.
8831 2021-09-27 Andrew Pinski <apinski@marvell.com>
8834 * gimplify.c (gimplify_save_expr): Return early
8835 if the type of val is error_mark_node.
8837 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8839 * tree-ssanames.c (ssa_name_has_boolean_range): Use
8842 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
8844 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
8845 vrp_visit_cond_stmt.
8846 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
8847 (cprop_into_stmt): Same.
8848 (dom_opt_dom_walker::optimize_stmt): Same.
8850 2021-09-27 Richard Biener <rguenther@suse.de>
8852 PR tree-optimization/97351
8853 PR tree-optimization/97352
8854 PR tree-optimization/82426
8855 * tree-vectorizer.h (dr_misalignment): Add vector type
8857 (aligned_access_p): Likewise.
8858 (known_alignment_for_access_p): Likewise.
8859 (vect_supportable_dr_alignment): Likewise.
8860 (vect_known_alignment_in_bytes): Likewise. Refactor.
8861 (DR_MISALIGNMENT): Remove.
8862 (vect_update_shared_vectype): Likewise.
8863 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
8864 a vector type with larger alignment requirement and apply
8865 the negative step adjustment here.
8866 (vect_calculate_target_alignment): Remove.
8867 (vect_compute_data_ref_alignment): Get explicit vector type
8868 argument, do not apply a negative step alignment adjustment
8870 (vect_slp_analyze_node_alignment): Re-analyze alignment
8871 when we re-visit the DR with a bigger desired alignment but
8872 keep more precise results from smaller alignments.
8873 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
8874 (vect_slp_analyze_node_operations_1): Do not update the
8875 shared vector type on stmts.
8876 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
8877 vector type of an SLP node to the representative stmt-info.
8878 (vect_transform_stmt): Likewise.
8880 2021-09-27 liuhongt <hongtao.liu@intel.com>
8883 2021-09-09 liuhongt <hongtao.liu@intel.com>
8886 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
8887 (reduc_plus_scal_v4sf): .. this, New define_expand.
8888 (reduc_plus_scal_v2df): .. and this, New define_expand.
8890 2021-09-26 liuhongt <hongtao.liu@intel.com>
8892 * doc/extend.texi (Half-Precision): Remove storage only
8893 description for _Float16 w/o avx512fp16.
8895 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
8897 * config/pru/constraints.md (Rrio): New constraint.
8898 * config/pru/predicates.md (regio_operand): New predicate.
8899 * config/pru/pru-pragma.c (pru_register_pragmas): Register
8900 the __regio_symbol address space.
8901 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
8902 * config/pru/pru.c (pru_symref2ioregno): New helper function.
8903 (pru_legitimate_address_p): Remove.
8904 (pru_addr_space_legitimate_address_p): Use the address space
8906 (pru_nongeneric_pointer_addrspace): New helper function.
8907 (pru_insert_attributes): New function to validate __regio_symbol
8909 (TARGET_INSERT_ATTRIBUTES): New macro.
8910 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
8911 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
8912 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
8913 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
8915 (*regio_nozext_writesi): New pattern to write to I/O registers.
8916 (*regio_zext_write_r30<EQS0:mode>): Ditto.
8917 * doc/extend.texi: Document the new PRU Named Address Space.
8919 2021-09-24 Patrick Palka <ppalka@redhat.com>
8923 * real.c (encode_ieee_double): Avoid unwanted sign extension.
8924 (encode_ieee_quad): Likewise.
8926 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
8928 PR rtl-optimization/102147
8929 * ira-build.c (ira_conflict_vector_profitable_p): Make
8930 profitability calculation independent of host compiler pointer and
8933 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
8935 * gimple-range-path.cc (path_range_query::path_range_query):
8936 Move debugging header...
8937 (path_range_query::precompute_ranges): ...here.
8938 (path_range_query::internal_range_of_expr): Do not call
8939 range_on_path_entry if NAME is defined in the current block.
8941 2021-09-24 Richard Biener <rguenther@suse.de>
8943 * cfghooks.c (verify_flow_info): Verify unallocated BB and
8944 edge flags are not set.
8946 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
8948 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
8950 (jt_path_registry::register_jump_thread): Call
8951 cancel_invalid_paths.
8952 * tree-ssa-threadupdate.h (class jt_path_registry): Add
8953 cancel_invalid_paths.
8955 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
8957 PR tree-optimization/102400
8958 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
8959 result_vdef to zero value.
8961 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
8963 PR tree-optimization/102451
8964 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
8967 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
8969 * config/i386/sse.md (cond_<insn><mode>): Extend to support
8971 (cond_mul<mode>): Likewise.
8972 (cond_div<mode>): Likewise.
8973 (cond_<code><mode>): Likewise.
8974 (cond_fma<mode>): Likewise.
8975 (cond_fms<mode>): Likewise.
8976 (cond_fnma<mode>): Likewise.
8977 (cond_fnms<mode>): Likewise.
8979 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
8981 PR tree-optimization/102463
8982 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
8983 there is no range-ops handler, don't look for a relation.
8985 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
8987 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
8988 non-executable_edge flag as parameter.
8989 * gimple-range-cache.h (ranger_cache): Adjust prototype.
8990 * gimple-range-gori.cc (gori_compute::gori_compute): Take
8991 non-executable_edge flag as parameter.
8992 (gori_compute::outgoing_edge_range_p): Check new flag.
8993 * gimple-range-gori.h (gori_compute): Adjust prototype.
8994 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
8995 (gimple_ranger::range_on_edge): Check new flag.
8996 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
8997 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
8998 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
8999 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
9000 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
9001 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
9002 (simplify_using_ranges::simplify_switch_using_ranges): Clear
9003 EDGE_EXECUTABLE like it originally did.
9004 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
9005 NON_EXECUTABLE flags.
9006 (simplify_using_ranges::simplify_using_ranges): Adjust.
9007 * vr-values.h (class simplify_using_ranges): Adjust.
9008 (simplify_using_ranges::set_range_query): Add non-executable flag param.
9010 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
9013 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
9014 zero-width bit fields and return indicator.
9015 (rs6000_discover_homogeneous_aggregate): Diagnose when the
9016 presence of a zero-width bit field changes parameter passing in
9019 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9021 * gimple-range-fold.cc (fold_using_range::range_of_phi):
9022 Remove dominator check.
9024 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9026 * gimple-range-path.cc (path_range_query::precompute_relations):
9027 Hoist edge calculations before using EDGE_SUCC.
9029 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
9031 * configure.ac: Fix --with-multilib-list description.
9032 * configure: Regenerate.
9034 2021-09-23 Richard Biener <rguenther@suse.de>
9036 PR tree-optimization/102448
9037 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9038 Clear alignment info copied from DR_PTR_INFO.
9040 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9042 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
9044 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
9045 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
9046 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
9047 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
9048 (vcond<mode><mode>): New expander.
9049 (vcond<mode><sseintvecmodelower>): Likewise.
9050 (vcond<sseintvecmodelower><mode>): Likewise.
9051 (vcondu<mode><sseintvecmodelower>): Likewise.
9053 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9055 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
9057 (extendv4hf<mode>2): Likewise.
9058 (extendv2hfv2df2): Likewise.
9059 (trunc<mode><ssePHmodelower>2): Likewise.
9060 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
9061 (trunc<mode>v4hf2): ... this, and drop constraints.
9062 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
9063 (truncv2dfv2hf2): ... this, and likewise.
9065 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9067 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
9069 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
9071 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
9072 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
9073 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
9075 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9077 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
9078 (fixuns_trunchfhi2): Likewise.
9079 (*fixuns_trunchfsi2zext): New define_insn.
9080 * config/i386/sse.md (ssePHmodelower): New mode_attr.
9081 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9082 New expander for same element vector fix_truncate.
9083 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
9084 Likewise for V4HF to V4SI/V4DI fix_truncate.
9085 (fix<fixunssuffix>_truncv2hfv2di2):
9086 Likeise for V2HF to V2DI fix_truncate.
9088 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
9090 * config/i386/i386.md (<code>hf3): New expander.
9092 2021-09-23 liuhongt <hongtao.liu@intel.com>
9094 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
9095 (VFH_SF_AVX512VL): Extend to handle HFmode.
9096 (VF_SF_AVX512VL): Deleted.
9098 2021-09-23 liuhongt <hongtao.liu@intel.com>
9100 * config/i386/i386.md (rinthf2): New expander.
9101 (nearbyinthf2): New expander.
9103 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
9105 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
9106 (class dom_jt_state): ...this and provide virtual overrides.
9107 (dom_jt_state::register_equiv): New.
9108 (class dom_jt_simplifier): Rename from
9109 dom_jump_threader_simplifier.
9110 (dom_jump_threader_simplifier::simplify): Rename...
9111 (dom_jt_simplifier::simplify): ...to this.
9112 (pass_dominator::execute): Use dom_jt_simplifier and
9114 * tree-ssa-threadedge.c (jump_threader::jump_threader):
9116 (jt_state::register_equivs_stmt): Abstract out...
9117 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
9119 (jump_threader::thread_around_empty_blocks): Update state.
9120 (jump_threader::thread_through_normal_block): Same.
9121 (jt_state::jt_state): Remove.
9122 (jt_state::push): Remove pass specific bits. Keep block vector
9124 (jt_state::append_path): New.
9125 (jt_state::pop): Remove pass specific bits.
9126 (jt_state::register_equiv): Same.
9127 (jt_state::record_ranges_from_stmt): Same.
9128 (jt_state::register_equivs_on_edge): Same. Rename...
9129 (jt_state::register_equivs_edge): ...to this.
9130 (jt_state::dump): New.
9131 (jt_state::debug): New.
9132 (jump_threader_simplifier::simplify): Remove.
9133 (jt_state::get_path): New.
9134 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
9135 class. Expose common functionality as virtual methods.
9136 (class jump_threader_simplifier): Same. Rename...
9137 (class jt_simplifier): ...to this.
9138 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
9139 (class vrp_jt_simplifier): ...to this. Provide pass specific
9141 (class vrp_jt_state): New.
9142 (vrp_jump_threader_simplifier::simplify): Rename...
9143 (vrp_jt_simplifier::simplify): ...to this. Inline code from
9144 what used to be the base class.
9145 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
9148 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
9151 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
9154 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
9155 Richard Biener <rguenther@suse.de>
9157 * match.pd (negation simplifications): Implement some negation
9158 folding transformations from fold-const.c's fold_negate_expr.
9159 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
9160 argument, to control whether the op should be simplified prior
9161 to looking up/assigning a value number.
9162 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
9163 (vn_nary_simplify): Likewise.
9164 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
9165 now call vn_nary_build_or_lookup_1 disabling simplification.
9167 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
9169 PR tree-optimization/102087
9170 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
9171 Update bound/cmp/control for niter.
9173 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9175 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9176 Move check for non-empty BB here.
9177 (fur_source::register_outgoing_edges): ...from here.
9179 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
9181 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
9182 Remove call to improve_range_with_equivs.
9183 (path_range_query::improve_range_with_equivs): Remove
9184 * gimple-range-path.h: Remove improve_range_with_equivs.
9186 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9188 * config/i386/avx512fp16intrin.h:
9189 (_mm512_mask_blend_ph): New intrinsic.
9190 (_mm512_permutex2var_ph): Ditto.
9191 (_mm512_permutexvar_ph): Ditto.
9192 * config/i386/avx512fp16vlintrin.h:
9193 (_mm256_mask_blend_ph): New intrinsic.
9194 (_mm256_permutex2var_ph): Ditto.
9195 (_mm256_permutexvar_ph): Ditto.
9196 (_mm_mask_blend_ph): Ditto.
9197 (_mm_permutex2var_ph): Ditto.
9198 (_mm_permutexvar_ph): Ditto.
9200 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9202 * config/i386/avx512fp16intrin.h: Add new intrinsics.
9203 (_mm512_conj_pch): New intrinsic.
9204 (_mm512_mask_conj_pch): Ditto.
9205 (_mm512_maskz_conj_pch): Ditto.
9206 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
9207 (_mm256_conj_pch): New intrinsic.
9208 (_mm256_mask_conj_pch): Ditto.
9209 (_mm256_maskz_conj_pch): Ditto.
9210 (_mm_conj_pch): Ditto.
9211 (_mm_mask_conj_pch): Ditto.
9212 (_mm_maskz_conj_pch): Ditto.
9214 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9216 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
9217 (_mm512_reduce_add_ph): New intrinsic.
9218 (_mm512_reduce_mul_ph): Ditto.
9219 (_mm512_reduce_min_ph): Ditto.
9220 (_mm512_reduce_max_ph): Ditto.
9221 * config/i386/avx512fp16vlintrin.h
9222 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
9223 (_mm256_reduce_add_ph): New intrinsic.
9224 (_mm256_reduce_mul_ph): Ditto.
9225 (_mm256_reduce_min_ph): Ditto.
9226 (_mm256_reduce_max_ph): Ditto.
9227 (_mm_reduce_add_ph): Ditto.
9228 (_mm_reduce_mul_ph): Ditto.
9229 (_mm_reduce_min_ph): Ditto.
9230 (_mm_reduce_max_ph): Ditto.
9232 2021-09-22 dianhong xu <dianhong.xu@intel.com>
9234 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
9235 __m128h_u): New typedef.
9236 (_mm512_load_ph): New intrinsic.
9237 (_mm256_load_ph): Ditto.
9238 (_mm_load_ph): Ditto.
9239 (_mm512_loadu_ph): Ditto.
9240 (_mm256_loadu_ph): Ditto.
9241 (_mm_loadu_ph): Ditto.
9242 (_mm512_store_ph): Ditto.
9243 (_mm256_store_ph): Ditto.
9244 (_mm_store_ph): Ditto.
9245 (_mm512_storeu_ph): Ditto.
9246 (_mm256_storeu_ph): Ditto.
9247 (_mm_storeu_ph): Ditto.
9248 (_mm512_abs_ph): Ditto.
9249 * config/i386/avx512fp16vlintrin.h
9250 (_mm_abs_ph): Ditto.
9251 (_mm256_abs_ph): Ditto.
9253 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9255 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
9257 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
9260 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
9261 is actually a full copy of the source operand into the target.
9262 Don't emit a strict low part move if source and target mode match.
9264 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9266 PR middle-end/102415
9267 * omp-expand.c (expand_omp_single): If region->exit is NULL,
9268 assert region->entry is GIMPLE_OMP_SCOPE region and return.
9270 2021-09-22 Jakub Jelinek <jakub@redhat.com>
9272 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
9273 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
9274 arguments from 2 to 3.
9275 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
9276 allocate clause allocator and print align if present.
9277 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
9278 for omp_default_mem_alloc if align modifier is present. If align
9279 modifier is present, use TREE_LIST to encode both allocator and
9281 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
9282 Handle align modifier on allocator clause if present.
9284 2021-09-22 liuhongt <hongtao.liu@intel.com>
9286 * config/i386/i386.md (define_attr "isa"): Add
9288 (define_attr "enabled"): Correspond fma_or_avx512vl to
9289 TARGET_FMA || TARGET_AVX512VL.
9290 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
9295 2021-09-22 liuhongt <hongtao.liu@intel.com>
9297 * config/i386/i386.md (cstorehf3): New define_expand.
9299 2021-09-22 liuhongt <hongtao.liu@intel.com>
9301 * config/i386/i386.md (<rounding_insn>hf2): New expander.
9302 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
9303 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
9304 Extend from VF_128 to VFH_128.
9306 2021-09-22 liuhongt <hongtao.liu@intel.com>
9308 * config/i386/i386-features.c (i386-features.c): Handle
9310 * config/i386/i386.md (sqrthf2): New expander.
9311 (*sqrthf2): New define_insn.
9312 * config/i386/sse.md
9313 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
9316 2021-09-22 liuhongt <hongtao.liu@intel.com>
9318 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
9320 (_mm_mask3_fcmadd_sch): Likewise.
9321 (_mm_maskz_fcmadd_sch): Likewise.
9322 (_mm_fcmadd_sch): Likewise.
9323 (_mm_mask_fmadd_sch): Likewise.
9324 (_mm_mask3_fmadd_sch): Likewise.
9325 (_mm_maskz_fmadd_sch): Likewise.
9326 (_mm_fmadd_sch): Likewise.
9327 (_mm_mask_fcmadd_round_sch): Likewise.
9328 (_mm_mask3_fcmadd_round_sch): Likewise.
9329 (_mm_maskz_fcmadd_round_sch): Likewise.
9330 (_mm_fcmadd_round_sch): Likewise.
9331 (_mm_mask_fmadd_round_sch): Likewise.
9332 (_mm_mask3_fmadd_round_sch): Likewise.
9333 (_mm_maskz_fmadd_round_sch): Likewise.
9334 (_mm_fmadd_round_sch): Likewise.
9335 (_mm_fcmul_sch): Likewise.
9336 (_mm_mask_fcmul_sch): Likewise.
9337 (_mm_maskz_fcmul_sch): Likewise.
9338 (_mm_fmul_sch): Likewise.
9339 (_mm_mask_fmul_sch): Likewise.
9340 (_mm_maskz_fmul_sch): Likewise.
9341 (_mm_fcmul_round_sch): Likewise.
9342 (_mm_mask_fcmul_round_sch): Likewise.
9343 (_mm_maskz_fcmul_round_sch): Likewise.
9344 (_mm_fmul_round_sch): Likewise.
9345 (_mm_mask_fmul_round_sch): Likewise.
9346 (_mm_maskz_fmul_round_sch): Likewise.
9347 * config/i386/i386-builtin.def: Add corresponding new builtins.
9348 * config/i386/sse.md
9349 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
9350 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
9351 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
9353 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
9354 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
9356 * config/i386/subst.md (mask_scalarcz_name): New.
9357 (mask_scalarc_name): Ditto.
9358 (mask_scalarc_operand3): Ditto.
9359 (mask_scalarcz_operand4): Ditto.
9360 (round_scalarcz_name): Ditto.
9361 (round_scalarc_mask_operand3): Ditto.
9362 (round_scalarcz_mask_operand4): Ditto.
9363 (round_scalarc_mask_op3): Ditto.
9364 (round_scalarcz_mask_op4): Ditto.
9365 (round_scalarcz_constraint): Ditto.
9366 (round_scalarcz_nimm_predicate): Ditto.
9367 (mask_scalarcz): Ditto.
9368 (mask_scalarc): Ditto.
9369 (round_scalarcz): Ditto.
9371 2021-09-22 liuhongt <hongtao.liu@intel.com>
9373 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
9375 (_mm512_mask_fcmadd_pch): Likewise.
9376 (_mm512_mask3_fcmadd_pch): Likewise.
9377 (_mm512_maskz_fcmadd_pch): Likewise.
9378 (_mm512_fmadd_pch): Likewise.
9379 (_mm512_mask_fmadd_pch): Likewise.
9380 (_mm512_mask3_fmadd_pch): Likewise.
9381 (_mm512_maskz_fmadd_pch): Likewise.
9382 (_mm512_fcmadd_round_pch): Likewise.
9383 (_mm512_mask_fcmadd_round_pch): Likewise.
9384 (_mm512_mask3_fcmadd_round_pch): Likewise.
9385 (_mm512_maskz_fcmadd_round_pch): Likewise.
9386 (_mm512_fmadd_round_pch): Likewise.
9387 (_mm512_mask_fmadd_round_pch): Likewise.
9388 (_mm512_mask3_fmadd_round_pch): Likewise.
9389 (_mm512_maskz_fmadd_round_pch): Likewise.
9390 (_mm512_fcmul_pch): Likewise.
9391 (_mm512_mask_fcmul_pch): Likewise.
9392 (_mm512_maskz_fcmul_pch): Likewise.
9393 (_mm512_fmul_pch): Likewise.
9394 (_mm512_mask_fmul_pch): Likewise.
9395 (_mm512_maskz_fmul_pch): Likewise.
9396 (_mm512_fcmul_round_pch): Likewise.
9397 (_mm512_mask_fcmul_round_pch): Likewise.
9398 (_mm512_maskz_fcmul_round_pch): Likewise.
9399 (_mm512_fmul_round_pch): Likewise.
9400 (_mm512_mask_fmul_round_pch): Likewise.
9401 (_mm512_maskz_fmul_round_pch): Likewise.
9402 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
9404 (_mm_mask_fmadd_pch): Likewise.
9405 (_mm_mask3_fmadd_pch): Likewise.
9406 (_mm_maskz_fmadd_pch): Likewise.
9407 (_mm256_fmadd_pch): Likewise.
9408 (_mm256_mask_fmadd_pch): Likewise.
9409 (_mm256_mask3_fmadd_pch): Likewise.
9410 (_mm256_maskz_fmadd_pch): Likewise.
9411 (_mm_fcmadd_pch): Likewise.
9412 (_mm_mask_fcmadd_pch): Likewise.
9413 (_mm_mask3_fcmadd_pch): Likewise.
9414 (_mm_maskz_fcmadd_pch): Likewise.
9415 (_mm256_fcmadd_pch): Likewise.
9416 (_mm256_mask_fcmadd_pch): Likewise.
9417 (_mm256_mask3_fcmadd_pch): Likewise.
9418 (_mm256_maskz_fcmadd_pch): Likewise.
9419 (_mm_fmul_pch): Likewise.
9420 (_mm_mask_fmul_pch): Likewise.
9421 (_mm_maskz_fmul_pch): Likewise.
9422 (_mm256_fmul_pch): Likewise.
9423 (_mm256_mask_fmul_pch): Likewise.
9424 (_mm256_maskz_fmul_pch): Likewise.
9425 (_mm_fcmul_pch): Likewise.
9426 (_mm_mask_fcmul_pch): Likewise.
9427 (_mm_maskz_fcmul_pch): Likewise.
9428 (_mm256_fcmul_pch): Likewise.
9429 (_mm256_mask_fcmul_pch): Likewise.
9430 (_mm256_maskz_fcmul_pch): Likewise.
9431 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
9432 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
9433 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
9434 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
9435 * config/i386/i386-builtin.def: Add new builtins.
9436 * config/i386/i386-expand.c: Handle new builtin types.
9437 * config/i386/subst.md (SUBST_CV): New.
9438 (maskc_name): Ditto.
9439 (maskc_operand3): Ditto.
9441 (sdc_maskz_name): Ditto.
9442 (sdc_mask_op4): Ditto.
9443 (sdc_mask_op5): Ditto.
9444 (sdc_mask_mode512bit_condition): Ditto.
9446 (round_maskc_operand3): Ditto.
9447 (round_sdc_mask_operand4): Ditto.
9448 (round_maskc_op3): Ditto.
9449 (round_sdc_mask_op4): Ditto.
9450 (round_saeonly_sdc_mask_operand5): Ditto.
9451 * config/i386/sse.md (unspec): Add complex fma unspecs.
9452 (avx512fmaskcmode): New.
9453 (UNSPEC_COMPLEX_F_C_MA): Ditto.
9454 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
9455 (complexopname): Ditto.
9456 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
9457 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
9458 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
9460 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
9461 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
9463 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
9465 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
9466 rs6000-density-size-threshold, rs6000-density-penalty,
9467 rs6000-density-load-pct-threshold,
9468 rs6000-density-load-num-threshold): New parameter.
9469 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
9470 corresponding parameters.
9472 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9474 * gimple-range-path.cc (path_range_query::defined_outside_path):
9476 (path_range_query::range_on_path_entry): New.
9477 (path_range_query::internal_range_of_expr): Resolve unknowns
9479 (path_range_query::improve_range_with_equivs): New.
9480 (path_range_query::ssa_range_in_phi): Resolve unknowns with
9482 * gimple-range-path.h (class path_range_query): Add
9483 defined_outside_path, range_on_path_entry, and
9484 improve_range_with_equivs.
9486 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9488 * gimple-range-path.cc (path_range_query::add_to_imports): New.
9489 (path_range_query::add_copies_to_imports): New.
9490 (path_range_query::precompute_ranges): Call
9491 add_copies_to_imports.
9492 * gimple-range-path.h (class path_range_query): Add prototypes
9493 for add_copies_to_imports and add_to_imports.
9495 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9497 * gimple-range-path.cc (path_range_query::range_defined_in_block):
9498 Remove useless code.
9500 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9502 * gimple-range-fold.h (class fur_source): Make oracle protected.
9503 * gimple-range-path.cc (path_range_query::path_range_query): Add
9504 resolve argument. Initialize oracle.
9505 (path_range_query::~path_range_query): Delete oracle.
9506 (path_range_query::range_of_stmt): Adapt to use relations.
9507 (path_range_query::precompute_ranges): Pre-compute relations.
9508 (class jt_fur_source): New
9509 (jt_fur_source::jt_fur_source): New.
9510 (jt_fur_source::register_relation): New.
9511 (jt_fur_source::query_relation): New.
9512 (path_range_query::precompute_relations): New.
9513 (path_range_query::precompute_phi_relations): New.
9514 * gimple-range-path.h (path_range_query): Add resolve argument.
9515 Add oracle, precompute_relations, precompute_phi_relations.
9516 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
9517 resolve argument to solver.
9519 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9521 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
9522 Rename postfold_gcond_edges to register_outgoing_edges and
9524 (fold_using_range::postfold_gcond_edges): Rename...
9525 (fur_source::register_outgoing_edges): ...to this.
9526 * gimple-range-fold.h (postfold_gcond_edges): Rename to
9527 register_outgoing_edges and move to fur_source.
9529 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9531 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
9532 dom_info_available_p.
9534 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
9536 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
9537 and quick_grow_cleared instead of safe_grow_cleared.
9539 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
9542 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
9545 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
9547 * configure.ac: Detect when the assembler supports new-style
9548 architecture extensions.
9549 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
9550 the full CPU string if the assembler can grok it.
9551 (arm_rewrite_march): Likewise but for the architecture.
9552 * config.in: Regenerate.
9553 * configure: Regenerate.
9555 2021-09-21 Richard Biener <rguenther@suse.de>
9557 PR tree-optimization/102421
9558 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
9559 adjust alignment info.
9561 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
9563 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
9564 bitfield stream out.
9566 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9568 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
9569 undefined edges, apply an equivalence if appropriate.
9570 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
9571 UNDEFINED if EDGE_EXECUTABLE is not set.
9572 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
9573 as EXECUTABLE upon startup.
9574 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
9575 EDGE_EXECUTABLE set.
9576 * vr-values.c (set_and_propagate_unexecutable): New.
9577 (simplify_using_ranges::fold_cond): Call set_and_propagate.
9578 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
9579 * vr-values.h: Add prototype.
9581 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
9583 * value-relation.cc (equiv_oracle::register_initial_def): New.
9584 (equiv_oracle::register_relation): Call register_initial_def.
9585 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
9586 (relation_oracle::register_stmt): Check def block of PHI arguments.
9587 * value-relation.h (equiv_oracle): Add new prototypes.
9589 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
9591 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
9592 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
9593 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
9594 __ROUNDING_MATH__ according to their corresponding flags.
9595 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
9596 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
9597 and __ROUNDING_MATH__.
9599 2021-09-20 Richard Biener <rguenther@suse.de>
9601 * tree-vect-stmts.c (vectorizable_load): Use the vectype
9604 2021-09-20 Richard Biener <rguenther@suse.de>
9606 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
9607 Do not compute alignment of the vectorized access here.
9609 2021-09-20 Richard Biener <rguenther@suse.de>
9611 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
9612 Store -1 for runtime alias peeling iterations.
9614 2021-09-20 Richard Biener <rguenther@suse.de>
9616 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
9618 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
9620 * input.c (string_concat_db::record_string_concatenation)
9621 (string_concat_db::get_string_concatenation): Skip for
9622 'RESERVED_LOCATION_P'.
9624 2021-09-20 Richard Biener <rguenther@suse.de>
9626 PR tree-optimization/65206
9627 * tree-data-ref.h (struct data_reference): Add alt_indices,
9629 * tree-data-ref.c (free_data_ref): Release alt_indices.
9630 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
9631 (create_data_ref): Adjust.
9632 (initialize_data_dependence_relation): Split into head
9633 and tail. When the base objects fail to match up try
9634 again with pointer-based analysis of indices.
9635 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
9636 not compare the lazily computed alternate set of indices.
9638 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
9640 * gcc.c: Test for execute OK when we find the
9641 programs for assembler linker and dsymutil and those
9642 were specified at configure-time.
9644 2021-09-19 Martin Sebor <msebor@redhat.com>
9646 PR middle-end/102403
9647 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
9648 Correct a function pre/postcondition.
9650 2021-09-19 Martin Sebor <msebor@redhat.com>
9652 PR middle-end/102243
9653 * tree-ssa-strlen.c (get_range): Handle null cfun.
9655 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
9657 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
9658 unwinder shim as a convenience library.
9660 2021-09-19 Andrew Pinski <apinski@marvell.com>
9662 * doc/install.texi: Add note about
9663 binutils 2.35 is required for LTO usage.
9665 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
9667 * tree-ssa-threadbackward.c
9668 (back_threader_registry::register_path): Use push_edge.
9669 * tree-ssa-threadedge.c
9670 (jump_threader::thread_around_empty_blocks): Same.
9671 (jump_threader::thread_through_normal_block): Same.
9672 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
9674 * tree-ssa-threadupdate.c
9675 (jt_path_registry::allocate_thread_edge): Remove.
9676 (jt_path_registry::push_edge): New.
9677 (dump_jump_thread_path): Make static.
9678 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
9681 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
9683 * gimple-range-path.cc (path_range_query::path_range_query): Add
9685 (path_range_query::dump): Remove extern declaration of dump_ranger.
9686 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
9687 * gimple-range-trace.h (dump_ranger): Add prototype.
9689 2021-09-19 John Ericson <git@JohnEricson.me>
9691 * gcc.c (find_a_program): New function, factored out of...
9692 (find_a_file): Here.
9693 (execute): Use find_a_program when looking for programs rather
9696 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
9698 * config/avr/avr-mcus.def: Add atmega324pb.
9699 * doc/avr-mmcu.texi: Corresponding changes.
9701 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
9704 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
9705 the same transformations as to @1. For comparisons against NaN,
9706 don't check HONOR_SNANS but confirm that neither operand is a
9709 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
9711 * attribs.c (make_unique_name): Delete.
9712 * attribs.h (make_unique_name): Delete.
9714 2021-09-19 Andrew Pinski <apinski@marvell.com>
9716 * lra-constraints.c (check_and_process_move): Assert
9717 that dclass and sclass are greater than or equal to NO_REGS.
9719 2021-09-18 Jakub Jelinek <jakub@redhat.com>
9721 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
9722 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
9723 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
9725 2021-09-18 liuhongt <hongtao.liu@intel.com>
9727 * config/i386/i386-features.c (remove_partial_avx_dependency):
9728 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
9729 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
9731 2021-09-18 Jakub Jelinek <jakub@redhat.com>
9733 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
9734 if file/namespace scope variable doesn't have predetermined sharing,
9735 treat it as if there was default(none).
9737 2021-09-18 liuhongt <hongtao.liu@intel.com>
9739 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
9741 (_mm_mask_fmadd_sh): Likewise.
9742 (_mm_mask3_fmadd_sh): Likewise.
9743 (_mm_maskz_fmadd_sh): Likewise.
9744 (_mm_fmadd_round_sh): Likewise.
9745 (_mm_mask_fmadd_round_sh): Likewise.
9746 (_mm_mask3_fmadd_round_sh): Likewise.
9747 (_mm_maskz_fmadd_round_sh): Likewise.
9748 (_mm_fnmadd_sh): Likewise.
9749 (_mm_mask_fnmadd_sh): Likewise.
9750 (_mm_mask3_fnmadd_sh): Likewise.
9751 (_mm_maskz_fnmadd_sh): Likewise.
9752 (_mm_fnmadd_round_sh): Likewise.
9753 (_mm_mask_fnmadd_round_sh): Likewise.
9754 (_mm_mask3_fnmadd_round_sh): Likewise.
9755 (_mm_maskz_fnmadd_round_sh): Likewise.
9756 (_mm_fmsub_sh): Likewise.
9757 (_mm_mask_fmsub_sh): Likewise.
9758 (_mm_mask3_fmsub_sh): Likewise.
9759 (_mm_maskz_fmsub_sh): Likewise.
9760 (_mm_fmsub_round_sh): Likewise.
9761 (_mm_mask_fmsub_round_sh): Likewise.
9762 (_mm_mask3_fmsub_round_sh): Likewise.
9763 (_mm_maskz_fmsub_round_sh): Likewise.
9764 (_mm_fnmsub_sh): Likewise.
9765 (_mm_mask_fnmsub_sh): Likewise.
9766 (_mm_mask3_fnmsub_sh): Likewise.
9767 (_mm_maskz_fnmsub_sh): Likewise.
9768 (_mm_fnmsub_round_sh): Likewise.
9769 (_mm_mask_fnmsub_round_sh): Likewise.
9770 (_mm_mask3_fnmsub_round_sh): Likewise.
9771 (_mm_maskz_fnmsub_round_sh): Likewise.
9772 * config/i386/i386-builtin-types.def
9773 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
9774 * config/i386/i386-builtin.def: Add new builtins.
9775 * config/i386/i386-expand.c: Handle new builtin type.
9776 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
9777 Ajdust to support FP16.
9778 (fmai_vmfmsub_<mode><round_name>): Ditto.
9779 (fmai_vmfnmadd_<mode><round_name>): Ditto.
9780 (fmai_vmfnmsub_<mode><round_name>): Ditto.
9781 (*fmai_fmadd_<mode>): Ditto.
9782 (*fmai_fmsub_<mode>): Ditto.
9783 (*fmai_fnmadd_<mode><round_name>): Ditto.
9784 (*fmai_fnmsub_<mode><round_name>): Ditto.
9785 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
9786 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
9787 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
9788 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
9789 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
9790 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
9791 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
9792 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
9793 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
9794 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
9795 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
9796 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
9797 adjust to support FP16.
9798 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
9799 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
9800 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
9803 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
9805 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
9807 (maskload<mode><avx512fmaskmodelower>): Ditto.
9808 (maskstore<mode><avx512fmaskmodelower>): Ditto.
9810 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
9812 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
9814 (ix86_expand_copysign): Ditto.
9815 (ix86_expand_xorsign): Ditto.
9816 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
9818 (ix86_build_signbit_mask): Ditto.
9819 (ix86_can_change_mode_class): Ditto.
9820 * config/i386/i386.md
9821 (SSEMODEF): Add HFmode.
9822 (ssevecmodef): Ditto.
9823 (<code>hf2): New define_expand.
9824 (*<code>hf2_1): New define_insn_and_split.
9825 (copysign<mode>): Extend to support HFmode under AVX512FP16.
9826 (xorsign<mode>): Ditto.
9827 * config/i386/sse.md (VFB): New mode iterator.
9828 (VFB_128_256): Ditto.
9830 (sseintvecmode2): Support HF vector mode.
9831 (<code><mode>2): Use new mode iterator.
9832 (*<code><mode>2): Ditto.
9833 (copysign<mode>3): Ditto.
9834 (xorsign<mode>3): Ditto.
9835 (<code><mode>3<mask_name>): Ditto.
9836 (<code><mode>3<mask_name>): Ditto.
9837 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
9838 (<sse>_andnot<mode>3<mask_name>): Ditto.
9839 (*<code><mode>3<mask_name>): Ditto.
9840 (*<code><mode>3<mask_name>): Ditto.
9842 2021-09-18 liuhongt <hongtao.liu@intel.com>
9844 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
9846 (_mm512_mask3_fmadd_ph): Likewise.
9847 (_mm512_maskz_fmadd_ph): Likewise.
9848 (_mm512_fmadd_round_ph): Likewise.
9849 (_mm512_mask_fmadd_round_ph): Likewise.
9850 (_mm512_mask3_fmadd_round_ph): Likewise.
9851 (_mm512_maskz_fmadd_round_ph): Likewise.
9852 (_mm512_fnmadd_ph): Likewise.
9853 (_mm512_mask_fnmadd_ph): Likewise.
9854 (_mm512_mask3_fnmadd_ph): Likewise.
9855 (_mm512_maskz_fnmadd_ph): Likewise.
9856 (_mm512_fnmadd_round_ph): Likewise.
9857 (_mm512_mask_fnmadd_round_ph): Likewise.
9858 (_mm512_mask3_fnmadd_round_ph): Likewise.
9859 (_mm512_maskz_fnmadd_round_ph): Likewise.
9860 (_mm512_fmsub_ph): Likewise.
9861 (_mm512_mask_fmsub_ph): Likewise.
9862 (_mm512_mask3_fmsub_ph): Likewise.
9863 (_mm512_maskz_fmsub_ph): Likewise.
9864 (_mm512_fmsub_round_ph): Likewise.
9865 (_mm512_mask_fmsub_round_ph): Likewise.
9866 (_mm512_mask3_fmsub_round_ph): Likewise.
9867 (_mm512_maskz_fmsub_round_ph): Likewise.
9868 (_mm512_fnmsub_ph): Likewise.
9869 (_mm512_mask_fnmsub_ph): Likewise.
9870 (_mm512_mask3_fnmsub_ph): Likewise.
9871 (_mm512_maskz_fnmsub_ph): Likewise.
9872 (_mm512_fnmsub_round_ph): Likewise.
9873 (_mm512_mask_fnmsub_round_ph): Likewise.
9874 (_mm512_mask3_fnmsub_round_ph): Likewise.
9875 (_mm512_maskz_fnmsub_round_ph): Likewise.
9876 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
9878 (_mm256_mask_fmadd_ph): Likewise.
9879 (_mm256_mask3_fmadd_ph): Likewise.
9880 (_mm256_maskz_fmadd_ph): Likewise.
9881 (_mm_fmadd_ph): Likewise.
9882 (_mm_mask_fmadd_ph): Likewise.
9883 (_mm_mask3_fmadd_ph): Likewise.
9884 (_mm_maskz_fmadd_ph): Likewise.
9885 (_mm256_fnmadd_ph): Likewise.
9886 (_mm256_mask_fnmadd_ph): Likewise.
9887 (_mm256_mask3_fnmadd_ph): Likewise.
9888 (_mm256_maskz_fnmadd_ph): Likewise.
9889 (_mm_fnmadd_ph): Likewise.
9890 (_mm_mask_fnmadd_ph): Likewise.
9891 (_mm_mask3_fnmadd_ph): Likewise.
9892 (_mm_maskz_fnmadd_ph): Likewise.
9893 (_mm256_fmsub_ph): Likewise.
9894 (_mm256_mask_fmsub_ph): Likewise.
9895 (_mm256_mask3_fmsub_ph): Likewise.
9896 (_mm256_maskz_fmsub_ph): Likewise.
9897 (_mm_fmsub_ph): Likewise.
9898 (_mm_mask_fmsub_ph): Likewise.
9899 (_mm_mask3_fmsub_ph): Likewise.
9900 (_mm_maskz_fmsub_ph): Likewise.
9901 (_mm256_fnmsub_ph): Likewise.
9902 (_mm256_mask_fnmsub_ph): Likewise.
9903 (_mm256_mask3_fnmsub_ph): Likewise.
9904 (_mm256_maskz_fnmsub_ph): Likewise.
9905 (_mm_fnmsub_ph): Likewise.
9906 (_mm_mask_fnmsub_ph): Likewise.
9907 (_mm_mask3_fnmsub_ph): Likewise.
9908 (_mm_maskz_fnmsub_ph): Likewise.
9909 * config/i386/i386-builtin.def: Add corresponding new builtins.
9910 * config/i386/sse.md
9911 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
9912 support HF vector modes.
9913 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
9915 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
9916 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
9917 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
9918 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
9919 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
9920 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
9921 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
9923 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
9924 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
9925 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
9926 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
9927 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
9928 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
9930 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
9931 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
9932 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
9933 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
9934 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
9935 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
9936 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
9938 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
9939 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
9940 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
9941 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
9942 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
9944 2021-09-18 liuhongt <hongtao.liu@intel.com>
9946 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
9948 (_mm512_mask_fmaddsub_ph): Likewise.
9949 (_mm512_mask3_fmaddsub_ph): Likewise.
9950 (_mm512_maskz_fmaddsub_ph): Likewise.
9951 (_mm512_fmaddsub_round_ph): Likewise.
9952 (_mm512_mask_fmaddsub_round_ph): Likewise.
9953 (_mm512_mask3_fmaddsub_round_ph): Likewise.
9954 (_mm512_maskz_fmaddsub_round_ph): Likewise.
9955 (_mm512_mask_fmsubadd_ph): Likewise.
9956 (_mm512_mask3_fmsubadd_ph): Likewise.
9957 (_mm512_maskz_fmsubadd_ph): Likewise.
9958 (_mm512_fmsubadd_round_ph): Likewise.
9959 (_mm512_mask_fmsubadd_round_ph): Likewise.
9960 (_mm512_mask3_fmsubadd_round_ph): Likewise.
9961 (_mm512_maskz_fmsubadd_round_ph): Likewise.
9962 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
9964 (_mm256_mask_fmaddsub_ph): Likewise.
9965 (_mm256_mask3_fmaddsub_ph): Likewise.
9966 (_mm256_maskz_fmaddsub_ph): Likewise.
9967 (_mm_fmaddsub_ph): Likewise.
9968 (_mm_mask_fmaddsub_ph): Likewise.
9969 (_mm_mask3_fmaddsub_ph): Likewise.
9970 (_mm_maskz_fmaddsub_ph): Likewise.
9971 (_mm256_fmsubadd_ph): Likewise.
9972 (_mm256_mask_fmsubadd_ph): Likewise.
9973 (_mm256_mask3_fmsubadd_ph): Likewise.
9974 (_mm256_maskz_fmsubadd_ph): Likewise.
9975 (_mm_fmsubadd_ph): Likewise.
9976 (_mm_mask_fmsubadd_ph): Likewise.
9977 (_mm_mask3_fmsubadd_ph): Likewise.
9978 (_mm_maskz_fmsubadd_ph): Likewise.
9979 * config/i386/i386-builtin.def: Add corresponding new builtins.
9980 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
9981 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
9982 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
9984 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
9986 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
9987 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
9988 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
9990 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
9991 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
9993 2021-09-18 liuhongt <hongtao.liu@intel.com>
9996 * config/i386/i386.c (ix86_print_operand): Handle
9997 V8HF/V16HF/V32HFmode.
9998 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
9999 * config/i386/sse.md (avx512bcst): Remove.
10001 2021-09-17 Martin Sebor <msebor@redhat.com>
10003 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
10004 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
10005 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
10006 (check_defs): Add comment.
10007 (can_skip_redundant_opnd): Update comment.
10008 (compute_uninit_opnds_pos): Adjust to namespace change.
10009 (find_pdom): Move to gimple-predicate-analysis.cc.
10011 (struct uninit_undef_val_t): New.
10012 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
10013 (find_control_equiv_block): Same.
10014 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
10015 (MAX_SWITCH_CASES): Same.
10016 (compute_control_dep_chain): Same.
10017 (find_uninit_use): Use predicate analyzer.
10018 (struct pred_info): Move to gimple-predicate-analysis.
10019 (convert_control_dep_chain_into_preds): Same.
10020 (find_predicates): Same.
10021 (collect_phi_def_edges): Same.
10022 (warn_uninitialized_phi): Use predicate analyzer.
10023 (find_def_preds): Move to gimple-predicate-analysis.
10024 (dump_pred_info): Same.
10025 (dump_pred_chain): Same.
10026 (dump_predicates): Same.
10027 (destroy_predicate_vecs): Remove.
10028 (execute_late_warn_uninitialized): New.
10029 (get_cmp_code): Move to gimple-predicate-analysis.
10030 (is_value_included_in): Same.
10031 (value_sat_pred_p): Same.
10032 (find_matching_predicate_in_rest_chains): Same.
10033 (is_use_properly_guarded): Same.
10034 (prune_uninit_phi_opnds): Same.
10035 (find_var_cmp_const): Same.
10036 (use_pred_not_overlap_with_undef_path_pred): Same.
10037 (pred_equal_p): Same.
10038 (is_neq_relop_p): Same.
10039 (is_neq_zero_form_p): Same.
10040 (pred_expr_equal_p): Same.
10041 (is_pred_expr_subset_of): Same.
10042 (is_pred_chain_subset_of): Same.
10043 (is_included_in): Same.
10044 (is_superset_of): Same.
10045 (pred_neg_p): Same.
10046 (simplify_pred): Same.
10047 (simplify_preds_2): Same.
10048 (simplify_preds_3): Same.
10049 (simplify_preds_4): Same.
10050 (simplify_preds): Same.
10052 (push_to_worklist): Same.
10053 (get_pred_info_from_cmp): Same.
10054 (is_degenerated_phi): Same.
10055 (normalize_one_pred_1): Same.
10056 (normalize_one_pred): Same.
10057 (normalize_one_pred_chain): Same.
10058 (normalize_preds): Same.
10059 (can_one_predicate_be_invalidated_p): Same.
10060 (can_chain_union_be_invalidated_p): Same.
10061 (uninit_uses_cannot_happen): Same.
10062 (pass_late_warn_uninitialized::execute): Define.
10063 * gimple-predicate-analysis.cc: New file.
10064 * gimple-predicate-analysis.h: New file.
10066 2021-09-17 Julian Brown <julian@codesourcery.com>
10068 * config/gcn/gcn.c (gimple.h): Include.
10069 (gcn_fork_join): Emit barrier for worker-level joins.
10070 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
10071 writes_gang_private bitmap parameter. Set bit for blocks
10072 containing gang-private variable writes.
10073 (worker_single_simple): Don't emit barrier after predicated block.
10074 (worker_single_copy): Don't emit barrier if we're not broadcasting
10075 anything and the block contains no gang-private writes.
10076 (neuter_worker_single): Don't predicate blocks that only contain
10077 NOPs or internal marker functions. Pass has_gang_private_write
10078 argument to worker_single_copy.
10079 (oacc_do_neutering): Add writes_gang_private bitmap handling.
10081 2021-09-17 Julian Brown <julian@codesourcery.com>
10083 * config/gcn/gcn-protos.h
10084 (gcn_goacc_create_worker_broadcast_record): Update prototype.
10085 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
10086 preallocated block of LDS memory. Do not cache/share decls for
10087 reduction temporaries between invocations.
10088 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
10089 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
10090 and return temporary LDS space at that offset. Return pointer in
10092 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
10094 (ACC_LDS_SIZE): Define as acc_lds_size.
10095 (gcn_init_machine_status): Don't initialise lds_allocated,
10096 lds_allocs, reduc_decls fields of machine function struct.
10097 (gcn_option_override): Handle default size for gang-private
10098 variables and -mgang-private-size option.
10099 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
10100 initialising M0_REG.
10101 (gcn_shared_mem_layout): New function.
10102 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
10103 gang_private_hwm variable.
10104 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
10105 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
10106 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
10107 * config/gcn/gcn.opt (gang_private_size_opt): New global.
10108 (mgang-private-size=): New option.
10109 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
10110 documentation hook.
10111 * doc/tm.texi: Regenerate.
10112 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
10114 (build_sender_ref): Handle sender_decl being pointer.
10115 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
10116 parameters. Pass placement argument to
10117 create_worker_broadcast_record hook invocations. Handle
10118 sender_decl being pointer and isolate_broadcasts inserting extra
10120 (blk_offset_map_t): Add typedef.
10121 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
10122 preallocated range to worker_single_copy call.
10123 (dfs_broadcast_reachable_1): New function.
10124 (idx_decl_pair_t, used_range_vec_t): New typedefs.
10125 (sort_size_descending): New function.
10126 (addr_range): New class.
10127 (splay_tree_compare_addr_range, splay_tree_free_key)
10128 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
10129 (execute_omp_oacc_neuter_broadcast): Rename to...
10130 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
10131 parameters. Arrange layout of shared memory for broadcast
10133 (execute_omp_oacc_neuter_broadcast): New function.
10134 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
10135 handling from here. Enable pass for all OpenACC routines in order
10136 to call shared memory-layout hook.
10137 * target.def (create_worker_broadcast_record): Add OFFSET
10139 (shared_mem_layout): New hook.
10141 2021-09-17 Julian Brown <julian@codesourcery.com>
10142 Thomas Schwinge <thomas@codesourcery.com>
10144 * omp-oacc-neuter-broadcast.cc
10145 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
10147 (execute_omp_oacc_neuter_broadcast): Adjust.
10149 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10151 * value-relation.cc (class equiv_chain): Move to header file.
10152 (path_oracle::path_oracle): New.
10153 (path_oracle::~path_oracle): New.
10154 (path_oracle::register_relation): New.
10155 (path_oracle::query_relation): New.
10156 (path_oracle::reset_path): New.
10157 (path_oracle::dump): New.
10158 * value-relation.h (class equiv_chain): Move to here.
10159 (class path_oracle): New.
10161 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
10163 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
10165 * gimple-range-fold.cc (fur_depend::register_relation): Use
10166 register_stmt/edge routines.
10167 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
10168 (equiv_oracle::equiv_oracle): Create self equivalence cache.
10169 (equiv_oracle::~equiv_oracle): Release same.
10170 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
10171 are no equivalences.
10172 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
10173 (equiv_oracle::register_relation): Rename from register_equiv.
10174 (relation_chain_head::find_relation): Relocate from dom_oracle.
10175 (relation_oracle::register_stmt): New.
10176 (relation_oracle::register_edge): New.
10177 (dom_oracle::*): Rename from relation_oracle.
10178 (dom_oracle::register_relation): Adjust to call equiv_oracle.
10179 (dom_oracle::set_one_relation): Split from register_relation.
10180 (dom_oracle::register_transitives): Consolidate 2 methods.
10181 (dom_oracle::find_relation_block): Move core to relation_chain.
10182 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
10183 * value-relation.h (class relation_oracle): New pure virtual base.
10184 (class equiv_oracle): Inherit from relation_oracle and adjust.
10185 (class dom_oracle): Rename from old relation_oracle and adjust.
10187 2021-09-17 Martin Sebor <msebor@redhat.com>
10189 PR middle-end/102200
10190 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
10191 (handle_min_max_size): Change argument. Store original SSA_NAME for
10192 operands to potentially distinct (sub)objects.
10193 (compute_objsize_r): Adjust call to the above.
10195 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10197 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
10198 (rs6000_new_builtin_vectorized_function): New function.
10199 (rs6000_new_builtin_md_vectorized_function): Likewise.
10200 (rs6000_builtin_vectorized_function): Call
10201 rs6000_new_builtin_vectorized_function.
10202 (rs6000_builtin_md_vectorized_function): Call
10203 rs6000_new_builtin_md_vectorized_function.
10205 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10207 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
10208 (ASSEMBLE_PAIR): Likewise.
10209 (BUILD_ACC): Likewise.
10210 (DISASSEMBLE_ACC): Likewise.
10211 (DISASSEMBLE_PAIR): Likewise.
10212 (PMXVBF16GER2): Likewise.
10213 (PMXVBF16GER2NN): Likewise.
10214 (PMXVBF16GER2NP): Likewise.
10215 (PMXVBF16GER2PN): Likewise.
10216 (PMXVBF16GER2PP): Likewise.
10217 (PMXVF16GER2): Likewise.
10218 (PMXVF16GER2NN): Likewise.
10219 (PMXVF16GER2NP): Likewise.
10220 (PMXVF16GER2PN): Likewise.
10221 (PMXVF16GER2PP): Likewise.
10222 (PMXVF32GER): Likewise.
10223 (PMXVF32GERNN): Likewise.
10224 (PMXVF32GERNP): Likewise.
10225 (PMXVF32GERPN): Likewise.
10226 (PMXVF32GERPP): Likewise.
10227 (PMXVF64GER): Likewise.
10228 (PMXVF64GERNN): Likewise.
10229 (PMXVF64GERNP): Likewise.
10230 (PMXVF64GERPN): Likewise.
10231 (PMXVF64GERPP): Likewise.
10232 (PMXVI16GER2): Likewise.
10233 (PMXVI16GER2PP): Likewise.
10234 (PMXVI16GER2S): Likewise.
10235 (PMXVI16GER2SPP): Likewise.
10236 (PMXVI4GER8): Likewise.
10237 (PMXVI4GER8PP): Likewise.
10238 (PMXVI8GER4): Likewise.
10239 (PMXVI8GER4PP): Likewise.
10240 (PMXVI8GER4SPP): Likewise.
10241 (XVBF16GER2): Likewise.
10242 (XVBF16GER2NN): Likewise.
10243 (XVBF16GER2NP): Likewise.
10244 (XVBF16GER2PN): Likewise.
10245 (XVBF16GER2PP): Likewise.
10246 (XVF16GER2): Likewise.
10247 (XVF16GER2NN): Likewise.
10248 (XVF16GER2NP): Likewise.
10249 (XVF16GER2PN): Likewise.
10250 (XVF16GER2PP): Likewise.
10251 (XVF32GER): Likewise.
10252 (XVF32GERNN): Likewise.
10253 (XVF32GERNP): Likewise.
10254 (XVF32GERPN): Likewise.
10255 (XVF32GERPP): Likewise.
10256 (XVF64GER): Likewise.
10257 (XVF64GERNN): Likewise.
10258 (XVF64GERNP): Likewise.
10259 (XVF64GERPN): Likewise.
10260 (XVF64GERPP): Likewise.
10261 (XVI16GER2): Likewise.
10262 (XVI16GER2PP): Likewise.
10263 (XVI16GER2S): Likewise.
10264 (XVI16GER2SPP): Likewise.
10265 (XVI4GER8): Likewise.
10266 (XVI4GER8PP): Likewise.
10267 (XVI8GER4): Likewise.
10268 (XVI8GER4PP): Likewise.
10269 (XVI8GER4SPP): Likewise.
10270 (XXMFACC): Likewise.
10271 (XXMTACC): Likewise.
10272 (XXSETACCZ): Likewise.
10273 (ASSEMBLE_PAIR_V): Likewise.
10274 (BUILD_PAIR): Likewise.
10275 (DISASSEMBLE_PAIR_V): Likewise.
10278 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
10279 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
10280 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
10281 (parse_bif_attrs): Handle ismmaint.
10282 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
10283 (write_bif_static_init): Handle ismmaint.
10285 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10287 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
10289 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
10290 (rs6000_new_builtin_valid_without_lhs): New function.
10291 (rs6000_gimple_fold_new_mma_builtin): Likewise.
10292 (rs6000_gimple_fold_new_builtin): Likewise.
10294 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
10296 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
10297 Destruct stale Value objects.
10298 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
10301 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
10304 * match.pd (shift optimizations): Disable recent sign-changing
10305 optimization for shifts by zero, these will be folded later.
10307 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
10309 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
10310 [power9] to [always].
10312 2021-09-17 Richard Biener <rguenther@suse.de>
10314 * tree-vect-stmts.c (vectorizable_load): Do not frob
10317 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10319 * config/i386/i386-features.c (remove_partial_avx_dependency):
10320 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
10321 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
10323 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
10325 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10326 * config/i386/i386.md (SSE FP to FP splitters): Replace
10327 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
10328 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
10329 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
10330 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
10331 * config/i386/x86-tune.def
10332 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
10333 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
10335 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10338 * config/i386/i386-features.c (remove_partial_avx_dependency):
10339 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
10340 before generating vxorps.
10342 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10344 * config/i386/i386-options.c (processor_cost_table): Use
10345 tremont_cost for Tremont.
10346 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
10347 (tremont_memset): Likewise.
10348 (tremont_cost): Likewise.
10349 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
10350 Enable for Tremont.
10352 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
10354 * common/config/i386/i386-common.c: Use Haswell scheduling model
10356 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
10358 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
10360 (ix86_adjust_cost): Handle Tremont.
10361 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
10362 Enable for Tremont.
10363 (X86_TUNE_USE_LEAVE): Likewise.
10364 (X86_TUNE_PUSH_MEMORY): Likewise.
10365 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
10366 (X86_TUNE_USE_CLTD): Likewise.
10367 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
10368 (X86_TUNE_AVOID_MFENCE): Likewise.
10369 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
10370 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
10371 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
10372 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
10373 (X86_TUNE_OPT_AGU): Likewise.
10374 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
10375 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
10376 (X86_TUNE_EXPAND_ABS): Likewise.
10377 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
10378 (X86_TUNE_SLOW_PSHUFB): Likewise.
10380 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
10382 PR rtl-optimization/102306
10383 * combine.c (try_combine): Abort the combination if we are about to
10384 duplicate volatile references.
10386 2021-09-17 liuhongt <hongtao.liu@intel.com>
10388 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
10390 (_mm256_undefined_ph): Likewise.
10391 (_mm512_undefined_ph): Likewise.
10392 (_mm_cvtsh_h): Likewise.
10393 (_mm256_cvtsh_h): Likewise.
10394 (_mm512_cvtsh_h): Likewise.
10395 (_mm512_castph_ps): Likewise.
10396 (_mm512_castph_pd): Likewise.
10397 (_mm512_castph_si512): Likewise.
10398 (_mm512_castph512_ph128): Likewise.
10399 (_mm512_castph512_ph256): Likewise.
10400 (_mm512_castph128_ph512): Likewise.
10401 (_mm512_castph256_ph512): Likewise.
10402 (_mm512_zextph128_ph512): Likewise.
10403 (_mm512_zextph256_ph512): Likewise.
10404 (_mm512_castps_ph): Likewise.
10405 (_mm512_castpd_ph): Likewise.
10406 (_mm512_castsi512_ph): Likewise.
10407 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
10409 (_mm256_castph_ps): Likewise.
10410 (_mm_castph_pd): Likewise.
10411 (_mm256_castph_pd): Likewise.
10412 (_mm_castph_si128): Likewise.
10413 (_mm256_castph_si256): Likewise.
10414 (_mm_castps_ph): Likewise.
10415 (_mm256_castps_ph): Likewise.
10416 (_mm_castpd_ph): Likewise.
10417 (_mm256_castpd_ph): Likewise.
10418 (_mm_castsi128_ph): Likewise.
10419 (_mm256_castsi256_ph): Likewise.
10420 (_mm256_castph256_ph128): Likewise.
10421 (_mm256_castph128_ph256): Likewise.
10422 (_mm256_zextph128_ph256): Likewise.
10424 2021-09-17 liuhongt <hongtao.liu@intel.com>
10426 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
10428 (_mm_mask_cvtsh_ss): Likewise.
10429 (_mm_maskz_cvtsh_ss): Likewise.
10430 (_mm_cvtsh_sd): Likewise.
10431 (_mm_mask_cvtsh_sd): Likewise.
10432 (_mm_maskz_cvtsh_sd): Likewise.
10433 (_mm_cvt_roundsh_ss): Likewise.
10434 (_mm_mask_cvt_roundsh_ss): Likewise.
10435 (_mm_maskz_cvt_roundsh_ss): Likewise.
10436 (_mm_cvt_roundsh_sd): Likewise.
10437 (_mm_mask_cvt_roundsh_sd): Likewise.
10438 (_mm_maskz_cvt_roundsh_sd): Likewise.
10439 (_mm_cvtss_sh): Likewise.
10440 (_mm_mask_cvtss_sh): Likewise.
10441 (_mm_maskz_cvtss_sh): Likewise.
10442 (_mm_cvtsd_sh): Likewise.
10443 (_mm_mask_cvtsd_sh): Likewise.
10444 (_mm_maskz_cvtsd_sh): Likewise.
10445 (_mm_cvt_roundss_sh): Likewise.
10446 (_mm_mask_cvt_roundss_sh): Likewise.
10447 (_mm_maskz_cvt_roundss_sh): Likewise.
10448 (_mm_cvt_roundsd_sh): Likewise.
10449 (_mm_mask_cvt_roundsd_sh): Likewise.
10450 (_mm_maskz_cvt_roundsd_sh): Likewise.
10451 * config/i386/i386-builtin-types.def
10452 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
10453 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
10454 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
10455 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
10456 * config/i386/i386-builtin.def: Add corrresponding new builtins.
10457 * config/i386/i386-expand.c: Handle new builtin types.
10458 * config/i386/sse.md (VF48_128): New mode iterator.
10459 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
10461 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
10464 2021-09-17 liuhongt <hongtao.liu@intel.com>
10466 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
10468 (_mm512_mask_cvtph_pd): Likewise.
10469 (_mm512_maskz_cvtph_pd): Likewise.
10470 (_mm512_cvt_roundph_pd): Likewise.
10471 (_mm512_mask_cvt_roundph_pd): Likewise.
10472 (_mm512_maskz_cvt_roundph_pd): Likewise.
10473 (_mm512_cvtxph_ps): Likewise.
10474 (_mm512_mask_cvtxph_ps): Likewise.
10475 (_mm512_maskz_cvtxph_ps): Likewise.
10476 (_mm512_cvtx_roundph_ps): Likewise.
10477 (_mm512_mask_cvtx_roundph_ps): Likewise.
10478 (_mm512_maskz_cvtx_roundph_ps): Likewise.
10479 (_mm512_cvtxps_ph): Likewise.
10480 (_mm512_mask_cvtxps_ph): Likewise.
10481 (_mm512_maskz_cvtxps_ph): Likewise.
10482 (_mm512_cvtx_roundps_ph): Likewise.
10483 (_mm512_mask_cvtx_roundps_ph): Likewise.
10484 (_mm512_maskz_cvtx_roundps_ph): Likewise.
10485 (_mm512_cvtpd_ph): Likewise.
10486 (_mm512_mask_cvtpd_ph): Likewise.
10487 (_mm512_maskz_cvtpd_ph): Likewise.
10488 (_mm512_cvt_roundpd_ph): Likewise.
10489 (_mm512_mask_cvt_roundpd_ph): Likewise.
10490 (_mm512_maskz_cvt_roundpd_ph): Likewise.
10491 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
10493 (_mm_mask_cvtph_pd): Likewise.
10494 (_mm_maskz_cvtph_pd): Likewise.
10495 (_mm256_cvtph_pd): Likewise.
10496 (_mm256_mask_cvtph_pd): Likewise.
10497 (_mm256_maskz_cvtph_pd): Likewise.
10498 (_mm_cvtxph_ps): Likewise.
10499 (_mm_mask_cvtxph_ps): Likewise.
10500 (_mm_maskz_cvtxph_ps): Likewise.
10501 (_mm256_cvtxph_ps): Likewise.
10502 (_mm256_mask_cvtxph_ps): Likewise.
10503 (_mm256_maskz_cvtxph_ps): Likewise.
10504 (_mm_cvtxps_ph): Likewise.
10505 (_mm_mask_cvtxps_ph): Likewise.
10506 (_mm_maskz_cvtxps_ph): Likewise.
10507 (_mm256_cvtxps_ph): Likewise.
10508 (_mm256_mask_cvtxps_ph): Likewise.
10509 (_mm256_maskz_cvtxps_ph): Likewise.
10510 (_mm_cvtpd_ph): Likewise.
10511 (_mm_mask_cvtpd_ph): Likewise.
10512 (_mm_maskz_cvtpd_ph): Likewise.
10513 (_mm256_cvtpd_ph): Likewise.
10514 (_mm256_mask_cvtpd_ph): Likewise.
10515 (_mm256_maskz_cvtpd_ph): Likewise.
10516 * config/i386/i386-builtin.def: Add corresponding new builtins.
10517 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10518 * config/i386/i386-expand.c: Handle new builtin types.
10519 * config/i386/sse.md
10520 (VF4_128_8_256): New.
10521 (VF48H_AVX512VL): Ditto.
10522 (ssePHmode): Add HF vector modes.
10523 (castmode): Add new convertable modes.
10524 (qq2phsuff): Ditto.
10525 (ph2pssuffix): New.
10526 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
10527 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10528 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
10529 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10530 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
10531 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
10532 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
10534 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
10535 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
10536 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
10537 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
10539 2021-09-17 liuhongt <hongtao.liu@intel.com>
10541 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
10543 (_mm_cvttsh_u32): Likewise.
10544 (_mm_cvtt_roundsh_i32): Likewise.
10545 (_mm_cvtt_roundsh_u32): Likewise.
10546 (_mm_cvttsh_i64): Likewise.
10547 (_mm_cvttsh_u64): Likewise.
10548 (_mm_cvtt_roundsh_i64): Likewise.
10549 (_mm_cvtt_roundsh_u64): Likewise.
10550 * config/i386/i386-builtin.def: Add corresponding new builtins.
10551 * config/i386/sse.md
10552 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
10555 2021-09-17 liuhongt <hongtao.liu@intel.com>
10557 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
10559 (_mm512_mask_cvttph_epi32): Likewise.
10560 (_mm512_maskz_cvttph_epi32): Likewise.
10561 (_mm512_cvtt_roundph_epi32): Likewise.
10562 (_mm512_mask_cvtt_roundph_epi32): Likewise.
10563 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
10564 (_mm512_cvttph_epu32): Likewise.
10565 (_mm512_mask_cvttph_epu32): Likewise.
10566 (_mm512_maskz_cvttph_epu32): Likewise.
10567 (_mm512_cvtt_roundph_epu32): Likewise.
10568 (_mm512_mask_cvtt_roundph_epu32): Likewise.
10569 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
10570 (_mm512_cvttph_epi64): Likewise.
10571 (_mm512_mask_cvttph_epi64): Likewise.
10572 (_mm512_maskz_cvttph_epi64): Likewise.
10573 (_mm512_cvtt_roundph_epi64): Likewise.
10574 (_mm512_mask_cvtt_roundph_epi64): Likewise.
10575 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
10576 (_mm512_cvttph_epu64): Likewise.
10577 (_mm512_mask_cvttph_epu64): Likewise.
10578 (_mm512_maskz_cvttph_epu64): Likewise.
10579 (_mm512_cvtt_roundph_epu64): Likewise.
10580 (_mm512_mask_cvtt_roundph_epu64): Likewise.
10581 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
10582 (_mm512_cvttph_epi16): Likewise.
10583 (_mm512_mask_cvttph_epi16): Likewise.
10584 (_mm512_maskz_cvttph_epi16): Likewise.
10585 (_mm512_cvtt_roundph_epi16): Likewise.
10586 (_mm512_mask_cvtt_roundph_epi16): Likewise.
10587 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
10588 (_mm512_cvttph_epu16): Likewise.
10589 (_mm512_mask_cvttph_epu16): Likewise.
10590 (_mm512_maskz_cvttph_epu16): Likewise.
10591 (_mm512_cvtt_roundph_epu16): Likewise.
10592 (_mm512_mask_cvtt_roundph_epu16): Likewise.
10593 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
10594 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
10596 (_mm_mask_cvttph_epi32): Likewise.
10597 (_mm_maskz_cvttph_epi32): Likewise.
10598 (_mm256_cvttph_epi32): Likewise.
10599 (_mm256_mask_cvttph_epi32): Likewise.
10600 (_mm256_maskz_cvttph_epi32): Likewise.
10601 (_mm_cvttph_epu32): Likewise.
10602 (_mm_mask_cvttph_epu32): Likewise.
10603 (_mm_maskz_cvttph_epu32): Likewise.
10604 (_mm256_cvttph_epu32): Likewise.
10605 (_mm256_mask_cvttph_epu32): Likewise.
10606 (_mm256_maskz_cvttph_epu32): Likewise.
10607 (_mm_cvttph_epi64): Likewise.
10608 (_mm_mask_cvttph_epi64): Likewise.
10609 (_mm_maskz_cvttph_epi64): Likewise.
10610 (_mm256_cvttph_epi64): Likewise.
10611 (_mm256_mask_cvttph_epi64): Likewise.
10612 (_mm256_maskz_cvttph_epi64): Likewise.
10613 (_mm_cvttph_epu64): Likewise.
10614 (_mm_mask_cvttph_epu64): Likewise.
10615 (_mm_maskz_cvttph_epu64): Likewise.
10616 (_mm256_cvttph_epu64): Likewise.
10617 (_mm256_mask_cvttph_epu64): Likewise.
10618 (_mm256_maskz_cvttph_epu64): Likewise.
10619 (_mm_cvttph_epi16): Likewise.
10620 (_mm_mask_cvttph_epi16): Likewise.
10621 (_mm_maskz_cvttph_epi16): Likewise.
10622 (_mm256_cvttph_epi16): Likewise.
10623 (_mm256_mask_cvttph_epi16): Likewise.
10624 (_mm256_maskz_cvttph_epi16): Likewise.
10625 (_mm_cvttph_epu16): Likewise.
10626 (_mm_mask_cvttph_epu16): Likewise.
10627 (_mm_maskz_cvttph_epu16): Likewise.
10628 (_mm256_cvttph_epu16): Likewise.
10629 (_mm256_mask_cvttph_epu16): Likewise.
10630 (_mm256_maskz_cvttph_epu16): Likewise.
10631 * config/i386/i386-builtin.def: Add new builtins.
10632 * config/i386/sse.md
10633 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
10635 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
10636 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
10637 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
10638 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
10640 2021-09-17 liuhongt <hongtao.liu@intel.com>
10642 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
10643 (_mm_cvtsh_u32): Likewise.
10644 (_mm_cvt_roundsh_i32): Likewise.
10645 (_mm_cvt_roundsh_u32): Likewise.
10646 (_mm_cvtsh_i64): Likewise.
10647 (_mm_cvtsh_u64): Likewise.
10648 (_mm_cvt_roundsh_i64): Likewise.
10649 (_mm_cvt_roundsh_u64): Likewise.
10650 (_mm_cvti32_sh): Likewise.
10651 (_mm_cvtu32_sh): Likewise.
10652 (_mm_cvt_roundi32_sh): Likewise.
10653 (_mm_cvt_roundu32_sh): Likewise.
10654 (_mm_cvti64_sh): Likewise.
10655 (_mm_cvtu64_sh): Likewise.
10656 (_mm_cvt_roundi64_sh): Likewise.
10657 (_mm_cvt_roundu64_sh): Likewise.
10658 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10659 * config/i386/i386-builtin.def: Add corresponding new builtins.
10660 * config/i386/i386-expand.c (ix86_expand_round_builtin):
10661 Handle new builtin types.
10662 * config/i386/sse.md
10663 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
10665 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
10666 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
10668 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
10670 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
10671 (altivec_resolve_new_overloaded_builtin): New forward decl.
10672 (rs6000_new_builtin_type_compatible): New function.
10673 (altivec_resolve_overloaded_builtin): Call
10674 altivec_resolve_new_overloaded_builtin.
10675 (altivec_build_new_resolved_builtin): New function.
10676 (altivec_resolve_new_overloaded_builtin): Likewise.
10677 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
10679 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
10680 name of rs6000_new_builtin_is_supported.
10682 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
10684 * config/i386/i386-protos.h (ix86_decompose_address):
10685 Change return type to bool.
10686 * config/i386/i386.c (ix86_decompose_address): Ditto.
10688 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
10691 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
10692 Added 'build/' to target, use build/%.o rule.
10693 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
10694 '$(build_exeext)' to target and 'build/' for the *.o files.
10695 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
10698 2021-09-16 Martin Jambor <mjambor@suse.cz>
10700 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
10701 the body has been removed. Remove trailing whitespace.
10703 2021-09-16 Richard Biener <rguenther@suse.de>
10705 PR middle-end/102360
10706 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
10707 of non-memory more robust.
10709 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10711 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
10712 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
10713 (leon5_adjust_cost): Increase cost of store with data dependency
10714 on ALU instruction and FPU anti-dependencies.
10715 (sparc_option_override): Add LEON5 costs
10716 (sparc_adjust_cost): Add LEON5 cost adjustments
10717 * config/sparc/sparc.h: Add LEON5
10718 * config/sparc/sparc.md: Include LEON5 scheduling information
10719 * config/sparc/sparc.opt: Add LEON5
10720 * doc/invoke.texi: Add LEON5
10721 * config/sparc/leon5.md: New file.
10723 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10725 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
10726 sensitive sequence for B2BST errata workaround.
10728 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10730 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
10731 functions with atomic instruction in the UT700 errata workaround.
10733 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10735 * config/sparc/sparc.c (next_active_non_empty_insn): New function
10736 that returns next active non empty assembly instruction.
10737 (sparc_do_work_around_errata): Use new function.
10739 2021-09-16 Daniel Cederman <cederman@gaisler.com>
10741 * config/sparc/sparc.c (store_insn_p): Add predicate for store
10743 (load_insn_p): Add predicate for load attributes.
10744 (sparc_do_work_around_errata): Use new predicates.
10746 2021-09-16 Andreas Larsson <andreas@gaisler.com>
10748 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
10751 2021-09-16 Martin Liska <mliska@suse.cz>
10753 * config/mips/netbsd.h: Fix typo in name of a macro.
10755 2021-09-16 liuhongt <hongtao.liu@intel.com>
10757 PR middle-end/102080
10758 * match.pd: Check mask type when doing cond_op related gimple
10760 * tree.c (is_truth_type_for): New function.
10761 * tree.h (is_truth_type_for): New declaration.
10763 2021-09-16 liuhongt <hongtao.liu@intel.com>
10765 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
10767 (_mm512_mask_cvtepi32_ph): Likewise.
10768 (_mm512_maskz_cvtepi32_ph): Likewise.
10769 (_mm512_cvt_roundepi32_ph): Likewise.
10770 (_mm512_mask_cvt_roundepi32_ph): Likewise.
10771 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
10772 (_mm512_cvtepu32_ph): Likewise.
10773 (_mm512_mask_cvtepu32_ph): Likewise.
10774 (_mm512_maskz_cvtepu32_ph): Likewise.
10775 (_mm512_cvt_roundepu32_ph): Likewise.
10776 (_mm512_mask_cvt_roundepu32_ph): Likewise.
10777 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
10778 (_mm512_cvtepi64_ph): Likewise.
10779 (_mm512_mask_cvtepi64_ph): Likewise.
10780 (_mm512_maskz_cvtepi64_ph): Likewise.
10781 (_mm512_cvt_roundepi64_ph): Likewise.
10782 (_mm512_mask_cvt_roundepi64_ph): Likewise.
10783 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
10784 (_mm512_cvtepu64_ph): Likewise.
10785 (_mm512_mask_cvtepu64_ph): Likewise.
10786 (_mm512_maskz_cvtepu64_ph): Likewise.
10787 (_mm512_cvt_roundepu64_ph): Likewise.
10788 (_mm512_mask_cvt_roundepu64_ph): Likewise.
10789 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
10790 (_mm512_cvtepi16_ph): Likewise.
10791 (_mm512_mask_cvtepi16_ph): Likewise.
10792 (_mm512_maskz_cvtepi16_ph): Likewise.
10793 (_mm512_cvt_roundepi16_ph): Likewise.
10794 (_mm512_mask_cvt_roundepi16_ph): Likewise.
10795 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
10796 (_mm512_cvtepu16_ph): Likewise.
10797 (_mm512_mask_cvtepu16_ph): Likewise.
10798 (_mm512_maskz_cvtepu16_ph): Likewise.
10799 (_mm512_cvt_roundepu16_ph): Likewise.
10800 (_mm512_mask_cvt_roundepu16_ph): Likewise.
10801 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
10802 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
10804 (_mm_mask_cvtepi32_ph): Likewise.
10805 (_mm_maskz_cvtepi32_ph): Likewise.
10806 (_mm256_cvtepi32_ph): Likewise.
10807 (_mm256_mask_cvtepi32_ph): Likewise.
10808 (_mm256_maskz_cvtepi32_ph): Likewise.
10809 (_mm_cvtepu32_ph): Likewise.
10810 (_mm_mask_cvtepu32_ph): Likewise.
10811 (_mm_maskz_cvtepu32_ph): Likewise.
10812 (_mm256_cvtepu32_ph): Likewise.
10813 (_mm256_mask_cvtepu32_ph): Likewise.
10814 (_mm256_maskz_cvtepu32_ph): Likewise.
10815 (_mm_cvtepi64_ph): Likewise.
10816 (_mm_mask_cvtepi64_ph): Likewise.
10817 (_mm_maskz_cvtepi64_ph): Likewise.
10818 (_mm256_cvtepi64_ph): Likewise.
10819 (_mm256_mask_cvtepi64_ph): Likewise.
10820 (_mm256_maskz_cvtepi64_ph): Likewise.
10821 (_mm_cvtepu64_ph): Likewise.
10822 (_mm_mask_cvtepu64_ph): Likewise.
10823 (_mm_maskz_cvtepu64_ph): Likewise.
10824 (_mm256_cvtepu64_ph): Likewise.
10825 (_mm256_mask_cvtepu64_ph): Likewise.
10826 (_mm256_maskz_cvtepu64_ph): Likewise.
10827 (_mm_cvtepi16_ph): Likewise.
10828 (_mm_mask_cvtepi16_ph): Likewise.
10829 (_mm_maskz_cvtepi16_ph): Likewise.
10830 (_mm256_cvtepi16_ph): Likewise.
10831 (_mm256_mask_cvtepi16_ph): Likewise.
10832 (_mm256_maskz_cvtepi16_ph): Likewise.
10833 (_mm_cvtepu16_ph): Likewise.
10834 (_mm_mask_cvtepu16_ph): Likewise.
10835 (_mm_maskz_cvtepu16_ph): Likewise.
10836 (_mm256_cvtepu16_ph): Likewise.
10837 (_mm256_mask_cvtepu16_ph): Likewise.
10838 (_mm256_maskz_cvtepu16_ph): Likewise.
10839 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10840 * config/i386/i386-builtin.def: Add corresponding new builtins.
10841 * config/i386/i386-expand.c
10842 (ix86_expand_args_builtin): Handle new builtin types.
10843 (ix86_expand_round_builtin): Ditto.
10844 * config/i386/i386-modes.def: Declare V2HF and V6HF.
10845 * config/i386/sse.md (VI2H_AVX512VL): New.
10846 (qq2phsuff): Ditto.
10847 (sseintvecmode): Add HF vector modes.
10848 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
10850 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
10851 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
10852 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
10853 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
10854 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
10855 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
10856 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
10857 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
10858 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
10859 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
10860 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
10862 2021-09-16 liuhongt <hongtao.liu@intel.com>
10864 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
10866 (_mm512_mask_cvtph_epi32): Likewise.
10867 (_mm512_maskz_cvtph_epi32): Likewise.
10868 (_mm512_cvt_roundph_epi32): Likewise.
10869 (_mm512_mask_cvt_roundph_epi32): Likewise.
10870 (_mm512_maskz_cvt_roundph_epi32): Likewise.
10871 (_mm512_cvtph_epu32): Likewise.
10872 (_mm512_mask_cvtph_epu32): Likewise.
10873 (_mm512_maskz_cvtph_epu32): Likewise.
10874 (_mm512_cvt_roundph_epu32): Likewise.
10875 (_mm512_mask_cvt_roundph_epu32): Likewise.
10876 (_mm512_maskz_cvt_roundph_epu32): Likewise.
10877 (_mm512_cvtph_epi64): Likewise.
10878 (_mm512_mask_cvtph_epi64): Likewise.
10879 (_mm512_maskz_cvtph_epi64): Likewise.
10880 (_mm512_cvt_roundph_epi64): Likewise.
10881 (_mm512_mask_cvt_roundph_epi64): Likewise.
10882 (_mm512_maskz_cvt_roundph_epi64): Likewise.
10883 (_mm512_cvtph_epu64): Likewise.
10884 (_mm512_mask_cvtph_epu64): Likewise.
10885 (_mm512_maskz_cvtph_epu64): Likewise.
10886 (_mm512_cvt_roundph_epu64): Likewise.
10887 (_mm512_mask_cvt_roundph_epu64): Likewise.
10888 (_mm512_maskz_cvt_roundph_epu64): Likewise.
10889 (_mm512_cvtph_epi16): Likewise.
10890 (_mm512_mask_cvtph_epi16): Likewise.
10891 (_mm512_maskz_cvtph_epi16): Likewise.
10892 (_mm512_cvt_roundph_epi16): Likewise.
10893 (_mm512_mask_cvt_roundph_epi16): Likewise.
10894 (_mm512_maskz_cvt_roundph_epi16): Likewise.
10895 (_mm512_cvtph_epu16): Likewise.
10896 (_mm512_mask_cvtph_epu16): Likewise.
10897 (_mm512_maskz_cvtph_epu16): Likewise.
10898 (_mm512_cvt_roundph_epu16): Likewise.
10899 (_mm512_mask_cvt_roundph_epu16): Likewise.
10900 (_mm512_maskz_cvt_roundph_epu16): Likewise.
10901 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
10903 (_mm_mask_cvtph_epi32): Likewise.
10904 (_mm_maskz_cvtph_epi32): Likewise.
10905 (_mm256_cvtph_epi32): Likewise.
10906 (_mm256_mask_cvtph_epi32): Likewise.
10907 (_mm256_maskz_cvtph_epi32): Likewise.
10908 (_mm_cvtph_epu32): Likewise.
10909 (_mm_mask_cvtph_epu32): Likewise.
10910 (_mm_maskz_cvtph_epu32): Likewise.
10911 (_mm256_cvtph_epu32): Likewise.
10912 (_mm256_mask_cvtph_epu32): Likewise.
10913 (_mm256_maskz_cvtph_epu32): Likewise.
10914 (_mm_cvtph_epi64): Likewise.
10915 (_mm_mask_cvtph_epi64): Likewise.
10916 (_mm_maskz_cvtph_epi64): Likewise.
10917 (_mm256_cvtph_epi64): Likewise.
10918 (_mm256_mask_cvtph_epi64): Likewise.
10919 (_mm256_maskz_cvtph_epi64): Likewise.
10920 (_mm_cvtph_epu64): Likewise.
10921 (_mm_mask_cvtph_epu64): Likewise.
10922 (_mm_maskz_cvtph_epu64): Likewise.
10923 (_mm256_cvtph_epu64): Likewise.
10924 (_mm256_mask_cvtph_epu64): Likewise.
10925 (_mm256_maskz_cvtph_epu64): Likewise.
10926 (_mm_cvtph_epi16): Likewise.
10927 (_mm_mask_cvtph_epi16): Likewise.
10928 (_mm_maskz_cvtph_epi16): Likewise.
10929 (_mm256_cvtph_epi16): Likewise.
10930 (_mm256_mask_cvtph_epi16): Likewise.
10931 (_mm256_maskz_cvtph_epi16): Likewise.
10932 (_mm_cvtph_epu16): Likewise.
10933 (_mm_mask_cvtph_epu16): Likewise.
10934 (_mm_maskz_cvtph_epu16): Likewise.
10935 (_mm256_cvtph_epu16): Likewise.
10936 (_mm256_mask_cvtph_epu16): Likewise.
10937 (_mm256_maskz_cvtph_epu16): Likewise.
10938 * config/i386/i386-builtin-types.def: Add new builtin types.
10939 * config/i386/i386-builtin.def: Add new builtins.
10940 * config/i386/i386-expand.c
10941 (ix86_expand_args_builtin): Handle new builtin types.
10942 (ix86_expand_round_builtin): Ditto.
10943 * config/i386/sse.md (sseintconvert): New.
10944 (ssePHmode): Ditto.
10945 (UNSPEC_US_FIX_NOTRUNC): Ditto.
10946 (sseintconvertsignprefix): Ditto.
10947 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
10950 2021-09-16 liuhongt <hongtao.liu@intel.com>
10952 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
10954 (_mm_cvtsi128_si16): Likewise.
10955 (_mm_mask_load_sh): Likewise.
10956 (_mm_maskz_load_sh): Likewise.
10957 (_mm_mask_store_sh): Likewise.
10958 (_mm_move_sh): Likewise.
10959 (_mm_mask_move_sh): Likewise.
10960 (_mm_maskz_move_sh): Likewise.
10961 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
10962 * config/i386/i386-builtin.def: Add corresponding new builtins.
10963 * config/i386/i386-expand.c
10964 (ix86_expand_special_args_builtin): Handle new builtin types.
10965 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
10966 * config/i386/sse.md (VI2F): New mode iterator.
10967 (vec_set<mode>_0): Use new mode iterator.
10968 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
10969 (avx512f_store<mode>_mask): Ditto.
10971 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
10973 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
10975 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
10977 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
10978 Proceed if no symbol summary or the symbol alias flag is false.
10980 2021-09-15 Jakub Jelinek <jakub@redhat.com>
10984 * varasm.c (output_constructor_regular_field): Instead of assertion
10985 that array_size_for_constructor result is equal to size of
10986 TREE_TYPE (local->val) in bytes, assert that the type size is greater
10987 or equal to array_size_for_constructor result and use type size as
10990 2021-09-15 Martin Liska <mliska@suse.cz>
10993 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
10995 2021-09-15 Martin Liska <mliska@suse.cz>
10998 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
10999 Check that we have a symbol summary for a symbol.
11001 2021-09-15 Richard Biener <rguenther@suse.de>
11004 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11005 to inherit from elfos.h
11007 2021-09-15 liuhongt <hongtao.liu@intel.com>
11010 * config/i386/i386-expand.c
11011 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
11013 (ix86_expand_vector_set): Use blendw instead of pinsrw.
11014 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
11015 AVX512FP16 which supports 16bit vector load.
11016 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
11018 (avx512bw_interleave_high<mode><mask_name>): .. this, and
11019 extend to V32HFmode.
11020 (avx2_interleave_highv16hi<mask_name>): Rename to ..
11021 (avx2_interleave_high<mode><mask_name>): .. this, and extend
11023 (vec_interleave_highv8hi<mask_name>): Rename to ..
11024 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
11025 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
11027 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
11028 this, and extend to V32HFmode.
11029 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
11030 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
11031 (vec_interleave_lowv8hi<mask_name>): Rename to ..
11032 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
11033 (sse4_1_pblendw): Rename to ..
11034 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
11035 (avx2_pblendph): New define_expand.
11036 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
11037 sseintmodesuffix instead of ssemodesuffix.
11038 (blendsuf): New mode attr.
11040 2021-09-15 Richard Biener <rguenther@suse.de>
11042 * tree-vectorizer.h (dr_misalignment): Move out of line.
11043 (dr_target_alignment): New.
11044 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
11045 (set_dr_target_alignment): New.
11046 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
11047 * tree-vect-data-refs.c (dr_misalignment): Compute and
11048 return the group members misalignment.
11049 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
11050 (vect_analyze_data_refs_alignment): Compute alignment only
11051 for the first element of a DR group.
11052 (vect_slp_analyze_node_alignment): Likewise.
11054 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
11056 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
11057 * config/i386/avx512fp16vlintrin.h: Likewise.
11058 * config/i386/i386-builtin.def: Adjust builtin name and
11059 enumeration to match AVX512F style.
11061 2021-09-15 Richard Biener <rguenther@suse.de>
11063 PR tree-optimization/102318
11064 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
11065 previous change and do the mode conversion separately from
11066 the sign conversion.
11068 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
11069 Peter Cordes <peter@cordes.ca>
11072 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
11073 (*vec_extract<mode><ssescalarmodelower>_valign): Output
11074 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
11077 2021-09-15 Richard Biener <rguenther@suse.de>
11079 * config.gcc: Remove vax-*-openbsd* configuration.
11081 2021-09-15 Richard Biener <rguenther@suse.de>
11083 * config.gcc: Remove m68k-openbsd.
11085 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
11088 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
11090 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
11092 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
11093 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
11094 (*mma_xxsetaccz): Delete.
11095 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
11096 Use UNSPECV_MMA_XXSETACCZ. Update comment.
11097 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
11099 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
11101 * Makefile.in: Remove variables related to applying no-PIE
11102 to the exes on $build.
11103 * configure: Regenerate.
11104 * configure.ac: Remove configuration related to applying
11105 no-PIE to the exes on $build.
11107 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
11109 * config/arc/arc.md (doloop_end): Add missing mode.
11110 (loop_end): Likewise.
11112 2021-09-14 Jakub Jelinek <jakub@redhat.com>
11114 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
11115 it to recursive calls, for depth above 7 just gimplify or return.
11116 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
11117 __builtin_clear_padding and TARGET_EXPR.
11118 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
11120 2021-09-14 liuhongt <hongtao.liu@intel.com>
11122 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
11124 (_mm_mask_fpclass_sh_mask): Likewise.
11125 (_mm512_mask_fpclass_ph_mask): Likewise.
11126 (_mm512_fpclass_ph_mask): Likewise.
11127 (_mm_getexp_sh): Likewise.
11128 (_mm_mask_getexp_sh): Likewise.
11129 (_mm_maskz_getexp_sh): Likewise.
11130 (_mm512_getexp_ph): Likewise.
11131 (_mm512_mask_getexp_ph): Likewise.
11132 (_mm512_maskz_getexp_ph): Likewise.
11133 (_mm_getexp_round_sh): Likewise.
11134 (_mm_mask_getexp_round_sh): Likewise.
11135 (_mm_maskz_getexp_round_sh): Likewise.
11136 (_mm512_getexp_round_ph): Likewise.
11137 (_mm512_mask_getexp_round_ph): Likewise.
11138 (_mm512_maskz_getexp_round_ph): Likewise.
11139 (_mm_getmant_sh): Likewise.
11140 (_mm_mask_getmant_sh): Likewise.
11141 (_mm_maskz_getmant_sh): Likewise.
11142 (_mm512_getmant_ph): Likewise.
11143 (_mm512_mask_getmant_ph): Likewise.
11144 (_mm512_maskz_getmant_ph): Likewise.
11145 (_mm_getmant_round_sh): Likewise.
11146 (_mm_mask_getmant_round_sh): Likewise.
11147 (_mm_maskz_getmant_round_sh): Likewise.
11148 (_mm512_getmant_round_ph): Likewise.
11149 (_mm512_mask_getmant_round_ph): Likewise.
11150 (_mm512_maskz_getmant_round_ph): Likewise.
11151 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
11153 (_mm_fpclass_ph_mask): Likewise.
11154 (_mm256_mask_fpclass_ph_mask): Likewise.
11155 (_mm256_fpclass_ph_mask): Likewise.
11156 (_mm256_getexp_ph): Likewise.
11157 (_mm256_mask_getexp_ph): Likewise.
11158 (_mm256_maskz_getexp_ph): Likewise.
11159 (_mm_getexp_ph): Likewise.
11160 (_mm_mask_getexp_ph): Likewise.
11161 (_mm_maskz_getexp_ph): Likewise.
11162 (_mm256_getmant_ph): Likewise.
11163 (_mm256_mask_getmant_ph): Likewise.
11164 (_mm256_maskz_getmant_ph): Likewise.
11165 (_mm_getmant_ph): Likewise.
11166 (_mm_mask_getmant_ph): Likewise.
11167 (_mm_maskz_getmant_ph): Likewise.
11168 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11169 * config/i386/i386-builtin.def: Add corresponding new builtins.
11170 * config/i386/i386-expand.c
11171 (ix86_expand_args_builtin): Handle new builtin types.
11172 (ix86_expand_round_builtin): Ditto.
11173 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
11174 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
11175 to support HF vector modes.
11176 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
11178 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
11179 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
11180 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
11181 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
11184 2021-09-14 liuhongt <hongtao.liu@intel.com>
11186 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
11188 (_mm512_mask_reduce_ph): Likewise.
11189 (_mm512_maskz_reduce_ph): Likewise.
11190 (_mm512_reduce_round_ph): Likewise.
11191 (_mm512_mask_reduce_round_ph): Likewise.
11192 (_mm512_maskz_reduce_round_ph): Likewise.
11193 (_mm_reduce_sh): Likewise.
11194 (_mm_mask_reduce_sh): Likewise.
11195 (_mm_maskz_reduce_sh): Likewise.
11196 (_mm_reduce_round_sh): Likewise.
11197 (_mm_mask_reduce_round_sh): Likewise.
11198 (_mm_maskz_reduce_round_sh): Likewise.
11199 (_mm512_roundscale_ph): Likewise.
11200 (_mm512_mask_roundscale_ph): Likewise.
11201 (_mm512_maskz_roundscale_ph): Likewise.
11202 (_mm512_roundscale_round_ph): Likewise.
11203 (_mm512_mask_roundscale_round_ph): Likewise.
11204 (_mm512_maskz_roundscale_round_ph): Likewise.
11205 (_mm_roundscale_sh): Likewise.
11206 (_mm_mask_roundscale_sh): Likewise.
11207 (_mm_maskz_roundscale_sh): Likewise.
11208 (_mm_roundscale_round_sh): Likewise.
11209 (_mm_mask_roundscale_round_sh): Likewise.
11210 (_mm_maskz_roundscale_round_sh): Likewise.
11211 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
11213 (_mm_mask_reduce_ph): Likewise.
11214 (_mm_maskz_reduce_ph): Likewise.
11215 (_mm256_reduce_ph): Likewise.
11216 (_mm256_mask_reduce_ph): Likewise.
11217 (_mm256_maskz_reduce_ph): Likewise.
11218 (_mm_roundscale_ph): Likewise.
11219 (_mm_mask_roundscale_ph): Likewise.
11220 (_mm_maskz_roundscale_ph): Likewise.
11221 (_mm256_roundscale_ph): Likewise.
11222 (_mm256_mask_roundscale_ph): Likewise.
11223 (_mm256_maskz_roundscale_ph): Likewise.
11224 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11225 * config/i386/i386-builtin.def: Add corresponding new builtins.
11226 * config/i386/i386-expand.c
11227 (ix86_expand_args_builtin): Handle new builtin types.
11228 (ix86_expand_round_builtin): Ditto.
11229 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
11231 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
11232 ... this, and adjust for round operands.
11233 (reduces<mode><mask_scalar_name>): Likewise, with ...
11234 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
11236 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
11237 Adjust for HF vector modes.
11238 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
11240 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
11242 2021-09-14 liuhongt <hongtao.liu@intel.com>
11244 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
11246 (_mm512_mask_rcp_ph): Likewise.
11247 (_mm512_maskz_rcp_ph): Likewise.
11248 (_mm_rcp_sh): Likewise.
11249 (_mm_mask_rcp_sh): Likewise.
11250 (_mm_maskz_rcp_sh): Likewise.
11251 (_mm512_scalef_ph): Likewise.
11252 (_mm512_mask_scalef_ph): Likewise.
11253 (_mm512_maskz_scalef_ph): Likewise.
11254 (_mm512_scalef_round_ph): Likewise.
11255 (_mm512_mask_scalef_round_ph): Likewise.
11256 (_mm512_maskz_scalef_round_ph): Likewise.
11257 (_mm_scalef_sh): Likewise.
11258 (_mm_mask_scalef_sh): Likewise.
11259 (_mm_maskz_scalef_sh): Likewise.
11260 (_mm_scalef_round_sh): Likewise.
11261 (_mm_mask_scalef_round_sh): Likewise.
11262 (_mm_maskz_scalef_round_sh): Likewise.
11263 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
11265 (_mm256_rcp_ph): Likewise.
11266 (_mm_mask_rcp_ph): Likewise.
11267 (_mm256_mask_rcp_ph): Likewise.
11268 (_mm_maskz_rcp_ph): Likewise.
11269 (_mm256_maskz_rcp_ph): Likewise.
11270 (_mm_scalef_ph): Likewise.
11271 (_mm256_scalef_ph): Likewise.
11272 (_mm_mask_scalef_ph): Likewise.
11273 (_mm256_mask_scalef_ph): Likewise.
11274 (_mm_maskz_scalef_ph): Likewise.
11275 (_mm256_maskz_scalef_ph): Likewise.
11276 * config/i386/i386-builtin.def: Add new builtins.
11277 * config/i386/sse.md (VFH_AVX512VL): New.
11278 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
11279 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
11280 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
11281 Adjust to support HF vector modes.
11282 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
11284 2021-09-14 liuhongt <hongtao.liu@intel.com>
11286 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
11288 (_mm512_mask_sqrt_ph): Likewise.
11289 (_mm512_maskz_sqrt_ph): Likewise.
11290 (_mm512_sqrt_round_ph): Likewise.
11291 (_mm512_mask_sqrt_round_ph): Likewise.
11292 (_mm512_maskz_sqrt_round_ph): Likewise.
11293 (_mm512_rsqrt_ph): Likewise.
11294 (_mm512_mask_rsqrt_ph): Likewise.
11295 (_mm512_maskz_rsqrt_ph): Likewise.
11296 (_mm_rsqrt_sh): Likewise.
11297 (_mm_mask_rsqrt_sh): Likewise.
11298 (_mm_maskz_rsqrt_sh): Likewise.
11299 (_mm_sqrt_sh): Likewise.
11300 (_mm_mask_sqrt_sh): Likewise.
11301 (_mm_maskz_sqrt_sh): Likewise.
11302 (_mm_sqrt_round_sh): Likewise.
11303 (_mm_mask_sqrt_round_sh): Likewise.
11304 (_mm_maskz_sqrt_round_sh): Likewise.
11305 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
11306 (_mm256_sqrt_ph): Likewise.
11307 (_mm_mask_sqrt_ph): Likewise.
11308 (_mm256_mask_sqrt_ph): Likewise.
11309 (_mm_maskz_sqrt_ph): Likewise.
11310 (_mm256_maskz_sqrt_ph): Likewise.
11311 (_mm_rsqrt_ph): Likewise.
11312 (_mm256_rsqrt_ph): Likewise.
11313 (_mm_mask_rsqrt_ph): Likewise.
11314 (_mm256_mask_rsqrt_ph): Likewise.
11315 (_mm_maskz_rsqrt_ph): Likewise.
11316 (_mm256_maskz_rsqrt_ph): Likewise.
11317 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11318 * config/i386/i386-builtin.def: Add corresponding new builtins.
11319 * config/i386/i386-expand.c
11320 (ix86_expand_args_builtin): Handle new builtins.
11321 (ix86_expand_round_builtin): Ditto.
11322 * config/i386/sse.md (VF_AVX512FP16VL): New.
11323 (sqrt<mode>2): Adjust for HF vector modes.
11324 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
11325 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
11327 (<sse>_rsqrt<mode>2<mask_name>): New.
11328 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
11330 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11332 PR bootstrap/101574
11333 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
11334 'RESERVED_LOCATION_P' locations.
11335 * warning-control.cc (get_nowarn_spec, suppress_warning)
11336 (copy_warning): Likewise.
11338 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11340 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
11341 (typedef key_type_t): ... this. Remove.
11342 (nowarn_map): Document.
11343 * diagnostic-spec.c (nowarn_map): Likewise.
11344 * warning-control.cc (convert_to_key): Evolve functions into...
11345 (get_location): ... these. Adjust all users.
11347 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
11349 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
11351 2021-09-13 Jason Merrill <jason@redhat.com>
11353 * params.opt: Add destructive-interference-size and
11354 constructive-interference-size.
11355 * doc/invoke.texi: Document them.
11356 * config/aarch64/aarch64.c (aarch64_override_options_internal):
11358 * config/arm/arm.c (arm_option_override): Set them.
11359 * config/i386/i386-options.c (ix86_option_override_internal):
11362 2021-09-13 Martin Liska <mliska@suse.cz>
11363 H.J. Lu <hjl.tools@gmail.com>
11366 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
11367 for x86-64 micro levels for __builtin_cpu_supports.
11368 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
11369 Add priorities for the micro-arch levels.
11370 (enum processor_features): Add new features.
11371 * common/config/i386/i386-isas.h: Add micro-arch features.
11372 * config/i386/i386-builtins.c (get_builtin_code_for_version):
11373 Support the micro-arch levels by callsing
11374 __builtin_cpu_supports.
11375 * doc/extend.texi: Document that the levels are support by
11376 __builtin_cpu_supports.
11378 2021-09-13 Andrew Pinski <apinski@marvell.com>
11381 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
11383 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
11384 (aarch64_general_gimple_fold_builtin): Likewise.
11386 2021-09-13 Andrew Pinski <apinski@marvell.com>
11388 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
11389 to the Unsupported targets list.
11390 Remove support for m32r-*-linux* and m32rle-*-linux*.
11391 * config/m32r/linux.h: Removed.
11392 * config/m32r/t-linux: Removed.
11394 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
11397 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
11398 register index for SVE predicate modes.
11400 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11402 * tree-ssa-threadbackward.c
11403 (back_threader_profitability::profitable_path_p): Remove FSM
11405 (back_threader_registry::register_path): Same.
11406 * tree-ssa-threadedge.c
11407 (jump_threader::simplify_control_stmt_condition): Same.
11408 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
11409 Add backedge_threads argument.
11410 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
11411 backedge_threads argument.
11412 (back_jt_path_registry::back_jt_path_registry): Same.
11413 (dump_jump_thread_path): Adjust for FSM removal.
11414 (back_jt_path_registry::rewire_first_differing_edge): Same.
11415 (back_jt_path_registry::adjust_paths_after_duplication): Same.
11416 (back_jt_path_registry::update_cfg): Same.
11417 (jt_path_registry::register_jump_thread): Same.
11418 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
11420 (class back_jt_path_registry): Add backedge_threads to
11423 2021-09-13 Martin Liska <mliska@suse.cz>
11426 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
11428 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
11431 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
11432 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
11433 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
11434 TARGET_AVX256_STORE_BY_PIECES instead of
11435 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
11436 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11437 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
11438 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
11439 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
11440 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
11442 2021-09-13 liuhongt <hongtao.liu@intel.com>
11444 PR bootstrap/102302
11445 * expmed.c (extract_bit_field_using_extv): Use
11446 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
11448 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
11450 * Makefile.in (OBJS): Add value-pointer-equiv.o.
11451 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
11452 value-pointer-equiv.*.
11453 (ssa_equiv_stack::ssa_equiv_stack): Same.
11454 (ssa_equiv_stack::enter): Same.
11455 (ssa_equiv_stack::leave): Same.
11456 (ssa_equiv_stack::push_replacement): Same.
11457 (ssa_equiv_stack::get_replacement): Same.
11458 (is_pointer_ssa): Same.
11459 (class pointer_equiv_analyzer): Same.
11460 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
11461 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
11462 (pointer_equiv_analyzer::set_global_equiv): Same.
11463 (pointer_equiv_analyzer::set_cond_equiv): Same.
11464 (pointer_equiv_analyzer::get_equiv): Same.
11465 (pointer_equiv_analyzer::enter): Same.
11466 (pointer_equiv_analyzer::leave): Same.
11467 (pointer_equiv_analyzer::get_equiv_expr): Same.
11468 (pta_valueize): Same.
11469 (pointer_equiv_analyzer::visit_stmt): Same.
11470 (pointer_equiv_analyzer::visit_edge): Same.
11471 (hybrid_folder::value_of_expr): Same.
11472 (hybrid_folder::value_on_edge): Same.
11473 * value-pointer-equiv.cc: New file.
11474 * value-pointer-equiv.h: New file.
11476 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11479 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
11480 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
11482 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11485 * config/arm/arm.md (movmisaligndi): New define_expand.
11486 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
11488 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
11491 * emit-rtl.c (gen_highpart): Use adjust_address to handle
11492 MEM rather than calling simplify_gen_subreg.
11494 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
11496 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
11497 define into a block.
11499 2021-09-13 Richard Biener <rguenther@suse.de>
11501 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
11502 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
11503 change based on DARWIN_PREFER_DWARF not being defined.
11505 2021-09-13 Richard Biener <rguenther@suse.de>
11507 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
11508 to inherit from elfos.h
11510 2021-09-13 Richard Biener <rguenther@suse.de>
11512 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
11514 2021-09-13 Richard Biener <rguenther@suse.de>
11516 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
11517 override, pick up DWARF2_DEBUG define from elfos.h
11519 2021-09-13 Richard Biener <rguenther@suse.de>
11521 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
11524 2021-09-13 Richard Biener <rguenther@suse.de>
11526 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
11529 2021-09-13 Richard Biener <rguenther@suse.de>
11531 * config/i386/cygming.h: Always default to DWARF2 debugging.
11532 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
11534 * doc/install.texi: Document binutils 2.16 as minimum
11535 requirement for mingw.
11537 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11539 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
11540 nstmts, nloads and extra_ctor_cost.
11541 (rs6000_density_test): Add load density related heuristics. Do
11542 extra costing on vector construction statements if need.
11543 (rs6000_init_cost): Init new members.
11544 (rs6000_update_target_cost_per_stmt): New function.
11545 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
11546 rs6000_update_target_cost_per_stmt and call it.
11548 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
11550 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
11551 (rs6000_init_cost): Adjust.
11553 2021-09-13 liuhongt <hongtao.liu@intel.com>
11555 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
11556 (UNSPEC_XORSIGN): Ditto.
11558 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
11560 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
11561 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
11564 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
11566 * tree-ssa-threadbackward.c (class back_threader_registry): Use
11567 back_jt_path_registry.
11568 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
11569 fwd_jt_path_registry.
11570 * tree-ssa-threadedge.h (class jump_threader): Same..
11571 * tree-ssa-threadupdate.c
11572 (jump_thread_path_registry::jump_thread_path_registry): Rename...
11573 (jt_path_registry::jt_path_registry): ...to this.
11574 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
11575 (jt_path_registry::~jt_path_registry): ...this.
11576 (fwd_jt_path_registry::fwd_jt_path_registry): New.
11577 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
11578 (jump_thread_path_registry::allocate_thread_edge): Rename...
11579 (jt_path_registry::allocate_thread_edge): ...to this.
11580 (jump_thread_path_registry::allocate_thread_path): Rename...
11581 (jt_path_registry::allocate_thread_path): ...to this.
11582 (jump_thread_path_registry::lookup_redirection_data): Rename...
11583 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
11584 (jump_thread_path_registry::thread_block_1): Rename...
11585 (fwd_jt_path_registry::thread_block_1): ...to this.
11586 (jump_thread_path_registry::thread_block): Rename...
11587 (fwd_jt_path_registry::thread_block): ...to this.
11588 (jt_path_registry::thread_through_loop_header): Rename...
11589 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
11590 (jump_thread_path_registry::mark_threaded_blocks): Rename...
11591 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
11592 (jump_thread_path_registry::debug_path): Rename...
11593 (jt_path_registry::debug_path): ...to this.
11594 (jump_thread_path_registry::dump): Rename...
11595 (jt_path_registry::debug): ...to this.
11596 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
11597 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
11598 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
11599 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
11600 (jump_thread_path_registry::duplicate_thread_path): Rename...
11601 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
11602 drop ill-formed candidates.
11603 (jump_thread_path_registry::remove_jump_threads_including): Rename...
11604 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
11605 (jt_path_registry::thread_through_all_blocks): New.
11606 (back_jt_path_registry::update_cfg): New.
11607 (fwd_jt_path_registry::update_cfg): New.
11608 (jump_thread_path_registry::register_jump_thread): Rename...
11609 (jt_path_registry::register_jump_thread): ...to this.
11610 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
11612 (class jt_path_registry): ...here.
11613 (class fwd_jt_path_registry): New.
11614 (class back_jt_path_registry): New.
11616 2021-09-10 liuhongt <hongtao.liu@intel.com>
11619 2021-09-01 liuhongt <hongtao.liu@intel.com>
11621 * emit-rtl.c (validate_subreg): Get rid of all float-int
11624 2021-09-10 Jakub Jelinek <jakub@redhat.com>
11626 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
11627 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
11628 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
11629 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
11630 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
11631 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
11632 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
11633 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
11635 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
11636 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
11637 GF_OMP_ATOMIC_WEAK.
11638 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
11640 * tree.h (OMP_ATOMIC_WEAK): Define.
11641 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
11642 fail memory order being encoded in the same enum and also print
11643 fail clause if present.
11644 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
11645 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
11646 handle pre_p == NULL case as a test mode that only returns value
11647 but doesn't change gimplify nor change anything otherwise, adjust
11648 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
11649 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
11650 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
11652 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
11653 COND_EXPR rhs. Set weak flag on gimple load/store for
11655 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
11656 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
11658 (expand_omp_atomic_cas): New function.
11659 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
11661 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
11662 using expand_omp_atomic_cas.
11664 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
11665 Michael Matz <matz@suse.de>
11667 * tree-pass.h (PROP_loop_opts_done): New.
11668 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
11669 Intersect with global range.
11670 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
11671 * tree-ssa-threadbackward.c
11672 (back_threader_profitability::profitable_path_p): Disable
11673 threading through latches until after loop optimizations have run.
11675 2021-09-10 David Faust <david.faust@oracle.com>
11677 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
11680 2021-09-10 David Faust <david.faust@oracle.com>
11682 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
11683 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
11684 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
11685 (bpf_expand_cbranch): New function.
11686 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
11688 (zero_extendsidi2): Only use mov32 instruction if it is available.
11689 (SIM mode iterator): Conditionalize support for SI mode.
11690 (JM mode iterator): New.
11691 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
11692 (*branch_on_di): Update name, use new JM iterator.
11693 * config/bpf/bpf.opt: (mjmpext): New option.
11694 (malu32): Likewise.
11695 (mjmp32): Likewise.
11697 (bpf_isa): New enum.
11699 2021-09-10 David Faust <david.faust@oracle.com>
11701 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
11702 for register-to-register extensions.
11703 (zero_extendqidi2): Likewise.
11705 2021-09-10 Richard Biener <rguenther@suse.de>
11707 PR middle-end/102273
11708 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
11710 2021-09-10 Richard Biener <rguenther@suse.de>
11712 PR middle-end/102269
11713 * gimplify.c (is_var_need_auto_init): Empty types do not need
11716 2021-09-10 Richard Biener <rguenther@suse.de>
11718 * configure.ac (--with-stabs): Remove.
11719 * configure: Regenerate.
11720 * doc/install.texi: Remove --with-stabs documentation.
11722 2021-09-10 liuhongt <hongtao.liu@intel.com>
11724 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
11726 (_mm512_mask_cmp_ph_mask): Likewise.
11727 (_mm512_cmp_round_ph_mask): Likewise.
11728 (_mm512_mask_cmp_round_ph_mask): Likewise.
11729 (_mm_cmp_sh_mask): Likewise.
11730 (_mm_mask_cmp_sh_mask): Likewise.
11731 (_mm_cmp_round_sh_mask): Likewise.
11732 (_mm_mask_cmp_round_sh_mask): Likewise.
11733 (_mm_comieq_sh): Likewise.
11734 (_mm_comilt_sh): Likewise.
11735 (_mm_comile_sh): Likewise.
11736 (_mm_comigt_sh): Likewise.
11737 (_mm_comige_sh): Likewise.
11738 (_mm_comineq_sh): Likewise.
11739 (_mm_ucomieq_sh): Likewise.
11740 (_mm_ucomilt_sh): Likewise.
11741 (_mm_ucomile_sh): Likewise.
11742 (_mm_ucomigt_sh): Likewise.
11743 (_mm_ucomige_sh): Likewise.
11744 (_mm_ucomineq_sh): Likewise.
11745 (_mm_comi_round_sh): Likewise.
11746 (_mm_comi_sh): Likewise.
11747 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
11748 (_mm_mask_cmp_ph_mask): Likewise.
11749 (_mm256_cmp_ph_mask): Likewise.
11750 (_mm256_mask_cmp_ph_mask): Likewise.
11751 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11752 * config/i386/i386-builtin.def: Add corresponding new builtins.
11753 * config/i386/i386-expand.c
11754 (ix86_expand_args_builtin): Handle new builtin types.
11755 (ix86_expand_round_builtin): Ditto.
11756 * config/i386/i386.md (ssevecmode): Add HF mode.
11757 (MODEFH): New mode iterator.
11758 * config/i386/sse.md
11759 (V48H_AVX512VL): New mode iterator to support HF vector modes.
11760 Ajdust corresponding description.
11761 (ssecmpintprefix): New.
11762 (VI12_AVX512VL): Adjust to support HF vector modes.
11763 (cmp_imm_predicate): Likewise.
11764 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
11766 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
11767 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
11768 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
11770 2021-09-10 liuhongt <hongtao.liu@intel.com>
11772 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
11773 (_mm512_mask_max_ph): Likewise.
11774 (_mm512_maskz_max_ph): Likewise.
11775 (_mm512_min_ph): Likewise.
11776 (_mm512_mask_min_ph): Likewise.
11777 (_mm512_maskz_min_ph): Likewise.
11778 (_mm512_max_round_ph): Likewise.
11779 (_mm512_mask_max_round_ph): Likewise.
11780 (_mm512_maskz_max_round_ph): Likewise.
11781 (_mm512_min_round_ph): Likewise.
11782 (_mm512_mask_min_round_ph): Likewise.
11783 (_mm512_maskz_min_round_ph): Likewise.
11784 (_mm_max_sh): Likewise.
11785 (_mm_mask_max_sh): Likewise.
11786 (_mm_maskz_max_sh): Likewise.
11787 (_mm_min_sh): Likewise.
11788 (_mm_mask_min_sh): Likewise.
11789 (_mm_maskz_min_sh): Likewise.
11790 (_mm_max_round_sh): Likewise.
11791 (_mm_mask_max_round_sh): Likewise.
11792 (_mm_maskz_max_round_sh): Likewise.
11793 (_mm_min_round_sh): Likewise.
11794 (_mm_mask_min_round_sh): Likewise.
11795 (_mm_maskz_min_round_sh): Likewise.
11796 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
11797 (_mm256_max_ph): Likewise.
11798 (_mm_mask_max_ph): Likewise.
11799 (_mm256_mask_max_ph): Likewise.
11800 (_mm_maskz_max_ph): Likewise.
11801 (_mm256_maskz_max_ph): Likewise.
11802 (_mm_min_ph): Likewise.
11803 (_mm256_min_ph): Likewise.
11804 (_mm_mask_min_ph): Likewise.
11805 (_mm256_mask_min_ph): Likewise.
11806 (_mm_maskz_min_ph): Likewise.
11807 (_mm256_maskz_min_ph): Likewise.
11808 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11809 * config/i386/i386-builtin.def: Add corresponding new builtins.
11810 * config/i386/i386-expand.c
11811 (ix86_expand_args_builtin): Handle new builtin types.
11812 * config/i386/sse.md
11813 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
11814 support HF vector modes.
11815 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
11816 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
11818 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
11820 * config/i386/subst.md (round_saeonly_mode512bit_condition):
11821 Adjust for HF vector modes.
11823 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
11825 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
11826 (_mm_mask_add_sh): Likewise.
11827 (_mm_maskz_add_sh): Likewise.
11828 (_mm_sub_sh): Likewise.
11829 (_mm_mask_sub_sh): Likewise.
11830 (_mm_maskz_sub_sh): Likewise.
11831 (_mm_mul_sh): Likewise.
11832 (_mm_mask_mul_sh): Likewise.
11833 (_mm_maskz_mul_sh): Likewise.
11834 (_mm_div_sh): Likewise.
11835 (_mm_mask_div_sh): Likewise.
11836 (_mm_maskz_div_sh): Likewise.
11837 (_mm_add_round_sh): Likewise.
11838 (_mm_mask_add_round_sh): Likewise.
11839 (_mm_maskz_add_round_sh): Likewise.
11840 (_mm_sub_round_sh): Likewise.
11841 (_mm_mask_sub_round_sh): Likewise.
11842 (_mm_maskz_sub_round_sh): Likewise.
11843 (_mm_mul_round_sh): Likewise.
11844 (_mm_mask_mul_round_sh): Likewise.
11845 (_mm_maskz_mul_round_sh): Likewise.
11846 (_mm_div_round_sh): Likewise.
11847 (_mm_mask_div_round_sh): Likewise.
11848 (_mm_maskz_div_round_sh): Likewise.
11849 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
11850 * config/i386/i386-builtin.def: Add corresponding new builtins.
11851 * config/i386/i386-expand.c
11852 (ix86_expand_round_builtin): Handle new builtins.
11853 * config/i386/sse.md (VF_128): Change description.
11854 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
11855 Adjust to support HF vector modes.
11856 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
11859 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
11861 * config/i386/i386-expand.c
11862 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
11863 * config/i386/i386.c
11864 (ix86_preferred_simd_mode): Handle HF mode.
11865 * config/i386/sse.md (V_256H): New mode iterator.
11866 (avx_vextractf128<mode>): Use it.
11867 (VEC_INIT_MODE): Align vector HFmode condition to vector
11868 HImodes since there're no real HF instruction used.
11869 (VEC_INIT_HALF_MODE): Ditto.
11871 (VIHF_AVX512BW): Ditto.
11872 (*vec_extracthf): Ditto.
11873 (VEC_EXTRACT_MODE): Ditto.
11875 2021-09-10 Richard Biener <rguenther@suse.de>
11878 * config/dbx.h: Remove.
11879 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
11880 * config/lynx.h: Likewise.
11882 2021-09-10 liuhongt <hongtao.liu@intel.com>
11884 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
11885 right into ANDNOT + AND + IOR, using paradoxical subregs.
11886 (ix86_split_copysign_const): Remove.
11887 (ix86_split_copysign_var): Ditto.
11888 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
11889 (ix86_split_copysign_var): Ditto.
11890 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
11891 (@copysign<mode>3_var): Ditto.
11893 2021-09-09 qing zhao <qing.zhao@oracle.com>
11895 * builtins.c (expand_builtin_memset): Make external visible.
11896 * builtins.h (expand_builtin_memset): Declare extern.
11897 * common.opt (ftrivial-auto-var-init=): New option.
11898 * doc/extend.texi: Document the uninitialized attribute.
11899 * doc/invoke.texi: Document -ftrivial-auto-var-init.
11900 * flag-types.h (enum auto_init_type): New enumerated type
11902 * gimple-fold.c (clear_padding_type): Add one new parameter.
11903 (clear_padding_union): Likewise.
11904 (clear_padding_emit_loop): Likewise.
11905 (clear_type_padding_in_mask): Likewise.
11906 (gimple_fold_builtin_clear_padding): Handle this new parameter.
11907 * gimplify.c (gimple_add_init_for_auto_var): New function.
11908 (gimple_add_padding_init_for_auto_var): New function.
11909 (is_var_need_auto_init): New function.
11910 (gimplify_decl_expr): Add initialization to automatic variables per
11912 (gimplify_call_expr): Add one new parameter for call to
11913 __builtin_clear_padding.
11914 (gimplify_init_constructor): Add padding initialization in the end.
11915 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
11916 (expand_DEFERRED_INIT): New function.
11917 * internal-fn.def (DEFERRED_INIT): New internal function.
11918 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
11919 * tree-sra.c (generate_subtree_deferred_init): New function.
11920 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
11921 calls to .DEFERRED_INIT.
11922 (sra_modify_deferred_init): New function.
11923 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
11924 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
11925 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
11927 (check_defs): Likewise.
11928 (warn_uninitialized_vars): Likewise.
11929 * tree-ssa.c (ssa_undefined_value_p): Likewise.
11930 * tree.c (build_common_builtin_nodes): Build tree node for
11931 BUILT_IN_CLEAR_PADDING when needed.
11933 2021-09-09 Richard Biener <rguenther@suse.de>
11935 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
11938 2021-09-09 Richard Biener <rguenther@suse.de>
11940 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
11941 DOM walk from get_loop_body_in_dom_order using a worklist
11944 2021-09-09 liuhongt <hongtao.liu@intel.com>
11946 * config.gcc: Add avx512fp16vlintrin.h.
11947 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
11948 (_mm512_mask_add_ph): Likewise.
11949 (_mm512_maskz_add_ph): Likewise.
11950 (_mm512_sub_ph): Likewise.
11951 (_mm512_mask_sub_ph): Likewise.
11952 (_mm512_maskz_sub_ph): Likewise.
11953 (_mm512_mul_ph): Likewise.
11954 (_mm512_mask_mul_ph): Likewise.
11955 (_mm512_maskz_mul_ph): Likewise.
11956 (_mm512_div_ph): Likewise.
11957 (_mm512_mask_div_ph): Likewise.
11958 (_mm512_maskz_div_ph): Likewise.
11959 (_mm512_add_round_ph): Likewise.
11960 (_mm512_mask_add_round_ph): Likewise.
11961 (_mm512_maskz_add_round_ph): Likewise.
11962 (_mm512_sub_round_ph): Likewise.
11963 (_mm512_mask_sub_round_ph): Likewise.
11964 (_mm512_maskz_sub_round_ph): Likewise.
11965 (_mm512_mul_round_ph): Likewise.
11966 (_mm512_mask_mul_round_ph): Likewise.
11967 (_mm512_maskz_mul_round_ph): Likewise.
11968 (_mm512_div_round_ph): Likewise.
11969 (_mm512_mask_div_round_ph): Likewise.
11970 (_mm512_maskz_div_round_ph): Likewise.
11971 * config/i386/avx512fp16vlintrin.h: New header.
11972 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
11973 Add new builtin types.
11974 * config/i386/i386-builtin.def: Add corresponding builtins.
11975 * config/i386/i386-expand.c
11976 (ix86_expand_args_builtin): Handle new builtin types.
11977 (ix86_expand_round_builtin): Likewise.
11978 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
11979 * config/i386/sse.md (VFH): New mode_iterator.
11981 (avx512fmaskmode): Add HF vector modes.
11982 (avx512fmaskhalfmode): Likewise.
11983 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
11985 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
11986 (mul<mode>3<mask_name><round_name>): Likewise.
11987 (*mul<mode>3<mask_name><round_name>): Likewise.
11988 (div<mode>3): Likewise.
11989 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
11990 * config/i386/subst.md (SUBST_V): Add HF vector modes.
11991 (SUBST_A): Likewise.
11992 (round_mode512bit_condition): Adjust for V32HFmode.
11994 2021-09-09 liuhongt <hongtao.liu@intel.com>
11997 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
11998 (reduc_plus_scal_v4sf): .. this, New define_expand.
11999 (reduc_plus_scal_v2df): .. and this, New define_expand.
12001 2021-09-09 liuhongt <hongtao.liu@intel.com>
12004 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
12007 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
12010 * doc/trouble.texi (Copy Assignment): Fix description of
12011 behaviour and fix code in example.
12013 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
12016 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
12017 r11 instead of r12 for restoring CR.
12019 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12020 liuhongt <hongtao.liu@intel.com>
12023 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
12024 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
12025 into AND with mask and XOR, using paradoxical subregs.
12026 (ix86_split_xorsign): Remove.
12027 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
12029 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
12031 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
12033 2021-09-08 Jakub Jelinek <jakub@redhat.com>
12036 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
12037 operands[2], emit abs<mode>2 instead.
12038 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
12039 first alternative even for avx, add another alternative with
12040 =&Yv <- 0, Yv, Yvm constraints.
12041 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
12042 to op1, emit vpandn instead.
12044 2021-09-08 liuhongt <hongtao.liu@intel.com>
12046 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
12047 (_mm256_set_ph): Likewise.
12048 (_mm512_set_ph): Likewise.
12049 (_mm_setr_ph): Likewise.
12050 (_mm256_setr_ph): Likewise.
12051 (_mm512_setr_ph): Likewise.
12052 (_mm_set1_ph): Likewise.
12053 (_mm256_set1_ph): Likewise.
12054 (_mm512_set1_ph): Likewise.
12055 (_mm_setzero_ph): Likewise.
12056 (_mm256_setzero_ph): Likewise.
12057 (_mm512_setzero_ph): Likewise.
12058 (_mm_set_sh): Likewise.
12059 (_mm_load_sh): Likewise.
12060 (_mm_store_sh): Likewise.
12061 * config/i386/i386-builtin-types.def (V8HF): New type.
12062 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
12063 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
12064 Support vector HFmodes.
12065 (ix86_expand_vector_init_one_nonzero): Likewise.
12066 (ix86_expand_vector_init_one_var): Likewise.
12067 (ix86_expand_vector_init_interleave): Likewise.
12068 (ix86_expand_vector_init_general): Likewise.
12069 (ix86_expand_vector_set): Likewise.
12070 (ix86_expand_vector_extract): Likewise.
12071 (ix86_expand_vector_init_concat): Likewise.
12072 (ix86_expand_sse_movcc): Handle vector HFmodes.
12073 (ix86_expand_vector_set_var): Ditto.
12074 * config/i386/i386-modes.def: Add HF vector modes in comment.
12075 * config/i386/i386.c (classify_argument): Add HF vector modes.
12076 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
12077 (ix86_vector_mode_supported_p): Likewise.
12078 (ix86_set_reg_reg_cost): Handle vector HFmode.
12079 (ix86_get_ssemov): Handle vector HFmode.
12080 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
12082 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
12084 (function_arg_advance_32): Ditto.
12085 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
12086 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
12087 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
12088 (VALID_SSE2_REG_VHF_MODE): New.
12089 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
12090 (SSE_REG_MODE_P): Add vector HFmode.
12091 * config/i386/i386.md (mode): Add HF vector modes.
12092 (MODE_SIZE): Likewise.
12093 (ssemodesuffix): Add ph suffix for HF vector modes.
12094 * config/i386/sse.md (VFH_128): New mode iterator.
12095 (VMOVE): Adjust for HF vector modes.
12097 (V_256_512): Likewise.
12098 (avx512): Likewise.
12099 (avx512fmaskmode): Likewise.
12100 (shuffletype): Likewise.
12101 (sseinsnmode): Likewise.
12102 (ssedoublevecmode): Likewise.
12103 (ssehalfvecmode): Likewise.
12104 (ssehalfvecmodelower): Likewise.
12105 (ssePScmode): Likewise.
12106 (ssescalarmode): Likewise.
12107 (ssescalarmodelower): Likewise.
12108 (sseintprefix): Likewise.
12110 (bcstscalarsuff): Likewise.
12111 (xtg_mode): Likewise.
12112 (VI12HF_AVX512VL): New mode_iterator.
12113 (VF_AVX512FP16): Likewise.
12115 (VIHF_256): Likewise.
12116 (VIHF_AVX512BW): Likewise.
12117 (V16_256): Likewise.
12118 (V32_512): Likewise.
12119 (sseintmodesuffix): New mode_attr.
12120 (sse): Add scalar and vector HFmodes.
12121 (ssescalarmode): Add vector HFmode mapping.
12122 (ssescalarmodesuffix): Add sh suffix for HFmode.
12123 (*<sse>_vm<insn><mode>3): Use VFH_128.
12124 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
12125 (*ieee_<ieee_maxmin><mode>3): Likewise.
12126 (<avx512>_blendm<mode>): New define_insn.
12127 (vec_setv8hf): New define_expand.
12128 (vec_set<mode>_0): New define_insn for HF vector set.
12129 (*avx512fp16_movsh): Likewise.
12130 (avx512fp16_movsh): Likewise.
12131 (vec_extract_lo_v32hi): Rename to ...
12132 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
12134 (vec_extract_hi_v32hi): Likewise.
12135 (vec_extract_hi_<mode>): Likewise.
12136 (vec_extract_lo_v16hi): Likewise.
12137 (vec_extract_lo_<mode>): Likewise.
12138 (vec_extract_hi_v16hi): Likewise.
12139 (vec_extract_hi_<mode>): Likewise.
12140 (vec_set_hi_v16hi): Likewise.
12141 (vec_set_hi_<mode>): Likewise.
12142 (vec_set_lo_v16hi): Likewise.
12143 (vec_set_lo_<mode>): Likewise.
12144 (*vec_extract<mode>_0): New define_insn_and_split for HF
12146 (*vec_extracthf): New define_insn.
12147 (VEC_EXTRACT_MODE): Add HF vector modes.
12148 (PINSR_MODE): Add V8HF.
12149 (sse2p4_1): Likewise.
12150 (pinsr_evex_isa): Likewise.
12151 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
12152 insert for V8HFmode.
12153 (pbroadcast_evex_isa): Add HF vector modes.
12154 (AVX2_VEC_DUP_MODE): Likewise.
12155 (VEC_INIT_MODE): Likewise.
12156 (VEC_INIT_HALF_MODE): Likewise.
12157 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
12159 (avx2_pbroadcast<mode>_1): Likewise.
12160 (<avx512>_vec_dup<mode>_1): Likewise.
12161 (<avx512>_vec_dup<mode><mask_name>): Likewise.
12162 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
12165 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
12166 H.J. Lu <hongjiu.lu@intel.com>
12167 Liu Hongtao <hongtao.liu@intel.com>
12168 Wang Hongyu <hongyu.wang@intel.com>
12169 Xu Dianhong <dianhong.xu@intel.com>
12171 * common/config/i386/cpuinfo.h (get_available_features):
12172 Detect FEATURE_AVX512FP16.
12173 * common/config/i386/i386-common.c
12174 (OPTION_MASK_ISA_AVX512FP16_SET,
12175 OPTION_MASK_ISA_AVX512FP16_UNSET,
12176 OPTION_MASK_ISA2_AVX512FP16_SET,
12177 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
12178 (OPTION_MASK_ISA2_AVX512BW_UNSET,
12179 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
12180 (ix86_handle_option): Handle -mavx512fp16.
12181 * common/config/i386/i386-cpuinfo.h (enum processor_features):
12182 Add FEATURE_AVX512FP16.
12183 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
12184 * config.gcc: Add avx512fp16intrin.h.
12185 * config/i386/avx512fp16intrin.h: New intrinsic header.
12186 * config/i386/cpuid.h: Add bit_AVX512FP16.
12187 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
12188 * config/i386/i386-builtins.c: Support _Float16 type for i386
12190 (ix86_register_float16_builtin_type): New function.
12191 (ix86_float16_type_node): New.
12192 * config/i386/i386-c.c (ix86_target_macros_internal): Define
12194 * config/i386/i386-expand.c (ix86_expand_branch): Support
12196 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
12197 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12198 (ix86_expand_fp_movcc): Ditto.
12199 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
12200 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
12201 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
12202 * config/i386/i386.c (ix86_get_ssemov): Use
12203 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
12204 (ix86_get_excess_precision): Use
12205 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
12207 (sse_store_index): Use SFmode cost for HFmode cost.
12208 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
12209 GPR cost for HFmode.
12210 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
12211 (ix86_mangle_type): Add manlging for _Float16 type.
12212 (inline_secondary_memory_needed): No memory is needed for
12213 16bit movement between gpr and sse reg under
12215 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
12216 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
12217 (ix86_division_cost): Ditto.
12218 (ix86_rtx_costs): Ditto.
12219 (ix86_add_stmt_cost): Ditto.
12220 (ix86_optab_supported_p): Ditto.
12221 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
12222 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
12223 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
12224 * config/i386/i386.md (mode): Add HFmode.
12225 (MODE_SIZE): Add HFmode.
12226 (isa): Add avx512fp16.
12227 (enabled): Handle avx512fp16.
12228 (ssemodesuffix): Add sh suffix for HFmode.
12229 (comm): Add mult, div.
12230 (plusminusmultdiv): New code iterator.
12231 (insn): Add mult, div.
12232 (*movhf_internal): Adjust for avx512fp16 instruction.
12233 (*movhi_internal): Ditto.
12234 (*cmpi<unord>hf): New define_insn for HFmode.
12235 (*ieee_s<ieee_maxmin>hf3): Likewise.
12236 (extendhf<mode>2): Likewise.
12237 (trunc<mode>hf2): Likewise.
12238 (float<floatunssuffix><mode>hf2): Likewise.
12239 (*<insn>hf): Likewise.
12240 (cbranchhf4): New expander.
12241 (movhfcc): Likewise.
12242 (<insn>hf3): Likewise.
12243 (mulhf3): Likewise.
12244 (divhf3): Likewise.
12245 * config/i386/i386.opt: Add mavx512fp16.
12246 * config/i386/immintrin.h: Include avx512fp16intrin.h.
12247 * doc/invoke.texi: Add mavx512fp16.
12248 * doc/extend.texi: Add avx512fp16 Usage Notes.
12250 2021-09-08 liuhongt <hongtao.liu@intel.com>
12252 * common.opt: Support -fexcess-precision=16.
12253 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
12254 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
12255 EXCESS_PRECISION_TYPE_FLOAT16.
12256 * config/arm/arm.c (arm_excess_precision): Ditto.
12257 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
12258 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
12259 when EXCESS_PRECISION_TYPE_FLOAT16.
12260 * config/s390/s390.c (s390_excess_precision): Ditto.
12261 * coretypes.h (enum excess_precision_type): Add
12262 EXCESS_PRECISION_TYPE_FLOAT16.
12263 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
12264 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
12265 * doc/extend.texi (Half-Precision): Document
12266 -fexcess-precision=16.
12267 * flag-types.h (enum excess_precision): Add
12268 EXCESS_PRECISION_FLOAT16.
12269 * target.def (excess_precision): Update document.
12270 * tree.c (excess_precision_type): Set excess_precision_type to
12271 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
12273 2021-09-08 liuhongt <hongtao.liu@intel.com>
12275 * doc/extend.texi: (@node Floating Types): Adjust the wording.
12276 (@node Half-Precision): Ditto.
12278 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
12281 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
12282 'CONST_INT_P (src)' to the condition of the block that tries to
12283 eliminate literal when loading integer contant.
12285 2021-09-07 David Faust <david.faust@oracle.com>
12287 * doc/extend.texi (BPF Type Attributes) New node.
12288 Document new preserve_access_index attribute.
12289 Document new preserve_access_index builtin.
12290 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
12292 2021-09-07 David Faust <david.faust@oracle.com>
12294 * config/bpf/bpf.c: Adjust includes.
12295 (bpf_handle_preserve_access_index_attribute): New function.
12296 (bpf_attribute_table): Use it here.
12297 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
12298 (bpf_option_override): Handle "-mco-re" option.
12299 (bpf_asm_init_sections): New.
12300 (TARGET_ASM_INIT_SECTIONS): Redefine.
12301 (bpf_file_end): New.
12302 (TARGET_ASM_FILE_END): Redefine.
12303 (bpf_init_builtins): Add "__builtin_preserve_access_index".
12304 (bpf_core_compute, bpf_core_get_index): New.
12305 (is_attr_preserve_access): New.
12306 (bpf_expand_builtin): Handle new builtins.
12307 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
12308 (bpf_core_walk): New.
12309 (bpf_resolve_overloaded_builtin): New.
12310 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
12311 (handle_attr): New.
12312 (pass_bpf_core_attr): New RTL pass.
12313 * config/bpf/bpf-passes.def: New file.
12314 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
12315 * config/bpf/coreout.c: New file.
12316 * config/bpf/coreout.h: Likewise.
12317 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
12318 (coreout.o): New rule.
12319 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
12320 * config.gcc (bpf): Add coreout.h to extra_headers.
12321 Add coreout.o to extra_objs.
12322 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
12324 2021-09-07 David Faust <david.faust@oracle.com>
12326 * btfout.c (get_btf_id): Function is no longer static.
12327 * ctfc.h: Expose it here.
12329 2021-09-07 David Faust <david.faust@oracle.com>
12331 * ctfc.c (ctf_lookup_tree_type): New function.
12332 * ctfc.h: Likewise.
12334 2021-09-07 David Faust <david.faust@oracle.com>
12336 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
12337 * ctfc.h: Analogous change.
12339 2021-09-07 David Faust <david.faust@oracle.com>
12341 * dwarf2out.c (lookup_type_die): Function is no longer static.
12342 * dwarf2out.h: Expose it here.
12344 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12346 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
12347 (ctf_debug_early_finish): New definition.
12348 (ctf_debug_finish): Likewise.
12349 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
12350 (ctf_debug_early_finish): New declaration.
12351 (ctf_debug_finish): Likewise.
12352 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
12353 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
12355 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12357 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
12358 support when compiling for CO-RE.
12359 * config/bpf/bpf.opt: Add new command line option -mco-re.
12361 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
12363 * flag-types.h (enum debug_info_type): Add new enum
12364 DINFO_TYPE_BTF_WITH_CORE.
12365 (BTF_WITH_CORE_DEBUG): New bitmask.
12366 * flags.h (btf_with_core_debuginfo_p): New declaration.
12367 * opts.c (btf_with_core_debuginfo_p): New definition.
12369 2021-09-07 Jason Merrill <jason@redhat.com>
12371 * tree.h (error_operand_p): Change to inline function.
12373 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
12375 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
12376 (empty_block_with_phis_p): ...this.
12377 (potentially_threadable_block): Same.
12378 (jump_threader::thread_through_normal_block): Same.
12380 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
12383 * dwarf2out.c (mark_base_types): New overloaded function.
12384 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
12385 as the compilation unit, and call move_marked_base_types afterward.
12387 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
12390 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
12392 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
12394 2021-09-07 Richard Biener <rguenther@suse.de>
12396 PR tree-optimization/102226
12397 * tree-vect-loop.c (vect_transform_cycle_phi): Record
12398 the converted value for the epilogue PHI use.
12400 2021-09-07 Martin Liska <mliska@suse.cz>
12402 PR gcov-profile/80223
12403 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
12404 options, do not inline when no_profile_instrument_function
12405 attributes are different in early inliner. It's fine to inline
12406 it after PGO instrumentation.
12408 2021-09-07 Richard Biener <rguenther@suse.de>
12410 PR tree-optimization/101555
12411 * tree-ssa-pre.c (translate_vuse_through_block): Do not
12412 perform an alias walk to determine the validity of the
12413 mem at the start of the block which is already guaranteed
12414 by means of prune_clobbered_mems.
12415 (phi_translate_1): Pass edge to translate_vuse_through_block.
12417 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
12420 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
12421 (remainder<mode>3): Likewise.
12423 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
12425 * config/mips/mips.c (mips_file_start): add .module for
12428 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
12430 * wide-int.cc (wi::clz): Reorder tests to ensure the result
12431 is zero for all negative values.
12433 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
12435 * doc/invoke.texi (-foffload-options): Fix @opindex.
12437 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
12440 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
12441 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
12442 source alternative for AVX.
12444 2021-09-06 liuhongt <hongtao.liu@intel.com>
12446 PR middle-end/102182
12447 * optabs.c (expand_fix): Add from1 to avoid from being
12450 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
12452 * dwarf2out.c (modified_type_die): Deal with all array types earlier
12453 and use local variable consistently throughout the function.
12455 2021-09-06 Jakub Jelinek <jakub@redhat.com>
12457 PR tree-optimization/102207
12458 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12459 were promoted from signed to wider unsigned type.
12461 2021-09-06 Andrew Pinski <apinski@marvell.com>
12463 PR tree-optimization/63184
12464 * match.pd: Add simplification of pointer_diff of two pointer_plus
12465 with addr_expr in the first operand of each pointer_plus.
12466 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
12467 in the first operand of each pointer_plus.
12469 2021-09-06 Richard Biener <rguenther@suse.de>
12471 PR tree-optimization/102176
12472 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
12474 (vect_bb_slp_scalar_cost): Use the computed set of
12475 vectorized scalar stmts instead of relying on the out-of-date
12476 and not accurate PURE_SLP_STMT.
12477 (vect_bb_vectorization_profitable_p): Compute the set
12478 of vectorized scalar stmts.
12480 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12482 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
12483 GIMPLE_COND special casing.
12484 (path_range_query::range_defined_in_block): Use range_of_stmt
12485 instead of calling fold_range directly.
12487 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12489 * gimple-range-path.cc (path_range_query::range_of_expr): Set
12490 m_undefined_path when appropriate.
12491 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
12492 (path_range_query::unreachable_path_p): New.
12493 (path_range_query::precompute_ranges): Set m_undefined_path.
12494 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
12495 (path_range_query::internal_range_of_expr): New.
12496 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
12497 Use unreachable_path_p.
12499 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
12501 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
12502 Remove argument and call find_taken_edge.
12503 (back_threader::resolve_phi): Do not calculate taken edge before
12504 calling maybe_register_path.
12505 (back_threader::find_paths_to_names): Same.
12507 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
12509 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
12510 * config/h8300/testcompare.md (store_c): Update name, use new
12512 (store_neg_c, store_shifted_c): New patterns.
12514 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
12517 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
12518 instead of r12 for CR save, in all cases.
12520 2021-09-03 Andrew Pinski <apinski@marvell.com>
12522 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
12523 Handle error_mark_node as the type of the type_decl.
12525 2021-09-03 Andrew Pinski <apinski@marvell.com>
12527 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
12529 (aarch64_simd_types): Likewise.
12530 (aarch64_simd_intOI_type_node): Likewise.
12531 (aarch64_simd_intCI_type_node): Likewise.
12532 (aarch64_simd_intXI_type_node): Likewise.
12533 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
12534 (aarch64_fp16_ptr_type_node): Likewise.
12535 (aarch64_bf16_type_node): Likewise.
12536 (aarch64_bf16_ptr_type_node): Likewise.
12538 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12540 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
12542 (minus_op1_op2_relation_effect): ...here.
12543 (class operator_pointer_diff): New.
12544 (operator_pointer_diff::op1_op2_relation_effect): Call
12545 minus_op1_op2_relation_effect.
12546 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
12548 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12550 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
12551 Add may_peel_loop_headers.
12552 (back_threader_registry::thread_through_all_blocks): Same.
12553 (try_thread_blocks): Pass may_peel_loop_headers argument.
12554 (pass_early_thread_jumps::execute): Same.
12556 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12558 * tree-ssa-threadedge.c (has_phis_p): New.
12559 (forwarder_block_p): New.
12560 (potentially_threadable_block): Call forwarder_block_p.
12561 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
12562 (jump_threader::thread_through_normal_block): Call
12565 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12567 * tree-ssa-threadbackward.c (back_threader::dump): New.
12568 (back_threader::debug): New.
12569 (back_threader_profitability::profitable_path_p): Dump blocks
12570 even if we are bailing early.
12572 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12574 * tree-ssa-threadupdate.c (cancel_thread): New.
12575 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
12576 (jump_thread_path_registry::mark_threaded_blocks): Same.
12577 (jump_thread_path_registry::register_jump_thread): Same.
12579 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12581 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
12582 which objects are available.
12583 (jt_state::pop): Same.
12584 (jt_state::register_equiv): Same.
12585 (jt_state::register_equivs_on_edge): Same.
12587 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12589 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
12590 Move pop until after a thread is registered.
12592 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12594 * tree-ssa-threadupdate.c (debug): New.
12596 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12598 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
12599 (push_dump_file::~push_dump_file): New.
12600 (dump_ranger): Change dump_file temporarily while dumping
12602 * gimple-range-trace.h (class push_dump_file): New.
12604 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12606 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
12607 (dump_ranger): Dump function name.
12609 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12611 * gimple-range-path.cc (path_range_query::range_defined_in_block):
12612 Adjust for non-null.
12613 (path_range_query::adjust_for_non_null_uses): New.
12614 (path_range_query::precompute_ranges): Call
12615 adjust_for_non_null_uses.
12616 * gimple-range-path.h: Add m_non_null and
12617 adjust_for_non_null_uses.
12619 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12621 * gimple-range-path.cc (path_range_query::dump): Dump path
12623 (path_range_query::precompute_ranges): Dump entire path.
12625 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12627 * value-relation.cc (relation_oracle::debug): New.
12628 * value-relation.h (relation_oracle::debug): New.
12630 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12632 * tree-ssa-loop-ch.c: Remove unnecessary include file.
12634 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12636 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
12637 Skip statements with no defining BB.
12638 * gimple-range-path.cc (path_range_query::range_defined_in_block):
12639 Do not get confused by statements with no defining BB.
12641 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
12643 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
12644 gimple_range_adjustment. Add support for constants.
12645 (adjust_realpart_expr): New.
12646 (gimple_range_adjustment): Move IMAGPART_EXPR code to
12647 adjust_imagpart_expr.
12648 * range-op.cc (integral_table::integral_table): Add entry for
12651 2021-09-03 Jakub Jelinek <jakub@redhat.com>
12653 * omp-expand.c (expand_omp_atomic_pipeline): Use
12654 IFN_ATOMIC_COMPARE_EXCHANGE instead of
12655 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
12658 2021-09-03 Jakub Jelinek <jakub@redhat.com>
12661 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
12662 that is false if DECL_BIT_FIELD.
12663 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
12664 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
12665 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
12666 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
12667 of DECL_FIELD_ABI_IGNORED.
12668 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
12669 SET_DECL_FIELD_ABI_IGNORED instead of writing to
12670 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
12671 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
12672 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
12673 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
12675 2021-09-03 liuhongt <hongtao.liu@intel.com>
12678 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
12679 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
12680 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
12682 2021-09-02 Martin Sebor <msebor@redhat.com>
12684 PR tree-optimization/17506
12686 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
12688 2021-09-02 Richard Biener <rguenther@suse.de>
12690 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
12691 fix for PR78185 and continue processing when leaving
12692 finite inner loops.
12694 2021-09-02 Jakub Jelinek <jakub@redhat.com>
12696 PR tree-optimization/99591
12697 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
12700 2021-09-02 Richard Biener <rguenther@suse.de>
12703 2021-09-02 Richard Biener <rguenther@suse.de>
12705 PR tree-optimization/102155
12706 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
12707 over a part of the RPO array and do not recurse here.
12708 Dump blocks marked as always executed.
12709 (fill_always_executed_in): Walk over the RPO array and
12710 process loops whose header we run into.
12711 (loop_invariant_motion_in_fun): Compute the first RPO
12712 using rev_post_order_and_mark_dfs_back_seme in iteration
12713 order and pass that to fill_always_executed_in.
12715 2021-09-02 liuhongt <hongtao.liu@intel.com>
12717 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
12718 * config/i386/i386.c (enum x86_64_reg_class): Add
12719 X86_64_SSEHF_CLASS.
12720 (merge_classes): Handle X86_64_SSEHF_CLASS.
12721 (examine_argument): Ditto.
12722 (construct_container): Ditto.
12723 (classify_argument): Ditto, and set HFmode/HCmode to
12724 X86_64_SSEHF_CLASS.
12725 (function_value_32): Return _FLoat16/Complex Float16 by
12727 (function_value_64): Return _Float16/Complex Float16 by SSE
12729 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
12730 (ix86_secondary_reload): Require gpr as intermediate register
12731 to store _Float16 from sse register when sse4 is not
12733 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
12735 (ix86_scalar_mode_supported_p): Ditto.
12736 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
12737 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
12738 (VALID_INT_MODE_P): Add HFmode and HCmode.
12739 * config/i386/i386.md (*pushhf_rex64): New define_insn.
12741 (*movhf_internal): Ditto.
12742 * doc/extend.texi (Half-Precision Floating Point): Documemt
12745 2021-09-02 Richard Biener <rguenther@suse.de>
12747 PR tree-optimization/102155
12748 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
12749 over a part of the RPO array and do not recurse here.
12750 Dump blocks marked as always executed.
12751 (fill_always_executed_in): Walk over the RPO array and
12752 process loops whose header we run into.
12753 (loop_invariant_motion_in_fun): Compute the first RPO
12754 using rev_post_order_and_mark_dfs_back_seme in iteration
12755 order and pass that to fill_always_executed_in.
12757 2021-09-02 YunQiang Su <syq@debian.org>
12760 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
12762 * config/mips/mips.c (mips_module_isa_name): New.
12763 mips_file_start: add .module mipsREV to all asm output
12765 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
12767 PR tree-optimization/102152
12768 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
12769 comparison to a scalar comparison before calling
12770 update_stmt_if_modified.
12772 2021-09-01 Andrew Pinski <apinski@marvell.com>
12775 * config/aarch64/aarch64.c (aarch64_expand_setmem):
12776 Check STRICT_ALIGNMENT before creating an overlapping
12779 2021-09-01 Martin Sebor <msebor@redhat.com>
12781 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
12782 (check_access): Pass additional argument.
12783 (check_memop_access): Remove template and make a member function.
12784 (maybe_check_dealloc_call): Make a pass_waccess member function.
12785 (class pass_waccess): Add, rename, and remove members.
12786 (pass_waccess::pass_waccess): Adjust to name change.
12787 (pass_waccess::~pass_waccess): Same.
12788 (check_alloca): Make a member function.
12789 (check_alloc_size_call): Same.
12790 (check_strcat): Same.
12791 (check_strncat): Same.
12792 (check_stxcpy): Same.
12793 (check_stxncpy): Same.
12794 (check_strncmp): Same.
12795 (maybe_warn_rdwr_sizes): Rename...
12796 (pass_waccess::maybe_check_access_sizes): ...to this.
12797 (pass_waccess::check_call): Adjust to name changes.
12798 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
12800 (pass_waccess::execute): Adjust to name changes.
12801 * gimple-ssa-warn-access.h (check_memop_access): Remove.
12802 * pointer-query.cc (access_ref::phi): Handle null pointer.
12803 (access_ref::inform_access): Same.
12804 (pointer_query::put_ref): Modify a cached value, not a copy of it.
12805 (pointer_query::dump): New function.
12806 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
12808 * pointer-query.h (pointer_query::dump): Declare.
12809 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
12810 (dump_strlen_info): Use function query.
12811 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
12813 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
12815 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
12817 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
12819 * doc/extend.texi: Document unavailable attribute.
12820 * print-tree.c (print_node): Handle unavailable attribute.
12821 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
12822 * tree.c (error_unavailable_use): New.
12823 * tree.h (TREE_UNAVAILABLE): New.
12824 (error_unavailable_use): New.
12826 2021-09-01 Jakub Jelinek <jakub@redhat.com>
12828 PR tree-optimization/102124
12829 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
12830 MINUS_EXPR, if itype is unsigned with smaller precision than type,
12831 add an extra cast to signed variant of itype to ensure sign-extension.
12833 2021-09-01 Martin Liska <mliska@suse.cz>
12835 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
12836 edges and rather use colors for TRUE and FALSE edges.
12838 2021-09-01 Richard Biener <rguenther@suse.de>
12840 PR tree-optimization/93491
12841 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
12842 after processing the stmt itself. Do not consider
12843 pure functions possibly not returning. Properly avoid
12844 adding possibly trapping calls to EXP_GEN when there's
12845 a preceeding possibly not returning call.
12846 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
12849 2021-09-01 Richard Biener <rguenther@suse.de>
12851 PR tree-optimization/102139
12852 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
12853 type to record a std::pair of the stmt-info and the innermost
12855 (dr_vec_info::group): New member.
12856 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
12857 (vect_compute_data_ref_alignment): Verify the recorded
12858 base alignment can be used.
12859 (data_ref_pair): Remove.
12860 (dr_group_sort_cmp): Adjust.
12861 (vect_analyze_data_ref_accesses): Store the group-ID in the
12862 dr_vec_info and operate on a vector of dr_vec_infos.
12864 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
12866 * read-md.c (md_reader::handle_enum): support value assignation.
12867 * doc/md.texi: record define_c_enum value assignation support.
12869 2021-09-01 Jakub Jelinek <jakub@redhat.com>
12871 PR tree-optimization/102141
12872 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
12873 argument. If false, emit stmts after gsi instead of before, and
12875 (bswap_replace): Adjust callers. When converting output of bswap,
12876 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
12879 2021-09-01 liuhongt <hongtao.liu@intel.com>
12881 * emit-rtl.c (validate_subreg): Get rid of all float-int
12884 2021-09-01 liuhongt <hongtao.liu@intel.com>
12887 2021-08-30 liuhongt <hongtao.liu@intel.com>
12889 * expmed.c (extract_bit_field_1): Make sure we're playing with
12890 integral modes before call extract_integral_bit_field.
12891 (extract_integral_bit_field): Add a parameter of type
12892 scalar_int_mode which corresponds to of tmode.
12893 And call extract_and_convert_fixed_bit_field instead of
12894 extract_fixed_bit_field and convert_extracted_bit_field.
12895 (extract_and_convert_fixed_bit_field): New function, it's a
12896 combination of extract_fixed_bit_field and
12897 convert_extracted_bit_field.
12899 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
12901 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
12903 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
12905 * omp-general.h (omp_is_reference): Rename to...
12906 (omp_privatize_by_reference): ... this. Adjust all users...
12907 * omp-general.c: ... here, ...
12908 * gimplify.c: ... here, ...
12909 * omp-expand.c: ... here, ...
12910 * omp-low.c: ... here.
12912 2021-08-31 Martin Sebor <msebor@redhat.com>
12914 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
12915 pointer element for equality to zero, not that of the cotaining
12918 2021-08-31 Martin Sebor <msebor@redhat.com>
12920 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
12922 2021-08-31 Martin Sebor <msebor@redhat.com>
12924 * function.h (function): Add comments.
12925 (get_range_query): Same. Add attribute returns nonnull.
12927 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
12929 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
12930 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
12931 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
12933 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
12935 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
12937 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
12938 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
12939 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
12940 (m32c-*-elf*, m32c-*-rtems*): Likewise.
12941 * config/cris/cris.h (DBX_NO_XREFS): Remove.
12942 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
12943 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
12944 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
12945 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
12946 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
12948 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
12950 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
12951 allowed on target constructs and only with particular other clauses.
12952 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
12953 'ancestor' is used.
12954 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
12955 structs when 'ancestor' is used.
12956 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
12957 'ancestor' is used.
12958 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
12959 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
12961 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
12963 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
12964 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12966 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
12967 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
12968 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
12969 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
12970 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
12971 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
12972 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
12973 would be paradoxical.
12975 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
12977 * combine.c (combine_simplify_rtx): Avoid converting an explicit
12978 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
12979 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
12981 2021-08-31 Richard Biener <rguenther@suse.de>
12983 PR tree-optimization/102142
12984 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
12985 condition under which to unset the visited flag.
12987 2021-08-31 Richard Biener <rguenther@suse.de>
12989 PR middle-end/102129
12990 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
12991 possibly trapping expressions across calls.
12993 2021-08-31 Jakub Jelinek <jakub@redhat.com>
12995 PR tree-optimization/102134
12996 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
12997 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
13000 2021-08-31 Andrew Pinski <apinski@marvell.com>
13003 * collect-utils.c (setup_signals): New declaration.
13004 * collect-utils.h (setup_signals): New function.
13005 * collect2.c (handler): Delete.
13006 (main): Instead of manually setting up the signals,
13007 just call setup_signals.
13008 * lto-wrapper.c (main): Likewise.
13010 2021-08-31 Andrew Pinski <apinski@marvell.com>
13013 * config/i386/i386-protos.h (x86_output_aligned_bss):
13014 Change align argument to unsigned type.
13015 (x86_elf_aligned_decl_common): Likewise.
13016 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
13017 (x86_output_aligned_bss): Likewise.
13019 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13021 * config/mips/mips.c (mips_module_isa_name): New.
13022 mips_file_start: add .module mipsREV to all asm output
13024 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
13026 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
13027 use enum instead of int for 'isa' member.
13028 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
13029 config/mips/netbsd.h: replace hardcoded numbers with enum.
13031 2021-08-31 liuhongt <hongtao.liu@intel.com>
13033 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
13034 define_split to define_insn_and_split.
13035 (*avx2_eq<mode>3): Removed.
13036 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
13037 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
13038 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
13040 (*avx2_gt<mode>3): Removed.
13041 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
13042 define_insn to define_expand, and adjust pattern.
13043 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
13045 2021-08-30 David Malcolm <dmalcolm@redhat.com>
13048 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
13050 2021-08-30 Jason Merrill <jason@redhat.com>
13052 * doc/invoke.texi: Document -Wmissing-requires.
13054 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13056 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
13057 TARGET_EXTRA_BUILTINS guard.
13059 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13061 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
13062 initialization of V2DI_type_node and unsigned_V2DI_type_node.
13064 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
13066 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
13067 decl when new_builtins_are_live.
13068 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
13071 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
13073 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
13074 OPTION_MASK_P10_FUSION_2STORE.
13075 (POWERPC_MASKS): Likewise.
13076 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
13077 store fusion for Power10.
13078 (is_fusable_store): New.
13079 (power10_sched_reorder): Likewise.
13080 (rs6000_sched_reorder): Do Power10 specific reordering.
13081 (rs6000_sched_reorder2): Likewise.
13082 * config/rs6000/rs6000.opt: Add new option.
13084 2021-08-30 Richard Biener <rguenther@suse.de>
13086 PR tree-optimization/102128
13087 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13088 Move scanning for if-converted scalar code to the caller
13089 and instead delay clearing the visited flag for profitable
13091 (vect_slp_region): Cost all subgraphs before scheduling.
13092 For if-converted BB vectorization scan for scalar COND_EXPRs
13093 and do not vectorize if any found and the cost model is
13096 2021-08-30 Richard Biener <rguenther@suse.de>
13098 * common.opt (fexceptions): Mark
13099 EnabledBy(fnon-call-exceptions).
13100 * doc/invoke.texi (fnon-call-exceptions): Document this
13101 enables -fexceptions.
13103 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
13105 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
13106 is not already defined.
13108 2021-08-30 liuhongt <hongtao.liu@intel.com>
13110 * expmed.c (extract_bit_field_1): Make sure we're playing with
13111 integral modes before call extract_integral_bit_field.
13112 (extract_integral_bit_field): Add a parameter of type
13113 scalar_int_mode which corresponds to of tmode.
13114 And call extract_and_convert_fixed_bit_field instead of
13115 extract_fixed_bit_field and convert_extracted_bit_field.
13116 (extract_and_convert_fixed_bit_field): New function, it's a
13117 combination of extract_fixed_bit_field and
13118 convert_extracted_bit_field.
13120 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13122 * config/darwin.c (darwin_libc_has_function): Do not run
13123 the checks for x86 or modern Darwin. Make sure that there
13124 is a value set for darwin_macosx_version_min before testing.
13126 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
13128 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
13130 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
13132 * ipa-modref-tree.h (modref_access_node::merge): Break out
13133 logic combining offsets and logic merging ranges to ...
13134 (modref_access_node::combined_offsets): ... here
13135 (modref_access_node::update2): ... here
13136 (modref_access_node::closer_pair_p): New member function.
13137 (modref_access_node::forced_merge): New member function.
13138 (modre_ref_node::insert): Do merging when table is full.
13140 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
13143 * config.gcc: MIPS: use N64 ABI by default if the triple end
13144 with -gnuabi64, which is used by Debian since 2013.
13146 2021-08-28 Alexandre Oliva <oliva@adacore.com>
13148 * ipa-modref.c (analyze_function): Skip debug stmts.
13149 * tree-inline.c (estimate_num_insn): Consider builtins even
13150 without a cgraph_node.
13152 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13154 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
13155 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
13156 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
13157 * config/h8300/h8300.md (eqne): New code iterator.
13158 (geultu, geultu_to_c): Similarly.
13159 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
13160 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
13161 (cmp<mode>_c): New pattern
13163 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
13165 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
13167 (dom_opt_dom_walker::optimize_stmt): Use it.
13169 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13171 * config/darwin.c (finalize_ctors): Add a section-start linker-
13173 (finalize_dtors): Likewise.
13174 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
13176 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
13178 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
13179 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
13180 old initialization logic when new builtins are enabled.
13181 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
13182 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
13183 (write_init_file): Likewise.
13185 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
13187 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
13188 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
13189 (amdgcn-* | gcn-*) Likewise.
13191 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
13193 * symbol-summary.h: Added missing template keyword.
13195 2021-08-27 Richard Biener <rguenther@suse.de>
13197 PR tree-optimization/45178
13198 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
13199 infinite loops without exit do not mark control dependent
13200 edges of the latch necessary.
13202 2021-08-27 konglin1 <lingling.kong@intel.com>
13205 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
13207 (<avx512>scattersi<mode>): Likewise.
13208 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
13209 (*avx512f_scatterdi<VI48F:mode>): Likewise
13211 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13213 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
13214 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
13215 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
13216 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
13217 P10_BUILTIN_PEXTD on Power10.
13219 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
13221 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
13222 signedness for some Power10 bifs.
13224 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
13227 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
13228 computed alignment if the entire struct has attribute packed.
13230 2021-08-27 liuhongt <hongtao.liu@intel.com>
13234 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
13235 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
13236 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
13237 IX86_BUILTIN_SHUFPS256.
13238 (ix86_masked_all_ones): New function.
13240 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13242 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
13243 (conditional moves with memory inputs splitters): Ditto.
13244 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
13246 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13248 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
13249 search after merging.
13251 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13253 * config/rs6000/rs6000-overload.def: Add remaining overloads.
13255 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13257 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
13259 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13261 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
13262 crypto, and htm stanzas.
13264 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
13266 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
13268 2021-08-26 Martin Sebor <msebor@redhat.com>
13270 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
13271 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
13272 (warn_uninitialized_vars): Same. Reduce visibility of locals.
13273 (warn_uninitialized_phi): Same.
13275 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13277 * tree-ssa-ccp.c (get_individual_bits): Helper function to
13278 extract the individual bits from a widest_int constant (mask).
13279 (gray_code_bit_flips): New read-only table for effiently
13280 enumerating permutations/combinations of bits.
13281 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
13282 by unknown counts that are guaranteed less than the target
13283 precision and four or fewer unknown bits by enumeration.
13284 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
13285 enumeration under the same conditions. Handle remaining
13286 shifts as a mask based upon the minimum possible shift value.
13288 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
13289 Richard Biener <rguenther@suse.de>
13291 * match.pd (shift transformations): Remove a redundant
13292 !POINTER_TYPE_P check.
13294 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
13297 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
13298 Set all_regs to true in the call to replace_rtx.
13300 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13302 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
13303 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
13304 hiting --param modref-max-refs limit.
13305 (modref_tree:insert_base): Be smarter when hitting
13306 --param modref-max-bases limit. Add new parameter REF.
13307 (modref_tree:insert): Update.
13308 (modref_tree:merge): Update.
13309 * ipa-modref.c (read_modref_records): Update.
13311 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13313 * params.opt: (modref-max-adjustments): Add full stop.
13315 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
13317 * ipa-modref-tree.h (modref_ref_node::verify): New member
13319 (modref_ref_node::insert): Use it.
13320 (modref_ref_node::try_mere_with): Fix off by one error.
13322 2021-08-26 Martin Liska <mliska@suse.cz>
13323 Stefan Kneifel <stefan.kneifel@bluewin.ch>
13325 * cgraph.h (create_version_clone_with_body): Add new parameter.
13326 * cgraphclones.c: Likewise.
13327 * multiple_target.c (create_dispatcher_calls): Do not use
13329 (create_target_clone): Likewise here.
13331 2021-08-26 Jonathan Yong <10walls@gmail.com>
13333 * doc/extend.texi: Add note about reserved priorities
13334 to the constructor attribute.
13336 2021-08-25 Martin Sebor <msebor@redhat.com>
13338 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
13339 range table header alone.
13340 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
13342 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
13344 * doc/invoke.texi: Document --param modref-max-adjustments.
13345 * ipa-modref-tree.c (test_insert_search_collapse): Update.
13346 (test_merge): Update.
13347 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
13348 (modref_access_node::operator==): Fix handling of access ranges.
13349 (modref_access_node::contains): Constify parameter; handle also
13350 mismatched parm offsets.
13351 (modref_access_node::update): New function.
13352 (modref_access_node::merge): New function.
13353 (unspecified_modref_access_node): Update constructor.
13354 (modref_ref_node::insert_access): Add record_adjustments parameter;
13356 (modref_ref_node::try_merge_with): New private function.
13357 (modref_tree::insert): New record_adjustments parameter.
13358 (modref_tree::merge): New record_adjustments parameter.
13359 (modref_tree::copy_from): Update.
13360 * ipa-modref.c (dump_access): Dump adjustments field.
13361 (get_access): Update constructor.
13362 (record_access): Update call of insert.
13363 (record_access_lto): Update call of insert.
13364 (merge_call_side_effects): Add record_adjustments parameter.
13365 (get_access_for_fnspec): Update.
13366 (process_fnspec): Update.
13367 (analyze_call): Update.
13368 (analyze_function): Update.
13369 (read_modref_records): Update.
13370 (ipa_merge_modref_summary_after_inlining): Update.
13371 (propagate_unknown_call): Update.
13372 (modref_propagate_in_scc): Update.
13373 * params.opt (param-max-modref-adjustments=): New.
13375 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
13377 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
13379 (xxspltiw_v4si): Use vecperm type attribute.
13380 (xxspltiw_v4si_inst): Use vecperm type attribute.
13381 (xxspltiw_v4sf_inst): Likewise.
13382 (xxspltidp_v2df): Use vecperm type attribute. Use
13383 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
13384 (xxspltidp_v2df_inst): Likewise.
13385 (xxsplti32dx_v4si): Use vecperm type attribute.
13386 (xxsplti32dx_v4si_inst): Likewise.
13387 (xxsplti32dx_v4sf_inst): Likewise.
13388 (xxblend_<mode>): Likewise.
13389 (xxpermx): Likewise.
13390 (xxpermx_inst): Likewise.
13391 (xxeval): Likewise.
13393 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
13396 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
13397 * diagnostic.c (diagnostic_initialize_input_context): New function.
13398 * diagnostic.h (diagnostic_initialize_input_context): Declare.
13399 * input.c (default_charset_callback): New function.
13400 (file_cache::initialize_input_context): New function.
13401 (file_cache_slot::create): Added ability to convert the input
13402 according to the input context.
13403 (file_cache::file_cache): Initialize the new input context.
13404 (class file_cache_slot): Added new m_alloc_offset member.
13405 (file_cache_slot::file_cache_slot): Initialize the new member.
13406 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
13407 (file_cache_slot::maybe_grow): Likewise.
13408 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
13409 (file_cache_slot::get_next_line): Likewise.
13410 * input.h (class file_cache): Added input context member.
13412 2021-08-25 Richard Biener <rguenther@suse.de>
13414 PR tree-optimization/102046
13415 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
13416 update ->any_pattern when swapping operands.
13418 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
13421 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
13422 (ix86_decompose_address): Remove retval check for ASHIFT,
13423 allow non-canonical zero extend if AND mask covers ASHIFT
13425 (ix86_legitimate_address_p): Adjust condition for decompose.
13426 (ix86_rtx_costs): Adjust cost for lea with non-canonical
13428 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
13430 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
13432 PR tree-optimization/101145
13433 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
13435 (number_of_iterations_lt): Invoke above function.
13436 (adjust_cond_for_loop_until_wrap):
13437 Merge to number_of_iterations_until_wrap.
13438 (number_of_iterations_cond): Update invokes for
13439 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
13441 2021-08-25 konglin1 <lingling.kong@intel.com>
13444 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
13445 macro define in O0.
13446 (_mm512_mask_fpclass_ps_mask): Ditto.
13448 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
13450 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
13451 (vec_unpacku_hi_v8hi): Likewise.
13452 (vec_unpacku_lo_v16qi): Likewise.
13453 (vec_unpacku_lo_v8hi): Likewise.
13454 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
13455 (vec_unpacku_lo_<VP_small_lc>): Likewise.
13457 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
13459 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
13460 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13461 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
13462 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
13464 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13466 PR middle-end/102031
13467 * simplify-rtx.c (simplify_truncation): When comparing precisions
13468 use "subreg_prec" variable, not "subreg_mode".
13470 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13472 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
13475 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13477 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
13478 various pointer type nodes.
13479 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
13480 values for various pointer types.
13481 (ptr_V16QI_type_node): New macro.
13482 (ptr_V1TI_type_node): New macro.
13483 (ptr_V2DI_type_node): New macro.
13484 (ptr_V2DF_type_node): New macro.
13485 (ptr_V4SI_type_node): New macro.
13486 (ptr_V4SF_type_node): New macro.
13487 (ptr_V8HI_type_node): New macro.
13488 (ptr_unsigned_V16QI_type_node): New macro.
13489 (ptr_unsigned_V1TI_type_node): New macro.
13490 (ptr_unsigned_V8HI_type_node): New macro.
13491 (ptr_unsigned_V4SI_type_node): New macro.
13492 (ptr_unsigned_V2DI_type_node): New macro.
13493 (ptr_bool_V16QI_type_node): New macro.
13494 (ptr_bool_V8HI_type_node): New macro.
13495 (ptr_bool_V4SI_type_node): New macro.
13496 (ptr_bool_V2DI_type_node): New macro.
13497 (ptr_bool_V1TI_type_node): New macro.
13498 (ptr_pixel_type_node): New macro.
13499 (ptr_intQI_type_node): New macro.
13500 (ptr_uintQI_type_node): New macro.
13501 (ptr_intHI_type_node): New macro.
13502 (ptr_uintHI_type_node): New macro.
13503 (ptr_intSI_type_node): New macro.
13504 (ptr_uintSI_type_node): New macro.
13505 (ptr_intDI_type_node): New macro.
13506 (ptr_uintDI_type_node): New macro.
13507 (ptr_intTI_type_node): New macro.
13508 (ptr_uintTI_type_node): New macro.
13509 (ptr_long_integer_type_node): New macro.
13510 (ptr_long_unsigned_type_node): New macro.
13511 (ptr_float_type_node): New macro.
13512 (ptr_double_type_node): New macro.
13513 (ptr_long_double_type_node): New macro.
13514 (ptr_dfloat64_type_node): New macro.
13515 (ptr_dfloat128_type_node): New macro.
13516 (ptr_ieee128_type_node): New macro.
13517 (ptr_ibm128_type_node): New macro.
13518 (ptr_vector_pair_type_node): New macro.
13519 (ptr_vector_quad_type_node): New macro.
13520 (ptr_long_long_integer_type_node): New macro.
13521 (ptr_long_long_unsigned_type_node): New macro.
13523 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13525 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
13526 and power9-64 stanzas.
13528 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13529 Tom de Vries <tdevries@suse.de>
13531 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
13532 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
13533 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
13534 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
13535 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
13536 * config/nvptx/nvptx-c.c: New source file.
13537 (nvptx_cpu_cpp_builtins): Move implementation here.
13539 2021-08-24 Martin Sebor <msebor@redhat.com>
13541 PR middle-end/101600
13542 PR middle-end/101977
13543 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
13544 the phrasing of a warning.
13545 (check_access): Use the remaining size after subtracting any offset
13546 rather than the whole object size.
13547 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
13548 clear for any nonnull PHI argument.
13549 (compute_objsize): Clear argument.
13551 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13553 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
13555 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
13557 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
13560 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
13562 * value-relation.cc (rr_transitive_table): New.
13563 (relation_transitive): New.
13564 (value_relation::swap): Remove.
13565 (value_relation::apply_transitive): New.
13566 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
13567 (relation_oracle::register_relation): Call register_transitives.
13568 (relation_oracle::register_transitives): New.
13569 * value-relation.h (relation_oracle): Add new temporary bitmap and
13572 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
13575 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
13576 from integer to a pseudo vector register.
13578 2021-08-24 Richard Biener <rguenther@suse.de>
13580 PR tree-optimization/100089
13581 * tree-vectorizer.h (vect_slp_bb): Rename to ...
13582 (vect_slp_if_converted_bb): ... this and get the original
13583 loop as new argument.
13584 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
13585 pass original loop to vect_slp_if_converted_bb.
13586 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
13587 If orig_loop was passed scan the not vectorized stmts
13588 for COND_EXPRs and force not profitable if found.
13589 (vect_slp_region): Pass down all SLP instances to costing
13590 if orig_loop was specified.
13591 (vect_slp_bbs): Pass through orig_loop.
13592 (vect_slp_bb): Rename to ...
13593 (vect_slp_if_converted_bb): ... this and get the original
13594 loop as new argument.
13595 (vect_slp_function): Adjust.
13597 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13600 * config/arm/arm.md (attribute arch): Add fix_vlldm.
13601 (arch_enabled): Use it.
13602 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
13603 use when erratum mitigation is needed.
13605 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13608 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
13609 * doc/invoke.texi (Arm Options): Document it.
13610 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
13611 (ALL_QUIRKS): Add quirk_vlldm.
13612 (cortex-m33): Add quirk_vlldm.
13613 (cortex-m35p, cortex-m55): Likewise.
13614 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
13615 targetting an affected CPU and not explicitly controlled on
13618 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
13620 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
13621 (lazy_load_multiple_insn): Likewise.
13623 2021-08-24 liuhongt <hongtao.liu@intel.com>
13626 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
13627 Enable avx512 embedded broadcast.
13628 (*<avx512>_vternlog<mode>_all): Ditto.
13629 (<avx512>_vternlog<mode>_mask): Ditto.
13631 2021-08-24 liuhongt <hongtao.liu@intel.com>
13634 * config/i386/i386.c (ix86_rtx_costs): Define cost for
13636 * config/i386/i386.h (STRIP_UNARY): New macro.
13637 * config/i386/predicates.md (reg_or_notreg_operand): New
13639 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
13640 (*<avx512>_vternlog<mode>_1): New pre_reload
13641 define_insn_and_split.
13642 (*<avx512>_vternlog<mode>_2): Ditto.
13643 (*<avx512>_vternlog<mode>_3): Ditto.
13644 (any_logic1,any_logic2): New code iterator.
13645 (logic_op): New code attribute.
13646 (ternlogsuffix): Extend to VNxDF and VNxSF.
13648 2021-08-24 Richard Biener <rguenther@suse.de>
13650 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
13651 * params.opt (--param vect-inner-loop-cost-factor): Adjust
13653 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
13654 inner_loop_cost_factor to the minimum of the estimated number
13655 of iterations of the inner loop and vect-inner-loop-cost-factor.
13657 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13658 Richard Biener <rguenther@suse.de>
13660 * config/i386/i386-features.c (compute_convert_gain): Provide
13661 more accurate values for CONST_INT, when optimizing for size.
13662 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
13663 * config/i386/i386.h (COSTS_N_BYTES): to here.
13665 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
13666 Jakub Jelinek <jakub@redhat.com>
13668 PR middle-end/102029
13669 * match.pd (shift transformations): Add an additional check for
13670 !POINTER_TYPE_P in the recently added left shift transformation.
13672 2021-08-24 liuhongt <hongtao.liu@intel.com>
13674 PR tree-optimization/100089
13675 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
13676 loop vectorizer when cost model is very-cheap.
13678 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13680 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
13681 asprintf, which is not available on AIX.
13683 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13685 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
13686 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
13688 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13690 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
13691 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
13693 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
13694 (rbtree.o): Likewise.
13695 (rs6000-gen-builtins): Likewise.
13696 (rs6000-builtins.c): Likewise.
13697 (rs6000-builtins.h): Likewise.
13698 (rs6000.o): Add dependency.
13699 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
13700 (rs6000-vecdefines.h): New target.
13701 (rs6000-builtins.o): Likewise.
13702 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
13703 (rs6000-c.o): Likewise.
13705 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
13708 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
13709 Diagnose buffer overrun.
13710 (safe_inc_pos): Fix overrun detection.
13711 (match_identifier): Diagnose buffer overrun.
13712 (match_integer): Likewise.
13713 (match_to_right_bracket): Likewise.
13715 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13717 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
13718 Improve range compare.
13719 (modref_access_node::contains): New member function.
13720 (modref_access_node::search): Remove.
13721 (modref_access_node::insert): Be smarter about subaccesses.
13723 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
13725 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
13726 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
13727 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
13730 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
13732 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
13734 * config/h8300/jumpcall.md (call, call_value): Restrict to
13735 !SIBLING_CALL_P cases.
13736 (subcall, sibcall_value): New patterns & expanders.
13737 * config/h8300/proepi.md (epilogue): Pass new argument to
13738 h8300_expand_epilogue.
13739 (sibcall_epilogue): New expander.
13740 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
13742 (h8300_ok_for_sibcall_p): New function.
13743 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
13745 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13747 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
13748 Handle case where the operand is already the desired mode.
13750 2021-08-23 Richard Biener <rguenther@suse.de>
13753 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
13754 in addition to has_gimple_body.
13756 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13758 PR middle-end/101949
13759 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
13762 2021-08-23 Martin Liska <mliska@suse.cz>
13764 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
13767 2021-08-23 Richard Biener <rguenther@suse.de>
13769 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
13770 Properly scale the inner loop cost only once.
13772 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13774 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
13775 Provide bounds for unsigned (and signed with non-negative operands)
13776 division and modulus.
13778 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13780 * simplify-rtx.c (simplify_truncation): Generalize simplification
13781 of (truncate:A (subreg:B X)).
13782 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
13783 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
13784 already has the desired machine mode.
13785 (test_scalar_int_ops): Add tests that useless extensions and
13786 truncations are optimized away.
13787 (test_scalar_int_ext_ops): New self-test function to confirm
13788 that truncations of extensions are correctly simplified.
13789 (test_scalar_int_ext_ops2): New self-test function to check
13790 truncations of truncations, extensions of extensions, and
13791 truncations of extensions.
13792 (test_scalar_ops): Call the above two functions with a
13793 representative sampling of integer machine modes.
13795 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
13797 * match.pd (shift transformations): Change the sign of an
13798 LSHIFT_EXPR if it reduces the number of explicit conversions.
13800 2021-08-23 Jakub Jelinek <jakub@redhat.com>
13802 PR tree-optimization/86723
13803 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
13804 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
13805 non-memory permutation of 64-bit src which only has bytes of
13806 0 or [5..8] and n->range is 4.
13807 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
13808 find_bswap_or_nop_finalize caller, support bswap with some bytes
13809 zeroed, as long as at least two bytes are not zeroed.
13810 (bswap_replace): Add mask argument and handle masking of bswap
13812 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
13813 caller, punt if cast64_to_32 or mask is not all ones.
13814 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
13815 caller, for now punt if cast64_to_32.
13817 2021-08-23 Richard Biener <rguenther@suse.de>
13819 PR tree-optimization/79334
13820 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
13821 a type also for COMPONENT_REFs.
13822 (vn_reference_may_trap): Check ARRAY_REF with constant index
13823 against the array domain.
13825 2021-08-23 liuhongt <hongtao.liu@intel.com>
13828 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
13829 TARGET_AVX512BW to condition.
13831 2021-08-23 Jakub Jelinek <jakub@redhat.com>
13834 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
13835 register variables already during early_dwarf if possible.
13837 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
13839 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
13841 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
13844 * config/arm/arm.opt: Fix typo.
13845 * config/arm/t-rmprofile: Fix typo.
13847 2021-08-23 Jakub Jelinek <jakub@redhat.com>
13849 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
13850 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
13851 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
13852 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
13853 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
13854 if either grainsize or num_tasks clause has the strict modifier.
13856 2021-08-23 Martin Liska <mliska@suse.cz>
13858 * dbgcnt.def (DEBUG_COUNTER): New counter.
13859 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
13861 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
13863 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
13865 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
13868 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
13870 * config/mips/mips.c (mips_msa_output_shift_immediate): New
13872 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
13873 vlshr<mode>3): Call it.
13875 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
13876 Martin Liska <mliska@suse.cz>
13878 PR middle-end/101949
13879 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
13882 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
13884 * config/mips/mips.c (mips_function_rodata_section,
13885 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
13887 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
13889 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
13891 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13893 * configure.ac (thread-local storage support): Remove tls_first_major
13894 and tls_first_minor. Use "$conftest_s" to check support.
13895 * configure: Regenerate.
13897 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13899 * configure.ac: Fixup formatting.
13901 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13903 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
13904 * configure.ac: ... update all callers.
13906 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
13909 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
13910 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
13911 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
13912 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
13913 Remove remaining checks for in-tree assembler.
13914 * configure: Regenerate.
13916 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
13918 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
13919 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
13921 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
13922 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
13923 Improve logical shifts by 27 bits for H8/S.
13924 (get_shift_alg): Corresponding changes.
13925 (h8300_option_override): Revert to loops for -Os when profitable.
13927 2021-08-20 Richard Biener <rguenther@suse.de>
13929 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
13931 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
13932 index as group_id when dataref_groups were not computed.
13933 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
13934 we advace to the next BB.
13936 2021-08-20 Jakub Jelinek <jakub@redhat.com>
13938 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
13941 2021-08-20 Martin Liska <mliska@suse.cz>
13943 PR gcov-profile/89961
13944 * gcov.c (make_gcov_file_name): Rewrite using std::string.
13945 (mangle_name): Simplify, do not used the second argument.
13946 (strip_extention): New function.
13947 (get_md5sum): Likewise.
13948 (get_gcov_intermediate_filename): Handle properly -p and -x
13950 (output_gcov_file): Use string type.
13951 (generate_results): Likewise.
13952 (md5sum_to_hex): Remove.
13954 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
13956 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
13957 (UNSPEC_XXSPLTIW): Move to vsx.md.
13958 (UNSPEC_XXSPLTID): Move to vsx.md.
13959 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
13960 (UNSPEC_XXBLEND): Move to vsx.md.
13961 (UNSPEC_XXPERMX): Move to vsx.md.
13962 (VM3): Move to vsx.md.
13963 (VM3_char): Move to vsx.md.
13964 (xxspltiw_v4si): Move to vsx.md.
13965 (xxspltiw_v4sf): Move to vsx.md.
13966 (xxspltiw_v4sf_inst): Move to vsx.md.
13967 (xxspltidp_v2df): Move to vsx.md.
13968 (xxspltidp_v2df_inst): Move to vsx.md.
13969 (xxsplti32dx_v4si_inst): Move to vsx.md.
13970 (xxsplti32dx_v4sf): Move to vsx.md.
13971 (xxsplti32dx_v4sf_inst): Move to vsx.md.
13972 (xxblend_<mode>): Move to vsx.md.
13973 (xxpermx): Move to vsx.md.
13974 (xxpermx_inst): Move to vsx.md.
13975 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
13976 (UNSPEC_XXSPLTIW): Move from altivec.md.
13977 (UNSPEC_XXSPLTID): Move from altivec.md.
13978 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
13979 (UNSPEC_XXBLEND): Move from altivec.md.
13980 (UNSPEC_XXPERMX): Move from altivec.md.
13981 (VM3): Move from altivec.md.
13982 (VM3_char): Move from altivec.md.
13983 (xxspltiw_v4si): Move from altivec.md.
13984 (xxspltiw_v4sf): Move from altivec.md.
13985 (xxspltiw_v4sf_inst): Move from altivec.md.
13986 (xxspltidp_v2df): Move from altivec.md.
13987 (xxspltidp_v2df_inst): Move from altivec.md.
13988 (xxsplti32dx_v4si_inst): Move from altivec.md.
13989 (xxsplti32dx_v4sf): Move from altivec.md.
13990 (xxsplti32dx_v4sf_inst): Move from altivec.md.
13991 (xxblend_<mode>): Move from altivec.md.
13992 (xxpermx): Move from altivec.md.
13993 (xxpermx_inst): Move from altivec.md.
13995 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
13997 * tree-vect-generic.c (expand_vector_operations_1): Use either
13998 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
13999 when constructing scalar splat expressions.
14001 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
14004 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
14005 pointer to __vector_pair *.
14007 2021-08-19 Martin Sebor <msebor@redhat.com>
14009 * gimple-range.cc: Add comments.
14010 * gimple-range.h: Same.
14012 2021-08-19 Martin Sebor <msebor@redhat.com>
14014 PR middle-end/101984
14015 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
14018 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
14020 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
14021 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
14022 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
14024 2021-08-19 Jakub Jelinek <jakub@redhat.com>
14026 PR middle-end/101950
14027 * optabs.c (expand_clrsb_using_clz): New function.
14028 (expand_unop): Use it as another clrsb expansion fallback.
14030 2021-08-19 liuhongt <hongtao.liu@intel.com>
14033 2021-07-28 liuhongt <hongtao.liu@intel.com>
14036 * config/i386/i386.h (processor_costs): Add new member
14038 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
14039 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
14040 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
14041 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
14042 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
14043 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
14044 generic_cost, core_cost): Initialize integer_to_sse same value
14046 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
14047 * config/i386/i386.c (ix86_builtin_vectorization_cost):
14048 Use integer_to_sse instead of sse_op to calculate the cost of
14051 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
14053 * config.gcc: Include rpath.opt for Darwin.
14054 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
14056 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14058 PR bootstrap/101959
14059 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
14062 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
14064 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
14065 pattern and type cast.
14066 (vld4_lane_f32): Use float RTL pattern.
14067 (vld4q_lane_f64): Use float type cast.
14069 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
14071 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
14074 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14076 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
14077 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
14078 (hash_map_tests_c_tests): Call it.
14080 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
14082 * ggc.h (enum ggc_collect): New.
14083 (ggc_collect): Use it.
14084 * ggc-page.c: Adjust.
14085 * ggc-common.c: Likewise.
14086 * ggc-tests.c: Likewise.
14087 * read-rtl-function.c: Likewise.
14088 * selftest-run-tests.c: Likewise.
14089 * doc/gty.texi (Invoking the garbage collector): Likewise.
14091 2021-08-18 liuhongt <hongtao.liu@intel.com>
14094 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
14096 * config/i386/sse.md (*sse3_haddv2df3_low): Add
14097 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
14098 (*sse3_hsubv2df3_low): Ditto.
14099 * config/i386/x86-tune.def
14100 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
14102 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14104 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
14105 (gori_compute::compute_operand_range): Add tracing.
14106 (gori_compute::logical_combine): Ditto.
14107 (gori_compute::compute_logical_operands): Ditto.
14108 (gori_compute::compute_operand1_range): Ditto.
14109 (gori_compute::compute_operand2_range): Ditto.
14110 (gori_compute::outgoing_edge_range_p): Ditto.
14111 * gimple-range-gori.h (class gori_compute): Add range_tracer.
14113 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14115 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
14116 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
14117 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
14118 * params.opt (--param=evrp-mode): Adjust options.
14120 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14122 * Makefile.in (OBJS): Add gimple-range-trace.o.
14123 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
14124 * gimple-range-fold.cc: Adjust headers.
14125 * gimple-range-trace.cc: New.
14126 * gimple-range-trace.h: New.
14127 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
14128 (gimple_ranger::range_of_expr): Add tracing.
14129 (gimple_ranger::range_on_entry): Ditto.
14130 (gimple_ranger::range_on_exit): Ditto.
14131 (gimple_ranger::range_on_edge): Ditto.
14132 (gimple_ranger::fold_range_internal): Ditto.
14133 (gimple_ranger::dump_bb): Do not calculate edge range twice.
14134 (trace_ranger::*): Remove.
14135 (enable_ranger): Never create a trace_ranger.
14136 (debug_seed_ranger): Move to gimple-range-trace.cc.
14137 (dump_ranger): Ditto.
14138 (debug_ranger): Ditto.
14139 * gimple-range.h: Include gimple-range-trace.h.
14140 (range_on_entry, range_on_exit): No longer virtual.
14141 (class trace_ranger): Remove.
14142 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
14144 2021-08-17 Martin Sebor <msebor@redhat.com>
14146 PR middle-end/101854
14147 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
14148 in gimple-ssa-warn-access.cc.
14149 * calls.c (alloc_max_size): Move code to check_alloca.
14150 (get_size_range): Move to pointer-query.cc.
14151 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
14152 (get_attr_nonstring_decl): Move to tree.c.
14153 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
14154 (append_attrname): Same.
14155 (maybe_warn_rdwr_sizes): Same.
14156 (initialize_argument_information): Move code to
14157 gimple-ssa-warn-access.cc.
14158 * calls.h (maybe_warn_alloc_args_overflow): Move to
14159 gimple-ssa-warn-access.h.
14160 (get_attr_nonstring_decl): Move to tree.h.
14161 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
14162 (enum size_range_flags): Move to pointer-query.h.
14163 (get_size_range): Same.
14164 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
14165 to avoid Clang -Wunused-function.
14166 (get_size_range): Declare static.
14167 (maybe_emit_free_warning): Rename...
14168 (maybe_check_dealloc_call): ...to this for consistency.
14169 (class pass_waccess): Add members.
14170 (pass_waccess::~pass_waccess): Defined.
14171 (alloc_max_size): Move here from calls.c.
14172 (maybe_warn_alloc_args_overflow): Same.
14173 (check_alloca): New function.
14174 (check_alloc_size_call): New function.
14175 (check_strncat): Handle another warning flag.
14176 (pass_waccess::check_builtin): Handle alloca.
14177 (fntype_argno_type): Move here from calls.c.
14178 (append_attrname): Same.
14179 (maybe_warn_rdwr_sizes): Same.
14180 (pass_waccess::check_call): Define.
14181 (check_nonstring_args): New function.
14182 (pass_waccess::check): Call new member functions.
14183 (pass_waccess::execute): Enable ranger.
14184 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
14185 (maybe_warn_nonstring_arg): Same.
14186 * gimple-ssa-warn-restrict.c: Remove #include.
14187 * pointer-query.cc (get_size_range): Move here from calls.c.
14188 * pointer-query.h (enum size_range_flags): Same.
14189 (get_size_range): Same.
14190 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
14191 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
14193 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
14195 * ggc.h (ggc_collect): Add 'force_collect' parameter.
14196 * ggc-page.c (ggc_collect): Use that one instead of global
14197 'ggc_force_collect'. Adjust all users.
14198 * doc/gty.texi (Invoking the garbage collector): Update.
14199 * ggc-internal.h (ggc_force_collect): Remove.
14200 * ggc-common.c (ggc_force_collect): Likewise.
14201 * selftest.h (forcibly_ggc_collect): Remove.
14202 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
14203 * read-rtl-function.c (test_loading_labels): Adjust.
14204 * selftest-run-tests.c (run_tests): Likewise.
14206 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14208 * config/darwin.c (darwin_file_end): Reset and reclaim the
14209 section names table at the end of compile.
14211 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
14214 * config.in: Regenerate.
14215 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
14216 (ASM_SPEC): Pass options to disable branch shortening where
14218 * configure: Regenerate.
14219 * configure.ac: Detect versions of 'as' that support the
14220 optimisation which has the bug.
14222 2021-08-17 Richard Biener <rguenther@suse.de>
14224 * optabs-query.c (supports_vec_gather_load_p): Also check
14226 (supports_vec_scatter_store_p): Likewise.
14227 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
14228 back to masked variants if non-masked are not supported.
14229 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
14230 When we need to use masked gather/scatter but do not have
14231 a mask set up a constant true one.
14232 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
14233 non-SSA_NAME masks.
14235 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14237 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
14238 algorithm as PLUS_EXPR to improve subtraction bit bounds.
14239 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
14241 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
14243 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
14244 calculate the mask-value pair result of a multiplication by an
14246 (bit_value_binop) [MULT_EXPR]: Call it from here for
14247 multiplications by (sparse) non-negative constants.
14249 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
14252 * config.gcc (gcc_cv_initfini_array): Leave undefined for
14253 uclinuxfdpiceabi targets.
14255 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14257 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
14258 reverse debug stmts.
14260 2021-08-17 Alexandre Oliva <oliva@adacore.com>
14262 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
14264 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14266 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
14267 (__LD4Q_LANE_FUNC): Likewise.
14268 (vld4_lane_u8): Define without macro.
14269 (vld4_lane_u16): Likewise.
14270 (vld4_lane_u32): Likewise.
14271 (vld4_lane_u64): Likewise.
14272 (vld4_lane_s8): Likewise.
14273 (vld4_lane_s16): Likewise.
14274 (vld4_lane_s32): Likewise.
14275 (vld4_lane_s64): Likewise.
14276 (vld4_lane_f16): Likewise.
14277 (vld4_lane_f32): Likewise.
14278 (vld4_lane_f64): Likewise.
14279 (vld4_lane_p8): Likewise.
14280 (vld4_lane_p16): Likewise.
14281 (vld4_lane_p64): Likewise.
14282 (vld4q_lane_u8): Likewise.
14283 (vld4q_lane_u16): Likewise.
14284 (vld4q_lane_u32): Likewise.
14285 (vld4q_lane_u64): Likewise.
14286 (vld4q_lane_s8): Likewise.
14287 (vld4q_lane_s16): Likewise.
14288 (vld4q_lane_s32): Likewise.
14289 (vld4q_lane_s64): Likewise.
14290 (vld4q_lane_f16): Likewise.
14291 (vld4q_lane_f32): Likewise.
14292 (vld4q_lane_f64): Likewise.
14293 (vld4q_lane_p8): Likewise.
14294 (vld4q_lane_p16): Likewise.
14295 (vld4q_lane_p64): Likewise.
14296 (vld4_lane_bf16): Likewise.
14297 (vld4q_lane_bf16): Likewise.
14299 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14301 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
14302 (__LD3Q_LANE_FUNC): Delete.
14303 (vld3_lane_u8): Define without macro.
14304 (vld3_lane_u16): Likewise.
14305 (vld3_lane_u32): Likewise.
14306 (vld3_lane_u64): Likewise.
14307 (vld3_lane_s8): Likewise.
14308 (vld3_lane_s16): Likewise.
14309 (vld3_lane_s32): Likewise.
14310 (vld3_lane_s64): Likewise.
14311 (vld3_lane_f16): Likewise.
14312 (vld3_lane_f32): Likewise.
14313 (vld3_lane_f64): Likewise.
14314 (vld3_lane_p8): Likewise.
14315 (vld3_lane_p16): Likewise.
14316 (vld3_lane_p64): Likewise.
14317 (vld3q_lane_u8): Likewise.
14318 (vld3q_lane_u16): Likewise.
14319 (vld3q_lane_u32): Likewise.
14320 (vld3q_lane_u64): Likewise.
14321 (vld3q_lane_s8): Likewise.
14322 (vld3q_lane_s16): Likewise.
14323 (vld3q_lane_s32): Likewise.
14324 (vld3q_lane_s64): Likewise.
14325 (vld3q_lane_f16): Likewise.
14326 (vld3q_lane_f32): Likewise.
14327 (vld3q_lane_f64): Likewise.
14328 (vld3q_lane_p8): Likewise.
14329 (vld3q_lane_p16): Likewise.
14330 (vld3q_lane_p64): Likewise.
14331 (vld3_lane_bf16): Likewise.
14332 (vld3q_lane_bf16): Likewise.
14334 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
14336 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
14337 (__LD2Q_LANE_FUNC): Likewise.
14338 (vld2_lane_u8): Define without macro.
14339 (vld2_lane_u16): Likewise.
14340 (vld2_lane_u32): Likewise.
14341 (vld2_lane_u64): Likewise.
14342 (vld2_lane_s8): Likewise.
14343 (vld2_lane_s16): Likewise.
14344 (vld2_lane_s32): Likewise.
14345 (vld2_lane_s64): Likewise.
14346 (vld2_lane_f16): Likewise.
14347 (vld2_lane_f32): Likewise.
14348 (vld2_lane_f64): Likewise.
14349 (vld2_lane_p8): Likewise.
14350 (vld2_lane_p16): Likewise.
14351 (vld2_lane_p64): Likewise.
14352 (vld2q_lane_u8): Likewise.
14353 (vld2q_lane_u16): Likewise.
14354 (vld2q_lane_u32): Likewise.
14355 (vld2q_lane_u64): Likewise.
14356 (vld2q_lane_s8): Likewise.
14357 (vld2q_lane_s16): Likewise.
14358 (vld2q_lane_s32): Likewise.
14359 (vld2q_lane_s64): Likewise.
14360 (vld2q_lane_f16): Likewise.
14361 (vld2q_lane_f32): Likewise.
14362 (vld2q_lane_f64): Likewise.
14363 (vld2q_lane_p8): Likewise.
14364 (vld2q_lane_p16): Likewise.
14365 (vld2q_lane_p64): Likewise.
14366 (vld2_lane_bf16): Likewise.
14367 (vld2q_lane_bf16): Likewise.
14369 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14371 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
14374 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14376 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
14377 (rank_for_schedule): Use it.
14379 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
14381 PR rtl-optimization/91598
14382 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
14383 insns after memory reads and before memory writes.
14385 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
14387 * rtl.h (CONST_VECTOR_P): New macro.
14388 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
14389 code testing macros.
14390 (aarch64_ptrue_all_mode): Likewise.
14391 (aarch64_expand_mov_immediate): Likewise.
14392 (aarch64_const_vec_all_in_range_p): Likewise.
14393 (aarch64_rtx_costs): Likewise.
14394 (aarch64_legitimate_constant_p): Likewise.
14395 (aarch64_simd_valid_immediate): Likewise.
14396 (aarch64_simd_make_constant): Likewise.
14397 (aarch64_convert_mult_to_shift): Likewise.
14398 (aarch64_expand_sve_vec_perm): Likewise.
14399 (aarch64_vec_fpconst_pow_of_2): Likewise.
14401 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
14403 PR tree-optimization/101938
14404 * range-op.cc (operator_abs::op1_range): Special case
14405 -TYPE_MIN_VALUE for flag_wrapv.
14407 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
14409 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
14412 2021-08-17 Jakub Jelinek <jakub@redhat.com>
14414 * tree.def (OMP_SCOPE): New tree code.
14415 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
14416 * tree-nested.c (convert_nonlocal_reference_stmt,
14417 convert_local_reference_stmt, convert_gimple_call): Handle
14419 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
14420 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
14421 * gimple.c (gimple_build_omp_scope): New function.
14422 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
14423 * gimple.h (gimple_build_omp_scope): Declare.
14424 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
14425 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
14426 gimple_omp_scope_set_clauses): New inline functions.
14427 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
14428 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
14429 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
14430 * gimple-walk.c (walk_gimple_stmt): Likewise.
14431 * gimple-low.c (lower_stmt): Likewise.
14432 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14433 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
14434 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
14435 allowing task reductions. Reject inscan reductions on scope.
14436 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
14437 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
14438 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
14439 (estimate_num_insns): Likewise.
14440 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
14441 contexts if var isn't privatized there.
14442 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
14443 (scan_omp_1_stmt): Likewise.
14444 (maybe_add_implicit_barrier_cancel): Look through outer
14446 (lower_omp_scope): New function.
14447 (lower_omp_task_reductions): Handle OMP_SCOPE.
14448 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
14449 (diagnose_sb_1, diagnose_sb_2): Likewise.
14450 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
14451 (expand_omp): Handle GIMPLE_OMP_SCOPE.
14452 (omp_make_gimple_edges): Likewise.
14453 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
14455 2021-08-17 Richard Biener <rguenther@suse.de>
14457 PR tree-optimization/101925
14458 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
14459 reverse on COMPONENT_REF and ARRAY_REF according to
14460 what reverse_storage_order_for_component_p does.
14461 (vn_reference_eq): Compare reversed on reference ops.
14462 (reverse_storage_order_for_component_p): New overload.
14463 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
14464 on the reference looked up.
14466 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
14468 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
14469 shifts on the H8/S.
14470 (h8300_option_override): Use loops on H8/S more often when optimizing
14472 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
14473 accordingly. Handle various arithmetic right shifts with special
14474 sequences that we couldn't handle before.
14476 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
14478 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
14480 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
14482 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
14483 * config/sparc/sparc.c (sparc_gcov_type_size): New.
14484 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
14485 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
14486 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
14487 * doc/tm.texi.in: Regenerate.
14488 * target.def (gcov_type_size): New target hook.
14489 * targhooks.c (default_gcov_type_size): New.
14490 * targhooks.h (default_gcov_type_size): Declare.
14491 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
14493 (gimple_gen_time_profiler): Likewise.
14495 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14497 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
14499 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
14501 PR middle-end/101931
14502 * omp-low.c (omp_runtime_api_call): Update for routines
14503 added in the meanwhile.
14505 2021-08-16 Martin Liska <mliska@suse.cz>
14507 PR tree-optimization/100393
14508 * tree-switch-conversion.c (group_cluster::dump): Use
14509 get_comparison_count.
14510 (jump_table_cluster::find_jump_tables): Pre-compute number of
14511 comparisons and then decrement it. Cache also max_ratio.
14512 (jump_table_cluster::can_be_handled): Change signature.
14513 * tree-switch-conversion.h (get_comparison_count): New.
14515 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
14517 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
14518 encodings only when -fgnat-encodings=all is specified.
14519 (add_bound_info): Likewise.
14520 (add_byte_size_attribute): Likewise.
14521 (gen_member_die): Likewise.
14523 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14525 * omp-oacc-neuter-broadcast.cc
14526 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
14528 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14530 * omp-oacc-neuter-broadcast.cc
14531 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
14534 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
14536 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
14537 (execute_omp_oacc_neuter_broadcast): ... here.
14538 (install_var_field, build_receiver_ref, build_sender_ref): Take
14539 'field_map_t *' parameter. Adjust all users.
14540 (worker_single_copy, neuter_worker_single): Take a
14541 'record_field_map_t *' parameter. Adjust all users.
14543 2021-08-16 liuhongt <hongtao.liu@intel.com>
14546 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
14549 2021-08-16 Martin Liska <mliska@suse.cz>
14552 * multiple_target.c (create_dispatcher_calls): Make default
14553 function local only if it is a definition.
14555 2021-08-16 Martin Liska <mliska@suse.cz>
14558 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
14559 consider equal SSA_NAMEs when one is a param.
14561 2021-08-16 liuhongt <hongtao.liu@intel.com>
14564 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
14565 Support vpermi2b for V32QI/V16QImode.
14566 (ix86_extract_perm_from_pool_constant): New function.
14567 (ix86_expand_vec_one_operand_perm_avx512): Support
14568 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
14569 (expand_vec_perm_1): Adjust comments for upper.
14570 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
14572 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
14573 (pshufb_truncv4siv4hi_operand): Ditto.
14574 (pshufb_truncv8hiv8qi_operand): Ditto.
14575 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
14576 New pre_reload define_insn_and_split.
14577 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
14578 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
14579 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
14580 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
14581 (*avx512f_permvar_truncv4div4si_1): Ditto.
14582 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
14583 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
14584 (*avx512f_pshufd_truncv2div2si_1): Ditto.
14586 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
14588 * config/riscv/multilib-generator: Support code model option for
14590 * doc/install.texi: Add document of new option for
14591 --with-multilib-generator.
14593 2021-08-15 Clément Chigot <clement.chigot@atos.net>
14595 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
14596 (rs6000_legitimize_tls_address_aix): Use it.
14597 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
14598 xcoff_tls_exec_model_detected is true.
14600 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
14602 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
14603 to allow a bit more code growth, saving many dozens of cycles.
14604 (h8300_option_override): Adjus shift_alg_si if optimizing for
14606 (get_shift_alg): Use special + inline shifts for residuals
14609 2021-08-14 Stafford Horne <shorne@gmail.com>
14612 * config/or1k/or1k-opts.h: New file.
14613 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
14614 Support generating gotha relocations if -mcmodel=large is
14616 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
14618 * config/or1k/or1k.opt (mcmodel=): New option.
14619 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
14621 2021-08-14 Martin Sebor <msebor@redhat.com>
14623 PR middle-end/101791
14624 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
14625 to valid_new_delete_pair_p.
14626 * tree.c (valid_new_delete_pair_p): Add argument.
14627 * tree.h (valid_new_delete_pair_p): Same.
14629 2021-08-14 Jakub Jelinek <jakub@redhat.com>
14632 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
14633 <case E_V64QImode>: For this mode assert
14634 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
14636 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
14639 * config/rs6000/altivec.md (xxeval): Use register_predicate
14640 instead of altivec_register_predicate.
14642 2021-08-13 Martin Sebor <msebor@redhat.com>
14644 PR middle-end/101734
14645 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
14646 (maybe_warn_operand): Call it.
14648 2021-08-13 Martin Liska <mliska@suse.cz>
14651 * attribs.c (decl_attributes): Make naked functions "noipa"
14654 2021-08-13 Martin Liska <mliska@suse.cz>
14657 * symtab.c (symtab_node::noninterposable_alias): Do not create
14658 local aliases for target_clone functions as the clonning pass
14661 2021-08-13 Martin Liska <mliska@suse.cz>
14663 * opts.c (LIVE_PATCHING_OPTION): Define.
14664 (control_options_for_live_patching): Use it in error messages.
14666 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
14668 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
14669 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
14670 ignore_stores_eaf_flags): New constants.
14671 (remove_useless_eaf_flags): New function.
14672 (eaf_flags_useful_p): Use it.
14673 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
14675 (modref_lattice::init): Add EAF_NOREAD.
14676 (modref_lattice::add_escape_point): Do not reacord escape point if
14678 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
14679 use remove_useless_eaf_flags.
14680 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
14681 (modref_lattice::merge_direct_load): Add EAF_NOREAD
14682 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
14683 (analyze_parms): Use remove_useless_eaf_flags.
14684 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
14685 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
14686 use remove_useless_eaf_flags.
14687 (modref_propagate_flags_in_scc): Update.
14688 * ipa-modref.h: Turn eaf_flags_t back to char.
14689 * tree-core.h (EAF_NOT_RETURNED): Fix.
14690 (EAF_NOREAD): New constant
14691 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
14693 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
14694 (handle_pure_call): Likewise.
14696 2021-08-12 Jakub Jelinek <jakub@redhat.com>
14698 * tree.def (OMP_MASKED): New tree code.
14699 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
14700 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
14701 OMP_CLAUSE_FILTER_EXPR): Define.
14702 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
14703 (omp_clause_code_name): Likewise.
14704 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
14705 * tree-nested.c (convert_nonlocal_omp_clauses,
14706 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
14707 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
14708 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
14709 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
14710 (dump_generic_node): Handle OMP_MASTER.
14711 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
14712 * gimple.c (gimple_build_omp_masked): New function.
14713 (gimple_copy): Handle GIMPLE_OMP_MASKED.
14714 * gimple.h (gimple_build_omp_masked): Declare.
14715 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
14716 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
14717 gimple_omp_masked_set_clauses): New inline functions.
14718 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
14719 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
14720 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
14721 * gimple-walk.c (walk_gimple_stmt): Likewise.
14722 * gimple-low.c (lower_stmt): Likewise.
14723 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
14724 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
14725 that take one expression rather than decl or constant, force
14726 gimplification of that into a SSA_NAME or temporary unless min
14728 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
14729 (gimplify_expr): Handle OMP_MASKED.
14730 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
14731 (estimate_num_insns): Likewise.
14732 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
14733 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
14734 diagnostics for existence of masked construct.
14735 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
14736 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
14737 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
14740 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
14743 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
14744 (ldexp<mode>3): Use avx512f_scalef<mode>2.
14745 (UNSPEC_SCALEF): Move from ...
14746 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
14748 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
14750 * ipa-split.c (consider_split): Fix condition testing void functions.
14752 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
14754 * doc/invoke.texi: Remove docs for threader-mode param.
14755 * flag-types.h (enum threader_mode): Remove.
14756 * params.opt: Remove threader-mode param.
14757 * tree-ssa-threadbackward.c (class back_threader): Remove
14758 path_is_unreachable_p.
14759 Make find_paths private.
14760 Add maybe_thread and thread_through_all_blocks.
14761 Remove reference marker for m_registry.
14762 Remove reference marker for m_profit.
14763 (back_threader::back_threader): Adjust for registry and profit not
14765 (dump_path): Move down.
14766 (debug): Move down.
14767 (class thread_jumps): Remove.
14768 (class back_threader_registry): Remove m_all_paths.
14770 (thread_jumps::thread_through_all_blocks): Move to back_threader
14772 (fsm_find_thread_path): Remove
14773 (back_threader::maybe_thread): New.
14774 (back_threader::thread_through_all_blocks): Move from
14776 (back_threader_registry::back_threader_registry): Remove
14778 (back_threader_registry::~back_threader_registry): Remove.
14779 (thread_jumps::find_taken_edge): Remove.
14780 (thread_jumps::check_subpath_and_update_thread_path): Remove.
14781 (thread_jumps::maybe_register_path): Remove.
14782 (thread_jumps::handle_phi): Remove.
14783 (handle_assignment_p): Remove.
14784 (thread_jumps::handle_assignment): Remove.
14785 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
14786 (thread_jumps::find_jump_threads_backwards): Remove.
14787 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
14788 (try_thread_blocks): Rename find_jump_threads_backwards to
14790 (pass_early_thread_jumps::execute): Same.
14792 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
14794 * tree-core.h (omp_clause_proc_bind_kind): Add
14795 OMP_CLAUSE_PROC_BIND_PRIMARY.
14796 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
14797 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
14799 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
14801 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
14802 fno-common reference.
14803 * config/arc/arc.c (arc_override_options): Remove overriding of
14806 2021-08-12 Jakub Jelinek <jakub@redhat.com>
14809 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
14810 If d->testing_p, return true after performing checks instead of
14811 actually expanding the insn.
14812 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
14813 !TARGET_AVX512BW and return false.
14815 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
14817 * configure.ac (PE linker --disable-dynamicbase support): New check.
14818 * configure: Regenerate.
14819 * config.in: Likewise.
14820 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
14821 (LINK_SPEC): Use it.
14822 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
14823 (LINK_SPEC): Likewise.
14825 2021-08-12 liuhongt <hongtao.liu@intel.com>
14828 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
14829 post_reload define_insn_and_split.
14830 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
14831 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
14832 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
14833 (*avx2_zero_extendv8hiv8si2_2): Ditto.
14834 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
14835 (*avx512f_zero_extendv8siv8di2_2): Ditto.
14836 (*avx2_zero_extendv4siv4di2_2): Ditto.
14837 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
14838 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
14841 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
14843 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
14846 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
14848 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
14850 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
14852 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
14853 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
14854 initialization of pcvoid_type_node here...
14855 (altivec_init_builtins): ...from here.
14856 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
14857 RS6000_BTI_const_ptr_void.
14858 (pcvoid_type_node): New macro.
14860 2021-08-11 Richard Biener <rguenther@suse.de>
14863 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
14864 hard-register accesses.
14866 2021-08-11 Richard Biener <rguenther@suse.de>
14868 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
14869 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
14870 to determine has_volatile_ops.
14872 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
14874 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
14876 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
14877 Alexandre Oliva <oliva@adacore.com>
14879 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
14881 (modref_lattice::merge_deref): Fix handling of indirect scape points.
14882 (update_escape_summary_1): Likewise.
14883 (update_escape_summary): Likewise.
14884 (ipa_merge_modref_summary_after_inlining): Likewise.
14886 2021-08-11 Richard Biener <rguenther@suse.de>
14888 PR middle-end/101858
14889 * fold-const.c (fold_binary_loc): Guard simplification
14890 of X < (cast) (1 << Y) to integer types.
14892 2021-08-11 Richard Biener <rguenther@suse.de>
14894 PR tree-optimization/101861
14895 * tree-vect-stmts.c (vectorizable_load): Fix error in
14896 previous change with regard to gather vectorization.
14898 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
14901 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
14903 (vdup_n_s16): Likewise.
14904 (vdup_n_s32): Likewise.
14905 (vdup_n_s64): Likewise.
14906 (vdup_n_u8): Likewise.
14907 (vdup_n_u16): Likewise.
14908 (vdup_n_u32): Likewise.
14909 (vdup_n_u64): Likewise.
14910 (vdup_n_p8): Likewise.
14911 (vdup_n_p16): Likewise.
14912 (vdup_n_p64): Likewise.
14913 (vdup_n_f16): Likewise.
14914 (vdup_n_f32): Likewise.
14915 (vdupq_n_s8): Likewise.
14916 (vdupq_n_s16): Likewise.
14917 (vdupq_n_s32): Likewise.
14918 (vdupq_n_s64): Likewise.
14919 (vdupq_n_u8): Likewise.
14920 (vdupq_n_u16): Likewise.
14921 (vdupq_n_u32): Likewise.
14922 (vdupq_n_u64): Likewise.
14923 (vdupq_n_p8): Likewise.
14924 (vdupq_n_p16): Likewise.
14925 (vdupq_n_p64): Likewise.
14926 (vdupq_n_f16): Likewise.
14927 (vdupq_n_f32): Likewise.
14928 (vmov_n_s8): Replace call to builtin with call to corresponding
14930 (vmov_n_s16): Likewise.
14931 (vmov_n_s32): Likewise.
14932 (vmov_n_s64): Likewise.
14933 (vmov_n_u8): Likewise.
14934 (vmov_n_u16): Likewise.
14935 (vmov_n_u32): Likewise.
14936 (vmov_n_u64): Likewise.
14937 (vmov_n_p8): Likewise.
14938 (vmov_n_p16): Likewise.
14939 (vmov_n_f16): Likewise.
14940 (vmov_n_f32): Likewise.
14941 (vmovq_n_s8): Likewise.
14942 (vmovq_n_s16): Likewise.
14943 (vmovq_n_s32): Likewise.
14944 (vmovq_n_s64): Likewise.
14945 (vmovq_n_u8): Likewise.
14946 (vmovq_n_u16): Likewise.
14947 (vmovq_n_u32): Likewise.
14948 (vmovq_n_u64): Likewise.
14949 (vmovq_n_p8): Likewise.
14950 (vmovq_n_p16): Likewise.
14951 (vmovq_n_f16): Likewise.
14952 (vmovq_n_f32): Likewise.
14953 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
14955 2021-08-11 liuhongt <hongtao.liu@intel.com>
14958 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
14959 when TARGET_AVX512F and TARGET_SSE_MATH.
14961 2021-08-10 Jakub Jelinek <jakub@redhat.com>
14964 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
14965 for V32HImode if !TARGET_AVX512BW.
14966 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
14967 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
14968 early, but actually check the permutation.
14970 2021-08-10 Richard Biener <rguenther@suse.de>
14972 PR tree-optimization/101809
14973 * tree-vect-stmts.c (get_load_store_type): Allow emulated
14974 gathers with offset vector nunits being a constant multiple
14975 of the data vector nunits.
14976 (vect_get_gather_scatter_ops): Use the appropriate nunits
14977 for the offset vector defs.
14978 (vectorizable_store): Adjust call to
14979 vect_get_gather_scatter_ops.
14980 (vectorizable_load): Likewise. Handle the case of less
14981 offset vectors than data vectors.
14983 2021-08-10 Jakub Jelinek <jakub@redhat.com>
14986 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
14987 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
14990 2021-08-10 Richard Biener <rguenther@suse.de>
14992 PR tree-optimization/101801
14993 PR tree-optimization/101819
14994 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
14995 * tree-vect-loop.c (vect_emulated_vector_p): New function.
14996 (vectorizable_reduction): Re-instantiate a check for emulated
14998 * tree-vect-stmts.c (vectorizable_shift): Likewise.
14999 (vectorizable_operation): Likewise. Cost emulated vector
15000 operations according to the scalar sequence synthesized by
15003 2021-08-10 Richard Biener <rguenther@suse.de>
15005 PR middle-end/101824
15006 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
15007 volatile in case the variable was.
15009 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
15012 * config/i386/constraints.md (BC): Document for integer SSE
15013 constant all bits set operand.
15014 (BF): New constraint for const floating-point all bits set
15016 * config/i386/i386.c (standard_sse_constant_p): Likewise.
15017 (standard_sse_constant_opcode): Likewise.
15018 * config/i386/sse.md (sseconstm1): New mode attribute.
15019 (mov<mode>_internal): Replace BC with <sseconstm1>.
15021 2021-08-10 liuhongt <hongtao.liu@intel.com>
15023 * config/i386/sse.md (cond_<insn><mode>): New expander.
15024 (VI248_AVX512VLBW): New mode iterator.
15025 * config/i386/predicates.md
15026 (nonimmediate_or_const_vec_dup_operand): New predicate.
15028 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
15030 PR tree-optimization/101741
15031 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
15032 type of parameter for toupper/tolower.
15034 2021-08-09 Martin Jambor <mjambor@suse.cz>
15036 PR testsuite/101654
15037 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
15039 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
15041 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
15043 (is_store_insn1): Verify source is a register.
15045 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
15048 * config/i386/mmx.md (<any_logic:code>v2sf3):
15049 Rename from *mmx_<any_logic:code>v2sf3
15051 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15053 * config/nvptx/nvptx.c: Cross-reference parts adapted in
15054 'gcc/omp-oacc-neuter-broadcast.cc'.
15055 * omp-low.c: Likewise.
15056 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
15059 2021-08-09 Julian Brown <julian@codesourcery.com>
15060 Kwok Cheung Yeung <kcy@codesourcery.com>
15061 Thomas Schwinge <thomas@codesourcery.com>
15063 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
15064 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
15065 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
15066 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
15067 (gcn_fork_join): Update comment.
15068 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
15069 (macc_experimental_workers): Remove unused option.
15071 2021-08-09 Julian Brown <julian@codesourcery.com>
15072 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
15073 Kwok Cheung Yeung <kcy@codesourcery.com>
15074 Thomas Schwinge <thomas@codesourcery.com>
15076 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
15077 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
15078 Add documentation hook.
15079 * doc/tm.texi: Regenerate.
15080 * omp-oacc-neuter-broadcast.cc: New file.
15081 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
15082 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
15083 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
15084 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
15085 * target.def (goacc.create_worker_broadcast_record): Add target
15087 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
15089 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
15090 Rename prototype to...
15091 (gcn_goacc_create_worker_broadcast_record): ... this.
15092 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
15094 (gcn_goacc_create_worker_broadcast_record): ... this.
15095 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
15097 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
15099 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
15102 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
15103 the right iterator.
15105 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15107 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
15109 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15111 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
15112 consider '-foffload-abi'.
15113 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
15114 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
15117 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15119 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
15122 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15124 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
15126 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15128 * doc/gty.texi (Files): Update.
15130 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
15132 * doc/gty.texi (Files): Fix GTY header file example.
15134 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
15136 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
15137 determine the upper and lower bounds from a mask-value pair.
15138 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
15139 absolute value and unsigned absolute value expressions.
15140 (bit_value_binop): Initialize *VAL's precision.
15141 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
15142 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
15143 support when the operands are unknown but potentially equal.
15144 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
15146 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
15148 * config/aarch64/aarch64.md
15149 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
15151 2021-08-08 Sergei Trofimovich <siarheit@google.com>
15153 * lra-constraints.c: Fix s/otput/output/ typo.
15155 2021-08-06 Martin Sebor <msebor@redhat.com>
15157 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
15158 (expand_builtin_strcat): Same.
15159 (expand_builtin_stpncpy): Same.
15160 (expand_builtin_strncat): Same.
15161 (check_read_access): Same.
15162 (check_memop_access): Same.
15163 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
15164 (expand_builtin_strnlen): Same.
15165 (expand_builtin_memcpy): Same.
15166 (expand_builtin_memmove): Same.
15167 (expand_builtin_mempcpy): Same.
15168 (expand_builtin_strcpy): Same.
15169 (expand_builtin_strcpy_args): Same.
15170 (expand_builtin_stpcpy_1): Same.
15171 (expand_builtin_strncpy): Same.
15172 (expand_builtin_memset): Same.
15173 (expand_builtin_bzero): Same.
15174 (expand_builtin_strcmp): Same.
15175 (expand_builtin_strncmp): Same.
15176 (expand_builtin): Remove handlers.
15177 (fold_builtin_strlen): Add a comment.
15178 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
15179 * calls.c (maybe_warn_nonstring_arg): Same.
15180 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
15181 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
15182 (gimple_fold_builtin_stpcpy): Same.
15183 * gimple-ssa-warn-access.cc (has_location): New function.
15184 (get_location): Same.
15185 (get_callee_fndecl): Same.
15186 (call_nargs): Same.
15188 (warn_string_no_nul): Define.
15189 (unterminated_array): Same.
15190 (check_nul_terminated_array): Same.
15191 (maybe_warn_nonstring_arg): Same.
15192 (maybe_warn_for_bound): Same.
15193 (warn_for_access): Same.
15194 (check_access): Same.
15195 (check_memop_access): Same.
15196 (check_read_access): Same.
15197 (warn_dealloc_offset): Use helper functions.
15198 (maybe_emit_free_warning): Same.
15199 (class pass_waccess): Add members.
15200 (check_strcat): New function.
15201 (check_strncat): New function.
15202 (check_stxcpy): New function.
15203 (check_stxncpy): New function.
15204 (check_strncmp): New function.
15205 (pass_waccess::check_builtin): New function.
15206 (pass_waccess::check): Call it.
15207 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
15209 (maybe_warn_for_bound): Same.
15210 (check_access): Same.
15211 (check_memop_access): Same.
15212 (check_read_access): Same.
15213 * pointer-query.h (struct access_data): Define a ctor overload.
15215 2021-08-06 Richard Biener <rguenther@suse.de>
15217 PR tree-optimization/101801
15218 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
15219 (vect_can_vectorize_without_simd_p): ... to this.
15220 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
15221 (vect_can_vectorize_without_simd_p): ... to this and fold
15222 in vect_min_worthwhile_factor.
15223 (vect_min_worthwhile_factor): Remove.
15224 (vectorizable_reduction): Adjust and remove the cost part.
15225 * tree-vect-stmts.c (vectorizable_shift): Likewise.
15226 (vectorizable_operation): Likewise.
15228 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
15231 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
15232 Add general_gr_operand predicate to operand 3.
15234 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
15236 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
15237 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
15239 2021-08-06 Tamar Christina <tamar.christina@arm.com>
15241 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
15242 register_svprfop): Pass vec<> by pointer.
15243 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
15244 * langhooks.c (lhd_simulate_enum_decl): Likewise.
15245 * langhooks.h (struct lang_hooks_for_types): Likewise.
15247 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15249 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
15250 __builtin_memcpy instead of constructing an additional
15251 __builtin_aarch64_simd_oi one vector at a time.
15252 (vst1q_bf16_x2): Likewise.
15253 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
15254 an additional __builtin_aarch64_simd_ci one vector at a time.
15255 (vst1q_bf16_x3): Likewise.
15256 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
15257 (vst1q_bf16_x4): Likewise.
15258 (vst2_bf16): Use __builtin_memcpy instead of constructing an
15259 additional __builtin_aarch64_simd_oi one vector at a time.
15260 (vst2q_bf16): Likewise.
15261 (vst3_bf16): Use __builtin_memcpy instead of constructing an
15262 additional __builtin_aarch64_simd_ci mode one vector at a
15264 (vst3q_bf16): Likewise.
15265 (vst4_bf16): Use __builtin_memcpy instead of constructing an
15266 additional __builtin_aarch64_simd_xi one vector at a time.
15267 (vst4q_bf16): Likewise.
15269 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15271 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
15272 (__ST2Q_LANE_FUNC): Delete.
15273 (vst2_lane_f16): Use __builtin_memcpy to copy vector
15274 structure instead of constructing __builtin_aarch64_simd_oi
15275 one vector at a time.
15276 (vst2_lane_f32): Likewise.
15277 (vst2_lane_f64): Likewise.
15278 (vst2_lane_p8): Likewise.
15279 (vst2_lane_p16): Likewise.
15280 (vst2_lane_p64): Likewise.
15281 (vst2_lane_s8): Likewise.
15282 (vst2_lane_s16): Likewise.
15283 (vst2_lane_s32): Likewise.
15284 (vst2_lane_s64): Likewise.
15285 (vst2_lane_u8): Likewise.
15286 (vst2_lane_u16): Likewise.
15287 (vst2_lane_u32): Likewise.
15288 (vst2_lane_u64): Likewise.
15289 (vst2_lane_bf16): Likewise.
15290 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
15291 structure instead of using a union.
15292 (vst2q_lane_f32): Likewise.
15293 (vst2q_lane_f64): Likewise.
15294 (vst2q_lane_p8): Likewise.
15295 (vst2q_lane_p16): Likewise.
15296 (vst2q_lane_p64): Likewise.
15297 (vst2q_lane_s8): Likewise.
15298 (vst2q_lane_s16): Likewise.
15299 (vst2q_lane_s32): Likewise.
15300 (vst2q_lane_s64): Likewise.
15301 (vst2q_lane_u8): Likewise.
15302 (vst2q_lane_u16): Likewise.
15303 (vst2q_lane_u32): Likewise.
15304 (vst2q_lane_u64): Likewise.
15305 (vst2q_lane_bf16): Likewise.
15307 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15309 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
15310 (__ST3Q_LANE_FUNC): Delete.
15311 (vst3_lane_f16): Use __builtin_memcpy to copy vector
15312 structure instead of constructing __builtin_aarch64_simd_ci
15313 one vector at a time.
15314 (vst3_lane_f32): Likewise.
15315 (vst3_lane_f64): Likewise.
15316 (vst3_lane_p8): Likewise.
15317 (vst3_lane_p16): Likewise.
15318 (vst3_lane_p64): Likewise.
15319 (vst3_lane_s8): Likewise.
15320 (vst3_lane_s16): Likewise.
15321 (vst3_lane_s32): Likewise.
15322 (vst3_lane_s64): Likewise.
15323 (vst3_lane_u8): Likewise.
15324 (vst3_lane_u16): Likewise.
15325 (vst3_lane_u32): Likewise.
15326 (vst3_lane_u64): Likewise.
15327 (vst3_lane_bf16): Likewise.
15328 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
15329 structure instead of using a union.
15330 (vst3q_lane_f32): Likewise.
15331 (vst3q_lane_f64): Likewise.
15332 (vst3q_lane_p8): Likewise.
15333 (vst3q_lane_p16): Likewise.
15334 (vst3q_lane_p64): Likewise.
15335 (vst3q_lane_s8): Likewise.
15336 (vst3q_lane_s16): Likewise.
15337 (vst3q_lane_s32): Likewise.
15338 (vst3q_lane_s64): Likewise.
15339 (vst3q_lane_u8): Likewise.
15340 (vst3q_lane_u16): Likewise.
15341 (vst3q_lane_u32): Likewise.
15342 (vst3q_lane_u64): Likewise.
15343 (vst3q_lane_bf16): Likewise.
15345 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
15347 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
15348 (__ST4Q_LANE_FUNC): Delete.
15349 (vst4_lane_f16): Use __builtin_memcpy to copy vector
15350 structure instead of constructing __builtin_aarch64_simd_xi
15351 one vector at a time.
15352 (vst4_lane_f32): Likewise.
15353 (vst4_lane_f64): Likewise.
15354 (vst4_lane_p8): Likewise.
15355 (vst4_lane_p16): Likewise.
15356 (vst4_lane_p64): Likewise.
15357 (vst4_lane_s8): Likewise.
15358 (vst4_lane_s16): Likewise.
15359 (vst4_lane_s32): Likewise.
15360 (vst4_lane_s64): Likewise.
15361 (vst4_lane_u8): Likewise.
15362 (vst4_lane_u16): Likewise.
15363 (vst4_lane_u32): Likewise.
15364 (vst4_lane_u64): Likewise.
15365 (vst4_lane_bf16): Likewise.
15366 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
15367 structure instead of using a union.
15368 (vst4q_lane_f32): Likewise.
15369 (vst4q_lane_f64): Likewise.
15370 (vst4q_lane_p8): Likewise.
15371 (vst4q_lane_p16): Likewise.
15372 (vst4q_lane_p64): Likewise.
15373 (vst4q_lane_s8): Likewise.
15374 (vst4q_lane_s16): Likewise.
15375 (vst4q_lane_s32): Likewise.
15376 (vst4q_lane_s64): Likewise.
15377 (vst4q_lane_u8): Likewise.
15378 (vst4q_lane_u16): Likewise.
15379 (vst4q_lane_u32): Likewise.
15380 (vst4q_lane_u64): Likewise.
15381 (vst4q_lane_bf16): Likewise.
15383 2021-08-06 Martin Liska <mliska@suse.cz>
15385 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
15386 a target option is restored, it can have
15387 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
15388 and error should not be emitted.
15390 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
15392 * gcov-io.h (gcov_write): Declare.
15393 * gcov-io.c (gcov_write): New.
15394 (gcov_write_counter): Remove.
15395 (gcov_write_tag_length): Likewise.
15396 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
15397 gcov_write_unsigned().
15398 * doc/invoke.texi (fprofile-info-section): Mention
15399 __gcov_info_to_gdca().
15401 2021-08-06 Martin Sebor <msebor@redhat.com>
15403 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
15404 arguments to by-reference.
15405 (iterate_fix_dominators): Same.
15406 * dominance.h (iterate_fix_dominators): Same.
15407 * ipa-prop.h: Call auto_vec::to_vec_legacy.
15408 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
15409 arguments to by-reference.
15410 (debug_data_dependence_relation): Same.
15411 (dump_data_dependence_relations): Same.
15412 * tree-data-ref.h (debug_data_dependence_relation): Same.
15413 (dump_data_dependence_relations): Same.
15414 * tree-predcom.c (dump_chains): Same.
15415 (initialize_root_vars_lm): Same.
15416 (determine_unroll_factor): Same.
15417 (replace_phis_by_defined_names): Same.
15418 (insert_init_seqs): Same.
15419 (pcom_worker::tree_predictive_commoning_loop): Call
15420 auto_vec::to_vec_legacy.
15421 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
15422 arguments to by-reference.
15423 * tree-ssa-threadbackward.c (populate_worklist): Same.
15424 (back_threader::resolve_def): Same.
15425 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
15426 (vect_enhance_data_refs_alignment): Same.
15427 (vect_check_lower_bound): Same.
15428 (vect_prune_runtime_alias_test_list): Same.
15429 (vect_permute_store_chain): Same.
15430 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
15431 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
15432 * tree-vectorizer.h (vect_permute_store_chain): Same.
15433 * vec.c (test_init): New function.
15434 (vec_c_tests): Call new function.
15435 * vec.h (vec): Declare ctors, dtor, and assignment.
15436 (auto_vec::vec_to_legacy): New function.
15437 (vec::copy): Adjust initialization.
15439 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
15442 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
15443 callee only uses GPRs.
15444 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
15445 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
15446 * config/i386/x86gprintrin.h: Add
15447 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
15448 to disable non-GPR ISAs.
15450 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15452 PR middle-end/101787
15453 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
15455 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
15457 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
15458 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
15459 (vect_is_extending_load, vect_is_integer_truncation): New functions,
15460 moved from aarch64.c but given different names.
15461 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
15462 (aarch64_is_reduction, aarch64_reduc_type)
15463 (aarch64_embedded_comparison_type, aarch64_comparison_type)
15464 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
15465 in favor of the above. Update callers accordingly.
15467 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15470 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
15471 writing .cpu directive in asm output.
15472 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
15473 (arm_last_printed_arch_string): Delete.
15474 (arm_last-printed_fpu_string): Delete.
15475 (arm_configure_build_target): If use of floating-point/SIMD is
15476 disabled, remove all fp/simd related features from the target ISA.
15477 (last_arm_targ_options): New variable.
15478 (arm_print_asm_arch_directives): Add new parameters. Change order
15479 of emitted directives and handle all cases here.
15480 (arm_file_start): Always call arm_print_asm_arch_directives, move
15481 all generation of .arch/.arch_extension here.
15482 (arm_file_end): Call arm_print_asm_arch.
15483 (arm_declare_function_name): Call arm_print_asm_arch_directives
15484 instead of printing .arch/.fpu directives directly.
15486 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15488 * config/arm/arm.c (arm_configure_build_target): Don't call
15489 arm_option_reconfigure_globals.
15490 (arm_option_restore): Call arm_option_reconfigure_globals after
15491 reconfiguring the target.
15492 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
15494 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
15496 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
15497 arch_name is always set.
15499 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15501 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15502 of vec_select high-half from being added into Neon subtract
15505 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
15507 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
15508 of vec_select high-half from being added into Neon add cost.
15510 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
15512 * cfgloop.h (loops_list::loops_list): Add one optional argument
15513 root and adjust accordingly, update loop tree walking and factor
15515 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
15517 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
15519 PR tree-optimization/101626
15520 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
15521 reverse scalar storage order on a pointer or vector component.
15523 2021-08-05 liuhongt <hongtao.liu@intel.com>
15525 * config/i386/sse.md (cond_<code><mode>): New expander.
15527 2021-08-05 liuhongt <hongtao.liu@intel.com>
15529 * config/i386/sse.md (cond_<code><mode>): New expander.
15531 2021-08-05 liuhongt <hongtao.liu@intel.com>
15533 * config/i386/sse.md (cond_<code><mode>): New expander.
15535 2021-08-04 David Malcolm <dmalcolm@redhat.com>
15538 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
15540 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15543 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
15544 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
15546 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15549 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
15550 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
15551 data with SSE register from one memory location to another.
15553 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15555 * config/s390/s390.c (expand_perm_with_vpdi): New function.
15556 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
15557 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
15558 parameterized expander.
15559 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
15561 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15563 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
15564 (struct expand_vec_perm_d): Define struct.
15565 (expand_perm_with_merge): New function.
15566 (vectorize_vec_perm_const_1): New function.
15567 (s390_vectorize_vec_perm_const): New function.
15568 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
15570 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15572 * config/s390/vector.md (V_HW_64): Remove mode iterator.
15573 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
15574 * config/s390/vx-builtins.md
15575 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
15578 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15580 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
15582 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
15584 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
15585 instead of an unspec.
15587 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
15589 * config/s390/s390-modes.def: Add more vector modes to support
15590 concatenation of two vectors.
15591 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
15593 (s390_expand_merge): Likewise.
15594 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
15595 (s390_expand_merge): New function.
15596 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
15597 Remove constant definitions.
15598 * config/s390/vector.md (V_HW_2): Add mode iterators.
15599 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
15600 (vec_2x_nelts, vec_2x_wide): New mode attributes.
15601 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
15602 New pattern definitions.
15603 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
15604 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
15605 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
15606 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
15608 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
15610 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
15611 emit vec merge pattern.
15613 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
15615 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
15617 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
15618 vec_select high-half from being added into Neon multiply
15620 * rtlanal.c (vec_series_highpart_p): Define.
15621 * rtlanal.h (vec_series_highpart_p): Declare.
15623 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
15625 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
15627 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
15628 vec_select cost from being added into Neon multiply cost.
15630 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15632 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
15633 which old_loop_vinfo is an epilogue loop that handles a constant
15634 number of iterations.
15636 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15638 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
15639 when a reanalyzed loop fails to be cheaper than the current
15642 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
15644 * config/aarch64/aarch64.c: Fix a typo.
15646 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
15648 PR gcov-profile/101773
15649 * gcov-io.c (gcov_close): Check return code of a fclose.
15651 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
15654 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
15655 .file statement when needed.
15657 2021-08-04 Richard Biener <rguenther@suse.de>
15659 * tree-vect-data-refs.c (vect_check_gather_scatter):
15660 Include widening conversions only when the result is
15661 still handed by native gather or the current offset
15662 size not already matches the data size.
15663 Also succeed analysis in case there's no native support,
15664 noted by a IFN_LAST ifn and a NULL decl.
15665 (vect_analyze_data_refs): Always consider gathers.
15666 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
15667 Test for no IFN gather rather than decl gather.
15668 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
15669 gather-scatter info and cost emulated gathers accordingly.
15670 (vect_truncate_gather_scatter_offset): Properly test for
15672 (vect_use_strided_gather_scatters_p): Likewise.
15673 (get_load_store_type): Handle emulated gathers and its
15675 (vectorizable_load): Likewise. Emulate them by extracting
15676 scalar offsets, doing scalar loads and a vector construct.
15678 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
15681 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
15682 argument to set m_max_size.
15683 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
15684 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
15685 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
15687 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
15688 Marc Glisse <marc.glisse@inria.fr>
15690 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
15691 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
15692 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
15694 2021-08-04 Richard Biener <rguenther@suse.de>
15696 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
15697 out code to decompose vector loads ...
15698 (optimize_vector_load): ... here. Generalize it to
15699 handle intermediate widening and TARGET_MEM_REF loads
15700 and apply it to loads with a supported vector mode as well.
15702 2021-08-04 Richard Biener <rguenther@suse.de>
15704 PR tree-optimization/101756
15705 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
15706 the result of the reduction epilogue is compatible to the original
15709 2021-08-04 liuhongt <hongtao.liu@intel.com>
15712 * config/i386/i386.md (peephole2): Refine predicate from
15713 register_operand to general_reg_operand.
15715 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
15717 * gimple-range-path.h (path_range_query::dump): Mark override.
15719 2021-08-04 Richard Biener <rguenther@suse.de>
15721 PR tree-optimization/101769
15722 * tree-tailcall.c (eliminate_tail_call): Add the created loop
15723 for the first recursion and return it via the new output parameter.
15724 (optimize_tail_call): Pass through new output param.
15725 (tree_optimize_tail_calls_1): After creating all latches,
15726 add the created loop to the loop tree. Do not mark loops for fixup.
15728 2021-08-04 Martin Liska <mliska@suse.cz>
15730 * doc/invoke.texi: Document threader-mode param.
15732 2021-08-04 liuhongt <hongtao.liu@intel.com>
15734 * config/i386/sse.md (cond_fma<mode>): New expander.
15735 (cond_fms<mode>): Ditto.
15736 (cond_fnma<mode>): Ditto.
15737 (cond_fnms<mode>): Ditto.
15739 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
15741 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
15743 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
15745 * config/rs6000/constraints.md: Remove "e" from the list of available
15746 constraint characters.
15748 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
15750 PR gcov-profile/71672
15751 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
15753 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
15755 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
15757 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
15759 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
15760 try XMM31 to avoid vzeroupper.
15762 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15764 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
15765 -mcpu=neoverse-512tvb.
15766 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
15767 * config/aarch64/aarch64-tune.md: Regenerate.
15768 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
15769 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
15770 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
15771 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
15772 (aarch64_adjust_body_cost): Likewise.
15774 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15776 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
15777 record issue information for operations that occur in the
15780 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15782 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
15783 parameter. Detect cases in which an Advanced SIMD MLA would almost
15784 certainly require a MOV.
15785 (aarch64_count_ops): Update accordingly.
15787 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15789 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
15790 function, split out from...
15791 (aarch64_detect_vector_stmt_subtype): ...here.
15792 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
15794 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15796 * config/aarch64/aarch64-protos.h (sve_vec_cost):
15797 Add gather_load_x32_cost and gather_load_x64_cost.
15798 * config/aarch64/aarch64.c (generic_sve_vector_cost)
15799 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
15800 accordingly, using the values given by the scalar_load * number
15801 of elements calculation that we used previously.
15802 (aarch64_detect_vector_stmt_subtype): Use the new fields.
15804 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15806 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
15807 function, split out from...
15808 (aarch64_adjust_body_cost): ...here.
15810 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15812 * config/aarch64/fractional-cost.h: New file.
15813 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
15814 and cost_fraction.h.
15815 (vec_cost_fraction): New typedef.
15816 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
15817 (aarch64_detect_vector_stmt_subtype): Likewise.
15818 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
15819 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
15821 (aarch64_adjust_body_cost): Likewise.
15822 (aarch64_test_cost_fraction): New function.
15823 (aarch64_run_selftests): Call it.
15825 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
15827 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
15829 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
15831 (aarch64_estimated_poly_value): Likewise. Use the least significant
15832 set bit for the minimum and likely values. Use the most significant
15833 set bit for the maximum value.
15835 2021-08-03 liuhongt <hongtao.liu@intel.com>
15837 * config/i386/sse.md (cond_<insn><mode>): New expander.
15838 (cond_mul<mode>): Ditto.
15840 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
15842 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
15844 2021-08-03 liuhongt <hongtao.liu@intel.com>
15846 * config/i386/sse.md (cond_<insn><mode>):New expander.
15847 (cond_mul<mode>): Ditto.
15848 (cond_div<mode>): Ditto.
15850 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
15852 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
15853 check stack_realign_needed for stack realignment.
15854 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
15855 than the largest integer supported by vector register.
15856 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
15857 (MOVE_MAX): Set to bytes of the largest integer supported by
15859 (STORE_MAX_PIECES): New.
15861 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
15863 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
15864 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
15865 data from one memory location to another.
15867 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
15869 PR middle-end/90773
15870 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
15872 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
15874 PR tree-optimization/101724
15875 * params.opt: Remove --param=threader-iterative.
15876 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
15879 2021-08-02 Tom de Vries <tdevries@suse.de>
15881 PR middle-end/101665
15882 * doc/extend.texi (nonnull attribute): Improve documentation.
15884 2021-08-02 Andrew Pinski <apinski@marvell.com>
15886 PR rtl-optimization/101683
15887 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
15889 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
15891 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
15892 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
15893 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
15894 (tree_ssa_phiop_worker): Update call to function above.
15896 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
15899 * config/i386/i386.md (bsr_rex64_1_zext): New.
15900 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
15901 with gen_bsr_rex64_1_zext.
15903 2021-07-31 Jakub Jelinek <jakub@redhat.com>
15906 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
15907 define_insn patterns.
15908 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
15909 Add combine splitters for constant - clz.
15910 (clz<mode>2): Use a temporary pseudo for bsr result.
15912 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
15914 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
15915 _mm_floor_sd, _mm_floor_ss): New.
15917 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
15919 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
15920 _mm_ceil_sd, _mm_ceil_ss): New.
15922 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
15924 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
15925 _mm_blend_ps, _mm_blendv_ps): New.
15927 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
15928 Uroš Bizjak <ubizjak@gmail.com>
15930 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
15931 to generate a conditional move using the carry flag after sub $1.
15932 (peephole2): Eliminate a register-to-register move by inverting
15933 the condition of a conditional move.
15935 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
15937 * config/mmix/mmix.md ("call", "call_value", "*call_real")
15938 ("*call_value_real"): Don't generate rtx mentioning the generic
15939 operands 1 and 2 to "call", and similarly for "call_value".
15940 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
15941 (mmix_print_operand): Use '!' instead of 'p'.
15943 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
15945 * doc/md.texi (call): Correct information about operand 2.
15946 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
15948 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
15950 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
15952 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
15954 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
15956 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
15958 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
15959 const_basic_block..
15960 (*::get_bb_range): Ditto.
15961 (*::bb_range_p): Ditto.
15962 * gimple-range-cache.h: Change prototypes.
15964 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
15966 PR middle-end/90773
15967 * builtins.c (builtin_memcpy_read_str): Change the mode argument
15968 from scalar_int_mode to fixed_size_mode.
15969 (builtin_strncpy_read_str): Likewise.
15970 (gen_memset_value_from_prev): New function.
15971 (builtin_memset_read_str): Change the mode argument from
15972 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
15973 and support CONST_VECTOR.
15974 (builtin_memset_gen_str): Likewise.
15975 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
15977 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
15978 with fixed_size_mode.
15979 (builtin_memset_read_str): Likewise.
15980 * expr.c (widest_int_mode_for_size): Renamed to ...
15981 (widest_fixed_size_mode_for_size): Add a bool argument to
15982 indicate if QI vector mode can be used.
15983 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
15984 instead of widest_int_mode_for_size.
15985 (pieces_addr::adjust): Change the mode argument from
15986 scalar_int_mode to fixed_size_mode.
15987 (op_by_pieces_d): Make m_len read-only. Add a bool member,
15988 m_qi_vector_mode, to indicate that QI vector mode can be used.
15989 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
15990 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
15991 instead of widest_int_mode_for_size.
15992 (op_by_pieces_d::get_usable_mode): Change the mode argument from
15993 scalar_int_mode to fixed_size_mode. Call
15994 widest_fixed_size_mode_for_size instead of
15995 widest_int_mode_for_size.
15996 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
15997 function to return the smallest integer or QI vector mode.
15998 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
15999 instead of widest_int_mode_for_size. Call
16000 smallest_fixed_size_mode_for_size instead of
16001 smallest_int_mode_for_size.
16002 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
16003 indicate that QI vector mode can be used and pass it to
16004 op_by_pieces_d::op_by_pieces_d.
16005 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
16006 instead of widest_int_mode_for_size. Pass memsetp to
16007 widest_fixed_size_mode_for_size to support QI vector mode.
16008 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
16009 (store_by_pieces): Pass memsetp to
16010 store_by_pieces_d::store_by_pieces_d.
16011 (clear_by_pieces_1): Removed.
16012 (clear_by_pieces): Replace clear_by_pieces_1 with
16013 builtin_memset_read_str and pass true to store_by_pieces_d to
16014 support vector mode broadcast.
16015 (string_cst_read_str): Change the mode argument from
16016 scalar_int_mode to fixed_size_mode.
16017 * expr.h (by_pieces_constfn): Change scalar_int_mode to
16019 (by_pieces_prev): Likewise.
16020 * rtl.h (lowpart_subreg_regno): New.
16021 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
16022 simplify_subreg_regno.
16023 * target.def (gen_memset_scratch_rtx): New hook.
16024 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
16025 * doc/tm.texi: Regenerated.
16027 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16030 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
16031 TARGET_EXPR instead of MODIFY_EXPR.
16033 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16036 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
16037 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
16038 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
16040 (vec_cmpu<IMSA:mode><mode_i>): New expander.
16042 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
16045 * config/i386/i386-options.c (ix86_option_override_internal):
16046 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
16048 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
16051 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
16052 explicitly dereferencing __a.
16053 (vld1_s64): Likewise.
16054 (vld1_u64): Likewise.
16055 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
16056 and change to VAR13.
16058 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
16060 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
16061 use of m_range_analyzer.
16062 (loop_versioning::lv_dom_walker::before_dom_children): Same.
16063 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
16064 (loop_versioning::prune_loop_conditions): Replace vr_values use
16065 with range_query interface.
16066 (pass_loop_versioning::execute): Use ranger.
16068 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
16071 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
16072 enum values, and emit a warning if they mismatch.
16074 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
16076 * cfgloop.h (as_const): New function.
16077 (class loop_iterator): Rename to ...
16078 (class loops_list): ... this.
16079 (loop_iterator::next): Rename to ...
16080 (loops_list::Iter::fill_curr_loop): ... this and adjust.
16081 (loop_iterator::loop_iterator): Rename to ...
16082 (loops_list::loops_list): ... this and adjust.
16083 (loops_list::Iter): New class.
16084 (loops_list::iterator): New type.
16085 (loops_list::const_iterator): New type.
16086 (loops_list::begin): New function.
16087 (loops_list::end): Likewise.
16088 (loops_list::begin const): Likewise.
16089 (loops_list::end const): Likewise.
16090 (FOR_EACH_LOOP): Remove.
16091 (FOR_EACH_LOOP_FN): Remove.
16092 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
16093 for loop with loops_list instance.
16094 (sort_sibling_loops): Likewise.
16095 (disambiguate_loops_with_multiple_latches): Likewise.
16096 (verify_loop_structure): Likewise.
16097 * cfgloopmanip.c (create_preheaders): Likewise.
16098 (force_single_succ_latches): Likewise.
16099 * config/aarch64/falkor-tag-collision-avoidance.c
16100 (execute_tag_collision_avoidance): Likewise.
16101 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
16102 * config/s390/s390.c (s390_adjust_loops): Likewise.
16103 * doc/loop.texi: Likewise.
16104 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
16105 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
16106 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
16107 (loop_versioning::make_versioning_decisions): Likewise.
16108 * gimple-ssa-split-paths.c (split_paths): Likewise.
16109 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
16110 * graphite.c (canonicalize_loop_form): Likewise.
16111 (graphite_transform_loops): Likewise.
16112 * ipa-fnsummary.c (analyze_function_body): Likewise.
16113 * ipa-pure-const.c (analyze_function): Likewise.
16114 * loop-doloop.c (doloop_optimize_loops): Likewise.
16115 * loop-init.c (loop_optimizer_finalize): Likewise.
16116 (fix_loop_structure): Likewise.
16117 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
16118 (move_loop_invariants): Likewise.
16119 * loop-unroll.c (decide_unrolling): Likewise.
16120 (unroll_loops): Likewise.
16121 * modulo-sched.c (sms_schedule): Likewise.
16122 * predict.c (predict_loops): Likewise.
16123 (pass_profile::execute): Likewise.
16124 * profile.c (branch_prob): Likewise.
16125 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
16126 (sel_find_rgns): Likewise.
16127 * tree-cfg.c (replace_loop_annotate): Likewise.
16128 (replace_uses_by): Likewise.
16129 (move_sese_region_to_fn): Likewise.
16130 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
16131 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
16132 * tree-parloops.c (parallelize_loops): Likewise.
16133 * tree-predcom.c (tree_predictive_commoning): Likewise.
16134 * tree-scalar-evolution.c (scev_initialize): Likewise.
16135 (scev_reset): Likewise.
16136 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
16137 * tree-ssa-live.c (remove_unused_locals): Likewise.
16138 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
16139 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
16140 (tree_ssa_lim_initialize): Likewise.
16141 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
16142 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
16143 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
16144 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
16145 (free_numbers_of_iterations_estimates): Likewise.
16146 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
16147 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
16148 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
16149 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
16150 (pass_scev_cprop::execute): Likewise.
16151 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
16152 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
16153 * tree-ssa-threadupdate.c
16154 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
16155 * tree-vectorizer.c (vectorize_loops): Likewise.
16156 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
16158 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
16160 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
16161 generating a VOIDmode register for e.g the
16162 function_arg_info::end_marker.
16164 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
16166 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
16167 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
16168 (cc mode_attr): Similarly.
16169 (ccz subst_attr): Similarly.
16170 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
16171 * config/h8300/testcompare.md: Remove various cc0 based patterns
16172 that had been commented out. Add pattern to set CCZ from a bit
16175 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
16176 Julian Brown <julian@codesourcery.com>
16177 Kwok Cheung Yeung <kcy@codesourcery.com>
16179 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
16180 'update_stmt' after modification.
16181 (pass_oacc_loop_designation): New function, extracted out of...
16182 (pass_oacc_device_lower): ... this.
16183 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
16184 (make_pass_oacc_loop_designation): New
16185 * passes.def: Add it.
16186 * tree-parloops.c (create_parallel_loop): Adjust.
16187 * tree-pass.h (make_pass_oacc_loop_designation): New.
16189 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
16191 * flag-types.h (enum threader_mode): New.
16192 * params.opt: Add entry for --param=threader-mode.
16193 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
16194 (class back_threader): New.
16195 (back_threader::back_threader): New.
16196 (back_threader::~back_threader): New.
16197 (back_threader::maybe_register_path): New.
16198 (back_threader::find_taken_edge): New.
16199 (back_threader::find_taken_edge_switch): New.
16200 (back_threader::find_taken_edge_cond): New.
16201 (back_threader::resolve_def): New.
16202 (back_threader::resolve_phi): New.
16203 (back_threader::find_paths_to_names): New.
16204 (back_threader::find_paths): New.
16207 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
16208 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
16209 (pass_thread_jumps::execute): Abstract out code...
16210 (try_thread_blocks): ...here.
16211 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
16212 Abstract out threading candidate code to...
16213 (single_succ_to_potentially_threadable_block): ...here.
16214 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
16216 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
16217 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
16218 Return bool from register_jump_thread.
16220 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
16222 * target.def: in0 and in1 do not need to be registers.
16223 * doc/tm.texi: Regenerate.
16225 2021-07-29 liuhongt <hongtao.liu@intel.com>
16228 * config/i386/i386.c (ix86_widen_mult_cost): New function.
16229 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
16232 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
16235 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
16236 (rs6000_preferred_doloop_mode): New hook.
16237 * doc/tm.texi: Regenerate.
16238 * doc/tm.texi.in: Add hook preferred_doloop_mode.
16239 * target.def (preferred_doloop_mode): New hook.
16240 * targhooks.c (default_preferred_doloop_mode): New hook.
16241 * targhooks.h (default_preferred_doloop_mode): New hook.
16242 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
16243 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
16244 and compute_doloop_base_on_mode.
16246 2021-07-28 Martin Sebor <msebor@redhat.com>
16248 PR middle-end/101494
16249 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
16250 and size computation.
16252 2021-07-28 Martin Sebor <msebor@redhat.com>
16254 PR middle-end/101601
16255 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
16257 Handle pointers to functions.
16259 2021-07-28 Martin Sebor <msebor@redhat.com>
16261 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
16262 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
16263 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
16264 (access_ref::access_ref): Same.
16265 (access_ref::phi): Same.
16266 (access_ref::get_ref): Same.
16267 (access_ref::size_remaining): Same.
16268 (access_ref::offset_in_range): Same.
16269 (access_ref::add_offset): Same.
16270 (access_ref::inform_access): Same.
16271 (ssa_name_limit_t::visit_phi): Same.
16272 (ssa_name_limit_t::leave_phi): Same.
16273 (ssa_name_limit_t::next): Same.
16274 (ssa_name_limit_t::next_phi): Same.
16275 (ssa_name_limit_t::~ssa_name_limit_t): Same.
16276 (pointer_query::pointer_query): Same.
16277 (pointer_query::get_ref): Same.
16278 (pointer_query::put_ref): Same.
16279 (pointer_query::flush_cache): Same.
16280 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
16281 (check_nul_terminated_array): Same.
16282 (unterminated_array): Same.
16283 (maybe_warn_for_bound): Same.
16284 (check_read_access): Same.
16285 (warn_for_access): Same.
16286 (get_size_range): Same.
16287 (check_access): Same.
16288 (gimple_call_alloc_size): Move to tree.c.
16289 (gimple_parm_array_size): Move to pointer-query.cc.
16290 (get_offset_range): Same.
16291 (gimple_call_return_array): Same.
16292 (handle_min_max_size): Same.
16293 (handle_array_ref): Same.
16294 (handle_mem_ref): Same.
16295 (compute_objsize): Same.
16296 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
16297 (call_dealloc_argno): Same.
16298 (fndecl_dealloc_argno): Same.
16299 (new_delete_mismatch_p): Same.
16300 (matching_alloc_calls_p): Same.
16301 (warn_dealloc_offset): Same.
16302 (maybe_emit_free_warning): Same.
16303 * builtins.h (check_nul_terminated_array): Move to
16304 gimple-ssa-warn-access.h.
16305 (check_nul_terminated_array): Same.
16306 (warn_string_no_nul): Same.
16307 (unterminated_array): Same.
16308 (class ssa_name_limit_t): Same.
16309 (class pointer_query): Same.
16310 (struct access_ref): Same.
16311 (class range_query): Same.
16312 (struct access_data): Same.
16313 (gimple_call_alloc_size): Same.
16314 (gimple_parm_array_size): Same.
16315 (compute_objsize): Same.
16316 (class access_data): Same.
16317 (maybe_emit_free_warning): Same.
16318 * calls.c (initialize_argument_information): Remove call to
16319 maybe_emit_free_warning.
16320 * gimple-array-bounds.cc: Include new header..
16321 * gimple-fold.c: Same.
16322 * gimple-ssa-sprintf.c: Same.
16323 * gimple-ssa-warn-restrict.c: Same.
16324 * passes.def: Add pass_warn_access.
16325 * tree-pass.h (make_pass_warn_access): Declare.
16326 * tree-ssa-strlen.c: Include new headers.
16327 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
16328 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
16329 * gimple-ssa-warn-access.cc: New file.
16330 * gimple-ssa-warn-access.h: New file.
16331 * pointer-query.cc: New file.
16332 * pointer-query.h: New file.
16334 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16336 PR middle-end/101624
16337 * ubsan.c (maybe_instrument_pointer_overflow,
16338 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
16339 PARM_DECLs or RESULT_DECLs.
16340 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
16342 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16344 PR middle-end/101642
16345 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
16346 to type of bswap16 for comparison.
16347 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
16349 2021-07-28 Richard Biener <rguenther@suse.de>
16351 PR tree-optimization/101615
16352 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
16353 at CTOR SLP graph entries.
16355 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
16357 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
16358 Add "r,w" alternative.
16360 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
16363 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
16364 AVX_U128_DIRTY when all bits are zero.
16366 2021-07-28 Richard Biener <rguenther@suse.de>
16368 PR tree-optimization/101615
16369 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
16370 external nodes cannot be permuted so make them perm_out 0.
16372 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
16375 * config.in: Regenerate.
16376 * config/gcn/gcn-hsa.h (A_FIJI): New define.
16377 (A_900): New define.
16378 (A_906): New define.
16379 (A_908): New define.
16380 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
16381 * config/gcn/gcn.c (output_file_start): Adjust attributes according
16382 to the assembler capabilities.
16383 * config/gcn/mkoffload.c (main): Likewise.
16384 * configure: Regenerate.
16385 * configure.ac: Add tests for LLVM assembler attribute features.
16387 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
16389 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
16390 cond_false and cond_true on branches.
16392 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16394 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
16397 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
16399 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
16401 2021-07-28 Jakub Jelinek <jakub@redhat.com>
16404 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
16405 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
16406 and has special !TARGET_AVX512VL expansion.
16407 (vashrv2di3<mask_name>): Rename to ...
16408 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
16409 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
16411 2021-07-28 Martin Uecker <muecker@gwdg.de>
16413 * calls.c (maybe_warn_rdwr_sizes): Correct argument
16414 numbers in warning that were switched.
16416 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
16418 PR tree-optimization/101596
16419 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
16420 by using new_type's precision instead.
16422 2021-07-28 liuhongt <hongtao.liu@intel.com>
16425 * config/i386/i386.h (processor_costs): Add new member
16427 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
16428 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
16429 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
16430 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
16431 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
16432 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
16433 generic_cost, core_cost): Initialize integer_to_sse same value
16435 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
16436 * config/i386/i386.c (ix86_builtin_vectorization_cost):
16437 Use integer_to_sse instead of sse_op to calculate the cost of
16440 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16442 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
16444 (write_init_file): Call write_ovld_static_init.
16446 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16448 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
16450 (write_init_file): Call write_bif_static_init.
16452 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
16454 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
16455 (TYPE_MAP_SIZE): New macro.
16456 (type_map): New initialized variable.
16457 (typemap_cmp): New function.
16458 (write_type_node): Likewise.
16459 (write_fntype_init): Implement.
16461 2021-07-27 Martin Sebor <msebor@redhat.com>
16463 PR tree-optimization/101584
16464 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
16465 (check_defs): Call it.
16467 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16469 * tree-ssa-dom.c (dom_jump_threader_simplifier):
16470 Put avail_exprs_stack in the class, instead of passing it to
16471 jump_threader_simplifier.
16472 (dom_jump_threader_simplifier::simplify): Add state argument.
16473 (dom_opt_dom_walker): Add state.
16474 (pass_dominator::execute): Pass state to threader.
16475 (dom_opt_dom_walker::before_dom_children): Use state.
16476 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
16477 arguments by state.
16478 (jump_threader::record_temporary_equivalences_from_phis):
16479 Register equivalences through the state variable.
16480 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
16481 Record ranges in a statement through the state variable.
16482 (jump_threader::simplify_control_stmt_condition): Pass state to
16484 (jump_threader::simplify_control_stmt_condition_1): Same.
16485 (jump_threader::thread_around_empty_blocks): Remove obsolete
16487 (jump_threader::thread_through_normal_block): Record equivalences
16488 on edge through the state variable.
16489 (jump_threader::thread_across_edge): Abstract state pushing.
16490 (jt_state::jt_state): New.
16491 (jt_state::push): New.
16492 (jt_state::pop): New.
16493 (jt_state::register_equiv): New.
16494 (jt_state::record_ranges_from_stmt): New.
16495 (jt_state::register_equivs_on_edge): New.
16496 (jump_threader_simplifier::jump_threader_simplifier): Move from
16498 (jump_threader_simplifier::simplify): Add state argument.
16499 * tree-ssa-threadedge.h (class jt_state): New.
16500 (class jump_threader): Add state to constructor.
16501 (class jump_threader_simplifier): Add state to simplify. Remove
16502 avail_exprs_stack from class.
16503 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
16505 (vrp_jump_threader::vrp_jump_threader): Add state.
16506 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
16508 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
16510 * Makefile.in (OBJS): Add gimple-range-path.o.
16511 * gimple-range-path.cc: New file.
16512 * gimple-range-path.h: New file.
16514 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
16516 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
16517 inside vec_duplicate for all patterns.
16518 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
16519 Push sign/zero-extension inside vec_duplicate.
16521 2021-07-27 Richard Biener <rguenther@suse.de>
16523 PR tree-optimization/101573
16524 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
16525 looking at uninitialized PHI arg defs in some constrained cases.
16526 (warn_uninitialized_vars): Call it.
16527 (execute_early_warn_uninitialized): Calculate dominators.
16529 2021-07-27 Richard Biener <rguenther@suse.de>
16531 PR tree-optimization/39821
16532 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
16533 vector_stmt for widening arithmetic.
16534 (vectorizable_conversion): Adjust.
16536 2021-07-27 Martin Jambor <mjambor@suse.cz>
16538 * cgraph.h (ipa_replace_map): New field force_load_ref.
16539 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
16540 aded new flag load_dereferenced, adjusted comments.
16541 (ipa_get_param_dereferenced): New function.
16542 (ipa_set_param_dereferenced): Likewise.
16543 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
16544 * ipa-cp.c: Include gimple.h.
16545 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
16546 (get_replacement_map): New parameter force_load_ref, set the
16547 appropriate flag in ipa_replace_map if set.
16548 (struct symbol_and_index_together): New type.
16549 (adjust_refs_in_act_callers): New function.
16550 (adjust_references_in_caller): Likewise.
16551 (create_specialized_node): When appropriate, call
16552 adjust_references_in_caller and force only load references.
16553 * ipa-prop.c (load_from_dereferenced_name): New function.
16554 (ipa_analyze_controlled_uses): Also detect loads from a
16555 dereference, harden testing of call statements.
16556 (ipa_write_node_info): Stream the dereferenced flag.
16557 (ipa_read_node_info): Likewise.
16558 (ipa_set_jf_constant): Also create refdesc when jump function
16559 references a variable.
16560 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
16561 also on references of variables and return a symtab_node. Adjust
16563 (propagate_controlled_uses): Also remove references to VAR_DECLs.
16565 2021-07-27 Jakub Jelinek <jakub@redhat.com>
16567 PR middle-end/101586
16568 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
16569 positions above or equal to sz except for diagnostics of flexible
16572 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16574 PR tree-optimization/78888
16575 * gimple-range-fold.cc (get_letter_range): New.
16576 (fold_using_range::range_of_builtin_call): Call get_letter_range.
16578 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
16580 PR tree-optimization/78888
16581 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
16582 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
16584 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
16585 Marc Glisse <marc.glisse@inria.fr>
16587 * match.pd (rotate): Simplify equality/inequality of rotations.
16588 (bswap): Simplify equality/inequality tests of byte swapping.
16590 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16592 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
16595 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16597 * range-op.cc (operator_lshift::fold_range): Pass rel to
16598 base class fold_range.
16599 (operator_rshift::fold_range): Same.
16601 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
16604 * toplev.h (min_align_loops_log): Remove declaration.
16605 (min_align_jumps_log, min_align_labels_log): Likewise.
16606 (min_align_functions_log): Likewise.
16608 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16610 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
16612 (execute_vrp): Abstract out simplification of conditionals...
16613 (simplify_casted_conds): ...here.
16615 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
16617 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
16618 Add gimple argument.
16619 (array_bounds_checker::check_array_ref): Same.
16620 (array_bounds_checker::check_addr_expr): Same.
16621 (array_bounds_checker::check_array_bounds): Pass statement to
16622 check_array_bounds and check_addr_expr.
16623 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
16624 (check_addr_expr): Same.
16625 (get_value_range): Same.
16627 2021-07-26 Tamar Christina <tamar.christina@arm.com>
16629 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
16630 (sdot_prod, udot_prod): ... This.
16631 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
16633 (<sur>dot_prod<vsi2qi>): ... this.
16634 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
16635 Change operands order.
16636 (<sur>sadv16qi): Use new operands order.
16637 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
16638 vdotq_s32): Use new RTL ordering.
16640 2021-07-26 Tamar Christina <tamar.christina@arm.com>
16642 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
16643 aarch64_types_ternop_suss_qualifiers): New.
16644 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
16645 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
16646 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
16648 2021-07-23 Jakub Jelinek <jakub@redhat.com>
16650 PR rtl-optimization/101562
16651 * expmed.c (store_integral_bit_field): Only use movstrict_optab
16652 if the operand isn't paradoxical.
16654 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
16656 * gimple-array-bounds.h (class array_bounds_checker): Change
16657 ranges type to range_query.
16659 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16661 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
16662 __builtin_memcpy instead of constructing
16663 __builtin_aarch64_simd_oi one vector at a time.
16664 (vst1_u64_x2): Likewise.
16665 (vst1_f64_x2): Likewise.
16666 (vst1_s8_x2): Likewise.
16667 (vst1_p8_x2): Likewise.
16668 (vst1_s16_x2): Likewise.
16669 (vst1_p16_x2): Likewise.
16670 (vst1_s32_x2): Likewise.
16671 (vst1_u8_x2): Likewise.
16672 (vst1_u16_x2): Likewise.
16673 (vst1_u32_x2): Likewise.
16674 (vst1_f16_x2): Likewise.
16675 (vst1_f32_x2): Likewise.
16676 (vst1_p64_x2): Likewise.
16677 (vst1q_s8_x2): Likewise.
16678 (vst1q_p8_x2): Likewise.
16679 (vst1q_s16_x2): Likewise.
16680 (vst1q_p16_x2): Likewise.
16681 (vst1q_s32_x2): Likewise.
16682 (vst1q_s64_x2): Likewise.
16683 (vst1q_u8_x2): Likewise.
16684 (vst1q_u16_x2): Likewise.
16685 (vst1q_u32_x2): Likewise.
16686 (vst1q_u64_x2): Likewise.
16687 (vst1q_f16_x2): Likewise.
16688 (vst1q_f32_x2): Likewise.
16689 (vst1q_f64_x2): Likewise.
16690 (vst1q_p64_x2): Likewise.
16692 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16694 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
16695 __builtin_memcpy instead of constructing
16696 __builtin_aarch64_simd_ci one vector at a time.
16697 (vst1_u64_x3): Likewise.
16698 (vst1_f64_x3): Likewise.
16699 (vst1_s8_x3): Likewise.
16700 (vst1_p8_x3): Likewise.
16701 (vst1_s16_x3): Likewise.
16702 (vst1_p16_x3): Likewise.
16703 (vst1_s32_x3): Likewise.
16704 (vst1_u8_x3): Likewise.
16705 (vst1_u16_x3): Likewise.
16706 (vst1_u32_x3): Likewise.
16707 (vst1_f16_x3): Likewise.
16708 (vst1_f32_x3): Likewise.
16709 (vst1_p64_x3): Likewise.
16710 (vst1q_s8_x3): Likewise.
16711 (vst1q_p8_x3): Likewise.
16712 (vst1q_s16_x3): Likewise.
16713 (vst1q_p16_x3): Likewise.
16714 (vst1q_s32_x3): Likewise.
16715 (vst1q_s64_x3): Likewise.
16716 (vst1q_u8_x3): Likewise.
16717 (vst1q_u16_x3): Likewise.
16718 (vst1q_u32_x3): Likewise.
16719 (vst1q_u64_x3): Likewise.
16720 (vst1q_f16_x3): Likewise.
16721 (vst1q_f32_x3): Likewise.
16722 (vst1q_f64_x3): Likewise.
16723 (vst1q_p64_x3): Likewise.
16725 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
16728 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
16729 hard register when LRA is in progress.
16731 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16733 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
16734 __builtin_memcpy instead of using a union.
16735 (vst1q_s8_x4): Likewise.
16736 (vst1_s16_x4): Likewise.
16737 (vst1q_s16_x4): Likewise.
16738 (vst1_s32_x4): Likewise.
16739 (vst1q_s32_x4): Likewise.
16740 (vst1_u8_x4): Likewise.
16741 (vst1q_u8_x4): Likewise.
16742 (vst1_u16_x4): Likewise.
16743 (vst1q_u16_x4): Likewise.
16744 (vst1_u32_x4): Likewise.
16745 (vst1q_u32_x4): Likewise.
16746 (vst1_f16_x4): Likewise.
16747 (vst1q_f16_x4): Likewise.
16748 (vst1_f32_x4): Likewise.
16749 (vst1q_f32_x4): Likewise.
16750 (vst1_p8_x4): Likewise.
16751 (vst1q_p8_x4): Likewise.
16752 (vst1_p16_x4): Likewise.
16753 (vst1q_p16_x4): Likewise.
16754 (vst1_s64_x4): Likewise.
16755 (vst1_u64_x4): Likewise.
16756 (vst1_p64_x4): Likewise.
16757 (vst1q_s64_x4): Likewise.
16758 (vst1q_u64_x4): Likewise.
16759 (vst1q_p64_x4): Likewise.
16760 (vst1_f64_x4): Likewise.
16761 (vst1q_f64_x4): Likewise.
16763 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
16765 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
16766 instead of constructing __builtin_aarch64_simd_oi one vector
16768 (vst2_u64): Likewise.
16769 (vst2_f64): Likewise.
16770 (vst2_s8): Likewise.
16771 (vst2_p8): Likewise.
16772 (vst2_s16): Likewise.
16773 (vst2_p16): Likewise.
16774 (vst2_s32): Likewise.
16775 (vst2_u8): Likewise.
16776 (vst2_u16): Likewise.
16777 (vst2_u32): Likewise.
16778 (vst2_f16): Likewise.
16779 (vst2_f32): Likewise.
16780 (vst2_p64): Likewise.
16781 (vst2q_s8): Likewise.
16782 (vst2q_p8): Likewise.
16783 (vst2q_s16): Likewise.
16784 (vst2q_p16): Likewise.
16785 (vst2q_s32): Likewise.
16786 (vst2q_s64): Likewise.
16787 (vst2q_u8): Likewise.
16788 (vst2q_u16): Likewise.
16789 (vst2q_u32): Likewise.
16790 (vst2q_u64): Likewise.
16791 (vst2q_f16): Likewise.
16792 (vst2q_f32): Likewise.
16793 (vst2q_f64): Likewise.
16794 (vst2q_p64): Likewise.
16796 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16798 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
16799 instead of constructing __builtin_aarch64_simd_ci one vector
16801 (vst3_u64): Likewise.
16802 (vst3_f64): Likewise.
16803 (vst3_s8): Likewise.
16804 (vst3_p8): Likewise.
16805 (vst3_s16): Likewise.
16806 (vst3_p16): Likewise.
16807 (vst3_s32): Likewise.
16808 (vst3_u8): Likewise.
16809 (vst3_u16): Likewise.
16810 (vst3_u32): Likewise.
16811 (vst3_f16): Likewise.
16812 (vst3_f32): Likewise.
16813 (vst3_p64): Likewise.
16814 (vst3q_s8): Likewise.
16815 (vst3q_p8): Likewise.
16816 (vst3q_s16): Likewise.
16817 (vst3q_p16): Likewise.
16818 (vst3q_s32): Likewise.
16819 (vst3q_s64): Likewise.
16820 (vst3q_u8): Likewise.
16821 (vst3q_u16): Likewise.
16822 (vst3q_u32): Likewise.
16823 (vst3q_u64): Likewise.
16824 (vst3q_f16): Likewise.
16825 (vst3q_f32): Likewise.
16826 (vst3q_f64): Likewise.
16827 (vst3q_p64): Likewise.
16829 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16831 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
16832 instead of constructing __builtin_aarch64_simd_xi one vector
16834 (vst4_u64): Likewise.
16835 (vst4_f64): Likewise.
16836 (vst4_s8): Likewise.
16837 (vst4_p8): Likewise.
16838 (vst4_s16): Likewise.
16839 (vst4_p16): Likewise.
16840 (vst4_s32): Likewise.
16841 (vst4_u8): Likewise.
16842 (vst4_u16): Likewise.
16843 (vst4_u32): Likewise.
16844 (vst4_f16): Likewise.
16845 (vst4_f32): Likewise.
16846 (vst4_p64): Likewise.
16847 (vst4q_s8): Likewise.
16848 (vst4q_p8): Likewise.
16849 (vst4q_s16): Likewise.
16850 (vst4q_p16): Likewise.
16851 (vst4q_s32): Likewise.
16852 (vst4q_s64): Likewise.
16853 (vst4q_u8): Likewise.
16854 (vst4q_u16): Likewise.
16855 (vst4q_u32): Likewise.
16856 (vst4q_u64): Likewise.
16857 (vst4q_f16): Likewise.
16858 (vst4q_f32): Likewise.
16859 (vst4q_f64): Likewise.
16860 (vst4q_p64): Likewise.
16862 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16864 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
16865 instead of constructing __builtin_aarch64_simd_oi one vector
16867 (vtbx4_u8): Likewise.
16868 (vtbx4_p8): Likewise.
16870 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16872 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
16873 instead of constructing __builtin_aarch64_simd_oi one vector
16875 (vtbl3_u8): Likewise.
16876 (vtbl3_p8): Likewise.
16877 (vtbl4_s8): Likewise.
16878 (vtbl4_u8): Likewise.
16879 (vtbl4_p8): Likewise.
16881 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16883 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
16884 instead of constructing __builtin_aarch64_simd_oi one vector
16886 (vqtbx2_u8): Likewise.
16887 (vqtbx2_p8): Likewise.
16888 (vqtbx2q_s8): Likewise.
16889 (vqtbx2q_u8): Likewise.
16890 (vqtbx2q_p8): Likewise.
16891 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
16892 __builtin_aarch64_simd_ci one vector at a time.
16893 (vqtbx3_u8): Likewise.
16894 (vqtbx3_p8): Likewise.
16895 (vqtbx3q_s8): Likewise.
16896 (vqtbx3q_u8): Likewise.
16897 (vqtbx3q_p8): Likewise.
16898 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
16899 __builtin_aarch64_simd_xi one vector at a time.
16900 (vqtbx4_u8): Likewise.
16901 (vqtbx4_p8): Likewise.
16902 (vqtbx4q_s8): Likewise.
16903 (vqtbx4q_u8): Likewise.
16904 (vqtbx4q_p8): Likewise.
16906 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
16908 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
16909 instead of constructing __builtin_aarch64_simd_oi one vector
16911 (vqtbl2_u8): Likewise.
16912 (vqtbl2_p8): Likewise.
16913 (vqtbl2q_s8): Likewise.
16914 (vqtbl2q_u8): Likewise.
16915 (vqtbl2q_p8): Likewise.
16916 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
16917 __builtin_aarch64_simd_ci one vector at a time.
16918 (vqtbl3_u8): Likewise.
16919 (vqtbl3_p8): Likewise.
16920 (vqtbl3q_s8): Likewise.
16921 (vqtbl3q_u8): Likewise.
16922 (vqtbl3q_p8): Likewise.
16923 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
16924 __builtin_aarch64_simd_xi one vector at a time.
16925 (vqtbl4_u8): Likewise.
16926 (vqtbl4_p8): Likewise.
16927 (vqtbl4q_s8): Likewise.
16928 (vqtbl4q_u8): Likewise.
16929 (vqtbl4q_p8): Likewise.
16931 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
16934 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
16936 2021-07-22 Andrew Pinski <apinski@marvell.com>
16938 PR tree-optimization/10153
16939 * tree-tailcall.c (create_tailcall_accumulator):
16940 Don't call fold_convert as the type should be correct already.
16941 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
16942 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
16944 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
16946 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
16947 varying_p check for null/non-null check.
16949 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
16951 PR tree-optimization/101511
16952 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
16953 is in ssa2's equiv set, and don't trap if so.
16955 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
16957 PR tree-optimization/101497
16958 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
16961 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
16963 PR tree-optimization/101496
16964 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
16965 first, then vrp_visit_cond_Stmt.
16967 2021-07-22 liuhongt <hongtao.liu@intel.com>
16969 * config/i386/i386-expand.c
16970 (ix86_broadcast_from_integer_constant): Rename to ..
16971 (ix86_broadcast_from_constant): .. this, and extend it to
16973 (ix86_expand_vector_move): Extend to float mode.
16974 * config/i386/i386-features.c
16975 (replace_constant_pool_with_broadcast): Remove.
16976 (remove_partial_avx_dependency_gate): Ditto.
16977 (constant_pool_broadcast): Ditto.
16978 (class pass_constant_pool_broadcast): Ditto.
16979 (make_pass_constant_pool_broadcast): Ditto.
16980 (remove_partial_avx_dependency): Adjust gate.
16981 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
16982 * config/i386/i386-protos.h
16983 (make_pass_constant_pool_broadcast): Remove.
16985 2021-07-22 liuhongt <hongtao.liu@intel.com>
16987 * config/i386/constraints.md (Wb): New constraint.
16989 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
16991 (*ashlqi3_1): Ditto.
16992 (*<insn><mode>3_1): Split to ..
16993 (*ashr<mode>3_1): this, ...
16994 (*lshr<mode>3_1): and this, also extend this pattern to avx512
16996 (*<insn><mode>3_1): Split to ..
16997 (*ashr<mode>3_1): this, ...
16998 (*lshrqi3_1): and this, also extend this pattern to avx512
17000 (*lshrhi3_1): And this, also extend this pattern to avx512
17002 * config/i386/sse.md (k<code><mode>): New define_split after
17003 it to convert generic shift pattern to mask shift ones.
17005 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
17006 Joseph Myers <joseph@codesourcery.com>
17007 Cesar Philippidis <cesar@codesourcery.com>
17009 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
17010 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
17012 * tree-pretty-print.c (dump_omp_clause): Likewise.
17013 * omp-general.c (oacc_verify_routine_clauses): Likewise.
17014 * gimplify.c (gimplify_scan_omp_clauses)
17015 (gimplify_adjust_omp_clauses): Likewise.
17016 * tree-nested.c (convert_nonlocal_omp_clauses)
17017 (convert_local_omp_clauses): Likewise.
17018 * omp-low.c (scan_sharing_clauses): Likewise.
17019 * omp-offload.c (execute_oacc_device_lower): Update.
17021 2021-07-21 Martin Sebor <msebor@redhat.com>
17023 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
17025 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17027 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
17030 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17032 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
17034 (write_fntype_init): New stub function.
17035 (write_init_bif_table): Likewise.
17036 (write_init_ovld_table): New function.
17037 (write_init_file): Implement.
17039 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17041 * config/rs6000/rs6000-gen-builtins.c
17042 (write_autogenerated_header): New function.
17043 (write_decls): Likewise.
17044 (write_extern_fntype): New callback function.
17045 (write_header_file): Implement.
17047 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17049 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
17052 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17054 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
17056 (complete_base_type): Likewise.
17057 (construct_fntype_id): Likewise.
17058 (parse_bif_entry): Call contruct_fntype_id.
17059 (parse_ovld_entry): Likewise.
17061 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17063 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
17064 (MAXOVLDSTANZAS): New macro.
17065 (ovld_stanzas): New variable.
17066 (curr_ovld_stanza): Likewise.
17067 (MAXOVLDS): New macro.
17068 (ovlddata): New struct.
17069 (ovlds): New variable.
17070 (curr_ovld): Likewise.
17071 (max_ovld_args): Likewise.
17072 (parse_ovld_entry): New function.
17073 (parse_ovld_stanza): Likewise.
17074 (parse_ovld): Implement.
17076 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17078 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
17081 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17083 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
17084 (parse_prototype): Implement.
17086 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17088 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
17089 (curr_bif_stanza): New variable.
17090 (stanza_entry): New struct.
17091 (stanza_map): New initialized variable.
17092 (enable_string): Likewise.
17093 (fnkinds): New enum.
17094 (typelist): New struct.
17095 (attrinfo): Likewise.
17096 (MAXRESTROPNDS): New macro.
17097 (prototype): New struct.
17098 (MAXBIFS): New macro.
17099 (bifdata): New struct.
17100 (bifs): New variable.
17101 (curr_bif): Likewise.
17102 (bif_order): Likewise.
17103 (bif_index): Likewise.
17104 (fatal): New function.
17105 (stanza_name_to_stanza): Likewise.
17106 (parse_bif_attrs): New stub function.
17107 (parse_prototype): Likewise.
17108 (parse_bif_entry): New function.
17109 (parse_bif_stanza): Likewise.
17110 (parse_bif): Implement.
17111 (set_bif_order): New function.
17112 (create_bif_order): Implement.
17114 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
17116 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
17117 (num_bifs): New variable.
17118 (num_ovld_stanzas): Likewise.
17119 (num_ovlds): Likewise.
17120 (parse_codes): New enum.
17121 (bif_rbt): New variable.
17122 (ovld_rbt): Likewise.
17123 (fntype_rbt): Likewise.
17124 (bifo_rbt): Likewise.
17125 (parse_bif): New stub function.
17126 (create_bif_order): Likewise.
17127 (parse_ovld): Likewise.
17128 (write_header_file): Likewise.
17129 (write_init_file): Likewise.
17130 (write_defines_file): Likewise.
17131 (delete_output_files): New function.
17134 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
17137 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
17138 from CRC32 _builtin functions.
17140 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
17142 * coverage.c (build_gcov_info_var_registration): Mark the object placed
17143 in the linker set as referenced so that it does not get optimized away.
17145 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
17148 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17150 * config.gcc (riscv*-*-*): Detect which python is available.
17152 2021-07-21 Jakub Jelinek <jakub@redhat.com>
17154 PR middle-end/101535
17155 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
17156 contexts in which decl isn't privatized and for ORT_TARGET return
17157 false if decl is mapped.
17159 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17161 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
17162 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
17163 has been successfully unrolled.
17165 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
17167 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
17168 exit condition if no tail loop is needed, and if the original exit
17169 condition should therefore be kept as-is.
17170 (tree_transform_and_unroll_loop): Handle that case here too.
17172 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
17174 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
17176 (free_data_refs): Likewise.
17177 * tree-data-ref.h (free_dependence_relations): Likewise.
17178 (free_data_refs): Likewise.
17179 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
17181 (struct component): Likewise.
17182 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
17183 (pcom_worker::~pcom_worker): Likewise.
17184 (pcom_worker::release_chain): Adjust as auto_vec changes.
17185 (pcom_worker::loop): Rename to ...
17186 (pcom_worker::m_loop): ... this.
17187 (pcom_worker::datarefs): Rename to ...
17188 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
17189 (pcom_worker::dependences): Rename to ...
17190 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
17191 (pcom_worker::chains): Rename to ...
17192 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
17193 (pcom_worker::looparound_phis): Rename to ...
17194 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
17196 (pcom_worker::cache): Rename to ...
17197 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
17198 (pcom_worker::release_chain): Adjust for auto_vec changes.
17199 (pcom_worker::release_chains): Adjust for auto_vec and renaming
17201 (release_component): Remove.
17202 (release_components): Adjust for release_component removal.
17203 (component_of): Adjust to use vec.
17204 (merge_comps): Likewise.
17205 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
17206 (pcom_worker::determine_offset): Likewise.
17207 (class comp_ptrs): Remove.
17208 (pcom_worker::split_data_refs_to_components): Adjust for renaming
17209 changes, for comp_ptrs removal with auto_vec.
17210 (pcom_worker::suitable_component_p): Adjust for renaming changes.
17211 (pcom_worker::filter_suitable_components): Adjust for release_component
17213 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
17214 (pcom_worker::find_looparound_phi): Likewise.
17215 (pcom_worker::add_looparound_copies): Likewise.
17216 (pcom_worker::determine_roots_comp): Likewise.
17217 (pcom_worker::single_nonlooparound_use): Likewise.
17218 (pcom_worker::execute_pred_commoning_chain): Likewise.
17219 (pcom_worker::execute_pred_commoning): Likewise.
17220 (pcom_worker::try_combine_chains): Likewise.
17221 (pcom_worker::prepare_initializers_chain): Likewise.
17222 (pcom_worker::prepare_initializers): Likewise.
17223 (pcom_worker::prepare_finalizers_chain): Likewise.
17224 (pcom_worker::prepare_finalizers): Likewise.
17225 (pcom_worker::tree_predictive_commoning_loop): Likewise.
17227 2021-07-20 Martin Sebor <msebor@redhat.com>
17229 PR middle-end/101397
17230 * builtins.c (gimple_call_return_array): Add argument. Correct
17231 offsets for memchr, mempcpy, stpcpy, and stpncpy.
17232 (compute_objsize_r): Adjust offset computation for argument returning
17235 2021-07-20 Martin Sebor <msebor@redhat.com>
17237 PR middle-end/101300
17238 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
17240 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
17242 * function.c (assign_parm_setup_block): Use adjust_address instead
17243 of change_address to preserve MEM_EXPR and friends.
17245 2021-07-20 Martin Sebor <msebor@redhat.com>
17247 * cfgloop.h (single_likely_exit): Adjust by-value argument to
17248 by-const-reference.
17249 * cfgloopanal.c (single_likely_exit): Same.
17250 * cgraph.h (struct cgraph_node): Same.
17251 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
17252 * genautomata.c (merge_states): Same.
17253 * genextract.c (VEC_char_to_string): Same.
17254 * genmatch.c (dt_node::gen_kids_1): Same.
17255 (walk_captures): Adjust by-value argument to by-reference.
17256 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
17257 to by-const-reference.
17258 * gimple.c (gimple_build_call_vec): Same.
17259 (gimple_build_call_internal_vec): Same.
17260 (gimple_build_switch): Same.
17261 (sort_case_labels): Same.
17262 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17264 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
17265 by-const-reference.
17266 (gimple_build_call_internal_vec): Same.
17267 (gimple_build_switch): Same.
17268 (sort_case_labels): Same.
17269 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
17271 * haifa-sched.c (calc_priorities): Adjust by-value argument to
17272 by-const-reference.
17273 (sched_init_luids): Same.
17274 (haifa_init_h_i_d): Same.
17275 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
17276 (adjust_callers_for_value_intersection): Adjust by-value argument to
17278 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
17279 by-const-reference.
17280 (find_more_contexts_for_caller_subset): Same.
17281 (find_aggregate_values_for_callers_subset): Same.
17282 (copy_useful_known_contexts): Same.
17283 * ipa-fnsummary.c (remap_edge_summaries): Same.
17284 (remap_freqcounting_predicate): Same.
17285 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
17287 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
17288 to by-const-reference.
17289 * ipa-predicate.h (predicate::remap_after_inlining): Same.
17290 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
17291 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
17292 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
17293 * read-rtl.c (add_overload_instance): Same.
17294 * rtl.h (native_decode_rtx): Same.
17295 (native_decode_vector_rtx): Same.
17296 * sched-int.h (sched_init_luids): Same.
17297 (haifa_init_h_i_d): Same.
17298 * simplify-rtx.c (native_decode_vector_rtx): Same.
17299 (native_decode_rtx): Same.
17300 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
17301 (shrink_wrap_one_built_in_call_with_conds): Same.
17302 (shrink_wrap_conditional_dead_built_in_calls): Same.
17303 * tree-data-ref.c (create_runtime_alias_checks): Same.
17304 (compute_all_dependences): Same.
17305 * tree-data-ref.h (compute_all_dependences): Same.
17306 (create_runtime_alias_checks): Same.
17307 (index_in_loop_nest): Same.
17308 * tree-if-conv.c (mask_exists): Same.
17309 * tree-loop-distribution.c (class loop_distribution): Same.
17310 (loop_distribution::create_rdg_vertices): Same.
17311 (dump_rdg_partitions): Same.
17312 (debug_rdg_partitions): Same.
17313 (partition_contains_all_rw): Same.
17314 (loop_distribution::distribute_loop): Same.
17315 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
17316 (oacc_entry_exit_single_gang): Same.
17317 * tree-ssa-loop-im.c (hoist_memory_references): Same.
17318 (loop_suitable_for_sm): Same.
17319 * tree-ssa-loop-niter.c (bound_index): Same.
17320 * tree-ssa-reassoc.c (update_ops): Same.
17321 (swap_ops_for_binary_stmt): Same.
17322 (rewrite_expr_tree): Same.
17323 (rewrite_expr_tree_parallel): Same.
17324 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
17325 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
17326 * tree-ssa-structalias.c (process_all_all_constraints): Same.
17327 (make_constraints_to): Same.
17328 (handle_lhs_call): Same.
17329 (find_func_aliases_for_builtin_call): Same.
17330 (sort_fieldstack): Same.
17331 (check_for_overlaps): Same.
17332 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
17333 (vect_create_cond_for_unequal_addrs): Same.
17334 (vect_create_cond_for_lower_bounds): Same.
17335 (vect_create_cond_for_alias_checks): Same.
17336 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
17337 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
17338 (vect_make_slp_decision): Same.
17339 (vect_slp_bbs): Same.
17340 (duplicate_and_interleave): Same.
17341 (vect_transform_slp_perm_load): Same.
17342 (vect_schedule_slp): Same.
17343 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
17344 (vect_schedule_slp): Same.
17345 (duplicate_and_interleave): Same.
17346 * tree.c (build_vector_from_ctor): Same.
17347 (build_vector): Same.
17348 (check_vector_cst): Same.
17349 (check_vector_cst_duplicate): Same.
17350 (check_vector_cst_fill): Same.
17351 (check_vector_cst_stepped): Same.
17352 * tree.h (build_vector_from_ctor): Same.
17354 2021-07-20 Jakub Jelinek <jakub@redhat.com>
17357 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
17358 type from bool to int.
17359 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
17360 EASY_VECTOR_MSB case if either step or copies is not 1.
17361 (vspltis_shifted): Fix comment typo.
17362 (easy_altivec_constant): Change return type from bool to int, instead
17363 of returning true return byte size of the element mode that should be
17364 used to synthetize the constant.
17365 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
17366 that vspltis_shifted is 0, handle the case where easy_altivec_constant
17367 assumes using different vector mode from CONST_VECTOR's mode.
17368 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
17369 easy_altivec_constant to determine mode in which -1 >> -1 should be
17370 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
17372 2021-07-20 Richard Biener <rguenther@suse.de>
17375 * dwarf2out.h (dwarf_file_data): Add key member.
17376 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
17377 (dwarf_file_hasher::hash): Hash key.
17378 (lookup_filename): Remap the filename and store it in the
17379 filename member of dwarf_file_data when creating a new
17381 (file_name_acquire): Do not remap the filename again.
17382 (maybe_emit_file): Likewise.
17384 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
17386 * config/aarch64/aarch64-simd-builtins.def: Use two variant
17387 generators for all TBL/TBX intrinsics and rename to
17388 consistent forms: qtbl[1234] or qtbx[1234].
17389 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
17391 (aarch64_qtbl1<mode>): This.
17392 (aarch64_tbx1<mode>): Rename to...
17393 (aarch64_qtbx1<mode>): This.
17394 (aarch64_tbl2v16qi): Delete.
17395 (aarch64_tbl3<mode>): Rename to...
17396 (aarch64_qtbl2<mode>): This.
17397 (aarch64_tbx4<mode>): Rename to...
17398 (aarch64_qtbx2<mode>): This.
17399 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
17400 renamed qtbl1 and qtbl2 RTL patterns.
17401 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
17403 (vqtbl1_s8): Likewise.
17404 (vqtbl1_u8): Likewise.
17405 (vqtbl1q_p8): Likewise.
17406 (vqtbl1q_s8): Likewise.
17407 (vqtbl1q_u8): Likewise.
17408 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
17409 (vqtbx1_u8): Likewise.
17410 (vqtbx1_p8): Likewise.
17411 (vqtbx1q_s8): Likewise.
17412 (vqtbx1q_u8): Likewise.
17413 (vqtbx1q_p8): Likewise.
17414 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
17415 (vtbl1_u8): Likewise.
17416 (vtbl1_p8): Likewise.
17417 (vtbl2_s8): Likewise
17418 (vtbl2_u8): Likewise.
17419 (vtbl2_p8): Likewise.
17420 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
17421 (vtbl3_u8): Likewise.
17422 (vtbl3_p8): Likewise.
17423 (vtbl4_s8): Likewise.
17424 (vtbl4_u8): Likewise.
17425 (vtbl4_p8): Likewise.
17426 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
17427 (vtbx2_u8): Likewise.
17428 (vtbx2_p8): Likewise.
17429 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
17430 (vqtbl2_u8): Likewise.
17431 (vqtbl2_p8): Likewise.
17432 (vqtbl2q_s8): Likewise.
17433 (vqtbl2q_u8): Likewise.
17434 (vqtbl2q_p8): Likewise.
17435 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
17436 (vqtbx2_u8): Likewise.
17437 (vqtbx2_p8): Likewise.
17438 (vqtbx2q_s8): Likewise.
17439 (vqtbx2q_u8): Likewise.
17440 (vqtbx2q_p8): Likewise.
17441 (vtbx4_s8): Likewise.
17442 (vtbx4_u8): Likewise.
17443 (vtbx4_p8): Likewise.
17445 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
17448 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
17450 (define_peephole2 atomic_loaddi_fpu): Ditto.
17452 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
17454 * config.gcc (riscv*-*-*): Detect which python is available.
17456 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17458 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
17459 (smul<mode>3_highpart): ... this.
17460 (mulhu_<mode>): Rename to...
17461 (umul<mode>3_highpart): ... this.
17462 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
17463 MULHU_V2DI, MULHU_V4SI): Adjust.
17465 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
17467 PR tree-optimization/100696
17468 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
17469 * internal-fn.def (IFN_MULH): New internal function.
17470 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
17471 recog normal multiply highpart as IFN_MULH.
17472 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
17475 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17477 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
17478 (BTF_DEBUGGING_INFO): Likewise.
17479 * doc/tm.texi.in: Document the new macros.
17480 * doc/tm.texi: Regenerated.
17481 * toplev.c: Guard initialization of debug hooks.
17483 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
17485 * flags.h (ctf_debuginfo_p): New function declaration.
17486 * opts.c (ctf_debuginfo_p): New function definition.
17488 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
17491 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
17492 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
17493 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
17494 * config/gcn/gcn-valu.md: Add a warning comment.
17495 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
17496 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
17497 * config/gcn/gcn.md: Add a warning comment.
17498 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
17499 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
17501 (EF_AMDGPU_XNACK): New.
17502 (EF_AMDGPU_SRAM_ECC): New.
17504 (copy_early_debug_info): Use elf_flags.
17505 (main): Handle -mxnack and -msram-ecc options.
17506 * doc/invoke.texi: Document -mxnack and -msram-ecc.
17508 2021-07-19 Andrew Pinski <apinski@marvell.com>
17511 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
17512 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
17514 2021-07-19 Richard Biener <rguenther@suse.de>
17516 PR tree-optimization/101505
17517 * tree-vect-patterns.c (vect_determine_precisions): Walk
17518 PHIs also for loop vectorization.
17520 2021-07-19 Richard Biener <rguenther@suse.de>
17522 * gimple.h (gimple_expr_type): Remove.
17523 * doc/gimple.texi: Remove gimple_expr_type documentation.
17525 2021-07-19 Richard Biener <rguenther@suse.de>
17527 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
17528 (ao_ref_init_from_vn_reference): Likewise.
17529 (fully_constant_reference): Likewise.
17530 (vn_reference_lookup_call): Do not set vr->type to random
17532 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
17534 * tree-vect-generic.c (expand_vector_piecewise): Pass in
17535 whether we expanded parallel.
17536 (expand_vector_parallel): Adjust.
17537 (expand_vector_addition): Likewise.
17538 (expand_vector_comparison): Likewise.
17539 (expand_vector_operation): Likewise.
17540 (expand_vector_scalar_condition): Likewise.
17541 (expand_vector_conversion): Likewise.
17543 2021-07-19 Richard Biener <rguenther@suse.de>
17545 * tree-vrp.c (register_edge_assert_for_2): Use the
17547 (vrp_folder::fold_predicate_in): Likewise.
17548 * vr-values.c (gimple_assign_nonzero_p): Likewise.
17549 (vr_values::extract_range_from_comparison): Likewise.
17550 (vr_values::extract_range_from_ubsan_builtin): Use the
17551 type of the first operand.
17552 (vr_values::extract_range_basic): Push down type
17553 computation, use the appropriate LHS.
17554 (vr_values::extract_range_from_assignment): Use the
17557 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17560 * common/config/i386/i386-common.c (ix86_handle_option): For
17561 -mgeneral-regs-only, enable the GPR only instructions which are
17562 enabled implicitly by SSE ISAs unless they have been disabled
17565 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
17568 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
17569 ix86_avx_u128_mode_needed.
17570 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
17571 returns AVX register.
17573 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
17575 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
17576 (handle_const_call): Liekise
17577 (handle_pure_call): Liekise
17579 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
17581 PR tree-optimization/96542
17582 * range-op.cc (range_operator::wi_fold_in_parts): New.
17583 (range_operator::fold_range): Call wi_fold_in_parts.
17584 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
17585 * range-op.h (wi_fold_in_parts): Add prototype.
17587 2021-07-16 David Malcolm <dmalcolm@redhat.com>
17589 * doc/analyzer.texi: Add __analyzer_dump_state.
17591 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17593 * config/rs6000/rbtree.c: New file.
17594 * config/rs6000/rbtree.h: New file.
17596 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17598 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
17599 (typeinfo): Add restr field.
17600 (match_bracketed_pair): New function.
17601 (match_const_restriction): Implement.
17603 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17605 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
17607 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17609 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
17610 (basetype): Likewise.
17611 (typeinfo): Likewise.
17612 (handle_pointer): New function.
17613 (match_basetype): New stub function.
17614 (match_const_restriction): Likewise.
17615 (match_type): New function.
17617 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17619 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
17621 (advance_line): Likewise.
17622 (safe_inc_pos): Likewise.
17623 (match_identifier): Likewise.
17624 (match_integer): Likewise.
17625 (match_to_right_bracket): Likewise.
17627 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17629 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
17630 (ovld_file): Likewise.
17631 (header_file): Likewise.
17632 (init_file): Likewise.
17633 (defines_file): Likewise.
17634 (pgm_path): Likewise.
17635 (bif_path): Likewise.
17636 (ovld_path): Likewise.
17637 (header_path): Likewise.
17638 (init_path): Likewise.
17639 (defines_path): Likewise.
17640 (LINELEN): New macro.
17641 (linebuf): New variable.
17645 (bif_diag): New function.
17646 (ovld_diag): Likewise.
17648 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17650 * config/rs6000/rs6000-builtin-new.def: New.
17651 * config/rs6000/rs6000-overload.def: New.
17653 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17655 * config/rs6000/rs6000-gen-builtins.c: New.
17657 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
17659 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
17660 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
17661 * gengtype-state.c (state_writer::write_state_file_list): Add a
17662 parameter to the fileslist expression for the number of build
17664 (read_state_files_list): Detect build headers and strip the
17665 initial "./" or ".\" from their names.
17666 * gengtype.c (build_headers): New global variable.
17667 (num_build_headers): Likewise.
17668 (open_base_files): Emit #include for each build header.
17669 (main): Detect and count build headers.
17670 * gengtype.h (build_headers): New extern variable.
17671 (num_build_headers): Likewise.
17673 2021-07-16 Richard Biener <rguenther@suse.de>
17675 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
17676 the type of the LHS.
17677 (find_bswap_or_nop_1): Likewise.
17678 (find_bswap_or_nop): Likewise.
17679 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
17681 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
17682 Remove unused parameters, pass in the scalar type. Fix
17683 internal store function handling.
17684 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
17685 (vect_get_vector_types_for_stmt): Move down check for
17686 existing vector stmt after we've determined a scalar type.
17687 Pass down the used scalar type to vect_get_smallest_scalar_type.
17688 * tree-vect-generic.c (expand_vector_condition): Use
17689 the type of the LHS.
17690 (expand_vector_scalar_condition): Likewise.
17691 (expand_vector_operations_1): Likewise.
17692 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
17693 (vect_recog_dot_prod_pattern): Likewise.
17694 (vect_recog_sad_pattern): Likewise.
17695 (vect_recog_widen_op_pattern): Likewise.
17696 (vect_recog_widen_sum_pattern): Likewise.
17697 (vect_recog_mixed_size_cond_pattern): Likewise.
17699 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
17701 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
17702 (dump_eaf_flags): Dump EAF_NOT_RETURNED
17703 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
17704 and EAF_NOT_RETURNED.
17705 (modref_summary::useful_p): Likewise.
17706 (modref_summary_lto::useful_p): Likewise.
17707 (struct) modref_summary_lto: Use eaf_fleags_t.
17708 (deref_flags): Handle EAF_NOT_RETURNED.
17709 (struct escape_point): Use min_flags.
17710 (modref_lattice::init): Add EAF_NOT_RETURNED.
17711 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
17712 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
17714 (analyze_parms): Also analyze const functions; update conition on
17716 (modref_write): Update streaming.
17717 (read_section): Update streaming.
17718 (remap_arg_flags): Use eaf_flags_t.
17719 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
17720 * ipa-modref.h: (eaf_flags_t): New typedef.
17721 (struct modref_summary): Use eaf_flags_t.
17722 * tree-core.h (EAF_NOT_RETURNED): New constant.
17724 2021-07-16 Richard Biener <rguenther@suse.de>
17726 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
17727 the type of the LHS.
17728 (gimple_assign_nonnegative_warnv_p): Likewise.
17729 (gimple_call_nonnegative_warnv_p): Likewise. Return false
17730 if the call has no LHS.
17731 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
17732 * tree-eh.c (stmt_could_throw_1_p): Likewise.
17733 * tree-inline.c (insert_init_stmt): Likewise.
17734 * tree-ssa-loop-niter.c (get_val_for): Likewise.
17735 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
17737 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
17738 gassign *. Use the type of the lhs.
17739 (vn_nary_op_lookup_stmt): Adjust.
17740 (vn_nary_op_insert_stmt): Likewise.
17742 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
17744 * config/s390/predicates.md (bras_sym_operand): Accept all
17745 functions in 64-bit mode, use UNSPEC_PLT31.
17746 (larl_operand): Use UNSPEC_PLT31.
17747 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
17748 (legitimize_pic_address): Likewise.
17749 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
17751 (s390_delegitimize_address): Use UNSPEC_PLT31.
17752 (s390_output_addr_const_extra): Likewise.
17753 (print_operand): Add @PLT to TLS calls, handle %K.
17754 (s390_function_profiler): Mark __fentry__/_mcount as function,
17755 use %K, use UNSPEC_PLT31.
17756 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
17757 (s390_emit_call): Use UNSPEC_PLT31.
17758 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
17759 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
17760 (*movdi_64): Use %K.
17761 (reload_base_64): Likewise.
17762 (*sibcall_brc): Likewise.
17763 (*sibcall_brcl): Likewise.
17764 (*sibcall_value_brc): Likewise.
17765 (*sibcall_value_brcl): Likewise.
17767 (*brasl): Likewise.
17768 (*bras_r): Likewise.
17769 (*brasl_r): Likewise.
17770 (*bras_tls): Likewise.
17771 (*brasl_tls): Likewise.
17772 (main_base_64): Likewise.
17773 (reload_base_64): Likewise.
17774 (@split_stack_call<mode>): Likewise.
17776 2021-07-16 Richard Biener <rguenther@suse.de>
17778 PR tree-optimization/101467
17779 * tree-vect-stmts.c (vect_gen_while): Properly guard
17780 make_temp_ssa_name usage.
17782 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
17784 * config.gcc: Don't use forked print-sysroot-suffix.sh and
17785 t-sysroot-suffix for C-SKY.
17786 * config/csky/print-sysroot-suffix.sh: Delete.
17787 * config/csky/t-csky-linux: Delete.
17788 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
17789 instead of CSKY_MULTILIB_DIRNAMES.
17791 2021-07-16 Richard Biener <rguenther@suse.de>
17793 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
17794 conversion issues with the partial reduction of the reused
17795 vector accumulator.
17797 2021-07-16 Richard Biener <rguenther@suse.de>
17799 * config/i386/i386-options.c (ix86_option_override_internal): Set
17800 param_vect_partial_vector_usage to zero if not set.
17802 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
17805 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
17806 (VALID_INT_MODE_P): Add SDmode and DDmode.
17807 Add TDmode for TARGET_64BIT.
17808 (VALID_DFP_MODE_P): Remove.
17809 * config/i386/i386.c (ix86_hard_regno_mode_ok):
17810 Do not use VALID_DFP_MODE_P.
17812 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
17814 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
17816 (fold_using_range::fold_stmt): Ditto.
17817 (fold_using_range::range_of_range_op): Ditto.
17818 (fold_using_range::range_of_phi): Ditto.
17819 (fold_using_range::range_of_call): Ditto.
17820 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
17821 (fold_using_range::range_of_builtin_call): Ditto.
17822 (fold_using_range::range_of_cond_expr): Ditto.
17823 * gimple-range-fold.h (gimple_range_type): New.
17825 2021-07-15 Martin Sebor <msebor@redhat.com>
17827 PR middle-end/97027
17828 * tree-ssa-strlen.c (handle_assign): New function.
17829 (maybe_warn_overflow): Add argument.
17830 (nonzero_bytes_for_type): New function.
17831 (count_nonzero_bytes): Handle more tree types. Call
17832 nonzero_bytes_for_tye.
17833 (count_nonzero_bytes): Handle types.
17834 (handle_store): Handle stores from function calls.
17835 (strlen_check_and_optimize_call): Move code to handle_assign. Call
17836 it for assignments from function calls.
17838 2021-07-15 David Malcolm <dmalcolm@redhat.com>
17843 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
17845 2021-07-15 David Malcolm <dmalcolm@redhat.com>
17847 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
17849 2021-07-15 Martin Sebor <msebor@redhat.com>
17853 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
17854 (operand_compare::verify_hash_value): Same.
17855 * tree-core.h (OEP_DECL_NAME): New.
17857 2021-07-15 Martin Jambor <mjambor@suse.cz>
17859 * profile-count.h (profile_count::value): Change the return type to
17861 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
17863 * tree-cfg.c (dump_function_to_file): Likewise.
17865 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
17868 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
17869 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
17872 2021-07-15 Richard Biener <rguenther@suse.de>
17874 * tree-vectorizer.h (vect_gen_while): Match up with
17875 vect_gen_while_not.
17876 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
17877 of vect_gen_while_not.
17878 (vect_gen_while_not): Adjust.
17879 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
17881 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
17883 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
17884 (ranger_cache::range_of_def): Call adjust_range.
17885 (ranger_cache::entry_range): Same.
17886 * gimple-range-cache.h (non_null_ref::adjust_range): New.
17887 * gimple-range.cc (gimple_ranger::range_of_expr): Call
17889 (gimple_ranger::range_on_entry): Same.
17891 2021-07-15 Tamar Christina <tamar.christina@arm.com>
17894 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17896 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
17898 2021-07-15 Tamar Christina <tamar.christina@arm.com>
17901 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17903 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
17904 (sdot_prod, udot_prod): ...These.
17905 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
17906 (aarch64_<sur>dot<vsi2qi>): Rename to...
17907 (<sur>dot_prod<vsi2qi>): ...This.
17908 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
17911 2021-07-15 Jakub Jelinek <jakub@redhat.com>
17913 PR middle-end/101437
17914 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
17915 types even if they have non-BLKmode TYPE_MODE.
17917 2021-07-15 Richard Biener <rguenther@suse.de>
17920 * gcc.c (process_command): Process -gtoggle like process_options
17921 would after parsing options.
17923 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
17925 * cfgexpand.c (expand_asm_loc): Adjust.
17926 (expand_asm_stmt): Likewise.
17927 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
17928 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
17929 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
17930 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
17931 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
17932 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
17933 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
17934 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
17935 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
17936 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
17937 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
17938 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
17939 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
17940 * doc/tm.texi: Regenerate.
17941 * target.def: Add location argument to md_asm_adjust.
17943 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
17945 * tree-diagnostic.c (diagnostic_report_current_function): Use the
17946 diagnostic's location, not input_location.
17948 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
17950 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
17952 (expand_asm_stmt): Likewise.
17954 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
17956 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
17957 addressed memory rtx, if any.
17958 (rs6000_split_multireg_move): Fix code formatting.
17959 Handle MMA build built-ins with operands in adjacent memory locations.
17961 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
17963 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
17966 2021-07-14 Jason Merrill <jason@redhat.com>
17968 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
17970 2021-07-14 Jason Merrill <jason@redhat.com>
17972 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
17975 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17977 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
17978 Remove erroneous line.
17980 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
17982 * params.opt (param_evrp_mode): Change default.
17984 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17986 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
17987 (sdot_prod, udot_prod): ...These.
17988 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
17989 (aarch64_<sur>dot<vsi2qi>): Rename to...
17990 (<sur>dot_prod<vsi2qi>): ...This.
17991 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
17994 2021-07-14 Tamar Christina <tamar.christina@arm.com>
17996 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
17998 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18000 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
18002 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18004 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
18006 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18008 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
18009 (usdot_prod<vsi2qi>): ... This.
18010 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
18011 (usdot_prod): ...This.
18012 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
18013 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
18015 (@<sur>dot_prod<vsi2qi>): ...This.
18016 * config/aarch64/aarch64-sve-builtins-base.cc
18017 (svusdot_impl::expand): Use it.
18019 2021-07-14 Tamar Christina <tamar.christina@arm.com>
18021 * optabs.def (usdot_prod_optab): New.
18022 * doc/md.texi: Document it and clarify other dot prod optabs.
18023 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
18024 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
18025 * optabs.c (expand_widen_pattern_expr): Likewise.
18026 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
18027 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
18028 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
18030 (vect_widened_op_tree): Optionally ignore
18032 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
18034 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
18037 * config/i386/driver-i386.c (host_detect_local_cpu): Check
18038 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
18039 Enable UINTR only for 64-bit codegen.
18040 * config/i386/i386-options.c
18041 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
18043 * config/i386/i386.h (ARCH_ARG): New.
18044 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
18045 "[arch|tune] 64" for 64-bit codegen.
18047 2021-07-14 Richard Biener <rguenther@suse.de>
18049 PR tree-optimization/101445
18050 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
18051 of the IV in the correct direction for negative stride
18054 2021-07-14 Jakub Jelinek <jakub@redhat.com>
18057 * godump.c (godump_str_hash): New type.
18058 (godump_container::pot_dummy_types): Use string_hash instead of
18059 ptr_hash in the hash_set.
18061 2021-07-14 Richard Biener <rguenther@suse.de>
18063 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
18064 vector types where the old vector type has a multiple of
18065 the new vector type elements.
18066 (vect_create_partial_epilog): New function, split out from...
18067 (vect_create_epilog_for_reduction): ... here.
18068 (vect_transform_cycle_phi): Reduce the re-used accumulator
18069 to the new vector type.
18071 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18073 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
18074 non-'t'-sized arg check.
18076 2021-07-14 Alexandre Oliva <oliva@adacore.com>
18078 * tree-cfg.c (cleanup_dead_labels_eh): Update
18079 post_landing_pad label upon change of landing pad block's
18081 (cleanup_dead_labels): Check that a removed label is not that
18084 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
18086 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
18088 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
18089 Add Neon to general purpose register case for zero-extend
18091 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
18092 case to prevent some cases opting to go through memory.
18093 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
18094 * rtl.c (rtvec_series_p): Define predicate to determine
18095 whether a vector contains a linear series of integers.
18096 * rtl.h (rtvec_series_p): Define.
18097 * rtlanal.c (vec_series_lowpart_p): Define predicate to
18098 determine if a vector selection is equivalent to the low part
18100 * rtlanal.h (vec_series_lowpart_p): Define.
18101 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
18102 Add vec_select -> subreg simplification.
18104 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
18106 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
18107 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
18108 _mm_test_mix_ones_zeros): New.
18110 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
18111 Richard Biener <rguenther@suse.de>
18113 * gimple.c (gimple_could_trap_p_1): Make S argument a
18114 "const gimple*". Preserve constness in call to
18115 gimple_asm_volatile_p.
18116 (gimple_could_trap_p): Make S argument a "const gimple*".
18117 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
18118 Update function prototypes.
18120 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18122 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
18123 (_loop_vec_info::main_loop_edge): New field.
18124 (_loop_vec_info::skip_main_loop_edge): Likewise.
18125 (_loop_vec_info::skip_this_loop_edge): Likewise.
18126 (_loop_vec_info::reusable_accumulators): Likewise.
18127 (_stmt_vec_info::reduc_scalar_results): Likewise.
18128 (_stmt_vec_info::reused_accumulator): Likewise.
18129 (vect_get_main_loop_result): Declare.
18130 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
18131 reduc_scalar_inputs.
18132 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
18133 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
18134 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
18135 skip_main_loop_edge and skip_this_loop_edge fields.
18136 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
18137 (vect_emit_reduction_init_stmts): New function.
18138 (get_initial_def_for_reduction): Use it.
18139 (get_initial_defs_for_reduction): Likewise. Change the vinfo
18140 parameter to a loop_vec_info.
18141 (vect_create_epilog_for_reduction): Store the scalar results
18142 in the reduc_info. If an epilogue loop is reusing an accumulator
18143 from the main loop, and if the epilogue loop can also be skipped,
18144 try to place the reduction code in the join block. Record
18145 accumulators that could potentially be reused by epilogue loops.
18146 (vect_transform_cycle_phi): When vectorizing epilogue loops,
18147 try to reuse accumulators from the main loop. Record the initial
18148 value in reduc_info for non-SLP reductions too.
18150 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18152 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
18153 adjustment handling. Take the neutral value as an argument,
18154 in place of the code argument.
18155 (vect_transform_cycle_phi): Update accordingly. Handle the
18156 initial values of cond reductions separately from code reductions.
18157 Choose the adjustment here rather than in
18158 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
18160 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18162 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
18163 (neutral_op_for_reduction): ...this, providing a more general
18165 (vect_create_epilog_for_reduction): Update accordingly.
18166 (vectorizable_reduction): Likewise.
18167 (vect_transform_cycle_phi): Likewise.
18169 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18171 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
18172 reduc_info instead of the original stmt_vec_info.
18173 (vect_transform_cycle_phi): Update accordingly.
18175 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18177 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
18178 reduc_info as an additional parameter.
18179 (vect_transform_cycle_phi): Update accordingly.
18181 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18183 * tree-vectorizer.h: Include tree-ssa-operands.h.
18184 (vect_phi_initial_value): New function.
18185 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
18186 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
18187 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
18188 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
18190 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18192 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
18193 the phi results to vectype after creating them. Remove later
18194 conversion code that thus becomes redundant.
18196 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18198 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
18199 the new_phis vector with a reduc_inputs vector. Combine handling
18200 of reduction chains and ncopies > 1.
18202 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18204 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
18205 scalar_results to group_size elements after reducing down from
18206 N*group_size elements. Construct an array_slice of the live-out
18207 stmts and assert that there is one stmt per scalar result.
18209 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18211 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
18212 nested_in_vect_loop and use double_reduc everywhere. Remove dead
18213 assignment to "loop".
18215 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
18217 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
18218 vector types first. For scalar types, consider both the preferred
18219 vector mode and the alternative vector modes.
18220 * optabs-query.c (can_vec_mask_load_store_p): Use the same
18221 structure as above, in particular using related_vector_mode
18222 for modes provided by autovectorize_vector_modes.
18224 2021-07-13 Jakub Jelinek <jakub@redhat.com>
18225 Richard Biener <rguenther@suse.de>
18227 PR tree-optimization/101419
18228 * tree-pass.h (PROP_objsz): Define.
18229 (make_pass_early_object_sizes): Declare.
18230 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
18231 there to pass_early_object_sizes, drop parameter.
18232 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
18233 drop parameter, move pass_post_ipa_warn right after that.
18234 * tree-object-size.c (pass_object_sizes::execute): Rename to...
18235 (object_sizes_execute): ... this. Add insert_min_max_p argument.
18236 (pass_data_object_sizes): Move after object_sizes_execute.
18237 (pass_object_sizes): Likewise. In execute method call
18238 object_sizes_execute, drop set_pass_param method and insert_min_max_p
18239 non-static data member and its initializer in the ctor.
18240 (pass_data_early_object_sizes, pass_early_object_sizes,
18241 make_pass_early_object_sizes): New.
18242 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
18243 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
18245 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
18248 * config/riscv/constraints.md ("S"): Update description and remove
18250 * doc/md.texi (Machine Constraints): Document the 'S' constraints
18253 2021-07-13 Richard Biener <rguenther@suse.de>
18256 2021-07-12 Richard Biener <rguenther@suse.de>
18258 * tree-vect-slp.c (vect_slp_region): Show the number of
18259 SLP graph entries in the optimization message.
18261 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
18263 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
18265 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
18266 return type to long.
18267 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
18270 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18272 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
18273 Query relation between the 2 operands and use it.
18275 2021-07-12 Sergei Trofimovich <siarheit@google.com>
18277 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
18279 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18282 * config/i386/predicates.md (vec_setm_sse41_operand):
18283 Rename from vec_setm_operand.
18284 (vec_setm_avx2_operand): New predicate.
18285 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
18286 Use vec_setm_sse41_operand as operand 2 predicate.
18287 (vec_set<V_256_512:mode): New expander.
18288 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
18289 as operand 2 predicate.
18291 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
18293 PR tree-optimization/101335
18294 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
18296 2021-07-12 Andrew Pinski <apinski@marvell.com>
18298 * tree-ssa-phiopt.c (match_simplify_replacement): Move
18299 insert of the sequence before the movement of the
18300 statement. Check if to see if the statement is used
18301 outside of the original phi to see if we should move it.
18303 2021-07-12 Richard Biener <rguenther@suse.de>
18305 * dump-context.h (debug_dump_context::debug_dump_context):
18306 Add FILE * parameter defaulted to stderr.
18307 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
18308 * tree-vect-slp.c (dot_slp_tree): New functions.
18310 2021-07-12 Richard Biener <rguenther@suse.de>
18312 PR tree-optimization/101373
18313 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
18314 references when the BB may not return.
18315 (compute_avail): Pass in the function we're working on and
18316 replace cfun references with it. Externally throwing
18317 const calls also possibly terminate the function.
18318 (pass_pre::execute): Pass down the function we're working on.
18319 * gcse.c (compute_hash_table_work): Externally throwing
18320 const/pure calls also need record_last_mem_set_info.
18321 * postreload-gcse.c (record_opr_changes): Looping or externally
18322 throwing const/pure calls also need record_last_mem_set_info.
18324 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
18326 * recog.c (memory_address_addr_space_p): Change the type to bool.
18327 Return true/false instead of 1/0.
18328 (offsettable_memref_p): Ditto.
18329 (offsettable_nonstrict_memref_p): Ditto.
18330 (offsettable_address_addr_space_p): Ditto.
18331 Change the type of addressp indirect function to bool.
18332 * recog.h (memory_address_addr_space_p): Change the type to bool.
18333 (strict_memory_address_addr_space_p): Ditto.
18334 (offsettable_memref_p): Ditto.
18335 (offsettable_nonstrict_memref_p): Ditto.
18336 (offsettable_address_addr_space_p): Ditto.
18337 * reload.c (maybe_memory_address_addr_space_p): Ditto.
18338 (strict_memory_address_addr_space_p): Change the type to bool.
18339 Return true/false instead of 1/0.
18340 (maybe_memory_address_addr_space_p): Change the type to bool.
18342 2021-07-12 Richard Biener <rguenther@suse.de>
18344 * tree-vect-slp.c (vect_slp_region): Show the number of
18345 SLP graph entries in the optimization message.
18347 2021-07-12 Richard Biener <rguenther@suse.de>
18349 PR tree-optimization/101394
18350 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
18351 copies from abnormals for a full redundancy.
18353 2021-07-12 Richard Biener <rguenther@suse.de>
18355 PR middle-end/101423
18356 * gimple.c (gimple_could_trap_p_1): Internal function calls
18358 * tree-eh.c (tree_could_trap_p): Likewise.
18360 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18363 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
18365 (vmulq_n_u32): Likewise.
18366 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
18367 (vmulq_n_f32): Likewise.
18368 (vmul_n_f16): Likewise.
18369 (vmulq_n_f16): Likewise.
18371 2021-07-12 Martin Liska <mliska@suse.cz>
18373 PR sanitizer/101425
18374 * gcc.c (check_offload_target_name): Call
18375 candidates_list_and_hint only if we have a candidate.
18377 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
18380 * config/arm/neon.md (vec_init): Move to ...
18381 * config/arm/vec-common.md (vec_init): ... here.
18382 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
18384 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
18386 PR tree-optimization/101403
18387 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
18388 signedness of the shift is not the same as the signedness of
18389 the type extension.
18391 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
18392 Uroš Bizjak <ubizjak@gmail.com>
18394 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
18395 divmod of a constant numerator with new define_insn_and_split.
18397 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
18400 * config/i386/i386-expand.c (ix86_expand_call): If a call is
18401 to a non-local-binding, or local but to a public symbol, then
18402 assume that it might be indirected via the lazy symbol binder.
18403 Mark R10 and R10 as clobbered in that case.
18405 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
18408 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
18409 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
18410 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
18412 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
18414 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
18415 log2 range of operands[3] to [1,31].
18416 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
18418 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18420 * doc/md.texi: Don't split @smallexample in multiple @groups.
18422 2021-07-09 Sergei Trofimovich <siarheit@google.com>
18424 * doc/md.texi: Add missing 'see' word.
18426 2021-07-09 Andrew Pinski <apinski@marvell.com>
18428 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
18429 to take sequence and gimple_match_op. Accept the case where
18430 op is a SSA_NAME and one statement in the sequence.
18431 Also allow constants.
18432 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
18433 Update call to phiopt_early_allow. Discard the sequence if not
18436 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
18441 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
18442 instead of ISA_HAS_MSA.
18443 (mips_expand_vec_unpack): Likewise.
18444 (mips_expand_vector_init): Likewise.
18446 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
18448 * config/rs6000/vsx.md (mods_<mode>): Rename to...
18449 (mod<mode>3): ... this.
18450 (modu_<mode>): Rename to...
18451 (umod<mode>3): ... this.
18452 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
18453 MODU_V4SI): Adjust.
18455 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
18457 * config/h8300/shiftrotate.md (variable shifts): Expose condition
18458 code handling for the test before the loop.
18460 2021-07-08 Martin Jambor <mjambor@suse.cz>
18463 * ipa-sra.c (class isra_call_summary): New member
18464 m_before_any_store, initialize it in the constructor.
18465 (isra_call_summary::dump): Dump the new field.
18466 (ipa_sra_call_summaries::duplicate): Copy it.
18467 (process_scan_results): Set it.
18468 (isra_write_edge_summary): Stream it.
18469 (isra_read_edge_summary): Likewise.
18470 (param_splitting_across_edge): Only override
18471 safe_to_import_accesses if m_before_any_store is set.
18473 2021-07-08 Martin Sebor <msebor@redhat.com>
18475 PR bootstrap/101374
18476 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
18477 Use Object Size Type 0 instead of 1.
18479 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18481 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
18484 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
18486 * match.pd: Simplify an extend-operate-truncate sequence involving
18489 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
18490 Richard Biener <rguenther@suse.de>
18492 PR tree-optimization/40210
18493 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
18494 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
18495 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
18497 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
18500 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
18502 * config/i386/mmx.md (V_32): New mode iterator.
18503 (mov<V_32:mode>): Use V_32 mode iterator.
18504 (*mov<V_32:mode>_internal): Ditto.
18505 (*push<V_32:mode>2_rex64): Ditto.
18506 (*push<V_32:mode>2): Ditto.
18507 (movmisalign<V_32:mode>): Ditto.
18508 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
18509 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
18510 (vec_unpacks_lo_v4qi): New expander.
18511 (vec_unpacks_hi_v4qi): Ditto.
18512 (vec_unpacku_lo_v4qi): Ditto.
18513 (vec_unpacku_hi_v4qi): Ditto.
18514 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
18515 (VALID_INT_MODE_P): Ditto.
18517 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
18520 * config/rs6000/rs6000.md (udivti3): New insn.
18521 (divti3): New insn.
18522 (umodti3): New insn.
18523 (modti3): New insn.
18525 2021-07-07 Martin Sebor <msebor@redhat.com>
18527 PR tree-optimization/100137
18528 PR tree-optimization/99121
18529 PR tree-optimization/97027
18530 * builtins.c (access_ref::access_ref): Also set offmax.
18531 (access_ref::offset_in_range): Define new function.
18532 (access_ref::add_offset): Set offmax.
18533 (access_ref::inform_access): Handle access_none.
18534 (handle_mem_ref): Clear ostype.
18535 (compute_objsize_r): Handle ASSERT_EXPR.
18536 * builtins.h (struct access_ref): Add offmax member.
18537 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
18538 compute_objsize() and simplify.
18540 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
18542 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
18543 and VSX_BUILTIN_STXVP.
18545 2021-07-07 Martin Sebor <msebor@redhat.com>
18548 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
18549 a stray %K from error_at() missed in r12-2088.
18551 2021-07-07 Richard Biener <rguenther@suse.de>
18553 PR tree-optimization/99728
18554 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
18556 (mem_refs_may_alias_p): Add assert we handled aggregate
18558 (sm_seq_valid_bb): Give up when running into aggregate copies.
18559 (ref_indep_loop_p): Handle aggregate copies as never
18560 being invariant themselves but allow other refs to be
18561 disambiguated against them.
18562 (can_sm_ref_p): Do not try to apply store-motion to aggregate
18565 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
18568 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
18571 2021-07-06 Martin Sebor <msebor@redhat.com>
18573 * gimple-pretty-print.c (percent_G_format): Remove.
18574 * tree-diagnostic.c (default_tree_printer): Remove calls.
18575 * tree-pretty-print.c (percent_K_format): Remove.
18576 * tree-pretty-print.h (percent_K_format): Remove.
18578 2021-07-06 Martin Sebor <msebor@redhat.com>
18580 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
18581 Remove %K and use error_at.
18582 (aarch64_expand_fcmla_builtin): Same.
18583 (aarch64_expand_builtin_tme): Same.
18584 (aarch64_expand_builtin_memtag): Same.
18585 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
18586 (arm_expand_builtin): Same.
18587 * config/arm/arm.c (bounds_check): Same.
18589 2021-07-06 Martin Sebor <msebor@redhat.com>
18591 * builtins.c (warn_string_no_nul): Remove %G.
18592 (maybe_warn_for_bound): Same.
18593 (warn_for_access): Same.
18594 (check_access): Same.
18595 (check_strncat_sizes): Same.
18596 (expand_builtin_strncat): Same.
18597 (expand_builtin_strncmp): Same.
18598 (expand_builtin): Same.
18599 (expand_builtin_object_size): Same.
18600 (warn_dealloc_offset): Same.
18601 (maybe_emit_free_warning): Same.
18602 * calls.c (maybe_warn_alloc_args_overflow): Same.
18603 (maybe_warn_nonstring_arg): Same.
18604 (maybe_warn_rdwr_sizes): Same.
18605 * expr.c (expand_expr_real_1): Remove %K.
18606 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
18607 (gimple_fold_builtin_strncat): Same.
18608 * gimple-ssa-sprintf.c (format_directive): Same.
18609 (handle_printf_call): Same.
18610 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
18611 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
18612 (maybe_diag_access_bounds): Same. Call gimple_location.
18613 (check_bounds_or_overlap): Same.
18614 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
18615 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
18616 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
18617 (maybe_diag_stxncpy_trunc): Same.
18618 (handle_builtin_stxncpy_strncat): Same.
18619 (maybe_warn_pointless_strcmp): Same.
18620 * tree-ssa-uninit.c (maybe_warn_operand): Same.
18622 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
18625 * config/i386/predicates.md (vec_setm_operand): Enable
18626 register_operand for TARGET_SSE4_1.
18627 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
18628 as operand 2 predicate. Call ix86_expand_vector_set_var
18629 for non-constant index operand.
18630 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
18631 Call ix86_expand_vector_set_var for non-constant index operand.
18633 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
18635 * config/h8300/jumpcall.md (*branch): When possible, generate
18636 the comparison in CCZN mode.
18637 * config/h8300/predicates.md (simple_memory_operand): Reject all
18638 auto-increment addressing modes.
18640 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
18642 PR bootstrap/100246
18643 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
18646 2021-07-06 Richard Biener <rguenther@suse.de>
18648 * doc/md.texi (vec_fmaddsub<mode>4): Document.
18649 (vec_fmsubadd<mode>4): Likewise.
18650 * optabs.def (vec_fmaddsub$a4): Add.
18651 (vec_fmsubadd$a4): Likewise.
18652 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
18653 (IFN_VEC_FMSUBADD): Likewise.
18654 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
18655 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
18656 (addsub_pattern::build): Likewise.
18657 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
18658 and CFN_VEC_FMSUBADD are not transparent for permutes.
18659 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
18660 (vec_fmsubadd<mode>4): Likewise.
18662 2021-07-06 Richard Biener <rguenther@suse.de>
18664 * doc/invoke.texi (fmove-loop-stores): Document.
18665 * common.opt (fmove-loop-stores): New option.
18666 * opts.c (default_options_table): Enable -fmove-loop-stores
18667 at -O1 but not -Og.
18668 * tree-ssa-loop-im.c (pass_lim::execute): Pass
18669 flag_move_loop_stores instead of true to
18670 loop_invariant_motion_in_fun.
18672 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
18674 * doc/install.texi: Document --with-dsymutil.
18676 2021-07-06 Andrew Pinski <apinski@marvell.com>
18678 PR tree-optimization/101256
18679 * dbgcnt.def (phiopt_edge_range): New counter.
18680 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
18681 Check to make sure the new name is defined in the same
18682 bb as the conditional before duplicating range info.
18683 Also add debug counter.
18685 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
18687 PR rtl-optimization/100328
18688 * config/i386/i386-options.c (ix86_option_override_internal):
18689 Set param_ira_consider_dup_in_all_alts to 0.
18691 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
18693 PR rtl-optimization/100328
18694 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
18696 * ira.c (ira_get_dup_out_num): Adjust as parameter
18697 param_ira_consider_dup_in_all_alts.
18698 * params.opt (ira-consider-dup-in-all-alts): New.
18699 * ira-conflicts.c (process_regs_for_copy): Add one parameter
18700 single_input_op_has_cstr_p.
18701 (get_freq_for_shuffle_copy): New function.
18702 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
18703 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
18705 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
18707 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
18708 generate condition code aware RTL directly.
18710 2021-07-05 Andrew Pinski <apinski@marvell.com>
18712 PR tree-optimization/101039
18713 * match.pd (A CMP 0 ? A : -A): New patterns.
18714 * tree-ssa-phiopt.c (abs_replacement): Delete function.
18715 (tree_ssa_phiopt_worker): Don't call abs_replacement.
18716 Update comment about abs_replacement.
18718 2021-07-05 Andrew Pinski <apinski@marvell.com>
18720 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
18721 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
18723 2021-07-05 Andrew Pinski <apinski@marvell.com>
18725 * tree-ssa-phiopt.c (match_simplify_replacement):
18726 Add early_p argument. Call gimple_simplify_phiopt
18727 instead of gimple_simplify.
18728 (tree_ssa_phiopt_worker): Update call to
18729 match_simplify_replacement and allow unconditionally.
18730 (phiopt_early_allow): New function.
18731 (gimple_simplify_phiopt): New function.
18733 2021-07-05 Andrew Pinski <apinski@marvell.com>
18735 PR middle-end/101237
18736 * fold-const.c (negate_expr_p): Remove call to element_mode
18737 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
18738 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
18739 (fold_negate_expr_1): Likewise.
18740 (const_unop): Likewise.
18741 (fold_cond_expr_with_comparison): Likewise.
18742 (fold_binary_loc): Likewise.
18743 (fold_ternary_loc): Likewise.
18744 (tree_call_nonnegative_warnv_p): Likewise.
18745 * match.pd (-(A + B) -> (-B) - A): Likewise.
18747 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
18749 * configure.ac: Handle --with-dsymutil in the same way as we
18750 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
18751 Extract the type and version for the dsymutil configured or
18752 found by the default searches.
18753 * config.in: Regenerated.
18754 * configure: Regenerated.
18755 * collect2.c (do_dsymutil): Handle locating dsymutil in the
18756 same way as for the assembler and linker.
18757 * config/darwin.h (DSYMUTIL): Delete.
18758 * gcc.c: Report a configured dsymutil correctly.
18759 * exec-tool.in: Allow for dsymutil.
18761 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
18763 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
18764 Handle V4QI and V2HI modes.
18765 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
18766 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
18767 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
18768 Handle 4-byte dual operands with TARGET_XOP and single operands
18769 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
18770 mmx_pshufbv4qi3 for TARGET_SSSE3.
18771 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
18772 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
18773 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
18774 (expand_vec_perm_even_odd_1): Handle V4QI mode.
18775 (expand_vec_perm_broadcast_1): Handle V4QI mode.
18776 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
18777 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
18778 (mmx_pshufbv4qi3): Ditto.
18779 (*mmx_pblendw32): Ditto.
18780 (*mmx_pblendw64): Rename from *mmx_pblendw.
18781 (mmx_punpckhbw_low): New insn_and_split pattern.
18782 (mmx_punpcklbw_low): Ditto.
18784 2021-07-05 Richard Biener <rguenther@suse.de>
18786 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
18787 set LOOP_C_INFINITE on the vectorized loop.
18789 2021-07-05 Richard Biener <rguenther@suse.de>
18791 PR middle-end/101291
18792 * cfgloopmanip.c (loop_version): Set the loop copy of the
18793 versioned loop to the new loop.
18795 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
18798 * config.gcc: Ensure that Darwin biarch definitions are
18799 added before i386.h.
18800 * config/i386/darwin.h (TARGET_64BIT): Remove.
18801 (PR80556_WORKAROUND): New.
18802 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
18803 (DARWIN_SUBARCH_SPEC): New.
18804 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
18805 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
18806 (REAL_LIBGCC_SPEC): Remove.
18807 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
18808 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
18809 (REAL_LIBGCC_SPEC): Remove.
18811 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
18813 PR middle-end/101294
18814 * expr.c (store_constructor): Don't use vec_duplicate on vector.
18816 2021-07-02 Martin Sebor <msebor@redhat.com>
18818 PR middle-end/98871
18819 PR middle-end/98512
18820 * diagnostic.c (get_any_inlining_info): New.
18821 (update_effective_level_from_pragmas): Handle inlining context.
18822 (diagnostic_enabled): Same.
18823 (diagnostic_report_diagnostic): Same.
18824 * diagnostic.h (struct diagnostic_info): Add ctor.
18825 (struct diagnostic_context): Add new member.
18826 * tree-diagnostic.c (set_inlining_locations): New.
18827 (tree_diagnostics_defaults): Set new callback pointer.
18829 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
18831 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
18833 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
18834 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
18835 lxvp and stxvp built-ins.
18836 (mma_init_builtins): Handle lxvp and stxvp built-ins.
18837 (builtin_function_type): Likewise.
18838 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
18840 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18842 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
18843 additional argument for the code.
18844 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
18845 argument for the code. Just return if the ZN bits are useful or
18846 not rather than the old style CC_* enums.
18847 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
18848 more generic shiftqi patterns.
18849 (shifthi_noscratch, shiftsi_noscratch): Similarly.
18850 (shiftqi_noscratch_set_flags): New pattern.
18851 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
18853 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
18855 PR tree-optimization/101223
18856 * range-op.cc (build_lt): Add -1 for signed values.
18857 (built_gt): Subtract -1 for signed values.
18859 2021-07-02 David Faust <david.faust@oracle.com>
18861 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
18862 (btf_asm_type): Likewise.
18864 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18866 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
18867 an array of rtx rather than a pointer to rtx. Add code argument.
18868 (compute_a_shift_length): Similarly.
18869 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
18870 shift itself isn't an operand. Create dummy operand[0] to carry
18871 a mode and pass a suitable rtx code to compute_a_shift_length.
18872 (get_shift_alg): Adjust operand number of clobber in output templates.
18873 (output_a_shift): Make first argument an array of rtx rather than
18874 a pointer to rtx. Add code argument for the type of shift.
18875 Adjust now that the shift itself is no longer an operand.
18876 (compute_a_shift_length): Similarly.
18877 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
18878 iterator rather than nshift_operator.
18879 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
18880 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
18881 and compute_a_shift_length.
18882 (shiftqi_noscratch_clobber_flags): Likewise.
18883 (shifthi_noscratch_clobber_flags): Likewise.
18884 (shiftsi_noscratch_clobber_flags): Likewise.
18886 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
18889 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
18890 dsymutil for BTF/CTF.
18892 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
18895 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
18896 segment to include BTF.
18897 (BTF_INFO_SECTION_NAME): New.
18899 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18901 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
18902 (small_data_operand, memreg_operand, small_insn_p): Likewise.
18903 * config/m32r/m32r.c (call_operand): Adjust return type.
18904 (small_data_operand, memreg_operand): Likewise.
18906 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18908 * config/frv/frv-protos.h (integer_register_operand): Adjust return
18910 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
18911 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
18912 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
18913 (move_source_operand, move_destination_operand): Likewise.
18914 (condexec_source_operand, condexec_dest_operand): Likewise.
18915 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
18916 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
18917 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
18918 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
18919 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
18920 (even_fpr_operand, odd_fpr_operand): Likewise.
18921 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
18922 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
18923 (uint4_operand, uint1_operand, int_2word_operand): Likewise
18924 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
18925 (relational_operator, float_relational_operator): Likewise.
18926 (ccr_eqne_operator, minmax_operator): Likewise.
18927 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
18928 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
18929 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
18930 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
18931 (quad_acc_operand, accg_operand): Likewise.
18933 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
18935 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
18936 return type to a bool.
18937 (nonimmediate_nonstack_operand): Likewise.
18938 (xstormy16_splittable_below100_operand): Likewise.
18939 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
18941 (xstormy16_splittable_below100_operand): Likewise.
18943 2021-07-02 Richard Biener <rguenther@suse.de>
18945 PR tree-optimization/101293
18946 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
18947 with combined offsets.
18948 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
18949 combined with the rest of the offset.
18951 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
18953 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
18954 PIC encodings for PE-COFF targets.
18956 2021-07-02 Jakub Jelinek <jakub@redhat.com>
18959 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
18960 Return nullptr for TImode inner mode.
18962 2021-07-02 Richard Biener <rguenther@suse.de>
18964 PR tree-optimization/101280
18965 PR tree-optimization/101173
18966 * gimple-loop-interchange.cc
18967 (tree_loop_interchange::valid_data_dependences): Properly
18968 guard all dependence checks with DDR_REVERSED_P or its
18971 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
18973 * config/i386/i386-expand.c (ix86_expand_builtin):
18974 Add branch to clear odata when ZF is set for asedecenc_expand
18975 and wideaesdecenc_expand.
18977 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
18979 * config/i386/gcc-auto-profile: regenerate
18981 2021-07-02 liuhongt <hongtao.liu@intel.com>
18983 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
18984 (trunc<mode><pmov_dst_4_lower>2): this.
18986 2021-07-01 David Malcolm <dmalcolm@redhat.com>
18988 * diagnostic.h (diagnostic_context::m_file_cache): New field.
18989 * input.c (class fcache): Rename to...
18990 (class file_cache_slot): ...this, making most members private and
18991 prefixing fields with "m_".
18992 (file_cache_slot::get_file_path): New accessor.
18993 (file_cache_slot::get_use_count): New accessor.
18994 (file_cache_slot::missing_trailing_newline_p): New accessor.
18995 (file_cache_slot::inc_use_count): New.
18996 (fcache_buffer_size): Move to...
18997 (file_cache_slot::buffer_size): ...here.
18998 (fcache_line_record_size): Move to...
18999 (file_cache_slot::line_record_size): ...here.
19000 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
19001 (fcache_tab_size): Move to file_cache::num_file_slots.
19002 (diagnostic_file_cache_init): Update for move of fcache_tab
19003 to global_dc->m_file_cache.
19004 (diagnostic_file_cache_fini): Likewise.
19005 (lookup_file_in_cache_tab): Convert to...
19006 (file_cache::lookup_file): ...this.
19007 (diagnostics_file_cache_forcibly_evict_file): Update for move of
19008 fcache_tab to global_dc->m_file_cache, moving most of
19009 implementation to...
19010 (file_cache::forcibly_evict_file): ...this new function and...
19011 (file_cache_slot::evict): ...this new function.
19012 (evicted_cache_tab_entry): Convert to...
19013 (file_cache::evicted_cache_tab_entry): ...this.
19014 (add_file_to_cache_tab): Convert to...
19015 (file_cache::add_file): ...this, moving bulk of implementation
19017 (file_cache_slot::create): ..this new function.
19018 (file_cache::file_cache): New.
19019 (file_cache::~file_cache): New.
19020 (lookup_or_add_file_to_cache_tab): Convert to...
19021 (file_cache::lookup_or_add_file): ..this new function.
19022 (fcache::fcache): Rename to...
19023 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
19025 (fcache::~fcache): Rename to...
19026 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
19028 (needs_read): Convert to...
19029 (file_cache_slot::needs_read_p): ...this.
19030 (needs_grow): Convert to...
19031 (file_cache_slot::needs_grow_p): ...this.
19032 (maybe_grow): Convert to...
19033 (file_cache_slot::maybe_grow): ...this.
19034 (read_data): Convert to...
19035 (file_cache_slot::read_data): ...this.
19036 (maybe_read_data): Convert to...
19037 (file_cache_slot::maybe_read_data): ...this.
19038 (get_next_line): Convert to...
19039 (file_cache_slot::get_next_line): ...this.
19040 (goto_next_line): Convert to...
19041 (file_cache_slot::goto_next_line): ...this.
19042 (read_line_num): Convert to...
19043 (file_cache_slot::read_line_num): ...this.
19044 (location_get_source_line): Update for moving of globals to
19045 global_dc->m_file_cache.
19046 (location_missing_trailing_newline): Likewise.
19047 * input.h (class file_cache_slot): New forward decl.
19048 (class file_cache): New.
19050 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
19052 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
19053 128-bit floating point conditional move support.
19054 (have_compare_and_set_mask): Add IEEE 128-bit floating point
19056 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
19057 (mov<mode>cc_p10, IEEE128 iterator): New insn.
19058 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
19059 (fpmask<mode>, IEEE128 iterator): New insn.
19060 (xxsel<mode>, IEEE128 iterator): New insn.
19062 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
19065 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
19067 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19069 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19071 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
19073 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
19074 (vec_duplicate<mode>): New expander.
19076 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
19079 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
19081 (ix86_byte_broadcast): New function.
19082 (ix86_convert_const_wide_int_to_broadcast): Likewise.
19083 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
19084 size is 16 bytes or bigger.
19085 (ix86_broadcast_from_integer_constant): New function.
19086 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
19087 to broadcast if mode size is 16 bytes or bigger.
19088 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
19090 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
19092 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19094 * config/i386/predicates.md (ix86_endbr_immediate_operand):
19095 Return true/false instead of 1/0.
19096 (movq_parallel): Ditto.
19098 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19100 * recog.c (general_operand): Return true/false instead of 1/0.
19101 (register_operand): Ditto.
19102 (immediate_operand): Ditto.
19103 (const_int_operand): Ditto.
19104 (const_scalar_int_operand): Ditto.
19105 (const_double_operand): Ditto.
19106 (push_operand): Ditto.
19107 (pop_operand): Ditto.
19108 (memory_operand): Ditto.
19109 (indirect_operand): Ditto.
19111 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19113 * genpreds.c (write_predicate_subfunction):
19114 Change the type of written subfunction to bool.
19115 (write_one_predicate_function):
19116 Change the type of written function to bool.
19117 (write_tm_preds_h): Ditto.
19118 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
19119 * recog.c (general_operand): Change the type to bool.
19120 (address_operand): Ditto.
19121 (register_operand): Ditto.
19122 (pmode_register_operand): Ditto.
19123 (scratch_operand): Ditto.
19124 (immediate_operand): Ditto.
19125 (const_int_operand): Ditto.
19126 (const_scalar_int_operand): Ditto.
19127 (const_double_operand): Ditto.
19128 (nonimmediate_operand): Ditto.
19129 (nonmemory_operand): Ditto.
19130 (push_operand): Ditto.
19131 (pop_operand): Ditto.
19132 (memory_operand): Ditto.
19133 (indirect_operand): Ditto.
19134 (ordered_comparison_operator): Ditto.
19135 (comparison_operator): Ditto.
19136 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
19137 Change the type of indirect predicate function to bool.
19138 * config/rs6000/rs6000.c (easy_vector_constant):
19139 Change the type to bool.
19140 * config/mips/mips-protos.h (m16_based_address_p):
19141 Change the type of operand 3 to bool.
19143 2021-07-01 Richard Biener <rguenther@suse.de>
19145 PR tree-optimization/101280
19146 PR tree-optimization/101173
19147 * gimple-loop-interchange.cc
19148 (tree_loop_interchange::valid_data_dependences): Revert
19149 previous change and instead correctly handle DDR_REVERSED_P
19152 2021-07-01 Richard Biener <rguenther@suse.de>
19154 PR tree-optimization/101278
19155 * tree-ssa-dse.c (dse_classify_store): First check for
19156 uses, then ignore stmt for chaining purposes.
19158 2021-07-01 Richard Biener <rguenther@suse.de>
19160 PR tree-optimization/100778
19161 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
19162 vectorized ops ahead of their scalar BB.
19164 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
19167 * config/i386/i386.md (*nabs<dwi>2_doubleword):
19168 New insn_and_split pattern.
19169 (*nabs<dwi>2_1): Ditto.
19170 * config/i386/i386-features.c
19171 (general_scalar_chain::compute_convert_gain):
19172 Handle (NEG (ABS (...))) RTX. Rewrite src code
19173 scanner as switch statement.
19174 (general_scalar_chain::convert_insn):
19175 Handle (NEG (ABS (...))) RTX.
19176 (general_scalar_to_vector_candidate_p):
19177 Detect (NEG (ABS (...))) RTX. Reorder case statements
19178 for (AND (NOT (...) ...)) fallthrough.
19180 2021-07-01 Richard Biener <rguenther@suse.de>
19182 PR tree-optimization/101178
19183 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
19184 (slpg::perm_in): Add.
19185 (slpg::get_perm_in): Remove.
19186 (slpg::get_perm_materialized): Add.
19187 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
19188 during permute propagation and materialization.
19190 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19193 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
19195 2021-07-01 Jakub Jelinek <jakub@redhat.com>
19197 PR middle-end/94366
19198 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
19199 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
19200 use boolean_type_node instead of integer_type_node as NE_EXPR type.
19201 (lower_reduction_clauses): Likewise.
19203 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19205 * config/gcn/gcn.c: Include dwarf2.h.
19206 (gcn_addr_space_debug): New function.
19207 (TARGET_ADDR_SPACE_DEBUG): New hook.
19209 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19211 * common/config/gcn/gcn-common.c
19212 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
19213 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
19215 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
19216 (gcn_frame_pointer_rqd): New function.
19217 (TARGET_FRAME_POINTER_REQUIRED): New hook.
19219 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
19221 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
19222 prologue register saves.
19223 (gcn_debug_unwind_info): Use UI_DWARF2.
19224 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
19225 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
19226 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
19227 (DWARF_LINK_REGISTER): New define.
19228 (FIRST_PSEUDO_REGISTER): Increment.
19229 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
19230 (CALL_USED_REGISTERS): Likewise.
19231 (REGISTER_NAMES): Likewise.
19233 2021-06-30 Richard Biener <rguenther@suse.de>
19235 PR tree-optimization/101267
19236 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
19237 API and use SLP compatible interface of vect_is_simple_use.
19238 Reject not vectorized SLP defs for callers that do not support
19240 (vect_check_store_rhs): Handle masked stores and pass down
19241 the appropriate operator index.
19242 (vectorizable_call): Adjust.
19243 (vectorizable_store): Likewise.
19244 (vectorizable_load): Likewise. Handle SLP pecularity of
19246 (vect_is_simple_use): Remove special-casing of masked stores.
19248 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19250 * common.opt (foffload): Remove help as Driver only.
19251 * gcc.c (display_help): Add -foffload.
19253 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
19255 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
19256 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
19257 a candidate list; better inform no offload target is configured
19258 and fix hint extraction when passed target is not '\0' at [len].
19259 * common.opt (foffload): Add tailing '.'.
19260 (foffload-options): Likewise; fix flag name in the help string.
19262 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
19265 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
19266 (vcage_f32): Gate comparison on __FAST_MATH__.
19267 (vcageq_f32): Likewise.
19268 (vcale_f32): Likewise.
19269 (vcaleq_f32): Likewise.
19270 (vcagt_f32): Likewise.
19271 (vcagtq_f32): Likewise.
19272 (vcalt_f32): Likewise.
19273 (vcaltq_f32): Likewise.
19274 (vcage_f16): Likewise.
19275 (vcageq_f16): Likewise.
19276 (vcale_f16): Likewise.
19277 (vcaleq_f16): Likewise.
19278 (vcagt_f16): Likewise.
19279 (vcagtq_f16): Likewise.
19280 (vcalt_f16): Likewise.
19281 (vcaltq_f16): Likewise.
19283 2021-06-30 Richard Biener <rguenther@suse.de>
19285 PR tree-optimization/101264
19286 * tree-vect-slp.c (vect_optimize_slp): Propagate the
19287 computed perm_in to all "any" permute successors
19288 we cannot de-duplicate immediately.
19290 2021-06-30 liuhongt <hongtao.liu@intel.com>
19293 * config/i386/sse.md
19294 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
19296 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
19298 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
19299 * config/i386/subst.md (maskz_scalar): New define_subst.
19300 (maskz_scalar_name): New subst_attr.
19301 (maskz_scalar_op5): Ditto.
19302 (round_saeonly_maskz_scalar_op5): Ditto.
19303 (round_saeonly_maskz_scalar_operand5): Ditto.
19305 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
19307 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
19308 Increase code CSECT alignment to at least 32 bytes.
19309 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
19310 alignment designation.
19312 2021-06-29 Sergei Trofimovich <siarheit@google.com>
19314 * doc/generic.texi: Fix s/net yet/not yet/ typo.
19316 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19318 PR tree-optimization/101254
19319 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
19320 wrapping/non-wrapping when setting the result range.
19322 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19324 * value-query.cc (gimple_range_global): Allow phis.
19326 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
19328 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
19329 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
19330 statement for location context.
19331 (check_for_binary_op_overflow): Ditto.
19332 (simplify_using_ranges::get_vr_for_comparison): Ditto.
19333 (simplify_using_ranges::compare_name_with_value): Ditto.
19334 (simplify_using_ranges::compare_names): Ditto.
19335 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
19336 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
19337 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
19338 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
19339 (simplify_using_ranges::two_valued_val_range_p): Ditto.
19340 (simplify_using_ranges::simplify): Ditto.
19341 * vr-values.h: Adjust prototypes.
19343 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
19346 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
19348 2021-06-29 Julian Brown <julian@codesourcery.com>
19350 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
19351 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
19352 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
19354 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
19355 (MAX_FIXED_MODE_SIZE): Change to 128.
19357 2021-06-29 Julian Brown <julian@codesourcery.com>
19359 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
19360 (s_mnemonic): Add clrsb.
19361 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
19362 (clrsb<mode>2): Add expander for SImode/DImode.
19364 2021-06-29 Julian Brown <julian@codesourcery.com>
19366 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
19367 muldi3): Add patterns.
19369 2021-06-29 Julian Brown <julian@codesourcery.com>
19371 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
19372 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
19374 2021-06-29 Julian Brown <julian@codesourcery.com>
19376 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
19378 2021-06-29 Joseph Myers <joseph@codesourcery.com>
19380 * btfout.c, ctfout.c: Include "memmodel.h".
19382 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19384 * gcc.c (check_offload_target_name): Cast len argument to
19385 %q.*s to 'int'; avoid -Wstringop-truncation warning.
19387 2021-06-29 Richard Biener <rguenther@suse.de>
19389 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
19390 to "any" permute nodes and relax "any" permute proapgation
19391 during iterative backward propagation.
19393 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19396 * common.opt (-foffload=): Update description.
19397 (-foffload-options=): New.
19398 * doc/invoke.texi (C Language Options): Document
19399 -foffload and -foffload-options.
19400 * gcc.c (check_offload_target_name): New, split off from
19401 handle_foffload_option.
19402 (check_foffload_target_names): New.
19403 (handle_foffload_option): Handle -foffload=default.
19404 (driver_handle_option): Update for -foffload-options.
19405 * lto-opts.c (lto_write_options): Use -foffload-options
19406 instead of -foffload.
19407 * lto-wrapper.c (merge_and_complain, append_offload_options):
19409 * opts.c (common_handle_option): Likewise.
19411 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
19413 * doc/invoke.texi (C Language Options): Sort options
19414 alphabetically in optlist and also the description itself.
19415 Remove leftover -fallow-single-precision from and add missing
19416 -fgnu-tm to the optlist.
19418 2021-06-29 Richard Biener <rguenther@suse.de>
19420 * tree-vect-slp.c (slpg_vertex::visited): Remove.
19421 (vect_slp_perms_eq): Handle -1 permutes.
19422 (vect_optimize_slp): Rewrite permute propagation.
19424 2021-06-29 Jakub Jelinek <jakub@redhat.com>
19427 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
19428 perform the optimization in GENERIC when sanitizing and x has a
19431 2021-06-29 Richard Biener <rguenther@suse.de>
19433 PR tree-optimization/101242
19434 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
19435 PHIs with not represented initial values as leafs.
19437 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
19439 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
19440 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
19441 "register" keyword.
19442 (pdp11_initial_elimination_offset) Remove unused variable.
19443 (pdp11_cmp_length) Ditto.
19444 (pdp11_insn_cost): Ditto, and fix signedness warning.
19446 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
19448 * btfout.c: Include tm_p.h.
19451 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19453 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
19455 (bpf_expand_epilogue): Likewise.
19456 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
19457 Do not define DBX_DEBUGGING_INFO.
19459 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19461 * doc/invoke.texi: Document the CTF and BTF debug info options.
19463 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
19464 David Faust <david.faust@oracle.com>
19465 Jose E. Marchesi <jose.marchesi@oracle.com>
19466 Weimin Pan <weimin.pan@oracle.com>
19468 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
19469 GTFILES. Add new object files.
19470 * common.opt: Add CTF and BTF debug info options.
19471 * btfout.c: New file.
19472 * ctfc.c: Likewise.
19473 * ctfc.h: Likewise.
19474 * ctfout.c: Likewise.
19475 * dwarf2ctf.c: Likewise.
19476 * dwarf2ctf.h: Likewise.
19477 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
19479 * dwarf2out.c (dwarf2out_source_line): Likewise.
19480 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
19482 (debug_format_do_cu): New function.
19483 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
19485 Include dwarf2ctf.c.
19486 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
19488 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
19489 (CTF_DEBUG): New bitmask.
19490 (BTF_DEBUG): Likewise.
19491 (enum ctf_debug_info_levels): New enum.
19492 * gengtype.c (open_base_files): Handle ctfc.h.
19493 (main): Handle uint32_t type.
19494 * flags.h (btf_debuginfo_p): New definition.
19495 (dwarf_based_debuginfo_p): Likewise.
19496 * opts.c (debug_type_names): Add entries for CTF and BTF.
19497 (btf_debuginfo_p): New function.
19498 (dwarf_based_debuginfo_p): Likewise.
19499 (common_handle_option): Handle -gctfN and -gbtf options.
19500 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
19501 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
19504 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
19506 * dwarf2out.c (AT_class): Function is no longer static.
19507 (AT_int): Likewise.
19508 (AT_unsigned): Likewise.
19509 (AT_loc): Likewise.
19510 (get_AT): Likewise.
19511 (get_AT_string): Likewise.
19512 (get_AT_flag): Likewise.
19513 (get_AT_unsigned): Likewise.
19514 (get_AT_ref): Likewise.
19515 (new_die_raw): Likewise.
19516 (lookup_decl_die): Likewise.
19517 (base_type_die): Likewise.
19518 (add_name_attribute): Likewise.
19519 (add_AT_int): Likewise.
19520 (add_AT_unsigned): Likewise.
19521 (add_AT_loc): Likewise.
19522 (dw_get_die_tag): New function.
19523 (dw_get_die_child): Likewise.
19524 (dw_get_die_sib): Likewise.
19525 (struct dwarf_file_data): Move from here to dwarf2out.h
19526 (struct dw_attr_struct): Likewise.
19527 * dwarf2out.h: Analogous changes.
19529 2021-06-28 Martin Jambor <mjambor@suse.cz>
19532 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
19533 members m_dead_stmts and m_dead_ssas.
19534 * ipa-param-manipulation.c
19535 (ipa_param_body_adjustments::mark_dead_statements): New function.
19536 (ipa_param_body_adjustments::common_initialization): Call it on
19537 all removed but not split parameters.
19538 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
19540 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
19542 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
19543 dead debug statements.
19544 (copy_phis_for_bb): Do not copy dead PHI nodes.
19546 2021-06-28 Martin Jambor <mjambor@suse.cz>
19549 * symtab-clones.h (clone_info): Removed member param_adjustments.
19550 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
19551 deal with pass-through splits now.
19552 (ipa_param_performed_split): Removed.
19553 (ipa_param_adjustments::modify_call): Adjusted parameters.
19554 (class ipa_param_body_adjustments): Adjusted parameters of
19555 register_replacement, modify_gimple_stmt and modify_call_stmt.
19556 (ipa_verify_edge_has_no_modifications): Declare.
19557 (ipa_edge_modifications_finalize): Declare.
19558 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
19559 performed_splits processing, pas only edge to padjs->modify_call,
19560 check that call arguments were not modified if they should not have
19562 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
19564 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
19565 (ipa_edge_modification_info): Likewise.
19566 (ipa_edge_modification_sum): Likewise.
19567 (ipa_edge_modifications): New edge summary.
19568 (ipa_verify_edge_has_no_modifications): New function.
19569 (transitive_split_p): Removed.
19570 (transitive_split_map): Likewise.
19571 (init_transitive_splits): Likewise.
19572 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
19573 summary instead of performed_splits.
19574 (ipa_param_body_adjustments::register_replacement): Drop dummy
19575 parameter, set base_index of the created ipa_param_body_replacement.
19576 (phi_arg_will_live_p): New function.
19577 (ipa_param_body_adjustments::common_initialization): Do not create
19578 IPA_SRA dummy decls.
19579 (simple_tree_swap_info): Removed.
19580 (remap_split_decl_to_dummy): Likewise.
19581 (record_argument_state_1): New function.
19582 (record_argument_state): Likewise.
19583 (ipa_param_body_adjustments::modify_call_stmt): New parameter
19584 orig_stmt. Do not work with dummy decls, save necessary info about
19585 changes to ipa_edge_modifications.
19586 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
19587 orig_stmt, pass it to modify_call_stmt.
19588 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
19589 modify_gimple_stmt.
19590 (ipa_edge_modifications_finalize): New function.
19591 * tree-inline.c (remap_gimple_stmt): Pass original statement to
19592 modify_gimple_stmt.
19593 (copy_phis_for_bb): Do not copy dead PHI nodes.
19594 (expand_call_inline): Do not remap performed_splits.
19595 (update_clone_info): Likewise.
19596 * toplev.c: Include ipa-param-manipulation.h.
19597 (toplev::finalize): Call ipa_edge_modifications_finalize.
19599 2021-06-28 Andrew Pinski <apinski@marvell.com>
19601 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
19602 info if we're the only things setting the target PHI.
19603 (value_replacement): Don't duplicate range here.
19604 (minmax_replacement): Likewise.
19606 2021-06-28 Richard Biener <rguenther@suse.de>
19608 PR tree-optimization/101229
19609 * gimple-walk.c (gimple_walk_op): Handle PHIs.
19611 2021-06-28 Martin Liska <mliska@suse.cz>
19613 * config/v850/v850.c (construct_dispose_instruction): Allocate
19615 (construct_prepare_instruction): Likewise.
19617 2021-06-28 Martin Liska <mliska@suse.cz>
19619 * config/v850/v850.c (v850_option_override): Build default
19621 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
19622 ignored for inlining.
19623 (TARGET_CAN_INLINE_P): New.
19625 2021-06-28 Richard Biener <rguenther@suse.de>
19627 PR tree-optimization/101207
19628 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
19629 permute eliding for load permutations properly.
19631 2021-06-28 Richard Biener <rguenther@suse.de>
19633 PR tree-optimization/101173
19634 * gimple-loop-interchange.cc
19635 (tree_loop_interchange::valid_data_dependences): Disallow outer
19636 loop dependence distance of zero.
19638 2021-06-28 liuhongt <hongtao.liu@intel.com>
19641 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
19642 define_insn_and_split.
19643 (*avx_cmp<mode>3_ltint): Ditto.
19644 (*avx2_pcmp<mode>3_3): Ditto.
19645 (*avx2_pcmp<mode>3_4): Ditto.
19646 (*avx2_pcmp<mode>3_5): Ditto.
19648 2021-06-28 liuhongt <hongtao.liu@intel.com>
19650 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
19651 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
19652 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
19653 IX86_BUILTIN_PBLENDVB128): Replace icode with
19655 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
19657 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
19658 New pre_reload splitter.
19660 2021-06-27 Andrew Pinski <apinski@marvell.com>
19662 PR middle-end/101230
19663 * fold-const.c (fold_ternary_loc): Check
19664 the return value of invert_tree_comparison.
19666 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
19668 * config.gcc: Add SPDX License Identifier.
19669 (powerpc-ibm-aix789): Default to aix73.h.
19670 (powerpc-ibm-aix7.2.*.*): New stanza.
19671 * config/rs6000/aix72.h: Add SPDX License Identifier.
19672 * config/rs6000/aix73.h: New file.
19674 2021-06-26 Jason Merrill <jason@redhat.com>
19676 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
19678 2021-06-26 Andrew Pinski <apinski@marvell.com>
19680 * genmatch.c (lower_cond): Copy for_subst_vec
19681 for the simplify also.
19682 (lower): Swap the order for lower_for and lower_cond.
19684 2021-06-26 Andrew Pinski <apinski@marvell.com>
19686 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
19687 flow senatitive info on the moved ssa set.
19689 2021-06-26 Andrew Pinski <apinski@marvell.com>
19691 * fold-const.c (fold_cond_expr_with_comparison):
19692 Exand arg0 into comp_code, arg00, and arg01.
19693 (fold_ternary_loc): Use invert_tree_comparison
19694 instead of fold_invert_truthvalue for the case
19695 where we have A CMP B ? C : A.
19697 2021-06-25 Martin Sebor <msebor@redhat.com>
19699 PR middle-end/101216
19700 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
19702 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
19704 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
19706 2021-06-25 Richard Biener <rguenther@suse.de>
19708 PR tree-optimization/101202
19709 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
19712 2021-06-25 Richard Biener <rguenther@suse.de>
19714 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
19715 STMT_VINFO_REDUC_DEF from the original representative.
19717 2021-06-25 Martin Sebor <msebor@redhat.com>
19719 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
19720 gimple_no_warning_p and gimple_set_no_warning with
19721 warning_suppressed_p, and suppress_warning.
19723 (maybe_warn_for_bound): Same.
19724 (warn_for_access): Same.
19725 (check_access): Same.
19726 (expand_builtin_strncmp): Same.
19727 (fold_builtin_varargs): Same.
19728 * calls.c (maybe_warn_nonstring_arg): Same.
19729 (maybe_warn_rdwr_sizes): Same.
19730 * cfgexpand.c (expand_call_stmt): Same.
19731 * cgraphunit.c (check_global_declaration): Same.
19732 * fold-const.c (fold_undefer_overflow_warnings): Same.
19733 (fold_truth_not_expr): Same.
19734 (fold_unary_loc): Same.
19735 (fold_checksum_tree): Same.
19736 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
19737 (array_bounds_checker::check_mem_ref): Same.
19738 (array_bounds_checker::check_addr_expr): Same.
19739 (array_bounds_checker::check_array_bounds): Same.
19740 * gimple-expr.c (copy_var_decl): Same.
19741 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
19742 (gimple_fold_builtin_strncat): Same.
19743 (gimple_fold_builtin_stxcpy_chk): Same.
19744 (gimple_fold_builtin_stpcpy): Same.
19745 (gimple_fold_builtin_sprintf): Same.
19746 (fold_stmt_1): Same.
19747 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
19748 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
19749 * gimple-ssa-sprintf.c (handle_printf_call): Same.
19750 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
19751 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
19752 * gimple-ssa-warn-restrict.h: Adjust declarations.
19753 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
19754 gimple_no_warning_p and gimple_set_no_warning with
19755 warning_suppressed_p, and suppress_warning.
19756 (check_call): Same.
19757 (check_bounds_or_overlap): Same.
19758 * gimple.c (gimple_build_call_from_tree): Same.
19759 * gimplify.c (gimplify_return_expr): Same.
19760 (gimplify_cond_expr): Same.
19761 (gimplify_modify_expr_complex_part): Same.
19762 (gimplify_modify_expr): Same.
19763 (gimple_push_cleanup): Same.
19764 (gimplify_expr): Same.
19765 * omp-expand.c (expand_omp_for_generic): Same.
19766 (expand_omp_taskloop_for_outer): Same.
19767 * omp-low.c (lower_rec_input_clauses): Same.
19768 (lower_lastprivate_clauses): Same.
19769 (lower_send_clauses): Same.
19770 (lower_omp_target): Same.
19771 * tree-cfg.c (pass_warn_function_return::execute): Same.
19772 * tree-complex.c (create_one_component_var): Same.
19773 * tree-inline.c (remap_gimple_op_r): Same.
19774 (copy_tree_body_r): Same.
19775 (declare_return_variable): Same.
19776 (expand_call_inline): Same.
19777 * tree-nested.c (lookup_field_for_decl): Same.
19778 * tree-sra.c (create_access_replacement): Same.
19779 (generate_subtree_copies): Same.
19780 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
19781 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
19782 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
19783 * tree-ssa-loop-im.c (execute_sm): Same.
19784 * tree-ssa-phiopt.c (cond_store_replacement): Same.
19785 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
19786 (handle_builtin_strcpy): Same.
19787 (maybe_diag_stxncpy_trunc): Same.
19788 (handle_builtin_stxncpy_strncat): Same.
19789 (handle_builtin_strcat): Same.
19790 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
19791 (set_no_uninit_warning): Same.
19792 (uninit_undefined_value_p): Same.
19793 (warn_uninit): Same.
19794 (maybe_warn_operand): Same.
19795 * tree-vrp.c (compare_values_warnv): Same.
19796 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
19797 (test_for_singularity): Same.
19798 * gimple.h (warning_suppressed_p): New function.
19799 (suppress_warning): Same.
19800 (copy_no_warning): Same.
19801 (gimple_set_block): Call gimple_set_location.
19802 (gimple_set_location): Call copy_warning.
19804 2021-06-25 Martin Sebor <msebor@redhat.com>
19806 * tree.h (warning_suppressed_at, copy_warning,
19807 warning_suppressed_p, suppress_warning): New functions.
19809 2021-06-25 Martin Sebor <msebor@redhat.com>
19811 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
19812 * gengtype.c (open_base_files): Add diagnostic-spec.h.
19813 * diagnostic-spec.c: New file.
19814 * diagnostic-spec.h: New file.
19815 * tree.h (no_warning, all_warnings, suppress_warning_at): New
19817 * warning-control.cc: New file.
19819 2021-06-25 liuhongt <hongtao.liu@intel.com>
19822 * config/i386/i386.c (x86_order_regs_for_local_alloc):
19825 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19827 PR tree-optimization/101189
19828 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
19829 LHS range of condition to postfold routine.
19830 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
19831 FALSE edge if the LHS range supports it being taken.
19832 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
19834 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19836 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
19837 (relation_oracle::find_relation_block): Check correct bitmap.
19838 (relation_oracle::dump): Do not dump NULL blocks.
19840 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19842 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
19843 range_on_edge instead of manually calculating.
19845 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
19847 * range-op.cc: Fix comment.
19849 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
19852 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
19853 Handle V8QI and V4HI modes.
19854 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
19856 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
19857 (mmxpackmode): New mode attribute.
19858 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
19859 (mmxunpackmode): New mode attribute.
19860 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
19861 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
19862 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
19863 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
19864 * config/i386/i386.md (extsuffix): Move from ...
19865 * config/i386/sse.md: ... here.
19867 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
19869 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
19870 (dwarf2out_finish): ...instead of here.
19872 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
19874 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
19875 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
19876 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
19877 * configure: Regenerate.
19879 2021-06-24 Richard Biener <rguenther@suse.de>
19881 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
19882 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
19883 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
19885 2021-06-24 Richard Biener <rguenther@suse.de>
19887 * config/i386/sse.md (avx_addsubv4df3): Rename to
19889 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
19890 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
19891 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
19892 * config/i386/i386-builtin.def: Adjust.
19893 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
19894 * optabs.def (vec_addsub_optab): New optab.
19895 * tree-vect-slp-patterns.c (class addsub_pattern): New.
19896 (slp_patterns): Add addsub_pattern.
19897 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
19898 across CFN_VEC_ADDSUB.
19899 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
19901 * doc/md.texi (vec_addsub<mode>3): Document.
19903 2021-06-24 Jakub Jelinek <jakub@redhat.com>
19905 PR middle-end/101170
19906 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
19907 where regno + subreg_regno_offset wraps around use 0 as starting
19910 2021-06-24 Jakub Jelinek <jakub@redhat.com>
19912 PR middle-end/101172
19913 * stor-layout.c (finish_bitfield_representative): If nextf has
19914 error_mark_node type, set repr type to error_mark_node too.
19916 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
19918 * config/s390/s390.c (s390_function_profiler): Ignore labelno
19920 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
19922 2021-06-24 Richard Biener <rguenther@suse.de>
19924 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
19925 across operations that have different semantics on different
19928 2021-06-24 Jakub Jelinek <jakub@redhat.com>
19930 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
19931 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
19932 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
19933 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
19934 OMP_TARGET user outer_ctx instead of ctx for placeholders and
19935 initializer/combiner gimplification.
19936 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
19937 on target constructs.
19938 (lower_rec_input_clauses): Likewise.
19939 (lower_omp_target): Likewise.
19940 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
19941 on target if in_reduction is present.
19943 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
19945 * tree-predcom.c (class pcom_worker): New class.
19946 (release_chain): Renamed to...
19947 (pcom_worker::release_chain): ...this.
19948 (release_chains): Renamed to...
19949 (pcom_worker::release_chains): ...this.
19950 (aff_combination_dr_offset): Renamed to...
19951 (pcom_worker::aff_combination_dr_offset): ...this.
19952 (determine_offset): Renamed to...
19953 (pcom_worker::determine_offset): ...this.
19954 (class comp_ptrs): New class.
19955 (split_data_refs_to_components): Renamed to...
19956 (pcom_worker::split_data_refs_to_components): ...this,
19957 and update with class comp_ptrs.
19958 (suitable_component_p): Renamed to...
19959 (pcom_worker::suitable_component_p): ...this.
19960 (filter_suitable_components): Renamed to...
19961 (pcom_worker::filter_suitable_components): ...this.
19962 (valid_initializer_p): Renamed to...
19963 (pcom_worker::valid_initializer_p): ...this.
19964 (find_looparound_phi): Renamed to...
19965 (pcom_worker::find_looparound_phi): ...this.
19966 (add_looparound_copies): Renamed to...
19967 (pcom_worker::add_looparound_copies): ...this.
19968 (determine_roots_comp): Renamed to...
19969 (pcom_worker::determine_roots_comp): ...this.
19970 (determine_roots): Renamed to...
19971 (pcom_worker::determine_roots): ...this.
19972 (single_nonlooparound_use): Renamed to...
19973 (pcom_worker::single_nonlooparound_use): ...this.
19974 (remove_stmt): Renamed to...
19975 (pcom_worker::remove_stmt): ...this.
19976 (execute_pred_commoning_chain): Renamed to...
19977 (pcom_worker::execute_pred_commoning_chain): ...this.
19978 (execute_pred_commoning): Renamed to...
19979 (pcom_worker::execute_pred_commoning): ...this.
19980 (struct epcc_data): New member worker.
19981 (execute_pred_commoning_cbck): Call execute_pred_commoning
19982 with pcom_worker pointer.
19983 (find_use_stmt): Renamed to...
19984 (pcom_worker::find_use_stmt): ...this.
19985 (find_associative_operation_root): Renamed to...
19986 (pcom_worker::find_associative_operation_root): ...this.
19987 (find_common_use_stmt): Renamed to...
19988 (pcom_worker::find_common_use_stmt): ...this.
19989 (combinable_refs_p): Renamed to...
19990 (pcom_worker::combinable_refs_p): ...this.
19991 (reassociate_to_the_same_stmt): Renamed to...
19992 (pcom_worker::reassociate_to_the_same_stmt): ...this.
19993 (stmt_combining_refs): Renamed to...
19994 (pcom_worker::stmt_combining_refs): ...this.
19995 (combine_chains): Renamed to...
19996 (pcom_worker::combine_chains): ...this.
19997 (try_combine_chains): Renamed to...
19998 (pcom_worker::try_combine_chains): ...this.
19999 (prepare_initializers_chain): Renamed to...
20000 (pcom_worker::prepare_initializers_chain): ...this.
20001 (prepare_initializers): Renamed to...
20002 (pcom_worker::prepare_initializers): ...this.
20003 (prepare_finalizers_chain): Renamed to...
20004 (pcom_worker::prepare_finalizers_chain): ...this.
20005 (prepare_finalizers): Renamed to...
20006 (pcom_worker::prepare_finalizers): ...this.
20007 (tree_predictive_commoning_loop): Renamed to...
20008 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
20009 some calls and remove some cleanup code.
20010 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
20011 (static variable looparound_phis): Remove.
20012 (static variable name_expansions): Remove.
20014 2021-06-24 Richard Biener <rguenther@suse.de>
20016 * tree-vect-slp.c (slpg_vertex): New struct.
20017 (vect_slp_build_vertices): Adjust.
20018 (vect_optimize_slp): Likewise. Maintain an outgoing permute
20019 and a materialized one.
20021 2021-06-24 Richard Biener <rguenther@suse.de>
20023 PR tree-optimization/101105
20024 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
20025 Only ignore steps when they are equal or scalar order is preserved.
20027 2021-06-24 liuhongt <hongtao.liu@intel.com>
20030 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
20031 Adjust comments for ix86_expand_vecop_qihi2.
20032 (ix86_expand_vecmul_qihi): Renamed to ..
20033 (ix86_expand_vecop_qihi2): Adjust function prototype to
20034 support shift operation, add static to definition.
20035 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
20036 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
20037 ix86_expand_vec_shift_qihi_constant.
20038 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
20039 (ix86_expand_vec_shift_qihi_constant): Deleted.
20040 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
20042 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
20043 condition TARGET_64BIT.
20044 (mul<mode>3): Ditto.
20045 (<insn><mode>3): Ditto.
20046 (vlshr<mode>3): Extend to support avx512 vlshr.
20047 (v<insn><mode>3): New expander for
20049 (v<insn>v8qi3): Ditto.
20050 (vashrv8hi3<mask_name>): Renamed to ..
20051 (vashr<mode>3): And extend to support V16QImode for avx512.
20052 (vashrv16qi3): Deleted.
20053 (vashrv2di3<mask_name>): Extend expander to support avx512
20056 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
20058 * doc/lto.texi (Design Overview): Update that slim objects are
20061 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
20063 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
20064 of OTHER_POWER10_MASKS so it will not be enabled by default.
20066 2021-06-23 Richard Biener <rguenther@suse.de>
20067 Martin Jambor <mjambor@suse.cz>
20069 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
20070 param replacement unconditionally. Adjust comment.
20072 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20074 * Makefile.in (OBJS): Add gimple-range-fold.o
20075 * gimple-range-fold.cc: New.
20076 * gimple-range-fold.h: New.
20077 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
20078 (gimple_range_calc_op2): Ditto.
20079 * gimple-range-gori.h: Move prototypes to here.
20080 * gimple-range.cc: Adjust include files.
20081 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
20082 (fur_source::get_operand): Ditto.
20083 (fur_source::get_phi_operand): Ditto.
20084 (fur_source::query_relation): Ditto.
20085 (fur_source::register_relation): Ditto.
20086 (class fur_edge): Ditto.
20087 (fur_edge::fur_edge): Ditto.
20088 (fur_edge::get_operand): Ditto.
20089 (fur_edge::get_phi_operand): Ditto.
20090 (fur_stmt::fur_stmt): Ditto.
20091 (fur_stmt::get_operand): Ditto.
20092 (fur_stmt::get_phi_operand): Ditto.
20093 (fur_stmt::query_relation): Ditto.
20094 (class fur_depend): Relocate to gimple-range-fold.h.
20095 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
20096 (fur_depend::register_relation): Ditto.
20097 (fur_depend::register_relation): Ditto.
20098 (class fur_list): Ditto.
20099 (fur_list::fur_list): Ditto.
20100 (fur_list::get_operand): Ditto.
20101 (fur_list::get_phi_operand): Ditto.
20102 (fold_range): Ditto.
20103 (adjust_pointer_diff_expr): Ditto.
20104 (gimple_range_adjustment): Ditto.
20105 (gimple_range_base_of_assignment): Ditto.
20106 (gimple_range_operand1): Ditto.
20107 (gimple_range_operand2): Ditto.
20108 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
20109 (gimple_range_calc_op2): Ditto.
20110 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
20111 (fold_using_range::range_of_range_op): Ditto.
20112 (fold_using_range::range_of_address): Ditto.
20113 (fold_using_range::range_of_phi): Ditto.
20114 (fold_using_range::range_of_call): Ditto.
20115 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
20116 (fold_using_range::range_of_builtin_call): Ditto.
20117 (fold_using_range::range_of_cond_expr): Ditto.
20118 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
20119 (fold_using_range::relation_fold_and_or): Ditto.
20120 (fold_using_range::postfold_gcond_edges): Ditto.
20121 * gimple-range.h: Add gimple-range-fold.h to include files. Change
20122 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
20123 (gimple_range_handler): Relocate to gimple-range-fold.h.
20124 (gimple_range_ssa_p): Ditto.
20125 (range_compatible_p): Ditto.
20126 (class fur_source): Ditto.
20127 (class fur_stmt): Ditto.
20128 (class fold_using_range): Ditto.
20129 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
20130 (gimple_range_calc_op2): Ditto.
20132 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20134 PR tree-optimization/101148
20135 PR tree-optimization/101014
20136 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
20137 (ranger_cache::~ranger_cache): Adjust.
20138 (ranger_cache::block_range): Check if propagation disallowed.
20139 (ranger_cache::propagate_cache): Disallow propagation if new value
20140 can't be stored properly.
20141 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
20143 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20145 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
20146 (sbr_vector::set_bb_range): Return true.
20147 (class sbr_sparse_bitmap): Adjust.
20148 (sbr_sparse_bitmap::set_bb_range): Return value.
20149 (block_range_cache::set_bb_range): Return value.
20150 (ranger_cache::propagate_cache): Use return value to print msg.
20151 * gimple-range-cache.h (class block_range_cache): Adjust.
20153 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
20155 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
20157 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
20159 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
20160 so this pattern can be used for test/compare removal. Pass
20161 current insn to compute_logical_op_length and output_logical_op.
20162 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
20163 (h8300_and_costs): Add argument to compute_logical_op_length.
20164 (output_logical_op): Add new argument. Use it to determine if the
20165 condition codes are used and adjust the output accordingly.
20166 (compute_logical_op_length): Add new argument and update length
20167 computations when condition codes are used.
20168 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
20170 (output_logical_op): Likewise.
20172 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20175 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
20176 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
20177 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
20178 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
20179 * config/i386/sse.md (unspec): ... here.
20181 2021-06-23 Martin Liska <mliska@suse.cz>
20184 * optc-save-gen.awk: Put back arm_fp16_format to
20187 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
20190 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
20193 (clz<mode>2): Update RTX pattern for additions.
20195 2021-06-23 Jakub Jelinek <jakub@redhat.com>
20197 PR middle-end/101167
20198 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
20199 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
20201 2021-06-22 Sergei Trofimovich <siarheit@google.com>
20203 * doc/rtl.texi: drop unbalanced parenthesis.
20205 2021-06-22 Richard Biener <rguenther@suse.de>
20207 PR middle-end/101156
20208 * gimplify.c (gimplify_expr): Remove premature incorrect
20211 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20213 PR tree-optimization/101159
20214 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
20217 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20219 PR middle-end/101160
20220 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
20221 clear crtl->return_rtx instead of keeping it referencing a pseudo.
20223 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20224 Andrew Pinski <apinski@marvell.com>
20226 PR tree-optimization/101162
20227 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
20230 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20232 * range-op.cc (range_relational_tests): New.
20233 (range_op_tests): Call range_relational_tests.
20235 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20237 * range-op.cc (operator_cast::lhs_op1_relation): New.
20238 (operator_identity::lhs_op1_relation): Mew.
20240 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20242 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
20244 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20246 * range-op.cc (operator_plus::lhs_op1_relation): New.
20247 (operator_plus::lhs_op2_relation): New.
20249 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20251 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
20252 relation_oracle if dominators exist.
20253 (ranger_cache::~ranger_cache): Dispose of oracle.
20254 (ranger_cache::dump_bb): Dump oracle.
20255 * gimple-range.cc (fur_source::fur_source): New.
20256 (fur_source::get_operand): Use mmeber query.
20257 (fur_source::get_phi_operand): Use member_query.
20258 (fur_source::query_relation): New.
20259 (fur_source::register_dependency): Delete.
20260 (fur_source::register_relation): New.
20261 (fur_edge::fur_edge): Adjust.
20262 (fur_edge::get_phi_operand): Fix comment.
20263 (fur_edge::query): Delete.
20264 (fur_stmt::fur_stmt): Adjust.
20265 (fur_stmt::query): Delete.
20266 (fur_depend::fur_depend): Adjust.
20267 (fur_depend::register_relation): New.
20268 (fur_depend::register_relation): New.
20269 (fur_list::fur_list): Adjust.
20270 (fur_list::get_operand): Use member query.
20271 (fold_using_range::range_of_range_op): Process and query relations.
20272 (fold_using_range::range_of_address): Adjust dependency call.
20273 (fold_using_range::range_of_phi): Ditto.
20274 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
20275 (fold_using_range::relation_fold_and_or): New.
20276 (fold_using_range::postfold_gcond_edges): New.
20277 * gimple-range.h (class gimple_ranger): Adjust.
20278 (class fur_source): Adjust members.
20279 (class fur_stmt): Ditto.
20280 (class fold_using_range): Ditto.
20282 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20284 * range-op.cc (range_operator::wi_fold): Apply relation effect.
20285 (range_operator::fold_range): Adjust and apply relation effect.
20286 (*::fold_range): Add relation parameters.
20287 (*::op1_range): Ditto.
20288 (*::op2_range): Ditto.
20289 (range_operator::lhs_op1_relation): New.
20290 (range_operator::lhs_op2_relation): New.
20291 (range_operator::op1_op2_relation): New.
20292 (range_operator::op1_op2_relation_effect): New.
20293 (relop_early_resolve): New.
20294 (operator_equal::op1_op2_relation): New.
20295 (operator_equal::fold_range): Call relop_early_resolve.
20296 (operator_not_equal::op1_op2_relation): New.
20297 (operator_not_equal::fold_range): Call relop_early_resolve.
20298 (operator_lt::op1_op2_relation): New.
20299 (operator_lt::fold_range): Call relop_early_resolve.
20300 (operator_le::op1_op2_relation): New.
20301 (operator_le::fold_range): Call relop_early_resolve.
20302 (operator_gt::op1_op2_relation): New.
20303 (operator_gt::fold_range): Call relop_early_resolve.
20304 (operator_ge::op1_op2_relation): New.
20305 (operator_ge::fold_range): Call relop_early_resolve.
20306 * range-op.h (class range_operator): Adjust parameters and methods.
20308 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
20310 * Makefile.in (OBJS): Add value-relation.o.
20311 * gimple-range.h: Adjust include files.
20312 * tree-data-ref.c: Adjust include file order.
20313 * value-query.cc (range_query::get_value_range): Default to no oracle.
20314 (range_query::query_relation): New.
20315 (range_query::query_relation): New.
20316 * value-query.h (class range_query): Adjust.
20317 * value-relation.cc: New.
20318 * value-relation.h: New.
20320 2021-06-22 Richard Biener <rguenther@suse.de>
20322 PR tree-optimization/101151
20323 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
20326 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
20328 * config/riscv/riscv.c (thead_c906_tune_info): New.
20329 (riscv_tune_info_table): Use new tune.
20331 2021-06-22 Richard Biener <rguenther@suse.de>
20333 PR tree-optimization/101158
20334 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
20335 checking after checking for matching operation.
20337 2021-06-22 Richard Biener <rguenther@suse.de>
20339 PR tree-optimization/101159
20340 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
20341 missing NULL vectype check.
20343 2021-06-22 Richard Biener <rguenther@suse.de>
20345 PR tree-optimization/101154
20346 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
20348 2021-06-22 Jakub Jelinek <jakub@redhat.com>
20351 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
20352 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
20353 * config/i386/i386.c (ix86_expand_prologue): Clear it.
20354 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
20355 Remove "" from match_operand. Emit new insns using emit_move_insn and
20356 set ix86_last_zero_store_uid to INSN_UID of the last store.
20357 Add peephole2s for 1/2/4 stores of const0_rtx following previous
20360 2021-06-22 Martin Liska <mliska@suse.cz>
20362 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
20365 2021-06-22 Martin Liska <mliska@suse.cz>
20367 * gcov-io.h: Remove padding entries.
20369 2021-06-22 liuhongt <hongtao.liu@intel.com>
20371 PR tree-optimization/97770
20372 * tree-vect-patterns.c (vect_recog_popcount_pattern):
20374 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
20376 2021-06-22 liuhongt <hongtao.liu@intel.com>
20379 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
20380 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
20381 (expand<mode>_mask): this ..
20382 (*expand<mode>_mask): New pre_reload splitter to transform
20383 v{,p}expand* to vmov* when mask is zero, all ones, or has all
20384 ones in it's lower part, otherwise still generate
20387 2021-06-22 liuhongt <hongtao.liu@intel.com>
20390 * config/i386/i386-expand.c
20391 (ix86_expand_special_args_builtin): Keep constm1_operand only
20392 if it satisfies insn's operand predicate.
20394 2021-06-21 Jason Merrill <jason@redhat.com>
20397 * df-scan.c (df_ref_record): Check that regno < endregno.
20398 * function.c (assign_parms, expand_function_end): Do nothing with a
20399 TYPE_EMPTY_P result.
20401 2021-06-21 Richard Biener <rguenther@suse.de>
20403 PR tree-optimization/101120
20404 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
20406 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
20407 DR chain DCE capability.
20408 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
20409 * tree-vect-stmts.c (vectorizable_load): Remove unused
20410 loads in the DR chain for SLP.
20412 2021-06-21 Jakub Jelinek <jakub@redhat.com>
20414 PR inline-asm/100785
20415 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
20416 output or input operands were already error_mark_node.
20417 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
20418 remove all inputs, outputs and clobbers from the asm and
20419 set template to "".
20421 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20423 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
20424 (vceq_s16): Likewise.
20425 (vceq_s32): Likewise.
20426 (vceq_u8): Likewise.
20427 (vceq_u16): Likewise.
20428 (vceq_u32): Likewise.
20429 (vceq_p8): Likewise.
20430 (vceqq_s8): Likewise.
20431 (vceqq_s16): Likewise.
20432 (vceqq_s32): Likewise.
20433 (vceqq_u8): Likewise.
20434 (vceqq_u16): Likewise.
20435 (vceqq_u32): Likewise.
20436 (vceqq_p8): Likewise.
20437 (vceq_f32): Gate __a == __b on __FAST_MATH__.
20438 (vceqq_f32): Likewise.
20439 (vceq_f16): Likewise.
20440 (vceqq_f16): Likewise.
20442 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
20445 * config/arm/iterators.md (NEON_VACMP): Remove.
20446 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
20448 (neon_vca<cmp_op><mode>_insn): Likewise.
20449 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
20452 2021-06-21 Richard Biener <rguenther@suse.de>
20454 PR tree-optimization/101121
20455 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
20456 when we just lack a stmt with the desired op when doing permutation.
20457 (vect_build_slp_tree): When caching a failed SLP build attempt
20458 assert that at least one lane is marked as not matching.
20460 2021-06-21 liuhongt <hongtao.liu@intel.com>
20463 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
20464 register alternative.
20465 (*and<mode>_1): Ditto.
20467 (*andn<mode>_1): Ditto.
20468 (*<code><mode>_1): Ditto.
20469 (*<code>qi_1): Ditto.
20470 (*one_cmpl<mode>2_1): Ditto.
20471 (*one_cmplsi2_1_zext): Ditto.
20472 (*one_cmplqi2_1): Ditto.
20473 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
20474 the order of mask registers to be before general registers.
20476 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
20479 * config/i386/i386.md: New define_peephole2s to shrink writing
20480 1, 2 or 4 consecutive zeros to memory when optimizing for size.
20482 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
20484 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
20485 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
20486 more efficient code when the source can be trivially simplified.
20488 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20490 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
20491 a range if global is not available.
20492 (ranger_cache::entry_range): Fallback to range_of_def.
20493 * gimple-range-cache.h (range_of_def): Adjust prototype.
20495 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
20497 PR tree-optimization/101014
20498 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
20500 (ranger_cache::~ranger_cache): Ditto.
20501 (ranger_cache::enable_new_values): Delete.
20502 (ranger_cache::push_poor_value): Delete.
20503 (ranger_cache::range_of_def): Remove poor value processing.
20504 (ranger_cache::entry_range): Ditto.
20505 (ranger_cache::fill_block_cache): Ditto.
20506 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
20507 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
20508 * gimple-range.h (class gimple_ranger): Adjust.
20510 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
20513 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
20514 derived from arm_canon_arch.
20515 (arm_canon_arch_option): Call it.
20516 (arm_canon_arch_multilib_option): New function.
20517 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
20518 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
20519 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
20520 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
20521 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
20522 * config/arm/arm.opt (mlibarch): New option.
20523 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
20524 of march on RHS with mlibarch.
20526 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
20528 * config.in: Regenerate.
20529 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
20531 * configure: Regenerate.
20532 * configure.ac: Fix for global_load assembler functions.
20534 2021-06-18 Richard Biener <rguenther@suse.de>
20536 PR tree-optimization/101112
20537 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
20538 to lookup a pattern stmt def.
20540 2021-06-18 Jakub Jelinek <jakub@redhat.com>
20542 PR middle-end/101062
20543 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
20544 representatives in QUAL_UNION_TYPE.
20546 2021-06-18 Andrew Pinski <apinski@marvell.com>
20548 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
20549 Add counting of how many times it is done.
20550 (factor_out_conditional_conversion): Likewise.
20551 (match_simplify_replacement): Likewise.
20552 (value_replacement): Likewise.
20553 (spaceship_replacement): Likewise.
20554 (cond_store_replacement): Likewise.
20555 (cond_if_else_store_replacement_1): Likewise.
20556 (hoist_adjacent_loads): Likewise.
20558 2021-06-18 Andrew Pinski <apinski@marvell.com>
20560 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
20561 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
20562 (verify_gimple_assign_binary): Reject point and offset types on
20563 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
20564 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
20565 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
20567 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
20569 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
20570 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
20571 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
20574 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
20576 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
20577 earlyclobber to alts 0/1.
20578 (gen_addadd): Add earlyclobber to alts 0/1.
20579 * config/rs6000/fusion.md: Regenerate file.
20581 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20583 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
20585 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20587 * gimple-range-cache.cc: Comment cleanups.
20588 * gimple-range-gori.cc: Comment cleanups.
20589 * gimple-range.cc: Comment/spacing cleanups
20590 * value-range.h: Comment cleanups.
20592 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
20595 * calls.c (expand_call): Replace PUSH_ARGS with
20596 targetm.calls.push_argument (0).
20597 (emit_library_call_value_1): Likewise.
20598 * defaults.h (PUSH_ARGS): Removed.
20599 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
20600 targetm.calls.push_argument (0).
20601 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
20602 (emit_push_insn): Pass the number bytes to push to
20603 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
20604 * hooks.c (hook_bool_uint_true): New.
20605 * hooks.h (hook_bool_uint_true): Likewise.
20606 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
20607 targetm.calls.push_argument (0).
20608 * target.def (push_argument): Add a targetm.calls hook.
20609 * targhooks.c (default_push_argument): New.
20610 * targhooks.h (default_push_argument): Likewise.
20611 * config/bpf/bpf.h (PUSH_ARGS): Removed.
20612 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
20613 * config/cr16/cr16.h (PUSH_ARGS): Removed.
20614 * config/i386/i386.c (ix86_push_argument): New.
20615 (TARGET_PUSH_ARGUMENT): Likewise.
20616 * config/i386/i386.h (PUSH_ARGS): Removed.
20617 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
20618 * config/m32c/m32c.h (PUSH_ARGS): Removed.
20619 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
20620 * config/pru/pru.h (PUSH_ARGS): Likewise.
20621 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
20622 TARGET_PUSH_ARGUMENT hook.
20623 * doc/tm.texi: Regenerated.
20625 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
20628 * config/i386/i386-expand.c (expand_vector_set_var):
20629 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
20630 ix86_expand_vector_init_duplicate.
20631 (ix86_expand_vector_init_duplicate): Emit insv_1 for
20632 QImode for !TARGET_PARTIAL_REG_STALL.
20633 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
20634 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
20635 as operand 2 predicate. Call ix86_expand_vector_set_var
20636 for non-constant index operand.
20637 (vec_setv2si): Ditto.
20638 (vec_setv4hi): Ditto.
20639 (vec_setv8qi): ditto.
20641 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
20643 PR tree-optimization/100790
20644 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
20647 2021-06-17 Martin Liska <mliska@suse.cz>
20649 * doc/invoke.texi: Use consistently -O1 instead of -O.
20651 2021-06-17 Martin Liska <mliska@suse.cz>
20653 * gcov-io.h: Update documentation entry about string format.
20655 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
20658 * config/s390/vecintrin.h (vec_doublee): Fix to use
20659 __builtin_s390_vflls.
20660 (vec_floate): Fix to use __builtin_s390_vflrd.
20662 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20664 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
20665 * dominance.h (get_dominated_to_depth): Likewise.
20666 (get_all_dominated_blocks): Likewise.
20667 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
20668 * gcse.c (hoist_code): Likewise.
20669 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
20670 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
20671 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
20672 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
20674 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20676 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
20677 * dominance.h (get_dominated_by_region): Likewise.
20678 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
20679 (gimple_duplicate_sese_tail): Likewise.
20680 (move_sese_region_to_fn): Likewise.
20682 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20684 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
20685 * dominance.h (get_dominated_by): Likewise.
20686 * auto-profile.c (afdo_find_equiv_class): Adjust.
20687 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
20688 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
20689 * tree-cfg.c (test_linear_chain): Likewise.
20690 (test_diamond): Likewise.
20692 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20694 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
20695 * cfgloopanal.c (get_loop_hot_path): Likewise.
20696 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
20698 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20700 * cgraph.c (cgraph_node::collect_callers): Return
20701 auto_vec<cgraph_edge *>.
20702 * cgraph.h (cgraph_node::collect_callers): Likewise.
20703 * ipa-cp.c (create_specialized_node): Adjust.
20704 (decide_about_value): Likewise.
20705 (decide_whether_version_node): Likewise.
20706 * ipa-sra.c (process_isra_node_results): Likewise.
20708 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
20710 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
20711 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
20713 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
20716 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
20718 * gimple-range.cc (debug_seed_ranger): New.
20719 (dump_ranger): New.
20720 (debug_ranger): New.
20722 2021-06-17 Richard Biener <rguenther@suse.de>
20724 PR tree-optimization/54400
20725 * tree-vectorizer.h (enum slp_instance_kind): Add
20726 slp_inst_kind_bb_reduc.
20727 (reduction_fn_for_scalar_code): Declare.
20728 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
20729 Check SLP_INSTANCE_KIND instead of looking at the
20731 (vect_slp_analyze_instance_alignment): Likewise.
20732 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
20733 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
20734 chain linearization from vect_build_slp_tree_2 and generalize
20735 for the use of BB reduction vectorization.
20736 (vect_build_slp_tree_2): Adjust accordingly.
20737 (vect_optimize_slp): Elide permutes at the root of BB reduction
20739 (vectorizable_bb_reduc_epilogue): New function.
20740 (vect_slp_prune_covered_roots): Likewise.
20741 (vect_slp_analyze_operations): Use them.
20742 (vect_slp_check_for_constructors): Recognize associatable
20743 chains for BB reduction vectorization.
20744 (vectorize_slp_instance_root_stmt): Generate code for the
20745 BB reduction epilogue.
20747 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20749 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
20751 (gori_compute::may_recompute_p): New.
20752 (gori_compute::outgoing_edge_range_p): Perform recomputations.
20753 * gimple-range-gori.h (class gori_compute): Add prototype.
20755 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
20757 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
20758 true when a range can be calculated.
20759 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
20761 2021-06-16 Martin Sebor <msebor@redhat.com>
20763 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
20764 Correct documented defaults.
20766 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
20768 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
20769 m_new_value_p directly.
20771 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
20774 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
20775 Handle 64bit modes for TARGET_SSE4_1.
20776 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
20777 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
20778 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
20779 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
20780 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
20781 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
20783 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20785 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
20786 Change to an expander that emits the correct instruction
20787 depending on endianness.
20788 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
20789 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
20791 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20793 * config/aarch64/aarch64-simd-builtins.def: Split generator
20794 for aarch64_<su>qmovn builtins into scalar and vector
20796 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
20798 (aarch64_<su>qmovn<mode>_insn_be): Define.
20799 (aarch64_<su>qmovn<mode>): Split into scalar and vector
20800 variants. Change vector variant to an expander that emits the
20801 correct instruction depending on endianness.
20803 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20805 * config/aarch64/aarch64-simd-builtins.def: Split generator
20806 for aarch64_sqmovun builtins into scalar and vector variants.
20807 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
20808 Split into scalar and vector variants. Change vector variant
20809 to an expander that emits the correct instruction depending
20811 (aarch64_sqmovun<mode>_insn_le): Define.
20812 (aarch64_sqmovun<mode>_insn_be): Define.
20814 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
20816 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
20817 Define - modeling zero-high-half semantics.
20818 (aarch64_xtn<mode>): Change to an expander that emits the
20819 appropriate instruction depending on endianness.
20820 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
20822 (aarch64_xtn2<mode>_le): Rename to...
20823 (aarch64_xtn2<mode>_insn_le): This.
20824 (aarch64_xtn2<mode>_be): Rename to...
20825 (aarch64_xtn2<mode>_insn_be): This.
20826 (vec_pack_trunc_<mode>): Emit truncation instruction instead
20828 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
20829 attribute iterator.
20831 2021-06-16 Martin Jambor <mjambor@suse.cz>
20833 PR tree-optimization/100453
20834 * tree-sra.c (create_access): Disqualify any const candidates
20835 which are written to.
20836 (sra_modify_expr): Do not store sub-replacements back to a const base.
20837 (handle_unscalarized_data_in_subtree): Likewise.
20838 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
20839 instead of constant_decl_p.
20841 2021-06-16 Jakub Jelinek <jakub@redhat.com>
20843 PR middle-end/101062
20844 * stor-layout.c (finish_bitfield_representative): For fields in unions
20845 assume nextf is always NULL.
20846 (finish_bitfield_layout): Compute bit field representatives also in
20847 unions, but handle it as if each bitfield was the only field in the
20850 2021-06-16 Richard Biener <rguenther@suse.de>
20852 PR tree-optimization/101088
20853 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
20854 supported refs on edges. Do not assert same ref but
20855 different kind stores are unsuported but mark them so.
20856 (hoist_memory_references): Only look for supported refs
20859 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
20861 PR rtl-optimization/46235
20862 * config/i386/i386.md: New define_split for bt followed by cmov.
20863 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
20864 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
20865 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
20866 by setnc with zero extension.
20868 2021-06-16 Richard Biener <rguenther@suse.de>
20870 PR tree-optimization/101083
20871 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
20872 vectype as argument.
20873 (vect_build_slp_tree_2): Adjust.
20875 2021-06-15 Martin Sebor <msebor@redhat.com>
20877 PR middle-end/100876
20878 * builtins.c: (gimple_call_return_array): Account for size_t
20879 mangling as either unsigned int or unsigned long
20881 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
20883 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
20884 up before eliminating comparisons.
20886 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
20888 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
20889 nonzero|X is nonzero.
20890 (range_op_bitwise_and_tests): Add tests for above.
20892 2021-06-15 Carl Love <cel@us.ibm.com>
20895 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
20897 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
20900 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
20903 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
20904 (struct gimplify_omp_ctx): Extend defaultmap array by one.
20905 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
20906 (omp_notice_variable): Update type classification for Fortran.
20907 (gimplify_scan_omp_clauses): Update calls for new argument; handle
20908 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
20909 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
20910 * langhooks.c (lhd_omp_scalar_p): Likewise.
20911 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
20912 (LANG_HOOKS_DECLS): Add them.
20913 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
20914 omp_scalar_p pointer type to include the new bool argument.
20916 2021-06-15 David Malcolm <dmalcolm@redhat.com>
20918 * doc/analyzer.texi
20919 (Special Functions for Debugging the Analyzer): Add
20920 __analyzer_dump_capacity.
20922 2021-06-15 Jakub Jelinek <jakub@redhat.com>
20925 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
20926 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
20928 2021-06-15 Richard Biener <rguenther@suse.de>
20930 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
20931 check to identify loop latches.
20932 * cfgloop.c (verify_loop_structure): Likewise.
20933 * loop-init.c (apply_loop_flags): Allow marked irreducible
20934 regions even with multiple latches.
20935 * predict.c (rebuild_frequencies): Simplify.
20937 2021-06-15 Richard Biener <rguenther@suse.de>
20939 * tree-ssa-threadupdate.c
20940 (jump_thread_path_registry::mark_threaded_blocks): Assert we
20941 have marked irreducible regions.
20943 2021-06-14 Martin Sebor <msebor@redhat.com>
20946 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
20947 Handle calls to placement new.
20948 (ndecl_dealloc_argno): Avoid placement delete.
20950 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
20953 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
20954 create_tmp_reg_or_ssa_name().
20956 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
20958 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
20959 (ranger_cache::enable_new_values): Set to specified value and
20960 return the old value.
20961 (ranger_cache::disable_new_values): Delete.
20962 (ranger_cache::fill_block_cache): Disable non 1st order derived
20964 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
20965 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
20967 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
20970 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
20971 Return true early when testing with V2HImode.
20972 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
20974 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
20976 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
20977 (mve_vec_unpack<US>_hi_<mode>): New pattern.
20978 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
20979 (mve_vmovntq_<supf><mode>): Prefix with '@'.
20980 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
20982 (vec_unpack<US>_lo_<mode>): Likewise.
20983 (vec_pack_trunc_<mode>): Rename to
20984 neon_quad_vec_pack_trunc_<mode>.
20985 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
20987 (vec_unpack<US>_lo_<mode>): New.
20988 (vec_pack_trunc_<mode>): New.
20990 2021-06-14 Richard Biener <rguenther@suse.de>
20992 PR tree-optimization/100934
20993 * tree-ssa-dom.c (pass_dominator::execute): Properly
20994 mark irreducible regions.
20996 2021-06-14 Martin Liska <mliska@suse.cz>
20998 * doc/invoke.texi: Put r{...} on the same line as @item.
21000 2021-06-14 Martin Liska <mliska@suse.cz>
21002 * doc/invoke.texi: Add missing newline.
21004 2021-06-14 Martin Liska <mliska@suse.cz>
21006 * doc/invoke.texi: Remove '+' charasters.
21008 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
21010 * config.gcc (arc): Add support for with_cpu option.
21011 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
21013 2021-06-14 Richard Biener <rguenther@suse.de>
21015 PR tree-optimization/101031
21016 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
21017 instead of size when accounting for a possibly string
21020 2021-06-14 Martin Liska <mliska@suse.cz>
21022 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
21024 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
21026 * value-query.cc (gimple_range_global): Call get_range_global
21027 if called after inlining.
21029 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
21032 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
21033 Emit constant permutation insn directly from here.
21035 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
21037 * attribs.c (find_attribute_namespace): Iterate over vec<> with
21039 * auto-profile.c (afdo_find_equiv_class): Likewise.
21040 * gcc.c (do_specs_vec): Likewise.
21041 (do_spec_1): Likewise.
21042 (driver::set_up_specs): Likewise.
21043 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
21044 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
21045 (imm_store_chain_info::try_coalesce_bswap): Likewise.
21046 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
21047 (get_location_for_stmts): Likewise.
21048 * graphite-poly.c (print_iteration_domains): Likewise.
21049 (free_poly_bb): Likewise.
21050 (remove_gbbs_in_scop): Likewise.
21051 (free_scop): Likewise.
21052 (dump_gbb_cases): Likewise.
21053 (dump_gbb_conditions): Likewise.
21054 (print_pdrs): Likewise.
21055 (print_scop): Likewise.
21056 * ifcvt.c (cond_move_process_if_block): Likewise.
21057 * lower-subreg.c (decompose_multiword_subregs): Likewise.
21058 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
21059 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
21060 * sel-sched-dump.c (dump_insn_vector): Likewise.
21061 * store-motion.c (store_ops_ok): Likewise.
21062 (store_killed_in_insn): Likewise.
21063 * timevar.c (timer::named_items::print): Likewise.
21064 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
21065 (cleanup_tree_cfg_noloop): Likewise.
21066 * tree-data-ref.c (dump_data_references): Likewise.
21067 (print_dir_vectors): Likewise.
21068 (print_dist_vectors): Likewise.
21069 (dump_data_dependence_relations): Likewise.
21070 (dump_dist_dir_vectors): Likewise.
21071 (dump_ddrs): Likewise.
21072 (create_runtime_alias_checks): Likewise.
21073 (free_subscripts): Likewise.
21074 (save_dist_v): Likewise.
21075 (save_dir_v): Likewise.
21076 (invariant_access_functions): Likewise.
21077 (same_access_functions): Likewise.
21078 (access_functions_are_affine_or_constant_p): Likewise.
21079 (find_data_references_in_stmt): Likewise.
21080 (graphite_find_data_references_in_stmt): Likewise.
21081 (free_dependence_relations): Likewise.
21082 (free_data_refs): Likewise.
21083 * tree-inline.c (copy_debug_stmts): Likewise.
21084 * tree-into-ssa.c (dump_currdefs): Likewise.
21085 (rewrite_update_phi_arguments): Likewise.
21086 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
21087 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
21089 (vect_slp_analyze_node_dependences): Likewise.
21090 (vect_slp_analyze_instance_dependence): Likewise.
21091 (vect_record_base_alignments): Likewise.
21092 (vect_get_peeling_costs_all_drs): Likewise.
21093 (vect_peeling_supportable): Likewise.
21094 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
21095 (vec_info::free_stmt_vec_infos): Likewise.
21097 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
21099 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
21100 (andqi3_1<cczn>): Removed.
21101 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
21102 (H8/SX bit logicals): Split out from other patterns.
21103 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
21104 mulqihi3_const_clobber_flags.
21105 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
21107 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
21110 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
21111 to true if red zone is used.
21112 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
21113 ix86_red_zone_used.
21114 * config/i386/i386.h (machine_function): Add red_zone_used.
21115 (ix86_red_zone_size): Removed.
21116 (ix86_red_zone_used): New.
21117 * config/i386/i386.md (peephole2 patterns): Replace
21118 ix86_red_zone_size with ix86_red_zone_used.
21120 2021-06-12 Jason Merrill <jason@redhat.com>
21122 * doc/extend.texi (unused variable attribute): Applies to
21123 structure fields as well.
21125 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
21127 * auto-profile.c (read_profile): fix a typo in an error string
21129 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
21131 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
21133 * tree-pretty-print.c (dump_omp_clauses): Update.
21134 (dump_generic_node) <OMP_CLAUSE>: Use it.
21136 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
21139 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
21140 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
21141 the polymorphic variants matching code.
21142 (__arm_vld1q_z): Likewise.
21143 (__arm_vld2q): Likewise.
21144 (__arm_vld4q): Likewise.
21145 (__arm_vldrbq_gather_offset): Likewise.
21146 (__arm_vldrbq_gather_offset_z): Likewise.
21148 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21150 PR tree-optimization/96392
21151 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
21153 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
21155 PR tree-optimization/96392
21156 * fold-const.c (fold_real_zero_addition_p): Take both arguments
21157 of the addition or subtraction, not just the zero. Use this
21158 other argument in tests for signaling NaNs and signed zeros.
21159 (tree_expr_maybe_real_minus_zero_p): New predicate.
21160 * fold-const.h (fold_real_zero_addition_p): Update prototype.
21161 (tree_expr_maybe_real_minus_zero_p): New function prototype.
21162 * match.pd: Update calls to fold_real_zero_addition_p.
21163 Replace HONOR_NANS with tree_expr_maybe_nan_p.
21164 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
21165 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
21166 * tree-ssa-reassoc.c (eliminate_using_constants): Update
21167 call to fold_real_zero_addition_p.
21169 2021-06-11 Richard Biener <rguenther@suse.de>
21171 PR tree-optimization/101025
21172 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
21173 all refs that require dependence checking.
21175 2021-06-11 Richard Biener <rguenther@suse.de>
21177 PR tree-optimization/101028
21178 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
21179 reassoc discovery fails fatally, mark appropriate lanes
21182 2021-06-11 Richard Biener <rguenther@suse.de>
21184 PR tree-optimization/101026
21185 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
21186 have a representative for the associated chain nodes.
21188 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21190 PR rtl-optimization/101008
21191 * simplify-rtx.c (relational_result): New function.
21192 (simplify_logical_relational_operation,
21193 simplify_relational_operation): Use it.
21195 2021-06-11 Jakub Jelinek <jakub@redhat.com>
21198 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
21200 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
21203 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
21204 false if the permutation can be implemented with constant
21205 permutation instruction in wider mode.
21206 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
21207 Handle V8QImode and V4HImode.
21209 2021-06-11 Martin Liska <mliska@suse.cz>
21211 PR gcov-profile/100788
21212 * common.opt: Add new option.
21213 * coverage.c (coverage_begin_function): Emit warning instead on
21214 the internal compiler error.
21215 * doc/invoke.texi: Document the option.
21216 * toplev.c (process_options): Enable it by default.
21218 2021-06-11 Richard Biener <rguenther@suse.de>
21220 PR middle-end/101009
21221 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
21222 to set *init_b to true when we encounter a constant equal
21224 (compute_affine_dependence): Also dump the actual DR_REF.
21226 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21228 PR tree-optimization/100984
21229 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
21230 replacements table.
21231 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
21233 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
21235 * config/rs6000/rs6000.md
21236 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
21237 define_insn_and_split.
21239 2021-06-11 Richard Biener <rguenther@suse.de>
21241 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
21242 to sort operands of the associative chain.
21244 2021-06-11 Richard Biener <rguenther@suse.de>
21246 * system.h (gcc_stablesort_r): Declare.
21247 * sort.cc (gcc_sort_r): Support stable sort.
21248 (gcc_stablesort_r): Define.
21249 * vec.h (vec<>::stablesort): Add.
21251 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
21254 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
21255 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
21256 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
21257 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
21258 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
21259 (expand_vec_perm_interleave2): Handle 64bit modes.
21260 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
21261 (expand_vec_perm_even_odd_1): Ditto.
21262 (ix86_vectorize_vec_perm_const): Ditto.
21263 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
21264 * config/i386/sse.md: ... here.
21265 * config/i386/mmx.md (*vec_interleave_lowv2sf):
21266 New insn_and_split pattern.
21267 (*vec_interleave_highv2sf): Ditto.
21268 (mmx_pshufbv8qi3): New insn pattern.
21269 (*mmx_pblendw): Ditto.
21271 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
21273 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
21274 (build_acc): Likewise.
21275 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
21276 source operands in little-endian mode.
21277 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
21278 (mma_init_builtins): Likewise.
21279 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
21280 ordering for the MMA assemble and build source operands.
21281 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
21283 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
21286 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
21288 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
21290 * config/h8300/extensions.md: Replace _clobber_flags patterns
21293 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
21295 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
21296 (vcond_mask_<mode><tointvec>): this.
21298 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
21299 Thomas Schwinge <thomas@codesourcery.com>
21301 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
21302 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
21303 * gimple.h (enum gf_mask): Split
21304 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
21305 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
21306 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
21307 (is_gimple_omp_oacc): Update.
21308 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
21309 * gimplify.c (gimplify_omp_target_update): Likewise.
21310 * omp-expand.c (expand_omp_target, build_omp_regions_1)
21311 (omp_make_gimple_edges): Likewise.
21312 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
21315 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
21317 * value-query.cc (value_query::value_on_edge): Rename name to
21319 (range_query::range_on_edge): Same.
21320 (range_query::value_of_expr): Same.
21321 (range_query::value_on_edge): Same.
21322 * value-query.h (class value_query): Same.
21323 (class range_query): Same.
21325 2021-06-10 Richard Biener <rguenther@suse.de>
21327 PR tree-optimization/101003
21328 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
21329 use the pattern stmt defs when linearizing a chain.
21331 2021-06-10 Jakub Jelinek <jakub@redhat.com>
21334 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
21335 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
21337 2021-06-10 Clement Chigot <clement.chigot@atos.net>
21339 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
21340 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
21342 2021-06-09 Andrew Pinski <apinski@marvell.com>
21344 PR tree-optimization/100925
21345 * match.pd (a ? CST1 : CST2): Limit transformations
21346 that would produce a negative to integeral types only.
21347 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
21349 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21352 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21354 * doc/tm.texi: Correctly update.
21356 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
21358 * doc/tm.texi: Correctly update.
21360 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21363 * doc/tm.texi.in (Trampolines): Add a missing blank line.
21365 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
21368 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
21369 and -ftrampolines work only with Ada.
21370 * doc/tm.texi.in (Trampolines): Likewise.
21371 * doc/tm.texi: Regenerated.
21373 2021-06-09 Carl Love <cel@us.ibm.com>
21375 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
21376 Add define for new builtins.
21377 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
21378 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
21379 overloaded builtin definitions.
21380 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
21381 VSIGNEXTSD2Q): Add builtin expansions.
21382 (SIGNEXT): Add P10 overload definition.
21383 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
21384 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
21385 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
21386 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
21387 vsignextend_si_v2di)[VIlong]: Add define_expand.
21388 Make define_insn vsx_sign_extend_si_v2di visible.
21389 * doc/extend.texi: Add documentation for the vec_signexti,
21390 vec_signextll builtins and vec_signextq.
21392 2021-06-09 Carl Love <cel@us.ibm.com>
21394 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
21395 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
21396 __floattikf_sw, __floatuntikf_sw respectively.
21397 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
21398 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
21399 define_insn for mode IEEE 128.
21401 2021-06-09 Carl Love <cel@us.ibm.com>
21403 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
21404 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
21405 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
21406 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
21407 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
21408 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
21409 uses of VSX_TI to VEC_TI.
21411 2021-06-09 Carl Love <cel@us.ibm.com>
21413 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
21415 2021-06-09 Carl Love <cel@us.ibm.com>
21417 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
21419 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
21420 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
21421 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
21422 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
21423 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
21424 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
21425 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
21427 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
21428 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
21429 altivec_vrlqnm): New define_expands.
21430 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
21431 VCMPGTUT_P): Add macro expansions.
21432 (BU_P10V_AV_P): Add builtin predicate definition.
21433 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
21434 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
21435 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
21436 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
21437 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
21438 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
21439 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
21440 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
21441 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
21442 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
21443 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
21444 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
21445 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
21446 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
21447 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
21448 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
21449 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
21450 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
21451 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
21452 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
21453 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
21454 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
21455 P10V_BUILTIN_MODU_V1TI):
21456 New overloaded definitions.
21457 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
21458 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
21459 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
21460 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
21461 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
21462 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
21464 (altivec_init_builtins): New E_V1TImode case statement.
21465 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
21466 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
21467 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
21468 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
21469 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
21470 E_V1TImode]: New case statements.
21471 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
21472 value RS6000_BTI_bool_V1TI.
21473 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
21474 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
21475 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
21476 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
21477 vlshrv1ti3, vashrv1ti3): New define_expands.
21478 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
21479 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
21480 UNSPEC_VSX_MODUQ): New unspecs.
21481 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
21482 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
21484 (vcmpnet): New define_expand.
21485 * doc/extend.texi: Add documentation for the new builtins vec_rl,
21486 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
21487 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
21488 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
21489 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
21490 vec_any_ge, vec_any_le.
21492 2021-06-09 Carl Love <cel@us.ibm.com>
21494 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
21495 bug in argument generation.
21497 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21499 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
21501 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
21502 remove <supf> iterator.
21503 (mve_vclzq_u<mode>): New.
21504 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
21505 (neon_vclz<mode): Move to ...
21506 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
21507 * config/arm/vec-common.md: ... here. Add support for MVE.
21509 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
21511 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
21512 (@mve_vrhaddq_<supf><mode): Likewise.
21513 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
21514 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
21515 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
21517 2021-06-09 imba-tjd <109224573@qq.com>
21519 * doc/invoke.texi: Fix typo.
21521 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
21523 PR middle-end/53267
21524 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
21525 Support evaluation of fmod/fmodf/fmodl at compile-time.
21527 2021-06-09 Richard Biener <rguenther@suse.de>
21529 PR tree-optimization/100981
21530 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
21531 gimple_get_lhs to also handle calls.
21532 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
21535 2021-06-09 Richard Biener <rguenther@suse.de>
21537 PR tree-optimization/97832
21538 * tree-vectorizer.h (_slp_tree::failed): New.
21539 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
21541 (_slp_tree::~_slp_tree): Free failed.
21542 (vect_build_slp_tree): Retain failed nodes and record
21543 matches in them, copying that back out when running
21544 into a cached fail. Dump start and end of discovery.
21545 (dt_sort_cmp): New.
21546 (vect_build_slp_tree_2): Handle associatable chains
21547 together doing more aggressive operand swapping.
21549 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
21552 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
21554 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
21555 Linux and GNU targets.
21557 2021-06-09 Richard Biener <rguenther@suse.de>
21559 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
21562 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21564 * config/arc/arc.md (loop_end): Change it to
21565 define_insn_and_split.
21567 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21569 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
21570 (machi): New pattern.
21571 (umaddhisi4): Use VMAC2HU instruction.
21572 (umachi): New pattern.
21574 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
21576 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
21577 * config/arc/arc.c (arc_split_move_p): New function.
21578 (arc_split_move): Clean up.
21579 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
21580 (movdf_insn): Likewise.
21581 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
21583 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
21586 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
21587 argument to "raw". Do not emit segment overrides when "raw" is true.
21589 2021-06-09 Martin Liska <mliska@suse.cz>
21591 * doc/gcov.texi: Create a proper JSON files.
21592 * doc/invoke.texi: Remove dots in order to make it a valid
21595 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
21597 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
21598 (insn_is_load_p): Use pattern_is_rotate64.
21599 (insn_is_swap_p): Likewise.
21600 (quad_aligned_load_p): Likewise.
21601 (const_load_sequence_p): Likewise.
21602 (replace_swapped_aligned_load): Likewise.
21603 (recombine_lvx_pattern): Likewise.
21604 (recombine_stvx_pattern): Likewise.
21606 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
21608 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
21609 fur_stmt source record.
21610 * gimple-range.cc (fur_source::get_operand): Generic range query.
21611 (fur_source::get_phi_operand): New.
21612 (fur_source::register_dependency): New.
21613 (fur_source::query): New.
21614 (class fur_edge): New. Edge source for operands.
21615 (fur_edge::fur_edge): New.
21616 (fur_edge::get_operand): New.
21617 (fur_edge::get_phi_operand): New.
21618 (fur_edge::query): New.
21619 (fur_stmt::fur_stmt): New.
21620 (fur_stmt::get_operand): New.
21621 (fur_stmt::get_phi_operand): New.
21622 (fur_stmt::query): New.
21623 (class fur_depend): New. Statement source and process dependencies.
21624 (fur_depend::fur_depend): New.
21625 (fur_depend::register_dependency): New.
21626 (class fur_list): New. List source for operands.
21627 (fur_list::fur_list): New.
21628 (fur_list::get_operand): New.
21629 (fur_list::get_phi_operand): New.
21630 (fold_range): New. Instantiate appropriate fur_source class and fold.
21631 (fold_using_range::range_of_range_op): Use new API.
21632 (fold_using_range::range_of_address): Ditto.
21633 (fold_using_range::range_of_phi): Ditto.
21634 (imple_ranger::fold_range_internal): Use fur_depend class.
21635 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
21636 * gimple-range.h (class fur_source): Now a base class.
21637 (class fur_stmt): New.
21638 (fold_range): New prototypes.
21639 (fur_source::fur_source): Delete.
21641 2021-06-08 Andrew Pinski <apinski@marvell.com>
21643 PR tree-optimization/25290
21644 * tree-ssa-phiopt.c (xor_replacement): Delete.
21645 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
21646 (match_simplify_replacement): Allow one cheap preparation
21647 statement that can be moved to before the if.
21649 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
21651 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
21652 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
21654 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
21656 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
21657 Create length attribute on define_insn_and_split. Only split for cases which we
21659 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
21660 fix length computation.
21661 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
21663 2021-06-08 Richard Biener <rguenther@suse.de>
21665 PR tree-optimization/100923
21666 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
21667 the operand vector to be valueized.
21668 (valueize_refs): Likewise.
21669 (valueize_shared_reference_ops_from_ref): Adjust.
21670 (valueize_shared_reference_ops_from_call): Likewise.
21671 (vn_reference_lookup_3): Likewise.
21672 (vn_reference_lookup_pieces): Likewise. Re-valueize
21673 with honoring availability when we are about to create
21674 the ao_ref and valueized before.
21675 (vn_reference_lookup): Likewise.
21676 (vn_reference_insert_pieces): Adjust.
21678 2021-06-08 Richard Biener <rguenther@suse.de>
21680 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
21681 (_slp_instance::root_stmts): ... a vector.
21682 (SLP_INSTANCE_ROOT_STMT): Rename to ...
21683 (SLP_INSTANCE_ROOT_STMTS): ... this.
21684 (slp_root::root): Change to...
21685 (slp_root::roots): ... a vector.
21686 (slp_root::slp_root): Adjust.
21687 * tree-vect-slp.c (_slp_instance::location): Adjust.
21688 (vect_free_slp_instance): Release the root stmt vector.
21689 (vect_build_slp_instance): Adjust.
21690 (vect_analyze_slp): Likewise.
21691 (_bb_vec_info::~_bb_vec_info): Likewise.
21692 (vect_slp_analyze_operations): Likewise.
21693 (vect_bb_vectorization_profitable_p): Likewise. Adjust
21694 costs for the root stmt.
21695 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
21697 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
21699 (vectorize_slp_instance_root_stmt): Adjust.
21700 (vect_schedule_slp): Likewise.
21702 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
21704 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
21705 (ssa_equiv_stack::ssa_equiv_stack): New.
21706 (ssa_equiv_stack::~ssa_equiv_stack): New.
21707 (ssa_equiv_stack::enter): New.
21708 (ssa_equiv_stack::leave): New.
21709 (ssa_equiv_stack::push_replacement): New.
21710 (ssa_equiv_stack::get_replacement): New.
21711 (is_pointer_ssa): New.
21712 (class pointer_equiv_analyzer): New.
21713 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
21714 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
21715 (pointer_equiv_analyzer::set_global_equiv): New.
21716 (pointer_equiv_analyzer::set_cond_equiv): New.
21717 (pointer_equiv_analyzer::get_equiv): New.
21718 (pointer_equiv_analyzer::enter): New.
21719 (pointer_equiv_analyzer::leave): New.
21720 (pointer_equiv_analyzer::get_equiv_expr): New.
21721 (pta_valueize): New.
21722 (pointer_equiv_analyzer::visit_stmt): New.
21723 (pointer_equiv_analyzer::visit_edge): New.
21724 (hybrid_folder::value_of_expr): Call PTA.
21725 (hybrid_folder::value_on_edge): Same.
21726 (hybrid_folder::pre_fold_bb): New.
21727 (hybrid_folder::post_fold_bb): New.
21728 (hybrid_folder::pre_fold_stmt): New.
21729 (rvrp_folder::pre_fold_bb): New.
21730 (rvrp_folder::post_fold_bb): New.
21731 (rvrp_folder::pre_fold_stmt): New.
21732 (rvrp_folder::value_of_expr): Call PTA.
21733 (rvrp_folder::value_on_edge): Same.
21735 2021-06-08 Jakub Jelinek <jakub@redhat.com>
21738 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
21739 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
21741 2021-06-08 Richard Biener <rguenther@suse.de>
21743 PR middle-end/100951
21744 * tree-vect-generic.c (expand_vector_piecewise): Build a
21745 VECTOR_CST if all elements are constant.
21746 (expand_vector_condition): Likewise.
21747 (lower_vec_perm): Likewise.
21748 (expand_vector_conversion): Likewise.
21750 2021-06-08 Martin Liska <mliska@suse.cz>
21752 * doc/invoke.texi: Document new param evrp-sparse-threshold.
21754 2021-06-08 Martin Liska <mliska@suse.cz>
21756 * genautomata.c (create_automata): Fix typo.
21758 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
21760 PR tree-optimization/100794
21761 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
21762 allow_unroll_p and only allow unrolling when it's true.
21763 (tree_predictive_commoning): Add parameter allow_unroll_p and
21765 (run_tree_predictive_commoning): Likewise.
21766 (pass_predcom::gate): Check flag_tree_loop_vectorize and
21767 global_options_set.x_flag_predictive_commoning.
21768 (pass_predcom::execute): Adjust for allow_unroll_p.
21770 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
21772 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
21773 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
21774 lambda function cleanup, remove scev_reset call, and adjust return
21776 (tree_predictive_commoning): Adjust for different changed values,
21777 only set flag TODO_update_ssa_only_virtuals if changed.
21778 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
21779 from todo_flags_finish.
21781 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
21783 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
21784 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
21785 (sbr_sparse_bitmap::bitmap_set_quad): New.
21786 (sbr_sparse_bitmap::bitmap_get_quad): New.
21787 (sbr_sparse_bitmap::set_bb_range): New.
21788 (sbr_sparse_bitmap::get_bb_range): New.
21789 (sbr_sparse_bitmap::bb_range_p): New.
21790 (block_range_cache::block_range_cache): initialize bitmap obstack.
21791 (block_range_cache::~block_range_cache): Destruct obstack.
21792 (block_range_cache::set_bb_range): Decide when to utilze the
21793 sparse on entry cache.
21794 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
21795 * params.opt (-param=evrp-sparse-threshold): New.
21797 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
21799 * bitmap.c (bitmap_set_aligned_chunk): New.
21800 (bitmap_get_aligned_chunk): New.
21801 (test_aligned_chunk): New.
21802 (bitmap_c_tests): Call test_aligned_chunk.
21803 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
21805 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
21808 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
21810 (ix86_expand_vector_init_one_nonzero): Ditto.
21811 (ix86_expand_vector_init_one_var): Ditto.
21812 (ix86_expand_vector_init_general): Ditto.
21813 * config/i386/mmx.md (vec_initv4qiqi): New expander.
21815 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
21817 * config/h8300/movepush.md: Change most _clobber_flags
21818 patterns to instead use <cczn> subst.
21819 (movsi_cczn): New pattern with usable CC cases split out.
21820 (movsi_h8sx_cczn): Likewise.
21822 2021-06-07 Martin Liska <mliska@suse.cz>
21824 * common/common-target.def: Split long lines and replace them
21826 * target.def: Likewise.
21827 * doc/tm.texi: Re-generated.
21829 2021-06-07 Jakub Jelinek <jakub@redhat.com>
21832 * fold-const.c (fold_read_from_vector): Return NULL if trying to
21833 read from a CONSTRUCTOR with vector type elements.
21835 2021-06-07 Jakub Jelinek <jakub@redhat.com>
21837 PR middle-end/100898
21838 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
21839 should copy any arguments. Don't call gimple_call_num_args
21840 on id->call_stmt or call_stmt more than once.
21842 2021-06-07 liuhongt <hongtao.liu@intel.com>
21845 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
21847 (<insn>v4siv4di2): Delete constraints for define_expand.
21849 2021-06-07 liuhongt <hongtao.liu@intel.com>
21852 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
21853 assignment of cfun->machine->has_explicit_vzeroupper.
21854 * config/i386/i386-features.c
21855 (ix86_add_reg_usage_to_vzerouppers): Delete.
21856 (ix86_add_reg_usage_to_vzeroupper): Ditto.
21857 (rest_of_handle_insert_vzeroupper): Remove
21858 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
21860 (gate): Remove cfun->machine->has_explicit_vzeroupper.
21861 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
21863 * config/i386/i386.c (ix86_insn_callee_abi): New function.
21864 (ix86_initialize_callee_abi): Ditto.
21865 (ix86_expand_avx_vzeroupper): Ditto.
21866 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
21868 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
21869 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
21871 * config/i386/i386.h (struct GTY(()) machine_function): Delete
21872 has_explicit_vzeroupper.
21873 * config/i386/i386.md (enum unspec): New member
21875 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
21876 define_constants for insn callee abi index.
21877 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
21878 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
21879 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
21880 (*avx_vzeroupper): Rename to ..
21881 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
21882 call_insn which has a special vzeroupper ABI.
21883 (*avx_vzeroupper_1): Deleted.
21885 2021-06-07 liuhongt <hongtao.liu@intel.com>
21888 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
21889 it won't use stack pointer reg.
21890 * final.c (leaf_function_p): When call_insn is a fake call, it
21891 won't affect caller as a leaf function.
21892 * reg-stack.c (callee_clobbers_any_stack_reg): New.
21893 (subst_stack_regs): When call_insn doesn't clobber any stack
21894 reg, don't clear the arguments.
21895 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
21897 * shrink-wrap.c (requires_stack_frame_p): No need for stack
21898 frame for a fake call.
21899 * rtl.h (FAKE_CALL_P): New macro.
21901 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
21903 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
21905 (sparc_order_regs_for_local_alloc): ...this.
21906 (sparc_leaf_reg_remap): Declare.
21907 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
21908 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
21909 * config/sparc/sparc.c (leaf_reg_remap): Delete.
21910 (order_regs_for_local_alloc): Rename to...
21911 (sparc_order_regs_for_local_alloc): ...this.
21912 (sparc_leaf_reg_remap): New function.
21913 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
21915 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
21917 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
21918 Use assemble_name to output BSS section name.
21920 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
21922 * config/i386/constraints.md (Bs):
21923 Remove boolean operators from match_test RTX.
21926 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
21929 2021-06-06 Martin Liska <mliska@suse.cz>
21931 * doc/extend.texi: Add missing @headitem.
21932 * doc/invoke.texi: Likewise.
21933 * doc/objc.texi: Likewise.
21935 2021-06-06 Martin Liska <mliska@suse.cz>
21937 * genhooks.c (emit_findices): Remove unused function.
21938 (emit_documentation): Do not call emit_findices
21939 and do not search for @Fcode directives.
21941 2021-06-06 Martin Liska <mliska@suse.cz>
21943 * doc/invoke.texi: Remove extra character.
21945 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21947 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
21949 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21951 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
21952 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
21953 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
21954 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
21956 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21958 * config/or1k/or1k.md (*movdi): Fix empty split condition.
21960 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21962 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
21965 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
21967 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
21968 *zero_extendsidi2): Fix empty split condition.
21970 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
21972 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
21974 * config/h8300/bitfield.md: Likewise.
21975 * config/h8300/combiner.md: Likewise.
21976 * config/h8300/divmod.md: Likewise.
21977 * config/h8300/extensions.md: Likewise.
21978 * config/h8300/jumpcall.md: Likewise.
21979 * config/h8300/movepush.md: Likewise.
21980 * config/h8300/multiply.md: Likewise.
21981 * config/h8300/other.md: Likewise.
21982 * config/h8300/shiftrotate.md: Likewise.
21983 * config/h8300/logical.md: Likewise. Fix split pattern to use
21984 code iterator that somehow slipped through.
21986 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
21988 PR middle-end/100905
21989 * tree-nested.c (convert_nonlocal_omp_clauses,
21990 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
21992 2021-06-04 Martin Sebor <msebor@redhat.com>
21994 PR middle-end/100732
21995 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
21996 with either source or destination argument of invalid type.
21997 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
21998 calls with arguments of invalid type.
22000 2021-06-04 Martin Sebor <msebor@redhat.com>
22002 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
22004 (attr_access::vla_bounds): Also handle VLA bounds.
22006 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22008 * config/i386/predicates.md (GOT_memory_operand):
22009 Implement using match_code RTXes.
22010 (GOT32_symbol_operand): Ditto.
22012 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
22015 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
22017 (ix86_expand_vector_init_general): Ditto.
22018 Use SImode instead of word_mode for logic operations
22019 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
22020 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
22021 implemented by expand_vec_perm_1.
22022 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
22023 should be implemented using standard shuffle patterns.
22024 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
22025 V2HI modes to modes, implementable with shuffle for one operand.
22026 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
22027 (*pshufw_1): New insn pattern.
22028 (*vec_dupv2hi): Ditto.
22029 (vec_initv2hihi): New expander.
22031 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22033 * config/arm/vfp.md (no_literal_pool_df_immediate,
22034 no_literal_pool_sf_immediate): Fix empty split condition.
22036 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
22038 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
22039 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
22040 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
22041 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
22042 *<sse4_1_avx2>_pblendvb_lt): Likewise.
22044 2021-06-04 Jakub Jelinek <jakub@redhat.com>
22047 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
22048 concatenation from half-sized modes with TImode elements.
22050 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
22052 * config/arc/arc.c (arc_override_options): Disable millicode
22053 thunks when RF16 is on.
22055 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22057 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
22059 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
22061 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
22062 Replace PROMOTE_MODE marco with its content.
22064 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22066 * config/cris/cris.md (*addi_reload): Fix empty split condition.
22068 2021-06-03 Jim Wilson <jimw@sifive.com>
22070 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
22071 turn it on for all riscv targets.
22073 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
22076 * config/i386/i386-expand.c (ix86_expand_vector_set):
22077 Handle V2HI and V4QI modes.
22078 (ix86_expand_vector_extract): Ditto.
22079 * config/i386/mmx.md (*pinsrw): New insn pattern.
22082 (*pextrw_zext): Ditto.
22084 (*pextrb_zext): Ditto.
22085 (vec_setv2hi): New expander.
22086 (vec_extractv2hihi): Ditto.
22087 (vec_setv4qi): Ditto.
22088 (vec_extractv4qiqi): Ditto.
22089 (vec_setv8qi): Enable only for TARGET_SSE4_1.
22090 (vec_extractv8qiqi): Ditto.
22092 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
22094 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
22095 order to subf instruction.
22096 * config/rs6000/fusion.md: Regenerate.
22098 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22100 * calls.c (get_size_range): Use range_of_expr instead of
22101 determine_value_range.
22102 * tree-affine.c (expr_to_aff_combination): Same.
22103 * tree-data-ref.c (split_constant_offset): Same.
22104 * tree-vrp.c (determine_value_range_1): Remove.
22105 (determine_value_range): Remove.
22106 * tree-vrp.h (determine_value_range): Remove.
22108 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22110 * function-tests.c (test_ranges): Call gimple_range_tests.
22111 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
22113 * gimple-range.cc (fur_source::get_operand): Do not call
22114 get_tree_range or gimple_range_global.
22116 (get_tree_range): Move to value-query.cc.
22117 Call get_arith_expr_range.
22118 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
22119 Include gimple-range-tests.cc.
22120 * gimple-range.h (fold_range): Add argument.
22121 (get_tree_range): Remove.
22122 * selftest.h (gimple_range_tests): New.
22123 * value-query.cc (global_range_query::range_of_expr): Add
22125 (range_query::get_tree_range): Move from gimple-range.cc.
22126 * value-query.h (class range_query): Add get_tree_range and
22127 get_arith_expr_range. Make fur_source a friend.
22128 * vr-values.c (vr_values::range_of_expr): Pass stmt to
22130 * gimple-range-tests.cc: New file.
22132 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
22134 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
22135 update_global_range.
22136 * value-query.cc (update_global_range): New.
22137 * value-query.h (update_global_range): New.
22139 2021-06-03 David Malcolm <dmalcolm@redhat.com>
22141 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
22142 printing the same location twice if there are fix-it hints,
22143 multiple locations, or a label.
22145 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
22147 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
22148 thresholds to narrow the upper bound on epilogue iterations.
22150 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
22152 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
22153 (mve_vabsq_s<mode>): Likewise.
22154 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
22155 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
22156 * config/arm/vec-common.md (neg<mode>2): Rename to
22157 <absneg_str><mode>2.
22159 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
22161 * common/config/arc/arc-common.c (arc_option_optimization_table):
22162 Remove malign-call.
22163 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
22164 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
22165 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
22166 * config/arc/arc.md (abssi2_mixed): Remove pattern.
22167 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
22168 (malign-call): Likewise.
22169 (mmixed-code): Likewise.
22170 * doc/invoke.texi (ARC): Update doc.
22172 2021-06-03 Martin Liska <mliska@suse.cz>
22174 * common.opt: Use proper Enum values.
22175 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
22176 (parse_sanitizer_options): Handle only sanitizer_opts.
22177 (common_handle_option): Just assign value.
22179 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22182 * tree-inline.c (inline_forbidden_p): Remove test on return type.
22184 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22186 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
22187 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
22188 (type_byte_size): Inline into...
22189 (add_byte_size_attribute): ...this and call add_scalar_info.
22191 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22193 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
22194 (typed_binop_from_tree): New function.
22195 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
22196 turn a divide by a power of 2 into a shift.
22197 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
22198 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
22199 typed divide by calling typed_binop_from_tree.
22201 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
22203 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
22204 (is_handled_procedure_type): Likewise.
22205 (struct loc_descr_context): Add strict_signedness field.
22206 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
22207 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
22208 (resolve_args_picking): Minor tweak.
22209 (function_to_dwarf_procedure): Initialize strict_signedness field.
22210 (type_byte_size): Likewise.
22211 (field_byte_offset): Likewise.
22212 (gen_descr_array_type_die): Likewise.
22213 (gen_variant_part): Likewise.
22214 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
22215 to true when a context is present before evaluating the arguments.
22216 <COND_EXPR>: Do not generate a useless comparison with zero.
22217 When dereferencing an address, if strict_signedness is true and the
22218 type is small and signed, use DW_OP_deref_type to do the dereference
22219 and then DW_OP_convert to convert back to the generic type.
22221 2021-06-03 Jakub Jelinek <jakub@redhat.com>
22224 * tree-inline.c (copy_tree_body_r): Handle iterators on
22225 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
22227 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
22229 * config/arc/arc.md (*bbit_di): Remove.
22231 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
22233 PR rtl-optimization/100264
22234 * ree.c (get_sub_rtx): Ignore SET expressions without register
22235 destinations and remove assertion, as it is not valid anymore
22236 with this new behaviour.
22237 (merge_def_and_ext): Eliminate destination check for register
22238 as such SET expressions can't occur anymore.
22239 (combine_reaching_defs): Likewise.
22241 2021-06-02 Jakub Jelinek <jakub@redhat.com>
22244 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
22245 -Wtype-limits warnings.
22246 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
22247 in operands to avoid -Wsign-compare warnings.
22249 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
22251 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
22254 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
22256 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
22258 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
22260 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
22262 * config/s390/subst.md(cconly_subst): Use a single constraint
22263 in (match_scratch).
22265 2021-06-02 Martin Liska <mliska@suse.cz>
22267 * ipa-icf.h: Use auto_vec for memory_access_types.
22269 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
22271 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
22272 argument from prototype.
22273 (output_logical_op): Add rtx_code argument.
22274 (compute_logical_op_length): Likewise.
22275 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
22276 to compute_a_shift_length.
22277 (output_logical_op); New argument with the rtx code rather than
22278 extracting it from an operand. Handle QImode too.
22279 (compute_logical_op_length): Similary.
22280 (compute_a_shift_length): Drop unused argument.
22281 * config/h8300/h8300.md (logicals): New code iterator.
22282 * config/h8300/logical.md (<code><mode>3 expander): Combine
22283 the "and" expander with the "ior"/"xor" expander.
22284 (bclr<mode>msx): Combine the QI/HI mode patterns.
22285 (<logical><mode>3 insns): Use code iterator rather than match_operator.
22286 Handle QImode as well. Update call to output_logical_op and
22287 compute_logical_op_length to pass in rtx_code
22288 Fix split condition on all define_insn_and_split patterns.
22289 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
22290 the flags and setting ZN via existing define_subst.
22291 * config/h8300/shiftrotate.md: Drop unused argument from
22292 calls to compute_a_shift_length.
22293 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
22295 2021-06-01 Andrew Pinski <apinski@marvell.com>
22297 PR tree-optimization/25290
22298 * tree-ssa-phiopt.c (match_simplify_replacement):
22300 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
22301 (two_value_replacement): Change the comment about
22302 conditional_replacement.
22303 (conditional_replacement): Delete.
22305 2021-06-01 Andrew Pinski <apinski@marvell.com>
22307 PR tree-optimization/95481
22308 * tree-tailcall.c (find_tail_calls): Handle empty typed
22311 2021-06-01 Andrew Pinski <apinski@marvell.com>
22313 * gimplify.c (zero_sized_field_decl): Delete
22314 (zero_sized_type): Delete
22315 (gimplify_init_ctor_eval): Use is_empty_type instead
22316 of zero_sized_field_decl.
22317 (gimplify_modify_expr): Use is_empty_type instead of
22320 2021-06-01 Jason Merrill <jason@redhat.com>
22323 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
22325 2021-06-01 Jason Merrill <jason@redhat.com>
22328 * diagnostic.h (warning_enabled_at): Declare.
22329 * diagnostic.c (diagnostic_enabled): Factor out from...
22330 (diagnostic_report_diagnostic): ...here.
22331 (warning_enabled_at): New.
22333 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
22335 * gimple-ssa-evrp.c: Enable exporting of global ranges.
22337 2021-06-01 Martin Liska <mliska@suse.cz>
22340 * doc/invoke.texi: Mention that -fgcse-after-reload
22341 is enabled with -O3.
22343 2021-06-01 liuhongt <hongtao.liu@intel.com>
22345 PR tree-optimization/98365
22346 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
22347 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
22348 (convert_scalar_cond_reduction): Ditto.
22349 (predicate_scalar_phi): Ditto.
22351 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22353 PR tree-optimization/100781
22354 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
22355 value calculation by default.
22356 (ranger_cache::enable_new_values): New.
22357 (ranger_cache::disable_new_values): New.
22358 (ranger_cache::push_poor_value): Check if new values are allowed.
22359 * gimple-range-cache.h (class ranger_cache): New member/methods.
22360 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
22361 statement, and disable/renable new value calculation.
22363 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22365 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
22366 (ranger_cache::range_of_def): New.
22367 (ranger_cache::entry_range): New.
22368 (ranger_cache::exit_range): New.
22369 (ranger_cache::range_of_expr): Adjust.
22370 (ranger_cache::range_on_edge): Adjust.
22371 (ranger_cache::propagate_cache): Call exit_range directly.
22372 * gimple-range-cache.h (class ranger_cache): Adjust.
22374 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22376 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
22377 gori_compute being a member rather than base class.
22378 dervied call to member call.
22379 (ranger_cache::dump): No longer dump gori_map.
22380 (ranger_cache::dump_bb): New.
22381 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
22382 being a member rather than base class.
22383 (ranger_cache::set_global_range): Ditto.
22384 (ranger_cache::ssa_range_in_bb): Ditto.
22385 (ranger_cache::range_of_expr): New.
22386 (ranger_cache::range_on_edge): New.
22387 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
22388 (ranger_cache::propagate_cache): Adjust debugging output.
22389 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
22391 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
22392 member, and inherit from range_query instead.
22393 (ranger_cache::dump_bb): New. split from dump.
22394 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
22395 (gori_compute::expr_range_at_stmt): Delete.
22396 (gori_compute::compute_name_range_op): Delete.
22397 (gori_compute::compute_operand_range_switch): Add fur_source.
22398 (gori_compute::compute_operand_range): Add fur_source param, inline
22399 old compute_name_range_op and optimize_logical_operands.
22400 (struct tf_range): Delete.
22401 (gori_compute::logical_combine): Adjust
22402 (gori_compute::optimize_logical_operands): Delete.
22403 (gori_compute::compute_logical_operands_in_chain): Delete.
22404 (gori_compute::compute_logical_operands): Adjust.
22405 (gori_compute::compute_operand1_range): Adjust to fur_source.
22406 (gori_compute::compute_operand2_range): Ditto.
22407 (gori_compute::compute_operand1_and_operand2_range): Ditto.
22408 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
22409 and adjust to fur_source.
22410 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
22411 range_query and fur_source.
22412 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
22413 from the ranger_cache..
22414 (gimple_ranger::fold_range_internal): Adjust to base class change of
22416 (gimple_ranger::dump_bb): Adjust dump.
22417 * gimple-range.h (gimple_ranger):export gori computes object.
22419 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
22421 PR tree-optimization/100774
22422 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
22423 Constant values are also not stale.
22424 (ranger_cache::set_global_range): Range invariant values should also
22425 have the correct timestamp.
22427 2021-05-31 Martin Liska <mliska@suse.cz>
22429 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
22430 Unpack FUNCTION_DECL_DECL_TYPE.
22431 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
22432 Stream FUNCTION_DECL_DECL_TYPE instead of
22433 DECL_IS_OPERATOR_NEW_P.
22434 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
22436 (DECL_IS_OPERATOR_NEW_P): Likewise.
22437 (DECL_IS_OPERATOR_DELETE_P): Likewise.
22438 (DECL_LAMBDA_FUNCTION_P): Likewise.
22440 2021-05-31 Richard Biener <rguenther@suse.de>
22443 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
22444 * internal-fn.def (SHUFFLEVECTOR): New.
22445 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
22446 * doc/extend.texi: Document __builtin_shufflevector.
22448 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
22451 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
22452 indexed form addresses.
22454 2021-05-29 Jeff Law <jlaw@tachyum.com>
22456 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
22457 parameter. Call callers fixed.
22459 (output_plussi): Add FALLTHRU markers.
22460 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
22462 2021-05-29 Jakub Jelinek <jakub@redhat.com>
22464 PR middle-end/99928
22465 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
22466 combined with parallel, make sure to add shared clause to
22467 parallel for explicit linear clause.
22469 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
22471 PR tree-optimization/100787
22472 * gimple-ssa-evrp.c: Disable exporting of global ranges.
22474 2021-05-28 Jason Merrill <jason@redhat.com>
22476 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
22477 operator--, operator*, operator==, and operator!=.
22478 (class tsi_range): New.
22480 2021-05-28 Richard Biener <rguenther@suse.de>
22482 PR tree-optimization/100778
22483 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
22484 trapping ops in different BBs.
22486 2021-05-28 Richard Biener <rguenther@suse.de>
22489 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
22490 copy fntype from original call.
22492 2021-05-28 Martin Liska <mliska@suse.cz>
22494 PR gcov-profile/100751
22495 * doc/gcov.texi: Revert partially a hunk that was wrong.
22497 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
22499 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
22501 (HAVE_sync_compare_and_swaphi): Likewise.
22502 (HAVE_sync_compare_and_swapsi): Likewise.
22504 2021-05-28 Jakub Jelinek <jakub@redhat.com>
22506 PR middle-end/99928
22507 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
22509 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
22511 * gimplify.c (gimplify_omp_affinity): New.
22512 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
22513 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
22514 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
22515 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
22516 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
22518 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
22519 Richard Biener <rguenther@suse.de>
22521 * match.pd <popcount & / + pattern matching>:
22522 When generating popcount directly fails, try doing it in two halves.
22524 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
22526 * Makefile.in (generated_files): Add gimple-match.c and
22529 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22531 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
22533 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
22535 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
22537 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
22539 PR tree-optimization/99398
22540 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
22541 where the fed operands are CTOR/CST and propagated through
22542 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
22543 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
22545 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
22548 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22550 * config/i386/mmx.md (addv2sf3): Do not call
22551 ix86_fixup_binary_operands_no_copy.
22554 (<smaxmin:code>v2sf3): Ditto.
22555 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
22556 (<plusminus:insn><VI_32:mode>3): Remove expander.
22557 (<plusminus:insn><VI_32:mode>3): Rename from
22558 "*<plusminus:insn><VI_32:mode>3".
22559 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
22560 (mulv2hi3): Remove expander.
22561 (mulv2hi3): Rename from *mulv2hi3.
22562 (<s>mulv2hi3_highpart): Remove expander.
22563 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
22564 (<smaxmin:code><MMXMODE14:mode>3): Rename from
22565 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
22566 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
22567 (SMAXMIN_MMXMODEI): Remove mode iterator.
22568 (<smaxmin:code>v4hi3): New expander.
22569 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
22570 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
22571 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22572 (SMAXMIN_VI_32): Remove mode iterator.
22573 (<umaxmin:code><MMXMODE24:mode>3): Rename from
22574 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
22575 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
22576 (UMAXMIN_MMXMODEI): Remove mode iterator.
22577 (<umaxmin:code>v8qi3): New expander.
22578 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
22579 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
22580 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
22581 (UMAXMIN_VI_32): Remove mode iterator.
22582 (<any_shift:insn>v2hi3): Remove expander.
22583 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
22584 (<any_logic:code><MMXMODEI:mode>3): Do not call
22585 ix86_fixup_binary_operands_no_copy.
22586 (<any_logic:code><VI_32:mode>3): Remove expander.
22587 (<any_logic:code><VI_32:mode>3): Rename from
22588 "*<any_logic:code><VI_32:mode>3".
22589 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
22590 * config/i386/sse.md (div<VF2:mode>3): Do not call
22591 ix86_fixup_binary_operands_no_copy.
22592 (div<VF1:mode>3): Ditto.
22593 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
22594 (smulhrsv4hi3): Ditto.
22595 (smulhrsv2hi3): Ditto.
22597 2021-05-27 Martin Sebor <msebor@redhat.com>
22599 * ggc.h (gt_ggc_mx): Add overloads for all integers.
22601 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
22603 (hash_map::operator==): New function.
22605 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22608 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
22609 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
22610 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
22611 (*xop_maskcmp<VI_32:mode>3): Ditto.
22612 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
22613 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
22615 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
22618 * config/arm/arm.c (arm_configure_build_target): Remove parameter
22619 opts_set, directly check opts parameters for being non-null.
22620 (arm_option_restore): Update call to arm_configure_build_target.
22621 (arm_option_override): Likewise.
22622 (arm_can_inline_p): Likewise.
22623 (arm_valid_target_attribute_tree): Likewise.
22624 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
22625 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
22628 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
22630 * vr-values.c (simplify_conversion_using_ranges): Use
22631 get_range_query instead of get_global_range_query.
22633 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
22635 * gimple-range.cc (get_range_global): Move to value-query.cc.
22636 (gimple_range_global): Same.
22637 (get_global_range_query): Same.
22638 (global_range_query::range_of_expr): Same.
22639 * gimple-range.h (class global_range_query): Move to
22641 (gimple_range_global): Same.
22642 * tree-ssanames.c (get_range_info): Move to value-query.cc.
22643 (get_ptr_nonnull): Same.
22644 * tree-ssanames.h (get_range_info): Remove.
22645 (get_ptr_nonnull): Remove.
22646 * value-query.cc (get_ssa_name_range_info): Move from
22648 (get_ssa_name_ptr_info_nonnull): Same.
22649 (get_range_global): Move from gimple-range.cc.
22650 (gimple_range_global): Same.
22651 (get_global_range_query): Same.
22652 (global_range_query::range_of_expr): Same.
22653 * value-query.h (class global_range_query): Move from
22655 (gimple_range_global): Same.
22657 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
22660 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
22661 (uavgv2hi3_ceil): Ditto.
22663 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
22666 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
22668 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22670 * tree-ssanames.c (get_range_info): Merge both copies of
22671 get_range_info into one that works with irange.
22672 * tree-ssanames.h (get_range_info): Remove version that works on
22675 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22677 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
22678 (expand_builtin_strnlen): Same.
22679 (determine_block_size): Same.
22680 * fold-const.c (expr_not_equal_to): Same.
22681 * gimple-fold.c (size_must_be_zero_p): Same.
22682 * gimple-match-head.c: Include gimple-range.h.
22683 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
22684 * gimple-ssa-warn-restrict.c
22685 (builtin_memref::extend_offset_range): Same.
22686 * graphite-sese-to-poly.c (add_param_constraints): Same.
22687 * internal-fn.c (get_min_precision): Same.
22688 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
22689 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
22691 * tree-data-ref.c (split_constant_offset): Same.
22692 (dr_step_indicator): Same.
22693 * tree-dfa.c (get_ref_base_and_extent): Same.
22694 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
22695 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
22696 (determine_value_range): Same.
22697 (record_nonwrapping_iv): Same.
22698 (infer_loop_bounds_from_signedness): Same.
22699 (scev_var_range_cant_overflow): Same.
22700 * tree-ssa-phiopt.c (two_value_replacement): Same.
22701 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
22702 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
22703 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
22705 (dump_strlen_info): Same.
22706 (set_strlen_range): Same.
22707 (maybe_diag_stxncpy_trunc): Same.
22708 (get_len_or_size): Same.
22709 (handle_integral_assign): Same.
22710 * tree-ssa-structalias.c (find_what_p_points_to): Same.
22711 * tree-ssa-uninit.c (find_var_cmp_const): Same.
22712 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
22713 * tree-vect-patterns.c (vect_get_range_info): Same.
22714 (vect_recog_divmod_pattern): Same.
22715 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
22716 (register_edge_assert_for_2): Same.
22717 (determine_value_range_1): Same.
22718 * tree.c (get_range_pos_neg): Same.
22719 * vr-values.c (vr_values::get_lattice_entry): Same.
22720 (vr_values::update_value_range): Same.
22721 (simplify_conversion_using_ranges): Same.
22723 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22725 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
22726 get_range_query instead of query argument.
22727 (pass_walloca::execute): Enable and disable global ranger.
22729 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22731 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
22733 (rvrp_folder::~rvrp_folder): Call disable_ranger.
22734 (hybrid_folder::hybrid_folder): Call enable_ranger.
22735 (hybrid_folder::~hybrid_folder): Call disable_ranger.
22737 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
22739 * function.c (allocate_struct_function): Set cfun->x_range_query.
22740 * function.h (struct function): Declare x_range_query.
22741 (get_range_query): New.
22742 (get_global_range_query): New.
22743 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
22744 Remove call to safe_grow_cleared.
22745 * gimple-range.cc (get_range_global): New.
22746 (gimple_range_global): Move from gimple-range.h.
22747 (get_global_range_query): New.
22748 (global_range_query::range_of_expr): New.
22749 (enable_ranger): New.
22750 (disable_ranger): New.
22751 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
22752 (class global_range_query): New.
22753 (enable_ranger): New.
22754 (disable_ranger): New.
22755 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
22756 dump_all_value_ranges to dump.
22757 * tree-vrp.c (vrp_prop::finalize): Same.
22758 * value-query.cc (range_query::dump): New.
22759 * value-query.h (range_query::dump): New.
22760 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
22761 (vr_values::dump): ...this.
22762 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
22763 dump and make virtual.
22765 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
22767 * config/i386/i386.c (ix86_autovectorize_vector_modes):
22768 Add V4QImode and V16QImode for TARGET_SSE2.
22769 * doc/sourcebuild.texi (Vector-specific attributes):
22770 Add vect64 and vect32 description.
22772 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
22774 * gimple-range-gori.cc (range_def_chain::register_dependency):
22775 Resize m_def_chain when needed.
22777 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
22779 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
22780 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
22781 * config/arm/vec-common.md: .. here. Add support for MVE.
22783 2021-05-26 Jakub Jelinek <jakub@redhat.com>
22785 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
22787 * config/microblaze/microblaze.c (microblaze_legitimize_address,
22789 microblaze_option_override, print_operand): Likewise.
22790 * config/microblaze/microblaze.md (call_internal_plt,
22791 call_value_intern_plt, call_value_intern): Likewise.
22792 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
22793 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
22794 call_value_multiple_internal1): Likewise.
22795 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
22797 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
22799 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
22800 arc_ccfsm_advance, symbolic_reference_mentioned_p,
22801 arc_raw_symbolic_reference_mentioned_p): Remove register
22804 2021-05-26 Jakub Jelinek <jakub@redhat.com>
22807 * omp-low.c: Include omp-offload.h.
22808 (create_omp_child_function): If current_function_decl has
22809 "omp declare target" attribute and is_gimple_omp_offloaded,
22810 remove that attribute from the copy of attribute list and
22811 add "omp target entrypoint" attribute instead.
22812 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
22813 variables for offloading if in omp_maybe_offloaded_ctx.
22814 * omp-offload.c (pass_omp_target_link::execute): Nullify second
22815 argument to GOMP_target_data_ext in offloaded code.
22817 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22819 * config/csky/csky.c (csky_can_change_mode_class): Delete.
22820 For csky, HF/SF mode use the low bits of VREGS.
22822 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
22824 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
22825 DECL which is a reference for OMP.
22827 2021-05-26 Martin Liska <mliska@suse.cz>
22829 PR gcov-profile/100751
22830 * doc/gcov.texi: Document that __gcov_dump can be called just
22831 once and that __gcov_reset resets run-time counters.
22833 2021-05-26 Martin Liska <mliska@suse.cz>
22835 * doc/install.texi: Port relevant part from install-old.texi
22836 and re-generate list of CPUs and systems.
22838 2021-05-26 Martin Liska <mliska@suse.cz>
22840 * Makefile.in: Remove it.
22841 * doc/include/fdl.texi: Update next/previous chapters.
22842 * doc/install.texi: Likewise.
22843 * doc/install-old.texi: Removed.
22845 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22847 * config/csky/csky.c (ck810_legitimate_index_p): Support
22848 "base + index" with DF mode.
22849 * config/csky/constraints.md ("Y"): New constraint for memory operands
22850 without index register.
22851 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
22852 when mov between memory and general registers, and lower their priority.
22853 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
22855 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22857 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
22859 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22861 * config/csky/csky.md (untyped_call): Emit clobber for return
22862 registers to mark them used.
22864 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
22866 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
22868 2021-05-26 Andrew Pinski <apinski@marvell.com>
22870 * match.pd (x < 0 ? ~y : y): New patterns.
22872 2021-05-26 Andrew Pinski <apinski@marvell.com>
22874 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
22875 A?POW2:0 and A?0:POW2.
22877 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22879 * gimple-range-gori.cc (class logical_stmt_cache): Delete
22880 (logical_stmt_cache::logical_stmt_cache ): Delete.
22881 (logical_stmt_cache::~logical_stmt_cache): Delete.
22882 (logical_stmt_cache::cache_entry::dump): Delete.
22883 (logical_stmt_cache::get_range): Delete.
22884 (logical_stmt_cache::cached_name ): Delete.
22885 (logical_stmt_cache::same_cached_name): Delete.
22886 (logical_stmt_cache::cacheable_p): Delete.
22887 (logical_stmt_cache::slot_diagnostics ): Delete.
22888 (logical_stmt_cache::dump): Delete.
22889 (gori_compute_cache::gori_compute_cache): Delete.
22890 (gori_compute_cache::~gori_compute_cache): Delete.
22891 (gori_compute_cache::compute_operand_range): Delete.
22892 (gori_compute_cache::cache_stmt): Delete.
22893 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
22895 (class gori_compute_cache): Delete.
22897 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22899 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
22901 (fold_using_range::range_of_address): Adjust.
22902 (fold_using_range::range_of_phi): Adjust.
22903 * gimple-range.h (class fur_source): Adjust.
22904 (fur_source::fur_source): Adjust.
22906 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22908 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
22909 from expr_range_in_bb and adjust.
22910 (gori_compute::compute_name_range_op): Adjust.
22911 (gori_compute::optimize_logical_operands): Adjust.
22912 (gori_compute::compute_logical_operands_in_chain): Adjust.
22913 (gori_compute::compute_operand1_range): Adjust.
22914 (gori_compute::compute_operand2_range): Adjust.
22915 (ori_compute_cache::cache_stmt): Adjust.
22916 * gimple-range-gori.h (gori_compute): Rename prototype.
22918 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22920 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
22921 checked only after range_of_stmt, not range_on_entry.
22922 (gimple_ranger::range_on_entry): Check for non-null in any
22923 predecessor block, if it is not already non-null.
22924 (gimple_ranger::range_on_exit): DOnt check for non-null after
22925 range on entry call.
22926 (gimple_ranger::dump_bb): New. Split from dump.
22927 (gimple_ranger::dump): Adjust.
22928 * gimple-range.h (class gimple_ranger): Adjust.
22930 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22932 * gimple-range-cache.cc (struct range_timestamp): Delete.
22933 (class temporal_cache): Adjust.
22934 (temporal_cache::get_timestamp): Delete.
22935 (temporal_cache::set_dependency): Delete.
22936 (temporal_cache::temporal_value): Adjust.
22937 (temporal_cache::current_p): Take dependencies as params.
22938 (temporal_cache::set_timestamp): Adjust.
22939 (temporal_cache::set_always_current): Adjust.
22940 (ranger_cache::get_non_stale_global_range): Adjust.
22941 (ranger_cache::register_dependency): Delete.
22942 * gimple-range-cache.h (class range_cache): Adjust.
22944 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22946 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
22948 (range_def_chain::~range_def_chain): Dispose of obstack rather than
22949 each individual bitmap.
22950 (range_def_chain::set_import): New.
22951 (range_def_chain::get_imports): New.
22952 (range_def_chain::chain_import_p): New.
22953 (range_def_chain::register_dependency): Rename from build_def_chain
22955 (range_def_chain::def_chain_in_bitmap_p): New.
22956 (range_def_chain::add_def_chain_to_bitmap): New.
22957 (range_def_chain::has_def_chain): Just check first depenedence.
22958 (range_def_chain::get_def_chain): Process imports, use generic
22959 register_dependency routine.
22960 (range_def_chain::dump): New.
22961 (gori_map::gori_map): Allocate import list.
22962 (gori_map::~gori_map): Release imports.
22963 (gori_map::exports): Check for past allocated block size.
22964 (gori_map::imports): New.
22965 (gori_map::def_chain_in_export_p): Delete.
22966 (gori_map::is_import_p): New.
22967 (gori_map::maybe_add_gori): Handle imports.
22968 (gori_map::dump): Adjust output, add imports.
22969 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
22970 (gori_export_iterator::gori_export_iterator): New.
22971 (gori_export_iterator::next): New.
22972 (gori_export_iterator::get_name): New.
22973 * gimple-range-gori.h (range_def_chain): Add imports and direct
22974 dependecies via struct rdc.
22975 (range_def_chain::depend1): New.
22976 (range_def_chain::depend2): New.
22977 (class gori_map): Adjust.
22978 (FOR_EACH_GORI_IMPORT_NAME): New.
22979 (FOR_EACH_GORI_EXPORT_NAME): New.
22980 (class gori_export_iterator): New.
22982 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22984 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
22985 export cache filling to here.
22986 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
22988 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
22990 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
22991 (gori_map): Move to gimple-range-gori.h.
22992 (gori_compute::gori_compute): Adjust.
22993 (gori_compute::~gori_compute): Delete.
22994 (gori_compute::compute_operand_range_switch): Adjust.
22995 (gori_compute::compute_operand_range): Adjust.
22996 (gori_compute::compute_logical_operands): Adjust.
22997 (gori_compute::has_edge_range_p ): Adjust.
22998 (gori_compute::set_range_invariant): Delete.
22999 (gori_compute::dump): Adjust.
23000 (gori_compute::outgoing_edge_range_p): Adjust.
23001 * gimple-range-gori.h (class range_def_chain): Relocate here.
23002 (class gori_map): Relocate here.
23003 (class gori_compute): Inherit from gori_map, and adjust.
23005 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
23007 * value-range.cc (range_tests_legacy): Use
23008 build_nonstandard_integer_type instead of int and short.
23010 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23012 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
23013 when really creating an initialization statement for it.
23015 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
23017 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
23019 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
23021 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
23023 2021-05-25 Martin Liska <mliska@suse.cz>
23025 PR tree-optimization/92860
23027 * optc-save-gen.awk: Remove exceptions.
23029 2021-05-25 Martin Liska <mliska@suse.cz>
23031 * asan.h (sanitize_coverage_p): New function.
23032 * doc/extend.texi: Document it.
23033 * fold-const.c (fold_range_test): Use sanitize_flags_p
23034 instead of flag_sanitize_coverage.
23035 (fold_truth_andor): Likewise.
23036 * sancov.c: Likewise.
23037 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
23038 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
23039 -fsanitize-coverage when inlining.
23041 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23043 * config/csky/csky-modes.def : Fix copyright.
23045 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
23047 * config/csky/csky-modes.def : Amend copyright.
23048 * config/csky/csky_insn_fpuv2.md : Likewise.
23049 * config/csky/csky_insn_fpuv3.md : Likewise.
23051 2021-05-25 Richard Biener <rguenther@suse.de>
23053 PR middle-end/100727
23054 * calls.c (initialize_argument_information): Explicitely test
23055 for WITH_SIZE_EXPR.
23056 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
23058 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23060 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
23061 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
23062 the signle definition. The signle definition may not work well
23063 at simplify_subreg_regno().
23064 (HARD_FRAME_POINTER_REGNUM): New.
23065 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
23066 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
23067 csky_initial_elimination_offset, csky_expand_prologue,
23068 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
23070 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23072 * config/csky/csky.c (csky_option_override):
23073 Init csky_arch_isa_features[] in advance, so TARGET_DSP
23074 and TARGET_DIV can be set well.
23076 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23078 * config/csky/constraints.md ("l", "h"): Delete.
23079 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
23080 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
23081 * config/csky/csky.c (regno_reg_classm,
23082 csky_secondary_reload, csky_register_move_cost):
23083 Use HILO_REGS instead of LO_REGS and HI_REGS.
23085 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
23087 * config/csky/constraints.md ("W"): New constriant for mem operand
23088 with base reg, index register.
23089 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
23090 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
23092 ("Dv"): New constraint for const double value that can be used at
23094 * config/csky/csky-modes.def (HFmode): New mode.
23095 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
23096 to "csky_valid_mem_constraint_operand" and support new constraint
23098 (csky_get_movedouble_length): New.
23099 (fpuv3_output_move): New.
23100 (fpuv3_const_double): New.
23101 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
23102 (decompose_csky_address): Refine.
23103 (csky_print_operand): New "CONST_DOUBLE" operand.
23104 (csky_output_move): Support fpv3 instructions.
23105 (csky_get_movedouble_length): New.
23106 (fpuv3_output_move): New.
23107 (fpuv3_const_double): New.
23108 (csky_emit_compare): Cover float comparsion.
23109 (csky_emit_compare_float): Refine.
23110 (csky_vaild_fpuv2_mem_operand): Rename to
23111 "csky_valid_mem_constraint_operand" and support new constraint "W".
23112 (ck860_rtx_costs): New.
23113 (csky_rtx_costs): Add the cost calculation of CK860.
23114 (regno_reg_class): New vregs for fpuv3.
23115 (csky_dbx_regno): Likewise.
23116 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
23117 (csky_conditional_register_usage): Suporrot fpuv3.
23118 (csky_dwarf_register_span): Suporrot fpuv3.
23119 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
23120 (ck810_legitimate_index_p): Support fp16.
23121 * config/csky/csky.h (TARGET_TLS): ADD CK860.
23122 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
23123 (TARGET_SINGLE_FPU): Support fpuv3.
23124 (TARGET_SUPPORT_FPV3): New.
23125 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
23126 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
23127 REG_CLASS_CONTENTS): Support fpuv3.
23128 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
23129 (csky_movsf_fpv2): Likewise.
23130 (ck801_movsf): Likewise.
23131 (csky_movsf): Likewise.
23133 (csky_movdf_fpv2): Likewise.
23134 (ck801_movdf): Likewise.
23135 (csky_movdf): Likewise.
23136 (movsicc): Refine. Use "comparison_operatior" instead of
23137 "ordered_comparison_operatior".
23138 (addsicc): Likewise.
23139 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
23140 (call_value_internal_vh): New.
23141 * config/csky/csky_cores.def (CK860): New arch and cpu.
23146 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
23147 into emit-patterns and match-patterns, remain the emit-patterns here,
23148 and move the match-patterns to csky_insn_fpuv2.md or
23149 csky_insn_fpuv3.md.
23150 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
23151 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
23153 * config/csky/csky_isa.def (fcr): New.
23158 (CK860): New definition for ck860.
23159 * config/csky/csky_tables.opt (ck860): New processors ck860,
23160 ck860f. And new arch ck860.
23165 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
23166 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
23167 * config/csky/t-csky-elf: Support 860.
23168 * config/csky/t-csky-linux: Likewise.
23169 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
23171 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
23173 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
23174 add generation of logical-add and add-logical fusion pairs.
23175 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
23177 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
23178 logical-add and add-logical fusion by default.
23179 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
23180 -mpower10-fusion-add-logical options.
23181 * config/rs6000/fusion.md: Regenerate file.
23183 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
23185 * value-range.cc (irange::legacy_equal_p): Check type when
23186 comparing VR_VARYING types.
23187 (range_tests_legacy): Test comparing VARYING ranges of different
23190 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23192 * config/aarch64/aarch64.c (neoversen1_tunings):
23193 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
23195 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
23197 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
23198 extern weak symbols. Limit symbol offsets for non-GOT symbols with
23201 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23203 * config/arm/neon.md (vec_load_lanesxi<mode>)
23204 (vec_store_lanexoi<mode>): Move ...
23205 * config/arm/vec-common.md: here.
23207 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
23209 * config/arm/neon.md (vec_load_lanesoi<mode>)
23210 (vec_store_lanesoi<mode>): Move ...
23211 * config/arm/vec-common.md: here.
23213 2021-05-24 liuhongt <hongtao.liu@intel.com>
23216 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
23217 stmt with GIMPLE_NOP when lhs doesn't exist.
23219 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
23222 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
23223 New instruction pattern.
23224 (*push<VI_32:mode>2): Ditto.
23225 (push splitter for SSE registers): New splitter.
23227 2021-05-23 Andrew Pinski <apinski@marvell.com>
23229 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
23231 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
23233 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
23234 * config/rs6000/fusion.md: Regenerate file.
23236 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23238 * config/rs6000/genfusion.pl (gen_addadd): New function.
23239 * config/rs6000/fusion.md: Regenerate file.
23240 * config/rs6000/rs6000-cpus.def: Add
23241 OPTION_MASK_P10_FUSION_2ADD to masks.
23242 * config/rs6000/rs6000.c (rs6000_option_override_internal):
23243 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
23244 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
23246 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23248 PR middle-end/99928
23249 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
23250 * gimplify.c (enum gimplify_omp_var_data): Fix up
23251 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
23252 (omp_lastprivate_for_combined_outer_constructs): If combined target
23253 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
23254 GOVD_MAP | GOVD_SEEN.
23255 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
23256 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
23257 (gimplify_adjust_omp_clauses): For firstprivate clauses with
23258 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
23259 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
23260 let it be replaced by implicit map clause.
23262 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23264 PR middle-end/99928
23265 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
23267 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
23268 (gimplify_omp_for): Likewise.
23270 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23272 PR middle-end/90115
23273 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
23274 'external' in blocks.
23276 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23278 PR middle-end/90115
23279 * flag-types.h (enum openacc_privatization): New.
23280 * params.opt (-param=openacc-privatization): New.
23281 * doc/invoke.texi (openacc-privatization): Document it.
23282 * omp-general.h (get_openacc_privatization_dump_flags): New
23284 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
23285 * omp-offload.c (execute_oacc_device_lower)
23286 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
23287 * target.def (goacc.adjust_private_decl): Add 'location_t'
23289 * doc/tm.texi: Regenerate.
23290 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
23291 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
23292 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
23293 Likewise. Preserve it for...
23294 (nvptx_goacc_expand_var_decl): ... use here.
23296 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23298 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
23301 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23303 PR middle-end/90115
23304 * omp-low.c (oacc_privatization_candidate_p): New function.
23305 (oacc_privatization_scan_clause_chain)
23306 (oacc_privatization_scan_decl_chain): Use it. Also
23307 'gcc_checking_assert' that we're not seeing duplicates.
23309 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23311 PR middle-end/90115
23312 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
23315 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23317 PR middle-end/90115
23318 * omp-offload.c (execute_oacc_device_lower): Explain.
23320 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23322 PR middle-end/90115
23323 * omp-offload.c (execute_oacc_device_lower)
23324 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
23326 * internal-fn.c (expand_UNIQUE): Don't expect
23327 'IFN_UNIQUE_OACC_PRIVATE'.
23329 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23331 PR middle-end/90115
23332 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
23334 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
23336 PR middle-end/90115
23337 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
23338 (nvptx_goacc_expand_var_decl): Tighten.
23340 2021-05-21 Julian Brown <julian@codesourcery.com>
23341 Chung-Lin Tang <cltang@codesourcery.com>
23342 Thomas Schwinge <thomas@codesourcery.com>
23344 PR middle-end/90115
23345 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
23346 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
23347 * doc/tm.texi: Regenerate.
23348 * expr.c (expand_expr_real_1): Expand decls using the
23349 expand_var_decl OpenACC hook if defined.
23350 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
23351 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
23352 * omp-low.c (omp_context): Add oacc_privatization_candidates
23354 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
23356 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
23357 private marker's gimple call arguments, and pass it to
23358 lower_oacc_reductions.
23359 (oacc_privatization_scan_clause_chain)
23360 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
23362 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
23363 * omp-offload.c (convert.h): Include.
23364 (oacc_loop_xform_head_tail): Treat private-variable markers like
23365 fork/join when transforming head/tail sequences.
23366 (struct var_decl_rewrite_info): Add struct.
23367 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
23368 (execute_oacc_device_lower): Support rewriting gang-private
23369 variables using target hook, and fix up addr_expr and var_decl
23371 * target.def (adjust_private_decl, expand_var_decl): New hooks.
23372 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
23374 (gcn_goacc_adjust_private_decl): ...this.
23375 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
23377 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
23378 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
23379 definition using gcn_goacc_adjust_gangprivate_decl...
23380 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
23381 gcn_goacc_adjust_private_decl.
23382 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
23383 (gang_private_shared_size): New global variable.
23384 (gang_private_shared_align): Likewise.
23385 (gang_private_shared_sym): Likewise.
23386 (gang_private_shared_hmap): Likewise.
23387 (nvptx_option_override): Initialize these.
23388 (nvptx_file_end): Output gang_private_shared_sym.
23389 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
23391 (nvptx_set_current_function): Clear gang_private_shared_hmap.
23392 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
23393 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
23395 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
23397 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
23399 2021-05-21 Richard Biener <rguenther@suse.de>
23400 H.J. Lu <hjl.tools@gmail.com>
23402 PR middle-end/90773
23403 * expr.c (expand_constructor): Elide expand_constructor if
23404 move by pieces is preferred.
23406 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23408 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
23409 Take a flag and mode value as arguments.
23410 (aarch64_modifies_global_state_p): Likewise.
23411 (aarch64_reads_global_state_p): Likewise.
23412 (aarch64_could_trap_p): Likewise.
23413 (aarch64_get_attributes): Likewise.
23414 (aarch64_init_simd_builtins): Adjust callsite of above.
23415 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
23416 function attributes to apply to builtins.
23417 (aarch64_init_crc32_builtins): Likewise.
23418 (aarch64_init_builtin_rsqrt): Likewise.
23420 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
23422 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
23423 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
23424 (gen_2logical): Use new fusion types.
23425 * config/rs6000/fusion.md: Regenerate.
23427 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23430 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
23431 Handle V4QI and V2HI modes.
23432 (ix86_expand_sse_movcc): Ditto.
23433 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
23434 New instruction pattern.
23435 (*eq<VI_32:mode>3): Ditto.
23436 (*gt<VI_32:mode>3): Ditto.
23437 (*xop_pcmov_<VI_32:mode>): Ditto.
23438 (mmx_pblendvb32): Ditto.
23439 (mmx_pblendvb64): Rename from mmx_pblendvb.
23440 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
23441 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
23442 (vcond<VI_32:mode><VI_32:mode>): Ditto.
23443 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
23444 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
23446 2021-05-21 Jakub Jelinek <jakub@redhat.com>
23448 PR tree-optimization/94589
23449 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
23450 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
23451 not the same as cmp3) treat <= the same as < and >= the same as >.
23452 Don't require that cond2_phi_edge is true edge, instead take
23453 false/true edges into account based on cmp1/cmp2 comparison kinds.
23455 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
23458 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
23459 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
23460 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
23461 using SMAXMIN_MMXMODEI mode iterator.
23462 (*<smaxmin:code>v4qi3): New insn pattern.
23463 (*<smaxmin:code>v2hi3): Ditto.
23464 (SMAXMIN_VI_32): New mode iterator.
23465 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
23466 (UMAXMIN_MMXMODEI): New mode iterator.
23467 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
23468 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
23469 using UMAXMIN_MMXMODEI mode iterator.
23470 (*<umaxmin:code>v4qi3): New insn pattern.
23471 (*<umaxmin:code>v2hi3): Ditto.
23472 (UMAXMIN_VI_32): New mode iterator.
23473 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
23474 (abs<VI_32:mode>2): New insn pattern.
23475 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
23476 * config/i386/sse.md: ... here.
23478 2021-05-20 Clement Chigot <clement.chigot@atos.net>
23479 David Edelsohn <dje.gcc@gmail.com>
23481 * collect2.c (scan_prog_file): Issue non-fatal warning for
23484 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
23486 * doc/invoke.texi (-Wno-c++11-extensions)
23487 (-Wno-c++14-extensions, -Wno-c++17-extensions)
23488 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
23491 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23493 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
23494 * config/darwin.c (darwin_override_options): Likewise.
23495 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
23496 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
23497 (DWARF2_FRAME_REG_OUT): Likewise.
23498 * config/mips/mips.c (mips_output_filename): Likewise.
23499 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
23501 (rs6000_dbx_register_number): Likewise.
23502 * dbxout.c: Include flags.h.
23503 * dwarf2cfi.c (cfi_label_required_p): Likewise.
23504 (dwarf2out_do_frame): Likewise.
23505 * except.c: Include flags.h.
23506 * final.c (dwarf2_debug_info_emitted_p): Likewise.
23507 (final_scan_insn_1): Likewise.
23508 * flags.h (dwarf_debuginfo_p): New function declaration.
23509 * opts.c (dwarf_debuginfo_p): New function definition.
23510 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
23511 * toplev.c (process_options): Likewise.
23513 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
23515 * common.opt: Change type to support bitmasks.
23516 * flag-types.h (enum debug_info_type): Rename enumerator constants.
23517 (NO_DEBUG): New bitmask.
23518 (DBX_DEBUG): Likewise.
23519 (DWARF2_DEBUG): Likewise.
23520 (XCOFF_DEBUG): Likewise.
23521 (VMS_DEBUG): Likewise.
23522 (VMS_AND_DWARF2_DEBUG): Likewise.
23523 * flags.h (debug_set_to_format): New function declaration.
23524 (debug_set_count): Likewise.
23525 (debug_set_names): Likewise.
23526 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
23527 (debug_set_to_format): New function definition.
23528 (debug_set_count): Likewise.
23529 (debug_set_names): Likewise.
23530 (set_debug_level): Update access to debug_type_names.
23531 * toplev.c: Likewise.
23533 2021-05-20 Martin Sebor <msebor@redhat.com>
23535 PR middle-end/100684
23536 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
23538 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23541 * config/i386/i386.md (isa): Remove x64_bmi.
23542 (enabled): Remove x64_bmi.
23543 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23544 Remove general register alternative.
23545 (*andnot<VI_32:mode>3): Ditto.
23546 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23547 (*<any_logic:code><VI_32:mode>3): Ditto.
23549 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23551 * config/arm/arm.c: Include head files tree-vectorizer.h and
23554 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
23557 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
23558 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
23559 (<s>mulv4hi3_highpart): New expander.
23560 (*<s>mulv2hi3_highpart): New insn pattern.
23561 (<s>mulv2hi3_higpart): New expander.
23562 (*<any_shift:insn>v2hi3): New insn pattern.
23563 (<any_shift:insn>v2hi3): New expander.
23564 * config/i386/sse.md (smulhrsv2hi3): New expander.
23565 (*smulhrsv2hi3): New insn pattern.
23567 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
23569 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
23571 * params.opt (vect-inner-loop-cost-factor): New.
23572 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
23573 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
23574 tree-vectorizer.h and its required ones.
23575 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
23576 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
23577 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
23578 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
23579 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
23580 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
23582 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
23583 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
23584 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
23586 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
23587 Torbjörn Svensson <torbjorn.svensson@st.com>
23590 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
23592 2021-05-20 Jakub Jelinek <jakub@redhat.com>
23594 PR middle-end/99928
23595 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
23596 explicit linear clause when combined with target, make it map(tofrom:)
23597 instead of no clause or firstprivate.
23599 2021-05-20 Jakub Jelinek <jakub@redhat.com>
23601 PR tree-optimization/94589
23602 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
23603 of integral conversions.
23605 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
23607 * gimple-range.cc (fur_source::get_operand): New.
23608 (gimple_range_fold): Delete.
23609 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
23610 (fold_using_range::range_of_range_op): Move from gimple_ranger.
23611 (fold_using_range::range_of_address): Ditto.
23612 (fold_using_range::range_of_phi): Ditto.
23613 (fold_using_range::range_of_call): Ditto.
23614 (fold_using_range::range_of_builtin_ubsan_call): Move from
23615 range_of_builtin_ubsan_call.
23616 (fold_using_range::range_of_builtin_call): Move from
23617 range_of_builtin_call.
23618 (gimple_ranger::range_of_builtin_call): Delete.
23619 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
23620 (gimple_ranger::fold_range_internal): New.
23621 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
23622 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
23623 gimple_ranger. Improve ranges of SSA_NAMES when possible.
23624 * gimple-range.h (gimple_ranger): Remove various range_of routines.
23625 (class fur_source): New.
23626 (class fold_using_range): New.
23627 (fur_source::fur_source): New.
23629 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
23630 instead of range_of_builtin_call.
23632 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
23634 * doc/cpp.texi (Common Predefined Macros): Update documentation
23635 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
23637 2021-05-19 Alex Coplan <alex.coplan@arm.com>
23640 * config/arm/arm.md (nonsecure_call_internal): Always ensure
23641 callee's address is in a register.
23643 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
23645 * common/config/riscv/riscv-common.c
23646 (riscv_subset_list::parsing_subset_version): Properly parse the letter
23648 (riscv_subset_list::parse_std_ext,
23649 riscv_subset_list::parse_multiletter_ext): To handle errors generated
23650 in riscv_subset_list::parsing_subset_version.
23652 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23654 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
23655 type attribute in patterns generating XTN(2).
23657 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23659 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
23660 Remove as duplicate of...
23661 (aarch64_xtn<mode>): This.
23662 (aarch64_xtn2<mode>_le): Move position in file.
23663 (aarch64_xtn2<mode>_be): Move position in file.
23664 (aarch64_xtn2<mode>): Move position in file.
23665 (vec_pack_trunc_<mode>): Define as an expander.
23667 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23669 * config/aarch64/aarch64-simd-builtins.def: Split builtin
23670 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
23671 separate scalar and vector generators.
23672 * config/aarch64/aarch64-simd.md
23673 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
23675 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
23676 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
23677 * config/aarch64/iterators.md: Define SD_HSDI iterator.
23679 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
23681 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
23683 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
23685 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23687 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
23688 Implement as an expand emitting a big/little endian
23689 instruction pattern.
23690 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
23691 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
23693 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
23695 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
23696 Implement as an expand emitting a big/little endian
23697 instruction pattern.
23698 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
23699 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
23700 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
23701 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
23703 2021-05-19 Richard Biener <rguenther@suse.de>
23705 PR middle-end/100672
23706 * fold-const.c (fold_negate_expr_1): Use element_precision.
23707 (negate_expr_p): Likewise.
23709 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
23711 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
23712 (pred_load): New int attribute.
23713 * config/aarch64/aarch64-sve.md
23714 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
23715 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
23716 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
23717 code_for_aarch64_load.
23719 2021-05-19 Richard Biener <rguenther@suse.de>
23721 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
23722 sure TARGET_MEM_REF bases are expanded as memory.
23723 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
23724 Do not mark TARGET_MEM_REF bases addressable.
23725 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
23726 TARGET_MEM_REF bases as never rewritable.
23727 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
23728 walk TARGET_MEM_REF bases as address-takens.
23729 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
23731 2021-05-19 Richard Biener <rguenther@suse.de>
23733 * builtins.c (get_object_alignment_1): Strip outer
23735 * tree-dfa.c (get_ref_base_and_extent): Handle outer
23736 WITH_SIZE_EXPR for size processing and process the
23738 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
23739 outer WITH_SIZE_EXPR.
23740 (ao_ref_base_alias_ptr_type): Likewise.
23741 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
23742 and handle that accordingly, stripping it for the
23743 core alias workers.
23744 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
23745 looking through it instead of returning NULL.
23747 2021-05-19 Jakub Jelinek <jakub@redhat.com>
23749 PR middle-end/100576
23750 * builtins.c (check_read_access): Convert bound to size_type_node if
23753 2021-05-19 Richard Biener <rguenther@suse.de>
23755 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
23756 (verify_types_in_gimple_reference): ... here. Sanitize.
23757 (verify_gimple_call): Verify references in LHS and arguments.
23758 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
23760 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
23762 * config/i386/i386.h (VALID_INT_MODE_P):
23763 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
23764 * config/i386/i386.md (isa): Add x64_bmi.
23765 (enabled): Handle x64_bmi.
23766 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
23767 Add alternative using 64bit general registers.
23768 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
23770 2021-05-19 Jakub Jelinek <jakub@redhat.com>
23772 PR middle-end/99928
23773 * tree.h (OMP_MASTER_COMBINED): Define.
23774 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
23775 handling for outer combined/composite constructs to a loop.
23776 Handle lastprivate on combined target.
23777 (gimplify_expr): Formatting fix.
23779 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
23781 * passes.def: Add sink_code pass before store_merging.
23782 * tree-ssa-sink.c (pass_sink_code:clone): New.
23784 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
23786 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
23787 rs6000_special_adjust_field_align_p.
23788 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
23789 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
23790 Remove ABI warning.
23791 (rs6000_function_arg): Likewise.
23792 * config/rs6000/rs6000-protos.h
23793 (rs6000_special_adjust_field_align_p): Remove prototype.
23794 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
23796 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
23797 rs6000_special_adjust_field_align_p.
23799 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
23802 * config/i386/i386.h (VALID_SSE2_REG_MODE):
23803 Add V4QI and V2HI modes.
23804 (VALID_INT_MODE_P): Ditto.
23805 * config/i386/mmx.md (VI_32): New mode iterator.
23806 (mmxvecsize): Handle V4QI and V2HI.
23808 (mov<VI_32:mode>): New expander.
23809 (*mov<mode>_internal): New insn pattern.
23810 (movmisalign<VI_32:mode>): New expander.
23811 (neg<VI_32:mode>): New expander.
23812 (<plusminus:insn><VI_32:mode>3): New expander.
23813 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
23814 (mulv2hi3): New expander.
23815 (*mulv2hi3): New insn pattern.
23816 (one_cmpl<VI_32:mode>2): New expander.
23817 (*andnot<VI_32:mode>3): New insn pattern.
23818 (<any_logic:code><VI_32:mode>3): New expander.
23819 (*<any_logic:code><VI_32:mode>3): New insn pattern.
23821 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
23823 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
23824 Fix a mode mismatch with operand 1.
23826 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
23829 * config/i386/i386-expand.c (split_double_mode): Return
23830 temporary register when simplify_gen_subreg fails with
23831 the high half od the paradoxical subreg.
23833 2021-05-18 Richard Biener <rguenther@suse.de>
23835 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
23836 and honor it when expanding.
23837 (expand_used_vars_for_block): Pass through forced_stack_var.
23838 (expand_used_vars): Likewise.
23839 (discover_nonconstant_array_refs_r): Set bits in
23840 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
23841 (avoid_type_punning_on_regs): Likewise.
23842 (discover_nonconstant_array_refs): Likewise.
23843 (pass_expand::execute): Create and pass down forced_stack_var
23844 bitmap. For parameters and returns temporarily set
23845 TREE_ADDRESSABLE when expand_function_start.
23847 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
23849 * doc/sourcebuild.texi: Document 'dg-note'.
23851 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
23854 * configure: Regenerate.
23855 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
23857 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
23859 * gimple.h (is_gimple_omp_oacc): Tighten.
23860 * omp-low.c (check_omp_nesting_restrictions): Adjust.
23862 2021-05-18 Richard Biener <rguenther@suse.de>
23864 * tree-ssa-operands.c (mark_address_taken): Simplify.
23866 2021-05-18 Martin Liska <mliska@suse.cz>
23868 * config/gcn/mkoffload.c (STR): Redefine.
23869 * config/i386/intelmic-mkoffload.c (STR): Likewise.
23870 * config/nvptx/mkoffload.c (STR): Likewise.
23872 2021-05-18 Martin Liska <mliska@suse.cz>
23874 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
23875 Use startswith function instead of strncmp.
23876 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
23877 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
23878 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
23879 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
23880 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
23881 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
23882 * config/arm/arm.c (arm_file_start): Likewise.
23883 (arm_valid_target_attribute_rec): Likewise.
23884 (thumb1_md_asm_adjust): Likewise.
23885 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
23886 * config/avr/avr.c (STR_PREFIX_P): Likewise.
23887 (avr_set_current_function): Likewise.
23888 (avr_handle_addr_attribute): Likewise.
23889 (avr_asm_output_aligned_decl_common): Likewise.
23890 (avr_asm_named_section): Likewise.
23891 (avr_section_type_flags): Likewise.
23892 (avr_asm_select_section): Likewise.
23893 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
23894 (c6x_section_type_flags): Likewise.
23895 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
23896 (darwin_objc_declare_unresolved_class_reference): Likewise.
23897 (darwin_objc_declare_class_definition): Likewise.
23898 * config/darwin.c (indirect_data): Likewise.
23899 (darwin_encode_section_info): Likewise.
23900 (darwin_objc2_section): Likewise.
23901 (darwin_objc1_section): Likewise.
23902 (machopic_select_section): Likewise.
23903 (darwin_globalize_label): Likewise.
23904 (darwin_label_is_anonymous_local_objc_name): Likewise.
23905 (darwin_asm_named_section): Likewise.
23906 (darwin_asm_output_dwarf_offset): Likewise.
23907 * config/frv/frv.c (frv_string_begins_with): Likewise.
23908 (frv_in_small_data_p): Likewise.
23909 * config/gcn/mkoffload.c (STR): Likewise.
23911 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
23912 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
23913 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
23914 (ix86_md_asm_adjust): Likewise.
23915 * config/i386/intelmic-mkoffload.c (STR): Likewise.
23916 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
23917 (i386_pe_file_end): Likewise.
23918 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
23919 (ia64_section_type_flags): Likewise.
23920 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
23921 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
23922 (mips16_stub_function_p): Likewise.
23923 (mips_function_rodata_section): Likewise.
23924 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
23925 (msp430_function_section): Likewise.
23926 (msp430_section_type_flags): Likewise.
23927 (msp430_expand_helper): Likewise.
23928 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
23929 (nios2_valid_target_attribute_rec): Likewise.
23930 * config/nvptx/mkoffload.c (process): Likewise.
23932 * config/pa/som.h: Likewise.
23933 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
23934 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
23935 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
23936 (rs6000_inner_target_options): Likewise.
23937 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
23938 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
23939 * config/vax/vax.c (vax_output_int_move): Likewise.
23940 * config/vms/vms-ld.c (startswith): Likewise.
23941 (process_args): Likewise.
23943 * config/vms/vms.c: Likewise.
23945 2021-05-18 Jakub Jelinek <jakub@redhat.com>
23947 PR rtl-optimization/100590
23948 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
23949 they are NONJUMP_INSN_P.
23951 2021-05-18 Jakub Jelinek <jakub@redhat.com>
23954 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
23955 DECL_ASSEMBLER_NAME on the fn_decl.
23957 2021-05-18 Jakub Jelinek <jakub@redhat.com>
23959 PR tree-optimization/94589
23960 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
23961 phi result used in (res & ~1) == 0 comparison as res >= 0 as
23962 res == 2 would be UB with -ffinite-math-only.
23964 2021-05-18 Martin Liska <mliska@suse.cz>
23966 * Makefile.in: genversion.o should depend on DATESTAMP.
23968 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
23970 * config/arc/simdext.md (negv2si2): Remove round bracket.
23972 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
23974 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
23975 _Bool as macro expanding to _Bool.
23977 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
23980 * tree.c (build_reference_type_for_mode)
23981 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
23983 (build_reference_type, build_pointer_type): Invoke
23984 build_*_type_for_mode with VOIDmode.
23986 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
23988 PR tree-optimization/100512
23989 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
23990 and non-zero pointer ranges as invariant.
23991 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
23992 processing from here.
23994 2021-05-17 Tom de Vries <tdevries@suse.de>
23997 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
23998 * config/nvptx/nvptx.c (nvptx_output_barrier)
23999 (nvptx_output_atomic_insn): New function.
24000 (nvptx_print_operand): Add support for 'B'.
24001 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
24004 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
24006 PR tree-optimization/100349
24007 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
24010 2021-05-17 Tamar Christina <tamar.christina@arm.com>
24012 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
24013 (host_detect_local_cpu): Use it.
24015 2021-05-17 Martin Liska <mliska@suse.cz>
24017 * doc/invoke.texi: Add 2 missing dots.
24019 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
24021 PR bootstrap/100552
24022 * configure.ac: Replace pattern substitution with call to sed.
24023 * configure: Regenerate.
24025 2021-05-17 Richard Biener <rguenther@suse.de>
24027 PR middle-end/100582
24028 * tree.c (array_at_struct_end_p): Get to the base of the
24029 reference before looking for the underlying decl.
24031 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
24033 * genoutput.c (validate_insn_alternatives) Make "wrong number of
24034 alternatives" message more specific, and remove assumption on where
24037 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24039 * config/arm/iterators.md (V16): New iterator.
24040 (VH_cvtto): New iterator.
24041 (v_cmp_result): Added V4HF and V8HF support.
24042 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
24043 (vcond<mode><mode>): Likewise.
24044 (vcond_mask_<mode><v_cmp_result>): Likewise.
24045 (vcond<VH_cvtto><mode>): New expander.
24047 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24049 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
24051 * config/arm/arm.c (arm_expand_vector_compare): Add support for
24053 (arm_expand_vcond): Likewise.
24054 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
24055 VCMPEQQ_N_S, VCMPNEQ_N_S.
24056 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
24057 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
24058 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
24059 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
24060 (@mve_vpselq_<supf><mode>): Likewise.
24061 (@mve_vpselq_f<mode>"): Likewise.
24062 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
24063 and move to vec-common.md.
24064 (vec_cmpu<mode><mode>): Likewise.
24065 (vcond<mode><mode>): Likewise.
24066 (vcond<V_cvtto><mode>): Likewise.
24067 (vcondu<mode><v_cmp_result>): Likewise.
24068 (vcond_mask_<mode><v_cmp_result>): Likewise.
24069 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
24070 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
24071 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
24072 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
24073 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
24074 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
24076 (vec_cmpu<mode><mode>): Likewise.
24077 (vcond<mode><mode>): Likewise.
24078 (vcond<V_cvtto><mode>): Likewise.
24079 (vcondu<mode><v_cmp_result>): Likewise.
24080 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
24083 2021-05-17 liuhongt <hongtao.liu@intel.com>
24086 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
24087 gsi_insert_seq_before instead.
24089 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
24091 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
24092 (arm_sat_ok): ...this.
24094 2021-05-17 Martin Liska <mliska@suse.cz>
24096 * lto-wrapper.c (merge_flto_options): Factor out a new function.
24097 (merge_and_complain): Use it.
24098 (run_gcc): Merge also linker command line -flto=foo argument
24101 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
24103 * config/arm/arm.h (CPP_SPEC): Remove error message about
24104 -mlittle-endian/-mbig-endian conflict.
24106 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24108 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
24109 __ROP_PROTECT__ if -mrop-protect is selected.
24111 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24113 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
24114 rop_hash_save_offset and rop_hash_size.
24115 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
24116 rop_hash_size and rop_hash_save_offset.
24117 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
24118 (rs6000_emit_prologue): Emit hashst[p] in prologue.
24119 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
24120 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
24122 (hashst): New define_insn.
24123 (hashchk): Likewise.
24125 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
24127 * config/rs6000/rs6000.c (rs6000_option_override_internal):
24128 Disable shrink wrap when inserting ROP-protect instructions.
24129 * config/rs6000/rs6000.opt (mrop-protect): New option.
24130 (mprivileged): Likewise.
24131 * doc/invoke.texi: Document mrop-protect and mprivileged.
24133 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
24135 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
24138 2021-05-15 Martin Jambor <mjambor@suse.cz>
24141 2021-05-13 Martin Jambor <mjambor@suse.cz>
24143 PR tree-optimization/100453
24144 * tree-sra.c (sra_modify_assign): All const base accesses do not
24145 need refreshing, not just those from decl_pool.
24146 (sra_modify_assign): Do not refresh into a const base decl.
24148 2021-05-15 Jakub Jelinek <jakub@redhat.com>
24150 PR rtl-optimization/100342
24151 * regcprop.c (copy_value): When copying a source reg in a wider
24152 mode than it has recorded for the value, adjust recorded destination
24153 mode too or punt if !REG_CAN_CHANGE_MODE_P.
24155 2021-05-14 Jason Merrill <jason@redhat.com>
24157 * intl.h: Add comments.
24159 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24161 * config/aarch64/aarch64-simd.md
24162 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
24163 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
24164 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
24165 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
24166 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
24167 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
24168 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
24169 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
24170 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
24172 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
24175 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
24176 boolean logic equivalent.
24177 (vtst_s16): Likewise.
24178 (vtst_s32): Likewise.
24179 (vtst_u8): Likewise.
24180 (vtst_u16): Likewise.
24181 (vtst_u32): Likewise.
24182 (vtst_p8): Likewise.
24183 (vtst_p16): Likewise.
24184 (vtstq_s8): Likewise.
24185 (vtstq_s16): Likewise.
24186 (vtstq_s32): Likewise.
24187 (vtstq_u8): Likewise.
24188 (vtstq_u16): Likewise.
24189 (vtstq_u32): Likewise.
24190 (vtstq_p8): Likewise.
24191 (vtstq_p16): Likewise.
24192 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
24193 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
24195 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24197 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
24198 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
24199 (aarch64_sqdmlsl2<mode>): Delete.
24200 (aarch64_sqdmlal2_lane<mode>): Merge this...
24201 (aarch64_sqdmlsl2_lane<mode>): ... And this...
24202 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
24203 (aarch64_sqdmlal2_laneq<mode>): Merge this...
24204 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
24205 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
24206 (aarch64_sqdmlal2_n<mode>): Merge this...
24207 (aarch64_sqdmlsl2_n<mode>): ... And this...
24208 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
24210 2021-05-13 Martin Sebor <msebor@redhat.com>
24212 PR middle-end/100574
24213 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
24214 all null arguments.
24216 2021-05-13 Martin Sebor <msebor@redhat.com>
24218 PR tree-optimization/93100
24219 PR middle-end/98583
24220 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
24221 don't modify referenced objects.
24223 2021-05-13 Martin Jambor <mjambor@suse.cz>
24225 PR tree-optimization/100453
24226 * tree-sra.c (sra_modify_assign): All const base accesses do not
24227 need refreshing, not just those from decl_pool.
24228 (sra_modify_assign): Do not refresh into a const base decl.
24230 2021-05-13 Martin Liska <mliska@suse.cz>
24232 * tree-ssa-dom.c: Remove m_simplifier.
24234 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
24237 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
24238 canonicalize DImode inequality comparisons against the
24239 maximum integral value.
24241 2021-05-13 Jakub Jelinek <jakub@redhat.com>
24243 PR tree-optimization/98856
24244 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
24245 Expect V2DI and V4DI arithmetic right shifts to be emulated.
24246 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
24248 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
24249 expand_vec_perm_2perm_pblendv): New functions.
24250 (ix86_expand_vec_perm_const_1): Use them.
24251 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
24252 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
24253 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
24254 (ashrv4di3): New define_expand.
24255 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
24256 and !TARGET_AVX512VL expansion.
24258 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
24261 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
24262 sizes < 16 to a register when constructing vpcmov pattern.
24263 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
24265 2021-05-13 Martin Liska <mliska@suse.cz>
24267 * gcov-io.c (gcov_write_block): Remove.
24268 (gcov_write_words): Likewise.
24269 (gcov_read_words): Re-implement using gcov_read_bytes.
24270 (gcov_allocate): Remove.
24271 (GCOV_BLOCK_SIZE): Likewise.
24272 (struct gcov_var): Remove most of the fields.
24273 (gcov_position): Implement with ftell.
24274 (gcov_rewrite): Remove setting of start and offset fields.
24275 (from_file): Re-format.
24276 (gcov_open): Remove setbuf call. It should not be needed.
24277 (gcov_close): Remove internal buffer handling.
24278 (gcov_magic): Use __builtin_bswap32.
24279 (gcov_write_counter): Use directly gcov_write_unsigned.
24280 (gcov_write_string): Use direct fwrite and do not round
24282 (gcov_seek): Use directly fseek.
24283 (gcov_write_tag): Use gcov_write_unsigned directly.
24284 (gcov_write_length): Likewise.
24285 (gcov_write_tag_length): Likewise.
24286 (gcov_read_bytes): Use directly fread.
24287 (gcov_read_unsigned): Use gcov_read_words.
24288 (gcov_read_counter): Likewise.
24289 (gcov_read_string): Use gcov_read_bytes.
24290 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
24291 that size is not in bytes, but words (4B).
24292 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
24293 (GCOV_TAG_ARCS_LENGTH): Likewise.
24294 (GCOV_TAG_ARCS_NUM): Likewise.
24295 (GCOV_TAG_COUNTER_LENGTH): Likewise.
24296 (GCOV_TAG_COUNTER_NUM): Likewise.
24297 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
24299 2021-05-13 liuhongt <hongtao.liu@intel.com>
24302 * config/i386/sse.md (ssedoublevecmode): Add attribute for
24303 V64QI/V32HI/V16SI/V4DI.
24304 (ssehalfvecmode): Add attribute for V2DI/V2DF.
24305 (*vec_concatv4si_0): Extend to VI124_128.
24306 (*vec_concat<mode>_0): New pre-reload splitter.
24307 * config/i386/predicates.md (movq_parallel): New predicate.
24309 2021-05-13 Alexandre Oliva <oliva@adacore.com>
24311 * targhooks.c (default_zero_call_used_regs): Retry using
24312 successfully-zeroed registers as sources.
24314 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
24316 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
24318 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
24321 * gimple-range.cc (range_of_builtin_call): Skip out on
24322 processing __builtin_clz when varying.
24324 2021-05-12 Tom de Vries <tdevries@suse.de>
24327 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
24328 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
24329 to ptx_version_option.
24330 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
24331 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
24332 (define_insn "nvptx_vote_ballot"): Use sync variant for
24334 * config/nvptx/nvptx.opt (ptx_version): Add enum.
24335 (mptx): Add option.
24336 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
24338 2021-05-12 Richard Biener <rguenther@suse.de>
24340 PR tree-optimization/100566
24341 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
24342 allow_back for all edge queries.
24344 2021-05-12 liuhongt <hongtao.liu@intel.com>
24347 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
24348 splitters for pblendvb of NOT mask register.
24350 2021-05-12 Richard Biener <rguenther@suse.de>
24352 PR tree-optimization/100519
24353 * tree-ssa-reassoc.c (can_associate_p): Split into...
24354 (can_associate_op_p): ... this
24355 (can_associate_type_p): ... and this.
24356 (is_reassociable_op): Call can_associate_op_p.
24357 (break_up_subtract_bb): Call the appropriate predicates.
24358 (reassociate_bb): Likewise.
24360 2021-05-12 Martin Liska <mliska@suse.cz>
24362 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
24363 (run_gcc): Use -flto argument detection for merged
24366 2021-05-12 Martin Liska <mliska@suse.cz>
24368 * lto-wrapper.c (print_lto_docs_link): New function.
24369 (run_gcc): Print warning about missing job server detection
24370 after we know NR of partitions. Do the same for -flto{,=1}.
24371 * opts.c (get_option_html_page): Support -flto option.
24373 2021-05-12 Martin Liska <mliska@suse.cz>
24375 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
24377 (append_option): Remove.
24378 (find_option): Rework to use the vector type.
24379 (remove_option): Remove.
24380 (merge_and_complain): Use vectors for cl_decoded_option data
24382 (append_compiler_options): Likewise.
24383 (append_diag_options): Likewise.
24384 (append_linker_options): Likewise.
24385 (append_offload_options): Likewise.
24386 (compile_offload_image): Likewise.
24387 (compile_images_for_offload_targets): Likewise.
24388 (find_and_merge_options): Likewise.
24389 (run_gcc): Likewise.
24391 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
24394 * dwarf2out.c (dwarf2out_finish): Set
24395 have_multiple_function_sections with multi-range text_section.
24397 2021-05-12 Martin Liska <mliska@suse.cz>
24399 PR bootstrap/100560
24400 * Makefile.in: Remove version.h from linker command line.
24402 2021-05-12 Richard Biener <rguenther@suse.de>
24404 PR middle-end/100547
24405 * rtl.h (rtvec_alloc): Make argument size_t.
24406 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
24408 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24410 PR middle-end/100508
24411 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
24412 type, don't reuse DECL_RTL if it has different mode, instead force
24413 creation of a new DEBUG_EXPR.
24415 2021-05-12 Jakub Jelinek <jakub@redhat.com>
24416 Marc Glisse <marc.glisse@inria.fr>
24418 PR tree-optimization/94589
24419 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
24420 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
24422 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
24425 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
24426 * config/i386/mmx.md (MMXMODE124): New mode iterator.
24428 (mmxintvecmode): New mode attribute.
24429 (mmxintvecmodelower): Ditto.
24430 (*mmx_maskcmpv2sf3_comm): New insn pattern.
24431 (*mmx_maskcmpv2sf3): Ditto.
24432 (vec_cmpv2sfv2si): New expander.
24433 (vcond<V2FI:mode>v2si): Ditto.
24434 (mmx_vlendvps): New insn pattern.
24435 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
24436 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
24437 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
24439 2021-05-11 Martin Sebor <msebor@redhat.com>
24441 PR middle-end/21433
24442 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
24444 2021-05-11 Richard Biener <rguenther@suse.de>
24446 * gimple-fold.c (gimple_fold_call): Do not call
24447 maybe_fold_reference on call arguments or the static chain.
24448 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
24451 2021-05-11 Martin Liska <mliska@suse.cz>
24453 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
24454 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
24455 (DEF_HSAIL_SAT_BUILTIN): Likewise.
24456 (DEF_HSAIL_INTR_BUILTIN): Likewise.
24457 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
24458 * doc/frontends.texi: Remove BRIG.
24459 * doc/install.texi: Likewise.
24460 * doc/invoke.texi: Likewise.
24461 * doc/standards.texi: Likewise.
24462 * brig-builtins.def: Removed.
24463 * brig/ChangeLog: Removed.
24464 * brig/Make-lang.in: Removed.
24465 * brig/brig-builtins.h: Removed.
24466 * brig/brig-c.h: Removed.
24467 * brig/brig-lang.c: Removed.
24468 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
24469 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
24470 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
24471 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
24472 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
24473 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
24474 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
24475 * brig/brigfrontend/brig-comment-handler.cc: Removed.
24476 * brig/brigfrontend/brig-control-handler.cc: Removed.
24477 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
24478 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
24479 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
24480 * brig/brigfrontend/brig-function-handler.cc: Removed.
24481 * brig/brigfrontend/brig-function.cc: Removed.
24482 * brig/brigfrontend/brig-function.h: Removed.
24483 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
24484 * brig/brigfrontend/brig-label-handler.cc: Removed.
24485 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
24486 * brig/brigfrontend/brig-machine.c: Removed.
24487 * brig/brigfrontend/brig-machine.h: Removed.
24488 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
24489 * brig/brigfrontend/brig-module-handler.cc: Removed.
24490 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
24491 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
24492 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
24493 * brig/brigfrontend/brig-to-generic.cc: Removed.
24494 * brig/brigfrontend/brig-to-generic.h: Removed.
24495 * brig/brigfrontend/brig-util.cc: Removed.
24496 * brig/brigfrontend/brig-util.h: Removed.
24497 * brig/brigfrontend/brig-variable-handler.cc: Removed.
24498 * brig/brigfrontend/hsa-brig-format.h: Removed.
24499 * brig/brigfrontend/phsa.h: Removed.
24500 * brig/brigspec.c: Removed.
24501 * brig/config-lang.in: Removed.
24502 * brig/gccbrig.texi: Removed.
24503 * brig/lang-specs.h: Removed.
24504 * brig/lang.opt: Removed.
24506 2021-05-11 Richard Biener <rguenther@suse.de>
24509 * ipa-param-manipulation.c
24510 (ipa_param_body_adjustments::modify_call_stmt): Avoid
24511 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
24512 via gimple_call_lhs_ptr.
24514 2021-05-11 Alex Coplan <alex.coplan@arm.com>
24517 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
24518 Avoid emitting CFA adjusts on the sp if we have the fp.
24520 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
24522 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
24523 (VMULD): New iterator.
24524 (VCOND): Handle V4HF and V8HF.
24525 (VCONQ): Fix entry for V2SF.
24526 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
24527 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
24528 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
24529 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
24530 Use a 128-bit vector mode for the indexed operand. Use stype for
24531 the scheduling type.
24533 2021-05-11 Richard Biener <rguenther@suse.de>
24535 * gimple-fold.c (maybe_fold_reference): Only return
24536 is_gimple_min_invariant values.
24538 2021-05-11 Richard Biener <rguenther@suse.de>
24540 PR middle-end/100509
24541 * gimple-fold.c (fold_gimple_assign): Only call
24542 get_symbol_constant_value on register type symbols.
24544 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
24545 Joe Ramsay <joe.ramsay@arm.com>
24548 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
24549 (__arm_vcmpneq): Remove duplicate definition.
24550 (__arm_vstrwq_scatter_offset_p): Likewise.
24551 (__arm_vmaxq_x): Likewise.
24552 (__arm_vmlsdavaq): Likewise.
24553 (__arm_vmlsdavaxq): Likewise.
24554 (__arm_vmlsdavq_p): Likewise.
24555 (__arm_vmlsdavxq_p): Likewise.
24556 (__arm_vrmlaldavhaq): Likewise.
24557 (__arm_vstrbq_p): Likewise.
24558 (__arm_vstrbq_scatter_offset): Likewise.
24559 (__arm_vstrbq_scatter_offset_p): Likewise.
24560 (__arm_vstrdq_scatter_offset): Likewise.
24561 (__arm_vstrdq_scatter_offset_p): Likewise.
24562 (__arm_vstrdq_scatter_shifted_offset): Likewise.
24563 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
24565 2021-05-11 Jakub Jelinek <jakub@redhat.com>
24567 PR middle-end/100471
24568 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
24569 is 0, bypass the reduction loop including
24570 GOMP_taskgroup_reduction_unregister call.
24572 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24574 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
24575 costing_for_scalar.
24576 (rs6000_density_test): Early return if costing_for_scalar is true.
24577 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
24579 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24581 * doc/tm.texi: Regenerated.
24582 * target.def (init_cost): Add new parameter costing_for_scalar.
24583 * targhooks.c (default_init_cost): Adjust for new parameter.
24584 * targhooks.h (default_init_cost): Likewise.
24585 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
24586 (vect_compute_single_scalar_iteration_cost): Likewise.
24587 (vect_analyze_loop_2): Likewise.
24588 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
24589 (vect_bb_vectorization_profitable_p): Likewise.
24590 * tree-vectorizer.h (init_cost): Likewise.
24591 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
24592 * config/i386/i386.c (ix86_init_cost): Likewise.
24593 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
24595 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
24597 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
24598 vect_nonmem and moved into...
24599 (struct rs6000_cost_data): ...here.
24600 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
24601 (rs6000_add_stmt_cost): Likewise.
24602 (rs6000_finish_cost): Likewise.
24604 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
24606 * range-op.cc (get_bool_state): Adjust head comment.
24607 (operator_not_equal::op1_range): Fix comment.
24608 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
24610 2021-05-10 Martin Sebor <msebor@redhat.com>
24612 PR middle-end/100425
24613 PR middle-end/100510
24614 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
24615 (pass_walloca::xlimit_certain_p): ...to this.
24616 (pass_walloca::gate): Execute for any kind of handled warning.
24617 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
24618 warnings when xlimit_certain_p is set.
24620 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
24622 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
24623 Return ALTIVEC_REGS if that is best_class.
24624 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
24626 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24628 * config/arm/arm.h (CPP_SPEC): Remove error message about
24631 2021-05-10 Martin Jambor <mjambor@suse.cz>
24633 * ipa-prop.h (IPA_NODE_REF): Removed.
24634 (IPA_NODE_REF_GET_CREATE): Likewise.
24635 (IPA_EDGE_REF): Likewise.
24636 (IPA_EDGE_REF_GET_CREATE): Likewise.
24637 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
24638 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
24639 use of ipa_node_params_sum.
24640 (ipcp_versionable_function_p): Likewise.
24641 (push_node_to_stack): Likewise.
24642 (pop_node_from_stack): Likewise.
24643 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
24644 direct use of ipa_node_params_sum.
24645 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
24646 ipa_node_params_sum.
24647 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
24649 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
24650 use of ipa_node_params_sum.
24651 (self_recursively_generated_p): Likewise.
24652 (propagate_scalar_across_jump_function): Likewise.
24653 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
24654 direct use of ipa_edge_args_sum, moved the lookup after the early
24655 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
24656 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
24657 direct uses of ipa_node_params_sum.
24658 (propagate_vr_across_jump_function): Likewise.
24659 (propagate_aggregate_lattice): Likewise.
24660 (propagate_aggs_across_jump_function): Likewise.
24661 (propagate_constants_across_call): Likewise, also replaced
24662 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
24663 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
24664 of ipa_node_params_sum.
24665 (estimate_local_effects): Likewise.
24666 (add_all_node_vals_to_toposort): Likewise.
24667 (propagate_constants_topo): Likewise.
24668 (ipcp_propagate_stage): Likewise.
24669 (ipcp_discover_new_direct_edges): Likewise.
24670 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
24671 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
24672 (get_info_about_necessary_edges): Likewise.
24673 (want_remove_some_param_p): Likewise.
24674 (create_specialized_node): Likewise.
24675 (self_recursive_pass_through_p): Likewise.
24676 (self_recursive_agg_pass_through_p): Likewise.
24677 (find_more_scalar_values_for_callers_subset): Likewise and also
24678 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
24679 case replacing two of those with a single query.
24680 (find_more_contexts_for_caller_subset): Likewise for the
24681 ipa_polymorphic_call_context overload.
24682 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
24683 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
24684 ipa_node_params_sum.
24685 (find_aggregate_values_for_callers_subset): Likewise, also reusing
24686 results of ipa_edge_args_sum->get.
24687 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
24688 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
24689 direct use of ipa_edge_args_sum.
24690 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
24691 summary query after the early exit and reused the result later.
24692 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
24693 ipa_node_params_sum.
24694 (decide_whether_version_node): Likewise. Removed re-querying for
24695 summaries after cloning.
24696 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
24697 ipa_node_params_sum.
24698 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
24700 (identify_dead_nodes): Likewise.
24701 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
24702 ipa_node_params_sum.
24703 (ipcp_store_vr_results): Likewise.
24704 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
24705 (ipa_fn_summary_t::duplicate): Likewise.
24706 (analyze_function_body): Likewise.
24707 (estimate_calls_size_and_time): Likewise.
24708 (ipa_cached_call_context::duplicate_from): Likewise.
24709 (ipa_call_context::equal_to): Likewise.
24710 (remap_edge_params): Likewise.
24711 (ipa_merge_fn_summary_after_inlining): Likewise.
24712 (inline_read_section): Likewise.
24713 * ipa-icf.c (sem_function::param_used_p): Likewise.
24714 * ipa-modref.c (compute_parm_map): Likewise.
24715 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
24717 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
24718 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
24720 * ipa-profile.c (check_argument_count): Likewise.
24721 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
24722 with a direct use of ipa_node_params_sum.
24723 (ipa_initialize_node_params): Likewise.
24724 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
24725 direct use of ipa_edge_args_sum and reused the query result.
24726 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
24727 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
24728 direct use of ipa_edge_args_sum.
24729 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
24730 ipa_node_params_sum and reused the result of the query.
24731 (ipa_analyze_node): Likewise.
24732 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
24733 of ipa_node_params_sum.
24734 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
24735 direct uses of ipa_edge_args_sum.
24736 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
24737 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
24738 direct use of ipa_edge_args_sum. Removed superficial re-querying the
24740 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
24741 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
24743 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
24744 direct use of ipa_edge_args_sum.
24745 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
24746 use of ipa_node_params_sum.
24747 (ipa_print_node_params): Likewise.
24748 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
24749 direct uses of ipa_edge_args_sum.
24750 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
24752 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
24753 ipa_node_params_sum.
24754 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
24755 scopes where it is used.
24757 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
24759 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
24760 <case E_V2SImode>: Force op_true to register.
24762 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24764 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
24765 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
24766 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
24768 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
24769 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
24770 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
24771 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
24772 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
24773 (mve_vcmpneq_n_f<mode>): Remove.
24774 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
24775 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
24776 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
24778 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24780 * config/arm/iterators.md (MVE_COMPARISONS): New.
24782 (mve_cmp_type): New.
24783 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
24785 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
24786 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
24787 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
24788 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
24789 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
24790 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
24791 (mve_vcmpneq_n_<mode>): Remove.
24793 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24795 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
24796 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
24797 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
24800 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24802 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
24803 (vcmpneq_n_u): Likewise.
24804 (vcmpeqq_u,): Likewise.
24805 (vcmpeqq_n_u): Likewise.
24806 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
24807 VCMPEQQ_N_U and VCMPNEQ_N_U.
24808 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
24809 (mve_vcmpeqq_n): Likewise.
24810 (mve_vcmpeqq): Likewise.
24811 (mve_vcmpneq_n): Likewise.
24813 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
24815 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
24816 the 's' version of the builtin.
24818 2021-05-10 Richard Biener <rguenther@suse.de>
24820 PR tree-optimization/100492
24821 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
24822 Find nothing when the loop contains an irreducible region.
24824 2021-05-10 Richard Biener <rguenther@suse.de>
24826 PR middle-end/100464
24828 * gimple-fold.c (canonicalize_constructor_val): Do not set
24831 2021-05-10 Richard Biener <rguenther@suse.de>
24833 PR tree-optimization/100434
24834 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
24836 (dse_optimize_stmt): Handle call LHS by dropping the
24837 LHS or the whole call if it doesn't have other
24839 (pass_dse::execute): Adjust.
24841 2021-05-10 Martin Liska <mliska@suse.cz>
24843 * Makefile.in: Add missing genversion rule.
24845 2021-05-10 Alex Coplan <alex.coplan@arm.com>
24848 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
24849 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
24851 2021-05-10 Martin Liska <mliska@suse.cz>
24853 * builtins.c (is_builtin_name): Use startswith
24854 function instead of strncmp.
24855 * collect2.c (main): Likewise.
24856 (has_lto_section): Likewise.
24857 (scan_libraries): Likewise.
24858 * coverage.c (coverage_checksum_string): Likewise.
24859 (coverage_init): Likewise.
24860 * dwarf2out.c (is_cxx): Likewise.
24861 (gen_compile_unit_die): Likewise.
24862 * gcc-ar.c (main): Likewise.
24863 * gcc.c (init_spec): Likewise.
24864 (read_specs): Likewise.
24865 (execute): Likewise.
24866 (check_live_switch): Likewise.
24867 * genattrtab.c (write_attr_case): Likewise.
24868 (IS_ATTR_GROUP): Likewise.
24869 * gencfn-macros.c (main): Likewise.
24870 * gengtype.c (type_for_name): Likewise.
24871 (gen_rtx_next): Likewise.
24872 (get_file_langdir): Likewise.
24873 (write_local): Likewise.
24874 * genmatch.c (get_operator): Likewise.
24875 (get_operand_type): Likewise.
24876 (expr::gen_transform): Likewise.
24877 * genoutput.c (validate_optab_operands): Likewise.
24878 * incpath.c (add_sysroot_to_chain): Likewise.
24879 * langhooks.c (lang_GNU_C): Likewise.
24880 (lang_GNU_CXX): Likewise.
24881 (lang_GNU_Fortran): Likewise.
24882 (lang_GNU_OBJC): Likewise.
24883 * lto-wrapper.c (run_gcc): Likewise.
24884 * omp-general.c (omp_max_simt_vf): Likewise.
24885 * omp-low.c (omp_runtime_api_call): Likewise.
24886 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
24887 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
24888 * real.c (real_from_string): Likewise.
24889 * selftest.c (assert_str_startswith): Likewise.
24890 * timevar.c (timer::validate_phases): Likewise.
24891 * tree.c (get_file_function_name): Likewise.
24892 * ubsan.c (ubsan_use_new_style_p): Likewise.
24893 * varasm.c (default_function_rodata_section): Likewise.
24894 (incorporeal_function_p): Likewise.
24895 (default_section_type_flags): Likewise.
24896 * system.h (startswith): Define startswith.
24898 2021-05-10 Martin Liska <mliska@suse.cz>
24900 * bitmap.h (class auto_bitmap): Remove
24901 __cplusplus >= 201103.
24902 * config/aarch64/aarch64.c: Likewise.
24903 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
24905 * sbitmap.h: Likewise.
24907 2021-05-10 Martin Liska <mliska@suse.cz>
24909 * Makefile.in: Rename gcov-iov to genversion and depend
24910 on version.h (instead of gcov-iov.h).
24911 * gcov-io.h: Include version.h instread of gcov-iov.h.
24912 * gengtype-state.c (read_state_version): Likewise.
24913 * gcov-iov.c: Moved to...
24914 * genversion.c: ...here.
24915 * lto-streamer.h (LTO_major_version): Define it with
24917 * version.c: Removed.
24918 * version.h: Removed.
24920 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24922 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
24923 * config/arc/simdext.md (VCT): Add predicates for iterator
24926 (voptab): Likewise.
24927 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
24928 (<voptab>v2si3): New patterns.
24930 (reduc_plus_scal_v4hi): Likewise.
24931 (reduc_plus_scal_v2si): Likewise.
24932 (vec_duplicatev2si): Likewise.
24933 (vec_duplicatev4hi): Likewise.
24935 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24937 * config/arc/simdext.md: Format and cleanup file.
24939 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24941 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
24942 only when munaligned-access option is on.
24943 (movmisalign<mode>): Likewise.
24945 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24947 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
24949 * config/arc/arc.c (arc_reorg): Remove underscore from string.
24951 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24953 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
24954 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
24955 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
24956 (norm_f): Likewise.
24959 (clzsi2): Use fls instruction when available.
24960 (arc_clzsi2): Likewise.
24962 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24964 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
24966 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
24968 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
24970 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
24972 PR middle-end/100467
24973 * toplev.c (compile_file): Call insn_locations_init before
24974 targetm.asm_out.code_end.
24976 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
24979 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
24981 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
24983 2021-05-07 Jakub Jelinek <jakub@redhat.com>
24984 Andrew Stubbs <amd@codesourcery.com>
24987 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
24988 emit_move_insn operands.
24990 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
24992 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
24993 location for the outgoing edges of an empty block.
24994 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
24995 type by means of the get_array_descr_info langhook, if it is set and
24996 returns true. Remove obsolete code dealing with unnamed subtypes.
24998 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25000 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
25001 (sbr_vector): Renamed from ssa_block_cache.
25002 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
25003 (ssa_block_ranges::~ssa_block_ranges): Remove.
25004 (sbr_vector::set_bb_range): Use varying and undefined cached values.
25005 (ssa_block_ranges::set_bb_varying): Remove.
25006 (sbr_vector::get_bb_range): Adjust assert.
25007 (sbr_vector::bb_range_p): Adjust assert.
25008 (~block_range_cache): No freeing loop required.
25009 (block_range_cache::get_block_ranges): Remove.
25010 (block_range_cache::set_bb_range): Inline get_block_ranges.
25011 (block_range_cache::set_bb_varying): Remove.
25012 * gimple-range-cache.h (set_bb_varying): Remove prototype.
25013 * value-range.h (irange_allocator::get_memory): New.
25015 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25017 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
25018 dominator tree is available and requested.
25019 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
25020 (ranger_cache::fill_block_cache): Don't search dom tree here either.
25021 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
25023 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25025 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
25026 only PHI nodes better.
25028 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25030 * gimple-range-edge.h (gimple_outgoing_range): Rename from
25032 (gcond_edge_range): Export prototype.
25033 * gimple-range-edge.cc (gcond_edge_range): New.
25034 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
25035 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
25037 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25039 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
25040 default range into a temp and allocate only what is needed.
25042 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25044 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
25046 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
25048 * gimple-range.h (gimple_range_global): Pick up parameter initial
25049 values, and use-before defined locals are UNDEFINED.
25051 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
25053 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
25055 * tree.h (reverse_storage_order_for_component_p): Return false if
25056 the type is a pointer.
25058 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
25060 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
25062 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25065 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
25066 Handle V8QI, V4HI and V2SI modes.
25067 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
25068 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
25069 * config/i386/i386.md (unspec): ... here.
25071 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
25072 Tom de Vries <tdevries@suse.de>
25074 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
25075 a truth_value_p reduction variable is nonintegral.
25077 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
25080 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
25081 Return false for mode sizes < 16.
25083 2021-05-07 Jakub Jelinek <jakub@redhat.com>
25086 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
25088 2021-05-06 Martin Jambor <mjambor@suse.cz>
25090 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
25091 when there is no function summary.
25092 (ipa_sra_summarize_function): produce edge summaries even when
25095 2021-05-06 Tom Tromey <tom@tromey.com>
25097 * godump.c (string_hash_eq): Remove.
25098 (go_finish): Use htab_eq_string.
25100 2021-05-06 Tom Tromey <tom@tromey.com>
25102 * gengtype-state.c (read_state): Use htab_eq_string.
25103 (string_eq): Remove.
25105 2021-05-06 Tom Tromey <tom@tromey.com>
25107 * gensupport.c (htab_eq_string): Remove.
25109 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
25112 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
25113 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
25114 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
25115 set_ignored_loc callbacks.
25116 * debug.c (do_nothing_debug_hooks): Likewise.
25117 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
25118 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
25119 (in_text_section_p, last_text_label, last_cold_label,
25120 switch_text_ranges, switch_cold_ranges): New data items.
25121 (dwarf2out_note_section_used): Remove.
25122 (dwarf2out_begin_prologue): Set fde->ignored_debug and
25124 (mark_ignored_debug_section): New helper function.
25125 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
25126 mark_ignored_debug_section.
25127 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
25128 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
25129 (size_of_aranges): Adjust formula for multi-part text ranges size.
25130 (output_aranges): Output multi-part text ranges.
25131 (dwarf2out_set_ignored_loc): New callback function.
25132 (dwarf2out_finish): Output multi-part text ranges.
25133 (dwarf2out_c_finalize): Clear new data items.
25134 * final.c (final_start_function_1): Call set_ignored_loc callback.
25135 (final_scan_insn_1): Likewise.
25136 * ggc-page.c (gt_ggc_mx): New helper function.
25137 * stringpool.c (gt_pch_nx): Likewise.
25139 2021-05-06 Richard Biener <rguenther@suse.de>
25141 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
25142 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
25143 (TV_TREE_INTO_SSA): New.
25144 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
25145 (rewrite_blocks): Likewise.
25146 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
25148 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25150 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
25151 abs_replacement, xor_replacement,
25152 cond_removal_in_popcount_clz_ctz_pattern,
25153 replace_phi_edge_with_variable): Change type of phi argument from
25154 gimple * to gphi *.
25156 2021-05-06 Richard Biener <rguenther@suse.de>
25158 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
25159 Output an opt-info message.
25160 (do_split_loop_on_cond): Likewise.
25161 (tree_ssa_split_loops): Update SSA form here.
25163 2021-05-06 Richard Biener <rguenther@suse.de>
25165 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
25166 return variable removal.
25168 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
25170 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
25171 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
25172 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
25174 * config/s390/s390.c (s390_const_operand_ok): Remove unused
25177 2021-05-06 Jakub Jelinek <jakub@redhat.com>
25179 PR tree-optimization/94589
25180 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
25181 spaceship_replacement.
25182 (cond_only_block_p, spaceship_replacement): New functions.
25184 2021-05-06 Richard Biener <rguenther@suse.de>
25187 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
25188 dealing with a debug use and only query existing addresses
25190 (lower_emutls_1): Avoid splitting out addresses for debug
25191 stmts, reset the debug stmt when we fail to find existing
25193 (lower_emutls_phi_arg): Set wi.stmt.
25195 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
25198 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
25199 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
25200 (stack_protect_test): Use cbranch helper.
25202 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25205 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
25206 always return the establisher frame for __builtin_frame_address (0).
25208 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
25211 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
25212 (ix86_cpu_model_var): Likewise.
25213 (ix86_cpu_features2_type_node): Likewise.
25214 (ix86_cpu_features2_var): Likewise.
25215 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
25218 2021-05-05 Martin Sebor <msebor@redhat.com>
25220 * passes.def (pass_warn_printf): Run after SSA.
25222 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25224 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
25225 * config/arm/predicates.md (minus_one_operand): New predicate.
25227 2021-05-05 Jeff Law <jlaw@tachyum.com>
25229 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
25231 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
25233 PR rtl-optimization/100263
25234 * postreload.c (move2add_valid_value_p): Ensure register can
25237 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25239 PR rtl-optimization/100411
25240 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
25241 and beginning of function markers.
25243 2021-05-05 Jeff Law <jlaw@tachyum.com>
25245 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
25246 * config/cr16/cr16.c (notice_update_cc): Remove.
25247 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
25249 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
25252 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
25253 Handle V8QI, V4HI and V2SI modes.
25254 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
25255 (ix86_build_signbit_mask): Ditto.
25256 * config/i386/mmx.md (MMXMODE14): New mode iterator.
25257 (<smaxmin:code><MMXMODE14:mode>3): New expander.
25258 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
25259 (<umaxmin:code><MMXMODE24:mode>3): New expander.
25260 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
25261 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
25262 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25263 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25264 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25265 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
25267 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
25269 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
25270 not expand the VALUE_EXPR of variables put in the non-local frame.
25271 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
25272 to be ignored for debug info, ensure its variable offsets are not.
25274 2021-05-05 Richard Biener <rguenther@suse.de>
25276 PR tree-optimization/79333
25277 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
25278 Fold stmt following SSA edges.
25280 2021-05-05 Richard Biener <rguenther@suse.de>
25282 PR middle-end/100394
25283 * calls.c (expand_call): Preserve possibly throwing calls.
25284 * cfgexpand.c (expand_call_stmt): When a call can throw signal
25285 RTL expansion there are side-effects.
25286 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
25287 mark all possibly throwing stmts necessary unless we can elide
25289 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
25290 -fdelete-dead-exceptions.
25291 * tree.h (DECL_PURE_P): Add note about exceptions.
25293 2021-05-05 Alexandre Oliva <oliva@adacore.com>
25295 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
25298 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
25300 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
25301 get_fnname_from_decl for name of thunk.
25302 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
25303 and ASM_OUTPUT_LABEL.
25304 (rs6000_xcoff_declare_function_name): Use assemble_name and
25306 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
25307 (rs6000_xcoff_encode_section_info): Don't add mapping class
25308 for aliases. Always add [DS] mapping class to primary
25310 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
25312 2021-05-04 Martin Sebor <msebor@redhat.com>
25314 PR middle-end/100307
25315 * builtins.c (compute_objsize_r): Clear base0 for pointers.
25317 2021-05-04 Jeff Law <jlaw@tachyum.com>
25319 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
25321 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
25323 * caller-save.c: Remove CC0.
25324 * cfgcleanup.c: Remove CC0.
25325 * cfgrtl.c: Remove CC0.
25326 * combine.c: Remove CC0.
25327 * compare-elim.c: Remove CC0.
25328 * conditions.h: Remove CC0.
25329 * config/h8300/h8300.h: Remove CC0.
25330 * config/h8300/h8300-protos.h: Remove CC0.
25331 * config/h8300/peepholes.md: Remove CC0.
25332 * config/i386/x86-tune-sched.c: Remove CC0.
25333 * config/m68k/m68k.c: Remove CC0.
25334 * config/rl78/rl78.c: Remove CC0.
25335 * config/sparc/sparc.c: Remove CC0.
25336 * config/xtensa/xtensa.c: Remove CC0.
25337 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
25338 RTL where that is used as a placeholder only.
25339 * cprop.c: Remove CC0.
25340 * cse.c: Remove CC0.
25341 * cselib.c: Remove CC0.
25342 * df-problems.c: Remove CC0.
25343 * df-scan.c: Remove CC0.
25344 * doc/md.texi: Remove CC0. Adjust an example.
25345 * doc/rtl.texi: Remove CC0. Adjust an example.
25346 * doc/tm.texi: Regenerate.
25347 * doc/tm.texi.in: Remove CC0.
25348 * emit-rtl.c: Remove CC0.
25349 * final.c: Remove CC0.
25350 * fwprop.c: Remove CC0.
25351 * gcse-common.c: Remove CC0.
25352 * gcse.c: Remove CC0.
25353 * genattrtab.c: Remove CC0.
25354 * genconfig.c: Remove CC0.
25355 * genemit.c: Remove CC0.
25356 * genextract.c: Remove CC0.
25357 * gengenrtl.c: Remove CC0.
25358 * genrecog.c: Remove CC0.
25359 * haifa-sched.c: Remove CC0.
25360 * ifcvt.c: Remove CC0.
25361 * ira-costs.c: Remove CC0.
25362 * ira.c: Remove CC0.
25363 * jump.c: Remove CC0.
25364 * loop-invariant.c: Remove CC0.
25365 * lra-constraints.c: Remove CC0.
25366 * lra-eliminations.c: Remove CC0.
25367 * optabs.c: Remove CC0.
25368 * postreload-gcse.c: Remove CC0.
25369 * postreload.c: Remove CC0.
25370 * print-rtl.c: Remove CC0.
25371 * read-rtl-function.c: Remove CC0.
25372 * reg-notes.def: Remove CC0.
25373 * reg-stack.c: Remove CC0.
25374 * reginfo.c: Remove CC0.
25375 * regrename.c: Remove CC0.
25376 * reload.c: Remove CC0.
25377 * reload1.c: Remove CC0.
25378 * reorg.c: Remove CC0.
25379 * resource.c: Remove CC0.
25380 * rtl.c: Remove CC0.
25381 * rtl.def: Remove CC0.
25382 * rtl.h: Remove CC0.
25383 * rtlanal.c: Remove CC0.
25384 * sched-deps.c: Remove CC0.
25385 * sched-rgn.c: Remove CC0.
25386 * shrink-wrap.c: Remove CC0.
25387 * simplify-rtx.c: Remove CC0.
25388 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
25389 CC_STATUS_MDEP, and CC_STATUS.
25390 * target.def: Remove CC0.
25391 * valtrack.c: Remove CC0.
25392 * var-tracking.c: Remove CC0.
25394 2021-05-04 Richard Biener <rguenther@suse.de>
25396 PR tree-optimization/100414
25397 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
25399 (tree_ssa_phiopt_worker): But unconditionally here.
25401 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
25403 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
25404 && and || with floating-point and complex arguments.
25406 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25408 * tree-inline.c (insert_debug_decl_map): Delete.
25409 (copy_debug_stmt): Minor tweak.
25410 (setup_one_parameter): Do not use a variable if the value is either
25411 a read-only DECL or a non-addressable local variable in the caller.
25412 In this case, insert the debug-only variable in the map manually.
25413 (expand_call_inline): Do not generate a CLOBBER for these values.
25414 * tree-inline.h (debug_map): Minor tweak.
25416 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
25418 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
25419 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
25421 2021-05-04 Richard Biener <rguenther@suse.de>
25423 PR tree-optimization/100329
25424 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
25426 (insert_stmt_after): Assert we're not running into asm goto.
25428 2021-05-04 Richard Biener <rguenther@suse.de>
25430 PR tree-optimization/100398
25431 * tree-ssa-dse.c (pass_dse::execute): Preserve control
25434 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25436 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
25438 2021-05-04 Alexandre Oliva <oliva@adacore.com>
25440 * builtins.c (try_store_by_multiple_pieces): New.
25441 (expand_builtin_memset_args): Use it. If target_char_cast
25442 fails, proceed as for non-constant val. Pass len's ctz to...
25443 * expr.c (clear_storage_hints): ... this. Try store by
25444 multiple pieces after setmem.
25445 (clear_storage): Adjust.
25446 * expr.h (clear_storage_hints): Likewise.
25447 (try_store_by_multiple_pieces): Declare.
25448 * passes.def: Replace the last copy_prop with ccp.
25450 2021-05-03 Tom de Vries <tdevries@suse.de>
25453 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
25456 2021-05-03 Richard Biener <rguenther@suse.de>
25458 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
25460 2021-05-03 Richard Biener <rguenther@suse.de>
25462 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
25463 (dse_dom_walker): Remove.
25464 (dse_dom_walker::dse_optimize_stmt): Rename...
25465 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
25466 (dse_dom_walker::before_dom_children): Inline ...
25467 (pass_dse::execute): ... here. Perform a reverse program
25470 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
25473 * configure: Regenerated.
25475 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
25478 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
25479 (s390_md_asm_adjust): Handle hard registers.
25481 2021-05-03 Jakub Jelinek <jakub@redhat.com>
25483 PR tree-optimization/100382
25484 * tree-ssa-dse.c: Include tree-eh.h.
25485 (dse_dom_walker::before_dom_children): Don't remove stmts if
25486 stmt_unremovable_because_of_non_call_eh_p is true.
25488 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
25490 * varasm.c (compute_reloc_for_var): Split out from...
25491 (get_variable_section): Use it.
25492 * output.h (compute_reloc_for_var): Declare.
25493 * config/rs6000/rs6000-protos.h
25494 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
25496 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
25497 Don't append storage mapping class to symbol.
25498 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
25499 Don't convert TLS BSS to common.
25500 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
25501 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
25503 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
25505 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
25506 If align is 0 from TLS class, use the same rules as varasm.c
25507 If not common, switch to BSS section manually.
25508 If common, emit appropriate comm or lcomm directive.
25509 (rs6000_xcoff_encode_section_info): Add logic to append all
25510 storage mapping classes.
25511 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
25512 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25513 rs6000_xcoff_asm_output_aligned_decl_common.
25514 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
25515 rs6000_xcoff_asm_output_aligned_decl_common.
25516 (ASM_OUTPUT_TLS_COMMON): Use
25517 rs6000_xcoff_asm_output_aligned_decl_common.
25519 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25522 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
25523 as first argument of pseudo_node_t constructors.
25525 2021-05-02 Jakub Jelinek <jakub@redhat.com>
25528 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
25530 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
25532 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
25533 (gt_pch_nx (int_range<1> *&)): New.
25534 (gt_ggc_mx (int_range<1> *&)): New.
25535 * value-range.h (class irange): Add GTY support for
25538 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
25540 * doc/options.texi (Negative): Change either or to both and.
25542 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25544 * config/aarch64/aarch64-simd-builtins.def: Add
25545 float_ml[as][q]_laneq builtin generator macros.
25546 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
25547 (aarch64_float_mla_laneq<mode>): Define.
25548 (aarch64_float_mls_laneq<mode>): Define.
25549 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
25550 instead of GCC vector extensions.
25551 (vmlaq_laneq_f32): Likewise.
25552 (vmls_laneq_f32): Likewise.
25553 (vmlsq_laneq_f32): Likewise.
25555 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25557 * config/aarch64/aarch64-simd-builtins.def: Add
25558 float_ml[as]_lane builtin generator macros.
25559 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
25561 (mul_lane<mode>3): This, and re-order arguments.
25562 (aarch64_float_mla_lane<mode>): Define.
25563 (aarch64_float_mls_lane<mode>): Define.
25564 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
25565 instead of GCC vector extensions.
25566 (vmlaq_lane_f32): Likewise.
25567 (vmls_lane_f32): Likewise.
25568 (vmlsq_lane_f32): Likewise.
25570 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25572 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
25573 builtin generator macros.
25574 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
25576 (aarch64_float_mls<mode>): Define.
25577 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
25578 instead of relying on GCC vector extensions.
25579 (vmla_f64): Likewise.
25580 (vmlaq_f32): Likewise.
25581 (vmlaq_f64): Likewise.
25582 (vmls_f32): Likewise.
25583 (vmls_f64): Likewise.
25584 (vmlsq_f32): Likewise.
25585 (vmlsq_f64): Likewise.
25586 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
25588 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
25590 * config/aarch64/aarch64-simd-builtins.def: Add
25591 float_ml[as]_n_builtin generator macros.
25592 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
25594 (mul_n<mode>3): This, and re-order arguments.
25595 (aarch64_float_mla_n<mode>): Define.
25596 (aarch64_float_mls_n<mode>): Define.
25597 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
25598 instead of inline asm.
25599 (vmlaq_n_f32): Likewise.
25600 (vmls_n_f32): Likewise.
25601 (vmlsq_n_f32): Likewise.
25603 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
25605 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
25606 builtin generator macros.
25607 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
25608 (aarch64_pmull_hiv16qi_insn): Define.
25609 (aarch64_pmull_hiv16qi): Define.
25610 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
25611 instead of inline asm.
25612 (vmull_p8): Likewise.
25614 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
25616 * config/avr/avr.md: Adjust peepholes to match and
25617 generate parallels with clobber of REG_CC.
25618 (mov<mode>_insn): Rename to mov<mode>_insn_split.
25619 (*mov<mode>_insn): Rename to mov<mode>_insn.
25621 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
25623 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
25624 for VAR_DECLs if -fdata-sections enabled.
25626 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
25628 PR bootstrap/100327
25629 * config/rs6000/rs6000.c
25630 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
25631 (rs6000_libgcc_floating_mode_supported_p): New target hook.
25633 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
25635 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
25637 (class back_threader_registry): ...to here...
25638 (class back_threader_profitability): ...and here...
25639 (thread_jumps::thread_through_all_blocks): Remove argument.
25640 (back_threader_registry::back_threader_registry): New.
25641 (back_threader_registry::~back_threader_registry): New.
25642 (back_threader_registry::thread_through_all_blocks): New.
25643 (thread_jumps::profitable_jump_thread_path): Move from here...
25644 (back_threader_profitability::profitable_path_p): ...to here.
25645 (thread_jumps::find_taken_edge): New.
25646 (thread_jumps::convert_and_register_current_path): Move...
25647 (back_threader_registry::register_path): ...to here.
25648 (thread_jumps::register_jump_thread_path_if_profitable): Move...
25649 (thread_jumps::maybe_register_path): ...to here.
25650 (thread_jumps::handle_phi): Call find_taken_edge and
25651 maybe_register_path.
25652 (thread_jumps::handle_assignment): Same.
25653 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
25654 tree argument to handle_phi and handle_assignment.
25655 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
25656 set of m_speed_p and m_max_threaded_paths.
25657 (pass_thread_jumps::execute): Remove second argument from
25658 find_jump_threads_backwards.
25659 (pass_early_thread_jumps::execute): Same.
25661 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
25663 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
25664 (class dom_opt_dom_walker): Initialize some class variables.
25665 (pass_dominator::execute): Pass evrp_range_analyzer and
25666 dom_jump_threader_simplifier to dom_opt_dom_walker.
25667 Adjust for some functions moving into classes.
25668 (simplify_stmt_for_jump_threading): Adjust and move to...
25669 (jump_threader_simplifier::simplify): ...here.
25670 (dom_opt_dom_walker::before_dom_children): Adjust for
25671 m_evrp_range_analyzer.
25672 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
25673 (test_for_singularity): Place in dom_opt_dom_walker class.
25674 (dom_opt_dom_walker::optimize_stmt): The argument
25675 evrp_range_analyzer is now a class field.
25676 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
25677 (thread_jumps::thread_through_all_blocks): New.
25678 (thread_jumps::convert_and_register_current_path): Use m_registry.
25679 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
25680 being in the threader class.
25681 (pass_early_thread_jumps::execute): Same.
25682 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
25683 (jump_threader::jump_threader): ...here.
25684 (threadedge_finalize_values): Move...
25685 (jump_threader::~jump_threader): ...here.
25686 (jump_threader::remove_jump_threads_including): New.
25687 (jump_threader::thread_through_all_blocks): New.
25688 (record_temporary_equivalences_from_phis): Move...
25689 (jump_threader::record_temporary_equivalences_from_phis): ...here.
25690 (record_temporary_equivalences_from_stmts_at_dest): Move...
25691 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
25693 (simplify_control_stmt_condition_1): Move to jump_threader class.
25694 (simplify_control_stmt_condition): Move...
25695 (jump_threader::simplify_control_stmt_condition): ...here.
25696 (thread_around_empty_blocks): Move...
25697 (jump_threader::thread_around_empty_blocks): ...here.
25698 (thread_through_normal_block): Move...
25699 (jump_threader::thread_through_normal_block): ...here.
25700 (thread_across_edge): Move...
25701 (jump_threader::thread_across_edge): ...here.
25702 (thread_outgoing_edges): Move...
25703 (jump_threader::thread_outgoing_edges): ...here.
25704 * tree-ssa-threadedge.h: Move externally facing functings...
25705 (class jump_threader): ...here...
25706 (class jump_threader_simplifier): ...and here.
25707 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
25708 (jump_thread_path_allocator::jump_thread_path_allocator): New.
25709 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
25710 (jump_thread_path_allocator::allocate_thread_edge): New.
25711 (jump_thread_path_allocator::allocate_thread_path): New.
25712 (jump_thread_path_registry::jump_thread_path_registry): New.
25713 (jump_thread_path_registry::~jump_thread_path_registry): New.
25714 (jump_thread_path_registry::allocate_thread_edge): New.
25715 (jump_thread_path_registry::allocate_thread_path): New.
25716 (dump_jump_thread_path): Make extern.
25717 (debug (const vec<jump_thread_edge *> &path)): New.
25718 (struct removed_edges): Move to tree-ssa-threadupdate.h.
25719 (struct thread_stats_d): Remove.
25720 (remove_ctrl_stmt_and_useless_edges): Make static.
25721 (lookup_redirection_data): Move...
25722 (jump_thread_path_registry::lookup_redirection_data): ...here.
25723 (ssa_redirect_edges): Make static.
25724 (thread_block_1): Move...
25725 (jump_thread_path_registry::thread_block_1): ...here.
25726 (thread_block): Move...
25727 (jump_thread_path_registry::thread_block): ...here.
25728 (thread_through_loop_header): Move...
25729 (jump_thread_path_registry::thread_through_loop_header): ...here.
25730 (mark_threaded_blocks): Move...
25731 (jump_thread_path_registry::mark_threaded_blocks): ...here.
25732 (debug_path): Move...
25733 (jump_thread_path_registry::debug_path): ...here.
25734 (debug_all_paths): Move...
25735 (jump_thread_path_registry::dump): ..here.
25736 (rewire_first_differing_edge): Move...
25737 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
25738 (adjust_paths_after_duplication): Move...
25739 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
25740 (duplicate_thread_path): Move...
25741 (jump_thread_path_registry::duplicate_thread_path): ..here.
25742 (remove_jump_threads_including): Move...
25743 (jump_thread_path_registry::remove_jump_threads_including): ...here.
25744 (thread_through_all_blocks): Move to...
25745 (jump_thread_path_registry::thread_through_all_blocks): ...here.
25746 (delete_jump_thread_path): Remove.
25747 (register_jump_thread): Move...
25748 (jump_thread_path_registry::register_jump_thread): ...here.
25749 * tree-ssa-threadupdate.h: Move externally facing functions...
25750 (class jump_thread_path_allocator): ...here...
25751 (class jump_thread_path_registry): ...and here.
25752 (thread_through_all_blocks): Remove.
25753 (struct removed_edges): New.
25754 (register_jump_thread): Remove.
25755 (remove_jump_threads_including): Remove.
25756 (delete_jump_thread_path): Remove.
25757 (remove_ctrl_stmt_and_useless_edges): Remove.
25758 (free_dom_edge_info): New prototype.
25759 * tree-vrp.c: Remove x_vr_values hack.
25760 (class vrp_jump_threader_simplifier): New.
25761 (vrp_jump_threader_simplifier::simplify): New.
25762 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
25763 Remove m_dummy_cond.
25764 Instantiate m_simplifier and m_threader.
25765 (vrp_jump_threader::thread_through_all_blocks): New.
25766 (vrp_jump_threader::simplify_stmt): Remove.
25767 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
25768 Remove x_vr_values hack.
25769 (execute_vrp): Adjust for thread_through_all_blocks being in a
25772 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
25774 * genflags.c (gen_insn): Print failed expansion string.
25776 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
25778 * expr.c (alignment_for_piecewise_move): Call mode_for_size
25779 without limit to MAX_FIXED_MODE_SIZE.
25781 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
25783 PR middle-end/90773
25784 * builtins.c (builtin_memset_gen_str): Don't use return from
25785 simplify_gen_subreg.
25787 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
25790 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
25791 (*addsi3_carry_zext_0r): Ditto.
25792 (*sub<mode>3_carry_0): Ditto.
25793 (*subsi3_carry_zext_0r): Ditto.
25794 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
25796 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
25797 Also consider ix86_carry_flag_unset_operator to calculate
25798 the cost of adc/sbb insn.
25800 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
25802 PR rtl-optimization/100225
25803 PR rtl-optimization/84878
25804 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
25805 where we have an instruction which touches (writes) any hard
25806 register from df->regular_block_artificial_uses set.
25807 Allow not-single-set instruction only right before basic block
25810 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
25812 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
25814 2021-04-30 LevyHsu <admin@levyhsu.com>
25816 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
25817 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
25818 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
25819 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
25821 2021-04-29 Alexandre Oliva <oliva@adacore.com>
25823 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
25825 2021-04-29 Alexandre Oliva <oliva@adacore.com>
25827 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
25828 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
25829 * config/i386/i386.c: Adjust.
25830 * config/i386/i386.md: Adjust.
25831 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
25832 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25833 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25834 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25835 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25836 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25837 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25838 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25839 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25840 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
25841 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
25843 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
25845 * config/i386/i386-expand.c (ix86_expand_int_compare):
25846 Swap operands of GTU and LEU comparison to emit carry flag comparison.
25847 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
25848 predicate to allow more combine opportunities with memory operands.
25849 (*sub<mode>3_carry_0): Ditto.
25851 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
25853 PR rtl-optimization/100303
25854 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
25855 boolean that indicates whether the use will only be used in
25856 debug instructions. Treat it in the same way that existing
25857 cross-EBB debug references would be handled if so.
25858 (function_info::make_uses_available): Likewise.
25859 * rtl-ssa/functions.h (function_info::make_uses_available): Update
25860 prototype accordingly.
25861 (function_info::make_uses_available): Likewise.
25862 * fwprop.c (try_fwprop_subst): Update call accordingly.
25864 2021-04-29 Jeff Law <jlaw@tachyum.com>
25866 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
25869 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
25870 Richard Biener <rguenther@suse.de>
25873 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
25874 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
25875 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
25876 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
25877 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
25878 to PURE_ARGS category.
25879 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
25880 Handle PURE_ARGS category.
25881 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
25883 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
25885 * configure.ac: Check for the presence of sys/locking.h header and
25886 for whether _LK_LOCK is supported by _locking.
25887 * configure: Regenerate.
25888 * config.in: Likewise.
25889 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
25890 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
25891 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
25893 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
25895 * config/i386/predicates.md (fcmov_comparison_operator):
25896 Do not check for trivial FP comparison operator.
25897 <case GEU, case LTU>: Allow CCGZmode.
25898 <case GTU, case LEU>: Do not allow CCCmode.
25899 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
25900 (ix86_carry_flag_operator): Match only LTU and UNLT code.
25901 Do not check for trivial FP comparison operator. Allow CCGZmode.
25903 2021-04-29 Tom de Vries <tdevries@suse.de>
25905 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
25906 fd->loop.step by either step or orig_step.
25908 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
25910 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
25911 (load_got_register): Do the PIC dance here.
25912 (sparc_legitimize_tls_address): Simplify.
25913 (sparc_emit_probe_stack_range): Likewise.
25914 (sparc32_initialize_trampoline): Likewise.
25915 (sparc64_initialize_trampoline): Likewise.
25916 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
25917 (probe_stack_range<P:mode>): Likewise.
25918 (flush<P:mode>): Likewise.
25919 (tgd_hi22<P:mode>): Likewise.
25920 (tgd_lo10<P:mode>): Likewise.
25921 (tgd_add<P:mode>): Likewise.
25922 (tgd_call<P:mode>): Likewise.
25923 (tldm_hi22<P:mode>): Likewise.
25924 (tldm_lo10<P:mode>): Likewise.
25925 (tldm_add<P:mode>): Likewise.
25926 (tldm_call<P:mode>): Likewise.
25927 (tldo_hix22<P:mode>): Likewise.
25928 (tldo_lox10<P:mode>): Likewise.
25929 (tldo_add<P:mode>): Likewise.
25930 (tie_hi22<P:mode>): Likewise.
25931 (tie_lo10<P:mode>): Likewise.
25932 (tie_add<P:mode>): Likewise.
25933 (tle_hix22<P:mode>): Likewise.
25934 (tle_lox10<P:mode>): Likewise.
25935 (stack_protect_setsi): Rename to...
25936 (stack_protect_set32): ...this.
25937 (stack_protect_setdi): Rename to...
25938 (stack_protect_set64): ...this.
25939 (stack_protect_set): Adjust calls to above.
25940 (stack_protect_testsi): Rename to...
25941 (stack_protect_test32): ...this.
25942 (stack_protect_testdi): Rename to...
25943 (stack_protect_test64): ...this.
25944 (stack_protect_test): Adjust calls to above.
25946 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
25948 PR middle-end/90773
25949 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
25950 (builtin_strncpy_read_str): Likewise.
25951 (builtin_memset_read_str): Add an argument for the previous RTL
25952 information and generate the new RTL from the previous RTL info.
25953 (builtin_memset_gen_str): Likewise.
25954 * builtins.h (builtin_strncpy_read_str): Update the prototype.
25955 (builtin_memset_read_str): Likewise.
25956 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
25957 returns true, round up size and alignment to the widest integer
25958 mode for maximum size.
25959 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
25960 and pass it to m_constfn.
25961 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
25962 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
25963 initialize m_push. Initialize m_overlap_op_by_pieces with
25964 targetm.overlap_op_by_pieces_p ().
25965 (op_by_pieces_d::run): Pass the previous RTL information to
25966 pieces_addr::adjust and generate overlapping operations if
25967 m_overlap_op_by_pieces is true.
25969 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
25971 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
25973 (can_store_by_pieces): Use by_pieces_constfn on constfun.
25974 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
25975 for op_by_pieces_d change.
25976 (clear_by_pieces_1): Add a dummy argument.
25977 (clear_by_pieces): Updated for op_by_pieces_d change.
25978 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
25979 (string_cst_read_str): Add a dummy argument.
25980 * expr.h (by_pieces_constfn): Add a dummy argument.
25981 (by_pieces_prev): New.
25982 * target.def (overlap_op_by_pieces_p): New target hook.
25983 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
25984 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
25985 * doc/tm.texi: Regenerated.
25987 2021-04-29 Richard Biener <rguenther@suse.de>
25989 PR tree-optimization/100253
25990 * tree-vect-stmts.c (vectorizable_load): Do not assume
25991 element alignment when DR_MISALIGNMENT is -1.
25992 (vectorizable_store): Likewise.
25994 2021-04-29 Jakub Jelinek <jakub@redhat.com>
25997 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
25998 absu_hwi instead of abs_hwi.
26000 2021-04-29 Richard Biener <rguenther@suse.de>
26002 PR middle-end/38474
26003 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
26004 forwarding when indirect forwarding through ESCAPED
26007 2021-04-29 Tom de Vries <tdevries@suse.de>
26010 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
26011 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
26012 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
26013 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
26015 2021-04-29 Richard Biener <rguenther@suse.de>
26017 PR tree-optimization/99912
26018 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
26019 (dse_dom_walker::todo): Likewise.
26020 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
26022 (dse_dom_walker::before_dom_children): Remove trivially
26023 dead SSA defs and schedule CFG cleanup if we removed all
26025 (pass_dse::execute): Get TODO as computed by the DOM walker
26026 and return it. Wipe dominator info earlier.
26028 2021-04-29 Richard Biener <rguenther@suse.de>
26031 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
26032 Track blocks to cleanup EH in new m_need_eh_cleanup.
26033 (ipcp_modif_dom_walker::cleanup_eh): New.
26034 (ipcp_transform_function): Release dominator info before
26037 2021-04-29 Martin Sebor <msebor@redhat.com>
26039 PR middle-end/100250
26040 * attribs.c (attr_access::array_as_string): Avoid dereferencing
26041 a pointer when it's null.
26043 2021-04-29 Martin Sebor <msebor@redhat.com>
26045 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
26046 * ipa-free-lang-data.cc: New file.
26047 * tree.c: Move pass free_lang_data to file above.
26048 (build_array_type_1): Declare extern.
26049 * tree.h (build_array_type_1): Declare.
26051 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26053 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
26054 make consistent with updated RTL pattern.
26055 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
26056 Implement using ss_truncate and us_truncate rather than
26058 * config/aarch64/iterators.md: Remove redundant unspecs and
26059 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
26061 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26063 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
26064 attributes consistent with those defined in arm_neon.h.
26066 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26068 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
26069 attributes consistent with those defined in arm_neon.h.
26071 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26073 * config/aarch64/aarch64-simd-builtins.def: Add
26074 float_trunc_rodd builtin generator macros.
26075 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
26077 (aarch64_float_trunc_rodd_lo_v2sf): Define.
26078 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
26079 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
26080 (aarch64_float_trunc_rodd_hi_v4sf): Define.
26081 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
26082 instead of inline asm.
26083 (vcvtx_high_f32_f64): Likewise.
26084 (vcvtxd_f32_f64): Likewise.
26085 * config/aarch64/iterators.md: Add FCVTXN unspec.
26087 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26089 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
26091 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
26093 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
26094 instead of inline asm.
26095 (vqtbx1_u8): Likewise.
26096 (vqtbx1_p8): Likewise.
26097 (vqtbx1q_s8): Likewise.
26098 (vqtbx1q_u8): Likewise.
26099 (vqtbx1q_p8): Likewise.
26100 (vtbx2_s8): Likewise.
26101 (vtbx2_u8): Likewise.
26102 (vtbx2_p8): Likewise.
26104 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26106 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
26108 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
26109 instead of inline asm.
26110 (vqtbl1_s8): Likewise.
26111 (vqtbl1_u8): Likewise.
26112 (vqtbl1q_p8): Likewise.
26113 (vqtbl1q_s8): Likewise.
26114 (vqtbl1q_u8): Likewise.
26115 (vtbl1_s8): Likewise.
26116 (vtbl1_u8): Likewise.
26117 (vtbl1_p8): Likewise.
26118 (vtbl2_s8): Likewise.
26119 (vtbl2_u8): Likewise.
26120 (vtbl2_p8): Likewise.
26122 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26124 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
26125 ssri_n buitin generator macro.
26126 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
26127 instead of inline asm.
26128 (vsri_n_p16): Likewise.
26129 (vsri_n_p64): Likewise.
26130 (vsriq_n_p8): Likewise.
26131 (vsriq_n_p16): Likewise.
26132 (vsriq_n_p64): Likewise.
26134 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26136 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
26137 iterator for polynomial ssli_n builtin generator macro.
26138 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
26139 instead of inline asm.
26140 (vsli_n_p16): Likewise.
26141 (vsliq_n_p8): Likewise.
26142 (vsliq_n_p16): Likewise.
26143 * config/aarch64/iterators.md: Define VALLP mode iterator.
26145 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26147 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
26148 iterator to generate [su]adalp RTL builtins.
26149 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
26150 [su]adalp RTL pattern.
26151 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
26152 instead of inline asm.
26153 (vpadal_u32): Likewise.
26155 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26157 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
26158 builtin generator macros.
26159 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
26161 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
26162 instead of inline asm.
26163 (vpaddl_s16): Likewise.
26164 (vpaddl_s32): Likewise.
26165 (vpaddl_u8): Likewise.
26166 (vpaddl_u16): Likewise.
26167 (vpaddl_u32): Likewise.
26168 (vpaddlq_s8): Likewise.
26169 (vpaddlq_s16): Likewise.
26170 (vpaddlq_s32): Likewise.
26171 (vpaddlq_u8): Likewise.
26172 (vpaddlq_u16): Likewise.
26173 (vpaddlq_u32): Liwewise.
26174 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
26175 appropriate attributes.
26177 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26179 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
26180 for aarch64_addp<mode> builtin macro generator.
26181 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
26182 aarch64_addp<mode> RTL pattern.
26183 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
26184 instead of inline asm.
26185 (vpaddq_s16): Likewise.
26186 (vpaddq_s32): Likewise.
26187 (vpaddq_s64): Likewise.
26188 (vpaddq_u8): Likewise.
26189 (vpaddq_u16): Likewise.
26190 (vpaddq_u32): Likewise.
26191 (vpaddq_u64): Likewise.
26193 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
26195 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
26196 builtin generator macros.
26197 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
26199 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
26200 instead of inline asm.
26201 (vqdmulh_n_s32): Likewise.
26202 (vqdmulhq_n_s16): Likewise.
26203 (vqdmulhq_n_s32): Likewise.
26204 (vqrdmulh_n_s16): Likewise.
26205 (vqrdmulh_n_s32): Likewise.
26206 (vqrdmulhq_n_s16): Likewise.
26207 (vqrdmulhq_n_s32): Likewise.
26209 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
26211 * doc/install.texi (--enable-offload-defaulted): Document.
26213 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
26215 * config/avr/avr-dimode.md: Turn existing patterns into
26216 define_insn_and_split style patterns where the splitter
26217 adds a clobber of the condition code register. Drop "cc"
26218 attribute. Add new patterns to match output of
26220 * config/avr/avr-fixed.md: Likewise.
26221 * config/avr/avr.c (cc_reg_rtx): New.
26222 (avr_parallel_insn_from_insns): Adjust insn count
26223 for removal of set of cc0.
26224 (avr_is_casesi_sequence): Likewise.
26225 (avr_casei_sequence_check_operands): Likewise.
26226 (avr_optimize_casesi): Likewise. Also insert
26227 new insns after jump_insn.
26228 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
26229 for removal of set of cc0.
26230 (avr_init_expanders): Initialize cc_reg_rtx.
26231 (avr_regno_reg_class): Handle REG_CC.
26232 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
26233 (avr_notice_update_cc): Remove function.
26234 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
26235 (compare_condition): Adjust for PARALLEL with
26237 (out_shift_with_cnt): Likewise.
26238 (ashlhi3_out): Likewise.
26239 (ashrhi3_out): Likewise.
26240 (lshrhi3_out): Likewise.
26241 (avr_class_max_nregs): Return single reg for REG_CC.
26242 (avr_compare_pattern): Check for REG_CC instead
26244 (avr_reorg_remove_redundant_compare): Likewise.
26245 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
26246 (avr_hard_regno_nregs): Return single reg for REG_CC.
26247 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
26248 (avr_md_asm_adjust): Clobber REG_CC.
26249 (TARGET_HARD_REGNO_NREGS): Define.
26250 (TARGET_CLASS_MAX_NREGS): Define.
26251 (TARGET_MD_ASM_ADJUST): Define.
26252 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
26254 (enum reg_class): Add CC_REG class.
26255 (NOTICE_UPDATE_CC): Remove.
26256 (CC_OVERFLOW_UNUSABLE): Remove.
26257 (CC_NO_CARRY): Remove.
26258 * config/avr/avr.md: Turn existing patterns into
26259 define_insn_and_split style patterns where the splitter
26260 adds a clobber of the condition code register. Drop "cc"
26261 attribute. Add new patterns to match output of
26263 (sez): Remove unused pattern.
26265 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
26268 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
26271 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
26274 * config/aarch64/constraints.md (Utq): Require the address to
26275 be valid for both the element mode and for V2DImode.
26277 2021-04-28 Jakub Jelinek <jakub@redhat.com>
26278 Tobias Burnus <tobias@codesourcery.com>
26280 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
26281 * gcc.c (process_command): New variable.
26282 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
26283 set it if -foffload is defaulted.
26284 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
26285 (compile_offload_image): If OFFLOAD_DEFAULTED and
26286 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
26287 if corresponding mkoffload can't be found.
26288 (compile_images_for_offload_targets): Likewise. Free and clear
26289 offload_names if no valid offload is found.
26290 * config.in: Regenerate.
26291 * configure: Regenerate.
26293 2021-04-28 Richard Biener <rguenther@suse.de>
26295 PR tree-optimization/100292
26296 * tree-vect-generic.c (expand_vector_condition): Do not fold
26299 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26301 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
26302 * config/rs6000/aix64.opt (m64): New.
26305 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26307 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
26308 (index_term_p): Handle ASHIFT too.
26310 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26312 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
26313 (sync_lock_test_and_set<mode>): Adjust accordingly.
26314 (sync_lock_release<mode>): Likewise.
26316 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
26318 * config/vax/vax-protos.h (adjacent_operands_p): Remove
26320 * config/vax/vax.c (adjacent_operands_p): Remove.
26322 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
26324 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
26325 through to the non-conditional execution case if getting the
26326 condition for conditional execution has failed.
26328 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26330 PR middle-end/100284
26331 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
26332 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
26333 than asserting on it.
26335 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26337 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
26338 with TARGET_AIX_OS.
26340 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
26343 * calls.c (precompute_register_parameters): Additionally test
26344 targetm.precompute_tls_p to pre-compute argument.
26345 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
26346 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
26347 * target.def (precompute_tls_p): New.
26348 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
26349 * doc/tm.texi: Regenerated.
26351 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26354 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
26355 back to HOST_WIDE_INT.
26357 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
26360 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
26361 memory alignment for the outer mode.
26363 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
26365 PR middle-end/90773
26366 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
26367 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
26369 2021-04-27 Alex Coplan <alex.coplan@arm.com>
26372 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
26373 with negative immediates: ensure we expand cbranchsi4_scratch
26374 correctly and ensure we satisfy its constraints.
26375 * config/arm/sync.md
26376 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
26377 attempt to tie two output operands together with constraints;
26378 collapse two alternatives.
26379 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
26380 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
26382 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26385 * config/aarch64/predicates.md (aarch64_sub_immediate,
26386 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
26387 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
26388 * config/aarch64/aarch64.c (aarch64_print_operand,
26389 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
26391 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26393 PR tree-optimization/100239
26394 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
26395 permutations with all indices from the first zero element as vec_shl.
26397 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26399 PR rtl-optimization/100254
26400 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
26401 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
26403 2021-04-27 Richard Biener <rguenther@suse.de>
26405 PR tree-optimization/99912
26406 * passes.def: Add comment about new TODO_remove_unused_locals.
26407 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
26410 2021-04-27 Richard Biener <rguenther@suse.de>
26412 PR tree-optimization/99912
26413 * passes.def (pass_all_optimizations): Add pass_dse before
26414 the first pass_dce, move the first pass_dse before the
26415 pass_dce following pass_pre.
26417 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26419 PR tree-optimization/95527
26420 * generic-match-head.c: Include tm.h.
26421 * gimple-match-head.c: Include tm.h.
26422 * match.pd (CLZ == INTEGER_CST): Don't use
26423 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
26424 if clz == CFN_CLZ. Add missing val declaration.
26425 (CTZ cmp CST): New simplifications.
26427 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26429 PR tree-optimization/96696
26430 * expr.c (expand_expr_divmod): New function.
26431 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
26432 divisions. Formatting fixes.
26433 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
26436 2021-04-27 Martin Jambor <mjambor@suse.cz>
26439 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
26440 If removing a call statement LHS SSA name, release it.
26442 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
26445 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
26446 is valid before including it in the mask.
26448 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
26451 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
26454 2021-04-27 Richard Biener <rguenther@suse.de>
26456 PR tree-optimization/100051
26457 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
26458 disambiguator based on access size vs. decl size.
26460 2021-04-27 Richard Biener <rguenther@suse.de>
26462 PR tree-optimization/100278
26463 * tree-ssa-pre.c (compute_avail): Give up when we cannot
26464 adjust TBAA beacuse of mismatching bases.
26466 2021-04-27 Jakub Jelinek <jakub@redhat.com>
26469 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
26470 For any_rotate define_insn_split and following splitters, use
26471 SWI iterator instead of SWI48.
26473 2021-04-27 Richard Biener <rguenther@suse.de>
26475 PR tree-optimization/99776
26476 * match.pd (bit_field_ref (ctor)): Relax element extract
26477 type compatibility checks.
26479 2021-04-27 Cui,Lili <lili.cui@intel.com>
26481 * common/config/i386/i386-common.c (processor_names):
26482 Sync processor_names with processor_type.
26483 * config/i386/i386-options.c (processor_cost_table):
26484 Sync processor_cost_table with processor_type.
26486 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26488 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
26489 (irange::set): Call irange_set_1bit_anti_range for handling all
26490 1-bit ranges. Fall through on ~[MIN,MAX].
26492 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26494 * value-range.cc (irange::legacy_num_pairs): Remove.
26495 (irange::invert): Change gcc_assert to gcc_checking_assert.
26496 * value-range.h (irange::num_pairs): Adjust for a cached
26497 num_pairs(). Also, rename all gcc_assert's to
26498 gcc_checking_assert's.
26500 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26502 * value-range.cc (irange::operator=): Set m_kind.
26503 (irange::copy_to_legacy): Handle varying and undefined sources
26504 as a legacy copy since they can be easily copied.
26505 (irange::irange_set): Set m_kind.
26506 (irange::irange_set_anti_range): Same.
26507 (irange::set): Rename normalize_min_max to normalize_kind.
26508 (irange::verify_range): Adjust for multi-ranges having the
26510 (irange::irange_union): Set m_kind.
26511 (irange::irange_intersect): Same.
26512 (irange::invert): Same.
26513 * value-range.h (irange::kind): Always return m_kind.
26514 (irange::varying_p): Rename to...
26515 (irange::varying_comptaible_p): ...this.
26516 (irange::undefined_p): Only look at m_kind.
26517 (irange::irange): Always set VR_UNDEFINED if applicable.
26518 (irange::set_undefined): Always set VR_UNDEFINED.
26519 (irange::set_varying): Always set m_kind to VR_VARYING.
26520 (irange::normalize_min_max): Rename to...
26521 (irange::normalize_kind): ...this.
26523 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26525 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
26526 Adjust for constant_p including varying_p.
26527 * tree-vrp.c (vrp_prop::finalize): Same.
26528 (determine_value_range): Same.
26529 * vr-values.c (vr_values::range_of_expr): Same.
26530 * value-range.cc (irange::symbolic_p): Do not check varying_p.
26531 (irange::constant_p): Same.
26533 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
26535 * value-range.cc (irange::legacy_lower_bound): Replace
26536 !undefined_p check with num_ranges > 0.
26537 (irange::legacy_upper_bound): Same.
26538 * value-range.h (irange::type): Same.
26539 (irange::lower_bound): Same.
26540 (irange::upper_bound): Same.
26542 2021-04-26 Richard Biener <rguenther@suse.de>
26544 PR tree-optimization/99956
26545 * gimple-loop-interchange.cc (compute_access_stride):
26546 Try instantiating the access in a shallower loop nest
26547 if instantiating failed.
26548 (compute_access_strides): Pass adjustable loop_nest
26549 to compute_access_stride.
26551 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
26553 * doc/sourcebuild.texi (arm_cmse_hw): Document.
26555 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26557 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
26559 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
26560 Nathan Sidwell <nathan@codesourcery.com>
26561 Tom de Vries <vries@codesourcery.com>
26562 Julian Brown <julian@codesourcery.com>
26563 Kwok Cheung Yeung <kcy@codesourcery.com>
26565 * omp-offload.c (oacc_validate_dims): Implement
26566 '-Wopenacc-parallelism'.
26567 * doc/invoke.texi (-Wopenacc-parallelism): Document.
26569 2021-04-26 Richard Biener <rguenther@suse.de>
26571 * tree-cfg.h (gimplify_build1): Remove.
26572 (gimplify_build2): Likewise.
26573 (gimplify_build3): Likewise.
26574 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
26575 (gimplify_build2): Likewise.
26576 (gimplify_build3): Likewise.
26577 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
26579 (gimplify_build2): Likewise.
26580 (gimplify_build3): Likewise.
26581 (tree_vec_extract): Use resimplify with following SSA edges.
26582 (expand_vector_parallel): Avoid passing NULL size/bitpos
26583 to tree_vec_extract.
26584 * expr.c (store_constructor): Deal with zero-element CTORs.
26585 * match.pd (bit_field_ref <vector CTOR>): Make sure to
26586 produce vector constants when possible.
26588 2021-04-26 Richard Biener <rguenther@suse.de>
26590 * tree-complex.c: Include gimple-fold.h.
26591 (expand_complex_addition): Use gimple_build.
26592 (expand_complex_multiplication_components): Likewise.
26593 (expand_complex_multiplication): Likewise.
26594 (expand_complex_div_straight): Likewise.
26595 (expand_complex_div_wide): Likewise.
26596 (expand_complex_division): Likewise.
26597 (expand_complex_conjugate): Likewise.
26598 (expand_complex_comparison): Likewise.
26600 2021-04-26 Richard Biener <rguenther@suse.de>
26602 * tree-ssa-phiopt.c (two_value_replacement): Remove use
26603 of legacy gimplify_buildN API.
26605 2021-04-26 Richard Biener <rguenther@suse.de>
26607 PR tree-optimization/99473
26608 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
26611 2021-04-26 Richard Biener <rguenther@suse.de>
26613 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
26614 Use replace_call_with_value.
26616 2021-04-26 Richard Biener <rguenther@suse.de>
26618 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
26619 (update_gimple_call): Likewise.
26620 (update_call_from_tree): Likewise.
26621 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
26622 (valid_gimple_call_p): Likewise.
26623 (move_ssa_defining_stmt_for_defs): Likewise.
26624 (finish_update_gimple_call): Likewise.
26625 (update_gimple_call): Likewise.
26626 (update_call_from_tree): Likewise.
26627 (propagate_tree_value_into_stmt): Use replace_call_with_value.
26628 * gimple-fold.h (update_gimple_call): Declare.
26629 * gimple-fold.c (valid_gimple_rhs_p): Move here from
26630 tree-ssa-propagate.c.
26631 (update_gimple_call): Likewise.
26632 (valid_gimple_call_p): Likewise.
26633 (finish_update_gimple_call): Likewise, and simplify.
26634 (gimplify_and_update_call_from_tree): Implement
26635 update_call_from_tree functionality, avoid excessive
26636 push/pop_gimplify_context.
26637 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
26638 (gimple_fold_call): Likewise.
26639 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
26640 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
26641 (pass_fold_builtins::execute): Likewise.
26642 (optimize_stack_restore): Use replace_call_with_value.
26643 * tree-cfg.c (fold_loop_internal_call): Likewise.
26644 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
26645 only gimplify_and_update_call_from_tree.
26646 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
26647 (handle_builtin_strchr): Likewise.
26648 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
26650 2021-04-26 Jakub Jelinek <jakub@redhat.com>
26653 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
26654 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
26657 2021-04-25 liuhongt <hongtao.liu@intel.com>
26660 * config/i386/i386-builtin.def (BDESC): Change the icode of
26661 the following builtins to CODE_FOR_nothing.
26662 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
26663 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
26664 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
26665 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
26666 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
26667 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
26668 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
26669 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
26670 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
26671 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
26672 (sse2_eq<mode>3): Ditto.
26673 (sse4_1_eqv2di3): Ditto.
26674 (sse2_gt<mode>3): Rename to ..
26675 (*sse2_gt<mode>3): .. this.
26677 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26680 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26683 * config/darwin.c (darwin_binds_local_p): Assume that any
26684 public symbol might be interposed for PIC code. Update function
26685 header comment to reflect current Darwin capability.
26687 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
26690 * config/darwin.c (darwin_binds_local_p): Assume that any
26691 public symbol might be interposed for PIC code. Update function
26692 header comment to reflect current Darwin capability.
26694 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
26696 * doc/sourcebuild.texi: Document no-opts and any-opts target
26699 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
26701 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
26703 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
26705 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
26707 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
26710 * config/i386/i386-options.c (ix86_option_override_internal):
26711 Error out when -m96bit-long-double is used with 64bit targets.
26712 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
26714 2021-04-23 Martin Liska <mliska@suse.cz>
26716 * lto-wrapper.c: Remove FIXME about usage of
26717 hardware_concurrency. The function is not on par with
26720 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
26723 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
26724 Copy operand 3 to operand 4. Use sse_reg_operand
26725 as operand 3 predicate.
26726 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
26727 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
26728 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
26729 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
26730 Copy operand 1 to operand 0.
26731 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
26732 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
26733 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
26735 2021-04-23 Alex Coplan <alex.coplan@arm.com>
26737 PR rtl-optimization/100230
26738 * early-remat.c (early_remat::sort_candidates): Use delete[]
26739 instead of delete for array allocated with new[].
26741 2021-04-23 Richard Biener <rguenther@suse.de>
26743 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
26744 (capture_info::capture_info): Likewise.
26745 (capture_info::walk_match): Likewise.
26746 (expr::gen_transform): Likewise.
26747 (dt_simplify::gen_1): Likewise.
26748 * gimple-match-head.c (maybe_resimplify_conditional_op):
26749 Remove VEC_COND_EXPR special-casing.
26750 (gimple_simplify): Likewise.
26751 * gimple.c (gimple_could_trap_p_1): Adjust.
26752 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
26753 to participate in PRE.
26755 2021-04-23 Richard Biener <rguenther@suse.de>
26757 * cfganal.c (connect_infinite_loops_to_exit): First call
26758 add_noreturn_fake_exit_edges.
26759 * ipa-sra.c (process_scan_results): Do not call the now redundant
26760 add_noreturn_fake_exit_edges.
26761 * predict.c (tree_estimate_probability): Likewise.
26762 (rebuild_frequencies): Likewise.
26763 * store-motion.c (one_store_motion_pass): Likewise.
26765 2021-04-23 Richard Biener <rguenther@suse.de>
26767 PR tree-optimization/100222
26768 * predict.c (pass_profile::execute): Remove redundant call to
26769 mark_irreducible_loops.
26770 (report_predictor_hitrates): Likewise.
26772 2021-04-23 Richard Biener <rguenther@suse.de>
26774 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
26775 valid_gimple_rhs_p by instead gimplifying to one.
26777 2021-04-23 Richard Biener <rguenther@suse.de>
26779 PR tree-optimization/99971
26780 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
26781 Always use TBAA for loads.
26783 2021-04-23 liuhongt <hongtao.liu@intel.com>
26786 * config/i386/i386-options.c (ix86_option_override_internal):
26787 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
26788 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
26789 by target attribute.
26791 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
26793 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
26795 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
26797 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
26799 * config.gcc (powerpc-ibm-aix6.*): Remove.
26800 * config/rs6000/aix61.h: Delete.
26802 2021-04-22 Martin Liska <mliska@suse.cz>
26804 PR testsuite/100159
26805 PR testsuite/100192
26806 * builtins.c (expand_builtin): Fix typos and missing comments.
26807 * dwarf2out.c (gen_subprogram_die): Likewise.
26808 (gen_struct_or_union_type_die): Likewise.
26810 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
26813 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
26814 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
26816 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
26818 * config/i386/darwin.h (TARGET_64BIT): Remove definition
26819 based on TARGET_ISA_64BIT.
26820 (TARGET_64BIT_P): Remove definition based on
26821 TARGET_ISA_64BIT_P().
26823 2021-04-21 Martin Liska <mliska@suse.cz>
26826 2021-04-21 Martin Liska <mliska@suse.cz>
26828 * lto-wrapper.c (cpuset_popcount): Remove.
26829 (init_num_threads): Remove and use hardware_concurrency.
26831 2021-04-21 Martin Liska <mliska@suse.cz>
26834 * main.c (main): Call toplev::finalize in CHECKING_P mode.
26835 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
26836 when incremental LTO linking happens.
26838 2021-04-21 Martin Liska <mliska@suse.cz>
26840 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
26841 makeserver cannot be detected, then use -flto=N fallback.
26843 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
26845 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
26846 default to yes for aarch64-linux-gnu.
26847 * configure: Regenerate.
26849 2021-04-21 Martin Liska <mliska@suse.cz>
26851 * lto-wrapper.c (cpuset_popcount): Remove.
26852 (init_num_threads): Remove and use hardware_concurrency.
26854 2021-04-21 Martin Liska <mliska@suse.cz>
26856 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
26857 which remains to be '(... || 0)' and clang complains about it.
26858 * dwarf2out.c (AT_vms_delta): Declare conditionally.
26859 (add_AT_vms_delta): Likewise.
26860 * tree.c (fld_simplified_type): Use rather more common pattern
26861 for disabling of something (#if 0).
26862 (get_tree_code_name): Likewise.
26863 (verify_type_variant): Likewise.
26865 2021-04-21 Martin Liska <mliska@suse.cz>
26867 * config/i386/i386-expand.c (decide_alignment): Use newly named
26868 macro TARGET_CPU_P.
26869 * config/i386/i386.c (ix86_decompose_address): Likewise.
26870 (ix86_address_cost): Likewise.
26871 (ix86_lea_outperforms): Likewise.
26872 (ix86_avoid_lea_for_addr): Likewise.
26873 (ix86_add_stmt_cost): Likewise.
26874 * config/i386/i386.h (TARGET_*): Remove.
26875 (TARGET_CPU_P): New macro.
26876 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
26877 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
26878 (swap_top_of_ready_list): Likewise.
26879 (ix86_atom_sched_reorder): Likewise.
26880 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
26881 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
26883 2021-04-21 Martin Liska <mliska@suse.cz>
26885 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
26887 (SET_TARGET_NO_SAHF): Likewise.
26888 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
26889 (SET_TARGET_PREFETCH_SSE): Likewise.
26890 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
26891 (SET_TARGET_NO_TUNE): Likewise.
26892 (TARGET_EXPLICIT_NO_80387_P): Likewise.
26893 (SET_TARGET_NO_80387): Likewise.
26895 * config/i386/i386.h (TARGET_*): Remove.
26896 * opth-gen.awk: Generate new used macros.
26898 2021-04-21 Martin Liska <mliska@suse.cz>
26900 * config/i386/i386.h (PTA_*): Remove.
26901 (enum pta_flag): New.
26902 (DEF_PTA): Generate PTA_* values from i386-isa.def.
26903 * config/i386/i386-isa.def: New file.
26905 2021-04-21 Alex Coplan <alex.coplan@arm.com>
26908 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
26909 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
26910 jump table targets.
26912 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
26914 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
26915 x86_64-*-* targets.
26916 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
26918 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
26919 (ix86_handle_option): Handle -mmwait.
26920 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
26921 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
26922 __builtin_ia32_monitor and __builtin_ia32_mwait.
26923 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
26924 (ix86_valid_target_attribute_inner_p): Likewise.
26925 (ix86_option_override_internal): Enable mwait/monitor
26926 instructions for -msse3.
26927 * config/i386/i386.h (TARGET_MWAIT): New.
26928 (TARGET_MWAIT_P): Likewise.
26929 * config/i386/i386.opt: Add -mmwait.
26930 * config/i386/mwaitintrin.h: New file.
26931 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
26932 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
26934 (@sse3_monitor_<mode>): Likewise.
26935 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
26936 * doc/extend.texi: Document mwait target attribute.
26937 * doc/invoke.texi: Document -mmwait.
26939 2021-04-21 Martin Liska <mliska@suse.cz>
26941 * config/i386/i386-options.c (DEF_ENUM): Remove it.
26942 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
26943 * config/i386/stringop.def (DEF_ENUM): Likewise.
26945 2021-04-21 Martin Liska <mliska@suse.cz>
26947 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
26948 of print_generic_expr.
26950 2021-04-21 Jakub Jelinek <jakub@redhat.com>
26952 PR rtl-optimization/100148
26953 * cprop.c (constprop_register): Use next_nondebug_insn instead of
26956 2021-04-21 Martin Liska <mliska@suse.cz>
26959 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
26960 free_dominance_info calls.
26962 2021-04-21 Richard Biener <rguenther@suse.de>
26964 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
26965 parameter (and assume it to be false).
26966 (fold_gimple_assign): Adjust, remove all callers of
26967 maybe_fold_reference calling it with is_lhs true.
26968 (gimple_fold_call): Likewise.
26969 (fold_stmt_1): Likewise.
26971 2021-04-21 Richard Biener <rguenther@suse.de>
26973 * fold-const.c (pedantic_non_lvalue_loc): Remove.
26974 (fold_binary_loc): Adjust.
26975 (fold_ternary_loc): Likewise.
26977 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
26979 PR middle-end/100130
26980 * varasm.c (get_block_for_decl): Make sure that any use of the
26981 retain attribute matches the section's retain flag.
26982 (switch_to_section): Check for retain mismatches even when
26983 changing sections, but do not warn if the given decl is the
26984 section's named.decl.
26985 (output_object_block): Pass the first decl in the block (if any)
26986 to switch_to_section.
26988 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
26990 * config/i386/i386-c.c (ix86_target_macros_internal): Define
26991 __CRC32__ for -mcrc32.
26992 * config/i386/i386-options.c (ix86_option_override_internal):
26993 Enable crc32 instruction for -msse4.2.
26994 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
26996 (sse4_2_crc32di): Likewise.
26997 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
27000 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
27003 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
27006 2021-04-20 Martin Liska <mliska@suse.cz>
27008 * doc/invoke.texi: Fix typo.
27009 * params.opt: Likewise.
27011 2021-04-20 Martin Liska <mliska@suse.cz>
27013 * doc/invoke.texi: Document new param.
27015 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
27017 PR tree-optimization/100081
27018 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
27019 rather than gori_compute_cache.
27020 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
27021 (range_def_chain::m_logical_depth): New member.
27022 (range_def_chain::range_def_chain): Initialize m_logical_depth.
27023 (range_def_chain::get_def_chain): Don't build defchains through more
27024 than LOGICAL_LIMIT logical expressions.
27025 * params.opt (param_ranger_logical_depth): New.
27027 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
27030 * config/arm/arm.c (arm_configure_build_target): Do not strip
27031 extended FPU/SIMD feature bits from the target ISA when -mfpu
27032 is specified (partial revert of r11-8168).
27034 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
27036 * params.opt (-param=openacc-kernels=): Add.
27037 * omp-oacc-kernels-decompose.cc
27038 (pass_omp_oacc_kernels_decompose::gate): Use it.
27039 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
27040 (--param): ... here, 'openacc-kernels'.
27042 2021-04-19 Martin Liska <mliska@suse.cz>
27045 * gengtype.c (finish_root_table): Align function arguments
27046 in between declaration and definition.
27048 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
27050 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
27051 frames larger than the SEH maximum frame size.
27053 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
27055 PR rtl-optimization/99927
27056 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
27057 is dead, just drop it.
27059 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27062 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
27063 * doc/tm.texi: Regenerate.
27064 * doc/tm.texi.in (D language and ABI): Add @hook for
27065 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
27067 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
27069 * config/darwin-d.c (darwin_d_handle_target_object_format): New
27071 (darwin_d_register_target_info): New function.
27072 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27073 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
27075 (dragonfly_d_register_target_info): New function.
27076 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27077 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
27079 (freebsd_d_register_target_info): New function.
27080 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27081 * config/glibc-d.c (glibc_d_handle_target_object_format): New
27083 (glibc_d_register_target_info): New function.
27084 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27085 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
27087 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
27088 as handler for objectFormat key.
27089 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
27091 (winnt_d_register_target_info): New function.
27092 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27093 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
27095 (netbsd_d_register_target_info): New function.
27096 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27097 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
27099 (openbsd_d_register_target_info): New function.
27100 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27101 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
27102 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
27103 handler for objectFormat key.
27104 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
27106 (rs6000_d_register_target_info): Add
27107 rs6000_d_handle_target_object_format as handler for objectFormat key.
27108 * config/sol2-d.c (solaris_d_handle_target_object_format): New
27110 (solaris_d_register_target_info): New function.
27111 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
27113 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27116 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
27117 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
27119 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
27120 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
27121 the old and new alignment after applying MIN/MAX to it is different.
27123 2021-04-16 Tamar Christina <tamar.christina@arm.com>
27126 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
27127 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
27129 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
27131 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
27133 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
27134 this section and its subsections.
27136 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27139 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
27140 define_insn patterns.
27142 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27144 PR rtl-optimization/98689
27145 * reg-notes.def (UNTYPED_CALL): New note.
27146 * combine.c (distribute_notes): Handle it.
27147 * emit-rtl.c (try_split): Likewise.
27148 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
27149 that calls with the note implicitly set all return value registers.
27150 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
27153 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
27155 PR rtl-optimization/99596
27156 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
27157 register accesses for const calls. Assume that pure functions
27158 can only read from global registers. Ignore cases in which
27159 the stack pointer has been marked global.
27161 2021-04-16 Jakub Jelinek <jakub@redhat.com>
27164 * tree-vect-loop.c (vect_transform_loop): Don't remove just
27165 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
27166 them by their last argument.
27168 2021-04-15 Martin Liska <mliska@suse.cz>
27170 * doc/invoke.texi: Other params don't use it, remove it.
27172 2021-04-15 Richard Biener <rguenther@suse.de>
27174 * gimple-builder.h: Add deprecation note.
27176 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27179 * attribs.h (restrict_type_identity_attributes_to): Declare.
27180 * attribs.c (restrict_type_identity_attributes_to): New function.
27182 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
27185 * attribs.h (affects_type_identity_attributes): Declare.
27186 * attribs.c (remove_attributes_matching): New function.
27187 (affects_type_identity_attributes): Likewise.
27189 2021-04-15 Jakub Jelinek <jakub@redhat.com>
27192 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
27193 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
27194 ZERO_EXTEND, SIGN_EXTEND or AND.
27196 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27198 PR rtl-optimization/99929
27199 * rtl.h (same_vector_encodings_p): New function.
27200 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
27201 * cselib.c (rtx_equal_for_cselib_1): Likewise.
27202 * jump.c (rtx_renumbered_equal_p): Likewise.
27203 * lra-constraints.c (operands_match_p): Likewise.
27204 * reload.c (operands_match_p): Likewise.
27205 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
27207 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27209 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
27210 more information about variable-length CONST_VECTORs.
27212 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
27214 PR rtl-optimization/100066
27215 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
27216 ordered modes when choosing splitting mode for hard reg.
27218 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
27221 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
27223 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
27225 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
27227 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
27228 for mask operand types.
27229 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
27230 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
27232 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
27234 * config/s390/s390.c (s390_const_operand_ok): Check the new
27235 operand types and generate a list of valid values.
27237 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27239 * doc/tm.texi: Regenerate.
27240 * doc/tm.texi.in (D language and ABI): Add @hook for
27241 TARGET_D_REGISTER_OS_TARGET_INFO.
27243 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27245 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
27247 (aarch64_d_register_target_info): New function.
27248 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
27250 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
27252 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
27253 (arm_d_register_target_info): New function.
27254 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
27255 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27256 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
27257 (ix86_d_register_target_info): New function.
27258 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
27259 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27260 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
27261 (mips_d_register_target_info): New function.
27262 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
27263 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27264 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
27265 (pa_d_register_target_info): New function.
27266 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
27267 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27268 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
27270 (riscv_d_register_target_info): New function.
27271 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
27272 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27273 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
27275 (rs6000_d_register_target_info): New function.
27276 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
27278 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27279 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
27280 (s390_d_register_target_info): New function.
27281 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
27282 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27283 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
27285 (sparc_d_register_target_info): New function.
27286 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
27287 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
27288 * doc/tm.texi: Regenerate.
27289 * doc/tm.texi.in (D language and ABI): Add @hook for
27290 TARGET_D_REGISTER_CPU_TARGET_INFO.
27292 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
27294 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
27295 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
27296 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
27297 * doc/tm.texi: Regenerate.
27298 * doc/tm.texi.in (D language and ABI): Add @hook for
27299 TARGET_D_HAS_STDCALL_CONVENTION.
27301 2021-04-14 Richard Biener <rguenther@suse.de>
27303 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
27304 VEC_COND_EXPRs have a gimple_val condition.
27305 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
27306 can no longer have a GENERIC condition.
27308 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
27311 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
27312 from the isa_delta when -mfpu has been used.
27313 (arm_options_perform_arch_sanity_checks): It's the architecture that
27314 lacks an FPU not the processor.
27316 2021-04-13 Richard Biener <rguenther@suse.de>
27318 PR tree-optimization/100053
27319 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
27320 not use optimistic dominance queries for backedges to validate
27322 (dominated_by_p_w_unex): Add parameter to ignore executable
27323 state on backedges.
27324 (rpo_elim::eliminate_avail): Adjust.
27326 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27329 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
27330 *aarch64_bfxilsi_extrdi): New define_insn patterns.
27332 2021-04-13 Jakub Jelinek <jakub@redhat.com>
27335 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
27336 outermode, return NULL if the result doesn't encode back to the
27337 original byte sequence.
27338 (simplify_gen_subreg): Don't create SUBREGs from constants to
27339 MODE_COMPOSITE_P outermode.
27341 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27343 PR rtl-optimization/99905
27344 * combine.c (expand_compound_operation): If pos + len > modewidth,
27345 perform the right shift by pos in inner_mode and then convert to mode,
27346 instead of trying to simplify a shift of rtx with inner_mode by pos
27347 as if it was a shift in mode.
27349 2021-04-12 Jakub Jelinek <jakub@redhat.com>
27352 * combine.c (simplify_and_const_int_1): Don't optimize varop
27353 away if it has side-effects.
27355 2021-04-12 Martin Liska <mliska@suse.cz>
27357 * doc/extend.texi: Escape @smallexample content.
27359 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
27361 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
27362 alternative in order to load a DFP zero.
27364 2021-04-12 Martin Liska <mliska@suse.cz>
27366 * doc/extend.texi: Be more precise in documentation
27367 of symver attribute.
27369 2021-04-12 Martin Liska <mliska@suse.cz>
27372 * gimplify.c (gimplify_expr): Right now, we unpoison all
27373 variables before a goto <dest>. We should not do it if we are
27376 2021-04-12 Cui,Lili <lili.cui@intel.com>
27378 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
27380 * common/config/i386/i386-common.c (processor_names): Add
27382 (processor_alias_table): Add rocketlake.
27383 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
27384 INTEL_COREI7_ROCKETLAKE.
27385 * config.gcc: Add -march=rocketlake.
27386 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
27388 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
27389 (processor_cost_table): Add rocketlake cost.
27390 * config/i386/i386.h (ix86_size_cost) : Define
27392 (processor_type) : Add PROCESSOR_ROCKETLAKE.
27393 (PTA_ROCKETLAKE): Ditto.
27394 * doc/extend.texi: Add rocketlake.
27395 * doc/invoke.texi: Add rocketlake.
27397 2021-04-12 Cui,Lili <lili.cui@intel.com>
27399 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
27400 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
27401 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
27402 * doc/invoke.texi: Change alderlake ISA list.
27404 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
27406 PR middle-end/98088
27407 * omp-expand.c (expand_oacc_collapse_init): Update condition in
27410 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
27413 * config/i386/serializeintrin.h (_serialize): Defined as macro.
27415 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27418 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
27419 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
27421 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27423 PR middle-end/99989
27424 * gimple-ssa-warn-alloca.c
27425 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
27426 0 with integer precision unconditionally.
27428 2021-04-10 Jakub Jelinek <jakub@redhat.com>
27430 PR rtl-optimization/98601
27431 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
27432 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
27433 unaligned_mems handle VOIDmode like BLKmode.
27435 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
27438 * tree.c (free_lang_data_in_decl): Do not release body of
27439 declare_variant_alt.
27441 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27443 * config/aarch64/aarch64.c (aarch64_option_restore): If the
27444 architecture was specified explicitly and the tuning wasn't,
27445 tune for the architecture rather than the configured default CPU.
27447 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
27449 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
27450 as the temporary register.
27452 2021-04-09 Martin Liska <mliska@suse.cz>
27454 * doc/extend.texi: Move non-target attributes on the top level.
27456 2021-04-09 Martin Liska <mliska@suse.cz>
27458 * doc/invoke.texi: Document minimum and maximum value of the
27459 argument for both supported compression algorithms.
27461 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
27463 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
27464 TLS BSS before TLS data.
27465 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
27467 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27469 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
27471 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
27473 * match.pd: Extend vec_cond folds to handle shifts.
27475 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
27477 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
27480 2021-04-08 Alex Coplan <alex.coplan@arm.com>
27483 * config/arm/iterators.md (MVE_vecs): New.
27484 (V_elem): Also handle V2DF.
27485 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
27486 (*mve_vdup<mode>): ... this. Remove second alternative since
27487 vec_duplicate of const_int is not canonical RTL, and we don't
27488 want to match symbol_refs.
27489 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
27491 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
27493 * fold-const.c (fold_single_bit_test): Fix typo.
27494 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
27497 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27499 PR tree-optimization/97513
27500 * tree-vect-slp.c (vect_add_slp_permutation): New function,
27502 (vectorizable_slp_permutation): ...here. Detect cases in which
27503 all VEC_PERM_EXPRs are guaranteed to have the same stepped
27504 permute vector and only generate one permute vector for that case.
27505 Extend that case to handle variable-length vectors.
27507 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
27509 PR tree-optimization/99873
27510 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
27511 (vect_build_slp_instance): Don't split store groups that could
27512 use IFN_STORE_LANES.
27514 2021-04-07 Jakub Jelinek <jakub@redhat.com>
27517 * varasm.c (output_constant_pool_contents): Don't strip name encoding
27518 from XSTR (desc->sym, 0) or from label before passing those to
27521 2021-04-07 Richard Biener <rguenther@suse.de>
27523 PR tree-optimization/99954
27524 * tree-loop-distribution.c: Include tree-affine.h.
27525 (generate_memcpy_builtin): Try using tree-affine to prove
27527 (loop_distribution::classify_builtin_ldst): Always classify
27530 2021-04-07 Richard Biener <rguenther@suse.de>
27532 PR tree-optimization/99947
27533 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
27534 steps vector to avoid pushing elements from the reallocated
27537 2021-04-07 Richard Biener <rguenther@suse.de>
27539 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
27540 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
27542 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
27544 (debug_vn_reference_ops): New.
27546 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
27548 PR tree-optimization/98736
27549 * tree-loop-distribution.c
27550 * (loop_distribution::bb_top_order_init):
27551 Compute RPO with programing order preserved by calling function
27552 rev_post_order_and_mark_dfs_back_seme.
27554 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
27557 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
27558 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
27560 (process_bb_lives): Don't update biggest mode of hard reg for
27561 implicit in multi-register group. Use the new functions for
27562 updating dead_set and unused_set by register notes.
27564 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
27566 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
27569 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
27571 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
27572 (skylake_memset): Likewise.
27573 (skylake_cost): Change CLEAR_RATIO to 17.
27574 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
27575 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
27576 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
27578 2021-04-06 Richard Biener <rguenther@suse.de>
27580 PR tree-optimization/99880
27581 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
27582 set vectorized defs of relevant PHIs.
27584 2021-04-06 Richard Biener <rguenther@suse.de>
27586 PR tree-optimization/99924
27587 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
27588 nodes w/o scalar stmts as visited.
27590 2021-04-06 Alex Coplan <alex.coplan@arm.com>
27593 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
27594 PCS for [su]fix_optab.
27596 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
27598 * config/darwin.c (machopic_legitimize_pic_address): Check
27599 that the current pic register is one of the hard reg set
27600 before setting liveness.
27602 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
27604 * config/darwin.c (machopic_legitimize_pic_address): Fix
27605 whitespace, remove unused code.
27607 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27609 PR tree-optimization/99882
27610 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
27613 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27615 PR rtl-optimization/99863
27616 * dse.c (replace_read): Drop regs_live argument. Instead of
27617 regs_live, use store_insn->fixed_regs_live if non-NULL,
27618 otherwise punt if insns sequence clobbers or sets any hard
27621 2021-04-03 Jakub Jelinek <jakub@redhat.com>
27624 * targhooks.h (default_print_patchable_function_entry_1): Declare.
27625 * targhooks.c (default_print_patchable_function_entry_1): New function,
27626 copied from default_print_patchable_function_entry with an added flags
27628 (default_print_patchable_function_entry): Rewritten into a small
27629 wrapper around default_print_patchable_function_entry_1.
27630 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
27632 (rs6000_print_patchable_function_entry): New function.
27634 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
27636 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
27638 2021-04-01 Jason Merrill <jason@redhat.com>
27641 * common.opt: Document v15 and v16.
27643 2021-04-01 Richard Biener <rguenther@suse.de>
27645 PR tree-optimization/99863
27646 * gimplify.c (gimplify_init_constructor): Recompute vector
27649 2021-04-01 Jakub Jelinek <jakub@redhat.com>
27651 * doc/extend.texi (symver attribute): Fix up syntax errors
27654 2021-04-01 Jakub Jelinek <jakub@redhat.com>
27656 PR tree-optimization/96573
27657 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
27658 also pointer types.
27660 2021-04-01 Richard Biener <rguenther@suse.de>
27662 PR tree-optimization/99856
27663 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
27664 precision to vector element precision.
27666 2021-04-01 Martin Jambor <mjambor@suse.cz>
27668 PR tree-optimization/97009
27669 * tree-sra.c (access_or_its_child_written): New function.
27670 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
27673 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
27676 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
27678 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
27681 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
27682 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
27683 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
27684 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
27685 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
27686 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
27688 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
27689 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
27690 (define_attr "prefixed"): Update initializer.
27692 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27695 * dwarf2out.c (debug_ranges_dwo_section): New variable.
27696 (DW_RANGES_IDX_SKELETON): Define.
27697 (struct dw_ranges): Add begin_entry and end_entry members.
27698 (DEBUG_DWO_RNGLISTS_SECTION): Define.
27699 (add_ranges_num): Adjust r initializer for addition of *_entry
27701 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
27702 set idx to DW_RANGES_IDX_SKELETON.
27703 (use_distinct_base_address_for_range): New function.
27704 (index_rnglists): Don't set r->idx if it is equal to
27705 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
27706 r->end_entry for -gsplit-dwarf if those will be needed by
27708 (output_rnglists): Add DWO argument. If true, switch to
27709 debug_ranges_dwo_section rather than debug_ranges_section.
27710 Adjust l1/l2 label indexes. Only output the offset table when
27711 dwo is true and don't include in there the skeleton range
27712 entry if present. For -gsplit-dwarf, skip ranges that belong
27713 to the other rnglists section. Change return type from void
27714 to bool and return true if there are any range entries for
27715 the other section. For dwarf_split_debug_info use
27716 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
27717 entries instead of DW_RLE_start_end, DW_RLE_start_length and
27718 DW_RLE_base_address. Use use_distinct_base_address_for_range.
27719 (init_sections_and_labels): Initialize debug_ranges_dwo_section
27720 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
27721 and range_base_label indexes.
27722 (dwarf2out_finish): Call index_rnglists earlier before finalizing
27723 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
27724 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
27725 with different dwo arguments.
27726 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
27728 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27730 PR tree-optimization/98268
27731 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
27732 recompute_tree_invariant_for_addr_expr after successfully
27733 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
27735 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27737 PR tree-optimization/99726
27738 * tree-data-ref.c (create_intersect_range_checks_index): Bail
27739 out if there is more than one access function SCEV for the loop
27742 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27744 PR rtl-optimization/97141
27745 PR rtl-optimization/98726
27746 * emit-rtl.c (valid_for_const_vector_p): Return true for
27748 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
27749 poly_wide_int instead of a wide_int.
27750 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
27752 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
27753 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
27754 false for CONST_VECTORs that cannot be forced to memory.
27755 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
27756 is too complex to force to memory, build it up from individual
27759 2021-03-31 Jan Hubicka <jh@suse.cz>
27762 * cgraph.c (cgraph_node::release_body): Fix overactive check.
27764 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
27767 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
27770 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
27772 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
27773 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
27775 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
27776 "rep movsb/stosb" only for known sizes.
27777 * config/i386/i386-options.c (processor_cost_table): Use Ice
27778 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
27779 Rapids and Alder Lake.
27780 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
27781 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
27782 (icelake_memset): Likewise.
27783 (icelake_cost): Likewise.
27784 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
27787 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
27790 * config/aarch64/aarch64.c
27791 (aarch64_vectorize_preferred_vector_alignment): Query the size
27792 of the provided SVE vector; do not assume that all SVE vectors
27793 have the same size.
27795 2021-03-31 Jan Hubicka <jh@suse.cz>
27798 * cgraph.c (cgraph_node::release_body): Remove all callers and
27800 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
27801 * cgraphunit.c (cgraph_node::expand): And here.
27803 2021-03-31 Martin Liska <mliska@suse.cz>
27805 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
27806 and one negated condition.
27808 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27809 Richard Sandiford <richard.sandiford@arm.com>
27812 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
27813 constraints on operands[2] and similarly 0 and rk constraints
27814 on operands[1] corresponding to that.
27816 2021-03-31 Jakub Jelinek <jakub@redhat.com>
27819 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
27820 linker doesn't support DWARF sections new in DWARF5.
27821 * config/i386/i386-options.c (ix86_option_override_internal): Default
27822 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
27824 * config.in: Regenerated.
27825 * configure: Regenerated.
27827 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27830 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
27831 available issue_info before using it.
27833 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27836 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
27839 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
27842 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
27843 (altivec_lvsl_reg_<mode>): ... this.
27844 (altivec_lvsr_reg): Change to ...
27845 (altivec_lvsr_reg_<mode>): ... this.
27846 * config/rs6000/predicates.md (vec_set_index_operand): New.
27847 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
27848 Enable 32bit variable vec_insert for all TARGET_VSX.
27849 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
27850 Enable 32bit variable vec_insert for p9 and above.
27851 (rs6000_expand_vector_set_var_p8): Rename to ...
27852 (rs6000_expand_vector_set_var_p7): ... this.
27853 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
27855 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
27856 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
27857 gen_altivec_lvsr_reg_di.
27859 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
27862 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
27863 (__rdtscp): Likewise.
27865 2021-03-30 Tamar Christina <tamar.christina@arm.com>
27867 PR tree-optimization/99825
27868 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
27869 Reject non-mult 2 lanes.
27871 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
27874 * config/arm/arm.c (arm_file_start): Fix emission of
27875 Tag_ABI_VFP_args attribute.
27877 2021-03-30 Richard Biener <rguenther@suse.de>
27879 PR tree-optimization/99824
27880 * stor-layout.c (set_min_and_max_values_for_integral_type):
27881 Assert the precision is within the bounds of
27882 WIDE_INT_MAX_PRECISION.
27883 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
27884 the outermost component ref only to lower the access size
27885 and initialize that from the access type.
27887 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
27890 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
27891 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
27893 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
27895 * config/aarch64/aarch64.md
27896 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
27897 attribute to disambiguate between SIMD and FP variants of the
27900 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
27902 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
27903 (analyze_ssa_name_flags): Fix typo in comment.
27905 2021-03-29 Alex Coplan <alex.coplan@arm.com>
27908 * config/aarch64/aarch64-sve-builtins.cc
27909 (function_builder::add_function): Add placeholder_p argument, use
27910 placeholder decls if this is set.
27911 (function_builder::add_unique_function): Instead of conditionally adding
27912 direct overloads, unconditionally add either a direct overload or a
27914 (function_builder::add_overloaded_function): Set placeholder_p if we're
27915 using C++ overloads. Use the obstack for string storage instead
27916 of relying on the tree nodes.
27917 (function_builder::add_overloaded_functions): Don't return early for
27918 m_direct_overloads: we need to add placeholders.
27919 * config/aarch64/aarch64-sve-builtins.h
27920 (function_builder::add_function): Add placeholder_p argument.
27922 2021-03-29 Richard Biener <rguenther@suse.de>
27924 PR tree-optimization/99807
27925 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
27926 assert below VEC_PERM handling.
27928 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27931 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
27932 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
27933 matching const_int 0.
27934 (move_lo_quad_internal_be_<mode>): Likewise.
27935 (move_lo_quad_<mode>): Update for the above.
27936 * config/aarch64/iterators.md (VQ_2E): Delete.
27938 2021-03-29 Jakub Jelinek <jakub@redhat.com>
27940 PR tree-optimization/99777
27941 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
27942 types other than scalar integral types.
27944 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
27946 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
27947 XCOFF TLS reloc decorations.
27949 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
27951 * doc/analyzer.texi (Analyzer Internals): Update link to
27952 "A Memory Model for Static Analysis of C Programs".
27954 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
27956 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
27957 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
27959 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
27960 (rs6000_special_round_type_align): Recursively check innermost first
27963 2021-03-26 Jakub Jelinek <jakub@redhat.com>
27966 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
27967 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
27968 assignment with drap_reg active, queue reg save for hfp with offset 0
27969 and flush queued reg saves. When handling a push with rule18,
27970 defer queueing reg save for hfp and just assert the offset is 0.
27971 (scan_trace): Assert that fde->rule18 is false.
27973 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
27976 * ira-costs.c (record_reg_classes): Put case with
27977 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
27978 * ira.c (ira_setup_alts): Ditto.
27979 * lra-constraints.c (process_alt_operands): Ditto.
27980 * recog.c (asm_operand_ok): Ditto.
27981 * reload.c (find_reloads): Ditto.
27983 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
27985 * config/aarch64/aarch64-protos.h
27986 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
27987 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
27988 * config/aarch64/aarch64.c (generic_addrcost_table): Update
27989 accordingly, using the same costs as for post_modify.
27990 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
27991 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
27992 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
27993 (a64fx_addrcost_table): Likewise.
27994 (neoversev1_addrcost_table): New.
27995 (neoversev1_tunings): Use neoversev1_addrcost_table.
27996 (aarch64_address_cost): Use the new post_modify costs for CImode
27999 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28001 * config/aarch64/aarch64.opt
28002 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
28003 * doc/invoke.texi: Document it.
28004 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
28005 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
28006 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
28007 (aarch64_vec_issue_info): New structures.
28008 (cpu_vector_cost): Write comments above the variables rather
28010 (cpu_vector_cost::issue_info): New member variable.
28011 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
28012 and tree-ssa-loop-niter.h.
28013 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
28014 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
28015 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
28016 (thunderx3t110_vector_cost): Initialize issue_info to null.
28017 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
28018 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
28019 (neoversev1_vector_cost): Use them.
28020 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
28021 (aarch64_vector_costs::saw_sve_only_op): New member variable.
28022 (aarch64_vector_costs::num_vector_iterations): Likewise.
28023 (aarch64_vector_costs::scalar_ops): Likewise.
28024 (aarch64_vector_costs::advsimd_ops): Likewise.
28025 (aarch64_vector_costs::sve_ops): Likewise.
28026 (aarch64_vector_costs::seen_loads): Likewise.
28027 (aarch64_simd_vec_costs_for_flags): New function.
28028 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
28029 Count the number of predicate operations required by SVE WHILE
28031 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
28032 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
28033 (aarch64_count_ops): Likewise.
28034 (aarch64_add_stmt_cost): Record whether see an SVE operation
28035 that cannot currently be implementing using Advanced SIMD.
28036 Record issue information about the scalar, Advanced SIMD
28037 and (where relevant) SVE versions of a loop.
28038 (aarch64_vec_op_count::dump): New function.
28039 (aarch64_sve_op_count::dump): Likewise.
28040 (aarch64_estimate_min_cycles_per_iter): Likewise.
28041 (aarch64_adjust_body_cost): If issue information is available,
28042 try to compare the issue rates of the various loop implementations
28043 and increase or decrease the vector body cost accordingly.
28045 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28047 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
28048 Assume a zero cost for induction phis.
28050 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28052 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
28054 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
28055 vector comparisons.
28057 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28059 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
28061 (aarch64_add_stmt_cost): Call it.
28063 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28065 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
28066 New tuning parameter.
28067 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
28068 (aarch64_estimated_sve_vq): New function.
28069 (aarch64_vector_costs::analyzed_vinfo): New member variable.
28070 (aarch64_vector_costs::is_loop): Likewise.
28071 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
28072 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
28073 (aarch64_record_potential_advsimd_unrolling): New function.
28074 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
28075 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
28076 aarch64_analyze_bb_vinfo on the first use of a costs structure.
28077 Detect whether we're vectorizing a loop for SVE that might be
28078 completely unrolled if it used Advanced SIMD instead.
28079 (aarch64_adjust_body_cost_for_latency): New function.
28080 (aarch64_finish_cost): Call it.
28082 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28084 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
28085 (aarch64_init_cost): New function.
28086 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
28087 the default unsigned[3].
28088 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
28089 (TARGET_VECTORIZE_INIT_COST): Override.
28090 (TARGET_VECTORIZE_FINISH_COST): Likewise.
28091 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
28093 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28095 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
28096 (neoversev1_sve_vector_cost): New cost structures.
28097 (neoversev1_vector_cost): Likewise.
28098 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
28100 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28102 * config/aarch64/aarch64-protos.h
28103 (sve_vec_cost::scatter_store_elt_cost): New member variable.
28104 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28105 accordingly, taking the cost from the cost of a scalar_store.
28106 (a64fx_sve_vector_cost): Likewise.
28107 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
28109 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28111 * config/aarch64/aarch64-protos.h
28112 (simd_vec_cost::store_elt_extra_cost): New member variable.
28113 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28114 accordingly, using the vec_to_scalar cost for the new field.
28115 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28116 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28117 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28118 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28119 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28120 (thunderx3t110_advsimd_vector_cost): Likewise.
28121 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
28123 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28125 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
28126 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
28127 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
28128 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28129 accordingly, using zero for the new costs.
28130 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28131 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28132 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28133 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28134 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28135 (thunderx3t110_advsimd_vector_cost): Likewise.
28136 (aarch64_ld234_st234_vectors): New function.
28137 (aarch64_adjust_stmt_cost): Likewise.
28138 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
28139 the new vector costs.
28141 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28143 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
28144 derived class of simd_vec_cost. Add information about CLAST[AB]
28145 and FADDA instructions.
28146 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
28147 accordingly, using the vec_to_scalar costs for the new fields.
28148 (a64fx_sve_vector_cost): Likewise.
28149 (aarch64_reduc_type): New function.
28150 (aarch64_sve_in_loop_reduction_latency): Likewise.
28151 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
28152 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
28153 that occur in the loop body.
28154 (aarch64_add_stmt_cost): Update call accordingly.
28156 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
28158 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
28160 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
28161 above the fields rather than to the right.
28162 (simd_vec_cost::reduc_i8_cost): New member variable.
28163 (simd_vec_cost::reduc_i16_cost): Likewise.
28164 (simd_vec_cost::reduc_i32_cost): Likewise.
28165 (simd_vec_cost::reduc_i64_cost): Likewise.
28166 (simd_vec_cost::reduc_f16_cost): Likewise.
28167 (simd_vec_cost::reduc_f32_cost): Likewise.
28168 (simd_vec_cost::reduc_f64_cost): Likewise.
28169 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
28170 accordingly, using the vec_to_scalar_cost for the new fields.
28171 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
28172 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
28173 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
28174 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
28175 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
28176 (thunderx3t110_advsimd_vector_cost): Likewise.
28177 (aarch64_use_new_vector_costs_p): New function.
28178 (aarch64_simd_vec_costs): New function, split out from...
28179 (aarch64_builtin_vectorization_cost): ...here.
28180 (aarch64_is_reduction): New function.
28181 (aarch64_detect_vector_stmt_subtype): Likewise.
28182 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
28183 using the new vector costs.
28185 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28188 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
28189 TLS declarations as public.
28191 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28193 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
28194 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
28195 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
28196 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
28197 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
28198 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
28199 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
28200 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
28201 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
28203 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28206 * config.gcc (*-*-cygwin*): Add winnt-d.o
28207 (*-*-mingw*): Likewise.
28208 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
28209 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
28210 * config/i386/t-cygming: Add winnt-d.o.
28211 * config/i386/winnt-d.c: New file.
28213 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28215 * config/freebsd-d.c: Include memmodel.h.
28217 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
28220 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
28221 * config/t-openbsd: Add openbsd-d.o.
28222 * config/openbsd-d.c: New file.
28224 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
28226 PR tree-optimization/96974
28227 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
28228 with graceful exit.
28230 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28233 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28237 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28238 always_inline in system headers.
28240 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
28242 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
28244 2021-03-25 Jakub Jelinek <jakub@redhat.com>
28247 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
28248 * fold-const.c (operand_compare::operand_equal_p): Don't compare
28249 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
28251 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
28255 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
28256 always_inline in system headers.
28258 2021-03-25 Richard Biener <rguenther@suse.de>
28260 PR tree-optimization/99746
28261 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
28262 the scalar stmt as patterned. Instead set up required things
28265 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
28267 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
28270 2021-03-24 Martin Liska <mliska@suse.cz>
28273 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
28275 * config/i386/i386-options.c (ix86_option_override_internal):
28276 Add run-time assert.
28278 2021-03-24 Martin Jambor <mjambor@suse.cz>
28281 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
28282 parameters with unknown type.
28283 (ipacp_value_safe_for_type): New function.
28284 (propagate_vals_across_arith_jfunc): Verify that the constant type
28285 can be used for a type of the formal parameter.
28286 (propagate_vals_across_ancestor): Likewise.
28287 (propagate_scalar_across_jump_function): Likewise. Pass the type
28288 also to propagate_vals_across_ancestor.
28290 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
28293 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
28295 (movmisalign<mode>_mve_load): Likewise.
28297 2021-03-24 Jakub Jelinek <jakub@redhat.com>
28300 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
28301 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
28303 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28305 * doc/sourcebuild.texi (sysconf): New effective target.
28307 2021-03-24 Alexandre Oliva <oliva@adacore.com>
28309 * config/i386/predicates.md (reg_or_const_vec_operand): New.
28310 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
28311 the now *-prefixed insn_and_split, turn the splitter const vec
28312 into an input for the insn, making it an ignored immediate for
28313 non-split cases, and loaded into the scratch register
28316 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
28319 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
28320 Use define_relaxed_memory_constraint for them.
28322 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
28325 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
28326 colon to the diagnostic message.
28328 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
28330 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
28332 (try_fwprop_subst_note): Use set_info instead of insn_info.
28333 (try_fwprop_subst_pattern): Likewise.
28334 (try_fwprop_subst_notes): Likewise.
28335 (try_fwprop_subst): Likewise.
28336 (forward_propagate_subreg): Likewise.
28337 (forward_propagate_and_simplify): Likewise.
28338 (forward_propagate_into): Likewise.
28339 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
28341 (set_info::single_nondebug_insn_use): Likewise.
28342 (set_info::single_phi_use): Likewise.
28343 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
28345 (set_info::single_nondebug_insn_use): Likewise.
28346 (set_info::single_phi_use): Likewise.
28348 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
28350 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
28352 2021-03-23 Jakub Jelinek <jakub@redhat.com>
28355 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
28356 expand_mult to perform an unsigned rather than a signed
28359 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
28362 * config/i386/cpuid.h (__cpuid): Add __volatile__.
28363 (__cpuid_count): Likewise.
28365 2021-03-23 Richard Biener <rguenther@suse.de>
28367 PR tree-optimization/99721
28368 * tree-vect-slp.c (vect_slp_analyze_node_operations):
28369 Make sure we can schedule the node.
28371 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28373 * config/riscv/riscv.c (riscv_subword): Take endianness into
28374 account when calculating the byte offset.
28376 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28378 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
28379 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
28380 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
28381 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
28382 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
28383 new predicate "subreg_lowpart_operator"
28385 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28387 * config/riscv/riscv.c (riscv_swap_instruction): New function
28388 to byteswap an SImode rtx containing an instruction.
28389 (riscv_trampoline_init): Byteswap the generated instructions
28392 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28394 * common/config/riscv/riscv-common.c
28395 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
28396 * config.gcc (riscv32be-*, riscv64be-*): Set
28397 TARGET_BIG_ENDIAN_DEFAULT to 1.
28398 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
28399 depending on default endianness.
28400 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28401 * config/riscv/linux.h (LINK_SPEC): Likewise.
28402 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
28403 default endianness.
28404 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
28406 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
28408 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
28409 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
28410 * config/riscv/linux.h (LINK_SPEC): Likewise.
28411 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
28413 (BYTES_BIG_ENDIAN): Handle big endian.
28414 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
28415 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
28417 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
28419 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
28421 * regcprop.c (find_oldest_value_reg): Ask target whether
28422 different mode is fine for replacement register.
28424 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
28426 PR tree-optimization/99296
28427 * value-range.cc (irange::irange_set_1bit_anti_range): New.
28428 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
28429 * value-range.h (irange::irange_set_1bit_anti_range): New.
28431 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
28434 * config/aarch64/constraints.md (UtQ): Use
28435 define_relaxed_memory_constraint for it.
28436 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
28437 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28438 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
28439 (have_relaxed_memory_constraints): New static var.
28440 (relaxed_memory_start, relaxed_memory_end): Ditto.
28441 (add_constraint): Add arg is_relaxed_memory. Check name for
28442 relaxed memory. Set up is_relaxed_memory in constraint_data and
28443 have_relaxed_memory_constraints. Adjust calls.
28444 (choose_enum_order): Process relaxed memory.
28445 (write_tm_preds_h): Ditto.
28446 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28447 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
28448 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
28449 * ira-lives.c (single_reg_class): Use
28450 insn_extra_relaxed_memory_constraint.
28451 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
28452 * lra-constraints.c (valid_address_p): Use
28453 insn_extra_relaxed_memory_constraint instead of other memory
28455 (process_alt_operands): Process CT_RELAXED_MEMORY.
28456 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
28457 * recog.c (asm_operand_ok, preprocess_constraints): Process
28459 * reload.c (find_reloads): Ditto.
28460 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
28461 * stmt.c (parse_input_constraint): Use
28462 insn_extra_relaxed_memory_constraint.
28464 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
28467 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
28470 2021-03-22 Alex Coplan <alex.coplan@arm.com>
28473 * config/arm/arm-protos.h (neon_make_constant): Add generate
28474 argument to guard emitting insns, default to true.
28475 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
28476 CONST_VECTORs which neon_make_constant can't handle.
28477 (neon_vdup_constant): Add generate argument, avoid emitting
28478 insns if it's not set.
28479 (neon_make_constant): Plumb new generate argument through.
28480 * config/arm/constraints.md (Ui): New. Use it...
28481 * config/arm/mve.md (*mve_mov<mode>): ... here.
28482 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
28483 synthesize constants.
28485 2021-03-22 Richard Biener <rguenther@suse.de>
28487 * debug.h: Add deprecation warning.
28489 2021-03-22 Richard Biener <rguenther@suse.de>
28491 PR tree-optimization/99694
28492 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
28495 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
28498 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
28499 after type checking.
28501 2021-03-22 Jakub Jelinek <jakub@redhat.com>
28505 * dwarf2out.c (get_full_len): Use get_precision rather than
28507 (add_const_value_attribute): Make sure add_AT_wide argument has
28508 precision prec rather than some very wide one.
28510 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
28512 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
28513 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
28514 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
28515 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
28516 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
28517 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
28518 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
28519 *fix_trunc<mode>si2_internal): Fix empty split condition.
28520 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
28521 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
28522 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
28523 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
28525 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
28528 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
28529 Convert idx to DImode.
28530 (rs6000_expand_vector_set_var_p8): Likewise.
28532 2021-03-21 Jakub Jelinek <jakub@redhat.com>
28535 * dwarf2out.c (insert_float): Change return type from void to
28536 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
28537 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
28540 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
28543 * config/i386/i386.c (construct_container): Check cfun != NULL
28544 before accessing silent_p.
28546 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
28548 * asan.c: Fix typos in comments.
28550 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
28552 PR rtl-optimization/99680
28553 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
28554 (process_address_1): Check empty constraint before using
28557 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
28559 * config/rs6000/rs6000.c (power10_cost): New.
28560 (rs6000_option_override_internal): Set Power10 costs.
28561 (rs6000_issue_rate): Set Power10 issue rate.
28562 * config/rs6000/power10.md: Rewrite for Power10.
28564 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
28567 * lra-constraints.c (process_address_1): Don't use unknown
28568 constraint for address constraint.
28570 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
28573 * config.gcc (powerpc-*-darwin8): Delete the reference to
28574 the now removed darwin8.h.
28576 2021-03-19 Olivier Hainque <hainque@adacore.com>
28579 * config/vxworksae.h (VX_CPU_PREFIX): Define.
28581 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
28583 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
28585 2021-03-19 Tamar Christina <tamar.christina@arm.com>
28587 PR tree-optimization/99656
28588 * tree-vect-slp-patterns.c (linear_loads_p,
28589 complex_add_pattern::matches, is_eq_or_top,
28590 vect_validate_multiplication, complex_mul_pattern::matches,
28591 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
28592 * tree-vectorizer.h: (complex_load_perm_t): Removed.
28593 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
28594 complex_load_perm_t.
28596 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
28599 * config/i386/i386-options.c (ix86_init_machine_status): Set
28601 * config/i386/i386.c (init_cumulative_args): Set silent_p to
28603 (construct_container): Return early for return and argument
28604 errors if silent_p is true.
28605 * config/i386/i386.h (machine_function): Add silent_p.
28607 2021-03-19 Jakub Jelinek <jakub@redhat.com>
28610 * config/arm/constraints.md (Ds): New constraint.
28611 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
28612 constraint instead of w,Dm.
28614 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
28616 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
28619 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
28621 PR middle-end/99641
28622 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
28623 array type, do the computation of the current position in sizetype.
28625 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
28628 * lra-constraints.c (process_address_1): Use lookup_constraint
28629 only for a single constraint.
28631 2021-03-18 Martin Sebor <msebor@redhat.com>
28633 PR middle-end/99502
28634 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
28635 (inbounds_memaccess_p): ...to this. Check the ending offset of
28636 the accessed member.
28638 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
28640 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
28641 %> quote markers to error messages.
28642 (gcn_goacc_validate_dims): Likewise.
28643 (gcn_conditional_register_usage): Remove exclaimation mark from error
28645 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
28647 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
28649 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
28652 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
28653 Kito Cheng <kito.cheng@sifive.com>
28655 * config/riscv/riscv.c (riscv_block_move_straight): Change type
28656 to unsigned HOST_WIDE_INT for parameter and local variable with
28657 HOST_WIDE_INT type.
28658 (riscv_adjust_block_mem): Ditto.
28659 (riscv_block_move_loop): Ditto.
28660 (riscv_expand_block_move): Ditto.
28662 2021-03-18 Nick Clifton <nickc@redhat.com>
28664 * config/v850/v850.c (construct_restore_jr): Increase static
28666 (construct_save_jarl): Likewise.
28667 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
28669 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28671 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
28672 (aarch64_override_options_internal): Use it.
28673 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
28676 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
28678 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
28679 error message format issues.
28680 (nios2_option_override): Likewise.
28681 (nios2_expand_fpu_builtin): Likewise.
28682 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
28683 truncation warning.
28684 (nios2_expand_custom_builtin): More error message format fixes.
28685 (nios2_expand_rdwrctl_builtin): Likewise.
28686 (nios2_expand_rdprs_builtin): Likewise.
28687 (nios2_expand_eni_builtin): Likewise.
28688 (nios2_expand_builtin): Likewise.
28689 (nios2_register_custom_code): Likewise.
28690 (nios2_valid_target_attribute_rec): Likewise.
28691 (nios2_add_insn_asm): Fix uninitialized variable warning.
28693 2021-03-17 Jan Hubicka <jh@suse.cz>
28695 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
28696 of gather to match reality.
28697 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
28699 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
28701 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
28702 to compare against CC_REG rather than NE.
28704 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
28707 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
28708 inline assembly statements.
28709 (ix86_print_operand): Update 'P' handling for -fno-plt.
28711 2021-03-17 Tamar Christina <tamar.christina@arm.com>
28714 * config/aarch64/aarch64.c
28715 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
28717 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
28720 * config/rs6000/predicates.md (branch_comparison_operator): Allow
28721 ordered and unordered for CCFPmode, if flag_finite_math_only.
28723 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28726 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
28727 rather than ASHIFT.
28728 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
28730 2021-03-16 Martin Liska <mliska@suse.cz>
28733 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
28734 cl_optimization_compare function.
28736 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
28738 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
28741 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28744 * config/i386/i386.h (struct machine_function): Add
28745 has_explicit_vzeroupper bitfield.
28746 * config/i386/i386-expand.c (ix86_expand_builtin): Set
28747 cfun->machine->has_explicit_vzeroupper when expanding
28748 IX86_BUILTIN_VZEROUPPER.
28749 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
28750 Do the mode switching only when TARGET_VZEROUPPER, expensive
28751 optimizations turned on and not optimizing for size.
28752 (pass_insert_vzeroupper::gate): Enable even when
28753 cfun->machine->has_explicit_vzeroupper is set.
28755 2021-03-16 Jakub Jelinek <jakub@redhat.com>
28758 * config/aarch64/aarch64.c
28759 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
28760 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
28761 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
28763 2021-03-15 Richard Biener <rguenther@suse.de>
28765 PR tree-optimization/98834
28766 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
28767 subsetting by truncating the access size.
28769 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
28771 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
28772 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
28775 2021-03-15 Martin Liska <mliska@suse.cz>
28777 * spellcheck.c: Add missing comma in initialization.
28779 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
28781 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
28782 alternative 2 and alternative 1 with alternative 3 using
28783 YW register constraint.
28784 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
28785 using YW register constraint.
28786 (*vec_extractv16qi_zext): Ditto.
28787 (*vec_extractv4si): Merge alternatives 4 and 5
28788 using Yw register constraint.
28789 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
28791 2021-03-13 Martin Sebor <msebor@redhat.com>
28793 PR tree-optimization/99489
28794 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
28795 is not a call statement.
28797 2021-03-13 Jakub Jelinek <jakub@redhat.com>
28799 PR tree-optimization/99544
28800 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
28801 if for vector types multiplication can't be done in type's mode.
28803 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
28806 * config/sparc/constraints.md (w): Rename to...
28807 (W): ... this and ditch previous implementation.
28808 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
28809 (*movdf_insn_sp64): Likewise.
28810 (*mov<VM64:mode>_insn_sp64): Likewise.
28811 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
28813 (atomic_compare_and_swap_leon3_1): Likewise.
28814 (*atomic_compare_and_swapdi_v8plus): Likewise.
28815 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
28816 architecture and add missing address validity check during LRA.
28818 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
28821 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
28822 occuring for assumed-size arrays in use_device_{ptr,addr}.
28824 2021-03-12 Jakub Jelinek <jakub@redhat.com>
28827 * config/i386/constraints.md (YW): New internal constraint.
28828 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
28829 (*<sse2_avx2>_<insn><mode>3<mask_name>,
28830 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
28831 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
28833 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
28834 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
28835 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
28836 into one, use Yw instead of former x,v.
28837 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
28838 the last alternative.
28839 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
28840 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
28841 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
28842 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
28843 into one, use <v_Yw> instead of former x,v.
28844 (avx2_interleave_highv32qi<mask_name>,
28845 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
28846 constraints. Add && <mask_avx512bw_condition> to condition.
28847 (avx2_interleave_lowv32qi<mask_name>,
28848 vec_interleave_lowv16qi<mask_name>,
28849 avx2_interleave_highv16hi<mask_name>,
28850 vec_interleave_highv8hi<mask_name>,
28851 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
28852 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
28853 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
28854 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
28855 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
28856 Yw instead of v in constraints.
28857 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
28858 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
28859 instead of Yv in constraints.
28860 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
28861 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
28862 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
28863 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
28864 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
28866 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
28867 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
28868 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
28869 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
28870 two, one with just x, another isa avx512vl with v.
28872 2021-03-12 Martin Liska <mliska@suse.cz>
28874 * doc/invoke.texi: Add missing param documentation.
28876 2021-03-11 David Malcolm <dmalcolm@redhat.com>
28879 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
28880 analyzer/trimmed-graph.o.
28881 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
28882 feasibility checking to reflect new implementation.
28883 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
28885 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
28887 2021-03-11 David Malcolm <dmalcolm@redhat.com>
28889 * digraph.cc (selftest::test_shortest_paths): Update
28890 shortest_paths init for new param. Add test of
28891 SPS_TO_GIVEN_TARGET.
28892 * shortest-paths.h (enum shortest_path_sense): New.
28893 (shortest_paths::shortest_paths): Add "sense" param.
28894 Update for renamings. Generalize to use "sense" param.
28895 (shortest_paths::get_shortest_path): Rename param.
28896 (shortest_paths::m_sense): New field.
28897 (shortest_paths::m_prev): Rename...
28898 (shortest_paths::m_best_edge): ...to this.
28899 (shortest_paths::get_shortest_path): Update for renamings.
28900 Conditionalize flipping of path on sense of traversal.
28902 2021-03-11 David Malcolm <dmalcolm@redhat.com>
28904 * digraph.cc (selftest::test_shortest_paths): Add test coverage
28905 for paths from B and C.
28906 * shortest-paths.h (shortest_paths::shortest_paths): Handle
28907 unreachable nodes, rather than asserting.
28909 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
28912 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
28913 xcoff_tbss_section_name.
28914 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
28915 * xcoffout.c (xcoff_tbss_section_name): Delete.
28916 * xcoffout.h (xcoff_tbss_section_name): Delete.
28918 2021-03-11 Richard Biener <rguenther@suse.de>
28920 PR tree-optimization/99523
28921 * tree-cfg.c (dump_function_to_file): Dump SSA names
28922 w/o identifier to the decls section as well, not only those
28923 without a VAR_DECL.
28925 2021-03-11 Jakub Jelinek <jakub@redhat.com>
28928 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
28929 function calls with lhs fail if the lhs don't have compatible types.
28931 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
28933 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
28934 Change FRAME_POINTER_REGNUM to correspond to a new faked
28935 register faked_fp, part of GENNONACR_REGS like faked_ap.
28936 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
28937 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
28938 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
28939 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
28940 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
28942 (CRIS_REAL_FP_REGNUM): New constant.
28943 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
28944 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
28945 (cris_initial_elimination_offset): Handle elimination changes
28946 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
28947 and add one from FRAME_POINTER_REGNUM to
28948 HARD_FRAME_POINTER_REGNUM.
28949 (cris_expand_prologue, cris_expand_epilogue): Emit code for
28950 hard_frame_pointer_rtx instead of frame_pointer_rtx.
28952 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
28955 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
28956 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
28958 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
28961 * lra-constraints.c (process_address_1): Don't check unknown
28962 constraint, use X for empty constraint.
28964 2021-03-10 Alex Coplan <alex.coplan@arm.com>
28966 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
28967 Fix typo in comment describing "is_ha" argument.
28969 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
28971 * doc/sourcebuild.texi: Document LRA target selector.
28973 2021-03-10 David Malcolm <dmalcolm@redhat.com>
28975 * doc/ux.texi: Add subsection contrasting interactive versus
28976 batch usage of GCC.
28978 2021-03-10 Joel Hutton <joel.hutton@arm.com>
28981 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
28983 (vectorizable_load): Fix gather load mask check condition.
28985 2021-03-10 Richard Biener <rguenther@suse.de>
28987 PR tree-optimization/99510
28988 * tree.c (check_aligned_type): Check that the candidate
28989 has TYPE_USER_ALIGN set instead of matching with the
28992 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
28994 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
28995 float and vector integer modes only if the mode is not larger.
28997 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
28999 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
29001 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29003 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
29005 * ira-lives.c (single_reg_class): Ditto.
29007 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
29009 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
29010 the architecture-specific rtems.h.
29011 (aarch64-*-rtems*): Likewise.
29012 (arm*-*-rtems*): Likewise.
29013 (epiphany-*-rtems*): Likewise.
29014 (riscv*-*-rtems*): Likewise.
29016 2021-03-09 Jakub Jelinek <jakub@redhat.com>
29018 PR tree-optimization/99305
29019 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
29020 before integer_all_onesp instead of vice versa.
29022 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
29024 * common/config/arm/arm-common.c (arm_config_default): Change type
29025 of 'i' to unsigned.
29027 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29030 * lra-constraints.c (process_address_1): Process constraint 'g'
29031 separately and digital constraints containing more one digit.
29033 2021-03-09 Nick Clifton <nickc@redhat.com>
29035 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
29036 (DWARF"_DEBUGGING_INFO): Define.
29038 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
29041 * calls.c (initialize_argument_information): When the argument
29042 is passed by reference, do not make a copy in a thunk only if
29043 the argument is already in memory. Remove redundant test for
29044 the case of callee copy.
29046 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
29049 * lra-constraints.c (process_address_1): Process 0..9 constraints
29050 in process_address_1.
29052 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
29054 * config/s390/s390.c (struct s390_processor processor_table):
29055 Binutils name string must not be empty.
29057 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
29059 * config/arc/arc.c (arc_attr_type): Remove function.
29061 2021-03-09 Martin Liska <mliska@suse.cz>
29064 * config/i386/i386-options.c (ix86_option_override_internal):
29065 Set isa_flags for OPTS argument and not for the global
29068 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29070 * config/rs6000/predicates.md (ds_form_mem_operand): Check
29073 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
29076 * config/rs6000/predicates.md (ds_form_mem_operand) New
29078 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
29079 ds_form_mem_operand in ld/lwa patterns.
29080 * config/rs6000/fusion.md: Regenerate file.
29082 2021-03-08 Martin Sebor <msebor@redhat.com>
29084 PR middle-end/98266
29085 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
29086 (array_bounds_checker::check_array_bounds): Call it.
29088 2021-03-08 Martin Sebor <msebor@redhat.com>
29090 PR middle-end/97631
29091 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
29092 (handle_builtin_stxncpy_strncat): Rename locals. Determine
29093 destination size from allocation calls. Issue a more appropriate
29095 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
29096 (handle_builtin_memset): Same.
29098 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
29101 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
29102 to ensure we do not have an Altivec style address.
29103 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
29104 an Altivec style address.
29105 (*vsx_le_perm_store_<mode>): Likewise.
29106 (splitters after *vsx_le_perm_store_<mode>): Likewise.
29107 (vsx_load_<mode>): Disable special expander if passed an Altivec
29109 (vsx_store_<mode>): Likewise.
29111 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29114 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
29115 (aarch64_simd_shift_imm_vec_hi): Likewise.
29116 (aarch64_simd_shift_imm_vec_si): Likewise.
29117 (aarch64_simd_shift_imm_vec_di): Likewise.
29118 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
29119 predicate from above.
29120 (aarch64_shrn<mode>_insn_be): Likewise.
29121 (aarch64_rshrn<mode>_insn_le): Likewise.
29122 (aarch64_rshrn<mode>_insn_be): Likewise.
29123 (aarch64_shrn2<mode>_insn_le): Likewise.
29124 (aarch64_shrn2<mode>_insn_be): Likewise.
29125 (aarch64_rshrn2<mode>_insn_le): Likewise.
29126 (aarch64_rshrn2<mode>_insn_be): Likewise.
29128 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
29131 * lra-constraints.c (skip_contraint_modifiers): New function.
29132 (process_address_1): Use it before lookup_constraint call.
29134 2021-03-08 Martin Liska <mliska@suse.cz>
29137 * config/i386/i386-options.c (ix86_option_override_internal):
29138 Enable UINTR and HRESET for -march that supports it.
29140 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
29142 * config/s390/s390.c (f_constraint_p): New function.
29143 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
29144 (TARGET_MD_ASM_ADJUST): Likewise.
29146 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
29149 * tree-nested.c (convert_local_reference_stmt): Avoid calling
29150 lookup_field_for_decl for Fortran module (= namespace context).
29152 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
29154 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
29155 comparison with arithmetic right shift.
29156 (s390_expand_vcond): No need for a force_reg anymore.
29157 s390_vec_compare will do it.
29158 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
29159 immediate operands.
29161 2021-03-07 Jakub Jelinek <jakub@redhat.com>
29164 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
29165 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
29167 * config/i386/sse.md (v_Yw): New define_mode_attr.
29168 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
29169 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
29171 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
29172 xYw in constraints.
29174 2021-03-06 Julian Brown <julian@codesourcery.com>
29176 * tree-pretty-print.c (dump_generic_node): Emit non-generic
29177 address space info for aggregates.
29179 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
29181 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
29183 2021-03-05 Jakub Jelinek <jakub@redhat.com>
29185 PR middle-end/99322
29186 * tree-cfg.c (bb_to_omp_idx): New variable.
29187 (execute_build_cfg): Release the bb_to_omp_idx vector after
29188 cleanup_tree_cfg returns.
29189 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
29190 for bb_to_omp_idx being a vec<int> instead of pointer to array
29192 (make_edges): Remove bb_to_omp_idx local variable, don't pass
29193 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
29194 vec<int> instead of pointer to array of ints and don't free/release
29196 (remove_bb): When removing a bb and placing forced label somewhere
29197 else, ensure it is put into the same OpenMP region during cfg
29198 pass if possible or to entry successor as fallback. Unregister
29199 bb from bb_to_omp_idx.
29201 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
29204 * lra-constraints.c (process_address_1): Skip decomposing address
29205 for asm insn operand with unknown constraint.
29207 2021-03-05 Martin Jambor <mjambor@suse.cz>
29210 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
29211 corresponding speculative edges if we are about to resolve
29212 sepculation. Make edge direct (and so resolve speculations) before
29213 removing it from call_site_hash.
29214 (cgraph_edge::make_direct): Relax the initial assert to allow calling
29215 the function on speculative direct edges.
29217 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
29219 PR rtl-optimization/99376
29220 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
29221 of low-order zero bits is too large, set the result to 0 directly.
29223 2021-03-04 Jakub Jelinek <jakub@redhat.com>
29225 PR middle-end/93235
29226 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
29227 SUBREG and a SUBREG to op_mode can't be created.
29229 2021-03-04 Alex Coplan <alex.coplan@arm.com>
29232 * config/aarch64/aarch64-sve-builtins.cc
29233 (function_resolver::require_vector_type): Handle error_mark_node.
29235 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
29237 * cfgexpand.c (expand_asm_loc): Pass new parameter.
29238 (expand_asm_stmt): Likewise.
29239 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
29241 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
29242 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
29243 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
29244 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
29245 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
29246 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
29247 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
29248 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
29249 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
29250 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
29251 * doc/tm.texi (md_asm_adjust): Likewise.
29252 * target.def (md_asm_adjust): Likewise.
29254 2021-03-04 Richard Biener <rguenther@suse.de>
29256 PR middle-end/97855
29257 * tree-pretty-print.c: Poison pp_printf.
29258 (dump_decl_name): Avoid use of pp_printf.
29259 (dump_block_node): Likewise.
29260 (dump_generic_node): Likewise.
29262 2021-03-04 Martin Sebor <msebor@redhat.com>
29264 PR middle-end/96963
29265 PR middle-end/94655
29266 * builtins.c (handle_array_ref): New helper.
29267 (handle_mem_ref): New helper.
29268 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
29269 into new helper functions. Correct a workaround for vectorized
29272 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
29274 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
29275 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
29276 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
29277 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
29278 attribute for Power10.
29279 * config/rs6000/mma.md (*movoo): Likewise.
29280 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
29281 (define_mode_attr bits): Add DD/TD modes.
29282 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
29283 store_conditionalpti): Update size attribute for Power10.
29285 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
29288 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
29289 -Wuninitialized, -Wmaybe-uninitialized.
29290 (wide-int.o-warn): Likewise.
29292 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
29294 * common/config/arm/arm-common.c: Include configargs.h.
29295 (arm_config_default): New function.
29296 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
29297 processors that do not support Thumb. Take into account the
29298 --with-mode configuration setting for selecting the default.
29299 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
29300 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
29302 2021-03-03 Martin Liska <mliska@suse.cz>
29304 PR gcov-profile/97461
29305 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
29307 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
29310 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29311 point back the hard frame pointer to its default location when the
29312 frame is larger than SEH_MAX_FRAME_SIZE.
29314 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29317 * config/i386/predicates.md (logic_operator): New define_predicate.
29318 * config/i386/i386.md (mov + mem using comm arith peephole2):
29319 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
29320 and the inner mode is [QH]Imode.
29322 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29325 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
29326 (new_loc_list): Clear end_entry.
29327 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
29328 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
29330 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
29331 initialize also end_entry.
29333 2021-03-03 Jakub Jelinek <jakub@redhat.com>
29336 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
29337 partitions, if in non-layout mode after reorder_blocks also move
29338 affected blocks to ensure a single partition transition.
29340 2021-03-03 Jason Merrill <jason@redhat.com>
29343 * cgraphunit.c (process_function_and_variable_attributes): Don't
29344 warn about flatten on an alias if the target also has it.
29345 * cgraph.h (symtab_node::get_alias_target_tree): New.
29347 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
29349 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
29350 period to symbol name.
29351 (tls_get_addr_internal<mode>): Same.
29353 2021-03-02 David Malcolm <dmalcolm@redhat.com>
29356 * diagnostic-show-locus.c
29357 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
29360 2021-03-02 Martin Sebor <msebor@redhat.com>
29362 PR middle-end/99276
29363 * builtins.c (warn_for_access): Remove stray warning text.
29365 2021-03-02 Martin Sebor <msebor@redhat.com>
29367 PR middle-end/99295
29368 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
29371 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29374 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
29375 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
29376 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
29377 DW_MACRO_define_strx and DW_MACRO_undef_strx.
29378 (save_macinfo_strings): Use DW_MACRO_*_str* even with
29379 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
29380 DW_MACRO_undef_strx.
29382 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29384 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
29386 (BT_FN_V8HI_V8HI_UINT): Likewise.
29387 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
29388 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
29389 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
29390 New builtin definitions.
29391 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
29392 vector extension version.
29393 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
29394 available with current -march level.
29395 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
29396 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
29397 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
29398 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
29399 (vec_extend_to_fp32_lo): Likewise.
29400 (vec_round_from_fp32): Likewise.
29401 (vec_convert_to_fp16): Likewise.
29402 (vec_convert_from_fp16): Likewise.
29403 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
29404 (vclfnls_v8hi): Likewise.
29405 (vcrnfs_v8hi): Likewise.
29406 (vcfn_v8hi): Likewise.
29407 (vcnf_v8hi): Likewise.
29409 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
29411 * common/config/s390/s390-common.c (processor_flags_table): New entry.
29412 * config.gcc: Enable arch14 for --with-arch and --with-tune.
29413 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
29414 arch14 for unknown CPU models.
29415 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
29416 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
29417 (s390_get_sched_attrmask): Likewise.
29418 (s390_get_unit_mask): Likewise.
29419 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
29420 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
29421 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
29422 (TARGET_NNPA_P): New macro definitions.
29423 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
29424 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
29426 2021-03-02 Jakub Jelinek <jakub@redhat.com>
29428 PR middle-end/95757
29429 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
29430 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
29431 != 1 comparisons if name is lhs of a comparison.
29433 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
29437 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
29438 * config/darwin.c (darwin_should_restore_cfa_state): New.
29439 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
29440 * doc/tm.texi: Regenerated.
29441 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
29442 * dwarf2cfi.c (connect_traces): If the target requests, restore
29443 the CFA expression after a DW_CFA_restore.
29444 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
29446 2021-03-01 Martin Liska <mliska@suse.cz>
29449 * optc-save-gen.awk: Add 4 more exceptions.
29451 2021-03-01 Nathan Sidwell <nathan@acm.org>
29454 * tree.h (TYPE_ALIGN_RAW): New accessor.
29455 (TYPE_ALIGN): Use it.
29457 2021-03-01 Jan Hubicka <jh@suse.cz>
29460 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
29462 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
29465 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
29466 point the hard frame pointer to the SSE register save area instead
29467 of the general register save area. Perform only minimal adjustment
29468 for small frames if it is initially not correctly aligned.
29469 (ix86_expand_prologue): Remove early saves for a SEH target.
29470 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
29472 2021-02-28 Jakub Jelinek <jakub@redhat.com>
29475 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
29476 typo - referneced -> referenced.
29477 * tree.c (component_ref_size): Fix comment typo -
29478 refernce -> reference.
29479 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
29480 traling -> trailing.
29481 (aliasing_component_refs_p): Fix comment typos -
29482 refernce -> reference and refernece -> reference and
29483 traling -> trailing.
29484 (nonoverlapping_refs_since_match_p): Fix comment typo -
29485 referneces -> references.
29486 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
29487 referneces -> references.
29489 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
29491 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
29492 diagnostic message to avoid use of a contraction and format
29495 2021-02-27 Jakub Jelinek <jakub@redhat.com>
29498 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
29499 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
29500 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
29501 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
29504 2021-02-26 Richard Biener <rguenther@suse.de>
29506 PR middle-end/99281
29507 * expr.c (store_field): For calls with return-slot optimization
29508 and addressable return type expand the store directly.
29510 2021-02-26 Richard Biener <rguenther@suse.de>
29513 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
29515 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
29518 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
29521 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
29523 * config.gcc: Add rs6000-pcrel-opt.o.
29524 * config/rs6000/rs6000-pcrel-opt.c: New file.
29525 * config/rs6000/pcrel-opt.md: New file.
29526 * config/rs6000/predicates.md: Add d_form_memory predicate.
29527 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
29528 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
29529 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
29530 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
29531 and make_pass_pcrel_opt().
29532 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
29533 (rs6000_option_override_internal): Add pcrel-opt.
29534 (rs6000_delegitimize_address): Support pcrel-opt.
29535 (rs6000_opt_masks): Add pcrel-opt.
29536 (pcrel_opt_valid_mem_p): New function.
29537 (reg_to_non_prefixed): Make global.
29538 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
29539 (output_pcrel_opt_reloc): New function.
29540 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
29541 (pcrel_extern_addr): Set loads_extern_addr.
29542 Add include for pcrel-opt.md.
29543 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
29544 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
29547 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
29550 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
29551 If TARGET_64BIT and dest is SUBREG, we check the width, if it
29552 equal to SImode, we use SImode operation, just like what we are
29555 2021-02-26 Marek Polacek <polacek@redhat.com>
29557 * builtins.c (warn_for_access): Fix typos.
29559 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
29561 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
29562 mark in front of the immediate quantity.
29563 (<optab>_rolsi3_uxtw): Likewise.
29565 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
29568 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
29569 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
29570 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
29571 address and disable when the FPCXT is not available.
29572 (nonsecure_call_value_reg_thumb2): Likewise.
29574 2021-02-25 Nathan Sidwell <nathan@acm.org>
29577 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
29579 2021-02-25 Tamar Christina <tamar.christina@arm.com>
29581 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
29583 2021-02-25 Richard Biener <rguenther@suse.de>
29585 PR tree-optimization/99253
29586 * tree-vect-loop.c (check_reduction_path): First compute
29587 code, then verify out-of-loop uses.
29589 2021-02-25 Jakub Jelinek <jakub@redhat.com>
29592 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
29594 2021-02-25 Jakub Jelinek <jakub@redhat.com>
29596 PR tree-optimization/80635
29597 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
29598 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
29599 has mode precision.
29601 2021-02-25 Richard Biener <rguenther@suse.de>
29603 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
29604 load_map population.
29605 (vect_match_slp_patterns_2): Revert part of last change.
29606 (vect_analyze_slp): Do not interleave optimize_load_redistribution
29607 with pattern detection but do it afterwards. Dump the
29608 whole SLP graph after pattern recognition and load
29609 redistribution optimization finished.
29611 2021-02-24 Jakub Jelinek <jakub@redhat.com>
29614 * omp-low.c (struct omp_context): Add teams_nested_p and
29615 nonteams_nested_p members.
29616 (scan_omp_target): Diagnose teams nested inside of target with other
29617 directives strictly nested inside of the same target.
29618 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
29619 ctx->nonteams_nested_p as needed.
29621 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
29623 PR inline-asm/99123
29624 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
29626 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
29628 * config/cris/cris.c (cris_expand_prologue): Set
29629 current_function_static_stack_size, if flag_stack_usage_info.
29631 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
29633 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
29634 (rs6000_final_prescan_insn): Adjust.
29635 (rs6000_asm_output_opcode): Likewise.
29637 2021-02-24 Martin Sebor <msebor@redhat.com>
29639 PR middle-end/97172
29640 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
29641 from function arguments.
29643 2021-02-24 Tamar Christina <tamar.christina@arm.com>
29645 PR tree-optimization/99220
29646 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
29647 node from cache when it's about to be deleted.
29649 2021-02-24 Jakub Jelinek <jakub@redhat.com>
29651 PR tree-optimization/99225
29652 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
29653 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
29654 build_int_cst (..., 1). Formatting fixes.
29656 2021-02-24 Tamar Christina <tamar.christina@arm.com>
29658 PR tree-optimization/99149
29659 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
29661 (vect_slp_reset_pattern): Remove.
29662 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
29663 (complex_mul_pattern::build, complex_fma_pattern::build,
29664 complex_fms_pattern::build): Fix ref counts.
29665 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
29666 when node is being deleted.
29667 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
29668 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
29670 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
29672 2021-02-24 Matthias Klose <doko@ubuntu.com>
29675 2020-12-07 Matthias Klose <doko@ubuntu.com>
29677 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
29678 and ENABLE_RTL_FLAG_CHECKING.
29680 2021-02-24 Richard Biener <rguenther@suse.de>
29683 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
29685 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
29687 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
29688 (vsx_assemble_pair): ...to this.
29689 (*mma_assemble_pair): Rename from this...
29690 (*vsx_assemble_pair): ...to this.
29691 (mma_disassemble_pair): Rename from this...
29692 (vsx_disassemble_pair): ...to this.
29693 (*mma_disassemble_pair): Rename from this...
29694 (*vsx_disassemble_pair): ...to this.
29695 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
29696 BU_COMPAT): New macros.
29697 (mma_assemble_pair): Rename from this...
29698 (vsx_assemble_pair): ...to this.
29699 (mma_disassemble_pair): Rename from this...
29700 (vsx_disassemble_pair): ...to this.
29701 (mma_assemble_pair): New compatibility built-in.
29702 (mma_disassemble_pair): Likewise.
29703 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
29704 (RS6000_BUILTIN_COMPAT): Define.
29705 (bdesc_compat): New.
29706 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
29707 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
29708 and VSX_BUILTIN_ASSEMBLE_PAIR.
29709 (rs6000_init_builtins): Register compatibility built-ins.
29710 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
29711 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
29712 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
29713 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
29714 (__builtin_vsx_assemble_pair): ...to this.
29715 (__builtin_mma_disassemble_pair): Rename from this...
29716 (__builtin_vsx_disassemble_pair): ...to this.
29718 2021-02-23 Martin Liska <mliska@suse.cz>
29721 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
29722 with different alignment. That leads to an invalid red zone
29723 size allocated in runtime.
29725 2021-02-23 Jakub Jelinek <jakub@redhat.com>
29727 PR tree-optimization/99204
29728 * fold-const.c (fold_read_from_constant_string): Check that
29729 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
29731 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
29732 Kewen Lin <linkw@gcc.gnu.org>
29734 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
29735 (rotl<mode>3_insert_3): ...this.
29736 (plus_ior_xor): New code_iterator.
29737 (define_split for GPR rl*imi): New splitter.
29738 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
29739 for integer merging.
29741 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
29743 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
29745 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
29746 into a register when the above is enabled.
29747 * config/aarch64/aarch64.c (neoversev1_tunings):
29748 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
29749 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
29751 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
29753 * config/cris/cris.c (cris_print_operand) <'T'>: Change
29754 valid operand from is now an addi mult-value to shift-value.
29755 * config/cris/cris.md (*addi): Change expression of scaled
29756 operand from mult to ashift.
29757 * config/cris/cris.md (*addi_reload): New insn_and_split.
29759 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
29762 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
29763 hook_bool_const_tree_hwi_hwi_const_tree_true.
29764 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
29766 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
29768 PR rtl-optimization/98791
29769 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
29770 for unordered modes.
29772 2021-02-22 Martin Liska <mliska@suse.cz>
29774 * tree-inline.c (inline_forbidden_p): Set
29775 inline_forbidden_reason.
29777 2021-02-22 Richard Biener <rguenther@suse.de>
29779 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
29782 2021-02-22 Richard Biener <rguenther@suse.de>
29784 PR tree-optimization/99165
29785 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
29786 Accumulate changed to ret.
29788 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
29791 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
29793 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
29795 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
29798 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
29800 (trunctf<DFP_ALL:mode>2): Likewise.
29801 (trunctdtf2_vr): Likewise.
29802 (trunctdtf2): Likewise.
29803 (extend<DFP_ALL:mode>tf2_vr): Likewise.
29804 (extend<DFP_ALL:mode>tf2): Likewise.
29805 (extendtftd2_vr): Likewise.
29806 (extendtftd2): Likewise.
29808 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
29810 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
29811 add memory alternative.
29812 (tf_to_fprx2): New pattern.
29814 2021-02-19 Martin Sebor <msebor@redhat.com>
29817 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
29818 (attr_access::free_lang_data): Remove a spurious test.
29820 2021-02-19 Nathan Sidwell <nathan@acm.org>
29822 * doc/invoke.texi (flang-info-module-read): Document.
29824 2021-02-19 Martin Liska <mliska@suse.cz>
29826 PR translation/99167
29827 * params.opt: Fix typo.
29829 2021-02-19 Richard Biener <rguenther@suse.de>
29831 PR middle-end/99122
29832 * tree-inline.c (inline_forbidden_p): Do not inline functions
29833 with VLA arguments or return value.
29835 2021-02-19 Jakub Jelinek <jakub@redhat.com>
29838 * config/arm/arm.md (*stack_protect_combined_set_insn,
29839 *stack_protect_combined_test_insn): If force_const_mem result
29840 is not valid general operand, force its address into the destination
29843 2021-02-19 Jakub Jelinek <jakub@redhat.com>
29846 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
29847 pad or non-local label, put FORCED_LABELs from bb b after that label
29848 rather than before it.
29850 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
29853 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
29854 expand_vector_broadcast' to emit the vec_duplicate operand.
29856 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
29858 PR rtl-optimization/96264
29859 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
29862 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
29865 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
29866 looking up the retain attribute.
29867 (resolve_unique_section): Likewise.
29868 (get_variable_section): Likewise.
29869 (switch_to_section): Likewise. Warn when a symbol without the
29870 retain attribute and a symbol with the retain attribute are
29871 placed in the section with the same name, instead of the used
29873 * doc/extend.texi: Document the "retain" attribute.
29875 2021-02-18 Nathan Sidwell <nathan@acm.org>
29878 * doc/invoke.texi (flang-info-include-translate): Document header
29881 2021-02-18 Richard Biener <rguenther@suse.de>
29883 PR middle-end/99122
29884 * ipa-fnsummary.c (analyze_function_body): Set
29885 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
29886 * tree-inline.c (insert_init_debug_bind): Pass NULL for
29887 error_mark_node values.
29888 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
29890 (setup_one_parameter): Delay force_value_to_type until when
29893 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
29895 PR tree-optimization/99142
29896 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
29898 2021-02-18 Jakub Jelinek <jakub@redhat.com>
29900 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
29901 wide_int_bitmask::wide_int_bitmask (uint64_t),
29902 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
29903 wide_int_bitmask::operator ~ () const,
29904 wide_int_bitmask::operator | (wide_int_bitmask) const,
29905 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
29907 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
29908 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
29909 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
29910 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
29911 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
29912 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
29913 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
29914 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
29915 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
29916 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
29917 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
29918 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
29919 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
29920 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
29921 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
29922 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
29923 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
29924 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
29925 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
29926 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
29927 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
29928 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
29929 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
29930 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
29932 2021-02-18 Jakub Jelinek <jakub@redhat.com>
29934 PR middle-end/99109
29935 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
29936 (build_printable_array_type): ... this. Add nelts argument. For
29937 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
29938 nelts, call build_array_type_nelts.
29939 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
29940 instead of build_zero_elt_array_type and build_array_type_nelts.
29942 2021-02-18 Jakub Jelinek <jakub@redhat.com>
29945 * config/i386/i386.c (distance_non_agu_define): Don't call
29946 extract_insn_cached here.
29947 (ix86_lea_outperforms): Save and restore recog_data around call
29948 to distance_non_agu_define and distance_agu_use.
29949 (ix86_ok_to_clobber_flags): Remove.
29950 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
29951 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
29952 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
29953 into define_insn. Move the splitting to define_peephole2 and
29954 check there using peep2_regno_dead_p if FLAGS_REG is dead.
29956 2021-02-17 Julian Brown <julian@codesourcery.com>
29958 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
29961 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
29964 * config/mips/mips.c (mips_symbol_insns): Do not use
29965 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
29967 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
29969 PR inline-asm/98096
29970 * stmt.c (resolve_operand_name_1): Take inout operands into account
29971 for access to labels by names.
29972 * doc/extend.texi: Describe counting operands for accessing labels.
29974 2021-02-16 Richard Biener <rguenther@suse.de>
29976 PR tree-optimization/38474
29977 * tree-ssa-structalias.c (variable_info::address_taken): New.
29978 (new_var_info): Initialize address_taken.
29979 (process_constraint): Set address_taken.
29980 (solve_constraints): Use the new address_taken flag rather
29981 than is_reg_var for sorting variables.
29982 (dump_constraint): Dump the variable number if the name
29985 2021-02-16 Jakub Jelinek <jakub@redhat.com>
29988 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
29989 multiply by 4096 and for inbranch by 8192.
29990 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
29991 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
29993 2021-02-15 Maya Rashish <coypu@sdf.org>
29995 * config/aarch64/aarch64.c (aarch64_init_builtins):
29996 Call SUBTARGET_INIT_BUILTINS.
29998 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
30000 PR rtl-optimization/98872
30001 * init-regs.c (initialize_uninitialized_regs): Skip initialization
30002 if CONST0_RTX is NULL.
30004 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
30006 PR rtl-optimization/98863
30007 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
30008 (function_info::build_info): Turn into a declaration, moving the
30009 definition to internals.h.
30010 (function_info::bb_walker): Declare.
30011 (function_info::create_reg_use): Likewise.
30012 (function_info::calculate_potential_phi_regs): Take a build_info
30014 (function_info::place_phis, function_info::create_ebbs): Declare.
30015 (function_info::calculate_ebb_live_in_for_debug): Likewise.
30016 (function_info::populate_backedge_phis): Delete.
30017 (function_info::start_block, function_info::end_block): Declare.
30018 (function_info::populate_phi_inputs): Delete.
30019 (function_info::m_potential_phi_regs): Move information to build_info.
30020 * rtl-ssa/internals.h: New file.
30021 (function_info::bb_phi_info): New class.
30022 (function_info::build_info): Moved from functions.h.
30023 Add a constructor and destructor.
30024 (function_info::build_info::ebb_use): Delete.
30025 (function_info::build_info::ebb_def): Likewise.
30026 (function_info::build_info::bb_live_out): Likewise.
30027 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
30028 (function_info::build_info::potential_phi_regs): Likewise.
30029 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
30030 (function_info::build_info::ebb_def_regs): Likewise.
30031 (function_info::build_info::bb_phis): Likewise.
30032 (function_info::build_info::bb_mem_live_out): Likewise.
30033 (function_info::build_info::bb_to_rpo): Likewise.
30034 (function_info::build_info::def_stack): Likewise.
30035 (function_info::build_info::old_def_stack_limit): Likewise.
30036 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
30037 Remove the regno argument. Push the previous definition onto the
30038 definition stack where necessary.
30039 * rtl-ssa/accesses.cc: Include internals.h.
30040 * rtl-ssa/changes.cc: Likewise.
30041 * rtl-ssa/blocks.cc: Likewise.
30042 (function_info::build_info::build_info): Define.
30043 (function_info::build_info::~build_info): Likewise.
30044 (function_info::bb_walker): New class.
30045 (function_info::bb_walker::bb_walker): Define.
30046 (function_info::add_live_out_use): Convert a logarithmic-complexity
30047 test into a linear one. Allow the same definition to be passed
30049 (function_info::calculate_potential_phi_regs): Moved from
30050 functions.cc. Take a build_info parameter and store the
30051 information there instead.
30052 (function_info::place_phis): New function.
30053 (function_info::add_entry_block_defs): Update call to record_reg_def.
30054 (function_info::calculate_ebb_live_in_for_debug): New function.
30055 (function_info::add_phi_nodes): Use bb_phis to decide which
30056 registers need phi nodes and initialize ebb_def_regs accordingly.
30057 Do not add degenerate phis here.
30058 (function_info::add_artificial_accesses): Use create_reg_use.
30059 Assert that all definitions are listed in the DF LR sets.
30060 Update call to record_reg_def.
30061 (function_info::record_block_live_out): Record live-out register
30062 values in the phis of successor blocks. Use the live-out set
30063 when processing the last block in an EBB, instead of always
30064 using the live-in sets of successor blocks. AND the live sets
30065 with the set of registers that have been defined in the EBB,
30066 rather than with all potential phi registers. Cope correctly
30067 with branches back to the start of the current EBB.
30068 (function_info::start_block): New function.
30069 (function_info::end_block): Likewise.
30070 (function_info::populate_phi_inputs): Likewise.
30071 (function_info::create_ebbs): Likewise.
30072 (function_info::process_all_blocks): Rewrite into a multi-phase
30074 * rtl-ssa/functions.cc: Include internals.h.
30075 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
30076 (function_info::init_function_data): Remove caller.
30077 * rtl-ssa/insns.cc: Include internals.h
30078 (function_info::create_reg_use): New function. Lazily any
30079 degenerate phis needed by the linear RPO view.
30080 (function_info::record_use): Use create_reg_use. When processing
30081 debug uses, use potential_phi_regs and test it before checking
30082 whether the register is live on entry to the current EBB. Lazily
30083 calculate ebb_live_in_for_debug.
30084 (function_info::record_call_clobbers): Update call to record_reg_def.
30085 (function_info::record_def): Likewise.
30087 2021-02-15 Martin Liska <mliska@suse.cz>
30089 * toplev.c (init_asm_output): Free output of
30090 gen_command_line_string function.
30091 (process_options): Likewise.
30093 2021-02-15 Martin Liska <mliska@suse.cz>
30095 * params.opt: Add 2 missing Param keywords.
30097 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
30099 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
30101 2021-02-15 Jakub Jelinek <jakub@redhat.com>
30103 PR tree-optimization/99079
30104 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
30105 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
30106 require both type and TREE_TYPE (@1) to be integral types and either
30107 type having smaller or equal precision, or TREE_TYPE (@1) being
30108 unsigned type, or type being signed type. If TREE_TYPE (@1)
30109 doesn't have wrapping overflow, perform the subtraction of one in
30112 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
30113 Richard Biener <rguether@suse.de>
30116 * ipa-reference.c (ipa_init): Only conditinally initialize
30117 reference_vars_to_consider.
30118 (propagate): Conditionally deninitialize reference_vars_to_consider.
30119 (ipa_reference_write_optimization_summary): Sanity check that
30120 reference_vars_to_consider is not allocated.
30122 2021-02-13 Levy Hsu <admin@levyhsu.com>
30125 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
30126 extend parameter to get_si_mem_base_reg declaration.
30127 (get_si_mem_base_reg): Add extend parameter. Set it.
30128 (analyze): Pass extend arg to get_si_mem_base_reg.
30129 (transform): Likewise. Use it when rewriting mems.
30130 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
30131 loads and emit sign/zero extending load followed by subreg move.
30133 2021-02-13 Jim Wilson <jimw@sifive.com>
30136 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
30137 exit when !reload_completed. Only perform check for compressed reg
30138 if reload_completed.
30139 (riscv_rtx_costs): In MEM case, when optimizing for size and
30140 shorten memrefs, if not compressible, then increase cost.
30142 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30144 PR rtl-optimization/98439
30145 * recog.c (pass_split_before_regstack::gate): Enable even when
30146 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
30149 2021-02-13 Jakub Jelinek <jakub@redhat.com>
30152 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
30153 swap of V2SImode elements in memory into DImode memory rotate by 32.
30155 2021-02-12 Martin Sebor <msebor@redhat.com>
30157 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
30159 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30161 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
30162 m_temp_obstack rather than m_obstack to allocate the temporary use.
30164 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
30166 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
30167 as read-modify operations.
30169 2021-02-12 Richard Biener <rguenther@suse.de>
30171 PR middle-end/38474
30172 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
30173 fbi->aa_walk_budget is bigger than zero. Update
30174 fbi->aa_walk_budget.
30175 (param_change_prob): Likewise.
30176 * ipa-prop.c (detect_type_change_from_memory_writes):
30177 Properly account walk_aliased_vdefs.
30178 (parm_preserved_before_stmt_p): Canonicalize updates.
30179 (parm_ref_data_preserved_p): Likewise.
30180 (parm_ref_data_pass_through_p): Likewise.
30181 (determine_known_aggregate_parts): Account own alias queries.
30183 2021-02-12 Martin Liska <mliska@suse.cz>
30185 * opts-common.c (decode_cmdline_option): Release werror_arg.
30186 * opts.c (gen_producer_string): Release output of
30187 gen_command_line_string.
30189 2021-02-12 Richard Biener <rguenther@suse.de>
30191 PR tree-optimization/38474
30192 * params.opt (-param=max-store-chains-to-track=): New param.
30193 (-param=max-stores-to-track=): Likewise.
30194 * doc/invoke.texi (max-store-chains-to-track): Document.
30195 (max-stores-to-track): Likewise.
30196 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
30198 (pass_store_merging::m_n_stores): Likewise.
30199 (pass_store_merging::terminate_and_process_chain): Update
30200 m_n_stores and m_n_chains.
30201 (pass_store_merging::process_store): Likewise. Terminate
30202 oldest chains if the number of stores or chains get too large.
30203 (imm_store_chain_info::terminate_and_process_chain): Dump
30206 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
30208 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
30209 the cold section, emit a nop before the directive if the previous
30210 active instruction can throw.
30212 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
30215 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
30216 memory addresses that are legal for quad word accesses.
30218 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
30221 * config/arm/thumb2.md (*doloop_end_internal): Generate
30222 alternative sequence to handle long range branches.
30224 2021-02-11 Joel Hutton <joel.hutton@arm.com>
30226 PR tree-optimization/98772
30227 * optabs-tree.c (supportable_half_widening_operation): New function
30228 to check for supportable V8QI->V8HI widening patterns.
30229 * optabs-tree.h (supportable_half_widening_operation): New function.
30230 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
30231 to create promotion stmts for V8QI->V8HI widening patterns.
30232 (vectorizable_conversion): Add case for V8QI->V8HI.
30234 2021-02-11 Richard Biener <rguenther@suse.de>
30236 * sparseset.h (SPARSESET_ELT_BITS): Remove.
30237 (SPARSESET_ELT_TYPE): Use unsigned int.
30238 * fwprop.c: Do not include sparseset.h.
30240 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30243 * varasm.c (declare_weak): For -fsyntax-only, allow even
30244 TREE_ASM_WRITTEN function decls.
30246 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30249 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
30250 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
30251 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
30252 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
30253 calling simplify_gen_subreg on it.
30255 2021-02-10 Martin Liska <mliska@suse.cz>
30257 * config/nvptx/nvptx.c (nvptx_option_override): Use
30258 flag_patchable_function_entry instead of the removed
30259 function_entry_patch_area_size.
30261 2021-02-10 Martin Liska <mliska@suse.cz>
30263 PR tree-optimization/99002
30264 PR tree-optimization/99026
30265 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
30266 leak when adjacent cases are merged.
30267 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
30269 (make_pass_lower_switch): Remove trailing whitespace.
30270 * tree-switch-conversion.h (release_clusters): New.
30272 2021-02-10 Richard Biener <rguenther@suse.de>
30274 PR rtl-optimization/99054
30275 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
30276 (fixup_partitions): Adjust.
30277 (rtl_verify_edges): Likewise.
30279 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30281 PR middle-end/99007
30282 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
30283 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
30286 2021-02-10 Richard Biener <rguenther@suse.de>
30289 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
30292 2021-02-10 Richard Biener <rguenther@suse.de>
30294 PR tree-optimization/99024
30295 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
30296 clear loop->aux if it is associated with the destroyed loop_vinfo.
30298 2021-02-10 Martin Liska <mliska@suse.cz>
30300 PR tree-optimization/99002
30301 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
30304 2021-02-10 Martin Liska <mliska@suse.cz>
30307 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
30308 a reference exists.
30310 2021-02-10 Jakub Jelinek <jakub@redhat.com>
30313 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
30314 at class scope for DWARF5+.
30316 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
30318 PR rtl-optimization/96015
30319 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
30320 (relax_delay_slots): When deleting a jump to the next active
30321 instruction over a barrier, first delete the barrier if the
30322 jump is the only way to reach the target label.
30324 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
30326 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
30327 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
30328 vector multiplies and vect.alu for SSRA.
30329 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
30330 vect.mul cost field.
30331 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
30332 * config/arm/arm.c: Likewise.
30334 2021-02-09 Richard Biener <rguenther@suse.de>
30336 PR tree-optimization/98863
30337 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
30338 * tree-ssa-sccvn.c (last_pushed_avail): New global.
30339 (rpo_elim::eliminate_push_avail): Chain pushed avails.
30340 (unwind_state::avail_top): Add.
30341 (do_unwind): Rewrite unwinding of avail entries.
30342 (do_rpo_vn): Initialize last_pushed_avail and
30343 avail_top of the undo state.
30345 2021-02-09 Jakub Jelinek <jakub@redhat.com>
30347 PR middle-end/99004
30348 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
30349 const char * to char * and free those pointers after use.
30351 2021-02-09 Richard Biener <rguenther@suse.de>
30353 PR tree-optimization/99017
30354 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
30355 zero vector cost entries.
30357 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
30359 PR middle-end/98974
30360 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
30361 parameter in vectorizable_condition.
30363 2021-02-08 Richard Biener <rguenther@suse.de>
30366 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
30368 2021-02-08 Martin Liska <mliska@suse.cz>
30371 * cfgexpand.c (pass_expand::execute): Parse per-function option
30372 flag_patchable_function_entry and use it.
30373 * common.opt: Remove function_entry_patch_area_size and
30374 function_entry_patch_area_start global variables.
30375 * opts.c (parse_and_check_patch_area): New function.
30376 (common_handle_option): Use it.
30377 * opts.h (parse_and_check_patch_area): New function.
30378 * toplev.c (process_options): Parse and use
30379 function_entry_patch_area_size.
30381 2021-02-08 Martin Sebor <msebor@redhat.com>
30383 * doc/extend.texi (attribute malloc): Correct typos.
30385 2021-02-05 Nathan Sidwell <nathan@acm.org>
30388 * gcc.c (driver::maybe_run_linker): Check for input file
30389 accessibility if not linking.
30391 2021-02-05 Richard Biener <rguenther@suse.de>
30393 PR tree-optimization/98855
30394 * tree-vectorizer.h (add_stmt_cost): New overload.
30395 * tree-vect-slp.c (li_cost_vec_cmp): New.
30396 (vect_bb_slp_scalar_cost): Cost individual loop regions
30397 separately. Account for the scalar instance root stmt.
30399 2021-02-05 Tom de Vries <tdevries@suse.de>
30402 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
30404 (bit_test_cluster::emit): Reuse location_t for newly created
30406 (switch_decision_tree::try_switch_expansion): Preserve
30408 * tree-switch-conversion.h: Change function signatures.
30410 2021-02-05 Jakub Jelinek <jakub@redhat.com>
30413 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
30414 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
30415 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
30416 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
30418 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30420 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
30421 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
30422 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
30423 (vget_high_f16): Reimplement using new builtin.
30424 (vget_high_f32): Likewise.
30425 (vget_high_f64): Likewise.
30426 (vget_high_p8): Likewise.
30427 (vget_high_p16): Likewise.
30428 (vget_high_p64): Likewise.
30429 (vget_high_s8): Likewise.
30430 (vget_high_s16): Likewise.
30431 (vget_high_s32): Likewise.
30432 (vget_high_s64): Likewise.
30433 (vget_high_u8): Likewise.
30434 (vget_high_u16): Likewise.
30435 (vget_high_u32): Likewise.
30436 (vget_high_u64): Likewise.
30438 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30440 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
30441 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
30442 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
30443 (vget_low_f16): Reimplement using new builtin.
30444 (vget_low_f32): Likewise.
30445 (vget_low_f64): Likewise.
30446 (vget_low_p8): Likewise.
30447 (vget_low_p16): Likewise.
30448 (vget_low_p64): Likewise.
30449 (vget_low_s8): Likewise.
30450 (vget_low_s16): Likewise.
30451 (vget_low_s32): Likewise.
30452 (vget_low_s64): Likewise.
30453 (vget_low_u8): Likewise.
30454 (vget_low_u16): Likewise.
30455 (vget_low_u32): Likewise.
30456 (vget_low_u64): Likewise.
30458 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
30460 * gcc.c (print_multilib_info): Check all required argument is provided
30463 2021-02-05 liuhongt <hongtao.liu@intel.com>
30466 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
30467 generate integer mask comparison for 128/256-bits vector when
30468 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
30469 delete redundant !maskcmp condition.
30470 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
30472 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
30473 condition directly to if (maskcmp), add extra check for
30474 cmpmode, it should be MODE_INT.
30475 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
30476 parameters op_true/op_false.
30477 (ix86_use_mask_cmp_p): New.
30479 2021-02-05 liuhongt <hongtao.liu@intel.com>
30482 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
30483 Remove m_GENERIC from ~list.
30484 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
30486 2021-02-04 David Malcolm <dmalcolm@redhat.com>
30489 * diagnostic-show-locus.c (compatible_locations_p): Require
30490 locations in the same macro map to be either both from the
30491 macro definition, or both from the macro arguments.
30493 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30495 * config/aarch64/aarch64-simd-builtins.def: Add
30496 [su]mull_hi_lane[q] builtin generator macros.
30497 * config/aarch64/aarch64-simd.md
30498 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
30499 (aarch64_<su>mull_hi_lane<mode>): Define.
30500 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
30501 (aarch64_<su>mull_hi_laneq<mode>): Define.
30502 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
30503 builtin instead of inline asm.
30504 (vmull_high_lane_s32): Likewise.
30505 (vmull_high_lane_u16): Likewise.
30506 (vmull_high_lane_u32): Likewise.
30507 (vmull_high_laneq_s16): Likewise.
30508 (vmull_high_laneq_s32): Likewise.
30509 (vmull_high_laneq_u16): Likewise.
30510 (vmull_high_laneq_u32): Liekwise.
30512 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
30514 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
30515 builtin generator macros.
30516 * config/aarch64/aarch64-simd.md
30517 (aarch64_<su>mull_hi_n<mode>_insn): Define.
30518 (aarch64_<su>mull_hi_n<mode>): Define.
30519 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
30520 instead of inline asm.
30521 (vmull_high_n_s32): Likewise.
30522 (vmull_high_n_u16): Likewise.
30523 (vmull_high_n_u32): Likewise.
30525 2021-02-04 Richard Biener <rguenther@suse.de>
30527 PR tree-optimization/98855
30528 * tree-vect-loop.c (vectorizable_phi): Do not cost
30529 single-argument PHIs.
30530 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
30531 * tree-vect-stmts.c (vectorizable_bswap): Also perform
30532 costing for SLP operation.
30534 2021-02-04 Martin Liska <mliska@suse.cz>
30536 * doc/extend.texi: Mention -mprefer-vector-width in target
30539 2021-02-03 Martin Sebor <msebor@redhat.com>
30541 PR tree-optimization/98937
30542 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
30543 Flush pointer_query cache.
30545 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30547 * config/rs6000/genfusion.pl (gen_2logical): Add missing
30548 fixes based on patch review.
30549 * config/rs6000/fusion.md: Regenerate file.
30551 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
30553 * config/rs6000/t-rs6000: Comment out auto generation of
30556 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
30558 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
30559 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
30560 (output_file_start): Add gfx908.
30561 * config/gcn/gcn.opt (gpu_type): Add gfx908.
30562 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
30563 (MULTILIB_DIRNAMES): Add gfx908.
30564 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
30565 (main): Recognize gfx908.
30566 * config/gcn/t-omp-device: Add gfx908.
30568 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30570 * config/aarch64/aarch64-simd-builtins.def: Add
30571 [su]mlsl_hi_lane[q] builtin macro generators.
30572 * config/aarch64/aarch64-simd.md
30573 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
30574 (aarch64_<su>mlsl_hi_lane<mode>): Define.
30575 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
30576 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
30577 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
30578 builtin instead of inline asm.
30579 (vmlsl_high_lane_s32): Likewise.
30580 (vmlsl_high_lane_u16): Likewise.
30581 (vmlsl_high_lane_u32): Likewise.
30582 (vmlsl_high_laneq_s16): Likewise.
30583 (vmlsl_high_laneq_s32): Likewise.
30584 (vmlsl_high_laneq_u16): Likewise.
30585 (vmlsl_high_laneq_u32): Likewise.
30586 (vmlal_high_laneq_u32): Likewise.
30588 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30590 * config/aarch64/aarch64-simd-builtins.def: Add
30591 [su]mlal_hi_lane[q] builtin generator macros.
30592 * config/aarch64/aarch64-simd.md
30593 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
30594 (aarch64_<su>mlal_hi_lane<mode>): Define.
30595 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
30596 (aarch64_<su>mlal_hi_laneq<mode>): Define.
30597 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
30598 builtin instead of inline asm.
30599 (vmlal_high_lane_s32): Likewise.
30600 (vmlal_high_lane_u16): Likewise.
30601 (vmlal_high_lane_u32): Likewise.
30602 (vmlal_high_laneq_s16): Likewise.
30603 (vmlal_high_laneq_s32): Likewise.
30604 (vmlal_high_laneq_u16): Likewise.
30605 (vmlal_high_laneq_u32): Likewise.
30607 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30609 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
30610 builtin generator macros.
30611 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
30613 (aarch64_<su>mlsl_hi_n<mode>): Define.
30614 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
30615 instead of inline asm.
30616 (vmlsl_high_n_s32): Likewise.
30617 (vmlsl_high_n_u16): Likewise.
30618 (vmlsl_high_n_u32): Likewise.
30620 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30622 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
30623 builtin generator macros.
30624 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
30626 (aarch64_<su>mlal_hi_n<mode>): Define.
30627 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
30628 instead of inline asm.
30629 (vmlal_high_n_s32): Likewise.
30630 (vmlal_high_n_u16): Likewise.
30631 (vmlal_high_n_u32): Likewise.
30633 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
30635 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
30637 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
30639 (aarch64_<su>mlal_hi<mode>_insn): This.
30640 (aarch64_<su>mlal_hi<mode>): Define.
30641 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
30642 instead of inline asm.
30643 (vmlal_high_s16): Likewise.
30644 (vmlal_high_s32): Likewise.
30645 (vmlal_high_u8): Likewise.
30646 (vmlal_high_u16): Likewise.
30647 (vmlal_high_u32): Likewise.
30649 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
30651 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
30652 after calling alter_subreg() on a (mem).
30654 2021-02-03 Martin Liska <mliska@suse.cz>
30657 * lto-streamer-out.c (produce_lto_section): Fill up missing
30659 * lto-streamer.h (struct lto_section): Add _padding field.
30661 2021-02-03 Richard Biener <rguenther@suse.de>
30663 * lto-streamer.c (lto_get_section_name): Free temporary
30665 * tree-loop-distribution.c
30666 (loop_distribution::merge_dep_scc_partitions): Free edge data.
30668 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30670 PR middle-end/97487
30671 * ifcvt.c (noce_can_force_operand): New function.
30672 (noce_emit_move_insn): Use it.
30673 (noce_try_sign_mask): Likewise. Formatting fix.
30675 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30677 PR middle-end/97971
30678 * lra-constraints.c (process_alt_operands): For inline asm, don't call
30679 fatal_insn, but instead return false.
30681 2021-02-03 Jakub Jelinek <jakub@redhat.com>
30683 PR tree-optimization/98287
30684 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
30687 2021-02-03 Tamar Christina <tamar.christina@arm.com>
30689 PR tree-optimization/98928
30690 * tree-vect-loop.c (vect_analyze_loop_2): Change
30691 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
30692 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
30693 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
30694 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
30696 2021-02-02 Richard Biener <rguenther@suse.de>
30698 * gimple-loop-interchange.cc (prepare_data_references):
30700 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
30701 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
30702 * tree-vect-stmts.c (vectorizable_condition): Do not
30704 (vectorizable_comparison): Likewise.
30706 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30708 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
30709 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
30710 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
30711 (vrsqrteq_u32): Likewise.
30713 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30715 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
30716 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
30717 (aarch64_sqxtun2<mode>_be): Likewise.
30718 (aarch64_sqxtun2<mode>): Likewise.
30719 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
30720 (vqmovun_high_s32): Likewise.
30721 (vqmovun_high_s64): Likewise.
30722 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
30724 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30726 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
30728 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
30730 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30732 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
30733 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
30734 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
30735 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
30736 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
30737 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
30738 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
30739 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
30741 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30743 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
30744 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
30745 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
30747 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30749 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
30750 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
30752 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30754 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
30755 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
30756 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
30757 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
30758 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
30759 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
30760 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
30761 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
30762 NONE builtin flags.
30764 2021-02-02 Jakub Jelinek <jakub@redhat.com>
30766 PR tree-optimization/98848
30767 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
30768 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
30770 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
30773 * expr.c: Check mode before calling store_expr.
30775 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
30777 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
30779 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
30780 instruction using expression ior.
30781 (mve_vornq_u<mode>): New expander.
30782 (mve_vornq_f<mode>): Use ior code instead of unspec.
30783 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
30785 2021-02-02 Alexandre Oliva <oliva@adacore.com>
30787 * tree-nested.c (convert_nonlocal_reference_op): Move
30788 current_function_decl restore after re-gimplification.
30789 (convert_local_reference_op): Likewise.
30791 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30793 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
30795 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
30797 (aarch64_rshrn<mode>_insn_be): Likewise.
30798 (aarch64_rshrn<mode>): Likewise.
30799 (aarch64_rshrn2<mode>_insn_le): Likewise.
30800 (aarch64_rshrn2<mode>_insn_be): Likewise.
30801 (aarch64_rshrn2<mode>): Likewise.
30802 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
30803 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
30805 (vrshrn_high_n_s32): Likewise.
30806 (vrshrn_high_n_s64): Likewise.
30807 (vrshrn_high_n_u16): Likewise.
30808 (vrshrn_high_n_u32): Likewise.
30809 (vrshrn_high_n_u64): Likewise.
30810 (vrshrn_n_s16): Likewise.
30811 (vrshrn_n_s32): Likewise.
30812 (vrshrn_n_s64): Likewise.
30813 (vrshrn_n_u16): Likewise.
30814 (vrshrn_n_u32): Likewise.
30815 (vrshrn_n_u64): Likewise.
30817 2021-02-01 Sergei Trofimovich <siarheit@google.com>
30819 PR tree-optimization/98499
30820 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
30821 conservatively and assume all possible side-effects.
30823 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30825 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
30826 vec_unpacku_hi_): Define builtins.
30827 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
30829 (vmovl_high_s16): Likewise.
30830 (vmovl_high_s32): Likewise.
30831 (vmovl_high_u8): Likewise.
30832 (vmovl_high_u16): Likewise.
30833 (vmovl_high_u32): Likewise.
30835 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30837 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
30839 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
30841 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
30843 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
30845 (vabdl_s16): Likewise.
30846 (vabdl_s32): Likewise.
30847 (vabdl_u8): Likewise.
30848 (vabdl_u16): Likewise.
30849 (vabdl_u32): Likewise.
30850 * config/aarch64/iterators.md (ABDL): New int iterator.
30851 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
30853 2021-02-01 Martin Sebor <msebor@redhat.com>
30855 * tree.h (BLOCK_VARS): Add comment.
30856 (BLOCK_SUBBLOCKS): Same.
30857 (BLOCK_SUPERCONTEXT): Same.
30858 (BLOCK_ABSTRACT_ORIGIN): Same.
30859 (inlined_function_outer_scope_p): Same.
30861 2021-02-01 Martin Sebor <msebor@redhat.com>
30863 PR middle-end/97172
30864 * attribs.c (attr_access::free_lang_data): Define new function.
30865 * attribs.h (attr_access::free_lang_data): Declare new function.
30867 2021-02-01 Richard Biener <rguenther@suse.de>
30869 * vec.h (auto_vec::auto_vec): Add memory stat parameters
30871 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
30873 2021-02-01 Tamar Christina <tamar.christina@arm.com>
30875 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
30876 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
30878 2021-02-01 Richard Biener <rguenther@suse.de>
30880 PR rtl-optimization/98863
30881 * config/i386/i386-features.c (convert_scalars_to_vector):
30882 Set DF_RD_PRUNE_DEAD_DEFS.
30884 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
30886 * system.h (SIZE_MAX): Define if not already defined.
30888 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
30890 * config/rs6000/genfusion.pl (gen_2logical): New function to
30891 generate patterns for logical-logical fusion.
30892 * config/rs6000/fusion.md: Regenerated patterns.
30893 * config/rs6000/rs6000-cpus.def: Add
30894 OPTION_MASK_P10_FUSION_2LOGICAL.
30895 * config/rs6000/rs6000.c (rs6000_option_override_internal):
30896 Enable logical-logical fusion for p10.
30897 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
30899 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
30901 * config/rs6000/rs6000.opt: Add periods to new AIX options.
30903 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
30905 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
30906 (mabi=vec-default): New.
30907 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
30908 __EXTABI__ for AIX Vector extended ABI.
30909 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
30911 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
30913 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
30915 2021-01-30 Jakub Jelinek <jakub@redhat.com>
30917 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
30918 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
30920 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
30923 * lra-constraints.c (in_class_p): Don't narrow class only for REG
30926 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
30928 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
30929 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
30930 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
30932 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
30934 PR tree-optimization/98866
30935 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
30936 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
30937 (gori_map::m_maybe_invariant): Rename from all_outgoing.
30938 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
30939 (gori_map::is_export_p): Ditto.
30940 (gori_map::calculate_gori): Ditto.
30941 (gori_compute::set_range_invariant): New.
30942 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
30943 invariant for pointers evaluating to [1, +INF].
30945 2021-01-29 Richard Biener <rguenther@suse.de>
30947 PR rtl-optimization/98863
30948 * config/i386/i386-features.c (remove_partial_avx_dependency):
30949 Do not perform DF analysis.
30950 (pass_data_remove_partial_avx_dependency): Remove
30953 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
30955 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
30956 builtin generator macros.
30957 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
30959 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
30960 instead of inline asm.
30961 (vmull_n_s32): Likewise.
30962 (vmull_n_u16): Likewise.
30963 (vmull_n_u32): Likewise.
30965 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30967 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
30969 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
30971 (aarch64_<sur>abdl2<mode>): ... This.
30972 (<sur>sadv16qi): Adjust use of above.
30973 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
30975 (vabdl_high_s16): Likewise.
30976 (vabdl_high_s32): Likewise.
30977 (vabdl_high_u8): Likewise.
30978 (vabdl_high_u16): Likewise.
30979 (vabdl_high_u32): Likewise.
30981 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
30983 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
30985 (uabal2): Likewise.
30986 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
30988 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
30990 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
30992 (vabal_high_s16): Likewise.
30993 (vabal_high_s32): Likewise.
30994 (vabal_high_u8): Likewise.
30995 (vabal_high_u16): Likewise.
30996 (vabal_high_u32): Likewise.
30997 * config/aarch64/iterators.md (ABAL2): New mode iterator.
30998 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
31000 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31002 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
31005 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
31007 (aarch64_<sur>abal<mode>): ... This
31008 (<sur>sadv16qi): Adust use of the above.
31009 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
31011 (vabal_s16): Likewise.
31012 (vabal_s32): Likewise.
31013 (vabal_u8): Likewise.
31014 (vabal_u16): Likewise.
31015 (vabal_u32): Likewise.
31017 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31019 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
31021 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
31023 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
31025 (vaddlv_s16): Likewise.
31026 (vaddlv_u8): Likewise.
31027 (vaddlv_u16): Likewise.
31028 (vaddlvq_s8): Likewise.
31029 (vaddlvq_s16): Likewise.
31030 (vaddlvq_s32): Likewise.
31031 (vaddlvq_u8): Likewise.
31032 (vaddlvq_u16): Likewise.
31033 (vaddlvq_u32): Likewise.
31034 (vaddlv_s32): Likewise.
31035 (vaddlv_u32): Likewise.
31036 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
31037 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
31038 (Vwstype): New mode attribute.
31040 (VWIDE_S): Likewise.
31041 (USADDLV): New int iterator.
31042 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
31044 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
31046 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
31047 builtin generator macros.
31048 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
31050 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
31051 instead of inline asm.
31052 (vmlsl_lane_s32): Likewise.
31053 (vmlsl_lane_u16): Likewise.
31054 (vmlsl_lane_u32): Likewise.
31055 (vmlsl_laneq_s16): Likewise.
31056 (vmlsl_laneq_s32): Likewise.
31057 (vmlsl_laneq_u16): Likewise.
31058 (vmlsl_laneq_u32): Likewise.
31060 2021-01-29 Richard Biener <rguenther@suse.de>
31062 * doc/invoke.texi (--param max-gcse-memory): Document unit
31064 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
31065 * params.opt (--param max-gcse-memory): Adjust default and
31066 document unit of size.
31068 2021-01-29 Richard Biener <rguenther@suse.de>
31070 PR rtl-optimization/98863
31071 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
31072 HOST_WIDE_INT for the memory estimate.
31074 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
31075 Richard Biener <rguenther@suse.de>
31077 PR tree-optimization/97627
31078 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
31079 Do not analyze fake edges.
31081 2021-01-29 Richard Biener <rguenther@suse.de>
31083 PR rtl-optimization/98144
31084 * df.h (df_mir_bb_info): Add con_visited member.
31085 * df-problems.c (df_mir_alloc): Initialize con_visited,
31086 do not fully populate IN and OUT.
31087 (df_mir_reset): Likewise.
31088 (df_mir_confluence_0): Set con_visited.
31089 (df_mir_confluence_n): Properly handle implicitely
31090 fully populated IN and OUT as designated by con_visited
31091 and update con_visited accordingly.
31093 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31096 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
31097 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
31098 && !TARGET_REALLY_IWMMXT to conditions.
31100 2021-01-29 Jakub Jelinek <jakub@redhat.com>
31103 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
31106 2021-01-28 Marek Polacek <polacek@redhat.com>
31109 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
31110 the main variant, maybe reset it in its variants too.
31111 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
31112 (check_aligned_type): Check if TYPE_USER_ALIGN match.
31114 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
31117 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
31118 of constant zero for comparisons.
31120 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
31122 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
31123 support for mapping built-in function names for long double
31124 built-in functions if long double is IEEE 128-bit.
31126 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31128 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
31129 builtin generator macros.
31130 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
31132 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
31133 instead of inline asm.
31134 (vmlsl_n_s32): Likewise.
31135 (vmlsl_n_u16): Likewise.
31136 (vmlsl_n_u32): Likewise.
31138 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
31140 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
31141 builtin generator macros.
31142 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
31144 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
31145 instead of inline asm.
31146 (vmlal_n_s32): Likewise.
31147 (vmlal_n_u16): Likewise.
31148 (vmlal_n_u32): Likewise.
31150 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31152 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
31154 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
31156 (aarch64_shrn2<mode>_insn_be): Likewise.
31157 (aarch64_shrn2<mode>): Likewise.
31158 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
31160 (vshrn_high_n_s32): Likewise.
31161 (vshrn_high_n_s64): Likewise.
31162 (vshrn_high_n_u16): Likewise.
31163 (vshrn_high_n_u32): Likewise.
31164 (vshrn_high_n_u64): Likewise.
31166 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31168 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
31170 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
31172 (aarch64_shrn<mode>_insn_be): Likewise.
31173 (aarch64_shrn<mode>): Likewise.
31174 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
31176 (vshrn_n_s32): Likewise.
31177 (vshrn_n_s64): Likewise.
31178 (vshrn_n_u16): Likewise.
31179 (vshrn_n_u32): Likewise.
31180 (vshrn_n_u64): Likewise.
31181 * config/aarch64/iterators.md (vn_mode): New mode attribute.
31183 2021-01-28 Richard Biener <rguenther@suse.de>
31185 PR rtl-optimization/80960
31186 * dse.c (check_mem_read_rtx): Call get_addr on the
31189 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
31190 David Edelsohn <dje.gcc@gmail.com>
31193 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31194 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
31196 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31198 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
31199 wrapper call rs6000_expand_vector_set_var for cleanup. Call
31200 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
31202 (rs6000_expand_vector_set_var): Delete.
31203 (rs6000_expand_vector_set_var_p9): Make static.
31204 (rs6000_expand_vector_set_var_p8): Make static.
31206 2021-01-28 Xing GUO <higuoxing@gmail.com>
31208 * common/config/riscv/riscv-common.c
31209 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
31210 when `p` extension exists.
31212 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
31214 PR rtl-optimization/97684
31215 * ira.c (ira): Call ira_set_pseudo_classes before
31216 update_equiv_regs when it is necessary.
31218 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31221 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
31222 %w0, %w1 and %2 instead of %0, %1 and %2.
31224 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
31226 * config/rs6000/genfusion.pl: New script to generate
31227 define_insn_and_split patterns so combine can arrange fused
31228 instructions next to each other.
31229 * config/rs6000/fusion.md: New file, generated fused instruction
31230 patterns for combine.
31231 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
31232 (non_update_memory_operand): New predicate.
31233 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
31234 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
31236 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
31238 * config/rs6000/rs6000.c (rs6000_option_override_internal):
31239 Automatically set OPTION_MASK_P10_FUSION and
31240 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
31241 (rs600_opt_masks): Allow -mpower10-fusion
31242 in function attributes.
31243 (address_is_non_pfx_d_or_x): New function.
31244 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
31245 * config/rs6000/rs6000.md: Include fusion.md.
31246 * config/rs6000/rs6000.opt: Add -mpower10-fusion
31247 and -mpower10-fusion-ld-cmpi.
31248 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
31250 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31252 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
31253 builtin generator macros.
31254 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
31256 (aarch64_<su>mlal<mode>): This.
31257 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
31258 instead of inline asm.
31259 (vmlal_s16): Likewise.
31260 (vmlal_s32): Likewise.
31261 (vmlal_u8): Likewise.
31262 (vmlal_u16): Likewise.
31263 (vmlal_u32): Likewise.
31265 2021-01-27 Richard Biener <rguenther@suse.de>
31267 PR tree-optimization/98854
31268 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
31269 PHIs from scalars when the number of CTORs matches the
31270 number of children.
31272 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31274 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
31276 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
31278 (aarch64_mls_n<mode>): This.
31279 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
31281 (vmls_n_s32): Likewise.
31282 (vmls_n_u16): Likewise.
31283 (vmls_n_u32): Likewise.
31284 (vmlsq_n_s16): Likewise.
31285 (vmlsq_n_s32): Likewise.
31286 (vmlsq_n_u16): Likewise.
31287 (vmlsq_n_u32): Likewise.
31289 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31291 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
31293 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
31295 (vmls_s16): Likewise.
31296 (vmls_s32): Likewise.
31297 (vmls_u8): Likewise.
31298 (vmls_u16): Likewise.
31299 (vmls_u32): Likewise.
31300 (vmlsq_s8): Likewise.
31301 (vmlsq_s16): Likewise.
31302 (vmlsq_s32): Likewise.
31303 (vmlsq_u8): Likewise.
31304 (vmlsq_u16): Likewise.
31305 (vmlsq_u32): Likewise.
31307 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
31309 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
31311 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
31313 (aarch64_mla_n<mode>): This.
31314 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
31316 (vmla_n_s32): Likewise.
31317 (vmla_n_u16): Likewise.
31318 (vmla_n_u32): Likewise.
31319 (vmlaq_n_s16): Likewise.
31320 (vmlaq_n_s32): Likewise.
31321 (vmlaq_n_u16): Likewise.
31322 (vmlaq_n_u32): Likewise.
31324 2021-01-27 liuhongt <hongtao.liu@intel.com>
31327 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
31328 (*sse2_eq<mode>3): Ditto.
31330 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31332 * tree-pass.h (PROP_trees): Rename to ...
31333 (PROP_gimple): ... this.
31334 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
31335 * passes.c (execute_function_dump, execute_function_todo,
31336 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
31337 * varpool.c (ctor_for_folding): Likewise.
31339 2021-01-27 Jakub Jelinek <jakub@redhat.com>
31341 PR tree-optimization/97260
31342 * varpool.c: Include tree-pass.h.
31343 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
31344 non-TREE_SIDE_EFFECTS automatic variables.
31346 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
31348 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
31350 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
31351 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
31352 (gen_compile_unit_die): Recognise C++23.
31354 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31357 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
31360 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31363 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
31364 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
31365 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
31367 2021-01-26 Richard Biener <rguenther@suse.de>
31369 * gimple-pretty-print.c (dump_binary_rhs): Handle
31370 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
31372 2021-01-26 Richard Biener <rguenther@suse.de>
31374 PR middle-end/98726
31375 * tree.h (vector_cst_int_elt): Remove.
31376 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
31379 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
31381 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
31382 for V64DFmode min/max reductions.
31384 2021-01-26 Jakub Jelinek <jakub@redhat.com>
31386 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
31387 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
31388 two halves, one with x and the other with const0_rtx, ordered
31389 depending on endianity.
31391 2021-01-26 Alexandre Oliva <oliva@adacore.com>
31393 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
31394 temporaries not seen in binding block, and not about to be
31395 added as gimple variables.
31397 2021-01-25 Martin Sebor <msebor@redhat.com>
31400 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
31402 2021-01-25 Martin Liska <mliska@suse.cz>
31404 * value-prof.c (get_nth_most_common_value): Use %s instead
31407 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31410 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
31411 readelf -wi is able to read the emitted .debug_info back.
31412 * configure: Regenerated.
31414 2021-01-25 Martin Liska <mliska@suse.cz>
31416 PR gcov-profile/98739
31417 * common.opt: Add missing sign symbol.
31418 * value-prof.c (get_nth_most_common_value): Restore handling
31419 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
31420 PROFILE_REPRODUCIBILITY_MULTITHREADED.
31422 2021-01-25 Richard Biener <rguenther@suse.de>
31424 PR middle-end/98807
31425 * tree.c (vector_element_bits): Always use precision of
31426 the element type for boolean vectors.
31428 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31430 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
31431 (ENDFILE_SPEC): Evaluate qnolinkcmds.
31433 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
31435 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
31436 nostartfiles handling since this is already done by
31437 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
31438 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
31439 is already done by LINK_COMMAND_SPEC.
31440 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
31441 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
31444 2021-01-25 Jakub Jelinek <jakub@redhat.com>
31447 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
31448 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
31449 value rather than host size_t.
31450 (fold_const_call): Change type of s2 from size_t to
31451 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
31452 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
31453 instead of s2 as last argument.
31455 2021-01-25 Tamar Christina <tamar.christina@arm.com>
31457 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
31458 VCMLA_OP, VCMUL_OP): New.
31459 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
31460 * config/arm/neon.md (cmul<conj_op><mode>3): New.
31461 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
31462 UNSPEC_VCMUL_CONJ): New.
31463 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
31464 cml<fcmac1><conj_op><mode>4): New.
31466 2021-01-23 Jakub Jelinek <jakub@redhat.com>
31469 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
31471 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
31473 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
31475 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
31477 (vmla_s16): Likewise.
31478 (vmla_s32): Likewise.
31479 (vmla_u8): Likewise.
31480 (vmla_u16): Likewise.
31481 (vmla_u32): Likewise.
31482 (vmlaq_s8): Likewise.
31483 (vmlaq_s16): Likewise.
31484 (vmlaq_s32): Likewise.
31485 (vmlaq_u8): Likewise.
31486 (vmlaq_u16): Likewise.
31487 (vmlaq_u32): Likewise.
31489 2021-01-22 David Malcolm <dmalcolm@redhat.com>
31491 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
31494 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31497 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
31498 filenames to emit, still emit the required 0 index directory and
31499 filename entries that match DW_AT_comp_dir and DW_AT_name of the
31502 2021-01-22 Marek Polacek <polacek@redhat.com>
31505 * doc/invoke.texi: Update C++ ABI Version 15 description.
31507 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31509 PR tree-optimization/98766
31510 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
31511 comparing against type size with param_avoid_fma_max_bits.
31513 2021-01-22 Richard Biener <rguenther@suse.de>
31515 PR middle-end/98793
31516 * tree.c (vector_element_bits): Key single-bit bool vector on
31517 integer mode rather than not vector mode.
31519 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31522 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31523 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
31525 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
31526 to call different path for P8 and P9.
31527 (rs6000_expand_vector_set_var_p9): New function.
31528 (rs6000_expand_vector_set_var_p8): New function.
31530 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
31534 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
31535 Ajdust variable index vec_insert from address dereference to
31536 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
31537 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
31539 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
31541 2021-01-22 Martin Liska <mliska@suse.cz>
31543 PR gcov-profile/98739
31544 * profile.c (compute_value_histograms): Drop time profile for
31545 -fprofile-reproducible=multithreaded.
31547 2021-01-22 Nathan Sidwell <nathan@acm.org>
31549 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
31552 2021-01-22 Richard Biener <rguenther@suse.de>
31554 PR middle-end/98773
31555 * tree-data-ref.c (initalize_matrix_A): Revert previous
31556 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
31558 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31560 PR tree-optimization/90248
31561 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
31562 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
31564 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
31565 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
31567 2021-01-22 Jakub Jelinek <jakub@redhat.com>
31569 PR tree-optimization/98255
31570 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
31571 extend index - low_bound from sizetype's precision rather than index
31573 (get_addr_base_and_unit_offset_1): Likewise.
31574 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
31575 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
31577 2021-01-22 Richard Biener <rguenther@suse.de>
31579 PR tree-optimization/98786
31580 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
31581 adding new uses of abnormals. Verify we deal with a conditional
31584 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
31587 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
31589 2021-01-22 liuhongt <hongtao.liu@intel.com>
31593 * config/i386/sse.md (VI_128_256): New mode iterator.
31594 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
31595 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
31596 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
31597 define_insn_and_split to lower avx512 vector comparison to avx
31598 version when dest is vector.
31599 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
31600 define_insn_and_split for negating the comparison result.
31601 * config/i386/predicates.md (float_vector_all_ones_operand):
31603 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
31604 general NOT operator without UNSPEC_MASKOP.
31606 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
31608 PR rtl-optimization/98777
31609 * lra-int.h (lra_pmode_pseudo): New extern.
31610 * lra.c (lra_pmode_pseudo): New global.
31612 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
31614 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
31616 * fwprop.c (fwprop_propagation::classify_result): Allow
31617 (subreg (mem)) simplifications.
31619 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31621 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
31623 (aarch64_sqdmlal<mode>): ... This...
31624 (aarch64_sqdmlsl<mode>): ... And this.
31625 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
31626 (aarch64_sqdmlal_lane<mode>): ... This...
31627 (aarch64_sqdmlsl_lane<mode>): ... And this.
31628 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
31629 (aarch64_sqdmlsl_laneq<mode>): ... This...
31630 (aarch64_sqdmlal_laneq<mode>): ... And this.
31631 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
31632 (aarch64_sqdmlsl_n<mode>): ... This...
31633 (aarch64_sqdmlal_n<mode>): ... And this.
31634 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
31635 (aarch64_sqdmlal2<mode>_internal): ... This...
31636 (aarch64_sqdmlsl2<mode>_internal): ... And this.
31638 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
31640 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
31642 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
31645 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
31647 2021-01-21 liuhongt <hongtao.liu@intel.com>
31649 PR rtl-optimization/98694
31650 * regcprop.c (copy_value): If SRC had been assigned a mode
31651 narrower than the copy, we can't link DEST into the chain even
31652 they have same hard_regno_nregs(i.e. HImode/SImode in i386
31655 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31657 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
31658 Convert to define_insn_and_split. Split into simple move when moving
31661 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
31663 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
31664 Adjust comment. Simplify code.
31666 2021-01-20 Jakub Jelinek <jakub@redhat.com>
31669 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
31670 with DW_FORM_line_strp form.
31671 (prune_unused_types_update_strings): Don't add into debug_str_hash
31672 indirect strings with DW_FORM_line_strp form.
31673 (adjust_name_comp_dir): New function.
31674 (dwarf2out_finish): Call it on CU DIEs after resetting
31675 debug_line_str_hash.
31677 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
31679 PR rtl-optimization/98722
31680 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
31681 has no 3-op add insn to transform insns containing two pluses.
31683 2021-01-20 Richard Biener <rguenther@suse.de>
31685 * hwint.h (add_hwi): New function.
31686 (mul_hwi): Likewise.
31687 * tree-data-ref.c (initialize_matrix_A): Properly translate
31688 tree constants and avoid HOST_WIDE_INT_MIN.
31689 (lambda_matrix_row_add): Avoid undefined integer overflow
31690 and return true on such overflow.
31691 (lambda_matrix_right_hermite): Handle overflow from
31692 lambda_matrix_row_add gracefully. Simplify previous fix.
31693 (analyze_subscript_affine_affine): Likewise.
31695 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
31697 PR tree-optimization/96674
31698 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
31699 x >= y && y != XXX_MIN --> x > y - 1
31701 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
31703 PR tree-optimization/98535
31704 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
31705 If the high and low permutes are the same, remove the high permutes
31706 from the working set and only continue with the low ones.
31708 2021-01-20 Jakub Jelinek <jakub@redhat.com>
31710 PR tree-optimization/98721
31711 * builtins.c (access_ref::inform_access): Don't assume
31712 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
31713 object whenever allocfn is NULL, rather than only when DECL_P
31714 is true. Use %qE instead of %qD for that. Formatting fixes.
31716 2021-01-20 Richard Biener <rguenther@suse.de>
31718 PR tree-optimization/98758
31719 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
31720 (lambda_matrix_right_hermite): Avoid undefinedness with
31721 signed integer abs and multiplication.
31722 (analyze_subscript_affine_affine): Use lambda_int.
31724 2021-01-20 David Malcolm <dmalcolm@redhat.com>
31727 * dwarf2out.c (output_line_info): Rename static variable
31728 "generation", moving it out of the function to...
31729 (output_line_info_generation): New.
31730 (init_sections_and_labels): Likewise, renaming the variable to...
31731 (init_sections_and_labels_generation): New.
31732 (dwarf2out_c_finalize): Reset the new variables.
31734 2021-01-19 Martin Sebor <msebor@redhat.com>
31736 PR middle-end/98664
31737 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
31738 all functions, even if they're not declared artificial or inline.
31739 * tree.c (tree_inlined_location): Use macro expansion location
31740 only if scope traversal fails to expose one.
31742 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
31744 PR rtl-optimization/92294
31745 * alias.c (compare_base_symbol_refs): Take an extra parameter
31746 and add the distance between two symbols to it. Enshrine in
31747 comments that -1 means "either 0 or 1, but we can't tell
31748 which at compile time".
31749 (memrefs_conflict_p): Update call accordingly.
31750 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
31753 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
31755 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
31756 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
31757 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
31758 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
31759 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
31760 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
31761 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
31762 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
31763 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
31764 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
31765 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
31766 sqmovun): Use NONE flags.
31768 2021-01-19 Richard Biener <rguenther@suse.de>
31771 * ipa-modref.c (analyze_stmt): Only record a summary for a
31774 2021-01-19 Richard Biener <rguenther@suse.de>
31776 PR middle-end/98638
31777 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
31779 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
31781 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
31782 built-in define __FIX_LEON3FT_TN0018.
31784 2021-01-19 Richard Biener <rguenther@suse.de>
31787 * tree-inline.c (tree_function_versioning): Set input_location
31788 to UNKNOWN_LOCATION throughout the function.
31790 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
31793 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
31795 2021-01-19 Martin Jambor <mjambor@suse.cz>
31798 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
31799 whether non-call exceptions allow removal of a statement.
31800 (isra_analyze_call): Pass the appropriate function to
31801 ssa_name_only_returned_p.
31803 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
31805 * config/riscv/arch-canonicalize (longext_sort): New function for
31806 sorting 'multi-letter'.
31807 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
31808 'alts'. The 'arch' may not be the first of 'alts'.
31809 (_expand_combination): Add underline for the 'ext' without '*'.
31810 This is because, a single-letter extension can always be treated well
31811 with a '_' prefix, but it cannot be separated out if it is appended
31814 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
31817 * ira.c (ira): Skip abnormal critical edge splitting.
31819 2021-01-18 Jakub Jelinek <jakub@redhat.com>
31821 PR tree-optimization/98727
31822 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
31823 second .MUL_OVERFLOW operand for signed multiplication with overflow
31824 checking if the second operand of multiplication is not constant.
31826 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
31828 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
31829 defaults to version 4.
31831 2021-01-18 David Malcolm <dmalcolm@redhat.com>
31833 * attribs.h (fndecl_dealloc_argno): New decl.
31834 * builtins.c (call_dealloc_argno): Split out second half of
31836 (fndecl_dealloc_argno): New.
31837 * doc/extend.texi (Common Function Attributes): Document the
31838 interaction between the analyzer and the malloc attribute.
31839 * doc/invoke.texi (Static Analyzer Options): Likewise.
31841 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
31843 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
31844 dwarf_version to 4.
31845 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
31847 2021-01-17 Martin Jambor <mjambor@suse.cz>
31850 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
31853 2021-01-17 Mark Wielaard <mark@klomp.org>
31855 * common.opt (gdwarf-): Init(5).
31856 * doc/invoke.texi (-gdwarf): Document default to 5.
31858 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
31860 * builtin-types.def
31861 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
31863 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
31864 ...this. Add extra argument.
31865 * gimplify.c (omp_default_clause): Ensure that event handle is
31866 firstprivate in a task region.
31867 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
31868 (gimplify_adjust_omp_clauses): Likewise.
31869 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
31870 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
31871 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
31872 if detach clause specified. Add detach argument when generating
31874 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
31876 (finish_taskreg_scan): Move field for variable containing the event
31877 handle to the front of the struct.
31878 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
31880 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
31881 OMP_CLAUSE_DETACH clause.
31882 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
31883 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
31884 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
31886 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
31888 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
31890 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
31892 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
31893 multilib directory names. Use MULTILIB_REQUIRED instead of
31894 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
31895 -mcustom-fpu-cfg=fph2 multilib.
31897 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
31899 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
31900 (nios2_init_fpu_configs): Provide register values for new
31901 -mcustom-fpu-cfg=fph2 option variant.
31902 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
31905 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
31907 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
31908 custom instruction warnings.
31910 2021-01-16 Jakub Jelinek <jakub@redhat.com>
31912 PR tree-optimization/96669
31913 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
31915 2021-01-16 Jakub Jelinek <jakub@redhat.com>
31917 PR tree-optimization/96271
31918 * passes.def: Pass false argument to first two pass_cd_dce
31919 instances and true to last instance. Add comment that
31920 last instance rewrites no longer addressed locals.
31921 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
31923 (pass_cd_dce::set_pass_param): New method.
31924 (pass_cd_dce::execute): Return TODO_update_address_taken from
31925 last cd_dce instance.
31927 2021-01-15 Carl Love <cel@us.ibm.com>
31929 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
31931 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
31932 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
31933 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
31934 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
31935 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
31936 Add builtin define.
31937 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
31938 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
31939 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
31940 New overloaded definitions.
31941 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
31942 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
31943 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
31944 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
31945 P10V_BUILTIN_MULHU_V4SI]: Add case
31946 statement for builtins.
31947 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
31948 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
31949 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
31950 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
31951 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
31952 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
31953 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
31954 Add define_insn, mode is VIlong.
31955 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
31956 Add builtin descriptions.
31958 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
31960 * final.c (final_start_function_1): Reset force_source_line.
31962 2021-01-15 Jakub Jelinek <jakub@redhat.com>
31964 PR tree-optimization/96669
31965 * match.pd (((1 << A) & 1) != 0 -> A == 0,
31966 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
31967 possibly different power of two constants and to right shift too.
31969 2021-01-15 Jakub Jelinek <jakub@redhat.com>
31971 PR tree-optimization/96681
31972 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
31973 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
31974 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
31975 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
31977 2021-01-15 Alexandre Oliva <oliva@adacore.com>
31979 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
31981 2021-01-15 Tamar Christina <tamar.christina@arm.com>
31983 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
31984 cmul<conj_op><mode>3): New.
31985 * config/aarch64/iterators.md (UNSPEC_FCMUL,
31986 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
31987 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
31988 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
31989 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
31990 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
31991 (rot_op): Renamed to conj_op.
31992 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
31993 cmul<conj_op><mode>3): New.
31994 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
31995 cmul<conj_op><mode>3): New.
31997 2021-01-15 David Malcolm <dmalcolm@redhat.com>
32001 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32002 Escape the tempfile name when constructing the expected output.
32004 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32006 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
32008 (aarch64_<su>mlsl_hi<mode>): ... This.
32009 (aarch64_<su>mlsl_hi<mode>): Define.
32010 (*aarch64_<su>mlsl<mode): Rename to...
32011 (aarch64_<su>mlsl<mode): ... This.
32012 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
32013 smlsl_hi, umlsl_hi): Define builtins.
32014 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
32015 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
32016 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
32017 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
32019 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
32021 * config/i386/i386-c.c (ix86_target_macros):
32022 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
32024 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32027 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
32028 * Makefile.in (RTL_SSA_H): New variable.
32029 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
32030 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
32031 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
32033 * config/aarch64/aarch64-cc-fusion.cc: New file.
32035 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32037 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
32038 calling cancel_changes for changes that no longer exist.
32040 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
32042 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
32043 (function_info::reg_defs): ...this.
32044 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
32045 (function_info::reg_defs): ...this.
32047 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32050 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32052 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32055 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32058 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32060 2021-01-15 Richard Biener <rguenther@suse.de>
32062 PR tree-optimization/96376
32063 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
32064 for VMAT_INVARIANT.
32066 2021-01-15 Martin Liska <mliska@suse.cz>
32068 * doc/install.texi: Document that some tests need pytest module.
32069 * doc/sourcebuild.texi: Likewise.
32071 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32074 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
32076 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32078 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
32079 (mve_vshrq_n_u<mode>_imm): Likewise.
32080 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
32081 * config/arm/vec-common.md: ... here.
32083 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
32085 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
32087 * config/arm/neon.md (vashl<mode>3): Delete.
32088 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
32089 (vasl<mode>3): New expander.
32091 2021-01-15 Richard Biener <rguenther@suse.de>
32093 PR tree-optimization/98685
32094 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
32095 of vector extern defs.
32097 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32100 * diagnostic.c (diagnostic_kind_text): Break out this array
32102 (diagnostic_build_prefix): ...here.
32103 (fancy_abort): Detect when diagnostic_initialize has not yet been
32104 called and fall back to a minimal implementation of printing the
32105 ICE, rather than segfaulting in internal_error.
32107 2021-01-14 David Malcolm <dmalcolm@redhat.com>
32109 * diagnostic.c (diagnostic_initialize): Eliminate
32110 parseable_fixits_p in favor of initializing extra_output_kind from
32111 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32112 (convert_column_unit): New function, split out from...
32113 (diagnostic_converted_column): ...this.
32114 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
32115 Use them to call convert_column_unit on the column values.
32116 (diagnostic_report_diagnostic): Eliminate conditional on
32117 parseable_fixits_p in favor of a switch statement on
32118 extra_output_kind, passing the appropriate values to the new
32119 params of print_parseable_fixits.
32120 (selftest::test_print_parseable_fixits_none): Update for new
32121 params of print_parseable_fixits.
32122 (selftest::test_print_parseable_fixits_insert): Likewise.
32123 (selftest::test_print_parseable_fixits_remove): Likewise.
32124 (selftest::test_print_parseable_fixits_replace): Likewise.
32125 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
32127 (selftest::diagnostic_c_tests): Call it.
32128 * diagnostic.h (enum diagnostics_extra_output_kind): New.
32129 (diagnostic_context::parseable_fixits_p): Delete field in favor
32131 (diagnostic_context::extra_output_kind): ...this new field.
32132 * doc/invoke.texi (Environment Variables): Add
32133 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
32134 * opts.c (common_handle_option): Update handling of
32135 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
32138 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32140 * tree-vect-slp-patterns.c (class complex_operations_pattern,
32141 complex_operations_pattern::matches,
32142 complex_operations_pattern::recognize,
32143 complex_operations_pattern::build): New.
32144 (slp_patterns): Use it.
32146 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32148 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
32149 * optabs.def (cmls_optab, cmls_conj_optab): New.
32150 * doc/md.texi: Document them.
32151 * tree-vect-slp-patterns.c (class complex_fms_pattern,
32152 complex_fms_pattern::matches, complex_fms_pattern::recognize,
32153 complex_fms_pattern::build): New.
32155 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32157 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
32158 * optabs.def (cmla_optab, cmla_conj_optab): New.
32159 * doc/md.texi: Document them.
32160 * tree-vect-slp-patterns.c (vect_match_call_p,
32161 class complex_fma_pattern, vect_slp_reset_pattern,
32162 complex_fma_pattern::matches, complex_fma_pattern::recognize,
32163 complex_fma_pattern::build): New.
32165 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32167 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
32168 * optabs.def (cmul_optab, cmul_conj_optab): New.
32169 * doc/md.texi: Document them.
32170 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
32171 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
32172 vect_build_combine_node, class complex_mul_pattern,
32173 complex_mul_pattern::matches, complex_mul_pattern::recognize,
32174 complex_mul_pattern::build): New.
32176 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32178 * tree-vect-slp.c (optimize_load_redistribution_1): New.
32179 (optimize_load_redistribution, vect_is_slp_load_node): New.
32180 (vect_match_slp_patterns): Use it.
32182 2021-01-14 Tamar Christina <tamar.christina@arm.com>
32184 * tree-vect-slp-patterns.c (complex_add_pattern::build):
32187 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
32189 * config/gcn/mkoffload.c (main): Create an offload image only in
32190 64-bit configurations.
32192 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
32195 * config/i386/i386-options.c (ix86_option_override_internal):
32196 Issue an error for -fcf-protection with CF_BRANCH when compiling
32197 for 32-bit non-TARGET_CMOV targets.
32199 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32202 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
32203 Remove declaration and initialization of shadow variable "ret".
32204 (ix86_option_override_internal): Remove delcaration of
32205 shadow variable "i". Redeclare shadowed variable to unsigned.
32206 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
32207 * config/i386/i386-builtins.c (get_builtin_code_for_version):
32208 Update for redeclaration.
32209 * config/i386/i386.h (pta_size): Ditto.
32211 2021-01-14 Richard Biener <rguenther@suse.de>
32213 PR tree-optimization/98674
32214 * tree-data-ref.c (base_supports_access_fn_components_p): New.
32215 (initialize_data_dependence_relation): For two bases without
32216 possible access fns resort to type size equality when determining
32217 shape compatibility.
32219 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
32222 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
32223 <=, >= operators in vcle and vcge intrinsics respectively.
32224 * config/arm/arm_neon_builtins.def: Remove entry for
32227 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
32230 * config/i386/i386-options.c (ix86_function_specific_save):
32231 Remove redundant assignment to opts->x_ix86_branch_cost.
32232 * config/i386/i386.c (ix86_prefetch_sse):
32233 Rename from x86_prefetch_sse. Update all uses.
32234 * config/i386/i386.h: Update for rename.
32235 * config/i386/i386-options.h: Ditto.
32237 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32240 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
32241 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
32242 Use Bm instead of m for non-avx. Add isa attribute.
32244 2021-01-14 Jakub Jelinek <jakub@redhat.com>
32246 PR tree-optimization/96688
32247 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
32248 ~X can be simplified.
32250 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
32252 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
32253 IFN_LOAD_LANES results.
32255 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32257 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
32259 (aarch64_xtn<mode>): Likewise.
32260 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
32263 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32265 (vmovl_s16): Likewise.
32266 (vmovl_s32): Likewise.
32267 (vmovl_u8): Likewise.
32268 (vmovl_u16): Likewise.
32269 (vmovl_u32): Likewise.
32270 (vmovn_s16): Likewise.
32271 (vmovn_s32): Likewise.
32272 (vmovn_s64): Likewise.
32273 (vmovn_u16): Likewise.
32274 (vmovn_u32): Likewise.
32275 (vmovn_u64): Likewise.
32277 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32279 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
32281 (aarch64_<su>qxtn2<mode>_be): Likewise.
32282 (aarch64_<su>qxtn2<mode>): Likewise.
32283 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
32285 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
32286 (su): Handle ss_truncate and us_truncate.
32287 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
32289 (vqmovn_high_s32): Likewise.
32290 (vqmovn_high_s64): Likewise.
32291 (vqmovn_high_u16): Likewise.
32292 (vqmovn_high_u32): Likewise.
32293 (vqmovn_high_u64): Likewise.
32295 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32297 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
32299 (aarch64_xtn2<mode>_be): Likewise.
32300 (aarch64_xtn2<mode>): Likewise.
32301 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
32303 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
32305 (vmovn_high_s32): Likewise.
32306 (vmovn_high_s64): Likewise.
32307 (vmovn_high_u16): Likewise.
32308 (vmovn_high_u32): Likewise.
32309 (vmovn_high_u64): Likewise.
32311 2021-01-13 Stafford Horne <shorne@gmail.com>
32313 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
32315 2021-01-13 Stafford Horne <shorne@gmail.com>
32317 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
32319 2021-01-13 Stafford Horne <shorne@gmail.com>
32321 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
32322 define for __or1k_hard_float__.
32324 2021-01-13 Stafford Horne <shorne@gmail.com>
32326 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
32327 (PROFILE_HOOK): Define to call _mcount.
32328 (FUNCTION_PROFILER): Change from abort to no-op.
32330 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32332 PR tree-optimization/96691
32333 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
32334 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
32335 (~D ^ C) or (D ^ C) can be simplified.
32337 2021-01-13 Richard Biener <rguenther@suse.de>
32339 PR tree-optimization/92645
32340 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
32341 until after vector lowering.
32343 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32345 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
32347 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
32348 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
32350 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32352 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
32354 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
32355 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
32357 2021-01-13 Richard Biener <rguenther@suse.de>
32359 PR tree-optimization/92645
32360 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
32361 BIT_FIELD_REF argument.
32362 (vect_build_slp_tree_2): Record the desired vector type
32363 on the external vector def.
32364 (vectorizable_slp_permutation): Handle required punning
32365 of existing vector defs.
32367 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32369 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
32371 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
32373 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
32375 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
32377 * config.gcc [$target == *-*-gnu*]: Enable
32378 'default_gnu_indirect_function'.
32380 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32383 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
32384 registers before calling targetm.vectorize.vec_perm_const, only after
32386 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
32387 two argument permutation when one operand is zero vector and only
32388 after that force operands into registers.
32389 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
32390 define_insn_and_split pattern.
32391 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
32392 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
32393 (*avx2_zero_extendv8hiv8si2_1): Likewise.
32394 (*avx512f_zero_extendv8siv8di2_1): Likewise.
32395 (*avx2_zero_extendv4siv4di2_1): Likewise.
32396 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
32398 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
32399 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
32400 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
32401 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
32402 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
32403 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
32405 2021-01-13 Martin Liska <mliska@suse.cz>
32407 PR tree-optimization/98455
32408 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
32409 Record also virtual PHIs.
32410 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
32413 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
32415 * doc/invoke.texi (C++ Modules): Fix typos.
32417 2021-01-13 Richard Biener <rguenther@suse.de>
32419 PR tree-optimization/98640
32420 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
32421 handle plus or minus from a truncated operand to be
32424 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32427 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
32428 define_insn_and_split patterns.
32429 (splitter after *btr<mode>_2): New splitter.
32431 2021-01-13 Martin Liska <mliska@suse.cz>
32434 * cgraphunit.c (analyze_functions): Remove dead code.
32436 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
32438 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
32439 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
32440 (a64fx_regmove_cost, a64fx_vector_cost): New.
32441 (a64fx_tunings): Use the new added cost tables.
32443 2021-01-13 Jakub Jelinek <jakub@redhat.com>
32446 * config/i386/predicates.md (pmovzx_parallel): New predicate.
32447 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
32448 define_insn_and_split pattern.
32449 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
32450 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
32452 2021-01-13 Julian Brown <julian@codesourcery.com>
32454 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
32455 to fix v0 register.
32457 2021-01-13 Julian Brown <julian@codesourcery.com>
32459 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
32462 2021-01-13 Julian Brown <julian@codesourcery.com>
32464 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
32465 for reciprocal-approximation instructions.
32466 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
32467 refinement and division result.
32468 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
32470 2021-01-13 Julian Brown <julian@codesourcery.com>
32472 * config/gcn/gcn-valu.md (subdf): Rename to...
32475 2021-01-12 Martin Liska <mliska@suse.cz>
32477 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
32479 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
32481 * function-abi.h: Fix typo.
32483 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
32487 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
32488 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
32489 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
32490 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
32491 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
32492 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
32493 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
32494 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
32495 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
32496 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
32497 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
32498 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
32499 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
32500 (ARM_HAVE_V2DI_LDST): Likewise.
32501 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
32502 (*movmisalign<mode>_mve_load): New pattern.
32503 * config/arm/neon.md (movmisalign<mode>): Move to ...
32504 * config/arm/vec-common.md: ... here.
32506 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
32509 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
32510 of pattern 'plus (plus (hard reg, const), pseudo)'.
32512 2021-01-12 Richard Biener <rguenther@suse.de>
32514 PR tree-optimization/98550
32515 * tree-vect-slp.c (vect_record_max_nunits): Check whether
32516 the group size is a multiple of the vector element count.
32517 (vect_build_slp_tree_1): When we need to fail because
32518 the vector type choosen causes unrolling do so lazily
32519 without affecting matches only at the end to guide group splitting.
32521 2021-01-12 Martin Liska <mliska@suse.cz>
32524 * optc-save-gen.awk: Compare also n_target_save vars with
32527 2021-01-12 Martin Liska <mliska@suse.cz>
32529 * gcov.c (source_info::debug): New.
32530 (print_usage): Add --debug (-D) option.
32531 (process_args): Likewise.
32532 (generate_results): Call src->debug after
32533 accumulate_line_counts.
32534 (read_graph_file): Properly assign id for EXIT_BLOCK.
32535 * profile.c (branch_prob): Dump function body before it is
32538 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32540 PR tree-optimization/98629
32541 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
32542 unless returning non-zero.
32544 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32546 PR tree-optimization/95731
32547 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
32548 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
32549 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
32550 only after optimize_range_tests_var_bound.
32552 2021-01-12 Jakub Jelinek <jakub@redhat.com>
32554 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
32555 * configure: Regenerated.
32557 2021-01-12 liuhongt <hongtao.liu@intel.com>
32560 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
32562 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
32565 2021-01-12 Alexandre Oliva <oliva@adacore.com>
32567 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
32569 (auto_end_imm_use_stmt_traverse): New struct.
32570 (FOR_EACH_IMM_USE_STMT): Use it.
32571 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
32573 * gimple-ssa-strength-reduction.c: ... here, ...
32574 * graphite-scop-detection.c: ... here, ...
32575 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
32576 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
32577 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
32578 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
32579 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
32580 * tree-vect-slp.c: ... and here, ...
32581 * doc/tree-ssa.texi: ... and the example here.
32583 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32585 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
32586 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
32587 (*sdiv_pow2<mode>3): New pattern.
32588 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
32589 Wrap the ASRD in an UNSPEC_PRED_X.
32590 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
32591 predicate with a constant PTRUE, if it isn't already.
32592 (*cond_<sve_int_op><mode>_z): Replace with...
32593 (*cond_<sve_int_op><mode>_any): ...this new pattern.
32595 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32597 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
32598 SVE_FULL_I to SVE_I.
32599 (*cond_bic<mode>_any): Likewise.
32601 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32603 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
32604 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
32607 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32609 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
32610 SVE_FULL_I to SVE_I.
32611 (*aarch64_cond_<su>abd<mode>_2): Likewise.
32612 (*aarch64_cond_<su>abd<mode>_any): Likewise.
32613 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
32614 for the max and min but not for the minus.
32615 (*aarch64_cond_<su>abd<mode>_3): New pattern.
32617 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32619 * config/aarch64/iterators.md (SVE_24I): New iterator.
32620 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
32621 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
32623 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32625 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
32626 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
32628 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
32629 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
32630 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
32631 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
32633 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32635 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
32636 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
32637 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
32640 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
32642 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
32643 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
32644 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
32646 2021-01-11 Martin Liska <mliska@suse.cz>
32649 * symtab-clones.h (clone_info::release): Release
32650 symtab::m_clones with ggc_delete as it's a GGC memory.
32652 2021-01-11 Matthias Klose <doko@ubuntu.com>
32654 * Makefile.in (LINK_PROGRESS): Show the link target.
32656 2021-01-11 Richard Biener <rguenther@suse.de>
32658 PR tree-optimization/91403
32659 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
32660 single-element interleaving group size at 4096 elements.
32662 2021-01-11 Richard Biener <rguenther@suse.de>
32664 PR tree-optimization/98526
32665 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
32666 of the actual reduction op for the regular case.
32667 (vectorizable_reduction): Cost the stmts
32668 vect_transform_reduction produces here.
32670 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
32672 * tree-ssa-forwprop.c (simplify_vector_constructor): For
32673 big-endian, use UNPACK[_FLOAT]_HI.
32675 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32677 * tree-vect-slp-patterns.c (class complex_pattern,
32678 class complex_add_pattern): Add parameters to matches.
32679 (complex_add_pattern::build): Free memory.
32680 (complex_add_pattern::matches): Move validation end of match.
32681 (complex_add_pattern::recognize): Likewise.
32683 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32685 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
32687 2021-01-11 Tamar Christina <tamar.christina@arm.com>
32689 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
32691 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32693 PR tree-optimization/95867
32694 * tree-ssa-math-opts.h: New header.
32695 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
32696 (powi_as_mults): No longer static. Use build_one_cst instead of
32697 build_real. Formatting fix.
32698 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
32699 (attempt_builtin_powi): Handle multiplication reassociation without
32700 powi_fndecl using powi_as_mults.
32701 (reassociate_bb): For integral types don't require
32702 -funsafe-math-optimizations to call attempt_builtin_powi.
32704 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32706 PR tree-optimization/95852
32707 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
32708 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
32709 allow in the bb any of the stmts in that vector, div_stmt and
32710 up to 3 cast stmts.
32711 (arith_cast_equal_p): New function.
32712 (arith_overflow_check_p): Add cast_stmt argument, handle signed
32713 multiply overflow checks.
32714 (match_arith_overflow): Adjust caller. Handle signed multiply
32717 2021-01-11 Jakub Jelinek <jakub@redhat.com>
32719 PR tree-optimization/95852
32720 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
32721 (uaddsub_overflow_check_p): Renamed to ...
32722 (arith_overflow_check_p): ... this. Handle also multiplication
32723 with overflow check.
32724 (match_uaddsub_overflow): Renamed to ...
32725 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
32726 also multiplication with overflow check. Adjust function comment.
32727 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
32728 match_arith_overflow also for MULT_EXPR.
32730 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32732 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
32733 __builtin_convertvector.
32734 (vmovl_s16): Likewise.
32735 (vmovl_s32): Likewise.
32736 (vmovl_u8): Likewise.
32737 (vmovl_u16): Likewise.
32738 (vmovl_u32): Likewise.
32739 (vmovn_s16): Likewise.
32740 (vmovn_s32): Likewise.
32741 (vmovn_s64): Likewise.
32742 (vmovn_u16): Likewise.
32743 (vmovn_u32): Likewise.
32744 (vmovn_u64): Likewise.
32746 2021-01-11 Martin Liska <mliska@suse.cz>
32748 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
32749 (if_chain::is_beneficial): Delete clusters
32750 (find_conditions): Make second argument of conditions_in_bbs a
32751 pointer so that we control over it's lifetime.
32752 (pass_if_to_switch::execute): Delete them.
32754 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
32756 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
32759 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32761 * config/vax/vax.md (cc): Remove mode attribute.
32762 (subst_<cc>, subst_f<cc>): Rename to...
32763 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
32764 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
32765 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
32766 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
32768 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32770 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
32771 `const_double_zero'.
32773 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32775 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
32776 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
32779 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
32781 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
32783 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
32784 with `const_double_zero'.
32785 * doc/rtl.texi (Constant Expression Types): Document it.
32787 2021-01-09 Jakub Jelinek <jakub@redhat.com>
32790 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
32791 POINTER_DIFF_EXPR to be any integral type.
32793 2021-01-09 Jakub Jelinek <jakub@redhat.com>
32795 PR rtl-optimization/98603
32796 * function.c (instantiate_virtual_regs_in_insn): For asm goto
32797 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
32798 if any, set ASM_OPERANDS mode to VOIDmode and change
32799 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
32801 2021-01-09 Alexandre Oliva <oliva@gnu.org>
32804 * final.c (notice_source_line): Narrow down the condition to
32805 skip a line-0 marker.
32807 2021-01-08 Sergei Trofimovich <siarheit@google.com>
32809 * ipa-modref.c (merge_call_side_effects): Fix
32810 linebreak split by reordering two print calls.
32812 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
32814 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
32815 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
32816 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
32817 consistency, fix constraint.
32819 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
32821 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
32822 callables instead of mask values.
32823 (struct target_flag_set_p): New predicate.
32824 (s390_cpu_cpp_builtins_internal): Define or undefine
32825 __LONG_DOUBLE_VX__ macro.
32827 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
32830 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
32831 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
32834 2021-01-08 Richard Biener <rguenther@suse.de>
32836 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
32838 2021-01-08 Richard Biener <rguenther@suse.de>
32840 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
32841 (vect_build_slp_tree): On cache hit release the matched
32842 scalar stmts vector.
32843 * tree-vect-stmts.c (vectorizable_store): Properly free
32844 vec_oprnds before possibly gathering them again.
32846 2021-01-08 Richard Biener <rguenther@suse.de>
32848 PR tree-optimization/98544
32849 * tree-vect-slp.c (vect_optimize_slp): Always materialize
32850 permutes at a permute node.
32852 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
32855 * config/i386/i386.c (x86_function_profiler): Use R10 to call
32856 mcount in large model. Sorry for large model with PIC.
32858 2021-01-08 Jakub Jelinek <jakub@redhat.com>
32861 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
32862 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
32863 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
32864 TargetSave and initialize for variables with enum types.
32865 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
32866 mstack-protector-guard-symbol=): Add Save.
32867 * config/i386/i386-options.c (ix86_function_specific_save,
32868 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
32869 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
32870 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
32871 x_ix86_veclibabi_type.
32873 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
32875 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
32876 SVE_FULL_I to SVE_I.
32877 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
32879 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
32881 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
32882 SVE_FULL_I to SVE_I.
32883 (*cond_uxt<mode>_any): Likewise.
32885 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32887 * config/aarch64/iterators.md (Vwhalf): New iterator.
32888 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
32890 (aarch64_<sur>adalp<mode>): ... This. Make more
32892 (<sur>sadv16qi): Adjust callsite of the above.
32893 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
32895 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
32897 (vpadal_s16): Likewise.
32898 (vpadal_u8): Likewise.
32899 (vpadal_u16): Likewise.
32900 (vpadalq_s8): Likewise.
32901 (vpadalq_s16): Likewise.
32902 (vpadalq_s32): Likewise.
32903 (vpadalq_u8): Likewise.
32904 (vpadalq_u16): Likewise.
32905 (vpadalq_u32): Likewise.
32907 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32909 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
32911 (aarch64_<su>abd<mode>): ... This.
32912 (<sur>sadv16qi): Adjust callsite of the above.
32913 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
32915 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
32917 (vabd_s16): Likewise.
32918 (vabd_s32): Likewise.
32919 (vabd_u8): Likewise.
32920 (vabd_u16): Likewise.
32921 (vabd_u32): Likewise.
32922 (vabdq_s8): Likewise.
32923 (vabdq_s16): Likewise.
32924 (vabdq_s32): Likewise.
32925 (vabdq_u8): Likewise.
32926 (vabdq_u16): Likewise.
32927 (vabdq_u32): Likewise.
32929 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32931 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
32933 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
32934 (vaba_s16): Likewise.
32935 (vaba_s32): Likewise.
32936 (vaba_u8): Likewise.
32937 (vaba_u16): Likewise.
32938 (vaba_u32): Likewise.
32939 (vabaq_s8): Likewise.
32940 (vabaq_s16): Likewise.
32941 (vabaq_s32): Likewise.
32942 (vabaq_u8): Likewise.
32943 (vabaq_u16): Likewise.
32944 (vabaq_u32): Likewise.
32946 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
32948 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
32949 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
32950 Change RTL pattern to match.
32952 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
32954 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
32955 * config/riscv/riscv-c.c (riscv-subset.h): New.
32956 (INCLUDE_STRING): Define.
32957 (riscv_cpu_cpp_builtins): Add new style architecture extension
32959 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
32960 (riscv_subset_list::end): New.
32961 (riscv_current_subset_list): New.
32963 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
32965 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
32966 Move to riscv-subset.h.
32967 (struct riscv_subset_t): Ditto.
32968 (class riscv_subset_list): Ditto.
32969 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
32970 from riscv-common.c.
32971 (struct riscv_subset_t): Ditto.
32972 (class riscv_subset_list): Ditto.
32973 * config/riscv/t-riscv ($(common_out_file)): Add file
32976 2021-01-07 Jakub Jelinek <jakub@redhat.com>
32979 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
32980 New define_insn patterns.
32982 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
32984 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
32985 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
32986 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
32988 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
32990 PR tree-optimization/98560
32991 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
32992 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
32994 (vec_cond_direct): Likewise argument 2.
32995 (vec_condu_direct, vec_condeq_direct): Delete.
32996 (expand_vect_cond_optab_fn): Rename to...
32997 (expand_vec_cond_optab_fn): ...this, replacing old macro.
32998 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
32999 (expand_vect_cond_mask_optab_fn): Rename to...
33000 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
33001 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
33003 (direct_vec_cond_optab_supported_p): Likewise.
33004 (direct_vec_condu_optab_supported_p): Delete.
33005 (direct_vec_condeq_optab_supported_p): Delete.
33006 * gimple-isel.cc: Include internal-fn.h.
33007 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
33010 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
33012 PR tree-optimization/98560
33013 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
33014 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
33016 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
33018 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
33019 rotate_insn and optab code attributes.
33020 Update all uses to merged code attribute.
33021 * config/i386/sse.md: Update all uses to merged code attribute.
33022 * config/i386/mmx.md: Update all uses to merged code attribute.
33024 2021-01-07 Jakub Jelinek <jakub@redhat.com>
33026 PR tree-optimization/98568
33027 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
33028 (bswap_replace): Use it.
33030 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
33032 PR rtl-optimization/97978
33033 * lra-int.h (lra_hard_reg_split_p): New external.
33034 * lra.c (lra_hard_reg_split_p): New global.
33035 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
33036 * lra-assigns.c (lra_assign): Don't check allocation correctness
33037 after hard reg splitting.
33039 2021-01-06 Martin Sebor <msebor@redhat.com>
33042 * builtins.c (new_delete_mismatch_p): New overload.
33043 (new_delete_mismatch_p (tree, tree)): Call it.
33045 2021-01-06 Alexandre Oliva <oliva@adacore.com>
33047 * Makefile.in (T_GLIMITS_H): New.
33048 (stmp-int-hdrs): Depend on it, use it.
33049 * config/t-vxworks (T_GLIMITS_H): Override it.
33050 (vxw-glimits.h): New.
33052 2021-01-06 Richard Biener <rguenther@suse.de>
33054 PR tree-optimization/98513
33055 * value-range.cc (intersect_ranges): Compare the upper bounds
33056 for the expected relation.
33058 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
33061 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
33063 * doc/standards.texi (HSAIL): Remove section.
33065 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
33067 * configure: Re-generate.
33069 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33071 * doc/invoke.texi (-std=c++20): Adjust for the publication of
33072 ISO 14882:2020 standard.
33073 * doc/standards.texi: Likewise.
33075 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33077 PR tree-optimization/94802
33078 * expr.h (maybe_optimize_sub_cmp_0): Declare.
33079 * expr.c: Include tree-pretty-print.h and flags.h.
33080 (maybe_optimize_sub_cmp_0): New function.
33081 (do_store_flag): Use it.
33082 * cfgexpand.c (expand_gimple_cond): Likewise.
33084 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33086 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
33087 * rtlanal.c (simple_regno_set): Tweak description to clarify the
33090 2021-01-05 Richard Biener <rguenther@suse.de>
33092 PR tree-optimization/98516
33093 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
33094 lanes when materializing on a VEC_PERM node.
33095 (vectorizable_slp_permutation): Dump the permute properly.
33097 2021-01-05 Richard Biener <rguenther@suse.de>
33099 * tree-vect-slp.c (vect_slp_region): Move debug counter
33100 to cover individual subgraphs.
33102 2021-01-05 Richard Biener <rguenther@suse.de>
33104 PR tree-optimization/98428
33105 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
33106 vector lane extracts for loop vectorization.
33108 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33110 PR tree-optimization/98514
33111 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
33113 (operand_rank): Change type from hash_map<tree, long> to
33114 hash_map<tree, int64_t>.
33115 (phi_rank): Change return type from long to int64_t.
33116 (loop_carried_phi): Change block_rank variable type from long to
33118 (propagate_rank): Change return type, rank parameter type and
33119 op_rank variable type from long to int64_t.
33120 (find_operand_rank): Change return type from long to int64_t
33121 and change slot variable type from long * to int64_t *.
33122 (insert_operand_rank): Change rank parameter type from long to
33124 (get_rank): Change return type and rank variable type from long to
33125 int64_t. Use PRId64 instead of ld to print the rank.
33126 (init_reassoc): Change rank variable type from long to int64_t
33127 and adjust correspondingly bb_rank and operand_rank initialization.
33129 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33131 PR tree-optimization/96928
33132 * tree-ssa-phiopt.c (xor_replacement): New function.
33133 (tree_ssa_phiopt_worker): Call it.
33135 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33137 PR tree-optimization/96930
33138 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
33139 from narrower value which has the same type as 1 << B, perform
33140 the right shift on the narrower value followed by extension.
33142 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33144 PR tree-optimization/96239
33145 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
33147 (get_status_for_store_merging): Don't return BB_INVALID for blocks
33148 with potential bswap optimizable CONSTRUCTORs.
33149 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
33152 2021-01-05 Richard Biener <rguenther@suse.de>
33154 PR tree-optimization/98381
33155 * tree.c (vector_element_bits): Properly compute bool vector
33157 * tree-vect-loop.c (vectorizable_live_operation): Properly
33158 compute the last lane bit offset.
33160 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33163 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
33164 Clear the top 64 bytes of the input XMM register.
33165 (sse_cvttps2pi): Ditto.
33167 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
33170 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
33172 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
33175 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
33178 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
33180 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
33181 (umaddsidi4_split): Likewise.
33183 2021-01-05 liuhongt <hongtao.liu@intel.com>
33186 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
33187 define_insn_and_split for zero_extend of subreg HI of pmovskb
33189 (*sse2_pmovskb_zexthisi): Add new combine splitters for
33190 zero_extend of not of subreg HI of pmovskb result.
33192 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33195 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
33197 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
33198 convert_memory_address to convert symbolic immediates to ptr_mode
33199 before forcing them to memory.
33201 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33203 PR rtl-optimization/97144
33204 * recog.c (constrain_operands): Initialize matching_operand
33205 for each alternative, rather than only doing it once.
33207 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33209 PR rtl-optimization/98403
33210 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
33211 why we don't remove call clobbers.
33212 (function_info::apply_changes_to_insn): Don't attempt to add
33213 call clobbers here.
33215 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
33217 PR tree-optimization/98371
33218 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
33219 (vect_analyze_loop): If an epilogue loop appears to be cheaper
33220 than the main loop, re-analyze it as a main loop before adopting
33223 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
33226 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
33227 * aclocal.m4, configure: Regenerate.
33228 * Makefile.in (NETLIBS): Define.
33229 (BACKEND): Remove $(CODYLIB).
33231 2021-01-05 Jakub Jelinek <jakub@redhat.com>
33233 PR rtl-optimization/98334
33234 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
33235 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
33237 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
33239 * tree-inline.c (expand_call_inline): Restore input_location.
33240 Return result from recursive call.
33242 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33244 PR tree-optimization/95401
33245 * config/aarch64/aarch64-sve-builtins.cc
33246 (gimple_folder::load_store_cookie): Use bits rather than bytes
33247 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
33248 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
33249 * tree-vect-stmts.c (vectorizable_store): Likewise.
33250 (vectorizable_load): Likewise.
33252 2021-01-04 Richard Biener <rguenther@suse.de>
33254 PR tree-optimization/98308
33255 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
33258 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33260 PR tree-optimization/95771
33261 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
33262 with precision smaller than int's precision and types with precision
33263 twice as large as long long. Formatting fixes.
33265 2021-01-04 Richard Biener <rguenther@suse.de>
33267 PR tree-optimization/98464
33268 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
33269 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
33270 (process_bb): Adjust.
33272 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
33275 * doc/invoke.texi (-fsanitize=address): Fix wording describing
33276 clash with -fsanitize=hwaddress.
33278 2021-01-04 Richard Biener <rguenther@suse.de>
33280 PR tree-optimization/98282
33281 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
33282 invariants as VN_NARY.
33284 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33287 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
33288 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
33289 to handle zero operands.
33291 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
33293 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
33294 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
33295 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
33298 2021-01-04 Richard Biener <rguenther@suse.de>
33300 PR tree-optimization/98393
33301 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
33302 when hitting the limit.
33304 2021-01-04 Richard Biener <rguenther@suse.de>
33306 PR tree-optimization/98291
33307 * tree-vect-loop.c (vectorizable_reduction): Bypass
33308 associativity check for SLP reductions with VF 1.
33310 2021-01-04 Jakub Jelinek <jakub@redhat.com>
33312 PR tree-optimization/96782
33313 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
33315 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
33317 * collect-utils.c (collect_execute): Check dumppfx.
33318 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
33319 to collect_execute.
33320 (do_link): Add new parameter atsuffix.
33321 (main): Handle -dumpdir option. Skip one argument for
33322 -o, -isystem and -B options.
33323 * gcc.c (make_at_file): New helper function.
33324 (close_at_file): Use it.
33326 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33328 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
33329 Amend handling for LD64_VERSION fallback defaults.
33331 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33333 * config.gcc: Compute default version information
33334 from the configured target. Likewise defaults for
33336 * config/darwin10.h: Removed.
33337 * config/darwin12.h: Removed.
33338 * config/darwin9.h: Removed.
33339 * config/rs6000/darwin8.h: Removed.
33341 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33343 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
33345 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33347 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
33348 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
33350 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33352 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
33354 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
33356 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33358 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
33359 for the Darwin10 unwinder stub from here ...
33360 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
33362 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
33364 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
33365 (ASM_DEBUG_SPEC):Only define if the assembler supports
33367 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
33368 (DARWIN_PREFER_DWARF): Define.
33369 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
33370 (DARWIN_PREFER_DWARF): Likewise
33371 (DSYMUTIL_SPEC): Likewise.
33372 (COLLECT_RUN_DSYMUTIL): Likewise.
33373 (ASM_DEBUG_SPEC): Likewise.
33374 (ASM_DEBUG_OPTION_SPEC): Likewise.
33376 2021-01-02 Jan Hubicka <jh@suse.cz>
33378 * cfg.c (free_block): ggc_free bb.
33380 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33382 * gcc.c (process_command): Update copyright notice dates.
33383 * gcov-dump.c (print_version): Ditto.
33384 * gcov.c (print_version): Ditto.
33385 * gcov-tool.c (print_version): Ditto.
33386 * gengtype.c (create_file): Ditto.
33387 * doc/cpp.texi: Bump @copying's copyright year.
33388 * doc/cppinternals.texi: Ditto.
33389 * doc/gcc.texi: Ditto.
33390 * doc/gccint.texi: Ditto.
33391 * doc/gcov.texi: Ditto.
33392 * doc/install.texi: Ditto.
33393 * doc/invoke.texi: Ditto.
33395 2021-01-01 Jakub Jelinek <jakub@redhat.com>
33397 * ChangeLog-2020: Rotate ChangeLog. New file.
33400 Copyright (C) 2021 Free Software Foundation, Inc.
33402 Copying and distribution of this file, with or without modification,
33403 are permitted in any medium without royalty provided the copyright
33404 notice and this notice are preserved.