1 2021-11-01 Martin Liska <mliska@suse.cz>
3 * opt-functions.awk: Add new sanity checking.
4 * optc-gen.awk: Add new argument to integer_range_info.
5 * params.opt: Update 2 params which have negative IntegerRange.
7 2021-11-01 qing zhao <qing.zhao@oracle.com>
9 * gimplify.c (gimplify_decl_expr): Do not add call to
10 __builtin_clear_padding when a variable is a gimple register
11 or it might not have padding.
12 (gimplify_init_constructor): Likewise.
14 2021-11-01 Tamar Christina <tamar.christina@arm.com>
16 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
17 movi, dup and extract costing fields.
18 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
19 thunderx_extra_costs, thunderx2t99_extra_costs,
20 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
22 * config/arm/aarch-cost-tables.h (generic_extra_costs,
23 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
24 exynosm1_extra_costs, xgene1_extra_costs): Likewise
25 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
26 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
28 2021-11-01 Tamar Christina <tamar.christina@arm.com>
30 * cse.c (add_to_set): New.
31 (find_sets_in_insn): Register constants in sets.
32 (canonicalize_insn): Use auto_vec instead.
33 (cse_insn): Try materializing using vec_dup.
34 * rtl.h (simplify_context::simplify_gen_vec_select,
35 simplify_gen_vec_select): New.
36 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
38 2021-11-01 David Malcolm <dmalcolm@redhat.com>
40 * common.opt (fdiagnostics-escape-format=): New.
41 (diagnostics_escape_format): New enum.
42 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
43 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
44 * diagnostic-format-json.cc (json_end_diagnostic): Add
45 "escape-source" attribute.
46 * diagnostic-show-locus.c
47 (exploc_with_display_col::exploc_with_display_col): Replace
48 "tabstop" param with a cpp_char_column_policy and add an "aspect"
49 param. Use these to compute m_display_col accordingly.
50 (struct char_display_policy): New struct.
51 (layout::m_policy): New field.
52 (layout::m_escape_on_output): New field.
53 (def_policy): New function.
54 (make_range): Update for changes to exploc_with_display_col ctor.
55 (default_print_decoded_ch): New.
56 (width_per_escaped_byte): New.
57 (escape_as_bytes_width): New.
58 (escape_as_bytes_print): New.
59 (escape_as_unicode_width): New.
60 (escape_as_unicode_print): New.
62 (layout::layout): Initialize new fields. Update m_exploc ctor
63 call for above change to ctor.
64 (layout::maybe_add_location_range): Update for changes to
65 exploc_with_display_col ctor.
66 (layout::calculate_x_offset_display): Update for change to
68 (layout::print_source_line): Pass policy
69 to cpp_display_width_computation. Capture cpp_decoded_char when
70 calling process_next_codepoint. Move printing of source code to
72 (line_label::line_label): Pass in policy rather than context.
73 (layout::print_any_labels): Update for change to line_label ctor.
74 (get_affected_range): Pass in policy rather than context, updating
75 calls to location_compute_display_column accordingly.
76 (get_printed_columns): Likewise, also for cpp_display_width.
77 (correction::correction): Pass in policy rather than tabstop.
78 (correction::compute_display_cols): Pass m_policy rather than
79 m_tabstop to cpp_display_width.
80 (correction::m_tabstop): Replace with...
81 (correction::m_policy): ...this.
82 (line_corrections::line_corrections): Pass in policy rather than
84 (line_corrections::m_context): Replace with...
85 (line_corrections::m_policy): ...this.
86 (line_corrections::add_hint): Update to use m_policy rather than
88 (line_corrections::add_hint): Likewise.
89 (layout::print_trailing_fixits): Likewise.
90 (selftest::test_display_widths): New.
91 (selftest::test_layout_x_offset_display_utf8): Update to use
92 policy rather than tabstop.
93 (selftest::test_one_liner_labels_utf8): Add test of escaping
95 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
96 use policy rather than tabstop.
97 (selftest::test_overlapped_fixit_printing): Likewise.
98 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
99 (selftest::test_overlapped_fixit_printing_2): Likewise.
100 (selftest::test_tab_expansion): Likewise.
101 (selftest::test_escaping_bytes_1): New.
102 (selftest::test_escaping_bytes_2): New.
103 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
104 * diagnostic.c (diagnostic_initialize): Initialize
105 context->escape_format.
106 (convert_column_unit): Update to use default character width policy.
107 (selftest::test_diagnostic_get_location_text): Likewise.
108 * diagnostic.h (enum diagnostics_escape_format): New enum.
109 (diagnostic_context::escape_format): New field.
110 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
111 (-fdiagnostics-format=): Add "escape-source" attribute to examples
112 of JSON output, and document it.
113 * input.c (location_compute_display_column): Pass in "policy"
114 rather than "tabstop", passing to
115 cpp_byte_column_to_display_column.
116 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
117 * input.h (class cpp_char_column_policy): New forward decl.
118 (location_compute_display_column): Pass in "policy" rather than
120 * opts.c (common_handle_option): Handle
121 OPT_fdiagnostics_escape_format_.
122 * selftest.c (temp_source_file::temp_source_file): New ctor
123 overload taking a size_t.
124 * selftest.h (temp_source_file::temp_source_file): Likewise.
126 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
128 * dbgcnt.def: Add debug counter for back_thread[12] and
130 * passes.def: Pass "first" argument to each back threading pass.
131 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
133 (back_threader::debug_counter): New.
134 (back_threader::maybe_register_path): Call debug_counter.
136 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
138 * tree-ssa-threadbackward.c (BT_NONE): New.
141 (back_threader::back_threader): Add flags.
142 Move loop initialization here.
143 (back_threader::~back_threader): New.
144 (back_threader::find_taken_edge_switch): Change solver and ranger
146 (back_threader::find_taken_edge_cond): Same.
147 (back_threader::find_paths_to_names): Same.
148 (back_threader::find_paths): Same.
149 (back_threader::dump): Same.
150 (try_thread_blocks): Merge into thread_blocks.
151 (back_threader::thread_blocks): New.
152 (do_early_thread_jumps): Merge into thread_blocks.
153 (do_thread_jumps): Merge into thread_blocks.
154 (back_threader::thread_through_all_blocks): Remove.
156 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
158 PR tree-optimization/103003
159 * value-relation.cc (dom_oracle::register_relation): If the 2
160 ssa names are the same, don't register any relation.
162 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
164 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
165 redundant check for calls_eh_return.
166 * config/aarch64/aarch64.md (*do_return): Likewise.
168 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
170 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
171 duplicate_loop_to_header_edge to
172 duplicate_loop_body_to_header_edge.
173 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
174 * cfghooks.h (struct cfg_hooks): Likewise.
175 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
176 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
177 (clone_loop_to_header_edge): Likewise.
178 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
179 * cfgrtl.c (struct cfg_hooks): Likewise.
180 * doc/loop.texi: Likewise.
181 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
182 (unroll_loop_runtime_iterations): Likewise.
183 (unroll_loop_stupid): Likewise.
184 (apply_opt_in_copies): Likewise.
185 * tree-cfg.c (struct cfg_hooks): Likewise.
186 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
187 (try_peel_loop): Likewise.
188 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
189 (gimple_duplicate_loop_body_to_header_edge): Likewise.
190 (tree_transform_and_unroll_loop): Likewise.
191 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
194 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
196 * cfgloopmanip.c (loop_version): Refactor loopify to
197 loop_version. Move condition generation after loopify.
199 * cfgloopmanip.h (loopify): Delete.
201 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
203 * ipa-fnsummary.c: Include tree-dfa.h.
204 (points_to_local_or_readonly_memory_p): Return true on return
206 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
209 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
212 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
213 internal Fortran name; new permit_num_teams arg to permit
214 omp_get_num_teams and omp_get_team_num.
215 (scan_omp_1_stmt): Update call to it, add missing call for
216 reverse offload, and check for strictly nested API calls in teams.
218 2021-10-30 Jakub Jelinek <jakub@redhat.com>
220 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
222 2021-10-29 David Malcolm <dmalcolm@redhat.com>
224 * selftest.c (assert_streq): Add newlines when emitting non-equal
227 2021-10-29 David Malcolm <dmalcolm@redhat.com>
229 * Makefile.in: Fix syntax for reference to LIBDEPS in
232 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
234 * doc/install.texi: Bump required minimum DejaGnu version.
236 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
237 Andrew MacLeod <amacleod@redhat.com>
239 * value-relation.cc (path_oracle::killing_def): Add a
240 self-equivalence so we don't look to the root oracle.
242 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
244 * passes.def: Replace the pass_thread_jumps before VRP* with
245 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
246 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
247 Remove hyphen from "thread-full" name.
249 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
251 PR tree-optimization/102983
252 * gimple-range-cache.h (propagate_updated_value): Make public.
253 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
254 when processing gcond stmts.
256 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
258 * gimple.c (gimple_call_retslot_flags): New function.
259 * gimple.h (gimple_call_retslot_flags): Declare.
260 * ipa-modref.c: Include tree-cfg.h.
261 (struct escape_entry): Turn parm_index to signed.
262 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
263 (modref_summary::modref_summary): Initialize retslot_flags.
264 (struct modref_summary_lto): Likewise.
265 (modref_summary::useful_p): Check retslot_flags.
266 (modref_summary_lto::useful_p): Likewise.
267 (modref_summary::dump): Dump retslot_flags.
268 (modref_summary_lto::dump): Likewise.
269 (struct escape_point): Add hidden_args enum.
270 (analyze_ssa_name_flags): Ignore return slot return;
271 use gimple_call_retslot_flags.
272 (record_escape_points): Break out from ...
273 (analyze_parms): ... here; handle retslot_flags.
274 (modref_summaries::duplicate): Duplicate retslot_flags.
275 (modref_summaries_lto::duplicate): Likewise.
276 (modref_write_escape_summary): Stream parm_index as signed.
277 (modref_read_escape_summary): Likewise.
278 (modref_write): Stream retslot_flags.
279 (read_section): Likewise.
280 (struct escape_map): Fix typo in comment.
281 (update_escape_summary_1): Fix whitespace.
282 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
283 (modref_merge_call_site_flags): Merge retslot_flags.
284 * ipa-modref.h (struct modref_summary): Add retslot_flags.
285 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
287 2021-10-29 Tamar Christina <tamar.christina@arm.com>
289 PR tree-optimization/102977
290 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
291 (vect_detect_pair_op): Add crosslane check.
292 (vect_match_call_complex_mla): Remove.
293 (class complex_mul_pattern): Update comment.
294 (complex_mul_pattern::matches): Update detection.
295 (class complex_fma_pattern): Remove.
296 (complex_fma_pattern::matches): Remove.
297 (complex_fma_pattern::recognize): Remove.
298 (complex_fma_pattern::build): Remove.
299 (class complex_fms_pattern): Update comment.
300 (complex_fms_pattern::matches): Remove.
301 (complex_operations_pattern::recognize): Remove complex_fma_pattern
303 2021-10-29 Jakub Jelinek <jakub@redhat.com>
305 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
308 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
311 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
312 patterns match and emit for VSX xxpermdi.
314 2021-10-29 liuhongt <hongtao.liu@intel.com>
317 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
319 (V16HF_FTYPE_V16HF): Ditto.
320 (V32HF_FTYPE_V32HF): Ditto.
321 (V8HF_FTYPE_V8HF_ROUND): Ditto.
322 (V16HF_FTYPE_V16HF_ROUND): Ditto.
323 (V32HF_FTYPE_V32HF_ROUND): Ditto.
324 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
325 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
326 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
327 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
328 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
329 * config/i386/i386-builtins.c
330 (ix86_builtin_vectorized_function): Enable vectorization for
331 HFmode FLOOR/CEIL/TRUNC operation.
332 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
334 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
337 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
338 Andrew MacLeod <amacleod@redhat.com>
340 * value-relation.cc (path_oracle::killing_def): Walk the
341 equivalency list and remove SSA from any equivalencies.
343 2021-10-28 Stafford Horne <shorne@gmail.com>
345 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
348 2021-10-28 Jakub Jelinek <jakub@redhat.com>
350 PR tree-optimization/102951
351 * fold-const.h (address_compare): Declare.
352 * fold-const.c (address_compare): New function.
353 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
354 address_compare helper.
355 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
358 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
360 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
362 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
364 PR tree-optimization/102940
365 * tree-vrp.c (execute_ranger_vrp): Reset scev.
367 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
369 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
371 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
373 * configure: Regenerate.
374 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
376 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
378 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
380 2021-10-28 Richard Biener <rguenther@suse.de>
383 * fold-const.c (fold_convert_const): Avoid int to float
384 constant folding with -frounding-math and inexact result.
385 * simplify-rtx.c (simplify_const_unary_operation): Likewise
386 for both float and unsigned_float.
388 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
390 * tree-ssa-threadbackward.c
391 (back_threader::find_taken_edge_switch): Use find_case_label_range
392 instead of find_taken_edge.
394 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
396 * tree-ssa-threadbackward.c (class back_threader_registry):
397 Inherit from back_jt_path_registry.
398 (back_threader_registry::thread_through_all_blocks): Remove.
399 (back_threader_registry::register_path): Remove
400 m_lowlevel_registry prefix.
402 2021-10-28 Richard Biener <rguenther@suse.de>
405 * fold-const.c (fold_convert_const_real_from_real): Honor
406 -frounding-math if the conversion is not exact.
407 * simplify-rtx.c (simplify_const_unary_operation): Do not
408 simplify FLOAT_TRUNCATE with sign dependent rounding.
410 2021-10-28 Richard Biener <rguenther@suse.de>
412 PR tree-optimization/102949
413 * tree-vect-stmts.c (ensure_base_align): Look at the
414 dr_info of a group leader and assert we are looking at
415 one with analyzed alignment.
417 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
420 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
421 V1T1 mode for unaligned load and store.
423 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
425 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
428 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
430 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
433 2021-10-28 Alexandre Oliva <oliva@adacore.com>
435 * common.opt (fharden-compares): New.
436 (fharden-conditional-branches): New.
437 * doc/invoke.texi: Document new options.
438 * gimple-harden-conditionals.cc: New.
439 * Makefile.in (OBJS): Build it.
440 * passes.def: Add new passes.
441 * tree-pass.h (make_pass_harden_compares): Declare.
442 (make_pass_harden_conditional_branches): Declare.
444 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
447 * config/rs6000/altivec.md: Add vsx register constraints.
448 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
449 (vsx_xxsel<mode>2): Likewise.
450 (vsx_xxsel<mode>3): Likewise.
451 (vsx_xxsel<mode>4): Likewise.
453 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
456 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
457 (altivec_vsel<mode>): ... this and update define.
458 (*altivec_vsel<mode>_uns): Delete.
459 (altivec_vsel<mode>2): New define_insn.
460 (altivec_vsel<mode>3): Likewise.
461 (altivec_vsel<mode>4): Likewise.
462 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
463 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
465 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
466 selection instead of per element.
467 * config/rs6000/vector.md:
468 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
469 (vsx_xxsel<mode>): ... this and update define.
470 (*vsx_xxsel<mode>_uns): Delete.
471 (vsx_xxsel<mode>2): New define_insn.
472 (vsx_xxsel<mode>3): Likewise.
473 (vsx_xxsel<mode>4): Likewise.
475 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
477 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
478 HFmode rsqrt without TARGET_SSE_MATH.
479 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
480 btrunc condition to be restricted by -ftrapping-math, adjust
481 use_rsqrt_p function call.
482 * config/i386/i386.md (rcphf2): New define_insn.
483 (rsqrthf2): Likewise.
484 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
485 (div<mode>3): New expander for HF mode.
486 (rsqrt<mode>2): Likewise.
487 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
488 (*avx512fp16_vmrsqrtv8hf2): Likewise.
490 2021-10-27 Saagar Jha <saagar@saagarjha.com>
492 * config.gcc: Adjust for Darwin21.
493 * config/darwin-c.c (macosx_version_as_macro): Likewise.
494 * config/darwin-driver.c (validate_macosx_version_min):
496 (darwin_find_version_from_kernel): Likewise.
498 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
500 * gimple-range-path.cc
501 (path_range_query::range_defined_in_block): Call killing_def.
503 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
505 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
506 operands as well as relation.
507 * gimple-range-path.cc
508 (path_range_query::compute_ranges_in_block): Compute PHI relations
509 first. Compute outgoing relations at the end.
510 (path_range_query::compute_ranges): Remove call to compute_relations.
511 (path_range_query::compute_relations): Remove.
512 (path_range_query::maybe_register_phi_relation): New.
513 (path_range_query::compute_phi_relations): Abstract out
514 registering one PHI relation to...
515 (path_range_query::compute_outgoing_relations): ...here.
516 * gimple-range-path.h (class path_range_query): Remove
518 Add maybe_register_phi_relation.
520 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
521 Andrew MacLeod <amacleod@redhat.com>
523 * value-relation.cc (path_oracle::killing_def): Kill second
526 2021-10-27 Martin Jambor <mjambor@suse.cz>
528 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
529 profile feedback depending on their local availability.
531 2021-10-27 Martin Jambor <mjambor@suse.cz>
533 * params.opt (param_ipa_cp_profile_count_base): New parameter.
534 * doc/invoke.texi (Optimize Options): Add entry for
535 ipa-cp-profile-count-base.
536 * ipa-cp.c (max_count): Replace with base_count, replace all
537 occurrences too, unless otherwise stated.
538 (ipcp_cloning_candidate_p): identify mostly-directly called
539 functions based on their counts, not max_count.
540 (compare_edge_profile_counts): New function.
541 (ipcp_propagate_stage): Instead of setting max_count, find the
542 appropriate edge count in a sorted vector of counts of eligible
543 edges and make it the base_count.
545 2021-10-27 Martin Jambor <mjambor@suse.cz>
547 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
548 n_nonrec_calls and itself, document all fields.
549 (init_caller_stats): Initialize the above new fields.
550 (gather_caller_stats): Gather self-recursive counts and calls number.
551 (get_info_about_necessary_edges): Gather counts of self-recursive and
552 other edges bringing in the requested value separately.
553 (dump_profile_updates): Rework to dump info about a single node only.
554 (lenient_count_portion_handling): New function.
555 (struct gather_other_count_struct): New type.
556 (gather_count_of_non_rec_edges): New function.
557 (struct desc_incoming_count_struct): New type.
558 (analyze_clone_icoming_counts): New function.
559 (adjust_clone_incoming_counts): Likewise.
560 (update_counts_for_self_gen_clones): Likewise.
561 (update_profiling_info): Rewritten.
562 (update_specialized_profile): Adjust call to dump_profile_updates.
563 (create_specialized_node): Do not update profiling info.
564 (decide_about_value): New parameter self_gen_clones, either push new
565 clones into it or updat their profile counts. For self-recursively
566 generated values, use a portion of the node count instead of count
567 from self-recursive edges to estimate goodness.
568 (decide_whether_version_node): Gather clones for self-generated values
569 in a new vector, update their profiles at once at the end.
571 2021-10-27 Richard Biener <rguenther@suse.de>
573 * tree-vectorizer.c (vect_transform_loops): New function,
575 (try_vectorize_loop_1): ... here. Simplify as epilogues
576 are now fully handled in the split part.
578 2021-10-27 Jakub Jelinek <jakub@redhat.com>
580 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
581 iterators with pointer types.
582 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
584 2021-10-26 Martin Sebor <msebor@redhat.com>
586 PR tree-optimization/102238
587 PR tree-optimization/102919
588 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
589 (array_elt_at_offset): Move to pointer-query.
590 (set_aggregate_size_and_offset): New function.
591 (field_at_offset): Move to pointer-query.
592 (get_origin_and_offset): Rename...
593 (get_origin_and_offset_r): this. Add an argument. Make aggregate
594 handling more robust.
595 (get_origin_and_offset): New.
596 (alias_offset): Add an argument.
597 (format_string): Use subobject size determined by get_origin_and_offset.
598 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
599 Improve/correct handling of aggregates.
600 (array_elt_at_offset): Same.
601 * pointer-query.h (field_at_offset): Declare.
602 (array_elt_at_offset): Declare.
604 2021-10-26 Martin Sebor <msebor@redhat.com>
606 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
608 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
609 (expand_builtin_fork_or_exec): Same.
610 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
611 compute_objsize additional arguments.
612 (inbounds_memaccess_p): Same.
613 (array_bounds_checker::check_array_bounds): Add an assert. Stash
614 statement in a member.
615 (check_array_bounds_dom_walker::before_dom_children): Same.
616 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
617 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
618 (handle_printf_call): Pass a new argument.
619 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
620 (check_access): Add an argument and pass it along to callees.
621 (check_read_access): Make a member function.
622 (pass_waccess::check_strcat): Pass access_data ctor additional
624 (pass_waccess::check_strncat): Same.
625 (pass_waccess::check_stxcpy): Same.
626 (pass_waccess::check_stxncpy): Same.
627 (pass_waccess::check_strncmp): Same.
628 (pass_waccess::check_read_access): Same.
629 (pass_waccess::check_builtin): Same.
630 (pass_waccess::maybe_check_access_sizes): Same.
631 (pass_waccess::maybe_check_dealloc_call): Same.
632 * gimple-ssa-warn-access.h (check_read_access): Declare a new
634 * pointer-query.cc (compute_objsize_r): Add an argument.
635 (gimple_call_return_array): Same.
636 (gimple_call_alloc_size): Same.
637 (access_ref::access_ref): Same.
638 (access_ref::get_ref): Same.
639 (pointer_query::get_ref): Same.
640 (handle_min_max_size): Pass an arguments to callees.
641 (handle_array_ref): Add an argument.
642 (handle_mem_ref): Same.
643 (compute_objsize): Same.
644 * pointer-query.h (struct access_ref): Adjust signatures.
645 (struct access_data): Same.
646 (gimple_call_alloc_size): Add an argument.
647 (gimple_parm_array_size): Same.
648 (compute_objsize): Same.
649 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
650 argument to compute_objsize.
651 (strlen_pass::maybe_warn_overflow): Same.
652 (maybe_diag_stxncpy_trunc): Same.
654 2021-10-26 Martin Sebor <msebor@redhat.com>
657 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
658 (pass_waccess::check_atomic_builtin): Call it.
660 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
662 PR rtl-optimization/102842
663 * lra-constraints.c (match_reload): Ignore out in checking values
665 (curr_insn_transform): Collect outputs before doing reloads of operands.
667 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
670 * config/rs6000/x86intrin.h: Move some included headers to new
671 headers. Include new immintrin.h instead of those headers.
672 * config/rs6000/immintrin.h: New.
673 * config/rs6000/x86gprintrin.h: New.
674 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
676 2021-10-26 Richard Biener <rguenther@suse.de>
678 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
679 Remove byte_offset parameter.
680 (vect_create_data_ref_ptr): Likewise.
681 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
683 (vect_create_data_ref_ptr): Likewise.
684 * tree-vect-stmts.c (vectorizable_store): Adjust.
685 (vectorizable_load): Likewise.
687 2021-10-26 Richard Biener <rguenther@suse.de>
689 PR tree-optimization/96109
690 * tree-vectorizer.h (dr_misalignment): Add optional offset
692 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
693 offset applied for negative stride accesses.
694 (vect_enhance_data_refs_alignment): Compute negative stride
695 access offset and pass it to dr_misalignment.
696 * tree-vect-stmts.c (get_negative_load_store_type): Pass
697 negative offset to dr_misalignment.
698 (get_group_load_store_type): Likewise.
699 (get_load_store_type): Likewise.
700 (vectorizable_store): Remove asserts about alignment.
701 (vectorizable_load): Likewise.
703 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
705 PR tree-optimization/102897
706 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
708 2021-10-26 Richard Biener <rguenther@suse.de>
710 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
711 Take offset in bytes.
712 (vect_create_data_ref_ptr): Likewise.
713 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
714 offset by element size.
715 (vect_create_cond_for_align_checks): Likewise.
716 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
717 (vectorizable_load): Remove duplicate leftover from merge
720 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
722 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
723 function to expand V1TI mode logical shifts by integer constants.
724 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
725 mode rotations by integer constants.
726 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
727 ix86_expand_v1ti_rotate): Prototype new functions here.
728 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
729 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
731 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
733 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
734 Avoid threading circular paths.
736 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
738 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
739 Attempt to resolve all incoming paths to a PHI.
740 (back_threader::resolve_def): Always return true for PHIs.
742 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
744 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
745 Always try to resolve path without looking back.
746 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
749 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
751 PR tree-optimization/102789
752 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
753 update inits of simd_lane_access.
755 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
757 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
759 (simplify_casted_conds): Move to vrp_folder class.
760 (execute_vrp): Call via vrp_folder now.
761 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
762 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
763 * vr-values.h (simplify_casted_cond): Add prototype.
765 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
767 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
770 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
772 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
774 (_mm_testc_si128): Likewise.
775 (_mm_testnzc_si128): Likewise.
776 (_mm_blend_ps): Likewise.
777 (_mm_blendv_ps): Likewise.
778 (_mm_blend_pd): Likewise.
779 (_mm_blendv_pd): Likewise.
780 (_mm_ceil_pd): Likewise.
781 (_mm_ceil_sd): Likewise.
782 (_mm_ceil_ps): Likewise.
783 (_mm_ceil_ss): Likewise.
784 (_mm_floor_pd): Likewise.
785 (_mm_floor_sd): Likewise.
786 (_mm_floor_ps): Likewise.
787 (_mm_floor_ss): Likewise.
788 (_mm_minpos_epu16): Likewise.
789 (_mm_mul_epi32): Likewise.
790 (_mm_cvtepi8_epi16): Likewise.
791 (_mm_packus_epi32): Likewise.
792 (_mm_cmpgt_epi64): Likewise.
794 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
796 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
797 shifts of the mode's smin_value and smax_value when the bit count
798 operand doesn't have side-effects.
799 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
800 when the bit count operand doesn't have side-effects.
801 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
802 compile-time evaluation of saturating left shifts with constant
805 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
807 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
810 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
812 * flag-types.h (enum ranger_debug): Adjust values.
813 * params.opt (ranger_debug): Ditto.
815 2021-10-25 Martin Jambor <mjambor@suse.cz>
817 PR tree-optimization/102886
818 * tree-sra.c (totally_scalarize_subtree): Fix the out of
821 2021-10-25 Andrew Pinski <apinski@marvell.com>
823 * tree-ssa-dce.c (simple_dce_from_worklist):
824 Check stmt_unremovable_because_of_non_call_eh_p also
825 before removing the statement.
827 2021-10-25 Richard Biener <rguenther@suse.de>
829 PR tree-optimization/102905
830 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
831 Use vect_supportable_dr_alignment again to determine whether
832 an access is supported when not aligned.
834 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
836 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
839 2021-10-25 Jim Wilson <jimw@sifive.com>
840 Kito Cheng <kito.cheng@sifive.com>
841 Jia-Wei Chen <jiawei@iscas.ac.cn>
842 Shi-Hua Liao <shihua@iscas.ac.cn>
844 * config/riscv/bitmanip.md (shiftm1): New.
845 (*bset<mode>): Ditto.
846 (*bset<mode>_mask): Ditto.
847 (*bset<mode>_1): Ditto.
848 (*bset<mode>_1_mask): Ditto.
849 (*bseti<mode>): Ditto.
850 (*bclr<mode>): Ditto.
851 (*bclri<mode>): Ditto.
852 (*binv<mode>): Ditto.
853 (*binvi<mode>): Ditto.
854 (*bext<mode>): Ditto.
856 * config/riscv/predicates.md (splittable_const_int_operand):
858 (single_bit_mask_operand): New.
859 (not_single_bit_mask_operand): Ditto.
860 (const31_operand): Ditto.
861 (const63_operand): Ditto.
862 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
863 (riscv_output_move): Ditto.
864 (riscv_print_operand): Handle new operand type: T and S.
865 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
867 2021-10-25 Jim Wilson <jimw@sifive.com>
869 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
872 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
874 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
876 (riscv_rtx_costs): Ditto.
878 2021-10-25 Jim Wilson <jimw@sifive.com>
879 Kito Cheng <kito.cheng@sifive.com>
880 Jia-Wei Chen <jiawei@iscas.ac.cn>
882 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
883 (bitmanip_minmax): New.
885 (bitmanip_optab): New.
886 (bitmanip_insn): New.
887 (*<optab>_not<mode>): New.
888 (*xor_not<mode>): New.
889 (<bitmanip_optab>si2): New.
890 (*<bitmanip_optab>disi2): New.
891 (<bitmanip_optab>di2): New.
892 (*zero_extendhi<GPR:mode>2_bitmanip): New.
893 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
894 (*zero_extendhi<GPR:mode>2_zbb): New.
902 (<bitmanip_optab><mode>3): New.
903 * config/riscv/riscv.md (type): Add rotate.
904 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
905 (*zero_extendhi<GPR:mode>2): New.
906 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
907 (*extend<SHORT:mode><SUPERQI:mode>2): New.
909 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
911 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
913 (riscv_rtx_costs): Ditto.
915 2021-10-25 Jim Wilson <jimw@sifive.com>
916 Kito Cheng <kito.cheng@sifive.com>
917 Jia-Wei Chen <jiawei@iscas.ac.cn>
919 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
924 (riscv_rtx_costs): Ditto.
925 * config/riscv/riscv.md: Include bitmanip.md
926 (type): Add bitmanip bype.
927 (zero_extendsidi2): Change to define_expand pattern.
928 (*zero_extendsidi2_internal): New.
929 (zero_extendsidi2_shifted): Disable for ZBA.
931 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
933 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
934 Add zba, zbb, zbc and zbs.
935 (riscv_ext_flag_table): Ditto.
936 * config/riscv/riscv-opts.h (MASK_ZBA): New.
944 * config/riscv/riscv.opt (riscv_zb_subext): New.
946 2021-10-25 liuhongt <hongtao.liu@intel.com>
949 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
950 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
952 2021-10-25 Richard Biener <rguenther@suse.de>
954 PR tree-optimization/102920
955 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
956 controlling VN_TOP matching behavior.
957 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
958 (vn_phi_eq): Do not optimistically match VN_TOP.
960 2021-10-25 konglin1 <lingling.kong@intel.com>
962 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
963 define_insn_and_split.
964 (fma_<mode>_fadd_fcmul):Likewise
965 (fma_<complexopname>_<mode>_fma_zero):Likewise
967 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
969 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
970 TARGET_DISABLE_FPREGS.
971 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
973 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
974 cost of hardware integer multiplication.
975 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
976 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
977 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
978 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
979 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
980 SOFT_FLOAT. Revise comment.
981 (msoft-float): New option.
983 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
985 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
987 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
989 * doc/install.texi (Specific): Remove obsolete details
990 around GNU/Linux on Itanium.
991 (Specific): Remove reference to Windows for Itanium.
993 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
995 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
998 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
1000 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
1002 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
1004 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
1005 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
1006 (one_cmplv1ti2): New define expand.
1008 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
1011 * Makefile.in: Handle dvidir and install-dvi target.
1012 * configure: Regenerate.
1013 * configure.ac: Add install-dvi to target_list.
1015 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
1017 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
1020 2021-10-22 Richard Biener <rguenther@suse.de>
1022 PR tree-optimization/102893
1023 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
1024 test for an exit edge.
1026 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
1027 Andrew MacLeod <amacleod@redhat.com>
1029 * gimple-range-path.cc (path_range_query::compute_phi_relations):
1030 Kill any global relations we may know before registering a new
1032 * value-relation.cc (path_oracle::killing_def): New.
1033 * value-relation.h (path_oracle::killing_def): New.
1035 2021-10-22 Richard Biener <rguenther@suse.de>
1038 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
1040 (vn_phi_lookup): Likewise.
1042 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
1045 * doc/invoke.texi: Document -fcf-protection requires i686 or
1048 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
1050 PR middle-end/102764
1051 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
1053 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
1055 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
1056 macro and all invocations.
1058 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
1060 * doc/invoke.texi (ranger-debug): Document.
1061 * flag-types.h (enum ranger_debug): New.
1062 (enum evrp_mode): Remove debug values.
1063 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
1064 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
1065 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
1066 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
1067 (execute_early_vrp): Use evrp-mode directly.
1068 * params.opt (enum evrp_mode): Remove debug values.
1069 (ranger-debug): New.
1070 (ranger-logical-depth): Relocate to be in alphabetical order.
1072 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
1074 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
1075 * flag-types.h: (enum vrp_mode): New.
1076 * params.opt: (vrp1-mode, vrp2-mode): New.
1077 * tree-vrp.c (vrp_pass_num): New.
1078 (pass_vrp::pass_vrp): Set pass number.
1079 (pass_vrp::execute): Choose which VRP mode to execute.
1081 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
1083 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
1084 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
1085 * tree-vrp.c (class rvrp_folder): Relocate here.
1086 (execute_ranger_vrp): New.
1087 * tree-vrp.h (execute_ranger_vrp): Export.
1089 2021-10-21 Martin Liska <mliska@suse.cz>
1093 * opts.c (finish_options): Process flag_var_tracking* options
1094 here as they can be adjusted by optimize attribute.
1095 Process also flag_syntax_only and flag_gtoggle.
1096 * toplev.c (process_options): Remove it here.
1097 * common.opt: Make debug_nonbind_markers_p as PerFunction
1098 attribute as it depends on optimization level.
1100 2021-10-21 Martin Jambor <mjambor@suse.cz>
1102 PR tree-optimization/102505
1103 * tree-sra.c (totally_scalarize_subtree): Check that the
1104 encountered field fits within the acces we would like to put it
1107 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
1109 * tree-ssa-threadbackward.c
1110 (back_threader::maybe_register_path): Remove circular paths check.
1112 2021-10-21 Richard Biener <rguenther@suse.de>
1114 * toplev.c (process_options): Move the initial debug_hooks
1116 (toplev::main): ... before the call of the post_options
1119 2021-10-21 Richard Biener <rguenther@suse.de>
1121 PR tree-optimization/102847
1122 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
1123 load cost in the prologue for VMAT_INVARIANT.
1125 2021-10-21 Richard Biener <rguenther@suse.de>
1127 PR tree-optimization/102847
1128 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
1129 handle VMAT_INVARIANT as a splat in the prologue.
1131 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
1134 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
1135 move to use the same logic as HImode.
1137 2021-10-21 Richard Biener <rguenther@suse.de>
1139 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
1140 superfluous gimple_call_nothrow_p check.
1142 2021-10-21 Andrew Pinski <apinski@marvell.com>
1144 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
1145 Mark the ssa-name of the rhs as one to be removed.
1146 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
1147 Call simple_dce_from_worklist at the end to a simple dce.
1149 2021-10-21 Andrew Pinski <apinski@marvell.com>
1151 * tree-cfg.c (maybe_remove_writeonly_store): New function
1152 factored out from ...
1153 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
1155 2021-10-21 Andrew Pinski <apinski@marvell.com>
1157 * tree-cfg.c (execute_fixup_cfg): Remove comment
1158 about standalone pass.
1160 2021-10-21 Andrew Pinski <apinski@marvell.com>
1162 * tree-cfg.c (execute_fixup_cfg): Output when the statement
1163 is removed when it is a write only var.
1165 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
1167 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
1168 Avoid threading circular paths.
1170 2021-10-20 Alex Coplan <alex.coplan@arm.com>
1172 * calls.c (initialize_argument_information): Remove some dead
1173 code, remove handling for function_arg returning const_int.
1174 * doc/tm.texi: Delete documentation for unused target hooks.
1175 * doc/tm.texi.in: Likewise.
1176 * target.def (load_bounds_for_arg): Delete.
1177 (store_bounds_for_arg): Delete.
1178 (load_returned_bounds): Delete.
1179 (store_returned_bounds): Delete.
1180 * targhooks.c (default_load_bounds_for_arg): Delete.
1181 (default_store_bounds_for_arg): Delete.
1182 (default_load_returned_bounds): Delete.
1183 (default_store_returned_bounds): Delete.
1184 * targhooks.h (default_load_bounds_for_arg): Delete.
1185 (default_store_bounds_for_arg): Delete.
1186 (default_load_returned_bounds): Delete.
1187 (default_store_returned_bounds): Delete.
1189 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
1191 * doc/extend.texi (Basic Asm): Clarify that asm is not an
1193 * doc/invoke.texi (-fno-asm): Fix description for C++.
1195 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
1197 * doc/install.texi: Remove link to old.html
1199 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1201 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
1203 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1205 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
1207 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1209 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
1211 * config/aarch64/constraints.md (D1): New.
1213 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1215 * config/aarch64/aarch64-simd.md
1216 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
1217 (*aarch64_topbits_shuffle<mode>_le): New.
1218 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
1219 (*aarch64_topbits_shuffle<mode>_be): New.
1220 * config/aarch64/predicates.md
1221 (aarch64_simd_shift_imm_vec_exact_top): New.
1223 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1225 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
1226 *aarch64_<srn_op>shrn<mode>2_vect_le,
1227 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
1228 * config/aarch64/iterators.md (srn_op): New.
1230 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
1232 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
1234 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
1235 ctx->outer instead of ctx.
1237 2021-10-20 Martin Liska <mliska@suse.cz>
1240 2021-10-19 Martin Liska <mliska@suse.cz>
1243 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
1244 * system.h (strip_whilespaces): New function.
1246 2021-10-20 Martin Liska <mliska@suse.cz>
1249 2021-10-19 Martin Liska <mliska@suse.cz>
1252 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
1255 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
1257 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
1258 Change to 8 with -Os, 11 otherwise.
1260 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
1262 * config/aarch64/aarch64.c (neoversev1_tunings):
1263 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
1264 (neoversen2_tunings): Likewise.
1266 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
1269 * config/aarch64/aarch64.md (frint_pattern): Update comment.
1270 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
1271 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
1272 * config/aarch64/arm_neon.h: Likewise.
1273 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
1275 2021-10-20 Martin Liska <mliska@suse.cz>
1277 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
1278 already declared global variable.
1279 (arm_unwind_emit_set): Use out_file as function argument.
1280 (arm_unwind_emit): Likewise.
1281 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
1282 (machopic_output_stub_indirection): Likewise.
1283 (machopic_output_indirection): Likewise.
1284 (machopic_finish): Likewise.
1285 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
1286 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
1287 * config/ia64/ia64.c (process_epilogue): Likewise.
1288 (process_cfa_adjust_cfa): Likewise.
1289 (process_cfa_register): Likewise.
1290 (process_cfa_offset): Likewise.
1291 (ia64_asm_unwind_emit): Likewise.
1292 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
1294 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
1296 * config/aarch64/aarch64-builtins.c
1297 (aarch64_general_gimple_fold_builtin):
1298 lower vld1 and vst1 variants of the neon builtins
1299 * config/aarch64/aarch64-protos.h:
1300 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
1301 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
1304 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
1306 * match.pd: Generate IFN_TRUNC.
1308 2021-10-20 Richard Biener <rguenther@suse.de>
1310 PR tree-optimization/102853
1311 * tree-data-ref.c (split_constant_offset_1): Bail out
1312 immediately if the expression traps on overflow.
1314 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1316 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
1318 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1320 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
1321 Remove superflous debugging message.
1322 (back_jt_path_registry::duplicate_thread_path): Same.
1324 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1326 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
1328 (back_threader_registry::register_path): Remove m_threaded_paths.
1330 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1332 PR tree-optimization/102814
1333 * doc/invoke.texi: Document --param=max-fsm-thread-length.
1334 * params.opt: Add --param=max-fsm-thread-length.
1335 * tree-ssa-threadbackward.c
1336 (back_threader_profitability::profitable_path_p): Fail on paths
1337 longer than max-fsm-thread-length.
1339 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
1341 PR middle-end/102764
1342 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
1343 statement to reset the current location for the outgoing edges.
1345 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1346 Richard Biener <rguenther@suse.de>
1348 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
1349 on the same line as the threading cancellation.
1350 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
1351 Avoid threading through loop headers where the path remains in the
1354 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
1356 * tree-object-size.c (unknown): Make into a function. Adjust
1358 (unknown_object_size): Simplify implementation.
1360 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
1361 Kewen Lin <linkw@linux.ibm.com>
1363 * doc/sourcebuild.texi (Effective-Target Keywords): Document
1364 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
1365 vect_slp_v16qi_store, vect_slp_v2hi_store,
1366 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
1368 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
1370 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
1372 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
1376 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
1377 * config/rs6000/pmmintrin.h: Same.
1378 * config/rs6000/smmintrin.h: Same.
1379 * config/rs6000/tmmintrin.h: Same.
1381 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
1383 * config.gcc (extra_headers): Add nmmintrin.h.
1385 2021-10-19 Richard Biener <rguenther@suse.de>
1387 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
1388 misalignment parameter.
1389 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
1390 Do not change DR_MISALIGNMENT in place, instead pass the
1391 adjusted misalignment to vect_supportable_dr_alignment.
1392 (vect_peeling_supportable): Likewise.
1393 (vect_peeling_hash_get_lowest_cost): Adjust.
1394 (vect_enhance_data_refs_alignment): Likewise.
1395 (vect_vfa_access_size): Likewise.
1396 (vect_supportable_dr_alignment): Add misalignment
1397 parameter and simplify.
1398 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
1399 (get_group_load_store_type): Likewise.
1400 (get_load_store_type): Likewise.
1402 2021-10-19 Clément Chigot <clement.chigot@atos.net>
1404 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
1405 __tls_get_addr reference to .text csect.
1407 2021-10-19 Martin Liska <mliska@suse.cz>
1410 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
1413 2021-10-19 Richard Biener <rguenther@suse.de>
1415 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
1416 (vect_get_load_cost): Likewise.
1417 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
1418 alignment support scheme and misalignment as arguments
1420 (vect_get_peeling_costs_all_drs): Compute that info here
1421 and note that we shouldn't need to.
1422 * tree-vect-stmts.c (vect_model_store_cost): Get
1423 alignment support scheme and misalignment as arguments.
1424 (vect_get_store_cost): Likewise.
1425 (vect_model_load_cost): Likewise.
1426 (vect_get_load_cost): Likewise.
1427 (vectorizable_store): Pass down alignment support scheme
1428 and misalignment to costing.
1429 (vectorizable_load): Likewise.
1431 2021-10-19 Richard Biener <rguenther@suse.de>
1433 * tree-vect-stmts.c (get_negative_load_store_type): Add
1434 offset output parameter and initialize it.
1435 (get_group_load_store_type): Likewise.
1436 (get_load_store_type): Likewise.
1437 (vectorizable_store): Use offset as computed by
1438 get_load_store_type.
1439 (vectorizable_load): Likewise.
1441 2021-10-19 Richard Biener <rguenther@suse.de>
1443 PR tree-optimization/102827
1444 * tree-if-conv.c (predicate_statements): Add pe parameter
1445 and use that edge to insert invariant stmts on.
1446 (combine_blocks): Pass through pe.
1447 (tree_if_conversion): Compute the edge to insert invariant
1448 stmts on and pass it along.
1450 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
1453 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
1454 sssubaddv2hi3): Swap the order of operators in vec_concat.
1456 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
1458 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
1459 (altivec_vmrghb_direct): New.
1460 (*altivec_vmrghh_internal): Delete.
1461 (altivec_vmrghh_direct): New.
1462 (*altivec_vmrghw_internal): Delete.
1463 (altivec_vmrghw_direct_<mode>): New.
1464 (altivec_vmrghw_direct): Delete.
1465 (*altivec_vmrglb_internal): Delete.
1466 (altivec_vmrglb_direct): New.
1467 (*altivec_vmrglh_internal): Delete.
1468 (altivec_vmrglh_direct): New.
1469 (*altivec_vmrglw_internal): Delete.
1470 (altivec_vmrglw_direct_<mode>): New.
1471 (altivec_vmrglw_direct): Delete.
1472 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
1473 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
1475 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
1476 (vsx_xxmrglw_<mode>): Adjust.
1478 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
1480 * passes.def: Change threading comment before pass_ccp pass.
1482 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
1484 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
1485 Modify the expansion for sign extension. All extensions are done
1486 within VSX registers.
1488 2021-10-19 Richard Biener <rguenther@suse.de>
1490 * tree-vect-stmts.c (get_group_load_store_type): Add
1491 misalignment output parameter and initialize it.
1492 (get_group_load_store_type): Likewise.
1493 (vectorizable_store): Remove now redundant queries.
1494 (vectorizable_load): Likewise.
1496 2021-10-19 Richard Biener <rguenther@suse.de>
1498 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
1499 check_aligned argument.
1500 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
1502 (vect_peeling_hash_insert): Add supportable_if_not_aligned
1503 argument and do not call vect_supportable_dr_alignment here.
1504 (vect_peeling_supportable): Adjust.
1505 (vect_enhance_data_refs_alignment): Compute whether the
1506 access is supported with different alignment here and
1507 pass that down to vect_peeling_hash_insert.
1508 (vect_vfa_access_size): Adjust.
1509 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
1510 (vect_get_load_cost): Likewise.
1511 (get_negative_load_store_type): Likewise.
1512 (get_group_load_store_type): Likewise.
1513 (get_load_store_type): Likewise.
1515 2021-10-19 Martin Liska <mliska@suse.cz>
1518 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
1519 * system.h (strip_whilespaces): New function.
1521 2021-10-19 dianhong xu <dianhong.xu@intel.com>
1523 * config/i386/avx512fp16intrin.h:
1524 (_mm512_set1_pch): New intrinsic.
1525 * config/i386/avx512fp16vlintrin.h:
1526 (_mm256_set1_pch): New intrinsic.
1527 (_mm_set1_pch): Ditto.
1529 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
1531 PR tree-optimization/102796
1532 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
1533 normally. Return get_tree_range for non gimple_range_ssa_p names.
1534 (gimple_ranger::range_of_stmt): Use get_tree_range for non
1535 gimple_range_ssa_p names.
1537 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
1540 * config/i386/i386.c (ix86_print_operand_address):
1541 Error out for non-address_operand asm operands.
1543 2021-10-18 Richard Biener <rguenther@suse.de>
1545 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
1546 not auto-convert dr_alignment_support to bool.
1547 (vect_peeling_supportable): Likewise.
1548 (vect_enhance_data_refs_alignment): Likewise.
1549 (vect_supportable_dr_alignment): Commonize read/write case.
1550 * tree-vect-stmts.c (vect_get_store_cost): Use
1551 dr_alignment_support, not int, for the vect_supportable_dr_alignment
1553 (vect_get_load_cost): Likewise.
1555 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
1557 * tree-object-size.c (object_sizes_execute): Consolidate LHS
1558 null check and do it early.
1560 2021-10-18 Richard Biener <rguenther@suse.de>
1562 * tree-vect-stmts.c (vectorizable_store): Use the
1563 computed alignment scheme instead of querying
1566 2021-10-18 Richard Biener <rguenther@suse.de>
1568 * tree-vect-stmts.c (vectorizable_store): Do not recompute
1569 alignment scheme already determined by get_load_store_type.
1571 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
1573 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
1574 Clone corresponding pass.
1576 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
1578 * combine.c (recog_for_combine): For an unrecognized move/set of
1579 a constant, try force_const_mem to place it in the constant pool.
1580 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
1581 Delete global variables (that are no longer assigned a cost value).
1582 (cse_insn): Simplify logic for deciding whether to place a folded
1583 constant in the constant pool using force_const_mem.
1584 (cse_main): Remove zero initialization of constant_pool_entries_cost
1585 and constant_pool_entries_regcost.
1586 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
1587 fractionally more expensive, when optimizing for speed.
1589 2021-10-18 Martin Liska <mliska@suse.cz>
1591 PR gcov-profile/102746
1592 PR gcov-profile/102747
1593 * gcov.c (main): Return return_code.
1594 (output_gcov_file): Mark return_code when error happens.
1595 (generate_results): Likewise.
1596 (read_graph_file): Likewise.
1597 (read_count_file): Likewise.
1599 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
1601 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
1602 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
1603 a popcount, allowing compile-time evaluation/simplification.
1604 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
1606 2021-10-18 Richard Biener <rguenther@suse.de>
1608 PR tree-optimization/102788
1609 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
1611 (vect_pattern_recog_1): Likewise.
1612 (vect_recog_bool_pattern): Continue matching the pattern
1613 even if we do not have a vector type for a conversion
1616 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
1618 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
1619 Evalute SS_NEG and SS_ABS of a constant argument.
1621 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
1624 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
1625 * internal-fn.def: Add entry for COND_NEG internal function.
1626 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
1627 NEGATE_EXPR, COND_NEG mapping.
1628 * optabs.def: Add entry for cond_neg_optab.
1629 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
1630 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
1631 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
1633 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
1635 * tree-ssa-strlen.c (class strlen_pass): Rename from
1637 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
1638 (handle_assign): Same.
1639 (adjust_last_stmt): Same.
1640 (maybe_warn_overflow): Same.
1641 (handle_builtin_strlen): Same.
1642 (handle_builtin_strchr): Same.
1643 (handle_builtin_strcpy): Same.
1644 (handle_builtin_strncat): Same.
1645 (handle_builtin_stxncpy_strncat): Same.
1646 (handle_builtin_memcpy): Same.
1647 (handle_builtin_strcat): Same.
1648 (handle_alloc_call): Same.
1649 (handle_builtin_memset): Same.
1650 (handle_builtin_memcmp): Same.
1651 (get_len_or_size): Same.
1652 (strxcmp_eqz_result): Same.
1653 (handle_builtin_string_cmp): Same.
1654 (handle_pointer_plus): Same.
1655 (count_nonzero_bytes_addr): Same.
1656 (count_nonzero_bytes): Same.
1657 (handle_store): Same.
1658 (strlen_check_and_optimize_call): Same.
1659 (handle_integral_assign): Same.
1660 (check_and_optimize_stmt): Same.
1661 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
1663 2021-10-18 Richard Biener <rguenther@suse.de>
1665 PR tree-optimization/102798
1666 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
1667 Only copy points-to info to newly generated SSA names.
1669 2021-10-18 Martin Liska <mliska@suse.cz>
1671 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
1672 * gcov.c (get_cycles_count): Likewise.
1673 * lto-compress.c (lto_compression_zlib): Likewise.
1674 (lto_uncompression_zlib): Likewise.
1675 * targhooks.c (default_pch_valid_p): Likewise.
1677 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
1679 * tree-pass.h (make_pass_thread_jumps_full): New.
1680 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
1681 (try_thread_blocks): Add resolve and speed arguments.
1682 (pass_thread_jumps::execute): Inline.
1683 (do_early_thread_jumps): New.
1684 (do_thread_jumps): New.
1685 (make_pass_thread_jumps): Move.
1686 (pass_early_thread_jumps::gate): Inline.
1687 (pass_early_thread_jumps::execute): Inline.
1688 (class pass_thread_jumps_full): New.
1690 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
1692 * configure.ac: Treat powerpc64*-*-freebsd* the same as
1693 powerpc64-*-freebsd*.
1694 * configure: Regenerate.
1696 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
1698 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
1699 set_ptr_nonull to set_ptr_nonnull in comments.
1701 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
1703 PR tree-optimization/102720
1704 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
1705 of call used and clobbered sets.
1707 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
1709 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
1710 skip gomp and itm when r or nodefaultlibs is given.
1711 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
1712 Update ordering of exclusions, remove duplicate 'v' addition
1713 (collect2 will add this from the main command line).
1715 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
1717 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
1718 filelist and framework options in specs instead of code.
1719 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
1720 specs that are really driver ones.
1721 (DARWIN_CC1_SPEC): Likewise.
1722 (CPP_SPEC): Likewise.
1723 (SYSROOT_SPEC): Append space.
1724 (LINK_SYSROOT_SPEC): Remove most driver link specs.
1725 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
1726 (STARTFILE_SPEC): Likewise.
1727 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
1728 (ASM_SPEC): Update driver-related specs.
1729 (ASM_FINAL_SPEC): Likewise.
1730 * config/darwin.opt: Remove now unused option aliases.
1731 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
1732 (ASM_SPEC): Update driver-related specs.
1734 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
1736 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
1737 allow standard_sse_constant_p immediate constants.
1739 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
1741 * config.gcc: Add tm-dwarf2.h to tm_d-file.
1743 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
1745 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
1746 that occur in abnormal phis.
1747 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
1748 abnormal and EH edges.
1749 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
1750 (rvrp_folder::value_on_edge): Ditto.
1751 (rvrp_folder::value_of_stmt): Ditto.
1752 (hybrid_folder::value_of_expr): Ditto for ranger queries.
1753 (hybrid_folder::value_on_edge): Ditto.
1754 (hybrid_folder::value_of_stmt): Ditto.
1755 * value-query.cc (gimple_range_global): Always return a range if
1756 the type is supported.
1758 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
1760 * config/pa/pa.md: Consistently use "rG" constraint for copy
1761 instruction in move patterns.
1763 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
1765 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
1768 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
1769 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
1770 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
1771 (ELFABIVERSION_AMDGPU_HSA_V4): New.
1772 (SET_SRAM_ECC_UNSUPPORTED): New.
1773 (copy_early_debug_info): Create elf flags to match the other objects.
1774 (main): Just let the attribute flags pass through.
1776 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
1778 * tree-loop-distribution.c (reduction_var_overflows_first):
1779 Pass the type of reduction_var as first argument as it is also
1780 done for the load type.
1781 (loop_distribution::transform_reduction_loop): Add missing
1782 TREE_TYPE while determining precission of reduction_var.
1784 2021-10-15 Richard Biener <rguenther@suse.de>
1786 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
1788 * toplev.c (process_options): Warn when STABS debugging is
1789 enabled but not the preferred format.
1790 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
1792 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
1794 2021-10-15 Richard Biener <rguenther@suse.de>
1797 * tree-inline.c (copy_bb): Avoid underflowing nargs.
1799 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
1801 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
1802 HFmode input operand to HImode.
1803 (ix86_vectorize_vec_perm_const): Likewise.
1804 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
1806 (*avx512f_permvar_truncv8siv8hi_1_hf):
1809 2021-10-15 Richard Biener <rguenther@suse.de>
1811 PR middle-end/102682
1812 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
1813 not create a paradoxical subreg.
1815 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
1817 * config/i386/i386-expand.c (ix86_expand_vector_init):
1818 For half_vector concat for HFmode, handle them like HImode.
1820 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1822 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
1823 (back_threader::back_threader): Same.
1824 (back_threader::resolve_phi): Try to solve without looking back if
1826 (back_threader::find_paths_to_names): Same.
1827 (try_thread_blocks): Pass resolve argument to back threader.
1828 (pass_early_thread_jumps::execute): Same.
1830 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1832 * doc/invoke.texi: Remove max-fsm-thread-length,
1833 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
1835 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
1837 (back_threader_registry::back_threader_registry): Same.
1838 (back_threader_profitability::profitable_path_p): Remove
1839 param_max_fsm_thread-length.
1840 (back_threader_registry::register_path): Remove
1841 m_max_allowable_paths.
1843 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1845 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
1847 (back_threader::~back_threader): Do not release m_path.
1849 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
1851 PR tree-optimization/102738
1852 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
1854 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
1856 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
1857 (omp_mark_declare_variant): Move from c-omp.c.
1858 (omp_context_name_list_prop): Update for Fortran strings.
1859 * omp-general.h (omp_check_context_selector): New prototype.
1860 (omp_mark_declare_variant): New prototype.
1862 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
1864 * config/rs6000/rs6000.c (rs6000_density_test): Move early
1865 exit test further up the function.
1867 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
1869 * config/arm/arm.c (arm_add_stmt_cost): Delete.
1870 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
1872 2021-10-14 Martin Jambor <mjambor@suse.cz>
1874 * doc/invoke.texi (Optimize Options): Add entry for
1875 ipa-cp-recursive-freq-factor.
1877 2021-10-14 Tamar Christina <tamar.christina@arm.com>
1879 * match.pd: New rule.
1881 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
1884 * ipa-modref-tree.h (modref_access_node::update2):
1885 Also check that parm_offset is unchanged.
1886 (modref_ref_node::insert_access): Fix updating of
1889 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1891 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
1894 2021-10-14 Richard Biener <rguenther@suse.de>
1896 PR tree-optimization/102659
1897 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
1898 rewrite pointer typed undefined overflow operations.
1899 (predicate_statements): Likewise. Make sure to emit invariant
1900 conversions in the preheader.
1901 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
1902 non-empty preheaders.
1903 * tree-data-ref.c (dr_analyze_indices): Strip useless
1904 conversions to the MEM_REF base type.
1906 2021-10-14 Martin Liska <mliska@suse.cz>
1908 * common.opt: Stop using AUTODETECT_VALUE
1909 and use EnabledBy where possible.
1910 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
1911 * toplev.c (AUTODETECT_VALUE): Remove macro.
1912 (process_options): Simplify by using EnabledBy and
1913 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
1916 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
1918 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
1920 (vld1_s32_x3): Likewise.
1922 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1924 PR tree-optimization/102736
1925 PR tree-optimization/102736
1926 * gimple-range-path.cc (path_range_query::range_on_path_entry):
1927 Assert that the requested range is defined outside the path.
1928 (path_range_query::ssa_range_in_phi): Do not call
1929 range_on_path_entry for SSA names that are defined within the
1932 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
1934 * config/darwin-driver.c (darwin_find_version_from_kernel):
1935 Quote internal identifiers and avoid contractions in
1937 (darwin_default_min_version): Likewise.
1938 (darwin_driver_init): Likewise.
1940 2021-10-14 Martin Jambor <mjambor@suse.cz>
1942 * params.opt (ipa-cp-recursive-freq-factor): New.
1943 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
1944 scc_no, self_recursion_generated_level, same_scc and
1945 self_recursion_generated_p.
1946 (ipcp_lattice::add_value): Replaced parameter unlimited with
1947 same_lat_gen_level, usit it determine limit of values and store it to
1949 (ipcp_lattice<valtype>::print): Dump the new fileds.
1950 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
1951 parameter and store it to the new value.
1952 (self_recursively_generated_p): Removed.
1953 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
1954 instead of self_recursively_generated_p, store self generation level
1956 (value_topo_info<valtype>::add_val): Set scc_no.
1957 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
1958 recursively feeding values and self generated values by appropriate
1961 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
1963 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
1964 redundant test for flag_vect_cost_model.
1966 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1968 * bitmap.c (debug): New overloaded function for auto_bitmaps.
1969 * bitmap.h (debug): Same.
1971 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
1973 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
1974 Adjust builtin call.
1975 (_mm512_mask3_fcmadd_pch): Likewise.
1976 (_mm512_mask_fmadd_pch): Likewise
1977 (_mm512_mask3_fmadd_pch): Likewise
1978 (_mm512_mask_fcmadd_round_pch): Likewise
1979 (_mm512_mask3_fcmadd_round_pch): Likewise
1980 (_mm512_mask_fmadd_round_pch): Likewise
1981 (_mm512_mask3_fmadd_round_pch): Likewise
1982 (_mm_mask_fcmadd_sch): Likewise
1983 (_mm_mask3_fcmadd_sch): Likewise
1984 (_mm_mask_fmadd_sch): Likewise
1985 (_mm_mask3_fmadd_sch): Likewise
1986 (_mm_mask_fcmadd_round_sch): Likewise
1987 (_mm_mask3_fcmadd_round_sch): Likewise
1988 (_mm_mask_fmadd_round_sch): Likewise
1989 (_mm_mask3_fmadd_round_sch): Likewise
1990 (_mm_fcmadd_round_sch): Likewise
1991 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
1992 Adjust builtin call.
1993 (_mm_mask3_fmadd_pch): Likewise
1994 (_mm256_mask_fmadd_pch): Likewise
1995 (_mm256_mask3_fmadd_pch): Likewise
1996 (_mm_mask_fcmadd_pch): Likewise
1997 (_mm_mask3_fcmadd_pch): Likewise
1998 (_mm256_mask_fcmadd_pch): Likewise
1999 (_mm256_mask3_fcmadd_pch): Likewise
2000 * config/i386/i386-builtin.def: Add mask3 builtin for complex
2001 fma, and adjust mask_builtin to corresponding expander.
2002 * config/i386/i386-expand.c (ix86_expand_round_builtin):
2003 Skip eraseing embedded rounding for expanders that emits
2005 * config/i386/sse.md (complexmove): New mode_attr.
2006 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
2007 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
2008 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
2009 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
2010 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
2011 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
2012 * config/i386/subst.md (round_embedded_complex): New subst.
2014 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
2016 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
2017 (cbranchdf4): Likewise.
2018 Add missing move patterns for TARGET_SOFT_FLOAT.
2020 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
2022 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
2023 pseudo intermediate when moving a SUBREG into a hard register,
2024 by checking ix86_hardreg_mov_ok.
2025 (ix86_expand_vector_extract): Store zero-extended SImode
2026 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
2028 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
2029 complex (SUBREG) sets of (vector) hard registers before reload, by
2030 checking ix86_hardreg_mov_ok.
2032 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
2034 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
2036 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
2038 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
2040 2021-10-13 Martin Sebor <msebor@redhat.com>
2042 PR middle-end/102630
2043 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
2045 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
2047 * collect2.c (is_lto_object_file): Release simple-object
2048 resources, close files.
2050 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
2052 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
2053 when checking mve addressing modes.
2054 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
2056 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
2057 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
2058 there is a single Ux constraint.
2060 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
2062 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
2063 multiplication on 32-bit PA 1.1 and 2.0 targets.
2065 2021-10-13 Martin Liska <mliska@suse.cz>
2067 PR gcov-profile/90364
2068 * coverage.c (build_info): Emit checksum to the global variable.
2069 (build_info_type): Add new field for checksum.
2070 (coverage_obj_finish): Pass object_checksum.
2071 (coverage_init): Use 0 as checksum for .gcno files.
2072 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
2073 * gcov.c (read_graph_file): Read also checksum.
2074 * doc/invoke.texi: Document the behaviour change.
2076 2021-10-13 Richard Biener <rguenther@suse.de>
2078 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
2079 start at integer value 2.
2080 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
2081 the iterator for GSI_LAST_NEW_STMT.
2082 (gsi_insert_seq_nodes_after): Likewise.
2083 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
2084 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
2085 arguments to gsi_replace.
2087 2021-10-13 Martin Liska <mliska@suse.cz>
2090 * common.opt: Use EnabledBy instead of detection in
2091 finish_options and process_options.
2092 * opts.c (finish_options): Remove handling of
2093 x_flag_unroll_all_loops.
2094 * toplev.c (process_options): Likewise for flag_web and
2095 flag_rename_registers.
2097 2021-10-13 Richard Biener <rguenther@suse.de>
2099 PR tree-optimization/102659
2100 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
2101 (if_convertible_gimple_assign_stmt_p): Mark the loop for
2102 rewrite when stmts with undefined behavior on integer
2104 (combine_blocks): Predicate also when we need to rewrite stmts.
2105 (predicate_statements): Rewrite affected stmts to something
2106 with well-defined behavior on overflow.
2107 (tree_if_conversion): Initialize need_to_rewrite_undefined.
2109 2021-10-13 Richard Biener <rguenther@suse.de>
2112 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
2115 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
2117 * dwarf2ctf.c: Fix typo in comment.
2119 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
2122 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
2123 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
2124 avoid generating an automatic #define of vec_cpsgn. Use the
2125 correct built-in for V4SFmode that doesn't depend on VSX.
2127 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
2131 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
2132 define_insn_and_split pattern. Add alternative 1 and split it
2133 post reload to insert operand 1 into the low part of operand 0.
2134 (*sub<mode>_1_slp): Ditto.
2135 (*and<mode>_1_slp): Ditto.
2136 (*<any_or:code><mode>_1_slp): Ditto.
2137 (*ashl<mode>3_1_slp): Ditto.
2138 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
2139 (*<any_rotate:insn><mode>3_1_slp): Ditto.
2140 (*neg<mode>_1_slp): New insn_and_split pattern.
2141 (*one_cmpl<mode>_1_slp): Ditto.
2143 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
2145 * doc/install.texi: Update MinGW and mingw-64 Binaries
2148 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
2150 * doc/extend.texi (Common Variable Attributes): Fix typos in
2151 alloc_size documentation.
2153 2021-10-12 Richard Biener <rguenther@suse.de>
2155 PR tree-optimization/102696
2156 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
2157 the tree fatally failed when we reject a BIT_FIELD_REF.
2159 2021-10-12 Richard Biener <rguenther@suse.de>
2161 PR tree-optimization/102572
2162 * tree-vect-stmts.c (vect_build_gather_load_calls): When
2163 gathering the vectorized defs for the mask pass in the
2164 desired mask vector type so invariants will be handled
2167 2021-10-12 Tamar Christina <tamar.christina@arm.com>
2169 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
2170 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
2171 *fcmuo<mode>_nor_combine): New.
2173 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
2176 * config/sparc/sparc-modes.def (OI): New integer mode.
2178 2021-10-12 Jakub Jelinek <jakub@redhat.com>
2180 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
2181 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
2184 2021-10-12 Jakub Jelinek <jakub@redhat.com>
2186 * tree-vectorizer.h (loop_cost_model): New function.
2187 (unlimited_cost_model): Use it.
2188 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
2189 call instead of flag_vect_cost_model.
2190 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
2191 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
2192 of flag_simd_cost_model.
2194 2021-10-12 liuhongt <hongtao.liu@intel.com>
2197 * config/i386/i386-expand.c (emit_reduc_half): Handle
2199 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
2200 (reduc_plus_scal_v4qi): Ditto.
2202 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2204 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
2205 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
2206 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
2208 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2210 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
2211 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
2212 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
2213 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
2214 _mm_cvtepu32_epi64): New.
2216 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2218 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
2219 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
2221 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2223 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
2224 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
2225 _mm_max_epi32, _mm_max_epu32): New.
2227 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
2229 * ipa-modref-tree.h (struct modref_access_node): Revert
2231 (struct modref_ref_node): Likewise.
2233 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
2235 * ipa-modref-tree.h (modref_tree::global_access_p): New member
2238 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
2239 ignore_stores_eaf_flags): Move to ipa-modref.h
2240 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
2241 (modref_summary::global_memory_read_p): New member function.
2242 (modref_summary::global_memory_written_p): New member function.
2243 * ipa-modref.h (modref_summary::global_memory_read_p,
2244 modref_summary::global_memory_written_p): Declare.
2245 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
2246 ignore_stores_eaf_flags): move here.
2247 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
2249 (handle_rhs_call): Rewrite.
2250 (handle_call_arg): New function.
2251 (determine_global_memory_access): New function.
2252 (handle_const_call): Remove
2253 (handle_pure_call): Remove
2254 (find_func_aliases_for_call): Update use of handle_rhs_call.
2255 (compute_points_to_sets): Handle global memory acccesses
2258 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
2260 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
2261 documentation, mention UBSAN_OPTIONS, similar to what is done
2262 for AddressSanitizer.
2264 2021-10-11 Richard Biener <rguenther@suse.de>
2266 PR middle-end/102683
2267 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
2268 availability before building an integer type for storage
2271 2021-10-11 Richard Biener <rguenther@suse.de>
2273 PR middle-end/101480
2274 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
2277 2021-10-11 Martin Liska <mliska@suse.cz>
2279 * common.opt: Remove Init(2) for some options.
2280 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
2281 use rather OPTION_SET_P.
2283 2021-10-11 Martin Liska <mliska@suse.cz>
2285 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
2286 * flag-types.h (enum ira_region): Likewise.
2287 * toplev.c (process_options): Use OPTION_SET_P instead of
2288 IRA_REGION_AUTODETECT.
2290 2021-10-11 Jakub Jelinek <jakub@redhat.com>
2292 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
2293 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
2295 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
2297 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
2298 * config/s390/s390.c (s390_rawmemchr): New function.
2299 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
2300 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
2301 the pattern vfees<mode> from vx-builtins.md.
2302 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
2304 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
2306 * builtins.c (get_memory_rtx): Change to external linkage.
2307 * builtins.h (get_memory_rtx): Add function prototype.
2308 * doc/md.texi (rawmemchr<mode>): Document.
2309 * internal-fn.c (expand_RAWMEMCHR): Define.
2310 * internal-fn.def (RAWMEMCHR): Add.
2311 * optabs.def (rawmemchr_optab): Add.
2312 * tree-loop-distribution.c (find_single_drs): Change return code
2313 behaviour by also returning true if no single store was found
2315 (loop_distribution::classify_partition): Respect the new return
2316 code behaviour of function find_single_drs.
2317 (loop_distribution::execute): Call new function
2318 transform_reduction_loop in order to replace rawmemchr or strlen
2319 like loops by calls into builtins.
2320 (generate_reduction_builtin_1): New function.
2321 (generate_rawmemchr_builtin): New function.
2322 (generate_strlen_builtin_1): New function.
2323 (generate_strlen_builtin): New function.
2324 (generate_strlen_builtin_using_rawmemchr): New function.
2325 (reduction_var_overflows_first): New function.
2326 (determine_reduction_stmt_1): New function.
2327 (determine_reduction_stmt): New function.
2328 (loop_distribution::transform_reduction_loop): New function.
2330 2021-10-11 Martin Liska <mliska@suse.cz>
2332 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
2333 and remove legacy hashing code.
2335 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
2338 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
2339 CONST_INT for BEGIN and END, and use gcc_assert rather than
2342 2021-10-10 Jakub Jelinek <jakub@redhat.com>
2345 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
2346 use MO_VAL_SET if loc is not sp.
2348 2021-10-10 Andrew Pinski <apinski@marvell.com>
2350 PR tree-optimization/102622
2351 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
2352 Swap the order of a?0:pow2cst and a?0:-1 transformations.
2354 2021-10-09 liuhongt <hongtao.liu@intel.com>
2357 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
2359 (ix86_use_mask_cmp_p): Ditto.
2360 (ix86_expand_sse_movcc): Ditto.
2361 * config/i386/i386.md (setcc_hf_mask): New define_insn.
2362 (movhf_mask): Ditto.
2363 (UNSPEC_MOVCC_MASK): New unspec.
2364 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
2366 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
2368 PR rtl-optimization/102627
2369 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
2371 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
2373 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
2376 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
2378 * value-range.cc (irange::debug): New.
2379 * value-range.h (irange::debug): New.
2381 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
2383 PR tree-optimization/102385
2384 * predict.h (change_edge_frequency): Declare.
2385 * predict.c (change_edge_frequency): New function.
2386 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
2388 (tree_unroll_loop): Likewise.
2389 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
2390 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
2392 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
2393 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
2394 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
2395 to retrieve the exit edges. Make all the old profile update code
2396 conditional on !single_loop_p -- the case it was written for --
2397 and use a different approach for the single-loop case.
2399 2021-10-08 Martin Liska <mliska@suse.cz>
2401 * config/alpha/alpha.c (alpha_option_override): Use new macro
2403 * config/arc/arc.c (arc_override_options): Likewise.
2404 * config/arm/arm.c (arm_option_override): Likewise.
2405 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
2406 * config/c6x/c6x.c (c6x_option_override): Likewise.
2407 * config/csky/csky.c: Likewise.
2408 * config/darwin.c (darwin_override_options): Likewise.
2409 * config/frv/frv.c (frv_option_override): Likewise.
2410 * config/i386/djgpp.h: Likewise.
2411 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
2412 (ix86_max_noce_ifcvt_seq_cost): Likewise.
2413 * config/ia64/ia64.c (ia64_option_override): Likewise.
2414 (ia64_override_options_after_change): Likewise.
2415 * config/m32c/m32c.c (m32c_option_override): Likewise.
2416 * config/m32r/m32r.c (m32r_init): Likewise.
2417 * config/m68k/m68k.c (m68k_option_override): Likewise.
2418 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
2419 * config/mips/mips.c (mips_option_override): Likewise.
2420 * config/nios2/nios2.c (nios2_option_override): Likewise.
2421 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
2422 * config/pa/pa.c (pa_option_override): Likewise.
2423 * config/riscv/riscv.c (riscv_option_override): Likewise.
2424 * config/rs6000/aix71.h: Likewise.
2425 * config/rs6000/aix72.h: Likewise.
2426 * config/rs6000/aix73.h: Likewise.
2427 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
2428 (rs6000_override_options_after_change): Likewise.
2429 (rs6000_linux64_override_options): Likewise.
2430 (glibc_supports_ieee_128bit): Likewise.
2431 (rs6000_option_override_internal): Likewise.
2432 (rs6000_file_start): Likewise.
2433 (rs6000_darwin_file_start): Likewise.
2434 * config/rs6000/rtems.h: Likewise.
2435 * config/rs6000/sysv4.h: Likewise.
2436 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
2437 * config/s390/s390.c (s390_option_override): Likewise.
2438 * config/sh/linux.h: Likewise.
2439 * config/sh/netbsd-elf.h (while): Likewise.
2440 * config/sh/sh.c (sh_option_override): Likewise.
2441 * config/sol2.c (solaris_override_options): Likewise.
2442 * config/sparc/sparc.c (sparc_option_override): Likewise.
2443 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
2444 * config/visium/visium.c (visium_option_override): Likewise.
2445 * config/vxworks.c (vxworks_override_options): Likewise.
2446 * lto-opts.c (lto_write_options): Likewise.
2447 * omp-expand.c (expand_omp_simd): Likewise.
2448 * omp-general.c (omp_max_vf): Likewise.
2449 * omp-offload.c (oacc_xform_loop): Likewise.
2450 * opts.h (OPTION_SET_P): Likewise.
2451 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
2452 * toplev.c (process_options): Likewise.
2453 * tree-predcom.c: Likewise.
2454 * tree-sra.c (analyze_all_variable_accesses): Likewise.
2456 2021-10-08 liuhongt <hongtao.liu@intel.com>
2459 * config/i386/i386.c (ix86_optab_supported_p):
2460 Return true for HFmode.
2461 * match.pd: Simplify (_Float16) ceil ((double) x) to
2462 __builtin_ceilf16 (a) when a is _Float16 type and
2463 direct_internal_fn_supported_p.
2465 2021-10-08 liuhongt <hongtao.liu@intel.com>
2468 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
2469 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
2470 (reduc_<code>_scal_v4hi): New.
2472 2021-10-08 liuhongt <hongtao.liu@intel.com>
2474 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
2475 * doc/invoke.texi (Options That Control Optimization): Update
2477 * opts.c (default_options_table): Enable auto-vectorization at
2478 O2 with very-cheap cost model.
2479 (finish_options): Use cheap cost model for
2480 explicit -ftree{,-loop}-vectorize.
2482 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
2484 * ctfc.c (ctfc_delete_container): Free hash table contents.
2486 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
2488 * toplev.c (process_options): Do not warn for GNU GIMPLE.
2490 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
2492 * tree-object-size.c (addr_object_size,
2493 compute_builtin_object_size): Drop PDECL and POFF arguments.
2494 (addr_object_size): Adjust calls.
2495 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
2498 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
2500 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
2501 representing signed and unsigned high-part multiplication resp.
2502 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
2503 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
2504 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
2505 SS_DIV, US_DIV]: Similar simplifications for saturating
2507 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
2508 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
2509 Implement compile-time evaluation for constant operands.
2510 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
2512 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
2513 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
2514 Mention the new smul_highpart and umul_highpart RTX codes.
2515 * doc/invoke.texi: Silence @xref "compilation" warnings.
2517 2021-10-07 Martin Jambor <mjambor@suse.cz>
2520 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
2521 case when the source reference description corresponds to a
2522 referance taken in a function src->caller is inlined to.
2524 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
2527 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
2529 (modref_access_node::try_merge_with): Add sanity check that there
2530 are no redundant entries in the list.
2532 2021-10-07 Richard Biener <rguenther@suse.de>
2534 PR tree-optimization/102608
2535 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
2538 2021-10-07 Martin Liska <mliska@suse.cz>
2540 * toplev.c (toplev::main): Make
2541 save_opt_decoded_options a pointer type
2542 * toplev.h: Likewise.
2544 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
2546 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
2547 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
2548 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
2550 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
2552 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
2553 Adjust for new -msram-ecc=any behaviour.
2554 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
2555 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
2556 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
2557 (EF_AMDGPU_XNACK_V3): ... this.
2558 (EF_AMDGPU_SRAM_ECC): Rename to ...
2559 (EF_AMDGPU_SRAM_ECC_V3): ... this.
2560 (EF_AMDGPU_FEATURE_XNACK_V4): New.
2561 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
2562 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
2563 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
2564 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
2565 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
2566 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
2567 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
2568 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
2569 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
2570 (SET_XNACK_ON): New.
2571 (SET_XNACK_OFF): New.
2573 (SET_SRAM_ECC_ON): New.
2574 (SET_SRAM_ECC_ANY): New.
2575 (SET_SRAM_ECC_OFF): New.
2576 (TEST_SRAM_ECC_ANY): New.
2577 (TEST_SRAM_ECC_ON): New.
2578 (main): Implement HSACOv4 and -msram-ecc=any.
2580 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
2582 * config.in: Regenerate.
2583 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
2587 (A_FIJI): Rename to ...
2589 (A_900): Rename to ...
2591 (A_906): Rename to ...
2593 (A_908): Rename to ...
2595 (SRAMOPT): New macro.
2596 (ASM_SPEC): Adjust xnack option usage.
2597 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
2598 * configure: Regenerate.
2599 * configure.ac: Detect LLVM assembler dialect.
2601 2021-10-07 Richard Biener <rguenther@suse.de>
2603 * tree-pretty-print.c (dump_generic_node): Do not elide
2604 printing '&' when dumping with -gimple.
2606 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2608 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
2610 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
2611 (adjust_imagpart_expr): Ditto.
2612 * value-range.cc (irange::irange_intersect): Call new routine if
2613 RHS is a single pair.
2614 (irange::intersect): New wide_int version.
2615 * value-range.h (class irange): New prototype.
2617 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2619 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
2620 Add parameter to limit size when recognizing switches.
2621 (gimple_outgoing_range::edge_range_p): Check size limit.
2622 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
2623 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
2624 that exceed the size limit.
2625 (gori_compute::gori_compute): Add initializer.
2626 * params.opt (evrp-switch-limit): New.
2627 * doc/invoke.texi: Update docs.
2629 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2631 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
2632 TYPE_MAX_VALUE instead of creating new trees when possible.
2634 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2636 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
2637 zero and non-zero more efficently.
2639 2021-10-06 Richard Biener <rguenther@suse.de>
2642 * dumpfile.h (TDF_GIMPLE_VAL): New.
2643 (dump_flag): Re-order and adjust TDF_* flags. Make
2644 the enum uint32_t. Use std::underlying_type in the
2646 (optgroup_flag): Likewise for the operator overloads.
2647 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
2648 in _Literal if TDF_GIMPLE_VAL.
2649 * gimple-pretty-print.c (dump_gimple_assign): Add
2650 TDF_GIMPLE_VAL to flags when dumping operands where only
2651 is_gimple_val are allowed.
2652 (dump_gimple_cond): Likewise.
2654 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
2656 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
2659 2021-10-05 qing zhao <qing.zhao@oracle.com>
2661 PR middle-end/102359
2662 * gimplify.c (gimplify_decl_expr): Not add initialization for an
2663 auto variable when it has been initialized by frontend.
2665 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
2667 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
2670 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
2672 * common/config/avr/avr-common.c (avr_handle_option): Mark
2673 argument as ATTRIBUTE_UNUSED.
2675 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
2677 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
2678 Undefine before redefinition.
2680 2021-10-05 Richard Biener <rguenther@suse.de>
2682 * toplev.c (no_backend): Remove global var.
2683 (process_options): Pass in no_backend, move post_options
2684 langhook call to toplev::main.
2685 (do_compile): Pass in no_backend, move process_options call
2687 (toplev::run_self_tests): Check no_backend at the caller.
2688 (toplev::main): Call post_options and process_options
2689 split out from do_compile, do self-tests only if
2690 no_backend is initialized.
2692 2021-10-05 Richard Biener <rguenther@suse.de>
2694 * tree-cfg.c (dump_function_to_file): Dump the UID of the
2695 function as part of the name when requested.
2696 * tree-pretty-print.c (dump_function_name): Dump the UID when
2697 requested and the langhook produced the actual name.
2699 2021-10-05 Richard Biener <rguenther@suse.de>
2701 PR middle-end/102587
2702 PR middle-end/102285
2703 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
2704 zero-initialization as last resort, use the constant
2705 size as given by the DEFERRED_INIT argument to build
2708 2021-10-04 Marek Polacek <polacek@redhat.com>
2711 * doc/invoke.texi: Document -Warray-compare.
2713 2021-10-04 Richard Biener <rguenther@suse.de>
2715 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
2716 variables are not to be initialized.
2718 2021-10-04 Richard Biener <rguenther@suse.de>
2720 * expr.h (non_mem_decl_p): Declare.
2721 (mem_ref_refers_to_non_mem_p): Likewise.
2722 * expr.c (non_mem_decl_p): Export.
2723 (mem_ref_refers_to_non_mem_p): Likewise.
2724 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
2725 but check the base with mem_ref_refers_to_non_mem_p
2728 2021-10-04 Richard Biener <rguenther@suse.de>
2730 PR tree-optimization/102570
2731 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
2732 we are using clique for the internal function code.
2733 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
2734 internal function code.
2735 (print_vn_reference_ops): Print the internal function code.
2736 (vn_reference_op_compute_hash): Hash it.
2737 (copy_reference_ops_from_call): Record it.
2738 (visit_stmt): Remove the restriction around internal function
2740 (fully_constant_vn_reference_p): Use fold_const_call and handle
2742 (vn_reference_eq): Compare call return types.
2743 * tree-ssa-pre.c (create_expression_by_pieces): Handle
2744 generating calls to internal functions.
2745 (compute_avail): Remove the restriction around internal function
2748 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
2750 PR tree-optimization/102560
2751 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
2752 marker for invalid_range.
2754 2021-10-04 Richard Biener <rguenther@suse.de>
2756 PR middle-end/102587
2757 * internal-fn.c (expand_DEFERRED_INIT): Guard register
2758 initialization path an avoid initializing VLA registers
2761 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
2763 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
2765 2021-10-03 Martin Liska <mliska@suse.cz>
2767 * toplev.c (toplev::main): Check opt_index if it is a part
2770 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
2772 PR tree-optimization/102563
2773 * range-op.cc (operator_lshift::op1_range): Do not clobber
2776 2021-10-02 Martin Liska <mliska@suse.cz>
2778 * toplev.c (toplev::main): save_decoded_options[0] is program
2779 name and so it should be skipped.
2781 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
2783 PR tree-optimization/102546
2784 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
2785 X << Y is non-zero implies X is also non-zero.
2787 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2789 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
2791 * config/aarch64/aarch64-tune.md: Regenerate.
2792 * doc/invoke.texi: Update docs.
2794 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2796 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
2798 * config/aarch64/aarch64-tune.md: Regenerate.
2799 * doc/invoke.texi: Update docs.
2801 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2803 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
2805 * config/aarch64/aarch64-tune.md: Regenerate.
2806 * doc/invoke.texi: Update docs.
2808 2021-10-01 Martin Sebor <msebor@redhat.com>
2811 * doc/invoke.texi (-Waddress): Update.
2812 * gengtype.c (write_types): Avoid -Waddress.
2813 * poly-int.h (POLY_SET_COEFF): Avoid using null.
2815 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
2818 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
2821 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2823 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
2825 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
2827 * gimple-range-path.cc (path_range_query::compute_ranges): Use
2829 * gimple-range-path.h (class path_range_query): Remove shadowed
2831 (path_range_query::get_path_oracle): New.
2833 2021-10-01 Jakub Jelinek <jakub@redhat.com>
2834 Richard Biener <rguenther@suse.de>
2837 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
2838 INT_MIN / -1 division detection from here ...
2839 (-fsanitize=signed-integer-overflow): ... and add it here.
2841 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2843 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
2845 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
2846 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
2847 (AARCH64_ISA_V9): New ISA flag.
2848 * doc/invoke.texi: Update docs.
2850 2021-10-01 Martin Liska <mliska@suse.cz>
2852 * toplev.c (toplev::main): Save decoded optimization options.
2853 * toplev.h (save_opt_decoded_options): New.
2854 * doc/extend.texi: Be more clear about optimize and target
2857 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
2859 * explow.c: Include langhooks.h.
2860 (set_stack_check_libfunc): Build a proper function type.
2862 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
2865 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
2866 not return true for external weak function symbols in medium model.
2868 2021-10-01 Jakub Jelinek <jakub@redhat.com>
2870 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
2871 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
2872 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
2873 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
2874 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
2875 OMP_CLAUSE_SCHEDULE_STATIC.
2877 2021-10-01 Richard Biener <rguenther@suse.de>
2879 PR middle-end/102518
2880 * tree-inline.c (setup_one_parameter): Avoid substituting
2881 an invariant into contexts where a GIMPLE register is not valid.
2883 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2885 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
2886 * config/arm/arm-tables.opt: Regenerate.
2887 * config/arm/arm-tune.md: Regenerate.
2888 * doc/invoke.texi: Update docs.
2890 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
2893 * config/i386/i386.md
2894 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
2897 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
2899 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
2900 omp_{c,re}alloc, fix omp_alloc/omp_free.
2902 2021-09-30 Martin Liska <mliska@suse.cz>
2904 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
2905 asm_out_file and stream directly to MYFILE.
2907 2021-09-30 Richard Biener <rguenther@suse.de>
2909 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
2910 Restore and fix condition under which we apply npeel to
2911 the DRs misalignment value.
2913 2021-09-30 Richard Biener <rguenther@suse.de>
2915 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
2916 Fix npeel check for variable amount of peeling.
2918 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
2920 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
2922 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
2924 * gimple-range.cc (gimple_ranger::debug): New.
2925 * gimple-range.h (class gimple_ranger): Add debug.
2927 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
2929 PR middle-end/102519
2930 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
2932 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
2935 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
2936 (btf_finalize): Empty the hash_map btf_var_ids.
2938 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
2940 * tree-vrp.c (thread_through_all_blocks): Return bool.
2941 (execute_vrp_threader): Return TODO_* flags.
2942 (pass_data_vrp_threader): Set todo_flags_finish to 0.
2944 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
2946 * timevar.def (TV_TREE_VRP_THREADER): New.
2947 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
2949 2021-09-29 David Faust <david.faust@oracle.com>
2951 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
2953 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
2955 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
2956 new type qualifier enum.
2957 (TYPES_TERNOP_SSSU): Likewise.
2958 (TYPES_TERNOP_PPPU): Likewise.
2959 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
2960 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
2962 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
2963 builtin and remove casts.
2964 (vqtbl1_s8): Likewise.
2965 (vqtbl1q_p8): Likewise.
2966 (vqtbl1q_s8): Likewise.
2967 (vqtbx1_s8): Likewise.
2968 (vqtbx1_p8): Likewise.
2969 (vqtbx1q_s8): Likewise.
2970 (vqtbx1q_p8): Likewise.
2971 (vtbl1_p8): Likewise.
2972 (vtbl2_p8): Likewise.
2973 (vtbx2_p8): Likewise.
2975 2021-09-29 Richard Biener <rguenther@suse.de>
2977 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
2979 (vect_update_misalignment_for_peel): Use it to update
2980 misaligned to the value necessary for an aligned access.
2981 (vect_get_peeling_costs_all_drs): Likewise.
2982 (vect_enhance_data_refs_alignment): Likewise.
2984 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
2986 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
2987 emitted operations and adjust heuristic for code size.
2989 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
2991 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
2992 emitted operations and adjust heuristic for code size.
2994 2021-09-29 Jakub Jelinek <jakub@redhat.com>
2996 PR middle-end/102504
2997 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
2998 in OMP_SCOPE clauses, not just on worksharing construct clauses.
3000 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
3002 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
3003 (<u>mulditi3): Call <su>muldi3_highpart.
3004 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
3005 (<u>mulsidi3): Call <su>mulsi3_highpart.
3006 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
3008 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
3010 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
3012 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
3014 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
3015 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
3018 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
3020 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
3021 (AARCH64_FL_V8_7): Likewise.
3022 (AARCH64_FL_FOR_ARCH8_7): Likewise.
3023 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
3024 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
3025 * doc/invoke.texi: Document the above.
3027 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3029 * dbgcnt.c (dbg_cnt_counter): New.
3030 * dbgcnt.h (dbg_cnt_counter): New.
3031 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
3032 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
3033 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
3034 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
3037 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3039 * cfgcleanup.c (pass_jump::execute): Check
3040 flag_expensive_optimizations.
3041 (pass_jump_after_combine::gate): Same.
3042 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
3043 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
3044 * tree-ssa-threadupdate.c
3045 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
3048 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
3050 * tree-ssa-reassoc.c (biased_names): New global.
3051 (propagate_bias_p): New function.
3052 (loop_carried_phi): Remove.
3053 (propagate_rank): Propagate bias along single uses.
3054 (get_rank): Update biased_names when needed.
3056 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
3058 * passes.def (pass_reassoc): Rename parameter to early_p.
3059 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
3061 (phi_rank): Don't bias loop-carried phi ranks
3062 before vectorization pass.
3063 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
3064 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
3066 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
3068 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
3070 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
3072 2021-09-28 Jakub Jelinek <jakub@redhat.com>
3075 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
3076 special 80387 instruction XFmode constants if flag_rounding_math.
3078 2021-09-28 Richard Biener <rguenther@suse.de>
3080 PR tree-optimization/100112
3081 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
3082 referece into the hashtable twice in case last_vuse is
3083 different from the original vuse on the stmt.
3085 2021-09-28 Jakub Jelinek <jakub@redhat.com>
3087 PR middle-end/102492
3088 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
3089 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
3090 clauses on SIMD constructs.
3092 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3094 PR tree-optimization/102511
3095 * gimple-range-path.cc (path_range_query::range_on_path_entry):
3096 Return VARYING when nothing found.
3098 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
3101 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
3103 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
3104 (VALID_MMX_REG_MODE): Add V4HFmode.
3105 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
3106 vector mode condition.
3107 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
3109 (function_arg_32): Add V4HFmode.
3110 (function_arg_advance_32): Likewise.
3111 * config/i386/i386.md (mode): Add V4HF/V2HF.
3112 (MODE_SIZE): Likewise.
3113 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
3114 (V_32): Add V2HF mode.
3115 (VHF_32_64): New mode iterator.
3116 (*mov<mode>_internal): Adjust sse alternatives to support
3118 (*mov<mode>_internal): Adjust sse alternatives to support
3120 (<insn><mode>3): New define_insn for add/sub/mul/div.
3122 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3124 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
3126 (pass_early_thread_jumps::gate): Same.
3127 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
3128 Return if !flag_thread_jumps.
3129 * tree-ssa-threadupdate.c
3130 (jt_path_registry::register_jump_thread): Assert that
3131 flag_thread_jumps is true.
3133 2021-09-28 liuhongt <hongtao.liu@intel.com>
3136 (simplify_context::simplify_binary_operation_1): Relax
3137 condition of simplifying (vec_concat:M (vec_select op0
3138 index0)(vec_select op1 index1)) to allow different modes
3139 between op0 and M, but have same inner mode.
3141 2021-09-28 liuhongt <hongtao.liu@intel.com>
3143 * config/i386/i386-expand.c (emit_reduc_half): Handle
3144 V8HF/V16HF/V32HFmode.
3145 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
3146 (REDUC_SSE_SMINMAX_MODE): Ditto.
3147 (REDUC_PLUS_MODE): Add V16HF and V32HF.
3148 (REDUC_SMINMAX_MODE): Ditto.
3150 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3152 * gimple-range-path.cc
3153 (path_range_query::precompute_ranges_in_block): Rename to...
3154 (path_range_query::compute_ranges_in_block): ...this.
3155 (path_range_query::precompute_ranges): Rename to...
3156 (path_range_query::compute_ranges): ...this.
3157 (path_range_query::precompute_relations): Rename to...
3158 (path_range_query::compute_relations): ...this.
3159 (path_range_query::precompute_phi_relations): Rename to...
3160 (path_range_query::compute_phi_relations): ...this.
3161 * gimple-range-path.h: Rename precompute* to compute*.
3162 * tree-ssa-threadbackward.c
3163 (back_threader::find_taken_edge_switch): Same.
3164 (back_threader::find_taken_edge_cond): Same.
3165 * tree-ssa-threadedge.c
3166 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
3167 (hybrid_jt_state::register_equivs_stmt): Inline...
3168 * tree-ssa-threadedge.h: ...here.
3170 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3172 * tree-vrp.c (lhs_of_dominating_assert): Remove.
3173 (class vrp_jt_state): Remove.
3174 (class vrp_jt_simplifier): Remove.
3175 (vrp_jt_simplifier::simplify): Remove.
3176 (class vrp_jump_threader): Remove.
3177 (vrp_jump_threader::vrp_jump_threader): Remove.
3178 (vrp_jump_threader::~vrp_jump_threader): Remove.
3179 (vrp_jump_threader::before_dom_children): Remove.
3180 (vrp_jump_threader::after_dom_children): Remove.
3182 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3184 * passes.def (pass_vrp_threader): New.
3185 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
3186 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
3187 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
3188 (hybrid_jt_simplifier::simplify): New.
3189 (hybrid_jt_simplifier::compute_ranges_from_state): New.
3190 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
3191 (class hybrid_jt_simplifier): New.
3192 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
3194 (class hybrid_threader): New.
3195 (hybrid_threader::hybrid_threader): New.
3196 (hybrid_threader::~hybrid_threader): New.
3197 (hybrid_threader::before_dom_children): New.
3198 (hybrid_threader::after_dom_children): New.
3199 (execute_vrp_threader): New.
3200 (class pass_vrp_threader): New.
3201 (make_pass_vrp_threader): New.
3203 2021-09-27 Martin Liska <mliska@suse.cz>
3205 * output.h (enum section_flag): New.
3206 (SECTION_FORGET): Remove.
3207 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
3208 (SECTION_STYLE_MASK): Define it based on other enum
3210 * varasm.c (switch_to_section): Remove unused handling of
3213 2021-09-27 Martin Liska <mliska@suse.cz>
3215 * common.opt: Add new variable flag_default_complex_method.
3216 * opts.c (finish_options): Handle flags related to
3217 x_flag_complex_method.
3218 * toplev.c (process_options): Remove option handling related
3219 to flag_complex_method.
3221 2021-09-27 Richard Biener <rguenther@suse.de>
3223 PR middle-end/102450
3224 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
3225 type_for_size, instead use int_mode_for_size.
3227 2021-09-27 Andrew Pinski <apinski@marvell.com>
3230 * gimplify.c (gimplify_save_expr): Return early
3231 if the type of val is error_mark_node.
3233 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3235 * tree-ssanames.c (ssa_name_has_boolean_range): Use
3238 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3240 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
3241 vrp_visit_cond_stmt.
3242 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
3243 (cprop_into_stmt): Same.
3244 (dom_opt_dom_walker::optimize_stmt): Same.
3246 2021-09-27 Richard Biener <rguenther@suse.de>
3248 PR tree-optimization/97351
3249 PR tree-optimization/97352
3250 PR tree-optimization/82426
3251 * tree-vectorizer.h (dr_misalignment): Add vector type
3253 (aligned_access_p): Likewise.
3254 (known_alignment_for_access_p): Likewise.
3255 (vect_supportable_dr_alignment): Likewise.
3256 (vect_known_alignment_in_bytes): Likewise. Refactor.
3257 (DR_MISALIGNMENT): Remove.
3258 (vect_update_shared_vectype): Likewise.
3259 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
3260 a vector type with larger alignment requirement and apply
3261 the negative step adjustment here.
3262 (vect_calculate_target_alignment): Remove.
3263 (vect_compute_data_ref_alignment): Get explicit vector type
3264 argument, do not apply a negative step alignment adjustment
3266 (vect_slp_analyze_node_alignment): Re-analyze alignment
3267 when we re-visit the DR with a bigger desired alignment but
3268 keep more precise results from smaller alignments.
3269 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
3270 (vect_slp_analyze_node_operations_1): Do not update the
3271 shared vector type on stmts.
3272 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
3273 vector type of an SLP node to the representative stmt-info.
3274 (vect_transform_stmt): Likewise.
3276 2021-09-27 liuhongt <hongtao.liu@intel.com>
3279 2021-09-09 liuhongt <hongtao.liu@intel.com>
3282 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
3283 (reduc_plus_scal_v4sf): .. this, New define_expand.
3284 (reduc_plus_scal_v2df): .. and this, New define_expand.
3286 2021-09-26 liuhongt <hongtao.liu@intel.com>
3288 * doc/extend.texi (Half-Precision): Remove storage only
3289 description for _Float16 w/o avx512fp16.
3291 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
3293 * config/pru/constraints.md (Rrio): New constraint.
3294 * config/pru/predicates.md (regio_operand): New predicate.
3295 * config/pru/pru-pragma.c (pru_register_pragmas): Register
3296 the __regio_symbol address space.
3297 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
3298 * config/pru/pru.c (pru_symref2ioregno): New helper function.
3299 (pru_legitimate_address_p): Remove.
3300 (pru_addr_space_legitimate_address_p): Use the address space
3302 (pru_nongeneric_pointer_addrspace): New helper function.
3303 (pru_insert_attributes): New function to validate __regio_symbol
3305 (TARGET_INSERT_ATTRIBUTES): New macro.
3306 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
3307 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
3308 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
3309 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
3311 (*regio_nozext_writesi): New pattern to write to I/O registers.
3312 (*regio_zext_write_r30<EQS0:mode>): Ditto.
3313 * doc/extend.texi: Document the new PRU Named Address Space.
3315 2021-09-24 Patrick Palka <ppalka@redhat.com>
3319 * real.c (encode_ieee_double): Avoid unwanted sign extension.
3320 (encode_ieee_quad): Likewise.
3322 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
3324 PR rtl-optimization/102147
3325 * ira-build.c (ira_conflict_vector_profitable_p): Make
3326 profitability calculation independent of host compiler pointer and
3329 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
3331 * gimple-range-path.cc (path_range_query::path_range_query):
3332 Move debugging header...
3333 (path_range_query::precompute_ranges): ...here.
3334 (path_range_query::internal_range_of_expr): Do not call
3335 range_on_path_entry if NAME is defined in the current block.
3337 2021-09-24 Richard Biener <rguenther@suse.de>
3339 * cfghooks.c (verify_flow_info): Verify unallocated BB and
3340 edge flags are not set.
3342 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
3344 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
3346 (jt_path_registry::register_jump_thread): Call
3347 cancel_invalid_paths.
3348 * tree-ssa-threadupdate.h (class jt_path_registry): Add
3349 cancel_invalid_paths.
3351 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
3353 PR tree-optimization/102400
3354 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
3355 result_vdef to zero value.
3357 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
3359 PR tree-optimization/102451
3360 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
3363 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
3365 * config/i386/sse.md (cond_<insn><mode>): Extend to support
3367 (cond_mul<mode>): Likewise.
3368 (cond_div<mode>): Likewise.
3369 (cond_<code><mode>): Likewise.
3370 (cond_fma<mode>): Likewise.
3371 (cond_fms<mode>): Likewise.
3372 (cond_fnma<mode>): Likewise.
3373 (cond_fnms<mode>): Likewise.
3375 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
3377 PR tree-optimization/102463
3378 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
3379 there is no range-ops handler, don't look for a relation.
3381 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
3383 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
3384 non-executable_edge flag as parameter.
3385 * gimple-range-cache.h (ranger_cache): Adjust prototype.
3386 * gimple-range-gori.cc (gori_compute::gori_compute): Take
3387 non-executable_edge flag as parameter.
3388 (gori_compute::outgoing_edge_range_p): Check new flag.
3389 * gimple-range-gori.h (gori_compute): Adjust prototype.
3390 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
3391 (gimple_ranger::range_on_edge): Check new flag.
3392 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
3393 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
3394 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
3395 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
3396 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
3397 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
3398 (simplify_using_ranges::simplify_switch_using_ranges): Clear
3399 EDGE_EXECUTABLE like it originally did.
3400 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
3401 NON_EXECUTABLE flags.
3402 (simplify_using_ranges::simplify_using_ranges): Adjust.
3403 * vr-values.h (class simplify_using_ranges): Adjust.
3404 (simplify_using_ranges::set_range_query): Add non-executable flag param.
3406 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
3409 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
3410 zero-width bit fields and return indicator.
3411 (rs6000_discover_homogeneous_aggregate): Diagnose when the
3412 presence of a zero-width bit field changes parameter passing in
3415 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
3417 * gimple-range-fold.cc (fold_using_range::range_of_phi):
3418 Remove dominator check.
3420 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
3422 * gimple-range-path.cc (path_range_query::precompute_relations):
3423 Hoist edge calculations before using EDGE_SUCC.
3425 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
3427 * configure.ac: Fix --with-multilib-list description.
3428 * configure: Regenerate.
3430 2021-09-23 Richard Biener <rguenther@suse.de>
3432 PR tree-optimization/102448
3433 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
3434 Clear alignment info copied from DR_PTR_INFO.
3436 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3438 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
3440 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
3441 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
3442 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
3443 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
3444 (vcond<mode><mode>): New expander.
3445 (vcond<mode><sseintvecmodelower>): Likewise.
3446 (vcond<sseintvecmodelower><mode>): Likewise.
3447 (vcondu<mode><sseintvecmodelower>): Likewise.
3449 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3451 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
3453 (extendv4hf<mode>2): Likewise.
3454 (extendv2hfv2df2): Likewise.
3455 (trunc<mode><ssePHmodelower>2): Likewise.
3456 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
3457 (trunc<mode>v4hf2): ... this, and drop constraints.
3458 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
3459 (truncv2dfv2hf2): ... this, and likewise.
3461 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3463 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
3465 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
3467 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
3468 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
3469 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
3471 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3473 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
3474 (fixuns_trunchfhi2): Likewise.
3475 (*fixuns_trunchfsi2zext): New define_insn.
3476 * config/i386/sse.md (ssePHmodelower): New mode_attr.
3477 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
3478 New expander for same element vector fix_truncate.
3479 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
3480 Likewise for V4HF to V4SI/V4DI fix_truncate.
3481 (fix<fixunssuffix>_truncv2hfv2di2):
3482 Likeise for V2HF to V2DI fix_truncate.
3484 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3486 * config/i386/i386.md (<code>hf3): New expander.
3488 2021-09-23 liuhongt <hongtao.liu@intel.com>
3490 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
3491 (VFH_SF_AVX512VL): Extend to handle HFmode.
3492 (VF_SF_AVX512VL): Deleted.
3494 2021-09-23 liuhongt <hongtao.liu@intel.com>
3496 * config/i386/i386.md (rinthf2): New expander.
3497 (nearbyinthf2): New expander.
3499 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
3501 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
3502 (class dom_jt_state): ...this and provide virtual overrides.
3503 (dom_jt_state::register_equiv): New.
3504 (class dom_jt_simplifier): Rename from
3505 dom_jump_threader_simplifier.
3506 (dom_jump_threader_simplifier::simplify): Rename...
3507 (dom_jt_simplifier::simplify): ...to this.
3508 (pass_dominator::execute): Use dom_jt_simplifier and
3510 * tree-ssa-threadedge.c (jump_threader::jump_threader):
3512 (jt_state::register_equivs_stmt): Abstract out...
3513 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
3515 (jump_threader::thread_around_empty_blocks): Update state.
3516 (jump_threader::thread_through_normal_block): Same.
3517 (jt_state::jt_state): Remove.
3518 (jt_state::push): Remove pass specific bits. Keep block vector
3520 (jt_state::append_path): New.
3521 (jt_state::pop): Remove pass specific bits.
3522 (jt_state::register_equiv): Same.
3523 (jt_state::record_ranges_from_stmt): Same.
3524 (jt_state::register_equivs_on_edge): Same. Rename...
3525 (jt_state::register_equivs_edge): ...to this.
3526 (jt_state::dump): New.
3527 (jt_state::debug): New.
3528 (jump_threader_simplifier::simplify): Remove.
3529 (jt_state::get_path): New.
3530 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
3531 class. Expose common functionality as virtual methods.
3532 (class jump_threader_simplifier): Same. Rename...
3533 (class jt_simplifier): ...to this.
3534 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
3535 (class vrp_jt_simplifier): ...to this. Provide pass specific
3537 (class vrp_jt_state): New.
3538 (vrp_jump_threader_simplifier::simplify): Rename...
3539 (vrp_jt_simplifier::simplify): ...to this. Inline code from
3540 what used to be the base class.
3541 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
3544 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
3547 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
3550 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
3551 Richard Biener <rguenther@suse.de>
3553 * match.pd (negation simplifications): Implement some negation
3554 folding transformations from fold-const.c's fold_negate_expr.
3555 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
3556 argument, to control whether the op should be simplified prior
3557 to looking up/assigning a value number.
3558 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
3559 (vn_nary_simplify): Likewise.
3560 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
3561 now call vn_nary_build_or_lookup_1 disabling simplification.
3563 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
3565 PR tree-optimization/102087
3566 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
3567 Update bound/cmp/control for niter.
3569 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
3571 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
3572 Move check for non-empty BB here.
3573 (fur_source::register_outgoing_edges): ...from here.
3575 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
3577 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
3578 Remove call to improve_range_with_equivs.
3579 (path_range_query::improve_range_with_equivs): Remove
3580 * gimple-range-path.h: Remove improve_range_with_equivs.
3582 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3584 * config/i386/avx512fp16intrin.h:
3585 (_mm512_mask_blend_ph): New intrinsic.
3586 (_mm512_permutex2var_ph): Ditto.
3587 (_mm512_permutexvar_ph): Ditto.
3588 * config/i386/avx512fp16vlintrin.h:
3589 (_mm256_mask_blend_ph): New intrinsic.
3590 (_mm256_permutex2var_ph): Ditto.
3591 (_mm256_permutexvar_ph): Ditto.
3592 (_mm_mask_blend_ph): Ditto.
3593 (_mm_permutex2var_ph): Ditto.
3594 (_mm_permutexvar_ph): Ditto.
3596 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3598 * config/i386/avx512fp16intrin.h: Add new intrinsics.
3599 (_mm512_conj_pch): New intrinsic.
3600 (_mm512_mask_conj_pch): Ditto.
3601 (_mm512_maskz_conj_pch): Ditto.
3602 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
3603 (_mm256_conj_pch): New intrinsic.
3604 (_mm256_mask_conj_pch): Ditto.
3605 (_mm256_maskz_conj_pch): Ditto.
3606 (_mm_conj_pch): Ditto.
3607 (_mm_mask_conj_pch): Ditto.
3608 (_mm_maskz_conj_pch): Ditto.
3610 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3612 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
3613 (_mm512_reduce_add_ph): New intrinsic.
3614 (_mm512_reduce_mul_ph): Ditto.
3615 (_mm512_reduce_min_ph): Ditto.
3616 (_mm512_reduce_max_ph): Ditto.
3617 * config/i386/avx512fp16vlintrin.h
3618 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
3619 (_mm256_reduce_add_ph): New intrinsic.
3620 (_mm256_reduce_mul_ph): Ditto.
3621 (_mm256_reduce_min_ph): Ditto.
3622 (_mm256_reduce_max_ph): Ditto.
3623 (_mm_reduce_add_ph): Ditto.
3624 (_mm_reduce_mul_ph): Ditto.
3625 (_mm_reduce_min_ph): Ditto.
3626 (_mm_reduce_max_ph): Ditto.
3628 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3630 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
3631 __m128h_u): New typedef.
3632 (_mm512_load_ph): New intrinsic.
3633 (_mm256_load_ph): Ditto.
3634 (_mm_load_ph): Ditto.
3635 (_mm512_loadu_ph): Ditto.
3636 (_mm256_loadu_ph): Ditto.
3637 (_mm_loadu_ph): Ditto.
3638 (_mm512_store_ph): Ditto.
3639 (_mm256_store_ph): Ditto.
3640 (_mm_store_ph): Ditto.
3641 (_mm512_storeu_ph): Ditto.
3642 (_mm256_storeu_ph): Ditto.
3643 (_mm_storeu_ph): Ditto.
3644 (_mm512_abs_ph): Ditto.
3645 * config/i386/avx512fp16vlintrin.h
3646 (_mm_abs_ph): Ditto.
3647 (_mm256_abs_ph): Ditto.
3649 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
3651 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
3653 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
3656 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
3657 is actually a full copy of the source operand into the target.
3658 Don't emit a strict low part move if source and target mode match.
3660 2021-09-22 Jakub Jelinek <jakub@redhat.com>
3662 PR middle-end/102415
3663 * omp-expand.c (expand_omp_single): If region->exit is NULL,
3664 assert region->entry is GIMPLE_OMP_SCOPE region and return.
3666 2021-09-22 Jakub Jelinek <jakub@redhat.com>
3668 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
3669 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
3670 arguments from 2 to 3.
3671 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
3672 allocate clause allocator and print align if present.
3673 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
3674 for omp_default_mem_alloc if align modifier is present. If align
3675 modifier is present, use TREE_LIST to encode both allocator and
3677 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
3678 Handle align modifier on allocator clause if present.
3680 2021-09-22 liuhongt <hongtao.liu@intel.com>
3682 * config/i386/i386.md (define_attr "isa"): Add
3684 (define_attr "enabled"): Correspond fma_or_avx512vl to
3685 TARGET_FMA || TARGET_AVX512VL.
3686 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
3691 2021-09-22 liuhongt <hongtao.liu@intel.com>
3693 * config/i386/i386.md (cstorehf3): New define_expand.
3695 2021-09-22 liuhongt <hongtao.liu@intel.com>
3697 * config/i386/i386.md (<rounding_insn>hf2): New expander.
3698 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
3699 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
3700 Extend from VF_128 to VFH_128.
3702 2021-09-22 liuhongt <hongtao.liu@intel.com>
3704 * config/i386/i386-features.c (i386-features.c): Handle
3706 * config/i386/i386.md (sqrthf2): New expander.
3707 (*sqrthf2): New define_insn.
3708 * config/i386/sse.md
3709 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
3712 2021-09-22 liuhongt <hongtao.liu@intel.com>
3714 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
3716 (_mm_mask3_fcmadd_sch): Likewise.
3717 (_mm_maskz_fcmadd_sch): Likewise.
3718 (_mm_fcmadd_sch): Likewise.
3719 (_mm_mask_fmadd_sch): Likewise.
3720 (_mm_mask3_fmadd_sch): Likewise.
3721 (_mm_maskz_fmadd_sch): Likewise.
3722 (_mm_fmadd_sch): Likewise.
3723 (_mm_mask_fcmadd_round_sch): Likewise.
3724 (_mm_mask3_fcmadd_round_sch): Likewise.
3725 (_mm_maskz_fcmadd_round_sch): Likewise.
3726 (_mm_fcmadd_round_sch): Likewise.
3727 (_mm_mask_fmadd_round_sch): Likewise.
3728 (_mm_mask3_fmadd_round_sch): Likewise.
3729 (_mm_maskz_fmadd_round_sch): Likewise.
3730 (_mm_fmadd_round_sch): Likewise.
3731 (_mm_fcmul_sch): Likewise.
3732 (_mm_mask_fcmul_sch): Likewise.
3733 (_mm_maskz_fcmul_sch): Likewise.
3734 (_mm_fmul_sch): Likewise.
3735 (_mm_mask_fmul_sch): Likewise.
3736 (_mm_maskz_fmul_sch): Likewise.
3737 (_mm_fcmul_round_sch): Likewise.
3738 (_mm_mask_fcmul_round_sch): Likewise.
3739 (_mm_maskz_fcmul_round_sch): Likewise.
3740 (_mm_fmul_round_sch): Likewise.
3741 (_mm_mask_fmul_round_sch): Likewise.
3742 (_mm_maskz_fmul_round_sch): Likewise.
3743 * config/i386/i386-builtin.def: Add corresponding new builtins.
3744 * config/i386/sse.md
3745 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
3746 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
3747 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
3749 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
3750 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
3752 * config/i386/subst.md (mask_scalarcz_name): New.
3753 (mask_scalarc_name): Ditto.
3754 (mask_scalarc_operand3): Ditto.
3755 (mask_scalarcz_operand4): Ditto.
3756 (round_scalarcz_name): Ditto.
3757 (round_scalarc_mask_operand3): Ditto.
3758 (round_scalarcz_mask_operand4): Ditto.
3759 (round_scalarc_mask_op3): Ditto.
3760 (round_scalarcz_mask_op4): Ditto.
3761 (round_scalarcz_constraint): Ditto.
3762 (round_scalarcz_nimm_predicate): Ditto.
3763 (mask_scalarcz): Ditto.
3764 (mask_scalarc): Ditto.
3765 (round_scalarcz): Ditto.
3767 2021-09-22 liuhongt <hongtao.liu@intel.com>
3769 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
3771 (_mm512_mask_fcmadd_pch): Likewise.
3772 (_mm512_mask3_fcmadd_pch): Likewise.
3773 (_mm512_maskz_fcmadd_pch): Likewise.
3774 (_mm512_fmadd_pch): Likewise.
3775 (_mm512_mask_fmadd_pch): Likewise.
3776 (_mm512_mask3_fmadd_pch): Likewise.
3777 (_mm512_maskz_fmadd_pch): Likewise.
3778 (_mm512_fcmadd_round_pch): Likewise.
3779 (_mm512_mask_fcmadd_round_pch): Likewise.
3780 (_mm512_mask3_fcmadd_round_pch): Likewise.
3781 (_mm512_maskz_fcmadd_round_pch): Likewise.
3782 (_mm512_fmadd_round_pch): Likewise.
3783 (_mm512_mask_fmadd_round_pch): Likewise.
3784 (_mm512_mask3_fmadd_round_pch): Likewise.
3785 (_mm512_maskz_fmadd_round_pch): Likewise.
3786 (_mm512_fcmul_pch): Likewise.
3787 (_mm512_mask_fcmul_pch): Likewise.
3788 (_mm512_maskz_fcmul_pch): Likewise.
3789 (_mm512_fmul_pch): Likewise.
3790 (_mm512_mask_fmul_pch): Likewise.
3791 (_mm512_maskz_fmul_pch): Likewise.
3792 (_mm512_fcmul_round_pch): Likewise.
3793 (_mm512_mask_fcmul_round_pch): Likewise.
3794 (_mm512_maskz_fcmul_round_pch): Likewise.
3795 (_mm512_fmul_round_pch): Likewise.
3796 (_mm512_mask_fmul_round_pch): Likewise.
3797 (_mm512_maskz_fmul_round_pch): Likewise.
3798 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
3800 (_mm_mask_fmadd_pch): Likewise.
3801 (_mm_mask3_fmadd_pch): Likewise.
3802 (_mm_maskz_fmadd_pch): Likewise.
3803 (_mm256_fmadd_pch): Likewise.
3804 (_mm256_mask_fmadd_pch): Likewise.
3805 (_mm256_mask3_fmadd_pch): Likewise.
3806 (_mm256_maskz_fmadd_pch): Likewise.
3807 (_mm_fcmadd_pch): Likewise.
3808 (_mm_mask_fcmadd_pch): Likewise.
3809 (_mm_mask3_fcmadd_pch): Likewise.
3810 (_mm_maskz_fcmadd_pch): Likewise.
3811 (_mm256_fcmadd_pch): Likewise.
3812 (_mm256_mask_fcmadd_pch): Likewise.
3813 (_mm256_mask3_fcmadd_pch): Likewise.
3814 (_mm256_maskz_fcmadd_pch): Likewise.
3815 (_mm_fmul_pch): Likewise.
3816 (_mm_mask_fmul_pch): Likewise.
3817 (_mm_maskz_fmul_pch): Likewise.
3818 (_mm256_fmul_pch): Likewise.
3819 (_mm256_mask_fmul_pch): Likewise.
3820 (_mm256_maskz_fmul_pch): Likewise.
3821 (_mm_fcmul_pch): Likewise.
3822 (_mm_mask_fcmul_pch): Likewise.
3823 (_mm_maskz_fcmul_pch): Likewise.
3824 (_mm256_fcmul_pch): Likewise.
3825 (_mm256_mask_fcmul_pch): Likewise.
3826 (_mm256_maskz_fcmul_pch): Likewise.
3827 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
3828 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
3829 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
3830 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
3831 * config/i386/i386-builtin.def: Add new builtins.
3832 * config/i386/i386-expand.c: Handle new builtin types.
3833 * config/i386/subst.md (SUBST_CV): New.
3834 (maskc_name): Ditto.
3835 (maskc_operand3): Ditto.
3837 (sdc_maskz_name): Ditto.
3838 (sdc_mask_op4): Ditto.
3839 (sdc_mask_op5): Ditto.
3840 (sdc_mask_mode512bit_condition): Ditto.
3842 (round_maskc_operand3): Ditto.
3843 (round_sdc_mask_operand4): Ditto.
3844 (round_maskc_op3): Ditto.
3845 (round_sdc_mask_op4): Ditto.
3846 (round_saeonly_sdc_mask_operand5): Ditto.
3847 * config/i386/sse.md (unspec): Add complex fma unspecs.
3848 (avx512fmaskcmode): New.
3849 (UNSPEC_COMPLEX_F_C_MA): Ditto.
3850 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
3851 (complexopname): Ditto.
3852 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
3853 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
3854 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
3856 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
3857 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
3859 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
3861 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
3862 rs6000-density-size-threshold, rs6000-density-penalty,
3863 rs6000-density-load-pct-threshold,
3864 rs6000-density-load-num-threshold): New parameter.
3865 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
3866 corresponding parameters.
3868 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3870 * gimple-range-path.cc (path_range_query::defined_outside_path):
3872 (path_range_query::range_on_path_entry): New.
3873 (path_range_query::internal_range_of_expr): Resolve unknowns
3875 (path_range_query::improve_range_with_equivs): New.
3876 (path_range_query::ssa_range_in_phi): Resolve unknowns with
3878 * gimple-range-path.h (class path_range_query): Add
3879 defined_outside_path, range_on_path_entry, and
3880 improve_range_with_equivs.
3882 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3884 * gimple-range-path.cc (path_range_query::add_to_imports): New.
3885 (path_range_query::add_copies_to_imports): New.
3886 (path_range_query::precompute_ranges): Call
3887 add_copies_to_imports.
3888 * gimple-range-path.h (class path_range_query): Add prototypes
3889 for add_copies_to_imports and add_to_imports.
3891 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3893 * gimple-range-path.cc (path_range_query::range_defined_in_block):
3894 Remove useless code.
3896 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3898 * gimple-range-fold.h (class fur_source): Make oracle protected.
3899 * gimple-range-path.cc (path_range_query::path_range_query): Add
3900 resolve argument. Initialize oracle.
3901 (path_range_query::~path_range_query): Delete oracle.
3902 (path_range_query::range_of_stmt): Adapt to use relations.
3903 (path_range_query::precompute_ranges): Pre-compute relations.
3904 (class jt_fur_source): New
3905 (jt_fur_source::jt_fur_source): New.
3906 (jt_fur_source::register_relation): New.
3907 (jt_fur_source::query_relation): New.
3908 (path_range_query::precompute_relations): New.
3909 (path_range_query::precompute_phi_relations): New.
3910 * gimple-range-path.h (path_range_query): Add resolve argument.
3911 Add oracle, precompute_relations, precompute_phi_relations.
3912 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
3913 resolve argument to solver.
3915 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3917 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
3918 Rename postfold_gcond_edges to register_outgoing_edges and
3920 (fold_using_range::postfold_gcond_edges): Rename...
3921 (fur_source::register_outgoing_edges): ...to this.
3922 * gimple-range-fold.h (postfold_gcond_edges): Rename to
3923 register_outgoing_edges and move to fur_source.
3925 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3927 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
3928 dom_info_available_p.
3930 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
3932 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
3933 and quick_grow_cleared instead of safe_grow_cleared.
3935 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
3938 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
3941 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
3943 * configure.ac: Detect when the assembler supports new-style
3944 architecture extensions.
3945 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
3946 the full CPU string if the assembler can grok it.
3947 (arm_rewrite_march): Likewise but for the architecture.
3948 * config.in: Regenerate.
3949 * configure: Regenerate.
3951 2021-09-21 Richard Biener <rguenther@suse.de>
3953 PR tree-optimization/102421
3954 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
3955 adjust alignment info.
3957 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
3959 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
3960 bitfield stream out.
3962 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
3964 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
3965 undefined edges, apply an equivalence if appropriate.
3966 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
3967 UNDEFINED if EDGE_EXECUTABLE is not set.
3968 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
3969 as EXECUTABLE upon startup.
3970 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
3971 EDGE_EXECUTABLE set.
3972 * vr-values.c (set_and_propagate_unexecutable): New.
3973 (simplify_using_ranges::fold_cond): Call set_and_propagate.
3974 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
3975 * vr-values.h: Add prototype.
3977 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
3979 * value-relation.cc (equiv_oracle::register_initial_def): New.
3980 (equiv_oracle::register_relation): Call register_initial_def.
3981 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
3982 (relation_oracle::register_stmt): Check def block of PHI arguments.
3983 * value-relation.h (equiv_oracle): Add new prototypes.
3985 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
3987 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
3988 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
3989 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
3990 __ROUNDING_MATH__ according to their corresponding flags.
3991 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
3992 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
3993 and __ROUNDING_MATH__.
3995 2021-09-20 Richard Biener <rguenther@suse.de>
3997 * tree-vect-stmts.c (vectorizable_load): Use the vectype
4000 2021-09-20 Richard Biener <rguenther@suse.de>
4002 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
4003 Do not compute alignment of the vectorized access here.
4005 2021-09-20 Richard Biener <rguenther@suse.de>
4007 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
4008 Store -1 for runtime alias peeling iterations.
4010 2021-09-20 Richard Biener <rguenther@suse.de>
4012 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
4014 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
4016 * input.c (string_concat_db::record_string_concatenation)
4017 (string_concat_db::get_string_concatenation): Skip for
4018 'RESERVED_LOCATION_P'.
4020 2021-09-20 Richard Biener <rguenther@suse.de>
4022 PR tree-optimization/65206
4023 * tree-data-ref.h (struct data_reference): Add alt_indices,
4025 * tree-data-ref.c (free_data_ref): Release alt_indices.
4026 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
4027 (create_data_ref): Adjust.
4028 (initialize_data_dependence_relation): Split into head
4029 and tail. When the base objects fail to match up try
4030 again with pointer-based analysis of indices.
4031 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
4032 not compare the lazily computed alternate set of indices.
4034 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
4036 * gcc.c: Test for execute OK when we find the
4037 programs for assembler linker and dsymutil and those
4038 were specified at configure-time.
4040 2021-09-19 Martin Sebor <msebor@redhat.com>
4042 PR middle-end/102403
4043 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
4044 Correct a function pre/postcondition.
4046 2021-09-19 Martin Sebor <msebor@redhat.com>
4048 PR middle-end/102243
4049 * tree-ssa-strlen.c (get_range): Handle null cfun.
4051 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
4053 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
4054 unwinder shim as a convenience library.
4056 2021-09-19 Andrew Pinski <apinski@marvell.com>
4058 * doc/install.texi: Add note about
4059 binutils 2.35 is required for LTO usage.
4061 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
4063 * tree-ssa-threadbackward.c
4064 (back_threader_registry::register_path): Use push_edge.
4065 * tree-ssa-threadedge.c
4066 (jump_threader::thread_around_empty_blocks): Same.
4067 (jump_threader::thread_through_normal_block): Same.
4068 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
4070 * tree-ssa-threadupdate.c
4071 (jt_path_registry::allocate_thread_edge): Remove.
4072 (jt_path_registry::push_edge): New.
4073 (dump_jump_thread_path): Make static.
4074 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
4077 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
4079 * gimple-range-path.cc (path_range_query::path_range_query): Add
4081 (path_range_query::dump): Remove extern declaration of dump_ranger.
4082 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
4083 * gimple-range-trace.h (dump_ranger): Add prototype.
4085 2021-09-19 John Ericson <git@JohnEricson.me>
4087 * gcc.c (find_a_program): New function, factored out of...
4088 (find_a_file): Here.
4089 (execute): Use find_a_program when looking for programs rather
4092 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
4094 * config/avr/avr-mcus.def: Add atmega324pb.
4095 * doc/avr-mmcu.texi: Corresponding changes.
4097 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
4100 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
4101 the same transformations as to @1. For comparisons against NaN,
4102 don't check HONOR_SNANS but confirm that neither operand is a
4105 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
4107 * attribs.c (make_unique_name): Delete.
4108 * attribs.h (make_unique_name): Delete.
4110 2021-09-19 Andrew Pinski <apinski@marvell.com>
4112 * lra-constraints.c (check_and_process_move): Assert
4113 that dclass and sclass are greater than or equal to NO_REGS.
4115 2021-09-18 Jakub Jelinek <jakub@redhat.com>
4117 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
4118 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
4119 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
4121 2021-09-18 liuhongt <hongtao.liu@intel.com>
4123 * config/i386/i386-features.c (remove_partial_avx_dependency):
4124 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
4125 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
4127 2021-09-18 Jakub Jelinek <jakub@redhat.com>
4129 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
4130 if file/namespace scope variable doesn't have predetermined sharing,
4131 treat it as if there was default(none).
4133 2021-09-18 liuhongt <hongtao.liu@intel.com>
4135 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
4137 (_mm_mask_fmadd_sh): Likewise.
4138 (_mm_mask3_fmadd_sh): Likewise.
4139 (_mm_maskz_fmadd_sh): Likewise.
4140 (_mm_fmadd_round_sh): Likewise.
4141 (_mm_mask_fmadd_round_sh): Likewise.
4142 (_mm_mask3_fmadd_round_sh): Likewise.
4143 (_mm_maskz_fmadd_round_sh): Likewise.
4144 (_mm_fnmadd_sh): Likewise.
4145 (_mm_mask_fnmadd_sh): Likewise.
4146 (_mm_mask3_fnmadd_sh): Likewise.
4147 (_mm_maskz_fnmadd_sh): Likewise.
4148 (_mm_fnmadd_round_sh): Likewise.
4149 (_mm_mask_fnmadd_round_sh): Likewise.
4150 (_mm_mask3_fnmadd_round_sh): Likewise.
4151 (_mm_maskz_fnmadd_round_sh): Likewise.
4152 (_mm_fmsub_sh): Likewise.
4153 (_mm_mask_fmsub_sh): Likewise.
4154 (_mm_mask3_fmsub_sh): Likewise.
4155 (_mm_maskz_fmsub_sh): Likewise.
4156 (_mm_fmsub_round_sh): Likewise.
4157 (_mm_mask_fmsub_round_sh): Likewise.
4158 (_mm_mask3_fmsub_round_sh): Likewise.
4159 (_mm_maskz_fmsub_round_sh): Likewise.
4160 (_mm_fnmsub_sh): Likewise.
4161 (_mm_mask_fnmsub_sh): Likewise.
4162 (_mm_mask3_fnmsub_sh): Likewise.
4163 (_mm_maskz_fnmsub_sh): Likewise.
4164 (_mm_fnmsub_round_sh): Likewise.
4165 (_mm_mask_fnmsub_round_sh): Likewise.
4166 (_mm_mask3_fnmsub_round_sh): Likewise.
4167 (_mm_maskz_fnmsub_round_sh): Likewise.
4168 * config/i386/i386-builtin-types.def
4169 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
4170 * config/i386/i386-builtin.def: Add new builtins.
4171 * config/i386/i386-expand.c: Handle new builtin type.
4172 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
4173 Ajdust to support FP16.
4174 (fmai_vmfmsub_<mode><round_name>): Ditto.
4175 (fmai_vmfnmadd_<mode><round_name>): Ditto.
4176 (fmai_vmfnmsub_<mode><round_name>): Ditto.
4177 (*fmai_fmadd_<mode>): Ditto.
4178 (*fmai_fmsub_<mode>): Ditto.
4179 (*fmai_fnmadd_<mode><round_name>): Ditto.
4180 (*fmai_fnmsub_<mode><round_name>): Ditto.
4181 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
4182 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
4183 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
4184 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
4185 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
4186 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
4187 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
4188 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
4189 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
4190 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
4191 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
4192 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
4193 adjust to support FP16.
4194 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
4195 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
4196 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
4199 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
4201 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
4203 (maskload<mode><avx512fmaskmodelower>): Ditto.
4204 (maskstore<mode><avx512fmaskmodelower>): Ditto.
4206 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
4208 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
4210 (ix86_expand_copysign): Ditto.
4211 (ix86_expand_xorsign): Ditto.
4212 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
4214 (ix86_build_signbit_mask): Ditto.
4215 (ix86_can_change_mode_class): Ditto.
4216 * config/i386/i386.md
4217 (SSEMODEF): Add HFmode.
4218 (ssevecmodef): Ditto.
4219 (<code>hf2): New define_expand.
4220 (*<code>hf2_1): New define_insn_and_split.
4221 (copysign<mode>): Extend to support HFmode under AVX512FP16.
4222 (xorsign<mode>): Ditto.
4223 * config/i386/sse.md (VFB): New mode iterator.
4224 (VFB_128_256): Ditto.
4226 (sseintvecmode2): Support HF vector mode.
4227 (<code><mode>2): Use new mode iterator.
4228 (*<code><mode>2): Ditto.
4229 (copysign<mode>3): Ditto.
4230 (xorsign<mode>3): Ditto.
4231 (<code><mode>3<mask_name>): Ditto.
4232 (<code><mode>3<mask_name>): Ditto.
4233 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
4234 (<sse>_andnot<mode>3<mask_name>): Ditto.
4235 (*<code><mode>3<mask_name>): Ditto.
4236 (*<code><mode>3<mask_name>): Ditto.
4238 2021-09-18 liuhongt <hongtao.liu@intel.com>
4240 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
4242 (_mm512_mask3_fmadd_ph): Likewise.
4243 (_mm512_maskz_fmadd_ph): Likewise.
4244 (_mm512_fmadd_round_ph): Likewise.
4245 (_mm512_mask_fmadd_round_ph): Likewise.
4246 (_mm512_mask3_fmadd_round_ph): Likewise.
4247 (_mm512_maskz_fmadd_round_ph): Likewise.
4248 (_mm512_fnmadd_ph): Likewise.
4249 (_mm512_mask_fnmadd_ph): Likewise.
4250 (_mm512_mask3_fnmadd_ph): Likewise.
4251 (_mm512_maskz_fnmadd_ph): Likewise.
4252 (_mm512_fnmadd_round_ph): Likewise.
4253 (_mm512_mask_fnmadd_round_ph): Likewise.
4254 (_mm512_mask3_fnmadd_round_ph): Likewise.
4255 (_mm512_maskz_fnmadd_round_ph): Likewise.
4256 (_mm512_fmsub_ph): Likewise.
4257 (_mm512_mask_fmsub_ph): Likewise.
4258 (_mm512_mask3_fmsub_ph): Likewise.
4259 (_mm512_maskz_fmsub_ph): Likewise.
4260 (_mm512_fmsub_round_ph): Likewise.
4261 (_mm512_mask_fmsub_round_ph): Likewise.
4262 (_mm512_mask3_fmsub_round_ph): Likewise.
4263 (_mm512_maskz_fmsub_round_ph): Likewise.
4264 (_mm512_fnmsub_ph): Likewise.
4265 (_mm512_mask_fnmsub_ph): Likewise.
4266 (_mm512_mask3_fnmsub_ph): Likewise.
4267 (_mm512_maskz_fnmsub_ph): Likewise.
4268 (_mm512_fnmsub_round_ph): Likewise.
4269 (_mm512_mask_fnmsub_round_ph): Likewise.
4270 (_mm512_mask3_fnmsub_round_ph): Likewise.
4271 (_mm512_maskz_fnmsub_round_ph): Likewise.
4272 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
4274 (_mm256_mask_fmadd_ph): Likewise.
4275 (_mm256_mask3_fmadd_ph): Likewise.
4276 (_mm256_maskz_fmadd_ph): Likewise.
4277 (_mm_fmadd_ph): Likewise.
4278 (_mm_mask_fmadd_ph): Likewise.
4279 (_mm_mask3_fmadd_ph): Likewise.
4280 (_mm_maskz_fmadd_ph): Likewise.
4281 (_mm256_fnmadd_ph): Likewise.
4282 (_mm256_mask_fnmadd_ph): Likewise.
4283 (_mm256_mask3_fnmadd_ph): Likewise.
4284 (_mm256_maskz_fnmadd_ph): Likewise.
4285 (_mm_fnmadd_ph): Likewise.
4286 (_mm_mask_fnmadd_ph): Likewise.
4287 (_mm_mask3_fnmadd_ph): Likewise.
4288 (_mm_maskz_fnmadd_ph): Likewise.
4289 (_mm256_fmsub_ph): Likewise.
4290 (_mm256_mask_fmsub_ph): Likewise.
4291 (_mm256_mask3_fmsub_ph): Likewise.
4292 (_mm256_maskz_fmsub_ph): Likewise.
4293 (_mm_fmsub_ph): Likewise.
4294 (_mm_mask_fmsub_ph): Likewise.
4295 (_mm_mask3_fmsub_ph): Likewise.
4296 (_mm_maskz_fmsub_ph): Likewise.
4297 (_mm256_fnmsub_ph): Likewise.
4298 (_mm256_mask_fnmsub_ph): Likewise.
4299 (_mm256_mask3_fnmsub_ph): Likewise.
4300 (_mm256_maskz_fnmsub_ph): Likewise.
4301 (_mm_fnmsub_ph): Likewise.
4302 (_mm_mask_fnmsub_ph): Likewise.
4303 (_mm_mask3_fnmsub_ph): Likewise.
4304 (_mm_maskz_fnmsub_ph): Likewise.
4305 * config/i386/i386-builtin.def: Add corresponding new builtins.
4306 * config/i386/sse.md
4307 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
4308 support HF vector modes.
4309 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
4311 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
4312 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
4313 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
4314 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
4315 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
4316 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
4317 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
4319 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
4320 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
4321 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
4322 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
4323 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
4324 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
4326 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
4327 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
4328 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
4329 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
4330 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
4331 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
4332 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
4334 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
4335 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
4336 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
4337 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
4338 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
4340 2021-09-18 liuhongt <hongtao.liu@intel.com>
4342 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
4344 (_mm512_mask_fmaddsub_ph): Likewise.
4345 (_mm512_mask3_fmaddsub_ph): Likewise.
4346 (_mm512_maskz_fmaddsub_ph): Likewise.
4347 (_mm512_fmaddsub_round_ph): Likewise.
4348 (_mm512_mask_fmaddsub_round_ph): Likewise.
4349 (_mm512_mask3_fmaddsub_round_ph): Likewise.
4350 (_mm512_maskz_fmaddsub_round_ph): Likewise.
4351 (_mm512_mask_fmsubadd_ph): Likewise.
4352 (_mm512_mask3_fmsubadd_ph): Likewise.
4353 (_mm512_maskz_fmsubadd_ph): Likewise.
4354 (_mm512_fmsubadd_round_ph): Likewise.
4355 (_mm512_mask_fmsubadd_round_ph): Likewise.
4356 (_mm512_mask3_fmsubadd_round_ph): Likewise.
4357 (_mm512_maskz_fmsubadd_round_ph): Likewise.
4358 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
4360 (_mm256_mask_fmaddsub_ph): Likewise.
4361 (_mm256_mask3_fmaddsub_ph): Likewise.
4362 (_mm256_maskz_fmaddsub_ph): Likewise.
4363 (_mm_fmaddsub_ph): Likewise.
4364 (_mm_mask_fmaddsub_ph): Likewise.
4365 (_mm_mask3_fmaddsub_ph): Likewise.
4366 (_mm_maskz_fmaddsub_ph): Likewise.
4367 (_mm256_fmsubadd_ph): Likewise.
4368 (_mm256_mask_fmsubadd_ph): Likewise.
4369 (_mm256_mask3_fmsubadd_ph): Likewise.
4370 (_mm256_maskz_fmsubadd_ph): Likewise.
4371 (_mm_fmsubadd_ph): Likewise.
4372 (_mm_mask_fmsubadd_ph): Likewise.
4373 (_mm_mask3_fmsubadd_ph): Likewise.
4374 (_mm_maskz_fmsubadd_ph): Likewise.
4375 * config/i386/i386-builtin.def: Add corresponding new builtins.
4376 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
4377 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
4378 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
4380 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
4382 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
4383 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
4384 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
4386 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
4387 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
4389 2021-09-18 liuhongt <hongtao.liu@intel.com>
4392 * config/i386/i386.c (ix86_print_operand): Handle
4393 V8HF/V16HF/V32HFmode.
4394 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
4395 * config/i386/sse.md (avx512bcst): Remove.
4397 2021-09-17 Martin Sebor <msebor@redhat.com>
4399 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
4400 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
4401 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
4402 (check_defs): Add comment.
4403 (can_skip_redundant_opnd): Update comment.
4404 (compute_uninit_opnds_pos): Adjust to namespace change.
4405 (find_pdom): Move to gimple-predicate-analysis.cc.
4407 (struct uninit_undef_val_t): New.
4408 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
4409 (find_control_equiv_block): Same.
4410 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
4411 (MAX_SWITCH_CASES): Same.
4412 (compute_control_dep_chain): Same.
4413 (find_uninit_use): Use predicate analyzer.
4414 (struct pred_info): Move to gimple-predicate-analysis.
4415 (convert_control_dep_chain_into_preds): Same.
4416 (find_predicates): Same.
4417 (collect_phi_def_edges): Same.
4418 (warn_uninitialized_phi): Use predicate analyzer.
4419 (find_def_preds): Move to gimple-predicate-analysis.
4420 (dump_pred_info): Same.
4421 (dump_pred_chain): Same.
4422 (dump_predicates): Same.
4423 (destroy_predicate_vecs): Remove.
4424 (execute_late_warn_uninitialized): New.
4425 (get_cmp_code): Move to gimple-predicate-analysis.
4426 (is_value_included_in): Same.
4427 (value_sat_pred_p): Same.
4428 (find_matching_predicate_in_rest_chains): Same.
4429 (is_use_properly_guarded): Same.
4430 (prune_uninit_phi_opnds): Same.
4431 (find_var_cmp_const): Same.
4432 (use_pred_not_overlap_with_undef_path_pred): Same.
4433 (pred_equal_p): Same.
4434 (is_neq_relop_p): Same.
4435 (is_neq_zero_form_p): Same.
4436 (pred_expr_equal_p): Same.
4437 (is_pred_expr_subset_of): Same.
4438 (is_pred_chain_subset_of): Same.
4439 (is_included_in): Same.
4440 (is_superset_of): Same.
4442 (simplify_pred): Same.
4443 (simplify_preds_2): Same.
4444 (simplify_preds_3): Same.
4445 (simplify_preds_4): Same.
4446 (simplify_preds): Same.
4448 (push_to_worklist): Same.
4449 (get_pred_info_from_cmp): Same.
4450 (is_degenerated_phi): Same.
4451 (normalize_one_pred_1): Same.
4452 (normalize_one_pred): Same.
4453 (normalize_one_pred_chain): Same.
4454 (normalize_preds): Same.
4455 (can_one_predicate_be_invalidated_p): Same.
4456 (can_chain_union_be_invalidated_p): Same.
4457 (uninit_uses_cannot_happen): Same.
4458 (pass_late_warn_uninitialized::execute): Define.
4459 * gimple-predicate-analysis.cc: New file.
4460 * gimple-predicate-analysis.h: New file.
4462 2021-09-17 Julian Brown <julian@codesourcery.com>
4464 * config/gcn/gcn.c (gimple.h): Include.
4465 (gcn_fork_join): Emit barrier for worker-level joins.
4466 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
4467 writes_gang_private bitmap parameter. Set bit for blocks
4468 containing gang-private variable writes.
4469 (worker_single_simple): Don't emit barrier after predicated block.
4470 (worker_single_copy): Don't emit barrier if we're not broadcasting
4471 anything and the block contains no gang-private writes.
4472 (neuter_worker_single): Don't predicate blocks that only contain
4473 NOPs or internal marker functions. Pass has_gang_private_write
4474 argument to worker_single_copy.
4475 (oacc_do_neutering): Add writes_gang_private bitmap handling.
4477 2021-09-17 Julian Brown <julian@codesourcery.com>
4479 * config/gcn/gcn-protos.h
4480 (gcn_goacc_create_worker_broadcast_record): Update prototype.
4481 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
4482 preallocated block of LDS memory. Do not cache/share decls for
4483 reduction temporaries between invocations.
4484 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
4485 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
4486 and return temporary LDS space at that offset. Return pointer in
4488 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
4490 (ACC_LDS_SIZE): Define as acc_lds_size.
4491 (gcn_init_machine_status): Don't initialise lds_allocated,
4492 lds_allocs, reduc_decls fields of machine function struct.
4493 (gcn_option_override): Handle default size for gang-private
4494 variables and -mgang-private-size option.
4495 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
4496 initialising M0_REG.
4497 (gcn_shared_mem_layout): New function.
4498 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
4499 gang_private_hwm variable.
4500 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
4501 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
4502 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
4503 * config/gcn/gcn.opt (gang_private_size_opt): New global.
4504 (mgang-private-size=): New option.
4505 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
4507 * doc/tm.texi: Regenerate.
4508 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
4510 (build_sender_ref): Handle sender_decl being pointer.
4511 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
4512 parameters. Pass placement argument to
4513 create_worker_broadcast_record hook invocations. Handle
4514 sender_decl being pointer and isolate_broadcasts inserting extra
4516 (blk_offset_map_t): Add typedef.
4517 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
4518 preallocated range to worker_single_copy call.
4519 (dfs_broadcast_reachable_1): New function.
4520 (idx_decl_pair_t, used_range_vec_t): New typedefs.
4521 (sort_size_descending): New function.
4522 (addr_range): New class.
4523 (splay_tree_compare_addr_range, splay_tree_free_key)
4524 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
4525 (execute_omp_oacc_neuter_broadcast): Rename to...
4526 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
4527 parameters. Arrange layout of shared memory for broadcast
4529 (execute_omp_oacc_neuter_broadcast): New function.
4530 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
4531 handling from here. Enable pass for all OpenACC routines in order
4532 to call shared memory-layout hook.
4533 * target.def (create_worker_broadcast_record): Add OFFSET
4535 (shared_mem_layout): New hook.
4537 2021-09-17 Julian Brown <julian@codesourcery.com>
4538 Thomas Schwinge <thomas@codesourcery.com>
4540 * omp-oacc-neuter-broadcast.cc
4541 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
4543 (execute_omp_oacc_neuter_broadcast): Adjust.
4545 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
4547 * value-relation.cc (class equiv_chain): Move to header file.
4548 (path_oracle::path_oracle): New.
4549 (path_oracle::~path_oracle): New.
4550 (path_oracle::register_relation): New.
4551 (path_oracle::query_relation): New.
4552 (path_oracle::reset_path): New.
4553 (path_oracle::dump): New.
4554 * value-relation.h (class equiv_chain): Move to here.
4555 (class path_oracle): New.
4557 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
4559 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
4561 * gimple-range-fold.cc (fur_depend::register_relation): Use
4562 register_stmt/edge routines.
4563 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
4564 (equiv_oracle::equiv_oracle): Create self equivalence cache.
4565 (equiv_oracle::~equiv_oracle): Release same.
4566 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
4567 are no equivalences.
4568 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
4569 (equiv_oracle::register_relation): Rename from register_equiv.
4570 (relation_chain_head::find_relation): Relocate from dom_oracle.
4571 (relation_oracle::register_stmt): New.
4572 (relation_oracle::register_edge): New.
4573 (dom_oracle::*): Rename from relation_oracle.
4574 (dom_oracle::register_relation): Adjust to call equiv_oracle.
4575 (dom_oracle::set_one_relation): Split from register_relation.
4576 (dom_oracle::register_transitives): Consolidate 2 methods.
4577 (dom_oracle::find_relation_block): Move core to relation_chain.
4578 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
4579 * value-relation.h (class relation_oracle): New pure virtual base.
4580 (class equiv_oracle): Inherit from relation_oracle and adjust.
4581 (class dom_oracle): Rename from old relation_oracle and adjust.
4583 2021-09-17 Martin Sebor <msebor@redhat.com>
4585 PR middle-end/102200
4586 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
4587 (handle_min_max_size): Change argument. Store original SSA_NAME for
4588 operands to potentially distinct (sub)objects.
4589 (compute_objsize_r): Adjust call to the above.
4591 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4593 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
4594 (rs6000_new_builtin_vectorized_function): New function.
4595 (rs6000_new_builtin_md_vectorized_function): Likewise.
4596 (rs6000_builtin_vectorized_function): Call
4597 rs6000_new_builtin_vectorized_function.
4598 (rs6000_builtin_md_vectorized_function): Call
4599 rs6000_new_builtin_md_vectorized_function.
4601 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4603 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
4604 (ASSEMBLE_PAIR): Likewise.
4605 (BUILD_ACC): Likewise.
4606 (DISASSEMBLE_ACC): Likewise.
4607 (DISASSEMBLE_PAIR): Likewise.
4608 (PMXVBF16GER2): Likewise.
4609 (PMXVBF16GER2NN): Likewise.
4610 (PMXVBF16GER2NP): Likewise.
4611 (PMXVBF16GER2PN): Likewise.
4612 (PMXVBF16GER2PP): Likewise.
4613 (PMXVF16GER2): Likewise.
4614 (PMXVF16GER2NN): Likewise.
4615 (PMXVF16GER2NP): Likewise.
4616 (PMXVF16GER2PN): Likewise.
4617 (PMXVF16GER2PP): Likewise.
4618 (PMXVF32GER): Likewise.
4619 (PMXVF32GERNN): Likewise.
4620 (PMXVF32GERNP): Likewise.
4621 (PMXVF32GERPN): Likewise.
4622 (PMXVF32GERPP): Likewise.
4623 (PMXVF64GER): Likewise.
4624 (PMXVF64GERNN): Likewise.
4625 (PMXVF64GERNP): Likewise.
4626 (PMXVF64GERPN): Likewise.
4627 (PMXVF64GERPP): Likewise.
4628 (PMXVI16GER2): Likewise.
4629 (PMXVI16GER2PP): Likewise.
4630 (PMXVI16GER2S): Likewise.
4631 (PMXVI16GER2SPP): Likewise.
4632 (PMXVI4GER8): Likewise.
4633 (PMXVI4GER8PP): Likewise.
4634 (PMXVI8GER4): Likewise.
4635 (PMXVI8GER4PP): Likewise.
4636 (PMXVI8GER4SPP): Likewise.
4637 (XVBF16GER2): Likewise.
4638 (XVBF16GER2NN): Likewise.
4639 (XVBF16GER2NP): Likewise.
4640 (XVBF16GER2PN): Likewise.
4641 (XVBF16GER2PP): Likewise.
4642 (XVF16GER2): Likewise.
4643 (XVF16GER2NN): Likewise.
4644 (XVF16GER2NP): Likewise.
4645 (XVF16GER2PN): Likewise.
4646 (XVF16GER2PP): Likewise.
4647 (XVF32GER): Likewise.
4648 (XVF32GERNN): Likewise.
4649 (XVF32GERNP): Likewise.
4650 (XVF32GERPN): Likewise.
4651 (XVF32GERPP): Likewise.
4652 (XVF64GER): Likewise.
4653 (XVF64GERNN): Likewise.
4654 (XVF64GERNP): Likewise.
4655 (XVF64GERPN): Likewise.
4656 (XVF64GERPP): Likewise.
4657 (XVI16GER2): Likewise.
4658 (XVI16GER2PP): Likewise.
4659 (XVI16GER2S): Likewise.
4660 (XVI16GER2SPP): Likewise.
4661 (XVI4GER8): Likewise.
4662 (XVI4GER8PP): Likewise.
4663 (XVI8GER4): Likewise.
4664 (XVI8GER4PP): Likewise.
4665 (XVI8GER4SPP): Likewise.
4666 (XXMFACC): Likewise.
4667 (XXMTACC): Likewise.
4668 (XXSETACCZ): Likewise.
4669 (ASSEMBLE_PAIR_V): Likewise.
4670 (BUILD_PAIR): Likewise.
4671 (DISASSEMBLE_PAIR_V): Likewise.
4674 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
4675 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
4676 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
4677 (parse_bif_attrs): Handle ismmaint.
4678 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
4679 (write_bif_static_init): Handle ismmaint.
4681 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4683 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
4685 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
4686 (rs6000_new_builtin_valid_without_lhs): New function.
4687 (rs6000_gimple_fold_new_mma_builtin): Likewise.
4688 (rs6000_gimple_fold_new_builtin): Likewise.
4690 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
4692 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
4693 Destruct stale Value objects.
4694 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
4697 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
4700 * match.pd (shift optimizations): Disable recent sign-changing
4701 optimization for shifts by zero, these will be folded later.
4703 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4705 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
4706 [power9] to [always].
4708 2021-09-17 Richard Biener <rguenther@suse.de>
4710 * tree-vect-stmts.c (vectorizable_load): Do not frob
4713 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4715 * config/i386/i386-features.c (remove_partial_avx_dependency):
4716 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
4717 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
4719 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
4721 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
4722 * config/i386/i386.md (SSE FP to FP splitters): Replace
4723 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
4724 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
4725 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
4726 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
4727 * config/i386/x86-tune.def
4728 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
4729 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
4731 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4734 * config/i386/i386-features.c (remove_partial_avx_dependency):
4735 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
4736 before generating vxorps.
4738 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4740 * config/i386/i386-options.c (processor_cost_table): Use
4741 tremont_cost for Tremont.
4742 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
4743 (tremont_memset): Likewise.
4744 (tremont_cost): Likewise.
4745 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
4748 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4750 * common/config/i386/i386-common.c: Use Haswell scheduling model
4752 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
4754 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
4756 (ix86_adjust_cost): Handle Tremont.
4757 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
4759 (X86_TUNE_USE_LEAVE): Likewise.
4760 (X86_TUNE_PUSH_MEMORY): Likewise.
4761 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
4762 (X86_TUNE_USE_CLTD): Likewise.
4763 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
4764 (X86_TUNE_AVOID_MFENCE): Likewise.
4765 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
4766 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
4767 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
4768 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
4769 (X86_TUNE_OPT_AGU): Likewise.
4770 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
4771 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
4772 (X86_TUNE_EXPAND_ABS): Likewise.
4773 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
4774 (X86_TUNE_SLOW_PSHUFB): Likewise.
4776 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
4778 PR rtl-optimization/102306
4779 * combine.c (try_combine): Abort the combination if we are about to
4780 duplicate volatile references.
4782 2021-09-17 liuhongt <hongtao.liu@intel.com>
4784 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
4786 (_mm256_undefined_ph): Likewise.
4787 (_mm512_undefined_ph): Likewise.
4788 (_mm_cvtsh_h): Likewise.
4789 (_mm256_cvtsh_h): Likewise.
4790 (_mm512_cvtsh_h): Likewise.
4791 (_mm512_castph_ps): Likewise.
4792 (_mm512_castph_pd): Likewise.
4793 (_mm512_castph_si512): Likewise.
4794 (_mm512_castph512_ph128): Likewise.
4795 (_mm512_castph512_ph256): Likewise.
4796 (_mm512_castph128_ph512): Likewise.
4797 (_mm512_castph256_ph512): Likewise.
4798 (_mm512_zextph128_ph512): Likewise.
4799 (_mm512_zextph256_ph512): Likewise.
4800 (_mm512_castps_ph): Likewise.
4801 (_mm512_castpd_ph): Likewise.
4802 (_mm512_castsi512_ph): Likewise.
4803 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
4805 (_mm256_castph_ps): Likewise.
4806 (_mm_castph_pd): Likewise.
4807 (_mm256_castph_pd): Likewise.
4808 (_mm_castph_si128): Likewise.
4809 (_mm256_castph_si256): Likewise.
4810 (_mm_castps_ph): Likewise.
4811 (_mm256_castps_ph): Likewise.
4812 (_mm_castpd_ph): Likewise.
4813 (_mm256_castpd_ph): Likewise.
4814 (_mm_castsi128_ph): Likewise.
4815 (_mm256_castsi256_ph): Likewise.
4816 (_mm256_castph256_ph128): Likewise.
4817 (_mm256_castph128_ph256): Likewise.
4818 (_mm256_zextph128_ph256): Likewise.
4820 2021-09-17 liuhongt <hongtao.liu@intel.com>
4822 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
4824 (_mm_mask_cvtsh_ss): Likewise.
4825 (_mm_maskz_cvtsh_ss): Likewise.
4826 (_mm_cvtsh_sd): Likewise.
4827 (_mm_mask_cvtsh_sd): Likewise.
4828 (_mm_maskz_cvtsh_sd): Likewise.
4829 (_mm_cvt_roundsh_ss): Likewise.
4830 (_mm_mask_cvt_roundsh_ss): Likewise.
4831 (_mm_maskz_cvt_roundsh_ss): Likewise.
4832 (_mm_cvt_roundsh_sd): Likewise.
4833 (_mm_mask_cvt_roundsh_sd): Likewise.
4834 (_mm_maskz_cvt_roundsh_sd): Likewise.
4835 (_mm_cvtss_sh): Likewise.
4836 (_mm_mask_cvtss_sh): Likewise.
4837 (_mm_maskz_cvtss_sh): Likewise.
4838 (_mm_cvtsd_sh): Likewise.
4839 (_mm_mask_cvtsd_sh): Likewise.
4840 (_mm_maskz_cvtsd_sh): Likewise.
4841 (_mm_cvt_roundss_sh): Likewise.
4842 (_mm_mask_cvt_roundss_sh): Likewise.
4843 (_mm_maskz_cvt_roundss_sh): Likewise.
4844 (_mm_cvt_roundsd_sh): Likewise.
4845 (_mm_mask_cvt_roundsd_sh): Likewise.
4846 (_mm_maskz_cvt_roundsd_sh): Likewise.
4847 * config/i386/i386-builtin-types.def
4848 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
4849 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
4850 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
4851 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
4852 * config/i386/i386-builtin.def: Add corrresponding new builtins.
4853 * config/i386/i386-expand.c: Handle new builtin types.
4854 * config/i386/sse.md (VF48_128): New mode iterator.
4855 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
4857 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
4860 2021-09-17 liuhongt <hongtao.liu@intel.com>
4862 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
4864 (_mm512_mask_cvtph_pd): Likewise.
4865 (_mm512_maskz_cvtph_pd): Likewise.
4866 (_mm512_cvt_roundph_pd): Likewise.
4867 (_mm512_mask_cvt_roundph_pd): Likewise.
4868 (_mm512_maskz_cvt_roundph_pd): Likewise.
4869 (_mm512_cvtxph_ps): Likewise.
4870 (_mm512_mask_cvtxph_ps): Likewise.
4871 (_mm512_maskz_cvtxph_ps): Likewise.
4872 (_mm512_cvtx_roundph_ps): Likewise.
4873 (_mm512_mask_cvtx_roundph_ps): Likewise.
4874 (_mm512_maskz_cvtx_roundph_ps): Likewise.
4875 (_mm512_cvtxps_ph): Likewise.
4876 (_mm512_mask_cvtxps_ph): Likewise.
4877 (_mm512_maskz_cvtxps_ph): Likewise.
4878 (_mm512_cvtx_roundps_ph): Likewise.
4879 (_mm512_mask_cvtx_roundps_ph): Likewise.
4880 (_mm512_maskz_cvtx_roundps_ph): Likewise.
4881 (_mm512_cvtpd_ph): Likewise.
4882 (_mm512_mask_cvtpd_ph): Likewise.
4883 (_mm512_maskz_cvtpd_ph): Likewise.
4884 (_mm512_cvt_roundpd_ph): Likewise.
4885 (_mm512_mask_cvt_roundpd_ph): Likewise.
4886 (_mm512_maskz_cvt_roundpd_ph): Likewise.
4887 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
4889 (_mm_mask_cvtph_pd): Likewise.
4890 (_mm_maskz_cvtph_pd): Likewise.
4891 (_mm256_cvtph_pd): Likewise.
4892 (_mm256_mask_cvtph_pd): Likewise.
4893 (_mm256_maskz_cvtph_pd): Likewise.
4894 (_mm_cvtxph_ps): Likewise.
4895 (_mm_mask_cvtxph_ps): Likewise.
4896 (_mm_maskz_cvtxph_ps): Likewise.
4897 (_mm256_cvtxph_ps): Likewise.
4898 (_mm256_mask_cvtxph_ps): Likewise.
4899 (_mm256_maskz_cvtxph_ps): Likewise.
4900 (_mm_cvtxps_ph): Likewise.
4901 (_mm_mask_cvtxps_ph): Likewise.
4902 (_mm_maskz_cvtxps_ph): Likewise.
4903 (_mm256_cvtxps_ph): Likewise.
4904 (_mm256_mask_cvtxps_ph): Likewise.
4905 (_mm256_maskz_cvtxps_ph): Likewise.
4906 (_mm_cvtpd_ph): Likewise.
4907 (_mm_mask_cvtpd_ph): Likewise.
4908 (_mm_maskz_cvtpd_ph): Likewise.
4909 (_mm256_cvtpd_ph): Likewise.
4910 (_mm256_mask_cvtpd_ph): Likewise.
4911 (_mm256_maskz_cvtpd_ph): Likewise.
4912 * config/i386/i386-builtin.def: Add corresponding new builtins.
4913 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
4914 * config/i386/i386-expand.c: Handle new builtin types.
4915 * config/i386/sse.md
4916 (VF4_128_8_256): New.
4917 (VF48H_AVX512VL): Ditto.
4918 (ssePHmode): Add HF vector modes.
4919 (castmode): Add new convertable modes.
4922 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
4923 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
4924 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
4925 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
4926 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
4927 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
4928 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
4930 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
4931 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
4932 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
4933 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
4935 2021-09-17 liuhongt <hongtao.liu@intel.com>
4937 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
4939 (_mm_cvttsh_u32): Likewise.
4940 (_mm_cvtt_roundsh_i32): Likewise.
4941 (_mm_cvtt_roundsh_u32): Likewise.
4942 (_mm_cvttsh_i64): Likewise.
4943 (_mm_cvttsh_u64): Likewise.
4944 (_mm_cvtt_roundsh_i64): Likewise.
4945 (_mm_cvtt_roundsh_u64): Likewise.
4946 * config/i386/i386-builtin.def: Add corresponding new builtins.
4947 * config/i386/sse.md
4948 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
4951 2021-09-17 liuhongt <hongtao.liu@intel.com>
4953 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
4955 (_mm512_mask_cvttph_epi32): Likewise.
4956 (_mm512_maskz_cvttph_epi32): Likewise.
4957 (_mm512_cvtt_roundph_epi32): Likewise.
4958 (_mm512_mask_cvtt_roundph_epi32): Likewise.
4959 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
4960 (_mm512_cvttph_epu32): Likewise.
4961 (_mm512_mask_cvttph_epu32): Likewise.
4962 (_mm512_maskz_cvttph_epu32): Likewise.
4963 (_mm512_cvtt_roundph_epu32): Likewise.
4964 (_mm512_mask_cvtt_roundph_epu32): Likewise.
4965 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
4966 (_mm512_cvttph_epi64): Likewise.
4967 (_mm512_mask_cvttph_epi64): Likewise.
4968 (_mm512_maskz_cvttph_epi64): Likewise.
4969 (_mm512_cvtt_roundph_epi64): Likewise.
4970 (_mm512_mask_cvtt_roundph_epi64): Likewise.
4971 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
4972 (_mm512_cvttph_epu64): Likewise.
4973 (_mm512_mask_cvttph_epu64): Likewise.
4974 (_mm512_maskz_cvttph_epu64): Likewise.
4975 (_mm512_cvtt_roundph_epu64): Likewise.
4976 (_mm512_mask_cvtt_roundph_epu64): Likewise.
4977 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
4978 (_mm512_cvttph_epi16): Likewise.
4979 (_mm512_mask_cvttph_epi16): Likewise.
4980 (_mm512_maskz_cvttph_epi16): Likewise.
4981 (_mm512_cvtt_roundph_epi16): Likewise.
4982 (_mm512_mask_cvtt_roundph_epi16): Likewise.
4983 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
4984 (_mm512_cvttph_epu16): Likewise.
4985 (_mm512_mask_cvttph_epu16): Likewise.
4986 (_mm512_maskz_cvttph_epu16): Likewise.
4987 (_mm512_cvtt_roundph_epu16): Likewise.
4988 (_mm512_mask_cvtt_roundph_epu16): Likewise.
4989 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
4990 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
4992 (_mm_mask_cvttph_epi32): Likewise.
4993 (_mm_maskz_cvttph_epi32): Likewise.
4994 (_mm256_cvttph_epi32): Likewise.
4995 (_mm256_mask_cvttph_epi32): Likewise.
4996 (_mm256_maskz_cvttph_epi32): Likewise.
4997 (_mm_cvttph_epu32): Likewise.
4998 (_mm_mask_cvttph_epu32): Likewise.
4999 (_mm_maskz_cvttph_epu32): Likewise.
5000 (_mm256_cvttph_epu32): Likewise.
5001 (_mm256_mask_cvttph_epu32): Likewise.
5002 (_mm256_maskz_cvttph_epu32): Likewise.
5003 (_mm_cvttph_epi64): Likewise.
5004 (_mm_mask_cvttph_epi64): Likewise.
5005 (_mm_maskz_cvttph_epi64): Likewise.
5006 (_mm256_cvttph_epi64): Likewise.
5007 (_mm256_mask_cvttph_epi64): Likewise.
5008 (_mm256_maskz_cvttph_epi64): Likewise.
5009 (_mm_cvttph_epu64): Likewise.
5010 (_mm_mask_cvttph_epu64): Likewise.
5011 (_mm_maskz_cvttph_epu64): Likewise.
5012 (_mm256_cvttph_epu64): Likewise.
5013 (_mm256_mask_cvttph_epu64): Likewise.
5014 (_mm256_maskz_cvttph_epu64): Likewise.
5015 (_mm_cvttph_epi16): Likewise.
5016 (_mm_mask_cvttph_epi16): Likewise.
5017 (_mm_maskz_cvttph_epi16): Likewise.
5018 (_mm256_cvttph_epi16): Likewise.
5019 (_mm256_mask_cvttph_epi16): Likewise.
5020 (_mm256_maskz_cvttph_epi16): Likewise.
5021 (_mm_cvttph_epu16): Likewise.
5022 (_mm_mask_cvttph_epu16): Likewise.
5023 (_mm_maskz_cvttph_epu16): Likewise.
5024 (_mm256_cvttph_epu16): Likewise.
5025 (_mm256_mask_cvttph_epu16): Likewise.
5026 (_mm256_maskz_cvttph_epu16): Likewise.
5027 * config/i386/i386-builtin.def: Add new builtins.
5028 * config/i386/sse.md
5029 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
5031 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
5032 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
5033 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
5034 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
5036 2021-09-17 liuhongt <hongtao.liu@intel.com>
5038 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
5039 (_mm_cvtsh_u32): Likewise.
5040 (_mm_cvt_roundsh_i32): Likewise.
5041 (_mm_cvt_roundsh_u32): Likewise.
5042 (_mm_cvtsh_i64): Likewise.
5043 (_mm_cvtsh_u64): Likewise.
5044 (_mm_cvt_roundsh_i64): Likewise.
5045 (_mm_cvt_roundsh_u64): Likewise.
5046 (_mm_cvti32_sh): Likewise.
5047 (_mm_cvtu32_sh): Likewise.
5048 (_mm_cvt_roundi32_sh): Likewise.
5049 (_mm_cvt_roundu32_sh): Likewise.
5050 (_mm_cvti64_sh): Likewise.
5051 (_mm_cvtu64_sh): Likewise.
5052 (_mm_cvt_roundi64_sh): Likewise.
5053 (_mm_cvt_roundu64_sh): Likewise.
5054 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5055 * config/i386/i386-builtin.def: Add corresponding new builtins.
5056 * config/i386/i386-expand.c (ix86_expand_round_builtin):
5057 Handle new builtin types.
5058 * config/i386/sse.md
5059 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
5061 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
5062 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
5064 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
5066 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
5067 (altivec_resolve_new_overloaded_builtin): New forward decl.
5068 (rs6000_new_builtin_type_compatible): New function.
5069 (altivec_resolve_overloaded_builtin): Call
5070 altivec_resolve_new_overloaded_builtin.
5071 (altivec_build_new_resolved_builtin): New function.
5072 (altivec_resolve_new_overloaded_builtin): Likewise.
5073 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
5075 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
5076 name of rs6000_new_builtin_is_supported.
5078 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
5080 * config/i386/i386-protos.h (ix86_decompose_address):
5081 Change return type to bool.
5082 * config/i386/i386.c (ix86_decompose_address): Ditto.
5084 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
5087 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
5088 Added 'build/' to target, use build/%.o rule.
5089 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
5090 '$(build_exeext)' to target and 'build/' for the *.o files.
5091 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
5094 2021-09-16 Martin Jambor <mjambor@suse.cz>
5096 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
5097 the body has been removed. Remove trailing whitespace.
5099 2021-09-16 Richard Biener <rguenther@suse.de>
5101 PR middle-end/102360
5102 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
5103 of non-memory more robust.
5105 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5107 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
5108 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
5109 (leon5_adjust_cost): Increase cost of store with data dependency
5110 on ALU instruction and FPU anti-dependencies.
5111 (sparc_option_override): Add LEON5 costs
5112 (sparc_adjust_cost): Add LEON5 cost adjustments
5113 * config/sparc/sparc.h: Add LEON5
5114 * config/sparc/sparc.md: Include LEON5 scheduling information
5115 * config/sparc/sparc.opt: Add LEON5
5116 * doc/invoke.texi: Add LEON5
5117 * config/sparc/leon5.md: New file.
5119 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5121 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
5122 sensitive sequence for B2BST errata workaround.
5124 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5126 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
5127 functions with atomic instruction in the UT700 errata workaround.
5129 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5131 * config/sparc/sparc.c (next_active_non_empty_insn): New function
5132 that returns next active non empty assembly instruction.
5133 (sparc_do_work_around_errata): Use new function.
5135 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5137 * config/sparc/sparc.c (store_insn_p): Add predicate for store
5139 (load_insn_p): Add predicate for load attributes.
5140 (sparc_do_work_around_errata): Use new predicates.
5142 2021-09-16 Andreas Larsson <andreas@gaisler.com>
5144 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
5147 2021-09-16 Martin Liska <mliska@suse.cz>
5149 * config/mips/netbsd.h: Fix typo in name of a macro.
5151 2021-09-16 liuhongt <hongtao.liu@intel.com>
5153 PR middle-end/102080
5154 * match.pd: Check mask type when doing cond_op related gimple
5156 * tree.c (is_truth_type_for): New function.
5157 * tree.h (is_truth_type_for): New declaration.
5159 2021-09-16 liuhongt <hongtao.liu@intel.com>
5161 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
5163 (_mm512_mask_cvtepi32_ph): Likewise.
5164 (_mm512_maskz_cvtepi32_ph): Likewise.
5165 (_mm512_cvt_roundepi32_ph): Likewise.
5166 (_mm512_mask_cvt_roundepi32_ph): Likewise.
5167 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
5168 (_mm512_cvtepu32_ph): Likewise.
5169 (_mm512_mask_cvtepu32_ph): Likewise.
5170 (_mm512_maskz_cvtepu32_ph): Likewise.
5171 (_mm512_cvt_roundepu32_ph): Likewise.
5172 (_mm512_mask_cvt_roundepu32_ph): Likewise.
5173 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
5174 (_mm512_cvtepi64_ph): Likewise.
5175 (_mm512_mask_cvtepi64_ph): Likewise.
5176 (_mm512_maskz_cvtepi64_ph): Likewise.
5177 (_mm512_cvt_roundepi64_ph): Likewise.
5178 (_mm512_mask_cvt_roundepi64_ph): Likewise.
5179 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
5180 (_mm512_cvtepu64_ph): Likewise.
5181 (_mm512_mask_cvtepu64_ph): Likewise.
5182 (_mm512_maskz_cvtepu64_ph): Likewise.
5183 (_mm512_cvt_roundepu64_ph): Likewise.
5184 (_mm512_mask_cvt_roundepu64_ph): Likewise.
5185 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
5186 (_mm512_cvtepi16_ph): Likewise.
5187 (_mm512_mask_cvtepi16_ph): Likewise.
5188 (_mm512_maskz_cvtepi16_ph): Likewise.
5189 (_mm512_cvt_roundepi16_ph): Likewise.
5190 (_mm512_mask_cvt_roundepi16_ph): Likewise.
5191 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
5192 (_mm512_cvtepu16_ph): Likewise.
5193 (_mm512_mask_cvtepu16_ph): Likewise.
5194 (_mm512_maskz_cvtepu16_ph): Likewise.
5195 (_mm512_cvt_roundepu16_ph): Likewise.
5196 (_mm512_mask_cvt_roundepu16_ph): Likewise.
5197 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
5198 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
5200 (_mm_mask_cvtepi32_ph): Likewise.
5201 (_mm_maskz_cvtepi32_ph): Likewise.
5202 (_mm256_cvtepi32_ph): Likewise.
5203 (_mm256_mask_cvtepi32_ph): Likewise.
5204 (_mm256_maskz_cvtepi32_ph): Likewise.
5205 (_mm_cvtepu32_ph): Likewise.
5206 (_mm_mask_cvtepu32_ph): Likewise.
5207 (_mm_maskz_cvtepu32_ph): Likewise.
5208 (_mm256_cvtepu32_ph): Likewise.
5209 (_mm256_mask_cvtepu32_ph): Likewise.
5210 (_mm256_maskz_cvtepu32_ph): Likewise.
5211 (_mm_cvtepi64_ph): Likewise.
5212 (_mm_mask_cvtepi64_ph): Likewise.
5213 (_mm_maskz_cvtepi64_ph): Likewise.
5214 (_mm256_cvtepi64_ph): Likewise.
5215 (_mm256_mask_cvtepi64_ph): Likewise.
5216 (_mm256_maskz_cvtepi64_ph): Likewise.
5217 (_mm_cvtepu64_ph): Likewise.
5218 (_mm_mask_cvtepu64_ph): Likewise.
5219 (_mm_maskz_cvtepu64_ph): Likewise.
5220 (_mm256_cvtepu64_ph): Likewise.
5221 (_mm256_mask_cvtepu64_ph): Likewise.
5222 (_mm256_maskz_cvtepu64_ph): Likewise.
5223 (_mm_cvtepi16_ph): Likewise.
5224 (_mm_mask_cvtepi16_ph): Likewise.
5225 (_mm_maskz_cvtepi16_ph): Likewise.
5226 (_mm256_cvtepi16_ph): Likewise.
5227 (_mm256_mask_cvtepi16_ph): Likewise.
5228 (_mm256_maskz_cvtepi16_ph): Likewise.
5229 (_mm_cvtepu16_ph): Likewise.
5230 (_mm_mask_cvtepu16_ph): Likewise.
5231 (_mm_maskz_cvtepu16_ph): Likewise.
5232 (_mm256_cvtepu16_ph): Likewise.
5233 (_mm256_mask_cvtepu16_ph): Likewise.
5234 (_mm256_maskz_cvtepu16_ph): Likewise.
5235 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5236 * config/i386/i386-builtin.def: Add corresponding new builtins.
5237 * config/i386/i386-expand.c
5238 (ix86_expand_args_builtin): Handle new builtin types.
5239 (ix86_expand_round_builtin): Ditto.
5240 * config/i386/i386-modes.def: Declare V2HF and V6HF.
5241 * config/i386/sse.md (VI2H_AVX512VL): New.
5243 (sseintvecmode): Add HF vector modes.
5244 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
5246 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
5247 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
5248 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
5249 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
5250 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
5251 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
5252 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
5253 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
5254 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
5255 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
5256 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
5258 2021-09-16 liuhongt <hongtao.liu@intel.com>
5260 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
5262 (_mm512_mask_cvtph_epi32): Likewise.
5263 (_mm512_maskz_cvtph_epi32): Likewise.
5264 (_mm512_cvt_roundph_epi32): Likewise.
5265 (_mm512_mask_cvt_roundph_epi32): Likewise.
5266 (_mm512_maskz_cvt_roundph_epi32): Likewise.
5267 (_mm512_cvtph_epu32): Likewise.
5268 (_mm512_mask_cvtph_epu32): Likewise.
5269 (_mm512_maskz_cvtph_epu32): Likewise.
5270 (_mm512_cvt_roundph_epu32): Likewise.
5271 (_mm512_mask_cvt_roundph_epu32): Likewise.
5272 (_mm512_maskz_cvt_roundph_epu32): Likewise.
5273 (_mm512_cvtph_epi64): Likewise.
5274 (_mm512_mask_cvtph_epi64): Likewise.
5275 (_mm512_maskz_cvtph_epi64): Likewise.
5276 (_mm512_cvt_roundph_epi64): Likewise.
5277 (_mm512_mask_cvt_roundph_epi64): Likewise.
5278 (_mm512_maskz_cvt_roundph_epi64): Likewise.
5279 (_mm512_cvtph_epu64): Likewise.
5280 (_mm512_mask_cvtph_epu64): Likewise.
5281 (_mm512_maskz_cvtph_epu64): Likewise.
5282 (_mm512_cvt_roundph_epu64): Likewise.
5283 (_mm512_mask_cvt_roundph_epu64): Likewise.
5284 (_mm512_maskz_cvt_roundph_epu64): Likewise.
5285 (_mm512_cvtph_epi16): Likewise.
5286 (_mm512_mask_cvtph_epi16): Likewise.
5287 (_mm512_maskz_cvtph_epi16): Likewise.
5288 (_mm512_cvt_roundph_epi16): Likewise.
5289 (_mm512_mask_cvt_roundph_epi16): Likewise.
5290 (_mm512_maskz_cvt_roundph_epi16): Likewise.
5291 (_mm512_cvtph_epu16): Likewise.
5292 (_mm512_mask_cvtph_epu16): Likewise.
5293 (_mm512_maskz_cvtph_epu16): Likewise.
5294 (_mm512_cvt_roundph_epu16): Likewise.
5295 (_mm512_mask_cvt_roundph_epu16): Likewise.
5296 (_mm512_maskz_cvt_roundph_epu16): Likewise.
5297 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
5299 (_mm_mask_cvtph_epi32): Likewise.
5300 (_mm_maskz_cvtph_epi32): Likewise.
5301 (_mm256_cvtph_epi32): Likewise.
5302 (_mm256_mask_cvtph_epi32): Likewise.
5303 (_mm256_maskz_cvtph_epi32): Likewise.
5304 (_mm_cvtph_epu32): Likewise.
5305 (_mm_mask_cvtph_epu32): Likewise.
5306 (_mm_maskz_cvtph_epu32): Likewise.
5307 (_mm256_cvtph_epu32): Likewise.
5308 (_mm256_mask_cvtph_epu32): Likewise.
5309 (_mm256_maskz_cvtph_epu32): Likewise.
5310 (_mm_cvtph_epi64): Likewise.
5311 (_mm_mask_cvtph_epi64): Likewise.
5312 (_mm_maskz_cvtph_epi64): Likewise.
5313 (_mm256_cvtph_epi64): Likewise.
5314 (_mm256_mask_cvtph_epi64): Likewise.
5315 (_mm256_maskz_cvtph_epi64): Likewise.
5316 (_mm_cvtph_epu64): Likewise.
5317 (_mm_mask_cvtph_epu64): Likewise.
5318 (_mm_maskz_cvtph_epu64): Likewise.
5319 (_mm256_cvtph_epu64): Likewise.
5320 (_mm256_mask_cvtph_epu64): Likewise.
5321 (_mm256_maskz_cvtph_epu64): Likewise.
5322 (_mm_cvtph_epi16): Likewise.
5323 (_mm_mask_cvtph_epi16): Likewise.
5324 (_mm_maskz_cvtph_epi16): Likewise.
5325 (_mm256_cvtph_epi16): Likewise.
5326 (_mm256_mask_cvtph_epi16): Likewise.
5327 (_mm256_maskz_cvtph_epi16): Likewise.
5328 (_mm_cvtph_epu16): Likewise.
5329 (_mm_mask_cvtph_epu16): Likewise.
5330 (_mm_maskz_cvtph_epu16): Likewise.
5331 (_mm256_cvtph_epu16): Likewise.
5332 (_mm256_mask_cvtph_epu16): Likewise.
5333 (_mm256_maskz_cvtph_epu16): Likewise.
5334 * config/i386/i386-builtin-types.def: Add new builtin types.
5335 * config/i386/i386-builtin.def: Add new builtins.
5336 * config/i386/i386-expand.c
5337 (ix86_expand_args_builtin): Handle new builtin types.
5338 (ix86_expand_round_builtin): Ditto.
5339 * config/i386/sse.md (sseintconvert): New.
5341 (UNSPEC_US_FIX_NOTRUNC): Ditto.
5342 (sseintconvertsignprefix): Ditto.
5343 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
5346 2021-09-16 liuhongt <hongtao.liu@intel.com>
5348 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
5350 (_mm_cvtsi128_si16): Likewise.
5351 (_mm_mask_load_sh): Likewise.
5352 (_mm_maskz_load_sh): Likewise.
5353 (_mm_mask_store_sh): Likewise.
5354 (_mm_move_sh): Likewise.
5355 (_mm_mask_move_sh): Likewise.
5356 (_mm_maskz_move_sh): Likewise.
5357 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5358 * config/i386/i386-builtin.def: Add corresponding new builtins.
5359 * config/i386/i386-expand.c
5360 (ix86_expand_special_args_builtin): Handle new builtin types.
5361 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
5362 * config/i386/sse.md (VI2F): New mode iterator.
5363 (vec_set<mode>_0): Use new mode iterator.
5364 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
5365 (avx512f_store<mode>_mask): Ditto.
5367 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
5369 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
5371 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
5373 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
5374 Proceed if no symbol summary or the symbol alias flag is false.
5376 2021-09-15 Jakub Jelinek <jakub@redhat.com>
5380 * varasm.c (output_constructor_regular_field): Instead of assertion
5381 that array_size_for_constructor result is equal to size of
5382 TREE_TYPE (local->val) in bytes, assert that the type size is greater
5383 or equal to array_size_for_constructor result and use type size as
5386 2021-09-15 Martin Liska <mliska@suse.cz>
5389 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
5391 2021-09-15 Martin Liska <mliska@suse.cz>
5394 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
5395 Check that we have a symbol summary for a symbol.
5397 2021-09-15 Richard Biener <rguenther@suse.de>
5400 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
5401 to inherit from elfos.h
5403 2021-09-15 liuhongt <hongtao.liu@intel.com>
5406 * config/i386/i386-expand.c
5407 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
5409 (ix86_expand_vector_set): Use blendw instead of pinsrw.
5410 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
5411 AVX512FP16 which supports 16bit vector load.
5412 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
5414 (avx512bw_interleave_high<mode><mask_name>): .. this, and
5415 extend to V32HFmode.
5416 (avx2_interleave_highv16hi<mask_name>): Rename to ..
5417 (avx2_interleave_high<mode><mask_name>): .. this, and extend
5419 (vec_interleave_highv8hi<mask_name>): Rename to ..
5420 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
5421 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
5423 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
5424 this, and extend to V32HFmode.
5425 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
5426 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
5427 (vec_interleave_lowv8hi<mask_name>): Rename to ..
5428 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
5429 (sse4_1_pblendw): Rename to ..
5430 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
5431 (avx2_pblendph): New define_expand.
5432 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
5433 sseintmodesuffix instead of ssemodesuffix.
5434 (blendsuf): New mode attr.
5436 2021-09-15 Richard Biener <rguenther@suse.de>
5438 * tree-vectorizer.h (dr_misalignment): Move out of line.
5439 (dr_target_alignment): New.
5440 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
5441 (set_dr_target_alignment): New.
5442 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
5443 * tree-vect-data-refs.c (dr_misalignment): Compute and
5444 return the group members misalignment.
5445 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
5446 (vect_analyze_data_refs_alignment): Compute alignment only
5447 for the first element of a DR group.
5448 (vect_slp_analyze_node_alignment): Likewise.
5450 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
5452 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
5453 * config/i386/avx512fp16vlintrin.h: Likewise.
5454 * config/i386/i386-builtin.def: Adjust builtin name and
5455 enumeration to match AVX512F style.
5457 2021-09-15 Richard Biener <rguenther@suse.de>
5459 PR tree-optimization/102318
5460 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
5461 previous change and do the mode conversion separately from
5462 the sign conversion.
5464 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
5465 Peter Cordes <peter@cordes.ca>
5468 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
5469 (*vec_extract<mode><ssescalarmodelower>_valign): Output
5470 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
5473 2021-09-15 Richard Biener <rguenther@suse.de>
5475 * config.gcc: Remove vax-*-openbsd* configuration.
5477 2021-09-15 Richard Biener <rguenther@suse.de>
5479 * config.gcc: Remove m68k-openbsd.
5481 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
5484 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
5486 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
5488 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
5489 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
5490 (*mma_xxsetaccz): Delete.
5491 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
5492 Use UNSPECV_MMA_XXSETACCZ. Update comment.
5493 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
5495 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
5497 * Makefile.in: Remove variables related to applying no-PIE
5498 to the exes on $build.
5499 * configure: Regenerate.
5500 * configure.ac: Remove configuration related to applying
5501 no-PIE to the exes on $build.
5503 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
5505 * config/arc/arc.md (doloop_end): Add missing mode.
5506 (loop_end): Likewise.
5508 2021-09-14 Jakub Jelinek <jakub@redhat.com>
5510 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
5511 it to recursive calls, for depth above 7 just gimplify or return.
5512 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
5513 __builtin_clear_padding and TARGET_EXPR.
5514 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
5516 2021-09-14 liuhongt <hongtao.liu@intel.com>
5518 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
5520 (_mm_mask_fpclass_sh_mask): Likewise.
5521 (_mm512_mask_fpclass_ph_mask): Likewise.
5522 (_mm512_fpclass_ph_mask): Likewise.
5523 (_mm_getexp_sh): Likewise.
5524 (_mm_mask_getexp_sh): Likewise.
5525 (_mm_maskz_getexp_sh): Likewise.
5526 (_mm512_getexp_ph): Likewise.
5527 (_mm512_mask_getexp_ph): Likewise.
5528 (_mm512_maskz_getexp_ph): Likewise.
5529 (_mm_getexp_round_sh): Likewise.
5530 (_mm_mask_getexp_round_sh): Likewise.
5531 (_mm_maskz_getexp_round_sh): Likewise.
5532 (_mm512_getexp_round_ph): Likewise.
5533 (_mm512_mask_getexp_round_ph): Likewise.
5534 (_mm512_maskz_getexp_round_ph): Likewise.
5535 (_mm_getmant_sh): Likewise.
5536 (_mm_mask_getmant_sh): Likewise.
5537 (_mm_maskz_getmant_sh): Likewise.
5538 (_mm512_getmant_ph): Likewise.
5539 (_mm512_mask_getmant_ph): Likewise.
5540 (_mm512_maskz_getmant_ph): Likewise.
5541 (_mm_getmant_round_sh): Likewise.
5542 (_mm_mask_getmant_round_sh): Likewise.
5543 (_mm_maskz_getmant_round_sh): Likewise.
5544 (_mm512_getmant_round_ph): Likewise.
5545 (_mm512_mask_getmant_round_ph): Likewise.
5546 (_mm512_maskz_getmant_round_ph): Likewise.
5547 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
5549 (_mm_fpclass_ph_mask): Likewise.
5550 (_mm256_mask_fpclass_ph_mask): Likewise.
5551 (_mm256_fpclass_ph_mask): Likewise.
5552 (_mm256_getexp_ph): Likewise.
5553 (_mm256_mask_getexp_ph): Likewise.
5554 (_mm256_maskz_getexp_ph): Likewise.
5555 (_mm_getexp_ph): Likewise.
5556 (_mm_mask_getexp_ph): Likewise.
5557 (_mm_maskz_getexp_ph): Likewise.
5558 (_mm256_getmant_ph): Likewise.
5559 (_mm256_mask_getmant_ph): Likewise.
5560 (_mm256_maskz_getmant_ph): Likewise.
5561 (_mm_getmant_ph): Likewise.
5562 (_mm_mask_getmant_ph): Likewise.
5563 (_mm_maskz_getmant_ph): Likewise.
5564 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5565 * config/i386/i386-builtin.def: Add corresponding new builtins.
5566 * config/i386/i386-expand.c
5567 (ix86_expand_args_builtin): Handle new builtin types.
5568 (ix86_expand_round_builtin): Ditto.
5569 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
5570 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
5571 to support HF vector modes.
5572 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
5574 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
5575 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
5576 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
5577 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
5580 2021-09-14 liuhongt <hongtao.liu@intel.com>
5582 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
5584 (_mm512_mask_reduce_ph): Likewise.
5585 (_mm512_maskz_reduce_ph): Likewise.
5586 (_mm512_reduce_round_ph): Likewise.
5587 (_mm512_mask_reduce_round_ph): Likewise.
5588 (_mm512_maskz_reduce_round_ph): Likewise.
5589 (_mm_reduce_sh): Likewise.
5590 (_mm_mask_reduce_sh): Likewise.
5591 (_mm_maskz_reduce_sh): Likewise.
5592 (_mm_reduce_round_sh): Likewise.
5593 (_mm_mask_reduce_round_sh): Likewise.
5594 (_mm_maskz_reduce_round_sh): Likewise.
5595 (_mm512_roundscale_ph): Likewise.
5596 (_mm512_mask_roundscale_ph): Likewise.
5597 (_mm512_maskz_roundscale_ph): Likewise.
5598 (_mm512_roundscale_round_ph): Likewise.
5599 (_mm512_mask_roundscale_round_ph): Likewise.
5600 (_mm512_maskz_roundscale_round_ph): Likewise.
5601 (_mm_roundscale_sh): Likewise.
5602 (_mm_mask_roundscale_sh): Likewise.
5603 (_mm_maskz_roundscale_sh): Likewise.
5604 (_mm_roundscale_round_sh): Likewise.
5605 (_mm_mask_roundscale_round_sh): Likewise.
5606 (_mm_maskz_roundscale_round_sh): Likewise.
5607 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
5609 (_mm_mask_reduce_ph): Likewise.
5610 (_mm_maskz_reduce_ph): Likewise.
5611 (_mm256_reduce_ph): Likewise.
5612 (_mm256_mask_reduce_ph): Likewise.
5613 (_mm256_maskz_reduce_ph): Likewise.
5614 (_mm_roundscale_ph): Likewise.
5615 (_mm_mask_roundscale_ph): Likewise.
5616 (_mm_maskz_roundscale_ph): Likewise.
5617 (_mm256_roundscale_ph): Likewise.
5618 (_mm256_mask_roundscale_ph): Likewise.
5619 (_mm256_maskz_roundscale_ph): Likewise.
5620 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5621 * config/i386/i386-builtin.def: Add corresponding new builtins.
5622 * config/i386/i386-expand.c
5623 (ix86_expand_args_builtin): Handle new builtin types.
5624 (ix86_expand_round_builtin): Ditto.
5625 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
5627 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
5628 ... this, and adjust for round operands.
5629 (reduces<mode><mask_scalar_name>): Likewise, with ...
5630 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
5632 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
5633 Adjust for HF vector modes.
5634 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
5636 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
5638 2021-09-14 liuhongt <hongtao.liu@intel.com>
5640 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
5642 (_mm512_mask_rcp_ph): Likewise.
5643 (_mm512_maskz_rcp_ph): Likewise.
5644 (_mm_rcp_sh): Likewise.
5645 (_mm_mask_rcp_sh): Likewise.
5646 (_mm_maskz_rcp_sh): Likewise.
5647 (_mm512_scalef_ph): Likewise.
5648 (_mm512_mask_scalef_ph): Likewise.
5649 (_mm512_maskz_scalef_ph): Likewise.
5650 (_mm512_scalef_round_ph): Likewise.
5651 (_mm512_mask_scalef_round_ph): Likewise.
5652 (_mm512_maskz_scalef_round_ph): Likewise.
5653 (_mm_scalef_sh): Likewise.
5654 (_mm_mask_scalef_sh): Likewise.
5655 (_mm_maskz_scalef_sh): Likewise.
5656 (_mm_scalef_round_sh): Likewise.
5657 (_mm_mask_scalef_round_sh): Likewise.
5658 (_mm_maskz_scalef_round_sh): Likewise.
5659 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
5661 (_mm256_rcp_ph): Likewise.
5662 (_mm_mask_rcp_ph): Likewise.
5663 (_mm256_mask_rcp_ph): Likewise.
5664 (_mm_maskz_rcp_ph): Likewise.
5665 (_mm256_maskz_rcp_ph): Likewise.
5666 (_mm_scalef_ph): Likewise.
5667 (_mm256_scalef_ph): Likewise.
5668 (_mm_mask_scalef_ph): Likewise.
5669 (_mm256_mask_scalef_ph): Likewise.
5670 (_mm_maskz_scalef_ph): Likewise.
5671 (_mm256_maskz_scalef_ph): Likewise.
5672 * config/i386/i386-builtin.def: Add new builtins.
5673 * config/i386/sse.md (VFH_AVX512VL): New.
5674 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
5675 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
5676 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
5677 Adjust to support HF vector modes.
5678 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
5680 2021-09-14 liuhongt <hongtao.liu@intel.com>
5682 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
5684 (_mm512_mask_sqrt_ph): Likewise.
5685 (_mm512_maskz_sqrt_ph): Likewise.
5686 (_mm512_sqrt_round_ph): Likewise.
5687 (_mm512_mask_sqrt_round_ph): Likewise.
5688 (_mm512_maskz_sqrt_round_ph): Likewise.
5689 (_mm512_rsqrt_ph): Likewise.
5690 (_mm512_mask_rsqrt_ph): Likewise.
5691 (_mm512_maskz_rsqrt_ph): Likewise.
5692 (_mm_rsqrt_sh): Likewise.
5693 (_mm_mask_rsqrt_sh): Likewise.
5694 (_mm_maskz_rsqrt_sh): Likewise.
5695 (_mm_sqrt_sh): Likewise.
5696 (_mm_mask_sqrt_sh): Likewise.
5697 (_mm_maskz_sqrt_sh): Likewise.
5698 (_mm_sqrt_round_sh): Likewise.
5699 (_mm_mask_sqrt_round_sh): Likewise.
5700 (_mm_maskz_sqrt_round_sh): Likewise.
5701 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
5702 (_mm256_sqrt_ph): Likewise.
5703 (_mm_mask_sqrt_ph): Likewise.
5704 (_mm256_mask_sqrt_ph): Likewise.
5705 (_mm_maskz_sqrt_ph): Likewise.
5706 (_mm256_maskz_sqrt_ph): Likewise.
5707 (_mm_rsqrt_ph): Likewise.
5708 (_mm256_rsqrt_ph): Likewise.
5709 (_mm_mask_rsqrt_ph): Likewise.
5710 (_mm256_mask_rsqrt_ph): Likewise.
5711 (_mm_maskz_rsqrt_ph): Likewise.
5712 (_mm256_maskz_rsqrt_ph): Likewise.
5713 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5714 * config/i386/i386-builtin.def: Add corresponding new builtins.
5715 * config/i386/i386-expand.c
5716 (ix86_expand_args_builtin): Handle new builtins.
5717 (ix86_expand_round_builtin): Ditto.
5718 * config/i386/sse.md (VF_AVX512FP16VL): New.
5719 (sqrt<mode>2): Adjust for HF vector modes.
5720 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
5721 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
5723 (<sse>_rsqrt<mode>2<mask_name>): New.
5724 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
5726 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
5729 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
5730 'RESERVED_LOCATION_P' locations.
5731 * warning-control.cc (get_nowarn_spec, suppress_warning)
5732 (copy_warning): Likewise.
5734 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
5736 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
5737 (typedef key_type_t): ... this. Remove.
5738 (nowarn_map): Document.
5739 * diagnostic-spec.c (nowarn_map): Likewise.
5740 * warning-control.cc (convert_to_key): Evolve functions into...
5741 (get_location): ... these. Adjust all users.
5743 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
5745 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
5747 2021-09-13 Jason Merrill <jason@redhat.com>
5749 * params.opt: Add destructive-interference-size and
5750 constructive-interference-size.
5751 * doc/invoke.texi: Document them.
5752 * config/aarch64/aarch64.c (aarch64_override_options_internal):
5754 * config/arm/arm.c (arm_option_override): Set them.
5755 * config/i386/i386-options.c (ix86_option_override_internal):
5758 2021-09-13 Martin Liska <mliska@suse.cz>
5759 H.J. Lu <hjl.tools@gmail.com>
5762 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
5763 for x86-64 micro levels for __builtin_cpu_supports.
5764 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
5765 Add priorities for the micro-arch levels.
5766 (enum processor_features): Add new features.
5767 * common/config/i386/i386-isas.h: Add micro-arch features.
5768 * config/i386/i386-builtins.c (get_builtin_code_for_version):
5769 Support the micro-arch levels by callsing
5770 __builtin_cpu_supports.
5771 * doc/extend.texi: Document that the levels are support by
5772 __builtin_cpu_supports.
5774 2021-09-13 Andrew Pinski <apinski@marvell.com>
5777 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
5779 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
5780 (aarch64_general_gimple_fold_builtin): Likewise.
5782 2021-09-13 Andrew Pinski <apinski@marvell.com>
5784 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
5785 to the Unsupported targets list.
5786 Remove support for m32r-*-linux* and m32rle-*-linux*.
5787 * config/m32r/linux.h: Removed.
5788 * config/m32r/t-linux: Removed.
5790 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
5793 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
5794 register index for SVE predicate modes.
5796 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
5798 * tree-ssa-threadbackward.c
5799 (back_threader_profitability::profitable_path_p): Remove FSM
5801 (back_threader_registry::register_path): Same.
5802 * tree-ssa-threadedge.c
5803 (jump_threader::simplify_control_stmt_condition): Same.
5804 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
5805 Add backedge_threads argument.
5806 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
5807 backedge_threads argument.
5808 (back_jt_path_registry::back_jt_path_registry): Same.
5809 (dump_jump_thread_path): Adjust for FSM removal.
5810 (back_jt_path_registry::rewire_first_differing_edge): Same.
5811 (back_jt_path_registry::adjust_paths_after_duplication): Same.
5812 (back_jt_path_registry::update_cfg): Same.
5813 (jt_path_registry::register_jump_thread): Same.
5814 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
5816 (class back_jt_path_registry): Add backedge_threads to
5819 2021-09-13 Martin Liska <mliska@suse.cz>
5822 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
5824 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
5827 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
5828 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
5829 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
5830 TARGET_AVX256_STORE_BY_PIECES instead of
5831 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
5832 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
5833 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
5834 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
5835 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
5836 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
5838 2021-09-13 liuhongt <hongtao.liu@intel.com>
5841 * expmed.c (extract_bit_field_using_extv): Use
5842 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
5844 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
5846 * Makefile.in (OBJS): Add value-pointer-equiv.o.
5847 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
5848 value-pointer-equiv.*.
5849 (ssa_equiv_stack::ssa_equiv_stack): Same.
5850 (ssa_equiv_stack::enter): Same.
5851 (ssa_equiv_stack::leave): Same.
5852 (ssa_equiv_stack::push_replacement): Same.
5853 (ssa_equiv_stack::get_replacement): Same.
5854 (is_pointer_ssa): Same.
5855 (class pointer_equiv_analyzer): Same.
5856 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
5857 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
5858 (pointer_equiv_analyzer::set_global_equiv): Same.
5859 (pointer_equiv_analyzer::set_cond_equiv): Same.
5860 (pointer_equiv_analyzer::get_equiv): Same.
5861 (pointer_equiv_analyzer::enter): Same.
5862 (pointer_equiv_analyzer::leave): Same.
5863 (pointer_equiv_analyzer::get_equiv_expr): Same.
5864 (pta_valueize): Same.
5865 (pointer_equiv_analyzer::visit_stmt): Same.
5866 (pointer_equiv_analyzer::visit_edge): Same.
5867 (hybrid_folder::value_of_expr): Same.
5868 (hybrid_folder::value_on_edge): Same.
5869 * value-pointer-equiv.cc: New file.
5870 * value-pointer-equiv.h: New file.
5872 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
5875 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
5876 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
5878 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
5881 * config/arm/arm.md (movmisaligndi): New define_expand.
5882 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
5884 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
5887 * emit-rtl.c (gen_highpart): Use adjust_address to handle
5888 MEM rather than calling simplify_gen_subreg.
5890 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
5892 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
5893 define into a block.
5895 2021-09-13 Richard Biener <rguenther@suse.de>
5897 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
5898 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
5899 change based on DARWIN_PREFER_DWARF not being defined.
5901 2021-09-13 Richard Biener <rguenther@suse.de>
5903 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
5904 to inherit from elfos.h
5906 2021-09-13 Richard Biener <rguenther@suse.de>
5908 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
5910 2021-09-13 Richard Biener <rguenther@suse.de>
5912 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
5913 override, pick up DWARF2_DEBUG define from elfos.h
5915 2021-09-13 Richard Biener <rguenther@suse.de>
5917 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
5920 2021-09-13 Richard Biener <rguenther@suse.de>
5922 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
5925 2021-09-13 Richard Biener <rguenther@suse.de>
5927 * config/i386/cygming.h: Always default to DWARF2 debugging.
5928 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
5930 * doc/install.texi: Document binutils 2.16 as minimum
5931 requirement for mingw.
5933 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
5935 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
5936 nstmts, nloads and extra_ctor_cost.
5937 (rs6000_density_test): Add load density related heuristics. Do
5938 extra costing on vector construction statements if need.
5939 (rs6000_init_cost): Init new members.
5940 (rs6000_update_target_cost_per_stmt): New function.
5941 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
5942 rs6000_update_target_cost_per_stmt and call it.
5944 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
5946 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
5947 (rs6000_init_cost): Adjust.
5949 2021-09-13 liuhongt <hongtao.liu@intel.com>
5951 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
5952 (UNSPEC_XORSIGN): Ditto.
5954 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
5956 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
5957 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
5960 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
5962 * tree-ssa-threadbackward.c (class back_threader_registry): Use
5963 back_jt_path_registry.
5964 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
5965 fwd_jt_path_registry.
5966 * tree-ssa-threadedge.h (class jump_threader): Same..
5967 * tree-ssa-threadupdate.c
5968 (jump_thread_path_registry::jump_thread_path_registry): Rename...
5969 (jt_path_registry::jt_path_registry): ...to this.
5970 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
5971 (jt_path_registry::~jt_path_registry): ...this.
5972 (fwd_jt_path_registry::fwd_jt_path_registry): New.
5973 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
5974 (jump_thread_path_registry::allocate_thread_edge): Rename...
5975 (jt_path_registry::allocate_thread_edge): ...to this.
5976 (jump_thread_path_registry::allocate_thread_path): Rename...
5977 (jt_path_registry::allocate_thread_path): ...to this.
5978 (jump_thread_path_registry::lookup_redirection_data): Rename...
5979 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
5980 (jump_thread_path_registry::thread_block_1): Rename...
5981 (fwd_jt_path_registry::thread_block_1): ...to this.
5982 (jump_thread_path_registry::thread_block): Rename...
5983 (fwd_jt_path_registry::thread_block): ...to this.
5984 (jt_path_registry::thread_through_loop_header): Rename...
5985 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
5986 (jump_thread_path_registry::mark_threaded_blocks): Rename...
5987 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
5988 (jump_thread_path_registry::debug_path): Rename...
5989 (jt_path_registry::debug_path): ...to this.
5990 (jump_thread_path_registry::dump): Rename...
5991 (jt_path_registry::debug): ...to this.
5992 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
5993 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
5994 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
5995 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
5996 (jump_thread_path_registry::duplicate_thread_path): Rename...
5997 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
5998 drop ill-formed candidates.
5999 (jump_thread_path_registry::remove_jump_threads_including): Rename...
6000 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
6001 (jt_path_registry::thread_through_all_blocks): New.
6002 (back_jt_path_registry::update_cfg): New.
6003 (fwd_jt_path_registry::update_cfg): New.
6004 (jump_thread_path_registry::register_jump_thread): Rename...
6005 (jt_path_registry::register_jump_thread): ...to this.
6006 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
6008 (class jt_path_registry): ...here.
6009 (class fwd_jt_path_registry): New.
6010 (class back_jt_path_registry): New.
6012 2021-09-10 liuhongt <hongtao.liu@intel.com>
6015 2021-09-01 liuhongt <hongtao.liu@intel.com>
6017 * emit-rtl.c (validate_subreg): Get rid of all float-int
6020 2021-09-10 Jakub Jelinek <jakub@redhat.com>
6022 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
6023 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
6024 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
6025 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
6026 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
6027 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
6028 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
6029 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
6031 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
6032 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
6034 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
6036 * tree.h (OMP_ATOMIC_WEAK): Define.
6037 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
6038 fail memory order being encoded in the same enum and also print
6039 fail clause if present.
6040 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
6041 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
6042 handle pre_p == NULL case as a test mode that only returns value
6043 but doesn't change gimplify nor change anything otherwise, adjust
6044 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
6045 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
6046 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
6048 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
6049 COND_EXPR rhs. Set weak flag on gimple load/store for
6051 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
6052 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
6054 (expand_omp_atomic_cas): New function.
6055 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
6057 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
6058 using expand_omp_atomic_cas.
6060 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
6061 Michael Matz <matz@suse.de>
6063 * tree-pass.h (PROP_loop_opts_done): New.
6064 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
6065 Intersect with global range.
6066 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
6067 * tree-ssa-threadbackward.c
6068 (back_threader_profitability::profitable_path_p): Disable
6069 threading through latches until after loop optimizations have run.
6071 2021-09-10 David Faust <david.faust@oracle.com>
6073 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
6076 2021-09-10 David Faust <david.faust@oracle.com>
6078 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
6079 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
6080 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
6081 (bpf_expand_cbranch): New function.
6082 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
6084 (zero_extendsidi2): Only use mov32 instruction if it is available.
6085 (SIM mode iterator): Conditionalize support for SI mode.
6086 (JM mode iterator): New.
6087 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
6088 (*branch_on_di): Update name, use new JM iterator.
6089 * config/bpf/bpf.opt: (mjmpext): New option.
6093 (bpf_isa): New enum.
6095 2021-09-10 David Faust <david.faust@oracle.com>
6097 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
6098 for register-to-register extensions.
6099 (zero_extendqidi2): Likewise.
6101 2021-09-10 Richard Biener <rguenther@suse.de>
6103 PR middle-end/102273
6104 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
6106 2021-09-10 Richard Biener <rguenther@suse.de>
6108 PR middle-end/102269
6109 * gimplify.c (is_var_need_auto_init): Empty types do not need
6112 2021-09-10 Richard Biener <rguenther@suse.de>
6114 * configure.ac (--with-stabs): Remove.
6115 * configure: Regenerate.
6116 * doc/install.texi: Remove --with-stabs documentation.
6118 2021-09-10 liuhongt <hongtao.liu@intel.com>
6120 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
6122 (_mm512_mask_cmp_ph_mask): Likewise.
6123 (_mm512_cmp_round_ph_mask): Likewise.
6124 (_mm512_mask_cmp_round_ph_mask): Likewise.
6125 (_mm_cmp_sh_mask): Likewise.
6126 (_mm_mask_cmp_sh_mask): Likewise.
6127 (_mm_cmp_round_sh_mask): Likewise.
6128 (_mm_mask_cmp_round_sh_mask): Likewise.
6129 (_mm_comieq_sh): Likewise.
6130 (_mm_comilt_sh): Likewise.
6131 (_mm_comile_sh): Likewise.
6132 (_mm_comigt_sh): Likewise.
6133 (_mm_comige_sh): Likewise.
6134 (_mm_comineq_sh): Likewise.
6135 (_mm_ucomieq_sh): Likewise.
6136 (_mm_ucomilt_sh): Likewise.
6137 (_mm_ucomile_sh): Likewise.
6138 (_mm_ucomigt_sh): Likewise.
6139 (_mm_ucomige_sh): Likewise.
6140 (_mm_ucomineq_sh): Likewise.
6141 (_mm_comi_round_sh): Likewise.
6142 (_mm_comi_sh): Likewise.
6143 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
6144 (_mm_mask_cmp_ph_mask): Likewise.
6145 (_mm256_cmp_ph_mask): Likewise.
6146 (_mm256_mask_cmp_ph_mask): Likewise.
6147 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
6148 * config/i386/i386-builtin.def: Add corresponding new builtins.
6149 * config/i386/i386-expand.c
6150 (ix86_expand_args_builtin): Handle new builtin types.
6151 (ix86_expand_round_builtin): Ditto.
6152 * config/i386/i386.md (ssevecmode): Add HF mode.
6153 (MODEFH): New mode iterator.
6154 * config/i386/sse.md
6155 (V48H_AVX512VL): New mode iterator to support HF vector modes.
6156 Ajdust corresponding description.
6157 (ssecmpintprefix): New.
6158 (VI12_AVX512VL): Adjust to support HF vector modes.
6159 (cmp_imm_predicate): Likewise.
6160 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
6162 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
6163 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
6164 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
6166 2021-09-10 liuhongt <hongtao.liu@intel.com>
6168 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
6169 (_mm512_mask_max_ph): Likewise.
6170 (_mm512_maskz_max_ph): Likewise.
6171 (_mm512_min_ph): Likewise.
6172 (_mm512_mask_min_ph): Likewise.
6173 (_mm512_maskz_min_ph): Likewise.
6174 (_mm512_max_round_ph): Likewise.
6175 (_mm512_mask_max_round_ph): Likewise.
6176 (_mm512_maskz_max_round_ph): Likewise.
6177 (_mm512_min_round_ph): Likewise.
6178 (_mm512_mask_min_round_ph): Likewise.
6179 (_mm512_maskz_min_round_ph): Likewise.
6180 (_mm_max_sh): Likewise.
6181 (_mm_mask_max_sh): Likewise.
6182 (_mm_maskz_max_sh): Likewise.
6183 (_mm_min_sh): Likewise.
6184 (_mm_mask_min_sh): Likewise.
6185 (_mm_maskz_min_sh): Likewise.
6186 (_mm_max_round_sh): Likewise.
6187 (_mm_mask_max_round_sh): Likewise.
6188 (_mm_maskz_max_round_sh): Likewise.
6189 (_mm_min_round_sh): Likewise.
6190 (_mm_mask_min_round_sh): Likewise.
6191 (_mm_maskz_min_round_sh): Likewise.
6192 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
6193 (_mm256_max_ph): Likewise.
6194 (_mm_mask_max_ph): Likewise.
6195 (_mm256_mask_max_ph): Likewise.
6196 (_mm_maskz_max_ph): Likewise.
6197 (_mm256_maskz_max_ph): Likewise.
6198 (_mm_min_ph): Likewise.
6199 (_mm256_min_ph): Likewise.
6200 (_mm_mask_min_ph): Likewise.
6201 (_mm256_mask_min_ph): Likewise.
6202 (_mm_maskz_min_ph): Likewise.
6203 (_mm256_maskz_min_ph): Likewise.
6204 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
6205 * config/i386/i386-builtin.def: Add corresponding new builtins.
6206 * config/i386/i386-expand.c
6207 (ix86_expand_args_builtin): Handle new builtin types.
6208 * config/i386/sse.md
6209 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
6210 support HF vector modes.
6211 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
6212 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
6214 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
6216 * config/i386/subst.md (round_saeonly_mode512bit_condition):
6217 Adjust for HF vector modes.
6219 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
6221 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
6222 (_mm_mask_add_sh): Likewise.
6223 (_mm_maskz_add_sh): Likewise.
6224 (_mm_sub_sh): Likewise.
6225 (_mm_mask_sub_sh): Likewise.
6226 (_mm_maskz_sub_sh): Likewise.
6227 (_mm_mul_sh): Likewise.
6228 (_mm_mask_mul_sh): Likewise.
6229 (_mm_maskz_mul_sh): Likewise.
6230 (_mm_div_sh): Likewise.
6231 (_mm_mask_div_sh): Likewise.
6232 (_mm_maskz_div_sh): Likewise.
6233 (_mm_add_round_sh): Likewise.
6234 (_mm_mask_add_round_sh): Likewise.
6235 (_mm_maskz_add_round_sh): Likewise.
6236 (_mm_sub_round_sh): Likewise.
6237 (_mm_mask_sub_round_sh): Likewise.
6238 (_mm_maskz_sub_round_sh): Likewise.
6239 (_mm_mul_round_sh): Likewise.
6240 (_mm_mask_mul_round_sh): Likewise.
6241 (_mm_maskz_mul_round_sh): Likewise.
6242 (_mm_div_round_sh): Likewise.
6243 (_mm_mask_div_round_sh): Likewise.
6244 (_mm_maskz_div_round_sh): Likewise.
6245 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
6246 * config/i386/i386-builtin.def: Add corresponding new builtins.
6247 * config/i386/i386-expand.c
6248 (ix86_expand_round_builtin): Handle new builtins.
6249 * config/i386/sse.md (VF_128): Change description.
6250 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
6251 Adjust to support HF vector modes.
6252 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
6255 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
6257 * config/i386/i386-expand.c
6258 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
6259 * config/i386/i386.c
6260 (ix86_preferred_simd_mode): Handle HF mode.
6261 * config/i386/sse.md (V_256H): New mode iterator.
6262 (avx_vextractf128<mode>): Use it.
6263 (VEC_INIT_MODE): Align vector HFmode condition to vector
6264 HImodes since there're no real HF instruction used.
6265 (VEC_INIT_HALF_MODE): Ditto.
6267 (VIHF_AVX512BW): Ditto.
6268 (*vec_extracthf): Ditto.
6269 (VEC_EXTRACT_MODE): Ditto.
6271 2021-09-10 Richard Biener <rguenther@suse.de>
6274 * config/dbx.h: Remove.
6275 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
6276 * config/lynx.h: Likewise.
6278 2021-09-10 liuhongt <hongtao.liu@intel.com>
6280 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
6281 right into ANDNOT + AND + IOR, using paradoxical subregs.
6282 (ix86_split_copysign_const): Remove.
6283 (ix86_split_copysign_var): Ditto.
6284 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
6285 (ix86_split_copysign_var): Ditto.
6286 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
6287 (@copysign<mode>3_var): Ditto.
6289 2021-09-09 qing zhao <qing.zhao@oracle.com>
6291 * builtins.c (expand_builtin_memset): Make external visible.
6292 * builtins.h (expand_builtin_memset): Declare extern.
6293 * common.opt (ftrivial-auto-var-init=): New option.
6294 * doc/extend.texi: Document the uninitialized attribute.
6295 * doc/invoke.texi: Document -ftrivial-auto-var-init.
6296 * flag-types.h (enum auto_init_type): New enumerated type
6298 * gimple-fold.c (clear_padding_type): Add one new parameter.
6299 (clear_padding_union): Likewise.
6300 (clear_padding_emit_loop): Likewise.
6301 (clear_type_padding_in_mask): Likewise.
6302 (gimple_fold_builtin_clear_padding): Handle this new parameter.
6303 * gimplify.c (gimple_add_init_for_auto_var): New function.
6304 (gimple_add_padding_init_for_auto_var): New function.
6305 (is_var_need_auto_init): New function.
6306 (gimplify_decl_expr): Add initialization to automatic variables per
6308 (gimplify_call_expr): Add one new parameter for call to
6309 __builtin_clear_padding.
6310 (gimplify_init_constructor): Add padding initialization in the end.
6311 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
6312 (expand_DEFERRED_INIT): New function.
6313 * internal-fn.def (DEFERRED_INIT): New internal function.
6314 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
6315 * tree-sra.c (generate_subtree_deferred_init): New function.
6316 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
6317 calls to .DEFERRED_INIT.
6318 (sra_modify_deferred_init): New function.
6319 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
6320 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
6321 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
6323 (check_defs): Likewise.
6324 (warn_uninitialized_vars): Likewise.
6325 * tree-ssa.c (ssa_undefined_value_p): Likewise.
6326 * tree.c (build_common_builtin_nodes): Build tree node for
6327 BUILT_IN_CLEAR_PADDING when needed.
6329 2021-09-09 Richard Biener <rguenther@suse.de>
6331 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
6334 2021-09-09 Richard Biener <rguenther@suse.de>
6336 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
6337 DOM walk from get_loop_body_in_dom_order using a worklist
6340 2021-09-09 liuhongt <hongtao.liu@intel.com>
6342 * config.gcc: Add avx512fp16vlintrin.h.
6343 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
6344 (_mm512_mask_add_ph): Likewise.
6345 (_mm512_maskz_add_ph): Likewise.
6346 (_mm512_sub_ph): Likewise.
6347 (_mm512_mask_sub_ph): Likewise.
6348 (_mm512_maskz_sub_ph): Likewise.
6349 (_mm512_mul_ph): Likewise.
6350 (_mm512_mask_mul_ph): Likewise.
6351 (_mm512_maskz_mul_ph): Likewise.
6352 (_mm512_div_ph): Likewise.
6353 (_mm512_mask_div_ph): Likewise.
6354 (_mm512_maskz_div_ph): Likewise.
6355 (_mm512_add_round_ph): Likewise.
6356 (_mm512_mask_add_round_ph): Likewise.
6357 (_mm512_maskz_add_round_ph): Likewise.
6358 (_mm512_sub_round_ph): Likewise.
6359 (_mm512_mask_sub_round_ph): Likewise.
6360 (_mm512_maskz_sub_round_ph): Likewise.
6361 (_mm512_mul_round_ph): Likewise.
6362 (_mm512_mask_mul_round_ph): Likewise.
6363 (_mm512_maskz_mul_round_ph): Likewise.
6364 (_mm512_div_round_ph): Likewise.
6365 (_mm512_mask_div_round_ph): Likewise.
6366 (_mm512_maskz_div_round_ph): Likewise.
6367 * config/i386/avx512fp16vlintrin.h: New header.
6368 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
6369 Add new builtin types.
6370 * config/i386/i386-builtin.def: Add corresponding builtins.
6371 * config/i386/i386-expand.c
6372 (ix86_expand_args_builtin): Handle new builtin types.
6373 (ix86_expand_round_builtin): Likewise.
6374 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
6375 * config/i386/sse.md (VFH): New mode_iterator.
6377 (avx512fmaskmode): Add HF vector modes.
6378 (avx512fmaskhalfmode): Likewise.
6379 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
6381 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
6382 (mul<mode>3<mask_name><round_name>): Likewise.
6383 (*mul<mode>3<mask_name><round_name>): Likewise.
6384 (div<mode>3): Likewise.
6385 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
6386 * config/i386/subst.md (SUBST_V): Add HF vector modes.
6387 (SUBST_A): Likewise.
6388 (round_mode512bit_condition): Adjust for V32HFmode.
6390 2021-09-09 liuhongt <hongtao.liu@intel.com>
6393 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
6394 (reduc_plus_scal_v4sf): .. this, New define_expand.
6395 (reduc_plus_scal_v2df): .. and this, New define_expand.
6397 2021-09-09 liuhongt <hongtao.liu@intel.com>
6400 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
6403 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
6406 * doc/trouble.texi (Copy Assignment): Fix description of
6407 behaviour and fix code in example.
6409 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
6412 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
6413 r11 instead of r12 for restoring CR.
6415 2021-09-08 Jakub Jelinek <jakub@redhat.com>
6416 liuhongt <hongtao.liu@intel.com>
6419 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
6420 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
6421 into AND with mask and XOR, using paradoxical subregs.
6422 (ix86_split_xorsign): Remove.
6423 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
6425 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
6427 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
6429 2021-09-08 Jakub Jelinek <jakub@redhat.com>
6432 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
6433 operands[2], emit abs<mode>2 instead.
6434 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
6435 first alternative even for avx, add another alternative with
6436 =&Yv <- 0, Yv, Yvm constraints.
6437 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
6438 to op1, emit vpandn instead.
6440 2021-09-08 liuhongt <hongtao.liu@intel.com>
6442 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
6443 (_mm256_set_ph): Likewise.
6444 (_mm512_set_ph): Likewise.
6445 (_mm_setr_ph): Likewise.
6446 (_mm256_setr_ph): Likewise.
6447 (_mm512_setr_ph): Likewise.
6448 (_mm_set1_ph): Likewise.
6449 (_mm256_set1_ph): Likewise.
6450 (_mm512_set1_ph): Likewise.
6451 (_mm_setzero_ph): Likewise.
6452 (_mm256_setzero_ph): Likewise.
6453 (_mm512_setzero_ph): Likewise.
6454 (_mm_set_sh): Likewise.
6455 (_mm_load_sh): Likewise.
6456 (_mm_store_sh): Likewise.
6457 * config/i386/i386-builtin-types.def (V8HF): New type.
6458 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
6459 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
6460 Support vector HFmodes.
6461 (ix86_expand_vector_init_one_nonzero): Likewise.
6462 (ix86_expand_vector_init_one_var): Likewise.
6463 (ix86_expand_vector_init_interleave): Likewise.
6464 (ix86_expand_vector_init_general): Likewise.
6465 (ix86_expand_vector_set): Likewise.
6466 (ix86_expand_vector_extract): Likewise.
6467 (ix86_expand_vector_init_concat): Likewise.
6468 (ix86_expand_sse_movcc): Handle vector HFmodes.
6469 (ix86_expand_vector_set_var): Ditto.
6470 * config/i386/i386-modes.def: Add HF vector modes in comment.
6471 * config/i386/i386.c (classify_argument): Add HF vector modes.
6472 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
6473 (ix86_vector_mode_supported_p): Likewise.
6474 (ix86_set_reg_reg_cost): Handle vector HFmode.
6475 (ix86_get_ssemov): Handle vector HFmode.
6476 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
6478 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
6480 (function_arg_advance_32): Ditto.
6481 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
6482 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
6483 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
6484 (VALID_SSE2_REG_VHF_MODE): New.
6485 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
6486 (SSE_REG_MODE_P): Add vector HFmode.
6487 * config/i386/i386.md (mode): Add HF vector modes.
6488 (MODE_SIZE): Likewise.
6489 (ssemodesuffix): Add ph suffix for HF vector modes.
6490 * config/i386/sse.md (VFH_128): New mode iterator.
6491 (VMOVE): Adjust for HF vector modes.
6493 (V_256_512): Likewise.
6495 (avx512fmaskmode): Likewise.
6496 (shuffletype): Likewise.
6497 (sseinsnmode): Likewise.
6498 (ssedoublevecmode): Likewise.
6499 (ssehalfvecmode): Likewise.
6500 (ssehalfvecmodelower): Likewise.
6501 (ssePScmode): Likewise.
6502 (ssescalarmode): Likewise.
6503 (ssescalarmodelower): Likewise.
6504 (sseintprefix): Likewise.
6506 (bcstscalarsuff): Likewise.
6507 (xtg_mode): Likewise.
6508 (VI12HF_AVX512VL): New mode_iterator.
6509 (VF_AVX512FP16): Likewise.
6511 (VIHF_256): Likewise.
6512 (VIHF_AVX512BW): Likewise.
6513 (V16_256): Likewise.
6514 (V32_512): Likewise.
6515 (sseintmodesuffix): New mode_attr.
6516 (sse): Add scalar and vector HFmodes.
6517 (ssescalarmode): Add vector HFmode mapping.
6518 (ssescalarmodesuffix): Add sh suffix for HFmode.
6519 (*<sse>_vm<insn><mode>3): Use VFH_128.
6520 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
6521 (*ieee_<ieee_maxmin><mode>3): Likewise.
6522 (<avx512>_blendm<mode>): New define_insn.
6523 (vec_setv8hf): New define_expand.
6524 (vec_set<mode>_0): New define_insn for HF vector set.
6525 (*avx512fp16_movsh): Likewise.
6526 (avx512fp16_movsh): Likewise.
6527 (vec_extract_lo_v32hi): Rename to ...
6528 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
6530 (vec_extract_hi_v32hi): Likewise.
6531 (vec_extract_hi_<mode>): Likewise.
6532 (vec_extract_lo_v16hi): Likewise.
6533 (vec_extract_lo_<mode>): Likewise.
6534 (vec_extract_hi_v16hi): Likewise.
6535 (vec_extract_hi_<mode>): Likewise.
6536 (vec_set_hi_v16hi): Likewise.
6537 (vec_set_hi_<mode>): Likewise.
6538 (vec_set_lo_v16hi): Likewise.
6539 (vec_set_lo_<mode>): Likewise.
6540 (*vec_extract<mode>_0): New define_insn_and_split for HF
6542 (*vec_extracthf): New define_insn.
6543 (VEC_EXTRACT_MODE): Add HF vector modes.
6544 (PINSR_MODE): Add V8HF.
6545 (sse2p4_1): Likewise.
6546 (pinsr_evex_isa): Likewise.
6547 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
6548 insert for V8HFmode.
6549 (pbroadcast_evex_isa): Add HF vector modes.
6550 (AVX2_VEC_DUP_MODE): Likewise.
6551 (VEC_INIT_MODE): Likewise.
6552 (VEC_INIT_HALF_MODE): Likewise.
6553 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
6555 (avx2_pbroadcast<mode>_1): Likewise.
6556 (<avx512>_vec_dup<mode>_1): Likewise.
6557 (<avx512>_vec_dup<mode><mask_name>): Likewise.
6558 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
6561 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
6562 H.J. Lu <hongjiu.lu@intel.com>
6563 Liu Hongtao <hongtao.liu@intel.com>
6564 Wang Hongyu <hongyu.wang@intel.com>
6565 Xu Dianhong <dianhong.xu@intel.com>
6567 * common/config/i386/cpuinfo.h (get_available_features):
6568 Detect FEATURE_AVX512FP16.
6569 * common/config/i386/i386-common.c
6570 (OPTION_MASK_ISA_AVX512FP16_SET,
6571 OPTION_MASK_ISA_AVX512FP16_UNSET,
6572 OPTION_MASK_ISA2_AVX512FP16_SET,
6573 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
6574 (OPTION_MASK_ISA2_AVX512BW_UNSET,
6575 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
6576 (ix86_handle_option): Handle -mavx512fp16.
6577 * common/config/i386/i386-cpuinfo.h (enum processor_features):
6578 Add FEATURE_AVX512FP16.
6579 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
6580 * config.gcc: Add avx512fp16intrin.h.
6581 * config/i386/avx512fp16intrin.h: New intrinsic header.
6582 * config/i386/cpuid.h: Add bit_AVX512FP16.
6583 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
6584 * config/i386/i386-builtins.c: Support _Float16 type for i386
6586 (ix86_register_float16_builtin_type): New function.
6587 (ix86_float16_type_node): New.
6588 * config/i386/i386-c.c (ix86_target_macros_internal): Define
6590 * config/i386/i386-expand.c (ix86_expand_branch): Support
6592 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
6593 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
6594 (ix86_expand_fp_movcc): Ditto.
6595 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
6596 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
6597 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
6598 * config/i386/i386.c (ix86_get_ssemov): Use
6599 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
6600 (ix86_get_excess_precision): Use
6601 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
6603 (sse_store_index): Use SFmode cost for HFmode cost.
6604 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
6605 GPR cost for HFmode.
6606 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
6607 (ix86_mangle_type): Add manlging for _Float16 type.
6608 (inline_secondary_memory_needed): No memory is needed for
6609 16bit movement between gpr and sse reg under
6611 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
6612 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
6613 (ix86_division_cost): Ditto.
6614 (ix86_rtx_costs): Ditto.
6615 (ix86_add_stmt_cost): Ditto.
6616 (ix86_optab_supported_p): Ditto.
6617 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
6618 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
6619 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
6620 * config/i386/i386.md (mode): Add HFmode.
6621 (MODE_SIZE): Add HFmode.
6622 (isa): Add avx512fp16.
6623 (enabled): Handle avx512fp16.
6624 (ssemodesuffix): Add sh suffix for HFmode.
6625 (comm): Add mult, div.
6626 (plusminusmultdiv): New code iterator.
6627 (insn): Add mult, div.
6628 (*movhf_internal): Adjust for avx512fp16 instruction.
6629 (*movhi_internal): Ditto.
6630 (*cmpi<unord>hf): New define_insn for HFmode.
6631 (*ieee_s<ieee_maxmin>hf3): Likewise.
6632 (extendhf<mode>2): Likewise.
6633 (trunc<mode>hf2): Likewise.
6634 (float<floatunssuffix><mode>hf2): Likewise.
6635 (*<insn>hf): Likewise.
6636 (cbranchhf4): New expander.
6637 (movhfcc): Likewise.
6638 (<insn>hf3): Likewise.
6641 * config/i386/i386.opt: Add mavx512fp16.
6642 * config/i386/immintrin.h: Include avx512fp16intrin.h.
6643 * doc/invoke.texi: Add mavx512fp16.
6644 * doc/extend.texi: Add avx512fp16 Usage Notes.
6646 2021-09-08 liuhongt <hongtao.liu@intel.com>
6648 * common.opt: Support -fexcess-precision=16.
6649 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
6650 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
6651 EXCESS_PRECISION_TYPE_FLOAT16.
6652 * config/arm/arm.c (arm_excess_precision): Ditto.
6653 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
6654 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
6655 when EXCESS_PRECISION_TYPE_FLOAT16.
6656 * config/s390/s390.c (s390_excess_precision): Ditto.
6657 * coretypes.h (enum excess_precision_type): Add
6658 EXCESS_PRECISION_TYPE_FLOAT16.
6659 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
6660 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
6661 * doc/extend.texi (Half-Precision): Document
6662 -fexcess-precision=16.
6663 * flag-types.h (enum excess_precision): Add
6664 EXCESS_PRECISION_FLOAT16.
6665 * target.def (excess_precision): Update document.
6666 * tree.c (excess_precision_type): Set excess_precision_type to
6667 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
6669 2021-09-08 liuhongt <hongtao.liu@intel.com>
6671 * doc/extend.texi: (@node Floating Types): Adjust the wording.
6672 (@node Half-Precision): Ditto.
6674 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
6677 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
6678 'CONST_INT_P (src)' to the condition of the block that tries to
6679 eliminate literal when loading integer contant.
6681 2021-09-07 David Faust <david.faust@oracle.com>
6683 * doc/extend.texi (BPF Type Attributes) New node.
6684 Document new preserve_access_index attribute.
6685 Document new preserve_access_index builtin.
6686 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
6688 2021-09-07 David Faust <david.faust@oracle.com>
6690 * config/bpf/bpf.c: Adjust includes.
6691 (bpf_handle_preserve_access_index_attribute): New function.
6692 (bpf_attribute_table): Use it here.
6693 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
6694 (bpf_option_override): Handle "-mco-re" option.
6695 (bpf_asm_init_sections): New.
6696 (TARGET_ASM_INIT_SECTIONS): Redefine.
6697 (bpf_file_end): New.
6698 (TARGET_ASM_FILE_END): Redefine.
6699 (bpf_init_builtins): Add "__builtin_preserve_access_index".
6700 (bpf_core_compute, bpf_core_get_index): New.
6701 (is_attr_preserve_access): New.
6702 (bpf_expand_builtin): Handle new builtins.
6703 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
6704 (bpf_core_walk): New.
6705 (bpf_resolve_overloaded_builtin): New.
6706 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
6708 (pass_bpf_core_attr): New RTL pass.
6709 * config/bpf/bpf-passes.def: New file.
6710 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
6711 * config/bpf/coreout.c: New file.
6712 * config/bpf/coreout.h: Likewise.
6713 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
6714 (coreout.o): New rule.
6715 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
6716 * config.gcc (bpf): Add coreout.h to extra_headers.
6717 Add coreout.o to extra_objs.
6718 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
6720 2021-09-07 David Faust <david.faust@oracle.com>
6722 * btfout.c (get_btf_id): Function is no longer static.
6723 * ctfc.h: Expose it here.
6725 2021-09-07 David Faust <david.faust@oracle.com>
6727 * ctfc.c (ctf_lookup_tree_type): New function.
6730 2021-09-07 David Faust <david.faust@oracle.com>
6732 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
6733 * ctfc.h: Analogous change.
6735 2021-09-07 David Faust <david.faust@oracle.com>
6737 * dwarf2out.c (lookup_type_die): Function is no longer static.
6738 * dwarf2out.h: Expose it here.
6740 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
6742 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
6743 (ctf_debug_early_finish): New definition.
6744 (ctf_debug_finish): Likewise.
6745 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
6746 (ctf_debug_early_finish): New declaration.
6747 (ctf_debug_finish): Likewise.
6748 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
6749 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
6751 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
6753 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
6754 support when compiling for CO-RE.
6755 * config/bpf/bpf.opt: Add new command line option -mco-re.
6757 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
6759 * flag-types.h (enum debug_info_type): Add new enum
6760 DINFO_TYPE_BTF_WITH_CORE.
6761 (BTF_WITH_CORE_DEBUG): New bitmask.
6762 * flags.h (btf_with_core_debuginfo_p): New declaration.
6763 * opts.c (btf_with_core_debuginfo_p): New definition.
6765 2021-09-07 Jason Merrill <jason@redhat.com>
6767 * tree.h (error_operand_p): Change to inline function.
6769 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
6771 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
6772 (empty_block_with_phis_p): ...this.
6773 (potentially_threadable_block): Same.
6774 (jump_threader::thread_through_normal_block): Same.
6776 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
6779 * dwarf2out.c (mark_base_types): New overloaded function.
6780 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
6781 as the compilation unit, and call move_marked_base_types afterward.
6783 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
6786 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
6788 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
6790 2021-09-07 Richard Biener <rguenther@suse.de>
6792 PR tree-optimization/102226
6793 * tree-vect-loop.c (vect_transform_cycle_phi): Record
6794 the converted value for the epilogue PHI use.
6796 2021-09-07 Martin Liska <mliska@suse.cz>
6798 PR gcov-profile/80223
6799 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
6800 options, do not inline when no_profile_instrument_function
6801 attributes are different in early inliner. It's fine to inline
6802 it after PGO instrumentation.
6804 2021-09-07 Richard Biener <rguenther@suse.de>
6806 PR tree-optimization/101555
6807 * tree-ssa-pre.c (translate_vuse_through_block): Do not
6808 perform an alias walk to determine the validity of the
6809 mem at the start of the block which is already guaranteed
6810 by means of prune_clobbered_mems.
6811 (phi_translate_1): Pass edge to translate_vuse_through_block.
6813 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
6816 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
6817 (remainder<mode>3): Likewise.
6819 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
6821 * config/mips/mips.c (mips_file_start): add .module for
6824 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
6826 * wide-int.cc (wi::clz): Reorder tests to ensure the result
6827 is zero for all negative values.
6829 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
6831 * doc/invoke.texi (-foffload-options): Fix @opindex.
6833 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
6836 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
6837 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
6838 source alternative for AVX.
6840 2021-09-06 liuhongt <hongtao.liu@intel.com>
6842 PR middle-end/102182
6843 * optabs.c (expand_fix): Add from1 to avoid from being
6846 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
6848 * dwarf2out.c (modified_type_die): Deal with all array types earlier
6849 and use local variable consistently throughout the function.
6851 2021-09-06 Jakub Jelinek <jakub@redhat.com>
6853 PR tree-optimization/102207
6854 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
6855 were promoted from signed to wider unsigned type.
6857 2021-09-06 Andrew Pinski <apinski@marvell.com>
6859 PR tree-optimization/63184
6860 * match.pd: Add simplification of pointer_diff of two pointer_plus
6861 with addr_expr in the first operand of each pointer_plus.
6862 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
6863 in the first operand of each pointer_plus.
6865 2021-09-06 Richard Biener <rguenther@suse.de>
6867 PR tree-optimization/102176
6868 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
6870 (vect_bb_slp_scalar_cost): Use the computed set of
6871 vectorized scalar stmts instead of relying on the out-of-date
6872 and not accurate PURE_SLP_STMT.
6873 (vect_bb_vectorization_profitable_p): Compute the set
6874 of vectorized scalar stmts.
6876 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
6878 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
6879 GIMPLE_COND special casing.
6880 (path_range_query::range_defined_in_block): Use range_of_stmt
6881 instead of calling fold_range directly.
6883 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
6885 * gimple-range-path.cc (path_range_query::range_of_expr): Set
6886 m_undefined_path when appropriate.
6887 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
6888 (path_range_query::unreachable_path_p): New.
6889 (path_range_query::precompute_ranges): Set m_undefined_path.
6890 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
6891 (path_range_query::internal_range_of_expr): New.
6892 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
6893 Use unreachable_path_p.
6895 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
6897 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
6898 Remove argument and call find_taken_edge.
6899 (back_threader::resolve_phi): Do not calculate taken edge before
6900 calling maybe_register_path.
6901 (back_threader::find_paths_to_names): Same.
6903 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
6905 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
6906 * config/h8300/testcompare.md (store_c): Update name, use new
6908 (store_neg_c, store_shifted_c): New patterns.
6910 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
6913 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
6914 instead of r12 for CR save, in all cases.
6916 2021-09-03 Andrew Pinski <apinski@marvell.com>
6918 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
6919 Handle error_mark_node as the type of the type_decl.
6921 2021-09-03 Andrew Pinski <apinski@marvell.com>
6923 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
6925 (aarch64_simd_types): Likewise.
6926 (aarch64_simd_intOI_type_node): Likewise.
6927 (aarch64_simd_intCI_type_node): Likewise.
6928 (aarch64_simd_intXI_type_node): Likewise.
6929 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
6930 (aarch64_fp16_ptr_type_node): Likewise.
6931 (aarch64_bf16_type_node): Likewise.
6932 (aarch64_bf16_ptr_type_node): Likewise.
6934 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6936 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
6938 (minus_op1_op2_relation_effect): ...here.
6939 (class operator_pointer_diff): New.
6940 (operator_pointer_diff::op1_op2_relation_effect): Call
6941 minus_op1_op2_relation_effect.
6942 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
6944 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6946 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
6947 Add may_peel_loop_headers.
6948 (back_threader_registry::thread_through_all_blocks): Same.
6949 (try_thread_blocks): Pass may_peel_loop_headers argument.
6950 (pass_early_thread_jumps::execute): Same.
6952 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6954 * tree-ssa-threadedge.c (has_phis_p): New.
6955 (forwarder_block_p): New.
6956 (potentially_threadable_block): Call forwarder_block_p.
6957 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
6958 (jump_threader::thread_through_normal_block): Call
6961 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6963 * tree-ssa-threadbackward.c (back_threader::dump): New.
6964 (back_threader::debug): New.
6965 (back_threader_profitability::profitable_path_p): Dump blocks
6966 even if we are bailing early.
6968 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6970 * tree-ssa-threadupdate.c (cancel_thread): New.
6971 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
6972 (jump_thread_path_registry::mark_threaded_blocks): Same.
6973 (jump_thread_path_registry::register_jump_thread): Same.
6975 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6977 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
6978 which objects are available.
6979 (jt_state::pop): Same.
6980 (jt_state::register_equiv): Same.
6981 (jt_state::register_equivs_on_edge): Same.
6983 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6985 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
6986 Move pop until after a thread is registered.
6988 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6990 * tree-ssa-threadupdate.c (debug): New.
6992 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
6994 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
6995 (push_dump_file::~push_dump_file): New.
6996 (dump_ranger): Change dump_file temporarily while dumping
6998 * gimple-range-trace.h (class push_dump_file): New.
7000 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7002 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
7003 (dump_ranger): Dump function name.
7005 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7007 * gimple-range-path.cc (path_range_query::range_defined_in_block):
7008 Adjust for non-null.
7009 (path_range_query::adjust_for_non_null_uses): New.
7010 (path_range_query::precompute_ranges): Call
7011 adjust_for_non_null_uses.
7012 * gimple-range-path.h: Add m_non_null and
7013 adjust_for_non_null_uses.
7015 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7017 * gimple-range-path.cc (path_range_query::dump): Dump path
7019 (path_range_query::precompute_ranges): Dump entire path.
7021 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7023 * value-relation.cc (relation_oracle::debug): New.
7024 * value-relation.h (relation_oracle::debug): New.
7026 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7028 * tree-ssa-loop-ch.c: Remove unnecessary include file.
7030 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7032 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
7033 Skip statements with no defining BB.
7034 * gimple-range-path.cc (path_range_query::range_defined_in_block):
7035 Do not get confused by statements with no defining BB.
7037 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7039 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
7040 gimple_range_adjustment. Add support for constants.
7041 (adjust_realpart_expr): New.
7042 (gimple_range_adjustment): Move IMAGPART_EXPR code to
7043 adjust_imagpart_expr.
7044 * range-op.cc (integral_table::integral_table): Add entry for
7047 2021-09-03 Jakub Jelinek <jakub@redhat.com>
7049 * omp-expand.c (expand_omp_atomic_pipeline): Use
7050 IFN_ATOMIC_COMPARE_EXCHANGE instead of
7051 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
7054 2021-09-03 Jakub Jelinek <jakub@redhat.com>
7057 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
7058 that is false if DECL_BIT_FIELD.
7059 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
7060 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
7061 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
7062 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
7063 of DECL_FIELD_ABI_IGNORED.
7064 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
7065 SET_DECL_FIELD_ABI_IGNORED instead of writing to
7066 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
7067 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
7068 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
7069 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
7071 2021-09-03 liuhongt <hongtao.liu@intel.com>
7074 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
7075 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
7076 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
7078 2021-09-02 Martin Sebor <msebor@redhat.com>
7080 PR tree-optimization/17506
7082 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
7084 2021-09-02 Richard Biener <rguenther@suse.de>
7086 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
7087 fix for PR78185 and continue processing when leaving
7090 2021-09-02 Jakub Jelinek <jakub@redhat.com>
7092 PR tree-optimization/99591
7093 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
7096 2021-09-02 Richard Biener <rguenther@suse.de>
7099 2021-09-02 Richard Biener <rguenther@suse.de>
7101 PR tree-optimization/102155
7102 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
7103 over a part of the RPO array and do not recurse here.
7104 Dump blocks marked as always executed.
7105 (fill_always_executed_in): Walk over the RPO array and
7106 process loops whose header we run into.
7107 (loop_invariant_motion_in_fun): Compute the first RPO
7108 using rev_post_order_and_mark_dfs_back_seme in iteration
7109 order and pass that to fill_always_executed_in.
7111 2021-09-02 liuhongt <hongtao.liu@intel.com>
7113 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
7114 * config/i386/i386.c (enum x86_64_reg_class): Add
7116 (merge_classes): Handle X86_64_SSEHF_CLASS.
7117 (examine_argument): Ditto.
7118 (construct_container): Ditto.
7119 (classify_argument): Ditto, and set HFmode/HCmode to
7121 (function_value_32): Return _FLoat16/Complex Float16 by
7123 (function_value_64): Return _Float16/Complex Float16 by SSE
7125 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
7126 (ix86_secondary_reload): Require gpr as intermediate register
7127 to store _Float16 from sse register when sse4 is not
7129 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
7131 (ix86_scalar_mode_supported_p): Ditto.
7132 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
7133 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
7134 (VALID_INT_MODE_P): Add HFmode and HCmode.
7135 * config/i386/i386.md (*pushhf_rex64): New define_insn.
7137 (*movhf_internal): Ditto.
7138 * doc/extend.texi (Half-Precision Floating Point): Documemt
7141 2021-09-02 Richard Biener <rguenther@suse.de>
7143 PR tree-optimization/102155
7144 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
7145 over a part of the RPO array and do not recurse here.
7146 Dump blocks marked as always executed.
7147 (fill_always_executed_in): Walk over the RPO array and
7148 process loops whose header we run into.
7149 (loop_invariant_motion_in_fun): Compute the first RPO
7150 using rev_post_order_and_mark_dfs_back_seme in iteration
7151 order and pass that to fill_always_executed_in.
7153 2021-09-02 YunQiang Su <syq@debian.org>
7156 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
7158 * config/mips/mips.c (mips_module_isa_name): New.
7159 mips_file_start: add .module mipsREV to all asm output
7161 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
7163 PR tree-optimization/102152
7164 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
7165 comparison to a scalar comparison before calling
7166 update_stmt_if_modified.
7168 2021-09-01 Andrew Pinski <apinski@marvell.com>
7171 * config/aarch64/aarch64.c (aarch64_expand_setmem):
7172 Check STRICT_ALIGNMENT before creating an overlapping
7175 2021-09-01 Martin Sebor <msebor@redhat.com>
7177 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
7178 (check_access): Pass additional argument.
7179 (check_memop_access): Remove template and make a member function.
7180 (maybe_check_dealloc_call): Make a pass_waccess member function.
7181 (class pass_waccess): Add, rename, and remove members.
7182 (pass_waccess::pass_waccess): Adjust to name change.
7183 (pass_waccess::~pass_waccess): Same.
7184 (check_alloca): Make a member function.
7185 (check_alloc_size_call): Same.
7186 (check_strcat): Same.
7187 (check_strncat): Same.
7188 (check_stxcpy): Same.
7189 (check_stxncpy): Same.
7190 (check_strncmp): Same.
7191 (maybe_warn_rdwr_sizes): Rename...
7192 (pass_waccess::maybe_check_access_sizes): ...to this.
7193 (pass_waccess::check_call): Adjust to name changes.
7194 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
7196 (pass_waccess::execute): Adjust to name changes.
7197 * gimple-ssa-warn-access.h (check_memop_access): Remove.
7198 * pointer-query.cc (access_ref::phi): Handle null pointer.
7199 (access_ref::inform_access): Same.
7200 (pointer_query::put_ref): Modify a cached value, not a copy of it.
7201 (pointer_query::dump): New function.
7202 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
7204 * pointer-query.h (pointer_query::dump): Declare.
7205 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
7206 (dump_strlen_info): Use function query.
7207 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
7209 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
7211 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
7213 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
7215 * doc/extend.texi: Document unavailable attribute.
7216 * print-tree.c (print_node): Handle unavailable attribute.
7217 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
7218 * tree.c (error_unavailable_use): New.
7219 * tree.h (TREE_UNAVAILABLE): New.
7220 (error_unavailable_use): New.
7222 2021-09-01 Jakub Jelinek <jakub@redhat.com>
7224 PR tree-optimization/102124
7225 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
7226 MINUS_EXPR, if itype is unsigned with smaller precision than type,
7227 add an extra cast to signed variant of itype to ensure sign-extension.
7229 2021-09-01 Martin Liska <mliska@suse.cz>
7231 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
7232 edges and rather use colors for TRUE and FALSE edges.
7234 2021-09-01 Richard Biener <rguenther@suse.de>
7236 PR tree-optimization/93491
7237 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
7238 after processing the stmt itself. Do not consider
7239 pure functions possibly not returning. Properly avoid
7240 adding possibly trapping calls to EXP_GEN when there's
7241 a preceeding possibly not returning call.
7242 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
7245 2021-09-01 Richard Biener <rguenther@suse.de>
7247 PR tree-optimization/102139
7248 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
7249 type to record a std::pair of the stmt-info and the innermost
7251 (dr_vec_info::group): New member.
7252 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
7253 (vect_compute_data_ref_alignment): Verify the recorded
7254 base alignment can be used.
7255 (data_ref_pair): Remove.
7256 (dr_group_sort_cmp): Adjust.
7257 (vect_analyze_data_ref_accesses): Store the group-ID in the
7258 dr_vec_info and operate on a vector of dr_vec_infos.
7260 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
7262 * read-md.c (md_reader::handle_enum): support value assignation.
7263 * doc/md.texi: record define_c_enum value assignation support.
7265 2021-09-01 Jakub Jelinek <jakub@redhat.com>
7267 PR tree-optimization/102141
7268 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
7269 argument. If false, emit stmts after gsi instead of before, and
7271 (bswap_replace): Adjust callers. When converting output of bswap,
7272 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
7275 2021-09-01 liuhongt <hongtao.liu@intel.com>
7277 * emit-rtl.c (validate_subreg): Get rid of all float-int
7280 2021-09-01 liuhongt <hongtao.liu@intel.com>
7283 2021-08-30 liuhongt <hongtao.liu@intel.com>
7285 * expmed.c (extract_bit_field_1): Make sure we're playing with
7286 integral modes before call extract_integral_bit_field.
7287 (extract_integral_bit_field): Add a parameter of type
7288 scalar_int_mode which corresponds to of tmode.
7289 And call extract_and_convert_fixed_bit_field instead of
7290 extract_fixed_bit_field and convert_extracted_bit_field.
7291 (extract_and_convert_fixed_bit_field): New function, it's a
7292 combination of extract_fixed_bit_field and
7293 convert_extracted_bit_field.
7295 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
7297 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
7299 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
7301 * omp-general.h (omp_is_reference): Rename to...
7302 (omp_privatize_by_reference): ... this. Adjust all users...
7303 * omp-general.c: ... here, ...
7304 * gimplify.c: ... here, ...
7305 * omp-expand.c: ... here, ...
7306 * omp-low.c: ... here.
7308 2021-08-31 Martin Sebor <msebor@redhat.com>
7310 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
7311 pointer element for equality to zero, not that of the cotaining
7314 2021-08-31 Martin Sebor <msebor@redhat.com>
7316 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
7318 2021-08-31 Martin Sebor <msebor@redhat.com>
7320 * function.h (function): Add comments.
7321 (get_range_query): Same. Add attribute returns nonnull.
7323 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
7325 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
7326 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
7327 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
7329 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
7331 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
7333 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
7334 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
7335 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
7336 (m32c-*-elf*, m32c-*-rtems*): Likewise.
7337 * config/cris/cris.h (DBX_NO_XREFS): Remove.
7338 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
7339 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
7340 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
7341 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
7342 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
7344 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
7346 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
7347 allowed on target constructs and only with particular other clauses.
7348 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
7350 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
7351 structs when 'ancestor' is used.
7352 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
7354 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
7355 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
7357 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
7359 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
7360 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
7362 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
7363 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
7364 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
7365 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
7366 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
7367 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
7368 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
7369 would be paradoxical.
7371 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
7373 * combine.c (combine_simplify_rtx): Avoid converting an explicit
7374 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
7375 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
7377 2021-08-31 Richard Biener <rguenther@suse.de>
7379 PR tree-optimization/102142
7380 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
7381 condition under which to unset the visited flag.
7383 2021-08-31 Richard Biener <rguenther@suse.de>
7385 PR middle-end/102129
7386 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
7387 possibly trapping expressions across calls.
7389 2021-08-31 Jakub Jelinek <jakub@redhat.com>
7391 PR tree-optimization/102134
7392 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
7393 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
7396 2021-08-31 Andrew Pinski <apinski@marvell.com>
7399 * collect-utils.c (setup_signals): New declaration.
7400 * collect-utils.h (setup_signals): New function.
7401 * collect2.c (handler): Delete.
7402 (main): Instead of manually setting up the signals,
7403 just call setup_signals.
7404 * lto-wrapper.c (main): Likewise.
7406 2021-08-31 Andrew Pinski <apinski@marvell.com>
7409 * config/i386/i386-protos.h (x86_output_aligned_bss):
7410 Change align argument to unsigned type.
7411 (x86_elf_aligned_decl_common): Likewise.
7412 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
7413 (x86_output_aligned_bss): Likewise.
7415 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
7417 * config/mips/mips.c (mips_module_isa_name): New.
7418 mips_file_start: add .module mipsREV to all asm output
7420 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
7422 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
7423 use enum instead of int for 'isa' member.
7424 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
7425 config/mips/netbsd.h: replace hardcoded numbers with enum.
7427 2021-08-31 liuhongt <hongtao.liu@intel.com>
7429 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
7430 define_split to define_insn_and_split.
7431 (*avx2_eq<mode>3): Removed.
7432 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
7433 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
7434 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
7436 (*avx2_gt<mode>3): Removed.
7437 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
7438 define_insn to define_expand, and adjust pattern.
7439 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
7441 2021-08-30 David Malcolm <dmalcolm@redhat.com>
7444 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
7446 2021-08-30 Jason Merrill <jason@redhat.com>
7448 * doc/invoke.texi: Document -Wmissing-requires.
7450 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
7452 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
7453 TARGET_EXTRA_BUILTINS guard.
7455 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
7457 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
7458 initialization of V2DI_type_node and unsigned_V2DI_type_node.
7460 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
7462 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
7463 decl when new_builtins_are_live.
7464 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
7467 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
7469 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
7470 OPTION_MASK_P10_FUSION_2STORE.
7471 (POWERPC_MASKS): Likewise.
7472 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
7473 store fusion for Power10.
7474 (is_fusable_store): New.
7475 (power10_sched_reorder): Likewise.
7476 (rs6000_sched_reorder): Do Power10 specific reordering.
7477 (rs6000_sched_reorder2): Likewise.
7478 * config/rs6000/rs6000.opt: Add new option.
7480 2021-08-30 Richard Biener <rguenther@suse.de>
7482 PR tree-optimization/102128
7483 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
7484 Move scanning for if-converted scalar code to the caller
7485 and instead delay clearing the visited flag for profitable
7487 (vect_slp_region): Cost all subgraphs before scheduling.
7488 For if-converted BB vectorization scan for scalar COND_EXPRs
7489 and do not vectorize if any found and the cost model is
7492 2021-08-30 Richard Biener <rguenther@suse.de>
7494 * common.opt (fexceptions): Mark
7495 EnabledBy(fnon-call-exceptions).
7496 * doc/invoke.texi (fnon-call-exceptions): Document this
7497 enables -fexceptions.
7499 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
7501 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
7502 is not already defined.
7504 2021-08-30 liuhongt <hongtao.liu@intel.com>
7506 * expmed.c (extract_bit_field_1): Make sure we're playing with
7507 integral modes before call extract_integral_bit_field.
7508 (extract_integral_bit_field): Add a parameter of type
7509 scalar_int_mode which corresponds to of tmode.
7510 And call extract_and_convert_fixed_bit_field instead of
7511 extract_fixed_bit_field and convert_extracted_bit_field.
7512 (extract_and_convert_fixed_bit_field): New function, it's a
7513 combination of extract_fixed_bit_field and
7514 convert_extracted_bit_field.
7516 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
7518 * config/darwin.c (darwin_libc_has_function): Do not run
7519 the checks for x86 or modern Darwin. Make sure that there
7520 is a value set for darwin_macosx_version_min before testing.
7522 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
7524 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
7526 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
7528 * ipa-modref-tree.h (modref_access_node::merge): Break out
7529 logic combining offsets and logic merging ranges to ...
7530 (modref_access_node::combined_offsets): ... here
7531 (modref_access_node::update2): ... here
7532 (modref_access_node::closer_pair_p): New member function.
7533 (modref_access_node::forced_merge): New member function.
7534 (modre_ref_node::insert): Do merging when table is full.
7536 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
7539 * config.gcc: MIPS: use N64 ABI by default if the triple end
7540 with -gnuabi64, which is used by Debian since 2013.
7542 2021-08-28 Alexandre Oliva <oliva@adacore.com>
7544 * ipa-modref.c (analyze_function): Skip debug stmts.
7545 * tree-inline.c (estimate_num_insn): Consider builtins even
7546 without a cgraph_node.
7548 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
7550 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
7551 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
7552 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
7553 * config/h8300/h8300.md (eqne): New code iterator.
7554 (geultu, geultu_to_c): Similarly.
7555 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
7556 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
7557 (cmp<mode>_c): New pattern
7559 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
7561 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
7563 (dom_opt_dom_walker::optimize_stmt): Use it.
7565 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
7567 * config/darwin.c (finalize_ctors): Add a section-start linker-
7569 (finalize_dtors): Likewise.
7570 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
7572 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
7574 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
7575 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
7576 old initialization logic when new builtins are enabled.
7577 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
7578 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
7579 (write_init_file): Likewise.
7581 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
7583 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
7584 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
7585 (amdgcn-* | gcn-*) Likewise.
7587 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
7589 * symbol-summary.h: Added missing template keyword.
7591 2021-08-27 Richard Biener <rguenther@suse.de>
7593 PR tree-optimization/45178
7594 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
7595 infinite loops without exit do not mark control dependent
7596 edges of the latch necessary.
7598 2021-08-27 konglin1 <lingling.kong@intel.com>
7601 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
7603 (<avx512>scattersi<mode>): Likewise.
7604 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
7605 (*avx512f_scatterdi<VI48F:mode>): Likewise
7607 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
7609 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
7610 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
7611 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
7612 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
7613 P10_BUILTIN_PEXTD on Power10.
7615 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
7617 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
7618 signedness for some Power10 bifs.
7620 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
7623 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
7624 computed alignment if the entire struct has attribute packed.
7626 2021-08-27 liuhongt <hongtao.liu@intel.com>
7630 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
7631 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
7632 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
7633 IX86_BUILTIN_SHUFPS256.
7634 (ix86_masked_all_ones): New function.
7636 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
7638 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
7639 (conditional moves with memory inputs splitters): Ditto.
7640 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
7642 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7644 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
7645 search after merging.
7647 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7649 * config/rs6000/rs6000-overload.def: Add remaining overloads.
7651 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7653 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
7655 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7657 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
7658 crypto, and htm stanzas.
7660 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7662 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
7664 2021-08-26 Martin Sebor <msebor@redhat.com>
7666 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
7667 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
7668 (warn_uninitialized_vars): Same. Reduce visibility of locals.
7669 (warn_uninitialized_phi): Same.
7671 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
7673 * tree-ssa-ccp.c (get_individual_bits): Helper function to
7674 extract the individual bits from a widest_int constant (mask).
7675 (gray_code_bit_flips): New read-only table for effiently
7676 enumerating permutations/combinations of bits.
7677 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
7678 by unknown counts that are guaranteed less than the target
7679 precision and four or fewer unknown bits by enumeration.
7680 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
7681 enumeration under the same conditions. Handle remaining
7682 shifts as a mask based upon the minimum possible shift value.
7684 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
7685 Richard Biener <rguenther@suse.de>
7687 * match.pd (shift transformations): Remove a redundant
7688 !POINTER_TYPE_P check.
7690 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
7693 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
7694 Set all_regs to true in the call to replace_rtx.
7696 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7698 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
7699 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
7700 hiting --param modref-max-refs limit.
7701 (modref_tree:insert_base): Be smarter when hitting
7702 --param modref-max-bases limit. Add new parameter REF.
7703 (modref_tree:insert): Update.
7704 (modref_tree:merge): Update.
7705 * ipa-modref.c (read_modref_records): Update.
7707 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7709 * params.opt: (modref-max-adjustments): Add full stop.
7711 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7713 * ipa-modref-tree.h (modref_ref_node::verify): New member
7715 (modref_ref_node::insert): Use it.
7716 (modref_ref_node::try_mere_with): Fix off by one error.
7718 2021-08-26 Martin Liska <mliska@suse.cz>
7719 Stefan Kneifel <stefan.kneifel@bluewin.ch>
7721 * cgraph.h (create_version_clone_with_body): Add new parameter.
7722 * cgraphclones.c: Likewise.
7723 * multiple_target.c (create_dispatcher_calls): Do not use
7725 (create_target_clone): Likewise here.
7727 2021-08-26 Jonathan Yong <10walls@gmail.com>
7729 * doc/extend.texi: Add note about reserved priorities
7730 to the constructor attribute.
7732 2021-08-25 Martin Sebor <msebor@redhat.com>
7734 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
7735 range table header alone.
7736 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
7738 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
7740 * doc/invoke.texi: Document --param modref-max-adjustments.
7741 * ipa-modref-tree.c (test_insert_search_collapse): Update.
7742 (test_merge): Update.
7743 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
7744 (modref_access_node::operator==): Fix handling of access ranges.
7745 (modref_access_node::contains): Constify parameter; handle also
7746 mismatched parm offsets.
7747 (modref_access_node::update): New function.
7748 (modref_access_node::merge): New function.
7749 (unspecified_modref_access_node): Update constructor.
7750 (modref_ref_node::insert_access): Add record_adjustments parameter;
7752 (modref_ref_node::try_merge_with): New private function.
7753 (modref_tree::insert): New record_adjustments parameter.
7754 (modref_tree::merge): New record_adjustments parameter.
7755 (modref_tree::copy_from): Update.
7756 * ipa-modref.c (dump_access): Dump adjustments field.
7757 (get_access): Update constructor.
7758 (record_access): Update call of insert.
7759 (record_access_lto): Update call of insert.
7760 (merge_call_side_effects): Add record_adjustments parameter.
7761 (get_access_for_fnspec): Update.
7762 (process_fnspec): Update.
7763 (analyze_call): Update.
7764 (analyze_function): Update.
7765 (read_modref_records): Update.
7766 (ipa_merge_modref_summary_after_inlining): Update.
7767 (propagate_unknown_call): Update.
7768 (modref_propagate_in_scc): Update.
7769 * params.opt (param-max-modref-adjustments=): New.
7771 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
7773 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
7775 (xxspltiw_v4si): Use vecperm type attribute.
7776 (xxspltiw_v4si_inst): Use vecperm type attribute.
7777 (xxspltiw_v4sf_inst): Likewise.
7778 (xxspltidp_v2df): Use vecperm type attribute. Use
7779 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
7780 (xxspltidp_v2df_inst): Likewise.
7781 (xxsplti32dx_v4si): Use vecperm type attribute.
7782 (xxsplti32dx_v4si_inst): Likewise.
7783 (xxsplti32dx_v4sf_inst): Likewise.
7784 (xxblend_<mode>): Likewise.
7785 (xxpermx): Likewise.
7786 (xxpermx_inst): Likewise.
7789 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
7792 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
7793 * diagnostic.c (diagnostic_initialize_input_context): New function.
7794 * diagnostic.h (diagnostic_initialize_input_context): Declare.
7795 * input.c (default_charset_callback): New function.
7796 (file_cache::initialize_input_context): New function.
7797 (file_cache_slot::create): Added ability to convert the input
7798 according to the input context.
7799 (file_cache::file_cache): Initialize the new input context.
7800 (class file_cache_slot): Added new m_alloc_offset member.
7801 (file_cache_slot::file_cache_slot): Initialize the new member.
7802 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
7803 (file_cache_slot::maybe_grow): Likewise.
7804 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
7805 (file_cache_slot::get_next_line): Likewise.
7806 * input.h (class file_cache): Added input context member.
7808 2021-08-25 Richard Biener <rguenther@suse.de>
7810 PR tree-optimization/102046
7811 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
7812 update ->any_pattern when swapping operands.
7814 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
7817 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
7818 (ix86_decompose_address): Remove retval check for ASHIFT,
7819 allow non-canonical zero extend if AND mask covers ASHIFT
7821 (ix86_legitimate_address_p): Adjust condition for decompose.
7822 (ix86_rtx_costs): Adjust cost for lea with non-canonical
7824 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
7826 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
7828 PR tree-optimization/101145
7829 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
7831 (number_of_iterations_lt): Invoke above function.
7832 (adjust_cond_for_loop_until_wrap):
7833 Merge to number_of_iterations_until_wrap.
7834 (number_of_iterations_cond): Update invokes for
7835 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
7837 2021-08-25 konglin1 <lingling.kong@intel.com>
7840 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
7842 (_mm512_mask_fpclass_ps_mask): Ditto.
7844 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
7846 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
7847 (vec_unpacku_hi_v8hi): Likewise.
7848 (vec_unpacku_lo_v16qi): Likewise.
7849 (vec_unpacku_lo_v8hi): Likewise.
7850 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
7851 (vec_unpacku_lo_<VP_small_lc>): Likewise.
7853 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
7855 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
7856 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
7857 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
7858 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
7860 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
7862 PR middle-end/102031
7863 * simplify-rtx.c (simplify_truncation): When comparing precisions
7864 use "subreg_prec" variable, not "subreg_mode".
7866 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
7868 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
7871 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
7873 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
7874 various pointer type nodes.
7875 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
7876 values for various pointer types.
7877 (ptr_V16QI_type_node): New macro.
7878 (ptr_V1TI_type_node): New macro.
7879 (ptr_V2DI_type_node): New macro.
7880 (ptr_V2DF_type_node): New macro.
7881 (ptr_V4SI_type_node): New macro.
7882 (ptr_V4SF_type_node): New macro.
7883 (ptr_V8HI_type_node): New macro.
7884 (ptr_unsigned_V16QI_type_node): New macro.
7885 (ptr_unsigned_V1TI_type_node): New macro.
7886 (ptr_unsigned_V8HI_type_node): New macro.
7887 (ptr_unsigned_V4SI_type_node): New macro.
7888 (ptr_unsigned_V2DI_type_node): New macro.
7889 (ptr_bool_V16QI_type_node): New macro.
7890 (ptr_bool_V8HI_type_node): New macro.
7891 (ptr_bool_V4SI_type_node): New macro.
7892 (ptr_bool_V2DI_type_node): New macro.
7893 (ptr_bool_V1TI_type_node): New macro.
7894 (ptr_pixel_type_node): New macro.
7895 (ptr_intQI_type_node): New macro.
7896 (ptr_uintQI_type_node): New macro.
7897 (ptr_intHI_type_node): New macro.
7898 (ptr_uintHI_type_node): New macro.
7899 (ptr_intSI_type_node): New macro.
7900 (ptr_uintSI_type_node): New macro.
7901 (ptr_intDI_type_node): New macro.
7902 (ptr_uintDI_type_node): New macro.
7903 (ptr_intTI_type_node): New macro.
7904 (ptr_uintTI_type_node): New macro.
7905 (ptr_long_integer_type_node): New macro.
7906 (ptr_long_unsigned_type_node): New macro.
7907 (ptr_float_type_node): New macro.
7908 (ptr_double_type_node): New macro.
7909 (ptr_long_double_type_node): New macro.
7910 (ptr_dfloat64_type_node): New macro.
7911 (ptr_dfloat128_type_node): New macro.
7912 (ptr_ieee128_type_node): New macro.
7913 (ptr_ibm128_type_node): New macro.
7914 (ptr_vector_pair_type_node): New macro.
7915 (ptr_vector_quad_type_node): New macro.
7916 (ptr_long_long_integer_type_node): New macro.
7917 (ptr_long_long_unsigned_type_node): New macro.
7919 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
7921 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
7922 and power9-64 stanzas.
7924 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
7925 Tom de Vries <tdevries@suse.de>
7927 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
7928 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
7929 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
7930 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
7931 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
7932 * config/nvptx/nvptx-c.c: New source file.
7933 (nvptx_cpu_cpp_builtins): Move implementation here.
7935 2021-08-24 Martin Sebor <msebor@redhat.com>
7937 PR middle-end/101600
7938 PR middle-end/101977
7939 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
7940 the phrasing of a warning.
7941 (check_access): Use the remaining size after subtracting any offset
7942 rather than the whole object size.
7943 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
7944 clear for any nonnull PHI argument.
7945 (compute_objsize): Clear argument.
7947 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
7949 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
7951 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
7953 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
7956 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
7958 * value-relation.cc (rr_transitive_table): New.
7959 (relation_transitive): New.
7960 (value_relation::swap): Remove.
7961 (value_relation::apply_transitive): New.
7962 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
7963 (relation_oracle::register_relation): Call register_transitives.
7964 (relation_oracle::register_transitives): New.
7965 * value-relation.h (relation_oracle): Add new temporary bitmap and
7968 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
7971 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
7972 from integer to a pseudo vector register.
7974 2021-08-24 Richard Biener <rguenther@suse.de>
7976 PR tree-optimization/100089
7977 * tree-vectorizer.h (vect_slp_bb): Rename to ...
7978 (vect_slp_if_converted_bb): ... this and get the original
7979 loop as new argument.
7980 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
7981 pass original loop to vect_slp_if_converted_bb.
7982 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
7983 If orig_loop was passed scan the not vectorized stmts
7984 for COND_EXPRs and force not profitable if found.
7985 (vect_slp_region): Pass down all SLP instances to costing
7986 if orig_loop was specified.
7987 (vect_slp_bbs): Pass through orig_loop.
7988 (vect_slp_bb): Rename to ...
7989 (vect_slp_if_converted_bb): ... this and get the original
7990 loop as new argument.
7991 (vect_slp_function): Adjust.
7993 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
7996 * config/arm/arm.md (attribute arch): Add fix_vlldm.
7997 (arch_enabled): Use it.
7998 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
7999 use when erratum mitigation is needed.
8001 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
8004 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
8005 * doc/invoke.texi (Arm Options): Document it.
8006 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
8007 (ALL_QUIRKS): Add quirk_vlldm.
8008 (cortex-m33): Add quirk_vlldm.
8009 (cortex-m35p, cortex-m55): Likewise.
8010 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
8011 targetting an affected CPU and not explicitly controlled on
8014 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
8016 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
8017 (lazy_load_multiple_insn): Likewise.
8019 2021-08-24 liuhongt <hongtao.liu@intel.com>
8022 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
8023 Enable avx512 embedded broadcast.
8024 (*<avx512>_vternlog<mode>_all): Ditto.
8025 (<avx512>_vternlog<mode>_mask): Ditto.
8027 2021-08-24 liuhongt <hongtao.liu@intel.com>
8030 * config/i386/i386.c (ix86_rtx_costs): Define cost for
8032 * config/i386/i386.h (STRIP_UNARY): New macro.
8033 * config/i386/predicates.md (reg_or_notreg_operand): New
8035 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
8036 (*<avx512>_vternlog<mode>_1): New pre_reload
8037 define_insn_and_split.
8038 (*<avx512>_vternlog<mode>_2): Ditto.
8039 (*<avx512>_vternlog<mode>_3): Ditto.
8040 (any_logic1,any_logic2): New code iterator.
8041 (logic_op): New code attribute.
8042 (ternlogsuffix): Extend to VNxDF and VNxSF.
8044 2021-08-24 Richard Biener <rguenther@suse.de>
8046 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
8047 * params.opt (--param vect-inner-loop-cost-factor): Adjust
8049 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
8050 inner_loop_cost_factor to the minimum of the estimated number
8051 of iterations of the inner loop and vect-inner-loop-cost-factor.
8053 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
8054 Richard Biener <rguenther@suse.de>
8056 * config/i386/i386-features.c (compute_convert_gain): Provide
8057 more accurate values for CONST_INT, when optimizing for size.
8058 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
8059 * config/i386/i386.h (COSTS_N_BYTES): to here.
8061 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
8062 Jakub Jelinek <jakub@redhat.com>
8064 PR middle-end/102029
8065 * match.pd (shift transformations): Add an additional check for
8066 !POINTER_TYPE_P in the recently added left shift transformation.
8068 2021-08-24 liuhongt <hongtao.liu@intel.com>
8070 PR tree-optimization/100089
8071 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
8072 loop vectorizer when cost model is very-cheap.
8074 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8076 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
8077 asprintf, which is not available on AIX.
8079 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8081 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
8082 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
8084 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8086 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
8087 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
8089 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
8090 (rbtree.o): Likewise.
8091 (rs6000-gen-builtins): Likewise.
8092 (rs6000-builtins.c): Likewise.
8093 (rs6000-builtins.h): Likewise.
8094 (rs6000.o): Add dependency.
8095 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
8096 (rs6000-vecdefines.h): New target.
8097 (rs6000-builtins.o): Likewise.
8098 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
8099 (rs6000-c.o): Likewise.
8101 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8104 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
8105 Diagnose buffer overrun.
8106 (safe_inc_pos): Fix overrun detection.
8107 (match_identifier): Diagnose buffer overrun.
8108 (match_integer): Likewise.
8109 (match_to_right_bracket): Likewise.
8111 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
8113 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
8114 Improve range compare.
8115 (modref_access_node::contains): New member function.
8116 (modref_access_node::search): Remove.
8117 (modref_access_node::insert): Be smarter about subaccesses.
8119 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
8121 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
8122 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
8123 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
8126 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
8128 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
8130 * config/h8300/jumpcall.md (call, call_value): Restrict to
8131 !SIBLING_CALL_P cases.
8132 (subcall, sibcall_value): New patterns & expanders.
8133 * config/h8300/proepi.md (epilogue): Pass new argument to
8134 h8300_expand_epilogue.
8135 (sibcall_epilogue): New expander.
8136 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
8138 (h8300_ok_for_sibcall_p): New function.
8139 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
8141 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8143 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
8144 Handle case where the operand is already the desired mode.
8146 2021-08-23 Richard Biener <rguenther@suse.de>
8149 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
8150 in addition to has_gimple_body.
8152 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
8154 PR middle-end/101949
8155 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
8158 2021-08-23 Martin Liska <mliska@suse.cz>
8160 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
8163 2021-08-23 Richard Biener <rguenther@suse.de>
8165 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
8166 Properly scale the inner loop cost only once.
8168 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8170 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
8171 Provide bounds for unsigned (and signed with non-negative operands)
8172 division and modulus.
8174 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8176 * simplify-rtx.c (simplify_truncation): Generalize simplification
8177 of (truncate:A (subreg:B X)).
8178 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
8179 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
8180 already has the desired machine mode.
8181 (test_scalar_int_ops): Add tests that useless extensions and
8182 truncations are optimized away.
8183 (test_scalar_int_ext_ops): New self-test function to confirm
8184 that truncations of extensions are correctly simplified.
8185 (test_scalar_int_ext_ops2): New self-test function to check
8186 truncations of truncations, extensions of extensions, and
8187 truncations of extensions.
8188 (test_scalar_ops): Call the above two functions with a
8189 representative sampling of integer machine modes.
8191 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8193 * match.pd (shift transformations): Change the sign of an
8194 LSHIFT_EXPR if it reduces the number of explicit conversions.
8196 2021-08-23 Jakub Jelinek <jakub@redhat.com>
8198 PR tree-optimization/86723
8199 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
8200 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
8201 non-memory permutation of 64-bit src which only has bytes of
8202 0 or [5..8] and n->range is 4.
8203 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
8204 find_bswap_or_nop_finalize caller, support bswap with some bytes
8205 zeroed, as long as at least two bytes are not zeroed.
8206 (bswap_replace): Add mask argument and handle masking of bswap
8208 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
8209 caller, punt if cast64_to_32 or mask is not all ones.
8210 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
8211 caller, for now punt if cast64_to_32.
8213 2021-08-23 Richard Biener <rguenther@suse.de>
8215 PR tree-optimization/79334
8216 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
8217 a type also for COMPONENT_REFs.
8218 (vn_reference_may_trap): Check ARRAY_REF with constant index
8219 against the array domain.
8221 2021-08-23 liuhongt <hongtao.liu@intel.com>
8224 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
8225 TARGET_AVX512BW to condition.
8227 2021-08-23 Jakub Jelinek <jakub@redhat.com>
8230 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
8231 register variables already during early_dwarf if possible.
8233 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
8235 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
8237 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
8240 * config/arm/arm.opt: Fix typo.
8241 * config/arm/t-rmprofile: Fix typo.
8243 2021-08-23 Jakub Jelinek <jakub@redhat.com>
8245 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
8246 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
8247 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
8248 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
8249 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
8250 if either grainsize or num_tasks clause has the strict modifier.
8252 2021-08-23 Martin Liska <mliska@suse.cz>
8254 * dbgcnt.def (DEBUG_COUNTER): New counter.
8255 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
8257 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
8259 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
8261 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
8264 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
8266 * config/mips/mips.c (mips_msa_output_shift_immediate): New
8268 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
8269 vlshr<mode>3): Call it.
8271 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
8272 Martin Liska <mliska@suse.cz>
8274 PR middle-end/101949
8275 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
8278 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
8280 * config/mips/mips.c (mips_function_rodata_section,
8281 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
8283 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
8285 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
8287 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8289 * configure.ac (thread-local storage support): Remove tls_first_major
8290 and tls_first_minor. Use "$conftest_s" to check support.
8291 * configure: Regenerate.
8293 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8295 * configure.ac: Fixup formatting.
8297 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8299 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
8300 * configure.ac: ... update all callers.
8302 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8305 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
8306 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
8307 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
8308 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
8309 Remove remaining checks for in-tree assembler.
8310 * configure: Regenerate.
8312 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
8314 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
8315 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
8317 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
8318 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
8319 Improve logical shifts by 27 bits for H8/S.
8320 (get_shift_alg): Corresponding changes.
8321 (h8300_option_override): Revert to loops for -Os when profitable.
8323 2021-08-20 Richard Biener <rguenther@suse.de>
8325 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
8327 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
8328 index as group_id when dataref_groups were not computed.
8329 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
8330 we advace to the next BB.
8332 2021-08-20 Jakub Jelinek <jakub@redhat.com>
8334 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
8337 2021-08-20 Martin Liska <mliska@suse.cz>
8339 PR gcov-profile/89961
8340 * gcov.c (make_gcov_file_name): Rewrite using std::string.
8341 (mangle_name): Simplify, do not used the second argument.
8342 (strip_extention): New function.
8343 (get_md5sum): Likewise.
8344 (get_gcov_intermediate_filename): Handle properly -p and -x
8346 (output_gcov_file): Use string type.
8347 (generate_results): Likewise.
8348 (md5sum_to_hex): Remove.
8350 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
8352 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
8353 (UNSPEC_XXSPLTIW): Move to vsx.md.
8354 (UNSPEC_XXSPLTID): Move to vsx.md.
8355 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
8356 (UNSPEC_XXBLEND): Move to vsx.md.
8357 (UNSPEC_XXPERMX): Move to vsx.md.
8358 (VM3): Move to vsx.md.
8359 (VM3_char): Move to vsx.md.
8360 (xxspltiw_v4si): Move to vsx.md.
8361 (xxspltiw_v4sf): Move to vsx.md.
8362 (xxspltiw_v4sf_inst): Move to vsx.md.
8363 (xxspltidp_v2df): Move to vsx.md.
8364 (xxspltidp_v2df_inst): Move to vsx.md.
8365 (xxsplti32dx_v4si_inst): Move to vsx.md.
8366 (xxsplti32dx_v4sf): Move to vsx.md.
8367 (xxsplti32dx_v4sf_inst): Move to vsx.md.
8368 (xxblend_<mode>): Move to vsx.md.
8369 (xxpermx): Move to vsx.md.
8370 (xxpermx_inst): Move to vsx.md.
8371 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
8372 (UNSPEC_XXSPLTIW): Move from altivec.md.
8373 (UNSPEC_XXSPLTID): Move from altivec.md.
8374 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
8375 (UNSPEC_XXBLEND): Move from altivec.md.
8376 (UNSPEC_XXPERMX): Move from altivec.md.
8377 (VM3): Move from altivec.md.
8378 (VM3_char): Move from altivec.md.
8379 (xxspltiw_v4si): Move from altivec.md.
8380 (xxspltiw_v4sf): Move from altivec.md.
8381 (xxspltiw_v4sf_inst): Move from altivec.md.
8382 (xxspltidp_v2df): Move from altivec.md.
8383 (xxspltidp_v2df_inst): Move from altivec.md.
8384 (xxsplti32dx_v4si_inst): Move from altivec.md.
8385 (xxsplti32dx_v4sf): Move from altivec.md.
8386 (xxsplti32dx_v4sf_inst): Move from altivec.md.
8387 (xxblend_<mode>): Move from altivec.md.
8388 (xxpermx): Move from altivec.md.
8389 (xxpermx_inst): Move from altivec.md.
8391 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
8393 * tree-vect-generic.c (expand_vector_operations_1): Use either
8394 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
8395 when constructing scalar splat expressions.
8397 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
8400 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
8401 pointer to __vector_pair *.
8403 2021-08-19 Martin Sebor <msebor@redhat.com>
8405 * gimple-range.cc: Add comments.
8406 * gimple-range.h: Same.
8408 2021-08-19 Martin Sebor <msebor@redhat.com>
8410 PR middle-end/101984
8411 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
8414 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
8416 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
8417 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
8418 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
8420 2021-08-19 Jakub Jelinek <jakub@redhat.com>
8422 PR middle-end/101950
8423 * optabs.c (expand_clrsb_using_clz): New function.
8424 (expand_unop): Use it as another clrsb expansion fallback.
8426 2021-08-19 liuhongt <hongtao.liu@intel.com>
8429 2021-07-28 liuhongt <hongtao.liu@intel.com>
8432 * config/i386/i386.h (processor_costs): Add new member
8434 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
8435 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
8436 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
8437 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
8438 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
8439 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
8440 generic_cost, core_cost): Initialize integer_to_sse same value
8442 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
8443 * config/i386/i386.c (ix86_builtin_vectorization_cost):
8444 Use integer_to_sse instead of sse_op to calculate the cost of
8447 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
8449 * config.gcc: Include rpath.opt for Darwin.
8450 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
8452 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
8455 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
8458 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
8460 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
8461 pattern and type cast.
8462 (vld4_lane_f32): Use float RTL pattern.
8463 (vld4q_lane_f64): Use float type cast.
8465 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
8467 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
8470 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
8472 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
8473 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
8474 (hash_map_tests_c_tests): Call it.
8476 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
8478 * ggc.h (enum ggc_collect): New.
8479 (ggc_collect): Use it.
8480 * ggc-page.c: Adjust.
8481 * ggc-common.c: Likewise.
8482 * ggc-tests.c: Likewise.
8483 * read-rtl-function.c: Likewise.
8484 * selftest-run-tests.c: Likewise.
8485 * doc/gty.texi (Invoking the garbage collector): Likewise.
8487 2021-08-18 liuhongt <hongtao.liu@intel.com>
8490 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
8492 * config/i386/sse.md (*sse3_haddv2df3_low): Add
8493 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
8494 (*sse3_hsubv2df3_low): Ditto.
8495 * config/i386/x86-tune.def
8496 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
8498 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8500 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
8501 (gori_compute::compute_operand_range): Add tracing.
8502 (gori_compute::logical_combine): Ditto.
8503 (gori_compute::compute_logical_operands): Ditto.
8504 (gori_compute::compute_operand1_range): Ditto.
8505 (gori_compute::compute_operand2_range): Ditto.
8506 (gori_compute::outgoing_edge_range_p): Ditto.
8507 * gimple-range-gori.h (class gori_compute): Add range_tracer.
8509 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8511 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
8512 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
8513 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
8514 * params.opt (--param=evrp-mode): Adjust options.
8516 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8518 * Makefile.in (OBJS): Add gimple-range-trace.o.
8519 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
8520 * gimple-range-fold.cc: Adjust headers.
8521 * gimple-range-trace.cc: New.
8522 * gimple-range-trace.h: New.
8523 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
8524 (gimple_ranger::range_of_expr): Add tracing.
8525 (gimple_ranger::range_on_entry): Ditto.
8526 (gimple_ranger::range_on_exit): Ditto.
8527 (gimple_ranger::range_on_edge): Ditto.
8528 (gimple_ranger::fold_range_internal): Ditto.
8529 (gimple_ranger::dump_bb): Do not calculate edge range twice.
8530 (trace_ranger::*): Remove.
8531 (enable_ranger): Never create a trace_ranger.
8532 (debug_seed_ranger): Move to gimple-range-trace.cc.
8533 (dump_ranger): Ditto.
8534 (debug_ranger): Ditto.
8535 * gimple-range.h: Include gimple-range-trace.h.
8536 (range_on_entry, range_on_exit): No longer virtual.
8537 (class trace_ranger): Remove.
8538 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
8540 2021-08-17 Martin Sebor <msebor@redhat.com>
8542 PR middle-end/101854
8543 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
8544 in gimple-ssa-warn-access.cc.
8545 * calls.c (alloc_max_size): Move code to check_alloca.
8546 (get_size_range): Move to pointer-query.cc.
8547 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
8548 (get_attr_nonstring_decl): Move to tree.c.
8549 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
8550 (append_attrname): Same.
8551 (maybe_warn_rdwr_sizes): Same.
8552 (initialize_argument_information): Move code to
8553 gimple-ssa-warn-access.cc.
8554 * calls.h (maybe_warn_alloc_args_overflow): Move to
8555 gimple-ssa-warn-access.h.
8556 (get_attr_nonstring_decl): Move to tree.h.
8557 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
8558 (enum size_range_flags): Move to pointer-query.h.
8559 (get_size_range): Same.
8560 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
8561 to avoid Clang -Wunused-function.
8562 (get_size_range): Declare static.
8563 (maybe_emit_free_warning): Rename...
8564 (maybe_check_dealloc_call): ...to this for consistency.
8565 (class pass_waccess): Add members.
8566 (pass_waccess::~pass_waccess): Defined.
8567 (alloc_max_size): Move here from calls.c.
8568 (maybe_warn_alloc_args_overflow): Same.
8569 (check_alloca): New function.
8570 (check_alloc_size_call): New function.
8571 (check_strncat): Handle another warning flag.
8572 (pass_waccess::check_builtin): Handle alloca.
8573 (fntype_argno_type): Move here from calls.c.
8574 (append_attrname): Same.
8575 (maybe_warn_rdwr_sizes): Same.
8576 (pass_waccess::check_call): Define.
8577 (check_nonstring_args): New function.
8578 (pass_waccess::check): Call new member functions.
8579 (pass_waccess::execute): Enable ranger.
8580 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
8581 (maybe_warn_nonstring_arg): Same.
8582 * gimple-ssa-warn-restrict.c: Remove #include.
8583 * pointer-query.cc (get_size_range): Move here from calls.c.
8584 * pointer-query.h (enum size_range_flags): Same.
8585 (get_size_range): Same.
8586 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
8587 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
8589 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
8591 * ggc.h (ggc_collect): Add 'force_collect' parameter.
8592 * ggc-page.c (ggc_collect): Use that one instead of global
8593 'ggc_force_collect'. Adjust all users.
8594 * doc/gty.texi (Invoking the garbage collector): Update.
8595 * ggc-internal.h (ggc_force_collect): Remove.
8596 * ggc-common.c (ggc_force_collect): Likewise.
8597 * selftest.h (forcibly_ggc_collect): Remove.
8598 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
8599 * read-rtl-function.c (test_loading_labels): Adjust.
8600 * selftest-run-tests.c (run_tests): Likewise.
8602 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
8604 * config/darwin.c (darwin_file_end): Reset and reclaim the
8605 section names table at the end of compile.
8607 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
8610 * config.in: Regenerate.
8611 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
8612 (ASM_SPEC): Pass options to disable branch shortening where
8614 * configure: Regenerate.
8615 * configure.ac: Detect versions of 'as' that support the
8616 optimisation which has the bug.
8618 2021-08-17 Richard Biener <rguenther@suse.de>
8620 * optabs-query.c (supports_vec_gather_load_p): Also check
8622 (supports_vec_scatter_store_p): Likewise.
8623 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
8624 back to masked variants if non-masked are not supported.
8625 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
8626 When we need to use masked gather/scatter but do not have
8627 a mask set up a constant true one.
8628 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
8631 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
8633 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
8634 algorithm as PLUS_EXPR to improve subtraction bit bounds.
8635 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
8637 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
8639 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
8640 calculate the mask-value pair result of a multiplication by an
8642 (bit_value_binop) [MULT_EXPR]: Call it from here for
8643 multiplications by (sparse) non-negative constants.
8645 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
8648 * config.gcc (gcc_cv_initfini_array): Leave undefined for
8649 uclinuxfdpiceabi targets.
8651 2021-08-17 Alexandre Oliva <oliva@adacore.com>
8653 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
8654 reverse debug stmts.
8656 2021-08-17 Alexandre Oliva <oliva@adacore.com>
8658 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
8660 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
8662 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
8663 (__LD4Q_LANE_FUNC): Likewise.
8664 (vld4_lane_u8): Define without macro.
8665 (vld4_lane_u16): Likewise.
8666 (vld4_lane_u32): Likewise.
8667 (vld4_lane_u64): Likewise.
8668 (vld4_lane_s8): Likewise.
8669 (vld4_lane_s16): Likewise.
8670 (vld4_lane_s32): Likewise.
8671 (vld4_lane_s64): Likewise.
8672 (vld4_lane_f16): Likewise.
8673 (vld4_lane_f32): Likewise.
8674 (vld4_lane_f64): Likewise.
8675 (vld4_lane_p8): Likewise.
8676 (vld4_lane_p16): Likewise.
8677 (vld4_lane_p64): Likewise.
8678 (vld4q_lane_u8): Likewise.
8679 (vld4q_lane_u16): Likewise.
8680 (vld4q_lane_u32): Likewise.
8681 (vld4q_lane_u64): Likewise.
8682 (vld4q_lane_s8): Likewise.
8683 (vld4q_lane_s16): Likewise.
8684 (vld4q_lane_s32): Likewise.
8685 (vld4q_lane_s64): Likewise.
8686 (vld4q_lane_f16): Likewise.
8687 (vld4q_lane_f32): Likewise.
8688 (vld4q_lane_f64): Likewise.
8689 (vld4q_lane_p8): Likewise.
8690 (vld4q_lane_p16): Likewise.
8691 (vld4q_lane_p64): Likewise.
8692 (vld4_lane_bf16): Likewise.
8693 (vld4q_lane_bf16): Likewise.
8695 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
8697 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
8698 (__LD3Q_LANE_FUNC): Delete.
8699 (vld3_lane_u8): Define without macro.
8700 (vld3_lane_u16): Likewise.
8701 (vld3_lane_u32): Likewise.
8702 (vld3_lane_u64): Likewise.
8703 (vld3_lane_s8): Likewise.
8704 (vld3_lane_s16): Likewise.
8705 (vld3_lane_s32): Likewise.
8706 (vld3_lane_s64): Likewise.
8707 (vld3_lane_f16): Likewise.
8708 (vld3_lane_f32): Likewise.
8709 (vld3_lane_f64): Likewise.
8710 (vld3_lane_p8): Likewise.
8711 (vld3_lane_p16): Likewise.
8712 (vld3_lane_p64): Likewise.
8713 (vld3q_lane_u8): Likewise.
8714 (vld3q_lane_u16): Likewise.
8715 (vld3q_lane_u32): Likewise.
8716 (vld3q_lane_u64): Likewise.
8717 (vld3q_lane_s8): Likewise.
8718 (vld3q_lane_s16): Likewise.
8719 (vld3q_lane_s32): Likewise.
8720 (vld3q_lane_s64): Likewise.
8721 (vld3q_lane_f16): Likewise.
8722 (vld3q_lane_f32): Likewise.
8723 (vld3q_lane_f64): Likewise.
8724 (vld3q_lane_p8): Likewise.
8725 (vld3q_lane_p16): Likewise.
8726 (vld3q_lane_p64): Likewise.
8727 (vld3_lane_bf16): Likewise.
8728 (vld3q_lane_bf16): Likewise.
8730 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
8732 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
8733 (__LD2Q_LANE_FUNC): Likewise.
8734 (vld2_lane_u8): Define without macro.
8735 (vld2_lane_u16): Likewise.
8736 (vld2_lane_u32): Likewise.
8737 (vld2_lane_u64): Likewise.
8738 (vld2_lane_s8): Likewise.
8739 (vld2_lane_s16): Likewise.
8740 (vld2_lane_s32): Likewise.
8741 (vld2_lane_s64): Likewise.
8742 (vld2_lane_f16): Likewise.
8743 (vld2_lane_f32): Likewise.
8744 (vld2_lane_f64): Likewise.
8745 (vld2_lane_p8): Likewise.
8746 (vld2_lane_p16): Likewise.
8747 (vld2_lane_p64): Likewise.
8748 (vld2q_lane_u8): Likewise.
8749 (vld2q_lane_u16): Likewise.
8750 (vld2q_lane_u32): Likewise.
8751 (vld2q_lane_u64): Likewise.
8752 (vld2q_lane_s8): Likewise.
8753 (vld2q_lane_s16): Likewise.
8754 (vld2q_lane_s32): Likewise.
8755 (vld2q_lane_s64): Likewise.
8756 (vld2q_lane_f16): Likewise.
8757 (vld2q_lane_f32): Likewise.
8758 (vld2q_lane_f64): Likewise.
8759 (vld2q_lane_p8): Likewise.
8760 (vld2q_lane_p16): Likewise.
8761 (vld2q_lane_p64): Likewise.
8762 (vld2_lane_bf16): Likewise.
8763 (vld2q_lane_bf16): Likewise.
8765 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
8767 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
8770 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
8772 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
8773 (rank_for_schedule): Use it.
8775 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
8777 PR rtl-optimization/91598
8778 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
8779 insns after memory reads and before memory writes.
8781 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
8783 * rtl.h (CONST_VECTOR_P): New macro.
8784 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
8785 code testing macros.
8786 (aarch64_ptrue_all_mode): Likewise.
8787 (aarch64_expand_mov_immediate): Likewise.
8788 (aarch64_const_vec_all_in_range_p): Likewise.
8789 (aarch64_rtx_costs): Likewise.
8790 (aarch64_legitimate_constant_p): Likewise.
8791 (aarch64_simd_valid_immediate): Likewise.
8792 (aarch64_simd_make_constant): Likewise.
8793 (aarch64_convert_mult_to_shift): Likewise.
8794 (aarch64_expand_sve_vec_perm): Likewise.
8795 (aarch64_vec_fpconst_pow_of_2): Likewise.
8797 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8799 PR tree-optimization/101938
8800 * range-op.cc (operator_abs::op1_range): Special case
8801 -TYPE_MIN_VALUE for flag_wrapv.
8803 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
8805 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
8808 2021-08-17 Jakub Jelinek <jakub@redhat.com>
8810 * tree.def (OMP_SCOPE): New tree code.
8811 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
8812 * tree-nested.c (convert_nonlocal_reference_stmt,
8813 convert_local_reference_stmt, convert_gimple_call): Handle
8815 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
8816 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
8817 * gimple.c (gimple_build_omp_scope): New function.
8818 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
8819 * gimple.h (gimple_build_omp_scope): Declare.
8820 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
8821 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
8822 gimple_omp_scope_set_clauses): New inline functions.
8823 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
8824 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
8825 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
8826 * gimple-walk.c (walk_gimple_stmt): Likewise.
8827 * gimple-low.c (lower_stmt): Likewise.
8828 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
8829 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
8830 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
8831 allowing task reductions. Reject inscan reductions on scope.
8832 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
8833 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
8834 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
8835 (estimate_num_insns): Likewise.
8836 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
8837 contexts if var isn't privatized there.
8838 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
8839 (scan_omp_1_stmt): Likewise.
8840 (maybe_add_implicit_barrier_cancel): Look through outer
8842 (lower_omp_scope): New function.
8843 (lower_omp_task_reductions): Handle OMP_SCOPE.
8844 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
8845 (diagnose_sb_1, diagnose_sb_2): Likewise.
8846 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
8847 (expand_omp): Handle GIMPLE_OMP_SCOPE.
8848 (omp_make_gimple_edges): Likewise.
8849 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
8851 2021-08-17 Richard Biener <rguenther@suse.de>
8853 PR tree-optimization/101925
8854 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
8855 reverse on COMPONENT_REF and ARRAY_REF according to
8856 what reverse_storage_order_for_component_p does.
8857 (vn_reference_eq): Compare reversed on reference ops.
8858 (reverse_storage_order_for_component_p): New overload.
8859 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
8860 on the reference looked up.
8862 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
8864 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
8866 (h8300_option_override): Use loops on H8/S more often when optimizing
8868 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
8869 accordingly. Handle various arithmetic right shifts with special
8870 sequences that we couldn't handle before.
8872 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
8874 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
8876 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
8878 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
8879 * config/sparc/sparc.c (sparc_gcov_type_size): New.
8880 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
8881 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
8882 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
8883 * doc/tm.texi.in: Regenerate.
8884 * target.def (gcov_type_size): New target hook.
8885 * targhooks.c (default_gcov_type_size): New.
8886 * targhooks.h (default_gcov_type_size): Declare.
8887 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
8889 (gimple_gen_time_profiler): Likewise.
8891 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
8893 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
8895 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
8897 PR middle-end/101931
8898 * omp-low.c (omp_runtime_api_call): Update for routines
8899 added in the meanwhile.
8901 2021-08-16 Martin Liska <mliska@suse.cz>
8903 PR tree-optimization/100393
8904 * tree-switch-conversion.c (group_cluster::dump): Use
8905 get_comparison_count.
8906 (jump_table_cluster::find_jump_tables): Pre-compute number of
8907 comparisons and then decrement it. Cache also max_ratio.
8908 (jump_table_cluster::can_be_handled): Change signature.
8909 * tree-switch-conversion.h (get_comparison_count): New.
8911 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
8913 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
8914 encodings only when -fgnat-encodings=all is specified.
8915 (add_bound_info): Likewise.
8916 (add_byte_size_attribute): Likewise.
8917 (gen_member_die): Likewise.
8919 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
8921 * omp-oacc-neuter-broadcast.cc
8922 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
8924 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
8926 * omp-oacc-neuter-broadcast.cc
8927 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
8930 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
8932 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
8933 (execute_omp_oacc_neuter_broadcast): ... here.
8934 (install_var_field, build_receiver_ref, build_sender_ref): Take
8935 'field_map_t *' parameter. Adjust all users.
8936 (worker_single_copy, neuter_worker_single): Take a
8937 'record_field_map_t *' parameter. Adjust all users.
8939 2021-08-16 liuhongt <hongtao.liu@intel.com>
8942 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
8945 2021-08-16 Martin Liska <mliska@suse.cz>
8948 * multiple_target.c (create_dispatcher_calls): Make default
8949 function local only if it is a definition.
8951 2021-08-16 Martin Liska <mliska@suse.cz>
8954 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
8955 consider equal SSA_NAMEs when one is a param.
8957 2021-08-16 liuhongt <hongtao.liu@intel.com>
8960 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
8961 Support vpermi2b for V32QI/V16QImode.
8962 (ix86_extract_perm_from_pool_constant): New function.
8963 (ix86_expand_vec_one_operand_perm_avx512): Support
8964 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
8965 (expand_vec_perm_1): Adjust comments for upper.
8966 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
8968 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
8969 (pshufb_truncv4siv4hi_operand): Ditto.
8970 (pshufb_truncv8hiv8qi_operand): Ditto.
8971 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
8972 New pre_reload define_insn_and_split.
8973 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
8974 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
8975 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
8976 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
8977 (*avx512f_permvar_truncv4div4si_1): Ditto.
8978 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
8979 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
8980 (*avx512f_pshufd_truncv2div2si_1): Ditto.
8982 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
8984 * config/riscv/multilib-generator: Support code model option for
8986 * doc/install.texi: Add document of new option for
8987 --with-multilib-generator.
8989 2021-08-15 Clément Chigot <clement.chigot@atos.net>
8991 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
8992 (rs6000_legitimize_tls_address_aix): Use it.
8993 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
8994 xcoff_tls_exec_model_detected is true.
8996 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
8998 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
8999 to allow a bit more code growth, saving many dozens of cycles.
9000 (h8300_option_override): Adjus shift_alg_si if optimizing for
9002 (get_shift_alg): Use special + inline shifts for residuals
9005 2021-08-14 Stafford Horne <shorne@gmail.com>
9008 * config/or1k/or1k-opts.h: New file.
9009 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
9010 Support generating gotha relocations if -mcmodel=large is
9012 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
9014 * config/or1k/or1k.opt (mcmodel=): New option.
9015 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
9017 2021-08-14 Martin Sebor <msebor@redhat.com>
9019 PR middle-end/101791
9020 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
9021 to valid_new_delete_pair_p.
9022 * tree.c (valid_new_delete_pair_p): Add argument.
9023 * tree.h (valid_new_delete_pair_p): Same.
9025 2021-08-14 Jakub Jelinek <jakub@redhat.com>
9028 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
9029 <case E_V64QImode>: For this mode assert
9030 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
9032 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
9035 * config/rs6000/altivec.md (xxeval): Use register_predicate
9036 instead of altivec_register_predicate.
9038 2021-08-13 Martin Sebor <msebor@redhat.com>
9040 PR middle-end/101734
9041 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
9042 (maybe_warn_operand): Call it.
9044 2021-08-13 Martin Liska <mliska@suse.cz>
9047 * attribs.c (decl_attributes): Make naked functions "noipa"
9050 2021-08-13 Martin Liska <mliska@suse.cz>
9053 * symtab.c (symtab_node::noninterposable_alias): Do not create
9054 local aliases for target_clone functions as the clonning pass
9057 2021-08-13 Martin Liska <mliska@suse.cz>
9059 * opts.c (LIVE_PATCHING_OPTION): Define.
9060 (control_options_for_live_patching): Use it in error messages.
9062 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
9064 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
9065 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
9066 ignore_stores_eaf_flags): New constants.
9067 (remove_useless_eaf_flags): New function.
9068 (eaf_flags_useful_p): Use it.
9069 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
9071 (modref_lattice::init): Add EAF_NOREAD.
9072 (modref_lattice::add_escape_point): Do not reacord escape point if
9074 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
9075 use remove_useless_eaf_flags.
9076 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
9077 (modref_lattice::merge_direct_load): Add EAF_NOREAD
9078 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
9079 (analyze_parms): Use remove_useless_eaf_flags.
9080 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
9081 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
9082 use remove_useless_eaf_flags.
9083 (modref_propagate_flags_in_scc): Update.
9084 * ipa-modref.h: Turn eaf_flags_t back to char.
9085 * tree-core.h (EAF_NOT_RETURNED): Fix.
9086 (EAF_NOREAD): New constant
9087 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
9089 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
9090 (handle_pure_call): Likewise.
9092 2021-08-12 Jakub Jelinek <jakub@redhat.com>
9094 * tree.def (OMP_MASKED): New tree code.
9095 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
9096 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
9097 OMP_CLAUSE_FILTER_EXPR): Define.
9098 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
9099 (omp_clause_code_name): Likewise.
9100 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
9101 * tree-nested.c (convert_nonlocal_omp_clauses,
9102 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
9103 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
9104 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
9105 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
9106 (dump_generic_node): Handle OMP_MASTER.
9107 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
9108 * gimple.c (gimple_build_omp_masked): New function.
9109 (gimple_copy): Handle GIMPLE_OMP_MASKED.
9110 * gimple.h (gimple_build_omp_masked): Declare.
9111 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
9112 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
9113 gimple_omp_masked_set_clauses): New inline functions.
9114 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
9115 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
9116 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
9117 * gimple-walk.c (walk_gimple_stmt): Likewise.
9118 * gimple-low.c (lower_stmt): Likewise.
9119 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
9120 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
9121 that take one expression rather than decl or constant, force
9122 gimplification of that into a SSA_NAME or temporary unless min
9124 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
9125 (gimplify_expr): Handle OMP_MASKED.
9126 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
9127 (estimate_num_insns): Likewise.
9128 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
9129 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
9130 diagnostics for existence of masked construct.
9131 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
9132 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
9133 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
9136 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
9139 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
9140 (ldexp<mode>3): Use avx512f_scalef<mode>2.
9141 (UNSPEC_SCALEF): Move from ...
9142 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
9144 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
9146 * ipa-split.c (consider_split): Fix condition testing void functions.
9148 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
9150 * doc/invoke.texi: Remove docs for threader-mode param.
9151 * flag-types.h (enum threader_mode): Remove.
9152 * params.opt: Remove threader-mode param.
9153 * tree-ssa-threadbackward.c (class back_threader): Remove
9154 path_is_unreachable_p.
9155 Make find_paths private.
9156 Add maybe_thread and thread_through_all_blocks.
9157 Remove reference marker for m_registry.
9158 Remove reference marker for m_profit.
9159 (back_threader::back_threader): Adjust for registry and profit not
9161 (dump_path): Move down.
9163 (class thread_jumps): Remove.
9164 (class back_threader_registry): Remove m_all_paths.
9166 (thread_jumps::thread_through_all_blocks): Move to back_threader
9168 (fsm_find_thread_path): Remove
9169 (back_threader::maybe_thread): New.
9170 (back_threader::thread_through_all_blocks): Move from
9172 (back_threader_registry::back_threader_registry): Remove
9174 (back_threader_registry::~back_threader_registry): Remove.
9175 (thread_jumps::find_taken_edge): Remove.
9176 (thread_jumps::check_subpath_and_update_thread_path): Remove.
9177 (thread_jumps::maybe_register_path): Remove.
9178 (thread_jumps::handle_phi): Remove.
9179 (handle_assignment_p): Remove.
9180 (thread_jumps::handle_assignment): Remove.
9181 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
9182 (thread_jumps::find_jump_threads_backwards): Remove.
9183 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
9184 (try_thread_blocks): Rename find_jump_threads_backwards to
9186 (pass_early_thread_jumps::execute): Same.
9188 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
9190 * tree-core.h (omp_clause_proc_bind_kind): Add
9191 OMP_CLAUSE_PROC_BIND_PRIMARY.
9192 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
9193 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
9195 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
9197 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
9198 fno-common reference.
9199 * config/arc/arc.c (arc_override_options): Remove overriding of
9202 2021-08-12 Jakub Jelinek <jakub@redhat.com>
9205 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
9206 If d->testing_p, return true after performing checks instead of
9207 actually expanding the insn.
9208 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
9209 !TARGET_AVX512BW and return false.
9211 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
9213 * configure.ac (PE linker --disable-dynamicbase support): New check.
9214 * configure: Regenerate.
9215 * config.in: Likewise.
9216 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
9217 (LINK_SPEC): Use it.
9218 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
9219 (LINK_SPEC): Likewise.
9221 2021-08-12 liuhongt <hongtao.liu@intel.com>
9224 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
9225 post_reload define_insn_and_split.
9226 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
9227 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
9228 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
9229 (*avx2_zero_extendv8hiv8si2_2): Ditto.
9230 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
9231 (*avx512f_zero_extendv8siv8di2_2): Ditto.
9232 (*avx2_zero_extendv4siv4di2_2): Ditto.
9233 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
9234 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
9237 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
9239 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
9242 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
9244 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
9246 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
9248 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
9249 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
9250 initialization of pcvoid_type_node here...
9251 (altivec_init_builtins): ...from here.
9252 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
9253 RS6000_BTI_const_ptr_void.
9254 (pcvoid_type_node): New macro.
9256 2021-08-11 Richard Biener <rguenther@suse.de>
9259 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
9260 hard-register accesses.
9262 2021-08-11 Richard Biener <rguenther@suse.de>
9264 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
9265 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
9266 to determine has_volatile_ops.
9268 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
9270 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
9272 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
9273 Alexandre Oliva <oliva@adacore.com>
9275 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
9277 (modref_lattice::merge_deref): Fix handling of indirect scape points.
9278 (update_escape_summary_1): Likewise.
9279 (update_escape_summary): Likewise.
9280 (ipa_merge_modref_summary_after_inlining): Likewise.
9282 2021-08-11 Richard Biener <rguenther@suse.de>
9284 PR middle-end/101858
9285 * fold-const.c (fold_binary_loc): Guard simplification
9286 of X < (cast) (1 << Y) to integer types.
9288 2021-08-11 Richard Biener <rguenther@suse.de>
9290 PR tree-optimization/101861
9291 * tree-vect-stmts.c (vectorizable_load): Fix error in
9292 previous change with regard to gather vectorization.
9294 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
9297 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
9299 (vdup_n_s16): Likewise.
9300 (vdup_n_s32): Likewise.
9301 (vdup_n_s64): Likewise.
9302 (vdup_n_u8): Likewise.
9303 (vdup_n_u16): Likewise.
9304 (vdup_n_u32): Likewise.
9305 (vdup_n_u64): Likewise.
9306 (vdup_n_p8): Likewise.
9307 (vdup_n_p16): Likewise.
9308 (vdup_n_p64): Likewise.
9309 (vdup_n_f16): Likewise.
9310 (vdup_n_f32): Likewise.
9311 (vdupq_n_s8): Likewise.
9312 (vdupq_n_s16): Likewise.
9313 (vdupq_n_s32): Likewise.
9314 (vdupq_n_s64): Likewise.
9315 (vdupq_n_u8): Likewise.
9316 (vdupq_n_u16): Likewise.
9317 (vdupq_n_u32): Likewise.
9318 (vdupq_n_u64): Likewise.
9319 (vdupq_n_p8): Likewise.
9320 (vdupq_n_p16): Likewise.
9321 (vdupq_n_p64): Likewise.
9322 (vdupq_n_f16): Likewise.
9323 (vdupq_n_f32): Likewise.
9324 (vmov_n_s8): Replace call to builtin with call to corresponding
9326 (vmov_n_s16): Likewise.
9327 (vmov_n_s32): Likewise.
9328 (vmov_n_s64): Likewise.
9329 (vmov_n_u8): Likewise.
9330 (vmov_n_u16): Likewise.
9331 (vmov_n_u32): Likewise.
9332 (vmov_n_u64): Likewise.
9333 (vmov_n_p8): Likewise.
9334 (vmov_n_p16): Likewise.
9335 (vmov_n_f16): Likewise.
9336 (vmov_n_f32): Likewise.
9337 (vmovq_n_s8): Likewise.
9338 (vmovq_n_s16): Likewise.
9339 (vmovq_n_s32): Likewise.
9340 (vmovq_n_s64): Likewise.
9341 (vmovq_n_u8): Likewise.
9342 (vmovq_n_u16): Likewise.
9343 (vmovq_n_u32): Likewise.
9344 (vmovq_n_u64): Likewise.
9345 (vmovq_n_p8): Likewise.
9346 (vmovq_n_p16): Likewise.
9347 (vmovq_n_f16): Likewise.
9348 (vmovq_n_f32): Likewise.
9349 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
9351 2021-08-11 liuhongt <hongtao.liu@intel.com>
9354 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
9355 when TARGET_AVX512F and TARGET_SSE_MATH.
9357 2021-08-10 Jakub Jelinek <jakub@redhat.com>
9360 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
9361 for V32HImode if !TARGET_AVX512BW.
9362 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
9363 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
9364 early, but actually check the permutation.
9366 2021-08-10 Richard Biener <rguenther@suse.de>
9368 PR tree-optimization/101809
9369 * tree-vect-stmts.c (get_load_store_type): Allow emulated
9370 gathers with offset vector nunits being a constant multiple
9371 of the data vector nunits.
9372 (vect_get_gather_scatter_ops): Use the appropriate nunits
9373 for the offset vector defs.
9374 (vectorizable_store): Adjust call to
9375 vect_get_gather_scatter_ops.
9376 (vectorizable_load): Likewise. Handle the case of less
9377 offset vectors than data vectors.
9379 2021-08-10 Jakub Jelinek <jakub@redhat.com>
9382 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
9383 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
9386 2021-08-10 Richard Biener <rguenther@suse.de>
9388 PR tree-optimization/101801
9389 PR tree-optimization/101819
9390 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
9391 * tree-vect-loop.c (vect_emulated_vector_p): New function.
9392 (vectorizable_reduction): Re-instantiate a check for emulated
9394 * tree-vect-stmts.c (vectorizable_shift): Likewise.
9395 (vectorizable_operation): Likewise. Cost emulated vector
9396 operations according to the scalar sequence synthesized by
9399 2021-08-10 Richard Biener <rguenther@suse.de>
9401 PR middle-end/101824
9402 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
9403 volatile in case the variable was.
9405 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
9408 * config/i386/constraints.md (BC): Document for integer SSE
9409 constant all bits set operand.
9410 (BF): New constraint for const floating-point all bits set
9412 * config/i386/i386.c (standard_sse_constant_p): Likewise.
9413 (standard_sse_constant_opcode): Likewise.
9414 * config/i386/sse.md (sseconstm1): New mode attribute.
9415 (mov<mode>_internal): Replace BC with <sseconstm1>.
9417 2021-08-10 liuhongt <hongtao.liu@intel.com>
9419 * config/i386/sse.md (cond_<insn><mode>): New expander.
9420 (VI248_AVX512VLBW): New mode iterator.
9421 * config/i386/predicates.md
9422 (nonimmediate_or_const_vec_dup_operand): New predicate.
9424 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
9426 PR tree-optimization/101741
9427 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
9428 type of parameter for toupper/tolower.
9430 2021-08-09 Martin Jambor <mjambor@suse.cz>
9433 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
9435 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
9437 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
9439 (is_store_insn1): Verify source is a register.
9441 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
9444 * config/i386/mmx.md (<any_logic:code>v2sf3):
9445 Rename from *mmx_<any_logic:code>v2sf3
9447 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9449 * config/nvptx/nvptx.c: Cross-reference parts adapted in
9450 'gcc/omp-oacc-neuter-broadcast.cc'.
9451 * omp-low.c: Likewise.
9452 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
9455 2021-08-09 Julian Brown <julian@codesourcery.com>
9456 Kwok Cheung Yeung <kcy@codesourcery.com>
9457 Thomas Schwinge <thomas@codesourcery.com>
9459 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
9460 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
9461 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
9462 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
9463 (gcn_fork_join): Update comment.
9464 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
9465 (macc_experimental_workers): Remove unused option.
9467 2021-08-09 Julian Brown <julian@codesourcery.com>
9468 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
9469 Kwok Cheung Yeung <kcy@codesourcery.com>
9470 Thomas Schwinge <thomas@codesourcery.com>
9472 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
9473 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
9474 Add documentation hook.
9475 * doc/tm.texi: Regenerate.
9476 * omp-oacc-neuter-broadcast.cc: New file.
9477 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
9478 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
9479 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
9480 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
9481 * target.def (goacc.create_worker_broadcast_record): Add target
9483 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
9485 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
9486 Rename prototype to...
9487 (gcn_goacc_create_worker_broadcast_record): ... this.
9488 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
9490 (gcn_goacc_create_worker_broadcast_record): ... this.
9491 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
9493 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
9495 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
9498 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
9501 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9503 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
9505 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9507 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
9508 consider '-foffload-abi'.
9509 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
9510 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
9513 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9515 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
9518 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9520 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
9522 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9524 * doc/gty.texi (Files): Update.
9526 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9528 * doc/gty.texi (Files): Fix GTY header file example.
9530 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
9532 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
9533 determine the upper and lower bounds from a mask-value pair.
9534 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
9535 absolute value and unsigned absolute value expressions.
9536 (bit_value_binop): Initialize *VAL's precision.
9537 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
9538 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
9539 support when the operands are unknown but potentially equal.
9540 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
9542 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
9544 * config/aarch64/aarch64.md
9545 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
9547 2021-08-08 Sergei Trofimovich <siarheit@google.com>
9549 * lra-constraints.c: Fix s/otput/output/ typo.
9551 2021-08-06 Martin Sebor <msebor@redhat.com>
9553 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
9554 (expand_builtin_strcat): Same.
9555 (expand_builtin_stpncpy): Same.
9556 (expand_builtin_strncat): Same.
9557 (check_read_access): Same.
9558 (check_memop_access): Same.
9559 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
9560 (expand_builtin_strnlen): Same.
9561 (expand_builtin_memcpy): Same.
9562 (expand_builtin_memmove): Same.
9563 (expand_builtin_mempcpy): Same.
9564 (expand_builtin_strcpy): Same.
9565 (expand_builtin_strcpy_args): Same.
9566 (expand_builtin_stpcpy_1): Same.
9567 (expand_builtin_strncpy): Same.
9568 (expand_builtin_memset): Same.
9569 (expand_builtin_bzero): Same.
9570 (expand_builtin_strcmp): Same.
9571 (expand_builtin_strncmp): Same.
9572 (expand_builtin): Remove handlers.
9573 (fold_builtin_strlen): Add a comment.
9574 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
9575 * calls.c (maybe_warn_nonstring_arg): Same.
9576 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
9577 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
9578 (gimple_fold_builtin_stpcpy): Same.
9579 * gimple-ssa-warn-access.cc (has_location): New function.
9580 (get_location): Same.
9581 (get_callee_fndecl): Same.
9584 (warn_string_no_nul): Define.
9585 (unterminated_array): Same.
9586 (check_nul_terminated_array): Same.
9587 (maybe_warn_nonstring_arg): Same.
9588 (maybe_warn_for_bound): Same.
9589 (warn_for_access): Same.
9590 (check_access): Same.
9591 (check_memop_access): Same.
9592 (check_read_access): Same.
9593 (warn_dealloc_offset): Use helper functions.
9594 (maybe_emit_free_warning): Same.
9595 (class pass_waccess): Add members.
9596 (check_strcat): New function.
9597 (check_strncat): New function.
9598 (check_stxcpy): New function.
9599 (check_stxncpy): New function.
9600 (check_strncmp): New function.
9601 (pass_waccess::check_builtin): New function.
9602 (pass_waccess::check): Call it.
9603 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
9605 (maybe_warn_for_bound): Same.
9606 (check_access): Same.
9607 (check_memop_access): Same.
9608 (check_read_access): Same.
9609 * pointer-query.h (struct access_data): Define a ctor overload.
9611 2021-08-06 Richard Biener <rguenther@suse.de>
9613 PR tree-optimization/101801
9614 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
9615 (vect_can_vectorize_without_simd_p): ... to this.
9616 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
9617 (vect_can_vectorize_without_simd_p): ... to this and fold
9618 in vect_min_worthwhile_factor.
9619 (vect_min_worthwhile_factor): Remove.
9620 (vectorizable_reduction): Adjust and remove the cost part.
9621 * tree-vect-stmts.c (vectorizable_shift): Likewise.
9622 (vectorizable_operation): Likewise.
9624 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
9627 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
9628 Add general_gr_operand predicate to operand 3.
9630 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
9632 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
9633 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
9635 2021-08-06 Tamar Christina <tamar.christina@arm.com>
9637 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
9638 register_svprfop): Pass vec<> by pointer.
9639 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
9640 * langhooks.c (lhd_simulate_enum_decl): Likewise.
9641 * langhooks.h (struct lang_hooks_for_types): Likewise.
9643 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9645 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
9646 __builtin_memcpy instead of constructing an additional
9647 __builtin_aarch64_simd_oi one vector at a time.
9648 (vst1q_bf16_x2): Likewise.
9649 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
9650 an additional __builtin_aarch64_simd_ci one vector at a time.
9651 (vst1q_bf16_x3): Likewise.
9652 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
9653 (vst1q_bf16_x4): Likewise.
9654 (vst2_bf16): Use __builtin_memcpy instead of constructing an
9655 additional __builtin_aarch64_simd_oi one vector at a time.
9656 (vst2q_bf16): Likewise.
9657 (vst3_bf16): Use __builtin_memcpy instead of constructing an
9658 additional __builtin_aarch64_simd_ci mode one vector at a
9660 (vst3q_bf16): Likewise.
9661 (vst4_bf16): Use __builtin_memcpy instead of constructing an
9662 additional __builtin_aarch64_simd_xi one vector at a time.
9663 (vst4q_bf16): Likewise.
9665 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9667 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
9668 (__ST2Q_LANE_FUNC): Delete.
9669 (vst2_lane_f16): Use __builtin_memcpy to copy vector
9670 structure instead of constructing __builtin_aarch64_simd_oi
9671 one vector at a time.
9672 (vst2_lane_f32): Likewise.
9673 (vst2_lane_f64): Likewise.
9674 (vst2_lane_p8): Likewise.
9675 (vst2_lane_p16): Likewise.
9676 (vst2_lane_p64): Likewise.
9677 (vst2_lane_s8): Likewise.
9678 (vst2_lane_s16): Likewise.
9679 (vst2_lane_s32): Likewise.
9680 (vst2_lane_s64): Likewise.
9681 (vst2_lane_u8): Likewise.
9682 (vst2_lane_u16): Likewise.
9683 (vst2_lane_u32): Likewise.
9684 (vst2_lane_u64): Likewise.
9685 (vst2_lane_bf16): Likewise.
9686 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
9687 structure instead of using a union.
9688 (vst2q_lane_f32): Likewise.
9689 (vst2q_lane_f64): Likewise.
9690 (vst2q_lane_p8): Likewise.
9691 (vst2q_lane_p16): Likewise.
9692 (vst2q_lane_p64): Likewise.
9693 (vst2q_lane_s8): Likewise.
9694 (vst2q_lane_s16): Likewise.
9695 (vst2q_lane_s32): Likewise.
9696 (vst2q_lane_s64): Likewise.
9697 (vst2q_lane_u8): Likewise.
9698 (vst2q_lane_u16): Likewise.
9699 (vst2q_lane_u32): Likewise.
9700 (vst2q_lane_u64): Likewise.
9701 (vst2q_lane_bf16): Likewise.
9703 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9705 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
9706 (__ST3Q_LANE_FUNC): Delete.
9707 (vst3_lane_f16): Use __builtin_memcpy to copy vector
9708 structure instead of constructing __builtin_aarch64_simd_ci
9709 one vector at a time.
9710 (vst3_lane_f32): Likewise.
9711 (vst3_lane_f64): Likewise.
9712 (vst3_lane_p8): Likewise.
9713 (vst3_lane_p16): Likewise.
9714 (vst3_lane_p64): Likewise.
9715 (vst3_lane_s8): Likewise.
9716 (vst3_lane_s16): Likewise.
9717 (vst3_lane_s32): Likewise.
9718 (vst3_lane_s64): Likewise.
9719 (vst3_lane_u8): Likewise.
9720 (vst3_lane_u16): Likewise.
9721 (vst3_lane_u32): Likewise.
9722 (vst3_lane_u64): Likewise.
9723 (vst3_lane_bf16): Likewise.
9724 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
9725 structure instead of using a union.
9726 (vst3q_lane_f32): Likewise.
9727 (vst3q_lane_f64): Likewise.
9728 (vst3q_lane_p8): Likewise.
9729 (vst3q_lane_p16): Likewise.
9730 (vst3q_lane_p64): Likewise.
9731 (vst3q_lane_s8): Likewise.
9732 (vst3q_lane_s16): Likewise.
9733 (vst3q_lane_s32): Likewise.
9734 (vst3q_lane_s64): Likewise.
9735 (vst3q_lane_u8): Likewise.
9736 (vst3q_lane_u16): Likewise.
9737 (vst3q_lane_u32): Likewise.
9738 (vst3q_lane_u64): Likewise.
9739 (vst3q_lane_bf16): Likewise.
9741 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9743 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
9744 (__ST4Q_LANE_FUNC): Delete.
9745 (vst4_lane_f16): Use __builtin_memcpy to copy vector
9746 structure instead of constructing __builtin_aarch64_simd_xi
9747 one vector at a time.
9748 (vst4_lane_f32): Likewise.
9749 (vst4_lane_f64): Likewise.
9750 (vst4_lane_p8): Likewise.
9751 (vst4_lane_p16): Likewise.
9752 (vst4_lane_p64): Likewise.
9753 (vst4_lane_s8): Likewise.
9754 (vst4_lane_s16): Likewise.
9755 (vst4_lane_s32): Likewise.
9756 (vst4_lane_s64): Likewise.
9757 (vst4_lane_u8): Likewise.
9758 (vst4_lane_u16): Likewise.
9759 (vst4_lane_u32): Likewise.
9760 (vst4_lane_u64): Likewise.
9761 (vst4_lane_bf16): Likewise.
9762 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
9763 structure instead of using a union.
9764 (vst4q_lane_f32): Likewise.
9765 (vst4q_lane_f64): Likewise.
9766 (vst4q_lane_p8): Likewise.
9767 (vst4q_lane_p16): Likewise.
9768 (vst4q_lane_p64): Likewise.
9769 (vst4q_lane_s8): Likewise.
9770 (vst4q_lane_s16): Likewise.
9771 (vst4q_lane_s32): Likewise.
9772 (vst4q_lane_s64): Likewise.
9773 (vst4q_lane_u8): Likewise.
9774 (vst4q_lane_u16): Likewise.
9775 (vst4q_lane_u32): Likewise.
9776 (vst4q_lane_u64): Likewise.
9777 (vst4q_lane_bf16): Likewise.
9779 2021-08-06 Martin Liska <mliska@suse.cz>
9781 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
9782 a target option is restored, it can have
9783 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
9784 and error should not be emitted.
9786 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
9788 * gcov-io.h (gcov_write): Declare.
9789 * gcov-io.c (gcov_write): New.
9790 (gcov_write_counter): Remove.
9791 (gcov_write_tag_length): Likewise.
9792 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
9793 gcov_write_unsigned().
9794 * doc/invoke.texi (fprofile-info-section): Mention
9795 __gcov_info_to_gdca().
9797 2021-08-06 Martin Sebor <msebor@redhat.com>
9799 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
9800 arguments to by-reference.
9801 (iterate_fix_dominators): Same.
9802 * dominance.h (iterate_fix_dominators): Same.
9803 * ipa-prop.h: Call auto_vec::to_vec_legacy.
9804 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
9805 arguments to by-reference.
9806 (debug_data_dependence_relation): Same.
9807 (dump_data_dependence_relations): Same.
9808 * tree-data-ref.h (debug_data_dependence_relation): Same.
9809 (dump_data_dependence_relations): Same.
9810 * tree-predcom.c (dump_chains): Same.
9811 (initialize_root_vars_lm): Same.
9812 (determine_unroll_factor): Same.
9813 (replace_phis_by_defined_names): Same.
9814 (insert_init_seqs): Same.
9815 (pcom_worker::tree_predictive_commoning_loop): Call
9816 auto_vec::to_vec_legacy.
9817 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
9818 arguments to by-reference.
9819 * tree-ssa-threadbackward.c (populate_worklist): Same.
9820 (back_threader::resolve_def): Same.
9821 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
9822 (vect_enhance_data_refs_alignment): Same.
9823 (vect_check_lower_bound): Same.
9824 (vect_prune_runtime_alias_test_list): Same.
9825 (vect_permute_store_chain): Same.
9826 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
9827 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
9828 * tree-vectorizer.h (vect_permute_store_chain): Same.
9829 * vec.c (test_init): New function.
9830 (vec_c_tests): Call new function.
9831 * vec.h (vec): Declare ctors, dtor, and assignment.
9832 (auto_vec::vec_to_legacy): New function.
9833 (vec::copy): Adjust initialization.
9835 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
9838 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
9839 callee only uses GPRs.
9840 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
9841 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
9842 * config/i386/x86gprintrin.h: Add
9843 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
9844 to disable non-GPR ISAs.
9846 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
9848 PR middle-end/101787
9849 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
9851 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
9853 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
9854 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
9855 (vect_is_extending_load, vect_is_integer_truncation): New functions,
9856 moved from aarch64.c but given different names.
9857 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
9858 (aarch64_is_reduction, aarch64_reduc_type)
9859 (aarch64_embedded_comparison_type, aarch64_comparison_type)
9860 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
9861 in favor of the above. Update callers accordingly.
9863 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
9866 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
9867 writing .cpu directive in asm output.
9868 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
9869 (arm_last_printed_arch_string): Delete.
9870 (arm_last-printed_fpu_string): Delete.
9871 (arm_configure_build_target): If use of floating-point/SIMD is
9872 disabled, remove all fp/simd related features from the target ISA.
9873 (last_arm_targ_options): New variable.
9874 (arm_print_asm_arch_directives): Add new parameters. Change order
9875 of emitted directives and handle all cases here.
9876 (arm_file_start): Always call arm_print_asm_arch_directives, move
9877 all generation of .arch/.arch_extension here.
9878 (arm_file_end): Call arm_print_asm_arch.
9879 (arm_declare_function_name): Call arm_print_asm_arch_directives
9880 instead of printing .arch/.fpu directives directly.
9882 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
9884 * config/arm/arm.c (arm_configure_build_target): Don't call
9885 arm_option_reconfigure_globals.
9886 (arm_option_restore): Call arm_option_reconfigure_globals after
9887 reconfiguring the target.
9888 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
9890 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
9892 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
9893 arch_name is always set.
9895 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
9897 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
9898 of vec_select high-half from being added into Neon subtract
9901 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
9903 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
9904 of vec_select high-half from being added into Neon add cost.
9906 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
9908 * cfgloop.h (loops_list::loops_list): Add one optional argument
9909 root and adjust accordingly, update loop tree walking and factor
9911 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
9913 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
9915 PR tree-optimization/101626
9916 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
9917 reverse scalar storage order on a pointer or vector component.
9919 2021-08-05 liuhongt <hongtao.liu@intel.com>
9921 * config/i386/sse.md (cond_<code><mode>): New expander.
9923 2021-08-05 liuhongt <hongtao.liu@intel.com>
9925 * config/i386/sse.md (cond_<code><mode>): New expander.
9927 2021-08-05 liuhongt <hongtao.liu@intel.com>
9929 * config/i386/sse.md (cond_<code><mode>): New expander.
9931 2021-08-04 David Malcolm <dmalcolm@redhat.com>
9934 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
9936 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
9939 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
9940 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
9942 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
9945 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
9946 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
9947 data with SSE register from one memory location to another.
9949 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
9951 * config/s390/s390.c (expand_perm_with_vpdi): New function.
9952 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
9953 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
9954 parameterized expander.
9955 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
9957 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
9959 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
9960 (struct expand_vec_perm_d): Define struct.
9961 (expand_perm_with_merge): New function.
9962 (vectorize_vec_perm_const_1): New function.
9963 (s390_vectorize_vec_perm_const): New function.
9964 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
9966 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
9968 * config/s390/vector.md (V_HW_64): Remove mode iterator.
9969 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
9970 * config/s390/vx-builtins.md
9971 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
9974 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
9976 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
9978 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
9980 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
9981 instead of an unspec.
9983 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
9985 * config/s390/s390-modes.def: Add more vector modes to support
9986 concatenation of two vectors.
9987 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
9989 (s390_expand_merge): Likewise.
9990 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
9991 (s390_expand_merge): New function.
9992 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
9993 Remove constant definitions.
9994 * config/s390/vector.md (V_HW_2): Add mode iterators.
9995 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
9996 (vec_2x_nelts, vec_2x_wide): New mode attributes.
9997 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
9998 New pattern definitions.
9999 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
10000 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
10001 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
10002 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
10004 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
10006 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
10007 emit vec merge pattern.
10009 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
10011 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
10013 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
10014 vec_select high-half from being added into Neon multiply
10016 * rtlanal.c (vec_series_highpart_p): Define.
10017 * rtlanal.h (vec_series_highpart_p): Declare.
10019 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
10021 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
10023 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
10024 vec_select cost from being added into Neon multiply cost.
10026 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
10028 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
10029 which old_loop_vinfo is an epilogue loop that handles a constant
10030 number of iterations.
10032 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
10034 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
10035 when a reanalyzed loop fails to be cheaper than the current
10038 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
10040 * config/aarch64/aarch64.c: Fix a typo.
10042 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
10044 PR gcov-profile/101773
10045 * gcov-io.c (gcov_close): Check return code of a fclose.
10047 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
10050 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
10051 .file statement when needed.
10053 2021-08-04 Richard Biener <rguenther@suse.de>
10055 * tree-vect-data-refs.c (vect_check_gather_scatter):
10056 Include widening conversions only when the result is
10057 still handed by native gather or the current offset
10058 size not already matches the data size.
10059 Also succeed analysis in case there's no native support,
10060 noted by a IFN_LAST ifn and a NULL decl.
10061 (vect_analyze_data_refs): Always consider gathers.
10062 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
10063 Test for no IFN gather rather than decl gather.
10064 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
10065 gather-scatter info and cost emulated gathers accordingly.
10066 (vect_truncate_gather_scatter_offset): Properly test for
10068 (vect_use_strided_gather_scatters_p): Likewise.
10069 (get_load_store_type): Handle emulated gathers and its
10071 (vectorizable_load): Likewise. Emulate them by extracting
10072 scalar offsets, doing scalar loads and a vector construct.
10074 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
10077 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
10078 argument to set m_max_size.
10079 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
10080 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
10081 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
10083 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
10084 Marc Glisse <marc.glisse@inria.fr>
10086 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
10087 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
10088 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
10090 2021-08-04 Richard Biener <rguenther@suse.de>
10092 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
10093 out code to decompose vector loads ...
10094 (optimize_vector_load): ... here. Generalize it to
10095 handle intermediate widening and TARGET_MEM_REF loads
10096 and apply it to loads with a supported vector mode as well.
10098 2021-08-04 Richard Biener <rguenther@suse.de>
10100 PR tree-optimization/101756
10101 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
10102 the result of the reduction epilogue is compatible to the original
10105 2021-08-04 liuhongt <hongtao.liu@intel.com>
10108 * config/i386/i386.md (peephole2): Refine predicate from
10109 register_operand to general_reg_operand.
10111 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
10113 * gimple-range-path.h (path_range_query::dump): Mark override.
10115 2021-08-04 Richard Biener <rguenther@suse.de>
10117 PR tree-optimization/101769
10118 * tree-tailcall.c (eliminate_tail_call): Add the created loop
10119 for the first recursion and return it via the new output parameter.
10120 (optimize_tail_call): Pass through new output param.
10121 (tree_optimize_tail_calls_1): After creating all latches,
10122 add the created loop to the loop tree. Do not mark loops for fixup.
10124 2021-08-04 Martin Liska <mliska@suse.cz>
10126 * doc/invoke.texi: Document threader-mode param.
10128 2021-08-04 liuhongt <hongtao.liu@intel.com>
10130 * config/i386/sse.md (cond_fma<mode>): New expander.
10131 (cond_fms<mode>): Ditto.
10132 (cond_fnma<mode>): Ditto.
10133 (cond_fnms<mode>): Ditto.
10135 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
10137 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
10139 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
10141 * config/rs6000/constraints.md: Remove "e" from the list of available
10142 constraint characters.
10144 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
10146 PR gcov-profile/71672
10147 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
10149 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
10151 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
10153 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
10155 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
10156 try XMM31 to avoid vzeroupper.
10158 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10160 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
10161 -mcpu=neoverse-512tvb.
10162 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
10163 * config/aarch64/aarch64-tune.md: Regenerate.
10164 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
10165 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
10166 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
10167 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
10168 (aarch64_adjust_body_cost): Likewise.
10170 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10172 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
10173 record issue information for operations that occur in the
10176 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10178 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
10179 parameter. Detect cases in which an Advanced SIMD MLA would almost
10180 certainly require a MOV.
10181 (aarch64_count_ops): Update accordingly.
10183 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10185 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
10186 function, split out from...
10187 (aarch64_detect_vector_stmt_subtype): ...here.
10188 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
10190 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10192 * config/aarch64/aarch64-protos.h (sve_vec_cost):
10193 Add gather_load_x32_cost and gather_load_x64_cost.
10194 * config/aarch64/aarch64.c (generic_sve_vector_cost)
10195 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
10196 accordingly, using the values given by the scalar_load * number
10197 of elements calculation that we used previously.
10198 (aarch64_detect_vector_stmt_subtype): Use the new fields.
10200 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10202 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
10203 function, split out from...
10204 (aarch64_adjust_body_cost): ...here.
10206 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10208 * config/aarch64/fractional-cost.h: New file.
10209 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
10210 and cost_fraction.h.
10211 (vec_cost_fraction): New typedef.
10212 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
10213 (aarch64_detect_vector_stmt_subtype): Likewise.
10214 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
10215 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
10217 (aarch64_adjust_body_cost): Likewise.
10218 (aarch64_test_cost_fraction): New function.
10219 (aarch64_run_selftests): Call it.
10221 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10223 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
10225 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
10227 (aarch64_estimated_poly_value): Likewise. Use the least significant
10228 set bit for the minimum and likely values. Use the most significant
10229 set bit for the maximum value.
10231 2021-08-03 liuhongt <hongtao.liu@intel.com>
10233 * config/i386/sse.md (cond_<insn><mode>): New expander.
10234 (cond_mul<mode>): Ditto.
10236 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
10238 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
10240 2021-08-03 liuhongt <hongtao.liu@intel.com>
10242 * config/i386/sse.md (cond_<insn><mode>):New expander.
10243 (cond_mul<mode>): Ditto.
10244 (cond_div<mode>): Ditto.
10246 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
10248 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
10249 check stack_realign_needed for stack realignment.
10250 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
10251 than the largest integer supported by vector register.
10252 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
10253 (MOVE_MAX): Set to bytes of the largest integer supported by
10255 (STORE_MAX_PIECES): New.
10257 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
10259 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
10260 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
10261 data from one memory location to another.
10263 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
10265 PR middle-end/90773
10266 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
10268 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
10270 PR tree-optimization/101724
10271 * params.opt: Remove --param=threader-iterative.
10272 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
10275 2021-08-02 Tom de Vries <tdevries@suse.de>
10277 PR middle-end/101665
10278 * doc/extend.texi (nonnull attribute): Improve documentation.
10280 2021-08-02 Andrew Pinski <apinski@marvell.com>
10282 PR rtl-optimization/101683
10283 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
10285 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
10287 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
10288 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
10289 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
10290 (tree_ssa_phiop_worker): Update call to function above.
10292 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
10295 * config/i386/i386.md (bsr_rex64_1_zext): New.
10296 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
10297 with gen_bsr_rex64_1_zext.
10299 2021-07-31 Jakub Jelinek <jakub@redhat.com>
10302 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
10303 define_insn patterns.
10304 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
10305 Add combine splitters for constant - clz.
10306 (clz<mode>2): Use a temporary pseudo for bsr result.
10308 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
10310 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
10311 _mm_floor_sd, _mm_floor_ss): New.
10313 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
10315 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
10316 _mm_ceil_sd, _mm_ceil_ss): New.
10318 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
10320 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
10321 _mm_blend_ps, _mm_blendv_ps): New.
10323 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
10324 Uroš Bizjak <ubizjak@gmail.com>
10326 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
10327 to generate a conditional move using the carry flag after sub $1.
10328 (peephole2): Eliminate a register-to-register move by inverting
10329 the condition of a conditional move.
10331 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
10333 * config/mmix/mmix.md ("call", "call_value", "*call_real")
10334 ("*call_value_real"): Don't generate rtx mentioning the generic
10335 operands 1 and 2 to "call", and similarly for "call_value".
10336 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
10337 (mmix_print_operand): Use '!' instead of 'p'.
10339 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
10341 * doc/md.texi (call): Correct information about operand 2.
10342 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
10344 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
10346 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
10348 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
10350 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
10352 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
10354 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
10355 const_basic_block..
10356 (*::get_bb_range): Ditto.
10357 (*::bb_range_p): Ditto.
10358 * gimple-range-cache.h: Change prototypes.
10360 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
10362 PR middle-end/90773
10363 * builtins.c (builtin_memcpy_read_str): Change the mode argument
10364 from scalar_int_mode to fixed_size_mode.
10365 (builtin_strncpy_read_str): Likewise.
10366 (gen_memset_value_from_prev): New function.
10367 (builtin_memset_read_str): Change the mode argument from
10368 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
10369 and support CONST_VECTOR.
10370 (builtin_memset_gen_str): Likewise.
10371 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
10373 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
10374 with fixed_size_mode.
10375 (builtin_memset_read_str): Likewise.
10376 * expr.c (widest_int_mode_for_size): Renamed to ...
10377 (widest_fixed_size_mode_for_size): Add a bool argument to
10378 indicate if QI vector mode can be used.
10379 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
10380 instead of widest_int_mode_for_size.
10381 (pieces_addr::adjust): Change the mode argument from
10382 scalar_int_mode to fixed_size_mode.
10383 (op_by_pieces_d): Make m_len read-only. Add a bool member,
10384 m_qi_vector_mode, to indicate that QI vector mode can be used.
10385 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
10386 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
10387 instead of widest_int_mode_for_size.
10388 (op_by_pieces_d::get_usable_mode): Change the mode argument from
10389 scalar_int_mode to fixed_size_mode. Call
10390 widest_fixed_size_mode_for_size instead of
10391 widest_int_mode_for_size.
10392 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
10393 function to return the smallest integer or QI vector mode.
10394 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
10395 instead of widest_int_mode_for_size. Call
10396 smallest_fixed_size_mode_for_size instead of
10397 smallest_int_mode_for_size.
10398 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
10399 indicate that QI vector mode can be used and pass it to
10400 op_by_pieces_d::op_by_pieces_d.
10401 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
10402 instead of widest_int_mode_for_size. Pass memsetp to
10403 widest_fixed_size_mode_for_size to support QI vector mode.
10404 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
10405 (store_by_pieces): Pass memsetp to
10406 store_by_pieces_d::store_by_pieces_d.
10407 (clear_by_pieces_1): Removed.
10408 (clear_by_pieces): Replace clear_by_pieces_1 with
10409 builtin_memset_read_str and pass true to store_by_pieces_d to
10410 support vector mode broadcast.
10411 (string_cst_read_str): Change the mode argument from
10412 scalar_int_mode to fixed_size_mode.
10413 * expr.h (by_pieces_constfn): Change scalar_int_mode to
10415 (by_pieces_prev): Likewise.
10416 * rtl.h (lowpart_subreg_regno): New.
10417 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
10418 simplify_subreg_regno.
10419 * target.def (gen_memset_scratch_rtx): New hook.
10420 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
10421 * doc/tm.texi: Regenerated.
10423 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
10426 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
10427 TARGET_EXPR instead of MODIFY_EXPR.
10429 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
10432 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
10433 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
10434 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
10436 (vec_cmpu<IMSA:mode><mode_i>): New expander.
10438 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
10441 * config/i386/i386-options.c (ix86_option_override_internal):
10442 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
10444 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
10447 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
10448 explicitly dereferencing __a.
10449 (vld1_s64): Likewise.
10450 (vld1_u64): Likewise.
10451 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
10452 and change to VAR13.
10454 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
10456 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
10457 use of m_range_analyzer.
10458 (loop_versioning::lv_dom_walker::before_dom_children): Same.
10459 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
10460 (loop_versioning::prune_loop_conditions): Replace vr_values use
10461 with range_query interface.
10462 (pass_loop_versioning::execute): Use ranger.
10464 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
10467 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
10468 enum values, and emit a warning if they mismatch.
10470 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
10472 * cfgloop.h (as_const): New function.
10473 (class loop_iterator): Rename to ...
10474 (class loops_list): ... this.
10475 (loop_iterator::next): Rename to ...
10476 (loops_list::Iter::fill_curr_loop): ... this and adjust.
10477 (loop_iterator::loop_iterator): Rename to ...
10478 (loops_list::loops_list): ... this and adjust.
10479 (loops_list::Iter): New class.
10480 (loops_list::iterator): New type.
10481 (loops_list::const_iterator): New type.
10482 (loops_list::begin): New function.
10483 (loops_list::end): Likewise.
10484 (loops_list::begin const): Likewise.
10485 (loops_list::end const): Likewise.
10486 (FOR_EACH_LOOP): Remove.
10487 (FOR_EACH_LOOP_FN): Remove.
10488 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
10489 for loop with loops_list instance.
10490 (sort_sibling_loops): Likewise.
10491 (disambiguate_loops_with_multiple_latches): Likewise.
10492 (verify_loop_structure): Likewise.
10493 * cfgloopmanip.c (create_preheaders): Likewise.
10494 (force_single_succ_latches): Likewise.
10495 * config/aarch64/falkor-tag-collision-avoidance.c
10496 (execute_tag_collision_avoidance): Likewise.
10497 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
10498 * config/s390/s390.c (s390_adjust_loops): Likewise.
10499 * doc/loop.texi: Likewise.
10500 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
10501 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
10502 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
10503 (loop_versioning::make_versioning_decisions): Likewise.
10504 * gimple-ssa-split-paths.c (split_paths): Likewise.
10505 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
10506 * graphite.c (canonicalize_loop_form): Likewise.
10507 (graphite_transform_loops): Likewise.
10508 * ipa-fnsummary.c (analyze_function_body): Likewise.
10509 * ipa-pure-const.c (analyze_function): Likewise.
10510 * loop-doloop.c (doloop_optimize_loops): Likewise.
10511 * loop-init.c (loop_optimizer_finalize): Likewise.
10512 (fix_loop_structure): Likewise.
10513 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
10514 (move_loop_invariants): Likewise.
10515 * loop-unroll.c (decide_unrolling): Likewise.
10516 (unroll_loops): Likewise.
10517 * modulo-sched.c (sms_schedule): Likewise.
10518 * predict.c (predict_loops): Likewise.
10519 (pass_profile::execute): Likewise.
10520 * profile.c (branch_prob): Likewise.
10521 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
10522 (sel_find_rgns): Likewise.
10523 * tree-cfg.c (replace_loop_annotate): Likewise.
10524 (replace_uses_by): Likewise.
10525 (move_sese_region_to_fn): Likewise.
10526 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
10527 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
10528 * tree-parloops.c (parallelize_loops): Likewise.
10529 * tree-predcom.c (tree_predictive_commoning): Likewise.
10530 * tree-scalar-evolution.c (scev_initialize): Likewise.
10531 (scev_reset): Likewise.
10532 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
10533 * tree-ssa-live.c (remove_unused_locals): Likewise.
10534 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
10535 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
10536 (tree_ssa_lim_initialize): Likewise.
10537 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
10538 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
10539 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
10540 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
10541 (free_numbers_of_iterations_estimates): Likewise.
10542 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
10543 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
10544 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
10545 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
10546 (pass_scev_cprop::execute): Likewise.
10547 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
10548 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
10549 * tree-ssa-threadupdate.c
10550 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
10551 * tree-vectorizer.c (vectorize_loops): Likewise.
10552 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
10554 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
10556 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
10557 generating a VOIDmode register for e.g the
10558 function_arg_info::end_marker.
10560 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
10562 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
10563 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
10564 (cc mode_attr): Similarly.
10565 (ccz subst_attr): Similarly.
10566 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
10567 * config/h8300/testcompare.md: Remove various cc0 based patterns
10568 that had been commented out. Add pattern to set CCZ from a bit
10571 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
10572 Julian Brown <julian@codesourcery.com>
10573 Kwok Cheung Yeung <kcy@codesourcery.com>
10575 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
10576 'update_stmt' after modification.
10577 (pass_oacc_loop_designation): New function, extracted out of...
10578 (pass_oacc_device_lower): ... this.
10579 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
10580 (make_pass_oacc_loop_designation): New
10581 * passes.def: Add it.
10582 * tree-parloops.c (create_parallel_loop): Adjust.
10583 * tree-pass.h (make_pass_oacc_loop_designation): New.
10585 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
10587 * flag-types.h (enum threader_mode): New.
10588 * params.opt: Add entry for --param=threader-mode.
10589 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
10590 (class back_threader): New.
10591 (back_threader::back_threader): New.
10592 (back_threader::~back_threader): New.
10593 (back_threader::maybe_register_path): New.
10594 (back_threader::find_taken_edge): New.
10595 (back_threader::find_taken_edge_switch): New.
10596 (back_threader::find_taken_edge_cond): New.
10597 (back_threader::resolve_def): New.
10598 (back_threader::resolve_phi): New.
10599 (back_threader::find_paths_to_names): New.
10600 (back_threader::find_paths): New.
10603 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
10604 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
10605 (pass_thread_jumps::execute): Abstract out code...
10606 (try_thread_blocks): ...here.
10607 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
10608 Abstract out threading candidate code to...
10609 (single_succ_to_potentially_threadable_block): ...here.
10610 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
10612 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
10613 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
10614 Return bool from register_jump_thread.
10616 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
10618 * target.def: in0 and in1 do not need to be registers.
10619 * doc/tm.texi: Regenerate.
10621 2021-07-29 liuhongt <hongtao.liu@intel.com>
10624 * config/i386/i386.c (ix86_widen_mult_cost): New function.
10625 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
10628 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
10631 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
10632 (rs6000_preferred_doloop_mode): New hook.
10633 * doc/tm.texi: Regenerate.
10634 * doc/tm.texi.in: Add hook preferred_doloop_mode.
10635 * target.def (preferred_doloop_mode): New hook.
10636 * targhooks.c (default_preferred_doloop_mode): New hook.
10637 * targhooks.h (default_preferred_doloop_mode): New hook.
10638 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
10639 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
10640 and compute_doloop_base_on_mode.
10642 2021-07-28 Martin Sebor <msebor@redhat.com>
10644 PR middle-end/101494
10645 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
10646 and size computation.
10648 2021-07-28 Martin Sebor <msebor@redhat.com>
10650 PR middle-end/101601
10651 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
10653 Handle pointers to functions.
10655 2021-07-28 Martin Sebor <msebor@redhat.com>
10657 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
10658 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
10659 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
10660 (access_ref::access_ref): Same.
10661 (access_ref::phi): Same.
10662 (access_ref::get_ref): Same.
10663 (access_ref::size_remaining): Same.
10664 (access_ref::offset_in_range): Same.
10665 (access_ref::add_offset): Same.
10666 (access_ref::inform_access): Same.
10667 (ssa_name_limit_t::visit_phi): Same.
10668 (ssa_name_limit_t::leave_phi): Same.
10669 (ssa_name_limit_t::next): Same.
10670 (ssa_name_limit_t::next_phi): Same.
10671 (ssa_name_limit_t::~ssa_name_limit_t): Same.
10672 (pointer_query::pointer_query): Same.
10673 (pointer_query::get_ref): Same.
10674 (pointer_query::put_ref): Same.
10675 (pointer_query::flush_cache): Same.
10676 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
10677 (check_nul_terminated_array): Same.
10678 (unterminated_array): Same.
10679 (maybe_warn_for_bound): Same.
10680 (check_read_access): Same.
10681 (warn_for_access): Same.
10682 (get_size_range): Same.
10683 (check_access): Same.
10684 (gimple_call_alloc_size): Move to tree.c.
10685 (gimple_parm_array_size): Move to pointer-query.cc.
10686 (get_offset_range): Same.
10687 (gimple_call_return_array): Same.
10688 (handle_min_max_size): Same.
10689 (handle_array_ref): Same.
10690 (handle_mem_ref): Same.
10691 (compute_objsize): Same.
10692 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
10693 (call_dealloc_argno): Same.
10694 (fndecl_dealloc_argno): Same.
10695 (new_delete_mismatch_p): Same.
10696 (matching_alloc_calls_p): Same.
10697 (warn_dealloc_offset): Same.
10698 (maybe_emit_free_warning): Same.
10699 * builtins.h (check_nul_terminated_array): Move to
10700 gimple-ssa-warn-access.h.
10701 (check_nul_terminated_array): Same.
10702 (warn_string_no_nul): Same.
10703 (unterminated_array): Same.
10704 (class ssa_name_limit_t): Same.
10705 (class pointer_query): Same.
10706 (struct access_ref): Same.
10707 (class range_query): Same.
10708 (struct access_data): Same.
10709 (gimple_call_alloc_size): Same.
10710 (gimple_parm_array_size): Same.
10711 (compute_objsize): Same.
10712 (class access_data): Same.
10713 (maybe_emit_free_warning): Same.
10714 * calls.c (initialize_argument_information): Remove call to
10715 maybe_emit_free_warning.
10716 * gimple-array-bounds.cc: Include new header..
10717 * gimple-fold.c: Same.
10718 * gimple-ssa-sprintf.c: Same.
10719 * gimple-ssa-warn-restrict.c: Same.
10720 * passes.def: Add pass_warn_access.
10721 * tree-pass.h (make_pass_warn_access): Declare.
10722 * tree-ssa-strlen.c: Include new headers.
10723 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
10724 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
10725 * gimple-ssa-warn-access.cc: New file.
10726 * gimple-ssa-warn-access.h: New file.
10727 * pointer-query.cc: New file.
10728 * pointer-query.h: New file.
10730 2021-07-28 Jakub Jelinek <jakub@redhat.com>
10732 PR middle-end/101624
10733 * ubsan.c (maybe_instrument_pointer_overflow,
10734 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
10735 PARM_DECLs or RESULT_DECLs.
10736 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
10738 2021-07-28 Jakub Jelinek <jakub@redhat.com>
10740 PR middle-end/101642
10741 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
10742 to type of bswap16 for comparison.
10743 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
10745 2021-07-28 Richard Biener <rguenther@suse.de>
10747 PR tree-optimization/101615
10748 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
10749 at CTOR SLP graph entries.
10751 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
10753 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
10754 Add "r,w" alternative.
10756 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
10759 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
10760 AVX_U128_DIRTY when all bits are zero.
10762 2021-07-28 Richard Biener <rguenther@suse.de>
10764 PR tree-optimization/101615
10765 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
10766 external nodes cannot be permuted so make them perm_out 0.
10768 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
10771 * config.in: Regenerate.
10772 * config/gcn/gcn-hsa.h (A_FIJI): New define.
10773 (A_900): New define.
10774 (A_906): New define.
10775 (A_908): New define.
10776 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
10777 * config/gcn/gcn.c (output_file_start): Adjust attributes according
10778 to the assembler capabilities.
10779 * config/gcn/mkoffload.c (main): Likewise.
10780 * configure: Regenerate.
10781 * configure.ac: Add tests for LLVM assembler attribute features.
10783 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
10785 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
10786 cond_false and cond_true on branches.
10788 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
10790 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
10793 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
10795 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
10797 2021-07-28 Jakub Jelinek <jakub@redhat.com>
10800 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
10801 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
10802 and has special !TARGET_AVX512VL expansion.
10803 (vashrv2di3<mask_name>): Rename to ...
10804 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
10805 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
10807 2021-07-28 Martin Uecker <muecker@gwdg.de>
10809 * calls.c (maybe_warn_rdwr_sizes): Correct argument
10810 numbers in warning that were switched.
10812 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
10814 PR tree-optimization/101596
10815 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
10816 by using new_type's precision instead.
10818 2021-07-28 liuhongt <hongtao.liu@intel.com>
10821 * config/i386/i386.h (processor_costs): Add new member
10823 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
10824 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
10825 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
10826 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
10827 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
10828 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
10829 generic_cost, core_cost): Initialize integer_to_sse same value
10831 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
10832 * config/i386/i386.c (ix86_builtin_vectorization_cost):
10833 Use integer_to_sse instead of sse_op to calculate the cost of
10836 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
10838 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
10840 (write_init_file): Call write_ovld_static_init.
10842 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
10844 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
10846 (write_init_file): Call write_bif_static_init.
10848 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
10850 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
10851 (TYPE_MAP_SIZE): New macro.
10852 (type_map): New initialized variable.
10853 (typemap_cmp): New function.
10854 (write_type_node): Likewise.
10855 (write_fntype_init): Implement.
10857 2021-07-27 Martin Sebor <msebor@redhat.com>
10859 PR tree-optimization/101584
10860 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
10861 (check_defs): Call it.
10863 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
10865 * tree-ssa-dom.c (dom_jump_threader_simplifier):
10866 Put avail_exprs_stack in the class, instead of passing it to
10867 jump_threader_simplifier.
10868 (dom_jump_threader_simplifier::simplify): Add state argument.
10869 (dom_opt_dom_walker): Add state.
10870 (pass_dominator::execute): Pass state to threader.
10871 (dom_opt_dom_walker::before_dom_children): Use state.
10872 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
10873 arguments by state.
10874 (jump_threader::record_temporary_equivalences_from_phis):
10875 Register equivalences through the state variable.
10876 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
10877 Record ranges in a statement through the state variable.
10878 (jump_threader::simplify_control_stmt_condition): Pass state to
10880 (jump_threader::simplify_control_stmt_condition_1): Same.
10881 (jump_threader::thread_around_empty_blocks): Remove obsolete
10883 (jump_threader::thread_through_normal_block): Record equivalences
10884 on edge through the state variable.
10885 (jump_threader::thread_across_edge): Abstract state pushing.
10886 (jt_state::jt_state): New.
10887 (jt_state::push): New.
10888 (jt_state::pop): New.
10889 (jt_state::register_equiv): New.
10890 (jt_state::record_ranges_from_stmt): New.
10891 (jt_state::register_equivs_on_edge): New.
10892 (jump_threader_simplifier::jump_threader_simplifier): Move from
10894 (jump_threader_simplifier::simplify): Add state argument.
10895 * tree-ssa-threadedge.h (class jt_state): New.
10896 (class jump_threader): Add state to constructor.
10897 (class jump_threader_simplifier): Add state to simplify. Remove
10898 avail_exprs_stack from class.
10899 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
10901 (vrp_jump_threader::vrp_jump_threader): Add state.
10902 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
10904 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
10906 * Makefile.in (OBJS): Add gimple-range-path.o.
10907 * gimple-range-path.cc: New file.
10908 * gimple-range-path.h: New file.
10910 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
10912 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
10913 inside vec_duplicate for all patterns.
10914 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
10915 Push sign/zero-extension inside vec_duplicate.
10917 2021-07-27 Richard Biener <rguenther@suse.de>
10919 PR tree-optimization/101573
10920 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
10921 looking at uninitialized PHI arg defs in some constrained cases.
10922 (warn_uninitialized_vars): Call it.
10923 (execute_early_warn_uninitialized): Calculate dominators.
10925 2021-07-27 Richard Biener <rguenther@suse.de>
10927 PR tree-optimization/39821
10928 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
10929 vector_stmt for widening arithmetic.
10930 (vectorizable_conversion): Adjust.
10932 2021-07-27 Martin Jambor <mjambor@suse.cz>
10934 * cgraph.h (ipa_replace_map): New field force_load_ref.
10935 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
10936 aded new flag load_dereferenced, adjusted comments.
10937 (ipa_get_param_dereferenced): New function.
10938 (ipa_set_param_dereferenced): Likewise.
10939 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
10940 * ipa-cp.c: Include gimple.h.
10941 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
10942 (get_replacement_map): New parameter force_load_ref, set the
10943 appropriate flag in ipa_replace_map if set.
10944 (struct symbol_and_index_together): New type.
10945 (adjust_refs_in_act_callers): New function.
10946 (adjust_references_in_caller): Likewise.
10947 (create_specialized_node): When appropriate, call
10948 adjust_references_in_caller and force only load references.
10949 * ipa-prop.c (load_from_dereferenced_name): New function.
10950 (ipa_analyze_controlled_uses): Also detect loads from a
10951 dereference, harden testing of call statements.
10952 (ipa_write_node_info): Stream the dereferenced flag.
10953 (ipa_read_node_info): Likewise.
10954 (ipa_set_jf_constant): Also create refdesc when jump function
10955 references a variable.
10956 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
10957 also on references of variables and return a symtab_node. Adjust
10959 (propagate_controlled_uses): Also remove references to VAR_DECLs.
10961 2021-07-27 Jakub Jelinek <jakub@redhat.com>
10963 PR middle-end/101586
10964 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
10965 positions above or equal to sz except for diagnostics of flexible
10968 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
10970 PR tree-optimization/78888
10971 * gimple-range-fold.cc (get_letter_range): New.
10972 (fold_using_range::range_of_builtin_call): Call get_letter_range.
10974 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
10976 PR tree-optimization/78888
10977 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
10978 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
10980 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
10981 Marc Glisse <marc.glisse@inria.fr>
10983 * match.pd (rotate): Simplify equality/inequality of rotations.
10984 (bswap): Simplify equality/inequality tests of byte swapping.
10986 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
10988 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
10991 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
10993 * range-op.cc (operator_lshift::fold_range): Pass rel to
10994 base class fold_range.
10995 (operator_rshift::fold_range): Same.
10997 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
11000 * toplev.h (min_align_loops_log): Remove declaration.
11001 (min_align_jumps_log, min_align_labels_log): Likewise.
11002 (min_align_functions_log): Likewise.
11004 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
11006 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
11008 (execute_vrp): Abstract out simplification of conditionals...
11009 (simplify_casted_conds): ...here.
11011 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
11013 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
11014 Add gimple argument.
11015 (array_bounds_checker::check_array_ref): Same.
11016 (array_bounds_checker::check_addr_expr): Same.
11017 (array_bounds_checker::check_array_bounds): Pass statement to
11018 check_array_bounds and check_addr_expr.
11019 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
11020 (check_addr_expr): Same.
11021 (get_value_range): Same.
11023 2021-07-26 Tamar Christina <tamar.christina@arm.com>
11025 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
11026 (sdot_prod, udot_prod): ... This.
11027 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
11029 (<sur>dot_prod<vsi2qi>): ... this.
11030 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
11031 Change operands order.
11032 (<sur>sadv16qi): Use new operands order.
11033 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
11034 vdotq_s32): Use new RTL ordering.
11036 2021-07-26 Tamar Christina <tamar.christina@arm.com>
11038 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
11039 aarch64_types_ternop_suss_qualifiers): New.
11040 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
11041 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
11042 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
11044 2021-07-23 Jakub Jelinek <jakub@redhat.com>
11046 PR rtl-optimization/101562
11047 * expmed.c (store_integral_bit_field): Only use movstrict_optab
11048 if the operand isn't paradoxical.
11050 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
11052 * gimple-array-bounds.h (class array_bounds_checker): Change
11053 ranges type to range_query.
11055 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11057 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
11058 __builtin_memcpy instead of constructing
11059 __builtin_aarch64_simd_oi one vector at a time.
11060 (vst1_u64_x2): Likewise.
11061 (vst1_f64_x2): Likewise.
11062 (vst1_s8_x2): Likewise.
11063 (vst1_p8_x2): Likewise.
11064 (vst1_s16_x2): Likewise.
11065 (vst1_p16_x2): Likewise.
11066 (vst1_s32_x2): Likewise.
11067 (vst1_u8_x2): Likewise.
11068 (vst1_u16_x2): Likewise.
11069 (vst1_u32_x2): Likewise.
11070 (vst1_f16_x2): Likewise.
11071 (vst1_f32_x2): Likewise.
11072 (vst1_p64_x2): Likewise.
11073 (vst1q_s8_x2): Likewise.
11074 (vst1q_p8_x2): Likewise.
11075 (vst1q_s16_x2): Likewise.
11076 (vst1q_p16_x2): Likewise.
11077 (vst1q_s32_x2): Likewise.
11078 (vst1q_s64_x2): Likewise.
11079 (vst1q_u8_x2): Likewise.
11080 (vst1q_u16_x2): Likewise.
11081 (vst1q_u32_x2): Likewise.
11082 (vst1q_u64_x2): Likewise.
11083 (vst1q_f16_x2): Likewise.
11084 (vst1q_f32_x2): Likewise.
11085 (vst1q_f64_x2): Likewise.
11086 (vst1q_p64_x2): Likewise.
11088 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11090 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
11091 __builtin_memcpy instead of constructing
11092 __builtin_aarch64_simd_ci one vector at a time.
11093 (vst1_u64_x3): Likewise.
11094 (vst1_f64_x3): Likewise.
11095 (vst1_s8_x3): Likewise.
11096 (vst1_p8_x3): Likewise.
11097 (vst1_s16_x3): Likewise.
11098 (vst1_p16_x3): Likewise.
11099 (vst1_s32_x3): Likewise.
11100 (vst1_u8_x3): Likewise.
11101 (vst1_u16_x3): Likewise.
11102 (vst1_u32_x3): Likewise.
11103 (vst1_f16_x3): Likewise.
11104 (vst1_f32_x3): Likewise.
11105 (vst1_p64_x3): Likewise.
11106 (vst1q_s8_x3): Likewise.
11107 (vst1q_p8_x3): Likewise.
11108 (vst1q_s16_x3): Likewise.
11109 (vst1q_p16_x3): Likewise.
11110 (vst1q_s32_x3): Likewise.
11111 (vst1q_s64_x3): Likewise.
11112 (vst1q_u8_x3): Likewise.
11113 (vst1q_u16_x3): Likewise.
11114 (vst1q_u32_x3): Likewise.
11115 (vst1q_u64_x3): Likewise.
11116 (vst1q_f16_x3): Likewise.
11117 (vst1q_f32_x3): Likewise.
11118 (vst1q_f64_x3): Likewise.
11119 (vst1q_p64_x3): Likewise.
11121 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
11124 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
11125 hard register when LRA is in progress.
11127 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11129 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
11130 __builtin_memcpy instead of using a union.
11131 (vst1q_s8_x4): Likewise.
11132 (vst1_s16_x4): Likewise.
11133 (vst1q_s16_x4): Likewise.
11134 (vst1_s32_x4): Likewise.
11135 (vst1q_s32_x4): Likewise.
11136 (vst1_u8_x4): Likewise.
11137 (vst1q_u8_x4): Likewise.
11138 (vst1_u16_x4): Likewise.
11139 (vst1q_u16_x4): Likewise.
11140 (vst1_u32_x4): Likewise.
11141 (vst1q_u32_x4): Likewise.
11142 (vst1_f16_x4): Likewise.
11143 (vst1q_f16_x4): Likewise.
11144 (vst1_f32_x4): Likewise.
11145 (vst1q_f32_x4): Likewise.
11146 (vst1_p8_x4): Likewise.
11147 (vst1q_p8_x4): Likewise.
11148 (vst1_p16_x4): Likewise.
11149 (vst1q_p16_x4): Likewise.
11150 (vst1_s64_x4): Likewise.
11151 (vst1_u64_x4): Likewise.
11152 (vst1_p64_x4): Likewise.
11153 (vst1q_s64_x4): Likewise.
11154 (vst1q_u64_x4): Likewise.
11155 (vst1q_p64_x4): Likewise.
11156 (vst1_f64_x4): Likewise.
11157 (vst1q_f64_x4): Likewise.
11159 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
11161 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
11162 instead of constructing __builtin_aarch64_simd_oi one vector
11164 (vst2_u64): Likewise.
11165 (vst2_f64): Likewise.
11166 (vst2_s8): Likewise.
11167 (vst2_p8): Likewise.
11168 (vst2_s16): Likewise.
11169 (vst2_p16): Likewise.
11170 (vst2_s32): Likewise.
11171 (vst2_u8): Likewise.
11172 (vst2_u16): Likewise.
11173 (vst2_u32): Likewise.
11174 (vst2_f16): Likewise.
11175 (vst2_f32): Likewise.
11176 (vst2_p64): Likewise.
11177 (vst2q_s8): Likewise.
11178 (vst2q_p8): Likewise.
11179 (vst2q_s16): Likewise.
11180 (vst2q_p16): Likewise.
11181 (vst2q_s32): Likewise.
11182 (vst2q_s64): Likewise.
11183 (vst2q_u8): Likewise.
11184 (vst2q_u16): Likewise.
11185 (vst2q_u32): Likewise.
11186 (vst2q_u64): Likewise.
11187 (vst2q_f16): Likewise.
11188 (vst2q_f32): Likewise.
11189 (vst2q_f64): Likewise.
11190 (vst2q_p64): Likewise.
11192 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11194 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
11195 instead of constructing __builtin_aarch64_simd_ci one vector
11197 (vst3_u64): Likewise.
11198 (vst3_f64): Likewise.
11199 (vst3_s8): Likewise.
11200 (vst3_p8): Likewise.
11201 (vst3_s16): Likewise.
11202 (vst3_p16): Likewise.
11203 (vst3_s32): Likewise.
11204 (vst3_u8): Likewise.
11205 (vst3_u16): Likewise.
11206 (vst3_u32): Likewise.
11207 (vst3_f16): Likewise.
11208 (vst3_f32): Likewise.
11209 (vst3_p64): Likewise.
11210 (vst3q_s8): Likewise.
11211 (vst3q_p8): Likewise.
11212 (vst3q_s16): Likewise.
11213 (vst3q_p16): Likewise.
11214 (vst3q_s32): Likewise.
11215 (vst3q_s64): Likewise.
11216 (vst3q_u8): Likewise.
11217 (vst3q_u16): Likewise.
11218 (vst3q_u32): Likewise.
11219 (vst3q_u64): Likewise.
11220 (vst3q_f16): Likewise.
11221 (vst3q_f32): Likewise.
11222 (vst3q_f64): Likewise.
11223 (vst3q_p64): Likewise.
11225 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11227 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
11228 instead of constructing __builtin_aarch64_simd_xi one vector
11230 (vst4_u64): Likewise.
11231 (vst4_f64): Likewise.
11232 (vst4_s8): Likewise.
11233 (vst4_p8): Likewise.
11234 (vst4_s16): Likewise.
11235 (vst4_p16): Likewise.
11236 (vst4_s32): Likewise.
11237 (vst4_u8): Likewise.
11238 (vst4_u16): Likewise.
11239 (vst4_u32): Likewise.
11240 (vst4_f16): Likewise.
11241 (vst4_f32): Likewise.
11242 (vst4_p64): Likewise.
11243 (vst4q_s8): Likewise.
11244 (vst4q_p8): Likewise.
11245 (vst4q_s16): Likewise.
11246 (vst4q_p16): Likewise.
11247 (vst4q_s32): Likewise.
11248 (vst4q_s64): Likewise.
11249 (vst4q_u8): Likewise.
11250 (vst4q_u16): Likewise.
11251 (vst4q_u32): Likewise.
11252 (vst4q_u64): Likewise.
11253 (vst4q_f16): Likewise.
11254 (vst4q_f32): Likewise.
11255 (vst4q_f64): Likewise.
11256 (vst4q_p64): Likewise.
11258 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11260 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
11261 instead of constructing __builtin_aarch64_simd_oi one vector
11263 (vtbx4_u8): Likewise.
11264 (vtbx4_p8): Likewise.
11266 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11268 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
11269 instead of constructing __builtin_aarch64_simd_oi one vector
11271 (vtbl3_u8): Likewise.
11272 (vtbl3_p8): Likewise.
11273 (vtbl4_s8): Likewise.
11274 (vtbl4_u8): Likewise.
11275 (vtbl4_p8): Likewise.
11277 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11279 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
11280 instead of constructing __builtin_aarch64_simd_oi one vector
11282 (vqtbx2_u8): Likewise.
11283 (vqtbx2_p8): Likewise.
11284 (vqtbx2q_s8): Likewise.
11285 (vqtbx2q_u8): Likewise.
11286 (vqtbx2q_p8): Likewise.
11287 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
11288 __builtin_aarch64_simd_ci one vector at a time.
11289 (vqtbx3_u8): Likewise.
11290 (vqtbx3_p8): Likewise.
11291 (vqtbx3q_s8): Likewise.
11292 (vqtbx3q_u8): Likewise.
11293 (vqtbx3q_p8): Likewise.
11294 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
11295 __builtin_aarch64_simd_xi one vector at a time.
11296 (vqtbx4_u8): Likewise.
11297 (vqtbx4_p8): Likewise.
11298 (vqtbx4q_s8): Likewise.
11299 (vqtbx4q_u8): Likewise.
11300 (vqtbx4q_p8): Likewise.
11302 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11304 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
11305 instead of constructing __builtin_aarch64_simd_oi one vector
11307 (vqtbl2_u8): Likewise.
11308 (vqtbl2_p8): Likewise.
11309 (vqtbl2q_s8): Likewise.
11310 (vqtbl2q_u8): Likewise.
11311 (vqtbl2q_p8): Likewise.
11312 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
11313 __builtin_aarch64_simd_ci one vector at a time.
11314 (vqtbl3_u8): Likewise.
11315 (vqtbl3_p8): Likewise.
11316 (vqtbl3q_s8): Likewise.
11317 (vqtbl3q_u8): Likewise.
11318 (vqtbl3q_p8): Likewise.
11319 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
11320 __builtin_aarch64_simd_xi one vector at a time.
11321 (vqtbl4_u8): Likewise.
11322 (vqtbl4_p8): Likewise.
11323 (vqtbl4q_s8): Likewise.
11324 (vqtbl4q_u8): Likewise.
11325 (vqtbl4q_p8): Likewise.
11327 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
11330 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
11332 2021-07-22 Andrew Pinski <apinski@marvell.com>
11334 PR tree-optimization/10153
11335 * tree-tailcall.c (create_tailcall_accumulator):
11336 Don't call fold_convert as the type should be correct already.
11337 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
11338 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
11340 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
11342 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
11343 varying_p check for null/non-null check.
11345 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
11347 PR tree-optimization/101511
11348 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
11349 is in ssa2's equiv set, and don't trap if so.
11351 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
11353 PR tree-optimization/101497
11354 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
11357 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
11359 PR tree-optimization/101496
11360 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
11361 first, then vrp_visit_cond_Stmt.
11363 2021-07-22 liuhongt <hongtao.liu@intel.com>
11365 * config/i386/i386-expand.c
11366 (ix86_broadcast_from_integer_constant): Rename to ..
11367 (ix86_broadcast_from_constant): .. this, and extend it to
11369 (ix86_expand_vector_move): Extend to float mode.
11370 * config/i386/i386-features.c
11371 (replace_constant_pool_with_broadcast): Remove.
11372 (remove_partial_avx_dependency_gate): Ditto.
11373 (constant_pool_broadcast): Ditto.
11374 (class pass_constant_pool_broadcast): Ditto.
11375 (make_pass_constant_pool_broadcast): Ditto.
11376 (remove_partial_avx_dependency): Adjust gate.
11377 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
11378 * config/i386/i386-protos.h
11379 (make_pass_constant_pool_broadcast): Remove.
11381 2021-07-22 liuhongt <hongtao.liu@intel.com>
11383 * config/i386/constraints.md (Wb): New constraint.
11385 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
11387 (*ashlqi3_1): Ditto.
11388 (*<insn><mode>3_1): Split to ..
11389 (*ashr<mode>3_1): this, ...
11390 (*lshr<mode>3_1): and this, also extend this pattern to avx512
11392 (*<insn><mode>3_1): Split to ..
11393 (*ashr<mode>3_1): this, ...
11394 (*lshrqi3_1): and this, also extend this pattern to avx512
11396 (*lshrhi3_1): And this, also extend this pattern to avx512
11398 * config/i386/sse.md (k<code><mode>): New define_split after
11399 it to convert generic shift pattern to mask shift ones.
11401 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
11402 Joseph Myers <joseph@codesourcery.com>
11403 Cesar Philippidis <cesar@codesourcery.com>
11405 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
11406 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
11408 * tree-pretty-print.c (dump_omp_clause): Likewise.
11409 * omp-general.c (oacc_verify_routine_clauses): Likewise.
11410 * gimplify.c (gimplify_scan_omp_clauses)
11411 (gimplify_adjust_omp_clauses): Likewise.
11412 * tree-nested.c (convert_nonlocal_omp_clauses)
11413 (convert_local_omp_clauses): Likewise.
11414 * omp-low.c (scan_sharing_clauses): Likewise.
11415 * omp-offload.c (execute_oacc_device_lower): Update.
11417 2021-07-21 Martin Sebor <msebor@redhat.com>
11419 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
11421 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11423 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
11426 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11428 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
11430 (write_fntype_init): New stub function.
11431 (write_init_bif_table): Likewise.
11432 (write_init_ovld_table): New function.
11433 (write_init_file): Implement.
11435 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11437 * config/rs6000/rs6000-gen-builtins.c
11438 (write_autogenerated_header): New function.
11439 (write_decls): Likewise.
11440 (write_extern_fntype): New callback function.
11441 (write_header_file): Implement.
11443 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11445 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
11448 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11450 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
11452 (complete_base_type): Likewise.
11453 (construct_fntype_id): Likewise.
11454 (parse_bif_entry): Call contruct_fntype_id.
11455 (parse_ovld_entry): Likewise.
11457 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11459 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
11460 (MAXOVLDSTANZAS): New macro.
11461 (ovld_stanzas): New variable.
11462 (curr_ovld_stanza): Likewise.
11463 (MAXOVLDS): New macro.
11464 (ovlddata): New struct.
11465 (ovlds): New variable.
11466 (curr_ovld): Likewise.
11467 (max_ovld_args): Likewise.
11468 (parse_ovld_entry): New function.
11469 (parse_ovld_stanza): Likewise.
11470 (parse_ovld): Implement.
11472 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11474 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
11477 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11479 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
11480 (parse_prototype): Implement.
11482 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11484 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
11485 (curr_bif_stanza): New variable.
11486 (stanza_entry): New struct.
11487 (stanza_map): New initialized variable.
11488 (enable_string): Likewise.
11489 (fnkinds): New enum.
11490 (typelist): New struct.
11491 (attrinfo): Likewise.
11492 (MAXRESTROPNDS): New macro.
11493 (prototype): New struct.
11494 (MAXBIFS): New macro.
11495 (bifdata): New struct.
11496 (bifs): New variable.
11497 (curr_bif): Likewise.
11498 (bif_order): Likewise.
11499 (bif_index): Likewise.
11500 (fatal): New function.
11501 (stanza_name_to_stanza): Likewise.
11502 (parse_bif_attrs): New stub function.
11503 (parse_prototype): Likewise.
11504 (parse_bif_entry): New function.
11505 (parse_bif_stanza): Likewise.
11506 (parse_bif): Implement.
11507 (set_bif_order): New function.
11508 (create_bif_order): Implement.
11510 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11512 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
11513 (num_bifs): New variable.
11514 (num_ovld_stanzas): Likewise.
11515 (num_ovlds): Likewise.
11516 (parse_codes): New enum.
11517 (bif_rbt): New variable.
11518 (ovld_rbt): Likewise.
11519 (fntype_rbt): Likewise.
11520 (bifo_rbt): Likewise.
11521 (parse_bif): New stub function.
11522 (create_bif_order): Likewise.
11523 (parse_ovld): Likewise.
11524 (write_header_file): Likewise.
11525 (write_init_file): Likewise.
11526 (write_defines_file): Likewise.
11527 (delete_output_files): New function.
11530 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
11533 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
11534 from CRC32 _builtin functions.
11536 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
11538 * coverage.c (build_gcov_info_var_registration): Mark the object placed
11539 in the linker set as referenced so that it does not get optimized away.
11541 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
11544 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
11546 * config.gcc (riscv*-*-*): Detect which python is available.
11548 2021-07-21 Jakub Jelinek <jakub@redhat.com>
11550 PR middle-end/101535
11551 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
11552 contexts in which decl isn't privatized and for ORT_TARGET return
11553 false if decl is mapped.
11555 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
11557 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
11558 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
11559 has been successfully unrolled.
11561 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
11563 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
11564 exit condition if no tail loop is needed, and if the original exit
11565 condition should therefore be kept as-is.
11566 (tree_transform_and_unroll_loop): Handle that case here too.
11568 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
11570 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
11572 (free_data_refs): Likewise.
11573 * tree-data-ref.h (free_dependence_relations): Likewise.
11574 (free_data_refs): Likewise.
11575 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
11577 (struct component): Likewise.
11578 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
11579 (pcom_worker::~pcom_worker): Likewise.
11580 (pcom_worker::release_chain): Adjust as auto_vec changes.
11581 (pcom_worker::loop): Rename to ...
11582 (pcom_worker::m_loop): ... this.
11583 (pcom_worker::datarefs): Rename to ...
11584 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
11585 (pcom_worker::dependences): Rename to ...
11586 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
11587 (pcom_worker::chains): Rename to ...
11588 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
11589 (pcom_worker::looparound_phis): Rename to ...
11590 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
11592 (pcom_worker::cache): Rename to ...
11593 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
11594 (pcom_worker::release_chain): Adjust for auto_vec changes.
11595 (pcom_worker::release_chains): Adjust for auto_vec and renaming
11597 (release_component): Remove.
11598 (release_components): Adjust for release_component removal.
11599 (component_of): Adjust to use vec.
11600 (merge_comps): Likewise.
11601 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
11602 (pcom_worker::determine_offset): Likewise.
11603 (class comp_ptrs): Remove.
11604 (pcom_worker::split_data_refs_to_components): Adjust for renaming
11605 changes, for comp_ptrs removal with auto_vec.
11606 (pcom_worker::suitable_component_p): Adjust for renaming changes.
11607 (pcom_worker::filter_suitable_components): Adjust for release_component
11609 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
11610 (pcom_worker::find_looparound_phi): Likewise.
11611 (pcom_worker::add_looparound_copies): Likewise.
11612 (pcom_worker::determine_roots_comp): Likewise.
11613 (pcom_worker::single_nonlooparound_use): Likewise.
11614 (pcom_worker::execute_pred_commoning_chain): Likewise.
11615 (pcom_worker::execute_pred_commoning): Likewise.
11616 (pcom_worker::try_combine_chains): Likewise.
11617 (pcom_worker::prepare_initializers_chain): Likewise.
11618 (pcom_worker::prepare_initializers): Likewise.
11619 (pcom_worker::prepare_finalizers_chain): Likewise.
11620 (pcom_worker::prepare_finalizers): Likewise.
11621 (pcom_worker::tree_predictive_commoning_loop): Likewise.
11623 2021-07-20 Martin Sebor <msebor@redhat.com>
11625 PR middle-end/101397
11626 * builtins.c (gimple_call_return_array): Add argument. Correct
11627 offsets for memchr, mempcpy, stpcpy, and stpncpy.
11628 (compute_objsize_r): Adjust offset computation for argument returning
11631 2021-07-20 Martin Sebor <msebor@redhat.com>
11633 PR middle-end/101300
11634 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
11636 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
11638 * function.c (assign_parm_setup_block): Use adjust_address instead
11639 of change_address to preserve MEM_EXPR and friends.
11641 2021-07-20 Martin Sebor <msebor@redhat.com>
11643 * cfgloop.h (single_likely_exit): Adjust by-value argument to
11644 by-const-reference.
11645 * cfgloopanal.c (single_likely_exit): Same.
11646 * cgraph.h (struct cgraph_node): Same.
11647 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
11648 * genautomata.c (merge_states): Same.
11649 * genextract.c (VEC_char_to_string): Same.
11650 * genmatch.c (dt_node::gen_kids_1): Same.
11651 (walk_captures): Adjust by-value argument to by-reference.
11652 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
11653 to by-const-reference.
11654 * gimple.c (gimple_build_call_vec): Same.
11655 (gimple_build_call_internal_vec): Same.
11656 (gimple_build_switch): Same.
11657 (sort_case_labels): Same.
11658 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
11660 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
11661 by-const-reference.
11662 (gimple_build_call_internal_vec): Same.
11663 (gimple_build_switch): Same.
11664 (sort_case_labels): Same.
11665 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
11667 * haifa-sched.c (calc_priorities): Adjust by-value argument to
11668 by-const-reference.
11669 (sched_init_luids): Same.
11670 (haifa_init_h_i_d): Same.
11671 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
11672 (adjust_callers_for_value_intersection): Adjust by-value argument to
11674 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
11675 by-const-reference.
11676 (find_more_contexts_for_caller_subset): Same.
11677 (find_aggregate_values_for_callers_subset): Same.
11678 (copy_useful_known_contexts): Same.
11679 * ipa-fnsummary.c (remap_edge_summaries): Same.
11680 (remap_freqcounting_predicate): Same.
11681 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
11683 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
11684 to by-const-reference.
11685 * ipa-predicate.h (predicate::remap_after_inlining): Same.
11686 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
11687 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
11688 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
11689 * read-rtl.c (add_overload_instance): Same.
11690 * rtl.h (native_decode_rtx): Same.
11691 (native_decode_vector_rtx): Same.
11692 * sched-int.h (sched_init_luids): Same.
11693 (haifa_init_h_i_d): Same.
11694 * simplify-rtx.c (native_decode_vector_rtx): Same.
11695 (native_decode_rtx): Same.
11696 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
11697 (shrink_wrap_one_built_in_call_with_conds): Same.
11698 (shrink_wrap_conditional_dead_built_in_calls): Same.
11699 * tree-data-ref.c (create_runtime_alias_checks): Same.
11700 (compute_all_dependences): Same.
11701 * tree-data-ref.h (compute_all_dependences): Same.
11702 (create_runtime_alias_checks): Same.
11703 (index_in_loop_nest): Same.
11704 * tree-if-conv.c (mask_exists): Same.
11705 * tree-loop-distribution.c (class loop_distribution): Same.
11706 (loop_distribution::create_rdg_vertices): Same.
11707 (dump_rdg_partitions): Same.
11708 (debug_rdg_partitions): Same.
11709 (partition_contains_all_rw): Same.
11710 (loop_distribution::distribute_loop): Same.
11711 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
11712 (oacc_entry_exit_single_gang): Same.
11713 * tree-ssa-loop-im.c (hoist_memory_references): Same.
11714 (loop_suitable_for_sm): Same.
11715 * tree-ssa-loop-niter.c (bound_index): Same.
11716 * tree-ssa-reassoc.c (update_ops): Same.
11717 (swap_ops_for_binary_stmt): Same.
11718 (rewrite_expr_tree): Same.
11719 (rewrite_expr_tree_parallel): Same.
11720 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
11721 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
11722 * tree-ssa-structalias.c (process_all_all_constraints): Same.
11723 (make_constraints_to): Same.
11724 (handle_lhs_call): Same.
11725 (find_func_aliases_for_builtin_call): Same.
11726 (sort_fieldstack): Same.
11727 (check_for_overlaps): Same.
11728 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
11729 (vect_create_cond_for_unequal_addrs): Same.
11730 (vect_create_cond_for_lower_bounds): Same.
11731 (vect_create_cond_for_alias_checks): Same.
11732 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
11733 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
11734 (vect_make_slp_decision): Same.
11735 (vect_slp_bbs): Same.
11736 (duplicate_and_interleave): Same.
11737 (vect_transform_slp_perm_load): Same.
11738 (vect_schedule_slp): Same.
11739 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
11740 (vect_schedule_slp): Same.
11741 (duplicate_and_interleave): Same.
11742 * tree.c (build_vector_from_ctor): Same.
11743 (build_vector): Same.
11744 (check_vector_cst): Same.
11745 (check_vector_cst_duplicate): Same.
11746 (check_vector_cst_fill): Same.
11747 (check_vector_cst_stepped): Same.
11748 * tree.h (build_vector_from_ctor): Same.
11750 2021-07-20 Jakub Jelinek <jakub@redhat.com>
11753 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
11754 type from bool to int.
11755 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
11756 EASY_VECTOR_MSB case if either step or copies is not 1.
11757 (vspltis_shifted): Fix comment typo.
11758 (easy_altivec_constant): Change return type from bool to int, instead
11759 of returning true return byte size of the element mode that should be
11760 used to synthetize the constant.
11761 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
11762 that vspltis_shifted is 0, handle the case where easy_altivec_constant
11763 assumes using different vector mode from CONST_VECTOR's mode.
11764 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
11765 easy_altivec_constant to determine mode in which -1 >> -1 should be
11766 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
11768 2021-07-20 Richard Biener <rguenther@suse.de>
11771 * dwarf2out.h (dwarf_file_data): Add key member.
11772 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
11773 (dwarf_file_hasher::hash): Hash key.
11774 (lookup_filename): Remap the filename and store it in the
11775 filename member of dwarf_file_data when creating a new
11777 (file_name_acquire): Do not remap the filename again.
11778 (maybe_emit_file): Likewise.
11780 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
11782 * config/aarch64/aarch64-simd-builtins.def: Use two variant
11783 generators for all TBL/TBX intrinsics and rename to
11784 consistent forms: qtbl[1234] or qtbx[1234].
11785 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
11787 (aarch64_qtbl1<mode>): This.
11788 (aarch64_tbx1<mode>): Rename to...
11789 (aarch64_qtbx1<mode>): This.
11790 (aarch64_tbl2v16qi): Delete.
11791 (aarch64_tbl3<mode>): Rename to...
11792 (aarch64_qtbl2<mode>): This.
11793 (aarch64_tbx4<mode>): Rename to...
11794 (aarch64_qtbx2<mode>): This.
11795 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
11796 renamed qtbl1 and qtbl2 RTL patterns.
11797 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
11799 (vqtbl1_s8): Likewise.
11800 (vqtbl1_u8): Likewise.
11801 (vqtbl1q_p8): Likewise.
11802 (vqtbl1q_s8): Likewise.
11803 (vqtbl1q_u8): Likewise.
11804 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
11805 (vqtbx1_u8): Likewise.
11806 (vqtbx1_p8): Likewise.
11807 (vqtbx1q_s8): Likewise.
11808 (vqtbx1q_u8): Likewise.
11809 (vqtbx1q_p8): Likewise.
11810 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
11811 (vtbl1_u8): Likewise.
11812 (vtbl1_p8): Likewise.
11813 (vtbl2_s8): Likewise
11814 (vtbl2_u8): Likewise.
11815 (vtbl2_p8): Likewise.
11816 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
11817 (vtbl3_u8): Likewise.
11818 (vtbl3_p8): Likewise.
11819 (vtbl4_s8): Likewise.
11820 (vtbl4_u8): Likewise.
11821 (vtbl4_p8): Likewise.
11822 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
11823 (vtbx2_u8): Likewise.
11824 (vtbx2_p8): Likewise.
11825 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
11826 (vqtbl2_u8): Likewise.
11827 (vqtbl2_p8): Likewise.
11828 (vqtbl2q_s8): Likewise.
11829 (vqtbl2q_u8): Likewise.
11830 (vqtbl2q_p8): Likewise.
11831 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
11832 (vqtbx2_u8): Likewise.
11833 (vqtbx2_p8): Likewise.
11834 (vqtbx2q_s8): Likewise.
11835 (vqtbx2q_u8): Likewise.
11836 (vqtbx2q_p8): Likewise.
11837 (vtbx4_s8): Likewise.
11838 (vtbx4_u8): Likewise.
11839 (vtbx4_p8): Likewise.
11841 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
11844 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
11846 (define_peephole2 atomic_loaddi_fpu): Ditto.
11848 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
11850 * config.gcc (riscv*-*-*): Detect which python is available.
11852 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
11854 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
11855 (smul<mode>3_highpart): ... this.
11856 (mulhu_<mode>): Rename to...
11857 (umul<mode>3_highpart): ... this.
11858 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
11859 MULHU_V2DI, MULHU_V4SI): Adjust.
11861 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
11863 PR tree-optimization/100696
11864 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
11865 * internal-fn.def (IFN_MULH): New internal function.
11866 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
11867 recog normal multiply highpart as IFN_MULH.
11868 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
11871 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
11873 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
11874 (BTF_DEBUGGING_INFO): Likewise.
11875 * doc/tm.texi.in: Document the new macros.
11876 * doc/tm.texi: Regenerated.
11877 * toplev.c: Guard initialization of debug hooks.
11879 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
11881 * flags.h (ctf_debuginfo_p): New function declaration.
11882 * opts.c (ctf_debuginfo_p): New function definition.
11884 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
11887 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
11888 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
11889 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
11890 * config/gcn/gcn-valu.md: Add a warning comment.
11891 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
11892 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
11893 * config/gcn/gcn.md: Add a warning comment.
11894 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
11895 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
11897 (EF_AMDGPU_XNACK): New.
11898 (EF_AMDGPU_SRAM_ECC): New.
11900 (copy_early_debug_info): Use elf_flags.
11901 (main): Handle -mxnack and -msram-ecc options.
11902 * doc/invoke.texi: Document -mxnack and -msram-ecc.
11904 2021-07-19 Andrew Pinski <apinski@marvell.com>
11907 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
11908 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
11910 2021-07-19 Richard Biener <rguenther@suse.de>
11912 PR tree-optimization/101505
11913 * tree-vect-patterns.c (vect_determine_precisions): Walk
11914 PHIs also for loop vectorization.
11916 2021-07-19 Richard Biener <rguenther@suse.de>
11918 * gimple.h (gimple_expr_type): Remove.
11919 * doc/gimple.texi: Remove gimple_expr_type documentation.
11921 2021-07-19 Richard Biener <rguenther@suse.de>
11923 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
11924 (ao_ref_init_from_vn_reference): Likewise.
11925 (fully_constant_reference): Likewise.
11926 (vn_reference_lookup_call): Do not set vr->type to random
11928 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
11930 * tree-vect-generic.c (expand_vector_piecewise): Pass in
11931 whether we expanded parallel.
11932 (expand_vector_parallel): Adjust.
11933 (expand_vector_addition): Likewise.
11934 (expand_vector_comparison): Likewise.
11935 (expand_vector_operation): Likewise.
11936 (expand_vector_scalar_condition): Likewise.
11937 (expand_vector_conversion): Likewise.
11939 2021-07-19 Richard Biener <rguenther@suse.de>
11941 * tree-vrp.c (register_edge_assert_for_2): Use the
11943 (vrp_folder::fold_predicate_in): Likewise.
11944 * vr-values.c (gimple_assign_nonzero_p): Likewise.
11945 (vr_values::extract_range_from_comparison): Likewise.
11946 (vr_values::extract_range_from_ubsan_builtin): Use the
11947 type of the first operand.
11948 (vr_values::extract_range_basic): Push down type
11949 computation, use the appropriate LHS.
11950 (vr_values::extract_range_from_assignment): Use the
11953 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
11956 * common/config/i386/i386-common.c (ix86_handle_option): For
11957 -mgeneral-regs-only, enable the GPR only instructions which are
11958 enabled implicitly by SSE ISAs unless they have been disabled
11961 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
11964 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
11965 ix86_avx_u128_mode_needed.
11966 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
11967 returns AVX register.
11969 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
11971 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
11972 (handle_const_call): Liekise
11973 (handle_pure_call): Liekise
11975 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
11977 PR tree-optimization/96542
11978 * range-op.cc (range_operator::wi_fold_in_parts): New.
11979 (range_operator::fold_range): Call wi_fold_in_parts.
11980 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
11981 * range-op.h (wi_fold_in_parts): Add prototype.
11983 2021-07-16 David Malcolm <dmalcolm@redhat.com>
11985 * doc/analyzer.texi: Add __analyzer_dump_state.
11987 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
11989 * config/rs6000/rbtree.c: New file.
11990 * config/rs6000/rbtree.h: New file.
11992 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
11994 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
11995 (typeinfo): Add restr field.
11996 (match_bracketed_pair): New function.
11997 (match_const_restriction): Implement.
11999 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12001 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
12003 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12005 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
12006 (basetype): Likewise.
12007 (typeinfo): Likewise.
12008 (handle_pointer): New function.
12009 (match_basetype): New stub function.
12010 (match_const_restriction): Likewise.
12011 (match_type): New function.
12013 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12015 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
12017 (advance_line): Likewise.
12018 (safe_inc_pos): Likewise.
12019 (match_identifier): Likewise.
12020 (match_integer): Likewise.
12021 (match_to_right_bracket): Likewise.
12023 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12025 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
12026 (ovld_file): Likewise.
12027 (header_file): Likewise.
12028 (init_file): Likewise.
12029 (defines_file): Likewise.
12030 (pgm_path): Likewise.
12031 (bif_path): Likewise.
12032 (ovld_path): Likewise.
12033 (header_path): Likewise.
12034 (init_path): Likewise.
12035 (defines_path): Likewise.
12036 (LINELEN): New macro.
12037 (linebuf): New variable.
12041 (bif_diag): New function.
12042 (ovld_diag): Likewise.
12044 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12046 * config/rs6000/rs6000-builtin-new.def: New.
12047 * config/rs6000/rs6000-overload.def: New.
12049 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12051 * config/rs6000/rs6000-gen-builtins.c: New.
12053 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12055 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
12056 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
12057 * gengtype-state.c (state_writer::write_state_file_list): Add a
12058 parameter to the fileslist expression for the number of build
12060 (read_state_files_list): Detect build headers and strip the
12061 initial "./" or ".\" from their names.
12062 * gengtype.c (build_headers): New global variable.
12063 (num_build_headers): Likewise.
12064 (open_base_files): Emit #include for each build header.
12065 (main): Detect and count build headers.
12066 * gengtype.h (build_headers): New extern variable.
12067 (num_build_headers): Likewise.
12069 2021-07-16 Richard Biener <rguenther@suse.de>
12071 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
12072 the type of the LHS.
12073 (find_bswap_or_nop_1): Likewise.
12074 (find_bswap_or_nop): Likewise.
12075 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
12077 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
12078 Remove unused parameters, pass in the scalar type. Fix
12079 internal store function handling.
12080 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
12081 (vect_get_vector_types_for_stmt): Move down check for
12082 existing vector stmt after we've determined a scalar type.
12083 Pass down the used scalar type to vect_get_smallest_scalar_type.
12084 * tree-vect-generic.c (expand_vector_condition): Use
12085 the type of the LHS.
12086 (expand_vector_scalar_condition): Likewise.
12087 (expand_vector_operations_1): Likewise.
12088 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
12089 (vect_recog_dot_prod_pattern): Likewise.
12090 (vect_recog_sad_pattern): Likewise.
12091 (vect_recog_widen_op_pattern): Likewise.
12092 (vect_recog_widen_sum_pattern): Likewise.
12093 (vect_recog_mixed_size_cond_pattern): Likewise.
12095 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
12097 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
12098 (dump_eaf_flags): Dump EAF_NOT_RETURNED
12099 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
12100 and EAF_NOT_RETURNED.
12101 (modref_summary::useful_p): Likewise.
12102 (modref_summary_lto::useful_p): Likewise.
12103 (struct) modref_summary_lto: Use eaf_fleags_t.
12104 (deref_flags): Handle EAF_NOT_RETURNED.
12105 (struct escape_point): Use min_flags.
12106 (modref_lattice::init): Add EAF_NOT_RETURNED.
12107 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
12108 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
12110 (analyze_parms): Also analyze const functions; update conition on
12112 (modref_write): Update streaming.
12113 (read_section): Update streaming.
12114 (remap_arg_flags): Use eaf_flags_t.
12115 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
12116 * ipa-modref.h: (eaf_flags_t): New typedef.
12117 (struct modref_summary): Use eaf_flags_t.
12118 * tree-core.h (EAF_NOT_RETURNED): New constant.
12120 2021-07-16 Richard Biener <rguenther@suse.de>
12122 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
12123 the type of the LHS.
12124 (gimple_assign_nonnegative_warnv_p): Likewise.
12125 (gimple_call_nonnegative_warnv_p): Likewise. Return false
12126 if the call has no LHS.
12127 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
12128 * tree-eh.c (stmt_could_throw_1_p): Likewise.
12129 * tree-inline.c (insert_init_stmt): Likewise.
12130 * tree-ssa-loop-niter.c (get_val_for): Likewise.
12131 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
12133 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
12134 gassign *. Use the type of the lhs.
12135 (vn_nary_op_lookup_stmt): Adjust.
12136 (vn_nary_op_insert_stmt): Likewise.
12138 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
12140 * config/s390/predicates.md (bras_sym_operand): Accept all
12141 functions in 64-bit mode, use UNSPEC_PLT31.
12142 (larl_operand): Use UNSPEC_PLT31.
12143 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
12144 (legitimize_pic_address): Likewise.
12145 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
12147 (s390_delegitimize_address): Use UNSPEC_PLT31.
12148 (s390_output_addr_const_extra): Likewise.
12149 (print_operand): Add @PLT to TLS calls, handle %K.
12150 (s390_function_profiler): Mark __fentry__/_mcount as function,
12151 use %K, use UNSPEC_PLT31.
12152 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
12153 (s390_emit_call): Use UNSPEC_PLT31.
12154 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
12155 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
12156 (*movdi_64): Use %K.
12157 (reload_base_64): Likewise.
12158 (*sibcall_brc): Likewise.
12159 (*sibcall_brcl): Likewise.
12160 (*sibcall_value_brc): Likewise.
12161 (*sibcall_value_brcl): Likewise.
12163 (*brasl): Likewise.
12164 (*bras_r): Likewise.
12165 (*brasl_r): Likewise.
12166 (*bras_tls): Likewise.
12167 (*brasl_tls): Likewise.
12168 (main_base_64): Likewise.
12169 (reload_base_64): Likewise.
12170 (@split_stack_call<mode>): Likewise.
12172 2021-07-16 Richard Biener <rguenther@suse.de>
12174 PR tree-optimization/101467
12175 * tree-vect-stmts.c (vect_gen_while): Properly guard
12176 make_temp_ssa_name usage.
12178 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
12180 * config.gcc: Don't use forked print-sysroot-suffix.sh and
12181 t-sysroot-suffix for C-SKY.
12182 * config/csky/print-sysroot-suffix.sh: Delete.
12183 * config/csky/t-csky-linux: Delete.
12184 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
12185 instead of CSKY_MULTILIB_DIRNAMES.
12187 2021-07-16 Richard Biener <rguenther@suse.de>
12189 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
12190 conversion issues with the partial reduction of the reused
12191 vector accumulator.
12193 2021-07-16 Richard Biener <rguenther@suse.de>
12195 * config/i386/i386-options.c (ix86_option_override_internal): Set
12196 param_vect_partial_vector_usage to zero if not set.
12198 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
12201 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
12202 (VALID_INT_MODE_P): Add SDmode and DDmode.
12203 Add TDmode for TARGET_64BIT.
12204 (VALID_DFP_MODE_P): Remove.
12205 * config/i386/i386.c (ix86_hard_regno_mode_ok):
12206 Do not use VALID_DFP_MODE_P.
12208 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
12210 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
12212 (fold_using_range::fold_stmt): Ditto.
12213 (fold_using_range::range_of_range_op): Ditto.
12214 (fold_using_range::range_of_phi): Ditto.
12215 (fold_using_range::range_of_call): Ditto.
12216 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
12217 (fold_using_range::range_of_builtin_call): Ditto.
12218 (fold_using_range::range_of_cond_expr): Ditto.
12219 * gimple-range-fold.h (gimple_range_type): New.
12221 2021-07-15 Martin Sebor <msebor@redhat.com>
12223 PR middle-end/97027
12224 * tree-ssa-strlen.c (handle_assign): New function.
12225 (maybe_warn_overflow): Add argument.
12226 (nonzero_bytes_for_type): New function.
12227 (count_nonzero_bytes): Handle more tree types. Call
12228 nonzero_bytes_for_tye.
12229 (count_nonzero_bytes): Handle types.
12230 (handle_store): Handle stores from function calls.
12231 (strlen_check_and_optimize_call): Move code to handle_assign. Call
12232 it for assignments from function calls.
12234 2021-07-15 David Malcolm <dmalcolm@redhat.com>
12239 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
12241 2021-07-15 David Malcolm <dmalcolm@redhat.com>
12243 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
12245 2021-07-15 Martin Sebor <msebor@redhat.com>
12249 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
12250 (operand_compare::verify_hash_value): Same.
12251 * tree-core.h (OEP_DECL_NAME): New.
12253 2021-07-15 Martin Jambor <mjambor@suse.cz>
12255 * profile-count.h (profile_count::value): Change the return type to
12257 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
12259 * tree-cfg.c (dump_function_to_file): Likewise.
12261 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
12264 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
12265 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
12268 2021-07-15 Richard Biener <rguenther@suse.de>
12270 * tree-vectorizer.h (vect_gen_while): Match up with
12271 vect_gen_while_not.
12272 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
12273 of vect_gen_while_not.
12274 (vect_gen_while_not): Adjust.
12275 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
12277 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
12279 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
12280 (ranger_cache::range_of_def): Call adjust_range.
12281 (ranger_cache::entry_range): Same.
12282 * gimple-range-cache.h (non_null_ref::adjust_range): New.
12283 * gimple-range.cc (gimple_ranger::range_of_expr): Call
12285 (gimple_ranger::range_on_entry): Same.
12287 2021-07-15 Tamar Christina <tamar.christina@arm.com>
12290 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12292 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
12294 2021-07-15 Tamar Christina <tamar.christina@arm.com>
12297 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12299 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
12300 (sdot_prod, udot_prod): ...These.
12301 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
12302 (aarch64_<sur>dot<vsi2qi>): Rename to...
12303 (<sur>dot_prod<vsi2qi>): ...This.
12304 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
12307 2021-07-15 Jakub Jelinek <jakub@redhat.com>
12309 PR middle-end/101437
12310 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
12311 types even if they have non-BLKmode TYPE_MODE.
12313 2021-07-15 Richard Biener <rguenther@suse.de>
12316 * gcc.c (process_command): Process -gtoggle like process_options
12317 would after parsing options.
12319 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
12321 * cfgexpand.c (expand_asm_loc): Adjust.
12322 (expand_asm_stmt): Likewise.
12323 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
12324 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
12325 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
12326 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
12327 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
12328 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
12329 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
12330 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
12331 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
12332 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
12333 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
12334 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
12335 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
12336 * doc/tm.texi: Regenerate.
12337 * target.def: Add location argument to md_asm_adjust.
12339 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
12341 * tree-diagnostic.c (diagnostic_report_current_function): Use the
12342 diagnostic's location, not input_location.
12344 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
12346 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
12348 (expand_asm_stmt): Likewise.
12350 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
12352 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
12353 addressed memory rtx, if any.
12354 (rs6000_split_multireg_move): Fix code formatting.
12355 Handle MMA build built-ins with operands in adjacent memory locations.
12357 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
12359 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
12362 2021-07-14 Jason Merrill <jason@redhat.com>
12364 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
12366 2021-07-14 Jason Merrill <jason@redhat.com>
12368 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
12371 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12373 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
12374 Remove erroneous line.
12376 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
12378 * params.opt (param_evrp_mode): Change default.
12380 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12382 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
12383 (sdot_prod, udot_prod): ...These.
12384 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
12385 (aarch64_<sur>dot<vsi2qi>): Rename to...
12386 (<sur>dot_prod<vsi2qi>): ...This.
12387 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
12390 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12392 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
12394 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12396 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
12398 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12400 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
12402 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12404 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
12405 (usdot_prod<vsi2qi>): ... This.
12406 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
12407 (usdot_prod): ...This.
12408 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
12409 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
12411 (@<sur>dot_prod<vsi2qi>): ...This.
12412 * config/aarch64/aarch64-sve-builtins-base.cc
12413 (svusdot_impl::expand): Use it.
12415 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12417 * optabs.def (usdot_prod_optab): New.
12418 * doc/md.texi: Document it and clarify other dot prod optabs.
12419 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
12420 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
12421 * optabs.c (expand_widen_pattern_expr): Likewise.
12422 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
12423 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
12424 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
12426 (vect_widened_op_tree): Optionally ignore
12428 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
12430 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
12433 * config/i386/driver-i386.c (host_detect_local_cpu): Check
12434 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
12435 Enable UINTR only for 64-bit codegen.
12436 * config/i386/i386-options.c
12437 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
12439 * config/i386/i386.h (ARCH_ARG): New.
12440 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
12441 "[arch|tune] 64" for 64-bit codegen.
12443 2021-07-14 Richard Biener <rguenther@suse.de>
12445 PR tree-optimization/101445
12446 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
12447 of the IV in the correct direction for negative stride
12450 2021-07-14 Jakub Jelinek <jakub@redhat.com>
12453 * godump.c (godump_str_hash): New type.
12454 (godump_container::pot_dummy_types): Use string_hash instead of
12455 ptr_hash in the hash_set.
12457 2021-07-14 Richard Biener <rguenther@suse.de>
12459 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
12460 vector types where the old vector type has a multiple of
12461 the new vector type elements.
12462 (vect_create_partial_epilog): New function, split out from...
12463 (vect_create_epilog_for_reduction): ... here.
12464 (vect_transform_cycle_phi): Reduce the re-used accumulator
12465 to the new vector type.
12467 2021-07-14 Alexandre Oliva <oliva@adacore.com>
12469 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
12470 non-'t'-sized arg check.
12472 2021-07-14 Alexandre Oliva <oliva@adacore.com>
12474 * tree-cfg.c (cleanup_dead_labels_eh): Update
12475 post_landing_pad label upon change of landing pad block's
12477 (cleanup_dead_labels): Check that a removed label is not that
12480 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
12482 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
12484 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
12485 Add Neon to general purpose register case for zero-extend
12487 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
12488 case to prevent some cases opting to go through memory.
12489 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
12490 * rtl.c (rtvec_series_p): Define predicate to determine
12491 whether a vector contains a linear series of integers.
12492 * rtl.h (rtvec_series_p): Define.
12493 * rtlanal.c (vec_series_lowpart_p): Define predicate to
12494 determine if a vector selection is equivalent to the low part
12496 * rtlanal.h (vec_series_lowpart_p): Define.
12497 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
12498 Add vec_select -> subreg simplification.
12500 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
12502 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
12503 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
12504 _mm_test_mix_ones_zeros): New.
12506 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
12507 Richard Biener <rguenther@suse.de>
12509 * gimple.c (gimple_could_trap_p_1): Make S argument a
12510 "const gimple*". Preserve constness in call to
12511 gimple_asm_volatile_p.
12512 (gimple_could_trap_p): Make S argument a "const gimple*".
12513 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
12514 Update function prototypes.
12516 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12518 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
12519 (_loop_vec_info::main_loop_edge): New field.
12520 (_loop_vec_info::skip_main_loop_edge): Likewise.
12521 (_loop_vec_info::skip_this_loop_edge): Likewise.
12522 (_loop_vec_info::reusable_accumulators): Likewise.
12523 (_stmt_vec_info::reduc_scalar_results): Likewise.
12524 (_stmt_vec_info::reused_accumulator): Likewise.
12525 (vect_get_main_loop_result): Declare.
12526 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
12527 reduc_scalar_inputs.
12528 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
12529 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
12530 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
12531 skip_main_loop_edge and skip_this_loop_edge fields.
12532 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
12533 (vect_emit_reduction_init_stmts): New function.
12534 (get_initial_def_for_reduction): Use it.
12535 (get_initial_defs_for_reduction): Likewise. Change the vinfo
12536 parameter to a loop_vec_info.
12537 (vect_create_epilog_for_reduction): Store the scalar results
12538 in the reduc_info. If an epilogue loop is reusing an accumulator
12539 from the main loop, and if the epilogue loop can also be skipped,
12540 try to place the reduction code in the join block. Record
12541 accumulators that could potentially be reused by epilogue loops.
12542 (vect_transform_cycle_phi): When vectorizing epilogue loops,
12543 try to reuse accumulators from the main loop. Record the initial
12544 value in reduc_info for non-SLP reductions too.
12546 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12548 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
12549 adjustment handling. Take the neutral value as an argument,
12550 in place of the code argument.
12551 (vect_transform_cycle_phi): Update accordingly. Handle the
12552 initial values of cond reductions separately from code reductions.
12553 Choose the adjustment here rather than in
12554 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
12556 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12558 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
12559 (neutral_op_for_reduction): ...this, providing a more general
12561 (vect_create_epilog_for_reduction): Update accordingly.
12562 (vectorizable_reduction): Likewise.
12563 (vect_transform_cycle_phi): Likewise.
12565 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12567 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
12568 reduc_info instead of the original stmt_vec_info.
12569 (vect_transform_cycle_phi): Update accordingly.
12571 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12573 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
12574 reduc_info as an additional parameter.
12575 (vect_transform_cycle_phi): Update accordingly.
12577 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12579 * tree-vectorizer.h: Include tree-ssa-operands.h.
12580 (vect_phi_initial_value): New function.
12581 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
12582 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
12583 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
12584 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
12586 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12588 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
12589 the phi results to vectype after creating them. Remove later
12590 conversion code that thus becomes redundant.
12592 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12594 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
12595 the new_phis vector with a reduc_inputs vector. Combine handling
12596 of reduction chains and ncopies > 1.
12598 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12600 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
12601 scalar_results to group_size elements after reducing down from
12602 N*group_size elements. Construct an array_slice of the live-out
12603 stmts and assert that there is one stmt per scalar result.
12605 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12607 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
12608 nested_in_vect_loop and use double_reduc everywhere. Remove dead
12609 assignment to "loop".
12611 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12613 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
12614 vector types first. For scalar types, consider both the preferred
12615 vector mode and the alternative vector modes.
12616 * optabs-query.c (can_vec_mask_load_store_p): Use the same
12617 structure as above, in particular using related_vector_mode
12618 for modes provided by autovectorize_vector_modes.
12620 2021-07-13 Jakub Jelinek <jakub@redhat.com>
12621 Richard Biener <rguenther@suse.de>
12623 PR tree-optimization/101419
12624 * tree-pass.h (PROP_objsz): Define.
12625 (make_pass_early_object_sizes): Declare.
12626 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
12627 there to pass_early_object_sizes, drop parameter.
12628 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
12629 drop parameter, move pass_post_ipa_warn right after that.
12630 * tree-object-size.c (pass_object_sizes::execute): Rename to...
12631 (object_sizes_execute): ... this. Add insert_min_max_p argument.
12632 (pass_data_object_sizes): Move after object_sizes_execute.
12633 (pass_object_sizes): Likewise. In execute method call
12634 object_sizes_execute, drop set_pass_param method and insert_min_max_p
12635 non-static data member and its initializer in the ctor.
12636 (pass_data_early_object_sizes, pass_early_object_sizes,
12637 make_pass_early_object_sizes): New.
12638 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
12639 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
12641 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
12644 * config/riscv/constraints.md ("S"): Update description and remove
12646 * doc/md.texi (Machine Constraints): Document the 'S' constraints
12649 2021-07-13 Richard Biener <rguenther@suse.de>
12652 2021-07-12 Richard Biener <rguenther@suse.de>
12654 * tree-vect-slp.c (vect_slp_region): Show the number of
12655 SLP graph entries in the optimization message.
12657 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
12659 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
12661 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
12662 return type to long.
12663 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
12666 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
12668 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
12669 Query relation between the 2 operands and use it.
12671 2021-07-12 Sergei Trofimovich <siarheit@google.com>
12673 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
12675 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
12678 * config/i386/predicates.md (vec_setm_sse41_operand):
12679 Rename from vec_setm_operand.
12680 (vec_setm_avx2_operand): New predicate.
12681 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
12682 Use vec_setm_sse41_operand as operand 2 predicate.
12683 (vec_set<V_256_512:mode): New expander.
12684 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
12685 as operand 2 predicate.
12687 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
12689 PR tree-optimization/101335
12690 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
12692 2021-07-12 Andrew Pinski <apinski@marvell.com>
12694 * tree-ssa-phiopt.c (match_simplify_replacement): Move
12695 insert of the sequence before the movement of the
12696 statement. Check if to see if the statement is used
12697 outside of the original phi to see if we should move it.
12699 2021-07-12 Richard Biener <rguenther@suse.de>
12701 * dump-context.h (debug_dump_context::debug_dump_context):
12702 Add FILE * parameter defaulted to stderr.
12703 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
12704 * tree-vect-slp.c (dot_slp_tree): New functions.
12706 2021-07-12 Richard Biener <rguenther@suse.de>
12708 PR tree-optimization/101373
12709 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
12710 references when the BB may not return.
12711 (compute_avail): Pass in the function we're working on and
12712 replace cfun references with it. Externally throwing
12713 const calls also possibly terminate the function.
12714 (pass_pre::execute): Pass down the function we're working on.
12715 * gcse.c (compute_hash_table_work): Externally throwing
12716 const/pure calls also need record_last_mem_set_info.
12717 * postreload-gcse.c (record_opr_changes): Looping or externally
12718 throwing const/pure calls also need record_last_mem_set_info.
12720 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
12722 * recog.c (memory_address_addr_space_p): Change the type to bool.
12723 Return true/false instead of 1/0.
12724 (offsettable_memref_p): Ditto.
12725 (offsettable_nonstrict_memref_p): Ditto.
12726 (offsettable_address_addr_space_p): Ditto.
12727 Change the type of addressp indirect function to bool.
12728 * recog.h (memory_address_addr_space_p): Change the type to bool.
12729 (strict_memory_address_addr_space_p): Ditto.
12730 (offsettable_memref_p): Ditto.
12731 (offsettable_nonstrict_memref_p): Ditto.
12732 (offsettable_address_addr_space_p): Ditto.
12733 * reload.c (maybe_memory_address_addr_space_p): Ditto.
12734 (strict_memory_address_addr_space_p): Change the type to bool.
12735 Return true/false instead of 1/0.
12736 (maybe_memory_address_addr_space_p): Change the type to bool.
12738 2021-07-12 Richard Biener <rguenther@suse.de>
12740 * tree-vect-slp.c (vect_slp_region): Show the number of
12741 SLP graph entries in the optimization message.
12743 2021-07-12 Richard Biener <rguenther@suse.de>
12745 PR tree-optimization/101394
12746 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
12747 copies from abnormals for a full redundancy.
12749 2021-07-12 Richard Biener <rguenther@suse.de>
12751 PR middle-end/101423
12752 * gimple.c (gimple_could_trap_p_1): Internal function calls
12754 * tree-eh.c (tree_could_trap_p): Likewise.
12756 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
12759 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
12761 (vmulq_n_u32): Likewise.
12762 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
12763 (vmulq_n_f32): Likewise.
12764 (vmul_n_f16): Likewise.
12765 (vmulq_n_f16): Likewise.
12767 2021-07-12 Martin Liska <mliska@suse.cz>
12769 PR sanitizer/101425
12770 * gcc.c (check_offload_target_name): Call
12771 candidates_list_and_hint only if we have a candidate.
12773 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
12776 * config/arm/neon.md (vec_init): Move to ...
12777 * config/arm/vec-common.md (vec_init): ... here.
12778 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
12780 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
12782 PR tree-optimization/101403
12783 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
12784 signedness of the shift is not the same as the signedness of
12785 the type extension.
12787 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
12788 Uroš Bizjak <ubizjak@gmail.com>
12790 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
12791 divmod of a constant numerator with new define_insn_and_split.
12793 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
12796 * config/i386/i386-expand.c (ix86_expand_call): If a call is
12797 to a non-local-binding, or local but to a public symbol, then
12798 assume that it might be indirected via the lazy symbol binder.
12799 Mark R10 and R10 as clobbered in that case.
12801 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
12804 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
12805 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
12806 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
12808 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
12810 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
12811 log2 range of operands[3] to [1,31].
12812 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
12814 2021-07-09 Sergei Trofimovich <siarheit@google.com>
12816 * doc/md.texi: Don't split @smallexample in multiple @groups.
12818 2021-07-09 Sergei Trofimovich <siarheit@google.com>
12820 * doc/md.texi: Add missing 'see' word.
12822 2021-07-09 Andrew Pinski <apinski@marvell.com>
12824 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
12825 to take sequence and gimple_match_op. Accept the case where
12826 op is a SSA_NAME and one statement in the sequence.
12827 Also allow constants.
12828 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
12829 Update call to phiopt_early_allow. Discard the sequence if not
12832 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
12837 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
12838 instead of ISA_HAS_MSA.
12839 (mips_expand_vec_unpack): Likewise.
12840 (mips_expand_vector_init): Likewise.
12842 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
12844 * config/rs6000/vsx.md (mods_<mode>): Rename to...
12845 (mod<mode>3): ... this.
12846 (modu_<mode>): Rename to...
12847 (umod<mode>3): ... this.
12848 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
12849 MODU_V4SI): Adjust.
12851 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
12853 * config/h8300/shiftrotate.md (variable shifts): Expose condition
12854 code handling for the test before the loop.
12856 2021-07-08 Martin Jambor <mjambor@suse.cz>
12859 * ipa-sra.c (class isra_call_summary): New member
12860 m_before_any_store, initialize it in the constructor.
12861 (isra_call_summary::dump): Dump the new field.
12862 (ipa_sra_call_summaries::duplicate): Copy it.
12863 (process_scan_results): Set it.
12864 (isra_write_edge_summary): Stream it.
12865 (isra_read_edge_summary): Likewise.
12866 (param_splitting_across_edge): Only override
12867 safe_to_import_accesses if m_before_any_store is set.
12869 2021-07-08 Martin Sebor <msebor@redhat.com>
12871 PR bootstrap/101374
12872 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
12873 Use Object Size Type 0 instead of 1.
12875 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
12877 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
12880 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
12882 * match.pd: Simplify an extend-operate-truncate sequence involving
12885 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
12886 Richard Biener <rguenther@suse.de>
12888 PR tree-optimization/40210
12889 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
12890 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
12891 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
12893 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
12896 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
12898 * config/i386/mmx.md (V_32): New mode iterator.
12899 (mov<V_32:mode>): Use V_32 mode iterator.
12900 (*mov<V_32:mode>_internal): Ditto.
12901 (*push<V_32:mode>2_rex64): Ditto.
12902 (*push<V_32:mode>2): Ditto.
12903 (movmisalign<V_32:mode>): Ditto.
12904 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
12905 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
12906 (vec_unpacks_lo_v4qi): New expander.
12907 (vec_unpacks_hi_v4qi): Ditto.
12908 (vec_unpacku_lo_v4qi): Ditto.
12909 (vec_unpacku_hi_v4qi): Ditto.
12910 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
12911 (VALID_INT_MODE_P): Ditto.
12913 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
12916 * config/rs6000/rs6000.md (udivti3): New insn.
12917 (divti3): New insn.
12918 (umodti3): New insn.
12919 (modti3): New insn.
12921 2021-07-07 Martin Sebor <msebor@redhat.com>
12923 PR tree-optimization/100137
12924 PR tree-optimization/99121
12925 PR tree-optimization/97027
12926 * builtins.c (access_ref::access_ref): Also set offmax.
12927 (access_ref::offset_in_range): Define new function.
12928 (access_ref::add_offset): Set offmax.
12929 (access_ref::inform_access): Handle access_none.
12930 (handle_mem_ref): Clear ostype.
12931 (compute_objsize_r): Handle ASSERT_EXPR.
12932 * builtins.h (struct access_ref): Add offmax member.
12933 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
12934 compute_objsize() and simplify.
12936 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
12938 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
12939 and VSX_BUILTIN_STXVP.
12941 2021-07-07 Martin Sebor <msebor@redhat.com>
12944 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
12945 a stray %K from error_at() missed in r12-2088.
12947 2021-07-07 Richard Biener <rguenther@suse.de>
12949 PR tree-optimization/99728
12950 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
12952 (mem_refs_may_alias_p): Add assert we handled aggregate
12954 (sm_seq_valid_bb): Give up when running into aggregate copies.
12955 (ref_indep_loop_p): Handle aggregate copies as never
12956 being invariant themselves but allow other refs to be
12957 disambiguated against them.
12958 (can_sm_ref_p): Do not try to apply store-motion to aggregate
12961 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
12964 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
12967 2021-07-06 Martin Sebor <msebor@redhat.com>
12969 * gimple-pretty-print.c (percent_G_format): Remove.
12970 * tree-diagnostic.c (default_tree_printer): Remove calls.
12971 * tree-pretty-print.c (percent_K_format): Remove.
12972 * tree-pretty-print.h (percent_K_format): Remove.
12974 2021-07-06 Martin Sebor <msebor@redhat.com>
12976 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
12977 Remove %K and use error_at.
12978 (aarch64_expand_fcmla_builtin): Same.
12979 (aarch64_expand_builtin_tme): Same.
12980 (aarch64_expand_builtin_memtag): Same.
12981 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
12982 (arm_expand_builtin): Same.
12983 * config/arm/arm.c (bounds_check): Same.
12985 2021-07-06 Martin Sebor <msebor@redhat.com>
12987 * builtins.c (warn_string_no_nul): Remove %G.
12988 (maybe_warn_for_bound): Same.
12989 (warn_for_access): Same.
12990 (check_access): Same.
12991 (check_strncat_sizes): Same.
12992 (expand_builtin_strncat): Same.
12993 (expand_builtin_strncmp): Same.
12994 (expand_builtin): Same.
12995 (expand_builtin_object_size): Same.
12996 (warn_dealloc_offset): Same.
12997 (maybe_emit_free_warning): Same.
12998 * calls.c (maybe_warn_alloc_args_overflow): Same.
12999 (maybe_warn_nonstring_arg): Same.
13000 (maybe_warn_rdwr_sizes): Same.
13001 * expr.c (expand_expr_real_1): Remove %K.
13002 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
13003 (gimple_fold_builtin_strncat): Same.
13004 * gimple-ssa-sprintf.c (format_directive): Same.
13005 (handle_printf_call): Same.
13006 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
13007 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
13008 (maybe_diag_access_bounds): Same. Call gimple_location.
13009 (check_bounds_or_overlap): Same.
13010 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
13011 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
13012 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
13013 (maybe_diag_stxncpy_trunc): Same.
13014 (handle_builtin_stxncpy_strncat): Same.
13015 (maybe_warn_pointless_strcmp): Same.
13016 * tree-ssa-uninit.c (maybe_warn_operand): Same.
13018 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
13021 * config/i386/predicates.md (vec_setm_operand): Enable
13022 register_operand for TARGET_SSE4_1.
13023 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
13024 as operand 2 predicate. Call ix86_expand_vector_set_var
13025 for non-constant index operand.
13026 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
13027 Call ix86_expand_vector_set_var for non-constant index operand.
13029 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
13031 * config/h8300/jumpcall.md (*branch): When possible, generate
13032 the comparison in CCZN mode.
13033 * config/h8300/predicates.md (simple_memory_operand): Reject all
13034 auto-increment addressing modes.
13036 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
13038 PR bootstrap/100246
13039 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
13042 2021-07-06 Richard Biener <rguenther@suse.de>
13044 * doc/md.texi (vec_fmaddsub<mode>4): Document.
13045 (vec_fmsubadd<mode>4): Likewise.
13046 * optabs.def (vec_fmaddsub$a4): Add.
13047 (vec_fmsubadd$a4): Likewise.
13048 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
13049 (IFN_VEC_FMSUBADD): Likewise.
13050 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
13051 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
13052 (addsub_pattern::build): Likewise.
13053 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
13054 and CFN_VEC_FMSUBADD are not transparent for permutes.
13055 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
13056 (vec_fmsubadd<mode>4): Likewise.
13058 2021-07-06 Richard Biener <rguenther@suse.de>
13060 * doc/invoke.texi (fmove-loop-stores): Document.
13061 * common.opt (fmove-loop-stores): New option.
13062 * opts.c (default_options_table): Enable -fmove-loop-stores
13063 at -O1 but not -Og.
13064 * tree-ssa-loop-im.c (pass_lim::execute): Pass
13065 flag_move_loop_stores instead of true to
13066 loop_invariant_motion_in_fun.
13068 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
13070 * doc/install.texi: Document --with-dsymutil.
13072 2021-07-06 Andrew Pinski <apinski@marvell.com>
13074 PR tree-optimization/101256
13075 * dbgcnt.def (phiopt_edge_range): New counter.
13076 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
13077 Check to make sure the new name is defined in the same
13078 bb as the conditional before duplicating range info.
13079 Also add debug counter.
13081 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
13083 PR rtl-optimization/100328
13084 * config/i386/i386-options.c (ix86_option_override_internal):
13085 Set param_ira_consider_dup_in_all_alts to 0.
13087 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
13089 PR rtl-optimization/100328
13090 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
13092 * ira.c (ira_get_dup_out_num): Adjust as parameter
13093 param_ira_consider_dup_in_all_alts.
13094 * params.opt (ira-consider-dup-in-all-alts): New.
13095 * ira-conflicts.c (process_regs_for_copy): Add one parameter
13096 single_input_op_has_cstr_p.
13097 (get_freq_for_shuffle_copy): New function.
13098 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
13099 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
13101 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
13103 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
13104 generate condition code aware RTL directly.
13106 2021-07-05 Andrew Pinski <apinski@marvell.com>
13108 PR tree-optimization/101039
13109 * match.pd (A CMP 0 ? A : -A): New patterns.
13110 * tree-ssa-phiopt.c (abs_replacement): Delete function.
13111 (tree_ssa_phiopt_worker): Don't call abs_replacement.
13112 Update comment about abs_replacement.
13114 2021-07-05 Andrew Pinski <apinski@marvell.com>
13116 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
13117 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
13119 2021-07-05 Andrew Pinski <apinski@marvell.com>
13121 * tree-ssa-phiopt.c (match_simplify_replacement):
13122 Add early_p argument. Call gimple_simplify_phiopt
13123 instead of gimple_simplify.
13124 (tree_ssa_phiopt_worker): Update call to
13125 match_simplify_replacement and allow unconditionally.
13126 (phiopt_early_allow): New function.
13127 (gimple_simplify_phiopt): New function.
13129 2021-07-05 Andrew Pinski <apinski@marvell.com>
13131 PR middle-end/101237
13132 * fold-const.c (negate_expr_p): Remove call to element_mode
13133 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
13134 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
13135 (fold_negate_expr_1): Likewise.
13136 (const_unop): Likewise.
13137 (fold_cond_expr_with_comparison): Likewise.
13138 (fold_binary_loc): Likewise.
13139 (fold_ternary_loc): Likewise.
13140 (tree_call_nonnegative_warnv_p): Likewise.
13141 * match.pd (-(A + B) -> (-B) - A): Likewise.
13143 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
13145 * configure.ac: Handle --with-dsymutil in the same way as we
13146 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
13147 Extract the type and version for the dsymutil configured or
13148 found by the default searches.
13149 * config.in: Regenerated.
13150 * configure: Regenerated.
13151 * collect2.c (do_dsymutil): Handle locating dsymutil in the
13152 same way as for the assembler and linker.
13153 * config/darwin.h (DSYMUTIL): Delete.
13154 * gcc.c: Report a configured dsymutil correctly.
13155 * exec-tool.in: Allow for dsymutil.
13157 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
13159 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
13160 Handle V4QI and V2HI modes.
13161 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
13162 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
13163 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
13164 Handle 4-byte dual operands with TARGET_XOP and single operands
13165 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
13166 mmx_pshufbv4qi3 for TARGET_SSSE3.
13167 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
13168 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
13169 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
13170 (expand_vec_perm_even_odd_1): Handle V4QI mode.
13171 (expand_vec_perm_broadcast_1): Handle V4QI mode.
13172 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
13173 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
13174 (mmx_pshufbv4qi3): Ditto.
13175 (*mmx_pblendw32): Ditto.
13176 (*mmx_pblendw64): Rename from *mmx_pblendw.
13177 (mmx_punpckhbw_low): New insn_and_split pattern.
13178 (mmx_punpcklbw_low): Ditto.
13180 2021-07-05 Richard Biener <rguenther@suse.de>
13182 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
13183 set LOOP_C_INFINITE on the vectorized loop.
13185 2021-07-05 Richard Biener <rguenther@suse.de>
13187 PR middle-end/101291
13188 * cfgloopmanip.c (loop_version): Set the loop copy of the
13189 versioned loop to the new loop.
13191 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
13194 * config.gcc: Ensure that Darwin biarch definitions are
13195 added before i386.h.
13196 * config/i386/darwin.h (TARGET_64BIT): Remove.
13197 (PR80556_WORKAROUND): New.
13198 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
13199 (DARWIN_SUBARCH_SPEC): New.
13200 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
13201 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
13202 (REAL_LIBGCC_SPEC): Remove.
13203 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
13204 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
13205 (REAL_LIBGCC_SPEC): Remove.
13207 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
13209 PR middle-end/101294
13210 * expr.c (store_constructor): Don't use vec_duplicate on vector.
13212 2021-07-02 Martin Sebor <msebor@redhat.com>
13214 PR middle-end/98871
13215 PR middle-end/98512
13216 * diagnostic.c (get_any_inlining_info): New.
13217 (update_effective_level_from_pragmas): Handle inlining context.
13218 (diagnostic_enabled): Same.
13219 (diagnostic_report_diagnostic): Same.
13220 * diagnostic.h (struct diagnostic_info): Add ctor.
13221 (struct diagnostic_context): Add new member.
13222 * tree-diagnostic.c (set_inlining_locations): New.
13223 (tree_diagnostics_defaults): Set new callback pointer.
13225 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
13227 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
13229 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
13230 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
13231 lxvp and stxvp built-ins.
13232 (mma_init_builtins): Handle lxvp and stxvp built-ins.
13233 (builtin_function_type): Likewise.
13234 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
13236 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13238 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
13239 additional argument for the code.
13240 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
13241 argument for the code. Just return if the ZN bits are useful or
13242 not rather than the old style CC_* enums.
13243 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
13244 more generic shiftqi patterns.
13245 (shifthi_noscratch, shiftsi_noscratch): Similarly.
13246 (shiftqi_noscratch_set_flags): New pattern.
13247 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
13249 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
13251 PR tree-optimization/101223
13252 * range-op.cc (build_lt): Add -1 for signed values.
13253 (built_gt): Subtract -1 for signed values.
13255 2021-07-02 David Faust <david.faust@oracle.com>
13257 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
13258 (btf_asm_type): Likewise.
13260 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13262 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
13263 an array of rtx rather than a pointer to rtx. Add code argument.
13264 (compute_a_shift_length): Similarly.
13265 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
13266 shift itself isn't an operand. Create dummy operand[0] to carry
13267 a mode and pass a suitable rtx code to compute_a_shift_length.
13268 (get_shift_alg): Adjust operand number of clobber in output templates.
13269 (output_a_shift): Make first argument an array of rtx rather than
13270 a pointer to rtx. Add code argument for the type of shift.
13271 Adjust now that the shift itself is no longer an operand.
13272 (compute_a_shift_length): Similarly.
13273 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
13274 iterator rather than nshift_operator.
13275 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
13276 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
13277 and compute_a_shift_length.
13278 (shiftqi_noscratch_clobber_flags): Likewise.
13279 (shifthi_noscratch_clobber_flags): Likewise.
13280 (shiftsi_noscratch_clobber_flags): Likewise.
13282 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
13285 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
13286 dsymutil for BTF/CTF.
13288 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
13291 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
13292 segment to include BTF.
13293 (BTF_INFO_SECTION_NAME): New.
13295 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13297 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
13298 (small_data_operand, memreg_operand, small_insn_p): Likewise.
13299 * config/m32r/m32r.c (call_operand): Adjust return type.
13300 (small_data_operand, memreg_operand): Likewise.
13302 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13304 * config/frv/frv-protos.h (integer_register_operand): Adjust return
13306 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
13307 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
13308 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
13309 (move_source_operand, move_destination_operand): Likewise.
13310 (condexec_source_operand, condexec_dest_operand): Likewise.
13311 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
13312 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
13313 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
13314 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
13315 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
13316 (even_fpr_operand, odd_fpr_operand): Likewise.
13317 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
13318 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
13319 (uint4_operand, uint1_operand, int_2word_operand): Likewise
13320 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
13321 (relational_operator, float_relational_operator): Likewise.
13322 (ccr_eqne_operator, minmax_operator): Likewise.
13323 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
13324 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
13325 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
13326 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
13327 (quad_acc_operand, accg_operand): Likewise.
13329 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13331 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
13332 return type to a bool.
13333 (nonimmediate_nonstack_operand): Likewise.
13334 (xstormy16_splittable_below100_operand): Likewise.
13335 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
13337 (xstormy16_splittable_below100_operand): Likewise.
13339 2021-07-02 Richard Biener <rguenther@suse.de>
13341 PR tree-optimization/101293
13342 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
13343 with combined offsets.
13344 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
13345 combined with the rest of the offset.
13347 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
13349 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
13350 PIC encodings for PE-COFF targets.
13352 2021-07-02 Jakub Jelinek <jakub@redhat.com>
13355 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
13356 Return nullptr for TImode inner mode.
13358 2021-07-02 Richard Biener <rguenther@suse.de>
13360 PR tree-optimization/101280
13361 PR tree-optimization/101173
13362 * gimple-loop-interchange.cc
13363 (tree_loop_interchange::valid_data_dependences): Properly
13364 guard all dependence checks with DDR_REVERSED_P or its
13367 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
13369 * config/i386/i386-expand.c (ix86_expand_builtin):
13370 Add branch to clear odata when ZF is set for asedecenc_expand
13371 and wideaesdecenc_expand.
13373 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
13375 * config/i386/gcc-auto-profile: regenerate
13377 2021-07-02 liuhongt <hongtao.liu@intel.com>
13379 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
13380 (trunc<mode><pmov_dst_4_lower>2): this.
13382 2021-07-01 David Malcolm <dmalcolm@redhat.com>
13384 * diagnostic.h (diagnostic_context::m_file_cache): New field.
13385 * input.c (class fcache): Rename to...
13386 (class file_cache_slot): ...this, making most members private and
13387 prefixing fields with "m_".
13388 (file_cache_slot::get_file_path): New accessor.
13389 (file_cache_slot::get_use_count): New accessor.
13390 (file_cache_slot::missing_trailing_newline_p): New accessor.
13391 (file_cache_slot::inc_use_count): New.
13392 (fcache_buffer_size): Move to...
13393 (file_cache_slot::buffer_size): ...here.
13394 (fcache_line_record_size): Move to...
13395 (file_cache_slot::line_record_size): ...here.
13396 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
13397 (fcache_tab_size): Move to file_cache::num_file_slots.
13398 (diagnostic_file_cache_init): Update for move of fcache_tab
13399 to global_dc->m_file_cache.
13400 (diagnostic_file_cache_fini): Likewise.
13401 (lookup_file_in_cache_tab): Convert to...
13402 (file_cache::lookup_file): ...this.
13403 (diagnostics_file_cache_forcibly_evict_file): Update for move of
13404 fcache_tab to global_dc->m_file_cache, moving most of
13405 implementation to...
13406 (file_cache::forcibly_evict_file): ...this new function and...
13407 (file_cache_slot::evict): ...this new function.
13408 (evicted_cache_tab_entry): Convert to...
13409 (file_cache::evicted_cache_tab_entry): ...this.
13410 (add_file_to_cache_tab): Convert to...
13411 (file_cache::add_file): ...this, moving bulk of implementation
13413 (file_cache_slot::create): ..this new function.
13414 (file_cache::file_cache): New.
13415 (file_cache::~file_cache): New.
13416 (lookup_or_add_file_to_cache_tab): Convert to...
13417 (file_cache::lookup_or_add_file): ..this new function.
13418 (fcache::fcache): Rename to...
13419 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
13421 (fcache::~fcache): Rename to...
13422 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
13424 (needs_read): Convert to...
13425 (file_cache_slot::needs_read_p): ...this.
13426 (needs_grow): Convert to...
13427 (file_cache_slot::needs_grow_p): ...this.
13428 (maybe_grow): Convert to...
13429 (file_cache_slot::maybe_grow): ...this.
13430 (read_data): Convert to...
13431 (file_cache_slot::read_data): ...this.
13432 (maybe_read_data): Convert to...
13433 (file_cache_slot::maybe_read_data): ...this.
13434 (get_next_line): Convert to...
13435 (file_cache_slot::get_next_line): ...this.
13436 (goto_next_line): Convert to...
13437 (file_cache_slot::goto_next_line): ...this.
13438 (read_line_num): Convert to...
13439 (file_cache_slot::read_line_num): ...this.
13440 (location_get_source_line): Update for moving of globals to
13441 global_dc->m_file_cache.
13442 (location_missing_trailing_newline): Likewise.
13443 * input.h (class file_cache_slot): New forward decl.
13444 (class file_cache): New.
13446 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
13448 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
13449 128-bit floating point conditional move support.
13450 (have_compare_and_set_mask): Add IEEE 128-bit floating point
13452 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
13453 (mov<mode>cc_p10, IEEE128 iterator): New insn.
13454 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
13455 (fpmask<mode>, IEEE128 iterator): New insn.
13456 (xxsel<mode>, IEEE128 iterator): New insn.
13458 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
13461 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
13463 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
13465 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
13467 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
13469 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
13470 (vec_duplicate<mode>): New expander.
13472 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
13475 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
13477 (ix86_byte_broadcast): New function.
13478 (ix86_convert_const_wide_int_to_broadcast): Likewise.
13479 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
13480 size is 16 bytes or bigger.
13481 (ix86_broadcast_from_integer_constant): New function.
13482 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
13483 to broadcast if mode size is 16 bytes or bigger.
13484 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
13486 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
13488 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13490 * config/i386/predicates.md (ix86_endbr_immediate_operand):
13491 Return true/false instead of 1/0.
13492 (movq_parallel): Ditto.
13494 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13496 * recog.c (general_operand): Return true/false instead of 1/0.
13497 (register_operand): Ditto.
13498 (immediate_operand): Ditto.
13499 (const_int_operand): Ditto.
13500 (const_scalar_int_operand): Ditto.
13501 (const_double_operand): Ditto.
13502 (push_operand): Ditto.
13503 (pop_operand): Ditto.
13504 (memory_operand): Ditto.
13505 (indirect_operand): Ditto.
13507 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13509 * genpreds.c (write_predicate_subfunction):
13510 Change the type of written subfunction to bool.
13511 (write_one_predicate_function):
13512 Change the type of written function to bool.
13513 (write_tm_preds_h): Ditto.
13514 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
13515 * recog.c (general_operand): Change the type to bool.
13516 (address_operand): Ditto.
13517 (register_operand): Ditto.
13518 (pmode_register_operand): Ditto.
13519 (scratch_operand): Ditto.
13520 (immediate_operand): Ditto.
13521 (const_int_operand): Ditto.
13522 (const_scalar_int_operand): Ditto.
13523 (const_double_operand): Ditto.
13524 (nonimmediate_operand): Ditto.
13525 (nonmemory_operand): Ditto.
13526 (push_operand): Ditto.
13527 (pop_operand): Ditto.
13528 (memory_operand): Ditto.
13529 (indirect_operand): Ditto.
13530 (ordered_comparison_operator): Ditto.
13531 (comparison_operator): Ditto.
13532 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
13533 Change the type of indirect predicate function to bool.
13534 * config/rs6000/rs6000.c (easy_vector_constant):
13535 Change the type to bool.
13536 * config/mips/mips-protos.h (m16_based_address_p):
13537 Change the type of operand 3 to bool.
13539 2021-07-01 Richard Biener <rguenther@suse.de>
13541 PR tree-optimization/101280
13542 PR tree-optimization/101173
13543 * gimple-loop-interchange.cc
13544 (tree_loop_interchange::valid_data_dependences): Revert
13545 previous change and instead correctly handle DDR_REVERSED_P
13548 2021-07-01 Richard Biener <rguenther@suse.de>
13550 PR tree-optimization/101278
13551 * tree-ssa-dse.c (dse_classify_store): First check for
13552 uses, then ignore stmt for chaining purposes.
13554 2021-07-01 Richard Biener <rguenther@suse.de>
13556 PR tree-optimization/100778
13557 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
13558 vectorized ops ahead of their scalar BB.
13560 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13563 * config/i386/i386.md (*nabs<dwi>2_doubleword):
13564 New insn_and_split pattern.
13565 (*nabs<dwi>2_1): Ditto.
13566 * config/i386/i386-features.c
13567 (general_scalar_chain::compute_convert_gain):
13568 Handle (NEG (ABS (...))) RTX. Rewrite src code
13569 scanner as switch statement.
13570 (general_scalar_chain::convert_insn):
13571 Handle (NEG (ABS (...))) RTX.
13572 (general_scalar_to_vector_candidate_p):
13573 Detect (NEG (ABS (...))) RTX. Reorder case statements
13574 for (AND (NOT (...) ...)) fallthrough.
13576 2021-07-01 Richard Biener <rguenther@suse.de>
13578 PR tree-optimization/101178
13579 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
13580 (slpg::perm_in): Add.
13581 (slpg::get_perm_in): Remove.
13582 (slpg::get_perm_materialized): Add.
13583 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
13584 during permute propagation and materialization.
13586 2021-07-01 Jakub Jelinek <jakub@redhat.com>
13589 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
13591 2021-07-01 Jakub Jelinek <jakub@redhat.com>
13593 PR middle-end/94366
13594 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
13595 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
13596 use boolean_type_node instead of integer_type_node as NE_EXPR type.
13597 (lower_reduction_clauses): Likewise.
13599 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
13601 * config/gcn/gcn.c: Include dwarf2.h.
13602 (gcn_addr_space_debug): New function.
13603 (TARGET_ADDR_SPACE_DEBUG): New hook.
13605 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
13607 * common/config/gcn/gcn-common.c
13608 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
13609 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
13611 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
13612 (gcn_frame_pointer_rqd): New function.
13613 (TARGET_FRAME_POINTER_REQUIRED): New hook.
13615 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
13617 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
13618 prologue register saves.
13619 (gcn_debug_unwind_info): Use UI_DWARF2.
13620 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
13621 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
13622 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
13623 (DWARF_LINK_REGISTER): New define.
13624 (FIRST_PSEUDO_REGISTER): Increment.
13625 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
13626 (CALL_USED_REGISTERS): Likewise.
13627 (REGISTER_NAMES): Likewise.
13629 2021-06-30 Richard Biener <rguenther@suse.de>
13631 PR tree-optimization/101267
13632 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
13633 API and use SLP compatible interface of vect_is_simple_use.
13634 Reject not vectorized SLP defs for callers that do not support
13636 (vect_check_store_rhs): Handle masked stores and pass down
13637 the appropriate operator index.
13638 (vectorizable_call): Adjust.
13639 (vectorizable_store): Likewise.
13640 (vectorizable_load): Likewise. Handle SLP pecularity of
13642 (vect_is_simple_use): Remove special-casing of masked stores.
13644 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
13646 * common.opt (foffload): Remove help as Driver only.
13647 * gcc.c (display_help): Add -foffload.
13649 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
13651 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
13652 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
13653 a candidate list; better inform no offload target is configured
13654 and fix hint extraction when passed target is not '\0' at [len].
13655 * common.opt (foffload): Add tailing '.'.
13656 (foffload-options): Likewise; fix flag name in the help string.
13658 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
13661 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
13662 (vcage_f32): Gate comparison on __FAST_MATH__.
13663 (vcageq_f32): Likewise.
13664 (vcale_f32): Likewise.
13665 (vcaleq_f32): Likewise.
13666 (vcagt_f32): Likewise.
13667 (vcagtq_f32): Likewise.
13668 (vcalt_f32): Likewise.
13669 (vcaltq_f32): Likewise.
13670 (vcage_f16): Likewise.
13671 (vcageq_f16): Likewise.
13672 (vcale_f16): Likewise.
13673 (vcaleq_f16): Likewise.
13674 (vcagt_f16): Likewise.
13675 (vcagtq_f16): Likewise.
13676 (vcalt_f16): Likewise.
13677 (vcaltq_f16): Likewise.
13679 2021-06-30 Richard Biener <rguenther@suse.de>
13681 PR tree-optimization/101264
13682 * tree-vect-slp.c (vect_optimize_slp): Propagate the
13683 computed perm_in to all "any" permute successors
13684 we cannot de-duplicate immediately.
13686 2021-06-30 liuhongt <hongtao.liu@intel.com>
13689 * config/i386/sse.md
13690 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
13692 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
13694 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
13695 * config/i386/subst.md (maskz_scalar): New define_subst.
13696 (maskz_scalar_name): New subst_attr.
13697 (maskz_scalar_op5): Ditto.
13698 (round_saeonly_maskz_scalar_op5): Ditto.
13699 (round_saeonly_maskz_scalar_operand5): Ditto.
13701 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
13703 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
13704 Increase code CSECT alignment to at least 32 bytes.
13705 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
13706 alignment designation.
13708 2021-06-29 Sergei Trofimovich <siarheit@google.com>
13710 * doc/generic.texi: Fix s/net yet/not yet/ typo.
13712 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
13714 PR tree-optimization/101254
13715 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
13716 wrapping/non-wrapping when setting the result range.
13718 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
13720 * value-query.cc (gimple_range_global): Allow phis.
13722 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
13724 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
13725 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
13726 statement for location context.
13727 (check_for_binary_op_overflow): Ditto.
13728 (simplify_using_ranges::get_vr_for_comparison): Ditto.
13729 (simplify_using_ranges::compare_name_with_value): Ditto.
13730 (simplify_using_ranges::compare_names): Ditto.
13731 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
13732 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
13733 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
13734 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
13735 (simplify_using_ranges::two_valued_val_range_p): Ditto.
13736 (simplify_using_ranges::simplify): Ditto.
13737 * vr-values.h: Adjust prototypes.
13739 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
13742 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
13744 2021-06-29 Julian Brown <julian@codesourcery.com>
13746 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
13747 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
13748 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
13750 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
13751 (MAX_FIXED_MODE_SIZE): Change to 128.
13753 2021-06-29 Julian Brown <julian@codesourcery.com>
13755 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
13756 (s_mnemonic): Add clrsb.
13757 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
13758 (clrsb<mode>2): Add expander for SImode/DImode.
13760 2021-06-29 Julian Brown <julian@codesourcery.com>
13762 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
13763 muldi3): Add patterns.
13765 2021-06-29 Julian Brown <julian@codesourcery.com>
13767 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
13768 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
13770 2021-06-29 Julian Brown <julian@codesourcery.com>
13772 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
13774 2021-06-29 Joseph Myers <joseph@codesourcery.com>
13776 * btfout.c, ctfout.c: Include "memmodel.h".
13778 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
13780 * gcc.c (check_offload_target_name): Cast len argument to
13781 %q.*s to 'int'; avoid -Wstringop-truncation warning.
13783 2021-06-29 Richard Biener <rguenther@suse.de>
13785 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
13786 to "any" permute nodes and relax "any" permute proapgation
13787 during iterative backward propagation.
13789 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
13792 * common.opt (-foffload=): Update description.
13793 (-foffload-options=): New.
13794 * doc/invoke.texi (C Language Options): Document
13795 -foffload and -foffload-options.
13796 * gcc.c (check_offload_target_name): New, split off from
13797 handle_foffload_option.
13798 (check_foffload_target_names): New.
13799 (handle_foffload_option): Handle -foffload=default.
13800 (driver_handle_option): Update for -foffload-options.
13801 * lto-opts.c (lto_write_options): Use -foffload-options
13802 instead of -foffload.
13803 * lto-wrapper.c (merge_and_complain, append_offload_options):
13805 * opts.c (common_handle_option): Likewise.
13807 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
13809 * doc/invoke.texi (C Language Options): Sort options
13810 alphabetically in optlist and also the description itself.
13811 Remove leftover -fallow-single-precision from and add missing
13812 -fgnu-tm to the optlist.
13814 2021-06-29 Richard Biener <rguenther@suse.de>
13816 * tree-vect-slp.c (slpg_vertex::visited): Remove.
13817 (vect_slp_perms_eq): Handle -1 permutes.
13818 (vect_optimize_slp): Rewrite permute propagation.
13820 2021-06-29 Jakub Jelinek <jakub@redhat.com>
13823 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
13824 perform the optimization in GENERIC when sanitizing and x has a
13827 2021-06-29 Richard Biener <rguenther@suse.de>
13829 PR tree-optimization/101242
13830 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
13831 PHIs with not represented initial values as leafs.
13833 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
13835 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
13836 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
13837 "register" keyword.
13838 (pdp11_initial_elimination_offset) Remove unused variable.
13839 (pdp11_cmp_length) Ditto.
13840 (pdp11_insn_cost): Ditto, and fix signedness warning.
13842 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
13844 * btfout.c: Include tm_p.h.
13847 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
13849 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
13851 (bpf_expand_epilogue): Likewise.
13852 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
13853 Do not define DBX_DEBUGGING_INFO.
13855 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
13857 * doc/invoke.texi: Document the CTF and BTF debug info options.
13859 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
13860 David Faust <david.faust@oracle.com>
13861 Jose E. Marchesi <jose.marchesi@oracle.com>
13862 Weimin Pan <weimin.pan@oracle.com>
13864 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
13865 GTFILES. Add new object files.
13866 * common.opt: Add CTF and BTF debug info options.
13867 * btfout.c: New file.
13868 * ctfc.c: Likewise.
13869 * ctfc.h: Likewise.
13870 * ctfout.c: Likewise.
13871 * dwarf2ctf.c: Likewise.
13872 * dwarf2ctf.h: Likewise.
13873 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
13875 * dwarf2out.c (dwarf2out_source_line): Likewise.
13876 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
13878 (debug_format_do_cu): New function.
13879 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
13881 Include dwarf2ctf.c.
13882 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
13884 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
13885 (CTF_DEBUG): New bitmask.
13886 (BTF_DEBUG): Likewise.
13887 (enum ctf_debug_info_levels): New enum.
13888 * gengtype.c (open_base_files): Handle ctfc.h.
13889 (main): Handle uint32_t type.
13890 * flags.h (btf_debuginfo_p): New definition.
13891 (dwarf_based_debuginfo_p): Likewise.
13892 * opts.c (debug_type_names): Add entries for CTF and BTF.
13893 (btf_debuginfo_p): New function.
13894 (dwarf_based_debuginfo_p): Likewise.
13895 (common_handle_option): Handle -gctfN and -gbtf options.
13896 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
13897 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
13900 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
13902 * dwarf2out.c (AT_class): Function is no longer static.
13903 (AT_int): Likewise.
13904 (AT_unsigned): Likewise.
13905 (AT_loc): Likewise.
13906 (get_AT): Likewise.
13907 (get_AT_string): Likewise.
13908 (get_AT_flag): Likewise.
13909 (get_AT_unsigned): Likewise.
13910 (get_AT_ref): Likewise.
13911 (new_die_raw): Likewise.
13912 (lookup_decl_die): Likewise.
13913 (base_type_die): Likewise.
13914 (add_name_attribute): Likewise.
13915 (add_AT_int): Likewise.
13916 (add_AT_unsigned): Likewise.
13917 (add_AT_loc): Likewise.
13918 (dw_get_die_tag): New function.
13919 (dw_get_die_child): Likewise.
13920 (dw_get_die_sib): Likewise.
13921 (struct dwarf_file_data): Move from here to dwarf2out.h
13922 (struct dw_attr_struct): Likewise.
13923 * dwarf2out.h: Analogous changes.
13925 2021-06-28 Martin Jambor <mjambor@suse.cz>
13928 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
13929 members m_dead_stmts and m_dead_ssas.
13930 * ipa-param-manipulation.c
13931 (ipa_param_body_adjustments::mark_dead_statements): New function.
13932 (ipa_param_body_adjustments::common_initialization): Call it on
13933 all removed but not split parameters.
13934 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
13936 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
13938 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
13939 dead debug statements.
13940 (copy_phis_for_bb): Do not copy dead PHI nodes.
13942 2021-06-28 Martin Jambor <mjambor@suse.cz>
13945 * symtab-clones.h (clone_info): Removed member param_adjustments.
13946 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
13947 deal with pass-through splits now.
13948 (ipa_param_performed_split): Removed.
13949 (ipa_param_adjustments::modify_call): Adjusted parameters.
13950 (class ipa_param_body_adjustments): Adjusted parameters of
13951 register_replacement, modify_gimple_stmt and modify_call_stmt.
13952 (ipa_verify_edge_has_no_modifications): Declare.
13953 (ipa_edge_modifications_finalize): Declare.
13954 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
13955 performed_splits processing, pas only edge to padjs->modify_call,
13956 check that call arguments were not modified if they should not have
13958 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
13960 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
13961 (ipa_edge_modification_info): Likewise.
13962 (ipa_edge_modification_sum): Likewise.
13963 (ipa_edge_modifications): New edge summary.
13964 (ipa_verify_edge_has_no_modifications): New function.
13965 (transitive_split_p): Removed.
13966 (transitive_split_map): Likewise.
13967 (init_transitive_splits): Likewise.
13968 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
13969 summary instead of performed_splits.
13970 (ipa_param_body_adjustments::register_replacement): Drop dummy
13971 parameter, set base_index of the created ipa_param_body_replacement.
13972 (phi_arg_will_live_p): New function.
13973 (ipa_param_body_adjustments::common_initialization): Do not create
13974 IPA_SRA dummy decls.
13975 (simple_tree_swap_info): Removed.
13976 (remap_split_decl_to_dummy): Likewise.
13977 (record_argument_state_1): New function.
13978 (record_argument_state): Likewise.
13979 (ipa_param_body_adjustments::modify_call_stmt): New parameter
13980 orig_stmt. Do not work with dummy decls, save necessary info about
13981 changes to ipa_edge_modifications.
13982 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
13983 orig_stmt, pass it to modify_call_stmt.
13984 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
13985 modify_gimple_stmt.
13986 (ipa_edge_modifications_finalize): New function.
13987 * tree-inline.c (remap_gimple_stmt): Pass original statement to
13988 modify_gimple_stmt.
13989 (copy_phis_for_bb): Do not copy dead PHI nodes.
13990 (expand_call_inline): Do not remap performed_splits.
13991 (update_clone_info): Likewise.
13992 * toplev.c: Include ipa-param-manipulation.h.
13993 (toplev::finalize): Call ipa_edge_modifications_finalize.
13995 2021-06-28 Andrew Pinski <apinski@marvell.com>
13997 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
13998 info if we're the only things setting the target PHI.
13999 (value_replacement): Don't duplicate range here.
14000 (minmax_replacement): Likewise.
14002 2021-06-28 Richard Biener <rguenther@suse.de>
14004 PR tree-optimization/101229
14005 * gimple-walk.c (gimple_walk_op): Handle PHIs.
14007 2021-06-28 Martin Liska <mliska@suse.cz>
14009 * config/v850/v850.c (construct_dispose_instruction): Allocate
14011 (construct_prepare_instruction): Likewise.
14013 2021-06-28 Martin Liska <mliska@suse.cz>
14015 * config/v850/v850.c (v850_option_override): Build default
14017 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
14018 ignored for inlining.
14019 (TARGET_CAN_INLINE_P): New.
14021 2021-06-28 Richard Biener <rguenther@suse.de>
14023 PR tree-optimization/101207
14024 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
14025 permute eliding for load permutations properly.
14027 2021-06-28 Richard Biener <rguenther@suse.de>
14029 PR tree-optimization/101173
14030 * gimple-loop-interchange.cc
14031 (tree_loop_interchange::valid_data_dependences): Disallow outer
14032 loop dependence distance of zero.
14034 2021-06-28 liuhongt <hongtao.liu@intel.com>
14037 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
14038 define_insn_and_split.
14039 (*avx_cmp<mode>3_ltint): Ditto.
14040 (*avx2_pcmp<mode>3_3): Ditto.
14041 (*avx2_pcmp<mode>3_4): Ditto.
14042 (*avx2_pcmp<mode>3_5): Ditto.
14044 2021-06-28 liuhongt <hongtao.liu@intel.com>
14046 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
14047 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
14048 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
14049 IX86_BUILTIN_PBLENDVB128): Replace icode with
14051 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
14053 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
14054 New pre_reload splitter.
14056 2021-06-27 Andrew Pinski <apinski@marvell.com>
14058 PR middle-end/101230
14059 * fold-const.c (fold_ternary_loc): Check
14060 the return value of invert_tree_comparison.
14062 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
14064 * config.gcc: Add SPDX License Identifier.
14065 (powerpc-ibm-aix789): Default to aix73.h.
14066 (powerpc-ibm-aix7.2.*.*): New stanza.
14067 * config/rs6000/aix72.h: Add SPDX License Identifier.
14068 * config/rs6000/aix73.h: New file.
14070 2021-06-26 Jason Merrill <jason@redhat.com>
14072 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
14074 2021-06-26 Andrew Pinski <apinski@marvell.com>
14076 * genmatch.c (lower_cond): Copy for_subst_vec
14077 for the simplify also.
14078 (lower): Swap the order for lower_for and lower_cond.
14080 2021-06-26 Andrew Pinski <apinski@marvell.com>
14082 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
14083 flow senatitive info on the moved ssa set.
14085 2021-06-26 Andrew Pinski <apinski@marvell.com>
14087 * fold-const.c (fold_cond_expr_with_comparison):
14088 Exand arg0 into comp_code, arg00, and arg01.
14089 (fold_ternary_loc): Use invert_tree_comparison
14090 instead of fold_invert_truthvalue for the case
14091 where we have A CMP B ? C : A.
14093 2021-06-25 Martin Sebor <msebor@redhat.com>
14095 PR middle-end/101216
14096 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
14098 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
14100 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
14102 2021-06-25 Richard Biener <rguenther@suse.de>
14104 PR tree-optimization/101202
14105 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
14108 2021-06-25 Richard Biener <rguenther@suse.de>
14110 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
14111 STMT_VINFO_REDUC_DEF from the original representative.
14113 2021-06-25 Martin Sebor <msebor@redhat.com>
14115 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
14116 gimple_no_warning_p and gimple_set_no_warning with
14117 warning_suppressed_p, and suppress_warning.
14119 (maybe_warn_for_bound): Same.
14120 (warn_for_access): Same.
14121 (check_access): Same.
14122 (expand_builtin_strncmp): Same.
14123 (fold_builtin_varargs): Same.
14124 * calls.c (maybe_warn_nonstring_arg): Same.
14125 (maybe_warn_rdwr_sizes): Same.
14126 * cfgexpand.c (expand_call_stmt): Same.
14127 * cgraphunit.c (check_global_declaration): Same.
14128 * fold-const.c (fold_undefer_overflow_warnings): Same.
14129 (fold_truth_not_expr): Same.
14130 (fold_unary_loc): Same.
14131 (fold_checksum_tree): Same.
14132 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
14133 (array_bounds_checker::check_mem_ref): Same.
14134 (array_bounds_checker::check_addr_expr): Same.
14135 (array_bounds_checker::check_array_bounds): Same.
14136 * gimple-expr.c (copy_var_decl): Same.
14137 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
14138 (gimple_fold_builtin_strncat): Same.
14139 (gimple_fold_builtin_stxcpy_chk): Same.
14140 (gimple_fold_builtin_stpcpy): Same.
14141 (gimple_fold_builtin_sprintf): Same.
14142 (fold_stmt_1): Same.
14143 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
14144 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
14145 * gimple-ssa-sprintf.c (handle_printf_call): Same.
14146 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
14147 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
14148 * gimple-ssa-warn-restrict.h: Adjust declarations.
14149 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
14150 gimple_no_warning_p and gimple_set_no_warning with
14151 warning_suppressed_p, and suppress_warning.
14152 (check_call): Same.
14153 (check_bounds_or_overlap): Same.
14154 * gimple.c (gimple_build_call_from_tree): Same.
14155 * gimplify.c (gimplify_return_expr): Same.
14156 (gimplify_cond_expr): Same.
14157 (gimplify_modify_expr_complex_part): Same.
14158 (gimplify_modify_expr): Same.
14159 (gimple_push_cleanup): Same.
14160 (gimplify_expr): Same.
14161 * omp-expand.c (expand_omp_for_generic): Same.
14162 (expand_omp_taskloop_for_outer): Same.
14163 * omp-low.c (lower_rec_input_clauses): Same.
14164 (lower_lastprivate_clauses): Same.
14165 (lower_send_clauses): Same.
14166 (lower_omp_target): Same.
14167 * tree-cfg.c (pass_warn_function_return::execute): Same.
14168 * tree-complex.c (create_one_component_var): Same.
14169 * tree-inline.c (remap_gimple_op_r): Same.
14170 (copy_tree_body_r): Same.
14171 (declare_return_variable): Same.
14172 (expand_call_inline): Same.
14173 * tree-nested.c (lookup_field_for_decl): Same.
14174 * tree-sra.c (create_access_replacement): Same.
14175 (generate_subtree_copies): Same.
14176 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
14177 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
14178 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
14179 * tree-ssa-loop-im.c (execute_sm): Same.
14180 * tree-ssa-phiopt.c (cond_store_replacement): Same.
14181 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
14182 (handle_builtin_strcpy): Same.
14183 (maybe_diag_stxncpy_trunc): Same.
14184 (handle_builtin_stxncpy_strncat): Same.
14185 (handle_builtin_strcat): Same.
14186 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
14187 (set_no_uninit_warning): Same.
14188 (uninit_undefined_value_p): Same.
14189 (warn_uninit): Same.
14190 (maybe_warn_operand): Same.
14191 * tree-vrp.c (compare_values_warnv): Same.
14192 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
14193 (test_for_singularity): Same.
14194 * gimple.h (warning_suppressed_p): New function.
14195 (suppress_warning): Same.
14196 (copy_no_warning): Same.
14197 (gimple_set_block): Call gimple_set_location.
14198 (gimple_set_location): Call copy_warning.
14200 2021-06-25 Martin Sebor <msebor@redhat.com>
14202 * tree.h (warning_suppressed_at, copy_warning,
14203 warning_suppressed_p, suppress_warning): New functions.
14205 2021-06-25 Martin Sebor <msebor@redhat.com>
14207 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
14208 * gengtype.c (open_base_files): Add diagnostic-spec.h.
14209 * diagnostic-spec.c: New file.
14210 * diagnostic-spec.h: New file.
14211 * tree.h (no_warning, all_warnings, suppress_warning_at): New
14213 * warning-control.cc: New file.
14215 2021-06-25 liuhongt <hongtao.liu@intel.com>
14218 * config/i386/i386.c (x86_order_regs_for_local_alloc):
14221 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14223 PR tree-optimization/101189
14224 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
14225 LHS range of condition to postfold routine.
14226 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
14227 FALSE edge if the LHS range supports it being taken.
14228 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
14230 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14232 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
14233 (relation_oracle::find_relation_block): Check correct bitmap.
14234 (relation_oracle::dump): Do not dump NULL blocks.
14236 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14238 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
14239 range_on_edge instead of manually calculating.
14241 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14243 * range-op.cc: Fix comment.
14245 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
14248 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
14249 Handle V8QI and V4HI modes.
14250 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
14252 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
14253 (mmxpackmode): New mode attribute.
14254 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
14255 (mmxunpackmode): New mode attribute.
14256 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
14257 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
14258 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
14259 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
14260 * config/i386/i386.md (extsuffix): Move from ...
14261 * config/i386/sse.md: ... here.
14263 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
14265 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
14266 (dwarf2out_finish): ...instead of here.
14268 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
14270 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
14271 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
14272 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
14273 * configure: Regenerate.
14275 2021-06-24 Richard Biener <rguenther@suse.de>
14277 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
14278 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
14279 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
14281 2021-06-24 Richard Biener <rguenther@suse.de>
14283 * config/i386/sse.md (avx_addsubv4df3): Rename to
14285 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
14286 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
14287 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
14288 * config/i386/i386-builtin.def: Adjust.
14289 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
14290 * optabs.def (vec_addsub_optab): New optab.
14291 * tree-vect-slp-patterns.c (class addsub_pattern): New.
14292 (slp_patterns): Add addsub_pattern.
14293 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
14294 across CFN_VEC_ADDSUB.
14295 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
14297 * doc/md.texi (vec_addsub<mode>3): Document.
14299 2021-06-24 Jakub Jelinek <jakub@redhat.com>
14301 PR middle-end/101170
14302 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
14303 where regno + subreg_regno_offset wraps around use 0 as starting
14306 2021-06-24 Jakub Jelinek <jakub@redhat.com>
14308 PR middle-end/101172
14309 * stor-layout.c (finish_bitfield_representative): If nextf has
14310 error_mark_node type, set repr type to error_mark_node too.
14312 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
14314 * config/s390/s390.c (s390_function_profiler): Ignore labelno
14316 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
14318 2021-06-24 Richard Biener <rguenther@suse.de>
14320 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
14321 across operations that have different semantics on different
14324 2021-06-24 Jakub Jelinek <jakub@redhat.com>
14326 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
14327 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
14328 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
14329 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
14330 OMP_TARGET user outer_ctx instead of ctx for placeholders and
14331 initializer/combiner gimplification.
14332 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
14333 on target constructs.
14334 (lower_rec_input_clauses): Likewise.
14335 (lower_omp_target): Likewise.
14336 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
14337 on target if in_reduction is present.
14339 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
14341 * tree-predcom.c (class pcom_worker): New class.
14342 (release_chain): Renamed to...
14343 (pcom_worker::release_chain): ...this.
14344 (release_chains): Renamed to...
14345 (pcom_worker::release_chains): ...this.
14346 (aff_combination_dr_offset): Renamed to...
14347 (pcom_worker::aff_combination_dr_offset): ...this.
14348 (determine_offset): Renamed to...
14349 (pcom_worker::determine_offset): ...this.
14350 (class comp_ptrs): New class.
14351 (split_data_refs_to_components): Renamed to...
14352 (pcom_worker::split_data_refs_to_components): ...this,
14353 and update with class comp_ptrs.
14354 (suitable_component_p): Renamed to...
14355 (pcom_worker::suitable_component_p): ...this.
14356 (filter_suitable_components): Renamed to...
14357 (pcom_worker::filter_suitable_components): ...this.
14358 (valid_initializer_p): Renamed to...
14359 (pcom_worker::valid_initializer_p): ...this.
14360 (find_looparound_phi): Renamed to...
14361 (pcom_worker::find_looparound_phi): ...this.
14362 (add_looparound_copies): Renamed to...
14363 (pcom_worker::add_looparound_copies): ...this.
14364 (determine_roots_comp): Renamed to...
14365 (pcom_worker::determine_roots_comp): ...this.
14366 (determine_roots): Renamed to...
14367 (pcom_worker::determine_roots): ...this.
14368 (single_nonlooparound_use): Renamed to...
14369 (pcom_worker::single_nonlooparound_use): ...this.
14370 (remove_stmt): Renamed to...
14371 (pcom_worker::remove_stmt): ...this.
14372 (execute_pred_commoning_chain): Renamed to...
14373 (pcom_worker::execute_pred_commoning_chain): ...this.
14374 (execute_pred_commoning): Renamed to...
14375 (pcom_worker::execute_pred_commoning): ...this.
14376 (struct epcc_data): New member worker.
14377 (execute_pred_commoning_cbck): Call execute_pred_commoning
14378 with pcom_worker pointer.
14379 (find_use_stmt): Renamed to...
14380 (pcom_worker::find_use_stmt): ...this.
14381 (find_associative_operation_root): Renamed to...
14382 (pcom_worker::find_associative_operation_root): ...this.
14383 (find_common_use_stmt): Renamed to...
14384 (pcom_worker::find_common_use_stmt): ...this.
14385 (combinable_refs_p): Renamed to...
14386 (pcom_worker::combinable_refs_p): ...this.
14387 (reassociate_to_the_same_stmt): Renamed to...
14388 (pcom_worker::reassociate_to_the_same_stmt): ...this.
14389 (stmt_combining_refs): Renamed to...
14390 (pcom_worker::stmt_combining_refs): ...this.
14391 (combine_chains): Renamed to...
14392 (pcom_worker::combine_chains): ...this.
14393 (try_combine_chains): Renamed to...
14394 (pcom_worker::try_combine_chains): ...this.
14395 (prepare_initializers_chain): Renamed to...
14396 (pcom_worker::prepare_initializers_chain): ...this.
14397 (prepare_initializers): Renamed to...
14398 (pcom_worker::prepare_initializers): ...this.
14399 (prepare_finalizers_chain): Renamed to...
14400 (pcom_worker::prepare_finalizers_chain): ...this.
14401 (prepare_finalizers): Renamed to...
14402 (pcom_worker::prepare_finalizers): ...this.
14403 (tree_predictive_commoning_loop): Renamed to...
14404 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
14405 some calls and remove some cleanup code.
14406 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
14407 (static variable looparound_phis): Remove.
14408 (static variable name_expansions): Remove.
14410 2021-06-24 Richard Biener <rguenther@suse.de>
14412 * tree-vect-slp.c (slpg_vertex): New struct.
14413 (vect_slp_build_vertices): Adjust.
14414 (vect_optimize_slp): Likewise. Maintain an outgoing permute
14415 and a materialized one.
14417 2021-06-24 Richard Biener <rguenther@suse.de>
14419 PR tree-optimization/101105
14420 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
14421 Only ignore steps when they are equal or scalar order is preserved.
14423 2021-06-24 liuhongt <hongtao.liu@intel.com>
14426 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
14427 Adjust comments for ix86_expand_vecop_qihi2.
14428 (ix86_expand_vecmul_qihi): Renamed to ..
14429 (ix86_expand_vecop_qihi2): Adjust function prototype to
14430 support shift operation, add static to definition.
14431 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
14432 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
14433 ix86_expand_vec_shift_qihi_constant.
14434 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
14435 (ix86_expand_vec_shift_qihi_constant): Deleted.
14436 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
14438 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
14439 condition TARGET_64BIT.
14440 (mul<mode>3): Ditto.
14441 (<insn><mode>3): Ditto.
14442 (vlshr<mode>3): Extend to support avx512 vlshr.
14443 (v<insn><mode>3): New expander for
14445 (v<insn>v8qi3): Ditto.
14446 (vashrv8hi3<mask_name>): Renamed to ..
14447 (vashr<mode>3): And extend to support V16QImode for avx512.
14448 (vashrv16qi3): Deleted.
14449 (vashrv2di3<mask_name>): Extend expander to support avx512
14452 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
14454 * doc/lto.texi (Design Overview): Update that slim objects are
14457 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
14459 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
14460 of OTHER_POWER10_MASKS so it will not be enabled by default.
14462 2021-06-23 Richard Biener <rguenther@suse.de>
14463 Martin Jambor <mjambor@suse.cz>
14465 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
14466 param replacement unconditionally. Adjust comment.
14468 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14470 * Makefile.in (OBJS): Add gimple-range-fold.o
14471 * gimple-range-fold.cc: New.
14472 * gimple-range-fold.h: New.
14473 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
14474 (gimple_range_calc_op2): Ditto.
14475 * gimple-range-gori.h: Move prototypes to here.
14476 * gimple-range.cc: Adjust include files.
14477 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
14478 (fur_source::get_operand): Ditto.
14479 (fur_source::get_phi_operand): Ditto.
14480 (fur_source::query_relation): Ditto.
14481 (fur_source::register_relation): Ditto.
14482 (class fur_edge): Ditto.
14483 (fur_edge::fur_edge): Ditto.
14484 (fur_edge::get_operand): Ditto.
14485 (fur_edge::get_phi_operand): Ditto.
14486 (fur_stmt::fur_stmt): Ditto.
14487 (fur_stmt::get_operand): Ditto.
14488 (fur_stmt::get_phi_operand): Ditto.
14489 (fur_stmt::query_relation): Ditto.
14490 (class fur_depend): Relocate to gimple-range-fold.h.
14491 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
14492 (fur_depend::register_relation): Ditto.
14493 (fur_depend::register_relation): Ditto.
14494 (class fur_list): Ditto.
14495 (fur_list::fur_list): Ditto.
14496 (fur_list::get_operand): Ditto.
14497 (fur_list::get_phi_operand): Ditto.
14498 (fold_range): Ditto.
14499 (adjust_pointer_diff_expr): Ditto.
14500 (gimple_range_adjustment): Ditto.
14501 (gimple_range_base_of_assignment): Ditto.
14502 (gimple_range_operand1): Ditto.
14503 (gimple_range_operand2): Ditto.
14504 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
14505 (gimple_range_calc_op2): Ditto.
14506 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
14507 (fold_using_range::range_of_range_op): Ditto.
14508 (fold_using_range::range_of_address): Ditto.
14509 (fold_using_range::range_of_phi): Ditto.
14510 (fold_using_range::range_of_call): Ditto.
14511 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
14512 (fold_using_range::range_of_builtin_call): Ditto.
14513 (fold_using_range::range_of_cond_expr): Ditto.
14514 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
14515 (fold_using_range::relation_fold_and_or): Ditto.
14516 (fold_using_range::postfold_gcond_edges): Ditto.
14517 * gimple-range.h: Add gimple-range-fold.h to include files. Change
14518 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
14519 (gimple_range_handler): Relocate to gimple-range-fold.h.
14520 (gimple_range_ssa_p): Ditto.
14521 (range_compatible_p): Ditto.
14522 (class fur_source): Ditto.
14523 (class fur_stmt): Ditto.
14524 (class fold_using_range): Ditto.
14525 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
14526 (gimple_range_calc_op2): Ditto.
14528 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14530 PR tree-optimization/101148
14531 PR tree-optimization/101014
14532 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
14533 (ranger_cache::~ranger_cache): Adjust.
14534 (ranger_cache::block_range): Check if propagation disallowed.
14535 (ranger_cache::propagate_cache): Disallow propagation if new value
14536 can't be stored properly.
14537 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
14539 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14541 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
14542 (sbr_vector::set_bb_range): Return true.
14543 (class sbr_sparse_bitmap): Adjust.
14544 (sbr_sparse_bitmap::set_bb_range): Return value.
14545 (block_range_cache::set_bb_range): Return value.
14546 (ranger_cache::propagate_cache): Use return value to print msg.
14547 * gimple-range-cache.h (class block_range_cache): Adjust.
14549 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14551 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
14553 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
14555 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
14556 so this pattern can be used for test/compare removal. Pass
14557 current insn to compute_logical_op_length and output_logical_op.
14558 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
14559 (h8300_and_costs): Add argument to compute_logical_op_length.
14560 (output_logical_op): Add new argument. Use it to determine if the
14561 condition codes are used and adjust the output accordingly.
14562 (compute_logical_op_length): Add new argument and update length
14563 computations when condition codes are used.
14564 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
14566 (output_logical_op): Likewise.
14568 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
14571 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
14572 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
14573 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
14574 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
14575 * config/i386/sse.md (unspec): ... here.
14577 2021-06-23 Martin Liska <mliska@suse.cz>
14580 * optc-save-gen.awk: Put back arm_fp16_format to
14583 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
14586 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
14589 (clz<mode>2): Update RTX pattern for additions.
14591 2021-06-23 Jakub Jelinek <jakub@redhat.com>
14593 PR middle-end/101167
14594 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
14595 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
14597 2021-06-22 Sergei Trofimovich <siarheit@google.com>
14599 * doc/rtl.texi: drop unbalanced parenthesis.
14601 2021-06-22 Richard Biener <rguenther@suse.de>
14603 PR middle-end/101156
14604 * gimplify.c (gimplify_expr): Remove premature incorrect
14607 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14609 PR tree-optimization/101159
14610 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
14613 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14615 PR middle-end/101160
14616 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
14617 clear crtl->return_rtx instead of keeping it referencing a pseudo.
14619 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14620 Andrew Pinski <apinski@marvell.com>
14622 PR tree-optimization/101162
14623 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
14626 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14628 * range-op.cc (range_relational_tests): New.
14629 (range_op_tests): Call range_relational_tests.
14631 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14633 * range-op.cc (operator_cast::lhs_op1_relation): New.
14634 (operator_identity::lhs_op1_relation): Mew.
14636 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14638 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
14640 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14642 * range-op.cc (operator_plus::lhs_op1_relation): New.
14643 (operator_plus::lhs_op2_relation): New.
14645 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14647 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
14648 relation_oracle if dominators exist.
14649 (ranger_cache::~ranger_cache): Dispose of oracle.
14650 (ranger_cache::dump_bb): Dump oracle.
14651 * gimple-range.cc (fur_source::fur_source): New.
14652 (fur_source::get_operand): Use mmeber query.
14653 (fur_source::get_phi_operand): Use member_query.
14654 (fur_source::query_relation): New.
14655 (fur_source::register_dependency): Delete.
14656 (fur_source::register_relation): New.
14657 (fur_edge::fur_edge): Adjust.
14658 (fur_edge::get_phi_operand): Fix comment.
14659 (fur_edge::query): Delete.
14660 (fur_stmt::fur_stmt): Adjust.
14661 (fur_stmt::query): Delete.
14662 (fur_depend::fur_depend): Adjust.
14663 (fur_depend::register_relation): New.
14664 (fur_depend::register_relation): New.
14665 (fur_list::fur_list): Adjust.
14666 (fur_list::get_operand): Use member query.
14667 (fold_using_range::range_of_range_op): Process and query relations.
14668 (fold_using_range::range_of_address): Adjust dependency call.
14669 (fold_using_range::range_of_phi): Ditto.
14670 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
14671 (fold_using_range::relation_fold_and_or): New.
14672 (fold_using_range::postfold_gcond_edges): New.
14673 * gimple-range.h (class gimple_ranger): Adjust.
14674 (class fur_source): Adjust members.
14675 (class fur_stmt): Ditto.
14676 (class fold_using_range): Ditto.
14678 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14680 * range-op.cc (range_operator::wi_fold): Apply relation effect.
14681 (range_operator::fold_range): Adjust and apply relation effect.
14682 (*::fold_range): Add relation parameters.
14683 (*::op1_range): Ditto.
14684 (*::op2_range): Ditto.
14685 (range_operator::lhs_op1_relation): New.
14686 (range_operator::lhs_op2_relation): New.
14687 (range_operator::op1_op2_relation): New.
14688 (range_operator::op1_op2_relation_effect): New.
14689 (relop_early_resolve): New.
14690 (operator_equal::op1_op2_relation): New.
14691 (operator_equal::fold_range): Call relop_early_resolve.
14692 (operator_not_equal::op1_op2_relation): New.
14693 (operator_not_equal::fold_range): Call relop_early_resolve.
14694 (operator_lt::op1_op2_relation): New.
14695 (operator_lt::fold_range): Call relop_early_resolve.
14696 (operator_le::op1_op2_relation): New.
14697 (operator_le::fold_range): Call relop_early_resolve.
14698 (operator_gt::op1_op2_relation): New.
14699 (operator_gt::fold_range): Call relop_early_resolve.
14700 (operator_ge::op1_op2_relation): New.
14701 (operator_ge::fold_range): Call relop_early_resolve.
14702 * range-op.h (class range_operator): Adjust parameters and methods.
14704 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14706 * Makefile.in (OBJS): Add value-relation.o.
14707 * gimple-range.h: Adjust include files.
14708 * tree-data-ref.c: Adjust include file order.
14709 * value-query.cc (range_query::get_value_range): Default to no oracle.
14710 (range_query::query_relation): New.
14711 (range_query::query_relation): New.
14712 * value-query.h (class range_query): Adjust.
14713 * value-relation.cc: New.
14714 * value-relation.h: New.
14716 2021-06-22 Richard Biener <rguenther@suse.de>
14718 PR tree-optimization/101151
14719 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
14722 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
14724 * config/riscv/riscv.c (thead_c906_tune_info): New.
14725 (riscv_tune_info_table): Use new tune.
14727 2021-06-22 Richard Biener <rguenther@suse.de>
14729 PR tree-optimization/101158
14730 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
14731 checking after checking for matching operation.
14733 2021-06-22 Richard Biener <rguenther@suse.de>
14735 PR tree-optimization/101159
14736 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
14737 missing NULL vectype check.
14739 2021-06-22 Richard Biener <rguenther@suse.de>
14741 PR tree-optimization/101154
14742 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
14744 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14747 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
14748 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
14749 * config/i386/i386.c (ix86_expand_prologue): Clear it.
14750 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
14751 Remove "" from match_operand. Emit new insns using emit_move_insn and
14752 set ix86_last_zero_store_uid to INSN_UID of the last store.
14753 Add peephole2s for 1/2/4 stores of const0_rtx following previous
14756 2021-06-22 Martin Liska <mliska@suse.cz>
14758 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
14761 2021-06-22 Martin Liska <mliska@suse.cz>
14763 * gcov-io.h: Remove padding entries.
14765 2021-06-22 liuhongt <hongtao.liu@intel.com>
14767 PR tree-optimization/97770
14768 * tree-vect-patterns.c (vect_recog_popcount_pattern):
14770 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
14772 2021-06-22 liuhongt <hongtao.liu@intel.com>
14775 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
14776 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
14777 (expand<mode>_mask): this ..
14778 (*expand<mode>_mask): New pre_reload splitter to transform
14779 v{,p}expand* to vmov* when mask is zero, all ones, or has all
14780 ones in it's lower part, otherwise still generate
14783 2021-06-22 liuhongt <hongtao.liu@intel.com>
14786 * config/i386/i386-expand.c
14787 (ix86_expand_special_args_builtin): Keep constm1_operand only
14788 if it satisfies insn's operand predicate.
14790 2021-06-21 Jason Merrill <jason@redhat.com>
14793 * df-scan.c (df_ref_record): Check that regno < endregno.
14794 * function.c (assign_parms, expand_function_end): Do nothing with a
14795 TYPE_EMPTY_P result.
14797 2021-06-21 Richard Biener <rguenther@suse.de>
14799 PR tree-optimization/101120
14800 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
14802 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
14803 DR chain DCE capability.
14804 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
14805 * tree-vect-stmts.c (vectorizable_load): Remove unused
14806 loads in the DR chain for SLP.
14808 2021-06-21 Jakub Jelinek <jakub@redhat.com>
14810 PR inline-asm/100785
14811 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
14812 output or input operands were already error_mark_node.
14813 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
14814 remove all inputs, outputs and clobbers from the asm and
14815 set template to "".
14817 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
14819 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
14820 (vceq_s16): Likewise.
14821 (vceq_s32): Likewise.
14822 (vceq_u8): Likewise.
14823 (vceq_u16): Likewise.
14824 (vceq_u32): Likewise.
14825 (vceq_p8): Likewise.
14826 (vceqq_s8): Likewise.
14827 (vceqq_s16): Likewise.
14828 (vceqq_s32): Likewise.
14829 (vceqq_u8): Likewise.
14830 (vceqq_u16): Likewise.
14831 (vceqq_u32): Likewise.
14832 (vceqq_p8): Likewise.
14833 (vceq_f32): Gate __a == __b on __FAST_MATH__.
14834 (vceqq_f32): Likewise.
14835 (vceq_f16): Likewise.
14836 (vceqq_f16): Likewise.
14838 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
14841 * config/arm/iterators.md (NEON_VACMP): Remove.
14842 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
14844 (neon_vca<cmp_op><mode>_insn): Likewise.
14845 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
14848 2021-06-21 Richard Biener <rguenther@suse.de>
14850 PR tree-optimization/101121
14851 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
14852 when we just lack a stmt with the desired op when doing permutation.
14853 (vect_build_slp_tree): When caching a failed SLP build attempt
14854 assert that at least one lane is marked as not matching.
14856 2021-06-21 liuhongt <hongtao.liu@intel.com>
14859 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
14860 register alternative.
14861 (*and<mode>_1): Ditto.
14863 (*andn<mode>_1): Ditto.
14864 (*<code><mode>_1): Ditto.
14865 (*<code>qi_1): Ditto.
14866 (*one_cmpl<mode>2_1): Ditto.
14867 (*one_cmplsi2_1_zext): Ditto.
14868 (*one_cmplqi2_1): Ditto.
14869 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
14870 the order of mask registers to be before general registers.
14872 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
14875 * config/i386/i386.md: New define_peephole2s to shrink writing
14876 1, 2 or 4 consecutive zeros to memory when optimizing for size.
14878 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
14880 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
14881 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
14882 more efficient code when the source can be trivially simplified.
14884 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
14886 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
14887 a range if global is not available.
14888 (ranger_cache::entry_range): Fallback to range_of_def.
14889 * gimple-range-cache.h (range_of_def): Adjust prototype.
14891 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
14893 PR tree-optimization/101014
14894 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
14896 (ranger_cache::~ranger_cache): Ditto.
14897 (ranger_cache::enable_new_values): Delete.
14898 (ranger_cache::push_poor_value): Delete.
14899 (ranger_cache::range_of_def): Remove poor value processing.
14900 (ranger_cache::entry_range): Ditto.
14901 (ranger_cache::fill_block_cache): Ditto.
14902 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
14903 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
14904 * gimple-range.h (class gimple_ranger): Adjust.
14906 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
14909 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
14910 derived from arm_canon_arch.
14911 (arm_canon_arch_option): Call it.
14912 (arm_canon_arch_multilib_option): New function.
14913 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
14914 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
14915 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
14916 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
14917 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
14918 * config/arm/arm.opt (mlibarch): New option.
14919 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
14920 of march on RHS with mlibarch.
14922 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
14924 * config.in: Regenerate.
14925 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
14927 * configure: Regenerate.
14928 * configure.ac: Fix for global_load assembler functions.
14930 2021-06-18 Richard Biener <rguenther@suse.de>
14932 PR tree-optimization/101112
14933 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
14934 to lookup a pattern stmt def.
14936 2021-06-18 Jakub Jelinek <jakub@redhat.com>
14938 PR middle-end/101062
14939 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
14940 representatives in QUAL_UNION_TYPE.
14942 2021-06-18 Andrew Pinski <apinski@marvell.com>
14944 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
14945 Add counting of how many times it is done.
14946 (factor_out_conditional_conversion): Likewise.
14947 (match_simplify_replacement): Likewise.
14948 (value_replacement): Likewise.
14949 (spaceship_replacement): Likewise.
14950 (cond_store_replacement): Likewise.
14951 (cond_if_else_store_replacement_1): Likewise.
14952 (hoist_adjacent_loads): Likewise.
14954 2021-06-18 Andrew Pinski <apinski@marvell.com>
14956 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
14957 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
14958 (verify_gimple_assign_binary): Reject point and offset types on
14959 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
14960 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
14961 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
14963 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
14965 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
14966 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
14967 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
14970 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
14972 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
14973 earlyclobber to alts 0/1.
14974 (gen_addadd): Add earlyclobber to alts 0/1.
14975 * config/rs6000/fusion.md: Regenerate file.
14977 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
14979 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
14981 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
14983 * gimple-range-cache.cc: Comment cleanups.
14984 * gimple-range-gori.cc: Comment cleanups.
14985 * gimple-range.cc: Comment/spacing cleanups
14986 * value-range.h: Comment cleanups.
14988 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
14991 * calls.c (expand_call): Replace PUSH_ARGS with
14992 targetm.calls.push_argument (0).
14993 (emit_library_call_value_1): Likewise.
14994 * defaults.h (PUSH_ARGS): Removed.
14995 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
14996 targetm.calls.push_argument (0).
14997 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
14998 (emit_push_insn): Pass the number bytes to push to
14999 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
15000 * hooks.c (hook_bool_uint_true): New.
15001 * hooks.h (hook_bool_uint_true): Likewise.
15002 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
15003 targetm.calls.push_argument (0).
15004 * target.def (push_argument): Add a targetm.calls hook.
15005 * targhooks.c (default_push_argument): New.
15006 * targhooks.h (default_push_argument): Likewise.
15007 * config/bpf/bpf.h (PUSH_ARGS): Removed.
15008 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
15009 * config/cr16/cr16.h (PUSH_ARGS): Removed.
15010 * config/i386/i386.c (ix86_push_argument): New.
15011 (TARGET_PUSH_ARGUMENT): Likewise.
15012 * config/i386/i386.h (PUSH_ARGS): Removed.
15013 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
15014 * config/m32c/m32c.h (PUSH_ARGS): Removed.
15015 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
15016 * config/pru/pru.h (PUSH_ARGS): Likewise.
15017 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
15018 TARGET_PUSH_ARGUMENT hook.
15019 * doc/tm.texi: Regenerated.
15021 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
15024 * config/i386/i386-expand.c (expand_vector_set_var):
15025 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
15026 ix86_expand_vector_init_duplicate.
15027 (ix86_expand_vector_init_duplicate): Emit insv_1 for
15028 QImode for !TARGET_PARTIAL_REG_STALL.
15029 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
15030 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
15031 as operand 2 predicate. Call ix86_expand_vector_set_var
15032 for non-constant index operand.
15033 (vec_setv2si): Ditto.
15034 (vec_setv4hi): Ditto.
15035 (vec_setv8qi): ditto.
15037 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
15039 PR tree-optimization/100790
15040 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
15043 2021-06-17 Martin Liska <mliska@suse.cz>
15045 * doc/invoke.texi: Use consistently -O1 instead of -O.
15047 2021-06-17 Martin Liska <mliska@suse.cz>
15049 * gcov-io.h: Update documentation entry about string format.
15051 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
15054 * config/s390/vecintrin.h (vec_doublee): Fix to use
15055 __builtin_s390_vflls.
15056 (vec_floate): Fix to use __builtin_s390_vflrd.
15058 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15060 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
15061 * dominance.h (get_dominated_to_depth): Likewise.
15062 (get_all_dominated_blocks): Likewise.
15063 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
15064 * gcse.c (hoist_code): Likewise.
15065 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
15066 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
15067 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
15068 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
15070 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15072 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
15073 * dominance.h (get_dominated_by_region): Likewise.
15074 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
15075 (gimple_duplicate_sese_tail): Likewise.
15076 (move_sese_region_to_fn): Likewise.
15078 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15080 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
15081 * dominance.h (get_dominated_by): Likewise.
15082 * auto-profile.c (afdo_find_equiv_class): Adjust.
15083 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
15084 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
15085 * tree-cfg.c (test_linear_chain): Likewise.
15086 (test_diamond): Likewise.
15088 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15090 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
15091 * cfgloopanal.c (get_loop_hot_path): Likewise.
15092 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
15094 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15096 * cgraph.c (cgraph_node::collect_callers): Return
15097 auto_vec<cgraph_edge *>.
15098 * cgraph.h (cgraph_node::collect_callers): Likewise.
15099 * ipa-cp.c (create_specialized_node): Adjust.
15100 (decide_about_value): Likewise.
15101 (decide_whether_version_node): Likewise.
15102 * ipa-sra.c (process_isra_node_results): Likewise.
15104 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15106 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
15107 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
15109 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
15112 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
15114 * gimple-range.cc (debug_seed_ranger): New.
15115 (dump_ranger): New.
15116 (debug_ranger): New.
15118 2021-06-17 Richard Biener <rguenther@suse.de>
15120 PR tree-optimization/54400
15121 * tree-vectorizer.h (enum slp_instance_kind): Add
15122 slp_inst_kind_bb_reduc.
15123 (reduction_fn_for_scalar_code): Declare.
15124 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
15125 Check SLP_INSTANCE_KIND instead of looking at the
15127 (vect_slp_analyze_instance_alignment): Likewise.
15128 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
15129 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
15130 chain linearization from vect_build_slp_tree_2 and generalize
15131 for the use of BB reduction vectorization.
15132 (vect_build_slp_tree_2): Adjust accordingly.
15133 (vect_optimize_slp): Elide permutes at the root of BB reduction
15135 (vectorizable_bb_reduc_epilogue): New function.
15136 (vect_slp_prune_covered_roots): Likewise.
15137 (vect_slp_analyze_operations): Use them.
15138 (vect_slp_check_for_constructors): Recognize associatable
15139 chains for BB reduction vectorization.
15140 (vectorize_slp_instance_root_stmt): Generate code for the
15141 BB reduction epilogue.
15143 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
15145 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
15147 (gori_compute::may_recompute_p): New.
15148 (gori_compute::outgoing_edge_range_p): Perform recomputations.
15149 * gimple-range-gori.h (class gori_compute): Add prototype.
15151 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
15153 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
15154 true when a range can be calculated.
15155 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
15157 2021-06-16 Martin Sebor <msebor@redhat.com>
15159 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
15160 Correct documented defaults.
15162 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
15164 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
15165 m_new_value_p directly.
15167 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
15170 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
15171 Handle 64bit modes for TARGET_SSE4_1.
15172 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
15173 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
15174 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
15175 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
15176 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
15177 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
15179 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15181 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
15182 Change to an expander that emits the correct instruction
15183 depending on endianness.
15184 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
15185 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
15187 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15189 * config/aarch64/aarch64-simd-builtins.def: Split generator
15190 for aarch64_<su>qmovn builtins into scalar and vector
15192 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
15194 (aarch64_<su>qmovn<mode>_insn_be): Define.
15195 (aarch64_<su>qmovn<mode>): Split into scalar and vector
15196 variants. Change vector variant to an expander that emits the
15197 correct instruction depending on endianness.
15199 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15201 * config/aarch64/aarch64-simd-builtins.def: Split generator
15202 for aarch64_sqmovun builtins into scalar and vector variants.
15203 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
15204 Split into scalar and vector variants. Change vector variant
15205 to an expander that emits the correct instruction depending
15207 (aarch64_sqmovun<mode>_insn_le): Define.
15208 (aarch64_sqmovun<mode>_insn_be): Define.
15210 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15212 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
15213 Define - modeling zero-high-half semantics.
15214 (aarch64_xtn<mode>): Change to an expander that emits the
15215 appropriate instruction depending on endianness.
15216 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
15218 (aarch64_xtn2<mode>_le): Rename to...
15219 (aarch64_xtn2<mode>_insn_le): This.
15220 (aarch64_xtn2<mode>_be): Rename to...
15221 (aarch64_xtn2<mode>_insn_be): This.
15222 (vec_pack_trunc_<mode>): Emit truncation instruction instead
15224 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
15225 attribute iterator.
15227 2021-06-16 Martin Jambor <mjambor@suse.cz>
15229 PR tree-optimization/100453
15230 * tree-sra.c (create_access): Disqualify any const candidates
15231 which are written to.
15232 (sra_modify_expr): Do not store sub-replacements back to a const base.
15233 (handle_unscalarized_data_in_subtree): Likewise.
15234 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
15235 instead of constant_decl_p.
15237 2021-06-16 Jakub Jelinek <jakub@redhat.com>
15239 PR middle-end/101062
15240 * stor-layout.c (finish_bitfield_representative): For fields in unions
15241 assume nextf is always NULL.
15242 (finish_bitfield_layout): Compute bit field representatives also in
15243 unions, but handle it as if each bitfield was the only field in the
15246 2021-06-16 Richard Biener <rguenther@suse.de>
15248 PR tree-optimization/101088
15249 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
15250 supported refs on edges. Do not assert same ref but
15251 different kind stores are unsuported but mark them so.
15252 (hoist_memory_references): Only look for supported refs
15255 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
15257 PR rtl-optimization/46235
15258 * config/i386/i386.md: New define_split for bt followed by cmov.
15259 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
15260 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
15261 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
15262 by setnc with zero extension.
15264 2021-06-16 Richard Biener <rguenther@suse.de>
15266 PR tree-optimization/101083
15267 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
15268 vectype as argument.
15269 (vect_build_slp_tree_2): Adjust.
15271 2021-06-15 Martin Sebor <msebor@redhat.com>
15273 PR middle-end/100876
15274 * builtins.c: (gimple_call_return_array): Account for size_t
15275 mangling as either unsigned int or unsigned long
15277 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
15279 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
15280 up before eliminating comparisons.
15282 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
15284 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
15285 nonzero|X is nonzero.
15286 (range_op_bitwise_and_tests): Add tests for above.
15288 2021-06-15 Carl Love <cel@us.ibm.com>
15291 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
15293 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
15296 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
15299 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
15300 (struct gimplify_omp_ctx): Extend defaultmap array by one.
15301 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
15302 (omp_notice_variable): Update type classification for Fortran.
15303 (gimplify_scan_omp_clauses): Update calls for new argument; handle
15304 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
15305 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
15306 * langhooks.c (lhd_omp_scalar_p): Likewise.
15307 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
15308 (LANG_HOOKS_DECLS): Add them.
15309 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
15310 omp_scalar_p pointer type to include the new bool argument.
15312 2021-06-15 David Malcolm <dmalcolm@redhat.com>
15314 * doc/analyzer.texi
15315 (Special Functions for Debugging the Analyzer): Add
15316 __analyzer_dump_capacity.
15318 2021-06-15 Jakub Jelinek <jakub@redhat.com>
15321 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
15322 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
15324 2021-06-15 Richard Biener <rguenther@suse.de>
15326 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
15327 check to identify loop latches.
15328 * cfgloop.c (verify_loop_structure): Likewise.
15329 * loop-init.c (apply_loop_flags): Allow marked irreducible
15330 regions even with multiple latches.
15331 * predict.c (rebuild_frequencies): Simplify.
15333 2021-06-15 Richard Biener <rguenther@suse.de>
15335 * tree-ssa-threadupdate.c
15336 (jump_thread_path_registry::mark_threaded_blocks): Assert we
15337 have marked irreducible regions.
15339 2021-06-14 Martin Sebor <msebor@redhat.com>
15342 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
15343 Handle calls to placement new.
15344 (ndecl_dealloc_argno): Avoid placement delete.
15346 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
15349 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
15350 create_tmp_reg_or_ssa_name().
15352 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
15354 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
15355 (ranger_cache::enable_new_values): Set to specified value and
15356 return the old value.
15357 (ranger_cache::disable_new_values): Delete.
15358 (ranger_cache::fill_block_cache): Disable non 1st order derived
15360 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
15361 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
15363 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
15366 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
15367 Return true early when testing with V2HImode.
15368 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
15370 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
15372 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
15373 (mve_vec_unpack<US>_hi_<mode>): New pattern.
15374 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
15375 (mve_vmovntq_<supf><mode>): Prefix with '@'.
15376 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
15378 (vec_unpack<US>_lo_<mode>): Likewise.
15379 (vec_pack_trunc_<mode>): Rename to
15380 neon_quad_vec_pack_trunc_<mode>.
15381 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
15383 (vec_unpack<US>_lo_<mode>): New.
15384 (vec_pack_trunc_<mode>): New.
15386 2021-06-14 Richard Biener <rguenther@suse.de>
15388 PR tree-optimization/100934
15389 * tree-ssa-dom.c (pass_dominator::execute): Properly
15390 mark irreducible regions.
15392 2021-06-14 Martin Liska <mliska@suse.cz>
15394 * doc/invoke.texi: Put r{...} on the same line as @item.
15396 2021-06-14 Martin Liska <mliska@suse.cz>
15398 * doc/invoke.texi: Add missing newline.
15400 2021-06-14 Martin Liska <mliska@suse.cz>
15402 * doc/invoke.texi: Remove '+' charasters.
15404 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
15406 * config.gcc (arc): Add support for with_cpu option.
15407 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
15409 2021-06-14 Richard Biener <rguenther@suse.de>
15411 PR tree-optimization/101031
15412 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
15413 instead of size when accounting for a possibly string
15416 2021-06-14 Martin Liska <mliska@suse.cz>
15418 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
15420 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
15422 * value-query.cc (gimple_range_global): Call get_range_global
15423 if called after inlining.
15425 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
15428 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
15429 Emit constant permutation insn directly from here.
15431 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
15433 * attribs.c (find_attribute_namespace): Iterate over vec<> with
15435 * auto-profile.c (afdo_find_equiv_class): Likewise.
15436 * gcc.c (do_specs_vec): Likewise.
15437 (do_spec_1): Likewise.
15438 (driver::set_up_specs): Likewise.
15439 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
15440 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
15441 (imm_store_chain_info::try_coalesce_bswap): Likewise.
15442 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
15443 (get_location_for_stmts): Likewise.
15444 * graphite-poly.c (print_iteration_domains): Likewise.
15445 (free_poly_bb): Likewise.
15446 (remove_gbbs_in_scop): Likewise.
15447 (free_scop): Likewise.
15448 (dump_gbb_cases): Likewise.
15449 (dump_gbb_conditions): Likewise.
15450 (print_pdrs): Likewise.
15451 (print_scop): Likewise.
15452 * ifcvt.c (cond_move_process_if_block): Likewise.
15453 * lower-subreg.c (decompose_multiword_subregs): Likewise.
15454 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
15455 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
15456 * sel-sched-dump.c (dump_insn_vector): Likewise.
15457 * store-motion.c (store_ops_ok): Likewise.
15458 (store_killed_in_insn): Likewise.
15459 * timevar.c (timer::named_items::print): Likewise.
15460 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
15461 (cleanup_tree_cfg_noloop): Likewise.
15462 * tree-data-ref.c (dump_data_references): Likewise.
15463 (print_dir_vectors): Likewise.
15464 (print_dist_vectors): Likewise.
15465 (dump_data_dependence_relations): Likewise.
15466 (dump_dist_dir_vectors): Likewise.
15467 (dump_ddrs): Likewise.
15468 (create_runtime_alias_checks): Likewise.
15469 (free_subscripts): Likewise.
15470 (save_dist_v): Likewise.
15471 (save_dir_v): Likewise.
15472 (invariant_access_functions): Likewise.
15473 (same_access_functions): Likewise.
15474 (access_functions_are_affine_or_constant_p): Likewise.
15475 (find_data_references_in_stmt): Likewise.
15476 (graphite_find_data_references_in_stmt): Likewise.
15477 (free_dependence_relations): Likewise.
15478 (free_data_refs): Likewise.
15479 * tree-inline.c (copy_debug_stmts): Likewise.
15480 * tree-into-ssa.c (dump_currdefs): Likewise.
15481 (rewrite_update_phi_arguments): Likewise.
15482 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
15483 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
15485 (vect_slp_analyze_node_dependences): Likewise.
15486 (vect_slp_analyze_instance_dependence): Likewise.
15487 (vect_record_base_alignments): Likewise.
15488 (vect_get_peeling_costs_all_drs): Likewise.
15489 (vect_peeling_supportable): Likewise.
15490 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
15491 (vec_info::free_stmt_vec_infos): Likewise.
15493 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
15495 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
15496 (andqi3_1<cczn>): Removed.
15497 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
15498 (H8/SX bit logicals): Split out from other patterns.
15499 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
15500 mulqihi3_const_clobber_flags.
15501 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
15503 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
15506 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
15507 to true if red zone is used.
15508 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
15509 ix86_red_zone_used.
15510 * config/i386/i386.h (machine_function): Add red_zone_used.
15511 (ix86_red_zone_size): Removed.
15512 (ix86_red_zone_used): New.
15513 * config/i386/i386.md (peephole2 patterns): Replace
15514 ix86_red_zone_size with ix86_red_zone_used.
15516 2021-06-12 Jason Merrill <jason@redhat.com>
15518 * doc/extend.texi (unused variable attribute): Applies to
15519 structure fields as well.
15521 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
15523 * auto-profile.c (read_profile): fix a typo in an error string
15525 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
15527 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
15529 * tree-pretty-print.c (dump_omp_clauses): Update.
15530 (dump_generic_node) <OMP_CLAUSE>: Use it.
15532 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
15535 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
15536 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
15537 the polymorphic variants matching code.
15538 (__arm_vld1q_z): Likewise.
15539 (__arm_vld2q): Likewise.
15540 (__arm_vld4q): Likewise.
15541 (__arm_vldrbq_gather_offset): Likewise.
15542 (__arm_vldrbq_gather_offset_z): Likewise.
15544 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
15546 PR tree-optimization/96392
15547 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
15549 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
15551 PR tree-optimization/96392
15552 * fold-const.c (fold_real_zero_addition_p): Take both arguments
15553 of the addition or subtraction, not just the zero. Use this
15554 other argument in tests for signaling NaNs and signed zeros.
15555 (tree_expr_maybe_real_minus_zero_p): New predicate.
15556 * fold-const.h (fold_real_zero_addition_p): Update prototype.
15557 (tree_expr_maybe_real_minus_zero_p): New function prototype.
15558 * match.pd: Update calls to fold_real_zero_addition_p.
15559 Replace HONOR_NANS with tree_expr_maybe_nan_p.
15560 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
15561 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
15562 * tree-ssa-reassoc.c (eliminate_using_constants): Update
15563 call to fold_real_zero_addition_p.
15565 2021-06-11 Richard Biener <rguenther@suse.de>
15567 PR tree-optimization/101025
15568 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
15569 all refs that require dependence checking.
15571 2021-06-11 Richard Biener <rguenther@suse.de>
15573 PR tree-optimization/101028
15574 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
15575 reassoc discovery fails fatally, mark appropriate lanes
15578 2021-06-11 Richard Biener <rguenther@suse.de>
15580 PR tree-optimization/101026
15581 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
15582 have a representative for the associated chain nodes.
15584 2021-06-11 Jakub Jelinek <jakub@redhat.com>
15586 PR rtl-optimization/101008
15587 * simplify-rtx.c (relational_result): New function.
15588 (simplify_logical_relational_operation,
15589 simplify_relational_operation): Use it.
15591 2021-06-11 Jakub Jelinek <jakub@redhat.com>
15594 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
15596 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
15599 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
15600 false if the permutation can be implemented with constant
15601 permutation instruction in wider mode.
15602 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
15603 Handle V8QImode and V4HImode.
15605 2021-06-11 Martin Liska <mliska@suse.cz>
15607 PR gcov-profile/100788
15608 * common.opt: Add new option.
15609 * coverage.c (coverage_begin_function): Emit warning instead on
15610 the internal compiler error.
15611 * doc/invoke.texi: Document the option.
15612 * toplev.c (process_options): Enable it by default.
15614 2021-06-11 Richard Biener <rguenther@suse.de>
15616 PR middle-end/101009
15617 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
15618 to set *init_b to true when we encounter a constant equal
15620 (compute_affine_dependence): Also dump the actual DR_REF.
15622 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
15624 PR tree-optimization/100984
15625 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
15626 replacements table.
15627 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
15629 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
15631 * config/rs6000/rs6000.md
15632 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
15633 define_insn_and_split.
15635 2021-06-11 Richard Biener <rguenther@suse.de>
15637 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
15638 to sort operands of the associative chain.
15640 2021-06-11 Richard Biener <rguenther@suse.de>
15642 * system.h (gcc_stablesort_r): Declare.
15643 * sort.cc (gcc_sort_r): Support stable sort.
15644 (gcc_stablesort_r): Define.
15645 * vec.h (vec<>::stablesort): Add.
15647 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
15650 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
15651 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
15652 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
15653 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
15654 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
15655 (expand_vec_perm_interleave2): Handle 64bit modes.
15656 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
15657 (expand_vec_perm_even_odd_1): Ditto.
15658 (ix86_vectorize_vec_perm_const): Ditto.
15659 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
15660 * config/i386/sse.md: ... here.
15661 * config/i386/mmx.md (*vec_interleave_lowv2sf):
15662 New insn_and_split pattern.
15663 (*vec_interleave_highv2sf): Ditto.
15664 (mmx_pshufbv8qi3): New insn pattern.
15665 (*mmx_pblendw): Ditto.
15667 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
15669 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
15670 (build_acc): Likewise.
15671 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
15672 source operands in little-endian mode.
15673 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
15674 (mma_init_builtins): Likewise.
15675 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
15676 ordering for the MMA assemble and build source operands.
15677 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
15679 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
15682 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
15684 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
15686 * config/h8300/extensions.md: Replace _clobber_flags patterns
15689 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
15691 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
15692 (vcond_mask_<mode><tointvec>): this.
15694 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
15695 Thomas Schwinge <thomas@codesourcery.com>
15697 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
15698 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
15699 * gimple.h (enum gf_mask): Split
15700 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
15701 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
15702 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
15703 (is_gimple_omp_oacc): Update.
15704 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
15705 * gimplify.c (gimplify_omp_target_update): Likewise.
15706 * omp-expand.c (expand_omp_target, build_omp_regions_1)
15707 (omp_make_gimple_edges): Likewise.
15708 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
15711 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
15713 * value-query.cc (value_query::value_on_edge): Rename name to
15715 (range_query::range_on_edge): Same.
15716 (range_query::value_of_expr): Same.
15717 (range_query::value_on_edge): Same.
15718 * value-query.h (class value_query): Same.
15719 (class range_query): Same.
15721 2021-06-10 Richard Biener <rguenther@suse.de>
15723 PR tree-optimization/101003
15724 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
15725 use the pattern stmt defs when linearizing a chain.
15727 2021-06-10 Jakub Jelinek <jakub@redhat.com>
15730 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
15731 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
15733 2021-06-10 Clement Chigot <clement.chigot@atos.net>
15735 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
15736 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
15738 2021-06-09 Andrew Pinski <apinski@marvell.com>
15740 PR tree-optimization/100925
15741 * match.pd (a ? CST1 : CST2): Limit transformations
15742 that would produce a negative to integeral types only.
15743 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
15745 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
15748 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
15750 * doc/tm.texi: Correctly update.
15752 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
15754 * doc/tm.texi: Correctly update.
15756 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
15759 * doc/tm.texi.in (Trampolines): Add a missing blank line.
15761 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
15764 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
15765 and -ftrampolines work only with Ada.
15766 * doc/tm.texi.in (Trampolines): Likewise.
15767 * doc/tm.texi: Regenerated.
15769 2021-06-09 Carl Love <cel@us.ibm.com>
15771 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
15772 Add define for new builtins.
15773 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
15774 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
15775 overloaded builtin definitions.
15776 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
15777 VSIGNEXTSD2Q): Add builtin expansions.
15778 (SIGNEXT): Add P10 overload definition.
15779 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
15780 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
15781 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
15782 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
15783 vsignextend_si_v2di)[VIlong]: Add define_expand.
15784 Make define_insn vsx_sign_extend_si_v2di visible.
15785 * doc/extend.texi: Add documentation for the vec_signexti,
15786 vec_signextll builtins and vec_signextq.
15788 2021-06-09 Carl Love <cel@us.ibm.com>
15790 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
15791 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
15792 __floattikf_sw, __floatuntikf_sw respectively.
15793 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
15794 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
15795 define_insn for mode IEEE 128.
15797 2021-06-09 Carl Love <cel@us.ibm.com>
15799 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
15800 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
15801 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
15802 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
15803 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
15804 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
15805 uses of VSX_TI to VEC_TI.
15807 2021-06-09 Carl Love <cel@us.ibm.com>
15809 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
15811 2021-06-09 Carl Love <cel@us.ibm.com>
15813 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
15815 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
15816 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
15817 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
15818 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
15819 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
15820 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
15821 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
15823 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
15824 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
15825 altivec_vrlqnm): New define_expands.
15826 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
15827 VCMPGTUT_P): Add macro expansions.
15828 (BU_P10V_AV_P): Add builtin predicate definition.
15829 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
15830 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
15831 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
15832 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
15833 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
15834 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
15835 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
15836 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
15837 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
15838 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
15839 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
15840 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
15841 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
15842 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
15843 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
15844 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
15845 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
15846 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
15847 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
15848 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
15849 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
15850 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
15851 P10V_BUILTIN_MODU_V1TI):
15852 New overloaded definitions.
15853 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
15854 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
15855 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
15856 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
15857 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
15858 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
15860 (altivec_init_builtins): New E_V1TImode case statement.
15861 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
15862 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
15863 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
15864 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
15865 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
15866 E_V1TImode]: New case statements.
15867 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
15868 value RS6000_BTI_bool_V1TI.
15869 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
15870 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
15871 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
15872 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
15873 vlshrv1ti3, vashrv1ti3): New define_expands.
15874 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
15875 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
15876 UNSPEC_VSX_MODUQ): New unspecs.
15877 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
15878 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
15880 (vcmpnet): New define_expand.
15881 * doc/extend.texi: Add documentation for the new builtins vec_rl,
15882 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
15883 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
15884 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
15885 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
15886 vec_any_ge, vec_any_le.
15888 2021-06-09 Carl Love <cel@us.ibm.com>
15890 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
15891 bug in argument generation.
15893 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
15895 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
15897 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
15898 remove <supf> iterator.
15899 (mve_vclzq_u<mode>): New.
15900 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
15901 (neon_vclz<mode): Move to ...
15902 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
15903 * config/arm/vec-common.md: ... here. Add support for MVE.
15905 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
15907 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
15908 (@mve_vrhaddq_<supf><mode): Likewise.
15909 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
15910 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
15911 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
15913 2021-06-09 imba-tjd <109224573@qq.com>
15915 * doc/invoke.texi: Fix typo.
15917 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
15919 PR middle-end/53267
15920 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
15921 Support evaluation of fmod/fmodf/fmodl at compile-time.
15923 2021-06-09 Richard Biener <rguenther@suse.de>
15925 PR tree-optimization/100981
15926 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
15927 gimple_get_lhs to also handle calls.
15928 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
15931 2021-06-09 Richard Biener <rguenther@suse.de>
15933 PR tree-optimization/97832
15934 * tree-vectorizer.h (_slp_tree::failed): New.
15935 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
15937 (_slp_tree::~_slp_tree): Free failed.
15938 (vect_build_slp_tree): Retain failed nodes and record
15939 matches in them, copying that back out when running
15940 into a cached fail. Dump start and end of discovery.
15941 (dt_sort_cmp): New.
15942 (vect_build_slp_tree_2): Handle associatable chains
15943 together doing more aggressive operand swapping.
15945 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
15948 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
15950 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
15951 Linux and GNU targets.
15953 2021-06-09 Richard Biener <rguenther@suse.de>
15955 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
15958 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
15960 * config/arc/arc.md (loop_end): Change it to
15961 define_insn_and_split.
15963 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
15965 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
15966 (machi): New pattern.
15967 (umaddhisi4): Use VMAC2HU instruction.
15968 (umachi): New pattern.
15970 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
15972 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
15973 * config/arc/arc.c (arc_split_move_p): New function.
15974 (arc_split_move): Clean up.
15975 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
15976 (movdf_insn): Likewise.
15977 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
15979 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
15982 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
15983 argument to "raw". Do not emit segment overrides when "raw" is true.
15985 2021-06-09 Martin Liska <mliska@suse.cz>
15987 * doc/gcov.texi: Create a proper JSON files.
15988 * doc/invoke.texi: Remove dots in order to make it a valid
15991 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
15993 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
15994 (insn_is_load_p): Use pattern_is_rotate64.
15995 (insn_is_swap_p): Likewise.
15996 (quad_aligned_load_p): Likewise.
15997 (const_load_sequence_p): Likewise.
15998 (replace_swapped_aligned_load): Likewise.
15999 (recombine_lvx_pattern): Likewise.
16000 (recombine_stvx_pattern): Likewise.
16002 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
16004 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
16005 fur_stmt source record.
16006 * gimple-range.cc (fur_source::get_operand): Generic range query.
16007 (fur_source::get_phi_operand): New.
16008 (fur_source::register_dependency): New.
16009 (fur_source::query): New.
16010 (class fur_edge): New. Edge source for operands.
16011 (fur_edge::fur_edge): New.
16012 (fur_edge::get_operand): New.
16013 (fur_edge::get_phi_operand): New.
16014 (fur_edge::query): New.
16015 (fur_stmt::fur_stmt): New.
16016 (fur_stmt::get_operand): New.
16017 (fur_stmt::get_phi_operand): New.
16018 (fur_stmt::query): New.
16019 (class fur_depend): New. Statement source and process dependencies.
16020 (fur_depend::fur_depend): New.
16021 (fur_depend::register_dependency): New.
16022 (class fur_list): New. List source for operands.
16023 (fur_list::fur_list): New.
16024 (fur_list::get_operand): New.
16025 (fur_list::get_phi_operand): New.
16026 (fold_range): New. Instantiate appropriate fur_source class and fold.
16027 (fold_using_range::range_of_range_op): Use new API.
16028 (fold_using_range::range_of_address): Ditto.
16029 (fold_using_range::range_of_phi): Ditto.
16030 (imple_ranger::fold_range_internal): Use fur_depend class.
16031 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
16032 * gimple-range.h (class fur_source): Now a base class.
16033 (class fur_stmt): New.
16034 (fold_range): New prototypes.
16035 (fur_source::fur_source): Delete.
16037 2021-06-08 Andrew Pinski <apinski@marvell.com>
16039 PR tree-optimization/25290
16040 * tree-ssa-phiopt.c (xor_replacement): Delete.
16041 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
16042 (match_simplify_replacement): Allow one cheap preparation
16043 statement that can be moved to before the if.
16045 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
16047 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
16048 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
16050 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
16052 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
16053 Create length attribute on define_insn_and_split. Only split for cases which we
16055 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
16056 fix length computation.
16057 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
16059 2021-06-08 Richard Biener <rguenther@suse.de>
16061 PR tree-optimization/100923
16062 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
16063 the operand vector to be valueized.
16064 (valueize_refs): Likewise.
16065 (valueize_shared_reference_ops_from_ref): Adjust.
16066 (valueize_shared_reference_ops_from_call): Likewise.
16067 (vn_reference_lookup_3): Likewise.
16068 (vn_reference_lookup_pieces): Likewise. Re-valueize
16069 with honoring availability when we are about to create
16070 the ao_ref and valueized before.
16071 (vn_reference_lookup): Likewise.
16072 (vn_reference_insert_pieces): Adjust.
16074 2021-06-08 Richard Biener <rguenther@suse.de>
16076 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
16077 (_slp_instance::root_stmts): ... a vector.
16078 (SLP_INSTANCE_ROOT_STMT): Rename to ...
16079 (SLP_INSTANCE_ROOT_STMTS): ... this.
16080 (slp_root::root): Change to...
16081 (slp_root::roots): ... a vector.
16082 (slp_root::slp_root): Adjust.
16083 * tree-vect-slp.c (_slp_instance::location): Adjust.
16084 (vect_free_slp_instance): Release the root stmt vector.
16085 (vect_build_slp_instance): Adjust.
16086 (vect_analyze_slp): Likewise.
16087 (_bb_vec_info::~_bb_vec_info): Likewise.
16088 (vect_slp_analyze_operations): Likewise.
16089 (vect_bb_vectorization_profitable_p): Likewise. Adjust
16090 costs for the root stmt.
16091 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
16093 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
16095 (vectorize_slp_instance_root_stmt): Adjust.
16096 (vect_schedule_slp): Likewise.
16098 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
16100 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
16101 (ssa_equiv_stack::ssa_equiv_stack): New.
16102 (ssa_equiv_stack::~ssa_equiv_stack): New.
16103 (ssa_equiv_stack::enter): New.
16104 (ssa_equiv_stack::leave): New.
16105 (ssa_equiv_stack::push_replacement): New.
16106 (ssa_equiv_stack::get_replacement): New.
16107 (is_pointer_ssa): New.
16108 (class pointer_equiv_analyzer): New.
16109 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
16110 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
16111 (pointer_equiv_analyzer::set_global_equiv): New.
16112 (pointer_equiv_analyzer::set_cond_equiv): New.
16113 (pointer_equiv_analyzer::get_equiv): New.
16114 (pointer_equiv_analyzer::enter): New.
16115 (pointer_equiv_analyzer::leave): New.
16116 (pointer_equiv_analyzer::get_equiv_expr): New.
16117 (pta_valueize): New.
16118 (pointer_equiv_analyzer::visit_stmt): New.
16119 (pointer_equiv_analyzer::visit_edge): New.
16120 (hybrid_folder::value_of_expr): Call PTA.
16121 (hybrid_folder::value_on_edge): Same.
16122 (hybrid_folder::pre_fold_bb): New.
16123 (hybrid_folder::post_fold_bb): New.
16124 (hybrid_folder::pre_fold_stmt): New.
16125 (rvrp_folder::pre_fold_bb): New.
16126 (rvrp_folder::post_fold_bb): New.
16127 (rvrp_folder::pre_fold_stmt): New.
16128 (rvrp_folder::value_of_expr): Call PTA.
16129 (rvrp_folder::value_on_edge): Same.
16131 2021-06-08 Jakub Jelinek <jakub@redhat.com>
16134 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
16135 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
16137 2021-06-08 Richard Biener <rguenther@suse.de>
16139 PR middle-end/100951
16140 * tree-vect-generic.c (expand_vector_piecewise): Build a
16141 VECTOR_CST if all elements are constant.
16142 (expand_vector_condition): Likewise.
16143 (lower_vec_perm): Likewise.
16144 (expand_vector_conversion): Likewise.
16146 2021-06-08 Martin Liska <mliska@suse.cz>
16148 * doc/invoke.texi: Document new param evrp-sparse-threshold.
16150 2021-06-08 Martin Liska <mliska@suse.cz>
16152 * genautomata.c (create_automata): Fix typo.
16154 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
16156 PR tree-optimization/100794
16157 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
16158 allow_unroll_p and only allow unrolling when it's true.
16159 (tree_predictive_commoning): Add parameter allow_unroll_p and
16161 (run_tree_predictive_commoning): Likewise.
16162 (pass_predcom::gate): Check flag_tree_loop_vectorize and
16163 global_options_set.x_flag_predictive_commoning.
16164 (pass_predcom::execute): Adjust for allow_unroll_p.
16166 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
16168 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
16169 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
16170 lambda function cleanup, remove scev_reset call, and adjust return
16172 (tree_predictive_commoning): Adjust for different changed values,
16173 only set flag TODO_update_ssa_only_virtuals if changed.
16174 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
16175 from todo_flags_finish.
16177 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
16179 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
16180 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
16181 (sbr_sparse_bitmap::bitmap_set_quad): New.
16182 (sbr_sparse_bitmap::bitmap_get_quad): New.
16183 (sbr_sparse_bitmap::set_bb_range): New.
16184 (sbr_sparse_bitmap::get_bb_range): New.
16185 (sbr_sparse_bitmap::bb_range_p): New.
16186 (block_range_cache::block_range_cache): initialize bitmap obstack.
16187 (block_range_cache::~block_range_cache): Destruct obstack.
16188 (block_range_cache::set_bb_range): Decide when to utilze the
16189 sparse on entry cache.
16190 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
16191 * params.opt (-param=evrp-sparse-threshold): New.
16193 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
16195 * bitmap.c (bitmap_set_aligned_chunk): New.
16196 (bitmap_get_aligned_chunk): New.
16197 (test_aligned_chunk): New.
16198 (bitmap_c_tests): Call test_aligned_chunk.
16199 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
16201 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
16204 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
16206 (ix86_expand_vector_init_one_nonzero): Ditto.
16207 (ix86_expand_vector_init_one_var): Ditto.
16208 (ix86_expand_vector_init_general): Ditto.
16209 * config/i386/mmx.md (vec_initv4qiqi): New expander.
16211 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
16213 * config/h8300/movepush.md: Change most _clobber_flags
16214 patterns to instead use <cczn> subst.
16215 (movsi_cczn): New pattern with usable CC cases split out.
16216 (movsi_h8sx_cczn): Likewise.
16218 2021-06-07 Martin Liska <mliska@suse.cz>
16220 * common/common-target.def: Split long lines and replace them
16222 * target.def: Likewise.
16223 * doc/tm.texi: Re-generated.
16225 2021-06-07 Jakub Jelinek <jakub@redhat.com>
16228 * fold-const.c (fold_read_from_vector): Return NULL if trying to
16229 read from a CONSTRUCTOR with vector type elements.
16231 2021-06-07 Jakub Jelinek <jakub@redhat.com>
16233 PR middle-end/100898
16234 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
16235 should copy any arguments. Don't call gimple_call_num_args
16236 on id->call_stmt or call_stmt more than once.
16238 2021-06-07 liuhongt <hongtao.liu@intel.com>
16241 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
16243 (<insn>v4siv4di2): Delete constraints for define_expand.
16245 2021-06-07 liuhongt <hongtao.liu@intel.com>
16248 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
16249 assignment of cfun->machine->has_explicit_vzeroupper.
16250 * config/i386/i386-features.c
16251 (ix86_add_reg_usage_to_vzerouppers): Delete.
16252 (ix86_add_reg_usage_to_vzeroupper): Ditto.
16253 (rest_of_handle_insert_vzeroupper): Remove
16254 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
16256 (gate): Remove cfun->machine->has_explicit_vzeroupper.
16257 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
16259 * config/i386/i386.c (ix86_insn_callee_abi): New function.
16260 (ix86_initialize_callee_abi): Ditto.
16261 (ix86_expand_avx_vzeroupper): Ditto.
16262 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
16264 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
16265 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
16267 * config/i386/i386.h (struct GTY(()) machine_function): Delete
16268 has_explicit_vzeroupper.
16269 * config/i386/i386.md (enum unspec): New member
16271 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
16272 define_constants for insn callee abi index.
16273 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
16274 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
16275 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
16276 (*avx_vzeroupper): Rename to ..
16277 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
16278 call_insn which has a special vzeroupper ABI.
16279 (*avx_vzeroupper_1): Deleted.
16281 2021-06-07 liuhongt <hongtao.liu@intel.com>
16284 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
16285 it won't use stack pointer reg.
16286 * final.c (leaf_function_p): When call_insn is a fake call, it
16287 won't affect caller as a leaf function.
16288 * reg-stack.c (callee_clobbers_any_stack_reg): New.
16289 (subst_stack_regs): When call_insn doesn't clobber any stack
16290 reg, don't clear the arguments.
16291 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
16293 * shrink-wrap.c (requires_stack_frame_p): No need for stack
16294 frame for a fake call.
16295 * rtl.h (FAKE_CALL_P): New macro.
16297 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
16299 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
16301 (sparc_order_regs_for_local_alloc): ...this.
16302 (sparc_leaf_reg_remap): Declare.
16303 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
16304 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
16305 * config/sparc/sparc.c (leaf_reg_remap): Delete.
16306 (order_regs_for_local_alloc): Rename to...
16307 (sparc_order_regs_for_local_alloc): ...this.
16308 (sparc_leaf_reg_remap): New function.
16309 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
16311 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
16313 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
16314 Use assemble_name to output BSS section name.
16316 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
16318 * config/i386/constraints.md (Bs):
16319 Remove boolean operators from match_test RTX.
16322 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
16325 2021-06-06 Martin Liska <mliska@suse.cz>
16327 * doc/extend.texi: Add missing @headitem.
16328 * doc/invoke.texi: Likewise.
16329 * doc/objc.texi: Likewise.
16331 2021-06-06 Martin Liska <mliska@suse.cz>
16333 * genhooks.c (emit_findices): Remove unused function.
16334 (emit_documentation): Do not call emit_findices
16335 and do not search for @Fcode directives.
16337 2021-06-06 Martin Liska <mliska@suse.cz>
16339 * doc/invoke.texi: Remove extra character.
16341 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16343 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
16345 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16347 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
16348 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
16349 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
16350 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
16352 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16354 * config/or1k/or1k.md (*movdi): Fix empty split condition.
16356 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16358 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
16361 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16363 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
16364 *zero_extendsidi2): Fix empty split condition.
16366 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
16368 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
16370 * config/h8300/bitfield.md: Likewise.
16371 * config/h8300/combiner.md: Likewise.
16372 * config/h8300/divmod.md: Likewise.
16373 * config/h8300/extensions.md: Likewise.
16374 * config/h8300/jumpcall.md: Likewise.
16375 * config/h8300/movepush.md: Likewise.
16376 * config/h8300/multiply.md: Likewise.
16377 * config/h8300/other.md: Likewise.
16378 * config/h8300/shiftrotate.md: Likewise.
16379 * config/h8300/logical.md: Likewise. Fix split pattern to use
16380 code iterator that somehow slipped through.
16382 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
16384 PR middle-end/100905
16385 * tree-nested.c (convert_nonlocal_omp_clauses,
16386 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
16388 2021-06-04 Martin Sebor <msebor@redhat.com>
16390 PR middle-end/100732
16391 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
16392 with either source or destination argument of invalid type.
16393 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
16394 calls with arguments of invalid type.
16396 2021-06-04 Martin Sebor <msebor@redhat.com>
16398 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
16400 (attr_access::vla_bounds): Also handle VLA bounds.
16402 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
16404 * config/i386/predicates.md (GOT_memory_operand):
16405 Implement using match_code RTXes.
16406 (GOT32_symbol_operand): Ditto.
16408 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
16411 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
16413 (ix86_expand_vector_init_general): Ditto.
16414 Use SImode instead of word_mode for logic operations
16415 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
16416 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
16417 implemented by expand_vec_perm_1.
16418 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
16419 should be implemented using standard shuffle patterns.
16420 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
16421 V2HI modes to modes, implementable with shuffle for one operand.
16422 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
16423 (*pshufw_1): New insn pattern.
16424 (*vec_dupv2hi): Ditto.
16425 (vec_initv2hihi): New expander.
16427 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
16429 * config/arm/vfp.md (no_literal_pool_df_immediate,
16430 no_literal_pool_sf_immediate): Fix empty split condition.
16432 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
16434 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
16435 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
16436 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
16437 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
16438 *<sse4_1_avx2>_pblendvb_lt): Likewise.
16440 2021-06-04 Jakub Jelinek <jakub@redhat.com>
16443 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
16444 concatenation from half-sized modes with TImode elements.
16446 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
16448 * config/arc/arc.c (arc_override_options): Disable millicode
16449 thunks when RF16 is on.
16451 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
16453 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
16455 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
16457 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
16458 Replace PROMOTE_MODE marco with its content.
16460 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
16462 * config/cris/cris.md (*addi_reload): Fix empty split condition.
16464 2021-06-03 Jim Wilson <jimw@sifive.com>
16466 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
16467 turn it on for all riscv targets.
16469 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
16472 * config/i386/i386-expand.c (ix86_expand_vector_set):
16473 Handle V2HI and V4QI modes.
16474 (ix86_expand_vector_extract): Ditto.
16475 * config/i386/mmx.md (*pinsrw): New insn pattern.
16478 (*pextrw_zext): Ditto.
16480 (*pextrb_zext): Ditto.
16481 (vec_setv2hi): New expander.
16482 (vec_extractv2hihi): Ditto.
16483 (vec_setv4qi): Ditto.
16484 (vec_extractv4qiqi): Ditto.
16485 (vec_setv8qi): Enable only for TARGET_SSE4_1.
16486 (vec_extractv8qiqi): Ditto.
16488 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
16490 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
16491 order to subf instruction.
16492 * config/rs6000/fusion.md: Regenerate.
16494 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
16496 * calls.c (get_size_range): Use range_of_expr instead of
16497 determine_value_range.
16498 * tree-affine.c (expr_to_aff_combination): Same.
16499 * tree-data-ref.c (split_constant_offset): Same.
16500 * tree-vrp.c (determine_value_range_1): Remove.
16501 (determine_value_range): Remove.
16502 * tree-vrp.h (determine_value_range): Remove.
16504 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
16506 * function-tests.c (test_ranges): Call gimple_range_tests.
16507 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
16509 * gimple-range.cc (fur_source::get_operand): Do not call
16510 get_tree_range or gimple_range_global.
16512 (get_tree_range): Move to value-query.cc.
16513 Call get_arith_expr_range.
16514 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
16515 Include gimple-range-tests.cc.
16516 * gimple-range.h (fold_range): Add argument.
16517 (get_tree_range): Remove.
16518 * selftest.h (gimple_range_tests): New.
16519 * value-query.cc (global_range_query::range_of_expr): Add
16521 (range_query::get_tree_range): Move from gimple-range.cc.
16522 * value-query.h (class range_query): Add get_tree_range and
16523 get_arith_expr_range. Make fur_source a friend.
16524 * vr-values.c (vr_values::range_of_expr): Pass stmt to
16526 * gimple-range-tests.cc: New file.
16528 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
16530 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
16531 update_global_range.
16532 * value-query.cc (update_global_range): New.
16533 * value-query.h (update_global_range): New.
16535 2021-06-03 David Malcolm <dmalcolm@redhat.com>
16537 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
16538 printing the same location twice if there are fix-it hints,
16539 multiple locations, or a label.
16541 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
16543 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
16544 thresholds to narrow the upper bound on epilogue iterations.
16546 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
16548 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
16549 (mve_vabsq_s<mode>): Likewise.
16550 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
16551 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
16552 * config/arm/vec-common.md (neg<mode>2): Rename to
16553 <absneg_str><mode>2.
16555 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
16557 * common/config/arc/arc-common.c (arc_option_optimization_table):
16558 Remove malign-call.
16559 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
16560 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
16561 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
16562 * config/arc/arc.md (abssi2_mixed): Remove pattern.
16563 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
16564 (malign-call): Likewise.
16565 (mmixed-code): Likewise.
16566 * doc/invoke.texi (ARC): Update doc.
16568 2021-06-03 Martin Liska <mliska@suse.cz>
16570 * common.opt: Use proper Enum values.
16571 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
16572 (parse_sanitizer_options): Handle only sanitizer_opts.
16573 (common_handle_option): Just assign value.
16575 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16578 * tree-inline.c (inline_forbidden_p): Remove test on return type.
16580 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16582 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
16583 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
16584 (type_byte_size): Inline into...
16585 (add_byte_size_attribute): ...this and call add_scalar_info.
16587 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16589 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
16590 (typed_binop_from_tree): New function.
16591 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
16592 turn a divide by a power of 2 into a shift.
16593 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
16594 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
16595 typed divide by calling typed_binop_from_tree.
16597 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16599 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
16600 (is_handled_procedure_type): Likewise.
16601 (struct loc_descr_context): Add strict_signedness field.
16602 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
16603 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
16604 (resolve_args_picking): Minor tweak.
16605 (function_to_dwarf_procedure): Initialize strict_signedness field.
16606 (type_byte_size): Likewise.
16607 (field_byte_offset): Likewise.
16608 (gen_descr_array_type_die): Likewise.
16609 (gen_variant_part): Likewise.
16610 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
16611 to true when a context is present before evaluating the arguments.
16612 <COND_EXPR>: Do not generate a useless comparison with zero.
16613 When dereferencing an address, if strict_signedness is true and the
16614 type is small and signed, use DW_OP_deref_type to do the dereference
16615 and then DW_OP_convert to convert back to the generic type.
16617 2021-06-03 Jakub Jelinek <jakub@redhat.com>
16620 * tree-inline.c (copy_tree_body_r): Handle iterators on
16621 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
16623 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
16625 * config/arc/arc.md (*bbit_di): Remove.
16627 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
16629 PR rtl-optimization/100264
16630 * ree.c (get_sub_rtx): Ignore SET expressions without register
16631 destinations and remove assertion, as it is not valid anymore
16632 with this new behaviour.
16633 (merge_def_and_ext): Eliminate destination check for register
16634 as such SET expressions can't occur anymore.
16635 (combine_reaching_defs): Likewise.
16637 2021-06-02 Jakub Jelinek <jakub@redhat.com>
16640 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
16641 -Wtype-limits warnings.
16642 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
16643 in operands to avoid -Wsign-compare warnings.
16645 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
16647 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
16650 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
16652 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
16654 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
16656 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
16658 * config/s390/subst.md(cconly_subst): Use a single constraint
16659 in (match_scratch).
16661 2021-06-02 Martin Liska <mliska@suse.cz>
16663 * ipa-icf.h: Use auto_vec for memory_access_types.
16665 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
16667 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
16668 argument from prototype.
16669 (output_logical_op): Add rtx_code argument.
16670 (compute_logical_op_length): Likewise.
16671 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
16672 to compute_a_shift_length.
16673 (output_logical_op); New argument with the rtx code rather than
16674 extracting it from an operand. Handle QImode too.
16675 (compute_logical_op_length): Similary.
16676 (compute_a_shift_length): Drop unused argument.
16677 * config/h8300/h8300.md (logicals): New code iterator.
16678 * config/h8300/logical.md (<code><mode>3 expander): Combine
16679 the "and" expander with the "ior"/"xor" expander.
16680 (bclr<mode>msx): Combine the QI/HI mode patterns.
16681 (<logical><mode>3 insns): Use code iterator rather than match_operator.
16682 Handle QImode as well. Update call to output_logical_op and
16683 compute_logical_op_length to pass in rtx_code
16684 Fix split condition on all define_insn_and_split patterns.
16685 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
16686 the flags and setting ZN via existing define_subst.
16687 * config/h8300/shiftrotate.md: Drop unused argument from
16688 calls to compute_a_shift_length.
16689 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
16691 2021-06-01 Andrew Pinski <apinski@marvell.com>
16693 PR tree-optimization/25290
16694 * tree-ssa-phiopt.c (match_simplify_replacement):
16696 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
16697 (two_value_replacement): Change the comment about
16698 conditional_replacement.
16699 (conditional_replacement): Delete.
16701 2021-06-01 Andrew Pinski <apinski@marvell.com>
16703 PR tree-optimization/95481
16704 * tree-tailcall.c (find_tail_calls): Handle empty typed
16707 2021-06-01 Andrew Pinski <apinski@marvell.com>
16709 * gimplify.c (zero_sized_field_decl): Delete
16710 (zero_sized_type): Delete
16711 (gimplify_init_ctor_eval): Use is_empty_type instead
16712 of zero_sized_field_decl.
16713 (gimplify_modify_expr): Use is_empty_type instead of
16716 2021-06-01 Jason Merrill <jason@redhat.com>
16719 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
16721 2021-06-01 Jason Merrill <jason@redhat.com>
16724 * diagnostic.h (warning_enabled_at): Declare.
16725 * diagnostic.c (diagnostic_enabled): Factor out from...
16726 (diagnostic_report_diagnostic): ...here.
16727 (warning_enabled_at): New.
16729 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
16731 * gimple-ssa-evrp.c: Enable exporting of global ranges.
16733 2021-06-01 Martin Liska <mliska@suse.cz>
16736 * doc/invoke.texi: Mention that -fgcse-after-reload
16737 is enabled with -O3.
16739 2021-06-01 liuhongt <hongtao.liu@intel.com>
16741 PR tree-optimization/98365
16742 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
16743 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
16744 (convert_scalar_cond_reduction): Ditto.
16745 (predicate_scalar_phi): Ditto.
16747 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16749 PR tree-optimization/100781
16750 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
16751 value calculation by default.
16752 (ranger_cache::enable_new_values): New.
16753 (ranger_cache::disable_new_values): New.
16754 (ranger_cache::push_poor_value): Check if new values are allowed.
16755 * gimple-range-cache.h (class ranger_cache): New member/methods.
16756 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
16757 statement, and disable/renable new value calculation.
16759 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16761 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
16762 (ranger_cache::range_of_def): New.
16763 (ranger_cache::entry_range): New.
16764 (ranger_cache::exit_range): New.
16765 (ranger_cache::range_of_expr): Adjust.
16766 (ranger_cache::range_on_edge): Adjust.
16767 (ranger_cache::propagate_cache): Call exit_range directly.
16768 * gimple-range-cache.h (class ranger_cache): Adjust.
16770 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16772 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
16773 gori_compute being a member rather than base class.
16774 dervied call to member call.
16775 (ranger_cache::dump): No longer dump gori_map.
16776 (ranger_cache::dump_bb): New.
16777 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
16778 being a member rather than base class.
16779 (ranger_cache::set_global_range): Ditto.
16780 (ranger_cache::ssa_range_in_bb): Ditto.
16781 (ranger_cache::range_of_expr): New.
16782 (ranger_cache::range_on_edge): New.
16783 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
16784 (ranger_cache::propagate_cache): Adjust debugging output.
16785 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
16787 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
16788 member, and inherit from range_query instead.
16789 (ranger_cache::dump_bb): New. split from dump.
16790 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
16791 (gori_compute::expr_range_at_stmt): Delete.
16792 (gori_compute::compute_name_range_op): Delete.
16793 (gori_compute::compute_operand_range_switch): Add fur_source.
16794 (gori_compute::compute_operand_range): Add fur_source param, inline
16795 old compute_name_range_op and optimize_logical_operands.
16796 (struct tf_range): Delete.
16797 (gori_compute::logical_combine): Adjust
16798 (gori_compute::optimize_logical_operands): Delete.
16799 (gori_compute::compute_logical_operands_in_chain): Delete.
16800 (gori_compute::compute_logical_operands): Adjust.
16801 (gori_compute::compute_operand1_range): Adjust to fur_source.
16802 (gori_compute::compute_operand2_range): Ditto.
16803 (gori_compute::compute_operand1_and_operand2_range): Ditto.
16804 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
16805 and adjust to fur_source.
16806 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
16807 range_query and fur_source.
16808 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
16809 from the ranger_cache..
16810 (gimple_ranger::fold_range_internal): Adjust to base class change of
16812 (gimple_ranger::dump_bb): Adjust dump.
16813 * gimple-range.h (gimple_ranger):export gori computes object.
16815 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16817 PR tree-optimization/100774
16818 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
16819 Constant values are also not stale.
16820 (ranger_cache::set_global_range): Range invariant values should also
16821 have the correct timestamp.
16823 2021-05-31 Martin Liska <mliska@suse.cz>
16825 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
16826 Unpack FUNCTION_DECL_DECL_TYPE.
16827 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
16828 Stream FUNCTION_DECL_DECL_TYPE instead of
16829 DECL_IS_OPERATOR_NEW_P.
16830 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
16832 (DECL_IS_OPERATOR_NEW_P): Likewise.
16833 (DECL_IS_OPERATOR_DELETE_P): Likewise.
16834 (DECL_LAMBDA_FUNCTION_P): Likewise.
16836 2021-05-31 Richard Biener <rguenther@suse.de>
16839 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
16840 * internal-fn.def (SHUFFLEVECTOR): New.
16841 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
16842 * doc/extend.texi: Document __builtin_shufflevector.
16844 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
16847 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
16848 indexed form addresses.
16850 2021-05-29 Jeff Law <jlaw@tachyum.com>
16852 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
16853 parameter. Call callers fixed.
16855 (output_plussi): Add FALLTHRU markers.
16856 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
16858 2021-05-29 Jakub Jelinek <jakub@redhat.com>
16860 PR middle-end/99928
16861 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
16862 combined with parallel, make sure to add shared clause to
16863 parallel for explicit linear clause.
16865 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
16867 PR tree-optimization/100787
16868 * gimple-ssa-evrp.c: Disable exporting of global ranges.
16870 2021-05-28 Jason Merrill <jason@redhat.com>
16872 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
16873 operator--, operator*, operator==, and operator!=.
16874 (class tsi_range): New.
16876 2021-05-28 Richard Biener <rguenther@suse.de>
16878 PR tree-optimization/100778
16879 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
16880 trapping ops in different BBs.
16882 2021-05-28 Richard Biener <rguenther@suse.de>
16885 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
16886 copy fntype from original call.
16888 2021-05-28 Martin Liska <mliska@suse.cz>
16890 PR gcov-profile/100751
16891 * doc/gcov.texi: Revert partially a hunk that was wrong.
16893 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
16895 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
16897 (HAVE_sync_compare_and_swaphi): Likewise.
16898 (HAVE_sync_compare_and_swapsi): Likewise.
16900 2021-05-28 Jakub Jelinek <jakub@redhat.com>
16902 PR middle-end/99928
16903 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
16905 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
16907 * gimplify.c (gimplify_omp_affinity): New.
16908 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
16909 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
16910 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
16911 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
16912 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
16914 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
16915 Richard Biener <rguenther@suse.de>
16917 * match.pd <popcount & / + pattern matching>:
16918 When generating popcount directly fails, try doing it in two halves.
16920 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
16922 * Makefile.in (generated_files): Add gimple-match.c and
16925 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
16927 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
16929 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
16931 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
16933 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
16935 PR tree-optimization/99398
16936 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
16937 where the fed operands are CTOR/CST and propagated through
16938 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
16939 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
16941 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
16944 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
16946 * config/i386/mmx.md (addv2sf3): Do not call
16947 ix86_fixup_binary_operands_no_copy.
16950 (<smaxmin:code>v2sf3): Ditto.
16951 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
16952 (<plusminus:insn><VI_32:mode>3): Remove expander.
16953 (<plusminus:insn><VI_32:mode>3): Rename from
16954 "*<plusminus:insn><VI_32:mode>3".
16955 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
16956 (mulv2hi3): Remove expander.
16957 (mulv2hi3): Rename from *mulv2hi3.
16958 (<s>mulv2hi3_highpart): Remove expander.
16959 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
16960 (<smaxmin:code><MMXMODE14:mode>3): Rename from
16961 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
16962 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
16963 (SMAXMIN_MMXMODEI): Remove mode iterator.
16964 (<smaxmin:code>v4hi3): New expander.
16965 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
16966 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
16967 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
16968 (SMAXMIN_VI_32): Remove mode iterator.
16969 (<umaxmin:code><MMXMODE24:mode>3): Rename from
16970 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
16971 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
16972 (UMAXMIN_MMXMODEI): Remove mode iterator.
16973 (<umaxmin:code>v8qi3): New expander.
16974 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
16975 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
16976 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
16977 (UMAXMIN_VI_32): Remove mode iterator.
16978 (<any_shift:insn>v2hi3): Remove expander.
16979 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
16980 (<any_logic:code><MMXMODEI:mode>3): Do not call
16981 ix86_fixup_binary_operands_no_copy.
16982 (<any_logic:code><VI_32:mode>3): Remove expander.
16983 (<any_logic:code><VI_32:mode>3): Rename from
16984 "*<any_logic:code><VI_32:mode>3".
16985 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
16986 * config/i386/sse.md (div<VF2:mode>3): Do not call
16987 ix86_fixup_binary_operands_no_copy.
16988 (div<VF1:mode>3): Ditto.
16989 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
16990 (smulhrsv4hi3): Ditto.
16991 (smulhrsv2hi3): Ditto.
16993 2021-05-27 Martin Sebor <msebor@redhat.com>
16995 * ggc.h (gt_ggc_mx): Add overloads for all integers.
16997 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
16999 (hash_map::operator==): New function.
17001 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
17004 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
17005 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
17006 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
17007 (*xop_maskcmp<VI_32:mode>3): Ditto.
17008 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
17009 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
17011 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
17014 * config/arm/arm.c (arm_configure_build_target): Remove parameter
17015 opts_set, directly check opts parameters for being non-null.
17016 (arm_option_restore): Update call to arm_configure_build_target.
17017 (arm_option_override): Likewise.
17018 (arm_can_inline_p): Likewise.
17019 (arm_valid_target_attribute_tree): Likewise.
17020 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
17021 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
17024 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
17026 * vr-values.c (simplify_conversion_using_ranges): Use
17027 get_range_query instead of get_global_range_query.
17029 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
17031 * gimple-range.cc (get_range_global): Move to value-query.cc.
17032 (gimple_range_global): Same.
17033 (get_global_range_query): Same.
17034 (global_range_query::range_of_expr): Same.
17035 * gimple-range.h (class global_range_query): Move to
17037 (gimple_range_global): Same.
17038 * tree-ssanames.c (get_range_info): Move to value-query.cc.
17039 (get_ptr_nonnull): Same.
17040 * tree-ssanames.h (get_range_info): Remove.
17041 (get_ptr_nonnull): Remove.
17042 * value-query.cc (get_ssa_name_range_info): Move from
17044 (get_ssa_name_ptr_info_nonnull): Same.
17045 (get_range_global): Move from gimple-range.cc.
17046 (gimple_range_global): Same.
17047 (get_global_range_query): Same.
17048 (global_range_query::range_of_expr): Same.
17049 * value-query.h (class global_range_query): Move from
17051 (gimple_range_global): Same.
17053 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
17056 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
17057 (uavgv2hi3_ceil): Ditto.
17059 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
17062 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
17064 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17066 * tree-ssanames.c (get_range_info): Merge both copies of
17067 get_range_info into one that works with irange.
17068 * tree-ssanames.h (get_range_info): Remove version that works on
17071 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17073 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
17074 (expand_builtin_strnlen): Same.
17075 (determine_block_size): Same.
17076 * fold-const.c (expr_not_equal_to): Same.
17077 * gimple-fold.c (size_must_be_zero_p): Same.
17078 * gimple-match-head.c: Include gimple-range.h.
17079 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
17080 * gimple-ssa-warn-restrict.c
17081 (builtin_memref::extend_offset_range): Same.
17082 * graphite-sese-to-poly.c (add_param_constraints): Same.
17083 * internal-fn.c (get_min_precision): Same.
17084 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
17085 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
17087 * tree-data-ref.c (split_constant_offset): Same.
17088 (dr_step_indicator): Same.
17089 * tree-dfa.c (get_ref_base_and_extent): Same.
17090 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
17091 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
17092 (determine_value_range): Same.
17093 (record_nonwrapping_iv): Same.
17094 (infer_loop_bounds_from_signedness): Same.
17095 (scev_var_range_cant_overflow): Same.
17096 * tree-ssa-phiopt.c (two_value_replacement): Same.
17097 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
17098 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
17099 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
17101 (dump_strlen_info): Same.
17102 (set_strlen_range): Same.
17103 (maybe_diag_stxncpy_trunc): Same.
17104 (get_len_or_size): Same.
17105 (handle_integral_assign): Same.
17106 * tree-ssa-structalias.c (find_what_p_points_to): Same.
17107 * tree-ssa-uninit.c (find_var_cmp_const): Same.
17108 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
17109 * tree-vect-patterns.c (vect_get_range_info): Same.
17110 (vect_recog_divmod_pattern): Same.
17111 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
17112 (register_edge_assert_for_2): Same.
17113 (determine_value_range_1): Same.
17114 * tree.c (get_range_pos_neg): Same.
17115 * vr-values.c (vr_values::get_lattice_entry): Same.
17116 (vr_values::update_value_range): Same.
17117 (simplify_conversion_using_ranges): Same.
17119 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17121 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
17122 get_range_query instead of query argument.
17123 (pass_walloca::execute): Enable and disable global ranger.
17125 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17127 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
17129 (rvrp_folder::~rvrp_folder): Call disable_ranger.
17130 (hybrid_folder::hybrid_folder): Call enable_ranger.
17131 (hybrid_folder::~hybrid_folder): Call disable_ranger.
17133 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17135 * function.c (allocate_struct_function): Set cfun->x_range_query.
17136 * function.h (struct function): Declare x_range_query.
17137 (get_range_query): New.
17138 (get_global_range_query): New.
17139 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
17140 Remove call to safe_grow_cleared.
17141 * gimple-range.cc (get_range_global): New.
17142 (gimple_range_global): Move from gimple-range.h.
17143 (get_global_range_query): New.
17144 (global_range_query::range_of_expr): New.
17145 (enable_ranger): New.
17146 (disable_ranger): New.
17147 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
17148 (class global_range_query): New.
17149 (enable_ranger): New.
17150 (disable_ranger): New.
17151 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
17152 dump_all_value_ranges to dump.
17153 * tree-vrp.c (vrp_prop::finalize): Same.
17154 * value-query.cc (range_query::dump): New.
17155 * value-query.h (range_query::dump): New.
17156 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
17157 (vr_values::dump): ...this.
17158 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
17159 dump and make virtual.
17161 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
17163 * config/i386/i386.c (ix86_autovectorize_vector_modes):
17164 Add V4QImode and V16QImode for TARGET_SSE2.
17165 * doc/sourcebuild.texi (Vector-specific attributes):
17166 Add vect64 and vect32 description.
17168 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
17170 * gimple-range-gori.cc (range_def_chain::register_dependency):
17171 Resize m_def_chain when needed.
17173 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
17175 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
17176 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
17177 * config/arm/vec-common.md: .. here. Add support for MVE.
17179 2021-05-26 Jakub Jelinek <jakub@redhat.com>
17181 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
17183 * config/microblaze/microblaze.c (microblaze_legitimize_address,
17185 microblaze_option_override, print_operand): Likewise.
17186 * config/microblaze/microblaze.md (call_internal_plt,
17187 call_value_intern_plt, call_value_intern): Likewise.
17188 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
17189 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
17190 call_value_multiple_internal1): Likewise.
17191 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
17193 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
17195 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
17196 arc_ccfsm_advance, symbolic_reference_mentioned_p,
17197 arc_raw_symbolic_reference_mentioned_p): Remove register
17200 2021-05-26 Jakub Jelinek <jakub@redhat.com>
17203 * omp-low.c: Include omp-offload.h.
17204 (create_omp_child_function): If current_function_decl has
17205 "omp declare target" attribute and is_gimple_omp_offloaded,
17206 remove that attribute from the copy of attribute list and
17207 add "omp target entrypoint" attribute instead.
17208 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
17209 variables for offloading if in omp_maybe_offloaded_ctx.
17210 * omp-offload.c (pass_omp_target_link::execute): Nullify second
17211 argument to GOMP_target_data_ext in offloaded code.
17213 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17215 * config/csky/csky.c (csky_can_change_mode_class): Delete.
17216 For csky, HF/SF mode use the low bits of VREGS.
17218 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
17220 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
17221 DECL which is a reference for OMP.
17223 2021-05-26 Martin Liska <mliska@suse.cz>
17225 PR gcov-profile/100751
17226 * doc/gcov.texi: Document that __gcov_dump can be called just
17227 once and that __gcov_reset resets run-time counters.
17229 2021-05-26 Martin Liska <mliska@suse.cz>
17231 * doc/install.texi: Port relevant part from install-old.texi
17232 and re-generate list of CPUs and systems.
17234 2021-05-26 Martin Liska <mliska@suse.cz>
17236 * Makefile.in: Remove it.
17237 * doc/include/fdl.texi: Update next/previous chapters.
17238 * doc/install.texi: Likewise.
17239 * doc/install-old.texi: Removed.
17241 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17243 * config/csky/csky.c (ck810_legitimate_index_p): Support
17244 "base + index" with DF mode.
17245 * config/csky/constraints.md ("Y"): New constraint for memory operands
17246 without index register.
17247 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
17248 when mov between memory and general registers, and lower their priority.
17249 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
17251 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17253 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
17255 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17257 * config/csky/csky.md (untyped_call): Emit clobber for return
17258 registers to mark them used.
17260 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17262 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
17264 2021-05-26 Andrew Pinski <apinski@marvell.com>
17266 * match.pd (x < 0 ? ~y : y): New patterns.
17268 2021-05-26 Andrew Pinski <apinski@marvell.com>
17270 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
17271 A?POW2:0 and A?0:POW2.
17273 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17275 * gimple-range-gori.cc (class logical_stmt_cache): Delete
17276 (logical_stmt_cache::logical_stmt_cache ): Delete.
17277 (logical_stmt_cache::~logical_stmt_cache): Delete.
17278 (logical_stmt_cache::cache_entry::dump): Delete.
17279 (logical_stmt_cache::get_range): Delete.
17280 (logical_stmt_cache::cached_name ): Delete.
17281 (logical_stmt_cache::same_cached_name): Delete.
17282 (logical_stmt_cache::cacheable_p): Delete.
17283 (logical_stmt_cache::slot_diagnostics ): Delete.
17284 (logical_stmt_cache::dump): Delete.
17285 (gori_compute_cache::gori_compute_cache): Delete.
17286 (gori_compute_cache::~gori_compute_cache): Delete.
17287 (gori_compute_cache::compute_operand_range): Delete.
17288 (gori_compute_cache::cache_stmt): Delete.
17289 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
17291 (class gori_compute_cache): Delete.
17293 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17295 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
17297 (fold_using_range::range_of_address): Adjust.
17298 (fold_using_range::range_of_phi): Adjust.
17299 * gimple-range.h (class fur_source): Adjust.
17300 (fur_source::fur_source): Adjust.
17302 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17304 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
17305 from expr_range_in_bb and adjust.
17306 (gori_compute::compute_name_range_op): Adjust.
17307 (gori_compute::optimize_logical_operands): Adjust.
17308 (gori_compute::compute_logical_operands_in_chain): Adjust.
17309 (gori_compute::compute_operand1_range): Adjust.
17310 (gori_compute::compute_operand2_range): Adjust.
17311 (ori_compute_cache::cache_stmt): Adjust.
17312 * gimple-range-gori.h (gori_compute): Rename prototype.
17314 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17316 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
17317 checked only after range_of_stmt, not range_on_entry.
17318 (gimple_ranger::range_on_entry): Check for non-null in any
17319 predecessor block, if it is not already non-null.
17320 (gimple_ranger::range_on_exit): DOnt check for non-null after
17321 range on entry call.
17322 (gimple_ranger::dump_bb): New. Split from dump.
17323 (gimple_ranger::dump): Adjust.
17324 * gimple-range.h (class gimple_ranger): Adjust.
17326 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17328 * gimple-range-cache.cc (struct range_timestamp): Delete.
17329 (class temporal_cache): Adjust.
17330 (temporal_cache::get_timestamp): Delete.
17331 (temporal_cache::set_dependency): Delete.
17332 (temporal_cache::temporal_value): Adjust.
17333 (temporal_cache::current_p): Take dependencies as params.
17334 (temporal_cache::set_timestamp): Adjust.
17335 (temporal_cache::set_always_current): Adjust.
17336 (ranger_cache::get_non_stale_global_range): Adjust.
17337 (ranger_cache::register_dependency): Delete.
17338 * gimple-range-cache.h (class range_cache): Adjust.
17340 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17342 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
17344 (range_def_chain::~range_def_chain): Dispose of obstack rather than
17345 each individual bitmap.
17346 (range_def_chain::set_import): New.
17347 (range_def_chain::get_imports): New.
17348 (range_def_chain::chain_import_p): New.
17349 (range_def_chain::register_dependency): Rename from build_def_chain
17351 (range_def_chain::def_chain_in_bitmap_p): New.
17352 (range_def_chain::add_def_chain_to_bitmap): New.
17353 (range_def_chain::has_def_chain): Just check first depenedence.
17354 (range_def_chain::get_def_chain): Process imports, use generic
17355 register_dependency routine.
17356 (range_def_chain::dump): New.
17357 (gori_map::gori_map): Allocate import list.
17358 (gori_map::~gori_map): Release imports.
17359 (gori_map::exports): Check for past allocated block size.
17360 (gori_map::imports): New.
17361 (gori_map::def_chain_in_export_p): Delete.
17362 (gori_map::is_import_p): New.
17363 (gori_map::maybe_add_gori): Handle imports.
17364 (gori_map::dump): Adjust output, add imports.
17365 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
17366 (gori_export_iterator::gori_export_iterator): New.
17367 (gori_export_iterator::next): New.
17368 (gori_export_iterator::get_name): New.
17369 * gimple-range-gori.h (range_def_chain): Add imports and direct
17370 dependecies via struct rdc.
17371 (range_def_chain::depend1): New.
17372 (range_def_chain::depend2): New.
17373 (class gori_map): Adjust.
17374 (FOR_EACH_GORI_IMPORT_NAME): New.
17375 (FOR_EACH_GORI_EXPORT_NAME): New.
17376 (class gori_export_iterator): New.
17378 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17380 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
17381 export cache filling to here.
17382 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
17384 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17386 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
17387 (gori_map): Move to gimple-range-gori.h.
17388 (gori_compute::gori_compute): Adjust.
17389 (gori_compute::~gori_compute): Delete.
17390 (gori_compute::compute_operand_range_switch): Adjust.
17391 (gori_compute::compute_operand_range): Adjust.
17392 (gori_compute::compute_logical_operands): Adjust.
17393 (gori_compute::has_edge_range_p ): Adjust.
17394 (gori_compute::set_range_invariant): Delete.
17395 (gori_compute::dump): Adjust.
17396 (gori_compute::outgoing_edge_range_p): Adjust.
17397 * gimple-range-gori.h (class range_def_chain): Relocate here.
17398 (class gori_map): Relocate here.
17399 (class gori_compute): Inherit from gori_map, and adjust.
17401 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
17403 * value-range.cc (range_tests_legacy): Use
17404 build_nonstandard_integer_type instead of int and short.
17406 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
17408 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
17409 when really creating an initialization statement for it.
17411 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
17413 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
17415 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
17417 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
17419 2021-05-25 Martin Liska <mliska@suse.cz>
17421 PR tree-optimization/92860
17423 * optc-save-gen.awk: Remove exceptions.
17425 2021-05-25 Martin Liska <mliska@suse.cz>
17427 * asan.h (sanitize_coverage_p): New function.
17428 * doc/extend.texi: Document it.
17429 * fold-const.c (fold_range_test): Use sanitize_flags_p
17430 instead of flag_sanitize_coverage.
17431 (fold_truth_andor): Likewise.
17432 * sancov.c: Likewise.
17433 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
17434 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
17435 -fsanitize-coverage when inlining.
17437 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
17439 * config/csky/csky-modes.def : Fix copyright.
17441 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
17443 * config/csky/csky-modes.def : Amend copyright.
17444 * config/csky/csky_insn_fpuv2.md : Likewise.
17445 * config/csky/csky_insn_fpuv3.md : Likewise.
17447 2021-05-25 Richard Biener <rguenther@suse.de>
17449 PR middle-end/100727
17450 * calls.c (initialize_argument_information): Explicitely test
17451 for WITH_SIZE_EXPR.
17452 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
17454 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17456 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
17457 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
17458 the signle definition. The signle definition may not work well
17459 at simplify_subreg_regno().
17460 (HARD_FRAME_POINTER_REGNUM): New.
17461 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
17462 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
17463 csky_initial_elimination_offset, csky_expand_prologue,
17464 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
17466 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17468 * config/csky/csky.c (csky_option_override):
17469 Init csky_arch_isa_features[] in advance, so TARGET_DSP
17470 and TARGET_DIV can be set well.
17472 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17474 * config/csky/constraints.md ("l", "h"): Delete.
17475 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
17476 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
17477 * config/csky/csky.c (regno_reg_classm,
17478 csky_secondary_reload, csky_register_move_cost):
17479 Use HILO_REGS instead of LO_REGS and HI_REGS.
17481 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17483 * config/csky/constraints.md ("W"): New constriant for mem operand
17484 with base reg, index register.
17485 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
17486 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
17488 ("Dv"): New constraint for const double value that can be used at
17490 * config/csky/csky-modes.def (HFmode): New mode.
17491 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
17492 to "csky_valid_mem_constraint_operand" and support new constraint
17494 (csky_get_movedouble_length): New.
17495 (fpuv3_output_move): New.
17496 (fpuv3_const_double): New.
17497 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
17498 (decompose_csky_address): Refine.
17499 (csky_print_operand): New "CONST_DOUBLE" operand.
17500 (csky_output_move): Support fpv3 instructions.
17501 (csky_get_movedouble_length): New.
17502 (fpuv3_output_move): New.
17503 (fpuv3_const_double): New.
17504 (csky_emit_compare): Cover float comparsion.
17505 (csky_emit_compare_float): Refine.
17506 (csky_vaild_fpuv2_mem_operand): Rename to
17507 "csky_valid_mem_constraint_operand" and support new constraint "W".
17508 (ck860_rtx_costs): New.
17509 (csky_rtx_costs): Add the cost calculation of CK860.
17510 (regno_reg_class): New vregs for fpuv3.
17511 (csky_dbx_regno): Likewise.
17512 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
17513 (csky_conditional_register_usage): Suporrot fpuv3.
17514 (csky_dwarf_register_span): Suporrot fpuv3.
17515 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
17516 (ck810_legitimate_index_p): Support fp16.
17517 * config/csky/csky.h (TARGET_TLS): ADD CK860.
17518 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
17519 (TARGET_SINGLE_FPU): Support fpuv3.
17520 (TARGET_SUPPORT_FPV3): New.
17521 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
17522 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
17523 REG_CLASS_CONTENTS): Support fpuv3.
17524 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
17525 (csky_movsf_fpv2): Likewise.
17526 (ck801_movsf): Likewise.
17527 (csky_movsf): Likewise.
17529 (csky_movdf_fpv2): Likewise.
17530 (ck801_movdf): Likewise.
17531 (csky_movdf): Likewise.
17532 (movsicc): Refine. Use "comparison_operatior" instead of
17533 "ordered_comparison_operatior".
17534 (addsicc): Likewise.
17535 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
17536 (call_value_internal_vh): New.
17537 * config/csky/csky_cores.def (CK860): New arch and cpu.
17542 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
17543 into emit-patterns and match-patterns, remain the emit-patterns here,
17544 and move the match-patterns to csky_insn_fpuv2.md or
17545 csky_insn_fpuv3.md.
17546 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
17547 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
17549 * config/csky/csky_isa.def (fcr): New.
17554 (CK860): New definition for ck860.
17555 * config/csky/csky_tables.opt (ck860): New processors ck860,
17556 ck860f. And new arch ck860.
17561 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
17562 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
17563 * config/csky/t-csky-elf: Support 860.
17564 * config/csky/t-csky-linux: Likewise.
17565 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
17567 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
17569 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
17570 add generation of logical-add and add-logical fusion pairs.
17571 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
17573 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
17574 logical-add and add-logical fusion by default.
17575 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
17576 -mpower10-fusion-add-logical options.
17577 * config/rs6000/fusion.md: Regenerate file.
17579 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
17581 * value-range.cc (irange::legacy_equal_p): Check type when
17582 comparing VR_VARYING types.
17583 (range_tests_legacy): Test comparing VARYING ranges of different
17586 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
17588 * config/aarch64/aarch64.c (neoversen1_tunings):
17589 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
17591 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
17593 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
17594 extern weak symbols. Limit symbol offsets for non-GOT symbols with
17597 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
17599 * config/arm/neon.md (vec_load_lanesxi<mode>)
17600 (vec_store_lanexoi<mode>): Move ...
17601 * config/arm/vec-common.md: here.
17603 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
17605 * config/arm/neon.md (vec_load_lanesoi<mode>)
17606 (vec_store_lanesoi<mode>): Move ...
17607 * config/arm/vec-common.md: here.
17609 2021-05-24 liuhongt <hongtao.liu@intel.com>
17612 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
17613 stmt with GIMPLE_NOP when lhs doesn't exist.
17615 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
17618 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
17619 New instruction pattern.
17620 (*push<VI_32:mode>2): Ditto.
17621 (push splitter for SSE registers): New splitter.
17623 2021-05-23 Andrew Pinski <apinski@marvell.com>
17625 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
17627 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
17629 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
17630 * config/rs6000/fusion.md: Regenerate file.
17632 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
17634 * config/rs6000/genfusion.pl (gen_addadd): New function.
17635 * config/rs6000/fusion.md: Regenerate file.
17636 * config/rs6000/rs6000-cpus.def: Add
17637 OPTION_MASK_P10_FUSION_2ADD to masks.
17638 * config/rs6000/rs6000.c (rs6000_option_override_internal):
17639 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
17640 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
17642 2021-05-21 Jakub Jelinek <jakub@redhat.com>
17644 PR middle-end/99928
17645 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
17646 * gimplify.c (enum gimplify_omp_var_data): Fix up
17647 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
17648 (omp_lastprivate_for_combined_outer_constructs): If combined target
17649 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
17650 GOVD_MAP | GOVD_SEEN.
17651 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
17652 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
17653 (gimplify_adjust_omp_clauses): For firstprivate clauses with
17654 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
17655 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
17656 let it be replaced by implicit map clause.
17658 2021-05-21 Jakub Jelinek <jakub@redhat.com>
17660 PR middle-end/99928
17661 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
17663 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
17664 (gimplify_omp_for): Likewise.
17666 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17668 PR middle-end/90115
17669 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
17670 'external' in blocks.
17672 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17674 PR middle-end/90115
17675 * flag-types.h (enum openacc_privatization): New.
17676 * params.opt (-param=openacc-privatization): New.
17677 * doc/invoke.texi (openacc-privatization): Document it.
17678 * omp-general.h (get_openacc_privatization_dump_flags): New
17680 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
17681 * omp-offload.c (execute_oacc_device_lower)
17682 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
17683 * target.def (goacc.adjust_private_decl): Add 'location_t'
17685 * doc/tm.texi: Regenerate.
17686 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
17687 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
17688 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
17689 Likewise. Preserve it for...
17690 (nvptx_goacc_expand_var_decl): ... use here.
17692 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17694 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
17697 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17699 PR middle-end/90115
17700 * omp-low.c (oacc_privatization_candidate_p): New function.
17701 (oacc_privatization_scan_clause_chain)
17702 (oacc_privatization_scan_decl_chain): Use it. Also
17703 'gcc_checking_assert' that we're not seeing duplicates.
17705 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17707 PR middle-end/90115
17708 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
17711 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17713 PR middle-end/90115
17714 * omp-offload.c (execute_oacc_device_lower): Explain.
17716 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17718 PR middle-end/90115
17719 * omp-offload.c (execute_oacc_device_lower)
17720 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
17722 * internal-fn.c (expand_UNIQUE): Don't expect
17723 'IFN_UNIQUE_OACC_PRIVATE'.
17725 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17727 PR middle-end/90115
17728 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
17730 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17732 PR middle-end/90115
17733 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
17734 (nvptx_goacc_expand_var_decl): Tighten.
17736 2021-05-21 Julian Brown <julian@codesourcery.com>
17737 Chung-Lin Tang <cltang@codesourcery.com>
17738 Thomas Schwinge <thomas@codesourcery.com>
17740 PR middle-end/90115
17741 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
17742 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
17743 * doc/tm.texi: Regenerate.
17744 * expr.c (expand_expr_real_1): Expand decls using the
17745 expand_var_decl OpenACC hook if defined.
17746 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
17747 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
17748 * omp-low.c (omp_context): Add oacc_privatization_candidates
17750 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
17752 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
17753 private marker's gimple call arguments, and pass it to
17754 lower_oacc_reductions.
17755 (oacc_privatization_scan_clause_chain)
17756 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
17758 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
17759 * omp-offload.c (convert.h): Include.
17760 (oacc_loop_xform_head_tail): Treat private-variable markers like
17761 fork/join when transforming head/tail sequences.
17762 (struct var_decl_rewrite_info): Add struct.
17763 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
17764 (execute_oacc_device_lower): Support rewriting gang-private
17765 variables using target hook, and fix up addr_expr and var_decl
17767 * target.def (adjust_private_decl, expand_var_decl): New hooks.
17768 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
17770 (gcn_goacc_adjust_private_decl): ...this.
17771 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
17773 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
17774 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
17775 definition using gcn_goacc_adjust_gangprivate_decl...
17776 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
17777 gcn_goacc_adjust_private_decl.
17778 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
17779 (gang_private_shared_size): New global variable.
17780 (gang_private_shared_align): Likewise.
17781 (gang_private_shared_sym): Likewise.
17782 (gang_private_shared_hmap): Likewise.
17783 (nvptx_option_override): Initialize these.
17784 (nvptx_file_end): Output gang_private_shared_sym.
17785 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
17787 (nvptx_set_current_function): Clear gang_private_shared_hmap.
17788 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
17789 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
17791 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
17793 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
17795 2021-05-21 Richard Biener <rguenther@suse.de>
17796 H.J. Lu <hjl.tools@gmail.com>
17798 PR middle-end/90773
17799 * expr.c (expand_constructor): Elide expand_constructor if
17800 move by pieces is preferred.
17802 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
17804 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
17805 Take a flag and mode value as arguments.
17806 (aarch64_modifies_global_state_p): Likewise.
17807 (aarch64_reads_global_state_p): Likewise.
17808 (aarch64_could_trap_p): Likewise.
17809 (aarch64_get_attributes): Likewise.
17810 (aarch64_init_simd_builtins): Adjust callsite of above.
17811 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
17812 function attributes to apply to builtins.
17813 (aarch64_init_crc32_builtins): Likewise.
17814 (aarch64_init_builtin_rsqrt): Likewise.
17816 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
17818 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
17819 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
17820 (gen_2logical): Use new fusion types.
17821 * config/rs6000/fusion.md: Regenerate.
17823 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
17826 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
17827 Handle V4QI and V2HI modes.
17828 (ix86_expand_sse_movcc): Ditto.
17829 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
17830 New instruction pattern.
17831 (*eq<VI_32:mode>3): Ditto.
17832 (*gt<VI_32:mode>3): Ditto.
17833 (*xop_pcmov_<VI_32:mode>): Ditto.
17834 (mmx_pblendvb32): Ditto.
17835 (mmx_pblendvb64): Rename from mmx_pblendvb.
17836 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
17837 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
17838 (vcond<VI_32:mode><VI_32:mode>): Ditto.
17839 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
17840 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
17842 2021-05-21 Jakub Jelinek <jakub@redhat.com>
17844 PR tree-optimization/94589
17845 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
17846 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
17847 not the same as cmp3) treat <= the same as < and >= the same as >.
17848 Don't require that cond2_phi_edge is true edge, instead take
17849 false/true edges into account based on cmp1/cmp2 comparison kinds.
17851 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
17854 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
17855 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
17856 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
17857 using SMAXMIN_MMXMODEI mode iterator.
17858 (*<smaxmin:code>v4qi3): New insn pattern.
17859 (*<smaxmin:code>v2hi3): Ditto.
17860 (SMAXMIN_VI_32): New mode iterator.
17861 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
17862 (UMAXMIN_MMXMODEI): New mode iterator.
17863 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
17864 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
17865 using UMAXMIN_MMXMODEI mode iterator.
17866 (*<umaxmin:code>v4qi3): New insn pattern.
17867 (*<umaxmin:code>v2hi3): Ditto.
17868 (UMAXMIN_VI_32): New mode iterator.
17869 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
17870 (abs<VI_32:mode>2): New insn pattern.
17871 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
17872 * config/i386/sse.md: ... here.
17874 2021-05-20 Clement Chigot <clement.chigot@atos.net>
17875 David Edelsohn <dje.gcc@gmail.com>
17877 * collect2.c (scan_prog_file): Issue non-fatal warning for
17880 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
17882 * doc/invoke.texi (-Wno-c++11-extensions)
17883 (-Wno-c++14-extensions, -Wno-c++17-extensions)
17884 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
17887 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
17889 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
17890 * config/darwin.c (darwin_override_options): Likewise.
17891 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
17892 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
17893 (DWARF2_FRAME_REG_OUT): Likewise.
17894 * config/mips/mips.c (mips_output_filename): Likewise.
17895 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
17897 (rs6000_dbx_register_number): Likewise.
17898 * dbxout.c: Include flags.h.
17899 * dwarf2cfi.c (cfi_label_required_p): Likewise.
17900 (dwarf2out_do_frame): Likewise.
17901 * except.c: Include flags.h.
17902 * final.c (dwarf2_debug_info_emitted_p): Likewise.
17903 (final_scan_insn_1): Likewise.
17904 * flags.h (dwarf_debuginfo_p): New function declaration.
17905 * opts.c (dwarf_debuginfo_p): New function definition.
17906 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
17907 * toplev.c (process_options): Likewise.
17909 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
17911 * common.opt: Change type to support bitmasks.
17912 * flag-types.h (enum debug_info_type): Rename enumerator constants.
17913 (NO_DEBUG): New bitmask.
17914 (DBX_DEBUG): Likewise.
17915 (DWARF2_DEBUG): Likewise.
17916 (XCOFF_DEBUG): Likewise.
17917 (VMS_DEBUG): Likewise.
17918 (VMS_AND_DWARF2_DEBUG): Likewise.
17919 * flags.h (debug_set_to_format): New function declaration.
17920 (debug_set_count): Likewise.
17921 (debug_set_names): Likewise.
17922 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
17923 (debug_set_to_format): New function definition.
17924 (debug_set_count): Likewise.
17925 (debug_set_names): Likewise.
17926 (set_debug_level): Update access to debug_type_names.
17927 * toplev.c: Likewise.
17929 2021-05-20 Martin Sebor <msebor@redhat.com>
17931 PR middle-end/100684
17932 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
17934 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
17937 * config/i386/i386.md (isa): Remove x64_bmi.
17938 (enabled): Remove x64_bmi.
17939 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
17940 Remove general register alternative.
17941 (*andnot<VI_32:mode>3): Ditto.
17942 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
17943 (*<any_logic:code><VI_32:mode>3): Ditto.
17945 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
17947 * config/arm/arm.c: Include head files tree-vectorizer.h and
17950 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
17953 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
17954 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
17955 (<s>mulv4hi3_highpart): New expander.
17956 (*<s>mulv2hi3_highpart): New insn pattern.
17957 (<s>mulv2hi3_higpart): New expander.
17958 (*<any_shift:insn>v2hi3): New insn pattern.
17959 (<any_shift:insn>v2hi3): New expander.
17960 * config/i386/sse.md (smulhrsv2hi3): New expander.
17961 (*smulhrsv2hi3): New insn pattern.
17963 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
17965 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
17967 * params.opt (vect-inner-loop-cost-factor): New.
17968 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
17969 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
17970 tree-vectorizer.h and its required ones.
17971 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
17972 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
17973 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
17974 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
17975 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
17976 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
17978 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
17979 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
17980 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
17982 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
17983 Torbjörn Svensson <torbjorn.svensson@st.com>
17986 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
17988 2021-05-20 Jakub Jelinek <jakub@redhat.com>
17990 PR middle-end/99928
17991 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
17992 explicit linear clause when combined with target, make it map(tofrom:)
17993 instead of no clause or firstprivate.
17995 2021-05-20 Jakub Jelinek <jakub@redhat.com>
17997 PR tree-optimization/94589
17998 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
17999 of integral conversions.
18001 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
18003 * gimple-range.cc (fur_source::get_operand): New.
18004 (gimple_range_fold): Delete.
18005 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
18006 (fold_using_range::range_of_range_op): Move from gimple_ranger.
18007 (fold_using_range::range_of_address): Ditto.
18008 (fold_using_range::range_of_phi): Ditto.
18009 (fold_using_range::range_of_call): Ditto.
18010 (fold_using_range::range_of_builtin_ubsan_call): Move from
18011 range_of_builtin_ubsan_call.
18012 (fold_using_range::range_of_builtin_call): Move from
18013 range_of_builtin_call.
18014 (gimple_ranger::range_of_builtin_call): Delete.
18015 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
18016 (gimple_ranger::fold_range_internal): New.
18017 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
18018 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
18019 gimple_ranger. Improve ranges of SSA_NAMES when possible.
18020 * gimple-range.h (gimple_ranger): Remove various range_of routines.
18021 (class fur_source): New.
18022 (class fold_using_range): New.
18023 (fur_source::fur_source): New.
18025 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
18026 instead of range_of_builtin_call.
18028 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
18030 * doc/cpp.texi (Common Predefined Macros): Update documentation
18031 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
18033 2021-05-19 Alex Coplan <alex.coplan@arm.com>
18036 * config/arm/arm.md (nonsecure_call_internal): Always ensure
18037 callee's address is in a register.
18039 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
18041 * common/config/riscv/riscv-common.c
18042 (riscv_subset_list::parsing_subset_version): Properly parse the letter
18044 (riscv_subset_list::parse_std_ext,
18045 riscv_subset_list::parse_multiletter_ext): To handle errors generated
18046 in riscv_subset_list::parsing_subset_version.
18048 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18050 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
18051 type attribute in patterns generating XTN(2).
18053 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18055 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
18056 Remove as duplicate of...
18057 (aarch64_xtn<mode>): This.
18058 (aarch64_xtn2<mode>_le): Move position in file.
18059 (aarch64_xtn2<mode>_be): Move position in file.
18060 (aarch64_xtn2<mode>): Move position in file.
18061 (vec_pack_trunc_<mode>): Define as an expander.
18063 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18065 * config/aarch64/aarch64-simd-builtins.def: Split builtin
18066 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
18067 separate scalar and vector generators.
18068 * config/aarch64/aarch64-simd.md
18069 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
18071 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
18072 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
18073 * config/aarch64/iterators.md: Define SD_HSDI iterator.
18075 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
18077 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
18079 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
18081 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18083 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
18084 Implement as an expand emitting a big/little endian
18085 instruction pattern.
18086 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
18087 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
18089 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18091 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
18092 Implement as an expand emitting a big/little endian
18093 instruction pattern.
18094 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
18095 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
18096 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
18097 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
18099 2021-05-19 Richard Biener <rguenther@suse.de>
18101 PR middle-end/100672
18102 * fold-const.c (fold_negate_expr_1): Use element_precision.
18103 (negate_expr_p): Likewise.
18105 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
18107 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
18108 (pred_load): New int attribute.
18109 * config/aarch64/aarch64-sve.md
18110 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
18111 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
18112 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
18113 code_for_aarch64_load.
18115 2021-05-19 Richard Biener <rguenther@suse.de>
18117 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
18118 sure TARGET_MEM_REF bases are expanded as memory.
18119 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
18120 Do not mark TARGET_MEM_REF bases addressable.
18121 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
18122 TARGET_MEM_REF bases as never rewritable.
18123 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
18124 walk TARGET_MEM_REF bases as address-takens.
18125 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
18127 2021-05-19 Richard Biener <rguenther@suse.de>
18129 * builtins.c (get_object_alignment_1): Strip outer
18131 * tree-dfa.c (get_ref_base_and_extent): Handle outer
18132 WITH_SIZE_EXPR for size processing and process the
18134 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
18135 outer WITH_SIZE_EXPR.
18136 (ao_ref_base_alias_ptr_type): Likewise.
18137 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
18138 and handle that accordingly, stripping it for the
18139 core alias workers.
18140 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
18141 looking through it instead of returning NULL.
18143 2021-05-19 Jakub Jelinek <jakub@redhat.com>
18145 PR middle-end/100576
18146 * builtins.c (check_read_access): Convert bound to size_type_node if
18149 2021-05-19 Richard Biener <rguenther@suse.de>
18151 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
18152 (verify_types_in_gimple_reference): ... here. Sanitize.
18153 (verify_gimple_call): Verify references in LHS and arguments.
18154 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
18156 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
18158 * config/i386/i386.h (VALID_INT_MODE_P):
18159 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
18160 * config/i386/i386.md (isa): Add x64_bmi.
18161 (enabled): Handle x64_bmi.
18162 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
18163 Add alternative using 64bit general registers.
18164 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
18166 2021-05-19 Jakub Jelinek <jakub@redhat.com>
18168 PR middle-end/99928
18169 * tree.h (OMP_MASTER_COMBINED): Define.
18170 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
18171 handling for outer combined/composite constructs to a loop.
18172 Handle lastprivate on combined target.
18173 (gimplify_expr): Formatting fix.
18175 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
18177 * passes.def: Add sink_code pass before store_merging.
18178 * tree-ssa-sink.c (pass_sink_code:clone): New.
18180 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
18182 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
18183 rs6000_special_adjust_field_align_p.
18184 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
18185 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
18186 Remove ABI warning.
18187 (rs6000_function_arg): Likewise.
18188 * config/rs6000/rs6000-protos.h
18189 (rs6000_special_adjust_field_align_p): Remove prototype.
18190 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
18192 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
18193 rs6000_special_adjust_field_align_p.
18195 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
18198 * config/i386/i386.h (VALID_SSE2_REG_MODE):
18199 Add V4QI and V2HI modes.
18200 (VALID_INT_MODE_P): Ditto.
18201 * config/i386/mmx.md (VI_32): New mode iterator.
18202 (mmxvecsize): Handle V4QI and V2HI.
18204 (mov<VI_32:mode>): New expander.
18205 (*mov<mode>_internal): New insn pattern.
18206 (movmisalign<VI_32:mode>): New expander.
18207 (neg<VI_32:mode>): New expander.
18208 (<plusminus:insn><VI_32:mode>3): New expander.
18209 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
18210 (mulv2hi3): New expander.
18211 (*mulv2hi3): New insn pattern.
18212 (one_cmpl<VI_32:mode>2): New expander.
18213 (*andnot<VI_32:mode>3): New insn pattern.
18214 (<any_logic:code><VI_32:mode>3): New expander.
18215 (*<any_logic:code><VI_32:mode>3): New insn pattern.
18217 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
18219 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
18220 Fix a mode mismatch with operand 1.
18222 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
18225 * config/i386/i386-expand.c (split_double_mode): Return
18226 temporary register when simplify_gen_subreg fails with
18227 the high half od the paradoxical subreg.
18229 2021-05-18 Richard Biener <rguenther@suse.de>
18231 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
18232 and honor it when expanding.
18233 (expand_used_vars_for_block): Pass through forced_stack_var.
18234 (expand_used_vars): Likewise.
18235 (discover_nonconstant_array_refs_r): Set bits in
18236 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
18237 (avoid_type_punning_on_regs): Likewise.
18238 (discover_nonconstant_array_refs): Likewise.
18239 (pass_expand::execute): Create and pass down forced_stack_var
18240 bitmap. For parameters and returns temporarily set
18241 TREE_ADDRESSABLE when expand_function_start.
18243 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
18245 * doc/sourcebuild.texi: Document 'dg-note'.
18247 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
18250 * configure: Regenerate.
18251 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
18253 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
18255 * gimple.h (is_gimple_omp_oacc): Tighten.
18256 * omp-low.c (check_omp_nesting_restrictions): Adjust.
18258 2021-05-18 Richard Biener <rguenther@suse.de>
18260 * tree-ssa-operands.c (mark_address_taken): Simplify.
18262 2021-05-18 Martin Liska <mliska@suse.cz>
18264 * config/gcn/mkoffload.c (STR): Redefine.
18265 * config/i386/intelmic-mkoffload.c (STR): Likewise.
18266 * config/nvptx/mkoffload.c (STR): Likewise.
18268 2021-05-18 Martin Liska <mliska@suse.cz>
18270 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
18271 Use startswith function instead of strncmp.
18272 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
18273 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
18274 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
18275 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
18276 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
18277 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18278 * config/arm/arm.c (arm_file_start): Likewise.
18279 (arm_valid_target_attribute_rec): Likewise.
18280 (thumb1_md_asm_adjust): Likewise.
18281 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
18282 * config/avr/avr.c (STR_PREFIX_P): Likewise.
18283 (avr_set_current_function): Likewise.
18284 (avr_handle_addr_attribute): Likewise.
18285 (avr_asm_output_aligned_decl_common): Likewise.
18286 (avr_asm_named_section): Likewise.
18287 (avr_section_type_flags): Likewise.
18288 (avr_asm_select_section): Likewise.
18289 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
18290 (c6x_section_type_flags): Likewise.
18291 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
18292 (darwin_objc_declare_unresolved_class_reference): Likewise.
18293 (darwin_objc_declare_class_definition): Likewise.
18294 * config/darwin.c (indirect_data): Likewise.
18295 (darwin_encode_section_info): Likewise.
18296 (darwin_objc2_section): Likewise.
18297 (darwin_objc1_section): Likewise.
18298 (machopic_select_section): Likewise.
18299 (darwin_globalize_label): Likewise.
18300 (darwin_label_is_anonymous_local_objc_name): Likewise.
18301 (darwin_asm_named_section): Likewise.
18302 (darwin_asm_output_dwarf_offset): Likewise.
18303 * config/frv/frv.c (frv_string_begins_with): Likewise.
18304 (frv_in_small_data_p): Likewise.
18305 * config/gcn/mkoffload.c (STR): Likewise.
18307 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
18308 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
18309 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
18310 (ix86_md_asm_adjust): Likewise.
18311 * config/i386/intelmic-mkoffload.c (STR): Likewise.
18312 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
18313 (i386_pe_file_end): Likewise.
18314 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
18315 (ia64_section_type_flags): Likewise.
18316 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
18317 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
18318 (mips16_stub_function_p): Likewise.
18319 (mips_function_rodata_section): Likewise.
18320 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
18321 (msp430_function_section): Likewise.
18322 (msp430_section_type_flags): Likewise.
18323 (msp430_expand_helper): Likewise.
18324 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
18325 (nios2_valid_target_attribute_rec): Likewise.
18326 * config/nvptx/mkoffload.c (process): Likewise.
18328 * config/pa/som.h: Likewise.
18329 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
18330 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
18331 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
18332 (rs6000_inner_target_options): Likewise.
18333 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
18334 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
18335 * config/vax/vax.c (vax_output_int_move): Likewise.
18336 * config/vms/vms-ld.c (startswith): Likewise.
18337 (process_args): Likewise.
18339 * config/vms/vms.c: Likewise.
18341 2021-05-18 Jakub Jelinek <jakub@redhat.com>
18343 PR rtl-optimization/100590
18344 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
18345 they are NONJUMP_INSN_P.
18347 2021-05-18 Jakub Jelinek <jakub@redhat.com>
18350 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
18351 DECL_ASSEMBLER_NAME on the fn_decl.
18353 2021-05-18 Jakub Jelinek <jakub@redhat.com>
18355 PR tree-optimization/94589
18356 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
18357 phi result used in (res & ~1) == 0 comparison as res >= 0 as
18358 res == 2 would be UB with -ffinite-math-only.
18360 2021-05-18 Martin Liska <mliska@suse.cz>
18362 * Makefile.in: genversion.o should depend on DATESTAMP.
18364 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
18366 * config/arc/simdext.md (negv2si2): Remove round bracket.
18368 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
18370 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
18371 _Bool as macro expanding to _Bool.
18373 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
18376 * tree.c (build_reference_type_for_mode)
18377 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
18379 (build_reference_type, build_pointer_type): Invoke
18380 build_*_type_for_mode with VOIDmode.
18382 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
18384 PR tree-optimization/100512
18385 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
18386 and non-zero pointer ranges as invariant.
18387 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
18388 processing from here.
18390 2021-05-17 Tom de Vries <tdevries@suse.de>
18393 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
18394 * config/nvptx/nvptx.c (nvptx_output_barrier)
18395 (nvptx_output_atomic_insn): New function.
18396 (nvptx_print_operand): Add support for 'B'.
18397 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
18400 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
18402 PR tree-optimization/100349
18403 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
18406 2021-05-17 Tamar Christina <tamar.christina@arm.com>
18408 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
18409 (host_detect_local_cpu): Use it.
18411 2021-05-17 Martin Liska <mliska@suse.cz>
18413 * doc/invoke.texi: Add 2 missing dots.
18415 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
18417 PR bootstrap/100552
18418 * configure.ac: Replace pattern substitution with call to sed.
18419 * configure: Regenerate.
18421 2021-05-17 Richard Biener <rguenther@suse.de>
18423 PR middle-end/100582
18424 * tree.c (array_at_struct_end_p): Get to the base of the
18425 reference before looking for the underlying decl.
18427 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
18429 * genoutput.c (validate_insn_alternatives) Make "wrong number of
18430 alternatives" message more specific, and remove assumption on where
18433 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
18435 * config/arm/iterators.md (V16): New iterator.
18436 (VH_cvtto): New iterator.
18437 (v_cmp_result): Added V4HF and V8HF support.
18438 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
18439 (vcond<mode><mode>): Likewise.
18440 (vcond_mask_<mode><v_cmp_result>): Likewise.
18441 (vcond<VH_cvtto><mode>): New expander.
18443 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
18445 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
18447 * config/arm/arm.c (arm_expand_vector_compare): Add support for
18449 (arm_expand_vcond): Likewise.
18450 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
18451 VCMPEQQ_N_S, VCMPNEQ_N_S.
18452 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
18453 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
18454 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
18455 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
18456 (@mve_vpselq_<supf><mode>): Likewise.
18457 (@mve_vpselq_f<mode>"): Likewise.
18458 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
18459 and move to vec-common.md.
18460 (vec_cmpu<mode><mode>): Likewise.
18461 (vcond<mode><mode>): Likewise.
18462 (vcond<V_cvtto><mode>): Likewise.
18463 (vcondu<mode><v_cmp_result>): Likewise.
18464 (vcond_mask_<mode><v_cmp_result>): Likewise.
18465 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
18466 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
18467 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
18468 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
18469 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
18470 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
18472 (vec_cmpu<mode><mode>): Likewise.
18473 (vcond<mode><mode>): Likewise.
18474 (vcond<V_cvtto><mode>): Likewise.
18475 (vcondu<mode><v_cmp_result>): Likewise.
18476 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
18479 2021-05-17 liuhongt <hongtao.liu@intel.com>
18482 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
18483 gsi_insert_seq_before instead.
18485 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
18487 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
18488 (arm_sat_ok): ...this.
18490 2021-05-17 Martin Liska <mliska@suse.cz>
18492 * lto-wrapper.c (merge_flto_options): Factor out a new function.
18493 (merge_and_complain): Use it.
18494 (run_gcc): Merge also linker command line -flto=foo argument
18497 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
18499 * config/arm/arm.h (CPP_SPEC): Remove error message about
18500 -mlittle-endian/-mbig-endian conflict.
18502 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
18504 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
18505 __ROP_PROTECT__ if -mrop-protect is selected.
18507 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
18509 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
18510 rop_hash_save_offset and rop_hash_size.
18511 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
18512 rop_hash_size and rop_hash_save_offset.
18513 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
18514 (rs6000_emit_prologue): Emit hashst[p] in prologue.
18515 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
18516 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
18518 (hashst): New define_insn.
18519 (hashchk): Likewise.
18521 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
18523 * config/rs6000/rs6000.c (rs6000_option_override_internal):
18524 Disable shrink wrap when inserting ROP-protect instructions.
18525 * config/rs6000/rs6000.opt (mrop-protect): New option.
18526 (mprivileged): Likewise.
18527 * doc/invoke.texi: Document mrop-protect and mprivileged.
18529 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
18531 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
18534 2021-05-15 Martin Jambor <mjambor@suse.cz>
18537 2021-05-13 Martin Jambor <mjambor@suse.cz>
18539 PR tree-optimization/100453
18540 * tree-sra.c (sra_modify_assign): All const base accesses do not
18541 need refreshing, not just those from decl_pool.
18542 (sra_modify_assign): Do not refresh into a const base decl.
18544 2021-05-15 Jakub Jelinek <jakub@redhat.com>
18546 PR rtl-optimization/100342
18547 * regcprop.c (copy_value): When copying a source reg in a wider
18548 mode than it has recorded for the value, adjust recorded destination
18549 mode too or punt if !REG_CAN_CHANGE_MODE_P.
18551 2021-05-14 Jason Merrill <jason@redhat.com>
18553 * intl.h: Add comments.
18555 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
18557 * config/aarch64/aarch64-simd.md
18558 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
18559 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
18560 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
18561 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
18562 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
18563 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
18564 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
18565 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
18566 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
18568 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
18571 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
18572 boolean logic equivalent.
18573 (vtst_s16): Likewise.
18574 (vtst_s32): Likewise.
18575 (vtst_u8): Likewise.
18576 (vtst_u16): Likewise.
18577 (vtst_u32): Likewise.
18578 (vtst_p8): Likewise.
18579 (vtst_p16): Likewise.
18580 (vtstq_s8): Likewise.
18581 (vtstq_s16): Likewise.
18582 (vtstq_s32): Likewise.
18583 (vtstq_u8): Likewise.
18584 (vtstq_u16): Likewise.
18585 (vtstq_u32): Likewise.
18586 (vtstq_p8): Likewise.
18587 (vtstq_p16): Likewise.
18588 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
18589 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
18591 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
18593 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
18594 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
18595 (aarch64_sqdmlsl2<mode>): Delete.
18596 (aarch64_sqdmlal2_lane<mode>): Merge this...
18597 (aarch64_sqdmlsl2_lane<mode>): ... And this...
18598 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
18599 (aarch64_sqdmlal2_laneq<mode>): Merge this...
18600 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
18601 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
18602 (aarch64_sqdmlal2_n<mode>): Merge this...
18603 (aarch64_sqdmlsl2_n<mode>): ... And this...
18604 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
18606 2021-05-13 Martin Sebor <msebor@redhat.com>
18608 PR middle-end/100574
18609 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
18610 all null arguments.
18612 2021-05-13 Martin Sebor <msebor@redhat.com>
18614 PR tree-optimization/93100
18615 PR middle-end/98583
18616 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
18617 don't modify referenced objects.
18619 2021-05-13 Martin Jambor <mjambor@suse.cz>
18621 PR tree-optimization/100453
18622 * tree-sra.c (sra_modify_assign): All const base accesses do not
18623 need refreshing, not just those from decl_pool.
18624 (sra_modify_assign): Do not refresh into a const base decl.
18626 2021-05-13 Martin Liska <mliska@suse.cz>
18628 * tree-ssa-dom.c: Remove m_simplifier.
18630 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
18633 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
18634 canonicalize DImode inequality comparisons against the
18635 maximum integral value.
18637 2021-05-13 Jakub Jelinek <jakub@redhat.com>
18639 PR tree-optimization/98856
18640 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
18641 Expect V2DI and V4DI arithmetic right shifts to be emulated.
18642 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
18644 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
18645 expand_vec_perm_2perm_pblendv): New functions.
18646 (ix86_expand_vec_perm_const_1): Use them.
18647 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
18648 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
18649 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
18650 (ashrv4di3): New define_expand.
18651 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
18652 and !TARGET_AVX512VL expansion.
18654 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
18657 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
18658 sizes < 16 to a register when constructing vpcmov pattern.
18659 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
18661 2021-05-13 Martin Liska <mliska@suse.cz>
18663 * gcov-io.c (gcov_write_block): Remove.
18664 (gcov_write_words): Likewise.
18665 (gcov_read_words): Re-implement using gcov_read_bytes.
18666 (gcov_allocate): Remove.
18667 (GCOV_BLOCK_SIZE): Likewise.
18668 (struct gcov_var): Remove most of the fields.
18669 (gcov_position): Implement with ftell.
18670 (gcov_rewrite): Remove setting of start and offset fields.
18671 (from_file): Re-format.
18672 (gcov_open): Remove setbuf call. It should not be needed.
18673 (gcov_close): Remove internal buffer handling.
18674 (gcov_magic): Use __builtin_bswap32.
18675 (gcov_write_counter): Use directly gcov_write_unsigned.
18676 (gcov_write_string): Use direct fwrite and do not round
18678 (gcov_seek): Use directly fseek.
18679 (gcov_write_tag): Use gcov_write_unsigned directly.
18680 (gcov_write_length): Likewise.
18681 (gcov_write_tag_length): Likewise.
18682 (gcov_read_bytes): Use directly fread.
18683 (gcov_read_unsigned): Use gcov_read_words.
18684 (gcov_read_counter): Likewise.
18685 (gcov_read_string): Use gcov_read_bytes.
18686 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
18687 that size is not in bytes, but words (4B).
18688 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
18689 (GCOV_TAG_ARCS_LENGTH): Likewise.
18690 (GCOV_TAG_ARCS_NUM): Likewise.
18691 (GCOV_TAG_COUNTER_LENGTH): Likewise.
18692 (GCOV_TAG_COUNTER_NUM): Likewise.
18693 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
18695 2021-05-13 liuhongt <hongtao.liu@intel.com>
18698 * config/i386/sse.md (ssedoublevecmode): Add attribute for
18699 V64QI/V32HI/V16SI/V4DI.
18700 (ssehalfvecmode): Add attribute for V2DI/V2DF.
18701 (*vec_concatv4si_0): Extend to VI124_128.
18702 (*vec_concat<mode>_0): New pre-reload splitter.
18703 * config/i386/predicates.md (movq_parallel): New predicate.
18705 2021-05-13 Alexandre Oliva <oliva@adacore.com>
18707 * targhooks.c (default_zero_call_used_regs): Retry using
18708 successfully-zeroed registers as sources.
18710 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
18712 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
18714 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
18717 * gimple-range.cc (range_of_builtin_call): Skip out on
18718 processing __builtin_clz when varying.
18720 2021-05-12 Tom de Vries <tdevries@suse.de>
18723 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
18724 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
18725 to ptx_version_option.
18726 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
18727 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
18728 (define_insn "nvptx_vote_ballot"): Use sync variant for
18730 * config/nvptx/nvptx.opt (ptx_version): Add enum.
18731 (mptx): Add option.
18732 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
18734 2021-05-12 Richard Biener <rguenther@suse.de>
18736 PR tree-optimization/100566
18737 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
18738 allow_back for all edge queries.
18740 2021-05-12 liuhongt <hongtao.liu@intel.com>
18743 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
18744 splitters for pblendvb of NOT mask register.
18746 2021-05-12 Richard Biener <rguenther@suse.de>
18748 PR tree-optimization/100519
18749 * tree-ssa-reassoc.c (can_associate_p): Split into...
18750 (can_associate_op_p): ... this
18751 (can_associate_type_p): ... and this.
18752 (is_reassociable_op): Call can_associate_op_p.
18753 (break_up_subtract_bb): Call the appropriate predicates.
18754 (reassociate_bb): Likewise.
18756 2021-05-12 Martin Liska <mliska@suse.cz>
18758 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
18759 (run_gcc): Use -flto argument detection for merged
18762 2021-05-12 Martin Liska <mliska@suse.cz>
18764 * lto-wrapper.c (print_lto_docs_link): New function.
18765 (run_gcc): Print warning about missing job server detection
18766 after we know NR of partitions. Do the same for -flto{,=1}.
18767 * opts.c (get_option_html_page): Support -flto option.
18769 2021-05-12 Martin Liska <mliska@suse.cz>
18771 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
18773 (append_option): Remove.
18774 (find_option): Rework to use the vector type.
18775 (remove_option): Remove.
18776 (merge_and_complain): Use vectors for cl_decoded_option data
18778 (append_compiler_options): Likewise.
18779 (append_diag_options): Likewise.
18780 (append_linker_options): Likewise.
18781 (append_offload_options): Likewise.
18782 (compile_offload_image): Likewise.
18783 (compile_images_for_offload_targets): Likewise.
18784 (find_and_merge_options): Likewise.
18785 (run_gcc): Likewise.
18787 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
18790 * dwarf2out.c (dwarf2out_finish): Set
18791 have_multiple_function_sections with multi-range text_section.
18793 2021-05-12 Martin Liska <mliska@suse.cz>
18795 PR bootstrap/100560
18796 * Makefile.in: Remove version.h from linker command line.
18798 2021-05-12 Richard Biener <rguenther@suse.de>
18800 PR middle-end/100547
18801 * rtl.h (rtvec_alloc): Make argument size_t.
18802 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
18804 2021-05-12 Jakub Jelinek <jakub@redhat.com>
18806 PR middle-end/100508
18807 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
18808 type, don't reuse DECL_RTL if it has different mode, instead force
18809 creation of a new DEBUG_EXPR.
18811 2021-05-12 Jakub Jelinek <jakub@redhat.com>
18812 Marc Glisse <marc.glisse@inria.fr>
18814 PR tree-optimization/94589
18815 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
18816 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
18818 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
18821 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
18822 * config/i386/mmx.md (MMXMODE124): New mode iterator.
18824 (mmxintvecmode): New mode attribute.
18825 (mmxintvecmodelower): Ditto.
18826 (*mmx_maskcmpv2sf3_comm): New insn pattern.
18827 (*mmx_maskcmpv2sf3): Ditto.
18828 (vec_cmpv2sfv2si): New expander.
18829 (vcond<V2FI:mode>v2si): Ditto.
18830 (mmx_vlendvps): New insn pattern.
18831 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
18832 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
18833 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
18835 2021-05-11 Martin Sebor <msebor@redhat.com>
18837 PR middle-end/21433
18838 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
18840 2021-05-11 Richard Biener <rguenther@suse.de>
18842 * gimple-fold.c (gimple_fold_call): Do not call
18843 maybe_fold_reference on call arguments or the static chain.
18844 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
18847 2021-05-11 Martin Liska <mliska@suse.cz>
18849 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
18850 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
18851 (DEF_HSAIL_SAT_BUILTIN): Likewise.
18852 (DEF_HSAIL_INTR_BUILTIN): Likewise.
18853 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
18854 * doc/frontends.texi: Remove BRIG.
18855 * doc/install.texi: Likewise.
18856 * doc/invoke.texi: Likewise.
18857 * doc/standards.texi: Likewise.
18858 * brig-builtins.def: Removed.
18859 * brig/ChangeLog: Removed.
18860 * brig/Make-lang.in: Removed.
18861 * brig/brig-builtins.h: Removed.
18862 * brig/brig-c.h: Removed.
18863 * brig/brig-lang.c: Removed.
18864 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
18865 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
18866 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
18867 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
18868 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
18869 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
18870 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
18871 * brig/brigfrontend/brig-comment-handler.cc: Removed.
18872 * brig/brigfrontend/brig-control-handler.cc: Removed.
18873 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
18874 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
18875 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
18876 * brig/brigfrontend/brig-function-handler.cc: Removed.
18877 * brig/brigfrontend/brig-function.cc: Removed.
18878 * brig/brigfrontend/brig-function.h: Removed.
18879 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
18880 * brig/brigfrontend/brig-label-handler.cc: Removed.
18881 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
18882 * brig/brigfrontend/brig-machine.c: Removed.
18883 * brig/brigfrontend/brig-machine.h: Removed.
18884 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
18885 * brig/brigfrontend/brig-module-handler.cc: Removed.
18886 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
18887 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
18888 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
18889 * brig/brigfrontend/brig-to-generic.cc: Removed.
18890 * brig/brigfrontend/brig-to-generic.h: Removed.
18891 * brig/brigfrontend/brig-util.cc: Removed.
18892 * brig/brigfrontend/brig-util.h: Removed.
18893 * brig/brigfrontend/brig-variable-handler.cc: Removed.
18894 * brig/brigfrontend/hsa-brig-format.h: Removed.
18895 * brig/brigfrontend/phsa.h: Removed.
18896 * brig/brigspec.c: Removed.
18897 * brig/config-lang.in: Removed.
18898 * brig/gccbrig.texi: Removed.
18899 * brig/lang-specs.h: Removed.
18900 * brig/lang.opt: Removed.
18902 2021-05-11 Richard Biener <rguenther@suse.de>
18905 * ipa-param-manipulation.c
18906 (ipa_param_body_adjustments::modify_call_stmt): Avoid
18907 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
18908 via gimple_call_lhs_ptr.
18910 2021-05-11 Alex Coplan <alex.coplan@arm.com>
18913 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
18914 Avoid emitting CFA adjusts on the sp if we have the fp.
18916 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
18918 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
18919 (VMULD): New iterator.
18920 (VCOND): Handle V4HF and V8HF.
18921 (VCONQ): Fix entry for V2SF.
18922 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
18923 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
18924 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
18925 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
18926 Use a 128-bit vector mode for the indexed operand. Use stype for
18927 the scheduling type.
18929 2021-05-11 Richard Biener <rguenther@suse.de>
18931 * gimple-fold.c (maybe_fold_reference): Only return
18932 is_gimple_min_invariant values.
18934 2021-05-11 Richard Biener <rguenther@suse.de>
18936 PR middle-end/100509
18937 * gimple-fold.c (fold_gimple_assign): Only call
18938 get_symbol_constant_value on register type symbols.
18940 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
18941 Joe Ramsay <joe.ramsay@arm.com>
18944 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
18945 (__arm_vcmpneq): Remove duplicate definition.
18946 (__arm_vstrwq_scatter_offset_p): Likewise.
18947 (__arm_vmaxq_x): Likewise.
18948 (__arm_vmlsdavaq): Likewise.
18949 (__arm_vmlsdavaxq): Likewise.
18950 (__arm_vmlsdavq_p): Likewise.
18951 (__arm_vmlsdavxq_p): Likewise.
18952 (__arm_vrmlaldavhaq): Likewise.
18953 (__arm_vstrbq_p): Likewise.
18954 (__arm_vstrbq_scatter_offset): Likewise.
18955 (__arm_vstrbq_scatter_offset_p): Likewise.
18956 (__arm_vstrdq_scatter_offset): Likewise.
18957 (__arm_vstrdq_scatter_offset_p): Likewise.
18958 (__arm_vstrdq_scatter_shifted_offset): Likewise.
18959 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
18961 2021-05-11 Jakub Jelinek <jakub@redhat.com>
18963 PR middle-end/100471
18964 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
18965 is 0, bypass the reduction loop including
18966 GOMP_taskgroup_reduction_unregister call.
18968 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
18970 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
18971 costing_for_scalar.
18972 (rs6000_density_test): Early return if costing_for_scalar is true.
18973 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
18975 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
18977 * doc/tm.texi: Regenerated.
18978 * target.def (init_cost): Add new parameter costing_for_scalar.
18979 * targhooks.c (default_init_cost): Adjust for new parameter.
18980 * targhooks.h (default_init_cost): Likewise.
18981 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
18982 (vect_compute_single_scalar_iteration_cost): Likewise.
18983 (vect_analyze_loop_2): Likewise.
18984 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
18985 (vect_bb_vectorization_profitable_p): Likewise.
18986 * tree-vectorizer.h (init_cost): Likewise.
18987 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
18988 * config/i386/i386.c (ix86_init_cost): Likewise.
18989 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
18991 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
18993 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
18994 vect_nonmem and moved into...
18995 (struct rs6000_cost_data): ...here.
18996 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
18997 (rs6000_add_stmt_cost): Likewise.
18998 (rs6000_finish_cost): Likewise.
19000 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
19002 * range-op.cc (get_bool_state): Adjust head comment.
19003 (operator_not_equal::op1_range): Fix comment.
19004 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
19006 2021-05-10 Martin Sebor <msebor@redhat.com>
19008 PR middle-end/100425
19009 PR middle-end/100510
19010 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
19011 (pass_walloca::xlimit_certain_p): ...to this.
19012 (pass_walloca::gate): Execute for any kind of handled warning.
19013 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
19014 warnings when xlimit_certain_p is set.
19016 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
19018 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
19019 Return ALTIVEC_REGS if that is best_class.
19020 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
19022 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19024 * config/arm/arm.h (CPP_SPEC): Remove error message about
19027 2021-05-10 Martin Jambor <mjambor@suse.cz>
19029 * ipa-prop.h (IPA_NODE_REF): Removed.
19030 (IPA_NODE_REF_GET_CREATE): Likewise.
19031 (IPA_EDGE_REF): Likewise.
19032 (IPA_EDGE_REF_GET_CREATE): Likewise.
19033 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
19034 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
19035 use of ipa_node_params_sum.
19036 (ipcp_versionable_function_p): Likewise.
19037 (push_node_to_stack): Likewise.
19038 (pop_node_from_stack): Likewise.
19039 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
19040 direct use of ipa_node_params_sum.
19041 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
19042 ipa_node_params_sum.
19043 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
19045 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
19046 use of ipa_node_params_sum.
19047 (self_recursively_generated_p): Likewise.
19048 (propagate_scalar_across_jump_function): Likewise.
19049 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
19050 direct use of ipa_edge_args_sum, moved the lookup after the early
19051 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
19052 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
19053 direct uses of ipa_node_params_sum.
19054 (propagate_vr_across_jump_function): Likewise.
19055 (propagate_aggregate_lattice): Likewise.
19056 (propagate_aggs_across_jump_function): Likewise.
19057 (propagate_constants_across_call): Likewise, also replaced
19058 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
19059 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
19060 of ipa_node_params_sum.
19061 (estimate_local_effects): Likewise.
19062 (add_all_node_vals_to_toposort): Likewise.
19063 (propagate_constants_topo): Likewise.
19064 (ipcp_propagate_stage): Likewise.
19065 (ipcp_discover_new_direct_edges): Likewise.
19066 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
19067 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
19068 (get_info_about_necessary_edges): Likewise.
19069 (want_remove_some_param_p): Likewise.
19070 (create_specialized_node): Likewise.
19071 (self_recursive_pass_through_p): Likewise.
19072 (self_recursive_agg_pass_through_p): Likewise.
19073 (find_more_scalar_values_for_callers_subset): Likewise and also
19074 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
19075 case replacing two of those with a single query.
19076 (find_more_contexts_for_caller_subset): Likewise for the
19077 ipa_polymorphic_call_context overload.
19078 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
19079 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
19080 ipa_node_params_sum.
19081 (find_aggregate_values_for_callers_subset): Likewise, also reusing
19082 results of ipa_edge_args_sum->get.
19083 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
19084 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
19085 direct use of ipa_edge_args_sum.
19086 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
19087 summary query after the early exit and reused the result later.
19088 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
19089 ipa_node_params_sum.
19090 (decide_whether_version_node): Likewise. Removed re-querying for
19091 summaries after cloning.
19092 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
19093 ipa_node_params_sum.
19094 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
19096 (identify_dead_nodes): Likewise.
19097 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
19098 ipa_node_params_sum.
19099 (ipcp_store_vr_results): Likewise.
19100 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
19101 (ipa_fn_summary_t::duplicate): Likewise.
19102 (analyze_function_body): Likewise.
19103 (estimate_calls_size_and_time): Likewise.
19104 (ipa_cached_call_context::duplicate_from): Likewise.
19105 (ipa_call_context::equal_to): Likewise.
19106 (remap_edge_params): Likewise.
19107 (ipa_merge_fn_summary_after_inlining): Likewise.
19108 (inline_read_section): Likewise.
19109 * ipa-icf.c (sem_function::param_used_p): Likewise.
19110 * ipa-modref.c (compute_parm_map): Likewise.
19111 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
19113 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
19114 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
19116 * ipa-profile.c (check_argument_count): Likewise.
19117 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
19118 with a direct use of ipa_node_params_sum.
19119 (ipa_initialize_node_params): Likewise.
19120 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
19121 direct use of ipa_edge_args_sum and reused the query result.
19122 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
19123 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
19124 direct use of ipa_edge_args_sum.
19125 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
19126 ipa_node_params_sum and reused the result of the query.
19127 (ipa_analyze_node): Likewise.
19128 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
19129 of ipa_node_params_sum.
19130 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
19131 direct uses of ipa_edge_args_sum.
19132 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
19133 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
19134 direct use of ipa_edge_args_sum. Removed superficial re-querying the
19136 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
19137 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
19139 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
19140 direct use of ipa_edge_args_sum.
19141 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
19142 use of ipa_node_params_sum.
19143 (ipa_print_node_params): Likewise.
19144 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
19145 direct uses of ipa_edge_args_sum.
19146 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
19148 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
19149 ipa_node_params_sum.
19150 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
19151 scopes where it is used.
19153 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
19155 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
19156 <case E_V2SImode>: Force op_true to register.
19158 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19160 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
19161 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
19162 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
19164 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
19165 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
19166 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
19167 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
19168 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
19169 (mve_vcmpneq_n_f<mode>): Remove.
19170 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
19171 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
19172 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
19174 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19176 * config/arm/iterators.md (MVE_COMPARISONS): New.
19178 (mve_cmp_type): New.
19179 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
19181 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
19182 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
19183 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
19184 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
19185 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
19186 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
19187 (mve_vcmpneq_n_<mode>): Remove.
19189 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19191 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
19192 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
19193 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
19196 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19198 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
19199 (vcmpneq_n_u): Likewise.
19200 (vcmpeqq_u,): Likewise.
19201 (vcmpeqq_n_u): Likewise.
19202 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
19203 VCMPEQQ_N_U and VCMPNEQ_N_U.
19204 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
19205 (mve_vcmpeqq_n): Likewise.
19206 (mve_vcmpeqq): Likewise.
19207 (mve_vcmpneq_n): Likewise.
19209 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19211 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
19212 the 's' version of the builtin.
19214 2021-05-10 Richard Biener <rguenther@suse.de>
19216 PR tree-optimization/100492
19217 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
19218 Find nothing when the loop contains an irreducible region.
19220 2021-05-10 Richard Biener <rguenther@suse.de>
19222 PR middle-end/100464
19224 * gimple-fold.c (canonicalize_constructor_val): Do not set
19227 2021-05-10 Richard Biener <rguenther@suse.de>
19229 PR tree-optimization/100434
19230 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
19232 (dse_optimize_stmt): Handle call LHS by dropping the
19233 LHS or the whole call if it doesn't have other
19235 (pass_dse::execute): Adjust.
19237 2021-05-10 Martin Liska <mliska@suse.cz>
19239 * Makefile.in: Add missing genversion rule.
19241 2021-05-10 Alex Coplan <alex.coplan@arm.com>
19244 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
19245 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
19247 2021-05-10 Martin Liska <mliska@suse.cz>
19249 * builtins.c (is_builtin_name): Use startswith
19250 function instead of strncmp.
19251 * collect2.c (main): Likewise.
19252 (has_lto_section): Likewise.
19253 (scan_libraries): Likewise.
19254 * coverage.c (coverage_checksum_string): Likewise.
19255 (coverage_init): Likewise.
19256 * dwarf2out.c (is_cxx): Likewise.
19257 (gen_compile_unit_die): Likewise.
19258 * gcc-ar.c (main): Likewise.
19259 * gcc.c (init_spec): Likewise.
19260 (read_specs): Likewise.
19261 (execute): Likewise.
19262 (check_live_switch): Likewise.
19263 * genattrtab.c (write_attr_case): Likewise.
19264 (IS_ATTR_GROUP): Likewise.
19265 * gencfn-macros.c (main): Likewise.
19266 * gengtype.c (type_for_name): Likewise.
19267 (gen_rtx_next): Likewise.
19268 (get_file_langdir): Likewise.
19269 (write_local): Likewise.
19270 * genmatch.c (get_operator): Likewise.
19271 (get_operand_type): Likewise.
19272 (expr::gen_transform): Likewise.
19273 * genoutput.c (validate_optab_operands): Likewise.
19274 * incpath.c (add_sysroot_to_chain): Likewise.
19275 * langhooks.c (lang_GNU_C): Likewise.
19276 (lang_GNU_CXX): Likewise.
19277 (lang_GNU_Fortran): Likewise.
19278 (lang_GNU_OBJC): Likewise.
19279 * lto-wrapper.c (run_gcc): Likewise.
19280 * omp-general.c (omp_max_simt_vf): Likewise.
19281 * omp-low.c (omp_runtime_api_call): Likewise.
19282 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
19283 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
19284 * real.c (real_from_string): Likewise.
19285 * selftest.c (assert_str_startswith): Likewise.
19286 * timevar.c (timer::validate_phases): Likewise.
19287 * tree.c (get_file_function_name): Likewise.
19288 * ubsan.c (ubsan_use_new_style_p): Likewise.
19289 * varasm.c (default_function_rodata_section): Likewise.
19290 (incorporeal_function_p): Likewise.
19291 (default_section_type_flags): Likewise.
19292 * system.h (startswith): Define startswith.
19294 2021-05-10 Martin Liska <mliska@suse.cz>
19296 * bitmap.h (class auto_bitmap): Remove
19297 __cplusplus >= 201103.
19298 * config/aarch64/aarch64.c: Likewise.
19299 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
19301 * sbitmap.h: Likewise.
19303 2021-05-10 Martin Liska <mliska@suse.cz>
19305 * Makefile.in: Rename gcov-iov to genversion and depend
19306 on version.h (instead of gcov-iov.h).
19307 * gcov-io.h: Include version.h instread of gcov-iov.h.
19308 * gengtype-state.c (read_state_version): Likewise.
19309 * gcov-iov.c: Moved to...
19310 * genversion.c: ...here.
19311 * lto-streamer.h (LTO_major_version): Define it with
19313 * version.c: Removed.
19314 * version.h: Removed.
19316 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19318 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
19319 * config/arc/simdext.md (VCT): Add predicates for iterator
19322 (voptab): Likewise.
19323 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
19324 (<voptab>v2si3): New patterns.
19326 (reduc_plus_scal_v4hi): Likewise.
19327 (reduc_plus_scal_v2si): Likewise.
19328 (vec_duplicatev2si): Likewise.
19329 (vec_duplicatev4hi): Likewise.
19331 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19333 * config/arc/simdext.md: Format and cleanup file.
19335 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19337 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
19338 only when munaligned-access option is on.
19339 (movmisalign<mode>): Likewise.
19341 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19343 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
19345 * config/arc/arc.c (arc_reorg): Remove underscore from string.
19347 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19349 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
19350 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
19351 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
19352 (norm_f): Likewise.
19355 (clzsi2): Use fls instruction when available.
19356 (arc_clzsi2): Likewise.
19358 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19360 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
19362 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19364 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
19366 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
19368 PR middle-end/100467
19369 * toplev.c (compile_file): Call insn_locations_init before
19370 targetm.asm_out.code_end.
19372 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
19375 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
19377 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
19379 2021-05-07 Jakub Jelinek <jakub@redhat.com>
19380 Andrew Stubbs <amd@codesourcery.com>
19383 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
19384 emit_move_insn operands.
19386 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
19388 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
19389 location for the outgoing edges of an empty block.
19390 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
19391 type by means of the get_array_descr_info langhook, if it is set and
19392 returns true. Remove obsolete code dealing with unnamed subtypes.
19394 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19396 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
19397 (sbr_vector): Renamed from ssa_block_cache.
19398 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
19399 (ssa_block_ranges::~ssa_block_ranges): Remove.
19400 (sbr_vector::set_bb_range): Use varying and undefined cached values.
19401 (ssa_block_ranges::set_bb_varying): Remove.
19402 (sbr_vector::get_bb_range): Adjust assert.
19403 (sbr_vector::bb_range_p): Adjust assert.
19404 (~block_range_cache): No freeing loop required.
19405 (block_range_cache::get_block_ranges): Remove.
19406 (block_range_cache::set_bb_range): Inline get_block_ranges.
19407 (block_range_cache::set_bb_varying): Remove.
19408 * gimple-range-cache.h (set_bb_varying): Remove prototype.
19409 * value-range.h (irange_allocator::get_memory): New.
19411 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19413 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
19414 dominator tree is available and requested.
19415 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
19416 (ranger_cache::fill_block_cache): Don't search dom tree here either.
19417 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
19419 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19421 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
19422 only PHI nodes better.
19424 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19426 * gimple-range-edge.h (gimple_outgoing_range): Rename from
19428 (gcond_edge_range): Export prototype.
19429 * gimple-range-edge.cc (gcond_edge_range): New.
19430 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
19431 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
19433 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19435 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
19436 default range into a temp and allocate only what is needed.
19438 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19440 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
19442 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19444 * gimple-range.h (gimple_range_global): Pick up parameter initial
19445 values, and use-before defined locals are UNDEFINED.
19447 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
19449 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
19451 * tree.h (reverse_storage_order_for_component_p): Return false if
19452 the type is a pointer.
19454 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
19456 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
19458 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
19461 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
19462 Handle V8QI, V4HI and V2SI modes.
19463 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
19464 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
19465 * config/i386/i386.md (unspec): ... here.
19467 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
19468 Tom de Vries <tdevries@suse.de>
19470 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
19471 a truth_value_p reduction variable is nonintegral.
19473 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
19476 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
19477 Return false for mode sizes < 16.
19479 2021-05-07 Jakub Jelinek <jakub@redhat.com>
19482 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
19484 2021-05-06 Martin Jambor <mjambor@suse.cz>
19486 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
19487 when there is no function summary.
19488 (ipa_sra_summarize_function): produce edge summaries even when
19491 2021-05-06 Tom Tromey <tom@tromey.com>
19493 * godump.c (string_hash_eq): Remove.
19494 (go_finish): Use htab_eq_string.
19496 2021-05-06 Tom Tromey <tom@tromey.com>
19498 * gengtype-state.c (read_state): Use htab_eq_string.
19499 (string_eq): Remove.
19501 2021-05-06 Tom Tromey <tom@tromey.com>
19503 * gensupport.c (htab_eq_string): Remove.
19505 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
19508 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
19509 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
19510 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
19511 set_ignored_loc callbacks.
19512 * debug.c (do_nothing_debug_hooks): Likewise.
19513 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
19514 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
19515 (in_text_section_p, last_text_label, last_cold_label,
19516 switch_text_ranges, switch_cold_ranges): New data items.
19517 (dwarf2out_note_section_used): Remove.
19518 (dwarf2out_begin_prologue): Set fde->ignored_debug and
19520 (mark_ignored_debug_section): New helper function.
19521 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
19522 mark_ignored_debug_section.
19523 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
19524 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
19525 (size_of_aranges): Adjust formula for multi-part text ranges size.
19526 (output_aranges): Output multi-part text ranges.
19527 (dwarf2out_set_ignored_loc): New callback function.
19528 (dwarf2out_finish): Output multi-part text ranges.
19529 (dwarf2out_c_finalize): Clear new data items.
19530 * final.c (final_start_function_1): Call set_ignored_loc callback.
19531 (final_scan_insn_1): Likewise.
19532 * ggc-page.c (gt_ggc_mx): New helper function.
19533 * stringpool.c (gt_pch_nx): Likewise.
19535 2021-05-06 Richard Biener <rguenther@suse.de>
19537 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
19538 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
19539 (TV_TREE_INTO_SSA): New.
19540 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
19541 (rewrite_blocks): Likewise.
19542 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
19544 2021-05-06 Jakub Jelinek <jakub@redhat.com>
19546 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
19547 abs_replacement, xor_replacement,
19548 cond_removal_in_popcount_clz_ctz_pattern,
19549 replace_phi_edge_with_variable): Change type of phi argument from
19550 gimple * to gphi *.
19552 2021-05-06 Richard Biener <rguenther@suse.de>
19554 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
19555 Output an opt-info message.
19556 (do_split_loop_on_cond): Likewise.
19557 (tree_ssa_split_loops): Update SSA form here.
19559 2021-05-06 Richard Biener <rguenther@suse.de>
19561 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
19562 return variable removal.
19564 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
19566 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
19567 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
19568 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
19570 * config/s390/s390.c (s390_const_operand_ok): Remove unused
19573 2021-05-06 Jakub Jelinek <jakub@redhat.com>
19575 PR tree-optimization/94589
19576 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
19577 spaceship_replacement.
19578 (cond_only_block_p, spaceship_replacement): New functions.
19580 2021-05-06 Richard Biener <rguenther@suse.de>
19583 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
19584 dealing with a debug use and only query existing addresses
19586 (lower_emutls_1): Avoid splitting out addresses for debug
19587 stmts, reset the debug stmt when we fail to find existing
19589 (lower_emutls_phi_arg): Set wi.stmt.
19591 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
19594 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
19595 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
19596 (stack_protect_test): Use cbranch helper.
19598 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
19601 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
19602 always return the establisher frame for __builtin_frame_address (0).
19604 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
19607 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
19608 (ix86_cpu_model_var): Likewise.
19609 (ix86_cpu_features2_type_node): Likewise.
19610 (ix86_cpu_features2_var): Likewise.
19611 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
19614 2021-05-05 Martin Sebor <msebor@redhat.com>
19616 * passes.def (pass_warn_printf): Run after SSA.
19618 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
19620 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
19621 * config/arm/predicates.md (minus_one_operand): New predicate.
19623 2021-05-05 Jeff Law <jlaw@tachyum.com>
19625 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
19627 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
19629 PR rtl-optimization/100263
19630 * postreload.c (move2add_valid_value_p): Ensure register can
19633 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
19635 PR rtl-optimization/100411
19636 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
19637 and beginning of function markers.
19639 2021-05-05 Jeff Law <jlaw@tachyum.com>
19641 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
19642 * config/cr16/cr16.c (notice_update_cc): Remove.
19643 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
19645 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
19648 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
19649 Handle V8QI, V4HI and V2SI modes.
19650 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
19651 (ix86_build_signbit_mask): Ditto.
19652 * config/i386/mmx.md (MMXMODE14): New mode iterator.
19653 (<smaxmin:code><MMXMODE14:mode>3): New expander.
19654 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
19655 (<umaxmin:code><MMXMODE24:mode>3): New expander.
19656 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
19657 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
19658 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19659 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19660 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19661 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19663 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
19665 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
19666 not expand the VALUE_EXPR of variables put in the non-local frame.
19667 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
19668 to be ignored for debug info, ensure its variable offsets are not.
19670 2021-05-05 Richard Biener <rguenther@suse.de>
19672 PR tree-optimization/79333
19673 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
19674 Fold stmt following SSA edges.
19676 2021-05-05 Richard Biener <rguenther@suse.de>
19678 PR middle-end/100394
19679 * calls.c (expand_call): Preserve possibly throwing calls.
19680 * cfgexpand.c (expand_call_stmt): When a call can throw signal
19681 RTL expansion there are side-effects.
19682 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
19683 mark all possibly throwing stmts necessary unless we can elide
19685 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
19686 -fdelete-dead-exceptions.
19687 * tree.h (DECL_PURE_P): Add note about exceptions.
19689 2021-05-05 Alexandre Oliva <oliva@adacore.com>
19691 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
19694 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
19696 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
19697 get_fnname_from_decl for name of thunk.
19698 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
19699 and ASM_OUTPUT_LABEL.
19700 (rs6000_xcoff_declare_function_name): Use assemble_name and
19702 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
19703 (rs6000_xcoff_encode_section_info): Don't add mapping class
19704 for aliases. Always add [DS] mapping class to primary
19706 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
19708 2021-05-04 Martin Sebor <msebor@redhat.com>
19710 PR middle-end/100307
19711 * builtins.c (compute_objsize_r): Clear base0 for pointers.
19713 2021-05-04 Jeff Law <jlaw@tachyum.com>
19715 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
19717 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
19719 * caller-save.c: Remove CC0.
19720 * cfgcleanup.c: Remove CC0.
19721 * cfgrtl.c: Remove CC0.
19722 * combine.c: Remove CC0.
19723 * compare-elim.c: Remove CC0.
19724 * conditions.h: Remove CC0.
19725 * config/h8300/h8300.h: Remove CC0.
19726 * config/h8300/h8300-protos.h: Remove CC0.
19727 * config/h8300/peepholes.md: Remove CC0.
19728 * config/i386/x86-tune-sched.c: Remove CC0.
19729 * config/m68k/m68k.c: Remove CC0.
19730 * config/rl78/rl78.c: Remove CC0.
19731 * config/sparc/sparc.c: Remove CC0.
19732 * config/xtensa/xtensa.c: Remove CC0.
19733 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
19734 RTL where that is used as a placeholder only.
19735 * cprop.c: Remove CC0.
19736 * cse.c: Remove CC0.
19737 * cselib.c: Remove CC0.
19738 * df-problems.c: Remove CC0.
19739 * df-scan.c: Remove CC0.
19740 * doc/md.texi: Remove CC0. Adjust an example.
19741 * doc/rtl.texi: Remove CC0. Adjust an example.
19742 * doc/tm.texi: Regenerate.
19743 * doc/tm.texi.in: Remove CC0.
19744 * emit-rtl.c: Remove CC0.
19745 * final.c: Remove CC0.
19746 * fwprop.c: Remove CC0.
19747 * gcse-common.c: Remove CC0.
19748 * gcse.c: Remove CC0.
19749 * genattrtab.c: Remove CC0.
19750 * genconfig.c: Remove CC0.
19751 * genemit.c: Remove CC0.
19752 * genextract.c: Remove CC0.
19753 * gengenrtl.c: Remove CC0.
19754 * genrecog.c: Remove CC0.
19755 * haifa-sched.c: Remove CC0.
19756 * ifcvt.c: Remove CC0.
19757 * ira-costs.c: Remove CC0.
19758 * ira.c: Remove CC0.
19759 * jump.c: Remove CC0.
19760 * loop-invariant.c: Remove CC0.
19761 * lra-constraints.c: Remove CC0.
19762 * lra-eliminations.c: Remove CC0.
19763 * optabs.c: Remove CC0.
19764 * postreload-gcse.c: Remove CC0.
19765 * postreload.c: Remove CC0.
19766 * print-rtl.c: Remove CC0.
19767 * read-rtl-function.c: Remove CC0.
19768 * reg-notes.def: Remove CC0.
19769 * reg-stack.c: Remove CC0.
19770 * reginfo.c: Remove CC0.
19771 * regrename.c: Remove CC0.
19772 * reload.c: Remove CC0.
19773 * reload1.c: Remove CC0.
19774 * reorg.c: Remove CC0.
19775 * resource.c: Remove CC0.
19776 * rtl.c: Remove CC0.
19777 * rtl.def: Remove CC0.
19778 * rtl.h: Remove CC0.
19779 * rtlanal.c: Remove CC0.
19780 * sched-deps.c: Remove CC0.
19781 * sched-rgn.c: Remove CC0.
19782 * shrink-wrap.c: Remove CC0.
19783 * simplify-rtx.c: Remove CC0.
19784 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
19785 CC_STATUS_MDEP, and CC_STATUS.
19786 * target.def: Remove CC0.
19787 * valtrack.c: Remove CC0.
19788 * var-tracking.c: Remove CC0.
19790 2021-05-04 Richard Biener <rguenther@suse.de>
19792 PR tree-optimization/100414
19793 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
19795 (tree_ssa_phiopt_worker): But unconditionally here.
19797 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
19799 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
19800 && and || with floating-point and complex arguments.
19802 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
19804 * tree-inline.c (insert_debug_decl_map): Delete.
19805 (copy_debug_stmt): Minor tweak.
19806 (setup_one_parameter): Do not use a variable if the value is either
19807 a read-only DECL or a non-addressable local variable in the caller.
19808 In this case, insert the debug-only variable in the map manually.
19809 (expand_call_inline): Do not generate a CLOBBER for these values.
19810 * tree-inline.h (debug_map): Minor tweak.
19812 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
19814 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
19815 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
19817 2021-05-04 Richard Biener <rguenther@suse.de>
19819 PR tree-optimization/100329
19820 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
19822 (insert_stmt_after): Assert we're not running into asm goto.
19824 2021-05-04 Richard Biener <rguenther@suse.de>
19826 PR tree-optimization/100398
19827 * tree-ssa-dse.c (pass_dse::execute): Preserve control
19830 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
19832 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
19834 2021-05-04 Alexandre Oliva <oliva@adacore.com>
19836 * builtins.c (try_store_by_multiple_pieces): New.
19837 (expand_builtin_memset_args): Use it. If target_char_cast
19838 fails, proceed as for non-constant val. Pass len's ctz to...
19839 * expr.c (clear_storage_hints): ... this. Try store by
19840 multiple pieces after setmem.
19841 (clear_storage): Adjust.
19842 * expr.h (clear_storage_hints): Likewise.
19843 (try_store_by_multiple_pieces): Declare.
19844 * passes.def: Replace the last copy_prop with ccp.
19846 2021-05-03 Tom de Vries <tdevries@suse.de>
19849 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
19852 2021-05-03 Richard Biener <rguenther@suse.de>
19854 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
19856 2021-05-03 Richard Biener <rguenther@suse.de>
19858 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
19859 (dse_dom_walker): Remove.
19860 (dse_dom_walker::dse_optimize_stmt): Rename...
19861 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
19862 (dse_dom_walker::before_dom_children): Inline ...
19863 (pass_dse::execute): ... here. Perform a reverse program
19866 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
19869 * configure: Regenerated.
19871 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
19874 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
19875 (s390_md_asm_adjust): Handle hard registers.
19877 2021-05-03 Jakub Jelinek <jakub@redhat.com>
19879 PR tree-optimization/100382
19880 * tree-ssa-dse.c: Include tree-eh.h.
19881 (dse_dom_walker::before_dom_children): Don't remove stmts if
19882 stmt_unremovable_because_of_non_call_eh_p is true.
19884 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
19886 * varasm.c (compute_reloc_for_var): Split out from...
19887 (get_variable_section): Use it.
19888 * output.h (compute_reloc_for_var): Declare.
19889 * config/rs6000/rs6000-protos.h
19890 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
19892 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
19893 Don't append storage mapping class to symbol.
19894 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
19895 Don't convert TLS BSS to common.
19896 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
19897 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
19899 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
19901 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
19902 If align is 0 from TLS class, use the same rules as varasm.c
19903 If not common, switch to BSS section manually.
19904 If common, emit appropriate comm or lcomm directive.
19905 (rs6000_xcoff_encode_section_info): Add logic to append all
19906 storage mapping classes.
19907 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
19908 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
19909 rs6000_xcoff_asm_output_aligned_decl_common.
19910 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
19911 rs6000_xcoff_asm_output_aligned_decl_common.
19912 (ASM_OUTPUT_TLS_COMMON): Use
19913 rs6000_xcoff_asm_output_aligned_decl_common.
19915 2021-05-02 Jakub Jelinek <jakub@redhat.com>
19918 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
19919 as first argument of pseudo_node_t constructors.
19921 2021-05-02 Jakub Jelinek <jakub@redhat.com>
19924 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
19926 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
19928 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
19929 (gt_pch_nx (int_range<1> *&)): New.
19930 (gt_ggc_mx (int_range<1> *&)): New.
19931 * value-range.h (class irange): Add GTY support for
19934 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
19936 * doc/options.texi (Negative): Change either or to both and.
19938 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
19940 * config/aarch64/aarch64-simd-builtins.def: Add
19941 float_ml[as][q]_laneq builtin generator macros.
19942 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
19943 (aarch64_float_mla_laneq<mode>): Define.
19944 (aarch64_float_mls_laneq<mode>): Define.
19945 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
19946 instead of GCC vector extensions.
19947 (vmlaq_laneq_f32): Likewise.
19948 (vmls_laneq_f32): Likewise.
19949 (vmlsq_laneq_f32): Likewise.
19951 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
19953 * config/aarch64/aarch64-simd-builtins.def: Add
19954 float_ml[as]_lane builtin generator macros.
19955 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
19957 (mul_lane<mode>3): This, and re-order arguments.
19958 (aarch64_float_mla_lane<mode>): Define.
19959 (aarch64_float_mls_lane<mode>): Define.
19960 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
19961 instead of GCC vector extensions.
19962 (vmlaq_lane_f32): Likewise.
19963 (vmls_lane_f32): Likewise.
19964 (vmlsq_lane_f32): Likewise.
19966 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
19968 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
19969 builtin generator macros.
19970 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
19972 (aarch64_float_mls<mode>): Define.
19973 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
19974 instead of relying on GCC vector extensions.
19975 (vmla_f64): Likewise.
19976 (vmlaq_f32): Likewise.
19977 (vmlaq_f64): Likewise.
19978 (vmls_f32): Likewise.
19979 (vmls_f64): Likewise.
19980 (vmlsq_f32): Likewise.
19981 (vmlsq_f64): Likewise.
19982 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
19984 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
19986 * config/aarch64/aarch64-simd-builtins.def: Add
19987 float_ml[as]_n_builtin generator macros.
19988 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
19990 (mul_n<mode>3): This, and re-order arguments.
19991 (aarch64_float_mla_n<mode>): Define.
19992 (aarch64_float_mls_n<mode>): Define.
19993 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
19994 instead of inline asm.
19995 (vmlaq_n_f32): Likewise.
19996 (vmls_n_f32): Likewise.
19997 (vmlsq_n_f32): Likewise.
19999 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
20001 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
20002 builtin generator macros.
20003 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
20004 (aarch64_pmull_hiv16qi_insn): Define.
20005 (aarch64_pmull_hiv16qi): Define.
20006 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
20007 instead of inline asm.
20008 (vmull_p8): Likewise.
20010 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
20012 * config/avr/avr.md: Adjust peepholes to match and
20013 generate parallels with clobber of REG_CC.
20014 (mov<mode>_insn): Rename to mov<mode>_insn_split.
20015 (*mov<mode>_insn): Rename to mov<mode>_insn.
20017 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
20019 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
20020 for VAR_DECLs if -fdata-sections enabled.
20022 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
20024 PR bootstrap/100327
20025 * config/rs6000/rs6000.c
20026 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
20027 (rs6000_libgcc_floating_mode_supported_p): New target hook.
20029 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
20031 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
20033 (class back_threader_registry): ...to here...
20034 (class back_threader_profitability): ...and here...
20035 (thread_jumps::thread_through_all_blocks): Remove argument.
20036 (back_threader_registry::back_threader_registry): New.
20037 (back_threader_registry::~back_threader_registry): New.
20038 (back_threader_registry::thread_through_all_blocks): New.
20039 (thread_jumps::profitable_jump_thread_path): Move from here...
20040 (back_threader_profitability::profitable_path_p): ...to here.
20041 (thread_jumps::find_taken_edge): New.
20042 (thread_jumps::convert_and_register_current_path): Move...
20043 (back_threader_registry::register_path): ...to here.
20044 (thread_jumps::register_jump_thread_path_if_profitable): Move...
20045 (thread_jumps::maybe_register_path): ...to here.
20046 (thread_jumps::handle_phi): Call find_taken_edge and
20047 maybe_register_path.
20048 (thread_jumps::handle_assignment): Same.
20049 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
20050 tree argument to handle_phi and handle_assignment.
20051 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
20052 set of m_speed_p and m_max_threaded_paths.
20053 (pass_thread_jumps::execute): Remove second argument from
20054 find_jump_threads_backwards.
20055 (pass_early_thread_jumps::execute): Same.
20057 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
20059 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
20060 (class dom_opt_dom_walker): Initialize some class variables.
20061 (pass_dominator::execute): Pass evrp_range_analyzer and
20062 dom_jump_threader_simplifier to dom_opt_dom_walker.
20063 Adjust for some functions moving into classes.
20064 (simplify_stmt_for_jump_threading): Adjust and move to...
20065 (jump_threader_simplifier::simplify): ...here.
20066 (dom_opt_dom_walker::before_dom_children): Adjust for
20067 m_evrp_range_analyzer.
20068 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
20069 (test_for_singularity): Place in dom_opt_dom_walker class.
20070 (dom_opt_dom_walker::optimize_stmt): The argument
20071 evrp_range_analyzer is now a class field.
20072 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
20073 (thread_jumps::thread_through_all_blocks): New.
20074 (thread_jumps::convert_and_register_current_path): Use m_registry.
20075 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
20076 being in the threader class.
20077 (pass_early_thread_jumps::execute): Same.
20078 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
20079 (jump_threader::jump_threader): ...here.
20080 (threadedge_finalize_values): Move...
20081 (jump_threader::~jump_threader): ...here.
20082 (jump_threader::remove_jump_threads_including): New.
20083 (jump_threader::thread_through_all_blocks): New.
20084 (record_temporary_equivalences_from_phis): Move...
20085 (jump_threader::record_temporary_equivalences_from_phis): ...here.
20086 (record_temporary_equivalences_from_stmts_at_dest): Move...
20087 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
20089 (simplify_control_stmt_condition_1): Move to jump_threader class.
20090 (simplify_control_stmt_condition): Move...
20091 (jump_threader::simplify_control_stmt_condition): ...here.
20092 (thread_around_empty_blocks): Move...
20093 (jump_threader::thread_around_empty_blocks): ...here.
20094 (thread_through_normal_block): Move...
20095 (jump_threader::thread_through_normal_block): ...here.
20096 (thread_across_edge): Move...
20097 (jump_threader::thread_across_edge): ...here.
20098 (thread_outgoing_edges): Move...
20099 (jump_threader::thread_outgoing_edges): ...here.
20100 * tree-ssa-threadedge.h: Move externally facing functings...
20101 (class jump_threader): ...here...
20102 (class jump_threader_simplifier): ...and here.
20103 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
20104 (jump_thread_path_allocator::jump_thread_path_allocator): New.
20105 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
20106 (jump_thread_path_allocator::allocate_thread_edge): New.
20107 (jump_thread_path_allocator::allocate_thread_path): New.
20108 (jump_thread_path_registry::jump_thread_path_registry): New.
20109 (jump_thread_path_registry::~jump_thread_path_registry): New.
20110 (jump_thread_path_registry::allocate_thread_edge): New.
20111 (jump_thread_path_registry::allocate_thread_path): New.
20112 (dump_jump_thread_path): Make extern.
20113 (debug (const vec<jump_thread_edge *> &path)): New.
20114 (struct removed_edges): Move to tree-ssa-threadupdate.h.
20115 (struct thread_stats_d): Remove.
20116 (remove_ctrl_stmt_and_useless_edges): Make static.
20117 (lookup_redirection_data): Move...
20118 (jump_thread_path_registry::lookup_redirection_data): ...here.
20119 (ssa_redirect_edges): Make static.
20120 (thread_block_1): Move...
20121 (jump_thread_path_registry::thread_block_1): ...here.
20122 (thread_block): Move...
20123 (jump_thread_path_registry::thread_block): ...here.
20124 (thread_through_loop_header): Move...
20125 (jump_thread_path_registry::thread_through_loop_header): ...here.
20126 (mark_threaded_blocks): Move...
20127 (jump_thread_path_registry::mark_threaded_blocks): ...here.
20128 (debug_path): Move...
20129 (jump_thread_path_registry::debug_path): ...here.
20130 (debug_all_paths): Move...
20131 (jump_thread_path_registry::dump): ..here.
20132 (rewire_first_differing_edge): Move...
20133 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
20134 (adjust_paths_after_duplication): Move...
20135 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
20136 (duplicate_thread_path): Move...
20137 (jump_thread_path_registry::duplicate_thread_path): ..here.
20138 (remove_jump_threads_including): Move...
20139 (jump_thread_path_registry::remove_jump_threads_including): ...here.
20140 (thread_through_all_blocks): Move to...
20141 (jump_thread_path_registry::thread_through_all_blocks): ...here.
20142 (delete_jump_thread_path): Remove.
20143 (register_jump_thread): Move...
20144 (jump_thread_path_registry::register_jump_thread): ...here.
20145 * tree-ssa-threadupdate.h: Move externally facing functions...
20146 (class jump_thread_path_allocator): ...here...
20147 (class jump_thread_path_registry): ...and here.
20148 (thread_through_all_blocks): Remove.
20149 (struct removed_edges): New.
20150 (register_jump_thread): Remove.
20151 (remove_jump_threads_including): Remove.
20152 (delete_jump_thread_path): Remove.
20153 (remove_ctrl_stmt_and_useless_edges): Remove.
20154 (free_dom_edge_info): New prototype.
20155 * tree-vrp.c: Remove x_vr_values hack.
20156 (class vrp_jump_threader_simplifier): New.
20157 (vrp_jump_threader_simplifier::simplify): New.
20158 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
20159 Remove m_dummy_cond.
20160 Instantiate m_simplifier and m_threader.
20161 (vrp_jump_threader::thread_through_all_blocks): New.
20162 (vrp_jump_threader::simplify_stmt): Remove.
20163 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
20164 Remove x_vr_values hack.
20165 (execute_vrp): Adjust for thread_through_all_blocks being in a
20168 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
20170 * genflags.c (gen_insn): Print failed expansion string.
20172 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
20174 * expr.c (alignment_for_piecewise_move): Call mode_for_size
20175 without limit to MAX_FIXED_MODE_SIZE.
20177 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
20179 PR middle-end/90773
20180 * builtins.c (builtin_memset_gen_str): Don't use return from
20181 simplify_gen_subreg.
20183 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
20186 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
20187 (*addsi3_carry_zext_0r): Ditto.
20188 (*sub<mode>3_carry_0): Ditto.
20189 (*subsi3_carry_zext_0r): Ditto.
20190 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
20192 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
20193 Also consider ix86_carry_flag_unset_operator to calculate
20194 the cost of adc/sbb insn.
20196 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
20198 PR rtl-optimization/100225
20199 PR rtl-optimization/84878
20200 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
20201 where we have an instruction which touches (writes) any hard
20202 register from df->regular_block_artificial_uses set.
20203 Allow not-single-set instruction only right before basic block
20206 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
20208 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
20210 2021-04-30 LevyHsu <admin@levyhsu.com>
20212 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
20213 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
20214 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
20215 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
20217 2021-04-29 Alexandre Oliva <oliva@adacore.com>
20219 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
20221 2021-04-29 Alexandre Oliva <oliva@adacore.com>
20223 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
20224 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
20225 * config/i386/i386.c: Adjust.
20226 * config/i386/i386.md: Adjust.
20227 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
20228 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20229 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20230 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20231 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20232 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20233 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20234 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20235 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20236 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20237 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
20239 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
20241 * config/i386/i386-expand.c (ix86_expand_int_compare):
20242 Swap operands of GTU and LEU comparison to emit carry flag comparison.
20243 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
20244 predicate to allow more combine opportunities with memory operands.
20245 (*sub<mode>3_carry_0): Ditto.
20247 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
20249 PR rtl-optimization/100303
20250 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
20251 boolean that indicates whether the use will only be used in
20252 debug instructions. Treat it in the same way that existing
20253 cross-EBB debug references would be handled if so.
20254 (function_info::make_uses_available): Likewise.
20255 * rtl-ssa/functions.h (function_info::make_uses_available): Update
20256 prototype accordingly.
20257 (function_info::make_uses_available): Likewise.
20258 * fwprop.c (try_fwprop_subst): Update call accordingly.
20260 2021-04-29 Jeff Law <jlaw@tachyum.com>
20262 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
20265 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
20266 Richard Biener <rguenther@suse.de>
20269 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
20270 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
20271 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
20272 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
20273 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
20274 to PURE_ARGS category.
20275 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
20276 Handle PURE_ARGS category.
20277 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
20279 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
20281 * configure.ac: Check for the presence of sys/locking.h header and
20282 for whether _LK_LOCK is supported by _locking.
20283 * configure: Regenerate.
20284 * config.in: Likewise.
20285 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
20286 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
20287 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
20289 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
20291 * config/i386/predicates.md (fcmov_comparison_operator):
20292 Do not check for trivial FP comparison operator.
20293 <case GEU, case LTU>: Allow CCGZmode.
20294 <case GTU, case LEU>: Do not allow CCCmode.
20295 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
20296 (ix86_carry_flag_operator): Match only LTU and UNLT code.
20297 Do not check for trivial FP comparison operator. Allow CCGZmode.
20299 2021-04-29 Tom de Vries <tdevries@suse.de>
20301 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
20302 fd->loop.step by either step or orig_step.
20304 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
20306 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
20307 (load_got_register): Do the PIC dance here.
20308 (sparc_legitimize_tls_address): Simplify.
20309 (sparc_emit_probe_stack_range): Likewise.
20310 (sparc32_initialize_trampoline): Likewise.
20311 (sparc64_initialize_trampoline): Likewise.
20312 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
20313 (probe_stack_range<P:mode>): Likewise.
20314 (flush<P:mode>): Likewise.
20315 (tgd_hi22<P:mode>): Likewise.
20316 (tgd_lo10<P:mode>): Likewise.
20317 (tgd_add<P:mode>): Likewise.
20318 (tgd_call<P:mode>): Likewise.
20319 (tldm_hi22<P:mode>): Likewise.
20320 (tldm_lo10<P:mode>): Likewise.
20321 (tldm_add<P:mode>): Likewise.
20322 (tldm_call<P:mode>): Likewise.
20323 (tldo_hix22<P:mode>): Likewise.
20324 (tldo_lox10<P:mode>): Likewise.
20325 (tldo_add<P:mode>): Likewise.
20326 (tie_hi22<P:mode>): Likewise.
20327 (tie_lo10<P:mode>): Likewise.
20328 (tie_add<P:mode>): Likewise.
20329 (tle_hix22<P:mode>): Likewise.
20330 (tle_lox10<P:mode>): Likewise.
20331 (stack_protect_setsi): Rename to...
20332 (stack_protect_set32): ...this.
20333 (stack_protect_setdi): Rename to...
20334 (stack_protect_set64): ...this.
20335 (stack_protect_set): Adjust calls to above.
20336 (stack_protect_testsi): Rename to...
20337 (stack_protect_test32): ...this.
20338 (stack_protect_testdi): Rename to...
20339 (stack_protect_test64): ...this.
20340 (stack_protect_test): Adjust calls to above.
20342 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
20344 PR middle-end/90773
20345 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
20346 (builtin_strncpy_read_str): Likewise.
20347 (builtin_memset_read_str): Add an argument for the previous RTL
20348 information and generate the new RTL from the previous RTL info.
20349 (builtin_memset_gen_str): Likewise.
20350 * builtins.h (builtin_strncpy_read_str): Update the prototype.
20351 (builtin_memset_read_str): Likewise.
20352 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
20353 returns true, round up size and alignment to the widest integer
20354 mode for maximum size.
20355 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
20356 and pass it to m_constfn.
20357 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
20358 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
20359 initialize m_push. Initialize m_overlap_op_by_pieces with
20360 targetm.overlap_op_by_pieces_p ().
20361 (op_by_pieces_d::run): Pass the previous RTL information to
20362 pieces_addr::adjust and generate overlapping operations if
20363 m_overlap_op_by_pieces is true.
20365 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
20367 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
20369 (can_store_by_pieces): Use by_pieces_constfn on constfun.
20370 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
20371 for op_by_pieces_d change.
20372 (clear_by_pieces_1): Add a dummy argument.
20373 (clear_by_pieces): Updated for op_by_pieces_d change.
20374 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
20375 (string_cst_read_str): Add a dummy argument.
20376 * expr.h (by_pieces_constfn): Add a dummy argument.
20377 (by_pieces_prev): New.
20378 * target.def (overlap_op_by_pieces_p): New target hook.
20379 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
20380 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
20381 * doc/tm.texi: Regenerated.
20383 2021-04-29 Richard Biener <rguenther@suse.de>
20385 PR tree-optimization/100253
20386 * tree-vect-stmts.c (vectorizable_load): Do not assume
20387 element alignment when DR_MISALIGNMENT is -1.
20388 (vectorizable_store): Likewise.
20390 2021-04-29 Jakub Jelinek <jakub@redhat.com>
20393 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
20394 absu_hwi instead of abs_hwi.
20396 2021-04-29 Richard Biener <rguenther@suse.de>
20398 PR middle-end/38474
20399 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
20400 forwarding when indirect forwarding through ESCAPED
20403 2021-04-29 Tom de Vries <tdevries@suse.de>
20406 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
20407 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
20408 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
20409 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
20411 2021-04-29 Richard Biener <rguenther@suse.de>
20413 PR tree-optimization/99912
20414 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
20415 (dse_dom_walker::todo): Likewise.
20416 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
20418 (dse_dom_walker::before_dom_children): Remove trivially
20419 dead SSA defs and schedule CFG cleanup if we removed all
20421 (pass_dse::execute): Get TODO as computed by the DOM walker
20422 and return it. Wipe dominator info earlier.
20424 2021-04-29 Richard Biener <rguenther@suse.de>
20427 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
20428 Track blocks to cleanup EH in new m_need_eh_cleanup.
20429 (ipcp_modif_dom_walker::cleanup_eh): New.
20430 (ipcp_transform_function): Release dominator info before
20433 2021-04-29 Martin Sebor <msebor@redhat.com>
20435 PR middle-end/100250
20436 * attribs.c (attr_access::array_as_string): Avoid dereferencing
20437 a pointer when it's null.
20439 2021-04-29 Martin Sebor <msebor@redhat.com>
20441 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
20442 * ipa-free-lang-data.cc: New file.
20443 * tree.c: Move pass free_lang_data to file above.
20444 (build_array_type_1): Declare extern.
20445 * tree.h (build_array_type_1): Declare.
20447 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20449 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
20450 make consistent with updated RTL pattern.
20451 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
20452 Implement using ss_truncate and us_truncate rather than
20454 * config/aarch64/iterators.md: Remove redundant unspecs and
20455 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
20457 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20459 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
20460 attributes consistent with those defined in arm_neon.h.
20462 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20464 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
20465 attributes consistent with those defined in arm_neon.h.
20467 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20469 * config/aarch64/aarch64-simd-builtins.def: Add
20470 float_trunc_rodd builtin generator macros.
20471 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
20473 (aarch64_float_trunc_rodd_lo_v2sf): Define.
20474 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
20475 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
20476 (aarch64_float_trunc_rodd_hi_v4sf): Define.
20477 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
20478 instead of inline asm.
20479 (vcvtx_high_f32_f64): Likewise.
20480 (vcvtxd_f32_f64): Likewise.
20481 * config/aarch64/iterators.md: Add FCVTXN unspec.
20483 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20485 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
20487 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
20489 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
20490 instead of inline asm.
20491 (vqtbx1_u8): Likewise.
20492 (vqtbx1_p8): Likewise.
20493 (vqtbx1q_s8): Likewise.
20494 (vqtbx1q_u8): Likewise.
20495 (vqtbx1q_p8): Likewise.
20496 (vtbx2_s8): Likewise.
20497 (vtbx2_u8): Likewise.
20498 (vtbx2_p8): Likewise.
20500 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20502 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
20504 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
20505 instead of inline asm.
20506 (vqtbl1_s8): Likewise.
20507 (vqtbl1_u8): Likewise.
20508 (vqtbl1q_p8): Likewise.
20509 (vqtbl1q_s8): Likewise.
20510 (vqtbl1q_u8): Likewise.
20511 (vtbl1_s8): Likewise.
20512 (vtbl1_u8): Likewise.
20513 (vtbl1_p8): Likewise.
20514 (vtbl2_s8): Likewise.
20515 (vtbl2_u8): Likewise.
20516 (vtbl2_p8): Likewise.
20518 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20520 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
20521 ssri_n buitin generator macro.
20522 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
20523 instead of inline asm.
20524 (vsri_n_p16): Likewise.
20525 (vsri_n_p64): Likewise.
20526 (vsriq_n_p8): Likewise.
20527 (vsriq_n_p16): Likewise.
20528 (vsriq_n_p64): Likewise.
20530 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20532 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
20533 iterator for polynomial ssli_n builtin generator macro.
20534 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
20535 instead of inline asm.
20536 (vsli_n_p16): Likewise.
20537 (vsliq_n_p8): Likewise.
20538 (vsliq_n_p16): Likewise.
20539 * config/aarch64/iterators.md: Define VALLP mode iterator.
20541 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20543 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
20544 iterator to generate [su]adalp RTL builtins.
20545 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
20546 [su]adalp RTL pattern.
20547 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
20548 instead of inline asm.
20549 (vpadal_u32): Likewise.
20551 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20553 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
20554 builtin generator macros.
20555 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
20557 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
20558 instead of inline asm.
20559 (vpaddl_s16): Likewise.
20560 (vpaddl_s32): Likewise.
20561 (vpaddl_u8): Likewise.
20562 (vpaddl_u16): Likewise.
20563 (vpaddl_u32): Likewise.
20564 (vpaddlq_s8): Likewise.
20565 (vpaddlq_s16): Likewise.
20566 (vpaddlq_s32): Likewise.
20567 (vpaddlq_u8): Likewise.
20568 (vpaddlq_u16): Likewise.
20569 (vpaddlq_u32): Liwewise.
20570 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
20571 appropriate attributes.
20573 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20575 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
20576 for aarch64_addp<mode> builtin macro generator.
20577 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
20578 aarch64_addp<mode> RTL pattern.
20579 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
20580 instead of inline asm.
20581 (vpaddq_s16): Likewise.
20582 (vpaddq_s32): Likewise.
20583 (vpaddq_s64): Likewise.
20584 (vpaddq_u8): Likewise.
20585 (vpaddq_u16): Likewise.
20586 (vpaddq_u32): Likewise.
20587 (vpaddq_u64): Likewise.
20589 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20591 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
20592 builtin generator macros.
20593 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
20595 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
20596 instead of inline asm.
20597 (vqdmulh_n_s32): Likewise.
20598 (vqdmulhq_n_s16): Likewise.
20599 (vqdmulhq_n_s32): Likewise.
20600 (vqrdmulh_n_s16): Likewise.
20601 (vqrdmulh_n_s32): Likewise.
20602 (vqrdmulhq_n_s16): Likewise.
20603 (vqrdmulhq_n_s32): Likewise.
20605 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
20607 * doc/install.texi (--enable-offload-defaulted): Document.
20609 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
20611 * config/avr/avr-dimode.md: Turn existing patterns into
20612 define_insn_and_split style patterns where the splitter
20613 adds a clobber of the condition code register. Drop "cc"
20614 attribute. Add new patterns to match output of
20616 * config/avr/avr-fixed.md: Likewise.
20617 * config/avr/avr.c (cc_reg_rtx): New.
20618 (avr_parallel_insn_from_insns): Adjust insn count
20619 for removal of set of cc0.
20620 (avr_is_casesi_sequence): Likewise.
20621 (avr_casei_sequence_check_operands): Likewise.
20622 (avr_optimize_casesi): Likewise. Also insert
20623 new insns after jump_insn.
20624 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
20625 for removal of set of cc0.
20626 (avr_init_expanders): Initialize cc_reg_rtx.
20627 (avr_regno_reg_class): Handle REG_CC.
20628 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
20629 (avr_notice_update_cc): Remove function.
20630 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
20631 (compare_condition): Adjust for PARALLEL with
20633 (out_shift_with_cnt): Likewise.
20634 (ashlhi3_out): Likewise.
20635 (ashrhi3_out): Likewise.
20636 (lshrhi3_out): Likewise.
20637 (avr_class_max_nregs): Return single reg for REG_CC.
20638 (avr_compare_pattern): Check for REG_CC instead
20640 (avr_reorg_remove_redundant_compare): Likewise.
20641 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
20642 (avr_hard_regno_nregs): Return single reg for REG_CC.
20643 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
20644 (avr_md_asm_adjust): Clobber REG_CC.
20645 (TARGET_HARD_REGNO_NREGS): Define.
20646 (TARGET_CLASS_MAX_NREGS): Define.
20647 (TARGET_MD_ASM_ADJUST): Define.
20648 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
20650 (enum reg_class): Add CC_REG class.
20651 (NOTICE_UPDATE_CC): Remove.
20652 (CC_OVERFLOW_UNUSABLE): Remove.
20653 (CC_NO_CARRY): Remove.
20654 * config/avr/avr.md: Turn existing patterns into
20655 define_insn_and_split style patterns where the splitter
20656 adds a clobber of the condition code register. Drop "cc"
20657 attribute. Add new patterns to match output of
20659 (sez): Remove unused pattern.
20661 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
20664 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
20667 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
20670 * config/aarch64/constraints.md (Utq): Require the address to
20671 be valid for both the element mode and for V2DImode.
20673 2021-04-28 Jakub Jelinek <jakub@redhat.com>
20674 Tobias Burnus <tobias@codesourcery.com>
20676 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
20677 * gcc.c (process_command): New variable.
20678 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
20679 set it if -foffload is defaulted.
20680 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
20681 (compile_offload_image): If OFFLOAD_DEFAULTED and
20682 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
20683 if corresponding mkoffload can't be found.
20684 (compile_images_for_offload_targets): Likewise. Free and clear
20685 offload_names if no valid offload is found.
20686 * config.in: Regenerate.
20687 * configure: Regenerate.
20689 2021-04-28 Richard Biener <rguenther@suse.de>
20691 PR tree-optimization/100292
20692 * tree-vect-generic.c (expand_vector_condition): Do not fold
20695 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
20697 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
20698 * config/rs6000/aix64.opt (m64): New.
20701 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
20703 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
20704 (index_term_p): Handle ASHIFT too.
20706 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
20708 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
20709 (sync_lock_test_and_set<mode>): Adjust accordingly.
20710 (sync_lock_release<mode>): Likewise.
20712 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
20714 * config/vax/vax-protos.h (adjacent_operands_p): Remove
20716 * config/vax/vax.c (adjacent_operands_p): Remove.
20718 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
20720 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
20721 through to the non-conditional execution case if getting the
20722 condition for conditional execution has failed.
20724 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
20726 PR middle-end/100284
20727 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
20728 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
20729 than asserting on it.
20731 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
20733 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
20734 with TARGET_AIX_OS.
20736 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
20739 * calls.c (precompute_register_parameters): Additionally test
20740 targetm.precompute_tls_p to pre-compute argument.
20741 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
20742 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
20743 * target.def (precompute_tls_p): New.
20744 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
20745 * doc/tm.texi: Regenerated.
20747 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20750 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
20751 back to HOST_WIDE_INT.
20753 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
20756 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
20757 memory alignment for the outer mode.
20759 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
20761 PR middle-end/90773
20762 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
20763 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
20765 2021-04-27 Alex Coplan <alex.coplan@arm.com>
20768 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
20769 with negative immediates: ensure we expand cbranchsi4_scratch
20770 correctly and ensure we satisfy its constraints.
20771 * config/arm/sync.md
20772 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
20773 attempt to tie two output operands together with constraints;
20774 collapse two alternatives.
20775 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
20776 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
20778 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20781 * config/aarch64/predicates.md (aarch64_sub_immediate,
20782 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
20783 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
20784 * config/aarch64/aarch64.c (aarch64_print_operand,
20785 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
20787 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20789 PR tree-optimization/100239
20790 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
20791 permutations with all indices from the first zero element as vec_shl.
20793 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20795 PR rtl-optimization/100254
20796 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
20797 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
20799 2021-04-27 Richard Biener <rguenther@suse.de>
20801 PR tree-optimization/99912
20802 * passes.def: Add comment about new TODO_remove_unused_locals.
20803 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
20806 2021-04-27 Richard Biener <rguenther@suse.de>
20808 PR tree-optimization/99912
20809 * passes.def (pass_all_optimizations): Add pass_dse before
20810 the first pass_dce, move the first pass_dse before the
20811 pass_dce following pass_pre.
20813 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20815 PR tree-optimization/95527
20816 * generic-match-head.c: Include tm.h.
20817 * gimple-match-head.c: Include tm.h.
20818 * match.pd (CLZ == INTEGER_CST): Don't use
20819 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
20820 if clz == CFN_CLZ. Add missing val declaration.
20821 (CTZ cmp CST): New simplifications.
20823 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20825 PR tree-optimization/96696
20826 * expr.c (expand_expr_divmod): New function.
20827 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
20828 divisions. Formatting fixes.
20829 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
20832 2021-04-27 Martin Jambor <mjambor@suse.cz>
20835 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
20836 If removing a call statement LHS SSA name, release it.
20838 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
20841 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
20842 is valid before including it in the mask.
20844 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
20847 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
20850 2021-04-27 Richard Biener <rguenther@suse.de>
20852 PR tree-optimization/100051
20853 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
20854 disambiguator based on access size vs. decl size.
20856 2021-04-27 Richard Biener <rguenther@suse.de>
20858 PR tree-optimization/100278
20859 * tree-ssa-pre.c (compute_avail): Give up when we cannot
20860 adjust TBAA beacuse of mismatching bases.
20862 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20865 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
20866 For any_rotate define_insn_split and following splitters, use
20867 SWI iterator instead of SWI48.
20869 2021-04-27 Richard Biener <rguenther@suse.de>
20871 PR tree-optimization/99776
20872 * match.pd (bit_field_ref (ctor)): Relax element extract
20873 type compatibility checks.
20875 2021-04-27 Cui,Lili <lili.cui@intel.com>
20877 * common/config/i386/i386-common.c (processor_names):
20878 Sync processor_names with processor_type.
20879 * config/i386/i386-options.c (processor_cost_table):
20880 Sync processor_cost_table with processor_type.
20882 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
20884 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
20885 (irange::set): Call irange_set_1bit_anti_range for handling all
20886 1-bit ranges. Fall through on ~[MIN,MAX].
20888 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
20890 * value-range.cc (irange::legacy_num_pairs): Remove.
20891 (irange::invert): Change gcc_assert to gcc_checking_assert.
20892 * value-range.h (irange::num_pairs): Adjust for a cached
20893 num_pairs(). Also, rename all gcc_assert's to
20894 gcc_checking_assert's.
20896 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
20898 * value-range.cc (irange::operator=): Set m_kind.
20899 (irange::copy_to_legacy): Handle varying and undefined sources
20900 as a legacy copy since they can be easily copied.
20901 (irange::irange_set): Set m_kind.
20902 (irange::irange_set_anti_range): Same.
20903 (irange::set): Rename normalize_min_max to normalize_kind.
20904 (irange::verify_range): Adjust for multi-ranges having the
20906 (irange::irange_union): Set m_kind.
20907 (irange::irange_intersect): Same.
20908 (irange::invert): Same.
20909 * value-range.h (irange::kind): Always return m_kind.
20910 (irange::varying_p): Rename to...
20911 (irange::varying_comptaible_p): ...this.
20912 (irange::undefined_p): Only look at m_kind.
20913 (irange::irange): Always set VR_UNDEFINED if applicable.
20914 (irange::set_undefined): Always set VR_UNDEFINED.
20915 (irange::set_varying): Always set m_kind to VR_VARYING.
20916 (irange::normalize_min_max): Rename to...
20917 (irange::normalize_kind): ...this.
20919 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
20921 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
20922 Adjust for constant_p including varying_p.
20923 * tree-vrp.c (vrp_prop::finalize): Same.
20924 (determine_value_range): Same.
20925 * vr-values.c (vr_values::range_of_expr): Same.
20926 * value-range.cc (irange::symbolic_p): Do not check varying_p.
20927 (irange::constant_p): Same.
20929 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
20931 * value-range.cc (irange::legacy_lower_bound): Replace
20932 !undefined_p check with num_ranges > 0.
20933 (irange::legacy_upper_bound): Same.
20934 * value-range.h (irange::type): Same.
20935 (irange::lower_bound): Same.
20936 (irange::upper_bound): Same.
20938 2021-04-26 Richard Biener <rguenther@suse.de>
20940 PR tree-optimization/99956
20941 * gimple-loop-interchange.cc (compute_access_stride):
20942 Try instantiating the access in a shallower loop nest
20943 if instantiating failed.
20944 (compute_access_strides): Pass adjustable loop_nest
20945 to compute_access_stride.
20947 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
20949 * doc/sourcebuild.texi (arm_cmse_hw): Document.
20951 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
20953 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
20955 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
20956 Nathan Sidwell <nathan@codesourcery.com>
20957 Tom de Vries <vries@codesourcery.com>
20958 Julian Brown <julian@codesourcery.com>
20959 Kwok Cheung Yeung <kcy@codesourcery.com>
20961 * omp-offload.c (oacc_validate_dims): Implement
20962 '-Wopenacc-parallelism'.
20963 * doc/invoke.texi (-Wopenacc-parallelism): Document.
20965 2021-04-26 Richard Biener <rguenther@suse.de>
20967 * tree-cfg.h (gimplify_build1): Remove.
20968 (gimplify_build2): Likewise.
20969 (gimplify_build3): Likewise.
20970 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
20971 (gimplify_build2): Likewise.
20972 (gimplify_build3): Likewise.
20973 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
20975 (gimplify_build2): Likewise.
20976 (gimplify_build3): Likewise.
20977 (tree_vec_extract): Use resimplify with following SSA edges.
20978 (expand_vector_parallel): Avoid passing NULL size/bitpos
20979 to tree_vec_extract.
20980 * expr.c (store_constructor): Deal with zero-element CTORs.
20981 * match.pd (bit_field_ref <vector CTOR>): Make sure to
20982 produce vector constants when possible.
20984 2021-04-26 Richard Biener <rguenther@suse.de>
20986 * tree-complex.c: Include gimple-fold.h.
20987 (expand_complex_addition): Use gimple_build.
20988 (expand_complex_multiplication_components): Likewise.
20989 (expand_complex_multiplication): Likewise.
20990 (expand_complex_div_straight): Likewise.
20991 (expand_complex_div_wide): Likewise.
20992 (expand_complex_division): Likewise.
20993 (expand_complex_conjugate): Likewise.
20994 (expand_complex_comparison): Likewise.
20996 2021-04-26 Richard Biener <rguenther@suse.de>
20998 * tree-ssa-phiopt.c (two_value_replacement): Remove use
20999 of legacy gimplify_buildN API.
21001 2021-04-26 Richard Biener <rguenther@suse.de>
21003 PR tree-optimization/99473
21004 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
21007 2021-04-26 Richard Biener <rguenther@suse.de>
21009 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
21010 Use replace_call_with_value.
21012 2021-04-26 Richard Biener <rguenther@suse.de>
21014 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
21015 (update_gimple_call): Likewise.
21016 (update_call_from_tree): Likewise.
21017 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
21018 (valid_gimple_call_p): Likewise.
21019 (move_ssa_defining_stmt_for_defs): Likewise.
21020 (finish_update_gimple_call): Likewise.
21021 (update_gimple_call): Likewise.
21022 (update_call_from_tree): Likewise.
21023 (propagate_tree_value_into_stmt): Use replace_call_with_value.
21024 * gimple-fold.h (update_gimple_call): Declare.
21025 * gimple-fold.c (valid_gimple_rhs_p): Move here from
21026 tree-ssa-propagate.c.
21027 (update_gimple_call): Likewise.
21028 (valid_gimple_call_p): Likewise.
21029 (finish_update_gimple_call): Likewise, and simplify.
21030 (gimplify_and_update_call_from_tree): Implement
21031 update_call_from_tree functionality, avoid excessive
21032 push/pop_gimplify_context.
21033 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
21034 (gimple_fold_call): Likewise.
21035 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
21036 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
21037 (pass_fold_builtins::execute): Likewise.
21038 (optimize_stack_restore): Use replace_call_with_value.
21039 * tree-cfg.c (fold_loop_internal_call): Likewise.
21040 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
21041 only gimplify_and_update_call_from_tree.
21042 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
21043 (handle_builtin_strchr): Likewise.
21044 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
21046 2021-04-26 Jakub Jelinek <jakub@redhat.com>
21049 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
21050 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
21053 2021-04-25 liuhongt <hongtao.liu@intel.com>
21056 * config/i386/i386-builtin.def (BDESC): Change the icode of
21057 the following builtins to CODE_FOR_nothing.
21058 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
21059 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
21060 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
21061 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
21062 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
21063 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
21064 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
21065 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
21066 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
21067 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
21068 (sse2_eq<mode>3): Ditto.
21069 (sse4_1_eqv2di3): Ditto.
21070 (sse2_gt<mode>3): Rename to ..
21071 (*sse2_gt<mode>3): .. this.
21073 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
21076 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
21079 * config/darwin.c (darwin_binds_local_p): Assume that any
21080 public symbol might be interposed for PIC code. Update function
21081 header comment to reflect current Darwin capability.
21083 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
21086 * config/darwin.c (darwin_binds_local_p): Assume that any
21087 public symbol might be interposed for PIC code. Update function
21088 header comment to reflect current Darwin capability.
21090 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
21092 * doc/sourcebuild.texi: Document no-opts and any-opts target
21095 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
21097 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
21099 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
21101 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
21103 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
21106 * config/i386/i386-options.c (ix86_option_override_internal):
21107 Error out when -m96bit-long-double is used with 64bit targets.
21108 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
21110 2021-04-23 Martin Liska <mliska@suse.cz>
21112 * lto-wrapper.c: Remove FIXME about usage of
21113 hardware_concurrency. The function is not on par with
21116 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
21119 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
21120 Copy operand 3 to operand 4. Use sse_reg_operand
21121 as operand 3 predicate.
21122 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
21123 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
21124 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
21125 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
21126 Copy operand 1 to operand 0.
21127 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
21128 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
21129 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
21131 2021-04-23 Alex Coplan <alex.coplan@arm.com>
21133 PR rtl-optimization/100230
21134 * early-remat.c (early_remat::sort_candidates): Use delete[]
21135 instead of delete for array allocated with new[].
21137 2021-04-23 Richard Biener <rguenther@suse.de>
21139 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
21140 (capture_info::capture_info): Likewise.
21141 (capture_info::walk_match): Likewise.
21142 (expr::gen_transform): Likewise.
21143 (dt_simplify::gen_1): Likewise.
21144 * gimple-match-head.c (maybe_resimplify_conditional_op):
21145 Remove VEC_COND_EXPR special-casing.
21146 (gimple_simplify): Likewise.
21147 * gimple.c (gimple_could_trap_p_1): Adjust.
21148 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
21149 to participate in PRE.
21151 2021-04-23 Richard Biener <rguenther@suse.de>
21153 * cfganal.c (connect_infinite_loops_to_exit): First call
21154 add_noreturn_fake_exit_edges.
21155 * ipa-sra.c (process_scan_results): Do not call the now redundant
21156 add_noreturn_fake_exit_edges.
21157 * predict.c (tree_estimate_probability): Likewise.
21158 (rebuild_frequencies): Likewise.
21159 * store-motion.c (one_store_motion_pass): Likewise.
21161 2021-04-23 Richard Biener <rguenther@suse.de>
21163 PR tree-optimization/100222
21164 * predict.c (pass_profile::execute): Remove redundant call to
21165 mark_irreducible_loops.
21166 (report_predictor_hitrates): Likewise.
21168 2021-04-23 Richard Biener <rguenther@suse.de>
21170 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
21171 valid_gimple_rhs_p by instead gimplifying to one.
21173 2021-04-23 Richard Biener <rguenther@suse.de>
21175 PR tree-optimization/99971
21176 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
21177 Always use TBAA for loads.
21179 2021-04-23 liuhongt <hongtao.liu@intel.com>
21182 * config/i386/i386-options.c (ix86_option_override_internal):
21183 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
21184 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
21185 by target attribute.
21187 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
21189 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
21191 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
21193 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
21195 * config.gcc (powerpc-ibm-aix6.*): Remove.
21196 * config/rs6000/aix61.h: Delete.
21198 2021-04-22 Martin Liska <mliska@suse.cz>
21200 PR testsuite/100159
21201 PR testsuite/100192
21202 * builtins.c (expand_builtin): Fix typos and missing comments.
21203 * dwarf2out.c (gen_subprogram_die): Likewise.
21204 (gen_struct_or_union_type_die): Likewise.
21206 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
21209 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
21210 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
21212 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
21214 * config/i386/darwin.h (TARGET_64BIT): Remove definition
21215 based on TARGET_ISA_64BIT.
21216 (TARGET_64BIT_P): Remove definition based on
21217 TARGET_ISA_64BIT_P().
21219 2021-04-21 Martin Liska <mliska@suse.cz>
21222 2021-04-21 Martin Liska <mliska@suse.cz>
21224 * lto-wrapper.c (cpuset_popcount): Remove.
21225 (init_num_threads): Remove and use hardware_concurrency.
21227 2021-04-21 Martin Liska <mliska@suse.cz>
21230 * main.c (main): Call toplev::finalize in CHECKING_P mode.
21231 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
21232 when incremental LTO linking happens.
21234 2021-04-21 Martin Liska <mliska@suse.cz>
21236 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
21237 makeserver cannot be detected, then use -flto=N fallback.
21239 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
21241 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
21242 default to yes for aarch64-linux-gnu.
21243 * configure: Regenerate.
21245 2021-04-21 Martin Liska <mliska@suse.cz>
21247 * lto-wrapper.c (cpuset_popcount): Remove.
21248 (init_num_threads): Remove and use hardware_concurrency.
21250 2021-04-21 Martin Liska <mliska@suse.cz>
21252 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
21253 which remains to be '(... || 0)' and clang complains about it.
21254 * dwarf2out.c (AT_vms_delta): Declare conditionally.
21255 (add_AT_vms_delta): Likewise.
21256 * tree.c (fld_simplified_type): Use rather more common pattern
21257 for disabling of something (#if 0).
21258 (get_tree_code_name): Likewise.
21259 (verify_type_variant): Likewise.
21261 2021-04-21 Martin Liska <mliska@suse.cz>
21263 * config/i386/i386-expand.c (decide_alignment): Use newly named
21264 macro TARGET_CPU_P.
21265 * config/i386/i386.c (ix86_decompose_address): Likewise.
21266 (ix86_address_cost): Likewise.
21267 (ix86_lea_outperforms): Likewise.
21268 (ix86_avoid_lea_for_addr): Likewise.
21269 (ix86_add_stmt_cost): Likewise.
21270 * config/i386/i386.h (TARGET_*): Remove.
21271 (TARGET_CPU_P): New macro.
21272 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
21273 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
21274 (swap_top_of_ready_list): Likewise.
21275 (ix86_atom_sched_reorder): Likewise.
21276 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
21277 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
21279 2021-04-21 Martin Liska <mliska@suse.cz>
21281 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
21283 (SET_TARGET_NO_SAHF): Likewise.
21284 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
21285 (SET_TARGET_PREFETCH_SSE): Likewise.
21286 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
21287 (SET_TARGET_NO_TUNE): Likewise.
21288 (TARGET_EXPLICIT_NO_80387_P): Likewise.
21289 (SET_TARGET_NO_80387): Likewise.
21291 * config/i386/i386.h (TARGET_*): Remove.
21292 * opth-gen.awk: Generate new used macros.
21294 2021-04-21 Martin Liska <mliska@suse.cz>
21296 * config/i386/i386.h (PTA_*): Remove.
21297 (enum pta_flag): New.
21298 (DEF_PTA): Generate PTA_* values from i386-isa.def.
21299 * config/i386/i386-isa.def: New file.
21301 2021-04-21 Alex Coplan <alex.coplan@arm.com>
21304 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
21305 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
21306 jump table targets.
21308 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
21310 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
21311 x86_64-*-* targets.
21312 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
21314 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
21315 (ix86_handle_option): Handle -mmwait.
21316 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
21317 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
21318 __builtin_ia32_monitor and __builtin_ia32_mwait.
21319 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
21320 (ix86_valid_target_attribute_inner_p): Likewise.
21321 (ix86_option_override_internal): Enable mwait/monitor
21322 instructions for -msse3.
21323 * config/i386/i386.h (TARGET_MWAIT): New.
21324 (TARGET_MWAIT_P): Likewise.
21325 * config/i386/i386.opt: Add -mmwait.
21326 * config/i386/mwaitintrin.h: New file.
21327 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
21328 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
21330 (@sse3_monitor_<mode>): Likewise.
21331 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
21332 * doc/extend.texi: Document mwait target attribute.
21333 * doc/invoke.texi: Document -mmwait.
21335 2021-04-21 Martin Liska <mliska@suse.cz>
21337 * config/i386/i386-options.c (DEF_ENUM): Remove it.
21338 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
21339 * config/i386/stringop.def (DEF_ENUM): Likewise.
21341 2021-04-21 Martin Liska <mliska@suse.cz>
21343 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
21344 of print_generic_expr.
21346 2021-04-21 Jakub Jelinek <jakub@redhat.com>
21348 PR rtl-optimization/100148
21349 * cprop.c (constprop_register): Use next_nondebug_insn instead of
21352 2021-04-21 Martin Liska <mliska@suse.cz>
21355 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
21356 free_dominance_info calls.
21358 2021-04-21 Richard Biener <rguenther@suse.de>
21360 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
21361 parameter (and assume it to be false).
21362 (fold_gimple_assign): Adjust, remove all callers of
21363 maybe_fold_reference calling it with is_lhs true.
21364 (gimple_fold_call): Likewise.
21365 (fold_stmt_1): Likewise.
21367 2021-04-21 Richard Biener <rguenther@suse.de>
21369 * fold-const.c (pedantic_non_lvalue_loc): Remove.
21370 (fold_binary_loc): Adjust.
21371 (fold_ternary_loc): Likewise.
21373 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
21375 PR middle-end/100130
21376 * varasm.c (get_block_for_decl): Make sure that any use of the
21377 retain attribute matches the section's retain flag.
21378 (switch_to_section): Check for retain mismatches even when
21379 changing sections, but do not warn if the given decl is the
21380 section's named.decl.
21381 (output_object_block): Pass the first decl in the block (if any)
21382 to switch_to_section.
21384 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
21386 * config/i386/i386-c.c (ix86_target_macros_internal): Define
21387 __CRC32__ for -mcrc32.
21388 * config/i386/i386-options.c (ix86_option_override_internal):
21389 Enable crc32 instruction for -msse4.2.
21390 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
21392 (sse4_2_crc32di): Likewise.
21393 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
21396 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
21399 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
21402 2021-04-20 Martin Liska <mliska@suse.cz>
21404 * doc/invoke.texi: Fix typo.
21405 * params.opt: Likewise.
21407 2021-04-20 Martin Liska <mliska@suse.cz>
21409 * doc/invoke.texi: Document new param.
21411 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
21413 PR tree-optimization/100081
21414 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
21415 rather than gori_compute_cache.
21416 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
21417 (range_def_chain::m_logical_depth): New member.
21418 (range_def_chain::range_def_chain): Initialize m_logical_depth.
21419 (range_def_chain::get_def_chain): Don't build defchains through more
21420 than LOGICAL_LIMIT logical expressions.
21421 * params.opt (param_ranger_logical_depth): New.
21423 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
21426 * config/arm/arm.c (arm_configure_build_target): Do not strip
21427 extended FPU/SIMD feature bits from the target ISA when -mfpu
21428 is specified (partial revert of r11-8168).
21430 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
21432 * params.opt (-param=openacc-kernels=): Add.
21433 * omp-oacc-kernels-decompose.cc
21434 (pass_omp_oacc_kernels_decompose::gate): Use it.
21435 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
21436 (--param): ... here, 'openacc-kernels'.
21438 2021-04-19 Martin Liska <mliska@suse.cz>
21441 * gengtype.c (finish_root_table): Align function arguments
21442 in between declaration and definition.
21444 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
21446 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
21447 frames larger than the SEH maximum frame size.
21449 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
21451 PR rtl-optimization/99927
21452 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
21453 is dead, just drop it.
21455 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
21458 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
21459 * doc/tm.texi: Regenerate.
21460 * doc/tm.texi.in (D language and ABI): Add @hook for
21461 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
21463 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
21465 * config/darwin-d.c (darwin_d_handle_target_object_format): New
21467 (darwin_d_register_target_info): New function.
21468 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21469 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
21471 (dragonfly_d_register_target_info): New function.
21472 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21473 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
21475 (freebsd_d_register_target_info): New function.
21476 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21477 * config/glibc-d.c (glibc_d_handle_target_object_format): New
21479 (glibc_d_register_target_info): New function.
21480 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21481 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
21483 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
21484 as handler for objectFormat key.
21485 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
21487 (winnt_d_register_target_info): New function.
21488 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21489 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
21491 (netbsd_d_register_target_info): New function.
21492 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21493 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
21495 (openbsd_d_register_target_info): New function.
21496 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21497 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
21498 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
21499 handler for objectFormat key.
21500 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
21502 (rs6000_d_register_target_info): Add
21503 rs6000_d_handle_target_object_format as handler for objectFormat key.
21504 * config/sol2-d.c (solaris_d_handle_target_object_format): New
21506 (solaris_d_register_target_info): New function.
21507 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21509 2021-04-16 Jakub Jelinek <jakub@redhat.com>
21512 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
21513 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
21515 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
21516 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
21517 the old and new alignment after applying MIN/MAX to it is different.
21519 2021-04-16 Tamar Christina <tamar.christina@arm.com>
21522 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
21523 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
21525 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
21527 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
21529 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
21530 this section and its subsections.
21532 2021-04-16 Jakub Jelinek <jakub@redhat.com>
21535 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
21536 define_insn patterns.
21538 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
21540 PR rtl-optimization/98689
21541 * reg-notes.def (UNTYPED_CALL): New note.
21542 * combine.c (distribute_notes): Handle it.
21543 * emit-rtl.c (try_split): Likewise.
21544 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
21545 that calls with the note implicitly set all return value registers.
21546 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
21549 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
21551 PR rtl-optimization/99596
21552 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
21553 register accesses for const calls. Assume that pure functions
21554 can only read from global registers. Ignore cases in which
21555 the stack pointer has been marked global.
21557 2021-04-16 Jakub Jelinek <jakub@redhat.com>
21560 * tree-vect-loop.c (vect_transform_loop): Don't remove just
21561 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
21562 them by their last argument.
21564 2021-04-15 Martin Liska <mliska@suse.cz>
21566 * doc/invoke.texi: Other params don't use it, remove it.
21568 2021-04-15 Richard Biener <rguenther@suse.de>
21570 * gimple-builder.h: Add deprecation note.
21572 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
21575 * attribs.h (restrict_type_identity_attributes_to): Declare.
21576 * attribs.c (restrict_type_identity_attributes_to): New function.
21578 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
21581 * attribs.h (affects_type_identity_attributes): Declare.
21582 * attribs.c (remove_attributes_matching): New function.
21583 (affects_type_identity_attributes): Likewise.
21585 2021-04-15 Jakub Jelinek <jakub@redhat.com>
21588 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
21589 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
21590 ZERO_EXTEND, SIGN_EXTEND or AND.
21592 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
21594 PR rtl-optimization/99929
21595 * rtl.h (same_vector_encodings_p): New function.
21596 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
21597 * cselib.c (rtx_equal_for_cselib_1): Likewise.
21598 * jump.c (rtx_renumbered_equal_p): Likewise.
21599 * lra-constraints.c (operands_match_p): Likewise.
21600 * reload.c (operands_match_p): Likewise.
21601 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
21603 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
21605 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
21606 more information about variable-length CONST_VECTORs.
21608 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
21610 PR rtl-optimization/100066
21611 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
21612 ordered modes when choosing splitting mode for hard reg.
21614 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
21617 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
21619 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
21621 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
21623 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
21624 for mask operand types.
21625 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
21626 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
21628 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
21630 * config/s390/s390.c (s390_const_operand_ok): Check the new
21631 operand types and generate a list of valid values.
21633 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
21635 * doc/tm.texi: Regenerate.
21636 * doc/tm.texi.in (D language and ABI): Add @hook for
21637 TARGET_D_REGISTER_OS_TARGET_INFO.
21639 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
21641 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
21643 (aarch64_d_register_target_info): New function.
21644 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
21646 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
21648 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
21649 (arm_d_register_target_info): New function.
21650 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
21651 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21652 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
21653 (ix86_d_register_target_info): New function.
21654 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
21655 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21656 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
21657 (mips_d_register_target_info): New function.
21658 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
21659 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21660 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
21661 (pa_d_register_target_info): New function.
21662 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
21663 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21664 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
21666 (riscv_d_register_target_info): New function.
21667 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
21668 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21669 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
21671 (rs6000_d_register_target_info): New function.
21672 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
21674 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21675 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
21676 (s390_d_register_target_info): New function.
21677 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
21678 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21679 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
21681 (sparc_d_register_target_info): New function.
21682 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
21683 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21684 * doc/tm.texi: Regenerate.
21685 * doc/tm.texi.in (D language and ABI): Add @hook for
21686 TARGET_D_REGISTER_CPU_TARGET_INFO.
21688 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
21690 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
21691 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
21692 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
21693 * doc/tm.texi: Regenerate.
21694 * doc/tm.texi.in (D language and ABI): Add @hook for
21695 TARGET_D_HAS_STDCALL_CONVENTION.
21697 2021-04-14 Richard Biener <rguenther@suse.de>
21699 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
21700 VEC_COND_EXPRs have a gimple_val condition.
21701 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
21702 can no longer have a GENERIC condition.
21704 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
21707 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
21708 from the isa_delta when -mfpu has been used.
21709 (arm_options_perform_arch_sanity_checks): It's the architecture that
21710 lacks an FPU not the processor.
21712 2021-04-13 Richard Biener <rguenther@suse.de>
21714 PR tree-optimization/100053
21715 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
21716 not use optimistic dominance queries for backedges to validate
21718 (dominated_by_p_w_unex): Add parameter to ignore executable
21719 state on backedges.
21720 (rpo_elim::eliminate_avail): Adjust.
21722 2021-04-13 Jakub Jelinek <jakub@redhat.com>
21725 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
21726 *aarch64_bfxilsi_extrdi): New define_insn patterns.
21728 2021-04-13 Jakub Jelinek <jakub@redhat.com>
21731 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
21732 outermode, return NULL if the result doesn't encode back to the
21733 original byte sequence.
21734 (simplify_gen_subreg): Don't create SUBREGs from constants to
21735 MODE_COMPOSITE_P outermode.
21737 2021-04-12 Jakub Jelinek <jakub@redhat.com>
21739 PR rtl-optimization/99905
21740 * combine.c (expand_compound_operation): If pos + len > modewidth,
21741 perform the right shift by pos in inner_mode and then convert to mode,
21742 instead of trying to simplify a shift of rtx with inner_mode by pos
21743 as if it was a shift in mode.
21745 2021-04-12 Jakub Jelinek <jakub@redhat.com>
21748 * combine.c (simplify_and_const_int_1): Don't optimize varop
21749 away if it has side-effects.
21751 2021-04-12 Martin Liska <mliska@suse.cz>
21753 * doc/extend.texi: Escape @smallexample content.
21755 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
21757 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
21758 alternative in order to load a DFP zero.
21760 2021-04-12 Martin Liska <mliska@suse.cz>
21762 * doc/extend.texi: Be more precise in documentation
21763 of symver attribute.
21765 2021-04-12 Martin Liska <mliska@suse.cz>
21768 * gimplify.c (gimplify_expr): Right now, we unpoison all
21769 variables before a goto <dest>. We should not do it if we are
21772 2021-04-12 Cui,Lili <lili.cui@intel.com>
21774 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
21776 * common/config/i386/i386-common.c (processor_names): Add
21778 (processor_alias_table): Add rocketlake.
21779 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
21780 INTEL_COREI7_ROCKETLAKE.
21781 * config.gcc: Add -march=rocketlake.
21782 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
21784 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
21785 (processor_cost_table): Add rocketlake cost.
21786 * config/i386/i386.h (ix86_size_cost) : Define
21788 (processor_type) : Add PROCESSOR_ROCKETLAKE.
21789 (PTA_ROCKETLAKE): Ditto.
21790 * doc/extend.texi: Add rocketlake.
21791 * doc/invoke.texi: Add rocketlake.
21793 2021-04-12 Cui,Lili <lili.cui@intel.com>
21795 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
21796 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
21797 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
21798 * doc/invoke.texi: Change alderlake ISA list.
21800 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
21802 PR middle-end/98088
21803 * omp-expand.c (expand_oacc_collapse_init): Update condition in
21806 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
21809 * config/i386/serializeintrin.h (_serialize): Defined as macro.
21811 2021-04-10 Jakub Jelinek <jakub@redhat.com>
21814 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
21815 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
21817 2021-04-10 Jakub Jelinek <jakub@redhat.com>
21819 PR middle-end/99989
21820 * gimple-ssa-warn-alloca.c
21821 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
21822 0 with integer precision unconditionally.
21824 2021-04-10 Jakub Jelinek <jakub@redhat.com>
21826 PR rtl-optimization/98601
21827 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
21828 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
21829 unaligned_mems handle VOIDmode like BLKmode.
21831 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
21834 * tree.c (free_lang_data_in_decl): Do not release body of
21835 declare_variant_alt.
21837 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
21839 * config/aarch64/aarch64.c (aarch64_option_restore): If the
21840 architecture was specified explicitly and the tuning wasn't,
21841 tune for the architecture rather than the configured default CPU.
21843 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
21845 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
21846 as the temporary register.
21848 2021-04-09 Martin Liska <mliska@suse.cz>
21850 * doc/extend.texi: Move non-target attributes on the top level.
21852 2021-04-09 Martin Liska <mliska@suse.cz>
21854 * doc/invoke.texi: Document minimum and maximum value of the
21855 argument for both supported compression algorithms.
21857 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
21859 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
21860 TLS BSS before TLS data.
21861 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
21863 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
21865 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
21867 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
21869 * match.pd: Extend vec_cond folds to handle shifts.
21871 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
21873 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
21876 2021-04-08 Alex Coplan <alex.coplan@arm.com>
21879 * config/arm/iterators.md (MVE_vecs): New.
21880 (V_elem): Also handle V2DF.
21881 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
21882 (*mve_vdup<mode>): ... this. Remove second alternative since
21883 vec_duplicate of const_int is not canonical RTL, and we don't
21884 want to match symbol_refs.
21885 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
21887 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
21889 * fold-const.c (fold_single_bit_test): Fix typo.
21890 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
21893 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
21895 PR tree-optimization/97513
21896 * tree-vect-slp.c (vect_add_slp_permutation): New function,
21898 (vectorizable_slp_permutation): ...here. Detect cases in which
21899 all VEC_PERM_EXPRs are guaranteed to have the same stepped
21900 permute vector and only generate one permute vector for that case.
21901 Extend that case to handle variable-length vectors.
21903 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
21905 PR tree-optimization/99873
21906 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
21907 (vect_build_slp_instance): Don't split store groups that could
21908 use IFN_STORE_LANES.
21910 2021-04-07 Jakub Jelinek <jakub@redhat.com>
21913 * varasm.c (output_constant_pool_contents): Don't strip name encoding
21914 from XSTR (desc->sym, 0) or from label before passing those to
21917 2021-04-07 Richard Biener <rguenther@suse.de>
21919 PR tree-optimization/99954
21920 * tree-loop-distribution.c: Include tree-affine.h.
21921 (generate_memcpy_builtin): Try using tree-affine to prove
21923 (loop_distribution::classify_builtin_ldst): Always classify
21926 2021-04-07 Richard Biener <rguenther@suse.de>
21928 PR tree-optimization/99947
21929 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
21930 steps vector to avoid pushing elements from the reallocated
21933 2021-04-07 Richard Biener <rguenther@suse.de>
21935 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
21936 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
21938 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
21940 (debug_vn_reference_ops): New.
21942 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
21944 PR tree-optimization/98736
21945 * tree-loop-distribution.c
21946 * (loop_distribution::bb_top_order_init):
21947 Compute RPO with programing order preserved by calling function
21948 rev_post_order_and_mark_dfs_back_seme.
21950 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
21953 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
21954 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
21956 (process_bb_lives): Don't update biggest mode of hard reg for
21957 implicit in multi-register group. Use the new functions for
21958 updating dead_set and unused_set by register notes.
21960 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
21962 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
21965 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
21967 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
21968 (skylake_memset): Likewise.
21969 (skylake_cost): Change CLEAR_RATIO to 17.
21970 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
21971 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
21972 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
21974 2021-04-06 Richard Biener <rguenther@suse.de>
21976 PR tree-optimization/99880
21977 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
21978 set vectorized defs of relevant PHIs.
21980 2021-04-06 Richard Biener <rguenther@suse.de>
21982 PR tree-optimization/99924
21983 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
21984 nodes w/o scalar stmts as visited.
21986 2021-04-06 Alex Coplan <alex.coplan@arm.com>
21989 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
21990 PCS for [su]fix_optab.
21992 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
21994 * config/darwin.c (machopic_legitimize_pic_address): Check
21995 that the current pic register is one of the hard reg set
21996 before setting liveness.
21998 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
22000 * config/darwin.c (machopic_legitimize_pic_address): Fix
22001 whitespace, remove unused code.
22003 2021-04-03 Jakub Jelinek <jakub@redhat.com>
22005 PR tree-optimization/99882
22006 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
22009 2021-04-03 Jakub Jelinek <jakub@redhat.com>
22011 PR rtl-optimization/99863
22012 * dse.c (replace_read): Drop regs_live argument. Instead of
22013 regs_live, use store_insn->fixed_regs_live if non-NULL,
22014 otherwise punt if insns sequence clobbers or sets any hard
22017 2021-04-03 Jakub Jelinek <jakub@redhat.com>
22020 * targhooks.h (default_print_patchable_function_entry_1): Declare.
22021 * targhooks.c (default_print_patchable_function_entry_1): New function,
22022 copied from default_print_patchable_function_entry with an added flags
22024 (default_print_patchable_function_entry): Rewritten into a small
22025 wrapper around default_print_patchable_function_entry_1.
22026 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
22028 (rs6000_print_patchable_function_entry): New function.
22030 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
22032 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
22034 2021-04-01 Jason Merrill <jason@redhat.com>
22037 * common.opt: Document v15 and v16.
22039 2021-04-01 Richard Biener <rguenther@suse.de>
22041 PR tree-optimization/99863
22042 * gimplify.c (gimplify_init_constructor): Recompute vector
22045 2021-04-01 Jakub Jelinek <jakub@redhat.com>
22047 * doc/extend.texi (symver attribute): Fix up syntax errors
22050 2021-04-01 Jakub Jelinek <jakub@redhat.com>
22052 PR tree-optimization/96573
22053 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
22054 also pointer types.
22056 2021-04-01 Richard Biener <rguenther@suse.de>
22058 PR tree-optimization/99856
22059 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
22060 precision to vector element precision.
22062 2021-04-01 Martin Jambor <mjambor@suse.cz>
22064 PR tree-optimization/97009
22065 * tree-sra.c (access_or_its_child_written): New function.
22066 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
22069 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
22072 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
22074 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
22077 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
22078 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
22079 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
22080 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
22081 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
22082 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
22084 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
22085 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
22086 (define_attr "prefixed"): Update initializer.
22088 2021-03-31 Jakub Jelinek <jakub@redhat.com>
22091 * dwarf2out.c (debug_ranges_dwo_section): New variable.
22092 (DW_RANGES_IDX_SKELETON): Define.
22093 (struct dw_ranges): Add begin_entry and end_entry members.
22094 (DEBUG_DWO_RNGLISTS_SECTION): Define.
22095 (add_ranges_num): Adjust r initializer for addition of *_entry
22097 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
22098 set idx to DW_RANGES_IDX_SKELETON.
22099 (use_distinct_base_address_for_range): New function.
22100 (index_rnglists): Don't set r->idx if it is equal to
22101 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
22102 r->end_entry for -gsplit-dwarf if those will be needed by
22104 (output_rnglists): Add DWO argument. If true, switch to
22105 debug_ranges_dwo_section rather than debug_ranges_section.
22106 Adjust l1/l2 label indexes. Only output the offset table when
22107 dwo is true and don't include in there the skeleton range
22108 entry if present. For -gsplit-dwarf, skip ranges that belong
22109 to the other rnglists section. Change return type from void
22110 to bool and return true if there are any range entries for
22111 the other section. For dwarf_split_debug_info use
22112 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
22113 entries instead of DW_RLE_start_end, DW_RLE_start_length and
22114 DW_RLE_base_address. Use use_distinct_base_address_for_range.
22115 (init_sections_and_labels): Initialize debug_ranges_dwo_section
22116 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
22117 and range_base_label indexes.
22118 (dwarf2out_finish): Call index_rnglists earlier before finalizing
22119 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
22120 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
22121 with different dwo arguments.
22122 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
22124 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22126 PR tree-optimization/98268
22127 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
22128 recompute_tree_invariant_for_addr_expr after successfully
22129 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
22131 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22133 PR tree-optimization/99726
22134 * tree-data-ref.c (create_intersect_range_checks_index): Bail
22135 out if there is more than one access function SCEV for the loop
22138 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22140 PR rtl-optimization/97141
22141 PR rtl-optimization/98726
22142 * emit-rtl.c (valid_for_const_vector_p): Return true for
22144 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
22145 poly_wide_int instead of a wide_int.
22146 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
22148 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
22149 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
22150 false for CONST_VECTORs that cannot be forced to memory.
22151 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
22152 is too complex to force to memory, build it up from individual
22155 2021-03-31 Jan Hubicka <jh@suse.cz>
22158 * cgraph.c (cgraph_node::release_body): Fix overactive check.
22160 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
22163 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
22166 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
22168 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
22169 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
22171 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
22172 "rep movsb/stosb" only for known sizes.
22173 * config/i386/i386-options.c (processor_cost_table): Use Ice
22174 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
22175 Rapids and Alder Lake.
22176 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
22177 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
22178 (icelake_memset): Likewise.
22179 (icelake_cost): Likewise.
22180 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
22183 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22186 * config/aarch64/aarch64.c
22187 (aarch64_vectorize_preferred_vector_alignment): Query the size
22188 of the provided SVE vector; do not assume that all SVE vectors
22189 have the same size.
22191 2021-03-31 Jan Hubicka <jh@suse.cz>
22194 * cgraph.c (cgraph_node::release_body): Remove all callers and
22196 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
22197 * cgraphunit.c (cgraph_node::expand): And here.
22199 2021-03-31 Martin Liska <mliska@suse.cz>
22201 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
22202 and one negated condition.
22204 2021-03-31 Jakub Jelinek <jakub@redhat.com>
22205 Richard Sandiford <richard.sandiford@arm.com>
22208 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
22209 constraints on operands[2] and similarly 0 and rk constraints
22210 on operands[1] corresponding to that.
22212 2021-03-31 Jakub Jelinek <jakub@redhat.com>
22215 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
22216 linker doesn't support DWARF sections new in DWARF5.
22217 * config/i386/i386-options.c (ix86_option_override_internal): Default
22218 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
22220 * config.in: Regenerated.
22221 * configure: Regenerated.
22223 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22226 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
22227 available issue_info before using it.
22229 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22232 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
22235 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
22238 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
22239 (altivec_lvsl_reg_<mode>): ... this.
22240 (altivec_lvsr_reg): Change to ...
22241 (altivec_lvsr_reg_<mode>): ... this.
22242 * config/rs6000/predicates.md (vec_set_index_operand): New.
22243 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
22244 Enable 32bit variable vec_insert for all TARGET_VSX.
22245 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
22246 Enable 32bit variable vec_insert for p9 and above.
22247 (rs6000_expand_vector_set_var_p8): Rename to ...
22248 (rs6000_expand_vector_set_var_p7): ... this.
22249 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
22251 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
22252 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
22253 gen_altivec_lvsr_reg_di.
22255 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
22258 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
22259 (__rdtscp): Likewise.
22261 2021-03-30 Tamar Christina <tamar.christina@arm.com>
22263 PR tree-optimization/99825
22264 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
22265 Reject non-mult 2 lanes.
22267 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
22270 * config/arm/arm.c (arm_file_start): Fix emission of
22271 Tag_ABI_VFP_args attribute.
22273 2021-03-30 Richard Biener <rguenther@suse.de>
22275 PR tree-optimization/99824
22276 * stor-layout.c (set_min_and_max_values_for_integral_type):
22277 Assert the precision is within the bounds of
22278 WIDE_INT_MAX_PRECISION.
22279 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
22280 the outermost component ref only to lower the access size
22281 and initialize that from the access type.
22283 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
22286 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
22287 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
22289 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
22291 * config/aarch64/aarch64.md
22292 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
22293 attribute to disambiguate between SIMD and FP variants of the
22296 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
22298 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
22299 (analyze_ssa_name_flags): Fix typo in comment.
22301 2021-03-29 Alex Coplan <alex.coplan@arm.com>
22304 * config/aarch64/aarch64-sve-builtins.cc
22305 (function_builder::add_function): Add placeholder_p argument, use
22306 placeholder decls if this is set.
22307 (function_builder::add_unique_function): Instead of conditionally adding
22308 direct overloads, unconditionally add either a direct overload or a
22310 (function_builder::add_overloaded_function): Set placeholder_p if we're
22311 using C++ overloads. Use the obstack for string storage instead
22312 of relying on the tree nodes.
22313 (function_builder::add_overloaded_functions): Don't return early for
22314 m_direct_overloads: we need to add placeholders.
22315 * config/aarch64/aarch64-sve-builtins.h
22316 (function_builder::add_function): Add placeholder_p argument.
22318 2021-03-29 Richard Biener <rguenther@suse.de>
22320 PR tree-optimization/99807
22321 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
22322 assert below VEC_PERM handling.
22324 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22327 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
22328 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
22329 matching const_int 0.
22330 (move_lo_quad_internal_be_<mode>): Likewise.
22331 (move_lo_quad_<mode>): Update for the above.
22332 * config/aarch64/iterators.md (VQ_2E): Delete.
22334 2021-03-29 Jakub Jelinek <jakub@redhat.com>
22336 PR tree-optimization/99777
22337 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
22338 types other than scalar integral types.
22340 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
22342 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
22343 XCOFF TLS reloc decorations.
22345 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
22347 * doc/analyzer.texi (Analyzer Internals): Update link to
22348 "A Memory Model for Static Analysis of C Programs".
22350 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
22352 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
22353 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
22355 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
22356 (rs6000_special_round_type_align): Recursively check innermost first
22359 2021-03-26 Jakub Jelinek <jakub@redhat.com>
22362 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
22363 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
22364 assignment with drap_reg active, queue reg save for hfp with offset 0
22365 and flush queued reg saves. When handling a push with rule18,
22366 defer queueing reg save for hfp and just assert the offset is 0.
22367 (scan_trace): Assert that fde->rule18 is false.
22369 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
22372 * ira-costs.c (record_reg_classes): Put case with
22373 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
22374 * ira.c (ira_setup_alts): Ditto.
22375 * lra-constraints.c (process_alt_operands): Ditto.
22376 * recog.c (asm_operand_ok): Ditto.
22377 * reload.c (find_reloads): Ditto.
22379 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22381 * config/aarch64/aarch64-protos.h
22382 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
22383 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
22384 * config/aarch64/aarch64.c (generic_addrcost_table): Update
22385 accordingly, using the same costs as for post_modify.
22386 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
22387 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
22388 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
22389 (a64fx_addrcost_table): Likewise.
22390 (neoversev1_addrcost_table): New.
22391 (neoversev1_tunings): Use neoversev1_addrcost_table.
22392 (aarch64_address_cost): Use the new post_modify costs for CImode
22395 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22397 * config/aarch64/aarch64.opt
22398 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
22399 * doc/invoke.texi: Document it.
22400 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
22401 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
22402 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
22403 (aarch64_vec_issue_info): New structures.
22404 (cpu_vector_cost): Write comments above the variables rather
22406 (cpu_vector_cost::issue_info): New member variable.
22407 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
22408 and tree-ssa-loop-niter.h.
22409 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
22410 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
22411 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
22412 (thunderx3t110_vector_cost): Initialize issue_info to null.
22413 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
22414 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
22415 (neoversev1_vector_cost): Use them.
22416 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
22417 (aarch64_vector_costs::saw_sve_only_op): New member variable.
22418 (aarch64_vector_costs::num_vector_iterations): Likewise.
22419 (aarch64_vector_costs::scalar_ops): Likewise.
22420 (aarch64_vector_costs::advsimd_ops): Likewise.
22421 (aarch64_vector_costs::sve_ops): Likewise.
22422 (aarch64_vector_costs::seen_loads): Likewise.
22423 (aarch64_simd_vec_costs_for_flags): New function.
22424 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
22425 Count the number of predicate operations required by SVE WHILE
22427 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
22428 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
22429 (aarch64_count_ops): Likewise.
22430 (aarch64_add_stmt_cost): Record whether see an SVE operation
22431 that cannot currently be implementing using Advanced SIMD.
22432 Record issue information about the scalar, Advanced SIMD
22433 and (where relevant) SVE versions of a loop.
22434 (aarch64_vec_op_count::dump): New function.
22435 (aarch64_sve_op_count::dump): Likewise.
22436 (aarch64_estimate_min_cycles_per_iter): Likewise.
22437 (aarch64_adjust_body_cost): If issue information is available,
22438 try to compare the issue rates of the various loop implementations
22439 and increase or decrease the vector body cost accordingly.
22441 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22443 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
22444 Assume a zero cost for induction phis.
22446 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22448 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
22450 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
22451 vector comparisons.
22453 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22455 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
22457 (aarch64_add_stmt_cost): Call it.
22459 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22461 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
22462 New tuning parameter.
22463 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
22464 (aarch64_estimated_sve_vq): New function.
22465 (aarch64_vector_costs::analyzed_vinfo): New member variable.
22466 (aarch64_vector_costs::is_loop): Likewise.
22467 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
22468 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
22469 (aarch64_record_potential_advsimd_unrolling): New function.
22470 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
22471 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
22472 aarch64_analyze_bb_vinfo on the first use of a costs structure.
22473 Detect whether we're vectorizing a loop for SVE that might be
22474 completely unrolled if it used Advanced SIMD instead.
22475 (aarch64_adjust_body_cost_for_latency): New function.
22476 (aarch64_finish_cost): Call it.
22478 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22480 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
22481 (aarch64_init_cost): New function.
22482 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
22483 the default unsigned[3].
22484 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
22485 (TARGET_VECTORIZE_INIT_COST): Override.
22486 (TARGET_VECTORIZE_FINISH_COST): Likewise.
22487 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
22489 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22491 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
22492 (neoversev1_sve_vector_cost): New cost structures.
22493 (neoversev1_vector_cost): Likewise.
22494 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
22496 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22498 * config/aarch64/aarch64-protos.h
22499 (sve_vec_cost::scatter_store_elt_cost): New member variable.
22500 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
22501 accordingly, taking the cost from the cost of a scalar_store.
22502 (a64fx_sve_vector_cost): Likewise.
22503 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
22505 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22507 * config/aarch64/aarch64-protos.h
22508 (simd_vec_cost::store_elt_extra_cost): New member variable.
22509 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
22510 accordingly, using the vec_to_scalar cost for the new field.
22511 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
22512 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
22513 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
22514 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
22515 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
22516 (thunderx3t110_advsimd_vector_cost): Likewise.
22517 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
22519 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22521 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
22522 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
22523 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
22524 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
22525 accordingly, using zero for the new costs.
22526 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
22527 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
22528 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
22529 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
22530 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
22531 (thunderx3t110_advsimd_vector_cost): Likewise.
22532 (aarch64_ld234_st234_vectors): New function.
22533 (aarch64_adjust_stmt_cost): Likewise.
22534 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
22535 the new vector costs.
22537 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22539 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
22540 derived class of simd_vec_cost. Add information about CLAST[AB]
22541 and FADDA instructions.
22542 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
22543 accordingly, using the vec_to_scalar costs for the new fields.
22544 (a64fx_sve_vector_cost): Likewise.
22545 (aarch64_reduc_type): New function.
22546 (aarch64_sve_in_loop_reduction_latency): Likewise.
22547 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
22548 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
22549 that occur in the loop body.
22550 (aarch64_add_stmt_cost): Update call accordingly.
22552 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22554 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
22556 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
22557 above the fields rather than to the right.
22558 (simd_vec_cost::reduc_i8_cost): New member variable.
22559 (simd_vec_cost::reduc_i16_cost): Likewise.
22560 (simd_vec_cost::reduc_i32_cost): Likewise.
22561 (simd_vec_cost::reduc_i64_cost): Likewise.
22562 (simd_vec_cost::reduc_f16_cost): Likewise.
22563 (simd_vec_cost::reduc_f32_cost): Likewise.
22564 (simd_vec_cost::reduc_f64_cost): Likewise.
22565 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
22566 accordingly, using the vec_to_scalar_cost for the new fields.
22567 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
22568 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
22569 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
22570 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
22571 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
22572 (thunderx3t110_advsimd_vector_cost): Likewise.
22573 (aarch64_use_new_vector_costs_p): New function.
22574 (aarch64_simd_vec_costs): New function, split out from...
22575 (aarch64_builtin_vectorization_cost): ...here.
22576 (aarch64_is_reduction): New function.
22577 (aarch64_detect_vector_stmt_subtype): Likewise.
22578 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
22579 using the new vector costs.
22581 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22584 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
22585 TLS declarations as public.
22587 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22589 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
22590 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
22591 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
22592 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
22593 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
22594 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
22595 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
22596 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
22597 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
22599 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22602 * config.gcc (*-*-cygwin*): Add winnt-d.o
22603 (*-*-mingw*): Likewise.
22604 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
22605 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
22606 * config/i386/t-cygming: Add winnt-d.o.
22607 * config/i386/winnt-d.c: New file.
22609 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22611 * config/freebsd-d.c: Include memmodel.h.
22613 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22616 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
22617 * config/t-openbsd: Add openbsd-d.o.
22618 * config/openbsd-d.c: New file.
22620 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
22622 PR tree-optimization/96974
22623 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
22624 with graceful exit.
22626 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
22629 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
22633 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
22634 always_inline in system headers.
22636 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
22638 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
22640 2021-03-25 Jakub Jelinek <jakub@redhat.com>
22643 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
22644 * fold-const.c (operand_compare::operand_equal_p): Don't compare
22645 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
22647 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
22651 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
22652 always_inline in system headers.
22654 2021-03-25 Richard Biener <rguenther@suse.de>
22656 PR tree-optimization/99746
22657 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
22658 the scalar stmt as patterned. Instead set up required things
22661 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
22663 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
22666 2021-03-24 Martin Liska <mliska@suse.cz>
22669 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
22671 * config/i386/i386-options.c (ix86_option_override_internal):
22672 Add run-time assert.
22674 2021-03-24 Martin Jambor <mjambor@suse.cz>
22677 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
22678 parameters with unknown type.
22679 (ipacp_value_safe_for_type): New function.
22680 (propagate_vals_across_arith_jfunc): Verify that the constant type
22681 can be used for a type of the formal parameter.
22682 (propagate_vals_across_ancestor): Likewise.
22683 (propagate_scalar_across_jump_function): Likewise. Pass the type
22684 also to propagate_vals_across_ancestor.
22686 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
22689 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
22691 (movmisalign<mode>_mve_load): Likewise.
22693 2021-03-24 Jakub Jelinek <jakub@redhat.com>
22696 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
22697 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
22699 2021-03-24 Alexandre Oliva <oliva@adacore.com>
22701 * doc/sourcebuild.texi (sysconf): New effective target.
22703 2021-03-24 Alexandre Oliva <oliva@adacore.com>
22705 * config/i386/predicates.md (reg_or_const_vec_operand): New.
22706 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
22707 the now *-prefixed insn_and_split, turn the splitter const vec
22708 into an input for the insn, making it an ignored immediate for
22709 non-split cases, and loaded into the scratch register
22712 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
22715 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
22716 Use define_relaxed_memory_constraint for them.
22718 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
22721 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
22722 colon to the diagnostic message.
22724 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
22726 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
22728 (try_fwprop_subst_note): Use set_info instead of insn_info.
22729 (try_fwprop_subst_pattern): Likewise.
22730 (try_fwprop_subst_notes): Likewise.
22731 (try_fwprop_subst): Likewise.
22732 (forward_propagate_subreg): Likewise.
22733 (forward_propagate_and_simplify): Likewise.
22734 (forward_propagate_into): Likewise.
22735 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
22737 (set_info::single_nondebug_insn_use): Likewise.
22738 (set_info::single_phi_use): Likewise.
22739 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
22741 (set_info::single_nondebug_insn_use): Likewise.
22742 (set_info::single_phi_use): Likewise.
22744 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
22746 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
22748 2021-03-23 Jakub Jelinek <jakub@redhat.com>
22751 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
22752 expand_mult to perform an unsigned rather than a signed
22755 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
22758 * config/i386/cpuid.h (__cpuid): Add __volatile__.
22759 (__cpuid_count): Likewise.
22761 2021-03-23 Richard Biener <rguenther@suse.de>
22763 PR tree-optimization/99721
22764 * tree-vect-slp.c (vect_slp_analyze_node_operations):
22765 Make sure we can schedule the node.
22767 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22769 * config/riscv/riscv.c (riscv_subword): Take endianness into
22770 account when calculating the byte offset.
22772 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22774 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
22775 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
22776 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
22777 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
22778 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
22779 new predicate "subreg_lowpart_operator"
22781 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22783 * config/riscv/riscv.c (riscv_swap_instruction): New function
22784 to byteswap an SImode rtx containing an instruction.
22785 (riscv_trampoline_init): Byteswap the generated instructions
22788 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22790 * common/config/riscv/riscv-common.c
22791 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
22792 * config.gcc (riscv32be-*, riscv64be-*): Set
22793 TARGET_BIG_ENDIAN_DEFAULT to 1.
22794 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
22795 depending on default endianness.
22796 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
22797 * config/riscv/linux.h (LINK_SPEC): Likewise.
22798 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
22799 default endianness.
22800 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
22802 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22804 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
22805 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
22806 * config/riscv/linux.h (LINK_SPEC): Likewise.
22807 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
22809 (BYTES_BIG_ENDIAN): Handle big endian.
22810 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
22811 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
22813 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
22815 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
22817 * regcprop.c (find_oldest_value_reg): Ask target whether
22818 different mode is fine for replacement register.
22820 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
22822 PR tree-optimization/99296
22823 * value-range.cc (irange::irange_set_1bit_anti_range): New.
22824 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
22825 * value-range.h (irange::irange_set_1bit_anti_range): New.
22827 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
22830 * config/aarch64/constraints.md (UtQ): Use
22831 define_relaxed_memory_constraint for it.
22832 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
22833 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
22834 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
22835 (have_relaxed_memory_constraints): New static var.
22836 (relaxed_memory_start, relaxed_memory_end): Ditto.
22837 (add_constraint): Add arg is_relaxed_memory. Check name for
22838 relaxed memory. Set up is_relaxed_memory in constraint_data and
22839 have_relaxed_memory_constraints. Adjust calls.
22840 (choose_enum_order): Process relaxed memory.
22841 (write_tm_preds_h): Ditto.
22842 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
22843 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
22844 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
22845 * ira-lives.c (single_reg_class): Use
22846 insn_extra_relaxed_memory_constraint.
22847 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
22848 * lra-constraints.c (valid_address_p): Use
22849 insn_extra_relaxed_memory_constraint instead of other memory
22851 (process_alt_operands): Process CT_RELAXED_MEMORY.
22852 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
22853 * recog.c (asm_operand_ok, preprocess_constraints): Process
22855 * reload.c (find_reloads): Ditto.
22856 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
22857 * stmt.c (parse_input_constraint): Use
22858 insn_extra_relaxed_memory_constraint.
22860 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
22863 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
22866 2021-03-22 Alex Coplan <alex.coplan@arm.com>
22869 * config/arm/arm-protos.h (neon_make_constant): Add generate
22870 argument to guard emitting insns, default to true.
22871 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
22872 CONST_VECTORs which neon_make_constant can't handle.
22873 (neon_vdup_constant): Add generate argument, avoid emitting
22874 insns if it's not set.
22875 (neon_make_constant): Plumb new generate argument through.
22876 * config/arm/constraints.md (Ui): New. Use it...
22877 * config/arm/mve.md (*mve_mov<mode>): ... here.
22878 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
22879 synthesize constants.
22881 2021-03-22 Richard Biener <rguenther@suse.de>
22883 * debug.h: Add deprecation warning.
22885 2021-03-22 Richard Biener <rguenther@suse.de>
22887 PR tree-optimization/99694
22888 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
22891 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
22894 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
22895 after type checking.
22897 2021-03-22 Jakub Jelinek <jakub@redhat.com>
22901 * dwarf2out.c (get_full_len): Use get_precision rather than
22903 (add_const_value_attribute): Make sure add_AT_wide argument has
22904 precision prec rather than some very wide one.
22906 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
22908 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
22909 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
22910 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
22911 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
22912 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
22913 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
22914 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
22915 *fix_trunc<mode>si2_internal): Fix empty split condition.
22916 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
22917 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
22918 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
22919 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
22921 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
22924 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
22925 Convert idx to DImode.
22926 (rs6000_expand_vector_set_var_p8): Likewise.
22928 2021-03-21 Jakub Jelinek <jakub@redhat.com>
22931 * dwarf2out.c (insert_float): Change return type from void to
22932 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
22933 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
22936 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
22939 * config/i386/i386.c (construct_container): Check cfun != NULL
22940 before accessing silent_p.
22942 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
22944 * asan.c: Fix typos in comments.
22946 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
22948 PR rtl-optimization/99680
22949 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
22950 (process_address_1): Check empty constraint before using
22953 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
22955 * config/rs6000/rs6000.c (power10_cost): New.
22956 (rs6000_option_override_internal): Set Power10 costs.
22957 (rs6000_issue_rate): Set Power10 issue rate.
22958 * config/rs6000/power10.md: Rewrite for Power10.
22960 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
22963 * lra-constraints.c (process_address_1): Don't use unknown
22964 constraint for address constraint.
22966 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
22969 * config.gcc (powerpc-*-darwin8): Delete the reference to
22970 the now removed darwin8.h.
22972 2021-03-19 Olivier Hainque <hainque@adacore.com>
22975 * config/vxworksae.h (VX_CPU_PREFIX): Define.
22977 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
22979 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
22981 2021-03-19 Tamar Christina <tamar.christina@arm.com>
22983 PR tree-optimization/99656
22984 * tree-vect-slp-patterns.c (linear_loads_p,
22985 complex_add_pattern::matches, is_eq_or_top,
22986 vect_validate_multiplication, complex_mul_pattern::matches,
22987 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
22988 * tree-vectorizer.h: (complex_load_perm_t): Removed.
22989 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
22990 complex_load_perm_t.
22992 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
22995 * config/i386/i386-options.c (ix86_init_machine_status): Set
22997 * config/i386/i386.c (init_cumulative_args): Set silent_p to
22999 (construct_container): Return early for return and argument
23000 errors if silent_p is true.
23001 * config/i386/i386.h (machine_function): Add silent_p.
23003 2021-03-19 Jakub Jelinek <jakub@redhat.com>
23006 * config/arm/constraints.md (Ds): New constraint.
23007 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
23008 constraint instead of w,Dm.
23010 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
23012 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
23015 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
23017 PR middle-end/99641
23018 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
23019 array type, do the computation of the current position in sizetype.
23021 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
23024 * lra-constraints.c (process_address_1): Use lookup_constraint
23025 only for a single constraint.
23027 2021-03-18 Martin Sebor <msebor@redhat.com>
23029 PR middle-end/99502
23030 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
23031 (inbounds_memaccess_p): ...to this. Check the ending offset of
23032 the accessed member.
23034 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
23036 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
23037 %> quote markers to error messages.
23038 (gcn_goacc_validate_dims): Likewise.
23039 (gcn_conditional_register_usage): Remove exclaimation mark from error
23041 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
23043 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
23045 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
23048 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
23049 Kito Cheng <kito.cheng@sifive.com>
23051 * config/riscv/riscv.c (riscv_block_move_straight): Change type
23052 to unsigned HOST_WIDE_INT for parameter and local variable with
23053 HOST_WIDE_INT type.
23054 (riscv_adjust_block_mem): Ditto.
23055 (riscv_block_move_loop): Ditto.
23056 (riscv_expand_block_move): Ditto.
23058 2021-03-18 Nick Clifton <nickc@redhat.com>
23060 * config/v850/v850.c (construct_restore_jr): Increase static
23062 (construct_save_jarl): Likewise.
23063 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
23065 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23067 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
23068 (aarch64_override_options_internal): Use it.
23069 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
23072 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
23074 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
23075 error message format issues.
23076 (nios2_option_override): Likewise.
23077 (nios2_expand_fpu_builtin): Likewise.
23078 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
23079 truncation warning.
23080 (nios2_expand_custom_builtin): More error message format fixes.
23081 (nios2_expand_rdwrctl_builtin): Likewise.
23082 (nios2_expand_rdprs_builtin): Likewise.
23083 (nios2_expand_eni_builtin): Likewise.
23084 (nios2_expand_builtin): Likewise.
23085 (nios2_register_custom_code): Likewise.
23086 (nios2_valid_target_attribute_rec): Likewise.
23087 (nios2_add_insn_asm): Fix uninitialized variable warning.
23089 2021-03-17 Jan Hubicka <jh@suse.cz>
23091 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
23092 of gather to match reality.
23093 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
23095 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23097 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
23098 to compare against CC_REG rather than NE.
23100 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
23103 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
23104 inline assembly statements.
23105 (ix86_print_operand): Update 'P' handling for -fno-plt.
23107 2021-03-17 Tamar Christina <tamar.christina@arm.com>
23110 * config/aarch64/aarch64.c
23111 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
23113 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
23116 * config/rs6000/predicates.md (branch_comparison_operator): Allow
23117 ordered and unordered for CCFPmode, if flag_finite_math_only.
23119 2021-03-16 Jakub Jelinek <jakub@redhat.com>
23122 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
23123 rather than ASHIFT.
23124 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
23126 2021-03-16 Martin Liska <mliska@suse.cz>
23129 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
23130 cl_optimization_compare function.
23132 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
23134 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
23137 2021-03-16 Jakub Jelinek <jakub@redhat.com>
23140 * config/i386/i386.h (struct machine_function): Add
23141 has_explicit_vzeroupper bitfield.
23142 * config/i386/i386-expand.c (ix86_expand_builtin): Set
23143 cfun->machine->has_explicit_vzeroupper when expanding
23144 IX86_BUILTIN_VZEROUPPER.
23145 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
23146 Do the mode switching only when TARGET_VZEROUPPER, expensive
23147 optimizations turned on and not optimizing for size.
23148 (pass_insert_vzeroupper::gate): Enable even when
23149 cfun->machine->has_explicit_vzeroupper is set.
23151 2021-03-16 Jakub Jelinek <jakub@redhat.com>
23154 * config/aarch64/aarch64.c
23155 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
23156 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
23157 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
23159 2021-03-15 Richard Biener <rguenther@suse.de>
23161 PR tree-optimization/98834
23162 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
23163 subsetting by truncating the access size.
23165 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
23167 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
23168 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
23171 2021-03-15 Martin Liska <mliska@suse.cz>
23173 * spellcheck.c: Add missing comma in initialization.
23175 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
23177 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
23178 alternative 2 and alternative 1 with alternative 3 using
23179 YW register constraint.
23180 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
23181 using YW register constraint.
23182 (*vec_extractv16qi_zext): Ditto.
23183 (*vec_extractv4si): Merge alternatives 4 and 5
23184 using Yw register constraint.
23185 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
23187 2021-03-13 Martin Sebor <msebor@redhat.com>
23189 PR tree-optimization/99489
23190 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
23191 is not a call statement.
23193 2021-03-13 Jakub Jelinek <jakub@redhat.com>
23195 PR tree-optimization/99544
23196 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
23197 if for vector types multiplication can't be done in type's mode.
23199 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
23202 * config/sparc/constraints.md (w): Rename to...
23203 (W): ... this and ditch previous implementation.
23204 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
23205 (*movdf_insn_sp64): Likewise.
23206 (*mov<VM64:mode>_insn_sp64): Likewise.
23207 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
23209 (atomic_compare_and_swap_leon3_1): Likewise.
23210 (*atomic_compare_and_swapdi_v8plus): Likewise.
23211 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
23212 architecture and add missing address validity check during LRA.
23214 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
23217 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
23218 occuring for assumed-size arrays in use_device_{ptr,addr}.
23220 2021-03-12 Jakub Jelinek <jakub@redhat.com>
23223 * config/i386/constraints.md (YW): New internal constraint.
23224 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
23225 (*<sse2_avx2>_<insn><mode>3<mask_name>,
23226 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
23227 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
23229 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
23230 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
23231 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
23232 into one, use Yw instead of former x,v.
23233 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
23234 the last alternative.
23235 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
23236 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
23237 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
23238 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
23239 into one, use <v_Yw> instead of former x,v.
23240 (avx2_interleave_highv32qi<mask_name>,
23241 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
23242 constraints. Add && <mask_avx512bw_condition> to condition.
23243 (avx2_interleave_lowv32qi<mask_name>,
23244 vec_interleave_lowv16qi<mask_name>,
23245 avx2_interleave_highv16hi<mask_name>,
23246 vec_interleave_highv8hi<mask_name>,
23247 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
23248 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
23249 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
23250 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
23251 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
23252 Yw instead of v in constraints.
23253 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
23254 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
23255 instead of Yv in constraints.
23256 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
23257 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
23258 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
23259 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
23260 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
23262 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
23263 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
23264 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
23265 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
23266 two, one with just x, another isa avx512vl with v.
23268 2021-03-12 Martin Liska <mliska@suse.cz>
23270 * doc/invoke.texi: Add missing param documentation.
23272 2021-03-11 David Malcolm <dmalcolm@redhat.com>
23275 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
23276 analyzer/trimmed-graph.o.
23277 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
23278 feasibility checking to reflect new implementation.
23279 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
23281 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
23283 2021-03-11 David Malcolm <dmalcolm@redhat.com>
23285 * digraph.cc (selftest::test_shortest_paths): Update
23286 shortest_paths init for new param. Add test of
23287 SPS_TO_GIVEN_TARGET.
23288 * shortest-paths.h (enum shortest_path_sense): New.
23289 (shortest_paths::shortest_paths): Add "sense" param.
23290 Update for renamings. Generalize to use "sense" param.
23291 (shortest_paths::get_shortest_path): Rename param.
23292 (shortest_paths::m_sense): New field.
23293 (shortest_paths::m_prev): Rename...
23294 (shortest_paths::m_best_edge): ...to this.
23295 (shortest_paths::get_shortest_path): Update for renamings.
23296 Conditionalize flipping of path on sense of traversal.
23298 2021-03-11 David Malcolm <dmalcolm@redhat.com>
23300 * digraph.cc (selftest::test_shortest_paths): Add test coverage
23301 for paths from B and C.
23302 * shortest-paths.h (shortest_paths::shortest_paths): Handle
23303 unreachable nodes, rather than asserting.
23305 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
23308 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
23309 xcoff_tbss_section_name.
23310 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
23311 * xcoffout.c (xcoff_tbss_section_name): Delete.
23312 * xcoffout.h (xcoff_tbss_section_name): Delete.
23314 2021-03-11 Richard Biener <rguenther@suse.de>
23316 PR tree-optimization/99523
23317 * tree-cfg.c (dump_function_to_file): Dump SSA names
23318 w/o identifier to the decls section as well, not only those
23319 without a VAR_DECL.
23321 2021-03-11 Jakub Jelinek <jakub@redhat.com>
23324 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
23325 function calls with lhs fail if the lhs don't have compatible types.
23327 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
23329 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
23330 Change FRAME_POINTER_REGNUM to correspond to a new faked
23331 register faked_fp, part of GENNONACR_REGS like faked_ap.
23332 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
23333 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
23334 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
23335 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
23336 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
23338 (CRIS_REAL_FP_REGNUM): New constant.
23339 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
23340 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
23341 (cris_initial_elimination_offset): Handle elimination changes
23342 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
23343 and add one from FRAME_POINTER_REGNUM to
23344 HARD_FRAME_POINTER_REGNUM.
23345 (cris_expand_prologue, cris_expand_epilogue): Emit code for
23346 hard_frame_pointer_rtx instead of frame_pointer_rtx.
23348 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
23351 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
23352 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
23354 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
23357 * lra-constraints.c (process_address_1): Don't check unknown
23358 constraint, use X for empty constraint.
23360 2021-03-10 Alex Coplan <alex.coplan@arm.com>
23362 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
23363 Fix typo in comment describing "is_ha" argument.
23365 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
23367 * doc/sourcebuild.texi: Document LRA target selector.
23369 2021-03-10 David Malcolm <dmalcolm@redhat.com>
23371 * doc/ux.texi: Add subsection contrasting interactive versus
23372 batch usage of GCC.
23374 2021-03-10 Joel Hutton <joel.hutton@arm.com>
23377 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
23379 (vectorizable_load): Fix gather load mask check condition.
23381 2021-03-10 Richard Biener <rguenther@suse.de>
23383 PR tree-optimization/99510
23384 * tree.c (check_aligned_type): Check that the candidate
23385 has TYPE_USER_ALIGN set instead of matching with the
23388 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
23390 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
23391 float and vector integer modes only if the mode is not larger.
23393 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
23395 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
23397 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
23399 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
23401 * ira-lives.c (single_reg_class): Ditto.
23403 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
23405 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
23406 the architecture-specific rtems.h.
23407 (aarch64-*-rtems*): Likewise.
23408 (arm*-*-rtems*): Likewise.
23409 (epiphany-*-rtems*): Likewise.
23410 (riscv*-*-rtems*): Likewise.
23412 2021-03-09 Jakub Jelinek <jakub@redhat.com>
23414 PR tree-optimization/99305
23415 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
23416 before integer_all_onesp instead of vice versa.
23418 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
23420 * common/config/arm/arm-common.c (arm_config_default): Change type
23421 of 'i' to unsigned.
23423 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
23426 * lra-constraints.c (process_address_1): Process constraint 'g'
23427 separately and digital constraints containing more one digit.
23429 2021-03-09 Nick Clifton <nickc@redhat.com>
23431 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
23432 (DWARF"_DEBUGGING_INFO): Define.
23434 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
23437 * calls.c (initialize_argument_information): When the argument
23438 is passed by reference, do not make a copy in a thunk only if
23439 the argument is already in memory. Remove redundant test for
23440 the case of callee copy.
23442 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
23445 * lra-constraints.c (process_address_1): Process 0..9 constraints
23446 in process_address_1.
23448 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
23450 * config/s390/s390.c (struct s390_processor processor_table):
23451 Binutils name string must not be empty.
23453 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
23455 * config/arc/arc.c (arc_attr_type): Remove function.
23457 2021-03-09 Martin Liska <mliska@suse.cz>
23460 * config/i386/i386-options.c (ix86_option_override_internal):
23461 Set isa_flags for OPTS argument and not for the global
23464 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
23466 * config/rs6000/predicates.md (ds_form_mem_operand): Check
23469 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
23472 * config/rs6000/predicates.md (ds_form_mem_operand) New
23474 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
23475 ds_form_mem_operand in ld/lwa patterns.
23476 * config/rs6000/fusion.md: Regenerate file.
23478 2021-03-08 Martin Sebor <msebor@redhat.com>
23480 PR middle-end/98266
23481 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
23482 (array_bounds_checker::check_array_bounds): Call it.
23484 2021-03-08 Martin Sebor <msebor@redhat.com>
23486 PR middle-end/97631
23487 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
23488 (handle_builtin_stxncpy_strncat): Rename locals. Determine
23489 destination size from allocation calls. Issue a more appropriate
23491 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
23492 (handle_builtin_memset): Same.
23494 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
23497 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
23498 to ensure we do not have an Altivec style address.
23499 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
23500 an Altivec style address.
23501 (*vsx_le_perm_store_<mode>): Likewise.
23502 (splitters after *vsx_le_perm_store_<mode>): Likewise.
23503 (vsx_load_<mode>): Disable special expander if passed an Altivec
23505 (vsx_store_<mode>): Likewise.
23507 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23510 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
23511 (aarch64_simd_shift_imm_vec_hi): Likewise.
23512 (aarch64_simd_shift_imm_vec_si): Likewise.
23513 (aarch64_simd_shift_imm_vec_di): Likewise.
23514 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
23515 predicate from above.
23516 (aarch64_shrn<mode>_insn_be): Likewise.
23517 (aarch64_rshrn<mode>_insn_le): Likewise.
23518 (aarch64_rshrn<mode>_insn_be): Likewise.
23519 (aarch64_shrn2<mode>_insn_le): Likewise.
23520 (aarch64_shrn2<mode>_insn_be): Likewise.
23521 (aarch64_rshrn2<mode>_insn_le): Likewise.
23522 (aarch64_rshrn2<mode>_insn_be): Likewise.
23524 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
23527 * lra-constraints.c (skip_contraint_modifiers): New function.
23528 (process_address_1): Use it before lookup_constraint call.
23530 2021-03-08 Martin Liska <mliska@suse.cz>
23533 * config/i386/i386-options.c (ix86_option_override_internal):
23534 Enable UINTR and HRESET for -march that supports it.
23536 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
23538 * config/s390/s390.c (f_constraint_p): New function.
23539 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
23540 (TARGET_MD_ASM_ADJUST): Likewise.
23542 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
23545 * tree-nested.c (convert_local_reference_stmt): Avoid calling
23546 lookup_field_for_decl for Fortran module (= namespace context).
23548 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
23550 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
23551 comparison with arithmetic right shift.
23552 (s390_expand_vcond): No need for a force_reg anymore.
23553 s390_vec_compare will do it.
23554 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
23555 immediate operands.
23557 2021-03-07 Jakub Jelinek <jakub@redhat.com>
23560 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
23561 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
23563 * config/i386/sse.md (v_Yw): New define_mode_attr.
23564 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
23565 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
23567 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
23568 xYw in constraints.
23570 2021-03-06 Julian Brown <julian@codesourcery.com>
23572 * tree-pretty-print.c (dump_generic_node): Emit non-generic
23573 address space info for aggregates.
23575 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
23577 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
23579 2021-03-05 Jakub Jelinek <jakub@redhat.com>
23581 PR middle-end/99322
23582 * tree-cfg.c (bb_to_omp_idx): New variable.
23583 (execute_build_cfg): Release the bb_to_omp_idx vector after
23584 cleanup_tree_cfg returns.
23585 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
23586 for bb_to_omp_idx being a vec<int> instead of pointer to array
23588 (make_edges): Remove bb_to_omp_idx local variable, don't pass
23589 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
23590 vec<int> instead of pointer to array of ints and don't free/release
23592 (remove_bb): When removing a bb and placing forced label somewhere
23593 else, ensure it is put into the same OpenMP region during cfg
23594 pass if possible or to entry successor as fallback. Unregister
23595 bb from bb_to_omp_idx.
23597 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
23600 * lra-constraints.c (process_address_1): Skip decomposing address
23601 for asm insn operand with unknown constraint.
23603 2021-03-05 Martin Jambor <mjambor@suse.cz>
23606 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
23607 corresponding speculative edges if we are about to resolve
23608 sepculation. Make edge direct (and so resolve speculations) before
23609 removing it from call_site_hash.
23610 (cgraph_edge::make_direct): Relax the initial assert to allow calling
23611 the function on speculative direct edges.
23613 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
23615 PR rtl-optimization/99376
23616 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
23617 of low-order zero bits is too large, set the result to 0 directly.
23619 2021-03-04 Jakub Jelinek <jakub@redhat.com>
23621 PR middle-end/93235
23622 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
23623 SUBREG and a SUBREG to op_mode can't be created.
23625 2021-03-04 Alex Coplan <alex.coplan@arm.com>
23628 * config/aarch64/aarch64-sve-builtins.cc
23629 (function_resolver::require_vector_type): Handle error_mark_node.
23631 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
23633 * cfgexpand.c (expand_asm_loc): Pass new parameter.
23634 (expand_asm_stmt): Likewise.
23635 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
23637 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
23638 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
23639 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
23640 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
23641 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
23642 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
23643 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
23644 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
23645 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
23646 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
23647 * doc/tm.texi (md_asm_adjust): Likewise.
23648 * target.def (md_asm_adjust): Likewise.
23650 2021-03-04 Richard Biener <rguenther@suse.de>
23652 PR middle-end/97855
23653 * tree-pretty-print.c: Poison pp_printf.
23654 (dump_decl_name): Avoid use of pp_printf.
23655 (dump_block_node): Likewise.
23656 (dump_generic_node): Likewise.
23658 2021-03-04 Martin Sebor <msebor@redhat.com>
23660 PR middle-end/96963
23661 PR middle-end/94655
23662 * builtins.c (handle_array_ref): New helper.
23663 (handle_mem_ref): New helper.
23664 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
23665 into new helper functions. Correct a workaround for vectorized
23668 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
23670 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
23671 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
23672 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
23673 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
23674 attribute for Power10.
23675 * config/rs6000/mma.md (*movoo): Likewise.
23676 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
23677 (define_mode_attr bits): Add DD/TD modes.
23678 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
23679 store_conditionalpti): Update size attribute for Power10.
23681 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
23684 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
23685 -Wuninitialized, -Wmaybe-uninitialized.
23686 (wide-int.o-warn): Likewise.
23688 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
23690 * common/config/arm/arm-common.c: Include configargs.h.
23691 (arm_config_default): New function.
23692 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
23693 processors that do not support Thumb. Take into account the
23694 --with-mode configuration setting for selecting the default.
23695 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
23696 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
23698 2021-03-03 Martin Liska <mliska@suse.cz>
23700 PR gcov-profile/97461
23701 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
23703 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
23706 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
23707 point back the hard frame pointer to its default location when the
23708 frame is larger than SEH_MAX_FRAME_SIZE.
23710 2021-03-03 Jakub Jelinek <jakub@redhat.com>
23713 * config/i386/predicates.md (logic_operator): New define_predicate.
23714 * config/i386/i386.md (mov + mem using comm arith peephole2):
23715 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
23716 and the inner mode is [QH]Imode.
23718 2021-03-03 Jakub Jelinek <jakub@redhat.com>
23721 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
23722 (new_loc_list): Clear end_entry.
23723 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
23724 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
23726 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
23727 initialize also end_entry.
23729 2021-03-03 Jakub Jelinek <jakub@redhat.com>
23732 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
23733 partitions, if in non-layout mode after reorder_blocks also move
23734 affected blocks to ensure a single partition transition.
23736 2021-03-03 Jason Merrill <jason@redhat.com>
23739 * cgraphunit.c (process_function_and_variable_attributes): Don't
23740 warn about flatten on an alias if the target also has it.
23741 * cgraph.h (symtab_node::get_alias_target_tree): New.
23743 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
23745 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
23746 period to symbol name.
23747 (tls_get_addr_internal<mode>): Same.
23749 2021-03-02 David Malcolm <dmalcolm@redhat.com>
23752 * diagnostic-show-locus.c
23753 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
23756 2021-03-02 Martin Sebor <msebor@redhat.com>
23758 PR middle-end/99276
23759 * builtins.c (warn_for_access): Remove stray warning text.
23761 2021-03-02 Martin Sebor <msebor@redhat.com>
23763 PR middle-end/99295
23764 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
23767 2021-03-02 Jakub Jelinek <jakub@redhat.com>
23770 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
23771 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
23772 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
23773 DW_MACRO_define_strx and DW_MACRO_undef_strx.
23774 (save_macinfo_strings): Use DW_MACRO_*_str* even with
23775 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
23776 DW_MACRO_undef_strx.
23778 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
23780 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
23782 (BT_FN_V8HI_V8HI_UINT): Likewise.
23783 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
23784 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
23785 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
23786 New builtin definitions.
23787 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
23788 vector extension version.
23789 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
23790 available with current -march level.
23791 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
23792 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
23793 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
23794 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
23795 (vec_extend_to_fp32_lo): Likewise.
23796 (vec_round_from_fp32): Likewise.
23797 (vec_convert_to_fp16): Likewise.
23798 (vec_convert_from_fp16): Likewise.
23799 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
23800 (vclfnls_v8hi): Likewise.
23801 (vcrnfs_v8hi): Likewise.
23802 (vcfn_v8hi): Likewise.
23803 (vcnf_v8hi): Likewise.
23805 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
23807 * common/config/s390/s390-common.c (processor_flags_table): New entry.
23808 * config.gcc: Enable arch14 for --with-arch and --with-tune.
23809 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
23810 arch14 for unknown CPU models.
23811 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
23812 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
23813 (s390_get_sched_attrmask): Likewise.
23814 (s390_get_unit_mask): Likewise.
23815 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
23816 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
23817 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
23818 (TARGET_NNPA_P): New macro definitions.
23819 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
23820 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
23822 2021-03-02 Jakub Jelinek <jakub@redhat.com>
23824 PR middle-end/95757
23825 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
23826 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
23827 != 1 comparisons if name is lhs of a comparison.
23829 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
23833 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
23834 * config/darwin.c (darwin_should_restore_cfa_state): New.
23835 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
23836 * doc/tm.texi: Regenerated.
23837 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
23838 * dwarf2cfi.c (connect_traces): If the target requests, restore
23839 the CFA expression after a DW_CFA_restore.
23840 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
23842 2021-03-01 Martin Liska <mliska@suse.cz>
23845 * optc-save-gen.awk: Add 4 more exceptions.
23847 2021-03-01 Nathan Sidwell <nathan@acm.org>
23850 * tree.h (TYPE_ALIGN_RAW): New accessor.
23851 (TYPE_ALIGN): Use it.
23853 2021-03-01 Jan Hubicka <jh@suse.cz>
23856 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
23858 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
23861 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
23862 point the hard frame pointer to the SSE register save area instead
23863 of the general register save area. Perform only minimal adjustment
23864 for small frames if it is initially not correctly aligned.
23865 (ix86_expand_prologue): Remove early saves for a SEH target.
23866 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
23868 2021-02-28 Jakub Jelinek <jakub@redhat.com>
23871 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
23872 typo - referneced -> referenced.
23873 * tree.c (component_ref_size): Fix comment typo -
23874 refernce -> reference.
23875 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
23876 traling -> trailing.
23877 (aliasing_component_refs_p): Fix comment typos -
23878 refernce -> reference and refernece -> reference and
23879 traling -> trailing.
23880 (nonoverlapping_refs_since_match_p): Fix comment typo -
23881 referneces -> references.
23882 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
23883 referneces -> references.
23885 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
23887 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
23888 diagnostic message to avoid use of a contraction and format
23891 2021-02-27 Jakub Jelinek <jakub@redhat.com>
23894 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
23895 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
23896 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
23897 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
23900 2021-02-26 Richard Biener <rguenther@suse.de>
23902 PR middle-end/99281
23903 * expr.c (store_field): For calls with return-slot optimization
23904 and addressable return type expand the store directly.
23906 2021-02-26 Richard Biener <rguenther@suse.de>
23909 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
23911 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
23914 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
23917 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
23919 * config.gcc: Add rs6000-pcrel-opt.o.
23920 * config/rs6000/rs6000-pcrel-opt.c: New file.
23921 * config/rs6000/pcrel-opt.md: New file.
23922 * config/rs6000/predicates.md: Add d_form_memory predicate.
23923 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
23924 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
23925 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
23926 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
23927 and make_pass_pcrel_opt().
23928 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
23929 (rs6000_option_override_internal): Add pcrel-opt.
23930 (rs6000_delegitimize_address): Support pcrel-opt.
23931 (rs6000_opt_masks): Add pcrel-opt.
23932 (pcrel_opt_valid_mem_p): New function.
23933 (reg_to_non_prefixed): Make global.
23934 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
23935 (output_pcrel_opt_reloc): New function.
23936 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
23937 (pcrel_extern_addr): Set loads_extern_addr.
23938 Add include for pcrel-opt.md.
23939 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
23940 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
23943 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
23946 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
23947 If TARGET_64BIT and dest is SUBREG, we check the width, if it
23948 equal to SImode, we use SImode operation, just like what we are
23951 2021-02-26 Marek Polacek <polacek@redhat.com>
23953 * builtins.c (warn_for_access): Fix typos.
23955 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
23957 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
23958 mark in front of the immediate quantity.
23959 (<optab>_rolsi3_uxtw): Likewise.
23961 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
23964 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
23965 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
23966 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
23967 address and disable when the FPCXT is not available.
23968 (nonsecure_call_value_reg_thumb2): Likewise.
23970 2021-02-25 Nathan Sidwell <nathan@acm.org>
23973 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
23975 2021-02-25 Tamar Christina <tamar.christina@arm.com>
23977 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
23979 2021-02-25 Richard Biener <rguenther@suse.de>
23981 PR tree-optimization/99253
23982 * tree-vect-loop.c (check_reduction_path): First compute
23983 code, then verify out-of-loop uses.
23985 2021-02-25 Jakub Jelinek <jakub@redhat.com>
23988 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
23990 2021-02-25 Jakub Jelinek <jakub@redhat.com>
23992 PR tree-optimization/80635
23993 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
23994 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
23995 has mode precision.
23997 2021-02-25 Richard Biener <rguenther@suse.de>
23999 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
24000 load_map population.
24001 (vect_match_slp_patterns_2): Revert part of last change.
24002 (vect_analyze_slp): Do not interleave optimize_load_redistribution
24003 with pattern detection but do it afterwards. Dump the
24004 whole SLP graph after pattern recognition and load
24005 redistribution optimization finished.
24007 2021-02-24 Jakub Jelinek <jakub@redhat.com>
24010 * omp-low.c (struct omp_context): Add teams_nested_p and
24011 nonteams_nested_p members.
24012 (scan_omp_target): Diagnose teams nested inside of target with other
24013 directives strictly nested inside of the same target.
24014 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
24015 ctx->nonteams_nested_p as needed.
24017 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
24019 PR inline-asm/99123
24020 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
24022 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
24024 * config/cris/cris.c (cris_expand_prologue): Set
24025 current_function_static_stack_size, if flag_stack_usage_info.
24027 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
24029 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
24030 (rs6000_final_prescan_insn): Adjust.
24031 (rs6000_asm_output_opcode): Likewise.
24033 2021-02-24 Martin Sebor <msebor@redhat.com>
24035 PR middle-end/97172
24036 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
24037 from function arguments.
24039 2021-02-24 Tamar Christina <tamar.christina@arm.com>
24041 PR tree-optimization/99220
24042 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
24043 node from cache when it's about to be deleted.
24045 2021-02-24 Jakub Jelinek <jakub@redhat.com>
24047 PR tree-optimization/99225
24048 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
24049 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
24050 build_int_cst (..., 1). Formatting fixes.
24052 2021-02-24 Tamar Christina <tamar.christina@arm.com>
24054 PR tree-optimization/99149
24055 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
24057 (vect_slp_reset_pattern): Remove.
24058 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
24059 (complex_mul_pattern::build, complex_fma_pattern::build,
24060 complex_fms_pattern::build): Fix ref counts.
24061 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
24062 when node is being deleted.
24063 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
24064 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
24066 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
24068 2021-02-24 Matthias Klose <doko@ubuntu.com>
24071 2020-12-07 Matthias Klose <doko@ubuntu.com>
24073 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
24074 and ENABLE_RTL_FLAG_CHECKING.
24076 2021-02-24 Richard Biener <rguenther@suse.de>
24079 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
24081 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
24083 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
24084 (vsx_assemble_pair): ...to this.
24085 (*mma_assemble_pair): Rename from this...
24086 (*vsx_assemble_pair): ...to this.
24087 (mma_disassemble_pair): Rename from this...
24088 (vsx_disassemble_pair): ...to this.
24089 (*mma_disassemble_pair): Rename from this...
24090 (*vsx_disassemble_pair): ...to this.
24091 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
24092 BU_COMPAT): New macros.
24093 (mma_assemble_pair): Rename from this...
24094 (vsx_assemble_pair): ...to this.
24095 (mma_disassemble_pair): Rename from this...
24096 (vsx_disassemble_pair): ...to this.
24097 (mma_assemble_pair): New compatibility built-in.
24098 (mma_disassemble_pair): Likewise.
24099 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
24100 (RS6000_BUILTIN_COMPAT): Define.
24101 (bdesc_compat): New.
24102 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
24103 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
24104 and VSX_BUILTIN_ASSEMBLE_PAIR.
24105 (rs6000_init_builtins): Register compatibility built-ins.
24106 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
24107 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
24108 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
24109 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
24110 (__builtin_vsx_assemble_pair): ...to this.
24111 (__builtin_mma_disassemble_pair): Rename from this...
24112 (__builtin_vsx_disassemble_pair): ...to this.
24114 2021-02-23 Martin Liska <mliska@suse.cz>
24117 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
24118 with different alignment. That leads to an invalid red zone
24119 size allocated in runtime.
24121 2021-02-23 Jakub Jelinek <jakub@redhat.com>
24123 PR tree-optimization/99204
24124 * fold-const.c (fold_read_from_constant_string): Check that
24125 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
24127 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
24128 Kewen Lin <linkw@gcc.gnu.org>
24130 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
24131 (rotl<mode>3_insert_3): ...this.
24132 (plus_ior_xor): New code_iterator.
24133 (define_split for GPR rl*imi): New splitter.
24134 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
24135 for integer merging.
24137 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24139 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
24141 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
24142 into a register when the above is enabled.
24143 * config/aarch64/aarch64.c (neoversev1_tunings):
24144 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
24145 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
24147 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
24149 * config/cris/cris.c (cris_print_operand) <'T'>: Change
24150 valid operand from is now an addi mult-value to shift-value.
24151 * config/cris/cris.md (*addi): Change expression of scaled
24152 operand from mult to ashift.
24153 * config/cris/cris.md (*addi_reload): New insn_and_split.
24155 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
24158 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
24159 hook_bool_const_tree_hwi_hwi_const_tree_true.
24160 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
24162 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
24164 PR rtl-optimization/98791
24165 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
24166 for unordered modes.
24168 2021-02-22 Martin Liska <mliska@suse.cz>
24170 * tree-inline.c (inline_forbidden_p): Set
24171 inline_forbidden_reason.
24173 2021-02-22 Richard Biener <rguenther@suse.de>
24175 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
24178 2021-02-22 Richard Biener <rguenther@suse.de>
24180 PR tree-optimization/99165
24181 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
24182 Accumulate changed to ret.
24184 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
24187 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
24189 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
24191 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
24194 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
24196 (trunctf<DFP_ALL:mode>2): Likewise.
24197 (trunctdtf2_vr): Likewise.
24198 (trunctdtf2): Likewise.
24199 (extend<DFP_ALL:mode>tf2_vr): Likewise.
24200 (extend<DFP_ALL:mode>tf2): Likewise.
24201 (extendtftd2_vr): Likewise.
24202 (extendtftd2): Likewise.
24204 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
24206 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
24207 add memory alternative.
24208 (tf_to_fprx2): New pattern.
24210 2021-02-19 Martin Sebor <msebor@redhat.com>
24213 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
24214 (attr_access::free_lang_data): Remove a spurious test.
24216 2021-02-19 Nathan Sidwell <nathan@acm.org>
24218 * doc/invoke.texi (flang-info-module-read): Document.
24220 2021-02-19 Martin Liska <mliska@suse.cz>
24222 PR translation/99167
24223 * params.opt: Fix typo.
24225 2021-02-19 Richard Biener <rguenther@suse.de>
24227 PR middle-end/99122
24228 * tree-inline.c (inline_forbidden_p): Do not inline functions
24229 with VLA arguments or return value.
24231 2021-02-19 Jakub Jelinek <jakub@redhat.com>
24234 * config/arm/arm.md (*stack_protect_combined_set_insn,
24235 *stack_protect_combined_test_insn): If force_const_mem result
24236 is not valid general operand, force its address into the destination
24239 2021-02-19 Jakub Jelinek <jakub@redhat.com>
24242 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
24243 pad or non-local label, put FORCED_LABELs from bb b after that label
24244 rather than before it.
24246 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
24249 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
24250 expand_vector_broadcast' to emit the vec_duplicate operand.
24252 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
24254 PR rtl-optimization/96264
24255 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
24258 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
24261 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
24262 looking up the retain attribute.
24263 (resolve_unique_section): Likewise.
24264 (get_variable_section): Likewise.
24265 (switch_to_section): Likewise. Warn when a symbol without the
24266 retain attribute and a symbol with the retain attribute are
24267 placed in the section with the same name, instead of the used
24269 * doc/extend.texi: Document the "retain" attribute.
24271 2021-02-18 Nathan Sidwell <nathan@acm.org>
24274 * doc/invoke.texi (flang-info-include-translate): Document header
24277 2021-02-18 Richard Biener <rguenther@suse.de>
24279 PR middle-end/99122
24280 * ipa-fnsummary.c (analyze_function_body): Set
24281 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
24282 * tree-inline.c (insert_init_debug_bind): Pass NULL for
24283 error_mark_node values.
24284 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
24286 (setup_one_parameter): Delay force_value_to_type until when
24289 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
24291 PR tree-optimization/99142
24292 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
24294 2021-02-18 Jakub Jelinek <jakub@redhat.com>
24296 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
24297 wide_int_bitmask::wide_int_bitmask (uint64_t),
24298 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
24299 wide_int_bitmask::operator ~ () const,
24300 wide_int_bitmask::operator | (wide_int_bitmask) const,
24301 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
24303 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
24304 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
24305 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
24306 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
24307 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
24308 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
24309 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
24310 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
24311 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
24312 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
24313 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
24314 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
24315 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
24316 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
24317 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
24318 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
24319 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
24320 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
24321 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
24322 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
24323 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
24324 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
24325 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
24326 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
24328 2021-02-18 Jakub Jelinek <jakub@redhat.com>
24330 PR middle-end/99109
24331 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
24332 (build_printable_array_type): ... this. Add nelts argument. For
24333 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
24334 nelts, call build_array_type_nelts.
24335 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
24336 instead of build_zero_elt_array_type and build_array_type_nelts.
24338 2021-02-18 Jakub Jelinek <jakub@redhat.com>
24341 * config/i386/i386.c (distance_non_agu_define): Don't call
24342 extract_insn_cached here.
24343 (ix86_lea_outperforms): Save and restore recog_data around call
24344 to distance_non_agu_define and distance_agu_use.
24345 (ix86_ok_to_clobber_flags): Remove.
24346 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
24347 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
24348 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
24349 into define_insn. Move the splitting to define_peephole2 and
24350 check there using peep2_regno_dead_p if FLAGS_REG is dead.
24352 2021-02-17 Julian Brown <julian@codesourcery.com>
24354 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
24357 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
24360 * config/mips/mips.c (mips_symbol_insns): Do not use
24361 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
24363 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
24365 PR inline-asm/98096
24366 * stmt.c (resolve_operand_name_1): Take inout operands into account
24367 for access to labels by names.
24368 * doc/extend.texi: Describe counting operands for accessing labels.
24370 2021-02-16 Richard Biener <rguenther@suse.de>
24372 PR tree-optimization/38474
24373 * tree-ssa-structalias.c (variable_info::address_taken): New.
24374 (new_var_info): Initialize address_taken.
24375 (process_constraint): Set address_taken.
24376 (solve_constraints): Use the new address_taken flag rather
24377 than is_reg_var for sorting variables.
24378 (dump_constraint): Dump the variable number if the name
24381 2021-02-16 Jakub Jelinek <jakub@redhat.com>
24384 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
24385 multiply by 4096 and for inbranch by 8192.
24386 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
24387 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
24389 2021-02-15 Maya Rashish <coypu@sdf.org>
24391 * config/aarch64/aarch64.c (aarch64_init_builtins):
24392 Call SUBTARGET_INIT_BUILTINS.
24394 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
24396 PR rtl-optimization/98872
24397 * init-regs.c (initialize_uninitialized_regs): Skip initialization
24398 if CONST0_RTX is NULL.
24400 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
24402 PR rtl-optimization/98863
24403 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
24404 (function_info::build_info): Turn into a declaration, moving the
24405 definition to internals.h.
24406 (function_info::bb_walker): Declare.
24407 (function_info::create_reg_use): Likewise.
24408 (function_info::calculate_potential_phi_regs): Take a build_info
24410 (function_info::place_phis, function_info::create_ebbs): Declare.
24411 (function_info::calculate_ebb_live_in_for_debug): Likewise.
24412 (function_info::populate_backedge_phis): Delete.
24413 (function_info::start_block, function_info::end_block): Declare.
24414 (function_info::populate_phi_inputs): Delete.
24415 (function_info::m_potential_phi_regs): Move information to build_info.
24416 * rtl-ssa/internals.h: New file.
24417 (function_info::bb_phi_info): New class.
24418 (function_info::build_info): Moved from functions.h.
24419 Add a constructor and destructor.
24420 (function_info::build_info::ebb_use): Delete.
24421 (function_info::build_info::ebb_def): Likewise.
24422 (function_info::build_info::bb_live_out): Likewise.
24423 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
24424 (function_info::build_info::potential_phi_regs): Likewise.
24425 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
24426 (function_info::build_info::ebb_def_regs): Likewise.
24427 (function_info::build_info::bb_phis): Likewise.
24428 (function_info::build_info::bb_mem_live_out): Likewise.
24429 (function_info::build_info::bb_to_rpo): Likewise.
24430 (function_info::build_info::def_stack): Likewise.
24431 (function_info::build_info::old_def_stack_limit): Likewise.
24432 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
24433 Remove the regno argument. Push the previous definition onto the
24434 definition stack where necessary.
24435 * rtl-ssa/accesses.cc: Include internals.h.
24436 * rtl-ssa/changes.cc: Likewise.
24437 * rtl-ssa/blocks.cc: Likewise.
24438 (function_info::build_info::build_info): Define.
24439 (function_info::build_info::~build_info): Likewise.
24440 (function_info::bb_walker): New class.
24441 (function_info::bb_walker::bb_walker): Define.
24442 (function_info::add_live_out_use): Convert a logarithmic-complexity
24443 test into a linear one. Allow the same definition to be passed
24445 (function_info::calculate_potential_phi_regs): Moved from
24446 functions.cc. Take a build_info parameter and store the
24447 information there instead.
24448 (function_info::place_phis): New function.
24449 (function_info::add_entry_block_defs): Update call to record_reg_def.
24450 (function_info::calculate_ebb_live_in_for_debug): New function.
24451 (function_info::add_phi_nodes): Use bb_phis to decide which
24452 registers need phi nodes and initialize ebb_def_regs accordingly.
24453 Do not add degenerate phis here.
24454 (function_info::add_artificial_accesses): Use create_reg_use.
24455 Assert that all definitions are listed in the DF LR sets.
24456 Update call to record_reg_def.
24457 (function_info::record_block_live_out): Record live-out register
24458 values in the phis of successor blocks. Use the live-out set
24459 when processing the last block in an EBB, instead of always
24460 using the live-in sets of successor blocks. AND the live sets
24461 with the set of registers that have been defined in the EBB,
24462 rather than with all potential phi registers. Cope correctly
24463 with branches back to the start of the current EBB.
24464 (function_info::start_block): New function.
24465 (function_info::end_block): Likewise.
24466 (function_info::populate_phi_inputs): Likewise.
24467 (function_info::create_ebbs): Likewise.
24468 (function_info::process_all_blocks): Rewrite into a multi-phase
24470 * rtl-ssa/functions.cc: Include internals.h.
24471 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
24472 (function_info::init_function_data): Remove caller.
24473 * rtl-ssa/insns.cc: Include internals.h
24474 (function_info::create_reg_use): New function. Lazily any
24475 degenerate phis needed by the linear RPO view.
24476 (function_info::record_use): Use create_reg_use. When processing
24477 debug uses, use potential_phi_regs and test it before checking
24478 whether the register is live on entry to the current EBB. Lazily
24479 calculate ebb_live_in_for_debug.
24480 (function_info::record_call_clobbers): Update call to record_reg_def.
24481 (function_info::record_def): Likewise.
24483 2021-02-15 Martin Liska <mliska@suse.cz>
24485 * toplev.c (init_asm_output): Free output of
24486 gen_command_line_string function.
24487 (process_options): Likewise.
24489 2021-02-15 Martin Liska <mliska@suse.cz>
24491 * params.opt: Add 2 missing Param keywords.
24493 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
24495 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
24497 2021-02-15 Jakub Jelinek <jakub@redhat.com>
24499 PR tree-optimization/99079
24500 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
24501 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
24502 require both type and TREE_TYPE (@1) to be integral types and either
24503 type having smaller or equal precision, or TREE_TYPE (@1) being
24504 unsigned type, or type being signed type. If TREE_TYPE (@1)
24505 doesn't have wrapping overflow, perform the subtraction of one in
24508 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
24509 Richard Biener <rguether@suse.de>
24512 * ipa-reference.c (ipa_init): Only conditinally initialize
24513 reference_vars_to_consider.
24514 (propagate): Conditionally deninitialize reference_vars_to_consider.
24515 (ipa_reference_write_optimization_summary): Sanity check that
24516 reference_vars_to_consider is not allocated.
24518 2021-02-13 Levy Hsu <admin@levyhsu.com>
24521 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
24522 extend parameter to get_si_mem_base_reg declaration.
24523 (get_si_mem_base_reg): Add extend parameter. Set it.
24524 (analyze): Pass extend arg to get_si_mem_base_reg.
24525 (transform): Likewise. Use it when rewriting mems.
24526 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
24527 loads and emit sign/zero extending load followed by subreg move.
24529 2021-02-13 Jim Wilson <jimw@sifive.com>
24532 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
24533 exit when !reload_completed. Only perform check for compressed reg
24534 if reload_completed.
24535 (riscv_rtx_costs): In MEM case, when optimizing for size and
24536 shorten memrefs, if not compressible, then increase cost.
24538 2021-02-13 Jakub Jelinek <jakub@redhat.com>
24540 PR rtl-optimization/98439
24541 * recog.c (pass_split_before_regstack::gate): Enable even when
24542 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
24545 2021-02-13 Jakub Jelinek <jakub@redhat.com>
24548 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
24549 swap of V2SImode elements in memory into DImode memory rotate by 32.
24551 2021-02-12 Martin Sebor <msebor@redhat.com>
24553 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
24555 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
24557 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
24558 m_temp_obstack rather than m_obstack to allocate the temporary use.
24560 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
24562 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
24563 as read-modify operations.
24565 2021-02-12 Richard Biener <rguenther@suse.de>
24567 PR middle-end/38474
24568 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
24569 fbi->aa_walk_budget is bigger than zero. Update
24570 fbi->aa_walk_budget.
24571 (param_change_prob): Likewise.
24572 * ipa-prop.c (detect_type_change_from_memory_writes):
24573 Properly account walk_aliased_vdefs.
24574 (parm_preserved_before_stmt_p): Canonicalize updates.
24575 (parm_ref_data_preserved_p): Likewise.
24576 (parm_ref_data_pass_through_p): Likewise.
24577 (determine_known_aggregate_parts): Account own alias queries.
24579 2021-02-12 Martin Liska <mliska@suse.cz>
24581 * opts-common.c (decode_cmdline_option): Release werror_arg.
24582 * opts.c (gen_producer_string): Release output of
24583 gen_command_line_string.
24585 2021-02-12 Richard Biener <rguenther@suse.de>
24587 PR tree-optimization/38474
24588 * params.opt (-param=max-store-chains-to-track=): New param.
24589 (-param=max-stores-to-track=): Likewise.
24590 * doc/invoke.texi (max-store-chains-to-track): Document.
24591 (max-stores-to-track): Likewise.
24592 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
24594 (pass_store_merging::m_n_stores): Likewise.
24595 (pass_store_merging::terminate_and_process_chain): Update
24596 m_n_stores and m_n_chains.
24597 (pass_store_merging::process_store): Likewise. Terminate
24598 oldest chains if the number of stores or chains get too large.
24599 (imm_store_chain_info::terminate_and_process_chain): Dump
24602 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
24604 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
24605 the cold section, emit a nop before the directive if the previous
24606 active instruction can throw.
24608 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
24611 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
24612 memory addresses that are legal for quad word accesses.
24614 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
24617 * config/arm/thumb2.md (*doloop_end_internal): Generate
24618 alternative sequence to handle long range branches.
24620 2021-02-11 Joel Hutton <joel.hutton@arm.com>
24622 PR tree-optimization/98772
24623 * optabs-tree.c (supportable_half_widening_operation): New function
24624 to check for supportable V8QI->V8HI widening patterns.
24625 * optabs-tree.h (supportable_half_widening_operation): New function.
24626 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
24627 to create promotion stmts for V8QI->V8HI widening patterns.
24628 (vectorizable_conversion): Add case for V8QI->V8HI.
24630 2021-02-11 Richard Biener <rguenther@suse.de>
24632 * sparseset.h (SPARSESET_ELT_BITS): Remove.
24633 (SPARSESET_ELT_TYPE): Use unsigned int.
24634 * fwprop.c: Do not include sparseset.h.
24636 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24639 * varasm.c (declare_weak): For -fsyntax-only, allow even
24640 TREE_ASM_WRITTEN function decls.
24642 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24645 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
24646 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
24647 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
24648 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
24649 calling simplify_gen_subreg on it.
24651 2021-02-10 Martin Liska <mliska@suse.cz>
24653 * config/nvptx/nvptx.c (nvptx_option_override): Use
24654 flag_patchable_function_entry instead of the removed
24655 function_entry_patch_area_size.
24657 2021-02-10 Martin Liska <mliska@suse.cz>
24659 PR tree-optimization/99002
24660 PR tree-optimization/99026
24661 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
24662 leak when adjacent cases are merged.
24663 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
24665 (make_pass_lower_switch): Remove trailing whitespace.
24666 * tree-switch-conversion.h (release_clusters): New.
24668 2021-02-10 Richard Biener <rguenther@suse.de>
24670 PR rtl-optimization/99054
24671 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
24672 (fixup_partitions): Adjust.
24673 (rtl_verify_edges): Likewise.
24675 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24677 PR middle-end/99007
24678 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
24679 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
24682 2021-02-10 Richard Biener <rguenther@suse.de>
24685 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
24688 2021-02-10 Richard Biener <rguenther@suse.de>
24690 PR tree-optimization/99024
24691 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
24692 clear loop->aux if it is associated with the destroyed loop_vinfo.
24694 2021-02-10 Martin Liska <mliska@suse.cz>
24696 PR tree-optimization/99002
24697 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
24700 2021-02-10 Martin Liska <mliska@suse.cz>
24703 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
24704 a reference exists.
24706 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24709 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
24710 at class scope for DWARF5+.
24712 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
24714 PR rtl-optimization/96015
24715 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
24716 (relax_delay_slots): When deleting a jump to the next active
24717 instruction over a barrier, first delete the barrier if the
24718 jump is the only way to reach the target label.
24720 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
24722 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
24723 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
24724 vector multiplies and vect.alu for SSRA.
24725 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
24726 vect.mul cost field.
24727 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
24728 * config/arm/arm.c: Likewise.
24730 2021-02-09 Richard Biener <rguenther@suse.de>
24732 PR tree-optimization/98863
24733 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
24734 * tree-ssa-sccvn.c (last_pushed_avail): New global.
24735 (rpo_elim::eliminate_push_avail): Chain pushed avails.
24736 (unwind_state::avail_top): Add.
24737 (do_unwind): Rewrite unwinding of avail entries.
24738 (do_rpo_vn): Initialize last_pushed_avail and
24739 avail_top of the undo state.
24741 2021-02-09 Jakub Jelinek <jakub@redhat.com>
24743 PR middle-end/99004
24744 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
24745 const char * to char * and free those pointers after use.
24747 2021-02-09 Richard Biener <rguenther@suse.de>
24749 PR tree-optimization/99017
24750 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
24751 zero vector cost entries.
24753 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
24755 PR middle-end/98974
24756 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
24757 parameter in vectorizable_condition.
24759 2021-02-08 Richard Biener <rguenther@suse.de>
24762 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
24764 2021-02-08 Martin Liska <mliska@suse.cz>
24767 * cfgexpand.c (pass_expand::execute): Parse per-function option
24768 flag_patchable_function_entry and use it.
24769 * common.opt: Remove function_entry_patch_area_size and
24770 function_entry_patch_area_start global variables.
24771 * opts.c (parse_and_check_patch_area): New function.
24772 (common_handle_option): Use it.
24773 * opts.h (parse_and_check_patch_area): New function.
24774 * toplev.c (process_options): Parse and use
24775 function_entry_patch_area_size.
24777 2021-02-08 Martin Sebor <msebor@redhat.com>
24779 * doc/extend.texi (attribute malloc): Correct typos.
24781 2021-02-05 Nathan Sidwell <nathan@acm.org>
24784 * gcc.c (driver::maybe_run_linker): Check for input file
24785 accessibility if not linking.
24787 2021-02-05 Richard Biener <rguenther@suse.de>
24789 PR tree-optimization/98855
24790 * tree-vectorizer.h (add_stmt_cost): New overload.
24791 * tree-vect-slp.c (li_cost_vec_cmp): New.
24792 (vect_bb_slp_scalar_cost): Cost individual loop regions
24793 separately. Account for the scalar instance root stmt.
24795 2021-02-05 Tom de Vries <tdevries@suse.de>
24798 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
24800 (bit_test_cluster::emit): Reuse location_t for newly created
24802 (switch_decision_tree::try_switch_expansion): Preserve
24804 * tree-switch-conversion.h: Change function signatures.
24806 2021-02-05 Jakub Jelinek <jakub@redhat.com>
24809 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
24810 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
24811 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
24812 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
24814 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24816 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
24817 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
24818 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
24819 (vget_high_f16): Reimplement using new builtin.
24820 (vget_high_f32): Likewise.
24821 (vget_high_f64): Likewise.
24822 (vget_high_p8): Likewise.
24823 (vget_high_p16): Likewise.
24824 (vget_high_p64): Likewise.
24825 (vget_high_s8): Likewise.
24826 (vget_high_s16): Likewise.
24827 (vget_high_s32): Likewise.
24828 (vget_high_s64): Likewise.
24829 (vget_high_u8): Likewise.
24830 (vget_high_u16): Likewise.
24831 (vget_high_u32): Likewise.
24832 (vget_high_u64): Likewise.
24834 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24836 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
24837 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
24838 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
24839 (vget_low_f16): Reimplement using new builtin.
24840 (vget_low_f32): Likewise.
24841 (vget_low_f64): Likewise.
24842 (vget_low_p8): Likewise.
24843 (vget_low_p16): Likewise.
24844 (vget_low_p64): Likewise.
24845 (vget_low_s8): Likewise.
24846 (vget_low_s16): Likewise.
24847 (vget_low_s32): Likewise.
24848 (vget_low_s64): Likewise.
24849 (vget_low_u8): Likewise.
24850 (vget_low_u16): Likewise.
24851 (vget_low_u32): Likewise.
24852 (vget_low_u64): Likewise.
24854 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
24856 * gcc.c (print_multilib_info): Check all required argument is provided
24859 2021-02-05 liuhongt <hongtao.liu@intel.com>
24862 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
24863 generate integer mask comparison for 128/256-bits vector when
24864 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
24865 delete redundant !maskcmp condition.
24866 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
24868 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
24869 condition directly to if (maskcmp), add extra check for
24870 cmpmode, it should be MODE_INT.
24871 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
24872 parameters op_true/op_false.
24873 (ix86_use_mask_cmp_p): New.
24875 2021-02-05 liuhongt <hongtao.liu@intel.com>
24878 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
24879 Remove m_GENERIC from ~list.
24880 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
24882 2021-02-04 David Malcolm <dmalcolm@redhat.com>
24885 * diagnostic-show-locus.c (compatible_locations_p): Require
24886 locations in the same macro map to be either both from the
24887 macro definition, or both from the macro arguments.
24889 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
24891 * config/aarch64/aarch64-simd-builtins.def: Add
24892 [su]mull_hi_lane[q] builtin generator macros.
24893 * config/aarch64/aarch64-simd.md
24894 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
24895 (aarch64_<su>mull_hi_lane<mode>): Define.
24896 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
24897 (aarch64_<su>mull_hi_laneq<mode>): Define.
24898 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
24899 builtin instead of inline asm.
24900 (vmull_high_lane_s32): Likewise.
24901 (vmull_high_lane_u16): Likewise.
24902 (vmull_high_lane_u32): Likewise.
24903 (vmull_high_laneq_s16): Likewise.
24904 (vmull_high_laneq_s32): Likewise.
24905 (vmull_high_laneq_u16): Likewise.
24906 (vmull_high_laneq_u32): Liekwise.
24908 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
24910 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
24911 builtin generator macros.
24912 * config/aarch64/aarch64-simd.md
24913 (aarch64_<su>mull_hi_n<mode>_insn): Define.
24914 (aarch64_<su>mull_hi_n<mode>): Define.
24915 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
24916 instead of inline asm.
24917 (vmull_high_n_s32): Likewise.
24918 (vmull_high_n_u16): Likewise.
24919 (vmull_high_n_u32): Likewise.
24921 2021-02-04 Richard Biener <rguenther@suse.de>
24923 PR tree-optimization/98855
24924 * tree-vect-loop.c (vectorizable_phi): Do not cost
24925 single-argument PHIs.
24926 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
24927 * tree-vect-stmts.c (vectorizable_bswap): Also perform
24928 costing for SLP operation.
24930 2021-02-04 Martin Liska <mliska@suse.cz>
24932 * doc/extend.texi: Mention -mprefer-vector-width in target
24935 2021-02-03 Martin Sebor <msebor@redhat.com>
24937 PR tree-optimization/98937
24938 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
24939 Flush pointer_query cache.
24941 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
24943 * config/rs6000/genfusion.pl (gen_2logical): Add missing
24944 fixes based on patch review.
24945 * config/rs6000/fusion.md: Regenerate file.
24947 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
24949 * config/rs6000/t-rs6000: Comment out auto generation of
24952 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
24954 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
24955 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
24956 (output_file_start): Add gfx908.
24957 * config/gcn/gcn.opt (gpu_type): Add gfx908.
24958 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
24959 (MULTILIB_DIRNAMES): Add gfx908.
24960 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
24961 (main): Recognize gfx908.
24962 * config/gcn/t-omp-device: Add gfx908.
24964 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
24966 * config/aarch64/aarch64-simd-builtins.def: Add
24967 [su]mlsl_hi_lane[q] builtin macro generators.
24968 * config/aarch64/aarch64-simd.md
24969 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
24970 (aarch64_<su>mlsl_hi_lane<mode>): Define.
24971 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
24972 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
24973 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
24974 builtin instead of inline asm.
24975 (vmlsl_high_lane_s32): Likewise.
24976 (vmlsl_high_lane_u16): Likewise.
24977 (vmlsl_high_lane_u32): Likewise.
24978 (vmlsl_high_laneq_s16): Likewise.
24979 (vmlsl_high_laneq_s32): Likewise.
24980 (vmlsl_high_laneq_u16): Likewise.
24981 (vmlsl_high_laneq_u32): Likewise.
24982 (vmlal_high_laneq_u32): Likewise.
24984 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
24986 * config/aarch64/aarch64-simd-builtins.def: Add
24987 [su]mlal_hi_lane[q] builtin generator macros.
24988 * config/aarch64/aarch64-simd.md
24989 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
24990 (aarch64_<su>mlal_hi_lane<mode>): Define.
24991 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
24992 (aarch64_<su>mlal_hi_laneq<mode>): Define.
24993 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
24994 builtin instead of inline asm.
24995 (vmlal_high_lane_s32): Likewise.
24996 (vmlal_high_lane_u16): Likewise.
24997 (vmlal_high_lane_u32): Likewise.
24998 (vmlal_high_laneq_s16): Likewise.
24999 (vmlal_high_laneq_s32): Likewise.
25000 (vmlal_high_laneq_u16): Likewise.
25001 (vmlal_high_laneq_u32): Likewise.
25003 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25005 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
25006 builtin generator macros.
25007 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
25009 (aarch64_<su>mlsl_hi_n<mode>): Define.
25010 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
25011 instead of inline asm.
25012 (vmlsl_high_n_s32): Likewise.
25013 (vmlsl_high_n_u16): Likewise.
25014 (vmlsl_high_n_u32): Likewise.
25016 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25018 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
25019 builtin generator macros.
25020 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
25022 (aarch64_<su>mlal_hi_n<mode>): Define.
25023 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
25024 instead of inline asm.
25025 (vmlal_high_n_s32): Likewise.
25026 (vmlal_high_n_u16): Likewise.
25027 (vmlal_high_n_u32): Likewise.
25029 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25031 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
25033 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
25035 (aarch64_<su>mlal_hi<mode>_insn): This.
25036 (aarch64_<su>mlal_hi<mode>): Define.
25037 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
25038 instead of inline asm.
25039 (vmlal_high_s16): Likewise.
25040 (vmlal_high_s32): Likewise.
25041 (vmlal_high_u8): Likewise.
25042 (vmlal_high_u16): Likewise.
25043 (vmlal_high_u32): Likewise.
25045 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
25047 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
25048 after calling alter_subreg() on a (mem).
25050 2021-02-03 Martin Liska <mliska@suse.cz>
25053 * lto-streamer-out.c (produce_lto_section): Fill up missing
25055 * lto-streamer.h (struct lto_section): Add _padding field.
25057 2021-02-03 Richard Biener <rguenther@suse.de>
25059 * lto-streamer.c (lto_get_section_name): Free temporary
25061 * tree-loop-distribution.c
25062 (loop_distribution::merge_dep_scc_partitions): Free edge data.
25064 2021-02-03 Jakub Jelinek <jakub@redhat.com>
25066 PR middle-end/97487
25067 * ifcvt.c (noce_can_force_operand): New function.
25068 (noce_emit_move_insn): Use it.
25069 (noce_try_sign_mask): Likewise. Formatting fix.
25071 2021-02-03 Jakub Jelinek <jakub@redhat.com>
25073 PR middle-end/97971
25074 * lra-constraints.c (process_alt_operands): For inline asm, don't call
25075 fatal_insn, but instead return false.
25077 2021-02-03 Jakub Jelinek <jakub@redhat.com>
25079 PR tree-optimization/98287
25080 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
25083 2021-02-03 Tamar Christina <tamar.christina@arm.com>
25085 PR tree-optimization/98928
25086 * tree-vect-loop.c (vect_analyze_loop_2): Change
25087 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
25088 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
25089 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
25090 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
25092 2021-02-02 Richard Biener <rguenther@suse.de>
25094 * gimple-loop-interchange.cc (prepare_data_references):
25096 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
25097 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
25098 * tree-vect-stmts.c (vectorizable_condition): Do not
25100 (vectorizable_comparison): Likewise.
25102 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25104 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
25105 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
25106 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
25107 (vrsqrteq_u32): Likewise.
25109 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25111 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
25112 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
25113 (aarch64_sqxtun2<mode>_be): Likewise.
25114 (aarch64_sqxtun2<mode>): Likewise.
25115 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
25116 (vqmovun_high_s32): Likewise.
25117 (vqmovun_high_s64): Likewise.
25118 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
25120 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25122 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
25124 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
25126 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25128 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
25129 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
25130 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
25131 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
25132 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
25133 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
25134 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
25135 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
25137 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25139 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
25140 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
25141 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
25143 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25145 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
25146 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
25148 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25150 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
25151 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
25152 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
25153 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
25154 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
25155 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
25156 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
25157 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
25158 NONE builtin flags.
25160 2021-02-02 Jakub Jelinek <jakub@redhat.com>
25162 PR tree-optimization/98848
25163 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
25164 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
25166 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
25169 * expr.c: Check mode before calling store_expr.
25171 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
25173 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
25175 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
25176 instruction using expression ior.
25177 (mve_vornq_u<mode>): New expander.
25178 (mve_vornq_f<mode>): Use ior code instead of unspec.
25179 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
25181 2021-02-02 Alexandre Oliva <oliva@adacore.com>
25183 * tree-nested.c (convert_nonlocal_reference_op): Move
25184 current_function_decl restore after re-gimplification.
25185 (convert_local_reference_op): Likewise.
25187 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25189 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
25191 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
25193 (aarch64_rshrn<mode>_insn_be): Likewise.
25194 (aarch64_rshrn<mode>): Likewise.
25195 (aarch64_rshrn2<mode>_insn_le): Likewise.
25196 (aarch64_rshrn2<mode>_insn_be): Likewise.
25197 (aarch64_rshrn2<mode>): Likewise.
25198 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
25199 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
25201 (vrshrn_high_n_s32): Likewise.
25202 (vrshrn_high_n_s64): Likewise.
25203 (vrshrn_high_n_u16): Likewise.
25204 (vrshrn_high_n_u32): Likewise.
25205 (vrshrn_high_n_u64): Likewise.
25206 (vrshrn_n_s16): Likewise.
25207 (vrshrn_n_s32): Likewise.
25208 (vrshrn_n_s64): Likewise.
25209 (vrshrn_n_u16): Likewise.
25210 (vrshrn_n_u32): Likewise.
25211 (vrshrn_n_u64): Likewise.
25213 2021-02-01 Sergei Trofimovich <siarheit@google.com>
25215 PR tree-optimization/98499
25216 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
25217 conservatively and assume all possible side-effects.
25219 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25221 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
25222 vec_unpacku_hi_): Define builtins.
25223 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
25225 (vmovl_high_s16): Likewise.
25226 (vmovl_high_s32): Likewise.
25227 (vmovl_high_u8): Likewise.
25228 (vmovl_high_u16): Likewise.
25229 (vmovl_high_u32): Likewise.
25231 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25233 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
25235 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
25237 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
25239 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
25241 (vabdl_s16): Likewise.
25242 (vabdl_s32): Likewise.
25243 (vabdl_u8): Likewise.
25244 (vabdl_u16): Likewise.
25245 (vabdl_u32): Likewise.
25246 * config/aarch64/iterators.md (ABDL): New int iterator.
25247 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
25249 2021-02-01 Martin Sebor <msebor@redhat.com>
25251 * tree.h (BLOCK_VARS): Add comment.
25252 (BLOCK_SUBBLOCKS): Same.
25253 (BLOCK_SUPERCONTEXT): Same.
25254 (BLOCK_ABSTRACT_ORIGIN): Same.
25255 (inlined_function_outer_scope_p): Same.
25257 2021-02-01 Martin Sebor <msebor@redhat.com>
25259 PR middle-end/97172
25260 * attribs.c (attr_access::free_lang_data): Define new function.
25261 * attribs.h (attr_access::free_lang_data): Declare new function.
25263 2021-02-01 Richard Biener <rguenther@suse.de>
25265 * vec.h (auto_vec::auto_vec): Add memory stat parameters
25267 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
25269 2021-02-01 Tamar Christina <tamar.christina@arm.com>
25271 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
25272 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
25274 2021-02-01 Richard Biener <rguenther@suse.de>
25276 PR rtl-optimization/98863
25277 * config/i386/i386-features.c (convert_scalars_to_vector):
25278 Set DF_RD_PRUNE_DEAD_DEFS.
25280 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
25282 * system.h (SIZE_MAX): Define if not already defined.
25284 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
25286 * config/rs6000/genfusion.pl (gen_2logical): New function to
25287 generate patterns for logical-logical fusion.
25288 * config/rs6000/fusion.md: Regenerated patterns.
25289 * config/rs6000/rs6000-cpus.def: Add
25290 OPTION_MASK_P10_FUSION_2LOGICAL.
25291 * config/rs6000/rs6000.c (rs6000_option_override_internal):
25292 Enable logical-logical fusion for p10.
25293 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
25295 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
25297 * config/rs6000/rs6000.opt: Add periods to new AIX options.
25299 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
25301 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
25302 (mabi=vec-default): New.
25303 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
25304 __EXTABI__ for AIX Vector extended ABI.
25305 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
25307 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
25309 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
25311 2021-01-30 Jakub Jelinek <jakub@redhat.com>
25313 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
25314 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
25316 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
25319 * lra-constraints.c (in_class_p): Don't narrow class only for REG
25322 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
25324 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
25325 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
25326 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
25328 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
25330 PR tree-optimization/98866
25331 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
25332 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
25333 (gori_map::m_maybe_invariant): Rename from all_outgoing.
25334 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
25335 (gori_map::is_export_p): Ditto.
25336 (gori_map::calculate_gori): Ditto.
25337 (gori_compute::set_range_invariant): New.
25338 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
25339 invariant for pointers evaluating to [1, +INF].
25341 2021-01-29 Richard Biener <rguenther@suse.de>
25343 PR rtl-optimization/98863
25344 * config/i386/i386-features.c (remove_partial_avx_dependency):
25345 Do not perform DF analysis.
25346 (pass_data_remove_partial_avx_dependency): Remove
25349 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
25351 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
25352 builtin generator macros.
25353 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
25355 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
25356 instead of inline asm.
25357 (vmull_n_s32): Likewise.
25358 (vmull_n_u16): Likewise.
25359 (vmull_n_u32): Likewise.
25361 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25363 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
25365 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
25367 (aarch64_<sur>abdl2<mode>): ... This.
25368 (<sur>sadv16qi): Adjust use of above.
25369 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
25371 (vabdl_high_s16): Likewise.
25372 (vabdl_high_s32): Likewise.
25373 (vabdl_high_u8): Likewise.
25374 (vabdl_high_u16): Likewise.
25375 (vabdl_high_u32): Likewise.
25377 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25379 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
25381 (uabal2): Likewise.
25382 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
25384 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
25386 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
25388 (vabal_high_s16): Likewise.
25389 (vabal_high_s32): Likewise.
25390 (vabal_high_u8): Likewise.
25391 (vabal_high_u16): Likewise.
25392 (vabal_high_u32): Likewise.
25393 * config/aarch64/iterators.md (ABAL2): New mode iterator.
25394 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
25396 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25398 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
25401 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
25403 (aarch64_<sur>abal<mode>): ... This
25404 (<sur>sadv16qi): Adust use of the above.
25405 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
25407 (vabal_s16): Likewise.
25408 (vabal_s32): Likewise.
25409 (vabal_u8): Likewise.
25410 (vabal_u16): Likewise.
25411 (vabal_u32): Likewise.
25413 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25415 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
25417 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
25419 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
25421 (vaddlv_s16): Likewise.
25422 (vaddlv_u8): Likewise.
25423 (vaddlv_u16): Likewise.
25424 (vaddlvq_s8): Likewise.
25425 (vaddlvq_s16): Likewise.
25426 (vaddlvq_s32): Likewise.
25427 (vaddlvq_u8): Likewise.
25428 (vaddlvq_u16): Likewise.
25429 (vaddlvq_u32): Likewise.
25430 (vaddlv_s32): Likewise.
25431 (vaddlv_u32): Likewise.
25432 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
25433 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
25434 (Vwstype): New mode attribute.
25436 (VWIDE_S): Likewise.
25437 (USADDLV): New int iterator.
25438 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
25440 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
25442 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
25443 builtin generator macros.
25444 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
25446 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
25447 instead of inline asm.
25448 (vmlsl_lane_s32): Likewise.
25449 (vmlsl_lane_u16): Likewise.
25450 (vmlsl_lane_u32): Likewise.
25451 (vmlsl_laneq_s16): Likewise.
25452 (vmlsl_laneq_s32): Likewise.
25453 (vmlsl_laneq_u16): Likewise.
25454 (vmlsl_laneq_u32): Likewise.
25456 2021-01-29 Richard Biener <rguenther@suse.de>
25458 * doc/invoke.texi (--param max-gcse-memory): Document unit
25460 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
25461 * params.opt (--param max-gcse-memory): Adjust default and
25462 document unit of size.
25464 2021-01-29 Richard Biener <rguenther@suse.de>
25466 PR rtl-optimization/98863
25467 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
25468 HOST_WIDE_INT for the memory estimate.
25470 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
25471 Richard Biener <rguenther@suse.de>
25473 PR tree-optimization/97627
25474 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
25475 Do not analyze fake edges.
25477 2021-01-29 Richard Biener <rguenther@suse.de>
25479 PR rtl-optimization/98144
25480 * df.h (df_mir_bb_info): Add con_visited member.
25481 * df-problems.c (df_mir_alloc): Initialize con_visited,
25482 do not fully populate IN and OUT.
25483 (df_mir_reset): Likewise.
25484 (df_mir_confluence_0): Set con_visited.
25485 (df_mir_confluence_n): Properly handle implicitely
25486 fully populated IN and OUT as designated by con_visited
25487 and update con_visited accordingly.
25489 2021-01-29 Jakub Jelinek <jakub@redhat.com>
25492 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
25493 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
25494 && !TARGET_REALLY_IWMMXT to conditions.
25496 2021-01-29 Jakub Jelinek <jakub@redhat.com>
25499 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
25502 2021-01-28 Marek Polacek <polacek@redhat.com>
25505 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
25506 the main variant, maybe reset it in its variants too.
25507 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
25508 (check_aligned_type): Check if TYPE_USER_ALIGN match.
25510 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
25513 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
25514 of constant zero for comparisons.
25516 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
25518 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
25519 support for mapping built-in function names for long double
25520 built-in functions if long double is IEEE 128-bit.
25522 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
25524 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
25525 builtin generator macros.
25526 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
25528 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
25529 instead of inline asm.
25530 (vmlsl_n_s32): Likewise.
25531 (vmlsl_n_u16): Likewise.
25532 (vmlsl_n_u32): Likewise.
25534 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
25536 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
25537 builtin generator macros.
25538 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
25540 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
25541 instead of inline asm.
25542 (vmlal_n_s32): Likewise.
25543 (vmlal_n_u16): Likewise.
25544 (vmlal_n_u32): Likewise.
25546 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25548 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
25550 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
25552 (aarch64_shrn2<mode>_insn_be): Likewise.
25553 (aarch64_shrn2<mode>): Likewise.
25554 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
25556 (vshrn_high_n_s32): Likewise.
25557 (vshrn_high_n_s64): Likewise.
25558 (vshrn_high_n_u16): Likewise.
25559 (vshrn_high_n_u32): Likewise.
25560 (vshrn_high_n_u64): Likewise.
25562 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25564 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
25566 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
25568 (aarch64_shrn<mode>_insn_be): Likewise.
25569 (aarch64_shrn<mode>): Likewise.
25570 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
25572 (vshrn_n_s32): Likewise.
25573 (vshrn_n_s64): Likewise.
25574 (vshrn_n_u16): Likewise.
25575 (vshrn_n_u32): Likewise.
25576 (vshrn_n_u64): Likewise.
25577 * config/aarch64/iterators.md (vn_mode): New mode attribute.
25579 2021-01-28 Richard Biener <rguenther@suse.de>
25581 PR rtl-optimization/80960
25582 * dse.c (check_mem_read_rtx): Call get_addr on the
25585 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
25586 David Edelsohn <dje.gcc@gmail.com>
25589 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
25590 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
25592 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
25594 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
25595 wrapper call rs6000_expand_vector_set_var for cleanup. Call
25596 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
25598 (rs6000_expand_vector_set_var): Delete.
25599 (rs6000_expand_vector_set_var_p9): Make static.
25600 (rs6000_expand_vector_set_var_p8): Make static.
25602 2021-01-28 Xing GUO <higuoxing@gmail.com>
25604 * common/config/riscv/riscv-common.c
25605 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
25606 when `p` extension exists.
25608 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
25610 PR rtl-optimization/97684
25611 * ira.c (ira): Call ira_set_pseudo_classes before
25612 update_equiv_regs when it is necessary.
25614 2021-01-27 Jakub Jelinek <jakub@redhat.com>
25617 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
25618 %w0, %w1 and %2 instead of %0, %1 and %2.
25620 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
25622 * config/rs6000/genfusion.pl: New script to generate
25623 define_insn_and_split patterns so combine can arrange fused
25624 instructions next to each other.
25625 * config/rs6000/fusion.md: New file, generated fused instruction
25626 patterns for combine.
25627 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
25628 (non_update_memory_operand): New predicate.
25629 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
25630 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
25632 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
25634 * config/rs6000/rs6000.c (rs6000_option_override_internal):
25635 Automatically set OPTION_MASK_P10_FUSION and
25636 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
25637 (rs600_opt_masks): Allow -mpower10-fusion
25638 in function attributes.
25639 (address_is_non_pfx_d_or_x): New function.
25640 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
25641 * config/rs6000/rs6000.md: Include fusion.md.
25642 * config/rs6000/rs6000.opt: Add -mpower10-fusion
25643 and -mpower10-fusion-ld-cmpi.
25644 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
25646 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25648 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
25649 builtin generator macros.
25650 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
25652 (aarch64_<su>mlal<mode>): This.
25653 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
25654 instead of inline asm.
25655 (vmlal_s16): Likewise.
25656 (vmlal_s32): Likewise.
25657 (vmlal_u8): Likewise.
25658 (vmlal_u16): Likewise.
25659 (vmlal_u32): Likewise.
25661 2021-01-27 Richard Biener <rguenther@suse.de>
25663 PR tree-optimization/98854
25664 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
25665 PHIs from scalars when the number of CTORs matches the
25666 number of children.
25668 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25670 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
25672 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
25674 (aarch64_mls_n<mode>): This.
25675 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
25677 (vmls_n_s32): Likewise.
25678 (vmls_n_u16): Likewise.
25679 (vmls_n_u32): Likewise.
25680 (vmlsq_n_s16): Likewise.
25681 (vmlsq_n_s32): Likewise.
25682 (vmlsq_n_u16): Likewise.
25683 (vmlsq_n_u32): Likewise.
25685 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25687 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
25689 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
25691 (vmls_s16): Likewise.
25692 (vmls_s32): Likewise.
25693 (vmls_u8): Likewise.
25694 (vmls_u16): Likewise.
25695 (vmls_u32): Likewise.
25696 (vmlsq_s8): Likewise.
25697 (vmlsq_s16): Likewise.
25698 (vmlsq_s32): Likewise.
25699 (vmlsq_u8): Likewise.
25700 (vmlsq_u16): Likewise.
25701 (vmlsq_u32): Likewise.
25703 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25705 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
25707 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
25709 (aarch64_mla_n<mode>): This.
25710 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
25712 (vmla_n_s32): Likewise.
25713 (vmla_n_u16): Likewise.
25714 (vmla_n_u32): Likewise.
25715 (vmlaq_n_s16): Likewise.
25716 (vmlaq_n_s32): Likewise.
25717 (vmlaq_n_u16): Likewise.
25718 (vmlaq_n_u32): Likewise.
25720 2021-01-27 liuhongt <hongtao.liu@intel.com>
25723 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
25724 (*sse2_eq<mode>3): Ditto.
25726 2021-01-27 Jakub Jelinek <jakub@redhat.com>
25728 * tree-pass.h (PROP_trees): Rename to ...
25729 (PROP_gimple): ... this.
25730 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
25731 * passes.c (execute_function_dump, execute_function_todo,
25732 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
25733 * varpool.c (ctor_for_folding): Likewise.
25735 2021-01-27 Jakub Jelinek <jakub@redhat.com>
25737 PR tree-optimization/97260
25738 * varpool.c: Include tree-pass.h.
25739 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
25740 non-TREE_SIDE_EFFECTS automatic variables.
25742 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
25744 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
25746 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
25747 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
25748 (gen_compile_unit_die): Recognise C++23.
25750 2021-01-26 Jakub Jelinek <jakub@redhat.com>
25753 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
25756 2021-01-26 Jakub Jelinek <jakub@redhat.com>
25759 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
25760 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
25761 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
25763 2021-01-26 Richard Biener <rguenther@suse.de>
25765 * gimple-pretty-print.c (dump_binary_rhs): Handle
25766 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
25768 2021-01-26 Richard Biener <rguenther@suse.de>
25770 PR middle-end/98726
25771 * tree.h (vector_cst_int_elt): Remove.
25772 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
25775 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
25777 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
25778 for V64DFmode min/max reductions.
25780 2021-01-26 Jakub Jelinek <jakub@redhat.com>
25782 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
25783 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
25784 two halves, one with x and the other with const0_rtx, ordered
25785 depending on endianity.
25787 2021-01-26 Alexandre Oliva <oliva@adacore.com>
25789 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
25790 temporaries not seen in binding block, and not about to be
25791 added as gimple variables.
25793 2021-01-25 Martin Sebor <msebor@redhat.com>
25796 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
25798 2021-01-25 Martin Liska <mliska@suse.cz>
25800 * value-prof.c (get_nth_most_common_value): Use %s instead
25803 2021-01-25 Jakub Jelinek <jakub@redhat.com>
25806 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
25807 readelf -wi is able to read the emitted .debug_info back.
25808 * configure: Regenerated.
25810 2021-01-25 Martin Liska <mliska@suse.cz>
25812 PR gcov-profile/98739
25813 * common.opt: Add missing sign symbol.
25814 * value-prof.c (get_nth_most_common_value): Restore handling
25815 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
25816 PROFILE_REPRODUCIBILITY_MULTITHREADED.
25818 2021-01-25 Richard Biener <rguenther@suse.de>
25820 PR middle-end/98807
25821 * tree.c (vector_element_bits): Always use precision of
25822 the element type for boolean vectors.
25824 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
25826 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
25827 (ENDFILE_SPEC): Evaluate qnolinkcmds.
25829 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
25831 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
25832 nostartfiles handling since this is already done by
25833 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
25834 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
25835 is already done by LINK_COMMAND_SPEC.
25836 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
25837 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
25840 2021-01-25 Jakub Jelinek <jakub@redhat.com>
25843 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
25844 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
25845 value rather than host size_t.
25846 (fold_const_call): Change type of s2 from size_t to
25847 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
25848 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
25849 instead of s2 as last argument.
25851 2021-01-25 Tamar Christina <tamar.christina@arm.com>
25853 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
25854 VCMLA_OP, VCMUL_OP): New.
25855 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
25856 * config/arm/neon.md (cmul<conj_op><mode>3): New.
25857 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
25858 UNSPEC_VCMUL_CONJ): New.
25859 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
25860 cml<fcmac1><conj_op><mode>4): New.
25862 2021-01-23 Jakub Jelinek <jakub@redhat.com>
25865 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
25867 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
25869 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
25871 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
25873 (vmla_s16): Likewise.
25874 (vmla_s32): Likewise.
25875 (vmla_u8): Likewise.
25876 (vmla_u16): Likewise.
25877 (vmla_u32): Likewise.
25878 (vmlaq_s8): Likewise.
25879 (vmlaq_s16): Likewise.
25880 (vmlaq_s32): Likewise.
25881 (vmlaq_u8): Likewise.
25882 (vmlaq_u16): Likewise.
25883 (vmlaq_u32): Likewise.
25885 2021-01-22 David Malcolm <dmalcolm@redhat.com>
25887 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
25890 2021-01-22 Jakub Jelinek <jakub@redhat.com>
25893 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
25894 filenames to emit, still emit the required 0 index directory and
25895 filename entries that match DW_AT_comp_dir and DW_AT_name of the
25898 2021-01-22 Marek Polacek <polacek@redhat.com>
25901 * doc/invoke.texi: Update C++ ABI Version 15 description.
25903 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25905 PR tree-optimization/98766
25906 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
25907 comparing against type size with param_avoid_fma_max_bits.
25909 2021-01-22 Richard Biener <rguenther@suse.de>
25911 PR middle-end/98793
25912 * tree.c (vector_element_bits): Key single-bit bool vector on
25913 integer mode rather than not vector mode.
25915 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
25918 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
25919 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
25921 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
25922 to call different path for P8 and P9.
25923 (rs6000_expand_vector_set_var_p9): New function.
25924 (rs6000_expand_vector_set_var_p8): New function.
25926 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
25930 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
25931 Ajdust variable index vec_insert from address dereference to
25932 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
25933 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
25935 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
25937 2021-01-22 Martin Liska <mliska@suse.cz>
25939 PR gcov-profile/98739
25940 * profile.c (compute_value_histograms): Drop time profile for
25941 -fprofile-reproducible=multithreaded.
25943 2021-01-22 Nathan Sidwell <nathan@acm.org>
25945 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
25948 2021-01-22 Richard Biener <rguenther@suse.de>
25950 PR middle-end/98773
25951 * tree-data-ref.c (initalize_matrix_A): Revert previous
25952 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
25954 2021-01-22 Jakub Jelinek <jakub@redhat.com>
25956 PR tree-optimization/90248
25957 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
25958 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
25960 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
25961 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
25963 2021-01-22 Jakub Jelinek <jakub@redhat.com>
25965 PR tree-optimization/98255
25966 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
25967 extend index - low_bound from sizetype's precision rather than index
25969 (get_addr_base_and_unit_offset_1): Likewise.
25970 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
25971 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
25973 2021-01-22 Richard Biener <rguenther@suse.de>
25975 PR tree-optimization/98786
25976 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
25977 adding new uses of abnormals. Verify we deal with a conditional
25980 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
25983 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
25985 2021-01-22 liuhongt <hongtao.liu@intel.com>
25989 * config/i386/sse.md (VI_128_256): New mode iterator.
25990 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
25991 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
25992 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
25993 define_insn_and_split to lower avx512 vector comparison to avx
25994 version when dest is vector.
25995 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
25996 define_insn_and_split for negating the comparison result.
25997 * config/i386/predicates.md (float_vector_all_ones_operand):
25999 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
26000 general NOT operator without UNSPEC_MASKOP.
26002 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
26004 PR rtl-optimization/98777
26005 * lra-int.h (lra_pmode_pseudo): New extern.
26006 * lra.c (lra_pmode_pseudo): New global.
26008 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
26010 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
26012 * fwprop.c (fwprop_propagation::classify_result): Allow
26013 (subreg (mem)) simplifications.
26015 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26017 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
26019 (aarch64_sqdmlal<mode>): ... This...
26020 (aarch64_sqdmlsl<mode>): ... And this.
26021 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
26022 (aarch64_sqdmlal_lane<mode>): ... This...
26023 (aarch64_sqdmlsl_lane<mode>): ... And this.
26024 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
26025 (aarch64_sqdmlsl_laneq<mode>): ... This...
26026 (aarch64_sqdmlal_laneq<mode>): ... And this.
26027 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
26028 (aarch64_sqdmlsl_n<mode>): ... This...
26029 (aarch64_sqdmlal_n<mode>): ... And this.
26030 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
26031 (aarch64_sqdmlal2<mode>_internal): ... This...
26032 (aarch64_sqdmlsl2<mode>_internal): ... And this.
26034 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
26036 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
26038 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
26041 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
26043 2021-01-21 liuhongt <hongtao.liu@intel.com>
26045 PR rtl-optimization/98694
26046 * regcprop.c (copy_value): If SRC had been assigned a mode
26047 narrower than the copy, we can't link DEST into the chain even
26048 they have same hard_regno_nregs(i.e. HImode/SImode in i386
26051 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26053 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
26054 Convert to define_insn_and_split. Split into simple move when moving
26057 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
26059 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
26060 Adjust comment. Simplify code.
26062 2021-01-20 Jakub Jelinek <jakub@redhat.com>
26065 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
26066 with DW_FORM_line_strp form.
26067 (prune_unused_types_update_strings): Don't add into debug_str_hash
26068 indirect strings with DW_FORM_line_strp form.
26069 (adjust_name_comp_dir): New function.
26070 (dwarf2out_finish): Call it on CU DIEs after resetting
26071 debug_line_str_hash.
26073 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
26075 PR rtl-optimization/98722
26076 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
26077 has no 3-op add insn to transform insns containing two pluses.
26079 2021-01-20 Richard Biener <rguenther@suse.de>
26081 * hwint.h (add_hwi): New function.
26082 (mul_hwi): Likewise.
26083 * tree-data-ref.c (initialize_matrix_A): Properly translate
26084 tree constants and avoid HOST_WIDE_INT_MIN.
26085 (lambda_matrix_row_add): Avoid undefined integer overflow
26086 and return true on such overflow.
26087 (lambda_matrix_right_hermite): Handle overflow from
26088 lambda_matrix_row_add gracefully. Simplify previous fix.
26089 (analyze_subscript_affine_affine): Likewise.
26091 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
26093 PR tree-optimization/96674
26094 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
26095 x >= y && y != XXX_MIN --> x > y - 1
26097 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
26099 PR tree-optimization/98535
26100 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
26101 If the high and low permutes are the same, remove the high permutes
26102 from the working set and only continue with the low ones.
26104 2021-01-20 Jakub Jelinek <jakub@redhat.com>
26106 PR tree-optimization/98721
26107 * builtins.c (access_ref::inform_access): Don't assume
26108 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
26109 object whenever allocfn is NULL, rather than only when DECL_P
26110 is true. Use %qE instead of %qD for that. Formatting fixes.
26112 2021-01-20 Richard Biener <rguenther@suse.de>
26114 PR tree-optimization/98758
26115 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
26116 (lambda_matrix_right_hermite): Avoid undefinedness with
26117 signed integer abs and multiplication.
26118 (analyze_subscript_affine_affine): Use lambda_int.
26120 2021-01-20 David Malcolm <dmalcolm@redhat.com>
26123 * dwarf2out.c (output_line_info): Rename static variable
26124 "generation", moving it out of the function to...
26125 (output_line_info_generation): New.
26126 (init_sections_and_labels): Likewise, renaming the variable to...
26127 (init_sections_and_labels_generation): New.
26128 (dwarf2out_c_finalize): Reset the new variables.
26130 2021-01-19 Martin Sebor <msebor@redhat.com>
26132 PR middle-end/98664
26133 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
26134 all functions, even if they're not declared artificial or inline.
26135 * tree.c (tree_inlined_location): Use macro expansion location
26136 only if scope traversal fails to expose one.
26138 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
26140 PR rtl-optimization/92294
26141 * alias.c (compare_base_symbol_refs): Take an extra parameter
26142 and add the distance between two symbols to it. Enshrine in
26143 comments that -1 means "either 0 or 1, but we can't tell
26144 which at compile time".
26145 (memrefs_conflict_p): Update call accordingly.
26146 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
26149 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26151 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
26152 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
26153 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
26154 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
26155 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
26156 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
26157 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
26158 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
26159 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
26160 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
26161 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
26162 sqmovun): Use NONE flags.
26164 2021-01-19 Richard Biener <rguenther@suse.de>
26167 * ipa-modref.c (analyze_stmt): Only record a summary for a
26170 2021-01-19 Richard Biener <rguenther@suse.de>
26172 PR middle-end/98638
26173 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
26175 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
26177 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
26178 built-in define __FIX_LEON3FT_TN0018.
26180 2021-01-19 Richard Biener <rguenther@suse.de>
26183 * tree-inline.c (tree_function_versioning): Set input_location
26184 to UNKNOWN_LOCATION throughout the function.
26186 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
26189 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
26191 2021-01-19 Martin Jambor <mjambor@suse.cz>
26194 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
26195 whether non-call exceptions allow removal of a statement.
26196 (isra_analyze_call): Pass the appropriate function to
26197 ssa_name_only_returned_p.
26199 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
26201 * config/riscv/arch-canonicalize (longext_sort): New function for
26202 sorting 'multi-letter'.
26203 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
26204 'alts'. The 'arch' may not be the first of 'alts'.
26205 (_expand_combination): Add underline for the 'ext' without '*'.
26206 This is because, a single-letter extension can always be treated well
26207 with a '_' prefix, but it cannot be separated out if it is appended
26210 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
26213 * ira.c (ira): Skip abnormal critical edge splitting.
26215 2021-01-18 Jakub Jelinek <jakub@redhat.com>
26217 PR tree-optimization/98727
26218 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
26219 second .MUL_OVERFLOW operand for signed multiplication with overflow
26220 checking if the second operand of multiplication is not constant.
26222 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
26224 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
26225 defaults to version 4.
26227 2021-01-18 David Malcolm <dmalcolm@redhat.com>
26229 * attribs.h (fndecl_dealloc_argno): New decl.
26230 * builtins.c (call_dealloc_argno): Split out second half of
26232 (fndecl_dealloc_argno): New.
26233 * doc/extend.texi (Common Function Attributes): Document the
26234 interaction between the analyzer and the malloc attribute.
26235 * doc/invoke.texi (Static Analyzer Options): Likewise.
26237 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
26239 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
26240 dwarf_version to 4.
26241 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
26243 2021-01-17 Martin Jambor <mjambor@suse.cz>
26246 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
26249 2021-01-17 Mark Wielaard <mark@klomp.org>
26251 * common.opt (gdwarf-): Init(5).
26252 * doc/invoke.texi (-gdwarf): Document default to 5.
26254 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
26256 * builtin-types.def
26257 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
26259 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
26260 ...this. Add extra argument.
26261 * gimplify.c (omp_default_clause): Ensure that event handle is
26262 firstprivate in a task region.
26263 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
26264 (gimplify_adjust_omp_clauses): Likewise.
26265 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
26266 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
26267 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
26268 if detach clause specified. Add detach argument when generating
26270 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
26272 (finish_taskreg_scan): Move field for variable containing the event
26273 handle to the front of the struct.
26274 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
26276 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
26277 OMP_CLAUSE_DETACH clause.
26278 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
26279 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
26280 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
26282 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
26284 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
26286 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
26288 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
26289 multilib directory names. Use MULTILIB_REQUIRED instead of
26290 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
26291 -mcustom-fpu-cfg=fph2 multilib.
26293 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
26295 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
26296 (nios2_init_fpu_configs): Provide register values for new
26297 -mcustom-fpu-cfg=fph2 option variant.
26298 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
26301 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
26303 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
26304 custom instruction warnings.
26306 2021-01-16 Jakub Jelinek <jakub@redhat.com>
26308 PR tree-optimization/96669
26309 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
26311 2021-01-16 Jakub Jelinek <jakub@redhat.com>
26313 PR tree-optimization/96271
26314 * passes.def: Pass false argument to first two pass_cd_dce
26315 instances and true to last instance. Add comment that
26316 last instance rewrites no longer addressed locals.
26317 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
26319 (pass_cd_dce::set_pass_param): New method.
26320 (pass_cd_dce::execute): Return TODO_update_address_taken from
26321 last cd_dce instance.
26323 2021-01-15 Carl Love <cel@us.ibm.com>
26325 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
26327 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
26328 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
26329 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
26330 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
26331 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
26332 Add builtin define.
26333 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
26334 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
26335 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
26336 New overloaded definitions.
26337 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
26338 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
26339 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
26340 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
26341 P10V_BUILTIN_MULHU_V4SI]: Add case
26342 statement for builtins.
26343 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
26344 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
26345 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
26346 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
26347 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
26348 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
26349 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
26350 Add define_insn, mode is VIlong.
26351 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
26352 Add builtin descriptions.
26354 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
26356 * final.c (final_start_function_1): Reset force_source_line.
26358 2021-01-15 Jakub Jelinek <jakub@redhat.com>
26360 PR tree-optimization/96669
26361 * match.pd (((1 << A) & 1) != 0 -> A == 0,
26362 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
26363 possibly different power of two constants and to right shift too.
26365 2021-01-15 Jakub Jelinek <jakub@redhat.com>
26367 PR tree-optimization/96681
26368 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
26369 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
26370 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
26371 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
26373 2021-01-15 Alexandre Oliva <oliva@adacore.com>
26375 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
26377 2021-01-15 Tamar Christina <tamar.christina@arm.com>
26379 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
26380 cmul<conj_op><mode>3): New.
26381 * config/aarch64/iterators.md (UNSPEC_FCMUL,
26382 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
26383 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
26384 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
26385 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
26386 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
26387 (rot_op): Renamed to conj_op.
26388 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
26389 cmul<conj_op><mode>3): New.
26390 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
26391 cmul<conj_op><mode>3): New.
26393 2021-01-15 David Malcolm <dmalcolm@redhat.com>
26397 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
26398 Escape the tempfile name when constructing the expected output.
26400 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26402 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
26404 (aarch64_<su>mlsl_hi<mode>): ... This.
26405 (aarch64_<su>mlsl_hi<mode>): Define.
26406 (*aarch64_<su>mlsl<mode): Rename to...
26407 (aarch64_<su>mlsl<mode): ... This.
26408 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
26409 smlsl_hi, umlsl_hi): Define builtins.
26410 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
26411 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
26412 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
26413 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
26415 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
26417 * config/i386/i386-c.c (ix86_target_macros):
26418 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
26420 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
26423 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
26424 * Makefile.in (RTL_SSA_H): New variable.
26425 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
26426 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
26427 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
26429 * config/aarch64/aarch64-cc-fusion.cc: New file.
26431 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
26433 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
26434 calling cancel_changes for changes that no longer exist.
26436 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
26438 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
26439 (function_info::reg_defs): ...this.
26440 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
26441 (function_info::reg_defs): ...this.
26443 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26446 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
26448 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26451 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26454 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
26456 2021-01-15 Richard Biener <rguenther@suse.de>
26458 PR tree-optimization/96376
26459 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
26460 for VMAT_INVARIANT.
26462 2021-01-15 Martin Liska <mliska@suse.cz>
26464 * doc/install.texi: Document that some tests need pytest module.
26465 * doc/sourcebuild.texi: Likewise.
26467 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26470 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
26472 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26474 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
26475 (mve_vshrq_n_u<mode>_imm): Likewise.
26476 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
26477 * config/arm/vec-common.md: ... here.
26479 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26481 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
26483 * config/arm/neon.md (vashl<mode>3): Delete.
26484 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
26485 (vasl<mode>3): New expander.
26487 2021-01-15 Richard Biener <rguenther@suse.de>
26489 PR tree-optimization/98685
26490 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
26491 of vector extern defs.
26493 2021-01-14 David Malcolm <dmalcolm@redhat.com>
26496 * diagnostic.c (diagnostic_kind_text): Break out this array
26498 (diagnostic_build_prefix): ...here.
26499 (fancy_abort): Detect when diagnostic_initialize has not yet been
26500 called and fall back to a minimal implementation of printing the
26501 ICE, rather than segfaulting in internal_error.
26503 2021-01-14 David Malcolm <dmalcolm@redhat.com>
26505 * diagnostic.c (diagnostic_initialize): Eliminate
26506 parseable_fixits_p in favor of initializing extra_output_kind from
26507 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
26508 (convert_column_unit): New function, split out from...
26509 (diagnostic_converted_column): ...this.
26510 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
26511 Use them to call convert_column_unit on the column values.
26512 (diagnostic_report_diagnostic): Eliminate conditional on
26513 parseable_fixits_p in favor of a switch statement on
26514 extra_output_kind, passing the appropriate values to the new
26515 params of print_parseable_fixits.
26516 (selftest::test_print_parseable_fixits_none): Update for new
26517 params of print_parseable_fixits.
26518 (selftest::test_print_parseable_fixits_insert): Likewise.
26519 (selftest::test_print_parseable_fixits_remove): Likewise.
26520 (selftest::test_print_parseable_fixits_replace): Likewise.
26521 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
26523 (selftest::diagnostic_c_tests): Call it.
26524 * diagnostic.h (enum diagnostics_extra_output_kind): New.
26525 (diagnostic_context::parseable_fixits_p): Delete field in favor
26527 (diagnostic_context::extra_output_kind): ...this new field.
26528 * doc/invoke.texi (Environment Variables): Add
26529 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
26530 * opts.c (common_handle_option): Update handling of
26531 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
26534 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26536 * tree-vect-slp-patterns.c (class complex_operations_pattern,
26537 complex_operations_pattern::matches,
26538 complex_operations_pattern::recognize,
26539 complex_operations_pattern::build): New.
26540 (slp_patterns): Use it.
26542 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26544 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
26545 * optabs.def (cmls_optab, cmls_conj_optab): New.
26546 * doc/md.texi: Document them.
26547 * tree-vect-slp-patterns.c (class complex_fms_pattern,
26548 complex_fms_pattern::matches, complex_fms_pattern::recognize,
26549 complex_fms_pattern::build): New.
26551 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26553 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
26554 * optabs.def (cmla_optab, cmla_conj_optab): New.
26555 * doc/md.texi: Document them.
26556 * tree-vect-slp-patterns.c (vect_match_call_p,
26557 class complex_fma_pattern, vect_slp_reset_pattern,
26558 complex_fma_pattern::matches, complex_fma_pattern::recognize,
26559 complex_fma_pattern::build): New.
26561 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26563 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
26564 * optabs.def (cmul_optab, cmul_conj_optab): New.
26565 * doc/md.texi: Document them.
26566 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
26567 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
26568 vect_build_combine_node, class complex_mul_pattern,
26569 complex_mul_pattern::matches, complex_mul_pattern::recognize,
26570 complex_mul_pattern::build): New.
26572 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26574 * tree-vect-slp.c (optimize_load_redistribution_1): New.
26575 (optimize_load_redistribution, vect_is_slp_load_node): New.
26576 (vect_match_slp_patterns): Use it.
26578 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26580 * tree-vect-slp-patterns.c (complex_add_pattern::build):
26583 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
26585 * config/gcn/mkoffload.c (main): Create an offload image only in
26586 64-bit configurations.
26588 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
26591 * config/i386/i386-options.c (ix86_option_override_internal):
26592 Issue an error for -fcf-protection with CF_BRANCH when compiling
26593 for 32-bit non-TARGET_CMOV targets.
26595 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
26598 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
26599 Remove declaration and initialization of shadow variable "ret".
26600 (ix86_option_override_internal): Remove delcaration of
26601 shadow variable "i". Redeclare shadowed variable to unsigned.
26602 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
26603 * config/i386/i386-builtins.c (get_builtin_code_for_version):
26604 Update for redeclaration.
26605 * config/i386/i386.h (pta_size): Ditto.
26607 2021-01-14 Richard Biener <rguenther@suse.de>
26609 PR tree-optimization/98674
26610 * tree-data-ref.c (base_supports_access_fn_components_p): New.
26611 (initialize_data_dependence_relation): For two bases without
26612 possible access fns resort to type size equality when determining
26613 shape compatibility.
26615 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26618 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
26619 <=, >= operators in vcle and vcge intrinsics respectively.
26620 * config/arm/arm_neon_builtins.def: Remove entry for
26623 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
26626 * config/i386/i386-options.c (ix86_function_specific_save):
26627 Remove redundant assignment to opts->x_ix86_branch_cost.
26628 * config/i386/i386.c (ix86_prefetch_sse):
26629 Rename from x86_prefetch_sse. Update all uses.
26630 * config/i386/i386.h: Update for rename.
26631 * config/i386/i386-options.h: Ditto.
26633 2021-01-14 Jakub Jelinek <jakub@redhat.com>
26636 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
26637 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
26638 Use Bm instead of m for non-avx. Add isa attribute.
26640 2021-01-14 Jakub Jelinek <jakub@redhat.com>
26642 PR tree-optimization/96688
26643 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
26644 ~X can be simplified.
26646 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
26648 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
26649 IFN_LOAD_LANES results.
26651 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26653 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
26655 (aarch64_xtn<mode>): Likewise.
26656 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
26659 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
26661 (vmovl_s16): Likewise.
26662 (vmovl_s32): Likewise.
26663 (vmovl_u8): Likewise.
26664 (vmovl_u16): Likewise.
26665 (vmovl_u32): Likewise.
26666 (vmovn_s16): Likewise.
26667 (vmovn_s32): Likewise.
26668 (vmovn_s64): Likewise.
26669 (vmovn_u16): Likewise.
26670 (vmovn_u32): Likewise.
26671 (vmovn_u64): Likewise.
26673 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26675 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
26677 (aarch64_<su>qxtn2<mode>_be): Likewise.
26678 (aarch64_<su>qxtn2<mode>): Likewise.
26679 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
26681 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
26682 (su): Handle ss_truncate and us_truncate.
26683 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
26685 (vqmovn_high_s32): Likewise.
26686 (vqmovn_high_s64): Likewise.
26687 (vqmovn_high_u16): Likewise.
26688 (vqmovn_high_u32): Likewise.
26689 (vqmovn_high_u64): Likewise.
26691 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26693 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
26695 (aarch64_xtn2<mode>_be): Likewise.
26696 (aarch64_xtn2<mode>): Likewise.
26697 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
26699 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
26701 (vmovn_high_s32): Likewise.
26702 (vmovn_high_s64): Likewise.
26703 (vmovn_high_u16): Likewise.
26704 (vmovn_high_u32): Likewise.
26705 (vmovn_high_u64): Likewise.
26707 2021-01-13 Stafford Horne <shorne@gmail.com>
26709 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
26711 2021-01-13 Stafford Horne <shorne@gmail.com>
26713 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
26715 2021-01-13 Stafford Horne <shorne@gmail.com>
26717 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
26718 define for __or1k_hard_float__.
26720 2021-01-13 Stafford Horne <shorne@gmail.com>
26722 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
26723 (PROFILE_HOOK): Define to call _mcount.
26724 (FUNCTION_PROFILER): Change from abort to no-op.
26726 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26728 PR tree-optimization/96691
26729 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
26730 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
26731 (~D ^ C) or (D ^ C) can be simplified.
26733 2021-01-13 Richard Biener <rguenther@suse.de>
26735 PR tree-optimization/92645
26736 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
26737 until after vector lowering.
26739 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26741 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
26743 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
26744 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
26746 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26748 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
26750 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
26751 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
26753 2021-01-13 Richard Biener <rguenther@suse.de>
26755 PR tree-optimization/92645
26756 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
26757 BIT_FIELD_REF argument.
26758 (vect_build_slp_tree_2): Record the desired vector type
26759 on the external vector def.
26760 (vectorizable_slp_permutation): Handle required punning
26761 of existing vector defs.
26763 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26765 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
26767 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26769 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
26771 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
26773 * config.gcc [$target == *-*-gnu*]: Enable
26774 'default_gnu_indirect_function'.
26776 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26779 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
26780 registers before calling targetm.vectorize.vec_perm_const, only after
26782 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
26783 two argument permutation when one operand is zero vector and only
26784 after that force operands into registers.
26785 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
26786 define_insn_and_split pattern.
26787 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
26788 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
26789 (*avx2_zero_extendv8hiv8si2_1): Likewise.
26790 (*avx512f_zero_extendv8siv8di2_1): Likewise.
26791 (*avx2_zero_extendv4siv4di2_1): Likewise.
26792 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
26794 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
26795 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
26796 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
26797 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
26798 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
26799 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
26801 2021-01-13 Martin Liska <mliska@suse.cz>
26803 PR tree-optimization/98455
26804 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
26805 Record also virtual PHIs.
26806 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
26809 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
26811 * doc/invoke.texi (C++ Modules): Fix typos.
26813 2021-01-13 Richard Biener <rguenther@suse.de>
26815 PR tree-optimization/98640
26816 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
26817 handle plus or minus from a truncated operand to be
26820 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26823 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
26824 define_insn_and_split patterns.
26825 (splitter after *btr<mode>_2): New splitter.
26827 2021-01-13 Martin Liska <mliska@suse.cz>
26830 * cgraphunit.c (analyze_functions): Remove dead code.
26832 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
26834 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
26835 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
26836 (a64fx_regmove_cost, a64fx_vector_cost): New.
26837 (a64fx_tunings): Use the new added cost tables.
26839 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26842 * config/i386/predicates.md (pmovzx_parallel): New predicate.
26843 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
26844 define_insn_and_split pattern.
26845 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
26846 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
26848 2021-01-13 Julian Brown <julian@codesourcery.com>
26850 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
26851 to fix v0 register.
26853 2021-01-13 Julian Brown <julian@codesourcery.com>
26855 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
26858 2021-01-13 Julian Brown <julian@codesourcery.com>
26860 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
26861 for reciprocal-approximation instructions.
26862 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
26863 refinement and division result.
26864 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
26866 2021-01-13 Julian Brown <julian@codesourcery.com>
26868 * config/gcn/gcn-valu.md (subdf): Rename to...
26871 2021-01-12 Martin Liska <mliska@suse.cz>
26873 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
26875 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
26877 * function-abi.h: Fix typo.
26879 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
26883 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
26884 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
26885 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
26886 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
26887 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
26888 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
26889 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
26890 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
26891 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
26892 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
26893 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
26894 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
26895 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
26896 (ARM_HAVE_V2DI_LDST): Likewise.
26897 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
26898 (*movmisalign<mode>_mve_load): New pattern.
26899 * config/arm/neon.md (movmisalign<mode>): Move to ...
26900 * config/arm/vec-common.md: ... here.
26902 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
26905 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
26906 of pattern 'plus (plus (hard reg, const), pseudo)'.
26908 2021-01-12 Richard Biener <rguenther@suse.de>
26910 PR tree-optimization/98550
26911 * tree-vect-slp.c (vect_record_max_nunits): Check whether
26912 the group size is a multiple of the vector element count.
26913 (vect_build_slp_tree_1): When we need to fail because
26914 the vector type choosen causes unrolling do so lazily
26915 without affecting matches only at the end to guide group splitting.
26917 2021-01-12 Martin Liska <mliska@suse.cz>
26920 * optc-save-gen.awk: Compare also n_target_save vars with
26923 2021-01-12 Martin Liska <mliska@suse.cz>
26925 * gcov.c (source_info::debug): New.
26926 (print_usage): Add --debug (-D) option.
26927 (process_args): Likewise.
26928 (generate_results): Call src->debug after
26929 accumulate_line_counts.
26930 (read_graph_file): Properly assign id for EXIT_BLOCK.
26931 * profile.c (branch_prob): Dump function body before it is
26934 2021-01-12 Jakub Jelinek <jakub@redhat.com>
26936 PR tree-optimization/98629
26937 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
26938 unless returning non-zero.
26940 2021-01-12 Jakub Jelinek <jakub@redhat.com>
26942 PR tree-optimization/95731
26943 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
26944 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
26945 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
26946 only after optimize_range_tests_var_bound.
26948 2021-01-12 Jakub Jelinek <jakub@redhat.com>
26950 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
26951 * configure: Regenerated.
26953 2021-01-12 liuhongt <hongtao.liu@intel.com>
26956 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
26958 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
26961 2021-01-12 Alexandre Oliva <oliva@adacore.com>
26963 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
26965 (auto_end_imm_use_stmt_traverse): New struct.
26966 (FOR_EACH_IMM_USE_STMT): Use it.
26967 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
26969 * gimple-ssa-strength-reduction.c: ... here, ...
26970 * graphite-scop-detection.c: ... here, ...
26971 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
26972 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
26973 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
26974 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
26975 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
26976 * tree-vect-slp.c: ... and here, ...
26977 * doc/tree-ssa.texi: ... and the example here.
26979 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
26981 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
26982 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
26983 (*sdiv_pow2<mode>3): New pattern.
26984 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
26985 Wrap the ASRD in an UNSPEC_PRED_X.
26986 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
26987 predicate with a constant PTRUE, if it isn't already.
26988 (*cond_<sve_int_op><mode>_z): Replace with...
26989 (*cond_<sve_int_op><mode>_any): ...this new pattern.
26991 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
26993 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
26994 SVE_FULL_I to SVE_I.
26995 (*cond_bic<mode>_any): Likewise.
26997 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
26999 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
27000 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
27003 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27005 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
27006 SVE_FULL_I to SVE_I.
27007 (*aarch64_cond_<su>abd<mode>_2): Likewise.
27008 (*aarch64_cond_<su>abd<mode>_any): Likewise.
27009 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
27010 for the max and min but not for the minus.
27011 (*aarch64_cond_<su>abd<mode>_3): New pattern.
27013 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27015 * config/aarch64/iterators.md (SVE_24I): New iterator.
27016 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
27017 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
27019 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27021 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
27022 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
27024 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
27025 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
27026 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
27027 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
27029 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27031 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
27032 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
27033 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
27036 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27038 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
27039 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
27040 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
27042 2021-01-11 Martin Liska <mliska@suse.cz>
27045 * symtab-clones.h (clone_info::release): Release
27046 symtab::m_clones with ggc_delete as it's a GGC memory.
27048 2021-01-11 Matthias Klose <doko@ubuntu.com>
27050 * Makefile.in (LINK_PROGRESS): Show the link target.
27052 2021-01-11 Richard Biener <rguenther@suse.de>
27054 PR tree-optimization/91403
27055 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
27056 single-element interleaving group size at 4096 elements.
27058 2021-01-11 Richard Biener <rguenther@suse.de>
27060 PR tree-optimization/98526
27061 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
27062 of the actual reduction op for the regular case.
27063 (vectorizable_reduction): Cost the stmts
27064 vect_transform_reduction produces here.
27066 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
27068 * tree-ssa-forwprop.c (simplify_vector_constructor): For
27069 big-endian, use UNPACK[_FLOAT]_HI.
27071 2021-01-11 Tamar Christina <tamar.christina@arm.com>
27073 * tree-vect-slp-patterns.c (class complex_pattern,
27074 class complex_add_pattern): Add parameters to matches.
27075 (complex_add_pattern::build): Free memory.
27076 (complex_add_pattern::matches): Move validation end of match.
27077 (complex_add_pattern::recognize): Likewise.
27079 2021-01-11 Tamar Christina <tamar.christina@arm.com>
27081 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
27083 2021-01-11 Tamar Christina <tamar.christina@arm.com>
27085 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
27087 2021-01-11 Jakub Jelinek <jakub@redhat.com>
27089 PR tree-optimization/95867
27090 * tree-ssa-math-opts.h: New header.
27091 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
27092 (powi_as_mults): No longer static. Use build_one_cst instead of
27093 build_real. Formatting fix.
27094 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
27095 (attempt_builtin_powi): Handle multiplication reassociation without
27096 powi_fndecl using powi_as_mults.
27097 (reassociate_bb): For integral types don't require
27098 -funsafe-math-optimizations to call attempt_builtin_powi.
27100 2021-01-11 Jakub Jelinek <jakub@redhat.com>
27102 PR tree-optimization/95852
27103 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
27104 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
27105 allow in the bb any of the stmts in that vector, div_stmt and
27106 up to 3 cast stmts.
27107 (arith_cast_equal_p): New function.
27108 (arith_overflow_check_p): Add cast_stmt argument, handle signed
27109 multiply overflow checks.
27110 (match_arith_overflow): Adjust caller. Handle signed multiply
27113 2021-01-11 Jakub Jelinek <jakub@redhat.com>
27115 PR tree-optimization/95852
27116 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
27117 (uaddsub_overflow_check_p): Renamed to ...
27118 (arith_overflow_check_p): ... this. Handle also multiplication
27119 with overflow check.
27120 (match_uaddsub_overflow): Renamed to ...
27121 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
27122 also multiplication with overflow check. Adjust function comment.
27123 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
27124 match_arith_overflow also for MULT_EXPR.
27126 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27128 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
27129 __builtin_convertvector.
27130 (vmovl_s16): Likewise.
27131 (vmovl_s32): Likewise.
27132 (vmovl_u8): Likewise.
27133 (vmovl_u16): Likewise.
27134 (vmovl_u32): Likewise.
27135 (vmovn_s16): Likewise.
27136 (vmovn_s32): Likewise.
27137 (vmovn_s64): Likewise.
27138 (vmovn_u16): Likewise.
27139 (vmovn_u32): Likewise.
27140 (vmovn_u64): Likewise.
27142 2021-01-11 Martin Liska <mliska@suse.cz>
27144 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
27145 (if_chain::is_beneficial): Delete clusters
27146 (find_conditions): Make second argument of conditions_in_bbs a
27147 pointer so that we control over it's lifetime.
27148 (pass_if_to_switch::execute): Delete them.
27150 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
27152 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
27155 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27157 * config/vax/vax.md (cc): Remove mode attribute.
27158 (subst_<cc>, subst_f<cc>): Rename to...
27159 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
27160 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
27161 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
27162 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
27164 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27166 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
27167 `const_double_zero'.
27169 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27171 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
27172 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
27175 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27177 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
27179 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
27180 with `const_double_zero'.
27181 * doc/rtl.texi (Constant Expression Types): Document it.
27183 2021-01-09 Jakub Jelinek <jakub@redhat.com>
27186 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
27187 POINTER_DIFF_EXPR to be any integral type.
27189 2021-01-09 Jakub Jelinek <jakub@redhat.com>
27191 PR rtl-optimization/98603
27192 * function.c (instantiate_virtual_regs_in_insn): For asm goto
27193 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
27194 if any, set ASM_OPERANDS mode to VOIDmode and change
27195 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
27197 2021-01-09 Alexandre Oliva <oliva@gnu.org>
27200 * final.c (notice_source_line): Narrow down the condition to
27201 skip a line-0 marker.
27203 2021-01-08 Sergei Trofimovich <siarheit@google.com>
27205 * ipa-modref.c (merge_call_side_effects): Fix
27206 linebreak split by reordering two print calls.
27208 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
27210 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
27211 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
27212 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
27213 consistency, fix constraint.
27215 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
27217 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
27218 callables instead of mask values.
27219 (struct target_flag_set_p): New predicate.
27220 (s390_cpu_cpp_builtins_internal): Define or undefine
27221 __LONG_DOUBLE_VX__ macro.
27223 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
27226 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
27227 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
27230 2021-01-08 Richard Biener <rguenther@suse.de>
27232 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
27234 2021-01-08 Richard Biener <rguenther@suse.de>
27236 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
27237 (vect_build_slp_tree): On cache hit release the matched
27238 scalar stmts vector.
27239 * tree-vect-stmts.c (vectorizable_store): Properly free
27240 vec_oprnds before possibly gathering them again.
27242 2021-01-08 Richard Biener <rguenther@suse.de>
27244 PR tree-optimization/98544
27245 * tree-vect-slp.c (vect_optimize_slp): Always materialize
27246 permutes at a permute node.
27248 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
27251 * config/i386/i386.c (x86_function_profiler): Use R10 to call
27252 mcount in large model. Sorry for large model with PIC.
27254 2021-01-08 Jakub Jelinek <jakub@redhat.com>
27257 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
27258 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
27259 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
27260 TargetSave and initialize for variables with enum types.
27261 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
27262 mstack-protector-guard-symbol=): Add Save.
27263 * config/i386/i386-options.c (ix86_function_specific_save,
27264 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
27265 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
27266 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
27267 x_ix86_veclibabi_type.
27269 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
27271 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
27272 SVE_FULL_I to SVE_I.
27273 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
27275 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
27277 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
27278 SVE_FULL_I to SVE_I.
27279 (*cond_uxt<mode>_any): Likewise.
27281 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27283 * config/aarch64/iterators.md (Vwhalf): New iterator.
27284 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
27286 (aarch64_<sur>adalp<mode>): ... This. Make more
27288 (<sur>sadv16qi): Adjust callsite of the above.
27289 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
27291 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
27293 (vpadal_s16): Likewise.
27294 (vpadal_u8): Likewise.
27295 (vpadal_u16): Likewise.
27296 (vpadalq_s8): Likewise.
27297 (vpadalq_s16): Likewise.
27298 (vpadalq_s32): Likewise.
27299 (vpadalq_u8): Likewise.
27300 (vpadalq_u16): Likewise.
27301 (vpadalq_u32): Likewise.
27303 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27305 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
27307 (aarch64_<su>abd<mode>): ... This.
27308 (<sur>sadv16qi): Adjust callsite of the above.
27309 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
27311 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
27313 (vabd_s16): Likewise.
27314 (vabd_s32): Likewise.
27315 (vabd_u8): Likewise.
27316 (vabd_u16): Likewise.
27317 (vabd_u32): Likewise.
27318 (vabdq_s8): Likewise.
27319 (vabdq_s16): Likewise.
27320 (vabdq_s32): Likewise.
27321 (vabdq_u8): Likewise.
27322 (vabdq_u16): Likewise.
27323 (vabdq_u32): Likewise.
27325 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27327 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
27329 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
27330 (vaba_s16): Likewise.
27331 (vaba_s32): Likewise.
27332 (vaba_u8): Likewise.
27333 (vaba_u16): Likewise.
27334 (vaba_u32): Likewise.
27335 (vabaq_s8): Likewise.
27336 (vabaq_s16): Likewise.
27337 (vabaq_s32): Likewise.
27338 (vabaq_u8): Likewise.
27339 (vabaq_u16): Likewise.
27340 (vabaq_u32): Likewise.
27342 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27344 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
27345 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
27346 Change RTL pattern to match.
27348 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
27350 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
27351 * config/riscv/riscv-c.c (riscv-subset.h): New.
27352 (INCLUDE_STRING): Define.
27353 (riscv_cpu_cpp_builtins): Add new style architecture extension
27355 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
27356 (riscv_subset_list::end): New.
27357 (riscv_current_subset_list): New.
27359 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
27361 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
27362 Move to riscv-subset.h.
27363 (struct riscv_subset_t): Ditto.
27364 (class riscv_subset_list): Ditto.
27365 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
27366 from riscv-common.c.
27367 (struct riscv_subset_t): Ditto.
27368 (class riscv_subset_list): Ditto.
27369 * config/riscv/t-riscv ($(common_out_file)): Add file
27372 2021-01-07 Jakub Jelinek <jakub@redhat.com>
27375 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
27376 New define_insn patterns.
27378 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
27380 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
27381 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
27382 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
27384 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
27386 PR tree-optimization/98560
27387 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
27388 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
27390 (vec_cond_direct): Likewise argument 2.
27391 (vec_condu_direct, vec_condeq_direct): Delete.
27392 (expand_vect_cond_optab_fn): Rename to...
27393 (expand_vec_cond_optab_fn): ...this, replacing old macro.
27394 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
27395 (expand_vect_cond_mask_optab_fn): Rename to...
27396 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
27397 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
27399 (direct_vec_cond_optab_supported_p): Likewise.
27400 (direct_vec_condu_optab_supported_p): Delete.
27401 (direct_vec_condeq_optab_supported_p): Delete.
27402 * gimple-isel.cc: Include internal-fn.h.
27403 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
27406 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
27408 PR tree-optimization/98560
27409 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
27410 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
27412 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
27414 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
27415 rotate_insn and optab code attributes.
27416 Update all uses to merged code attribute.
27417 * config/i386/sse.md: Update all uses to merged code attribute.
27418 * config/i386/mmx.md: Update all uses to merged code attribute.
27420 2021-01-07 Jakub Jelinek <jakub@redhat.com>
27422 PR tree-optimization/98568
27423 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
27424 (bswap_replace): Use it.
27426 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
27428 PR rtl-optimization/97978
27429 * lra-int.h (lra_hard_reg_split_p): New external.
27430 * lra.c (lra_hard_reg_split_p): New global.
27431 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
27432 * lra-assigns.c (lra_assign): Don't check allocation correctness
27433 after hard reg splitting.
27435 2021-01-06 Martin Sebor <msebor@redhat.com>
27438 * builtins.c (new_delete_mismatch_p): New overload.
27439 (new_delete_mismatch_p (tree, tree)): Call it.
27441 2021-01-06 Alexandre Oliva <oliva@adacore.com>
27443 * Makefile.in (T_GLIMITS_H): New.
27444 (stmp-int-hdrs): Depend on it, use it.
27445 * config/t-vxworks (T_GLIMITS_H): Override it.
27446 (vxw-glimits.h): New.
27448 2021-01-06 Richard Biener <rguenther@suse.de>
27450 PR tree-optimization/98513
27451 * value-range.cc (intersect_ranges): Compare the upper bounds
27452 for the expected relation.
27454 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
27457 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
27459 * doc/standards.texi (HSAIL): Remove section.
27461 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
27463 * configure: Re-generate.
27465 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27467 * doc/invoke.texi (-std=c++20): Adjust for the publication of
27468 ISO 14882:2020 standard.
27469 * doc/standards.texi: Likewise.
27471 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27473 PR tree-optimization/94802
27474 * expr.h (maybe_optimize_sub_cmp_0): Declare.
27475 * expr.c: Include tree-pretty-print.h and flags.h.
27476 (maybe_optimize_sub_cmp_0): New function.
27477 (do_store_flag): Use it.
27478 * cfgexpand.c (expand_gimple_cond): Likewise.
27480 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27482 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
27483 * rtlanal.c (simple_regno_set): Tweak description to clarify the
27486 2021-01-05 Richard Biener <rguenther@suse.de>
27488 PR tree-optimization/98516
27489 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
27490 lanes when materializing on a VEC_PERM node.
27491 (vectorizable_slp_permutation): Dump the permute properly.
27493 2021-01-05 Richard Biener <rguenther@suse.de>
27495 * tree-vect-slp.c (vect_slp_region): Move debug counter
27496 to cover individual subgraphs.
27498 2021-01-05 Richard Biener <rguenther@suse.de>
27500 PR tree-optimization/98428
27501 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
27502 vector lane extracts for loop vectorization.
27504 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27506 PR tree-optimization/98514
27507 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
27509 (operand_rank): Change type from hash_map<tree, long> to
27510 hash_map<tree, int64_t>.
27511 (phi_rank): Change return type from long to int64_t.
27512 (loop_carried_phi): Change block_rank variable type from long to
27514 (propagate_rank): Change return type, rank parameter type and
27515 op_rank variable type from long to int64_t.
27516 (find_operand_rank): Change return type from long to int64_t
27517 and change slot variable type from long * to int64_t *.
27518 (insert_operand_rank): Change rank parameter type from long to
27520 (get_rank): Change return type and rank variable type from long to
27521 int64_t. Use PRId64 instead of ld to print the rank.
27522 (init_reassoc): Change rank variable type from long to int64_t
27523 and adjust correspondingly bb_rank and operand_rank initialization.
27525 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27527 PR tree-optimization/96928
27528 * tree-ssa-phiopt.c (xor_replacement): New function.
27529 (tree_ssa_phiopt_worker): Call it.
27531 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27533 PR tree-optimization/96930
27534 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
27535 from narrower value which has the same type as 1 << B, perform
27536 the right shift on the narrower value followed by extension.
27538 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27540 PR tree-optimization/96239
27541 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
27543 (get_status_for_store_merging): Don't return BB_INVALID for blocks
27544 with potential bswap optimizable CONSTRUCTORs.
27545 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
27548 2021-01-05 Richard Biener <rguenther@suse.de>
27550 PR tree-optimization/98381
27551 * tree.c (vector_element_bits): Properly compute bool vector
27553 * tree-vect-loop.c (vectorizable_live_operation): Properly
27554 compute the last lane bit offset.
27556 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
27559 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
27560 Clear the top 64 bytes of the input XMM register.
27561 (sse_cvttps2pi): Ditto.
27563 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
27566 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
27568 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
27571 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
27574 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
27576 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
27577 (umaddsidi4_split): Likewise.
27579 2021-01-05 liuhongt <hongtao.liu@intel.com>
27582 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
27583 define_insn_and_split for zero_extend of subreg HI of pmovskb
27585 (*sse2_pmovskb_zexthisi): Add new combine splitters for
27586 zero_extend of not of subreg HI of pmovskb result.
27588 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27591 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
27593 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
27594 convert_memory_address to convert symbolic immediates to ptr_mode
27595 before forcing them to memory.
27597 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27599 PR rtl-optimization/97144
27600 * recog.c (constrain_operands): Initialize matching_operand
27601 for each alternative, rather than only doing it once.
27603 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27605 PR rtl-optimization/98403
27606 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
27607 why we don't remove call clobbers.
27608 (function_info::apply_changes_to_insn): Don't attempt to add
27609 call clobbers here.
27611 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27613 PR tree-optimization/98371
27614 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
27615 (vect_analyze_loop): If an epilogue loop appears to be cheaper
27616 than the main loop, re-analyze it as a main loop before adopting
27619 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
27622 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
27623 * aclocal.m4, configure: Regenerate.
27624 * Makefile.in (NETLIBS): Define.
27625 (BACKEND): Remove $(CODYLIB).
27627 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27629 PR rtl-optimization/98334
27630 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
27631 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
27633 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
27635 * tree-inline.c (expand_call_inline): Restore input_location.
27636 Return result from recursive call.
27638 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
27640 PR tree-optimization/95401
27641 * config/aarch64/aarch64-sve-builtins.cc
27642 (gimple_folder::load_store_cookie): Use bits rather than bytes
27643 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
27644 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
27645 * tree-vect-stmts.c (vectorizable_store): Likewise.
27646 (vectorizable_load): Likewise.
27648 2021-01-04 Richard Biener <rguenther@suse.de>
27650 PR tree-optimization/98308
27651 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
27654 2021-01-04 Jakub Jelinek <jakub@redhat.com>
27656 PR tree-optimization/95771
27657 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
27658 with precision smaller than int's precision and types with precision
27659 twice as large as long long. Formatting fixes.
27661 2021-01-04 Richard Biener <rguenther@suse.de>
27663 PR tree-optimization/98464
27664 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
27665 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
27666 (process_bb): Adjust.
27668 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
27671 * doc/invoke.texi (-fsanitize=address): Fix wording describing
27672 clash with -fsanitize=hwaddress.
27674 2021-01-04 Richard Biener <rguenther@suse.de>
27676 PR tree-optimization/98282
27677 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
27678 invariants as VN_NARY.
27680 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
27683 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
27684 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
27685 to handle zero operands.
27687 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
27689 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
27690 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
27691 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
27694 2021-01-04 Richard Biener <rguenther@suse.de>
27696 PR tree-optimization/98393
27697 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
27698 when hitting the limit.
27700 2021-01-04 Richard Biener <rguenther@suse.de>
27702 PR tree-optimization/98291
27703 * tree-vect-loop.c (vectorizable_reduction): Bypass
27704 associativity check for SLP reductions with VF 1.
27706 2021-01-04 Jakub Jelinek <jakub@redhat.com>
27708 PR tree-optimization/96782
27709 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
27711 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
27713 * collect-utils.c (collect_execute): Check dumppfx.
27714 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
27715 to collect_execute.
27716 (do_link): Add new parameter atsuffix.
27717 (main): Handle -dumpdir option. Skip one argument for
27718 -o, -isystem and -B options.
27719 * gcc.c (make_at_file): New helper function.
27720 (close_at_file): Use it.
27722 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27724 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
27725 Amend handling for LD64_VERSION fallback defaults.
27727 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27729 * config.gcc: Compute default version information
27730 from the configured target. Likewise defaults for
27732 * config/darwin10.h: Removed.
27733 * config/darwin12.h: Removed.
27734 * config/darwin9.h: Removed.
27735 * config/rs6000/darwin8.h: Removed.
27737 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27739 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
27741 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27743 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
27744 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
27746 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27748 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
27750 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
27752 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27754 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
27755 for the Darwin10 unwinder stub from here ...
27756 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
27758 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27760 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
27761 (ASM_DEBUG_SPEC):Only define if the assembler supports
27763 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
27764 (DARWIN_PREFER_DWARF): Define.
27765 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
27766 (DARWIN_PREFER_DWARF): Likewise
27767 (DSYMUTIL_SPEC): Likewise.
27768 (COLLECT_RUN_DSYMUTIL): Likewise.
27769 (ASM_DEBUG_SPEC): Likewise.
27770 (ASM_DEBUG_OPTION_SPEC): Likewise.
27772 2021-01-02 Jan Hubicka <jh@suse.cz>
27774 * cfg.c (free_block): ggc_free bb.
27776 2021-01-01 Jakub Jelinek <jakub@redhat.com>
27778 * gcc.c (process_command): Update copyright notice dates.
27779 * gcov-dump.c (print_version): Ditto.
27780 * gcov.c (print_version): Ditto.
27781 * gcov-tool.c (print_version): Ditto.
27782 * gengtype.c (create_file): Ditto.
27783 * doc/cpp.texi: Bump @copying's copyright year.
27784 * doc/cppinternals.texi: Ditto.
27785 * doc/gcc.texi: Ditto.
27786 * doc/gccint.texi: Ditto.
27787 * doc/gcov.texi: Ditto.
27788 * doc/install.texi: Ditto.
27789 * doc/invoke.texi: Ditto.
27791 2021-01-01 Jakub Jelinek <jakub@redhat.com>
27793 * ChangeLog-2020: Rotate ChangeLog. New file.
27796 Copyright (C) 2021 Free Software Foundation, Inc.
27798 Copying and distribution of this file, with or without modification,
27799 are permitted in any medium without royalty provided the copyright
27800 notice and this notice are preserved.