1 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
3 * ipa-modref.c (modref_eaf_analysis::analyze_ssa_name): Revert
6 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
7 Uroš Bizjak <ubizjak@gmail.com>
9 * config/i386/i386.md (<any_rotate>ti3): Provide expansion for
10 rotations by non-constant amounts.
12 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
14 * ipa-modref.c: Fix anonymous namespace placement.
15 (class modref_eaf_analysis): New class.
16 (analyze_ssa_name_flags): Turn to ...
17 (modref_eaf_analysis::analyze_ssa_name): ... this one.
18 (merge_call_lhs_flags): Turn to ...
19 (modref_eaf_analysis::merge_call_lhs_flags): .. this one
20 (modref_eaf_analysis::merge_with_ssa_name): New member function.
21 (record_escape_points): Turn to ...
22 (modref_eaf_analysis::record_escape_points): ... this one.
23 (analyze_parms): Updat
24 (ipa_merge_modref_summary_after_inlining): Move to the end of file.
26 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
28 * gimple.c (gimple_call_static_chain_flags): New function.
29 * gimple.h (gimple_call_static_chain_flags): Declare
30 * ipa-modref.c (modref_summary::modref_summary): Initialize
32 (modref_summary_lto::modref_summary_lto): Likewise.
33 (modref_summary::useful_p): Test static_chain_flags.
34 (modref_summary_lto::useful_p): Likewise.
35 (struct modref_summary_lto): Add static_chain_flags.
36 (modref_summary::dump): Dump static_chain_flags.
37 (modref_summary_lto::dump): Likewise.
38 (struct escape_point): Add static_cahin_arg.
39 (analyze_ssa_name_flags): Use gimple_call_static_chain_flags.
40 (analyze_parms): Handle static chains.
41 (modref_summaries::duplicate): Duplicate static_chain_flags.
42 (modref_summaries_lto::duplicate): Likewise.
43 (modref_write): Stream static_chain_flags.
44 (read_section): Likewise.
45 (modref_merge_call_site_flags): Handle static_chain_flags.
46 * ipa-modref.h (struct modref_summary): Add static_chain_flags.
47 * tree-ssa-structalias.c (handle_rhs_call): Use
48 gimple_static_chain_flags.
50 2021-11-02 Richard Biener <rguenther@suse.de>
52 PR tree-optimization/103029
53 * tree-vect-loop-manip.c (vect_loop_versioning): Ensure
54 the PHI nodes in the loop maintain their original operand
57 2021-11-02 Jan Hubicka <hubicka@ucw.cz>
59 * tree-core.h (EAF_NOT_RETURNED_DIRECTLY): New flag.
60 (EAF_NOREAD): Renumber.
61 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOT_RETURNED_DIRECTLY.
62 (remove_useless_eaf_flags): Handle EAF_NOT_RETURNED_DIRECTLY
63 (deref_flags): Likewise.
64 (modref_lattice::init): Likewise.
65 (modref_lattice::merge): Likewise.
66 (merge_call_lhs_flags): Likewise.
67 (analyze_ssa_name_flags): Likewise.
68 (modref_merge_call_site_flags): Likewise.
69 * tree-ssa-structalias.c (handle_call_arg): Likewise.
71 2021-11-02 Maciej W. Rozycki <macro@embecosm.com>
73 * config/riscv/riscv.c (riscv_rtx_costs): Correct a CONST_INT_P
74 check and remove an unused local variable with shNadd/shNadd.uw
77 2021-11-02 Tamar Christina <tamar.christina@arm.com>
79 PR tree-optimization/103007
80 * tree-vect-slp-patterns.c (complex_fms_pattern::matches): Add elem
83 2021-11-02 Richard Biener <rguenther@suse.de>
86 * fold-const.c (native_interpret_expr): Handle OFFSET_TYPE.
87 (can_native_interpret_type_p): Likewise.
88 * internal-fn.c (expand_DEFERRED_INIT): View-convert the
89 RHS if the LHS is an SSA name.
91 2021-11-02 Richard Sandiford <richard.sandiford@arm.com>
93 * langhooks.h (lang_hooks_for_types::simulate_record_decl): New hook.
94 * langhooks-def.h (lhd_simulate_record_decl): Declare.
95 (LANG_HOOKS_SIMULATE_RECORD_DECL): Define.
96 (LANG_HOOKS_FOR_TYPES_INITIALIZER): Include it.
97 * langhooks.c (lhd_simulate_record_decl): New function.
99 2021-11-02 Jakub Jelinek <jakub@redhat.com>
102 * config/i386/i386.c (ix86_vector_mode_supported_p): Reject vector
103 modes with TImode inner mode if 32-bit.
105 2021-11-02 liuhongt <hongtao.liu@intel.com>
107 * doc/sourcebuild.texi (vect_slp_v4qi_store_unalign,
108 vect_slp_v2hi_store_unalign, vect_slp_v4hi_store_unalign,
109 vect_slp_v4si_store_unalign): Document efficient target.
110 (vect_slp_v4qi_store_unalign_1, vect_slp_v8qi_store_unalign_1,
111 vect_slp_v16qi_store_unalign_1): Ditto.
112 (vect_slp_v2hi_store_align,vect_slp_v2qi_store_align,
113 vect_slp_v2si_store_align, vect_slp_v4qi_store_align): Ditto.
114 (struct_4char_block_move, struct_8char_block_move,
115 struct_16char_block_move): Ditto.
117 2021-11-02 Roger Sayle <roger@nextmovesoftware.com>
118 Jakub Jelinek <jakub@redhat.com>
121 * config/i386/i386-expand.c (ix86_expand_v1ti_to_ti,
122 ix86_expand_ti_to_v1ti): New helper functions.
123 (ix86_expand_v1ti_shift): Check if the amount operand is an
124 integer constant, and expand as a TImode shift if it isn't.
125 (ix86_expand_v1ti_rotate): Check if the amount operand is an
126 integer constant, and expand as a TImode rotate if it isn't.
127 (ix86_expand_v1ti_ashiftrt): New function to expand arithmetic
128 right shifts of V1TImode quantities.
129 * config/i386/i386-protos.h (ix86_expand_v1ti_ashift): Prototype.
130 * config/i386/sse.md (ashlv1ti3, lshrv1ti3): Change constraints
131 to QImode general_operand, and let the helper functions lower
132 shifts by non-constant operands, as TImode shifts. Make
133 conditional on TARGET_64BIT.
134 (ashrv1ti3): New expander calling ix86_expand_v1ti_ashiftrt.
135 (rotlv1ti3, rotrv1ti3): Change shift operand to QImode.
136 Make conditional on TARGET_64BIT.
138 2021-11-02 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
140 * config/s390/s390.md ("*cc_to_int", "tabort", "*tabort_1",
141 "*tabort_1_plus"): Remove operands non-null check.
143 2021-11-01 Martin Liska <mliska@suse.cz>
145 * opt-functions.awk: Add new sanity checking.
146 * optc-gen.awk: Add new argument to integer_range_info.
147 * params.opt: Update 2 params which have negative IntegerRange.
149 2021-11-01 qing zhao <qing.zhao@oracle.com>
151 * gimplify.c (gimplify_decl_expr): Do not add call to
152 __builtin_clear_padding when a variable is a gimple register
153 or it might not have padding.
154 (gimplify_init_constructor): Likewise.
156 2021-11-01 Tamar Christina <tamar.christina@arm.com>
158 * config/arm/aarch-common-protos.h (struct vector_cost_table): Add
159 movi, dup and extract costing fields.
160 * config/aarch64/aarch64-cost-tables.h (qdf24xx_extra_costs,
161 thunderx_extra_costs, thunderx2t99_extra_costs,
162 thunderx3t110_extra_costs, tsv110_extra_costs, a64fx_extra_costs): Use
164 * config/arm/aarch-cost-tables.h (generic_extra_costs,
165 cortexa53_extra_costs, cortexa57_extra_costs, cortexa76_extra_costs,
166 exynosm1_extra_costs, xgene1_extra_costs): Likewise
167 * config/aarch64/aarch64-simd.md (aarch64_simd_dup<mode>): Add r->w dup.
168 * config/aarch64/aarch64.c (aarch64_rtx_costs): Add extra costs.
170 2021-11-01 Tamar Christina <tamar.christina@arm.com>
172 * cse.c (add_to_set): New.
173 (find_sets_in_insn): Register constants in sets.
174 (canonicalize_insn): Use auto_vec instead.
175 (cse_insn): Try materializing using vec_dup.
176 * rtl.h (simplify_context::simplify_gen_vec_select,
177 simplify_gen_vec_select): New.
178 * simplify-rtx.c (simplify_context::simplify_gen_vec_select): New.
180 2021-11-01 David Malcolm <dmalcolm@redhat.com>
182 * common.opt (fdiagnostics-escape-format=): New.
183 (diagnostics_escape_format): New enum.
184 (DIAGNOSTICS_ESCAPE_FORMAT_UNICODE): New enum value.
185 (DIAGNOSTICS_ESCAPE_FORMAT_BYTES): Likewise.
186 * diagnostic-format-json.cc (json_end_diagnostic): Add
187 "escape-source" attribute.
188 * diagnostic-show-locus.c
189 (exploc_with_display_col::exploc_with_display_col): Replace
190 "tabstop" param with a cpp_char_column_policy and add an "aspect"
191 param. Use these to compute m_display_col accordingly.
192 (struct char_display_policy): New struct.
193 (layout::m_policy): New field.
194 (layout::m_escape_on_output): New field.
195 (def_policy): New function.
196 (make_range): Update for changes to exploc_with_display_col ctor.
197 (default_print_decoded_ch): New.
198 (width_per_escaped_byte): New.
199 (escape_as_bytes_width): New.
200 (escape_as_bytes_print): New.
201 (escape_as_unicode_width): New.
202 (escape_as_unicode_print): New.
204 (layout::layout): Initialize new fields. Update m_exploc ctor
205 call for above change to ctor.
206 (layout::maybe_add_location_range): Update for changes to
207 exploc_with_display_col ctor.
208 (layout::calculate_x_offset_display): Update for change to
210 (layout::print_source_line): Pass policy
211 to cpp_display_width_computation. Capture cpp_decoded_char when
212 calling process_next_codepoint. Move printing of source code to
214 (line_label::line_label): Pass in policy rather than context.
215 (layout::print_any_labels): Update for change to line_label ctor.
216 (get_affected_range): Pass in policy rather than context, updating
217 calls to location_compute_display_column accordingly.
218 (get_printed_columns): Likewise, also for cpp_display_width.
219 (correction::correction): Pass in policy rather than tabstop.
220 (correction::compute_display_cols): Pass m_policy rather than
221 m_tabstop to cpp_display_width.
222 (correction::m_tabstop): Replace with...
223 (correction::m_policy): ...this.
224 (line_corrections::line_corrections): Pass in policy rather than
226 (line_corrections::m_context): Replace with...
227 (line_corrections::m_policy): ...this.
228 (line_corrections::add_hint): Update to use m_policy rather than
230 (line_corrections::add_hint): Likewise.
231 (layout::print_trailing_fixits): Likewise.
232 (selftest::test_display_widths): New.
233 (selftest::test_layout_x_offset_display_utf8): Update to use
234 policy rather than tabstop.
235 (selftest::test_one_liner_labels_utf8): Add test of escaping
237 (selftest::test_diagnostic_show_locus_one_liner_utf8): Update to
238 use policy rather than tabstop.
239 (selftest::test_overlapped_fixit_printing): Likewise.
240 (selftest::test_overlapped_fixit_printing_utf8): Likewise.
241 (selftest::test_overlapped_fixit_printing_2): Likewise.
242 (selftest::test_tab_expansion): Likewise.
243 (selftest::test_escaping_bytes_1): New.
244 (selftest::test_escaping_bytes_2): New.
245 (selftest::diagnostic_show_locus_c_tests): Call the new tests.
246 * diagnostic.c (diagnostic_initialize): Initialize
247 context->escape_format.
248 (convert_column_unit): Update to use default character width policy.
249 (selftest::test_diagnostic_get_location_text): Likewise.
250 * diagnostic.h (enum diagnostics_escape_format): New enum.
251 (diagnostic_context::escape_format): New field.
252 * doc/invoke.texi (-fdiagnostics-escape-format=): New option.
253 (-fdiagnostics-format=): Add "escape-source" attribute to examples
254 of JSON output, and document it.
255 * input.c (location_compute_display_column): Pass in "policy"
256 rather than "tabstop", passing to
257 cpp_byte_column_to_display_column.
258 (selftest::test_cpp_utf8): Update to use cpp_char_column_policy.
259 * input.h (class cpp_char_column_policy): New forward decl.
260 (location_compute_display_column): Pass in "policy" rather than
262 * opts.c (common_handle_option): Handle
263 OPT_fdiagnostics_escape_format_.
264 * selftest.c (temp_source_file::temp_source_file): New ctor
265 overload taking a size_t.
266 * selftest.h (temp_source_file::temp_source_file): Likewise.
268 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
270 * dbgcnt.def: Add debug counter for back_thread[12] and
272 * passes.def: Pass "first" argument to each back threading pass.
273 * tree-ssa-threadbackward.c (back_threader::back_threader): Add
275 (back_threader::debug_counter): New.
276 (back_threader::maybe_register_path): Call debug_counter.
278 2021-11-01 Aldy Hernandez <aldyh@redhat.com>
280 * tree-ssa-threadbackward.c (BT_NONE): New.
283 (back_threader::back_threader): Add flags.
284 Move loop initialization here.
285 (back_threader::~back_threader): New.
286 (back_threader::find_taken_edge_switch): Change solver and ranger
288 (back_threader::find_taken_edge_cond): Same.
289 (back_threader::find_paths_to_names): Same.
290 (back_threader::find_paths): Same.
291 (back_threader::dump): Same.
292 (try_thread_blocks): Merge into thread_blocks.
293 (back_threader::thread_blocks): New.
294 (do_early_thread_jumps): Merge into thread_blocks.
295 (do_thread_jumps): Merge into thread_blocks.
296 (back_threader::thread_through_all_blocks): Remove.
298 2021-11-01 Andrew MacLeod <amacleod@redhat.com>
300 PR tree-optimization/103003
301 * value-relation.cc (dom_oracle::register_relation): If the 2
302 ssa names are the same, don't register any relation.
304 2021-11-01 Dan Li <ashimida@linux.alibaba.com>
306 * config/aarch64/aarch64.c (aarch64_expand_epilogue): Remove
307 redundant check for calls_eh_return.
308 * config/aarch64/aarch64.md (*do_return): Likewise.
310 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
312 * cfghooks.c (cfg_hook_duplicate_loop_to_header_edge): Rename
313 duplicate_loop_to_header_edge to
314 duplicate_loop_body_to_header_edge.
315 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
316 * cfghooks.h (struct cfg_hooks): Likewise.
317 (cfg_hook_duplicate_loop_body_to_header_edge): Likewise.
318 * cfgloopmanip.c (duplicate_loop_body_to_header_edge): Likewise.
319 (clone_loop_to_header_edge): Likewise.
320 * cfgloopmanip.h (duplicate_loop_body_to_header_edge): Likewise.
321 * cfgrtl.c (struct cfg_hooks): Likewise.
322 * doc/loop.texi: Likewise.
323 * loop-unroll.c (unroll_loop_constant_iterations): Likewise.
324 (unroll_loop_runtime_iterations): Likewise.
325 (unroll_loop_stupid): Likewise.
326 (apply_opt_in_copies): Likewise.
327 * tree-cfg.c (struct cfg_hooks): Likewise.
328 * tree-ssa-loop-ivcanon.c (try_unroll_loop_completely): Likewise.
329 (try_peel_loop): Likewise.
330 * tree-ssa-loop-manip.c (copy_phi_node_args): Likewise.
331 (gimple_duplicate_loop_body_to_header_edge): Likewise.
332 (tree_transform_and_unroll_loop): Likewise.
333 * tree-ssa-loop-manip.h (gimple_duplicate_loop_body_to_header_edge):
336 2021-11-01 Xionghu Luo <luoxhu@linux.ibm.com>
338 * cfgloopmanip.c (loop_version): Refactor loopify to
339 loop_version. Move condition generation after loopify.
341 * cfgloopmanip.h (loopify): Delete.
343 2021-10-31 Jan Hubicka <hubicka@ucw.cz>
345 * ipa-fnsummary.c: Include tree-dfa.h.
346 (points_to_local_or_readonly_memory_p): Return true on return
348 * ipa-modref.c (analyze_ssa_name_flags): Fix handling of copy
351 2021-10-30 Tobias Burnus <tobias@codesourcery.com>
354 * omp-low.c (omp_runtime_api_call): Use DECL_ASSEMBLER_NAME to get
355 internal Fortran name; new permit_num_teams arg to permit
356 omp_get_num_teams and omp_get_team_num.
357 (scan_omp_1_stmt): Update call to it, add missing call for
358 reverse offload, and check for strictly nested API calls in teams.
360 2021-10-30 Jakub Jelinek <jakub@redhat.com>
362 * gimplify.c (gimplify_omp_for): Diagnose threadprivate iterators.
364 2021-10-29 David Malcolm <dmalcolm@redhat.com>
366 * selftest.c (assert_streq): Add newlines when emitting non-equal
369 2021-10-29 David Malcolm <dmalcolm@redhat.com>
371 * Makefile.in: Fix syntax for reference to LIBDEPS in
374 2021-10-29 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
376 * doc/install.texi: Bump required minimum DejaGnu version.
378 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
379 Andrew MacLeod <amacleod@redhat.com>
381 * value-relation.cc (path_oracle::killing_def): Add a
382 self-equivalence so we don't look to the root oracle.
384 2021-10-29 Aldy Hernandez <aldyh@redhat.com>
386 * passes.def: Replace the pass_thread_jumps before VRP* with
387 pass_thread_jumps_full. Remove all pass_vrp_threader instances.
388 * tree-ssa-threadbackward.c (pass_data_thread_jumps_full):
389 Remove hyphen from "thread-full" name.
391 2021-10-29 Andrew MacLeod <amacleod@redhat.com>
393 PR tree-optimization/102983
394 * gimple-range-cache.h (propagate_updated_value): Make public.
395 * gimple-range.cc (gimple_ranger::range_of_stmt): Propagate exports
396 when processing gcond stmts.
398 2021-10-29 Jan Hubicka <hubicka@ucw.cz>
400 * gimple.c (gimple_call_retslot_flags): New function.
401 * gimple.h (gimple_call_retslot_flags): Declare.
402 * ipa-modref.c: Include tree-cfg.h.
403 (struct escape_entry): Turn parm_index to signed.
404 (modref_summary_lto::modref_summary_lto): Add retslot_flags.
405 (modref_summary::modref_summary): Initialize retslot_flags.
406 (struct modref_summary_lto): Likewise.
407 (modref_summary::useful_p): Check retslot_flags.
408 (modref_summary_lto::useful_p): Likewise.
409 (modref_summary::dump): Dump retslot_flags.
410 (modref_summary_lto::dump): Likewise.
411 (struct escape_point): Add hidden_args enum.
412 (analyze_ssa_name_flags): Ignore return slot return;
413 use gimple_call_retslot_flags.
414 (record_escape_points): Break out from ...
415 (analyze_parms): ... here; handle retslot_flags.
416 (modref_summaries::duplicate): Duplicate retslot_flags.
417 (modref_summaries_lto::duplicate): Likewise.
418 (modref_write_escape_summary): Stream parm_index as signed.
419 (modref_read_escape_summary): Likewise.
420 (modref_write): Stream retslot_flags.
421 (read_section): Likewise.
422 (struct escape_map): Fix typo in comment.
423 (update_escape_summary_1): Fix whitespace.
424 (ipa_merge_modref_summary_after_inlining): Drop retslot_flags.
425 (modref_merge_call_site_flags): Merge retslot_flags.
426 * ipa-modref.h (struct modref_summary): Add retslot_flags.
427 * tree-ssa-structalias.c (handle_rhs_call): Handle retslot_flags.
429 2021-10-29 Tamar Christina <tamar.christina@arm.com>
431 PR tree-optimization/102977
432 * tree-vect-slp-patterns.c (vect_match_call_p): Remove.
433 (vect_detect_pair_op): Add crosslane check.
434 (vect_match_call_complex_mla): Remove.
435 (class complex_mul_pattern): Update comment.
436 (complex_mul_pattern::matches): Update detection.
437 (class complex_fma_pattern): Remove.
438 (complex_fma_pattern::matches): Remove.
439 (complex_fma_pattern::recognize): Remove.
440 (complex_fma_pattern::build): Remove.
441 (class complex_fms_pattern): Update comment.
442 (complex_fms_pattern::matches): Remove.
443 (complex_operations_pattern::recognize): Remove complex_fma_pattern
445 2021-10-29 Jakub Jelinek <jakub@redhat.com>
447 * gimple-fold.c (gimple_fold_builtin_memset): Copy over location from
450 2021-10-29 Xionghu Luo <luoxhu@linux.ibm.com>
453 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const): Add
454 patterns match and emit for VSX xxpermdi.
456 2021-10-29 liuhongt <hongtao.liu@intel.com>
459 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF): New
461 (V16HF_FTYPE_V16HF): Ditto.
462 (V32HF_FTYPE_V32HF): Ditto.
463 (V8HF_FTYPE_V8HF_ROUND): Ditto.
464 (V16HF_FTYPE_V16HF_ROUND): Ditto.
465 (V32HF_FTYPE_V32HF_ROUND): Ditto.
466 * config/i386/i386-builtin.def ( IX86_BUILTIN_FLOORPH,
467 IX86_BUILTIN_CEILPH, IX86_BUILTIN_TRUNCPH,
468 IX86_BUILTIN_FLOORPH256, IX86_BUILTIN_CEILPH256,
469 IX86_BUILTIN_TRUNCPH256, IX86_BUILTIN_FLOORPH512,
470 IX86_BUILTIN_CEILPH512, IX86_BUILTIN_TRUNCPH512): New builtin.
471 * config/i386/i386-builtins.c
472 (ix86_builtin_vectorized_function): Enable vectorization for
473 HFmode FLOOR/CEIL/TRUNC operation.
474 * config/i386/i386-expand.c (ix86_expand_args_builtin): Handle
476 * config/i386/sse.md (rint<mode>2, nearbyint<mode>2): Extend
479 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
480 Andrew MacLeod <amacleod@redhat.com>
482 * value-relation.cc (path_oracle::killing_def): Walk the
483 equivalency list and remove SSA from any equivalencies.
485 2021-10-28 Stafford Horne <shorne@gmail.com>
487 * config/or1k/or1k.h (PROFILE_HOOK): Add return address argument
490 2021-10-28 Jakub Jelinek <jakub@redhat.com>
492 PR tree-optimization/102951
493 * fold-const.h (address_compare): Declare.
494 * fold-const.c (address_compare): New function.
495 * match.pd (cmp (convert1?@2 addr@0) (convert2? addr@1)): Use
496 address_compare helper.
497 (minmax cmp (convert1?@2 addr@0) (convert2?@3 addr@1)): New
500 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
502 * vr-values.c (simplify_using_ranges::fold_cond): Change fold message.
504 2021-10-28 Andrew MacLeod <amacleod@redhat.com>
506 PR tree-optimization/102940
507 * tree-vrp.c (execute_ranger_vrp): Reset scev.
509 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
511 * config/nios2/linux.h (MUSL_DYNAMIC_LINKER): Add musl linker
513 2021-10-28 Richard Purdie <richard.purdie@linuxfoundation.org>
515 * configure: Regenerate.
516 * configure.ac: Use CPPFLAGS_FOR_BUILD for GMPINC
518 2021-10-28 Eric Botcazou <ebotcazou@adacore.com>
520 * doc/invoke.texi (%X): Remove obsolete reference to -Wl.
522 2021-10-28 Richard Biener <rguenther@suse.de>
525 * fold-const.c (fold_convert_const): Avoid int to float
526 constant folding with -frounding-math and inexact result.
527 * simplify-rtx.c (simplify_const_unary_operation): Likewise
528 for both float and unsigned_float.
530 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
532 * tree-ssa-threadbackward.c
533 (back_threader::find_taken_edge_switch): Use find_case_label_range
534 instead of find_taken_edge.
536 2021-10-28 Aldy Hernandez <aldyh@redhat.com>
538 * tree-ssa-threadbackward.c (class back_threader_registry):
539 Inherit from back_jt_path_registry.
540 (back_threader_registry::thread_through_all_blocks): Remove.
541 (back_threader_registry::register_path): Remove
542 m_lowlevel_registry prefix.
544 2021-10-28 Richard Biener <rguenther@suse.de>
547 * fold-const.c (fold_convert_const_real_from_real): Honor
548 -frounding-math if the conversion is not exact.
549 * simplify-rtx.c (simplify_const_unary_operation): Do not
550 simplify FLOAT_TRUNCATE with sign dependent rounding.
552 2021-10-28 Richard Biener <rguenther@suse.de>
554 PR tree-optimization/102949
555 * tree-vect-stmts.c (ensure_base_align): Look at the
556 dr_info of a group leader and assert we are looking at
557 one with analyzed alignment.
559 2021-10-28 Kewen Lin <linkw@linux.ibm.com>
562 * config/rs6000/rs6000.c (rs6000_builtin_vectorization_cost): Consider
563 V1T1 mode for unaligned load and store.
565 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
567 * config/riscv/riscv.md (zero_extendsidi2_internal): Allow ZBB
570 2021-10-28 Kito Cheng <kito.cheng@sifive.com>
572 * config/riscv/arch-canonicalize (CANONICAL_ORDER): Add `i` to
575 2021-10-28 Alexandre Oliva <oliva@adacore.com>
577 * common.opt (fharden-compares): New.
578 (fharden-conditional-branches): New.
579 * doc/invoke.texi: Document new options.
580 * gimple-harden-conditionals.cc: New.
581 * Makefile.in (OBJS): Build it.
582 * passes.def: Add new passes.
583 * tree-pass.h (make_pass_harden_compares): Declare.
584 (make_pass_harden_conditional_branches): Declare.
586 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
589 * config/rs6000/altivec.md: Add vsx register constraints.
590 * config/rs6000/vsx.md (vsx_xxsel<mode>): Delete.
591 (vsx_xxsel<mode>2): Likewise.
592 (vsx_xxsel<mode>3): Likewise.
593 (vsx_xxsel<mode>4): Likewise.
595 2021-10-28 Xionghu Luo <luoxhu@linux.ibm.com>
598 * config/rs6000/altivec.md (*altivec_vsel<mode>): Change to ...
599 (altivec_vsel<mode>): ... this and update define.
600 (*altivec_vsel<mode>_uns): Delete.
601 (altivec_vsel<mode>2): New define_insn.
602 (altivec_vsel<mode>3): Likewise.
603 (altivec_vsel<mode>4): Likewise.
604 * config/rs6000/rs6000-call.c (altivec_expand_vec_sel_builtin): New.
605 (altivec_expand_builtin): Call altivec_expand_vec_sel_builtin to expand
607 * config/rs6000/rs6000.c (rs6000_emit_vector_cond_expr): Use bit-wise
608 selection instead of per element.
609 * config/rs6000/vector.md:
610 * config/rs6000/vsx.md (*vsx_xxsel<mode>): Change to ...
611 (vsx_xxsel<mode>): ... this and update define.
612 (*vsx_xxsel<mode>_uns): Delete.
613 (vsx_xxsel<mode>2): New define_insn.
614 (vsx_xxsel<mode>3): Likewise.
615 (vsx_xxsel<mode>4): Likewise.
617 2021-10-28 Hongyu Wang <hongyu.wang@intel.com>
619 * config/i386/i386.c (use_rsqrt_p): Add mode parameter, enable
620 HFmode rsqrt without TARGET_SSE_MATH.
621 (ix86_optab_supported_p): Refactor rint, adjust floor, ceil,
622 btrunc condition to be restricted by -ftrapping-math, adjust
623 use_rsqrt_p function call.
624 * config/i386/i386.md (rcphf2): New define_insn.
625 (rsqrthf2): Likewise.
626 * config/i386/sse.md (div<mode>3): Change VF2H to VF2.
627 (div<mode>3): New expander for HF mode.
628 (rsqrt<mode>2): Likewise.
629 (*avx512fp16_vmrcpv8hf2): New define_insn for rpad pass.
630 (*avx512fp16_vmrsqrtv8hf2): Likewise.
632 2021-10-27 Saagar Jha <saagar@saagarjha.com>
634 * config.gcc: Adjust for Darwin21.
635 * config/darwin-c.c (macosx_version_as_macro): Likewise.
636 * config/darwin-driver.c (validate_macosx_version_min):
638 (darwin_find_version_from_kernel): Likewise.
640 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
642 * gimple-range-path.cc
643 (path_range_query::range_defined_in_block): Call killing_def.
645 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
647 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Dump
648 operands as well as relation.
649 * gimple-range-path.cc
650 (path_range_query::compute_ranges_in_block): Compute PHI relations
651 first. Compute outgoing relations at the end.
652 (path_range_query::compute_ranges): Remove call to compute_relations.
653 (path_range_query::compute_relations): Remove.
654 (path_range_query::maybe_register_phi_relation): New.
655 (path_range_query::compute_phi_relations): Abstract out
656 registering one PHI relation to...
657 (path_range_query::compute_outgoing_relations): ...here.
658 * gimple-range-path.h (class path_range_query): Remove
660 Add maybe_register_phi_relation.
662 2021-10-27 Aldy Hernandez <aldyh@redhat.com>
663 Andrew MacLeod <amacleod@redhat.com>
665 * value-relation.cc (path_oracle::killing_def): Kill second
668 2021-10-27 Martin Jambor <mjambor@suse.cz>
670 * ipa-cp.c (good_cloning_opportunity_p): Decide whether to use
671 profile feedback depending on their local availability.
673 2021-10-27 Martin Jambor <mjambor@suse.cz>
675 * params.opt (param_ipa_cp_profile_count_base): New parameter.
676 * doc/invoke.texi (Optimize Options): Add entry for
677 ipa-cp-profile-count-base.
678 * ipa-cp.c (max_count): Replace with base_count, replace all
679 occurrences too, unless otherwise stated.
680 (ipcp_cloning_candidate_p): identify mostly-directly called
681 functions based on their counts, not max_count.
682 (compare_edge_profile_counts): New function.
683 (ipcp_propagate_stage): Instead of setting max_count, find the
684 appropriate edge count in a sorted vector of counts of eligible
685 edges and make it the base_count.
687 2021-10-27 Martin Jambor <mjambor@suse.cz>
689 * ipa-cp.c (struct caller_statistics): New fields rec_count_sum,
690 n_nonrec_calls and itself, document all fields.
691 (init_caller_stats): Initialize the above new fields.
692 (gather_caller_stats): Gather self-recursive counts and calls number.
693 (get_info_about_necessary_edges): Gather counts of self-recursive and
694 other edges bringing in the requested value separately.
695 (dump_profile_updates): Rework to dump info about a single node only.
696 (lenient_count_portion_handling): New function.
697 (struct gather_other_count_struct): New type.
698 (gather_count_of_non_rec_edges): New function.
699 (struct desc_incoming_count_struct): New type.
700 (analyze_clone_icoming_counts): New function.
701 (adjust_clone_incoming_counts): Likewise.
702 (update_counts_for_self_gen_clones): Likewise.
703 (update_profiling_info): Rewritten.
704 (update_specialized_profile): Adjust call to dump_profile_updates.
705 (create_specialized_node): Do not update profiling info.
706 (decide_about_value): New parameter self_gen_clones, either push new
707 clones into it or updat their profile counts. For self-recursively
708 generated values, use a portion of the node count instead of count
709 from self-recursive edges to estimate goodness.
710 (decide_whether_version_node): Gather clones for self-generated values
711 in a new vector, update their profiles at once at the end.
713 2021-10-27 Richard Biener <rguenther@suse.de>
715 * tree-vectorizer.c (vect_transform_loops): New function,
717 (try_vectorize_loop_1): ... here. Simplify as epilogues
718 are now fully handled in the split part.
720 2021-10-27 Jakub Jelinek <jakub@redhat.com>
722 * omp-expand.c (expand_omp_for_init_counts): Handle non-rectangular
723 iterators with pointer types.
724 (expand_omp_for_init_vars, extract_omp_for_update_vars): Likewise.
726 2021-10-26 Martin Sebor <msebor@redhat.com>
728 PR tree-optimization/102238
729 PR tree-optimization/102919
730 * gimple-ssa-sprintf.c (get_string_length): Add an argument.
731 (array_elt_at_offset): Move to pointer-query.
732 (set_aggregate_size_and_offset): New function.
733 (field_at_offset): Move to pointer-query.
734 (get_origin_and_offset): Rename...
735 (get_origin_and_offset_r): this. Add an argument. Make aggregate
736 handling more robust.
737 (get_origin_and_offset): New.
738 (alias_offset): Add an argument.
739 (format_string): Use subobject size determined by get_origin_and_offset.
740 * pointer-query.cc (field_at_offset): Move from gimple-ssa-sprintf.c.
741 Improve/correct handling of aggregates.
742 (array_elt_at_offset): Same.
743 * pointer-query.h (field_at_offset): Declare.
744 (array_elt_at_offset): Declare.
746 2021-10-26 Martin Sebor <msebor@redhat.com>
748 * builtins.c (check_strncat_sizes): Pass access_data ctor additional
750 (expand_builtin_memcmp): Move code to gimple-ssa-warn-access.cc.
751 (expand_builtin_fork_or_exec): Same.
752 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Pass
753 compute_objsize additional arguments.
754 (inbounds_memaccess_p): Same.
755 (array_bounds_checker::check_array_bounds): Add an assert. Stash
756 statement in a member.
757 (check_array_bounds_dom_walker::before_dom_children): Same.
758 * gimple-array-bounds.h (array_bounds_checker::m_stmt): New member.
759 * gimple-ssa-sprintf.c (get_destination_size): Add an argument.
760 (handle_printf_call): Pass a new argument.
761 * gimple-ssa-warn-access.cc (get_size_range): Add an argument.
762 (check_access): Add an argument and pass it along to callees.
763 (check_read_access): Make a member function.
764 (pass_waccess::check_strcat): Pass access_data ctor additional
766 (pass_waccess::check_strncat): Same.
767 (pass_waccess::check_stxcpy): Same.
768 (pass_waccess::check_stxncpy): Same.
769 (pass_waccess::check_strncmp): Same.
770 (pass_waccess::check_read_access): Same.
771 (pass_waccess::check_builtin): Same.
772 (pass_waccess::maybe_check_access_sizes): Same.
773 (pass_waccess::maybe_check_dealloc_call): Same.
774 * gimple-ssa-warn-access.h (check_read_access): Declare a new
776 * pointer-query.cc (compute_objsize_r): Add an argument.
777 (gimple_call_return_array): Same.
778 (gimple_call_alloc_size): Same.
779 (access_ref::access_ref): Same.
780 (access_ref::get_ref): Same.
781 (pointer_query::get_ref): Same.
782 (handle_min_max_size): Pass an arguments to callees.
783 (handle_array_ref): Add an argument.
784 (handle_mem_ref): Same.
785 (compute_objsize): Same.
786 * pointer-query.h (struct access_ref): Adjust signatures.
787 (struct access_data): Same.
788 (gimple_call_alloc_size): Add an argument.
789 (gimple_parm_array_size): Same.
790 (compute_objsize): Same.
791 * tree-ssa-strlen.c (strlen_pass::adjust_last_stmt): Pass an additional
792 argument to compute_objsize.
793 (strlen_pass::maybe_warn_overflow): Same.
794 (maybe_diag_stxncpy_trunc): Same.
796 2021-10-26 Martin Sebor <msebor@redhat.com>
799 * gimple-ssa-warn-access.cc (pass_waccess::check_atomic_builtin): New.
800 (pass_waccess::check_atomic_builtin): Call it.
802 2021-10-26 Vladimir N. Makarov <vmakarov@redhat.com>
804 PR rtl-optimization/102842
805 * lra-constraints.c (match_reload): Ignore out in checking values
807 (curr_insn_transform): Collect outputs before doing reloads of operands.
809 2021-10-26 Paul A. Clarke <pc@us.ibm.com>
812 * config/rs6000/x86intrin.h: Move some included headers to new
813 headers. Include new immintrin.h instead of those headers.
814 * config/rs6000/immintrin.h: New.
815 * config/rs6000/x86gprintrin.h: New.
816 * config.gcc (powerpc*-*-*): Add new headers to extra_headers.
818 2021-10-26 Richard Biener <rguenther@suse.de>
820 * tree-vectorizer.h (vect_create_addr_base_for_vector_ref):
821 Remove byte_offset parameter.
822 (vect_create_data_ref_ptr): Likewise.
823 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
825 (vect_create_data_ref_ptr): Likewise.
826 * tree-vect-stmts.c (vectorizable_store): Adjust.
827 (vectorizable_load): Likewise.
829 2021-10-26 Richard Biener <rguenther@suse.de>
831 PR tree-optimization/96109
832 * tree-vectorizer.h (dr_misalignment): Add optional offset
834 * tree-vect-data-refs.c (dr_misalignment): Likewise. Remove
835 offset applied for negative stride accesses.
836 (vect_enhance_data_refs_alignment): Compute negative stride
837 access offset and pass it to dr_misalignment.
838 * tree-vect-stmts.c (get_negative_load_store_type): Pass
839 negative offset to dr_misalignment.
840 (get_group_load_store_type): Likewise.
841 (get_load_store_type): Likewise.
842 (vectorizable_store): Remove asserts about alignment.
843 (vectorizable_load): Likewise.
845 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
847 PR tree-optimization/102897
848 * tree-ssa-forwprop.c (simplify_permutation): Remove a wrong assertion.
850 2021-10-26 Richard Biener <rguenther@suse.de>
852 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
853 Take offset in bytes.
854 (vect_create_data_ref_ptr): Likewise.
855 * tree-vect-loop-manip.c (get_misalign_in_elems): Multiply
856 offset by element size.
857 (vect_create_cond_for_align_checks): Likewise.
858 * tree-vect-stmts.c (get_negative_load_store_type): Likewise.
859 (vectorizable_load): Remove duplicate leftover from merge
862 2021-10-26 Roger Sayle <roger@nextmovesoftware.com>
864 * config/i386/i386-expand.c (ix86_expand_v1ti_shift): New helper
865 function to expand V1TI mode logical shifts by integer constants.
866 (ix86_expand_v1ti_rotate): New helper function to expand V1TI
867 mode rotations by integer constants.
868 * config/i386/i386-protos.h (ix86_expand_v1ti_shift,
869 ix86_expand_v1ti_rotate): Prototype new functions here.
870 * config/i386/sse.md (ashlv1ti3, lshrv1ti3, rotlv1ti3, rotrv1ti3):
871 New TARGET_SSE2 expanders to implement V1TI shifts and rotations.
873 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
875 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
876 Avoid threading circular paths.
878 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
880 * tree-ssa-threadbackward.c (back_threader::resolve_phi):
881 Attempt to resolve all incoming paths to a PHI.
882 (back_threader::resolve_def): Always return true for PHIs.
884 2021-10-26 Aldy Hernandez <aldyh@redhat.com>
886 * tree-ssa-threadbackward.c (back_threader::find_paths_to_names):
887 Always try to resolve path without looking back.
888 * tree-ssa-threadupdate.c (dump_jump_thread): Indidicate whether
891 2021-10-26 Kewen Lin <linkw@linux.ibm.com>
893 PR tree-optimization/102789
894 * tree-vect-loop-manip.c (vect_update_inits_of_drs): Do not
895 update inits of simd_lane_access.
897 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
899 * tree-vrp.c (vrp_simplify_cond_using_ranges): Add return type and
901 (simplify_casted_conds): Move to vrp_folder class.
902 (execute_vrp): Call via vrp_folder now.
903 * vr-values.c (simplify_cond_using_ranges_1): Call simplify_casted_cond.
904 (simplify_using_ranges::simplify_casted_cond): Relocate from tree-vrp.c.
905 * vr-values.h (simplify_casted_cond): Add prototype.
907 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
909 * tree-vrp.c (rvrp_folder::fold_stmt): If simplification fails, try
912 2021-10-25 Paul A. Clarke <pc@us.ibm.com>
914 * config/rs6000/smmintrin.h (_mm_testz_si128): Add "extern" to
916 (_mm_testc_si128): Likewise.
917 (_mm_testnzc_si128): Likewise.
918 (_mm_blend_ps): Likewise.
919 (_mm_blendv_ps): Likewise.
920 (_mm_blend_pd): Likewise.
921 (_mm_blendv_pd): Likewise.
922 (_mm_ceil_pd): Likewise.
923 (_mm_ceil_sd): Likewise.
924 (_mm_ceil_ps): Likewise.
925 (_mm_ceil_ss): Likewise.
926 (_mm_floor_pd): Likewise.
927 (_mm_floor_sd): Likewise.
928 (_mm_floor_ps): Likewise.
929 (_mm_floor_ss): Likewise.
930 (_mm_minpos_epu16): Likewise.
931 (_mm_mul_epi32): Likewise.
932 (_mm_cvtepi8_epi16): Likewise.
933 (_mm_packus_epi32): Likewise.
934 (_mm_cmpgt_epi64): Likewise.
936 2021-10-25 Roger Sayle <roger@nextmovesoftware.com>
938 * simplify-rtx.c (simplify_binary_operation_1) [SS_ASHIFT]: Simplify
939 shifts of the mode's smin_value and smax_value when the bit count
940 operand doesn't have side-effects.
941 [US_ASHIFT]: Likewise, simplify shifts of the mode's umax_value
942 when the bit count operand doesn't have side-effects.
943 (simplify_const_binary_operation) [SS_ASHIFT, US_ASHIFT]: Perform
944 compile-time evaluation of saturating left shifts with constant
947 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
949 * gimple-range.cc (gimple_ranger::export_global_ranges): Remove check
952 2021-10-25 Andrew MacLeod <amacleod@redhat.com>
954 * flag-types.h (enum ranger_debug): Adjust values.
955 * params.opt (ranger_debug): Ditto.
957 2021-10-25 Martin Jambor <mjambor@suse.cz>
959 PR tree-optimization/102886
960 * tree-sra.c (totally_scalarize_subtree): Fix the out of
963 2021-10-25 Andrew Pinski <apinski@marvell.com>
965 * tree-ssa-dce.c (simple_dce_from_worklist):
966 Check stmt_unremovable_because_of_non_call_eh_p also
967 before removing the statement.
969 2021-10-25 Richard Biener <rguenther@suse.de>
971 PR tree-optimization/102905
972 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
973 Use vect_supportable_dr_alignment again to determine whether
974 an access is supported when not aligned.
976 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
978 * config/riscv/riscv.c (riscv_rtx_costs): Handle cost model
981 2021-10-25 Jim Wilson <jimw@sifive.com>
982 Kito Cheng <kito.cheng@sifive.com>
983 Jia-Wei Chen <jiawei@iscas.ac.cn>
984 Shi-Hua Liao <shihua@iscas.ac.cn>
986 * config/riscv/bitmanip.md (shiftm1): New.
987 (*bset<mode>): Ditto.
988 (*bset<mode>_mask): Ditto.
989 (*bset<mode>_1): Ditto.
990 (*bset<mode>_1_mask): Ditto.
991 (*bseti<mode>): Ditto.
992 (*bclr<mode>): Ditto.
993 (*bclri<mode>): Ditto.
994 (*binv<mode>): Ditto.
995 (*binvi<mode>): Ditto.
996 (*bext<mode>): Ditto.
998 * config/riscv/predicates.md (splittable_const_int_operand):
1000 (single_bit_mask_operand): New.
1001 (not_single_bit_mask_operand): Ditto.
1002 (const31_operand): Ditto.
1003 (const63_operand): Ditto.
1004 * config/riscv/riscv.c (riscv_build_integer_1): Handle bseti.
1005 (riscv_output_move): Ditto.
1006 (riscv_print_operand): Handle new operand type: T and S.
1007 * config/riscv/riscv.h (SINGLE_BIT_MASK_OPERAND): New.
1009 2021-10-25 Jim Wilson <jimw@sifive.com>
1011 * config/riscv/riscv.c (riscv_build_integer_1): Build integer
1014 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
1016 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
1018 (riscv_rtx_costs): Ditto.
1020 2021-10-25 Jim Wilson <jimw@sifive.com>
1021 Kito Cheng <kito.cheng@sifive.com>
1022 Jia-Wei Chen <jiawei@iscas.ac.cn>
1024 * config/riscv/bitmanip.md (bitmanip_bitwise): New.
1025 (bitmanip_minmax): New.
1026 (clz_ctz_pcnt): New.
1027 (bitmanip_optab): New.
1028 (bitmanip_insn): New.
1029 (*<optab>_not<mode>): New.
1030 (*xor_not<mode>): New.
1031 (<bitmanip_optab>si2): New.
1032 (*<bitmanip_optab>disi2): New.
1033 (<bitmanip_optab>di2): New.
1034 (*zero_extendhi<GPR:mode>2_bitmanip): New.
1035 (*extend<SHORT:mode><SUPERQI:mode>2_zbb): New.
1036 (*zero_extendhi<GPR:mode>2_zbb): New.
1039 (rotrsi3_sext): New.
1042 (rotlsi3_sext): New.
1043 (bswap<mode>2): New.
1044 (<bitmanip_optab><mode>3): New.
1045 * config/riscv/riscv.md (type): Add rotate.
1046 (zero_extendhi<GPR:mode>2): Change to define_expand pattern.
1047 (*zero_extendhi<GPR:mode>2): New.
1048 (extend<SHORT:mode><SUPERQI:mode>2): Change to define_expand pattern.
1049 (*extend<SHORT:mode><SUPERQI:mode>2): New.
1051 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
1053 * config/riscv/riscv.c (riscv_extend_cost): Handle cost model
1055 (riscv_rtx_costs): Ditto.
1057 2021-10-25 Jim Wilson <jimw@sifive.com>
1058 Kito Cheng <kito.cheng@sifive.com>
1059 Jia-Wei Chen <jiawei@iscas.ac.cn>
1061 * config/riscv/bitmanip.md (*zero_extendsidi2_bitmanip): New.
1066 (riscv_rtx_costs): Ditto.
1067 * config/riscv/riscv.md: Include bitmanip.md
1068 (type): Add bitmanip bype.
1069 (zero_extendsidi2): Change to define_expand pattern.
1070 (*zero_extendsidi2_internal): New.
1071 (zero_extendsidi2_shifted): Disable for ZBA.
1073 2021-10-25 Kito Cheng <kito.cheng@sifive.com>
1075 * common/config/riscv/riscv-common.c (riscv_ext_version_table):
1076 Add zba, zbb, zbc and zbs.
1077 (riscv_ext_flag_table): Ditto.
1078 * config/riscv/riscv-opts.h (MASK_ZBA): New.
1082 (TARGET_ZBA): Ditto.
1083 (TARGET_ZBB): Ditto.
1084 (TARGET_ZBC): Ditto.
1085 (TARGET_ZBS): Ditto.
1086 * config/riscv/riscv.opt (riscv_zb_subext): New.
1088 2021-10-25 liuhongt <hongtao.liu@intel.com>
1091 * match.pd: Simplify (_Float16) sqrtf((float) a) to .SQRT(a)
1092 when direct_internal_fn_supported_p, similar for sqrt/sqrtl.
1094 2021-10-25 Richard Biener <rguenther@suse.de>
1096 PR tree-optimization/102920
1097 * tree-ssa-sccvn.h (expressions_equal_p): Add argument
1098 controlling VN_TOP matching behavior.
1099 * tree-ssa-sccvn.c (expressions_equal_p): Likewise.
1100 (vn_phi_eq): Do not optimistically match VN_TOP.
1102 2021-10-25 konglin1 <lingling.kong@intel.com>
1104 * config/i386/sse.md (fma_<mode>_fadd_fmul): Add new
1105 define_insn_and_split.
1106 (fma_<mode>_fadd_fcmul):Likewise
1107 (fma_<complexopname>_<mode>_fma_zero):Likewise
1109 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
1111 * config/pa/pa-d.c (pa_d_handle_target_float_abi): Don't check
1112 TARGET_DISABLE_FPREGS.
1113 * config/pa/pa.c (fix_range): Use MASK_SOFT_FLOAT instead of
1114 MASK_DISABLE_FPREGS.
1115 (hppa_rtx_costs): Don't check TARGET_DISABLE_FPREGS. Adjust
1116 cost of hardware integer multiplication.
1117 (pa_conditional_register_usage): Don't check TARGET_DISABLE_FPREGS.
1118 * config/pa/pa.h (INT14_OK_STRICT): Likewise.
1119 * config/pa/pa.md: Don't check TARGET_DISABLE_FPREGS. Check
1120 TARGET_SOFT_FLOAT in patterns that use xmpyu instruction.
1121 * config/pa/pa.opt (mdisable-fpregs): Change target mask to
1122 SOFT_FLOAT. Revise comment.
1123 (msoft-float): New option.
1125 2021-10-24 John David Anglin <danglin@gcc.gnu.org>
1127 * config/pa/pa.md: Don't use 'G' constraint in integer move patterns.
1129 2021-10-24 Gerald Pfeifer <gerald@pfeifer.com>
1131 * doc/install.texi (Specific): Remove obsolete details
1132 around GNU/Linux on Itanium.
1133 (Specific): Remove reference to Windows for Itanium.
1135 2021-10-23 Bernhard Reutner-Fischer <aldot@gcc.gnu.org>
1137 * config/i386/x86-tune-sched-bd.c (dispatch_group): Commentary
1140 2021-10-23 Jan Hubicka <hubicka@ucw.cz>
1142 * tree-ssa-structalias.c (compute_points_to_sets): Cleanup.
1144 2021-10-23 Roger Sayle <roger@nextmovesoftware.com>
1146 * config/i386/sse.md (<any_logic>v1ti3): New define_insn to
1147 implement V1TImode AND, IOR and XOR on TARGET_SSE2 (and above).
1148 (one_cmplv1ti2): New define expand.
1150 2021-10-22 Eric Gallager <egallager@gcc.gnu.org>
1153 * Makefile.in: Handle dvidir and install-dvi target.
1154 * configure: Regenerate.
1155 * configure.ac: Add install-dvi to target_list.
1157 2021-10-22 Gerald Pfeifer <gerald@pfeifer.com>
1159 * doc/install.texi (Binaries): Convert mingw-w64.org to https.
1162 2021-10-22 Richard Biener <rguenther@suse.de>
1164 PR tree-optimization/102893
1165 * tree-ssa-dce.c (find_obviously_necessary_stmts): Fix the
1166 test for an exit edge.
1168 2021-10-22 Aldy Hernandez <aldyh@redhat.com>
1169 Andrew MacLeod <amacleod@redhat.com>
1171 * gimple-range-path.cc (path_range_query::compute_phi_relations):
1172 Kill any global relations we may know before registering a new
1174 * value-relation.cc (path_oracle::killing_def): New.
1175 * value-relation.h (path_oracle::killing_def): New.
1177 2021-10-22 Richard Biener <rguenther@suse.de>
1180 * tree-ssa-sccvn.c (vn_phi_insert): For undefined SSA args
1182 (vn_phi_lookup): Likewise.
1184 2021-10-21 H.J. Lu <hjl.tools@gmail.com>
1187 * doc/invoke.texi: Document -fcf-protection requires i686 or
1190 2021-10-21 Eric Botcazou <ebotcazou@adacore.com>
1192 PR middle-end/102764
1193 * cfgexpand.c (expand_gimple_basic_block): Robustify latest change.
1195 2021-10-21 Jonathan Wright <jonathan.wright@arm.com>
1197 * config/aarch64/arm_neon.h (__STRUCTN): Delete function
1198 macro and all invocations.
1200 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
1202 * doc/invoke.texi (ranger-debug): Document.
1203 * flag-types.h (enum ranger_debug): New.
1204 (enum evrp_mode): Remove debug values.
1205 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Use new debug flag.
1206 * gimple-range-gori.cc (gori_compute::gori_compute): Ditto.
1207 * gimple-range.cc (gimple_ranger::gimple_ranger): Ditto.
1208 * gimple-ssa-evrp.c (hybrid_folder::choose_value): Ditto.
1209 (execute_early_vrp): Use evrp-mode directly.
1210 * params.opt (enum evrp_mode): Remove debug values.
1211 (ranger-debug): New.
1212 (ranger-logical-depth): Relocate to be in alphabetical order.
1214 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
1216 * doc/invoke.texi: (vrp1-mode, vrp2-mode): Document.
1217 * flag-types.h: (enum vrp_mode): New.
1218 * params.opt: (vrp1-mode, vrp2-mode): New.
1219 * tree-vrp.c (vrp_pass_num): New.
1220 (pass_vrp::pass_vrp): Set pass number.
1221 (pass_vrp::execute): Choose which VRP mode to execute.
1223 2021-10-21 Andrew MacLeod <amacleod@redhat.com>
1225 * gimple-ssa-evrp.c (class rvrp_folder): Move to tree-vrp.c.
1226 (execute_early_vrp): For ranger only mode, invoke ranger_vrp.
1227 * tree-vrp.c (class rvrp_folder): Relocate here.
1228 (execute_ranger_vrp): New.
1229 * tree-vrp.h (execute_ranger_vrp): Export.
1231 2021-10-21 Martin Liska <mliska@suse.cz>
1235 * opts.c (finish_options): Process flag_var_tracking* options
1236 here as they can be adjusted by optimize attribute.
1237 Process also flag_syntax_only and flag_gtoggle.
1238 * toplev.c (process_options): Remove it here.
1239 * common.opt: Make debug_nonbind_markers_p as PerFunction
1240 attribute as it depends on optimization level.
1242 2021-10-21 Martin Jambor <mjambor@suse.cz>
1244 PR tree-optimization/102505
1245 * tree-sra.c (totally_scalarize_subtree): Check that the
1246 encountered field fits within the acces we would like to put it
1249 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
1251 * tree-ssa-threadbackward.c
1252 (back_threader::maybe_register_path): Remove circular paths check.
1254 2021-10-21 Richard Biener <rguenther@suse.de>
1256 * toplev.c (process_options): Move the initial debug_hooks
1258 (toplev::main): ... before the call of the post_options
1261 2021-10-21 Richard Biener <rguenther@suse.de>
1263 PR tree-optimization/102847
1264 * tree-vect-stmts.c (vect_model_load_cost): Add the scalar
1265 load cost in the prologue for VMAT_INVARIANT.
1267 2021-10-21 Richard Biener <rguenther@suse.de>
1269 PR tree-optimization/102847
1270 * tree-vect-stmts.c (vect_model_load_cost): Explicitely
1271 handle VMAT_INVARIANT as a splat in the prologue.
1273 2021-10-21 Hongyu Wang <hongyu.wang@intel.com>
1276 * config/i386/i386.c (ix86_get_ssemov): Adjust HFmode vector
1277 move to use the same logic as HImode.
1279 2021-10-21 Richard Biener <rguenther@suse.de>
1281 * tree-vect-slp.c (vect_build_slp_tree_1): Remove
1282 superfluous gimple_call_nothrow_p check.
1284 2021-10-21 Andrew Pinski <apinski@marvell.com>
1286 * tree-cfg.c (maybe_remove_writeonly_store): Add dce_ssa_names argument.
1287 Mark the ssa-name of the rhs as one to be removed.
1288 (execute_fixup_cfg): Update call to maybe_remove_writeonly_store.
1289 Call simple_dce_from_worklist at the end to a simple dce.
1291 2021-10-21 Andrew Pinski <apinski@marvell.com>
1293 * tree-cfg.c (maybe_remove_writeonly_store): New function
1294 factored out from ...
1295 (execute_fixup_cfg): Here. Call maybe_remove_writeonly_store.
1297 2021-10-21 Andrew Pinski <apinski@marvell.com>
1299 * tree-cfg.c (execute_fixup_cfg): Remove comment
1300 about standalone pass.
1302 2021-10-21 Andrew Pinski <apinski@marvell.com>
1304 * tree-cfg.c (execute_fixup_cfg): Output when the statement
1305 is removed when it is a write only var.
1307 2021-10-21 Aldy Hernandez <aldyh@redhat.com>
1309 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
1310 Avoid threading circular paths.
1312 2021-10-20 Alex Coplan <alex.coplan@arm.com>
1314 * calls.c (initialize_argument_information): Remove some dead
1315 code, remove handling for function_arg returning const_int.
1316 * doc/tm.texi: Delete documentation for unused target hooks.
1317 * doc/tm.texi.in: Likewise.
1318 * target.def (load_bounds_for_arg): Delete.
1319 (store_bounds_for_arg): Delete.
1320 (load_returned_bounds): Delete.
1321 (store_returned_bounds): Delete.
1322 * targhooks.c (default_load_bounds_for_arg): Delete.
1323 (default_store_bounds_for_arg): Delete.
1324 (default_load_returned_bounds): Delete.
1325 (default_store_returned_bounds): Delete.
1326 * targhooks.h (default_load_bounds_for_arg): Delete.
1327 (default_store_bounds_for_arg): Delete.
1328 (default_load_returned_bounds): Delete.
1329 (default_store_returned_bounds): Delete.
1331 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
1333 * doc/extend.texi (Basic Asm): Clarify that asm is not an
1335 * doc/invoke.texi (-fno-asm): Fix description for C++.
1337 2021-10-20 Jonathan Wakely <jwakely@redhat.com>
1339 * doc/install.texi: Remove link to old.html
1341 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1343 * config/aarch64/aarch64-simd.md (*aarch64_cmtst_same_<mode>): New.
1345 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1347 * config/aarch64/aarch64-simd.md (*aarch64_narrow_trunc<mode>): New.
1349 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1351 * config/aarch64/aarch64-simd.md (aarch64_simd_ashr<mode>): Add case cmp
1353 * config/aarch64/constraints.md (D1): New.
1355 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1357 * config/aarch64/aarch64-simd.md
1358 (*aarch64_<srn_op>topbits_shuffle<mode>_le): New.
1359 (*aarch64_topbits_shuffle<mode>_le): New.
1360 (*aarch64_<srn_op>topbits_shuffle<mode>_be): New.
1361 (*aarch64_topbits_shuffle<mode>_be): New.
1362 * config/aarch64/predicates.md
1363 (aarch64_simd_shift_imm_vec_exact_top): New.
1365 2021-10-20 Tamar Christina <tamar.christina@arm.com>
1367 * config/aarch64/aarch64-simd.md (*aarch64_<srn_op>shrn<mode>_vect,
1368 *aarch64_<srn_op>shrn<mode>2_vect_le,
1369 *aarch64_<srn_op>shrn<mode>2_vect_be): New.
1370 * config/aarch64/iterators.md (srn_op): New.
1372 2021-10-20 Chung-Lin Tang <cltang@codesourcery.com>
1374 * omp-low.c (omp_copy_decl_2): For !ctx, use record_vars to add new copy
1376 (scan_sharing_clauses): Place copy of OMP_CLAUSE_IN_REDUCTION decl in
1377 ctx->outer instead of ctx.
1379 2021-10-20 Martin Liska <mliska@suse.cz>
1382 2021-10-19 Martin Liska <mliska@suse.cz>
1385 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
1386 * system.h (strip_whilespaces): New function.
1388 2021-10-20 Martin Liska <mliska@suse.cz>
1391 2021-10-19 Martin Liska <mliska@suse.cz>
1394 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
1397 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
1399 * config/aarch64/aarch64.c (aarch64_case_values_threshold):
1400 Change to 8 with -Os, 11 otherwise.
1402 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
1404 * config/aarch64/aarch64.c (neoversev1_tunings):
1405 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
1406 (neoversen2_tunings): Likewise.
1408 2021-10-20 Wilco Dijkstra <wdijkstr@arm.com>
1411 * config/aarch64/aarch64.md (frint_pattern): Update comment.
1412 * config/aarch64/aarch64-simd-builtins.def: Change frintn to roundeven.
1413 * config/aarch64/arm_fp16.h: Change frintn to roundeven.
1414 * config/aarch64/arm_neon.h: Likewise.
1415 * config/aarch64/iterators.md (frint_pattern): Use roundeven for FRINTN.
1417 2021-10-20 Martin Liska <mliska@suse.cz>
1419 * config/arm/arm.c (arm_unwind_emit_sequence): Do not declare
1420 already declared global variable.
1421 (arm_unwind_emit_set): Use out_file as function argument.
1422 (arm_unwind_emit): Likewise.
1423 * config/darwin.c (machopic_output_data_section_indirection): Likewise.
1424 (machopic_output_stub_indirection): Likewise.
1425 (machopic_output_indirection): Likewise.
1426 (machopic_finish): Likewise.
1427 * config/i386/i386.c (ix86_asm_output_function_label): Likewise.
1428 * config/i386/winnt.c (i386_pe_seh_unwind_emit): Likewise.
1429 * config/ia64/ia64.c (process_epilogue): Likewise.
1430 (process_cfa_adjust_cfa): Likewise.
1431 (process_cfa_register): Likewise.
1432 (process_cfa_offset): Likewise.
1433 (ia64_asm_unwind_emit): Likewise.
1434 * config/s390/s390.c (s390_asm_output_function_label): Likewise.
1436 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
1438 * config/aarch64/aarch64-builtins.c
1439 (aarch64_general_gimple_fold_builtin):
1440 lower vld1 and vst1 variants of the neon builtins
1441 * config/aarch64/aarch64-protos.h:
1442 (aarch64_general_gimple_fold_builtin): Add gsi parameter.
1443 * config/aarch64/aarch64.c (aarch64_general_gimple_fold_builtin):
1446 2021-10-20 Andre Simoes Dias Vieira <andre.simoesdiasvieira@arm.com>
1448 * match.pd: Generate IFN_TRUNC.
1450 2021-10-20 Richard Biener <rguenther@suse.de>
1452 PR tree-optimization/102853
1453 * tree-data-ref.c (split_constant_offset_1): Bail out
1454 immediately if the expression traps on overflow.
1456 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1458 * tree-ssa-threadbackward.c (back_threader::~back_threader): Remove.
1460 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1462 * tree-ssa-threadupdate.c (back_jt_path_registry::adjust_paths_after_duplication):
1463 Remove superflous debugging message.
1464 (back_jt_path_registry::duplicate_thread_path): Same.
1466 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1468 * tree-ssa-threadbackward.c (back_threader_registry::back_threader_registry):
1470 (back_threader_registry::register_path): Remove m_threaded_paths.
1472 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1474 PR tree-optimization/102814
1475 * doc/invoke.texi: Document --param=max-fsm-thread-length.
1476 * params.opt: Add --param=max-fsm-thread-length.
1477 * tree-ssa-threadbackward.c
1478 (back_threader_profitability::profitable_path_p): Fail on paths
1479 longer than max-fsm-thread-length.
1481 2021-10-20 Eric Botcazou <ebotcazou@adacore.com>
1483 PR middle-end/102764
1484 * cfgexpand.c (expand_gimple_basic_block): Disregard a final debug
1485 statement to reset the current location for the outgoing edges.
1487 2021-10-20 Aldy Hernandez <aldyh@redhat.com>
1488 Richard Biener <rguenther@suse.de>
1490 * tree-ssa-threadupdate.c (cancel_thread): Dump threading reason
1491 on the same line as the threading cancellation.
1492 (jt_path_registry::cancel_invalid_paths): Avoid rotating loops.
1493 Avoid threading through loop headers where the path remains in the
1496 2021-10-20 Siddhesh Poyarekar <siddhesh@gotplt.org>
1498 * tree-object-size.c (unknown): Make into a function. Adjust
1500 (unknown_object_size): Simplify implementation.
1502 2021-10-20 Hongtao Liu <hongtao.liu@intel.com>
1503 Kewen Lin <linkw@linux.ibm.com>
1505 * doc/sourcebuild.texi (Effective-Target Keywords): Document
1506 vect_slp_v2qi_store, vect_slp_v4qi_store, vect_slp_v8qi_store,
1507 vect_slp_v16qi_store, vect_slp_v2hi_store,
1508 vect_slp_v4hi_store, vect_slp_v2si_store, vect_slp_v4si_store.
1510 2021-10-19 Jonathan Wakely <jwakely@redhat.com>
1512 * doc/extend.texi (Basic PowerPC Built-in Functions): Fix typo.
1514 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
1518 * config/rs6000/emmintrin.h: Guard POWER8 intrinsics.
1519 * config/rs6000/pmmintrin.h: Same.
1520 * config/rs6000/smmintrin.h: Same.
1521 * config/rs6000/tmmintrin.h: Same.
1523 2021-10-19 Paul A. Clarke <pc@us.ibm.com>
1525 * config.gcc (extra_headers): Add nmmintrin.h.
1527 2021-10-19 Richard Biener <rguenther@suse.de>
1529 * tree-vectorizer.h (vect_supportable_dr_alignment): Add
1530 misalignment parameter.
1531 * tree-vect-data-refs.c (vect_get_peeling_costs_all_drs):
1532 Do not change DR_MISALIGNMENT in place, instead pass the
1533 adjusted misalignment to vect_supportable_dr_alignment.
1534 (vect_peeling_supportable): Likewise.
1535 (vect_peeling_hash_get_lowest_cost): Adjust.
1536 (vect_enhance_data_refs_alignment): Likewise.
1537 (vect_vfa_access_size): Likewise.
1538 (vect_supportable_dr_alignment): Add misalignment
1539 parameter and simplify.
1540 * tree-vect-stmts.c (get_negative_load_store_type): Adjust.
1541 (get_group_load_store_type): Likewise.
1542 (get_load_store_type): Likewise.
1544 2021-10-19 Clément Chigot <clement.chigot@atos.net>
1546 * config/rs6000/rs6000.c (rs6000_xcoff_file_end): Move
1547 __tls_get_addr reference to .text csect.
1549 2021-10-19 Martin Liska <mliska@suse.cz>
1552 * config/aarch64/aarch64.c (aarch64_process_one_target_attr):
1555 2021-10-19 Richard Biener <rguenther@suse.de>
1557 * tree-vectorizer.h (vect_get_store_cost): Adjust signature.
1558 (vect_get_load_cost): Likewise.
1559 * tree-vect-data-refs.c (vect_get_data_access_cost): Get
1560 alignment support scheme and misalignment as arguments
1562 (vect_get_peeling_costs_all_drs): Compute that info here
1563 and note that we shouldn't need to.
1564 * tree-vect-stmts.c (vect_model_store_cost): Get
1565 alignment support scheme and misalignment as arguments.
1566 (vect_get_store_cost): Likewise.
1567 (vect_model_load_cost): Likewise.
1568 (vect_get_load_cost): Likewise.
1569 (vectorizable_store): Pass down alignment support scheme
1570 and misalignment to costing.
1571 (vectorizable_load): Likewise.
1573 2021-10-19 Richard Biener <rguenther@suse.de>
1575 * tree-vect-stmts.c (get_negative_load_store_type): Add
1576 offset output parameter and initialize it.
1577 (get_group_load_store_type): Likewise.
1578 (get_load_store_type): Likewise.
1579 (vectorizable_store): Use offset as computed by
1580 get_load_store_type.
1581 (vectorizable_load): Likewise.
1583 2021-10-19 Richard Biener <rguenther@suse.de>
1585 PR tree-optimization/102827
1586 * tree-if-conv.c (predicate_statements): Add pe parameter
1587 and use that edge to insert invariant stmts on.
1588 (combine_blocks): Pass through pe.
1589 (tree_if_conversion): Compute the edge to insert invariant
1590 stmts on and pass it along.
1592 2021-10-19 Roger Sayle <roger@nextmovesoftware.com>
1595 * config/bfin/bfin.md (addsubv2hi3, subaddv2hi3, ssaddsubv2hi3,
1596 sssubaddv2hi3): Swap the order of operators in vec_concat.
1598 2021-10-19 Xionghu Luo <luoxhu@linux.ibm.com>
1600 * config/rs6000/altivec.md (*altivec_vmrghb_internal): Delete.
1601 (altivec_vmrghb_direct): New.
1602 (*altivec_vmrghh_internal): Delete.
1603 (altivec_vmrghh_direct): New.
1604 (*altivec_vmrghw_internal): Delete.
1605 (altivec_vmrghw_direct_<mode>): New.
1606 (altivec_vmrghw_direct): Delete.
1607 (*altivec_vmrglb_internal): Delete.
1608 (altivec_vmrglb_direct): New.
1609 (*altivec_vmrglh_internal): Delete.
1610 (altivec_vmrglh_direct): New.
1611 (*altivec_vmrglw_internal): Delete.
1612 (altivec_vmrglw_direct_<mode>): New.
1613 (altivec_vmrglw_direct): Delete.
1614 * config/rs6000/rs6000-p8swap.c (rtx_is_swappable_p): Adjust.
1615 * config/rs6000/rs6000.c (altivec_expand_vec_perm_const):
1617 * config/rs6000/vsx.md (vsx_xxmrghw_<mode>): Adjust.
1618 (vsx_xxmrglw_<mode>): Adjust.
1620 2021-10-19 Aldy Hernandez <aldyh@redhat.com>
1622 * passes.def: Change threading comment before pass_ccp pass.
1624 2021-10-19 Haochen Gui <guihaoc@gcc.gnu.org>
1626 * config/rs6000/rs6000-call.c (altivec_expand_lxvr_builtin):
1627 Modify the expansion for sign extension. All extensions are done
1628 within VSX registers.
1630 2021-10-19 Richard Biener <rguenther@suse.de>
1632 * tree-vect-stmts.c (get_group_load_store_type): Add
1633 misalignment output parameter and initialize it.
1634 (get_group_load_store_type): Likewise.
1635 (vectorizable_store): Remove now redundant queries.
1636 (vectorizable_load): Likewise.
1638 2021-10-19 Richard Biener <rguenther@suse.de>
1640 * tree-vectorizer.h (vect_supportable_dr_alignment): Remove
1641 check_aligned argument.
1642 * tree-vect-data-refs.c (vect_supportable_dr_alignment):
1644 (vect_peeling_hash_insert): Add supportable_if_not_aligned
1645 argument and do not call vect_supportable_dr_alignment here.
1646 (vect_peeling_supportable): Adjust.
1647 (vect_enhance_data_refs_alignment): Compute whether the
1648 access is supported with different alignment here and
1649 pass that down to vect_peeling_hash_insert.
1650 (vect_vfa_access_size): Adjust.
1651 * tree-vect-stmts.c (vect_get_store_cost): Likewise.
1652 (vect_get_load_cost): Likewise.
1653 (get_negative_load_store_type): Likewise.
1654 (get_group_load_store_type): Likewise.
1655 (get_load_store_type): Likewise.
1657 2021-10-19 Martin Liska <mliska@suse.cz>
1660 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p): Strip whitespaces.
1661 * system.h (strip_whilespaces): New function.
1663 2021-10-19 dianhong xu <dianhong.xu@intel.com>
1665 * config/i386/avx512fp16intrin.h:
1666 (_mm512_set1_pch): New intrinsic.
1667 * config/i386/avx512fp16vlintrin.h:
1668 (_mm256_set1_pch): New intrinsic.
1669 (_mm_set1_pch): Ditto.
1671 2021-10-18 Andrew MacLeod <amacleod@redhat.com>
1673 PR tree-optimization/102796
1674 * gimple-range.cc (gimple_ranger::range_on_edge): Process EH edges
1675 normally. Return get_tree_range for non gimple_range_ssa_p names.
1676 (gimple_ranger::range_of_stmt): Use get_tree_range for non
1677 gimple_range_ssa_p names.
1679 2021-10-18 Uroš Bizjak <ubizjak@gmail.com>
1682 * config/i386/i386.c (ix86_print_operand_address):
1683 Error out for non-address_operand asm operands.
1685 2021-10-18 Richard Biener <rguenther@suse.de>
1687 * tree-vect-data-refs.c (vect_peeling_hash_insert): Do
1688 not auto-convert dr_alignment_support to bool.
1689 (vect_peeling_supportable): Likewise.
1690 (vect_enhance_data_refs_alignment): Likewise.
1691 (vect_supportable_dr_alignment): Commonize read/write case.
1692 * tree-vect-stmts.c (vect_get_store_cost): Use
1693 dr_alignment_support, not int, for the vect_supportable_dr_alignment
1695 (vect_get_load_cost): Likewise.
1697 2021-10-18 Siddhesh Poyarekar <siddhesh@gotplt.org>
1699 * tree-object-size.c (object_sizes_execute): Consolidate LHS
1700 null check and do it early.
1702 2021-10-18 Richard Biener <rguenther@suse.de>
1704 * tree-vect-stmts.c (vectorizable_store): Use the
1705 computed alignment scheme instead of querying
1708 2021-10-18 Richard Biener <rguenther@suse.de>
1710 * tree-vect-stmts.c (vectorizable_store): Do not recompute
1711 alignment scheme already determined by get_load_store_type.
1713 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
1715 * tree-ssa-threadbackward.c (class pass_thread_jumps_full):
1716 Clone corresponding pass.
1718 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
1720 * combine.c (recog_for_combine): For an unrecognized move/set of
1721 a constant, try force_const_mem to place it in the constant pool.
1722 * cse.c (constant_pool_entries_cost, constant_pool_entries_regcost):
1723 Delete global variables (that are no longer assigned a cost value).
1724 (cse_insn): Simplify logic for deciding whether to place a folded
1725 constant in the constant pool using force_const_mem.
1726 (cse_main): Remove zero initialization of constant_pool_entries_cost
1727 and constant_pool_entries_regcost.
1728 * config/i386/i386.c (ix86_rtx_costs): Make memory accesses
1729 fractionally more expensive, when optimizing for speed.
1731 2021-10-18 Martin Liska <mliska@suse.cz>
1733 PR gcov-profile/102746
1734 PR gcov-profile/102747
1735 * gcov.c (main): Return return_code.
1736 (output_gcov_file): Mark return_code when error happens.
1737 (generate_results): Likewise.
1738 (read_graph_file): Likewise.
1739 (read_count_file): Likewise.
1741 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
1743 * config/bfin/bfin.md (define_constants): Remove UNSPEC_ONES.
1744 (define_insn "ones"): Replace UNSPEC_ONES with a truncate of
1745 a popcount, allowing compile-time evaluation/simplification.
1746 (popcountsi2, popcounthi2): New expanders using a "ones" insn.
1748 2021-10-18 Richard Biener <rguenther@suse.de>
1750 PR tree-optimization/102788
1751 * tree-vect-patterns.c (vect_init_pattern_stmt): Allow
1753 (vect_pattern_recog_1): Likewise.
1754 (vect_recog_bool_pattern): Continue matching the pattern
1755 even if we do not have a vector type for a conversion
1758 2021-10-18 Roger Sayle <roger@nextmovesoftware.com>
1760 * simplify-rtx.c (simplify_const_unary_operation) [SS_NEG, SS_ABS]:
1761 Evalute SS_NEG and SS_ABS of a constant argument.
1763 2021-10-18 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
1766 * gimple-match-head.c (try_conditional_simplification): Add case for single operand.
1767 * internal-fn.def: Add entry for COND_NEG internal function.
1768 * internal-fn.c (FOR_EACH_CODE_MAPPING): Add entry for
1769 NEGATE_EXPR, COND_NEG mapping.
1770 * optabs.def: Add entry for cond_neg_optab.
1771 * match.pd (UNCOND_UNARY, COND_UNARY): New operator lists.
1772 (vec_cond COND (foo A) B) -> (IFN_COND_FOO COND A B): New pattern.
1773 (vec_cond COND B (foo A)) -> (IFN_COND_FOO ~COND A B): Likewise.
1775 2021-10-18 Aldy Hernandez <aldyh@redhat.com>
1777 * tree-ssa-strlen.c (class strlen_pass): Rename from
1779 (handle_builtin_stxncpy_strncat): Move to strlen_pass.
1780 (handle_assign): Same.
1781 (adjust_last_stmt): Same.
1782 (maybe_warn_overflow): Same.
1783 (handle_builtin_strlen): Same.
1784 (handle_builtin_strchr): Same.
1785 (handle_builtin_strcpy): Same.
1786 (handle_builtin_strncat): Same.
1787 (handle_builtin_stxncpy_strncat): Same.
1788 (handle_builtin_memcpy): Same.
1789 (handle_builtin_strcat): Same.
1790 (handle_alloc_call): Same.
1791 (handle_builtin_memset): Same.
1792 (handle_builtin_memcmp): Same.
1793 (get_len_or_size): Same.
1794 (strxcmp_eqz_result): Same.
1795 (handle_builtin_string_cmp): Same.
1796 (handle_pointer_plus): Same.
1797 (count_nonzero_bytes_addr): Same.
1798 (count_nonzero_bytes): Same.
1799 (handle_store): Same.
1800 (strlen_check_and_optimize_call): Same.
1801 (handle_integral_assign): Same.
1802 (check_and_optimize_stmt): Same.
1803 (printf_strlen_execute): Rename strlen_dom_walker to strlen_pass.
1805 2021-10-18 Richard Biener <rguenther@suse.de>
1807 PR tree-optimization/102798
1808 * tree-vect-data-refs.c (vect_create_addr_base_for_vector_ref):
1809 Only copy points-to info to newly generated SSA names.
1811 2021-10-18 Martin Liska <mliska@suse.cz>
1813 * dbgcnt.c (dbg_cnt_process_opt): Remove unused but set variable.
1814 * gcov.c (get_cycles_count): Likewise.
1815 * lto-compress.c (lto_compression_zlib): Likewise.
1816 (lto_uncompression_zlib): Likewise.
1817 * targhooks.c (default_pch_valid_p): Likewise.
1819 2021-10-17 Aldy Hernandez <aldyh@redhat.com>
1821 * tree-pass.h (make_pass_thread_jumps_full): New.
1822 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Inline.
1823 (try_thread_blocks): Add resolve and speed arguments.
1824 (pass_thread_jumps::execute): Inline.
1825 (do_early_thread_jumps): New.
1826 (do_thread_jumps): New.
1827 (make_pass_thread_jumps): Move.
1828 (pass_early_thread_jumps::gate): Inline.
1829 (pass_early_thread_jumps::execute): Inline.
1830 (class pass_thread_jumps_full): New.
1832 2021-10-16 Piotr Kubaj <pkubaj@FreeBSD.org>
1834 * configure.ac: Treat powerpc64*-*-freebsd* the same as
1835 powerpc64-*-freebsd*.
1836 * configure: Regenerate.
1838 2021-10-16 H.J. Lu <hjl.tools@gmail.com>
1840 * value-query.cc (get_ssa_name_ptr_info_nonnull): Change
1841 set_ptr_nonull to set_ptr_nonnull in comments.
1843 2021-10-16 Jan Hubicka <hubicka@ucw.cz>
1845 PR tree-optimization/102720
1846 * tree-ssa-structalias.c (compute_points_to_sets): Fix producing
1847 of call used and clobbered sets.
1849 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
1851 * config/darwin.h (LINK_COMMAND_SPEC_A): Update 'r' handling to
1852 skip gomp and itm when r or nodefaultlibs is given.
1853 (DSYMUTIL_SPEC): Do not call dsymutil for '-r' link lines.
1854 Update ordering of exclusions, remove duplicate 'v' addition
1855 (collect2 will add this from the main command line).
1857 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
1859 * config/darwin-driver.c (darwin_driver_init): Revise comments, handle
1860 filelist and framework options in specs instead of code.
1861 * config/darwin.h (SUBTARGET_DRIVER_SELF_SPECS): Update to handle link
1862 specs that are really driver ones.
1863 (DARWIN_CC1_SPEC): Likewise.
1864 (CPP_SPEC): Likewise.
1865 (SYSROOT_SPEC): Append space.
1866 (LINK_SYSROOT_SPEC): Remove most driver link specs.
1867 (STANDARD_STARTFILE_PREFIX_2): Update link-related specs.
1868 (STARTFILE_SPEC): Likewise.
1869 (ASM_MMACOSX_VERSION_MIN_SPEC): Fix line wrap.
1870 (ASM_SPEC): Update driver-related specs.
1871 (ASM_FINAL_SPEC): Likewise.
1872 * config/darwin.opt: Remove now unused option aliases.
1873 * config/i386/darwin.h (EXTRA_ASM_OPTS): Ensure space after opt.
1874 (ASM_SPEC): Update driver-related specs.
1876 2021-10-15 Roger Sayle <roger@nextmovesoftware.com>
1878 * config/i386/i386.c (ix86_hardreg_mov_ok): For vector modes,
1879 allow standard_sse_constant_p immediate constants.
1881 2021-10-15 Iain Sandoe <iain@sandoe.co.uk>
1883 * config.gcc: Add tm-dwarf2.h to tm_d-file.
1885 2021-10-15 Andrew MacLeod <amacleod@redhat.com>
1887 * gimple-range-fold.h (gimple_range_ssa_p): Don't process names
1888 that occur in abnormal phis.
1889 * gimple-range.cc (gimple_ranger::range_on_edge): Return false for
1890 abnormal and EH edges.
1891 * gimple-ssa-evrp.c (rvrp_folder::value_of_expr): Ditto.
1892 (rvrp_folder::value_on_edge): Ditto.
1893 (rvrp_folder::value_of_stmt): Ditto.
1894 (hybrid_folder::value_of_expr): Ditto for ranger queries.
1895 (hybrid_folder::value_on_edge): Ditto.
1896 (hybrid_folder::value_of_stmt): Ditto.
1897 * value-query.cc (gimple_range_global): Always return a range if
1898 the type is supported.
1900 2021-10-15 John David Anglin <danglin@gcc.gnu.org>
1902 * config/pa/pa.md: Consistently use "rG" constraint for copy
1903 instruction in move patterns.
1905 2021-10-15 Andrew Stubbs <ams@codesourcery.com>
1907 * config/gcn/gcn-hsa.h (S_FIJI): Set unconditionally.
1910 * config/gcn/gcn.c: Hard code SRAM ECC settings for old architectures.
1911 * config/gcn/mkoffload.c (ELFABIVERSION_AMDGPU_HSA): Rename to ...
1912 (ELFABIVERSION_AMDGPU_HSA_V3): ... this.
1913 (ELFABIVERSION_AMDGPU_HSA_V4): New.
1914 (SET_SRAM_ECC_UNSUPPORTED): New.
1915 (copy_early_debug_info): Create elf flags to match the other objects.
1916 (main): Just let the attribute flags pass through.
1918 2021-10-15 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
1920 * tree-loop-distribution.c (reduction_var_overflows_first):
1921 Pass the type of reduction_var as first argument as it is also
1922 done for the load type.
1923 (loop_distribution::transform_reduction_loop): Add missing
1924 TREE_TYPE while determining precission of reduction_var.
1926 2021-10-15 Richard Biener <rguenther@suse.de>
1928 * defaults.h (PREFERRED_DEBUGGING_TYPE): Choose DWARF2_DEBUG
1930 * toplev.c (process_options): Warn when STABS debugging is
1931 enabled but not the preferred format.
1932 * config/pa/som.h (PREFERRED_DEBUGGING_TYPE): Define to
1934 * config/pdp11/pdp11.h (PREFERRED_DEBUGGING_TYPE): Likewise.
1936 2021-10-15 Richard Biener <rguenther@suse.de>
1939 * tree-inline.c (copy_bb): Avoid underflowing nargs.
1941 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
1943 * config/i386/i386-expand.c (ix86_expand_vec_perm): Convert
1944 HFmode input operand to HImode.
1945 (ix86_vectorize_vec_perm_const): Likewise.
1946 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1_hf):
1948 (*avx512f_permvar_truncv8siv8hi_1_hf):
1951 2021-10-15 Richard Biener <rguenther@suse.de>
1953 PR middle-end/102682
1954 * expmed.c (store_bit_field_1): Ensure a LHS subreg would
1955 not create a paradoxical subreg.
1957 2021-10-15 Hongyu Wang <hongyu.wang@intel.com>
1959 * config/i386/i386-expand.c (ix86_expand_vector_init):
1960 For half_vector concat for HFmode, handle them like HImode.
1962 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1964 * tree-ssa-threadbackward.c (class back_threader): Add m_resolve.
1965 (back_threader::back_threader): Same.
1966 (back_threader::resolve_phi): Try to solve without looking back if
1968 (back_threader::find_paths_to_names): Same.
1969 (try_thread_blocks): Pass resolve argument to back threader.
1970 (pass_early_thread_jumps::execute): Same.
1972 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1974 * doc/invoke.texi: Remove max-fsm-thread-length,
1975 max-fsm-thread-paths, and fsm-maximum-phi-arguments.
1977 * tree-ssa-threadbackward.c (back_threader::back_threader): Remove
1979 (back_threader_registry::back_threader_registry): Same.
1980 (back_threader_profitability::profitable_path_p): Remove
1981 param_max_fsm_thread-length.
1982 (back_threader_registry::register_path): Remove
1983 m_max_allowable_paths.
1985 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
1987 * tree-ssa-threadbackward.c (class back_threader): Make m_imports
1989 (back_threader::~back_threader): Do not release m_path.
1991 2021-10-14 Andrew MacLeod <amacleod@redhat.com>
1993 PR tree-optimization/102738
1994 * vr-values.c (simplify_using_ranges::simplify): Handle RSHIFT_EXPR.
1996 2021-10-14 Kwok Cheung Yeung <kcy@codesourcery.com>
1998 * omp-general.c (omp_check_context_selector): Move from c-omp.c.
1999 (omp_mark_declare_variant): Move from c-omp.c.
2000 (omp_context_name_list_prop): Update for Fortran strings.
2001 * omp-general.h (omp_check_context_selector): New prototype.
2002 (omp_mark_declare_variant): New prototype.
2004 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
2006 * config/rs6000/rs6000.c (rs6000_density_test): Move early
2007 exit test further up the function.
2009 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
2011 * config/arm/arm.c (arm_add_stmt_cost): Delete.
2012 (TARGET_VECTORIZE_ADD_STMT_COST): Delete.
2014 2021-10-14 Martin Jambor <mjambor@suse.cz>
2016 * doc/invoke.texi (Optimize Options): Add entry for
2017 ipa-cp-recursive-freq-factor.
2019 2021-10-14 Tamar Christina <tamar.christina@arm.com>
2021 * match.pd: New rule.
2023 2021-10-14 Jan Hubicka <hubicka@ucw.cz>
2026 * ipa-modref-tree.h (modref_access_node::update2):
2027 Also check that parm_offset is unchanged.
2028 (modref_ref_node::insert_access): Fix updating of
2031 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
2033 * tree-ssa-threadbackward.c (back_threader::resolve_phi): Add
2036 2021-10-14 Richard Biener <rguenther@suse.de>
2038 PR tree-optimization/102659
2039 * tree-if-conv.c (if_convertible_gimple_assign_stmt_p): Also
2040 rewrite pointer typed undefined overflow operations.
2041 (predicate_statements): Likewise. Make sure to emit invariant
2042 conversions in the preheader.
2043 * tree-vectorizer.c (vect_loop_vectorized_call): Look through
2044 non-empty preheaders.
2045 * tree-data-ref.c (dr_analyze_indices): Strip useless
2046 conversions to the MEM_REF base type.
2048 2021-10-14 Martin Liska <mliska@suse.cz>
2050 * common.opt: Stop using AUTODETECT_VALUE
2051 and use EnabledBy where possible.
2052 * opts.c: Enable OPT_fvar_tracking with optimize >= 1.
2053 * toplev.c (AUTODETECT_VALUE): Remove macro.
2054 (process_options): Simplify by using EnabledBy and
2055 OPT_fvar_tracking. Use OPTION_SET_P macro instead of
2058 2021-10-14 Jonathan Wright <jonathan.wright@arm.com>
2060 * config/aarch64/arm_neon.h (vld1_s8_x3): Use signed type for
2062 (vld1_s32_x3): Likewise.
2064 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
2066 PR tree-optimization/102736
2067 PR tree-optimization/102736
2068 * gimple-range-path.cc (path_range_query::range_on_path_entry):
2069 Assert that the requested range is defined outside the path.
2070 (path_range_query::ssa_range_in_phi): Do not call
2071 range_on_path_entry for SSA names that are defined within the
2074 2021-10-14 Iain Sandoe <iain@sandoe.co.uk>
2076 * config/darwin-driver.c (darwin_find_version_from_kernel):
2077 Quote internal identifiers and avoid contractions in
2079 (darwin_default_min_version): Likewise.
2080 (darwin_driver_init): Likewise.
2082 2021-10-14 Martin Jambor <mjambor@suse.cz>
2084 * params.opt (ipa-cp-recursive-freq-factor): New.
2085 * ipa-cp.c (ipcp_value): Switch to inline initialization. New members
2086 scc_no, self_recursion_generated_level, same_scc and
2087 self_recursion_generated_p.
2088 (ipcp_lattice::add_value): Replaced parameter unlimited with
2089 same_lat_gen_level, usit it determine limit of values and store it to
2091 (ipcp_lattice<valtype>::print): Dump the new fileds.
2092 (allocate_and_init_ipcp_value): Take same_lat_gen_level as a new
2093 parameter and store it to the new value.
2094 (self_recursively_generated_p): Removed.
2095 (propagate_vals_across_arith_jfunc): Use self_recursion_generated_p
2096 instead of self_recursively_generated_p, store self generation level
2098 (value_topo_info<valtype>::add_val): Set scc_no.
2099 (value_topo_info<valtype>::propagate_effects): Multiply frequencies of
2100 recursively feeding values and self generated values by appropriate
2103 2021-10-14 Richard Sandiford <richard.sandiford@arm.com>
2105 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Remove
2106 redundant test for flag_vect_cost_model.
2108 2021-10-14 Aldy Hernandez <aldyh@redhat.com>
2110 * bitmap.c (debug): New overloaded function for auto_bitmaps.
2111 * bitmap.h (debug): Same.
2113 2021-10-14 Hongyu Wang <hongyu.wang@intel.com>
2115 * config/i386/avx512fp16intrin.h (_mm512_mask_fcmadd_pch):
2116 Adjust builtin call.
2117 (_mm512_mask3_fcmadd_pch): Likewise.
2118 (_mm512_mask_fmadd_pch): Likewise
2119 (_mm512_mask3_fmadd_pch): Likewise
2120 (_mm512_mask_fcmadd_round_pch): Likewise
2121 (_mm512_mask3_fcmadd_round_pch): Likewise
2122 (_mm512_mask_fmadd_round_pch): Likewise
2123 (_mm512_mask3_fmadd_round_pch): Likewise
2124 (_mm_mask_fcmadd_sch): Likewise
2125 (_mm_mask3_fcmadd_sch): Likewise
2126 (_mm_mask_fmadd_sch): Likewise
2127 (_mm_mask3_fmadd_sch): Likewise
2128 (_mm_mask_fcmadd_round_sch): Likewise
2129 (_mm_mask3_fcmadd_round_sch): Likewise
2130 (_mm_mask_fmadd_round_sch): Likewise
2131 (_mm_mask3_fmadd_round_sch): Likewise
2132 (_mm_fcmadd_round_sch): Likewise
2133 * config/i386/avx512fp16vlintrin.h (_mm_mask_fmadd_pch):
2134 Adjust builtin call.
2135 (_mm_mask3_fmadd_pch): Likewise
2136 (_mm256_mask_fmadd_pch): Likewise
2137 (_mm256_mask3_fmadd_pch): Likewise
2138 (_mm_mask_fcmadd_pch): Likewise
2139 (_mm_mask3_fcmadd_pch): Likewise
2140 (_mm256_mask_fcmadd_pch): Likewise
2141 (_mm256_mask3_fcmadd_pch): Likewise
2142 * config/i386/i386-builtin.def: Add mask3 builtin for complex
2143 fma, and adjust mask_builtin to corresponding expander.
2144 * config/i386/i386-expand.c (ix86_expand_round_builtin):
2145 Skip eraseing embedded rounding for expanders that emits
2147 * config/i386/sse.md (complexmove): New mode_attr.
2148 (<avx512>_fmaddc_<mode>_mask1<round_expand_name>): New expander.
2149 (<avx512>_fcmaddc_<mode>_mask1<round_expand_name>): Likewise.
2150 (avx512fp16_fmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
2151 (avx512fp16_fcmaddcsh_v8hf_mask1<round_expand_name>): Likewise.
2152 (avx512fp16_fcmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
2153 (avx512fp16_fmaddcsh_v8hf_mask3<round_expand_name>): Likewise.
2154 * config/i386/subst.md (round_embedded_complex): New subst.
2156 2021-10-14 John David Anglin <danglin@gcc.gnu.org>
2158 * config/pa/pa.md (cbranchsf4): Disable if TARGET_SOFT_FLOAT.
2159 (cbranchdf4): Likewise.
2160 Add missing move patterns for TARGET_SOFT_FLOAT.
2162 2021-10-13 Roger Sayle <roger@nextmovesoftware.com>
2164 * config/i386/i386-expand.c (ix86_expand_vector_move): Use a
2165 pseudo intermediate when moving a SUBREG into a hard register,
2166 by checking ix86_hardreg_mov_ok.
2167 (ix86_expand_vector_extract): Store zero-extended SImode
2168 intermediate in a pseudo, then set target using a SUBREG_PROMOTED
2170 * config/i386/sse.md (mov<VMOVE>_internal): Prevent CSE creating
2171 complex (SUBREG) sets of (vector) hard registers before reload, by
2172 checking ix86_hardreg_mov_ok.
2174 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
2176 * ctfc.h (enum ctf_dtu_d_union_enum): Remove redundant comma.
2178 2021-10-13 Indu Bhagat <indu.bhagat@oracle.com>
2180 * dwarf2ctf.c (gen_ctf_array_type): Fix typo in comment.
2182 2021-10-13 Martin Sebor <msebor@redhat.com>
2184 PR middle-end/102630
2185 * pointer-query.cc (compute_objsize_r): Handle named address spaces.
2187 2021-10-13 Iain Sandoe <iain@sandoe.co.uk>
2189 * collect2.c (is_lto_object_file): Release simple-object
2190 resources, close files.
2192 2021-10-13 Andre Vieira <andre.simoesdiasvieira@arm.com>
2194 * config/arm/arm.c (thumb2_legitimate_address_p): Use VALID_MVE_MODE
2195 when checking mve addressing modes.
2196 (mve_vector_mem_operand): Fix the way we handle pre, post and offset
2198 (arm_print_operand): Fix printing of POST_ and PRE_MODIFY.
2199 * config/arm/mve.md: Use mve_memory_operand predicate everywhere where
2200 there is a single Ux constraint.
2202 2021-10-13 John David Anglin <danglin@gcc.gnu.org>
2204 * config/pa/pa.md (muldi3): Add support for inlining 64-bit
2205 multiplication on 32-bit PA 1.1 and 2.0 targets.
2207 2021-10-13 Martin Liska <mliska@suse.cz>
2209 PR gcov-profile/90364
2210 * coverage.c (build_info): Emit checksum to the global variable.
2211 (build_info_type): Add new field for checksum.
2212 (coverage_obj_finish): Pass object_checksum.
2213 (coverage_init): Use 0 as checksum for .gcno files.
2214 * gcov-dump.c (dump_gcov_file): Dump also new checksum field.
2215 * gcov.c (read_graph_file): Read also checksum.
2216 * doc/invoke.texi: Document the behaviour change.
2218 2021-10-13 Richard Biener <rguenther@suse.de>
2220 * gimple-iterator.h (gsi_iterator_update): Add GSI_LAST_NEW_STMT,
2221 start at integer value 2.
2222 * gimple-iterator.c (gsi_insert_seq_nodes_before): Update
2223 the iterator for GSI_LAST_NEW_STMT.
2224 (gsi_insert_seq_nodes_after): Likewise.
2225 * tree-if-conv.c (predicate_statements): Use GSI_LAST_NEW_STMT.
2226 * tree-ssa.c (execute_update_addresses_taken): Correct bogus
2227 arguments to gsi_replace.
2229 2021-10-13 Martin Liska <mliska@suse.cz>
2232 * common.opt: Use EnabledBy instead of detection in
2233 finish_options and process_options.
2234 * opts.c (finish_options): Remove handling of
2235 x_flag_unroll_all_loops.
2236 * toplev.c (process_options): Likewise for flag_web and
2237 flag_rename_registers.
2239 2021-10-13 Richard Biener <rguenther@suse.de>
2241 PR tree-optimization/102659
2242 * tree-if-conv.c (need_to_rewrite_undefined): New flag.
2243 (if_convertible_gimple_assign_stmt_p): Mark the loop for
2244 rewrite when stmts with undefined behavior on integer
2246 (combine_blocks): Predicate also when we need to rewrite stmts.
2247 (predicate_statements): Rewrite affected stmts to something
2248 with well-defined behavior on overflow.
2249 (tree_if_conversion): Initialize need_to_rewrite_undefined.
2251 2021-10-13 Richard Biener <rguenther@suse.de>
2254 * ipa-sra.c (ptr_parm_has_nonarg_uses): Fix volatileness
2257 2021-10-13 Jose E. Marchesi <jose.marchesi@oracle.com>
2259 * dwarf2ctf.c: Fix typo in comment.
2261 2021-10-12 Bill Schmidt <wschmidt@linux.ibm.com>
2264 * config/rs6000/altivec.h (vec_cpsgn): Swap operand order.
2265 * config/rs6000/rs6000-overload.def (VEC_COPYSIGN): Use SKIP to
2266 avoid generating an automatic #define of vec_cpsgn. Use the
2267 correct built-in for V4SFmode that doesn't depend on VSX.
2269 2021-10-12 Uroš Bizjak <ubizjak@gmail.com>
2273 * config/i386/i386.md (*add<mode>_1_slp): Rewrite as
2274 define_insn_and_split pattern. Add alternative 1 and split it
2275 post reload to insert operand 1 into the low part of operand 0.
2276 (*sub<mode>_1_slp): Ditto.
2277 (*and<mode>_1_slp): Ditto.
2278 (*<any_or:code><mode>_1_slp): Ditto.
2279 (*ashl<mode>3_1_slp): Ditto.
2280 (*<any_shiftrt:insn><mode>3_1_slp): Ditto.
2281 (*<any_rotate:insn><mode>3_1_slp): Ditto.
2282 (*neg<mode>_1_slp): New insn_and_split pattern.
2283 (*one_cmpl<mode>_1_slp): Ditto.
2285 2021-10-12 David Edelsohn <dje.gcc@gmail.com>
2287 * doc/install.texi: Update MinGW and mingw-64 Binaries
2290 2021-10-12 Daniel Le Duc Khoi Nguyen <greenrecyclebin@gmail.com>
2292 * doc/extend.texi (Common Variable Attributes): Fix typos in
2293 alloc_size documentation.
2295 2021-10-12 Richard Biener <rguenther@suse.de>
2297 PR tree-optimization/102696
2298 * tree-vect-slp.c (vect_build_slp_tree_2): Properly mark
2299 the tree fatally failed when we reject a BIT_FIELD_REF.
2301 2021-10-12 Richard Biener <rguenther@suse.de>
2303 PR tree-optimization/102572
2304 * tree-vect-stmts.c (vect_build_gather_load_calls): When
2305 gathering the vectorized defs for the mask pass in the
2306 desired mask vector type so invariants will be handled
2309 2021-10-12 Tamar Christina <tamar.christina@arm.com>
2311 * config/aarch64/aarch64-sve.md (*fcm<cmp_op><mode>_bic_combine,
2312 *fcm<cmp_op><mode>_nor_combine, *fcmuo<mode>_bic_combine,
2313 *fcmuo<mode>_nor_combine): New.
2315 2021-10-12 Eric Botcazou <ebotcazou@adacore.com>
2318 * config/sparc/sparc-modes.def (OI): New integer mode.
2320 2021-10-12 Jakub Jelinek <jakub@redhat.com>
2322 * gimple-fold.h (clear_padding_type_may_have_padding_p): Declare.
2323 * gimple-fold.c (clear_padding_type_may_have_padding_p): No longer
2326 2021-10-12 Jakub Jelinek <jakub@redhat.com>
2328 * tree-vectorizer.h (loop_cost_model): New function.
2329 (unlimited_cost_model): Use it.
2330 * tree-vect-loop.c (vect_analyze_loop_costing): Use loop_cost_model
2331 call instead of flag_vect_cost_model.
2332 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment): Likewise.
2333 (vect_prune_runtime_alias_test_list): Likewise. Also use it instead
2334 of flag_simd_cost_model.
2336 2021-10-12 liuhongt <hongtao.liu@intel.com>
2339 * config/i386/i386-expand.c (emit_reduc_half): Handle
2341 * config/i386/mmx.md (reduc_<code>_scal_v4qi): New expander.
2342 (reduc_plus_scal_v4qi): Ditto.
2344 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2346 * config/rs6000/smmintrin.h (_mm_cmpeq_epi64, _mm_cmpgt_epi64,
2347 _mm_mullo_epi32, _mm_mul_epi32, _mm_packus_epi32): New.
2348 * config/rs6000/nmmintrin.h: Copy from i386, tweak to suit.
2350 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2352 * config/rs6000/smmintrin.h (_mm_cvtepi8_epi16, _mm_cvtepi8_epi32,
2353 _mm_cvtepi8_epi64, _mm_cvtepi16_epi32, _mm_cvtepi16_epi64,
2354 _mm_cvtepi32_epi64, _mm_cvtepu8_epi16, _mm_cvtepu8_epi32,
2355 _mm_cvtepu8_epi64, _mm_cvtepu16_epi32, _mm_cvtepu16_epi64,
2356 _mm_cvtepu32_epi64): New.
2358 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2360 * config/rs6000/smmintrin.h (_mm_test_all_zeros,
2361 _mm_test_all_ones, _mm_test_mix_ones_zeros): Rewrite as macro.
2363 2021-10-12 Paul A. Clarke <pc@us.ibm.com>
2365 * config/rs6000/smmintrin.h (_mm_min_epi8, _mm_min_epu16,
2366 _mm_min_epi32, _mm_min_epu32, _mm_max_epi8, _mm_max_epu16,
2367 _mm_max_epi32, _mm_max_epu32): New.
2369 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
2371 * ipa-modref-tree.h (struct modref_access_node): Revert
2373 (struct modref_ref_node): Likewise.
2375 2021-10-11 Jan Hubicka <hubicka@ucw.cz>
2377 * ipa-modref-tree.h (modref_tree::global_access_p): New member
2380 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
2381 ignore_stores_eaf_flags): Move to ipa-modref.h
2382 (remove_useless_eaf_flags): Remove early exit on NOCLOBBER.
2383 (modref_summary::global_memory_read_p): New member function.
2384 (modref_summary::global_memory_written_p): New member function.
2385 * ipa-modref.h (modref_summary::global_memory_read_p,
2386 modref_summary::global_memory_written_p): Declare.
2387 (implicint_const_eaf_flags,implicit_pure_eaf_flags,
2388 ignore_stores_eaf_flags): move here.
2389 * tree-ssa-structalias.c: Include ipa-modref-tree.h, ipa-modref.h
2391 (handle_rhs_call): Rewrite.
2392 (handle_call_arg): New function.
2393 (determine_global_memory_access): New function.
2394 (handle_const_call): Remove
2395 (handle_pure_call): Remove
2396 (find_func_aliases_for_call): Update use of handle_rhs_call.
2397 (compute_points_to_sets): Handle global memory acccesses
2400 2021-10-11 Diane Meirowitz <diane.meirowitz@oracle.com>
2402 * doc/invoke.texi: Add link to UndefinedBehaviorSanitizer
2403 documentation, mention UBSAN_OPTIONS, similar to what is done
2404 for AddressSanitizer.
2406 2021-10-11 Richard Biener <rguenther@suse.de>
2408 PR middle-end/102683
2409 * internal-fn.c (expand_DEFERRED_INIT): Check for mode
2410 availability before building an integer type for storage
2413 2021-10-11 Richard Biener <rguenther@suse.de>
2415 PR middle-end/101480
2416 * gimple.c (gimple_call_fnspec): Do not mark operator new/delete
2419 2021-10-11 Martin Liska <mliska@suse.cz>
2421 * common.opt: Remove Init(2) for some options.
2422 * toplev.c (process_options): Do not use AUTODETECT_VALUE, but
2423 use rather OPTION_SET_P.
2425 2021-10-11 Martin Liska <mliska@suse.cz>
2427 * common.opt: Remove usage of IRA_REGION_AUTODETECT.
2428 * flag-types.h (enum ira_region): Likewise.
2429 * toplev.c (process_options): Use OPTION_SET_P instead of
2430 IRA_REGION_AUTODETECT.
2432 2021-10-11 Jakub Jelinek <jakub@redhat.com>
2434 * omp-low.c (omp_runtime_api_call): Handle omp_get_max_teams,
2435 omp_[sg]et_teams_thread_limit and omp_set_num_teams.
2437 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
2439 * config/s390/s390-protos.h (s390_rawmemchr): Add prototype.
2440 * config/s390/s390.c (s390_rawmemchr): New function.
2441 * config/s390/s390.md (rawmemchr<SINT:mode>): New expander.
2442 * config/s390/vector.md (@vec_vfees<mode>): Basically a copy of
2443 the pattern vfees<mode> from vx-builtins.md.
2444 * config/s390/vx-builtins.md (*vfees<mode>): Remove.
2446 2021-10-11 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
2448 * builtins.c (get_memory_rtx): Change to external linkage.
2449 * builtins.h (get_memory_rtx): Add function prototype.
2450 * doc/md.texi (rawmemchr<mode>): Document.
2451 * internal-fn.c (expand_RAWMEMCHR): Define.
2452 * internal-fn.def (RAWMEMCHR): Add.
2453 * optabs.def (rawmemchr_optab): Add.
2454 * tree-loop-distribution.c (find_single_drs): Change return code
2455 behaviour by also returning true if no single store was found
2457 (loop_distribution::classify_partition): Respect the new return
2458 code behaviour of function find_single_drs.
2459 (loop_distribution::execute): Call new function
2460 transform_reduction_loop in order to replace rawmemchr or strlen
2461 like loops by calls into builtins.
2462 (generate_reduction_builtin_1): New function.
2463 (generate_rawmemchr_builtin): New function.
2464 (generate_strlen_builtin_1): New function.
2465 (generate_strlen_builtin): New function.
2466 (generate_strlen_builtin_using_rawmemchr): New function.
2467 (reduction_var_overflows_first): New function.
2468 (determine_reduction_stmt_1): New function.
2469 (determine_reduction_stmt): New function.
2470 (loop_distribution::transform_reduction_loop): New function.
2472 2021-10-11 Martin Liska <mliska@suse.cz>
2474 * tree.c (cl_option_hasher::hash): Use cl_optimization_hash
2475 and remove legacy hashing code.
2477 2021-10-11 Kito Cheng <kito.cheng@sifive.com>
2480 * builtins.c (maybe_emit_call_builtin___clear_cache): Allow
2481 CONST_INT for BEGIN and END, and use gcc_assert rather than
2484 2021-10-10 Jakub Jelinek <jakub@redhat.com>
2487 * var-tracking.c (add_stores): For cselib_sp_derived_value_p values
2488 use MO_VAL_SET if loc is not sp.
2490 2021-10-10 Andrew Pinski <apinski@marvell.com>
2492 PR tree-optimization/102622
2493 * match.pd: Swap the order of a?pow2cst:0 and a?-1:0 transformations.
2494 Swap the order of a?0:pow2cst and a?0:-1 transformations.
2496 2021-10-09 liuhongt <hongtao.liu@intel.com>
2499 * config/i386/i386-expand.c (ix86_valid_mask_cmp_mode): Handle
2501 (ix86_use_mask_cmp_p): Ditto.
2502 (ix86_expand_sse_movcc): Ditto.
2503 * config/i386/i386.md (setcc_hf_mask): New define_insn.
2504 (movhf_mask): Ditto.
2505 (UNSPEC_MOVCC_MASK): New unspec.
2506 * config/i386/sse.md (UNSPEC_PCMP): Move to i386.md.
2508 2021-10-08 Vladimir N. Makarov <vmakarov@redhat.com>
2510 PR rtl-optimization/102627
2511 * lra-constraints.c (split_reg): Use at least natural mode of hard reg.
2513 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
2515 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Grow
2518 2021-10-08 Aldy Hernandez <aldyh@redhat.com>
2520 * value-range.cc (irange::debug): New.
2521 * value-range.h (irange::debug): New.
2523 2021-10-08 Richard Sandiford <richard.sandiford@arm.com>
2525 PR tree-optimization/102385
2526 * predict.h (change_edge_frequency): Declare.
2527 * predict.c (change_edge_frequency): New function.
2528 * tree-ssa-loop-manip.h (tree_transform_and_unroll_loop): Remove
2530 (tree_unroll_loop): Likewise.
2531 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Update accordingly.
2532 * tree-predcom.c (pcom_worker::tree_predictive_commoning_loop):
2534 * tree-ssa-loop-prefetch.c (loop_prefetch_arrays): Likewise.
2535 * tree-ssa-loop-manip.c (tree_unroll_loop): Likewise.
2536 (tree_transform_and_unroll_loop): Likewise. Use single_dom_exit
2537 to retrieve the exit edges. Make all the old profile update code
2538 conditional on !single_loop_p -- the case it was written for --
2539 and use a different approach for the single-loop case.
2541 2021-10-08 Martin Liska <mliska@suse.cz>
2543 * config/alpha/alpha.c (alpha_option_override): Use new macro
2545 * config/arc/arc.c (arc_override_options): Likewise.
2546 * config/arm/arm.c (arm_option_override): Likewise.
2547 * config/bfin/bfin.c (bfin_load_pic_reg): Likewise.
2548 * config/c6x/c6x.c (c6x_option_override): Likewise.
2549 * config/csky/csky.c: Likewise.
2550 * config/darwin.c (darwin_override_options): Likewise.
2551 * config/frv/frv.c (frv_option_override): Likewise.
2552 * config/i386/djgpp.h: Likewise.
2553 * config/i386/i386.c (ix86_stack_protect_guard): Likewise.
2554 (ix86_max_noce_ifcvt_seq_cost): Likewise.
2555 * config/ia64/ia64.c (ia64_option_override): Likewise.
2556 (ia64_override_options_after_change): Likewise.
2557 * config/m32c/m32c.c (m32c_option_override): Likewise.
2558 * config/m32r/m32r.c (m32r_init): Likewise.
2559 * config/m68k/m68k.c (m68k_option_override): Likewise.
2560 * config/microblaze/microblaze.c (microblaze_option_override): Likewise.
2561 * config/mips/mips.c (mips_option_override): Likewise.
2562 * config/nios2/nios2.c (nios2_option_override): Likewise.
2563 * config/nvptx/nvptx.c (nvptx_option_override): Likewise.
2564 * config/pa/pa.c (pa_option_override): Likewise.
2565 * config/riscv/riscv.c (riscv_option_override): Likewise.
2566 * config/rs6000/aix71.h: Likewise.
2567 * config/rs6000/aix72.h: Likewise.
2568 * config/rs6000/aix73.h: Likewise.
2569 * config/rs6000/rs6000.c (darwin_rs6000_override_options): Likewise.
2570 (rs6000_override_options_after_change): Likewise.
2571 (rs6000_linux64_override_options): Likewise.
2572 (glibc_supports_ieee_128bit): Likewise.
2573 (rs6000_option_override_internal): Likewise.
2574 (rs6000_file_start): Likewise.
2575 (rs6000_darwin_file_start): Likewise.
2576 * config/rs6000/rtems.h: Likewise.
2577 * config/rs6000/sysv4.h: Likewise.
2578 * config/rs6000/vxworks.h (SUB3TARGET_OVERRIDE_OPTIONS): Likewise.
2579 * config/s390/s390.c (s390_option_override): Likewise.
2580 * config/sh/linux.h: Likewise.
2581 * config/sh/netbsd-elf.h (while): Likewise.
2582 * config/sh/sh.c (sh_option_override): Likewise.
2583 * config/sol2.c (solaris_override_options): Likewise.
2584 * config/sparc/sparc.c (sparc_option_override): Likewise.
2585 * config/tilegx/tilegx.c (tilegx_option_override): Likewise.
2586 * config/visium/visium.c (visium_option_override): Likewise.
2587 * config/vxworks.c (vxworks_override_options): Likewise.
2588 * lto-opts.c (lto_write_options): Likewise.
2589 * omp-expand.c (expand_omp_simd): Likewise.
2590 * omp-general.c (omp_max_vf): Likewise.
2591 * omp-offload.c (oacc_xform_loop): Likewise.
2592 * opts.h (OPTION_SET_P): Likewise.
2593 * targhooks.c (default_max_noce_ifcvt_seq_cost): Likewise.
2594 * toplev.c (process_options): Likewise.
2595 * tree-predcom.c: Likewise.
2596 * tree-sra.c (analyze_all_variable_accesses): Likewise.
2598 2021-10-08 liuhongt <hongtao.liu@intel.com>
2601 * config/i386/i386.c (ix86_optab_supported_p):
2602 Return true for HFmode.
2603 * match.pd: Simplify (_Float16) ceil ((double) x) to
2604 __builtin_ceilf16 (a) when a is _Float16 type and
2605 direct_internal_fn_supported_p.
2607 2021-10-08 liuhongt <hongtao.liu@intel.com>
2610 * config/i386/i386-expand.c (emit_reduc_half): Hanlde V4HImode.
2611 * config/i386/mmx.md (reduc_plus_scal_v4hi): New.
2612 (reduc_<code>_scal_v4hi): New.
2614 2021-10-08 liuhongt <hongtao.liu@intel.com>
2616 * common.opt (ftree-vectorize): Add Var(flag_tree_vectorize).
2617 * doc/invoke.texi (Options That Control Optimization): Update
2619 * opts.c (default_options_table): Enable auto-vectorization at
2620 O2 with very-cheap cost model.
2621 (finish_options): Use cheap cost model for
2622 explicit -ftree{,-loop}-vectorize.
2624 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
2626 * ctfc.c (ctfc_delete_container): Free hash table contents.
2628 2021-10-07 Indu Bhagat <indu.bhagat@oracle.com>
2630 * toplev.c (process_options): Do not warn for GNU GIMPLE.
2632 2021-10-07 Siddhesh Poyarekar <siddhesh@gotplt.org>
2634 * tree-object-size.c (addr_object_size,
2635 compute_builtin_object_size): Drop PDECL and POFF arguments.
2636 (addr_object_size): Adjust calls.
2637 * tree-object-size.h (compute_builtin_object_size): Drop PDECL
2640 2021-10-07 Roger Sayle <roger@nextmovesoftware.com>
2642 * rtl.def (SMUL_HIGHPART, UMUL_HIGHPART): New RTX codes for
2643 representing signed and unsigned high-part multiplication resp.
2644 * simplify-rtx.c (simplify_binary_operation_1) [SMUL_HIGHPART,
2645 UMUL_HIGHPART]: Simplify high-part multiplications by zero.
2646 [SS_PLUS, US_PLUS, SS_MINUS, US_MINUS, SS_MULT, US_MULT,
2647 SS_DIV, US_DIV]: Similar simplifications for saturating
2649 (simplify_const_binary_operation) [SS_PLUS, US_PLUS, SS_MINUS,
2650 US_MINUS, SS_MULT, US_MULT, SMUL_HIGHPART, UMUL_HIGHPART]:
2651 Implement compile-time evaluation for constant operands.
2652 * dwarf2out.c (mem_loc_descriptor): Skip SMUL_HIGHPART and
2654 * doc/rtl.texi (smul_highpart, umul_highpart): Document RTX codes.
2655 * doc/md.texi (smul@var{m}3_highpart, umul@var{m3}_highpart):
2656 Mention the new smul_highpart and umul_highpart RTX codes.
2657 * doc/invoke.texi: Silence @xref "compilation" warnings.
2659 2021-10-07 Martin Jambor <mjambor@suse.cz>
2662 * ipa-prop.c (ipa_edge_args_sum_t::duplicate): Also handle the
2663 case when the source reference description corresponds to a
2664 referance taken in a function src->caller is inlined to.
2666 2021-10-07 Jan Hubicka <hubicka@ucw.cz>
2669 * ipa-modref-tree.h (modref_access_node::contains_p): Handle offsets
2671 (modref_access_node::try_merge_with): Add sanity check that there
2672 are no redundant entries in the list.
2674 2021-10-07 Richard Biener <rguenther@suse.de>
2676 PR tree-optimization/102608
2677 * tree-ssa-sccvn.c (visit_stmt): Drop .DEFERRED_INIT to
2680 2021-10-07 Martin Liska <mliska@suse.cz>
2682 * toplev.c (toplev::main): Make
2683 save_opt_decoded_options a pointer type
2684 * toplev.h: Likewise.
2686 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
2688 * config/gcn/gcn-valu.md (gather<mode>_insn_2offsets<exec>): Apply
2689 HAVE_GCN_ASM_GLOBAL_LOAD_FIXED.
2690 (scatter<mode>_insn_2offsets<exec_scatter>): Likewise.
2692 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
2694 * config/gcn/gcn-hsa.h (SRAMOPT): Include the whole option string.
2695 Adjust for new -msram-ecc=any behaviour.
2696 (ASM_SPEC): Adjust -mxnack and -msram-ecc usage.
2697 * config/gcn/gcn.c (output_file_start): Implement -msram-ecc=any.
2698 * config/gcn/mkoffload.c (EF_AMDGPU_XNACK): Rename to ...
2699 (EF_AMDGPU_XNACK_V3): ... this.
2700 (EF_AMDGPU_SRAM_ECC): Rename to ...
2701 (EF_AMDGPU_SRAM_ECC_V3): ... this.
2702 (EF_AMDGPU_FEATURE_XNACK_V4): New.
2703 (EF_AMDGPU_FEATURE_XNACK_UNSUPPORTED_V4): New.
2704 (EF_AMDGPU_FEATURE_XNACK_ANY_V4): New.
2705 (EF_AMDGPU_FEATURE_XNACK_OFF_V4): New.
2706 (EF_AMDGPU_FEATURE_XNACK_ON_V4): New.
2707 (EF_AMDGPU_FEATURE_SRAMECC_V4): New.
2708 (EF_AMDGPU_FEATURE_SRAMECC_UNSUPPORTED_V4): New.
2709 (EF_AMDGPU_FEATURE_SRAMECC_ANY_V4): New.
2710 (EF_AMDGPU_FEATURE_SRAMECC_OFF_V4): New.
2711 (EF_AMDGPU_FEATURE_SRAMECC_ON_V4): New.
2712 (SET_XNACK_ON): New.
2713 (SET_XNACK_OFF): New.
2715 (SET_SRAM_ECC_ON): New.
2716 (SET_SRAM_ECC_ANY): New.
2717 (SET_SRAM_ECC_OFF): New.
2718 (TEST_SRAM_ECC_ANY): New.
2719 (TEST_SRAM_ECC_ON): New.
2720 (main): Implement HSACOv4 and -msram-ecc=any.
2722 2021-10-07 Andrew Stubbs <ams@codesourcery.com>
2724 * config.in: Regenerate.
2725 * config/gcn/gcn-hsa.h (X_FIJI): New macro.
2729 (A_FIJI): Rename to ...
2731 (A_900): Rename to ...
2733 (A_906): Rename to ...
2735 (A_908): Rename to ...
2737 (SRAMOPT): New macro.
2738 (ASM_SPEC): Adjust xnack option usage.
2739 * config/gcn/gcn.c (output_file_start): Adjust amdgcn_target usage.
2740 * configure: Regenerate.
2741 * configure.ac: Detect LLVM assembler dialect.
2743 2021-10-07 Richard Biener <rguenther@suse.de>
2745 * tree-pretty-print.c (dump_generic_node): Do not elide
2746 printing '&' when dumping with -gimple.
2748 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2750 * gimple-range-cache.cc (non_null_ref::adjust_range): Call new
2752 * gimple-range-fold.cc (adjust_pointer_diff_expr): Ditto.
2753 (adjust_imagpart_expr): Ditto.
2754 * value-range.cc (irange::irange_intersect): Call new routine if
2755 RHS is a single pair.
2756 (irange::intersect): New wide_int version.
2757 * value-range.h (class irange): New prototype.
2759 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2761 * gimple-range-edge.cc (gimple_outgoing_range::gimple_outgoing_range):
2762 Add parameter to limit size when recognizing switches.
2763 (gimple_outgoing_range::edge_range_p): Check size limit.
2764 * gimple-range-edge.h (gimple_outgoing_range): Add size field.
2765 * gimple-range-gori.cc (gori_map::calculate_gori): Ignore switches
2766 that exceed the size limit.
2767 (gori_compute::gori_compute): Add initializer.
2768 * params.opt (evrp-switch-limit): New.
2769 * doc/invoke.texi: Update docs.
2771 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2773 * value-range.h (irange::set_varying): Use TYPE_MIN_VALUE and
2774 TYPE_MAX_VALUE instead of creating new trees when possible.
2776 2021-10-06 Andrew MacLeod <amacleod@redhat.com>
2778 * gimple-range-cache.cc (non_null_ref::adjust_range): Check for
2779 zero and non-zero more efficently.
2781 2021-10-06 Richard Biener <rguenther@suse.de>
2784 * dumpfile.h (TDF_GIMPLE_VAL): New.
2785 (dump_flag): Re-order and adjust TDF_* flags. Make
2786 the enum uint32_t. Use std::underlying_type in the
2788 (optgroup_flag): Likewise for the operator overloads.
2789 * tree-pretty-print.c (dump_generic_node): Wrap ADDR_EXPR
2790 in _Literal if TDF_GIMPLE_VAL.
2791 * gimple-pretty-print.c (dump_gimple_assign): Add
2792 TDF_GIMPLE_VAL to flags when dumping operands where only
2793 is_gimple_val are allowed.
2794 (dump_gimple_cond): Likewise.
2796 2021-10-06 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
2798 * gimple-isel.cc (gimple_expand_vec_cond_expr): Remove redundant if
2801 2021-10-05 qing zhao <qing.zhao@oracle.com>
2803 PR middle-end/102359
2804 * gimplify.c (gimplify_decl_expr): Not add initialization for an
2805 auto variable when it has been initialized by frontend.
2807 2021-10-05 Aldy Hernandez <aldyh@redhat.com>
2809 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
2812 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
2814 * common/config/avr/avr-common.c (avr_handle_option): Mark
2815 argument as ATTRIBUTE_UNUSED.
2817 2021-10-05 Jan-Benedict Glaw <jbglaw@lug-owl.de>
2819 * config/lm32/uclinux-elf.h (LINK_GCC_C_SEQUENCE_SPEC):
2820 Undefine before redefinition.
2822 2021-10-05 Richard Biener <rguenther@suse.de>
2824 * toplev.c (no_backend): Remove global var.
2825 (process_options): Pass in no_backend, move post_options
2826 langhook call to toplev::main.
2827 (do_compile): Pass in no_backend, move process_options call
2829 (toplev::run_self_tests): Check no_backend at the caller.
2830 (toplev::main): Call post_options and process_options
2831 split out from do_compile, do self-tests only if
2832 no_backend is initialized.
2834 2021-10-05 Richard Biener <rguenther@suse.de>
2836 * tree-cfg.c (dump_function_to_file): Dump the UID of the
2837 function as part of the name when requested.
2838 * tree-pretty-print.c (dump_function_name): Dump the UID when
2839 requested and the langhook produced the actual name.
2841 2021-10-05 Richard Biener <rguenther@suse.de>
2843 PR middle-end/102587
2844 PR middle-end/102285
2845 * internal-fn.c (expand_DEFERRED_INIT): Fall back to
2846 zero-initialization as last resort, use the constant
2847 size as given by the DEFERRED_INIT argument to build
2850 2021-10-04 Marek Polacek <polacek@redhat.com>
2853 * doc/invoke.texi: Document -Warray-compare.
2855 2021-10-04 Richard Biener <rguenther@suse.de>
2857 * gimplify.c (is_var_need_auto_init): DECL_HARD_REGISTER
2858 variables are not to be initialized.
2860 2021-10-04 Richard Biener <rguenther@suse.de>
2862 * expr.h (non_mem_decl_p): Declare.
2863 (mem_ref_refers_to_non_mem_p): Likewise.
2864 * expr.c (non_mem_decl_p): Export.
2865 (mem_ref_refers_to_non_mem_p): Likewise.
2866 * internal-fn.c (expand_DEFERRED_INIT): Do not expand the LHS
2867 but check the base with mem_ref_refers_to_non_mem_p
2870 2021-10-04 Richard Biener <rguenther@suse.de>
2872 PR tree-optimization/102570
2873 * tree-ssa-sccvn.h (vn_reference_op_struct): Document
2874 we are using clique for the internal function code.
2875 * tree-ssa-sccvn.c (vn_reference_op_eq): Compare the
2876 internal function code.
2877 (print_vn_reference_ops): Print the internal function code.
2878 (vn_reference_op_compute_hash): Hash it.
2879 (copy_reference_ops_from_call): Record it.
2880 (visit_stmt): Remove the restriction around internal function
2882 (fully_constant_vn_reference_p): Use fold_const_call and handle
2884 (vn_reference_eq): Compare call return types.
2885 * tree-ssa-pre.c (create_expression_by_pieces): Handle
2886 generating calls to internal functions.
2887 (compute_avail): Remove the restriction around internal function
2890 2021-10-04 Aldy Hernandez <aldyh@redhat.com>
2892 PR tree-optimization/102560
2893 * gimple-ssa-warn-alloca.c (alloca_call_type): Remove static
2894 marker for invalid_range.
2896 2021-10-04 Richard Biener <rguenther@suse.de>
2898 PR middle-end/102587
2899 * internal-fn.c (expand_DEFERRED_INIT): Guard register
2900 initialization path an avoid initializing VLA registers
2903 2021-10-04 Eric Botcazou <ebotcazou@adacore.com>
2905 * config/rs6000/vxworks.h (TARGET_INIT_LIBFUNCS): Delete.
2907 2021-10-03 Martin Liska <mliska@suse.cz>
2909 * toplev.c (toplev::main): Check opt_index if it is a part
2912 2021-10-02 Aldy Hernandez <aldyh@redhat.com>
2914 PR tree-optimization/102563
2915 * range-op.cc (operator_lshift::op1_range): Do not clobber
2918 2021-10-02 Martin Liska <mliska@suse.cz>
2920 * toplev.c (toplev::main): save_decoded_options[0] is program
2921 name and so it should be skipped.
2923 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
2925 PR tree-optimization/102546
2926 * range-op.cc (operator_lshift::op1_range): Teach range-ops that
2927 X << Y is non-zero implies X is also non-zero.
2929 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2931 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
2933 * config/aarch64/aarch64-tune.md: Regenerate.
2934 * doc/invoke.texi: Update docs.
2936 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2938 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
2940 * config/aarch64/aarch64-tune.md: Regenerate.
2941 * doc/invoke.texi: Update docs.
2943 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2945 * config/aarch64/aarch64-cores.def (AARCH64_CORE): New
2947 * config/aarch64/aarch64-tune.md: Regenerate.
2948 * doc/invoke.texi: Update docs.
2950 2021-10-01 Martin Sebor <msebor@redhat.com>
2953 * doc/invoke.texi (-Waddress): Update.
2954 * gengtype.c (write_types): Avoid -Waddress.
2955 * poly-int.h (POLY_SET_COEFF): Avoid using null.
2957 2021-10-01 John David Anglin <danglin@gcc.gnu.org>
2960 * config/pa/pa.c (pa_option_override): Default to dwarf version 4
2963 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2965 * config/aarch64/aarch64.h (AARCH64_FL_V9): Update value.
2967 2021-10-01 Aldy Hernandez <aldyh@redhat.com>
2969 * gimple-range-path.cc (path_range_query::compute_ranges): Use
2971 * gimple-range-path.h (class path_range_query): Remove shadowed
2973 (path_range_query::get_path_oracle): New.
2975 2021-10-01 Jakub Jelinek <jakub@redhat.com>
2976 Richard Biener <rguenther@suse.de>
2979 * doc/invoke.texi (-fsanitize=integer-divide-by-zero): Remove
2980 INT_MIN / -1 division detection from here ...
2981 (-fsanitize=signed-integer-overflow): ... and add it here.
2983 2021-10-01 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
2985 * config/aarch64/aarch64-arches.def (AARCH64_ARCH): Added
2987 * config/aarch64/aarch64.h (AARCH64_FL_V9): New.
2988 (AARCH64_FL_FOR_ARCH9): New flags for Armv9-A.
2989 (AARCH64_ISA_V9): New ISA flag.
2990 * doc/invoke.texi: Update docs.
2992 2021-10-01 Martin Liska <mliska@suse.cz>
2994 * toplev.c (toplev::main): Save decoded optimization options.
2995 * toplev.h (save_opt_decoded_options): New.
2996 * doc/extend.texi: Be more clear about optimize and target
2999 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
3001 * explow.c: Include langhooks.h.
3002 (set_stack_check_libfunc): Build a proper function type.
3004 2021-10-01 Eric Botcazou <ebotcazou@adacore.com>
3007 * config/i386/i386.c (legitimate_pic_address_disp_p): For PE-COFF do
3008 not return true for external weak function symbols in medium model.
3010 2021-10-01 Jakub Jelinek <jakub@redhat.com>
3012 * tree.h (OMP_CLAUSE_ORDER_REPRODUCIBLE): Define.
3013 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_ORDER>: Print
3014 reproducible: for OMP_CLAUSE_ORDER_REPRODUCIBLE.
3015 * omp-general.c (omp_extract_for_data): If OMP_CLAUSE_ORDER is seen
3016 without OMP_CLAUSE_ORDER_UNCONSTRAINED, overwrite sched_kind to
3017 OMP_CLAUSE_SCHEDULE_STATIC.
3019 2021-10-01 Richard Biener <rguenther@suse.de>
3021 PR middle-end/102518
3022 * tree-inline.c (setup_one_parameter): Avoid substituting
3023 an invariant into contexts where a GIMPLE register is not valid.
3025 2021-09-30 Przemyslaw Wirkus <przemyslaw.wirkus@arm.com>
3027 * config/arm/arm-cpus.in: Add Cortex-R52+ CPU.
3028 * config/arm/arm-tables.opt: Regenerate.
3029 * config/arm/arm-tune.md: Regenerate.
3030 * doc/invoke.texi: Update docs.
3032 2021-09-30 Uroš Bizjak <ubizjak@gmail.com>
3035 * config/i386/i386.md
3036 (sign_extend:WIDE (any_logic:NARROW (memory, immediate)) splitters):
3039 2021-09-30 Tobias Burnus <tobias@codesourcery.com>
3041 * omp-low.c (omp_runtime_api_call): Add omp_aligned_{,c}alloc and
3042 omp_{c,re}alloc, fix omp_alloc/omp_free.
3044 2021-09-30 Martin Liska <mliska@suse.cz>
3046 * defaults.h (ASM_OUTPUT_ASCII): Do not hide global variable
3047 asm_out_file and stream directly to MYFILE.
3049 2021-09-30 Richard Biener <rguenther@suse.de>
3051 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
3052 Restore and fix condition under which we apply npeel to
3053 the DRs misalignment value.
3055 2021-09-30 Richard Biener <rguenther@suse.de>
3057 * tree-vect-data-refs.c (vect_update_misalignment_for_peel):
3058 Fix npeel check for variable amount of peeling.
3060 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
3062 * lto-wrapper.c (run_gcc): Plug snprintf overflow.
3064 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
3066 * gimple-range.cc (gimple_ranger::debug): New.
3067 * gimple-range.h (class gimple_ranger): Add debug.
3069 2021-09-30 Aldy Hernandez <aldyh@redhat.com>
3071 PR middle-end/102519
3072 * tree-vrp.c (hybrid_threader::~hybrid_threader): Free m_query.
3074 2021-09-29 Indu Bhagat <indu.bhagat@oracle.com>
3077 * btfout.c (GTY): Add GTY (()) albeit for cosmetic only purpose.
3078 (btf_finalize): Empty the hash_map btf_var_ids.
3080 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
3082 * tree-vrp.c (thread_through_all_blocks): Return bool.
3083 (execute_vrp_threader): Return TODO_* flags.
3084 (pass_data_vrp_threader): Set todo_flags_finish to 0.
3086 2021-09-29 Aldy Hernandez <aldyh@redhat.com>
3088 * timevar.def (TV_TREE_VRP_THREADER): New.
3089 * tree-vrp.c: Use TV_TREE_VRP_THREADER for VRP threader pass.
3091 2021-09-29 David Faust <david.faust@oracle.com>
3093 * config.gcc (bpf-*-*): Do not overwrite extra_headers.
3095 2021-09-29 Jonathan Wright <jonathan.wright@arm.com>
3097 * config/aarch64/aarch64-builtins.c (TYPES_BINOP_PPU): Define
3098 new type qualifier enum.
3099 (TYPES_TERNOP_SSSU): Likewise.
3100 (TYPES_TERNOP_PPPU): Likewise.
3101 * config/aarch64/aarch64-simd-builtins.def: Define PPU, SSU,
3102 PPPU and SSSU builtin generator macros for qtbl1 and qtbx1
3104 * config/aarch64/arm_neon.h (vqtbl1_p8): Use type-qualified
3105 builtin and remove casts.
3106 (vqtbl1_s8): Likewise.
3107 (vqtbl1q_p8): Likewise.
3108 (vqtbl1q_s8): Likewise.
3109 (vqtbx1_s8): Likewise.
3110 (vqtbx1_p8): Likewise.
3111 (vqtbx1q_s8): Likewise.
3112 (vqtbx1q_p8): Likewise.
3113 (vtbl1_p8): Likewise.
3114 (vtbl2_p8): Likewise.
3115 (vtbx2_p8): Likewise.
3117 2021-09-29 Richard Biener <rguenther@suse.de>
3119 * tree-vect-data-refs.c (vect_dr_misalign_for_aligned_access):
3121 (vect_update_misalignment_for_peel): Use it to update
3122 misaligned to the value necessary for an aligned access.
3123 (vect_get_peeling_costs_all_drs): Likewise.
3124 (vect_enhance_data_refs_alignment): Likewise.
3126 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
3128 * config/aarch64/aarch64.c (aarch64_expand_cpymem): Count number of
3129 emitted operations and adjust heuristic for code size.
3131 2021-09-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
3133 * config/aarch64/aarch64.c (aarch64_expand_setmem): Count number of
3134 emitted operations and adjust heuristic for code size.
3136 2021-09-29 Jakub Jelinek <jakub@redhat.com>
3138 PR middle-end/102504
3139 * gimplify.c (gimplify_scan_omp_clauses): Use omp_check_private even
3140 in OMP_SCOPE clauses, not just on worksharing construct clauses.
3142 2021-09-28 Geng Qi <gengqi@linux.alibaba.com>
3144 * config/riscv/riscv.md (mulv<mode>4): Call gen_smul<mode>3_highpart.
3145 (<u>mulditi3): Call <su>muldi3_highpart.
3146 (<u>muldi3_highpart): Rename to <su>muldi3_highpart.
3147 (<u>mulsidi3): Call <su>mulsi3_highpart.
3148 (<u>mulsi3_highpart): Rename to <su>mulsi3_highpart.
3150 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
3152 * config/darwin.h (DSYMUTIL_SPEC): Recognize D sources.
3154 2021-09-28 Iain Sandoe <iain@sandoe.co.uk>
3156 * config/rs6000/darwin.h (FIXED_R13): Add for PPC64.
3157 (FIRST_SAVED_GP_REGNO): Save from R13 even when it is one
3160 2021-09-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
3162 * config/aarch64/aarch64.h (AARCH64_FL_LS64): Define
3163 (AARCH64_FL_V8_7): Likewise.
3164 (AARCH64_FL_FOR_ARCH8_7): Likewise.
3165 * config/aarch64/aarch64-arches.def (armv8.7-a): Define.
3166 * config/aarch64/aarch64-option-extensions.def (ls64): Define.
3167 * doc/invoke.texi: Document the above.
3169 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3171 * dbgcnt.c (dbg_cnt_counter): New.
3172 * dbgcnt.h (dbg_cnt_counter): New.
3173 * dumpfile.c (dump_options): Add entry for TDF_THREADING.
3174 * dumpfile.h (enum dump_flag): Add TDF_THREADING.
3175 * gimple-range-path.cc (DEBUG_SOLVER): Use TDF_THREADING.
3176 * tree-ssa-threadupdate.c (dump_jump_thread_path): Dump out
3179 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3181 * cfgcleanup.c (pass_jump::execute): Check
3182 flag_expensive_optimizations.
3183 (pass_jump_after_combine::gate): Same.
3184 * doc/invoke.texi (-fthread-jumps): Enable for -O1.
3185 * opts.c (default_options_table): Enable -fthread-jumps at -O1.
3186 * tree-ssa-threadupdate.c
3187 (fwd_jt_path_registry::remove_jump_threads_including): Bail unless
3190 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
3192 * tree-ssa-reassoc.c (biased_names): New global.
3193 (propagate_bias_p): New function.
3194 (loop_carried_phi): Remove.
3195 (propagate_rank): Propagate bias along single uses.
3196 (get_rank): Update biased_names when needed.
3198 2021-09-28 Ilya Leoshkevich <iii@linux.ibm.com>
3200 * passes.def (pass_reassoc): Rename parameter to early_p.
3201 * tree-ssa-reassoc.c (reassoc_bias_loop_carried_phi_ranks_p):
3203 (phi_rank): Don't bias loop-carried phi ranks
3204 before vectorization pass.
3205 (execute_reassoc): Add bias_loop_carried_phi_ranks_p parameter.
3206 (pass_reassoc::pass_reassoc): Add bias_loop_carried_phi_ranks_p
3208 (pass_reassoc::set_param): Set bias_loop_carried_phi_ranks_p
3210 (pass_reassoc::execute): Pass bias_loop_carried_phi_ranks_p to
3212 (pass_reassoc::bias_loop_carried_phi_ranks_p): New member.
3214 2021-09-28 Jakub Jelinek <jakub@redhat.com>
3217 * config/i386/i386.c (standard_80387_constant_p): Don't recognize
3218 special 80387 instruction XFmode constants if flag_rounding_math.
3220 2021-09-28 Richard Biener <rguenther@suse.de>
3222 PR tree-optimization/100112
3223 * tree-ssa-sccvn.c (visit_reference_op_load): Record the
3224 referece into the hashtable twice in case last_vuse is
3225 different from the original vuse on the stmt.
3227 2021-09-28 Jakub Jelinek <jakub@redhat.com>
3229 PR middle-end/102492
3230 * gimplify.c (gimplify_adjust_omp_clauses_1): Don't call the
3231 omp_finish_clause langhook on implicitly added OMP_CLAUSE_PRIVATE
3232 clauses on SIMD constructs.
3234 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3236 PR tree-optimization/102511
3237 * gimple-range-path.cc (path_range_query::range_on_path_entry):
3238 Return VARYING when nothing found.
3240 2021-09-28 Hongyu Wang <hongyu.wang@intel.com>
3243 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): Add
3245 (VALID_SSE2_REG_VHF_MODE): Add V4HFmode and V2HFmode.
3246 (VALID_MMX_REG_MODE): Add V4HFmode.
3247 (SSE_REG_MODE_P): Replace VALID_AVX512FP16_REG_MODE with
3248 vector mode condition.
3249 * config/i386/i386.c (classify_argument): Parse V4HF/V2HF
3251 (function_arg_32): Add V4HFmode.
3252 (function_arg_advance_32): Likewise.
3253 * config/i386/i386.md (mode): Add V4HF/V2HF.
3254 (MODE_SIZE): Likewise.
3255 * config/i386/mmx.md (MMXMODE): Add V4HF mode.
3256 (V_32): Add V2HF mode.
3257 (VHF_32_64): New mode iterator.
3258 (*mov<mode>_internal): Adjust sse alternatives to support
3260 (*mov<mode>_internal): Adjust sse alternatives to support
3262 (<insn><mode>3): New define_insn for add/sub/mul/div.
3264 2021-09-28 Aldy Hernandez <aldyh@redhat.com>
3266 * tree-ssa-threadbackward.c (pass_thread_jumps::gate): Check
3268 (pass_early_thread_jumps::gate): Same.
3269 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
3270 Return if !flag_thread_jumps.
3271 * tree-ssa-threadupdate.c
3272 (jt_path_registry::register_jump_thread): Assert that
3273 flag_thread_jumps is true.
3275 2021-09-28 liuhongt <hongtao.liu@intel.com>
3278 (simplify_context::simplify_binary_operation_1): Relax
3279 condition of simplifying (vec_concat:M (vec_select op0
3280 index0)(vec_select op1 index1)) to allow different modes
3281 between op0 and M, but have same inner mode.
3283 2021-09-28 liuhongt <hongtao.liu@intel.com>
3285 * config/i386/i386-expand.c (emit_reduc_half): Handle
3286 V8HF/V16HF/V32HFmode.
3287 * config/i386/sse.md (REDUC_SSE_PLUS_MODE): Add V8HF.
3288 (REDUC_SSE_SMINMAX_MODE): Ditto.
3289 (REDUC_PLUS_MODE): Add V16HF and V32HF.
3290 (REDUC_SMINMAX_MODE): Ditto.
3292 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3294 * gimple-range-path.cc
3295 (path_range_query::precompute_ranges_in_block): Rename to...
3296 (path_range_query::compute_ranges_in_block): ...this.
3297 (path_range_query::precompute_ranges): Rename to...
3298 (path_range_query::compute_ranges): ...this.
3299 (path_range_query::precompute_relations): Rename to...
3300 (path_range_query::compute_relations): ...this.
3301 (path_range_query::precompute_phi_relations): Rename to...
3302 (path_range_query::compute_phi_relations): ...this.
3303 * gimple-range-path.h: Rename precompute* to compute*.
3304 * tree-ssa-threadbackward.c
3305 (back_threader::find_taken_edge_switch): Same.
3306 (back_threader::find_taken_edge_cond): Same.
3307 * tree-ssa-threadedge.c
3308 (hybrid_jt_simplifier::compute_ranges_from_state): Same.
3309 (hybrid_jt_state::register_equivs_stmt): Inline...
3310 * tree-ssa-threadedge.h: ...here.
3312 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3314 * tree-vrp.c (lhs_of_dominating_assert): Remove.
3315 (class vrp_jt_state): Remove.
3316 (class vrp_jt_simplifier): Remove.
3317 (vrp_jt_simplifier::simplify): Remove.
3318 (class vrp_jump_threader): Remove.
3319 (vrp_jump_threader::vrp_jump_threader): Remove.
3320 (vrp_jump_threader::~vrp_jump_threader): Remove.
3321 (vrp_jump_threader::before_dom_children): Remove.
3322 (vrp_jump_threader::after_dom_children): Remove.
3324 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3326 * passes.def (pass_vrp_threader): New.
3327 * tree-pass.h (make_pass_vrp_threader): Add make_pass_vrp_threader.
3328 * tree-ssa-threadedge.c (hybrid_jt_state::register_equivs_stmt): New.
3329 (hybrid_jt_simplifier::hybrid_jt_simplifier): New.
3330 (hybrid_jt_simplifier::simplify): New.
3331 (hybrid_jt_simplifier::compute_ranges_from_state): New.
3332 * tree-ssa-threadedge.h (class hybrid_jt_state): New.
3333 (class hybrid_jt_simplifier): New.
3334 * tree-vrp.c (execute_vrp): Remove ASSERT_EXPR based jump
3336 (class hybrid_threader): New.
3337 (hybrid_threader::hybrid_threader): New.
3338 (hybrid_threader::~hybrid_threader): New.
3339 (hybrid_threader::before_dom_children): New.
3340 (hybrid_threader::after_dom_children): New.
3341 (execute_vrp_threader): New.
3342 (class pass_vrp_threader): New.
3343 (make_pass_vrp_threader): New.
3345 2021-09-27 Martin Liska <mliska@suse.cz>
3347 * output.h (enum section_flag): New.
3348 (SECTION_FORGET): Remove.
3349 (SECTION_ENTSIZE): Make it (1UL << 8) - 1.
3350 (SECTION_STYLE_MASK): Define it based on other enum
3352 * varasm.c (switch_to_section): Remove unused handling of
3355 2021-09-27 Martin Liska <mliska@suse.cz>
3357 * common.opt: Add new variable flag_default_complex_method.
3358 * opts.c (finish_options): Handle flags related to
3359 x_flag_complex_method.
3360 * toplev.c (process_options): Remove option handling related
3361 to flag_complex_method.
3363 2021-09-27 Richard Biener <rguenther@suse.de>
3365 PR middle-end/102450
3366 * gimple-fold.c (gimple_fold_builtin_memory_op): Avoid using
3367 type_for_size, instead use int_mode_for_size.
3369 2021-09-27 Andrew Pinski <apinski@marvell.com>
3372 * gimplify.c (gimplify_save_expr): Return early
3373 if the type of val is error_mark_node.
3375 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3377 * tree-ssanames.c (ssa_name_has_boolean_range): Use
3380 2021-09-27 Aldy Hernandez <aldyh@redhat.com>
3382 * gimple-ssa-evrp-analyze.h (class evrp_range_analyzer): Remove
3383 vrp_visit_cond_stmt.
3384 * tree-ssa-dom.c (cprop_operand): Convert to range_query API.
3385 (cprop_into_stmt): Same.
3386 (dom_opt_dom_walker::optimize_stmt): Same.
3388 2021-09-27 Richard Biener <rguenther@suse.de>
3390 PR tree-optimization/97351
3391 PR tree-optimization/97352
3392 PR tree-optimization/82426
3393 * tree-vectorizer.h (dr_misalignment): Add vector type
3395 (aligned_access_p): Likewise.
3396 (known_alignment_for_access_p): Likewise.
3397 (vect_supportable_dr_alignment): Likewise.
3398 (vect_known_alignment_in_bytes): Likewise. Refactor.
3399 (DR_MISALIGNMENT): Remove.
3400 (vect_update_shared_vectype): Likewise.
3401 * tree-vect-data-refs.c (dr_misalignment): Refactor, handle
3402 a vector type with larger alignment requirement and apply
3403 the negative step adjustment here.
3404 (vect_calculate_target_alignment): Remove.
3405 (vect_compute_data_ref_alignment): Get explicit vector type
3406 argument, do not apply a negative step alignment adjustment
3408 (vect_slp_analyze_node_alignment): Re-analyze alignment
3409 when we re-visit the DR with a bigger desired alignment but
3410 keep more precise results from smaller alignments.
3411 * tree-vect-slp.c (vect_update_shared_vectype): Remove.
3412 (vect_slp_analyze_node_operations_1): Do not update the
3413 shared vector type on stmts.
3414 * tree-vect-stmts.c (vect_analyze_stmt): Push/pop the
3415 vector type of an SLP node to the representative stmt-info.
3416 (vect_transform_stmt): Likewise.
3418 2021-09-27 liuhongt <hongtao.liu@intel.com>
3421 2021-09-09 liuhongt <hongtao.liu@intel.com>
3424 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
3425 (reduc_plus_scal_v4sf): .. this, New define_expand.
3426 (reduc_plus_scal_v2df): .. and this, New define_expand.
3428 2021-09-26 liuhongt <hongtao.liu@intel.com>
3430 * doc/extend.texi (Half-Precision): Remove storage only
3431 description for _Float16 w/o avx512fp16.
3433 2021-09-25 Dimitar Dimitrov <dimitar@dinux.eu>
3435 * config/pru/constraints.md (Rrio): New constraint.
3436 * config/pru/predicates.md (regio_operand): New predicate.
3437 * config/pru/pru-pragma.c (pru_register_pragmas): Register
3438 the __regio_symbol address space.
3439 * config/pru/pru-protos.h (pru_symref2ioregno): Declaration.
3440 * config/pru/pru.c (pru_symref2ioregno): New helper function.
3441 (pru_legitimate_address_p): Remove.
3442 (pru_addr_space_legitimate_address_p): Use the address space
3444 (pru_nongeneric_pointer_addrspace): New helper function.
3445 (pru_insert_attributes): New function to validate __regio_symbol
3447 (TARGET_INSERT_ATTRIBUTES): New macro.
3448 (TARGET_LEGITIMATE_ADDRESS_P): Remove.
3449 (TARGET_ADDR_SPACE_LEGITIMATE_ADDRESS_P): New macro.
3450 * config/pru/pru.h (enum reg_class): Add REGIO_REGS class.
3451 * config/pru/pru.md (*regio_readsi): New pattern to read I/O
3453 (*regio_nozext_writesi): New pattern to write to I/O registers.
3454 (*regio_zext_write_r30<EQS0:mode>): Ditto.
3455 * doc/extend.texi: Document the new PRU Named Address Space.
3457 2021-09-24 Patrick Palka <ppalka@redhat.com>
3461 * real.c (encode_ieee_double): Avoid unwanted sign extension.
3462 (encode_ieee_quad): Likewise.
3464 2021-09-24 Vladimir Makarov <vmakarov@redhat.com>
3466 PR rtl-optimization/102147
3467 * ira-build.c (ira_conflict_vector_profitable_p): Make
3468 profitability calculation independent of host compiler pointer and
3471 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
3473 * gimple-range-path.cc (path_range_query::path_range_query):
3474 Move debugging header...
3475 (path_range_query::precompute_ranges): ...here.
3476 (path_range_query::internal_range_of_expr): Do not call
3477 range_on_path_entry if NAME is defined in the current block.
3479 2021-09-24 Richard Biener <rguenther@suse.de>
3481 * cfghooks.c (verify_flow_info): Verify unallocated BB and
3482 edge flags are not set.
3484 2021-09-24 Aldy Hernandez <aldyh@redhat.com>
3486 * tree-ssa-threadupdate.c (jt_path_registry::cancel_invalid_paths):
3488 (jt_path_registry::register_jump_thread): Call
3489 cancel_invalid_paths.
3490 * tree-ssa-threadupdate.h (class jt_path_registry): Add
3491 cancel_invalid_paths.
3493 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
3495 PR tree-optimization/102400
3496 * tree-ssa-sccvn.c (vn_reference_insert_pieces): Initialize
3497 result_vdef to zero value.
3499 2021-09-24 Feng Xue <fxue@os.amperecomputing.com>
3501 PR tree-optimization/102451
3502 * tree-ssa-dse.c (delete_dead_or_redundant_call): Record bb of stmt
3505 2021-09-24 Hongyu Wang <hongyu.wang@intel.com>
3507 * config/i386/sse.md (cond_<insn><mode>): Extend to support
3509 (cond_mul<mode>): Likewise.
3510 (cond_div<mode>): Likewise.
3511 (cond_<code><mode>): Likewise.
3512 (cond_fma<mode>): Likewise.
3513 (cond_fms<mode>): Likewise.
3514 (cond_fnma<mode>): Likewise.
3515 (cond_fnms<mode>): Likewise.
3517 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
3519 PR tree-optimization/102463
3520 * gimple-range-fold.cc (fold_using_range::relation_fold_and_or): If
3521 there is no range-ops handler, don't look for a relation.
3523 2021-09-23 Andrew MacLeod <amacleod@redhat.com>
3525 * gimple-range-cache.cc (ranger_cache::ranger_cache): Take
3526 non-executable_edge flag as parameter.
3527 * gimple-range-cache.h (ranger_cache): Adjust prototype.
3528 * gimple-range-gori.cc (gori_compute::gori_compute): Take
3529 non-executable_edge flag as parameter.
3530 (gori_compute::outgoing_edge_range_p): Check new flag.
3531 * gimple-range-gori.h (gori_compute): Adjust prototype.
3532 * gimple-range.cc (gimple_ranger::gimple_ranger): Create new flag.
3533 (gimple_ranger::range_on_edge): Check new flag.
3534 * gimple-range.h (gimple_ranger::non_executable_edge_flag): New.
3535 * gimple-ssa-evrp.c (rvrp_folder): Pass ranger flag to simplifer.
3536 (hybrid_folder::hybrid_folder): Set ranger non-executable flag value.
3537 (hybrid_folder::fold_stmt): Set flag value in the simplifer.
3538 * vr-values.c (simplify_using_ranges::set_and_propagate_unexecutable):
3539 Use not_executable flag if provided inmstead of EDGE_EXECUTABLE.
3540 (simplify_using_ranges::simplify_switch_using_ranges): Clear
3541 EDGE_EXECUTABLE like it originally did.
3542 (simplify_using_ranges::cleanup_edges_and_switches): Clear any
3543 NON_EXECUTABLE flags.
3544 (simplify_using_ranges::simplify_using_ranges): Adjust.
3545 * vr-values.h (class simplify_using_ranges): Adjust.
3546 (simplify_using_ranges::set_range_query): Add non-executable flag param.
3548 2021-09-23 Bill Schmidt <wschmidt@linux.ibm.com>
3551 * config/rs6000/rs6000-call.c (rs6000_aggregate_candidate): Detect
3552 zero-width bit fields and return indicator.
3553 (rs6000_discover_homogeneous_aggregate): Diagnose when the
3554 presence of a zero-width bit field changes parameter passing in
3557 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
3559 * gimple-range-fold.cc (fold_using_range::range_of_phi):
3560 Remove dominator check.
3562 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
3564 * gimple-range-path.cc (path_range_query::precompute_relations):
3565 Hoist edge calculations before using EDGE_SUCC.
3567 2021-09-23 Jonathan Wakely <jwakely@redhat.com>
3569 * configure.ac: Fix --with-multilib-list description.
3570 * configure: Regenerate.
3572 2021-09-23 Richard Biener <rguenther@suse.de>
3574 PR tree-optimization/102448
3575 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
3576 Clear alignment info copied from DR_PTR_INFO.
3578 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3580 * config/i386/i386-expand.c (ix86_use_mask_cmp_p): Enable
3582 * config/i386/sse.md (sseintvecmodelower): Add HF vector modes.
3583 (<avx512>_store<mode>_mask): Extend to support HF vector modes.
3584 (vec_cmp<mode><avx512fmaskmodelower>): Likewise.
3585 (vcond_mask_<mode><avx512fmaskmodelower>): Likewise.
3586 (vcond<mode><mode>): New expander.
3587 (vcond<mode><sseintvecmodelower>): Likewise.
3588 (vcond<sseintvecmodelower><mode>): Likewise.
3589 (vcondu<mode><sseintvecmodelower>): Likewise.
3591 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3593 * config/i386/sse.md (extend<ssePHmodelower><mode>2):
3595 (extendv4hf<mode>2): Likewise.
3596 (extendv2hfv2df2): Likewise.
3597 (trunc<mode><ssePHmodelower>2): Likewise.
3598 (avx512fp16_vcvt<castmode>2ph_<mode>): Rename to ...
3599 (trunc<mode>v4hf2): ... this, and drop constraints.
3600 (avx512fp16_vcvtpd2ph_v2df): Rename to ...
3601 (truncv2dfv2hf2): ... this, and likewise.
3603 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3605 * config/i386/sse.md (float<floatunssuffix><mode><ssePHmodelower>2):
3607 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>):
3609 (float<floatunssuffix><mode>v4hf2): ... this, and drop constraints.
3610 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Rename to ...
3611 (float<floatunssuffix>v2div2hf2): ... this, and likewise.
3613 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3615 * config/i386/i386.md (fix<fixunssuffix>_trunchf<mode>2): New expander.
3616 (fixuns_trunchfhi2): Likewise.
3617 (*fixuns_trunchfsi2zext): New define_insn.
3618 * config/i386/sse.md (ssePHmodelower): New mode_attr.
3619 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
3620 New expander for same element vector fix_truncate.
3621 (fix<fixunssuffix>_trunc<ssePHmodelower><mode>2):
3622 Likewise for V4HF to V4SI/V4DI fix_truncate.
3623 (fix<fixunssuffix>_truncv2hfv2di2):
3624 Likeise for V2HF to V2DI fix_truncate.
3626 2021-09-23 Hongyu Wang <hongyu.wang@intel.com>
3628 * config/i386/i386.md (<code>hf3): New expander.
3630 2021-09-23 liuhongt <hongtao.liu@intel.com>
3632 * config/i386/sse.md (FMAMODEM): extend to handle FP16.
3633 (VFH_SF_AVX512VL): Extend to handle HFmode.
3634 (VF_SF_AVX512VL): Deleted.
3636 2021-09-23 liuhongt <hongtao.liu@intel.com>
3638 * config/i386/i386.md (rinthf2): New expander.
3639 (nearbyinthf2): New expander.
3641 2021-09-23 Aldy Hernandez <aldyh@redhat.com>
3643 * tree-ssa-dom.c (class dom_jump_threader_simplifier): Rename...
3644 (class dom_jt_state): ...this and provide virtual overrides.
3645 (dom_jt_state::register_equiv): New.
3646 (class dom_jt_simplifier): Rename from
3647 dom_jump_threader_simplifier.
3648 (dom_jump_threader_simplifier::simplify): Rename...
3649 (dom_jt_simplifier::simplify): ...to this.
3650 (pass_dominator::execute): Use dom_jt_simplifier and
3652 * tree-ssa-threadedge.c (jump_threader::jump_threader):
3654 (jt_state::register_equivs_stmt): Abstract out...
3655 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
3657 (jump_threader::thread_around_empty_blocks): Update state.
3658 (jump_threader::thread_through_normal_block): Same.
3659 (jt_state::jt_state): Remove.
3660 (jt_state::push): Remove pass specific bits. Keep block vector
3662 (jt_state::append_path): New.
3663 (jt_state::pop): Remove pass specific bits.
3664 (jt_state::register_equiv): Same.
3665 (jt_state::record_ranges_from_stmt): Same.
3666 (jt_state::register_equivs_on_edge): Same. Rename...
3667 (jt_state::register_equivs_edge): ...to this.
3668 (jt_state::dump): New.
3669 (jt_state::debug): New.
3670 (jump_threader_simplifier::simplify): Remove.
3671 (jt_state::get_path): New.
3672 * tree-ssa-threadedge.h (class jt_simplifier): Make into a base
3673 class. Expose common functionality as virtual methods.
3674 (class jump_threader_simplifier): Same. Rename...
3675 (class jt_simplifier): ...to this.
3676 * tree-vrp.c (class vrp_jump_threader_simplifier): Rename...
3677 (class vrp_jt_simplifier): ...to this. Provide pass specific
3679 (class vrp_jt_state): New.
3680 (vrp_jump_threader_simplifier::simplify): Rename...
3681 (vrp_jt_simplifier::simplify): ...to this. Inline code from
3682 what used to be the base class.
3683 (vrp_jump_threader::vrp_jump_threader): Use vrp_jt_state and
3686 2021-09-22 Tobias Burnus <tobias@codesourcery.com>
3689 * doc/invoke.texi (-Wno-missing-include-dirs.): Document Fortran
3692 2021-09-22 Roger Sayle <roger@nextmovesoftware.com>
3693 Richard Biener <rguenther@suse.de>
3695 * match.pd (negation simplifications): Implement some negation
3696 folding transformations from fold-const.c's fold_negate_expr.
3697 * tree-ssa-sccvn.c (vn_nary_build_or_lookup_1): Add a SIMPLIFY
3698 argument, to control whether the op should be simplified prior
3699 to looking up/assigning a value number.
3700 (vn_nary_build_or_lookup): Update call to vn_nary_build_or_lookup_1.
3701 (vn_nary_simplify): Likewise.
3702 (visit_nary_op): Likewise, but when constructing a NEGATE_EXPR
3703 now call vn_nary_build_or_lookup_1 disabling simplification.
3705 2021-09-22 Jiufu Guo <guojiufu@linux.ibm.com>
3707 PR tree-optimization/102087
3708 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
3709 Update bound/cmp/control for niter.
3711 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
3713 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
3714 Move check for non-empty BB here.
3715 (fur_source::register_outgoing_edges): ...from here.
3717 2021-09-22 Aldy Hernandez <aldyh@redhat.com>
3719 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
3720 Remove call to improve_range_with_equivs.
3721 (path_range_query::improve_range_with_equivs): Remove
3722 * gimple-range-path.h: Remove improve_range_with_equivs.
3724 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3726 * config/i386/avx512fp16intrin.h:
3727 (_mm512_mask_blend_ph): New intrinsic.
3728 (_mm512_permutex2var_ph): Ditto.
3729 (_mm512_permutexvar_ph): Ditto.
3730 * config/i386/avx512fp16vlintrin.h:
3731 (_mm256_mask_blend_ph): New intrinsic.
3732 (_mm256_permutex2var_ph): Ditto.
3733 (_mm256_permutexvar_ph): Ditto.
3734 (_mm_mask_blend_ph): Ditto.
3735 (_mm_permutex2var_ph): Ditto.
3736 (_mm_permutexvar_ph): Ditto.
3738 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3740 * config/i386/avx512fp16intrin.h: Add new intrinsics.
3741 (_mm512_conj_pch): New intrinsic.
3742 (_mm512_mask_conj_pch): Ditto.
3743 (_mm512_maskz_conj_pch): Ditto.
3744 * config/i386/avx512fp16vlintrin.h: Add new intrinsics.
3745 (_mm256_conj_pch): New intrinsic.
3746 (_mm256_mask_conj_pch): Ditto.
3747 (_mm256_maskz_conj_pch): Ditto.
3748 (_mm_conj_pch): Ditto.
3749 (_mm_mask_conj_pch): Ditto.
3750 (_mm_maskz_conj_pch): Ditto.
3752 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3754 * config/i386/avx512fp16intrin.h (_MM512_REDUCE_OP): New macro
3755 (_mm512_reduce_add_ph): New intrinsic.
3756 (_mm512_reduce_mul_ph): Ditto.
3757 (_mm512_reduce_min_ph): Ditto.
3758 (_mm512_reduce_max_ph): Ditto.
3759 * config/i386/avx512fp16vlintrin.h
3760 (_MM256_REDUCE_OP/_MM_REDUCE_OP): New macro.
3761 (_mm256_reduce_add_ph): New intrinsic.
3762 (_mm256_reduce_mul_ph): Ditto.
3763 (_mm256_reduce_min_ph): Ditto.
3764 (_mm256_reduce_max_ph): Ditto.
3765 (_mm_reduce_add_ph): Ditto.
3766 (_mm_reduce_mul_ph): Ditto.
3767 (_mm_reduce_min_ph): Ditto.
3768 (_mm_reduce_max_ph): Ditto.
3770 2021-09-22 dianhong xu <dianhong.xu@intel.com>
3772 * config/i386/avx512fp16intrin.h (__m512h_u, __m256h_u,
3773 __m128h_u): New typedef.
3774 (_mm512_load_ph): New intrinsic.
3775 (_mm256_load_ph): Ditto.
3776 (_mm_load_ph): Ditto.
3777 (_mm512_loadu_ph): Ditto.
3778 (_mm256_loadu_ph): Ditto.
3779 (_mm_loadu_ph): Ditto.
3780 (_mm512_store_ph): Ditto.
3781 (_mm256_store_ph): Ditto.
3782 (_mm_store_ph): Ditto.
3783 (_mm512_storeu_ph): Ditto.
3784 (_mm256_storeu_ph): Ditto.
3785 (_mm_storeu_ph): Ditto.
3786 (_mm512_abs_ph): Ditto.
3787 * config/i386/avx512fp16vlintrin.h
3788 (_mm_abs_ph): Ditto.
3789 (_mm256_abs_ph): Ditto.
3791 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
3793 * config/s390/tpf.md (prologue_tpf, epilogue_tpf): Add cc clobber.
3795 2021-09-22 Andreas Krebbel <krebbel@linux.ibm.com>
3798 * config/s390/s390.c (s390_expand_insv): Emit a normal move if it
3799 is actually a full copy of the source operand into the target.
3800 Don't emit a strict low part move if source and target mode match.
3802 2021-09-22 Jakub Jelinek <jakub@redhat.com>
3804 PR middle-end/102415
3805 * omp-expand.c (expand_omp_single): If region->exit is NULL,
3806 assert region->entry is GIMPLE_OMP_SCOPE region and return.
3808 2021-09-22 Jakub Jelinek <jakub@redhat.com>
3810 * tree.h (OMP_CLAUSE_ALLOCATE_ALIGN): Define.
3811 * tree.c (omp_clause_num_ops): Change number of OMP_CLAUSE_ALLOCATE
3812 arguments from 2 to 3.
3813 * tree-pretty-print.c (dump_omp_clause): Print allocator() around
3814 allocate clause allocator and print align if present.
3815 * omp-low.c (scan_sharing_clauses): Force allocate_map entry even
3816 for omp_default_mem_alloc if align modifier is present. If align
3817 modifier is present, use TREE_LIST to encode both allocator and
3819 (lower_private_allocate, lower_rec_input_clauses, create_task_copyfn):
3820 Handle align modifier on allocator clause if present.
3822 2021-09-22 liuhongt <hongtao.liu@intel.com>
3824 * config/i386/i386.md (define_attr "isa"): Add
3826 (define_attr "enabled"): Correspond fma_or_avx512vl to
3827 TARGET_FMA || TARGET_AVX512VL.
3828 * config/i386/mmx.md (fmav2sf4): Extend to AVX512 fma.
3833 2021-09-22 liuhongt <hongtao.liu@intel.com>
3835 * config/i386/i386.md (cstorehf3): New define_expand.
3837 2021-09-22 liuhongt <hongtao.liu@intel.com>
3839 * config/i386/i386.md (<rounding_insn>hf2): New expander.
3840 (sse4_1_round<mode>2): Extend from MODEF to MODEFH.
3841 * config/i386/sse.md (*sse4_1_round<ssescalarmodesuffix>):
3842 Extend from VF_128 to VFH_128.
3844 2021-09-22 liuhongt <hongtao.liu@intel.com>
3846 * config/i386/i386-features.c (i386-features.c): Handle
3848 * config/i386/i386.md (sqrthf2): New expander.
3849 (*sqrthf2): New define_insn.
3850 * config/i386/sse.md
3851 (*<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
3854 2021-09-22 liuhongt <hongtao.liu@intel.com>
3856 * config/i386/avx512fp16intrin.h (_mm_mask_fcmadd_sch):
3858 (_mm_mask3_fcmadd_sch): Likewise.
3859 (_mm_maskz_fcmadd_sch): Likewise.
3860 (_mm_fcmadd_sch): Likewise.
3861 (_mm_mask_fmadd_sch): Likewise.
3862 (_mm_mask3_fmadd_sch): Likewise.
3863 (_mm_maskz_fmadd_sch): Likewise.
3864 (_mm_fmadd_sch): Likewise.
3865 (_mm_mask_fcmadd_round_sch): Likewise.
3866 (_mm_mask3_fcmadd_round_sch): Likewise.
3867 (_mm_maskz_fcmadd_round_sch): Likewise.
3868 (_mm_fcmadd_round_sch): Likewise.
3869 (_mm_mask_fmadd_round_sch): Likewise.
3870 (_mm_mask3_fmadd_round_sch): Likewise.
3871 (_mm_maskz_fmadd_round_sch): Likewise.
3872 (_mm_fmadd_round_sch): Likewise.
3873 (_mm_fcmul_sch): Likewise.
3874 (_mm_mask_fcmul_sch): Likewise.
3875 (_mm_maskz_fcmul_sch): Likewise.
3876 (_mm_fmul_sch): Likewise.
3877 (_mm_mask_fmul_sch): Likewise.
3878 (_mm_maskz_fmul_sch): Likewise.
3879 (_mm_fcmul_round_sch): Likewise.
3880 (_mm_mask_fcmul_round_sch): Likewise.
3881 (_mm_maskz_fcmul_round_sch): Likewise.
3882 (_mm_fmul_round_sch): Likewise.
3883 (_mm_mask_fmul_round_sch): Likewise.
3884 (_mm_maskz_fmul_round_sch): Likewise.
3885 * config/i386/i386-builtin.def: Add corresponding new builtins.
3886 * config/i386/sse.md
3887 (avx512fp16_fmaddcsh_v8hf_maskz<round_expand_name>): New expander.
3888 (avx512fp16_fcmaddcsh_v8hf_maskz<round_expand_name>): Ditto.
3889 (avx512fp16_fma_<complexopname>sh_v8hf<mask_scalarcz_name><round_scalarcz_name>):
3891 (avx512fp16_<complexopname>sh_v8hf_mask<round_name>): Ditto.
3892 (avx512fp16_<complexopname>sh_v8hf<mask_scalarc_name><round_scalarcz_name>):
3894 * config/i386/subst.md (mask_scalarcz_name): New.
3895 (mask_scalarc_name): Ditto.
3896 (mask_scalarc_operand3): Ditto.
3897 (mask_scalarcz_operand4): Ditto.
3898 (round_scalarcz_name): Ditto.
3899 (round_scalarc_mask_operand3): Ditto.
3900 (round_scalarcz_mask_operand4): Ditto.
3901 (round_scalarc_mask_op3): Ditto.
3902 (round_scalarcz_mask_op4): Ditto.
3903 (round_scalarcz_constraint): Ditto.
3904 (round_scalarcz_nimm_predicate): Ditto.
3905 (mask_scalarcz): Ditto.
3906 (mask_scalarc): Ditto.
3907 (round_scalarcz): Ditto.
3909 2021-09-22 liuhongt <hongtao.liu@intel.com>
3911 * config/i386/avx512fp16intrin.h (_mm512_fcmadd_pch):
3913 (_mm512_mask_fcmadd_pch): Likewise.
3914 (_mm512_mask3_fcmadd_pch): Likewise.
3915 (_mm512_maskz_fcmadd_pch): Likewise.
3916 (_mm512_fmadd_pch): Likewise.
3917 (_mm512_mask_fmadd_pch): Likewise.
3918 (_mm512_mask3_fmadd_pch): Likewise.
3919 (_mm512_maskz_fmadd_pch): Likewise.
3920 (_mm512_fcmadd_round_pch): Likewise.
3921 (_mm512_mask_fcmadd_round_pch): Likewise.
3922 (_mm512_mask3_fcmadd_round_pch): Likewise.
3923 (_mm512_maskz_fcmadd_round_pch): Likewise.
3924 (_mm512_fmadd_round_pch): Likewise.
3925 (_mm512_mask_fmadd_round_pch): Likewise.
3926 (_mm512_mask3_fmadd_round_pch): Likewise.
3927 (_mm512_maskz_fmadd_round_pch): Likewise.
3928 (_mm512_fcmul_pch): Likewise.
3929 (_mm512_mask_fcmul_pch): Likewise.
3930 (_mm512_maskz_fcmul_pch): Likewise.
3931 (_mm512_fmul_pch): Likewise.
3932 (_mm512_mask_fmul_pch): Likewise.
3933 (_mm512_maskz_fmul_pch): Likewise.
3934 (_mm512_fcmul_round_pch): Likewise.
3935 (_mm512_mask_fcmul_round_pch): Likewise.
3936 (_mm512_maskz_fcmul_round_pch): Likewise.
3937 (_mm512_fmul_round_pch): Likewise.
3938 (_mm512_mask_fmul_round_pch): Likewise.
3939 (_mm512_maskz_fmul_round_pch): Likewise.
3940 * config/i386/avx512fp16vlintrin.h (_mm_fmadd_pch):
3942 (_mm_mask_fmadd_pch): Likewise.
3943 (_mm_mask3_fmadd_pch): Likewise.
3944 (_mm_maskz_fmadd_pch): Likewise.
3945 (_mm256_fmadd_pch): Likewise.
3946 (_mm256_mask_fmadd_pch): Likewise.
3947 (_mm256_mask3_fmadd_pch): Likewise.
3948 (_mm256_maskz_fmadd_pch): Likewise.
3949 (_mm_fcmadd_pch): Likewise.
3950 (_mm_mask_fcmadd_pch): Likewise.
3951 (_mm_mask3_fcmadd_pch): Likewise.
3952 (_mm_maskz_fcmadd_pch): Likewise.
3953 (_mm256_fcmadd_pch): Likewise.
3954 (_mm256_mask_fcmadd_pch): Likewise.
3955 (_mm256_mask3_fcmadd_pch): Likewise.
3956 (_mm256_maskz_fcmadd_pch): Likewise.
3957 (_mm_fmul_pch): Likewise.
3958 (_mm_mask_fmul_pch): Likewise.
3959 (_mm_maskz_fmul_pch): Likewise.
3960 (_mm256_fmul_pch): Likewise.
3961 (_mm256_mask_fmul_pch): Likewise.
3962 (_mm256_maskz_fmul_pch): Likewise.
3963 (_mm_fcmul_pch): Likewise.
3964 (_mm_mask_fcmul_pch): Likewise.
3965 (_mm_maskz_fcmul_pch): Likewise.
3966 (_mm256_fcmul_pch): Likewise.
3967 (_mm256_mask_fcmul_pch): Likewise.
3968 (_mm256_maskz_fcmul_pch): Likewise.
3969 * config/i386/i386-builtin-types.def (V8HF_FTYPE_V8HF_V8HF_V8HF,
3970 V8HF_FTYPE_V16HF_V16HF_V16HF, V16HF_FTYPE_V16HF_V16HF_V16HF_UQI,
3971 V32HF_FTYPE_V32HF_V32HF_V32HF_INT,
3972 V32HF_FTYPE_V32HF_V32HF_V32HF_UHI_INT): Add new builtin types.
3973 * config/i386/i386-builtin.def: Add new builtins.
3974 * config/i386/i386-expand.c: Handle new builtin types.
3975 * config/i386/subst.md (SUBST_CV): New.
3976 (maskc_name): Ditto.
3977 (maskc_operand3): Ditto.
3979 (sdc_maskz_name): Ditto.
3980 (sdc_mask_op4): Ditto.
3981 (sdc_mask_op5): Ditto.
3982 (sdc_mask_mode512bit_condition): Ditto.
3984 (round_maskc_operand3): Ditto.
3985 (round_sdc_mask_operand4): Ditto.
3986 (round_maskc_op3): Ditto.
3987 (round_sdc_mask_op4): Ditto.
3988 (round_saeonly_sdc_mask_operand5): Ditto.
3989 * config/i386/sse.md (unspec): Add complex fma unspecs.
3990 (avx512fmaskcmode): New.
3991 (UNSPEC_COMPLEX_F_C_MA): Ditto.
3992 (UNSPEC_COMPLEX_F_C_MUL): Ditto.
3993 (complexopname): Ditto.
3994 (<avx512>_fmaddc_<mode>_maskz<round_expand_name>): New expander.
3995 (<avx512>_fcmaddc_<mode>_maskz<round_expand_name>): Ditto.
3996 (fma_<complexopname>_<mode><sdc_maskz_name><round_name>): New
3998 (<avx512>_<complexopname>_<mode>_mask<round_name>): Ditto.
3999 (<avx512>_<complexopname>_<mode><maskc_name><round_name>): Ditto.
4001 2021-09-22 Kewen Lin <linkw@linux.ibm.com>
4003 * config/rs6000/rs6000.opt (rs6000-density-pct-threshold,
4004 rs6000-density-size-threshold, rs6000-density-penalty,
4005 rs6000-density-load-pct-threshold,
4006 rs6000-density-load-num-threshold): New parameter.
4007 * config/rs6000/rs6000.c (rs6000_density_test): Adjust with
4008 corresponding parameters.
4010 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4012 * gimple-range-path.cc (path_range_query::defined_outside_path):
4014 (path_range_query::range_on_path_entry): New.
4015 (path_range_query::internal_range_of_expr): Resolve unknowns
4017 (path_range_query::improve_range_with_equivs): New.
4018 (path_range_query::ssa_range_in_phi): Resolve unknowns with
4020 * gimple-range-path.h (class path_range_query): Add
4021 defined_outside_path, range_on_path_entry, and
4022 improve_range_with_equivs.
4024 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4026 * gimple-range-path.cc (path_range_query::add_to_imports): New.
4027 (path_range_query::add_copies_to_imports): New.
4028 (path_range_query::precompute_ranges): Call
4029 add_copies_to_imports.
4030 * gimple-range-path.h (class path_range_query): Add prototypes
4031 for add_copies_to_imports and add_to_imports.
4033 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4035 * gimple-range-path.cc (path_range_query::range_defined_in_block):
4036 Remove useless code.
4038 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4040 * gimple-range-fold.h (class fur_source): Make oracle protected.
4041 * gimple-range-path.cc (path_range_query::path_range_query): Add
4042 resolve argument. Initialize oracle.
4043 (path_range_query::~path_range_query): Delete oracle.
4044 (path_range_query::range_of_stmt): Adapt to use relations.
4045 (path_range_query::precompute_ranges): Pre-compute relations.
4046 (class jt_fur_source): New
4047 (jt_fur_source::jt_fur_source): New.
4048 (jt_fur_source::register_relation): New.
4049 (jt_fur_source::query_relation): New.
4050 (path_range_query::precompute_relations): New.
4051 (path_range_query::precompute_phi_relations): New.
4052 * gimple-range-path.h (path_range_query): Add resolve argument.
4053 Add oracle, precompute_relations, precompute_phi_relations.
4054 * tree-ssa-threadbackward.c (back_threader::back_threader): Pass
4055 resolve argument to solver.
4057 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4059 * gimple-range-fold.cc (fold_using_range::range_of_range_op):
4060 Rename postfold_gcond_edges to register_outgoing_edges and
4062 (fold_using_range::postfold_gcond_edges): Rename...
4063 (fur_source::register_outgoing_edges): ...to this.
4064 * gimple-range-fold.h (postfold_gcond_edges): Rename to
4065 register_outgoing_edges and move to fur_source.
4067 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4069 * gimple-range-fold.cc (fold_using_range::range_of_phi): Check
4070 dom_info_available_p.
4072 2021-09-21 Aldy Hernandez <aldyh@redhat.com>
4074 * gimple-range-cache.cc (non_null_ref::non_null_ref): Use create
4075 and quick_grow_cleared instead of safe_grow_cleared.
4077 2021-09-21 Thomas Schwinge <thomas@codesourcery.com>
4080 * omp-oacc-neuter-broadcast.cc (oacc_do_neutering): Evaluate
4083 2021-09-21 Richard Earnshaw <rearnsha@arm.com>
4085 * configure.ac: Detect when the assembler supports new-style
4086 architecture extensions.
4087 * common/config/arm/arm-common.c (arm_rewrite_mcpu): Return
4088 the full CPU string if the assembler can grok it.
4089 (arm_rewrite_march): Likewise but for the architecture.
4090 * config.in: Regenerate.
4091 * configure: Regenerate.
4093 2021-09-21 Richard Biener <rguenther@suse.de>
4095 PR tree-optimization/102421
4096 * tree-vect-loop.c (vect_dissolve_slp_only_groups): Copy and
4097 adjust alignment info.
4099 2021-09-21 Kewen Lin <linkw@linux.ibm.com>
4101 * ipa-fnsummary.c (ipa_fn_summary_write): Remove inconsistent
4102 bitfield stream out.
4104 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
4106 * gimple-range-fold.cc (fold_using_range::range_of_phi): Ignore
4107 undefined edges, apply an equivalence if appropriate.
4108 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Return
4109 UNDEFINED if EDGE_EXECUTABLE is not set.
4110 * gimple-range.cc (gimple_ranger::gimple_ranger): Set all edges
4111 as EXECUTABLE upon startup.
4112 (gimple_ranger::range_on_edge): Return UNDEFINED for edges without
4113 EDGE_EXECUTABLE set.
4114 * vr-values.c (set_and_propagate_unexecutable): New.
4115 (simplify_using_ranges::fold_cond): Call set_and_propagate.
4116 (simplify_using_ranges::simplify_switch_using_ranges): Ditto.
4117 * vr-values.h: Add prototype.
4119 2021-09-20 Andrew MacLeod <amacleod@redhat.com>
4121 * value-relation.cc (equiv_oracle::register_initial_def): New.
4122 (equiv_oracle::register_relation): Call register_initial_def.
4123 (equiv_oracle::add_equiv_to_block): New. Split register_relation.
4124 (relation_oracle::register_stmt): Check def block of PHI arguments.
4125 * value-relation.h (equiv_oracle): Add new prototypes.
4127 2021-09-20 Matthias Kretz <m.kretz@gsi.de>
4129 * cppbuiltin.c (define_builtin_macros_for_compilation_flags):
4130 Define __RECIPROCAL_MATH__, __NO_SIGNED_ZEROS__,
4131 __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__, and
4132 __ROUNDING_MATH__ according to their corresponding flags.
4133 * doc/cpp.texi: Document __RECIPROCAL_MATH__,
4134 __NO_SIGNED_ZEROS__, __NO_TRAPPING_MATH__, __ASSOCIATIVE_MATH__,
4135 and __ROUNDING_MATH__.
4137 2021-09-20 Richard Biener <rguenther@suse.de>
4139 * tree-vect-stmts.c (vectorizable_load): Use the vectype
4142 2021-09-20 Richard Biener <rguenther@suse.de>
4144 * tree-vect-data-refs.c (vect_duplicate_ssa_name_ptr_info):
4145 Do not compute alignment of the vectorized access here.
4147 2021-09-20 Richard Biener <rguenther@suse.de>
4149 * tree-vect-data-refs.c (vect_enhance_data_refs_alignment):
4150 Store -1 for runtime alias peeling iterations.
4152 2021-09-20 Richard Biener <rguenther@suse.de>
4154 * config.gcc: Obsolete hppa[12]*-*-hpux10* and hppa[12]*-*-hpux11*.
4156 2021-09-20 Thomas Schwinge <thomas@codesourcery.com>
4158 * input.c (string_concat_db::record_string_concatenation)
4159 (string_concat_db::get_string_concatenation): Skip for
4160 'RESERVED_LOCATION_P'.
4162 2021-09-20 Richard Biener <rguenther@suse.de>
4164 PR tree-optimization/65206
4165 * tree-data-ref.h (struct data_reference): Add alt_indices,
4167 * tree-data-ref.c (free_data_ref): Release alt_indices.
4168 (dr_analyze_indices): Work on struct indices and get DR_REF as tree.
4169 (create_data_ref): Adjust.
4170 (initialize_data_dependence_relation): Split into head
4171 and tail. When the base objects fail to match up try
4172 again with pointer-based analysis of indices.
4173 * tree-vectorizer.c (vec_info_shared::check_datarefs): Do
4174 not compare the lazily computed alternate set of indices.
4176 2021-09-20 Iain Sandoe <iain@sandoe.co.uk>
4178 * gcc.c: Test for execute OK when we find the
4179 programs for assembler linker and dsymutil and those
4180 were specified at configure-time.
4182 2021-09-19 Martin Sebor <msebor@redhat.com>
4184 PR middle-end/102403
4185 * gimple-predicate-analysis.cc (predicate::init_from_control_deps):
4186 Correct a function pre/postcondition.
4188 2021-09-19 Martin Sebor <msebor@redhat.com>
4190 PR middle-end/102243
4191 * tree-ssa-strlen.c (get_range): Handle null cfun.
4193 2021-09-19 Iain Sandoe <iain@sandoe.co.uk>
4195 * config/darwin.h (LINK_COMMAND_SPEC_A): Use Darwin10
4196 unwinder shim as a convenience library.
4198 2021-09-19 Andrew Pinski <apinski@marvell.com>
4200 * doc/install.texi: Add note about
4201 binutils 2.35 is required for LTO usage.
4203 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
4205 * tree-ssa-threadbackward.c
4206 (back_threader_registry::register_path): Use push_edge.
4207 * tree-ssa-threadedge.c
4208 (jump_threader::thread_around_empty_blocks): Same.
4209 (jump_threader::thread_through_normal_block): Same.
4210 (jump_threader::thread_across_edge): Same. Also, use auto_bitmap.
4212 * tree-ssa-threadupdate.c
4213 (jt_path_registry::allocate_thread_edge): Remove.
4214 (jt_path_registry::push_edge): New.
4215 (dump_jump_thread_path): Make static.
4216 * tree-ssa-threadupdate.h (allocate_thread_edge): Remove.
4219 2021-09-19 Aldy Hernandez <aldyh@redhat.com>
4221 * gimple-range-path.cc (path_range_query::path_range_query): Add
4223 (path_range_query::dump): Remove extern declaration of dump_ranger.
4224 * gimple-range-trace.cc (dump_ranger): Add DEBUG_FUNCTION marker.
4225 * gimple-range-trace.h (dump_ranger): Add prototype.
4227 2021-09-19 John Ericson <git@JohnEricson.me>
4229 * gcc.c (find_a_program): New function, factored out of...
4230 (find_a_file): Here.
4231 (execute): Use find_a_program when looking for programs rather
4234 2021-09-19 Matwey V. Kornilov <matwey.kornilov@gmail.com>
4236 * config/avr/avr-mcus.def: Add atmega324pb.
4237 * doc/avr-mmcu.texi: Corresponding changes.
4239 2021-09-19 Roger Sayle <roger@nextmovesoftware.com>
4242 * match.pd (cmp @0 REAL_CST@1): When @0 is also REAL_CST, apply
4243 the same transformations as to @1. For comparisons against NaN,
4244 don't check HONOR_SNANS but confirm that neither operand is a
4247 2021-09-19 Benjamin Peterson <benjamin@locrian.net>
4249 * attribs.c (make_unique_name): Delete.
4250 * attribs.h (make_unique_name): Delete.
4252 2021-09-19 Andrew Pinski <apinski@marvell.com>
4254 * lra-constraints.c (check_and_process_move): Assert
4255 that dclass and sclass are greater than or equal to NO_REGS.
4257 2021-09-18 Jakub Jelinek <jakub@redhat.com>
4259 * tree.h (OMP_CLAUSE_ORDER_UNCONSTRAINED): Define.
4260 * tree-pretty-print.c (dump_omp_clause): Print unconstrained:
4261 for OMP_CLAUSE_ORDER_UNCONSTRAINED.
4263 2021-09-18 liuhongt <hongtao.liu@intel.com>
4265 * config/i386/i386-features.c (remove_partial_avx_dependency):
4266 Restrict TARGET_USE_VECTOR_FP_CONVERTS and
4267 TARGET_USE_VECTOR_CONVERTS to conversion instructions only.
4269 2021-09-18 Jakub Jelinek <jakub@redhat.com>
4271 * gimplify.c (omp_default_clause): For C/C++ default({,first}private),
4272 if file/namespace scope variable doesn't have predetermined sharing,
4273 treat it as if there was default(none).
4275 2021-09-18 liuhongt <hongtao.liu@intel.com>
4277 * config/i386/avx512fp16intrin.h (_mm_fmadd_sh):
4279 (_mm_mask_fmadd_sh): Likewise.
4280 (_mm_mask3_fmadd_sh): Likewise.
4281 (_mm_maskz_fmadd_sh): Likewise.
4282 (_mm_fmadd_round_sh): Likewise.
4283 (_mm_mask_fmadd_round_sh): Likewise.
4284 (_mm_mask3_fmadd_round_sh): Likewise.
4285 (_mm_maskz_fmadd_round_sh): Likewise.
4286 (_mm_fnmadd_sh): Likewise.
4287 (_mm_mask_fnmadd_sh): Likewise.
4288 (_mm_mask3_fnmadd_sh): Likewise.
4289 (_mm_maskz_fnmadd_sh): Likewise.
4290 (_mm_fnmadd_round_sh): Likewise.
4291 (_mm_mask_fnmadd_round_sh): Likewise.
4292 (_mm_mask3_fnmadd_round_sh): Likewise.
4293 (_mm_maskz_fnmadd_round_sh): Likewise.
4294 (_mm_fmsub_sh): Likewise.
4295 (_mm_mask_fmsub_sh): Likewise.
4296 (_mm_mask3_fmsub_sh): Likewise.
4297 (_mm_maskz_fmsub_sh): Likewise.
4298 (_mm_fmsub_round_sh): Likewise.
4299 (_mm_mask_fmsub_round_sh): Likewise.
4300 (_mm_mask3_fmsub_round_sh): Likewise.
4301 (_mm_maskz_fmsub_round_sh): Likewise.
4302 (_mm_fnmsub_sh): Likewise.
4303 (_mm_mask_fnmsub_sh): Likewise.
4304 (_mm_mask3_fnmsub_sh): Likewise.
4305 (_mm_maskz_fnmsub_sh): Likewise.
4306 (_mm_fnmsub_round_sh): Likewise.
4307 (_mm_mask_fnmsub_round_sh): Likewise.
4308 (_mm_mask3_fnmsub_round_sh): Likewise.
4309 (_mm_maskz_fnmsub_round_sh): Likewise.
4310 * config/i386/i386-builtin-types.def
4311 (V8HF_FTYPE_V8HF_V8HF_V8HF_UQI_INT): New builtin type.
4312 * config/i386/i386-builtin.def: Add new builtins.
4313 * config/i386/i386-expand.c: Handle new builtin type.
4314 * config/i386/sse.md (fmai_vmfmadd_<mode><round_name>):
4315 Ajdust to support FP16.
4316 (fmai_vmfmsub_<mode><round_name>): Ditto.
4317 (fmai_vmfnmadd_<mode><round_name>): Ditto.
4318 (fmai_vmfnmsub_<mode><round_name>): Ditto.
4319 (*fmai_fmadd_<mode>): Ditto.
4320 (*fmai_fmsub_<mode>): Ditto.
4321 (*fmai_fnmadd_<mode><round_name>): Ditto.
4322 (*fmai_fnmsub_<mode><round_name>): Ditto.
4323 (avx512f_vmfmadd_<mode>_mask<round_name>): Ditto.
4324 (avx512f_vmfmadd_<mode>_mask3<round_name>): Ditto.
4325 (avx512f_vmfmadd_<mode>_maskz<round_expand_name>): Ditto.
4326 (avx512f_vmfmadd_<mode>_maskz_1<round_name>): Ditto.
4327 (*avx512f_vmfmsub_<mode>_mask<round_name>): Ditto.
4328 (avx512f_vmfmsub_<mode>_mask3<round_name>): Ditto.
4329 (*avx512f_vmfmsub_<mode>_maskz_1<round_name>): Ditto.
4330 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
4331 (*avx512f_vmfnmsub_<mode>_mask3<round_name>): Ditto.
4332 (*avx512f_vmfnmsub_<mode>_mask<round_name>): Ditto.
4333 (*avx512f_vmfnmadd_<mode>_mask<round_name>): Renamed to ...
4334 (avx512f_vmfnmadd_<mode>_mask<round_name>) ... this, and
4335 adjust to support FP16.
4336 (avx512f_vmfnmadd_<mode>_mask3<round_name>): Ditto.
4337 (avx512f_vmfnmadd_<mode>_maskz_1<round_name>): Ditto.
4338 (avx512f_vmfnmadd_<mode>_maskz<round_expand_name>): New
4341 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
4343 * config/i386/sse.md (avx512fmaskmodelower): Extend to support
4345 (maskload<mode><avx512fmaskmodelower>): Ditto.
4346 (maskstore<mode><avx512fmaskmodelower>): Ditto.
4348 2021-09-18 H.J. Lu <hjl.tools@gmail.com>
4350 * config/i386/i386-expand.c (ix86_expand_fp_absneg_operator):
4352 (ix86_expand_copysign): Ditto.
4353 (ix86_expand_xorsign): Ditto.
4354 * config/i386/i386.c (ix86_build_const_vector): Handle HF vector
4356 (ix86_build_signbit_mask): Ditto.
4357 (ix86_can_change_mode_class): Ditto.
4358 * config/i386/i386.md
4359 (SSEMODEF): Add HFmode.
4360 (ssevecmodef): Ditto.
4361 (<code>hf2): New define_expand.
4362 (*<code>hf2_1): New define_insn_and_split.
4363 (copysign<mode>): Extend to support HFmode under AVX512FP16.
4364 (xorsign<mode>): Ditto.
4365 * config/i386/sse.md (VFB): New mode iterator.
4366 (VFB_128_256): Ditto.
4368 (sseintvecmode2): Support HF vector mode.
4369 (<code><mode>2): Use new mode iterator.
4370 (*<code><mode>2): Ditto.
4371 (copysign<mode>3): Ditto.
4372 (xorsign<mode>3): Ditto.
4373 (<code><mode>3<mask_name>): Ditto.
4374 (<code><mode>3<mask_name>): Ditto.
4375 (<sse>_andnot<mode>3<mask_name>): Adjust for HF vector mode.
4376 (<sse>_andnot<mode>3<mask_name>): Ditto.
4377 (*<code><mode>3<mask_name>): Ditto.
4378 (*<code><mode>3<mask_name>): Ditto.
4380 2021-09-18 liuhongt <hongtao.liu@intel.com>
4382 * config/i386/avx512fp16intrin.h (_mm512_mask_fmadd_ph):
4384 (_mm512_mask3_fmadd_ph): Likewise.
4385 (_mm512_maskz_fmadd_ph): Likewise.
4386 (_mm512_fmadd_round_ph): Likewise.
4387 (_mm512_mask_fmadd_round_ph): Likewise.
4388 (_mm512_mask3_fmadd_round_ph): Likewise.
4389 (_mm512_maskz_fmadd_round_ph): Likewise.
4390 (_mm512_fnmadd_ph): Likewise.
4391 (_mm512_mask_fnmadd_ph): Likewise.
4392 (_mm512_mask3_fnmadd_ph): Likewise.
4393 (_mm512_maskz_fnmadd_ph): Likewise.
4394 (_mm512_fnmadd_round_ph): Likewise.
4395 (_mm512_mask_fnmadd_round_ph): Likewise.
4396 (_mm512_mask3_fnmadd_round_ph): Likewise.
4397 (_mm512_maskz_fnmadd_round_ph): Likewise.
4398 (_mm512_fmsub_ph): Likewise.
4399 (_mm512_mask_fmsub_ph): Likewise.
4400 (_mm512_mask3_fmsub_ph): Likewise.
4401 (_mm512_maskz_fmsub_ph): Likewise.
4402 (_mm512_fmsub_round_ph): Likewise.
4403 (_mm512_mask_fmsub_round_ph): Likewise.
4404 (_mm512_mask3_fmsub_round_ph): Likewise.
4405 (_mm512_maskz_fmsub_round_ph): Likewise.
4406 (_mm512_fnmsub_ph): Likewise.
4407 (_mm512_mask_fnmsub_ph): Likewise.
4408 (_mm512_mask3_fnmsub_ph): Likewise.
4409 (_mm512_maskz_fnmsub_ph): Likewise.
4410 (_mm512_fnmsub_round_ph): Likewise.
4411 (_mm512_mask_fnmsub_round_ph): Likewise.
4412 (_mm512_mask3_fnmsub_round_ph): Likewise.
4413 (_mm512_maskz_fnmsub_round_ph): Likewise.
4414 * config/i386/avx512fp16vlintrin.h (_mm256_fmadd_ph):
4416 (_mm256_mask_fmadd_ph): Likewise.
4417 (_mm256_mask3_fmadd_ph): Likewise.
4418 (_mm256_maskz_fmadd_ph): Likewise.
4419 (_mm_fmadd_ph): Likewise.
4420 (_mm_mask_fmadd_ph): Likewise.
4421 (_mm_mask3_fmadd_ph): Likewise.
4422 (_mm_maskz_fmadd_ph): Likewise.
4423 (_mm256_fnmadd_ph): Likewise.
4424 (_mm256_mask_fnmadd_ph): Likewise.
4425 (_mm256_mask3_fnmadd_ph): Likewise.
4426 (_mm256_maskz_fnmadd_ph): Likewise.
4427 (_mm_fnmadd_ph): Likewise.
4428 (_mm_mask_fnmadd_ph): Likewise.
4429 (_mm_mask3_fnmadd_ph): Likewise.
4430 (_mm_maskz_fnmadd_ph): Likewise.
4431 (_mm256_fmsub_ph): Likewise.
4432 (_mm256_mask_fmsub_ph): Likewise.
4433 (_mm256_mask3_fmsub_ph): Likewise.
4434 (_mm256_maskz_fmsub_ph): Likewise.
4435 (_mm_fmsub_ph): Likewise.
4436 (_mm_mask_fmsub_ph): Likewise.
4437 (_mm_mask3_fmsub_ph): Likewise.
4438 (_mm_maskz_fmsub_ph): Likewise.
4439 (_mm256_fnmsub_ph): Likewise.
4440 (_mm256_mask_fnmsub_ph): Likewise.
4441 (_mm256_mask3_fnmsub_ph): Likewise.
4442 (_mm256_maskz_fnmsub_ph): Likewise.
4443 (_mm_fnmsub_ph): Likewise.
4444 (_mm_mask_fnmsub_ph): Likewise.
4445 (_mm_mask3_fnmsub_ph): Likewise.
4446 (_mm_maskz_fnmsub_ph): Likewise.
4447 * config/i386/i386-builtin.def: Add corresponding new builtins.
4448 * config/i386/sse.md
4449 (<avx512>_fmadd_<mode>_maskz<round_expand_name>): Adjust to
4450 support HF vector modes.
4451 (<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name><round_name>):
4453 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
4454 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
4455 (*<sd_mask_codefor>fma_fmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
4456 (<avx512>_fmadd_<mode>_mask<round_name>): Ditto.
4457 (<avx512>_fmadd_<mode>_mask3<round_name>): Ditto.
4458 (<avx512>_fmsub_<mode>_maskz<round_expand_name>): Ditto.
4459 (<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name><round_name>):
4461 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
4462 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
4463 (*<sd_mask_codefor>fma_fmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
4464 (<avx512>_fmsub_<mode>_mask<round_name>): Ditto.
4465 (<avx512>_fmsub_<mode>_mask3<round_name>): Ditto.
4466 (<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name><round_name>):
4468 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_1): Ditto.
4469 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_2): Ditto.
4470 (*<sd_mask_codefor>fma_fnmadd_<mode><sd_maskz_name>_bcst_3): Ditto.
4471 (<avx512>_fnmadd_<mode>_mask<round_name>): Ditto.
4472 (<avx512>_fnmadd_<mode>_mask3<round_name>): Ditto.
4473 (<avx512>_fnmsub_<mode>_maskz<round_expand_name>): Ditto.
4474 (<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name><round_name>):
4476 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_1): Ditto.
4477 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_2): Ditto.
4478 (*<sd_mask_codefor>fma_fnmsub_<mode><sd_maskz_name>_bcst_3): Ditto.
4479 (<avx512>_fnmsub_<mode>_mask<round_name>): Ditto.
4480 (<avx512>_fnmsub_<mode>_mask3<round_name>): Ditto.
4482 2021-09-18 liuhongt <hongtao.liu@intel.com>
4484 * config/i386/avx512fp16intrin.h (_mm512_fmaddsub_ph):
4486 (_mm512_mask_fmaddsub_ph): Likewise.
4487 (_mm512_mask3_fmaddsub_ph): Likewise.
4488 (_mm512_maskz_fmaddsub_ph): Likewise.
4489 (_mm512_fmaddsub_round_ph): Likewise.
4490 (_mm512_mask_fmaddsub_round_ph): Likewise.
4491 (_mm512_mask3_fmaddsub_round_ph): Likewise.
4492 (_mm512_maskz_fmaddsub_round_ph): Likewise.
4493 (_mm512_mask_fmsubadd_ph): Likewise.
4494 (_mm512_mask3_fmsubadd_ph): Likewise.
4495 (_mm512_maskz_fmsubadd_ph): Likewise.
4496 (_mm512_fmsubadd_round_ph): Likewise.
4497 (_mm512_mask_fmsubadd_round_ph): Likewise.
4498 (_mm512_mask3_fmsubadd_round_ph): Likewise.
4499 (_mm512_maskz_fmsubadd_round_ph): Likewise.
4500 * config/i386/avx512fp16vlintrin.h (_mm256_fmaddsub_ph):
4502 (_mm256_mask_fmaddsub_ph): Likewise.
4503 (_mm256_mask3_fmaddsub_ph): Likewise.
4504 (_mm256_maskz_fmaddsub_ph): Likewise.
4505 (_mm_fmaddsub_ph): Likewise.
4506 (_mm_mask_fmaddsub_ph): Likewise.
4507 (_mm_mask3_fmaddsub_ph): Likewise.
4508 (_mm_maskz_fmaddsub_ph): Likewise.
4509 (_mm256_fmsubadd_ph): Likewise.
4510 (_mm256_mask_fmsubadd_ph): Likewise.
4511 (_mm256_mask3_fmsubadd_ph): Likewise.
4512 (_mm256_maskz_fmsubadd_ph): Likewise.
4513 (_mm_fmsubadd_ph): Likewise.
4514 (_mm_mask_fmsubadd_ph): Likewise.
4515 (_mm_mask3_fmsubadd_ph): Likewise.
4516 (_mm_maskz_fmsubadd_ph): Likewise.
4517 * config/i386/i386-builtin.def: Add corresponding new builtins.
4518 * config/i386/sse.md (VFH_SF_AVX512VL): New mode iterator.
4519 * (<avx512>_fmsubadd_<mode>_maskz<round_expand_name>): New expander.
4520 * (<avx512>_fmaddsub_<mode>_maskz<round_expand_name>): Use
4522 * (<sd_mask_codefor>fma_fmaddsub_<mode><sd_maskz_name><round_name>):
4524 * (<avx512>_fmaddsub_<mode>_mask<round_name>): Ditto.
4525 * (<avx512>_fmaddsub_<mode>_mask3<round_name>): Ditto.
4526 * (<sd_mask_codefor>fma_fmsubadd_<mode><sd_maskz_name><round_name>):
4528 * (<avx512>_fmsubadd_<mode>_mask<round_name>): Ditto.
4529 * (<avx512>_fmsubadd_<mode>_mask3<round_name>): Ditto.
4531 2021-09-18 liuhongt <hongtao.liu@intel.com>
4534 * config/i386/i386.c (ix86_print_operand): Handle
4535 V8HF/V16HF/V32HFmode.
4536 * config/i386/i386.h (VALID_BCST_MODE_P): Add HFmode.
4537 * config/i386/sse.md (avx512bcst): Remove.
4539 2021-09-17 Martin Sebor <msebor@redhat.com>
4541 * Makefile.in (OBJS): Add gimple-predicate-analysis.o.
4542 * tree-ssa-uninit.c (max_phi_args): Move to gimple-predicate-analysis.
4543 (MASK_SET_BIT, MASK_TEST_BIT, MASK_EMPTY): Same.
4544 (check_defs): Add comment.
4545 (can_skip_redundant_opnd): Update comment.
4546 (compute_uninit_opnds_pos): Adjust to namespace change.
4547 (find_pdom): Move to gimple-predicate-analysis.cc.
4549 (struct uninit_undef_val_t): New.
4550 (is_non_loop_exit_postdominating): Move to gimple-predicate-analysis.cc.
4551 (find_control_equiv_block): Same.
4552 (MAX_NUM_CHAINS, MAX_CHAIN_LEN, MAX_POSTDOM_CHECK): Same.
4553 (MAX_SWITCH_CASES): Same.
4554 (compute_control_dep_chain): Same.
4555 (find_uninit_use): Use predicate analyzer.
4556 (struct pred_info): Move to gimple-predicate-analysis.
4557 (convert_control_dep_chain_into_preds): Same.
4558 (find_predicates): Same.
4559 (collect_phi_def_edges): Same.
4560 (warn_uninitialized_phi): Use predicate analyzer.
4561 (find_def_preds): Move to gimple-predicate-analysis.
4562 (dump_pred_info): Same.
4563 (dump_pred_chain): Same.
4564 (dump_predicates): Same.
4565 (destroy_predicate_vecs): Remove.
4566 (execute_late_warn_uninitialized): New.
4567 (get_cmp_code): Move to gimple-predicate-analysis.
4568 (is_value_included_in): Same.
4569 (value_sat_pred_p): Same.
4570 (find_matching_predicate_in_rest_chains): Same.
4571 (is_use_properly_guarded): Same.
4572 (prune_uninit_phi_opnds): Same.
4573 (find_var_cmp_const): Same.
4574 (use_pred_not_overlap_with_undef_path_pred): Same.
4575 (pred_equal_p): Same.
4576 (is_neq_relop_p): Same.
4577 (is_neq_zero_form_p): Same.
4578 (pred_expr_equal_p): Same.
4579 (is_pred_expr_subset_of): Same.
4580 (is_pred_chain_subset_of): Same.
4581 (is_included_in): Same.
4582 (is_superset_of): Same.
4584 (simplify_pred): Same.
4585 (simplify_preds_2): Same.
4586 (simplify_preds_3): Same.
4587 (simplify_preds_4): Same.
4588 (simplify_preds): Same.
4590 (push_to_worklist): Same.
4591 (get_pred_info_from_cmp): Same.
4592 (is_degenerated_phi): Same.
4593 (normalize_one_pred_1): Same.
4594 (normalize_one_pred): Same.
4595 (normalize_one_pred_chain): Same.
4596 (normalize_preds): Same.
4597 (can_one_predicate_be_invalidated_p): Same.
4598 (can_chain_union_be_invalidated_p): Same.
4599 (uninit_uses_cannot_happen): Same.
4600 (pass_late_warn_uninitialized::execute): Define.
4601 * gimple-predicate-analysis.cc: New file.
4602 * gimple-predicate-analysis.h: New file.
4604 2021-09-17 Julian Brown <julian@codesourcery.com>
4606 * config/gcn/gcn.c (gimple.h): Include.
4607 (gcn_fork_join): Emit barrier for worker-level joins.
4608 * omp-oacc-neuter-broadcast.cc (find_local_vars_to_propagate): Add
4609 writes_gang_private bitmap parameter. Set bit for blocks
4610 containing gang-private variable writes.
4611 (worker_single_simple): Don't emit barrier after predicated block.
4612 (worker_single_copy): Don't emit barrier if we're not broadcasting
4613 anything and the block contains no gang-private writes.
4614 (neuter_worker_single): Don't predicate blocks that only contain
4615 NOPs or internal marker functions. Pass has_gang_private_write
4616 argument to worker_single_copy.
4617 (oacc_do_neutering): Add writes_gang_private bitmap handling.
4619 2021-09-17 Julian Brown <julian@codesourcery.com>
4621 * config/gcn/gcn-protos.h
4622 (gcn_goacc_create_worker_broadcast_record): Update prototype.
4623 * config/gcn/gcn-tree.c (gcn_goacc_get_worker_red_decl): Use
4624 preallocated block of LDS memory. Do not cache/share decls for
4625 reduction temporaries between invocations.
4626 (gcn_goacc_reduction_teardown): Unshare VAR on second use.
4627 (gcn_goacc_create_worker_broadcast_record): Add OFFSET parameter
4628 and return temporary LDS space at that offset. Return pointer in
4630 * config/gcn/gcn.c (acc_lds_size, gang_private_hwm, lds_allocs):
4632 (ACC_LDS_SIZE): Define as acc_lds_size.
4633 (gcn_init_machine_status): Don't initialise lds_allocated,
4634 lds_allocs, reduc_decls fields of machine function struct.
4635 (gcn_option_override): Handle default size for gang-private
4636 variables and -mgang-private-size option.
4637 (gcn_expand_prologue): Use LDS_SIZE instead of LDS_SIZE-1 when
4638 initialising M0_REG.
4639 (gcn_shared_mem_layout): New function.
4640 (gcn_print_lds_decl): Update comment. Use global lds_allocs map and
4641 gang_private_hwm variable.
4642 (TARGET_GOACC_SHARED_MEM_LAYOUT): Define target hook.
4643 * config/gcn/gcn.h (machine_function): Remove lds_allocated,
4644 lds_allocs, reduc_decls. Add reduction_base, reduction_limit.
4645 * config/gcn/gcn.opt (gang_private_size_opt): New global.
4646 (mgang-private-size=): New option.
4647 * doc/tm.texi.in (TARGET_GOACC_SHARED_MEM_LAYOUT): Place
4649 * doc/tm.texi: Regenerate.
4650 * omp-oacc-neuter-broadcast.cc (targhooks.h, diagnostic-core.h):
4652 (build_sender_ref): Handle sender_decl being pointer.
4653 (worker_single_copy): Add PLACEMENT and ISOLATE_BROADCASTS
4654 parameters. Pass placement argument to
4655 create_worker_broadcast_record hook invocations. Handle
4656 sender_decl being pointer and isolate_broadcasts inserting extra
4658 (blk_offset_map_t): Add typedef.
4659 (neuter_worker_single): Add BLK_OFFSET_MAP parameter. Pass
4660 preallocated range to worker_single_copy call.
4661 (dfs_broadcast_reachable_1): New function.
4662 (idx_decl_pair_t, used_range_vec_t): New typedefs.
4663 (sort_size_descending): New function.
4664 (addr_range): New class.
4665 (splay_tree_compare_addr_range, splay_tree_free_key)
4666 (first_fit_range, merge_ranges_1, merge_ranges): New functions.
4667 (execute_omp_oacc_neuter_broadcast): Rename to...
4668 (oacc_do_neutering): ... this. Add BOUNDS_LO, BOUNDS_HI
4669 parameters. Arrange layout of shared memory for broadcast
4671 (execute_omp_oacc_neuter_broadcast): New function.
4672 (pass_omp_oacc_neuter_broadcast::gate): Remove num_workers==1
4673 handling from here. Enable pass for all OpenACC routines in order
4674 to call shared memory-layout hook.
4675 * target.def (create_worker_broadcast_record): Add OFFSET
4677 (shared_mem_layout): New hook.
4679 2021-09-17 Julian Brown <julian@codesourcery.com>
4680 Thomas Schwinge <thomas@codesourcery.com>
4682 * omp-oacc-neuter-broadcast.cc
4683 (pass_omp_oacc_neuter_broadcast::gate): Disable if num_workers is
4685 (execute_omp_oacc_neuter_broadcast): Adjust.
4687 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
4689 * value-relation.cc (class equiv_chain): Move to header file.
4690 (path_oracle::path_oracle): New.
4691 (path_oracle::~path_oracle): New.
4692 (path_oracle::register_relation): New.
4693 (path_oracle::query_relation): New.
4694 (path_oracle::reset_path): New.
4695 (path_oracle::dump): New.
4696 * value-relation.h (class equiv_chain): Move to here.
4697 (class path_oracle): New.
4699 2021-09-17 Andrew MacLeod <amacleod@redhat.com>
4701 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a DOM
4703 * gimple-range-fold.cc (fur_depend::register_relation): Use
4704 register_stmt/edge routines.
4705 * value-relation.cc (equiv_chain::find): Relocate from equiv_oracle.
4706 (equiv_oracle::equiv_oracle): Create self equivalence cache.
4707 (equiv_oracle::~equiv_oracle): Release same.
4708 (equiv_oracle::equiv_set): Return entry from self equiv cache if there
4709 are no equivalences.
4710 (equiv_oracle::find_equiv_block): Move list find to equiv_chain.
4711 (equiv_oracle::register_relation): Rename from register_equiv.
4712 (relation_chain_head::find_relation): Relocate from dom_oracle.
4713 (relation_oracle::register_stmt): New.
4714 (relation_oracle::register_edge): New.
4715 (dom_oracle::*): Rename from relation_oracle.
4716 (dom_oracle::register_relation): Adjust to call equiv_oracle.
4717 (dom_oracle::set_one_relation): Split from register_relation.
4718 (dom_oracle::register_transitives): Consolidate 2 methods.
4719 (dom_oracle::find_relation_block): Move core to relation_chain.
4720 (dom_oracle::query_relation): Rename from find_relation_dom and adjust.
4721 * value-relation.h (class relation_oracle): New pure virtual base.
4722 (class equiv_oracle): Inherit from relation_oracle and adjust.
4723 (class dom_oracle): Rename from old relation_oracle and adjust.
4725 2021-09-17 Martin Sebor <msebor@redhat.com>
4727 PR middle-end/102200
4728 * pointer-query.cc (access_ref::inform_access): Handle MIN/MAX_EXPR.
4729 (handle_min_max_size): Change argument. Store original SSA_NAME for
4730 operands to potentially distinct (sub)objects.
4731 (compute_objsize_r): Adjust call to the above.
4733 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4735 * config/rs6000/rs6000.c (rs6000-builtins.h): New include.
4736 (rs6000_new_builtin_vectorized_function): New function.
4737 (rs6000_new_builtin_md_vectorized_function): Likewise.
4738 (rs6000_builtin_vectorized_function): Call
4739 rs6000_new_builtin_vectorized_function.
4740 (rs6000_builtin_md_vectorized_function): Call
4741 rs6000_new_builtin_md_vectorized_function.
4743 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4745 * config/rs6000/rs6000-builtin-new.def (ASSEMBLE_ACC): Add mmaint flag.
4746 (ASSEMBLE_PAIR): Likewise.
4747 (BUILD_ACC): Likewise.
4748 (DISASSEMBLE_ACC): Likewise.
4749 (DISASSEMBLE_PAIR): Likewise.
4750 (PMXVBF16GER2): Likewise.
4751 (PMXVBF16GER2NN): Likewise.
4752 (PMXVBF16GER2NP): Likewise.
4753 (PMXVBF16GER2PN): Likewise.
4754 (PMXVBF16GER2PP): Likewise.
4755 (PMXVF16GER2): Likewise.
4756 (PMXVF16GER2NN): Likewise.
4757 (PMXVF16GER2NP): Likewise.
4758 (PMXVF16GER2PN): Likewise.
4759 (PMXVF16GER2PP): Likewise.
4760 (PMXVF32GER): Likewise.
4761 (PMXVF32GERNN): Likewise.
4762 (PMXVF32GERNP): Likewise.
4763 (PMXVF32GERPN): Likewise.
4764 (PMXVF32GERPP): Likewise.
4765 (PMXVF64GER): Likewise.
4766 (PMXVF64GERNN): Likewise.
4767 (PMXVF64GERNP): Likewise.
4768 (PMXVF64GERPN): Likewise.
4769 (PMXVF64GERPP): Likewise.
4770 (PMXVI16GER2): Likewise.
4771 (PMXVI16GER2PP): Likewise.
4772 (PMXVI16GER2S): Likewise.
4773 (PMXVI16GER2SPP): Likewise.
4774 (PMXVI4GER8): Likewise.
4775 (PMXVI4GER8PP): Likewise.
4776 (PMXVI8GER4): Likewise.
4777 (PMXVI8GER4PP): Likewise.
4778 (PMXVI8GER4SPP): Likewise.
4779 (XVBF16GER2): Likewise.
4780 (XVBF16GER2NN): Likewise.
4781 (XVBF16GER2NP): Likewise.
4782 (XVBF16GER2PN): Likewise.
4783 (XVBF16GER2PP): Likewise.
4784 (XVF16GER2): Likewise.
4785 (XVF16GER2NN): Likewise.
4786 (XVF16GER2NP): Likewise.
4787 (XVF16GER2PN): Likewise.
4788 (XVF16GER2PP): Likewise.
4789 (XVF32GER): Likewise.
4790 (XVF32GERNN): Likewise.
4791 (XVF32GERNP): Likewise.
4792 (XVF32GERPN): Likewise.
4793 (XVF32GERPP): Likewise.
4794 (XVF64GER): Likewise.
4795 (XVF64GERNN): Likewise.
4796 (XVF64GERNP): Likewise.
4797 (XVF64GERPN): Likewise.
4798 (XVF64GERPP): Likewise.
4799 (XVI16GER2): Likewise.
4800 (XVI16GER2PP): Likewise.
4801 (XVI16GER2S): Likewise.
4802 (XVI16GER2SPP): Likewise.
4803 (XVI4GER8): Likewise.
4804 (XVI4GER8PP): Likewise.
4805 (XVI8GER4): Likewise.
4806 (XVI8GER4PP): Likewise.
4807 (XVI8GER4SPP): Likewise.
4808 (XXMFACC): Likewise.
4809 (XXMTACC): Likewise.
4810 (XXSETACCZ): Likewise.
4811 (ASSEMBLE_PAIR_V): Likewise.
4812 (BUILD_PAIR): Likewise.
4813 (DISASSEMBLE_PAIR_V): Likewise.
4816 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_mma_builtin):
4817 Handle RS6000_BIF_LXVP and RS6000_BIF_STXVP.
4818 * config/rs6000/rs6000-gen-builtins.c (attrinfo): Add ismmaint.
4819 (parse_bif_attrs): Handle ismmaint.
4820 (write_decls): Add bif_mmaint_bit and bif_is_mmaint.
4821 (write_bif_static_init): Handle ismmaint.
4823 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4825 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_new_builtin): New
4827 (rs6000_gimple_fold_builtin): Call rs6000_gimple_fold_new_builtin.
4828 (rs6000_new_builtin_valid_without_lhs): New function.
4829 (rs6000_gimple_fold_new_mma_builtin): Likewise.
4830 (rs6000_gimple_fold_new_builtin): Likewise.
4832 2021-09-17 Thomas Schwinge <thomas@codesourcery.com>
4834 * hash-table.h (hash_table<Descriptor, Lazy, Allocator>::expand):
4835 Destruct stale Value objects.
4836 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
4839 2021-09-17 Roger Sayle <roger@nextmovesoftware.com>
4842 * match.pd (shift optimizations): Disable recent sign-changing
4843 optimization for shifts by zero, these will be folded later.
4845 2021-09-17 Bill Schmidt <wschmidt@linux.ibm.com>
4847 * config/rs6000/rs6000-builtin-new.def (__builtin_mffsl): Move from
4848 [power9] to [always].
4850 2021-09-17 Richard Biener <rguenther@suse.de>
4852 * tree-vect-stmts.c (vectorizable_load): Do not frob
4855 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4857 * config/i386/i386-features.c (remove_partial_avx_dependency):
4858 Also check TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY and
4859 and TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY before generating
4861 * config/i386/i386.h (TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY):
4863 (TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
4864 * config/i386/i386.md (SSE FP to FP splitters): Replace
4865 TARGET_SSE_PARTIAL_REG_DEPENDENCY with
4866 TARGET_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY.
4867 (SSE INT to FP splitter): Replace TARGET_SSE_PARTIAL_REG_DEPENDENCY
4868 with TARGET_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY.
4869 * config/i386/x86-tune.def
4870 (X86_TUNE_SSE_PARTIAL_REG_FP_CONVERTS_DEPENDENCY): New.
4871 (X86_TUNE_SSE_PARTIAL_REG_CONVERTS_DEPENDENCY): Likewise.
4873 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4876 * config/i386/i386-features.c (remove_partial_avx_dependency):
4877 Check TARGET_USE_VECTOR_FP_CONVERTS and TARGET_USE_VECTOR_CONVERTS
4878 before generating vxorps.
4880 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4882 * config/i386/i386-options.c (processor_cost_table): Use
4883 tremont_cost for Tremont.
4884 * config/i386/x86-tune-costs.h (tremont_memcpy): New.
4885 (tremont_memset): Likewise.
4886 (tremont_cost): Likewise.
4887 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
4890 2021-09-17 H.J. Lu <hjl.tools@gmail.com>
4892 * common/config/i386/i386-common.c: Use Haswell scheduling model
4894 * config/i386/i386.c (ix86_sched_init_global): Prepare for Tremont
4896 * config/i386/x86-tune-sched.c (ix86_issue_rate): Change Tremont
4898 (ix86_adjust_cost): Handle Tremont.
4899 * config/i386/x86-tune.def (X86_TUNE_SSE_PARTIAL_REG_DEPENDENCY):
4901 (X86_TUNE_USE_LEAVE): Likewise.
4902 (X86_TUNE_PUSH_MEMORY): Likewise.
4903 (X86_TUNE_MISALIGNED_MOVE_STRING_PRO_EPILOGUES): Likewise.
4904 (X86_TUNE_USE_CLTD): Likewise.
4905 (X86_TUNE_AVOID_FALSE_DEP_FOR_BMI): Likewise.
4906 (X86_TUNE_AVOID_MFENCE): Likewise.
4907 (X86_TUNE_SSE_TYPELESS_STORES): Likewise.
4908 (X86_TUNE_SSE_LOAD0_BY_PXOR): Likewise.
4909 (X86_TUNE_ACCUMULATE_OUTGOING_ARGS): Disable for Tremont.
4910 (X86_TUNE_FOUR_JUMP_LIMIT): Likewise.
4911 (X86_TUNE_OPT_AGU): Likewise.
4912 (X86_TUNE_AVOID_LEA_FOR_ADDR): Likewise.
4913 (X86_TUNE_AVOID_MEM_OPND_FOR_CMOVE): Likewise.
4914 (X86_TUNE_EXPAND_ABS): Likewise.
4915 (X86_TUNE_SPLIT_MEM_OPND_FOR_FP_CONVERTS): Likewise.
4916 (X86_TUNE_SLOW_PSHUFB): Likewise.
4918 2021-09-17 Eric Botcazou <ebotcazou@adacore.com>
4920 PR rtl-optimization/102306
4921 * combine.c (try_combine): Abort the combination if we are about to
4922 duplicate volatile references.
4924 2021-09-17 liuhongt <hongtao.liu@intel.com>
4926 * config/i386/avx512fp16intrin.h (_mm_undefined_ph):
4928 (_mm256_undefined_ph): Likewise.
4929 (_mm512_undefined_ph): Likewise.
4930 (_mm_cvtsh_h): Likewise.
4931 (_mm256_cvtsh_h): Likewise.
4932 (_mm512_cvtsh_h): Likewise.
4933 (_mm512_castph_ps): Likewise.
4934 (_mm512_castph_pd): Likewise.
4935 (_mm512_castph_si512): Likewise.
4936 (_mm512_castph512_ph128): Likewise.
4937 (_mm512_castph512_ph256): Likewise.
4938 (_mm512_castph128_ph512): Likewise.
4939 (_mm512_castph256_ph512): Likewise.
4940 (_mm512_zextph128_ph512): Likewise.
4941 (_mm512_zextph256_ph512): Likewise.
4942 (_mm512_castps_ph): Likewise.
4943 (_mm512_castpd_ph): Likewise.
4944 (_mm512_castsi512_ph): Likewise.
4945 * config/i386/avx512fp16vlintrin.h (_mm_castph_ps):
4947 (_mm256_castph_ps): Likewise.
4948 (_mm_castph_pd): Likewise.
4949 (_mm256_castph_pd): Likewise.
4950 (_mm_castph_si128): Likewise.
4951 (_mm256_castph_si256): Likewise.
4952 (_mm_castps_ph): Likewise.
4953 (_mm256_castps_ph): Likewise.
4954 (_mm_castpd_ph): Likewise.
4955 (_mm256_castpd_ph): Likewise.
4956 (_mm_castsi128_ph): Likewise.
4957 (_mm256_castsi256_ph): Likewise.
4958 (_mm256_castph256_ph128): Likewise.
4959 (_mm256_castph128_ph256): Likewise.
4960 (_mm256_zextph128_ph256): Likewise.
4962 2021-09-17 liuhongt <hongtao.liu@intel.com>
4964 * config/i386/avx512fp16intrin.h (_mm_cvtsh_ss):
4966 (_mm_mask_cvtsh_ss): Likewise.
4967 (_mm_maskz_cvtsh_ss): Likewise.
4968 (_mm_cvtsh_sd): Likewise.
4969 (_mm_mask_cvtsh_sd): Likewise.
4970 (_mm_maskz_cvtsh_sd): Likewise.
4971 (_mm_cvt_roundsh_ss): Likewise.
4972 (_mm_mask_cvt_roundsh_ss): Likewise.
4973 (_mm_maskz_cvt_roundsh_ss): Likewise.
4974 (_mm_cvt_roundsh_sd): Likewise.
4975 (_mm_mask_cvt_roundsh_sd): Likewise.
4976 (_mm_maskz_cvt_roundsh_sd): Likewise.
4977 (_mm_cvtss_sh): Likewise.
4978 (_mm_mask_cvtss_sh): Likewise.
4979 (_mm_maskz_cvtss_sh): Likewise.
4980 (_mm_cvtsd_sh): Likewise.
4981 (_mm_mask_cvtsd_sh): Likewise.
4982 (_mm_maskz_cvtsd_sh): Likewise.
4983 (_mm_cvt_roundss_sh): Likewise.
4984 (_mm_mask_cvt_roundss_sh): Likewise.
4985 (_mm_maskz_cvt_roundss_sh): Likewise.
4986 (_mm_cvt_roundsd_sh): Likewise.
4987 (_mm_mask_cvt_roundsd_sh): Likewise.
4988 (_mm_maskz_cvt_roundsd_sh): Likewise.
4989 * config/i386/i386-builtin-types.def
4990 (V8HF_FTYPE_V2DF_V8HF_V8HF_UQI_INT,
4991 V8HF_FTYPE_V4SF_V8HF_V8HF_UQI_INT,
4992 V2DF_FTYPE_V8HF_V2DF_V2DF_UQI_INT,
4993 V4SF_FTYPE_V8HF_V4SF_V4SF_UQI_INT): Add new builtin types.
4994 * config/i386/i386-builtin.def: Add corrresponding new builtins.
4995 * config/i386/i386-expand.c: Handle new builtin types.
4996 * config/i386/sse.md (VF48_128): New mode iterator.
4997 (avx512fp16_vcvtsh2<ssescalarmodesuffix><mask_scalar_name><round_saeonly_scalar_name>):
4999 (avx512fp16_vcvt<ssescalarmodesuffix>2sh<mask_scalar_name><round_scalar_name>):
5002 2021-09-17 liuhongt <hongtao.liu@intel.com>
5004 * config/i386/avx512fp16intrin.h (_mm512_cvtph_pd):
5006 (_mm512_mask_cvtph_pd): Likewise.
5007 (_mm512_maskz_cvtph_pd): Likewise.
5008 (_mm512_cvt_roundph_pd): Likewise.
5009 (_mm512_mask_cvt_roundph_pd): Likewise.
5010 (_mm512_maskz_cvt_roundph_pd): Likewise.
5011 (_mm512_cvtxph_ps): Likewise.
5012 (_mm512_mask_cvtxph_ps): Likewise.
5013 (_mm512_maskz_cvtxph_ps): Likewise.
5014 (_mm512_cvtx_roundph_ps): Likewise.
5015 (_mm512_mask_cvtx_roundph_ps): Likewise.
5016 (_mm512_maskz_cvtx_roundph_ps): Likewise.
5017 (_mm512_cvtxps_ph): Likewise.
5018 (_mm512_mask_cvtxps_ph): Likewise.
5019 (_mm512_maskz_cvtxps_ph): Likewise.
5020 (_mm512_cvtx_roundps_ph): Likewise.
5021 (_mm512_mask_cvtx_roundps_ph): Likewise.
5022 (_mm512_maskz_cvtx_roundps_ph): Likewise.
5023 (_mm512_cvtpd_ph): Likewise.
5024 (_mm512_mask_cvtpd_ph): Likewise.
5025 (_mm512_maskz_cvtpd_ph): Likewise.
5026 (_mm512_cvt_roundpd_ph): Likewise.
5027 (_mm512_mask_cvt_roundpd_ph): Likewise.
5028 (_mm512_maskz_cvt_roundpd_ph): Likewise.
5029 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_pd):
5031 (_mm_mask_cvtph_pd): Likewise.
5032 (_mm_maskz_cvtph_pd): Likewise.
5033 (_mm256_cvtph_pd): Likewise.
5034 (_mm256_mask_cvtph_pd): Likewise.
5035 (_mm256_maskz_cvtph_pd): Likewise.
5036 (_mm_cvtxph_ps): Likewise.
5037 (_mm_mask_cvtxph_ps): Likewise.
5038 (_mm_maskz_cvtxph_ps): Likewise.
5039 (_mm256_cvtxph_ps): Likewise.
5040 (_mm256_mask_cvtxph_ps): Likewise.
5041 (_mm256_maskz_cvtxph_ps): Likewise.
5042 (_mm_cvtxps_ph): Likewise.
5043 (_mm_mask_cvtxps_ph): Likewise.
5044 (_mm_maskz_cvtxps_ph): Likewise.
5045 (_mm256_cvtxps_ph): Likewise.
5046 (_mm256_mask_cvtxps_ph): Likewise.
5047 (_mm256_maskz_cvtxps_ph): Likewise.
5048 (_mm_cvtpd_ph): Likewise.
5049 (_mm_mask_cvtpd_ph): Likewise.
5050 (_mm_maskz_cvtpd_ph): Likewise.
5051 (_mm256_cvtpd_ph): Likewise.
5052 (_mm256_mask_cvtpd_ph): Likewise.
5053 (_mm256_maskz_cvtpd_ph): Likewise.
5054 * config/i386/i386-builtin.def: Add corresponding new builtins.
5055 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5056 * config/i386/i386-expand.c: Handle new builtin types.
5057 * config/i386/sse.md
5058 (VF4_128_8_256): New.
5059 (VF48H_AVX512VL): Ditto.
5060 (ssePHmode): Add HF vector modes.
5061 (castmode): Add new convertable modes.
5064 (avx512fp16_vcvt<castmode>2ph_<mode><mask_name><round_name>): Ditto.
5065 (avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
5066 (*avx512fp16_vcvt<castmode>2ph_<mode>): Ditto.
5067 (avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
5068 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask): Ditto.
5069 (*avx512fp16_vcvt<castmode>2ph_<mode>_mask_1): Ditto.
5070 (avx512fp16_float_extend_ph<mode>2<mask_name><round_saeonly_name>):
5072 (avx512fp16_float_extend_ph<mode>2<mask_name>): Ditto.
5073 (*avx512fp16_float_extend_ph<mode>2_load<mask_name>): Ditto.
5074 (avx512fp16_float_extend_phv2df2<mask_name>): Ditto.
5075 (*avx512fp16_float_extend_phv2df2_load<mask_name>): Ditto.
5077 2021-09-17 liuhongt <hongtao.liu@intel.com>
5079 * config/i386/avx512fp16intrin.h (_mm_cvttsh_i32):
5081 (_mm_cvttsh_u32): Likewise.
5082 (_mm_cvtt_roundsh_i32): Likewise.
5083 (_mm_cvtt_roundsh_u32): Likewise.
5084 (_mm_cvttsh_i64): Likewise.
5085 (_mm_cvttsh_u64): Likewise.
5086 (_mm_cvtt_roundsh_i64): Likewise.
5087 (_mm_cvtt_roundsh_u64): Likewise.
5088 * config/i386/i386-builtin.def: Add corresponding new builtins.
5089 * config/i386/sse.md
5090 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<round_saeonly_name>):
5093 2021-09-17 liuhongt <hongtao.liu@intel.com>
5095 * config/i386/avx512fp16intrin.h (_mm512_cvttph_epi32):
5097 (_mm512_mask_cvttph_epi32): Likewise.
5098 (_mm512_maskz_cvttph_epi32): Likewise.
5099 (_mm512_cvtt_roundph_epi32): Likewise.
5100 (_mm512_mask_cvtt_roundph_epi32): Likewise.
5101 (_mm512_maskz_cvtt_roundph_epi32): Likewise.
5102 (_mm512_cvttph_epu32): Likewise.
5103 (_mm512_mask_cvttph_epu32): Likewise.
5104 (_mm512_maskz_cvttph_epu32): Likewise.
5105 (_mm512_cvtt_roundph_epu32): Likewise.
5106 (_mm512_mask_cvtt_roundph_epu32): Likewise.
5107 (_mm512_maskz_cvtt_roundph_epu32): Likewise.
5108 (_mm512_cvttph_epi64): Likewise.
5109 (_mm512_mask_cvttph_epi64): Likewise.
5110 (_mm512_maskz_cvttph_epi64): Likewise.
5111 (_mm512_cvtt_roundph_epi64): Likewise.
5112 (_mm512_mask_cvtt_roundph_epi64): Likewise.
5113 (_mm512_maskz_cvtt_roundph_epi64): Likewise.
5114 (_mm512_cvttph_epu64): Likewise.
5115 (_mm512_mask_cvttph_epu64): Likewise.
5116 (_mm512_maskz_cvttph_epu64): Likewise.
5117 (_mm512_cvtt_roundph_epu64): Likewise.
5118 (_mm512_mask_cvtt_roundph_epu64): Likewise.
5119 (_mm512_maskz_cvtt_roundph_epu64): Likewise.
5120 (_mm512_cvttph_epi16): Likewise.
5121 (_mm512_mask_cvttph_epi16): Likewise.
5122 (_mm512_maskz_cvttph_epi16): Likewise.
5123 (_mm512_cvtt_roundph_epi16): Likewise.
5124 (_mm512_mask_cvtt_roundph_epi16): Likewise.
5125 (_mm512_maskz_cvtt_roundph_epi16): Likewise.
5126 (_mm512_cvttph_epu16): Likewise.
5127 (_mm512_mask_cvttph_epu16): Likewise.
5128 (_mm512_maskz_cvttph_epu16): Likewise.
5129 (_mm512_cvtt_roundph_epu16): Likewise.
5130 (_mm512_mask_cvtt_roundph_epu16): Likewise.
5131 (_mm512_maskz_cvtt_roundph_epu16): Likewise.
5132 * config/i386/avx512fp16vlintrin.h (_mm_cvttph_epi32):
5134 (_mm_mask_cvttph_epi32): Likewise.
5135 (_mm_maskz_cvttph_epi32): Likewise.
5136 (_mm256_cvttph_epi32): Likewise.
5137 (_mm256_mask_cvttph_epi32): Likewise.
5138 (_mm256_maskz_cvttph_epi32): Likewise.
5139 (_mm_cvttph_epu32): Likewise.
5140 (_mm_mask_cvttph_epu32): Likewise.
5141 (_mm_maskz_cvttph_epu32): Likewise.
5142 (_mm256_cvttph_epu32): Likewise.
5143 (_mm256_mask_cvttph_epu32): Likewise.
5144 (_mm256_maskz_cvttph_epu32): Likewise.
5145 (_mm_cvttph_epi64): Likewise.
5146 (_mm_mask_cvttph_epi64): Likewise.
5147 (_mm_maskz_cvttph_epi64): Likewise.
5148 (_mm256_cvttph_epi64): Likewise.
5149 (_mm256_mask_cvttph_epi64): Likewise.
5150 (_mm256_maskz_cvttph_epi64): Likewise.
5151 (_mm_cvttph_epu64): Likewise.
5152 (_mm_mask_cvttph_epu64): Likewise.
5153 (_mm_maskz_cvttph_epu64): Likewise.
5154 (_mm256_cvttph_epu64): Likewise.
5155 (_mm256_mask_cvttph_epu64): Likewise.
5156 (_mm256_maskz_cvttph_epu64): Likewise.
5157 (_mm_cvttph_epi16): Likewise.
5158 (_mm_mask_cvttph_epi16): Likewise.
5159 (_mm_maskz_cvttph_epi16): Likewise.
5160 (_mm256_cvttph_epi16): Likewise.
5161 (_mm256_mask_cvttph_epi16): Likewise.
5162 (_mm256_maskz_cvttph_epi16): Likewise.
5163 (_mm_cvttph_epu16): Likewise.
5164 (_mm_mask_cvttph_epu16): Likewise.
5165 (_mm_maskz_cvttph_epu16): Likewise.
5166 (_mm256_cvttph_epu16): Likewise.
5167 (_mm256_mask_cvttph_epu16): Likewise.
5168 (_mm256_maskz_cvttph_epu16): Likewise.
5169 * config/i386/i386-builtin.def: Add new builtins.
5170 * config/i386/sse.md
5171 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name><round_saeonly_name>):
5173 (avx512fp16_fix<fixunssuffix>_trunc<mode>2<mask_name>): Ditto.
5174 (*avx512fp16_fix<fixunssuffix>_trunc<mode>2_load<mask_name>): Ditto.
5175 (avx512fp16_fix<fixunssuffix>_truncv2di2<mask_name>): Ditto.
5176 (avx512fp16_fix<fixunssuffix>_truncv2di2_load<mask_name>): Ditto.
5178 2021-09-17 liuhongt <hongtao.liu@intel.com>
5180 * config/i386/avx512fp16intrin.h (_mm_cvtsh_i32): New intrinsic.
5181 (_mm_cvtsh_u32): Likewise.
5182 (_mm_cvt_roundsh_i32): Likewise.
5183 (_mm_cvt_roundsh_u32): Likewise.
5184 (_mm_cvtsh_i64): Likewise.
5185 (_mm_cvtsh_u64): Likewise.
5186 (_mm_cvt_roundsh_i64): Likewise.
5187 (_mm_cvt_roundsh_u64): Likewise.
5188 (_mm_cvti32_sh): Likewise.
5189 (_mm_cvtu32_sh): Likewise.
5190 (_mm_cvt_roundi32_sh): Likewise.
5191 (_mm_cvt_roundu32_sh): Likewise.
5192 (_mm_cvti64_sh): Likewise.
5193 (_mm_cvtu64_sh): Likewise.
5194 (_mm_cvt_roundi64_sh): Likewise.
5195 (_mm_cvt_roundu64_sh): Likewise.
5196 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5197 * config/i386/i386-builtin.def: Add corresponding new builtins.
5198 * config/i386/i386-expand.c (ix86_expand_round_builtin):
5199 Handle new builtin types.
5200 * config/i386/sse.md
5201 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix><round_name>):
5203 (avx512fp16_vcvtsh2<sseintconvertsignprefix>si<rex64namesuffix>_2): Likewise.
5204 (avx512fp16_vcvt<floatsuffix>si2sh<rex64namesuffix><round_name>): Likewise.
5206 2021-09-16 Bill Schmidt <wschmidt@linux.ibm.com>
5208 * config/rs6000/rs6000-c.c (rs6000-builtins.h): New include.
5209 (altivec_resolve_new_overloaded_builtin): New forward decl.
5210 (rs6000_new_builtin_type_compatible): New function.
5211 (altivec_resolve_overloaded_builtin): Call
5212 altivec_resolve_new_overloaded_builtin.
5213 (altivec_build_new_resolved_builtin): New function.
5214 (altivec_resolve_new_overloaded_builtin): Likewise.
5215 * config/rs6000/rs6000-call.c (rs6000_new_builtin_is_supported):
5217 * config/rs6000/rs6000-gen-builtins.c (write_decls): Remove _p from
5218 name of rs6000_new_builtin_is_supported.
5220 2021-09-16 Uroš Bizjak <ubizjak@gmail.com>
5222 * config/i386/i386-protos.h (ix86_decompose_address):
5223 Change return type to bool.
5224 * config/i386/i386.c (ix86_decompose_address): Ditto.
5226 2021-09-16 Tobias Burnus <tobias@codesourcery.com>
5229 * config/rs6000/t-rs6000 (build/rs6000-gen-builtins.o, build/rbtree.o):
5230 Added 'build/' to target, use build/%.o rule.
5231 (build/rs6000-gen-builtins$(build_exeext)): Add 'build/' and
5232 '$(build_exeext)' to target and 'build/' for the *.o files.
5233 (rs6000-builtins.c): Update for those changes; run rs6000-gen-builtins
5236 2021-09-16 Martin Jambor <mjambor@suse.cz>
5238 * cgraph.c (cgraph_node::dump): Do not check caller count sums if
5239 the body has been removed. Remove trailing whitespace.
5241 2021-09-16 Richard Biener <rguenther@suse.de>
5243 PR middle-end/102360
5244 * internal-fn.c (expand_DEFERRED_INIT): Make pattern-init
5245 of non-memory more robust.
5247 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5249 * config/sparc/sparc-opts.h (enum sparc_processor_type): Add LEON5
5250 * config/sparc/sparc.c (struct processor_costs): Add LEON5 costs
5251 (leon5_adjust_cost): Increase cost of store with data dependency
5252 on ALU instruction and FPU anti-dependencies.
5253 (sparc_option_override): Add LEON5 costs
5254 (sparc_adjust_cost): Add LEON5 cost adjustments
5255 * config/sparc/sparc.h: Add LEON5
5256 * config/sparc/sparc.md: Include LEON5 scheduling information
5257 * config/sparc/sparc.opt: Add LEON5
5258 * doc/invoke.texi: Add LEON5
5259 * config/sparc/leon5.md: New file.
5261 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5263 * config/sparc/sparc.md (stack_protect_set32): Add NOP to prevent
5264 sensitive sequence for B2BST errata workaround.
5266 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5268 * config/sparc/sparc.c (sparc_do_work_around_errata): Do not begin
5269 functions with atomic instruction in the UT700 errata workaround.
5271 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5273 * config/sparc/sparc.c (next_active_non_empty_insn): New function
5274 that returns next active non empty assembly instruction.
5275 (sparc_do_work_around_errata): Use new function.
5277 2021-09-16 Daniel Cederman <cederman@gaisler.com>
5279 * config/sparc/sparc.c (store_insn_p): Add predicate for store
5281 (load_insn_p): Add predicate for load attributes.
5282 (sparc_do_work_around_errata): Use new predicates.
5284 2021-09-16 Andreas Larsson <andreas@gaisler.com>
5286 * config/sparc/sparc.c (dump_target_flag_bits): Print bit names for
5289 2021-09-16 Martin Liska <mliska@suse.cz>
5291 * config/mips/netbsd.h: Fix typo in name of a macro.
5293 2021-09-16 liuhongt <hongtao.liu@intel.com>
5295 PR middle-end/102080
5296 * match.pd: Check mask type when doing cond_op related gimple
5298 * tree.c (is_truth_type_for): New function.
5299 * tree.h (is_truth_type_for): New declaration.
5301 2021-09-16 liuhongt <hongtao.liu@intel.com>
5303 * config/i386/avx512fp16intrin.h (_mm512_cvtepi32_ph): New
5305 (_mm512_mask_cvtepi32_ph): Likewise.
5306 (_mm512_maskz_cvtepi32_ph): Likewise.
5307 (_mm512_cvt_roundepi32_ph): Likewise.
5308 (_mm512_mask_cvt_roundepi32_ph): Likewise.
5309 (_mm512_maskz_cvt_roundepi32_ph): Likewise.
5310 (_mm512_cvtepu32_ph): Likewise.
5311 (_mm512_mask_cvtepu32_ph): Likewise.
5312 (_mm512_maskz_cvtepu32_ph): Likewise.
5313 (_mm512_cvt_roundepu32_ph): Likewise.
5314 (_mm512_mask_cvt_roundepu32_ph): Likewise.
5315 (_mm512_maskz_cvt_roundepu32_ph): Likewise.
5316 (_mm512_cvtepi64_ph): Likewise.
5317 (_mm512_mask_cvtepi64_ph): Likewise.
5318 (_mm512_maskz_cvtepi64_ph): Likewise.
5319 (_mm512_cvt_roundepi64_ph): Likewise.
5320 (_mm512_mask_cvt_roundepi64_ph): Likewise.
5321 (_mm512_maskz_cvt_roundepi64_ph): Likewise.
5322 (_mm512_cvtepu64_ph): Likewise.
5323 (_mm512_mask_cvtepu64_ph): Likewise.
5324 (_mm512_maskz_cvtepu64_ph): Likewise.
5325 (_mm512_cvt_roundepu64_ph): Likewise.
5326 (_mm512_mask_cvt_roundepu64_ph): Likewise.
5327 (_mm512_maskz_cvt_roundepu64_ph): Likewise.
5328 (_mm512_cvtepi16_ph): Likewise.
5329 (_mm512_mask_cvtepi16_ph): Likewise.
5330 (_mm512_maskz_cvtepi16_ph): Likewise.
5331 (_mm512_cvt_roundepi16_ph): Likewise.
5332 (_mm512_mask_cvt_roundepi16_ph): Likewise.
5333 (_mm512_maskz_cvt_roundepi16_ph): Likewise.
5334 (_mm512_cvtepu16_ph): Likewise.
5335 (_mm512_mask_cvtepu16_ph): Likewise.
5336 (_mm512_maskz_cvtepu16_ph): Likewise.
5337 (_mm512_cvt_roundepu16_ph): Likewise.
5338 (_mm512_mask_cvt_roundepu16_ph): Likewise.
5339 (_mm512_maskz_cvt_roundepu16_ph): Likewise.
5340 * config/i386/avx512fp16vlintrin.h (_mm_cvtepi32_ph): New
5342 (_mm_mask_cvtepi32_ph): Likewise.
5343 (_mm_maskz_cvtepi32_ph): Likewise.
5344 (_mm256_cvtepi32_ph): Likewise.
5345 (_mm256_mask_cvtepi32_ph): Likewise.
5346 (_mm256_maskz_cvtepi32_ph): Likewise.
5347 (_mm_cvtepu32_ph): Likewise.
5348 (_mm_mask_cvtepu32_ph): Likewise.
5349 (_mm_maskz_cvtepu32_ph): Likewise.
5350 (_mm256_cvtepu32_ph): Likewise.
5351 (_mm256_mask_cvtepu32_ph): Likewise.
5352 (_mm256_maskz_cvtepu32_ph): Likewise.
5353 (_mm_cvtepi64_ph): Likewise.
5354 (_mm_mask_cvtepi64_ph): Likewise.
5355 (_mm_maskz_cvtepi64_ph): Likewise.
5356 (_mm256_cvtepi64_ph): Likewise.
5357 (_mm256_mask_cvtepi64_ph): Likewise.
5358 (_mm256_maskz_cvtepi64_ph): Likewise.
5359 (_mm_cvtepu64_ph): Likewise.
5360 (_mm_mask_cvtepu64_ph): Likewise.
5361 (_mm_maskz_cvtepu64_ph): Likewise.
5362 (_mm256_cvtepu64_ph): Likewise.
5363 (_mm256_mask_cvtepu64_ph): Likewise.
5364 (_mm256_maskz_cvtepu64_ph): Likewise.
5365 (_mm_cvtepi16_ph): Likewise.
5366 (_mm_mask_cvtepi16_ph): Likewise.
5367 (_mm_maskz_cvtepi16_ph): Likewise.
5368 (_mm256_cvtepi16_ph): Likewise.
5369 (_mm256_mask_cvtepi16_ph): Likewise.
5370 (_mm256_maskz_cvtepi16_ph): Likewise.
5371 (_mm_cvtepu16_ph): Likewise.
5372 (_mm_mask_cvtepu16_ph): Likewise.
5373 (_mm_maskz_cvtepu16_ph): Likewise.
5374 (_mm256_cvtepu16_ph): Likewise.
5375 (_mm256_mask_cvtepu16_ph): Likewise.
5376 (_mm256_maskz_cvtepu16_ph): Likewise.
5377 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5378 * config/i386/i386-builtin.def: Add corresponding new builtins.
5379 * config/i386/i386-expand.c
5380 (ix86_expand_args_builtin): Handle new builtin types.
5381 (ix86_expand_round_builtin): Ditto.
5382 * config/i386/i386-modes.def: Declare V2HF and V6HF.
5383 * config/i386/sse.md (VI2H_AVX512VL): New.
5385 (sseintvecmode): Add HF vector modes.
5386 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode><mask_name><round_name>):
5388 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
5389 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>): Ditto.
5390 (avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
5391 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask): Ditto.
5392 (*avx512fp16_vcvt<floatsuffix><sseintconvert>2ph_<mode>_mask_1): Ditto.
5393 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
5394 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di): Ditto.
5395 (avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
5396 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask): Ditto.
5397 (*avx512fp16_vcvt<floatsuffix>qq2ph_v2di_mask_1): Ditto.
5398 * config/i386/subst.md (round_qq2phsuff): New subst_attr.
5400 2021-09-16 liuhongt <hongtao.liu@intel.com>
5402 * config/i386/avx512fp16intrin.h (_mm512_cvtph_epi32):
5404 (_mm512_mask_cvtph_epi32): Likewise.
5405 (_mm512_maskz_cvtph_epi32): Likewise.
5406 (_mm512_cvt_roundph_epi32): Likewise.
5407 (_mm512_mask_cvt_roundph_epi32): Likewise.
5408 (_mm512_maskz_cvt_roundph_epi32): Likewise.
5409 (_mm512_cvtph_epu32): Likewise.
5410 (_mm512_mask_cvtph_epu32): Likewise.
5411 (_mm512_maskz_cvtph_epu32): Likewise.
5412 (_mm512_cvt_roundph_epu32): Likewise.
5413 (_mm512_mask_cvt_roundph_epu32): Likewise.
5414 (_mm512_maskz_cvt_roundph_epu32): Likewise.
5415 (_mm512_cvtph_epi64): Likewise.
5416 (_mm512_mask_cvtph_epi64): Likewise.
5417 (_mm512_maskz_cvtph_epi64): Likewise.
5418 (_mm512_cvt_roundph_epi64): Likewise.
5419 (_mm512_mask_cvt_roundph_epi64): Likewise.
5420 (_mm512_maskz_cvt_roundph_epi64): Likewise.
5421 (_mm512_cvtph_epu64): Likewise.
5422 (_mm512_mask_cvtph_epu64): Likewise.
5423 (_mm512_maskz_cvtph_epu64): Likewise.
5424 (_mm512_cvt_roundph_epu64): Likewise.
5425 (_mm512_mask_cvt_roundph_epu64): Likewise.
5426 (_mm512_maskz_cvt_roundph_epu64): Likewise.
5427 (_mm512_cvtph_epi16): Likewise.
5428 (_mm512_mask_cvtph_epi16): Likewise.
5429 (_mm512_maskz_cvtph_epi16): Likewise.
5430 (_mm512_cvt_roundph_epi16): Likewise.
5431 (_mm512_mask_cvt_roundph_epi16): Likewise.
5432 (_mm512_maskz_cvt_roundph_epi16): Likewise.
5433 (_mm512_cvtph_epu16): Likewise.
5434 (_mm512_mask_cvtph_epu16): Likewise.
5435 (_mm512_maskz_cvtph_epu16): Likewise.
5436 (_mm512_cvt_roundph_epu16): Likewise.
5437 (_mm512_mask_cvt_roundph_epu16): Likewise.
5438 (_mm512_maskz_cvt_roundph_epu16): Likewise.
5439 * config/i386/avx512fp16vlintrin.h (_mm_cvtph_epi32):
5441 (_mm_mask_cvtph_epi32): Likewise.
5442 (_mm_maskz_cvtph_epi32): Likewise.
5443 (_mm256_cvtph_epi32): Likewise.
5444 (_mm256_mask_cvtph_epi32): Likewise.
5445 (_mm256_maskz_cvtph_epi32): Likewise.
5446 (_mm_cvtph_epu32): Likewise.
5447 (_mm_mask_cvtph_epu32): Likewise.
5448 (_mm_maskz_cvtph_epu32): Likewise.
5449 (_mm256_cvtph_epu32): Likewise.
5450 (_mm256_mask_cvtph_epu32): Likewise.
5451 (_mm256_maskz_cvtph_epu32): Likewise.
5452 (_mm_cvtph_epi64): Likewise.
5453 (_mm_mask_cvtph_epi64): Likewise.
5454 (_mm_maskz_cvtph_epi64): Likewise.
5455 (_mm256_cvtph_epi64): Likewise.
5456 (_mm256_mask_cvtph_epi64): Likewise.
5457 (_mm256_maskz_cvtph_epi64): Likewise.
5458 (_mm_cvtph_epu64): Likewise.
5459 (_mm_mask_cvtph_epu64): Likewise.
5460 (_mm_maskz_cvtph_epu64): Likewise.
5461 (_mm256_cvtph_epu64): Likewise.
5462 (_mm256_mask_cvtph_epu64): Likewise.
5463 (_mm256_maskz_cvtph_epu64): Likewise.
5464 (_mm_cvtph_epi16): Likewise.
5465 (_mm_mask_cvtph_epi16): Likewise.
5466 (_mm_maskz_cvtph_epi16): Likewise.
5467 (_mm256_cvtph_epi16): Likewise.
5468 (_mm256_mask_cvtph_epi16): Likewise.
5469 (_mm256_maskz_cvtph_epi16): Likewise.
5470 (_mm_cvtph_epu16): Likewise.
5471 (_mm_mask_cvtph_epu16): Likewise.
5472 (_mm_maskz_cvtph_epu16): Likewise.
5473 (_mm256_cvtph_epu16): Likewise.
5474 (_mm256_mask_cvtph_epu16): Likewise.
5475 (_mm256_maskz_cvtph_epu16): Likewise.
5476 * config/i386/i386-builtin-types.def: Add new builtin types.
5477 * config/i386/i386-builtin.def: Add new builtins.
5478 * config/i386/i386-expand.c
5479 (ix86_expand_args_builtin): Handle new builtin types.
5480 (ix86_expand_round_builtin): Ditto.
5481 * config/i386/sse.md (sseintconvert): New.
5483 (UNSPEC_US_FIX_NOTRUNC): Ditto.
5484 (sseintconvertsignprefix): Ditto.
5485 (avx512fp16_vcvtph2<sseintconvertsignprefix><sseintconvert>_<mode><mask_name><round_name>):
5488 2021-09-16 liuhongt <hongtao.liu@intel.com>
5490 * config/i386/avx512fp16intrin.h: (_mm_cvtsi16_si128):
5492 (_mm_cvtsi128_si16): Likewise.
5493 (_mm_mask_load_sh): Likewise.
5494 (_mm_maskz_load_sh): Likewise.
5495 (_mm_mask_store_sh): Likewise.
5496 (_mm_move_sh): Likewise.
5497 (_mm_mask_move_sh): Likewise.
5498 (_mm_maskz_move_sh): Likewise.
5499 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5500 * config/i386/i386-builtin.def: Add corresponding new builtins.
5501 * config/i386/i386-expand.c
5502 (ix86_expand_special_args_builtin): Handle new builtin types.
5503 (ix86_expand_vector_init_one_nonzero): Adjust for FP16 target.
5504 * config/i386/sse.md (VI2F): New mode iterator.
5505 (vec_set<mode>_0): Use new mode iterator.
5506 (avx512f_mov<ssescalarmodelower>_mask): Adjust for HF vector mode.
5507 (avx512f_store<mode>_mask): Ditto.
5509 2021-09-16 Kewen Lin <linkw@linux.ibm.com>
5511 * config/rs6000/rs6000.opt (-mtoc-fusion): Remove.
5513 2021-09-15 David Edelsohn <dje.gcc@gmail.com>
5515 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
5516 Proceed if no symbol summary or the symbol alias flag is false.
5518 2021-09-15 Jakub Jelinek <jakub@redhat.com>
5522 * varasm.c (output_constructor_regular_field): Instead of assertion
5523 that array_size_for_constructor result is equal to size of
5524 TREE_TYPE (local->val) in bytes, assert that the type size is greater
5525 or equal to array_size_for_constructor result and use type size as
5528 2021-09-15 Martin Liska <mliska@suse.cz>
5531 * config/i386/vxworks.h: Use new macro TARGET_CPU_P.
5533 2021-09-15 Martin Liska <mliska@suse.cz>
5536 * config/rs6000/rs6000.c (rs6000_xcoff_encode_section_info):
5537 Check that we have a symbol summary for a symbol.
5539 2021-09-15 Richard Biener <rguenther@suse.de>
5542 * config/rs6000/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
5543 to inherit from elfos.h
5545 2021-09-15 liuhongt <hongtao.liu@intel.com>
5548 * config/i386/i386-expand.c
5549 (ix86_expand_vector_init_interleave): Use puncklwd to pack 2
5551 (ix86_expand_vector_set): Use blendw instead of pinsrw.
5552 * config/i386/i386.c (ix86_can_change_mode_class): Adjust for
5553 AVX512FP16 which supports 16bit vector load.
5554 * config/i386/sse.md (avx512bw_interleave_highv32hi<mask_name>):
5556 (avx512bw_interleave_high<mode><mask_name>): .. this, and
5557 extend to V32HFmode.
5558 (avx2_interleave_highv16hi<mask_name>): Rename to ..
5559 (avx2_interleave_high<mode><mask_name>): .. this, and extend
5561 (vec_interleave_highv8hi<mask_name>): Rename to ..
5562 (vec_interleave_high<mode><mask_name>): .. this, and extend to V8HFmode.
5563 (<mask_codefor>avx512bw_interleave_lowv32hi<mask_name>):
5565 (<mask_codefor>avx512bw_interleave_low<mode><mask_name>):
5566 this, and extend to V32HFmode.
5567 (avx2_interleave_lowv16hi<mask_name>): Rename to ..
5568 (avx2_interleave_low<mode><mask_name>): .. this, and extend to V16HFmode.
5569 (vec_interleave_lowv8hi<mask_name>): Rename to ..
5570 (vec_interleave_low<mode><mask_name>): .. this, and extend to V8HFmode.
5571 (sse4_1_pblendw): Rename to ..
5572 (sse4_1_pblend<blendsuf>): .. this, and extend to V8HFmode.
5573 (avx2_pblendph): New define_expand.
5574 (<sse2p4_1>_pinsr<ssemodesuffix>): Refactor, use
5575 sseintmodesuffix instead of ssemodesuffix.
5576 (blendsuf): New mode attr.
5578 2021-09-15 Richard Biener <rguenther@suse.de>
5580 * tree-vectorizer.h (dr_misalignment): Move out of line.
5581 (dr_target_alignment): New.
5582 (DR_TARGET_ALIGNMENT): Wrap dr_target_alignment.
5583 (set_dr_target_alignment): New.
5584 (SET_DR_TARGET_ALIGNMENT): Wrap set_dr_target_alignment.
5585 * tree-vect-data-refs.c (dr_misalignment): Compute and
5586 return the group members misalignment.
5587 (vect_compute_data_ref_alignment): Use SET_DR_TARGET_ALIGNMENT.
5588 (vect_analyze_data_refs_alignment): Compute alignment only
5589 for the first element of a DR group.
5590 (vect_slp_analyze_node_alignment): Likewise.
5592 2021-09-15 Hongyu Wang <hongyu.wang@intel.com>
5594 * config/i386/avx512fp16intrin.h: Adjust all builtin calls.
5595 * config/i386/avx512fp16vlintrin.h: Likewise.
5596 * config/i386/i386-builtin.def: Adjust builtin name and
5597 enumeration to match AVX512F style.
5599 2021-09-15 Richard Biener <rguenther@suse.de>
5601 PR tree-optimization/102318
5602 * tree-vect-loop.c (vect_transform_cycle_phi): Revert
5603 previous change and do the mode conversion separately from
5604 the sign conversion.
5606 2021-09-15 Hongtao Liu <hongtao.liu@intel.com>
5607 Peter Cordes <peter@cordes.ca>
5610 * config/i386/sse.md (extract_suf): Add V8SF/V8SI/V4DF/V4DI.
5611 (*vec_extract<mode><ssescalarmodelower>_valign): Output
5612 vextract{i,f}{32x4,64x2} instruction when byte_offset % 16 ==
5615 2021-09-15 Richard Biener <rguenther@suse.de>
5617 * config.gcc: Remove vax-*-openbsd* configuration.
5619 2021-09-15 Richard Biener <rguenther@suse.de>
5621 * config.gcc: Remove m68k-openbsd.
5623 2021-09-15 Max Filippov <jcmvbkbc@gmail.com>
5626 * config/xtensa/t-xtensa (TM_H): Add include/xtensa-config.h.
5628 2021-09-14 Peter Bergner <bergner@linux.ibm.com>
5630 * config/rs6000/mma.md (unspec): Delete UNSPEC_MMA_XXSETACCZ.
5631 (unspecv): Add UNSPECV_MMA_XXSETACCZ.
5632 (*mma_xxsetaccz): Delete.
5633 (mma_xxsetaccz): Change to define_insn. Remove operand 1.
5634 Use UNSPECV_MMA_XXSETACCZ. Update comment.
5635 * config/rs6000/rs6000.c (rs6000_rtx_costs): Use UNSPECV_MMA_XXSETACCZ.
5637 2021-09-14 Iain Sandoe <iain@sandoe.co.uk>
5639 * Makefile.in: Remove variables related to applying no-PIE
5640 to the exes on $build.
5641 * configure: Regenerate.
5642 * configure.ac: Remove configuration related to applying
5643 no-PIE to the exes on $build.
5645 2021-09-14 Claudiu Zissulescu <claziss@synopsys.com>
5647 * config/arc/arc.md (doloop_end): Add missing mode.
5648 (loop_end): Likewise.
5650 2021-09-14 Jakub Jelinek <jakub@redhat.com>
5652 * gimplify.c (goa_stabilize_expr): Add depth argument, propagate
5653 it to recursive calls, for depth above 7 just gimplify or return.
5654 Perform a test even for MODIFY_EXPR, ADDR_EXPR, COMPOUND_EXPR with
5655 __builtin_clear_padding and TARGET_EXPR.
5656 (gimplify_omp_atomic): Adjust goa_stabilize_expr callers.
5658 2021-09-14 liuhongt <hongtao.liu@intel.com>
5660 * config/i386/avx512fp16intrin.h (_mm_fpclass_sh_mask):
5662 (_mm_mask_fpclass_sh_mask): Likewise.
5663 (_mm512_mask_fpclass_ph_mask): Likewise.
5664 (_mm512_fpclass_ph_mask): Likewise.
5665 (_mm_getexp_sh): Likewise.
5666 (_mm_mask_getexp_sh): Likewise.
5667 (_mm_maskz_getexp_sh): Likewise.
5668 (_mm512_getexp_ph): Likewise.
5669 (_mm512_mask_getexp_ph): Likewise.
5670 (_mm512_maskz_getexp_ph): Likewise.
5671 (_mm_getexp_round_sh): Likewise.
5672 (_mm_mask_getexp_round_sh): Likewise.
5673 (_mm_maskz_getexp_round_sh): Likewise.
5674 (_mm512_getexp_round_ph): Likewise.
5675 (_mm512_mask_getexp_round_ph): Likewise.
5676 (_mm512_maskz_getexp_round_ph): Likewise.
5677 (_mm_getmant_sh): Likewise.
5678 (_mm_mask_getmant_sh): Likewise.
5679 (_mm_maskz_getmant_sh): Likewise.
5680 (_mm512_getmant_ph): Likewise.
5681 (_mm512_mask_getmant_ph): Likewise.
5682 (_mm512_maskz_getmant_ph): Likewise.
5683 (_mm_getmant_round_sh): Likewise.
5684 (_mm_mask_getmant_round_sh): Likewise.
5685 (_mm_maskz_getmant_round_sh): Likewise.
5686 (_mm512_getmant_round_ph): Likewise.
5687 (_mm512_mask_getmant_round_ph): Likewise.
5688 (_mm512_maskz_getmant_round_ph): Likewise.
5689 * config/i386/avx512fp16vlintrin.h (_mm_mask_fpclass_ph_mask):
5691 (_mm_fpclass_ph_mask): Likewise.
5692 (_mm256_mask_fpclass_ph_mask): Likewise.
5693 (_mm256_fpclass_ph_mask): Likewise.
5694 (_mm256_getexp_ph): Likewise.
5695 (_mm256_mask_getexp_ph): Likewise.
5696 (_mm256_maskz_getexp_ph): Likewise.
5697 (_mm_getexp_ph): Likewise.
5698 (_mm_mask_getexp_ph): Likewise.
5699 (_mm_maskz_getexp_ph): Likewise.
5700 (_mm256_getmant_ph): Likewise.
5701 (_mm256_mask_getmant_ph): Likewise.
5702 (_mm256_maskz_getmant_ph): Likewise.
5703 (_mm_getmant_ph): Likewise.
5704 (_mm_mask_getmant_ph): Likewise.
5705 (_mm_maskz_getmant_ph): Likewise.
5706 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5707 * config/i386/i386-builtin.def: Add corresponding new builtins.
5708 * config/i386/i386-expand.c
5709 (ix86_expand_args_builtin): Handle new builtin types.
5710 (ix86_expand_round_builtin): Ditto.
5711 * config/i386/sse.md (vecmemsuffix): Add HF vector modes.
5712 (<avx512>_getexp<mode><mask_name><round_saeonly_name>): Adjust
5713 to support HF vector modes.
5714 (avx512f_sgetexp<mode><mask_scalar_name><round_saeonly_scalar_name):
5716 (avx512dq_fpclass<mode><mask_scalar_merge_name>): Ditto.
5717 (avx512dq_vmfpclass<mode><mask_scalar_merge_name>): Ditto.
5718 (<avx512>_getmant<mode><mask_name><round_saeonly_name>): Ditto.
5719 (avx512f_vgetmant<mode><mask_scalar_name><round_saeonly_scalar_name>):
5722 2021-09-14 liuhongt <hongtao.liu@intel.com>
5724 * config/i386/avx512fp16intrin.h (_mm512_reduce_ph):
5726 (_mm512_mask_reduce_ph): Likewise.
5727 (_mm512_maskz_reduce_ph): Likewise.
5728 (_mm512_reduce_round_ph): Likewise.
5729 (_mm512_mask_reduce_round_ph): Likewise.
5730 (_mm512_maskz_reduce_round_ph): Likewise.
5731 (_mm_reduce_sh): Likewise.
5732 (_mm_mask_reduce_sh): Likewise.
5733 (_mm_maskz_reduce_sh): Likewise.
5734 (_mm_reduce_round_sh): Likewise.
5735 (_mm_mask_reduce_round_sh): Likewise.
5736 (_mm_maskz_reduce_round_sh): Likewise.
5737 (_mm512_roundscale_ph): Likewise.
5738 (_mm512_mask_roundscale_ph): Likewise.
5739 (_mm512_maskz_roundscale_ph): Likewise.
5740 (_mm512_roundscale_round_ph): Likewise.
5741 (_mm512_mask_roundscale_round_ph): Likewise.
5742 (_mm512_maskz_roundscale_round_ph): Likewise.
5743 (_mm_roundscale_sh): Likewise.
5744 (_mm_mask_roundscale_sh): Likewise.
5745 (_mm_maskz_roundscale_sh): Likewise.
5746 (_mm_roundscale_round_sh): Likewise.
5747 (_mm_mask_roundscale_round_sh): Likewise.
5748 (_mm_maskz_roundscale_round_sh): Likewise.
5749 * config/i386/avx512fp16vlintrin.h: (_mm_reduce_ph):
5751 (_mm_mask_reduce_ph): Likewise.
5752 (_mm_maskz_reduce_ph): Likewise.
5753 (_mm256_reduce_ph): Likewise.
5754 (_mm256_mask_reduce_ph): Likewise.
5755 (_mm256_maskz_reduce_ph): Likewise.
5756 (_mm_roundscale_ph): Likewise.
5757 (_mm_mask_roundscale_ph): Likewise.
5758 (_mm_maskz_roundscale_ph): Likewise.
5759 (_mm256_roundscale_ph): Likewise.
5760 (_mm256_mask_roundscale_ph): Likewise.
5761 (_mm256_maskz_roundscale_ph): Likewise.
5762 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5763 * config/i386/i386-builtin.def: Add corresponding new builtins.
5764 * config/i386/i386-expand.c
5765 (ix86_expand_args_builtin): Handle new builtin types.
5766 (ix86_expand_round_builtin): Ditto.
5767 * config/i386/sse.md (<mask_codefor>reducep<mode><mask_name>):
5769 (<mask_codefor>reducep<mode><mask_name><round_saeonly_name>):
5770 ... this, and adjust for round operands.
5771 (reduces<mode><mask_scalar_name>): Likewise, with ...
5772 (reduces<mode><mask_scalar_name><round_saeonly_scalar_name):
5774 (<avx512>_rndscale<mode><mask_name><round_saeonly_name>):
5775 Adjust for HF vector modes.
5776 (avx512f_rndscale<mode><mask_scalar_name><round_saeonly_scalar_name>):
5778 (*avx512f_rndscale<mode><round_saeonly_name>): Ditto.
5780 2021-09-14 liuhongt <hongtao.liu@intel.com>
5782 * config/i386/avx512fp16intrin.h: (_mm512_rcp_ph):
5784 (_mm512_mask_rcp_ph): Likewise.
5785 (_mm512_maskz_rcp_ph): Likewise.
5786 (_mm_rcp_sh): Likewise.
5787 (_mm_mask_rcp_sh): Likewise.
5788 (_mm_maskz_rcp_sh): Likewise.
5789 (_mm512_scalef_ph): Likewise.
5790 (_mm512_mask_scalef_ph): Likewise.
5791 (_mm512_maskz_scalef_ph): Likewise.
5792 (_mm512_scalef_round_ph): Likewise.
5793 (_mm512_mask_scalef_round_ph): Likewise.
5794 (_mm512_maskz_scalef_round_ph): Likewise.
5795 (_mm_scalef_sh): Likewise.
5796 (_mm_mask_scalef_sh): Likewise.
5797 (_mm_maskz_scalef_sh): Likewise.
5798 (_mm_scalef_round_sh): Likewise.
5799 (_mm_mask_scalef_round_sh): Likewise.
5800 (_mm_maskz_scalef_round_sh): Likewise.
5801 * config/i386/avx512fp16vlintrin.h (_mm_rcp_ph):
5803 (_mm256_rcp_ph): Likewise.
5804 (_mm_mask_rcp_ph): Likewise.
5805 (_mm256_mask_rcp_ph): Likewise.
5806 (_mm_maskz_rcp_ph): Likewise.
5807 (_mm256_maskz_rcp_ph): Likewise.
5808 (_mm_scalef_ph): Likewise.
5809 (_mm256_scalef_ph): Likewise.
5810 (_mm_mask_scalef_ph): Likewise.
5811 (_mm256_mask_scalef_ph): Likewise.
5812 (_mm_maskz_scalef_ph): Likewise.
5813 (_mm256_maskz_scalef_ph): Likewise.
5814 * config/i386/i386-builtin.def: Add new builtins.
5815 * config/i386/sse.md (VFH_AVX512VL): New.
5816 (avx512fp16_rcp<mode>2<mask_name>): Ditto.
5817 (avx512fp16_vmrcpv8hf2<mask_scalar_name>): Ditto.
5818 (avx512f_vmscalef<mode><mask_scalar_name><round_scalar_name>):
5819 Adjust to support HF vector modes.
5820 (<avx512>_scalef<mode><mask_name><round_name>): Ditto.
5822 2021-09-14 liuhongt <hongtao.liu@intel.com>
5824 * config/i386/avx512fp16intrin.h: (_mm512_sqrt_ph):
5826 (_mm512_mask_sqrt_ph): Likewise.
5827 (_mm512_maskz_sqrt_ph): Likewise.
5828 (_mm512_sqrt_round_ph): Likewise.
5829 (_mm512_mask_sqrt_round_ph): Likewise.
5830 (_mm512_maskz_sqrt_round_ph): Likewise.
5831 (_mm512_rsqrt_ph): Likewise.
5832 (_mm512_mask_rsqrt_ph): Likewise.
5833 (_mm512_maskz_rsqrt_ph): Likewise.
5834 (_mm_rsqrt_sh): Likewise.
5835 (_mm_mask_rsqrt_sh): Likewise.
5836 (_mm_maskz_rsqrt_sh): Likewise.
5837 (_mm_sqrt_sh): Likewise.
5838 (_mm_mask_sqrt_sh): Likewise.
5839 (_mm_maskz_sqrt_sh): Likewise.
5840 (_mm_sqrt_round_sh): Likewise.
5841 (_mm_mask_sqrt_round_sh): Likewise.
5842 (_mm_maskz_sqrt_round_sh): Likewise.
5843 * config/i386/avx512fp16vlintrin.h (_mm_sqrt_ph): New intrinsic.
5844 (_mm256_sqrt_ph): Likewise.
5845 (_mm_mask_sqrt_ph): Likewise.
5846 (_mm256_mask_sqrt_ph): Likewise.
5847 (_mm_maskz_sqrt_ph): Likewise.
5848 (_mm256_maskz_sqrt_ph): Likewise.
5849 (_mm_rsqrt_ph): Likewise.
5850 (_mm256_rsqrt_ph): Likewise.
5851 (_mm_mask_rsqrt_ph): Likewise.
5852 (_mm256_mask_rsqrt_ph): Likewise.
5853 (_mm_maskz_rsqrt_ph): Likewise.
5854 (_mm256_maskz_rsqrt_ph): Likewise.
5855 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
5856 * config/i386/i386-builtin.def: Add corresponding new builtins.
5857 * config/i386/i386-expand.c
5858 (ix86_expand_args_builtin): Handle new builtins.
5859 (ix86_expand_round_builtin): Ditto.
5860 * config/i386/sse.md (VF_AVX512FP16VL): New.
5861 (sqrt<mode>2): Adjust for HF vector modes.
5862 (<sse>_sqrt<mode>2<mask_name><round_name>): Likewise.
5863 (<sse>_vmsqrt<mode>2<mask_scalar_name><round_scalar_name>):
5865 (<sse>_rsqrt<mode>2<mask_name>): New.
5866 (avx512fp16_vmrsqrtv8hf2<mask_scalar_name>): Likewise.
5868 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
5871 * diagnostic-spec.c (warning_suppressed_at, copy_warning): Handle
5872 'RESERVED_LOCATION_P' locations.
5873 * warning-control.cc (get_nowarn_spec, suppress_warning)
5874 (copy_warning): Likewise.
5876 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
5878 * diagnostic-spec.h (typedef xint_hash_t): Use 'location_t' instead of...
5879 (typedef key_type_t): ... this. Remove.
5880 (nowarn_map): Document.
5881 * diagnostic-spec.c (nowarn_map): Likewise.
5882 * warning-control.cc (convert_to_key): Evolve functions into...
5883 (get_location): ... these. Adjust all users.
5885 2021-09-13 Thomas Schwinge <thomas@codesourcery.com>
5887 * warning-control.cc (copy_warning): Remove 'nowarn_map' setup.
5889 2021-09-13 Jason Merrill <jason@redhat.com>
5891 * params.opt: Add destructive-interference-size and
5892 constructive-interference-size.
5893 * doc/invoke.texi: Document them.
5894 * config/aarch64/aarch64.c (aarch64_override_options_internal):
5896 * config/arm/arm.c (arm_option_override): Set them.
5897 * config/i386/i386-options.c (ix86_option_override_internal):
5900 2021-09-13 Martin Liska <mliska@suse.cz>
5901 H.J. Lu <hjl.tools@gmail.com>
5904 * common/config/i386/cpuinfo.h (cpu_indicator_init): Add support
5905 for x86-64 micro levels for __builtin_cpu_supports.
5906 * common/config/i386/i386-cpuinfo.h (enum feature_priority):
5907 Add priorities for the micro-arch levels.
5908 (enum processor_features): Add new features.
5909 * common/config/i386/i386-isas.h: Add micro-arch features.
5910 * config/i386/i386-builtins.c (get_builtin_code_for_version):
5911 Support the micro-arch levels by callsing
5912 __builtin_cpu_supports.
5913 * doc/extend.texi: Document that the levels are support by
5914 __builtin_cpu_supports.
5916 2021-09-13 Andrew Pinski <apinski@marvell.com>
5919 * config/aarch64/aarch64-builtins.c (aarch64_fold_builtin_lane_check):
5921 (aarch64_general_fold_builtin): Handle AARCH64_SIMD_BUILTIN_LANE_CHECK.
5922 (aarch64_general_gimple_fold_builtin): Likewise.
5924 2021-09-13 Andrew Pinski <apinski@marvell.com>
5926 * config.gcc: Add m32r-*-linux* and m32rle-*-linux*
5927 to the Unsupported targets list.
5928 Remove support for m32r-*-linux* and m32rle-*-linux*.
5929 * config/m32r/linux.h: Removed.
5930 * config/m32r/t-linux: Removed.
5932 2021-09-13 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
5935 * config/aarch64/aarch64.c (aarch64_classify_address): Don't allow
5936 register index for SVE predicate modes.
5938 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
5940 * tree-ssa-threadbackward.c
5941 (back_threader_profitability::profitable_path_p): Remove FSM
5943 (back_threader_registry::register_path): Same.
5944 * tree-ssa-threadedge.c
5945 (jump_threader::simplify_control_stmt_condition): Same.
5946 * tree-ssa-threadupdate.c (jt_path_registry::jt_path_registry):
5947 Add backedge_threads argument.
5948 (fwd_jt_path_registry::fwd_jt_path_registry): Pass
5949 backedge_threads argument.
5950 (back_jt_path_registry::back_jt_path_registry): Same.
5951 (dump_jump_thread_path): Adjust for FSM removal.
5952 (back_jt_path_registry::rewire_first_differing_edge): Same.
5953 (back_jt_path_registry::adjust_paths_after_duplication): Same.
5954 (back_jt_path_registry::update_cfg): Same.
5955 (jt_path_registry::register_jump_thread): Same.
5956 * tree-ssa-threadupdate.h (enum jump_thread_edge_type): Remove
5958 (class back_jt_path_registry): Add backedge_threads to
5961 2021-09-13 Martin Liska <mliska@suse.cz>
5964 * asan.h (sanitize_coverage_p): Handle when fn == NULL.
5966 2021-09-13 H.J. Lu <hjl.tools@gmail.com>
5969 * config/i386/i386.h (TARGET_AVX256_MOVE_BY_PIECES): New.
5970 (TARGET_AVX256_STORE_BY_PIECES): Likewise.
5971 (MOVE_MAX): Check TARGET_AVX256_MOVE_BY_PIECES and
5972 TARGET_AVX256_STORE_BY_PIECES instead of
5973 TARGET_AVX256_SPLIT_UNALIGNED_LOAD and
5974 TARGET_AVX256_SPLIT_UNALIGNED_STORE.
5975 (STORE_MAX_PIECES): Check TARGET_AVX256_STORE_BY_PIECES instead
5976 of TARGET_AVX256_SPLIT_UNALIGNED_STORE.
5977 * config/i386/x86-tune.def (X86_TUNE_AVX256_MOVE_BY_PIECES): New.
5978 (X86_TUNE_AVX256_STORE_BY_PIECES): Likewise.
5980 2021-09-13 liuhongt <hongtao.liu@intel.com>
5983 * expmed.c (extract_bit_field_using_extv): Use
5984 gen_lowpart_if_possible instead of gen_lowpart to avoid ICE.
5986 2021-09-13 Aldy Hernandez <aldyh@redhat.com>
5988 * Makefile.in (OBJS): Add value-pointer-equiv.o.
5989 * gimple-ssa-evrp.c (class ssa_equiv_stack): Move to
5990 value-pointer-equiv.*.
5991 (ssa_equiv_stack::ssa_equiv_stack): Same.
5992 (ssa_equiv_stack::enter): Same.
5993 (ssa_equiv_stack::leave): Same.
5994 (ssa_equiv_stack::push_replacement): Same.
5995 (ssa_equiv_stack::get_replacement): Same.
5996 (is_pointer_ssa): Same.
5997 (class pointer_equiv_analyzer): Same.
5998 (pointer_equiv_analyzer::pointer_equiv_analyzer): Same.
5999 (pointer_equiv_analyzer::~pointer_equiv_analyzer): Same.
6000 (pointer_equiv_analyzer::set_global_equiv): Same.
6001 (pointer_equiv_analyzer::set_cond_equiv): Same.
6002 (pointer_equiv_analyzer::get_equiv): Same.
6003 (pointer_equiv_analyzer::enter): Same.
6004 (pointer_equiv_analyzer::leave): Same.
6005 (pointer_equiv_analyzer::get_equiv_expr): Same.
6006 (pta_valueize): Same.
6007 (pointer_equiv_analyzer::visit_stmt): Same.
6008 (pointer_equiv_analyzer::visit_edge): Same.
6009 (hybrid_folder::value_of_expr): Same.
6010 (hybrid_folder::value_on_edge): Same.
6011 * value-pointer-equiv.cc: New file.
6012 * value-pointer-equiv.h: New file.
6014 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
6017 * gimple-fold.c (gimple_fold_builtin_memory_op): Allow folding
6018 memcpy if the size is not more than MOVE_MAX * MOVE_RATIO.
6020 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
6023 * config/arm/arm.md (movmisaligndi): New define_expand.
6024 * config/arm/vec-common.md (movmisalign<mode>): Iterate over VDQ mode.
6026 2021-09-13 Richard Earnshaw <rearnsha@arm.com>
6029 * emit-rtl.c (gen_highpart): Use adjust_address to handle
6030 MEM rather than calling simplify_gen_subreg.
6032 2021-09-13 Jan-Benedict Glaw <jbglaw@ług-owl.de>
6034 * config/alpha/vms.h (INIT_CUMULATIVE_ARGS): Wrap multi-statment
6035 define into a block.
6037 2021-09-13 Richard Biener <rguenther@suse.de>
6039 * config/darwin.h (DARWIN_PREFER_DWARF): Do not define.
6040 * config/i386/darwin.h (PREFERRED_DEBUGGING_TYPE): Do not
6041 change based on DARWIN_PREFER_DWARF not being defined.
6043 2021-09-13 Richard Biener <rguenther@suse.de>
6045 * config/i386/lynx.h: Remove undef of PREFERRED_DEBUGGING_TYPE
6046 to inherit from elfos.h
6048 2021-09-13 Richard Biener <rguenther@suse.de>
6050 * config.gcc: Add cr16-*-* to the list of obsoleted targets.
6052 2021-09-13 Richard Biener <rguenther@suse.de>
6054 * config/avr/elf.h (PREFERRED_DEBUGGING_TYPE): Remove
6055 override, pick up DWARF2_DEBUG define from elfos.h
6057 2021-09-13 Richard Biener <rguenther@suse.de>
6059 * config/rx/rx.h (PREFERRED_DEBUGGING_TYPE): Always define to
6062 2021-09-13 Richard Biener <rguenther@suse.de>
6064 * config/alpha/vms.h (PREFERRED_DEBUGGING_TYPE): Define to
6067 2021-09-13 Richard Biener <rguenther@suse.de>
6069 * config/i386/cygming.h: Always default to DWARF2 debugging.
6070 Do not define DBX_DEBUGGING_INFO, that's done via dbxcoff.h
6072 * doc/install.texi: Document binutils 2.16 as minimum
6073 requirement for mingw.
6075 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
6077 * config/rs6000/rs6000.c (struct rs6000_cost_data): New members
6078 nstmts, nloads and extra_ctor_cost.
6079 (rs6000_density_test): Add load density related heuristics. Do
6080 extra costing on vector construction statements if need.
6081 (rs6000_init_cost): Init new members.
6082 (rs6000_update_target_cost_per_stmt): New function.
6083 (rs6000_add_stmt_cost): Factor vect_nonmem hunk out to function
6084 rs6000_update_target_cost_per_stmt and call it.
6086 2021-09-13 Kewen Lin <linkw@linux.ibm.com>
6088 * config/rs6000/rs6000.c (struct rs6000_cost_data): Remove typedef.
6089 (rs6000_init_cost): Adjust.
6091 2021-09-13 liuhongt <hongtao.liu@intel.com>
6093 * config/i386/i386.md: (UNSPEC_COPYSIGN): Remove.
6094 (UNSPEC_XORSIGN): Ditto.
6096 2021-09-12 Roger Sayle <roger@nextmovesoftware.com>
6098 * expr.c (convert_move): Preserve SUBREG_PROMOTED_VAR_P when
6099 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
6102 2021-09-11 Aldy Hernandez <aldyh@redhat.com>
6104 * tree-ssa-threadbackward.c (class back_threader_registry): Use
6105 back_jt_path_registry.
6106 * tree-ssa-threadedge.c (jump_threader::jump_threader): Use
6107 fwd_jt_path_registry.
6108 * tree-ssa-threadedge.h (class jump_threader): Same..
6109 * tree-ssa-threadupdate.c
6110 (jump_thread_path_registry::jump_thread_path_registry): Rename...
6111 (jt_path_registry::jt_path_registry): ...to this.
6112 (jump_thread_path_registry::~jump_thread_path_registry): Rename...
6113 (jt_path_registry::~jt_path_registry): ...this.
6114 (fwd_jt_path_registry::fwd_jt_path_registry): New.
6115 (fwd_jt_path_registry::~fwd_jt_path_registry): New.
6116 (jump_thread_path_registry::allocate_thread_edge): Rename...
6117 (jt_path_registry::allocate_thread_edge): ...to this.
6118 (jump_thread_path_registry::allocate_thread_path): Rename...
6119 (jt_path_registry::allocate_thread_path): ...to this.
6120 (jump_thread_path_registry::lookup_redirection_data): Rename...
6121 (fwd_jt_path_registry::lookup_redirection_data): ...to this.
6122 (jump_thread_path_registry::thread_block_1): Rename...
6123 (fwd_jt_path_registry::thread_block_1): ...to this.
6124 (jump_thread_path_registry::thread_block): Rename...
6125 (fwd_jt_path_registry::thread_block): ...to this.
6126 (jt_path_registry::thread_through_loop_header): Rename...
6127 (fwd_jt_path_registry::thread_through_loop_header): ...to this.
6128 (jump_thread_path_registry::mark_threaded_blocks): Rename...
6129 (fwd_jt_path_registry::mark_threaded_blocks): ...to this.
6130 (jump_thread_path_registry::debug_path): Rename...
6131 (jt_path_registry::debug_path): ...to this.
6132 (jump_thread_path_registry::dump): Rename...
6133 (jt_path_registry::debug): ...to this.
6134 (jump_thread_path_registry::rewire_first_differing_edge): Rename...
6135 (back_jt_path_registry::rewire_first_differing_edge): ...to this.
6136 (jump_thread_path_registry::adjust_paths_after_duplication): Rename...
6137 (back_jt_path_registry::adjust_paths_after_duplication): ...to this.
6138 (jump_thread_path_registry::duplicate_thread_path): Rename...
6139 (back_jt_path_registry::duplicate_thread_path): ...to this. Also,
6140 drop ill-formed candidates.
6141 (jump_thread_path_registry::remove_jump_threads_including): Rename...
6142 (fwd_jt_path_registry::remove_jump_threads_including): ...to this.
6143 (jt_path_registry::thread_through_all_blocks): New.
6144 (back_jt_path_registry::update_cfg): New.
6145 (fwd_jt_path_registry::update_cfg): New.
6146 (jump_thread_path_registry::register_jump_thread): Rename...
6147 (jt_path_registry::register_jump_thread): ...to this.
6148 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
6150 (class jt_path_registry): ...here.
6151 (class fwd_jt_path_registry): New.
6152 (class back_jt_path_registry): New.
6154 2021-09-10 liuhongt <hongtao.liu@intel.com>
6157 2021-09-01 liuhongt <hongtao.liu@intel.com>
6159 * emit-rtl.c (validate_subreg): Get rid of all float-int
6162 2021-09-10 Jakub Jelinek <jakub@redhat.com>
6164 * tree-core.h (enum omp_memory_order): Add OMP_MEMORY_ORDER_MASK,
6165 OMP_FAIL_MEMORY_ORDER_UNSPECIFIED, OMP_FAIL_MEMORY_ORDER_RELAXED,
6166 OMP_FAIL_MEMORY_ORDER_ACQUIRE, OMP_FAIL_MEMORY_ORDER_RELEASE,
6167 OMP_FAIL_MEMORY_ORDER_ACQ_REL, OMP_FAIL_MEMORY_ORDER_SEQ_CST and
6168 OMP_FAIL_MEMORY_ORDER_MASK enumerators.
6169 (OMP_FAIL_MEMORY_ORDER_SHIFT): Define.
6170 * gimple-pretty-print.c (dump_gimple_omp_atomic_load,
6171 dump_gimple_omp_atomic_store): Print [weak] for weak atomic
6173 * gimple.h (enum gf_mask): Change GF_OMP_ATOMIC_MEMORY_ORDER
6174 to 6-bit mask, adjust GF_OMP_ATOMIC_NEED_VALUE value and add
6176 (gimple_omp_atomic_weak_p, gimple_omp_atomic_set_weak): New inline
6178 * tree.h (OMP_ATOMIC_WEAK): Define.
6179 * tree-pretty-print.c (dump_omp_atomic_memory_order): Adjust for
6180 fail memory order being encoded in the same enum and also print
6181 fail clause if present.
6182 (dump_generic_node): Print weak clause if OMP_ATOMIC_WEAK.
6183 * gimplify.c (goa_stabilize_expr): Add target_expr and rhs arguments,
6184 handle pre_p == NULL case as a test mode that only returns value
6185 but doesn't change gimplify nor change anything otherwise, adjust
6186 recursive calls, add MODIFY_EXPR, ADDR_EXPR, COND_EXPR, TARGET_EXPR
6187 and CALL_EXPR handling, adjust COMPOUND_EXPR handling for
6188 __builtin_clear_padding calls, for !rhs gimplify as lvalue rather
6190 (gimplify_omp_atomic): Adjust goa_stabilize_expr caller. Handle
6191 COND_EXPR rhs. Set weak flag on gimple load/store for
6193 * omp-expand.c (omp_memory_order_to_fail_memmodel): New function.
6194 (omp_memory_order_to_memmodel): Adjust for fail clause encoded
6196 (expand_omp_atomic_cas): New function.
6197 (expand_omp_atomic_pipeline): Use omp_memory_order_to_fail_memmodel
6199 (expand_omp_atomic): Attempt to optimize atomic compare and exchange
6200 using expand_omp_atomic_cas.
6202 2021-09-10 Aldy Hernandez <aldyh@redhat.com>
6203 Michael Matz <matz@suse.de>
6205 * tree-pass.h (PROP_loop_opts_done): New.
6206 * gimple-range-path.cc (path_range_query::internal_range_of_expr):
6207 Intersect with global range.
6208 * tree-ssa-loop.c (tree_ssa_loop_done): Set PROP_loop_opts_done.
6209 * tree-ssa-threadbackward.c
6210 (back_threader_profitability::profitable_path_p): Disable
6211 threading through latches until after loop optimizations have run.
6213 2021-09-10 David Faust <david.faust@oracle.com>
6215 * doc/invoke.texi: Document BPF -mcpu, -mjmpext, -mjmp32 and -malu32
6218 2021-09-10 David Faust <david.faust@oracle.com>
6220 * config/bpf/bpf-opts.h (bpf_isa_version): New enum.
6221 * config/bpf/bpf-protos.h (bpf_expand_cbranch): New.
6222 * config/bpf/bpf.c (bpf_option_override): Handle -mcpu option.
6223 (bpf_expand_cbranch): New function.
6224 * config/bpf/bpf.md (AM mode iterator): Conditionalize support for SI
6226 (zero_extendsidi2): Only use mov32 instruction if it is available.
6227 (SIM mode iterator): Conditionalize support for SI mode.
6228 (JM mode iterator): New.
6229 (cbranchdi4): Update name, use new JM iterator. Use bpf_expand_cbranch.
6230 (*branch_on_di): Update name, use new JM iterator.
6231 * config/bpf/bpf.opt: (mjmpext): New option.
6235 (bpf_isa): New enum.
6237 2021-09-10 David Faust <david.faust@oracle.com>
6239 * config/bpf/bpf.md (zero_extendhidi2): Add new output template
6240 for register-to-register extensions.
6241 (zero_extendqidi2): Likewise.
6243 2021-09-10 Richard Biener <rguenther@suse.de>
6245 PR middle-end/102273
6246 * internal-fn.c (expand_DEFERRED_INIT): Always expand non-SSA vars.
6248 2021-09-10 Richard Biener <rguenther@suse.de>
6250 PR middle-end/102269
6251 * gimplify.c (is_var_need_auto_init): Empty types do not need
6254 2021-09-10 Richard Biener <rguenther@suse.de>
6256 * configure.ac (--with-stabs): Remove.
6257 * configure: Regenerate.
6258 * doc/install.texi: Remove --with-stabs documentation.
6260 2021-09-10 liuhongt <hongtao.liu@intel.com>
6262 * config/i386/avx512fp16intrin.h: (_mm512_cmp_ph_mask):
6264 (_mm512_mask_cmp_ph_mask): Likewise.
6265 (_mm512_cmp_round_ph_mask): Likewise.
6266 (_mm512_mask_cmp_round_ph_mask): Likewise.
6267 (_mm_cmp_sh_mask): Likewise.
6268 (_mm_mask_cmp_sh_mask): Likewise.
6269 (_mm_cmp_round_sh_mask): Likewise.
6270 (_mm_mask_cmp_round_sh_mask): Likewise.
6271 (_mm_comieq_sh): Likewise.
6272 (_mm_comilt_sh): Likewise.
6273 (_mm_comile_sh): Likewise.
6274 (_mm_comigt_sh): Likewise.
6275 (_mm_comige_sh): Likewise.
6276 (_mm_comineq_sh): Likewise.
6277 (_mm_ucomieq_sh): Likewise.
6278 (_mm_ucomilt_sh): Likewise.
6279 (_mm_ucomile_sh): Likewise.
6280 (_mm_ucomigt_sh): Likewise.
6281 (_mm_ucomige_sh): Likewise.
6282 (_mm_ucomineq_sh): Likewise.
6283 (_mm_comi_round_sh): Likewise.
6284 (_mm_comi_sh): Likewise.
6285 * config/i386/avx512fp16vlintrin.h (_mm_cmp_ph_mask): New intrinsic.
6286 (_mm_mask_cmp_ph_mask): Likewise.
6287 (_mm256_cmp_ph_mask): Likewise.
6288 (_mm256_mask_cmp_ph_mask): Likewise.
6289 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
6290 * config/i386/i386-builtin.def: Add corresponding new builtins.
6291 * config/i386/i386-expand.c
6292 (ix86_expand_args_builtin): Handle new builtin types.
6293 (ix86_expand_round_builtin): Ditto.
6294 * config/i386/i386.md (ssevecmode): Add HF mode.
6295 (MODEFH): New mode iterator.
6296 * config/i386/sse.md
6297 (V48H_AVX512VL): New mode iterator to support HF vector modes.
6298 Ajdust corresponding description.
6299 (ssecmpintprefix): New.
6300 (VI12_AVX512VL): Adjust to support HF vector modes.
6301 (cmp_imm_predicate): Likewise.
6302 (<avx512>_cmp<mode>3<mask_scalar_merge_name><round_saeonly_name>):
6304 (avx512f_vmcmp<mode>3<round_saeonly_name>): Likewise.
6305 (avx512f_vmcmp<mode>3_mask<round_saeonly_name>): Likewise.
6306 (<sse>_<unord>comi<round_saeonly_name>): Likewise.
6308 2021-09-10 liuhongt <hongtao.liu@intel.com>
6310 * config/i386/avx512fp16intrin.h: (_mm512_max_ph): New intrinsic.
6311 (_mm512_mask_max_ph): Likewise.
6312 (_mm512_maskz_max_ph): Likewise.
6313 (_mm512_min_ph): Likewise.
6314 (_mm512_mask_min_ph): Likewise.
6315 (_mm512_maskz_min_ph): Likewise.
6316 (_mm512_max_round_ph): Likewise.
6317 (_mm512_mask_max_round_ph): Likewise.
6318 (_mm512_maskz_max_round_ph): Likewise.
6319 (_mm512_min_round_ph): Likewise.
6320 (_mm512_mask_min_round_ph): Likewise.
6321 (_mm512_maskz_min_round_ph): Likewise.
6322 (_mm_max_sh): Likewise.
6323 (_mm_mask_max_sh): Likewise.
6324 (_mm_maskz_max_sh): Likewise.
6325 (_mm_min_sh): Likewise.
6326 (_mm_mask_min_sh): Likewise.
6327 (_mm_maskz_min_sh): Likewise.
6328 (_mm_max_round_sh): Likewise.
6329 (_mm_mask_max_round_sh): Likewise.
6330 (_mm_maskz_max_round_sh): Likewise.
6331 (_mm_min_round_sh): Likewise.
6332 (_mm_mask_min_round_sh): Likewise.
6333 (_mm_maskz_min_round_sh): Likewise.
6334 * config/i386/avx512fp16vlintrin.h (_mm_max_ph): New intrinsic.
6335 (_mm256_max_ph): Likewise.
6336 (_mm_mask_max_ph): Likewise.
6337 (_mm256_mask_max_ph): Likewise.
6338 (_mm_maskz_max_ph): Likewise.
6339 (_mm256_maskz_max_ph): Likewise.
6340 (_mm_min_ph): Likewise.
6341 (_mm256_min_ph): Likewise.
6342 (_mm_mask_min_ph): Likewise.
6343 (_mm256_mask_min_ph): Likewise.
6344 (_mm_maskz_min_ph): Likewise.
6345 (_mm256_maskz_min_ph): Likewise.
6346 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
6347 * config/i386/i386-builtin.def: Add corresponding new builtins.
6348 * config/i386/i386-expand.c
6349 (ix86_expand_args_builtin): Handle new builtin types.
6350 * config/i386/sse.md
6351 (<code><mode>3<mask_name><round_saeonly_name>): Adjust to
6352 support HF vector modes.
6353 (*<code><mode>3<mask_name><round_saeonly_name>): Likewise.
6354 (ieee_<ieee_maxmin><mode>3<mask_name><round_saeonly_name>):
6356 (<sse>_vm<code><mode>3<mask_scalar_name><round_saeonly_scalar_name>):
6358 * config/i386/subst.md (round_saeonly_mode512bit_condition):
6359 Adjust for HF vector modes.
6361 2021-09-10 Liu, Hongtao <hongtao.liu@intel.com>
6363 * config/i386/avx512fp16intrin.h (_mm_add_sh): New intrinsic.
6364 (_mm_mask_add_sh): Likewise.
6365 (_mm_maskz_add_sh): Likewise.
6366 (_mm_sub_sh): Likewise.
6367 (_mm_mask_sub_sh): Likewise.
6368 (_mm_maskz_sub_sh): Likewise.
6369 (_mm_mul_sh): Likewise.
6370 (_mm_mask_mul_sh): Likewise.
6371 (_mm_maskz_mul_sh): Likewise.
6372 (_mm_div_sh): Likewise.
6373 (_mm_mask_div_sh): Likewise.
6374 (_mm_maskz_div_sh): Likewise.
6375 (_mm_add_round_sh): Likewise.
6376 (_mm_mask_add_round_sh): Likewise.
6377 (_mm_maskz_add_round_sh): Likewise.
6378 (_mm_sub_round_sh): Likewise.
6379 (_mm_mask_sub_round_sh): Likewise.
6380 (_mm_maskz_sub_round_sh): Likewise.
6381 (_mm_mul_round_sh): Likewise.
6382 (_mm_mask_mul_round_sh): Likewise.
6383 (_mm_maskz_mul_round_sh): Likewise.
6384 (_mm_div_round_sh): Likewise.
6385 (_mm_mask_div_round_sh): Likewise.
6386 (_mm_maskz_div_round_sh): Likewise.
6387 * config/i386/i386-builtin-types.def: Add corresponding builtin types.
6388 * config/i386/i386-builtin.def: Add corresponding new builtins.
6389 * config/i386/i386-expand.c
6390 (ix86_expand_round_builtin): Handle new builtins.
6391 * config/i386/sse.md (VF_128): Change description.
6392 (<sse>_vm<plusminus_insn><mode>3<mask_scalar_name><round_scalar_name>):
6393 Adjust to support HF vector modes.
6394 (<sse>_vm<multdiv_mnemonic><mode>3<mask_scalar_name><round_scalar_name>):
6397 2021-09-10 H.J. Lu <hjl.tools@gmail.com>
6399 * config/i386/i386-expand.c
6400 (ix86_avx256_split_vector_move_misalign): Handle V16HF mode.
6401 * config/i386/i386.c
6402 (ix86_preferred_simd_mode): Handle HF mode.
6403 * config/i386/sse.md (V_256H): New mode iterator.
6404 (avx_vextractf128<mode>): Use it.
6405 (VEC_INIT_MODE): Align vector HFmode condition to vector
6406 HImodes since there're no real HF instruction used.
6407 (VEC_INIT_HALF_MODE): Ditto.
6409 (VIHF_AVX512BW): Ditto.
6410 (*vec_extracthf): Ditto.
6411 (VEC_EXTRACT_MODE): Ditto.
6413 2021-09-10 Richard Biener <rguenther@suse.de>
6416 * config/dbx.h: Remove.
6417 * config/dbxcoff.h: Do not define PREFERRED_DEBUGGING_TYPE.
6418 * config/lynx.h: Likewise.
6420 2021-09-10 liuhongt <hongtao.liu@intel.com>
6422 * config/i386/i386-expand.c (ix86_expand_copysign): Expand
6423 right into ANDNOT + AND + IOR, using paradoxical subregs.
6424 (ix86_split_copysign_const): Remove.
6425 (ix86_split_copysign_var): Ditto.
6426 * config/i386/i386-protos.h (ix86_split_copysign_const): Dotto.
6427 (ix86_split_copysign_var): Ditto.
6428 * config/i386/i386.md (@copysign<mode>3_const): Ditto.
6429 (@copysign<mode>3_var): Ditto.
6431 2021-09-09 qing zhao <qing.zhao@oracle.com>
6433 * builtins.c (expand_builtin_memset): Make external visible.
6434 * builtins.h (expand_builtin_memset): Declare extern.
6435 * common.opt (ftrivial-auto-var-init=): New option.
6436 * doc/extend.texi: Document the uninitialized attribute.
6437 * doc/invoke.texi: Document -ftrivial-auto-var-init.
6438 * flag-types.h (enum auto_init_type): New enumerated type
6440 * gimple-fold.c (clear_padding_type): Add one new parameter.
6441 (clear_padding_union): Likewise.
6442 (clear_padding_emit_loop): Likewise.
6443 (clear_type_padding_in_mask): Likewise.
6444 (gimple_fold_builtin_clear_padding): Handle this new parameter.
6445 * gimplify.c (gimple_add_init_for_auto_var): New function.
6446 (gimple_add_padding_init_for_auto_var): New function.
6447 (is_var_need_auto_init): New function.
6448 (gimplify_decl_expr): Add initialization to automatic variables per
6450 (gimplify_call_expr): Add one new parameter for call to
6451 __builtin_clear_padding.
6452 (gimplify_init_constructor): Add padding initialization in the end.
6453 * internal-fn.c (INIT_PATTERN_VALUE): New macro.
6454 (expand_DEFERRED_INIT): New function.
6455 * internal-fn.def (DEFERRED_INIT): New internal function.
6456 * tree-cfg.c (verify_gimple_call): Verify calls to .DEFERRED_INIT.
6457 * tree-sra.c (generate_subtree_deferred_init): New function.
6458 (scan_function): Avoid setting cannot_scalarize_away_bitmap for
6459 calls to .DEFERRED_INIT.
6460 (sra_modify_deferred_init): New function.
6461 (sra_modify_function_body): Handle calls to DEFERRED_INIT specially.
6462 * tree-ssa-structalias.c (find_func_aliases_for_call): Likewise.
6463 * tree-ssa-uninit.c (warn_uninit): Handle calls to DEFERRED_INIT
6465 (check_defs): Likewise.
6466 (warn_uninitialized_vars): Likewise.
6467 * tree-ssa.c (ssa_undefined_value_p): Likewise.
6468 * tree.c (build_common_builtin_nodes): Build tree node for
6469 BUILT_IN_CLEAR_PADDING when needed.
6471 2021-09-09 Richard Biener <rguenther@suse.de>
6473 * tree-ssa-loop-im.c (fill_always_executed_in_1): Walk
6476 2021-09-09 Richard Biener <rguenther@suse.de>
6478 * tree-ssa-loop-im.c (fill_always_executed_in_1): Integrate
6479 DOM walk from get_loop_body_in_dom_order using a worklist
6482 2021-09-09 liuhongt <hongtao.liu@intel.com>
6484 * config.gcc: Add avx512fp16vlintrin.h.
6485 * config/i386/avx512fp16intrin.h: (_mm512_add_ph): New intrinsic.
6486 (_mm512_mask_add_ph): Likewise.
6487 (_mm512_maskz_add_ph): Likewise.
6488 (_mm512_sub_ph): Likewise.
6489 (_mm512_mask_sub_ph): Likewise.
6490 (_mm512_maskz_sub_ph): Likewise.
6491 (_mm512_mul_ph): Likewise.
6492 (_mm512_mask_mul_ph): Likewise.
6493 (_mm512_maskz_mul_ph): Likewise.
6494 (_mm512_div_ph): Likewise.
6495 (_mm512_mask_div_ph): Likewise.
6496 (_mm512_maskz_div_ph): Likewise.
6497 (_mm512_add_round_ph): Likewise.
6498 (_mm512_mask_add_round_ph): Likewise.
6499 (_mm512_maskz_add_round_ph): Likewise.
6500 (_mm512_sub_round_ph): Likewise.
6501 (_mm512_mask_sub_round_ph): Likewise.
6502 (_mm512_maskz_sub_round_ph): Likewise.
6503 (_mm512_mul_round_ph): Likewise.
6504 (_mm512_mask_mul_round_ph): Likewise.
6505 (_mm512_maskz_mul_round_ph): Likewise.
6506 (_mm512_div_round_ph): Likewise.
6507 (_mm512_mask_div_round_ph): Likewise.
6508 (_mm512_maskz_div_round_ph): Likewise.
6509 * config/i386/avx512fp16vlintrin.h: New header.
6510 * config/i386/i386-builtin-types.def (V16HF, V8HF, V32HF):
6511 Add new builtin types.
6512 * config/i386/i386-builtin.def: Add corresponding builtins.
6513 * config/i386/i386-expand.c
6514 (ix86_expand_args_builtin): Handle new builtin types.
6515 (ix86_expand_round_builtin): Likewise.
6516 * config/i386/immintrin.h: Include avx512fp16vlintrin.h
6517 * config/i386/sse.md (VFH): New mode_iterator.
6519 (avx512fmaskmode): Add HF vector modes.
6520 (avx512fmaskhalfmode): Likewise.
6521 (<plusminus_insn><mode>3<mask_name><round_name>): Adjust to for
6523 (*<plusminus_insn><mode>3<mask_name><round_name>): Likewise.
6524 (mul<mode>3<mask_name><round_name>): Likewise.
6525 (*mul<mode>3<mask_name><round_name>): Likewise.
6526 (div<mode>3): Likewise.
6527 (<sse>_div<mode>3<mask_name><round_name>): Likewise.
6528 * config/i386/subst.md (SUBST_V): Add HF vector modes.
6529 (SUBST_A): Likewise.
6530 (round_mode512bit_condition): Adjust for V32HFmode.
6532 2021-09-09 liuhongt <hongtao.liu@intel.com>
6535 * config/i386/sse.md (reduc_plus_scal_<mode>): Split to ..
6536 (reduc_plus_scal_v4sf): .. this, New define_expand.
6537 (reduc_plus_scal_v2df): .. and this, New define_expand.
6539 2021-09-09 liuhongt <hongtao.liu@intel.com>
6542 * config/i386/sse.md (*vec_extract<mode><ssescalarmodelower>_valign):
6545 2021-09-08 Jonathan Wakely <jwakely@redhat.com>
6548 * doc/trouble.texi (Copy Assignment): Fix description of
6549 behaviour and fix code in example.
6551 2021-09-08 Segher Boessenkool <segher@kernel.crashing.org>
6554 * config/rs6000/rs6000-logue.c (rs6000_emit_epilogue): For ELFv2 use
6555 r11 instead of r12 for restoring CR.
6557 2021-09-08 Jakub Jelinek <jakub@redhat.com>
6558 liuhongt <hongtao.liu@intel.com>
6561 * config/i386/i386.md (@xorsign<mode>3_1): Remove.
6562 * config/i386/i386-expand.c (ix86_expand_xorsign): Expand right away
6563 into AND with mask and XOR, using paradoxical subregs.
6564 (ix86_split_xorsign): Remove.
6565 * config/i386/i386-protos.h (ix86_split_xorsign): Remove.
6567 2021-09-08 Di Zhao <dizhao@os.amperecomputing.com>
6569 * tree-ssa-sccvn.c (vn_nary_op_insert_into): fix result compare
6571 2021-09-08 Jakub Jelinek <jakub@redhat.com>
6574 * config/i386/i386.md (xorsign<mode>3): If operands[1] is equal to
6575 operands[2], emit abs<mode>2 instead.
6576 (@xorsign<mode>3_1): Add early-clobbers for output operand, enable
6577 first alternative even for avx, add another alternative with
6578 =&Yv <- 0, Yv, Yvm constraints.
6579 * config/i386/i386-expand.c (ix86_split_xorsign): If op0 is equal
6580 to op1, emit vpandn instead.
6582 2021-09-08 liuhongt <hongtao.liu@intel.com>
6584 * config/i386/avx512fp16intrin.h (_mm_set_ph): New intrinsic.
6585 (_mm256_set_ph): Likewise.
6586 (_mm512_set_ph): Likewise.
6587 (_mm_setr_ph): Likewise.
6588 (_mm256_setr_ph): Likewise.
6589 (_mm512_setr_ph): Likewise.
6590 (_mm_set1_ph): Likewise.
6591 (_mm256_set1_ph): Likewise.
6592 (_mm512_set1_ph): Likewise.
6593 (_mm_setzero_ph): Likewise.
6594 (_mm256_setzero_ph): Likewise.
6595 (_mm512_setzero_ph): Likewise.
6596 (_mm_set_sh): Likewise.
6597 (_mm_load_sh): Likewise.
6598 (_mm_store_sh): Likewise.
6599 * config/i386/i386-builtin-types.def (V8HF): New type.
6600 (DEF_FUNCTION_TYPE (V8HF, V8HI)): New builtin function type
6601 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
6602 Support vector HFmodes.
6603 (ix86_expand_vector_init_one_nonzero): Likewise.
6604 (ix86_expand_vector_init_one_var): Likewise.
6605 (ix86_expand_vector_init_interleave): Likewise.
6606 (ix86_expand_vector_init_general): Likewise.
6607 (ix86_expand_vector_set): Likewise.
6608 (ix86_expand_vector_extract): Likewise.
6609 (ix86_expand_vector_init_concat): Likewise.
6610 (ix86_expand_sse_movcc): Handle vector HFmodes.
6611 (ix86_expand_vector_set_var): Ditto.
6612 * config/i386/i386-modes.def: Add HF vector modes in comment.
6613 * config/i386/i386.c (classify_argument): Add HF vector modes.
6614 (ix86_hard_regno_mode_ok): Allow HF vector modes for AVX512FP16.
6615 (ix86_vector_mode_supported_p): Likewise.
6616 (ix86_set_reg_reg_cost): Handle vector HFmode.
6617 (ix86_get_ssemov): Handle vector HFmode.
6618 (function_arg_advance_64): Pass unamed V16HFmode and V32HFmode
6620 (function_arg_advance_32): Pass V8HF/V16HF/V32HF by sse reg for 32bit
6622 (function_arg_advance_32): Ditto.
6623 * config/i386/i386.h (VALID_AVX512FP16_REG_MODE): New.
6624 (VALID_AVX256_REG_OR_OI_MODE): Rename to ..
6625 (VALID_AVX256_REG_OR_OI_VHF_MODE): .. this, and add V16HF.
6626 (VALID_SSE2_REG_VHF_MODE): New.
6627 (VALID_AVX512VL_128_REG_MODE): Add V8HF and TImode.
6628 (SSE_REG_MODE_P): Add vector HFmode.
6629 * config/i386/i386.md (mode): Add HF vector modes.
6630 (MODE_SIZE): Likewise.
6631 (ssemodesuffix): Add ph suffix for HF vector modes.
6632 * config/i386/sse.md (VFH_128): New mode iterator.
6633 (VMOVE): Adjust for HF vector modes.
6635 (V_256_512): Likewise.
6637 (avx512fmaskmode): Likewise.
6638 (shuffletype): Likewise.
6639 (sseinsnmode): Likewise.
6640 (ssedoublevecmode): Likewise.
6641 (ssehalfvecmode): Likewise.
6642 (ssehalfvecmodelower): Likewise.
6643 (ssePScmode): Likewise.
6644 (ssescalarmode): Likewise.
6645 (ssescalarmodelower): Likewise.
6646 (sseintprefix): Likewise.
6648 (bcstscalarsuff): Likewise.
6649 (xtg_mode): Likewise.
6650 (VI12HF_AVX512VL): New mode_iterator.
6651 (VF_AVX512FP16): Likewise.
6653 (VIHF_256): Likewise.
6654 (VIHF_AVX512BW): Likewise.
6655 (V16_256): Likewise.
6656 (V32_512): Likewise.
6657 (sseintmodesuffix): New mode_attr.
6658 (sse): Add scalar and vector HFmodes.
6659 (ssescalarmode): Add vector HFmode mapping.
6660 (ssescalarmodesuffix): Add sh suffix for HFmode.
6661 (*<sse>_vm<insn><mode>3): Use VFH_128.
6662 (*<sse>_vm<multdiv_mnemonic><mode>3): Likewise.
6663 (*ieee_<ieee_maxmin><mode>3): Likewise.
6664 (<avx512>_blendm<mode>): New define_insn.
6665 (vec_setv8hf): New define_expand.
6666 (vec_set<mode>_0): New define_insn for HF vector set.
6667 (*avx512fp16_movsh): Likewise.
6668 (avx512fp16_movsh): Likewise.
6669 (vec_extract_lo_v32hi): Rename to ...
6670 (vec_extract_lo_<mode>): ... this, and adjust to allow HF
6672 (vec_extract_hi_v32hi): Likewise.
6673 (vec_extract_hi_<mode>): Likewise.
6674 (vec_extract_lo_v16hi): Likewise.
6675 (vec_extract_lo_<mode>): Likewise.
6676 (vec_extract_hi_v16hi): Likewise.
6677 (vec_extract_hi_<mode>): Likewise.
6678 (vec_set_hi_v16hi): Likewise.
6679 (vec_set_hi_<mode>): Likewise.
6680 (vec_set_lo_v16hi): Likewise.
6681 (vec_set_lo_<mode>): Likewise.
6682 (*vec_extract<mode>_0): New define_insn_and_split for HF
6684 (*vec_extracthf): New define_insn.
6685 (VEC_EXTRACT_MODE): Add HF vector modes.
6686 (PINSR_MODE): Add V8HF.
6687 (sse2p4_1): Likewise.
6688 (pinsr_evex_isa): Likewise.
6689 (<sse2p4_1>_pinsr<ssemodesuffix>): Adjust to support
6690 insert for V8HFmode.
6691 (pbroadcast_evex_isa): Add HF vector modes.
6692 (AVX2_VEC_DUP_MODE): Likewise.
6693 (VEC_INIT_MODE): Likewise.
6694 (VEC_INIT_HALF_MODE): Likewise.
6695 (avx2_pbroadcast<mode>): Adjust to support HF vector mode
6697 (avx2_pbroadcast<mode>_1): Likewise.
6698 (<avx512>_vec_dup<mode>_1): Likewise.
6699 (<avx512>_vec_dup<mode><mask_name>): Likewise.
6700 (<mask_codefor><avx512>_vec_dup_gpr<mode><mask_name>):
6703 2021-09-08 Guo, Xuepeng <xuepeng.guo@intel.com>
6704 H.J. Lu <hongjiu.lu@intel.com>
6705 Liu Hongtao <hongtao.liu@intel.com>
6706 Wang Hongyu <hongyu.wang@intel.com>
6707 Xu Dianhong <dianhong.xu@intel.com>
6709 * common/config/i386/cpuinfo.h (get_available_features):
6710 Detect FEATURE_AVX512FP16.
6711 * common/config/i386/i386-common.c
6712 (OPTION_MASK_ISA_AVX512FP16_SET,
6713 OPTION_MASK_ISA_AVX512FP16_UNSET,
6714 OPTION_MASK_ISA2_AVX512FP16_SET,
6715 OPTION_MASK_ISA2_AVX512FP16_UNSET): New.
6716 (OPTION_MASK_ISA2_AVX512BW_UNSET,
6717 OPTION_MASK_ISA2_AVX512BF16_UNSET): Add AVX512FP16.
6718 (ix86_handle_option): Handle -mavx512fp16.
6719 * common/config/i386/i386-cpuinfo.h (enum processor_features):
6720 Add FEATURE_AVX512FP16.
6721 * common/config/i386/i386-isas.h: Add entry for AVX512FP16.
6722 * config.gcc: Add avx512fp16intrin.h.
6723 * config/i386/avx512fp16intrin.h: New intrinsic header.
6724 * config/i386/cpuid.h: Add bit_AVX512FP16.
6725 * config/i386/i386-builtin-types.def: (FLOAT16): New primitive type.
6726 * config/i386/i386-builtins.c: Support _Float16 type for i386
6728 (ix86_register_float16_builtin_type): New function.
6729 (ix86_float16_type_node): New.
6730 * config/i386/i386-c.c (ix86_target_macros_internal): Define
6732 * config/i386/i386-expand.c (ix86_expand_branch): Support
6734 (ix86_prepare_fp_compare_args): Adjust TARGET_SSE_MATH &&
6735 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
6736 (ix86_expand_fp_movcc): Ditto.
6737 * config/i386/i386-isa.def: Add PTA define for AVX512FP16.
6738 * config/i386/i386-options.c (isa2_opts): Add -mavx512fp16.
6739 (ix86_valid_target_attribute_inner_p): Add avx512fp16 attribute.
6740 * config/i386/i386.c (ix86_get_ssemov): Use
6741 vmovdqu16/vmovw/vmovsh for HFmode/HImode scalar or vector.
6742 (ix86_get_excess_precision): Use
6743 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when TARGET_AVX512FP16
6745 (sse_store_index): Use SFmode cost for HFmode cost.
6746 (inline_memory_move_cost): Add HFmode, and perfer SSE cost over
6747 GPR cost for HFmode.
6748 (ix86_hard_regno_mode_ok): Allow HImode in sse register.
6749 (ix86_mangle_type): Add manlging for _Float16 type.
6750 (inline_secondary_memory_needed): No memory is needed for
6751 16bit movement between gpr and sse reg under
6753 (ix86_multiplication_cost): Adjust TARGET_SSE_MATH &&
6754 SSE_FLOAT_MODE_P to SSE_FLOAT_MODE_SSEMATH_OR_HF_P.
6755 (ix86_division_cost): Ditto.
6756 (ix86_rtx_costs): Ditto.
6757 (ix86_add_stmt_cost): Ditto.
6758 (ix86_optab_supported_p): Ditto.
6759 * config/i386/i386.h (VALID_AVX512F_SCALAR_MODE): Add HFmode.
6760 (SSE_FLOAT_MODE_SSEMATH_OR_HF_P): Add HFmode.
6761 (PTA_SAPPHIRERAPIDS): Add PTA_AVX512FP16.
6762 * config/i386/i386.md (mode): Add HFmode.
6763 (MODE_SIZE): Add HFmode.
6764 (isa): Add avx512fp16.
6765 (enabled): Handle avx512fp16.
6766 (ssemodesuffix): Add sh suffix for HFmode.
6767 (comm): Add mult, div.
6768 (plusminusmultdiv): New code iterator.
6769 (insn): Add mult, div.
6770 (*movhf_internal): Adjust for avx512fp16 instruction.
6771 (*movhi_internal): Ditto.
6772 (*cmpi<unord>hf): New define_insn for HFmode.
6773 (*ieee_s<ieee_maxmin>hf3): Likewise.
6774 (extendhf<mode>2): Likewise.
6775 (trunc<mode>hf2): Likewise.
6776 (float<floatunssuffix><mode>hf2): Likewise.
6777 (*<insn>hf): Likewise.
6778 (cbranchhf4): New expander.
6779 (movhfcc): Likewise.
6780 (<insn>hf3): Likewise.
6783 * config/i386/i386.opt: Add mavx512fp16.
6784 * config/i386/immintrin.h: Include avx512fp16intrin.h.
6785 * doc/invoke.texi: Add mavx512fp16.
6786 * doc/extend.texi: Add avx512fp16 Usage Notes.
6788 2021-09-08 liuhongt <hongtao.liu@intel.com>
6790 * common.opt: Support -fexcess-precision=16.
6791 * config/aarch64/aarch64.c (aarch64_excess_precision): Return
6792 FLT_EVAL_METHOD_PROMOTE_TO_FLOAT16 when
6793 EXCESS_PRECISION_TYPE_FLOAT16.
6794 * config/arm/arm.c (arm_excess_precision): Ditto.
6795 * config/i386/i386.c (ix86_get_excess_precision): Ditto.
6796 * config/m68k/m68k.c (m68k_excess_precision): Issue an error
6797 when EXCESS_PRECISION_TYPE_FLOAT16.
6798 * config/s390/s390.c (s390_excess_precision): Ditto.
6799 * coretypes.h (enum excess_precision_type): Add
6800 EXCESS_PRECISION_TYPE_FLOAT16.
6801 * doc/tm.texi (TARGET_C_EXCESS_PRECISION): Update documents.
6802 * doc/tm.texi.in (TARGET_C_EXCESS_PRECISION): Ditto.
6803 * doc/extend.texi (Half-Precision): Document
6804 -fexcess-precision=16.
6805 * flag-types.h (enum excess_precision): Add
6806 EXCESS_PRECISION_FLOAT16.
6807 * target.def (excess_precision): Update document.
6808 * tree.c (excess_precision_type): Set excess_precision_type to
6809 EXCESS_PRECISION_FLOAT16 when -fexcess-precision=16.
6811 2021-09-08 liuhongt <hongtao.liu@intel.com>
6813 * doc/extend.texi: (@node Floating Types): Adjust the wording.
6814 (@node Half-Precision): Ditto.
6816 2021-09-07 Takayuki 'January June' Suwa <jjsuwa_sys3175@yahoo.co.jp>
6819 * config/xtensa/xtensa.c (xtensa_emit_move_sequence): Add
6820 'CONST_INT_P (src)' to the condition of the block that tries to
6821 eliminate literal when loading integer contant.
6823 2021-09-07 David Faust <david.faust@oracle.com>
6825 * doc/extend.texi (BPF Type Attributes) New node.
6826 Document new preserve_access_index attribute.
6827 Document new preserve_access_index builtin.
6828 * doc/invoke.texi: Document -mco-re and -mno-co-re options.
6830 2021-09-07 David Faust <david.faust@oracle.com>
6832 * config/bpf/bpf.c: Adjust includes.
6833 (bpf_handle_preserve_access_index_attribute): New function.
6834 (bpf_attribute_table): Use it here.
6835 (bpf_builtins): Add BPF_BUILTIN_PRESERVE_ACCESS_INDEX.
6836 (bpf_option_override): Handle "-mco-re" option.
6837 (bpf_asm_init_sections): New.
6838 (TARGET_ASM_INIT_SECTIONS): Redefine.
6839 (bpf_file_end): New.
6840 (TARGET_ASM_FILE_END): Redefine.
6841 (bpf_init_builtins): Add "__builtin_preserve_access_index".
6842 (bpf_core_compute, bpf_core_get_index): New.
6843 (is_attr_preserve_access): New.
6844 (bpf_expand_builtin): Handle new builtins.
6845 (bpf_core_newdecl, bpf_core_is_maybe_aggregate_access): New.
6846 (bpf_core_walk): New.
6847 (bpf_resolve_overloaded_builtin): New.
6848 (TARGET_RESOLVE_OVERLOADED_BUILTIN): Redefine.
6850 (pass_bpf_core_attr): New RTL pass.
6851 * config/bpf/bpf-passes.def: New file.
6852 * config/bpf/bpf-protos.h (make_pass_bpf_core_attr): New.
6853 * config/bpf/coreout.c: New file.
6854 * config/bpf/coreout.h: Likewise.
6855 * config/bpf/t-bpf (TM_H): Add $(srcdir)/config/bpf/coreout.h.
6856 (coreout.o): New rule.
6857 (PASSES_EXTRA): Add $(srcdir)/config/bpf/bpf-passes.def.
6858 * config.gcc (bpf): Add coreout.h to extra_headers.
6859 Add coreout.o to extra_objs.
6860 Add $(srcdir)/config/bpf/coreout.c to target_gtfiles.
6862 2021-09-07 David Faust <david.faust@oracle.com>
6864 * btfout.c (get_btf_id): Function is no longer static.
6865 * ctfc.h: Expose it here.
6867 2021-09-07 David Faust <david.faust@oracle.com>
6869 * ctfc.c (ctf_lookup_tree_type): New function.
6872 2021-09-07 David Faust <david.faust@oracle.com>
6874 * ctfc.c (ctf_dtd_lookup): Function is no longer static.
6875 * ctfc.h: Analogous change.
6877 2021-09-07 David Faust <david.faust@oracle.com>
6879 * dwarf2out.c (lookup_type_die): Function is no longer static.
6880 * dwarf2out.h: Expose it here.
6882 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
6884 * dwarf2ctf.c (ctf_debug_finalize): Make it static.
6885 (ctf_debug_early_finish): New definition.
6886 (ctf_debug_finish): Likewise.
6887 * dwarf2ctf.h (ctf_debug_finalize): Remove declaration.
6888 (ctf_debug_early_finish): New declaration.
6889 (ctf_debug_finish): Likewise.
6890 * dwarf2out.c (dwarf2out_finish): Invoke ctf_debug_finish.
6891 (dwarf2out_early_finish): Invoke ctf_debug_early_finish.
6893 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
6895 * config/bpf/bpf.c (bpf_option_override): For BPF backend, disable LTO
6896 support when compiling for CO-RE.
6897 * config/bpf/bpf.opt: Add new command line option -mco-re.
6899 2021-09-07 Indu Bhagat <indu.bhagat@oracle.com>
6901 * flag-types.h (enum debug_info_type): Add new enum
6902 DINFO_TYPE_BTF_WITH_CORE.
6903 (BTF_WITH_CORE_DEBUG): New bitmask.
6904 * flags.h (btf_with_core_debuginfo_p): New declaration.
6905 * opts.c (btf_with_core_debuginfo_p): New definition.
6907 2021-09-07 Jason Merrill <jason@redhat.com>
6909 * tree.h (error_operand_p): Change to inline function.
6911 2021-09-07 Aldy Hernandez <aldyh@redhat.com>
6913 * tree-ssa-threadedge.c (forwarder_block_p): Rename to...
6914 (empty_block_with_phis_p): ...this.
6915 (potentially_threadable_block): Same.
6916 (jump_threader::thread_through_normal_block): Same.
6918 2021-09-07 Eric Botcazou <ebotcazou@adacore.com>
6921 * dwarf2out.c (mark_base_types): New overloaded function.
6922 (dwarf2out_early_finish): Invoke it on the COMDAT type list as well
6923 as the compilation unit, and call move_marked_base_types afterward.
6925 2021-09-07 H.J. Lu <hjl.tools@gmail.com>
6928 * config/i386/i386-expand.c (ix86_expand_convert_uns_sisf_sse):
6930 (ix86_expand_vector_convert_uns_vsivsf): Likewise.
6932 2021-09-07 Richard Biener <rguenther@suse.de>
6934 PR tree-optimization/102226
6935 * tree-vect-loop.c (vect_transform_cycle_phi): Record
6936 the converted value for the epilogue PHI use.
6938 2021-09-07 Martin Liska <mliska@suse.cz>
6940 PR gcov-profile/80223
6941 * ipa-inline.c (can_inline_edge_p): Similarly to sanitizer
6942 options, do not inline when no_profile_instrument_function
6943 attributes are different in early inliner. It's fine to inline
6944 it after PGO instrumentation.
6946 2021-09-07 Richard Biener <rguenther@suse.de>
6948 PR tree-optimization/101555
6949 * tree-ssa-pre.c (translate_vuse_through_block): Do not
6950 perform an alias walk to determine the validity of the
6951 mem at the start of the block which is already guaranteed
6952 by means of prune_clobbered_mems.
6953 (phi_translate_1): Pass edge to translate_vuse_through_block.
6955 2021-09-07 Xionghu Luo <luoxhu@linux.ibm.com>
6958 * config/rs6000/rs6000.md (fmod<mode>3): New define_expand.
6959 (remainder<mode>3): Likewise.
6961 2021-09-07 YunQiang Su <yunqiang.su@cipunited.com>
6963 * config/mips/mips.c (mips_file_start): add .module for
6966 2021-09-06 Roger Sayle <roger@nextmovesoftware.com>
6968 * wide-int.cc (wi::clz): Reorder tests to ensure the result
6969 is zero for all negative values.
6971 2021-09-06 Tobias Burnus <tobias@codesourcery.com>
6973 * doc/invoke.texi (-foffload-options): Fix @opindex.
6975 2021-09-06 H.J. Lu <hjl.tools@gmail.com>
6978 * config/i386/i386-expand.c (ix86_split_xorsign): Use operands[2].
6979 * config/i386/i386.md (@xorsign<mode>3_1): Add non-destructive
6980 source alternative for AVX.
6982 2021-09-06 liuhongt <hongtao.liu@intel.com>
6984 PR middle-end/102182
6985 * optabs.c (expand_fix): Add from1 to avoid from being
6988 2021-09-06 Eric Botcazou <ebotcazou@adacore.com>
6990 * dwarf2out.c (modified_type_die): Deal with all array types earlier
6991 and use local variable consistently throughout the function.
6993 2021-09-06 Jakub Jelinek <jakub@redhat.com>
6995 PR tree-optimization/102207
6996 * match.pd: Don't demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
6997 were promoted from signed to wider unsigned type.
6999 2021-09-06 Andrew Pinski <apinski@marvell.com>
7001 PR tree-optimization/63184
7002 * match.pd: Add simplification of pointer_diff of two pointer_plus
7003 with addr_expr in the first operand of each pointer_plus.
7004 Add simplificatoin of ne/eq of two pointer_plus with addr_expr
7005 in the first operand of each pointer_plus.
7007 2021-09-06 Richard Biener <rguenther@suse.de>
7009 PR tree-optimization/102176
7010 * tree-vect-slp.c (vect_slp_gather_vectorized_scalar_stmts):
7012 (vect_bb_slp_scalar_cost): Use the computed set of
7013 vectorized scalar stmts instead of relying on the out-of-date
7014 and not accurate PURE_SLP_STMT.
7015 (vect_bb_vectorization_profitable_p): Compute the set
7016 of vectorized scalar stmts.
7018 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
7020 * gimple-range-path.cc (path_range_query::range_of_stmt): Remove
7021 GIMPLE_COND special casing.
7022 (path_range_query::range_defined_in_block): Use range_of_stmt
7023 instead of calling fold_range directly.
7025 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
7027 * gimple-range-path.cc (path_range_query::range_of_expr): Set
7028 m_undefined_path when appropriate.
7029 (path_range_query::internal_range_of_expr): Copy from range_of_expr.
7030 (path_range_query::unreachable_path_p): New.
7031 (path_range_query::precompute_ranges): Set m_undefined_path.
7032 * gimple-range-path.h (path_range_query::unreachable_path_p): New.
7033 (path_range_query::internal_range_of_expr): New.
7034 * tree-ssa-threadbackward.c (back_threader::find_taken_edge_cond):
7035 Use unreachable_path_p.
7037 2021-09-05 Aldy Hernandez <aldyh@redhat.com>
7039 * tree-ssa-threadbackward.c (back_threader::maybe_register_path):
7040 Remove argument and call find_taken_edge.
7041 (back_threader::resolve_phi): Do not calculate taken edge before
7042 calling maybe_register_path.
7043 (back_threader::find_paths_to_names): Same.
7045 2021-09-05 Jeff Law <jlaw@localhost.localdomain>
7047 * config/h8300/h8300.md (QHSI2 mode iterator): New mode iterator.
7048 * config/h8300/testcompare.md (store_c): Update name, use new
7050 (store_neg_c, store_shifted_c): New patterns.
7052 2021-09-03 Segher Boessenkool <segher@kernel.crashing.org>
7055 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): On ELFv2 use r11
7056 instead of r12 for CR save, in all cases.
7058 2021-09-03 Andrew Pinski <apinski@marvell.com>
7060 * config/aarch64/aarch64-sve-builtins.cc (register_vector_type):
7061 Handle error_mark_node as the type of the type_decl.
7063 2021-09-03 Andrew Pinski <apinski@marvell.com>
7065 * config/aarch64/aarch64-builtins.c (struct aarch64_simd_type_info):
7067 (aarch64_simd_types): Likewise.
7068 (aarch64_simd_intOI_type_node): Likewise.
7069 (aarch64_simd_intCI_type_node): Likewise.
7070 (aarch64_simd_intXI_type_node): Likewise.
7071 * config/aarch64/aarch64.h (aarch64_fp16_type_node): Likewise.
7072 (aarch64_fp16_ptr_type_node): Likewise.
7073 (aarch64_bf16_type_node): Likewise.
7074 (aarch64_bf16_ptr_type_node): Likewise.
7076 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7078 * range-op.cc (operator_minus::op1_op2_relation_effect): Abstract
7080 (minus_op1_op2_relation_effect): ...here.
7081 (class operator_pointer_diff): New.
7082 (operator_pointer_diff::op1_op2_relation_effect): Call
7083 minus_op1_op2_relation_effect.
7084 (integral_table::integral_table): Add entry for POINTER_DIFF_EXPR.
7086 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7088 * tree-ssa-threadbackward.c (back_threader::thread_through_all_blocks):
7089 Add may_peel_loop_headers.
7090 (back_threader_registry::thread_through_all_blocks): Same.
7091 (try_thread_blocks): Pass may_peel_loop_headers argument.
7092 (pass_early_thread_jumps::execute): Same.
7094 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7096 * tree-ssa-threadedge.c (has_phis_p): New.
7097 (forwarder_block_p): New.
7098 (potentially_threadable_block): Call forwarder_block_p.
7099 (jump_threader::thread_around_empty_blocks): Call has_phis_p.
7100 (jump_threader::thread_through_normal_block): Call
7103 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7105 * tree-ssa-threadbackward.c (back_threader::dump): New.
7106 (back_threader::debug): New.
7107 (back_threader_profitability::profitable_path_p): Dump blocks
7108 even if we are bailing early.
7110 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7112 * tree-ssa-threadupdate.c (cancel_thread): New.
7113 (jump_thread_path_registry::thread_block_1): Use cancel_thread.
7114 (jump_thread_path_registry::mark_threaded_blocks): Same.
7115 (jump_thread_path_registry::register_jump_thread): Same.
7117 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7119 * tree-ssa-threadedge.c (jt_state::push): Only call methods for
7120 which objects are available.
7121 (jt_state::pop): Same.
7122 (jt_state::register_equiv): Same.
7123 (jt_state::register_equivs_on_edge): Same.
7125 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7127 * tree-ssa-threadedge.c (jump_threader::thread_across_edge):
7128 Move pop until after a thread is registered.
7130 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7132 * tree-ssa-threadupdate.c (debug): New.
7134 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7136 * gimple-range-trace.cc (push_dump_file::push_dump_file): New.
7137 (push_dump_file::~push_dump_file): New.
7138 (dump_ranger): Change dump_file temporarily while dumping
7140 * gimple-range-trace.h (class push_dump_file): New.
7142 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7144 * gimple-range-trace.cc (debug_seed_ranger): Remove static.
7145 (dump_ranger): Dump function name.
7147 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7149 * gimple-range-path.cc (path_range_query::range_defined_in_block):
7150 Adjust for non-null.
7151 (path_range_query::adjust_for_non_null_uses): New.
7152 (path_range_query::precompute_ranges): Call
7153 adjust_for_non_null_uses.
7154 * gimple-range-path.h: Add m_non_null and
7155 adjust_for_non_null_uses.
7157 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7159 * gimple-range-path.cc (path_range_query::dump): Dump path
7161 (path_range_query::precompute_ranges): Dump entire path.
7163 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7165 * value-relation.cc (relation_oracle::debug): New.
7166 * value-relation.h (relation_oracle::debug): New.
7168 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7170 * tree-ssa-loop-ch.c: Remove unnecessary include file.
7172 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7174 * gimple-range-fold.cc (fold_using_range::postfold_gcond_edges):
7175 Skip statements with no defining BB.
7176 * gimple-range-path.cc (path_range_query::range_defined_in_block):
7177 Do not get confused by statements with no defining BB.
7179 2021-09-03 Aldy Hernandez <aldyh@redhat.com>
7181 * gimple-range-fold.cc (adjust_imagpart_expr): Move from
7182 gimple_range_adjustment. Add support for constants.
7183 (adjust_realpart_expr): New.
7184 (gimple_range_adjustment): Move IMAGPART_EXPR code to
7185 adjust_imagpart_expr.
7186 * range-op.cc (integral_table::integral_table): Add entry for
7189 2021-09-03 Jakub Jelinek <jakub@redhat.com>
7191 * omp-expand.c (expand_omp_atomic_pipeline): Use
7192 IFN_ATOMIC_COMPARE_EXCHANGE instead of
7193 BUILT_IN_SYNC_VAL_COMPARE_AND_SWAP_? so that memory order
7196 2021-09-03 Jakub Jelinek <jakub@redhat.com>
7199 * tree.h (DECL_FIELD_ABI_IGNORED): Changed into rvalue only macro
7200 that is false if DECL_BIT_FIELD.
7201 (SET_DECL_FIELD_ABI_IGNORED, DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD,
7202 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD): Define.
7203 * tree-streamer-out.c (pack_ts_decl_common_value_fields): For
7204 DECL_BIT_FIELD stream DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead
7205 of DECL_FIELD_ABI_IGNORED.
7206 * tree-streamer-in.c (unpack_ts_decl_common_value_fields): Use
7207 SET_DECL_FIELD_ABI_IGNORED instead of writing to
7208 DECL_FIELD_ABI_IGNORED and for DECL_BIT_FIELD use
7209 SET_DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead.
7210 * lto-streamer-out.c (hash_tree): For DECL_BIT_FIELD hash
7211 DECL_FIELD_CXX_ZERO_WIDTH_BIT_FIELD instead of DECL_FIELD_ABI_IGNORED.
7213 2021-09-03 liuhongt <hongtao.liu@intel.com>
7216 * config/i386/amxbf16intrin.h : Remove macro check for __AMX_BF16__.
7217 * config/i386/amxint8intrin.h : Remove macro check for __AMX_INT8__.
7218 * config/i386/amxtileintrin.h : Remove macro check for __AMX_TILE__.
7220 2021-09-02 Martin Sebor <msebor@redhat.com>
7222 PR tree-optimization/17506
7224 * tree-ssa-uninit.c (warn_uninit): Remove conditional guarding note.
7226 2021-09-02 Richard Biener <rguenther@suse.de>
7228 * tree-ssa-loop-im.c (fill_always_executed_in_1): Refine
7229 fix for PR78185 and continue processing when leaving
7232 2021-09-02 Jakub Jelinek <jakub@redhat.com>
7234 PR tree-optimization/99591
7235 * match.pd: Demote operands of IFN_{ADD,SUB,MUL}_OVERFLOW if they
7238 2021-09-02 Richard Biener <rguenther@suse.de>
7241 2021-09-02 Richard Biener <rguenther@suse.de>
7243 PR tree-optimization/102155
7244 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
7245 over a part of the RPO array and do not recurse here.
7246 Dump blocks marked as always executed.
7247 (fill_always_executed_in): Walk over the RPO array and
7248 process loops whose header we run into.
7249 (loop_invariant_motion_in_fun): Compute the first RPO
7250 using rev_post_order_and_mark_dfs_back_seme in iteration
7251 order and pass that to fill_always_executed_in.
7253 2021-09-02 liuhongt <hongtao.liu@intel.com>
7255 * config/i386/i386-modes.def (FLOAT_MODE): Define ieee HFmode.
7256 * config/i386/i386.c (enum x86_64_reg_class): Add
7258 (merge_classes): Handle X86_64_SSEHF_CLASS.
7259 (examine_argument): Ditto.
7260 (construct_container): Ditto.
7261 (classify_argument): Ditto, and set HFmode/HCmode to
7263 (function_value_32): Return _FLoat16/Complex Float16 by
7265 (function_value_64): Return _Float16/Complex Float16 by SSE
7267 (ix86_print_operand): Handle CONST_DOUBLE HFmode.
7268 (ix86_secondary_reload): Require gpr as intermediate register
7269 to store _Float16 from sse register when sse4 is not
7271 (ix86_libgcc_floating_mode_supported_p): Enable _FLoat16 under
7273 (ix86_scalar_mode_supported_p): Ditto.
7274 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Defined.
7275 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add HFmode.
7276 (VALID_INT_MODE_P): Add HFmode and HCmode.
7277 * config/i386/i386.md (*pushhf_rex64): New define_insn.
7279 (*movhf_internal): Ditto.
7280 * doc/extend.texi (Half-Precision Floating Point): Documemt
7283 2021-09-02 Richard Biener <rguenther@suse.de>
7285 PR tree-optimization/102155
7286 * tree-ssa-loop-im.c (fill_always_executed_in_1): Iterate
7287 over a part of the RPO array and do not recurse here.
7288 Dump blocks marked as always executed.
7289 (fill_always_executed_in): Walk over the RPO array and
7290 process loops whose header we run into.
7291 (loop_invariant_motion_in_fun): Compute the first RPO
7292 using rev_post_order_and_mark_dfs_back_seme in iteration
7293 order and pass that to fill_always_executed_in.
7295 2021-09-02 YunQiang Su <syq@debian.org>
7298 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
7300 * config/mips/mips.c (mips_module_isa_name): New.
7301 mips_file_start: add .module mipsREV to all asm output
7303 2021-09-01 Jeff Law <jlaw@localhost.localdomain>
7305 PR tree-optimization/102152
7306 * tree-ssa-dom.c (dom_opt_dom_walker::optimize_stmt): Reduce a vector
7307 comparison to a scalar comparison before calling
7308 update_stmt_if_modified.
7310 2021-09-01 Andrew Pinski <apinski@marvell.com>
7313 * config/aarch64/aarch64.c (aarch64_expand_setmem):
7314 Check STRICT_ALIGNMENT before creating an overlapping
7317 2021-09-01 Martin Sebor <msebor@redhat.com>
7319 * gimple-ssa-warn-access.cc (get_size_range): Add argument.
7320 (check_access): Pass additional argument.
7321 (check_memop_access): Remove template and make a member function.
7322 (maybe_check_dealloc_call): Make a pass_waccess member function.
7323 (class pass_waccess): Add, rename, and remove members.
7324 (pass_waccess::pass_waccess): Adjust to name change.
7325 (pass_waccess::~pass_waccess): Same.
7326 (check_alloca): Make a member function.
7327 (check_alloc_size_call): Same.
7328 (check_strcat): Same.
7329 (check_strncat): Same.
7330 (check_stxcpy): Same.
7331 (check_stxncpy): Same.
7332 (check_strncmp): Same.
7333 (maybe_warn_rdwr_sizes): Rename...
7334 (pass_waccess::maybe_check_access_sizes): ...to this.
7335 (pass_waccess::check_call): Adjust to name changes.
7336 (pass_waccess::maybe_check_dealloc_call): Make a pass_waccess member
7338 (pass_waccess::execute): Adjust to name changes.
7339 * gimple-ssa-warn-access.h (check_memop_access): Remove.
7340 * pointer-query.cc (access_ref::phi): Handle null pointer.
7341 (access_ref::inform_access): Same.
7342 (pointer_query::put_ref): Modify a cached value, not a copy of it.
7343 (pointer_query::dump): New function.
7344 (compute_objsize_r): Avoid overwriting access_ref::bndrng. Cache
7346 * pointer-query.h (pointer_query::dump): Declare.
7347 * tree-ssa-strlen.c (get_range): Simplify. Use function query.
7348 (dump_strlen_info): Use function query.
7349 (printf_strlen_execute): Factor code out into pointer_query::put_ref.
7351 2021-09-01 Thomas Schwinge <thomas@codesourcery.com>
7353 * tree.c (walk_tree_1) <OMP_CLAUSE>: Simplify.
7355 2021-09-01 Iain Sandoe <iain@sandoe.co.uk>
7357 * doc/extend.texi: Document unavailable attribute.
7358 * print-tree.c (print_node): Handle unavailable attribute.
7359 * tree-core.h (struct tree_base): Add a bit to carry unavailability.
7360 * tree.c (error_unavailable_use): New.
7361 * tree.h (TREE_UNAVAILABLE): New.
7362 (error_unavailable_use): New.
7364 2021-09-01 Jakub Jelinek <jakub@redhat.com>
7366 PR tree-optimization/102124
7367 * tree-vect-patterns.c (vect_recog_widen_op_pattern): For ORIG_CODE
7368 MINUS_EXPR, if itype is unsigned with smaller precision than type,
7369 add an extra cast to signed variant of itype to ensure sign-extension.
7371 2021-09-01 Martin Liska <mliska@suse.cz>
7373 * graph.c (draw_cfg_node_succ_edges): Do not color fallthru
7374 edges and rather use colors for TRUE and FALSE edges.
7376 2021-09-01 Richard Biener <rguenther@suse.de>
7378 PR tree-optimization/93491
7379 * tree-ssa-pre.c (compute_avail): Set BB_MAY_NOTRETURN
7380 after processing the stmt itself. Do not consider
7381 pure functions possibly not returning. Properly avoid
7382 adding possibly trapping calls to EXP_GEN when there's
7383 a preceeding possibly not returning call.
7384 * tree-ssa-sccvn.c (vn_reference_may_trap): Conservatively
7387 2021-09-01 Richard Biener <rguenther@suse.de>
7389 PR tree-optimization/102139
7390 * tree-vectorizer.h (vec_base_alignments): Adjust hash-map
7391 type to record a std::pair of the stmt-info and the innermost
7393 (dr_vec_info::group): New member.
7394 * tree-vect-data-refs.c (vect_record_base_alignment): Adjust.
7395 (vect_compute_data_ref_alignment): Verify the recorded
7396 base alignment can be used.
7397 (data_ref_pair): Remove.
7398 (dr_group_sort_cmp): Adjust.
7399 (vect_analyze_data_ref_accesses): Store the group-ID in the
7400 dr_vec_info and operate on a vector of dr_vec_infos.
7402 2021-09-01 YunQiang Su <yunqiang.su@cipunited.com>
7404 * read-md.c (md_reader::handle_enum): support value assignation.
7405 * doc/md.texi: record define_c_enum value assignation support.
7407 2021-09-01 Jakub Jelinek <jakub@redhat.com>
7409 PR tree-optimization/102141
7410 * gimple-ssa-store-merging.c (bswap_view_convert): Add BEFORE
7411 argument. If false, emit stmts after gsi instead of before, and
7413 (bswap_replace): Adjust callers. When converting output of bswap,
7414 emit VIEW_CONVERT prepratation stmts after a copy of gsi instead
7417 2021-09-01 liuhongt <hongtao.liu@intel.com>
7419 * emit-rtl.c (validate_subreg): Get rid of all float-int
7422 2021-09-01 liuhongt <hongtao.liu@intel.com>
7425 2021-08-30 liuhongt <hongtao.liu@intel.com>
7427 * expmed.c (extract_bit_field_1): Make sure we're playing with
7428 integral modes before call extract_integral_bit_field.
7429 (extract_integral_bit_field): Add a parameter of type
7430 scalar_int_mode which corresponds to of tmode.
7431 And call extract_and_convert_fixed_bit_field instead of
7432 extract_fixed_bit_field and convert_extracted_bit_field.
7433 (extract_and_convert_fixed_bit_field): New function, it's a
7434 combination of extract_fixed_bit_field and
7435 convert_extracted_bit_field.
7437 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
7439 * tree.c (walk_tree_1) <OMP_CLAUSE_TILE>: Handle three operands.
7441 2021-08-31 Thomas Schwinge <thomas@codesourcery.com>
7443 * omp-general.h (omp_is_reference): Rename to...
7444 (omp_privatize_by_reference): ... this. Adjust all users...
7445 * omp-general.c: ... here, ...
7446 * gimplify.c: ... here, ...
7447 * omp-expand.c: ... here, ...
7448 * omp-low.c: ... here.
7450 2021-08-31 Martin Sebor <msebor@redhat.com>
7452 * gimple-ssa-warn-access.cc (maybe_warn_alloc_args_overflow): Test
7453 pointer element for equality to zero, not that of the cotaining
7456 2021-08-31 Martin Sebor <msebor@redhat.com>
7458 * gcc-rich-location.h (gcc_rich_location): Make ctor explicit.
7460 2021-08-31 Martin Sebor <msebor@redhat.com>
7462 * function.h (function): Add comments.
7463 (get_range_query): Same. Add attribute returns nonnull.
7465 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
7467 * expr.c (convert_modes): Don't use subreg_promoted_mode on a
7468 SUBREG if it can't be guaranteed to a SUBREG_PROMOTED_VAR_P set.
7469 Instead use the standard (safer) is_a <scalar_int_mode> idiom.
7471 2021-08-31 Jeff Law <jlaw@localhost.localdomain>
7473 * config.gcc (cris-*-elf, cris-*-none): Remove dbxelf.h from
7475 (m32r-*-elf, m32rle-*-elf, m32r-*-linux): Likewise.
7476 (mn10300-*-*, am33_2.0-*-linux*): Likewise.
7477 (xtensa*-*-elf, xtensa*-*-linux, xtensa*-*-uclinux): Likewise.
7478 (m32c-*-elf*, m32c-*-rtems*): Likewise.
7479 * config/cris/cris.h (DBX_NO_XREFS): Remove.
7480 (DBX_CONTIN_LENGTH, DBX_CONTIN_CHAR): Likewise.
7481 * config/m32r/m32r.h (DBXOUT_SOURCE_LINE): Likewise.
7482 (DBX_DEBUGGING_INFO, DBX_CONTIN_LENGTH): Likewise.
7483 * config/mn10300/mn10300.h (DEFAULT_GDB_EXTENSIONS): Likewise.
7484 * config/mn10300/linux.h (DBX_REGISTER_NAMES): Likewise.
7486 2021-08-31 Marcel Vollweiler <marcel@codesourcery.com>
7488 * gimplify.c (gimplify_scan_omp_clauses): Error handling. 'ancestor' only
7489 allowed on target constructs and only with particular other clauses.
7490 * omp-expand.c (expand_omp_target): Output of 'sorry, not supported' if
7492 * omp-low.c (check_omp_nesting_restrictions): Error handling. No nested OpenMP
7493 structs when 'ancestor' is used.
7494 (scan_omp_1_stmt): No usage of OpenMP runtime routines in a target region when
7496 * tree-pretty-print.c (dump_omp_clause): Append 'ancestor'.
7497 * tree.h (OMP_CLAUSE_DEVICE_ANCESTOR): Define macro.
7499 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
7501 * expr.c (convert_modes): Preserve SUBREG_PROMOTED_VAR_P when
7502 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
7504 * simplify-rtx.c (simplify_unary_operation_1) [SIGN_EXTEND]:
7505 Likewise, preserve SUBREG_PROMOTED_VAR_P when creating a (wider)
7506 partial subreg from a SUBREG_PROMOTED_VAR_P subreg. Generate
7507 SIGN_EXTEND of the SUBREG_REG when a subreg would be paradoxical.
7508 [ZERO_EXTEND]: Likewise, preserve SUBREG_PROMOTED_VAR_P when
7509 creating a (wider) partial subreg from a SUBREG_PROMOTED_VAR_P
7510 subreg. Generate ZERO_EXTEND of the SUBREG_REG when a subreg
7511 would be paradoxical.
7513 2021-08-31 Roger Sayle <roger@nextmovesoftware.com>
7515 * combine.c (combine_simplify_rtx): Avoid converting an explicit
7516 TRUNCATE into a lowpart SUBREG on !TRULY_NOOP_TRUNCATION targets.
7517 * simplify-rtx.c (simplify_unary_operation_1): Likewise.
7519 2021-08-31 Richard Biener <rguenther@suse.de>
7521 PR tree-optimization/102142
7522 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Fix
7523 condition under which to unset the visited flag.
7525 2021-08-31 Richard Biener <rguenther@suse.de>
7527 PR middle-end/102129
7528 * tree-ssa-ter.c (find_replaceable_in_bb): Do not move
7529 possibly trapping expressions across calls.
7531 2021-08-31 Jakub Jelinek <jakub@redhat.com>
7533 PR tree-optimization/102134
7534 * tree-ssa-ccp.c (bit_value_binop) <case RSHIFT_EXPR>: If sgn is
7535 UNSIGNED and r1val | r1mask has MSB set, ensure lzcount doesn't
7538 2021-08-31 Andrew Pinski <apinski@marvell.com>
7541 * collect-utils.c (setup_signals): New declaration.
7542 * collect-utils.h (setup_signals): New function.
7543 * collect2.c (handler): Delete.
7544 (main): Instead of manually setting up the signals,
7545 just call setup_signals.
7546 * lto-wrapper.c (main): Likewise.
7548 2021-08-31 Andrew Pinski <apinski@marvell.com>
7551 * config/i386/i386-protos.h (x86_output_aligned_bss):
7552 Change align argument to unsigned type.
7553 (x86_elf_aligned_decl_common): Likewise.
7554 * config/i386/i386.c (x86_elf_aligned_decl_common): Likewise.
7555 (x86_output_aligned_bss): Likewise.
7557 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
7559 * config/mips/mips.c (mips_module_isa_name): New.
7560 mips_file_start: add .module mipsREV to all asm output
7562 2021-08-31 YunQiang Su <yunqiang.su@cipunited.com>
7564 * config/mips/mips.h (struct mips_cpu_info): define enum mips_isa;
7565 use enum instead of int for 'isa' member.
7566 * config.gcc, config/mips/mips.c, config/mips/mips-cpus.def,
7567 config/mips/netbsd.h: replace hardcoded numbers with enum.
7569 2021-08-31 liuhongt <hongtao.liu@intel.com>
7571 * config/i386/sse.md (*<avx512>_ucmp<mode>3_1): Change from
7572 define_split to define_insn_and_split.
7573 (*avx2_eq<mode>3): Removed.
7574 (<avx512>_eq<mode>3<mask_scalar_merge_name>): Adjust pattern
7575 (<avx512>_eq<mode>3<mask_scalar_merge_name>_1): Rename to ..
7576 (*<avx512>_eq<mode>3<mask_scalar_merge_name>_1): .. this, and
7578 (*avx2_gt<mode>3): Removed.
7579 (<avx512>_gt<mode>3<mask_scalar_merge_name>): Change from
7580 define_insn to define_expand, and adjust pattern.
7581 (UNSPEC_MASKED_EQ, UNSPEC_MASKED_GT): Removed.
7583 2021-08-30 David Malcolm <dmalcolm@redhat.com>
7586 * Makefile.in (ANALYZER_OBJS): Add analyzer/call-info.o.
7588 2021-08-30 Jason Merrill <jason@redhat.com>
7590 * doc/invoke.texi: Document -Wmissing-requires.
7592 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
7594 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Remove
7595 TARGET_EXTRA_BUILTINS guard.
7597 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
7599 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Change
7600 initialization of V2DI_type_node and unsigned_V2DI_type_node.
7602 2021-08-30 Bill Schmidt <wschmidt@linux.ibm.com>
7604 * config/rs6000/darwin.h (SUBTARGET_INIT_BUILTINS): Use the new
7605 decl when new_builtins_are_live.
7606 * config/rs6000/rs6000-builtin-new.def (__builtin_cfstring): New
7609 2021-08-30 Pat Haugen <pthaugen@linux.ibm.com>
7611 * config/rs6000/rs6000-cpus.def (ISA_3_1_MASKS_SERVER): Add
7612 OPTION_MASK_P10_FUSION_2STORE.
7613 (POWERPC_MASKS): Likewise.
7614 * config/rs6000/rs6000.c (rs6000_option_override_internal): Enable
7615 store fusion for Power10.
7616 (is_fusable_store): New.
7617 (power10_sched_reorder): Likewise.
7618 (rs6000_sched_reorder): Do Power10 specific reordering.
7619 (rs6000_sched_reorder2): Likewise.
7620 * config/rs6000/rs6000.opt: Add new option.
7622 2021-08-30 Richard Biener <rguenther@suse.de>
7624 PR tree-optimization/102128
7625 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
7626 Move scanning for if-converted scalar code to the caller
7627 and instead delay clearing the visited flag for profitable
7629 (vect_slp_region): Cost all subgraphs before scheduling.
7630 For if-converted BB vectorization scan for scalar COND_EXPRs
7631 and do not vectorize if any found and the cost model is
7634 2021-08-30 Richard Biener <rguenther@suse.de>
7636 * common.opt (fexceptions): Mark
7637 EnabledBy(fnon-call-exceptions).
7638 * doc/invoke.texi (fnon-call-exceptions): Document this
7639 enables -fexceptions.
7641 2021-08-30 Sebastian Huber <sebastian.huber@embedded-brains.de>
7643 * tsystem.h (abort): Define abort() if inhibit_libc is defined and it
7644 is not already defined.
7646 2021-08-30 liuhongt <hongtao.liu@intel.com>
7648 * expmed.c (extract_bit_field_1): Make sure we're playing with
7649 integral modes before call extract_integral_bit_field.
7650 (extract_integral_bit_field): Add a parameter of type
7651 scalar_int_mode which corresponds to of tmode.
7652 And call extract_and_convert_fixed_bit_field instead of
7653 extract_fixed_bit_field and convert_extracted_bit_field.
7654 (extract_and_convert_fixed_bit_field): New function, it's a
7655 combination of extract_fixed_bit_field and
7656 convert_extracted_bit_field.
7658 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
7660 * config/darwin.c (darwin_libc_has_function): Do not run
7661 the checks for x86 or modern Darwin. Make sure that there
7662 is a value set for darwin_macosx_version_min before testing.
7664 2021-08-29 Iain Sandoe <iain@sandoe.co.uk>
7666 * config/i386/darwin.h (CLEAR_INSN_CACHE): New.
7668 2021-08-28 Jan Hubicka <hubicka@ucw.cz>
7670 * ipa-modref-tree.h (modref_access_node::merge): Break out
7671 logic combining offsets and logic merging ranges to ...
7672 (modref_access_node::combined_offsets): ... here
7673 (modref_access_node::update2): ... here
7674 (modref_access_node::closer_pair_p): New member function.
7675 (modref_access_node::forced_merge): New member function.
7676 (modre_ref_node::insert): Do merging when table is full.
7678 2021-08-28 YunQiang Su <yunqiang.su@cipunited.com>
7681 * config.gcc: MIPS: use N64 ABI by default if the triple end
7682 with -gnuabi64, which is used by Debian since 2013.
7684 2021-08-28 Alexandre Oliva <oliva@adacore.com>
7686 * ipa-modref.c (analyze_function): Skip debug stmts.
7687 * tree-inline.c (estimate_num_insn): Consider builtins even
7688 without a cgraph_node.
7690 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
7692 * config/h8300/bitfield.md (cstore<mode>4): Remove expander.
7693 * config/h8300/h8300.c (h8300_expand_branch): Remove function.
7694 * config/h8300/h8300-protos.h (h8300_expadn_branch): Remove prototype.
7695 * config/h8300/h8300.md (eqne): New code iterator.
7696 (geultu, geultu_to_c): Similarly.
7697 * config/h8300/testcompare.md (cstore<mode>4): Dummy expander.
7698 (store_c_<mode>, store_c_i_<mode>): New define_insn_and_splits
7699 (cmp<mode>_c): New pattern
7701 2021-08-27 Jeff Law <jlaw@localhost.localdomain>
7703 * tree-ssa-dom.c (reduce_vector_comparison_to_scalar_comparison): New
7705 (dom_opt_dom_walker::optimize_stmt): Use it.
7707 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
7709 * config/darwin.c (finalize_ctors): Add a section-start linker-
7711 (finalize_dtors): Likewise.
7712 * config/darwin.h (MIN_LD64_INIT_TERM_START_LABELS): New.
7714 2021-08-27 Bill Schmidt <wschmidt@linux.ibm.com>
7716 * config/rs6000/rs6000-call.c (rs6000-builtins.h): New #include.
7717 (rs6000_init_builtins): Call rs6000_init_generated_builtins. Skip the
7718 old initialization logic when new builtins are enabled.
7719 * config/rs6000/rs6000-gen-builtins.c (write_decls): Rename
7720 rs6000_autoinit_builtins to rs6000_init_generated_builtins.
7721 (write_init_file): Likewise.
7723 2021-08-27 Iain Sandoe <iain@sandoe.co.uk>
7725 * configure.ac (darwin2[[0-9]]* | darwin19*): Alter use of
7726 gcc_GAS_CHECK_FEATURE to remove an extraneous parameter.
7727 (amdgcn-* | gcn-*) Likewise.
7729 2021-08-27 Anthony Sharp <anthonysharp15@gmail.com>
7731 * symbol-summary.h: Added missing template keyword.
7733 2021-08-27 Richard Biener <rguenther@suse.de>
7735 PR tree-optimization/45178
7736 * tree-ssa-dce.c (find_obviously_necessary_stmts): For
7737 infinite loops without exit do not mark control dependent
7738 edges of the latch necessary.
7740 2021-08-27 konglin1 <lingling.kong@intel.com>
7743 * config/i386/sse.md: (<avx512>scattersi<mode>): Add mask operand to
7745 (<avx512>scattersi<mode>): Likewise.
7746 (*avx512f_scattersi<VI48F:mode>): Merge mask operand to set_dest.
7747 (*avx512f_scatterdi<VI48F:mode>): Likewise
7749 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
7751 * config/rs6000/rs6000.c (rs6000_builtin_md_vectorized_function): Add
7752 support for built-in functions MISC_BUILTIN_DIVWE, MISC_BUILTIN_DIVWEU,
7753 MISC_BUILTIN_DIVDE, MISC_BUILTIN_DIVDEU, P10_BUILTIN_CFUGED,
7754 P10_BUILTIN_CNTLZDM, P10_BUILTIN_CNTTZDM, P10_BUILTIN_PDEPD and
7755 P10_BUILTIN_PEXTD on Power10.
7757 2021-08-27 Kewen Lin <linkw@linux.ibm.com>
7759 * config/rs6000/rs6000-call.c (builtin_function_type): Add unsigned
7760 signedness for some Power10 bifs.
7762 2021-08-27 David Edelsohn <dje.gcc@gmail.com>
7765 * config/rs6000/rs6000.c (rs6000_adjust_field_align): Use
7766 computed alignment if the entire struct has attribute packed.
7768 2021-08-27 liuhongt <hongtao.liu@intel.com>
7772 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
7773 IX86_BUILTIN_SHUFPD512, IX86_BUILTIN_SHUFPS512,
7774 IX86_BUILTIN_SHUFPD256, IX86_BUILTIN_SHUFPS,
7775 IX86_BUILTIN_SHUFPS256.
7776 (ix86_masked_all_ones): New function.
7778 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
7780 * config/i386/i386.md (*btr<mode>_1): Call force_reg unconditionally.
7781 (conditional moves with memory inputs splitters): Ditto.
7782 * config/i386/sse.md (one_cmpl<mode>2): Simplify.
7784 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7786 * ipa-modref-tree.h (modref_access_node::try_merge_with): Restart
7787 search after merging.
7789 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7791 * config/rs6000/rs6000-overload.def: Add remaining overloads.
7793 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7795 * config/rs6000/rs6000-builtin-new.def: Add cell stanza.
7797 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7799 * config/rs6000/rs6000-builtin-new.def: Add ieee128-hw, dfp,
7800 crypto, and htm stanzas.
7802 2021-08-26 Bill Schmidt <wschmidt@linux.ibm.com>
7804 * config/rs6000/rs6000-builtin-new.def: Add mma stanza.
7806 2021-08-26 Martin Sebor <msebor@redhat.com>
7808 * tree-ssa-uninit.c (warn_uninit): Refactor and simplify.
7809 (warn_uninit_phi_uses): Remove argument from calls to warn_uninit.
7810 (warn_uninitialized_vars): Same. Reduce visibility of locals.
7811 (warn_uninitialized_phi): Same.
7813 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
7815 * tree-ssa-ccp.c (get_individual_bits): Helper function to
7816 extract the individual bits from a widest_int constant (mask).
7817 (gray_code_bit_flips): New read-only table for effiently
7818 enumerating permutations/combinations of bits.
7819 (bit_value_binop) [LROTATE_EXPR, RROTATE_EXPR]: Handle rotates
7820 by unknown counts that are guaranteed less than the target
7821 precision and four or fewer unknown bits by enumeration.
7822 [LSHIFT_EXPR, RSHIFT_EXPR]: Likewise, also handle shifts by
7823 enumeration under the same conditions. Handle remaining
7824 shifts as a mask based upon the minimum possible shift value.
7826 2021-08-26 Roger Sayle <roger@nextmovesoftware.com>
7827 Richard Biener <rguenther@suse.de>
7829 * match.pd (shift transformations): Remove a redundant
7830 !POINTER_TYPE_P check.
7832 2021-08-26 Uroš Bizjak <ubizjak@gmail.com>
7835 * config/i386/i386.md (cmove reg-reg move elimination peephole2s):
7836 Set all_regs to true in the call to replace_rtx.
7838 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7840 * ipa-modref-tree.c (test_insert_search_collapse): Update test.
7841 * ipa-modref-tree.h (modref_base_node::insert): Be smarter when
7842 hiting --param modref-max-refs limit.
7843 (modref_tree:insert_base): Be smarter when hitting
7844 --param modref-max-bases limit. Add new parameter REF.
7845 (modref_tree:insert): Update.
7846 (modref_tree:merge): Update.
7847 * ipa-modref.c (read_modref_records): Update.
7849 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7851 * params.opt: (modref-max-adjustments): Add full stop.
7853 2021-08-26 Jan Hubicka <hubicka@ucw.cz>
7855 * ipa-modref-tree.h (modref_ref_node::verify): New member
7857 (modref_ref_node::insert): Use it.
7858 (modref_ref_node::try_mere_with): Fix off by one error.
7860 2021-08-26 Martin Liska <mliska@suse.cz>
7861 Stefan Kneifel <stefan.kneifel@bluewin.ch>
7863 * cgraph.h (create_version_clone_with_body): Add new parameter.
7864 * cgraphclones.c: Likewise.
7865 * multiple_target.c (create_dispatcher_calls): Do not use
7867 (create_target_clone): Likewise here.
7869 2021-08-26 Jonathan Yong <10walls@gmail.com>
7871 * doc/extend.texi: Add note about reserved priorities
7872 to the constructor attribute.
7874 2021-08-25 Martin Sebor <msebor@redhat.com>
7876 * gimple-range-cache.cc (ssa_global_cache::dump): Avoid printing
7877 range table header alone.
7878 * gimple-range.cc (gimple_ranger::export_global_ranges): Same.
7880 2021-08-25 Jan Hubicka <hubicka@ucw.cz>
7882 * doc/invoke.texi: Document --param modref-max-adjustments.
7883 * ipa-modref-tree.c (test_insert_search_collapse): Update.
7884 (test_merge): Update.
7885 * ipa-modref-tree.h (struct modref_access_node): Add adjustments;
7886 (modref_access_node::operator==): Fix handling of access ranges.
7887 (modref_access_node::contains): Constify parameter; handle also
7888 mismatched parm offsets.
7889 (modref_access_node::update): New function.
7890 (modref_access_node::merge): New function.
7891 (unspecified_modref_access_node): Update constructor.
7892 (modref_ref_node::insert_access): Add record_adjustments parameter;
7894 (modref_ref_node::try_merge_with): New private function.
7895 (modref_tree::insert): New record_adjustments parameter.
7896 (modref_tree::merge): New record_adjustments parameter.
7897 (modref_tree::copy_from): Update.
7898 * ipa-modref.c (dump_access): Dump adjustments field.
7899 (get_access): Update constructor.
7900 (record_access): Update call of insert.
7901 (record_access_lto): Update call of insert.
7902 (merge_call_side_effects): Add record_adjustments parameter.
7903 (get_access_for_fnspec): Update.
7904 (process_fnspec): Update.
7905 (analyze_call): Update.
7906 (analyze_function): Update.
7907 (read_modref_records): Update.
7908 (ipa_merge_modref_summary_after_inlining): Update.
7909 (propagate_unknown_call): Update.
7910 (modref_propagate_in_scc): Update.
7911 * params.opt (param-max-modref-adjustments=): New.
7913 2021-08-25 Michael Meissner <meissner@linux.ibm.com>
7915 * config/rs6000/vsx.md (UNSPEC_XXSPLTIDP): Rename from
7917 (xxspltiw_v4si): Use vecperm type attribute.
7918 (xxspltiw_v4si_inst): Use vecperm type attribute.
7919 (xxspltiw_v4sf_inst): Likewise.
7920 (xxspltidp_v2df): Use vecperm type attribute. Use
7921 UNSPEC_XXSPLTIDP instead of UNSPEC_XXSPLTID.
7922 (xxspltidp_v2df_inst): Likewise.
7923 (xxsplti32dx_v4si): Use vecperm type attribute.
7924 (xxsplti32dx_v4si_inst): Likewise.
7925 (xxsplti32dx_v4sf_inst): Likewise.
7926 (xxblend_<mode>): Likewise.
7927 (xxpermx): Likewise.
7928 (xxpermx_inst): Likewise.
7931 2021-08-25 Lewis Hyatt <lhyatt@gmail.com>
7934 * coretypes.h (typedef diagnostic_input_charset_callback): Declare.
7935 * diagnostic.c (diagnostic_initialize_input_context): New function.
7936 * diagnostic.h (diagnostic_initialize_input_context): Declare.
7937 * input.c (default_charset_callback): New function.
7938 (file_cache::initialize_input_context): New function.
7939 (file_cache_slot::create): Added ability to convert the input
7940 according to the input context.
7941 (file_cache::file_cache): Initialize the new input context.
7942 (class file_cache_slot): Added new m_alloc_offset member.
7943 (file_cache_slot::file_cache_slot): Initialize the new member.
7944 (file_cache_slot::~file_cache_slot): Handle potentially offset buffer.
7945 (file_cache_slot::maybe_grow): Likewise.
7946 (file_cache_slot::needs_read_p): Handle NULL fp, which is now possible.
7947 (file_cache_slot::get_next_line): Likewise.
7948 * input.h (class file_cache): Added input context member.
7950 2021-08-25 Richard Biener <rguenther@suse.de>
7952 PR tree-optimization/102046
7953 * tree-vect-slp.c (vect_build_slp_tree_2): Conservatively
7954 update ->any_pattern when swapping operands.
7956 2021-08-25 Hongyu Wang <hongyu.wang@intel.com>
7959 * config/i386/i386.c (ix86_live_on_entry): Adjust comment.
7960 (ix86_decompose_address): Remove retval check for ASHIFT,
7961 allow non-canonical zero extend if AND mask covers ASHIFT
7963 (ix86_legitimate_address_p): Adjust condition for decompose.
7964 (ix86_rtx_costs): Adjust cost for lea with non-canonical
7966 Co-Authored by: Uros Bizjak <ubizjak@gmail.com>
7968 2021-08-25 Jiufu Guo <guojiufu@linux.ibm.com>
7970 PR tree-optimization/101145
7971 * tree-ssa-loop-niter.c (number_of_iterations_until_wrap):
7973 (number_of_iterations_lt): Invoke above function.
7974 (adjust_cond_for_loop_until_wrap):
7975 Merge to number_of_iterations_until_wrap.
7976 (number_of_iterations_cond): Update invokes for
7977 adjust_cond_for_loop_until_wrap and number_of_iterations_lt.
7979 2021-08-25 konglin1 <lingling.kong@intel.com>
7982 * config/i386/avx512dqintrin.h (_mm512_fpclass_ps_mask): Fix
7984 (_mm512_mask_fpclass_ps_mask): Ditto.
7986 2021-08-25 Kewen Lin <linkw@linux.ibm.com>
7988 * config/rs6000/altivec.md (vec_unpacku_hi_v16qi): Remove.
7989 (vec_unpacku_hi_v8hi): Likewise.
7990 (vec_unpacku_lo_v16qi): Likewise.
7991 (vec_unpacku_lo_v8hi): Likewise.
7992 (vec_unpacku_hi_<VP_small_lc>): New define_expand.
7993 (vec_unpacku_lo_<VP_small_lc>): Likewise.
7995 2021-08-24 David Edelsohn <dje.gcc@gmail.com>
7997 * config/rs6000/aix.h (SYSTEM_IMPLICIT_EXTERN_C): Delete.
7998 * config/rs6000/aix71.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
7999 * config/rs6000/aix72.h (SYSTEM_IMPLICIT_EXTERN_C): Define.
8000 * config/rs6000/aix73.h (TARGET_AIX_VERSION): Increase to 73.
8002 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
8004 PR middle-end/102031
8005 * simplify-rtx.c (simplify_truncation): When comparing precisions
8006 use "subreg_prec" variable, not "subreg_mode".
8008 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
8010 * config/rs6000/rs6000-builtin-new.def: Add power10 and power10-64
8013 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
8015 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Initialize
8016 various pointer type nodes.
8017 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add enum
8018 values for various pointer types.
8019 (ptr_V16QI_type_node): New macro.
8020 (ptr_V1TI_type_node): New macro.
8021 (ptr_V2DI_type_node): New macro.
8022 (ptr_V2DF_type_node): New macro.
8023 (ptr_V4SI_type_node): New macro.
8024 (ptr_V4SF_type_node): New macro.
8025 (ptr_V8HI_type_node): New macro.
8026 (ptr_unsigned_V16QI_type_node): New macro.
8027 (ptr_unsigned_V1TI_type_node): New macro.
8028 (ptr_unsigned_V8HI_type_node): New macro.
8029 (ptr_unsigned_V4SI_type_node): New macro.
8030 (ptr_unsigned_V2DI_type_node): New macro.
8031 (ptr_bool_V16QI_type_node): New macro.
8032 (ptr_bool_V8HI_type_node): New macro.
8033 (ptr_bool_V4SI_type_node): New macro.
8034 (ptr_bool_V2DI_type_node): New macro.
8035 (ptr_bool_V1TI_type_node): New macro.
8036 (ptr_pixel_type_node): New macro.
8037 (ptr_intQI_type_node): New macro.
8038 (ptr_uintQI_type_node): New macro.
8039 (ptr_intHI_type_node): New macro.
8040 (ptr_uintHI_type_node): New macro.
8041 (ptr_intSI_type_node): New macro.
8042 (ptr_uintSI_type_node): New macro.
8043 (ptr_intDI_type_node): New macro.
8044 (ptr_uintDI_type_node): New macro.
8045 (ptr_intTI_type_node): New macro.
8046 (ptr_uintTI_type_node): New macro.
8047 (ptr_long_integer_type_node): New macro.
8048 (ptr_long_unsigned_type_node): New macro.
8049 (ptr_float_type_node): New macro.
8050 (ptr_double_type_node): New macro.
8051 (ptr_long_double_type_node): New macro.
8052 (ptr_dfloat64_type_node): New macro.
8053 (ptr_dfloat128_type_node): New macro.
8054 (ptr_ieee128_type_node): New macro.
8055 (ptr_ibm128_type_node): New macro.
8056 (ptr_vector_pair_type_node): New macro.
8057 (ptr_vector_quad_type_node): New macro.
8058 (ptr_long_long_integer_type_node): New macro.
8059 (ptr_long_long_unsigned_type_node): New macro.
8061 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
8063 * config/rs6000/rs6000-builtin-new.def: Add power9-vector, power9,
8064 and power9-64 stanzas.
8066 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
8067 Tom de Vries <tdevries@suse.de>
8069 * config.gcc (nvptx-*-*): Define {c,c++}_target_objs.
8070 * config/nvptx/nvptx-protos.h (nvptx_cpu_cpp_builtins): Prototype.
8071 * config/nvptx/nvptx.h (TARGET_CPU_CPP_BUILTINS): Implement with
8072 a call to the new nvptx_cpu_cpp_builtins function in nvptx-c.c.
8073 * config/nvptx/t-nvptx (nvptx-c.o): New rule.
8074 * config/nvptx/nvptx-c.c: New source file.
8075 (nvptx_cpu_cpp_builtins): Move implementation here.
8077 2021-08-24 Martin Sebor <msebor@redhat.com>
8079 PR middle-end/101600
8080 PR middle-end/101977
8081 * gimple-ssa-warn-access.cc (maybe_warn_for_bound): Tighten up
8082 the phrasing of a warning.
8083 (check_access): Use the remaining size after subtracting any offset
8084 rather than the whole object size.
8085 * pointer-query.cc (access_ref::get_ref): Clear BASE0 flag if it's
8086 clear for any nonnull PHI argument.
8087 (compute_objsize): Clear argument.
8089 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
8091 * config/rs6000/rs6000-builtin-new.def: Add power8-vector stanza.
8093 2021-08-24 Bill Schmidt <wschmidt@linux.ibm.com>
8095 * config/rs6000/rs6000-builtin-new.def: Add power7 and power7-64
8098 2021-08-24 Andrew MacLeod <amacleod@redhat.com>
8100 * value-relation.cc (rr_transitive_table): New.
8101 (relation_transitive): New.
8102 (value_relation::swap): Remove.
8103 (value_relation::apply_transitive): New.
8104 (relation_oracle::relation_oracle): Allocate a new tmp bitmap.
8105 (relation_oracle::register_relation): Call register_transitives.
8106 (relation_oracle::register_transitives): New.
8107 * value-relation.h (relation_oracle): Add new temporary bitmap and
8110 2021-08-24 H.J. Lu <hjl.tools@gmail.com>
8113 * config/i386/i386-expand.c (ix86_expand_vector_move): Broadcast
8114 from integer to a pseudo vector register.
8116 2021-08-24 Richard Biener <rguenther@suse.de>
8118 PR tree-optimization/100089
8119 * tree-vectorizer.h (vect_slp_bb): Rename to ...
8120 (vect_slp_if_converted_bb): ... this and get the original
8121 loop as new argument.
8122 * tree-vectorizer.c (try_vectorize_loop_1): Revert previous fix,
8123 pass original loop to vect_slp_if_converted_bb.
8124 * tree-vect-slp.c (vect_bb_vectorization_profitable_p):
8125 If orig_loop was passed scan the not vectorized stmts
8126 for COND_EXPRs and force not profitable if found.
8127 (vect_slp_region): Pass down all SLP instances to costing
8128 if orig_loop was specified.
8129 (vect_slp_bbs): Pass through orig_loop.
8130 (vect_slp_bb): Rename to ...
8131 (vect_slp_if_converted_bb): ... this and get the original
8132 loop as new argument.
8133 (vect_slp_function): Adjust.
8135 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
8138 * config/arm/arm.md (attribute arch): Add fix_vlldm.
8139 (arch_enabled): Use it.
8140 * config/arm/vfp.md (lazy_store_multiple_insn): Add alternative to
8141 use when erratum mitigation is needed.
8143 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
8146 * config/arm/arm.opt (mfix-cmse-cve-2021-35465): New option.
8147 * doc/invoke.texi (Arm Options): Document it.
8148 * config/arm/arm-cpus.in (quirk_vlldm): New feature bit.
8149 (ALL_QUIRKS): Add quirk_vlldm.
8150 (cortex-m33): Add quirk_vlldm.
8151 (cortex-m35p, cortex-m55): Likewise.
8152 * config/arm/arm.c (arm_option_override): Enable fix_vlldm if
8153 targetting an affected CPU and not explicitly controlled on
8156 2021-08-24 Richard Earnshaw <rearnsha@arm.com>
8158 * config/arm/vfp.md (lazy_store_multiple_insn): Rewrite as valid RTL.
8159 (lazy_load_multiple_insn): Likewise.
8161 2021-08-24 liuhongt <hongtao.liu@intel.com>
8164 * config/i386/sse.md (<avx512>_vternlog<mode><sd_maskz_name>):
8165 Enable avx512 embedded broadcast.
8166 (*<avx512>_vternlog<mode>_all): Ditto.
8167 (<avx512>_vternlog<mode>_mask): Ditto.
8169 2021-08-24 liuhongt <hongtao.liu@intel.com>
8172 * config/i386/i386.c (ix86_rtx_costs): Define cost for
8174 * config/i386/i386.h (STRIP_UNARY): New macro.
8175 * config/i386/predicates.md (reg_or_notreg_operand): New
8177 * config/i386/sse.md (*<avx512>_vternlog<mode>_all): New define_insn.
8178 (*<avx512>_vternlog<mode>_1): New pre_reload
8179 define_insn_and_split.
8180 (*<avx512>_vternlog<mode>_2): Ditto.
8181 (*<avx512>_vternlog<mode>_3): Ditto.
8182 (any_logic1,any_logic2): New code iterator.
8183 (logic_op): New code attribute.
8184 (ternlogsuffix): Extend to VNxDF and VNxSF.
8186 2021-08-24 Richard Biener <rguenther@suse.de>
8188 * doc/invoke.texi (vect-inner-loop-cost-factor): Adjust.
8189 * params.opt (--param vect-inner-loop-cost-factor): Adjust
8191 * tree-vect-loop.c (vect_analyze_loop_form): Initialize
8192 inner_loop_cost_factor to the minimum of the estimated number
8193 of iterations of the inner loop and vect-inner-loop-cost-factor.
8195 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
8196 Richard Biener <rguenther@suse.de>
8198 * config/i386/i386-features.c (compute_convert_gain): Provide
8199 more accurate values for CONST_INT, when optimizing for size.
8200 * config/i386/i386.c (COSTS_N_BYTES): Move definition from here...
8201 * config/i386/i386.h (COSTS_N_BYTES): to here.
8203 2021-08-24 Roger Sayle <roger@nextmovesoftware.com>
8204 Jakub Jelinek <jakub@redhat.com>
8206 PR middle-end/102029
8207 * match.pd (shift transformations): Add an additional check for
8208 !POINTER_TYPE_P in the recently added left shift transformation.
8210 2021-08-24 liuhongt <hongtao.liu@intel.com>
8212 PR tree-optimization/100089
8213 * tree-vectorizer.c (try_vectorize_loop_1): Disable slp in
8214 loop vectorizer when cost model is very-cheap.
8216 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8218 * config/rs6000/rs6000-gen-builtins.c (parse_bif_entry): Don't call
8219 asprintf, which is not available on AIX.
8221 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8223 * config.gcc (target_gtfiles): Add ./rs6000-builtins.h.
8224 * config/rs6000/t-rs6000 (EXTRA_GTYPE_DEPS): Set.
8226 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8228 * config.gcc (powerpc*-*-*): Add rs6000-builtins.o to extra_objs.
8229 * config/rs6000/rs6000-gen-builtins.c (main): Close init_file
8231 * config/rs6000/t-rs6000 (rs6000-gen-builtins.o): New target.
8232 (rbtree.o): Likewise.
8233 (rs6000-gen-builtins): Likewise.
8234 (rs6000-builtins.c): Likewise.
8235 (rs6000-builtins.h): Likewise.
8236 (rs6000.o): Add dependency.
8237 (EXTRA_HEADERS): Add rs6000-vecdefines.h.
8238 (rs6000-vecdefines.h): New target.
8239 (rs6000-builtins.o): Likewise.
8240 (rs6000-call.o): Add rs6000-builtins.h as a dependency.
8241 (rs6000-c.o): Likewise.
8243 2021-08-23 Bill Schmidt <wschmidt@linux.ibm.com>
8246 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace):
8247 Diagnose buffer overrun.
8248 (safe_inc_pos): Fix overrun detection.
8249 (match_identifier): Diagnose buffer overrun.
8250 (match_integer): Likewise.
8251 (match_to_right_bracket): Likewise.
8253 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
8255 * ipa-modref-tree.h (modref_access_node::range_info_useful_p):
8256 Improve range compare.
8257 (modref_access_node::contains): New member function.
8258 (modref_access_node::search): Remove.
8259 (modref_access_node::insert): Be smarter about subaccesses.
8261 2021-08-23 Thomas Schwinge <thomas@codesourcery.com>
8263 * config/i386/i386-options.c (ix86_omp_device_kind_arch_isa)
8264 <omp_device_arch> [ACCEL_COMPILER]: Match "intel_mic".
8265 * config/i386/t-omp-device (omp-device-properties-i386) <arch>:
8268 2021-08-23 Jeff Law <jlaw@localhost.localdomain>
8270 * config/h8300/h8300-protos.h (h8300_expand_epilogue): Add new
8272 * config/h8300/jumpcall.md (call, call_value): Restrict to
8273 !SIBLING_CALL_P cases.
8274 (subcall, sibcall_value): New patterns & expanders.
8275 * config/h8300/proepi.md (epilogue): Pass new argument to
8276 h8300_expand_epilogue.
8277 (sibcall_epilogue): New expander.
8278 * config/h8300/h8300.c (h8300_expand_epilogue): Handle sibcall
8280 (h8300_ok_for_sibcall_p): New function.
8281 (TARGET_FUNCTION_OK_FOR_SIBCALL): define.
8283 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8285 * simplify-rtx.c (simplify_unary_operation_1): [TRUNCATE]:
8286 Handle case where the operand is already the desired mode.
8288 2021-08-23 Richard Biener <rguenther@suse.de>
8291 * tree-ssa-structalias.c (ipa_pta_execute): Check in_other_partition
8292 in addition to has_gimple_body.
8294 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
8296 PR middle-end/101949
8297 * ipa-modref.c (analyze_ssa_name_flags): Fix merging of
8300 2021-08-23 Martin Liska <mliska@suse.cz>
8302 * doc/invoke.texi: Put the option out of -mxl-mode-app-model
8305 2021-08-23 Richard Biener <rguenther@suse.de>
8307 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
8308 Properly scale the inner loop cost only once.
8310 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8312 * tree-ssa-ccp.c (bit_value_binop) [TRUNC_MOD_EXPR, TRUNC_DIV_EXPR]:
8313 Provide bounds for unsigned (and signed with non-negative operands)
8314 division and modulus.
8316 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8318 * simplify-rtx.c (simplify_truncation): Generalize simplification
8319 of (truncate:A (subreg:B X)).
8320 (simplify_unary_operation_1) [FLOAT_TRUNCATE, FLOAT_EXTEND,
8321 SIGN_EXTEND, ZERO_EXTEND]: Handle cases where the operand
8322 already has the desired machine mode.
8323 (test_scalar_int_ops): Add tests that useless extensions and
8324 truncations are optimized away.
8325 (test_scalar_int_ext_ops): New self-test function to confirm
8326 that truncations of extensions are correctly simplified.
8327 (test_scalar_int_ext_ops2): New self-test function to check
8328 truncations of truncations, extensions of extensions, and
8329 truncations of extensions.
8330 (test_scalar_ops): Call the above two functions with a
8331 representative sampling of integer machine modes.
8333 2021-08-23 Roger Sayle <roger@nextmovesoftware.com>
8335 * match.pd (shift transformations): Change the sign of an
8336 LSHIFT_EXPR if it reduces the number of explicit conversions.
8338 2021-08-23 Jakub Jelinek <jakub@redhat.com>
8340 PR tree-optimization/86723
8341 * gimple-ssa-store-merging.c (find_bswap_or_nop_finalize): Add
8342 cast64_to_32 argument, set *cast64_to_32 to false, unless n is
8343 non-memory permutation of 64-bit src which only has bytes of
8344 0 or [5..8] and n->range is 4.
8345 (find_bswap_or_nop): Add cast64_to_32 and mask arguments, adjust
8346 find_bswap_or_nop_finalize caller, support bswap with some bytes
8347 zeroed, as long as at least two bytes are not zeroed.
8348 (bswap_replace): Add mask argument and handle masking of bswap
8350 (maybe_optimize_vector_constructor): Adjust find_bswap_or_nop
8351 caller, punt if cast64_to_32 or mask is not all ones.
8352 (pass_optimize_bswap::execute): Adjust find_bswap_or_nop_finalize
8353 caller, for now punt if cast64_to_32.
8355 2021-08-23 Richard Biener <rguenther@suse.de>
8357 PR tree-optimization/79334
8358 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Record
8359 a type also for COMPONENT_REFs.
8360 (vn_reference_may_trap): Check ARRAY_REF with constant index
8361 against the array domain.
8363 2021-08-23 liuhongt <hongtao.liu@intel.com>
8366 * config/i386/sse.md (*avx512f_pshufb_truncv8hiv8qi_1): Add
8367 TARGET_AVX512BW to condition.
8369 2021-08-23 Jakub Jelinek <jakub@redhat.com>
8372 * dwarf2out.c (gen_variable_die): Add DW_AT_location for global
8373 register variables already during early_dwarf if possible.
8375 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
8377 * config/arm/arm_mve.h: Fix __arm_vctp16q return type.
8379 2021-08-23 Christophe Lyon <christophe.lyon@foss.st.com>
8382 * config/arm/arm.opt: Fix typo.
8383 * config/arm/t-rmprofile: Fix typo.
8385 2021-08-23 Jakub Jelinek <jakub@redhat.com>
8387 * tree.h (OMP_CLAUSE_GRAINSIZE_STRICT): Define.
8388 (OMP_CLAUSE_NUM_TASKS_STRICT): Define.
8389 * tree-pretty-print.c (dump_omp_clause) <case OMP_CLAUSE_GRAINSIZE,
8390 case OMP_CLAUSE_NUM_TASKS>: Print strict: modifier.
8391 * omp-expand.c (expand_task_call): Use GOMP_TASK_FLAG_STRICT in iflags
8392 if either grainsize or num_tasks clause has the strict modifier.
8394 2021-08-23 Martin Liska <mliska@suse.cz>
8396 * dbgcnt.def (DEBUG_COUNTER): New counter.
8397 * gimple.c (gimple_call_arg_flags): Use it in IPA PTA.
8399 2021-08-23 Jan Hubicka <hubicka@ucw.cz>
8401 * ipa-modref.c (analyze_ssa_name_flags): Improve handling of return slot.
8403 2021-08-23 Xi Ruoyao <xry111@mengyan1223.wang>
8406 * config/mips/mips-protos.h (mips_msa_output_shift_immediate):
8408 * config/mips/mips.c (mips_msa_output_shift_immediate): New
8410 * config/mips/mips-msa.md (vashl<mode>3, vashr<mode>3,
8411 vlshr<mode>3): Call it.
8413 2021-08-22 Jan Hubicka <hubicka@ucw.cz>
8414 Martin Liska <mliska@suse.cz>
8416 PR middle-end/101949
8417 * ipa-modref.c (analyze_ssa_name_flags): Indirect call implies
8420 2021-08-21 Dragan Mladjenovic <OT_Dragan.Mladjenovic@mediatek.com>
8422 * config/mips/mips.c (mips_function_rodata_section,
8423 TARGET_ASM_FUNCTION_RODATA_SECTION): Removed.
8425 2021-08-21 John David Anglin <danglin@gcc.gnu.org>
8427 * config/pa/pa.c (pa_asm_output_aligned_common): Remove warning.
8429 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8431 * configure.ac (thread-local storage support): Remove tls_first_major
8432 and tls_first_minor. Use "$conftest_s" to check support.
8433 * configure: Regenerate.
8435 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8437 * configure.ac: Fixup formatting.
8439 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8441 * acinclude.m4 (gcc_GAS_CHECK_FEATURE): Remove third argument and ...
8442 * configure.ac: ... update all callers.
8444 2021-08-20 Serge Belyshev <belyshev@depni.sinp.msu.ru>
8447 * acinclude.m4 (_gcc_COMPUTE_GAS_VERSION, _gcc_GAS_VERSION_GTE_IFELSE)
8448 (gcc_GAS_VERSION_GTE_IFELSE): Remove.
8449 (gcc_GAS_CHECK_FEATURE): Do not handle in-tree case specially.
8450 * configure.ac: Remove gcc_cv_gas_major_version, gcc_cv_gas_minor_version.
8451 Remove remaining checks for in-tree assembler.
8452 * configure: Regenerate.
8454 2021-08-20 Jeff Law <jlaw@localhost.localdomain>
8456 * config/h8300/h8300.c (shift_alg_hi): Improve arithmetic shift right
8457 by 15 bits for H8/300H and H8/S. Improve logical shifts by 12
8459 (shift_alg_si): Improve arithmetic right shift by 28-30 bits for
8460 H8/300H. Improve arithmetic shift right by 15 bits for H8/S.
8461 Improve logical shifts by 27 bits for H8/S.
8462 (get_shift_alg): Corresponding changes.
8463 (h8300_option_override): Revert to loops for -Os when profitable.
8465 2021-08-20 Richard Biener <rguenther@suse.de>
8467 * tree-vect-data-refs.c (dr_group_sort_cmp): Do not compare
8469 (vect_analyze_data_ref_accesses): Likewise. Assign the BB
8470 index as group_id when dataref_groups were not computed.
8471 * tree-vect-slp.c (vect_slp_bbs): Bump current_group when
8472 we advace to the next BB.
8474 2021-08-20 Jakub Jelinek <jakub@redhat.com>
8476 * omp-builtins.def (BUILT_IN_GOMP_WARNING, BUILT_IN_GOMP_ERROR): New
8479 2021-08-20 Martin Liska <mliska@suse.cz>
8481 PR gcov-profile/89961
8482 * gcov.c (make_gcov_file_name): Rewrite using std::string.
8483 (mangle_name): Simplify, do not used the second argument.
8484 (strip_extention): New function.
8485 (get_md5sum): Likewise.
8486 (get_gcov_intermediate_filename): Handle properly -p and -x
8488 (output_gcov_file): Use string type.
8489 (generate_results): Likewise.
8490 (md5sum_to_hex): Remove.
8492 2021-08-20 Michael Meissner <meissner@linux.ibm.com>
8494 * config/rs6000/altivec.md (UNSPEC_XXEVAL): Move to vsx.md.
8495 (UNSPEC_XXSPLTIW): Move to vsx.md.
8496 (UNSPEC_XXSPLTID): Move to vsx.md.
8497 (UNSPEC_XXSPLTI32DX): Move to vsx.md.
8498 (UNSPEC_XXBLEND): Move to vsx.md.
8499 (UNSPEC_XXPERMX): Move to vsx.md.
8500 (VM3): Move to vsx.md.
8501 (VM3_char): Move to vsx.md.
8502 (xxspltiw_v4si): Move to vsx.md.
8503 (xxspltiw_v4sf): Move to vsx.md.
8504 (xxspltiw_v4sf_inst): Move to vsx.md.
8505 (xxspltidp_v2df): Move to vsx.md.
8506 (xxspltidp_v2df_inst): Move to vsx.md.
8507 (xxsplti32dx_v4si_inst): Move to vsx.md.
8508 (xxsplti32dx_v4sf): Move to vsx.md.
8509 (xxsplti32dx_v4sf_inst): Move to vsx.md.
8510 (xxblend_<mode>): Move to vsx.md.
8511 (xxpermx): Move to vsx.md.
8512 (xxpermx_inst): Move to vsx.md.
8513 * config/rs6000/vsx.md (UNSPEC_XXEVAL): Move from altivec.md.
8514 (UNSPEC_XXSPLTIW): Move from altivec.md.
8515 (UNSPEC_XXSPLTID): Move from altivec.md.
8516 (UNSPEC_XXSPLTI32DX): Move from altivec.md.
8517 (UNSPEC_XXBLEND): Move from altivec.md.
8518 (UNSPEC_XXPERMX): Move from altivec.md.
8519 (VM3): Move from altivec.md.
8520 (VM3_char): Move from altivec.md.
8521 (xxspltiw_v4si): Move from altivec.md.
8522 (xxspltiw_v4sf): Move from altivec.md.
8523 (xxspltiw_v4sf_inst): Move from altivec.md.
8524 (xxspltidp_v2df): Move from altivec.md.
8525 (xxspltidp_v2df_inst): Move from altivec.md.
8526 (xxsplti32dx_v4si_inst): Move from altivec.md.
8527 (xxsplti32dx_v4sf): Move from altivec.md.
8528 (xxsplti32dx_v4sf_inst): Move from altivec.md.
8529 (xxblend_<mode>): Move from altivec.md.
8530 (xxpermx): Move from altivec.md.
8531 (xxpermx_inst): Move from altivec.md.
8533 2021-08-19 Roger Sayle <roger@nextmovesoftware.com>
8535 * tree-vect-generic.c (expand_vector_operations_1): Use either
8536 gimplify_build1 or gimplify_build2 instead of gimple_build_assign
8537 when constructing scalar splat expressions.
8539 2021-08-19 Peter Bergner <bergner@linux.ibm.com>
8542 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Cast
8543 pointer to __vector_pair *.
8545 2021-08-19 Martin Sebor <msebor@redhat.com>
8547 * gimple-range.cc: Add comments.
8548 * gimple-range.h: Same.
8550 2021-08-19 Martin Sebor <msebor@redhat.com>
8552 PR middle-end/101984
8553 * gimple-ssa-warn-access.cc (pass_waccess::execute): Also call
8556 2021-08-19 Jeff Law <jlaw@localhost.localdomain>
8558 * config.gcc (h8300-*-elf*): Do not include dbxelf.h.
8559 (h8300-*-linux*, v850-*-rtems*, v850*-elf*): Likewise.
8560 * config/v850/v850.h (DEFAULT_GDB_EXTENSIONS): Remove.
8562 2021-08-19 Jakub Jelinek <jakub@redhat.com>
8564 PR middle-end/101950
8565 * optabs.c (expand_clrsb_using_clz): New function.
8566 (expand_unop): Use it as another clrsb expansion fallback.
8568 2021-08-19 liuhongt <hongtao.liu@intel.com>
8571 2021-07-28 liuhongt <hongtao.liu@intel.com>
8574 * config/i386/i386.h (processor_costs): Add new member
8576 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
8577 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
8578 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
8579 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
8580 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
8581 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
8582 generic_cost, core_cost): Initialize integer_to_sse same value
8584 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
8585 * config/i386/i386.c (ix86_builtin_vectorization_cost):
8586 Use integer_to_sse instead of sse_op to calculate the cost of
8589 2021-08-18 Iain Sandoe <iain@sandoe.co.uk>
8591 * config.gcc: Include rpath.opt for Darwin.
8592 * config/darwin.h (DRIVER_SELF_SPECS): Handle -rpath.
8594 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
8597 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor_expand):
8600 2021-08-18 Jonathan Wright <jonathan.wright@arm.com>
8602 * config/aarch64/arm_neon.h (vld3_lane_f64): Use float RTL
8603 pattern and type cast.
8604 (vld4_lane_f32): Use float RTL pattern.
8605 (vld4q_lane_f64): Use float type cast.
8607 2021-08-18 Jan Hubicka <hubicka@ucw.cz>
8609 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Check also
8612 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
8614 * hash-map-tests.c (test_map_of_type_with_ctor_and_dtor): Extend.
8615 (test_map_of_type_with_ctor_and_dtor_expand): Add function.
8616 (hash_map_tests_c_tests): Call it.
8618 2021-08-18 Thomas Schwinge <thomas@codesourcery.com>
8620 * ggc.h (enum ggc_collect): New.
8621 (ggc_collect): Use it.
8622 * ggc-page.c: Adjust.
8623 * ggc-common.c: Likewise.
8624 * ggc-tests.c: Likewise.
8625 * read-rtl-function.c: Likewise.
8626 * selftest-run-tests.c: Likewise.
8627 * doc/gty.texi (Invoking the garbage collector): Likewise.
8629 2021-08-18 liuhongt <hongtao.liu@intel.com>
8632 * config/i386/i386.h (TARGET_V2DF_REDUCTION_PREFER_HADDPD):
8634 * config/i386/sse.md (*sse3_haddv2df3_low): Add
8635 TARGET_V2DF_REDUCTION_PREFER_HADDPD.
8636 (*sse3_hsubv2df3_low): Ditto.
8637 * config/i386/x86-tune.def
8638 (X86_TUNE_V2DF_REDUCTION_PREFER_HADDPD): New tune.
8640 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8642 * gimple-range-gori.cc (gori_compute::gori_compute): Enable tracing.
8643 (gori_compute::compute_operand_range): Add tracing.
8644 (gori_compute::logical_combine): Ditto.
8645 (gori_compute::compute_logical_operands): Ditto.
8646 (gori_compute::compute_operand1_range): Ditto.
8647 (gori_compute::compute_operand2_range): Ditto.
8648 (gori_compute::outgoing_edge_range_p): Ditto.
8649 * gimple-range-gori.h (class gori_compute): Add range_tracer.
8651 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8653 * flag-types.h (enum evrp_mode): Adjust evrp-mode values.
8654 * gimple-range-cache.cc (DEBUG_RANGE_CACHE): Relocate from.
8655 * gimple-range-trace.h (DEBUG_RANGE_CACHE): Here.
8656 * params.opt (--param=evrp-mode): Adjust options.
8658 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8660 * Makefile.in (OBJS): Add gimple-range-trace.o.
8661 * gimple-range-cache.h (enable_new_values): Remove unused prototype.
8662 * gimple-range-fold.cc: Adjust headers.
8663 * gimple-range-trace.cc: New.
8664 * gimple-range-trace.h: New.
8665 * gimple-range.cc (gimple_ranger::gimple_ranger): Enable tracer.
8666 (gimple_ranger::range_of_expr): Add tracing.
8667 (gimple_ranger::range_on_entry): Ditto.
8668 (gimple_ranger::range_on_exit): Ditto.
8669 (gimple_ranger::range_on_edge): Ditto.
8670 (gimple_ranger::fold_range_internal): Ditto.
8671 (gimple_ranger::dump_bb): Do not calculate edge range twice.
8672 (trace_ranger::*): Remove.
8673 (enable_ranger): Never create a trace_ranger.
8674 (debug_seed_ranger): Move to gimple-range-trace.cc.
8675 (dump_ranger): Ditto.
8676 (debug_ranger): Ditto.
8677 * gimple-range.h: Include gimple-range-trace.h.
8678 (range_on_entry, range_on_exit): No longer virtual.
8679 (class trace_ranger): Remove.
8680 (DEBUG_RANGE_CACHE): Move to gimple-range-trace.h.
8682 2021-08-17 Martin Sebor <msebor@redhat.com>
8684 PR middle-end/101854
8685 * builtins.c (expand_builtin_alloca): Move warning code to check_alloca
8686 in gimple-ssa-warn-access.cc.
8687 * calls.c (alloc_max_size): Move code to check_alloca.
8688 (get_size_range): Move to pointer-query.cc.
8689 (maybe_warn_alloc_args_overflow): Move to gimple-ssa-warn-access.cc.
8690 (get_attr_nonstring_decl): Move to tree.c.
8691 (fntype_argno_type): Move to gimple-ssa-warn-access.cc.
8692 (append_attrname): Same.
8693 (maybe_warn_rdwr_sizes): Same.
8694 (initialize_argument_information): Move code to
8695 gimple-ssa-warn-access.cc.
8696 * calls.h (maybe_warn_alloc_args_overflow): Move to
8697 gimple-ssa-warn-access.h.
8698 (get_attr_nonstring_decl): Move to tree.h.
8699 (maybe_warn_nonstring_arg): Move to gimple-ssa-warn-access.h.
8700 (enum size_range_flags): Move to pointer-query.h.
8701 (get_size_range): Same.
8702 * gimple-ssa-warn-access.cc (has_location): Remove unused overload
8703 to avoid Clang -Wunused-function.
8704 (get_size_range): Declare static.
8705 (maybe_emit_free_warning): Rename...
8706 (maybe_check_dealloc_call): ...to this for consistency.
8707 (class pass_waccess): Add members.
8708 (pass_waccess::~pass_waccess): Defined.
8709 (alloc_max_size): Move here from calls.c.
8710 (maybe_warn_alloc_args_overflow): Same.
8711 (check_alloca): New function.
8712 (check_alloc_size_call): New function.
8713 (check_strncat): Handle another warning flag.
8714 (pass_waccess::check_builtin): Handle alloca.
8715 (fntype_argno_type): Move here from calls.c.
8716 (append_attrname): Same.
8717 (maybe_warn_rdwr_sizes): Same.
8718 (pass_waccess::check_call): Define.
8719 (check_nonstring_args): New function.
8720 (pass_waccess::check): Call new member functions.
8721 (pass_waccess::execute): Enable ranger.
8722 * gimple-ssa-warn-access.h (get_size_range): Move here from calls.h.
8723 (maybe_warn_nonstring_arg): Same.
8724 * gimple-ssa-warn-restrict.c: Remove #include.
8725 * pointer-query.cc (get_size_range): Move here from calls.c.
8726 * pointer-query.h (enum size_range_flags): Same.
8727 (get_size_range): Same.
8728 * tree.c (get_attr_nonstring_decl): Move here from calls.c.
8729 * tree.h (get_attr_nonstring_decl): Move here from calls.h.
8731 2021-08-17 Thomas Schwinge <thomas@codesourcery.com>
8733 * ggc.h (ggc_collect): Add 'force_collect' parameter.
8734 * ggc-page.c (ggc_collect): Use that one instead of global
8735 'ggc_force_collect'. Adjust all users.
8736 * doc/gty.texi (Invoking the garbage collector): Update.
8737 * ggc-internal.h (ggc_force_collect): Remove.
8738 * ggc-common.c (ggc_force_collect): Likewise.
8739 * selftest.h (forcibly_ggc_collect): Remove.
8740 * ggc-tests.c (selftest::forcibly_ggc_collect): Likewise.
8741 * read-rtl-function.c (test_loading_labels): Adjust.
8742 * selftest-run-tests.c (run_tests): Likewise.
8744 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
8746 * config/darwin.c (darwin_file_end): Reset and reclaim the
8747 section names table at the end of compile.
8749 2021-08-17 Iain Sandoe <iain@sandoe.co.uk>
8752 * config.in: Regenerate.
8753 * config/i386/darwin.h (EXTRA_ASM_OPTS): New
8754 (ASM_SPEC): Pass options to disable branch shortening where
8756 * configure: Regenerate.
8757 * configure.ac: Detect versions of 'as' that support the
8758 optimisation which has the bug.
8760 2021-08-17 Richard Biener <rguenther@suse.de>
8762 * optabs-query.c (supports_vec_gather_load_p): Also check
8764 (supports_vec_scatter_store_p): Likewise.
8765 * tree-vect-data-refs.c (vect_gather_scatter_fn_p): Fall
8766 back to masked variants if non-masked are not supported.
8767 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
8768 When we need to use masked gather/scatter but do not have
8769 a mask set up a constant true one.
8770 * tree-vect-stmts.c (vect_check_scalar_mask): Also allow
8773 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
8775 * tree-ssa-ccp.c (bit_value_binop) [MINUS_EXPR]: Use same
8776 algorithm as PLUS_EXPR to improve subtraction bit bounds.
8777 [POINTER_DIFF_EXPR]: Treat as synonymous with MINUS_EXPR.
8779 2021-08-17 Roger Sayle <roger@nextmovesoftware.com>
8781 * tree-ssa-ccp.c (bit_value_mult_const): New helper function to
8782 calculate the mask-value pair result of a multiplication by an
8784 (bit_value_binop) [MULT_EXPR]: Call it from here for
8785 multiplications by (sparse) non-negative constants.
8787 2021-08-17 Christophe Lyon <christophe.lyon@foss.st.com>
8790 * config.gcc (gcc_cv_initfini_array): Leave undefined for
8791 uclinuxfdpiceabi targets.
8793 2021-08-17 Alexandre Oliva <oliva@adacore.com>
8795 * tree-inline.c (maybe_move_debug_stmts_to_successors): Don't
8796 reverse debug stmts.
8798 2021-08-17 Alexandre Oliva <oliva@adacore.com>
8800 * tree-cfg.c (dump_function_to_file): Use fun, not cfun.
8802 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
8804 * config/aarch64/arm_neon.h (__LD4_LANE_FUNC): Delete.
8805 (__LD4Q_LANE_FUNC): Likewise.
8806 (vld4_lane_u8): Define without macro.
8807 (vld4_lane_u16): Likewise.
8808 (vld4_lane_u32): Likewise.
8809 (vld4_lane_u64): Likewise.
8810 (vld4_lane_s8): Likewise.
8811 (vld4_lane_s16): Likewise.
8812 (vld4_lane_s32): Likewise.
8813 (vld4_lane_s64): Likewise.
8814 (vld4_lane_f16): Likewise.
8815 (vld4_lane_f32): Likewise.
8816 (vld4_lane_f64): Likewise.
8817 (vld4_lane_p8): Likewise.
8818 (vld4_lane_p16): Likewise.
8819 (vld4_lane_p64): Likewise.
8820 (vld4q_lane_u8): Likewise.
8821 (vld4q_lane_u16): Likewise.
8822 (vld4q_lane_u32): Likewise.
8823 (vld4q_lane_u64): Likewise.
8824 (vld4q_lane_s8): Likewise.
8825 (vld4q_lane_s16): Likewise.
8826 (vld4q_lane_s32): Likewise.
8827 (vld4q_lane_s64): Likewise.
8828 (vld4q_lane_f16): Likewise.
8829 (vld4q_lane_f32): Likewise.
8830 (vld4q_lane_f64): Likewise.
8831 (vld4q_lane_p8): Likewise.
8832 (vld4q_lane_p16): Likewise.
8833 (vld4q_lane_p64): Likewise.
8834 (vld4_lane_bf16): Likewise.
8835 (vld4q_lane_bf16): Likewise.
8837 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
8839 * config/aarch64/arm_neon.h (__LD3_LANE_FUNC): Delete.
8840 (__LD3Q_LANE_FUNC): Delete.
8841 (vld3_lane_u8): Define without macro.
8842 (vld3_lane_u16): Likewise.
8843 (vld3_lane_u32): Likewise.
8844 (vld3_lane_u64): Likewise.
8845 (vld3_lane_s8): Likewise.
8846 (vld3_lane_s16): Likewise.
8847 (vld3_lane_s32): Likewise.
8848 (vld3_lane_s64): Likewise.
8849 (vld3_lane_f16): Likewise.
8850 (vld3_lane_f32): Likewise.
8851 (vld3_lane_f64): Likewise.
8852 (vld3_lane_p8): Likewise.
8853 (vld3_lane_p16): Likewise.
8854 (vld3_lane_p64): Likewise.
8855 (vld3q_lane_u8): Likewise.
8856 (vld3q_lane_u16): Likewise.
8857 (vld3q_lane_u32): Likewise.
8858 (vld3q_lane_u64): Likewise.
8859 (vld3q_lane_s8): Likewise.
8860 (vld3q_lane_s16): Likewise.
8861 (vld3q_lane_s32): Likewise.
8862 (vld3q_lane_s64): Likewise.
8863 (vld3q_lane_f16): Likewise.
8864 (vld3q_lane_f32): Likewise.
8865 (vld3q_lane_f64): Likewise.
8866 (vld3q_lane_p8): Likewise.
8867 (vld3q_lane_p16): Likewise.
8868 (vld3q_lane_p64): Likewise.
8869 (vld3_lane_bf16): Likewise.
8870 (vld3q_lane_bf16): Likewise.
8872 2021-08-17 Jonathan Wright <jonathan.wright@arm.com>
8874 * config/aarch64/arm_neon.h (__LD2_LANE_FUNC): Delete.
8875 (__LD2Q_LANE_FUNC): Likewise.
8876 (vld2_lane_u8): Define without macro.
8877 (vld2_lane_u16): Likewise.
8878 (vld2_lane_u32): Likewise.
8879 (vld2_lane_u64): Likewise.
8880 (vld2_lane_s8): Likewise.
8881 (vld2_lane_s16): Likewise.
8882 (vld2_lane_s32): Likewise.
8883 (vld2_lane_s64): Likewise.
8884 (vld2_lane_f16): Likewise.
8885 (vld2_lane_f32): Likewise.
8886 (vld2_lane_f64): Likewise.
8887 (vld2_lane_p8): Likewise.
8888 (vld2_lane_p16): Likewise.
8889 (vld2_lane_p64): Likewise.
8890 (vld2q_lane_u8): Likewise.
8891 (vld2q_lane_u16): Likewise.
8892 (vld2q_lane_u32): Likewise.
8893 (vld2q_lane_u64): Likewise.
8894 (vld2q_lane_s8): Likewise.
8895 (vld2q_lane_s16): Likewise.
8896 (vld2q_lane_s32): Likewise.
8897 (vld2q_lane_s64): Likewise.
8898 (vld2q_lane_f16): Likewise.
8899 (vld2q_lane_f32): Likewise.
8900 (vld2q_lane_f64): Likewise.
8901 (vld2q_lane_p8): Likewise.
8902 (vld2q_lane_p16): Likewise.
8903 (vld2q_lane_p64): Likewise.
8904 (vld2_lane_bf16): Likewise.
8905 (vld2q_lane_bf16): Likewise.
8907 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
8909 * haifa-sched.c (advance_one_cycle): Output more context-synchronization
8912 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
8914 * haifa-sched.c (enum rfs_decision, rfs_str): Add RFS_AUTOPREF.
8915 (rank_for_schedule): Use it.
8917 2021-08-17 Maxim Kuvyrkov <maxim.kuvyrkov@linaro.org>
8919 PR rtl-optimization/91598
8920 * haifa-sched.c (autopref_rank_for_schedule): Prioritize "irrelevant"
8921 insns after memory reads and before memory writes.
8923 2021-08-17 Alistair_Lee <alistair.lee@arm.com>
8925 * rtl.h (CONST_VECTOR_P): New macro.
8926 * config/aarch64/aarch64.c (aarch64_get_sve_pred_bits): Use RTL
8927 code testing macros.
8928 (aarch64_ptrue_all_mode): Likewise.
8929 (aarch64_expand_mov_immediate): Likewise.
8930 (aarch64_const_vec_all_in_range_p): Likewise.
8931 (aarch64_rtx_costs): Likewise.
8932 (aarch64_legitimate_constant_p): Likewise.
8933 (aarch64_simd_valid_immediate): Likewise.
8934 (aarch64_simd_make_constant): Likewise.
8935 (aarch64_convert_mult_to_shift): Likewise.
8936 (aarch64_expand_sve_vec_perm): Likewise.
8937 (aarch64_vec_fpconst_pow_of_2): Likewise.
8939 2021-08-17 Andrew MacLeod <amacleod@redhat.com>
8941 PR tree-optimization/101938
8942 * range-op.cc (operator_abs::op1_range): Special case
8943 -TYPE_MIN_VALUE for flag_wrapv.
8945 2021-08-17 Kewen Lin <linkw@linux.ibm.com>
8947 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Add the cost for
8950 2021-08-17 Jakub Jelinek <jakub@redhat.com>
8952 * tree.def (OMP_SCOPE): New tree code.
8953 * tree.h (OMP_SCOPE_BODY, OMP_SCOPE_CLAUSES): Define.
8954 * tree-nested.c (convert_nonlocal_reference_stmt,
8955 convert_local_reference_stmt, convert_gimple_call): Handle
8957 * tree-pretty-print.c (dump_generic_node): Handle OMP_SCOPE.
8958 * gimple.def (GIMPLE_OMP_SCOPE): New gimple code.
8959 * gimple.c (gimple_build_omp_scope): New function.
8960 (gimple_copy): Handle GIMPLE_OMP_SCOPE.
8961 * gimple.h (gimple_build_omp_scope): Declare.
8962 (gimple_has_substatements): Handle GIMPLE_OMP_SCOPE.
8963 (gimple_omp_scope_clauses, gimple_omp_scope_clauses_ptr,
8964 gimple_omp_scope_set_clauses): New inline functions.
8965 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_SCOPE.
8966 * gimple-pretty-print.c (dump_gimple_omp_scope): New function.
8967 (pp_gimple_stmt_1): Handle GIMPLE_OMP_SCOPE.
8968 * gimple-walk.c (walk_gimple_stmt): Likewise.
8969 * gimple-low.c (lower_stmt): Likewise.
8970 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
8971 (gimplify_scan_omp_clauses): For task reductions, handle OMP_SCOPE
8972 like ORT_WORKSHARE constructs. Adjust diagnostics for %<scope%>
8973 allowing task reductions. Reject inscan reductions on scope.
8974 (omp_find_stores_stmt): Handle GIMPLE_OMP_SCOPE.
8975 (gimplify_omp_workshare, gimplify_expr): Handle OMP_SCOPE.
8976 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_SCOPE.
8977 (estimate_num_insns): Likewise.
8978 * omp-low.c (build_outer_var_ref): Look through GIMPLE_OMP_SCOPE
8979 contexts if var isn't privatized there.
8980 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_SCOPE.
8981 (scan_omp_1_stmt): Likewise.
8982 (maybe_add_implicit_barrier_cancel): Look through outer
8984 (lower_omp_scope): New function.
8985 (lower_omp_task_reductions): Handle OMP_SCOPE.
8986 (lower_omp_1): Handle GIMPLE_OMP_SCOPE.
8987 (diagnose_sb_1, diagnose_sb_2): Likewise.
8988 * omp-expand.c (expand_omp_single): Support also GIMPLE_OMP_SCOPE.
8989 (expand_omp): Handle GIMPLE_OMP_SCOPE.
8990 (omp_make_gimple_edges): Likewise.
8991 * omp-builtins.def (BUILT_IN_GOMP_SCOPE_START): New built-in.
8993 2021-08-17 Richard Biener <rguenther@suse.de>
8995 PR tree-optimization/101925
8996 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Set
8997 reverse on COMPONENT_REF and ARRAY_REF according to
8998 what reverse_storage_order_for_component_p does.
8999 (vn_reference_eq): Compare reversed on reference ops.
9000 (reverse_storage_order_for_component_p): New overload.
9001 (vn_reference_lookup_3): Check reverse_storage_order_for_component_p
9002 on the reference looked up.
9004 2021-08-17 Jeff Law <jlaw@localhost.localdomain>
9006 * config/h8300/h8300.c (shift_alg_si): Avoid loops for most SImode
9008 (h8300_option_override): Use loops on H8/S more often when optimizing
9010 (get_shift_alg): Handle new "special" cases on H8/S. Simplify
9011 accordingly. Handle various arithmetic right shifts with special
9012 sequences that we couldn't handle before.
9014 2021-08-16 Jeff Law <jlaw@localhost.localdomain>
9016 * config.gcc (rl78-*-elf*): Do not include dbxelf.h.
9018 2021-08-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
9020 * config/sparc/rtemself.h (SPARC_GCOV_TYPE_SIZE): Define.
9021 * config/sparc/sparc.c (sparc_gcov_type_size): New.
9022 (TARGET_GCOV_TYPE_SIZE): Redefine if SPARC_GCOV_TYPE_SIZE is defined.
9023 * coverage.c (get_gcov_type): Use targetm.gcov_type_size().
9024 * doc/tm.texi (TARGET_GCOV_TYPE_SIZE): Add hook under "Misc".
9025 * doc/tm.texi.in: Regenerate.
9026 * target.def (gcov_type_size): New target hook.
9027 * targhooks.c (default_gcov_type_size): New.
9028 * targhooks.h (default_gcov_type_size): Declare.
9029 * tree-profile.c (gimple_gen_edge_profiler): Use precision of
9031 (gimple_gen_time_profiler): Likewise.
9033 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
9035 * dwarf2out.c (add_scalar_info): Deal with DW_AT_data_bit_offset.
9037 2021-08-16 Tobias Burnus <tobias@codesourcery.com>
9039 PR middle-end/101931
9040 * omp-low.c (omp_runtime_api_call): Update for routines
9041 added in the meanwhile.
9043 2021-08-16 Martin Liska <mliska@suse.cz>
9045 PR tree-optimization/100393
9046 * tree-switch-conversion.c (group_cluster::dump): Use
9047 get_comparison_count.
9048 (jump_table_cluster::find_jump_tables): Pre-compute number of
9049 comparisons and then decrement it. Cache also max_ratio.
9050 (jump_table_cluster::can_be_handled): Change signature.
9051 * tree-switch-conversion.h (get_comparison_count): New.
9053 2021-08-16 Eric Botcazou <ebotcazou@gcc.gnu.org>
9055 * dwarf2out.c (add_data_member_location_attribute): Use GNAT
9056 encodings only when -fgnat-encodings=all is specified.
9057 (add_bound_info): Likewise.
9058 (add_byte_size_attribute): Likewise.
9059 (gen_member_die): Likewise.
9061 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
9063 * omp-oacc-neuter-broadcast.cc
9064 (execute_omp_oacc_neuter_broadcast): Plug 'par' memory leak.
9066 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
9068 * omp-oacc-neuter-broadcast.cc
9069 (execute_omp_oacc_neuter_broadcast): Clarify memory management for
9072 2021-08-16 Thomas Schwinge <thomas@codesourcery.com>
9074 * omp-oacc-neuter-broadcast.cc (field_map): Move variable into...
9075 (execute_omp_oacc_neuter_broadcast): ... here.
9076 (install_var_field, build_receiver_ref, build_sender_ref): Take
9077 'field_map_t *' parameter. Adjust all users.
9078 (worker_single_copy, neuter_worker_single): Take a
9079 'record_field_map_t *' parameter. Adjust all users.
9081 2021-08-16 liuhongt <hongtao.liu@intel.com>
9084 * config/i386/i386.md (ldexp<mode>3): Force operands[1] to
9087 2021-08-16 Martin Liska <mliska@suse.cz>
9090 * multiple_target.c (create_dispatcher_calls): Make default
9091 function local only if it is a definition.
9093 2021-08-16 Martin Liska <mliska@suse.cz>
9096 * ipa-icf-gimple.c (func_checker::compare_ssa_name): Do not
9097 consider equal SSA_NAMEs when one is a param.
9099 2021-08-16 liuhongt <hongtao.liu@intel.com>
9102 * config/i386/i386-expand.c (ix86_expand_vec_perm_vpermt2):
9103 Support vpermi2b for V32QI/V16QImode.
9104 (ix86_extract_perm_from_pool_constant): New function.
9105 (ix86_expand_vec_one_operand_perm_avx512): Support
9106 vpermw/vpermb under TARGET_AVX512BW/TARGET_AVX512VBMI.
9107 (expand_vec_perm_1): Adjust comments for upper.
9108 * config/i386/i386-protos.h (ix86_extract_perm_from_pool_constant):
9110 * config/i386/predicates.md (permvar_truncate_operand): New predicate.
9111 (pshufb_truncv4siv4hi_operand): Ditto.
9112 (pshufb_truncv8hiv8qi_operand): Ditto.
9113 * config/i386/sse.md (*avx512bw_permvar_truncv16siv16hi_1):
9114 New pre_reload define_insn_and_split.
9115 (*avx512f_permvar_truncv8siv8hi_1): Ditto.
9116 (*avx512f_vpermvar_truncv8div8si_1): Ditto.
9117 (*avx512f_permvar_truncv32hiv32qi_1): Ditto.
9118 (*avx512f_permvar_truncv16hiv16qi_1): Ditto.
9119 (*avx512f_permvar_truncv4div4si_1): Ditto.
9120 (*avx512f_pshufb_truncv8hiv8qi_1): Ditto.
9121 (*avx512f_pshufb_truncv4siv4hi_1): Ditto.
9122 (*avx512f_pshufd_truncv2div2si_1): Ditto.
9124 2021-08-16 Kito Cheng <kito.cheng@sifive.com>
9126 * config/riscv/multilib-generator: Support code model option for
9128 * doc/install.texi: Add document of new option for
9129 --with-multilib-generator.
9131 2021-08-15 Clément Chigot <clement.chigot@atos.net>
9133 * config/rs6000/rs6000.c (xcoff_tls_exec_model_detected): New.
9134 (rs6000_legitimize_tls_address_aix): Use it.
9135 (rs6000_xcoff_file_end): Add ".ref __tls_get_addr" when
9136 xcoff_tls_exec_model_detected is true.
9138 2021-08-15 Jeff Law <jlaw@localhost.localdomain>
9140 * config/h8300/h8300.c (shift_alg_si): Retune H8/300H shifts
9141 to allow a bit more code growth, saving many dozens of cycles.
9142 (h8300_option_override): Adjus shift_alg_si if optimizing for
9144 (get_shift_alg): Use special + inline shifts for residuals
9147 2021-08-14 Stafford Horne <shorne@gmail.com>
9150 * config/or1k/or1k-opts.h: New file.
9151 * config/or1k/or1k.c (or1k_legitimize_address_1, print_reloc):
9152 Support generating gotha relocations if -mcmodel=large is
9154 * config/or1k/or1k.h (TARGET_CMODEL_SMALL, TARGET_CMODEL_LARGE):
9156 * config/or1k/or1k.opt (mcmodel=): New option.
9157 * doc/invoke.texi (OpenRISC Options): Document mcmodel.
9159 2021-08-14 Martin Sebor <msebor@redhat.com>
9161 PR middle-end/101791
9162 * gimple-ssa-warn-access.cc (new_delete_mismatch_p): Use new argument
9163 to valid_new_delete_pair_p.
9164 * tree.c (valid_new_delete_pair_p): Add argument.
9165 * tree.h (valid_new_delete_pair_p): Same.
9167 2021-08-14 Jakub Jelinek <jakub@redhat.com>
9170 * config/i386/i386-expand.c (expand_vec_perm_broadcast_1)
9171 <case E_V64QImode>: For this mode assert
9172 !TARGET_AVX512BW || d->perm[0] rather than !TARGET_AVX2 || d->perm[0].
9174 2021-08-13 Michael Meissner <meissner@linux.ibm.com>
9177 * config/rs6000/altivec.md (xxeval): Use register_predicate
9178 instead of altivec_register_predicate.
9180 2021-08-13 Martin Sebor <msebor@redhat.com>
9182 PR middle-end/101734
9183 * tree-ssa-uninit.c (maybe_warn_read_write_only): New function.
9184 (maybe_warn_operand): Call it.
9186 2021-08-13 Martin Liska <mliska@suse.cz>
9189 * attribs.c (decl_attributes): Make naked functions "noipa"
9192 2021-08-13 Martin Liska <mliska@suse.cz>
9195 * symtab.c (symtab_node::noninterposable_alias): Do not create
9196 local aliases for target_clone functions as the clonning pass
9199 2021-08-13 Martin Liska <mliska@suse.cz>
9201 * opts.c (LIVE_PATCHING_OPTION): Define.
9202 (control_options_for_live_patching): Use it in error messages.
9204 2021-08-13 Jan Hubicka <hubicka@ucw.cz>
9206 * ipa-modref.c (dump_eaf_flags): Dump EAF_NOREAD.
9207 (implicit_const_eaf_flags, implicit_pure_eaf_flags,
9208 ignore_stores_eaf_flags): New constants.
9209 (remove_useless_eaf_flags): New function.
9210 (eaf_flags_useful_p): Use it.
9211 (deref_flags): Add EAF_NOT_RETURNED if flag is unused;
9213 (modref_lattice::init): Add EAF_NOREAD.
9214 (modref_lattice::add_escape_point): Do not reacord escape point if
9216 (modref_lattice::merge): EAF_NOESCAPE implies EAF_NODIRECTESCAPE;
9217 use remove_useless_eaf_flags.
9218 (modref_lattice::merge_deref): Use ignore_stores_eaf_flags.
9219 (modref_lattice::merge_direct_load): Add EAF_NOREAD
9220 (analyze_ssa_name_flags): Fix handling EAF_NOT_RETURNED
9221 (analyze_parms): Use remove_useless_eaf_flags.
9222 (ipa_merge_modref_summary_after_inlining): Use ignore_stores_eaf_flags.
9223 (modref_merge_call_site_flags): Add caller and ecf_flags parameter;
9224 use remove_useless_eaf_flags.
9225 (modref_propagate_flags_in_scc): Update.
9226 * ipa-modref.h: Turn eaf_flags_t back to char.
9227 * tree-core.h (EAF_NOT_RETURNED): Fix.
9228 (EAF_NOREAD): New constant
9229 * tree-ssa-alias.c: (ref_maybe_used_by_call_p_1): Check for
9231 * tree-ssa-structalias.c (handle_rhs_call): Handle new flags.
9232 (handle_pure_call): Likewise.
9234 2021-08-12 Jakub Jelinek <jakub@redhat.com>
9236 * tree.def (OMP_MASKED): New tree code.
9237 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_FILTER.
9238 * tree.h (OMP_MASKED_BODY, OMP_MASKED_CLAUSES, OMP_MASKED_COMBINED,
9239 OMP_CLAUSE_FILTER_EXPR): Define.
9240 * tree.c (omp_clause_num_ops): Add OMP_CLAUSE_FILTER entry.
9241 (omp_clause_code_name): Likewise.
9242 (walk_tree_1): Handle OMP_CLAUSE_FILTER.
9243 * tree-nested.c (convert_nonlocal_omp_clauses,
9244 convert_local_omp_clauses): Handle OMP_CLAUSE_FILTER.
9245 (convert_nonlocal_reference_stmt, convert_local_reference_stmt,
9246 convert_gimple_call): Handle GIMPLE_OMP_MASTER.
9247 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_FILTER.
9248 (dump_generic_node): Handle OMP_MASTER.
9249 * gimple.def (GIMPLE_OMP_MASKED): New gimple code.
9250 * gimple.c (gimple_build_omp_masked): New function.
9251 (gimple_copy): Handle GIMPLE_OMP_MASKED.
9252 * gimple.h (gimple_build_omp_masked): Declare.
9253 (gimple_has_substatements): Handle GIMPLE_OMP_MASKED.
9254 (gimple_omp_masked_clauses, gimple_omp_masked_clauses_ptr,
9255 gimple_omp_masked_set_clauses): New inline functions.
9256 (CASE_GIMPLE_OMP): Add GIMPLE_OMP_MASKED.
9257 * gimple-pretty-print.c (dump_gimple_omp_masked): New function.
9258 (pp_gimple_stmt_1): Handle GIMPLE_OMP_MASKED.
9259 * gimple-walk.c (walk_gimple_stmt): Likewise.
9260 * gimple-low.c (lower_stmt): Likewise.
9261 * gimplify.c (is_gimple_stmt): Handle OMP_MASTER.
9262 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_FILTER. For clauses
9263 that take one expression rather than decl or constant, force
9264 gimplification of that into a SSA_NAME or temporary unless min
9266 (gimplify_adjust_omp_clauses): Handle OMP_CLAUSE_FILTER.
9267 (gimplify_expr): Handle OMP_MASKED.
9268 * tree-inline.c (remap_gimple_stmt): Handle GIMPLE_OMP_MASKED.
9269 (estimate_num_insns): Likewise.
9270 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_FILTER.
9271 (check_omp_nesting_restrictions): Handle GIMPLE_OMP_MASKED. Adjust
9272 diagnostics for existence of masked construct.
9273 (scan_omp_1_stmt, lower_omp_master, lower_omp_1, diagnose_sb_1,
9274 diagnose_sb_2): Handle GIMPLE_OMP_MASKED.
9275 * omp-expand.c (expand_omp_synch, expand_omp, omp_make_gimple_edges):
9278 2021-08-12 Uroš Bizjak <ubizjak@gmail.com>
9281 * config/i386/i386.md (avx512f_scalef<mode>2): New insn pattern.
9282 (ldexp<mode>3): Use avx512f_scalef<mode>2.
9283 (UNSPEC_SCALEF): Move from ...
9284 * config/i386/sse.md (UNSPEC_SCALEF): ... here.
9286 2021-08-12 Jan Hubicka <hubicka@ucw.cz>
9288 * ipa-split.c (consider_split): Fix condition testing void functions.
9290 2021-08-12 Aldy Hernandez <aldyh@redhat.com>
9292 * doc/invoke.texi: Remove docs for threader-mode param.
9293 * flag-types.h (enum threader_mode): Remove.
9294 * params.opt: Remove threader-mode param.
9295 * tree-ssa-threadbackward.c (class back_threader): Remove
9296 path_is_unreachable_p.
9297 Make find_paths private.
9298 Add maybe_thread and thread_through_all_blocks.
9299 Remove reference marker for m_registry.
9300 Remove reference marker for m_profit.
9301 (back_threader::back_threader): Adjust for registry and profit not
9303 (dump_path): Move down.
9305 (class thread_jumps): Remove.
9306 (class back_threader_registry): Remove m_all_paths.
9308 (thread_jumps::thread_through_all_blocks): Move to back_threader
9310 (fsm_find_thread_path): Remove
9311 (back_threader::maybe_thread): New.
9312 (back_threader::thread_through_all_blocks): Move from
9314 (back_threader_registry::back_threader_registry): Remove
9316 (back_threader_registry::~back_threader_registry): Remove.
9317 (thread_jumps::find_taken_edge): Remove.
9318 (thread_jumps::check_subpath_and_update_thread_path): Remove.
9319 (thread_jumps::maybe_register_path): Remove.
9320 (thread_jumps::handle_phi): Remove.
9321 (handle_assignment_p): Remove.
9322 (thread_jumps::handle_assignment): Remove.
9323 (thread_jumps::fsm_find_control_statement_thread_paths): Remove.
9324 (thread_jumps::find_jump_threads_backwards): Remove.
9325 (thread_jumps::find_jump_threads_backwards_with_ranger): Remove.
9326 (try_thread_blocks): Rename find_jump_threads_backwards to
9328 (pass_early_thread_jumps::execute): Same.
9330 2021-08-12 Tobias Burnus <tobias@codesourcery.com>
9332 * tree-core.h (omp_clause_proc_bind_kind): Add
9333 OMP_CLAUSE_PROC_BIND_PRIMARY.
9334 * tree-pretty-print.c (dump_omp_clause): Add TODO comment to
9335 change 'master' to 'primary' in proc_bind for OpenMP 5.1.
9337 2021-08-12 Claudiu Zissulescu <claziss@synopsys.com>
9339 * common/config/arc/arc-common.c (arc_option_init_struct): Remove
9340 fno-common reference.
9341 * config/arc/arc.c (arc_override_options): Remove overriding of
9344 2021-08-12 Jakub Jelinek <jakub@redhat.com>
9347 * config/i386/i386-expand.c (ix86_expand_vec_one_operand_perm_avx512):
9348 If d->testing_p, return true after performing checks instead of
9349 actually expanding the insn.
9350 (expand_vec_perm_broadcast_1): Handle V32HImode - assert
9351 !TARGET_AVX512BW and return false.
9353 2021-08-12 Eric Botcazou <ebotcazou@gcc.gnu.org>
9355 * configure.ac (PE linker --disable-dynamicbase support): New check.
9356 * configure: Regenerate.
9357 * config.in: Likewise.
9358 * config/i386/mingw32.h (LINK_SPEC_DISABLE_DYNAMICBASE): New define.
9359 (LINK_SPEC): Use it.
9360 * config/i386/mingw-w64.h (LINK_SPEC_DISABLE_DYNAMICBASE): Likewise.
9361 (LINK_SPEC): Likewise.
9363 2021-08-12 liuhongt <hongtao.liu@intel.com>
9366 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_2): New
9367 post_reload define_insn_and_split.
9368 (*avx512bw_zero_extendv32qiv32hi2_2): Ditto.
9369 (*sse4_1_zero_extendv8qiv8hi2_4): Ditto.
9370 (*avx512f_zero_extendv16hiv16si2_2): Ditto.
9371 (*avx2_zero_extendv8hiv8si2_2): Ditto.
9372 (*sse4_1_zero_extendv4hiv4si2_4): Ditto.
9373 (*avx512f_zero_extendv8siv8di2_2): Ditto.
9374 (*avx2_zero_extendv4siv4di2_2): Ditto.
9375 (*sse4_1_zero_extendv2siv2di2_4): Ditto.
9376 (VI248_256, VI248_512, VI148_512, VI148_256, VI148_128): New
9379 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
9381 * config/rs6000/rs6000-builtin-new.def: Add always, power5, and
9384 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
9386 * config/rs6000/rs6000-builtin-new.def: Add vsx stanza.
9388 2021-08-11 Bill Schmidt <wschmidt@linux.ibm.com>
9390 * config/rs6000/rs6000-builtin-new.def: Finish altivec stanza.
9391 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Move
9392 initialization of pcvoid_type_node here...
9393 (altivec_init_builtins): ...from here.
9394 * config/rs6000/rs6000.h (rs6000_builtin_type_index): Add
9395 RS6000_BTI_const_ptr_void.
9396 (pcvoid_type_node): New macro.
9398 2021-08-11 Richard Biener <rguenther@suse.de>
9401 * tree-ssa-forwprop.c (pass_forwprop::execute): Do not decompose
9402 hard-register accesses.
9404 2021-08-11 Richard Biener <rguenther@suse.de>
9406 * tree-ssa-operands.c (operands_scanner::get_expr_operands):
9407 Do not look at COMPONENT_REF FIELD_DECLs TREE_THIS_VOLATILE
9408 to determine has_volatile_ops.
9410 2021-08-11 Eric Botcazou <ebotcazou@gcc.gnu.org>
9412 * cfgexpand.c (expand_used_vars): Reuse attribs local variable.
9414 2021-08-11 Jan Hubicka <hubicka@ucw.cz>
9415 Alexandre Oliva <oliva@adacore.com>
9417 * ipa-modref.c (modref_lattice::dump): Fix escape_point's min_flags
9419 (modref_lattice::merge_deref): Fix handling of indirect scape points.
9420 (update_escape_summary_1): Likewise.
9421 (update_escape_summary): Likewise.
9422 (ipa_merge_modref_summary_after_inlining): Likewise.
9424 2021-08-11 Richard Biener <rguenther@suse.de>
9426 PR middle-end/101858
9427 * fold-const.c (fold_binary_loc): Guard simplification
9428 of X < (cast) (1 << Y) to integer types.
9430 2021-08-11 Richard Biener <rguenther@suse.de>
9432 PR tree-optimization/101861
9433 * tree-vect-stmts.c (vectorizable_load): Fix error in
9434 previous change with regard to gather vectorization.
9436 2021-08-11 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
9439 * config/arm/arm_neon.h (vdup_n_s8): Replace call to builtin
9441 (vdup_n_s16): Likewise.
9442 (vdup_n_s32): Likewise.
9443 (vdup_n_s64): Likewise.
9444 (vdup_n_u8): Likewise.
9445 (vdup_n_u16): Likewise.
9446 (vdup_n_u32): Likewise.
9447 (vdup_n_u64): Likewise.
9448 (vdup_n_p8): Likewise.
9449 (vdup_n_p16): Likewise.
9450 (vdup_n_p64): Likewise.
9451 (vdup_n_f16): Likewise.
9452 (vdup_n_f32): Likewise.
9453 (vdupq_n_s8): Likewise.
9454 (vdupq_n_s16): Likewise.
9455 (vdupq_n_s32): Likewise.
9456 (vdupq_n_s64): Likewise.
9457 (vdupq_n_u8): Likewise.
9458 (vdupq_n_u16): Likewise.
9459 (vdupq_n_u32): Likewise.
9460 (vdupq_n_u64): Likewise.
9461 (vdupq_n_p8): Likewise.
9462 (vdupq_n_p16): Likewise.
9463 (vdupq_n_p64): Likewise.
9464 (vdupq_n_f16): Likewise.
9465 (vdupq_n_f32): Likewise.
9466 (vmov_n_s8): Replace call to builtin with call to corresponding
9468 (vmov_n_s16): Likewise.
9469 (vmov_n_s32): Likewise.
9470 (vmov_n_s64): Likewise.
9471 (vmov_n_u8): Likewise.
9472 (vmov_n_u16): Likewise.
9473 (vmov_n_u32): Likewise.
9474 (vmov_n_u64): Likewise.
9475 (vmov_n_p8): Likewise.
9476 (vmov_n_p16): Likewise.
9477 (vmov_n_f16): Likewise.
9478 (vmov_n_f32): Likewise.
9479 (vmovq_n_s8): Likewise.
9480 (vmovq_n_s16): Likewise.
9481 (vmovq_n_s32): Likewise.
9482 (vmovq_n_s64): Likewise.
9483 (vmovq_n_u8): Likewise.
9484 (vmovq_n_u16): Likewise.
9485 (vmovq_n_u32): Likewise.
9486 (vmovq_n_u64): Likewise.
9487 (vmovq_n_p8): Likewise.
9488 (vmovq_n_p16): Likewise.
9489 (vmovq_n_f16): Likewise.
9490 (vmovq_n_f32): Likewise.
9491 * config/arm/arm_neon_builtins.def: Remove entries for vdup_n.
9493 2021-08-11 liuhongt <hongtao.liu@intel.com>
9496 * config/i386/i386.md (ldexp<mode>3): Extend to vscalefs[sd]
9497 when TARGET_AVX512F and TARGET_SSE_MATH.
9499 2021-08-10 Jakub Jelinek <jakub@redhat.com>
9502 * config/i386/i386-expand.c (expand_vec_perm_even_odd): Return false
9503 for V32HImode if !TARGET_AVX512BW.
9504 (ix86_vectorize_vec_perm_const) <case E_V32HImode, case E_V64QImode>:
9505 If !TARGET_AVX512BW and TARGET_AVX512F and d.testing_p, don't fail
9506 early, but actually check the permutation.
9508 2021-08-10 Richard Biener <rguenther@suse.de>
9510 PR tree-optimization/101809
9511 * tree-vect-stmts.c (get_load_store_type): Allow emulated
9512 gathers with offset vector nunits being a constant multiple
9513 of the data vector nunits.
9514 (vect_get_gather_scatter_ops): Use the appropriate nunits
9515 for the offset vector defs.
9516 (vectorizable_store): Adjust call to
9517 vect_get_gather_scatter_ops.
9518 (vectorizable_load): Likewise. Handle the case of less
9519 offset vectors than data vectors.
9521 2021-08-10 Jakub Jelinek <jakub@redhat.com>
9524 * config/i386/sse.md (*avx512f_shuf_<shuffletype>64x2_1<mask_name>_1,
9525 *avx512f_shuf_<shuffletype>32x4_1<mask_name>_1): New define_insn
9528 2021-08-10 Richard Biener <rguenther@suse.de>
9530 PR tree-optimization/101801
9531 PR tree-optimization/101819
9532 * tree-vectorizer.h (vect_emulated_vector_p): Declare.
9533 * tree-vect-loop.c (vect_emulated_vector_p): New function.
9534 (vectorizable_reduction): Re-instantiate a check for emulated
9536 * tree-vect-stmts.c (vectorizable_shift): Likewise.
9537 (vectorizable_operation): Likewise. Cost emulated vector
9538 operations according to the scalar sequence synthesized by
9541 2021-08-10 Richard Biener <rguenther@suse.de>
9543 PR middle-end/101824
9544 * tree-nested.c (get_frame_field): Mark the COMPONENT_REF as
9545 volatile in case the variable was.
9547 2021-08-10 H.J. Lu <hjl.tools@gmail.com>
9550 * config/i386/constraints.md (BC): Document for integer SSE
9551 constant all bits set operand.
9552 (BF): New constraint for const floating-point all bits set
9554 * config/i386/i386.c (standard_sse_constant_p): Likewise.
9555 (standard_sse_constant_opcode): Likewise.
9556 * config/i386/sse.md (sseconstm1): New mode attribute.
9557 (mov<mode>_internal): Replace BC with <sseconstm1>.
9559 2021-08-10 liuhongt <hongtao.liu@intel.com>
9561 * config/i386/sse.md (cond_<insn><mode>): New expander.
9562 (VI248_AVX512VLBW): New mode iterator.
9563 * config/i386/predicates.md
9564 (nonimmediate_or_const_vec_dup_operand): New predicate.
9566 2021-08-09 Andrew MacLeod <amacleod@redhat.com>
9568 PR tree-optimization/101741
9569 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Check
9570 type of parameter for toupper/tolower.
9572 2021-08-09 Martin Jambor <mjambor@suse.cz>
9575 * ipa-prop.c (propagate_controlled_uses): Removed a spurious space.
9577 2021-08-09 Pat Haugen <pthaugen@linux.ibm.com>
9579 * config/rs6000/rs6000.c (is_load_insn1): Verify destination is a
9581 (is_store_insn1): Verify source is a register.
9583 2021-08-09 Uroš Bizjak <ubizjak@gmail.com>
9586 * config/i386/mmx.md (<any_logic:code>v2sf3):
9587 Rename from *mmx_<any_logic:code>v2sf3
9589 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9591 * config/nvptx/nvptx.c: Cross-reference parts adapted in
9592 'gcc/omp-oacc-neuter-broadcast.cc'.
9593 * omp-low.c: Likewise.
9594 * omp-oacc-neuter-broadcast.cc: Cross-reference parts adapted from
9597 2021-08-09 Julian Brown <julian@codesourcery.com>
9598 Kwok Cheung Yeung <kcy@codesourcery.com>
9599 Thomas Schwinge <thomas@codesourcery.com>
9601 * config/gcn/gcn.c (gcn_init_builtins): Override decls for
9602 BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START,
9603 BUILT_IN_GOACC_SINGLE_COPY_END and BUILT_IN_GOACC_BARRIER.
9604 (gcn_goacc_validate_dims): Turn on worker partitioning unconditionally.
9605 (gcn_fork_join): Update comment.
9606 * config/gcn/gcn.opt (flag_worker_partitioning): Remove.
9607 (macc_experimental_workers): Remove unused option.
9609 2021-08-09 Julian Brown <julian@codesourcery.com>
9610 Nathan Sidwell <nathan@codesourcery.com> (via 'gcc/config/nvptx/nvptx.c' master)
9611 Kwok Cheung Yeung <kcy@codesourcery.com>
9612 Thomas Schwinge <thomas@codesourcery.com>
9614 * Makefile.in (OBJS): Add omp-oacc-neuter-broadcast.o.
9615 * doc/tm.texi.in (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD):
9616 Add documentation hook.
9617 * doc/tm.texi: Regenerate.
9618 * omp-oacc-neuter-broadcast.cc: New file.
9619 * omp-builtins.def (BUILT_IN_GOACC_BARRIER)
9620 (BUILT_IN_GOACC_SINGLE_START, BUILT_IN_GOACC_SINGLE_COPY_START)
9621 (BUILT_IN_GOACC_SINGLE_COPY_END): New builtins.
9622 * passes.def (pass_omp_oacc_neuter_broadcast): Add pass.
9623 * target.def (goacc.create_worker_broadcast_record): Add target
9625 * tree-pass.h (make_pass_omp_oacc_neuter_broadcast): Add
9627 * config/gcn/gcn-protos.h (gcn_goacc_adjust_propagation_record):
9628 Rename prototype to...
9629 (gcn_goacc_create_worker_broadcast_record): ... this.
9630 * config/gcn/gcn-tree.c (gcn_goacc_adjust_propagation_record): Rename
9632 (gcn_goacc_create_worker_broadcast_record): ... this.
9633 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_PROPAGATION_RECORD):
9635 (TARGET_GOACC_CREATE_WORKER_BROADCAST_RECORD): ... this.
9637 2021-08-09 Tejas Belagod <tejas.belagod@arm.com>
9640 * config/aarch64/aarch64-simd.md (vlshr<mode>3, vashr<mode>3): Use
9643 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9645 * Makefile.in (GTFILES): Remove '$(srcdir)/omp-offload.c'.
9647 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9649 * builtins.def (DEF_GOACC_BUILTIN, DEF_GOMP_BUILTIN): Don't
9650 consider '-foffload-abi'.
9651 * common.opt (-foffload-abi): Remove 'Var', 'Init'.
9652 * opts.c (common_handle_option) <-foffload-abi> [ACCEL_COMPILER]:
9655 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9657 * optc-gen.awk: Sanity check that 'Init' doesn't appear without
9660 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9662 * omp-builtins.def (BUILT_IN_ACC_GET_DEVICE_TYPE): Remove.
9664 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9666 * doc/gty.texi (Files): Update.
9668 2021-08-09 Thomas Schwinge <thomas@codesourcery.com>
9670 * doc/gty.texi (Files): Fix GTY header file example.
9672 2021-08-09 Roger Sayle <roger@nextmovesoftware.com>
9674 * tree-ssa-ccp.c (value_mask_to_min_max): Helper function to
9675 determine the upper and lower bounds from a mask-value pair.
9676 (bit_value_unop) [ABS_EXPR, ABSU_EXPR]: Add support for
9677 absolute value and unsigned absolute value expressions.
9678 (bit_value_binop): Initialize *VAL's precision.
9679 [LT_EXPR, LE_EXPR]: Use value_mask_to_min_max to determine
9680 upper and lower bounds of operands. Add LE_EXPR/GE_EXPR
9681 support when the operands are unknown but potentially equal.
9682 [MIN_EXPR, MAX_EXPR]: Support minimum/maximum expressions.
9684 2021-08-09 Bin Cheng <bin.cheng@linux.alibaba.com>
9686 * config/aarch64/aarch64.md
9687 (*extend<SHORT:mode><GPI:mode>2_aarch64): Use %<GPI:w>0.
9689 2021-08-08 Sergei Trofimovich <siarheit@google.com>
9691 * lra-constraints.c: Fix s/otput/output/ typo.
9693 2021-08-06 Martin Sebor <msebor@redhat.com>
9695 * builtins.c (expand_builtin_memchr): Move to gimple-ssa-warn-access.cc.
9696 (expand_builtin_strcat): Same.
9697 (expand_builtin_stpncpy): Same.
9698 (expand_builtin_strncat): Same.
9699 (check_read_access): Same.
9700 (check_memop_access): Same.
9701 (expand_builtin_strlen): Move checks to gimple-ssa-warn-access.cc.
9702 (expand_builtin_strnlen): Same.
9703 (expand_builtin_memcpy): Same.
9704 (expand_builtin_memmove): Same.
9705 (expand_builtin_mempcpy): Same.
9706 (expand_builtin_strcpy): Same.
9707 (expand_builtin_strcpy_args): Same.
9708 (expand_builtin_stpcpy_1): Same.
9709 (expand_builtin_strncpy): Same.
9710 (expand_builtin_memset): Same.
9711 (expand_builtin_bzero): Same.
9712 (expand_builtin_strcmp): Same.
9713 (expand_builtin_strncmp): Same.
9714 (expand_builtin): Remove handlers.
9715 (fold_builtin_strlen): Add a comment.
9716 * builtins.h (check_access): Move to gimple-ssa-warn-access.cc.
9717 * calls.c (maybe_warn_nonstring_arg): Same.
9718 * diagnostic-spec.c (nowarn_spec_t::nowarn_spec_t): Add warning option.
9719 * gimple-fold.c (gimple_fold_builtin_strcpy): Pass argument to callee.
9720 (gimple_fold_builtin_stpcpy): Same.
9721 * gimple-ssa-warn-access.cc (has_location): New function.
9722 (get_location): Same.
9723 (get_callee_fndecl): Same.
9726 (warn_string_no_nul): Define.
9727 (unterminated_array): Same.
9728 (check_nul_terminated_array): Same.
9729 (maybe_warn_nonstring_arg): Same.
9730 (maybe_warn_for_bound): Same.
9731 (warn_for_access): Same.
9732 (check_access): Same.
9733 (check_memop_access): Same.
9734 (check_read_access): Same.
9735 (warn_dealloc_offset): Use helper functions.
9736 (maybe_emit_free_warning): Same.
9737 (class pass_waccess): Add members.
9738 (check_strcat): New function.
9739 (check_strncat): New function.
9740 (check_stxcpy): New function.
9741 (check_stxncpy): New function.
9742 (check_strncmp): New function.
9743 (pass_waccess::check_builtin): New function.
9744 (pass_waccess::check): Call it.
9745 * gimple-ssa-warn-access.h (warn_string_no_nul): Move here from
9747 (maybe_warn_for_bound): Same.
9748 (check_access): Same.
9749 (check_memop_access): Same.
9750 (check_read_access): Same.
9751 * pointer-query.h (struct access_data): Define a ctor overload.
9753 2021-08-06 Richard Biener <rguenther@suse.de>
9755 PR tree-optimization/101801
9756 * tree-vectorizer.h (vect_worthwhile_without_simd_p): Rename...
9757 (vect_can_vectorize_without_simd_p): ... to this.
9758 * tree-vect-loop.c (vect_worthwhile_without_simd_p): Rename...
9759 (vect_can_vectorize_without_simd_p): ... to this and fold
9760 in vect_min_worthwhile_factor.
9761 (vect_min_worthwhile_factor): Remove.
9762 (vectorizable_reduction): Adjust and remove the cost part.
9763 * tree-vect-stmts.c (vectorizable_shift): Likewise.
9764 (vectorizable_operation): Likewise.
9766 2021-08-06 Uroš Bizjak <ubizjak@gmail.com>
9769 * config/i386/i386.md (cmove reg-to-reg move elimination peephole2s):
9770 Add general_gr_operand predicate to operand 3.
9772 2021-08-06 Roger Sayle <roger@nextmovesoftware.com>
9774 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern): Use
9775 CFN_BUILT_IN_CLRSB* instead of BUILT_IN_CLRSB* for consistency.
9777 2021-08-06 Tamar Christina <tamar.christina@arm.com>
9779 * config/aarch64/aarch64-sve-builtins.cc (register_svpattern,
9780 register_svprfop): Pass vec<> by pointer.
9781 * langhooks-def.h (lhd_simulate_enum_decl): Likewise.
9782 * langhooks.c (lhd_simulate_enum_decl): Likewise.
9783 * langhooks.h (struct lang_hooks_for_types): Likewise.
9785 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9787 * config/aarch64/arm_neon.h (vst1_bf16_x2): Use
9788 __builtin_memcpy instead of constructing an additional
9789 __builtin_aarch64_simd_oi one vector at a time.
9790 (vst1q_bf16_x2): Likewise.
9791 (vst1_bf16_x3): Use __builtin_memcpy instead of constructing
9792 an additional __builtin_aarch64_simd_ci one vector at a time.
9793 (vst1q_bf16_x3): Likewise.
9794 (vst1_bf16_x4): Use __builtin_memcpy instead of a union.
9795 (vst1q_bf16_x4): Likewise.
9796 (vst2_bf16): Use __builtin_memcpy instead of constructing an
9797 additional __builtin_aarch64_simd_oi one vector at a time.
9798 (vst2q_bf16): Likewise.
9799 (vst3_bf16): Use __builtin_memcpy instead of constructing an
9800 additional __builtin_aarch64_simd_ci mode one vector at a
9802 (vst3q_bf16): Likewise.
9803 (vst4_bf16): Use __builtin_memcpy instead of constructing an
9804 additional __builtin_aarch64_simd_xi one vector at a time.
9805 (vst4q_bf16): Likewise.
9807 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9809 * config/aarch64/arm_neon.h (__ST2_LANE_FUNC): Delete.
9810 (__ST2Q_LANE_FUNC): Delete.
9811 (vst2_lane_f16): Use __builtin_memcpy to copy vector
9812 structure instead of constructing __builtin_aarch64_simd_oi
9813 one vector at a time.
9814 (vst2_lane_f32): Likewise.
9815 (vst2_lane_f64): Likewise.
9816 (vst2_lane_p8): Likewise.
9817 (vst2_lane_p16): Likewise.
9818 (vst2_lane_p64): Likewise.
9819 (vst2_lane_s8): Likewise.
9820 (vst2_lane_s16): Likewise.
9821 (vst2_lane_s32): Likewise.
9822 (vst2_lane_s64): Likewise.
9823 (vst2_lane_u8): Likewise.
9824 (vst2_lane_u16): Likewise.
9825 (vst2_lane_u32): Likewise.
9826 (vst2_lane_u64): Likewise.
9827 (vst2_lane_bf16): Likewise.
9828 (vst2q_lane_f16): Use __builtin_memcpy to copy vector
9829 structure instead of using a union.
9830 (vst2q_lane_f32): Likewise.
9831 (vst2q_lane_f64): Likewise.
9832 (vst2q_lane_p8): Likewise.
9833 (vst2q_lane_p16): Likewise.
9834 (vst2q_lane_p64): Likewise.
9835 (vst2q_lane_s8): Likewise.
9836 (vst2q_lane_s16): Likewise.
9837 (vst2q_lane_s32): Likewise.
9838 (vst2q_lane_s64): Likewise.
9839 (vst2q_lane_u8): Likewise.
9840 (vst2q_lane_u16): Likewise.
9841 (vst2q_lane_u32): Likewise.
9842 (vst2q_lane_u64): Likewise.
9843 (vst2q_lane_bf16): Likewise.
9845 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9847 * config/aarch64/arm_neon.h (__ST3_LANE_FUNC): Delete.
9848 (__ST3Q_LANE_FUNC): Delete.
9849 (vst3_lane_f16): Use __builtin_memcpy to copy vector
9850 structure instead of constructing __builtin_aarch64_simd_ci
9851 one vector at a time.
9852 (vst3_lane_f32): Likewise.
9853 (vst3_lane_f64): Likewise.
9854 (vst3_lane_p8): Likewise.
9855 (vst3_lane_p16): Likewise.
9856 (vst3_lane_p64): Likewise.
9857 (vst3_lane_s8): Likewise.
9858 (vst3_lane_s16): Likewise.
9859 (vst3_lane_s32): Likewise.
9860 (vst3_lane_s64): Likewise.
9861 (vst3_lane_u8): Likewise.
9862 (vst3_lane_u16): Likewise.
9863 (vst3_lane_u32): Likewise.
9864 (vst3_lane_u64): Likewise.
9865 (vst3_lane_bf16): Likewise.
9866 (vst3q_lane_f16): Use __builtin_memcpy to copy vector
9867 structure instead of using a union.
9868 (vst3q_lane_f32): Likewise.
9869 (vst3q_lane_f64): Likewise.
9870 (vst3q_lane_p8): Likewise.
9871 (vst3q_lane_p16): Likewise.
9872 (vst3q_lane_p64): Likewise.
9873 (vst3q_lane_s8): Likewise.
9874 (vst3q_lane_s16): Likewise.
9875 (vst3q_lane_s32): Likewise.
9876 (vst3q_lane_s64): Likewise.
9877 (vst3q_lane_u8): Likewise.
9878 (vst3q_lane_u16): Likewise.
9879 (vst3q_lane_u32): Likewise.
9880 (vst3q_lane_u64): Likewise.
9881 (vst3q_lane_bf16): Likewise.
9883 2021-08-06 Jonathan Wright <jonathan.wright@arm.com>
9885 * config/aarch64/arm_neon.h (__ST4_LANE_FUNC): Delete.
9886 (__ST4Q_LANE_FUNC): Delete.
9887 (vst4_lane_f16): Use __builtin_memcpy to copy vector
9888 structure instead of constructing __builtin_aarch64_simd_xi
9889 one vector at a time.
9890 (vst4_lane_f32): Likewise.
9891 (vst4_lane_f64): Likewise.
9892 (vst4_lane_p8): Likewise.
9893 (vst4_lane_p16): Likewise.
9894 (vst4_lane_p64): Likewise.
9895 (vst4_lane_s8): Likewise.
9896 (vst4_lane_s16): Likewise.
9897 (vst4_lane_s32): Likewise.
9898 (vst4_lane_s64): Likewise.
9899 (vst4_lane_u8): Likewise.
9900 (vst4_lane_u16): Likewise.
9901 (vst4_lane_u32): Likewise.
9902 (vst4_lane_u64): Likewise.
9903 (vst4_lane_bf16): Likewise.
9904 (vst4q_lane_f16): Use __builtin_memcpy to copy vector
9905 structure instead of using a union.
9906 (vst4q_lane_f32): Likewise.
9907 (vst4q_lane_f64): Likewise.
9908 (vst4q_lane_p8): Likewise.
9909 (vst4q_lane_p16): Likewise.
9910 (vst4q_lane_p64): Likewise.
9911 (vst4q_lane_s8): Likewise.
9912 (vst4q_lane_s16): Likewise.
9913 (vst4q_lane_s32): Likewise.
9914 (vst4q_lane_s64): Likewise.
9915 (vst4q_lane_u8): Likewise.
9916 (vst4q_lane_u16): Likewise.
9917 (vst4q_lane_u32): Likewise.
9918 (vst4q_lane_u64): Likewise.
9919 (vst4q_lane_bf16): Likewise.
9921 2021-08-06 Martin Liska <mliska@suse.cz>
9923 * config/rs6000/rs6000.c (rs6000_option_override_internal): When
9924 a target option is restored, it can have
9925 rs6000_long_double_type_size set to FLOAT_PRECISION_TFmode
9926 and error should not be emitted.
9928 2021-08-06 Sebastian Huber <sebastian.huber@embedded-brains.de>
9930 * gcov-io.h (gcov_write): Declare.
9931 * gcov-io.c (gcov_write): New.
9932 (gcov_write_counter): Remove.
9933 (gcov_write_tag_length): Likewise.
9934 (gcov_write_summary): Replace gcov_write_tag_length() with calls to
9935 gcov_write_unsigned().
9936 * doc/invoke.texi (fprofile-info-section): Mention
9937 __gcov_info_to_gdca().
9939 2021-08-06 Martin Sebor <msebor@redhat.com>
9941 * dominance.c (prune_bbs_to_update_dominators): Adjust by-value vec
9942 arguments to by-reference.
9943 (iterate_fix_dominators): Same.
9944 * dominance.h (iterate_fix_dominators): Same.
9945 * ipa-prop.h: Call auto_vec::to_vec_legacy.
9946 * tree-data-ref.c (dump_data_dependence_relation): Adjust by-value vec
9947 arguments to by-reference.
9948 (debug_data_dependence_relation): Same.
9949 (dump_data_dependence_relations): Same.
9950 * tree-data-ref.h (debug_data_dependence_relation): Same.
9951 (dump_data_dependence_relations): Same.
9952 * tree-predcom.c (dump_chains): Same.
9953 (initialize_root_vars_lm): Same.
9954 (determine_unroll_factor): Same.
9955 (replace_phis_by_defined_names): Same.
9956 (insert_init_seqs): Same.
9957 (pcom_worker::tree_predictive_commoning_loop): Call
9958 auto_vec::to_vec_legacy.
9959 * tree-ssa-pre.c (insert_into_preds_of_block): Adjust by-value vec
9960 arguments to by-reference.
9961 * tree-ssa-threadbackward.c (populate_worklist): Same.
9962 (back_threader::resolve_def): Same.
9963 * tree-vect-data-refs.c (vect_check_nonzero_value): Same.
9964 (vect_enhance_data_refs_alignment): Same.
9965 (vect_check_lower_bound): Same.
9966 (vect_prune_runtime_alias_test_list): Same.
9967 (vect_permute_store_chain): Same.
9968 * tree-vect-slp-patterns.c (vect_normalize_conj_loc): Same.
9969 * tree-vect-stmts.c (vect_create_vectorized_demotion_stmts): Same.
9970 * tree-vectorizer.h (vect_permute_store_chain): Same.
9971 * vec.c (test_init): New function.
9972 (vec_c_tests): Call new function.
9973 * vec.h (vec): Declare ctors, dtor, and assignment.
9974 (auto_vec::vec_to_legacy): New function.
9975 (vec::copy): Adjust initialization.
9977 2021-08-05 H.J. Lu <hjl.tools@gmail.com>
9980 * config/i386/i386.c (ix86_can_inline_p): Ignore MASK_80387 if
9981 callee only uses GPRs.
9982 * config/i386/ia32intrin.h: Revert commit 5463cee2770.
9983 * config/i386/serializeintrin.h: Revert commit 71958f740f1.
9984 * config/i386/x86gprintrin.h: Add
9985 #pragma GCC target("general-regs-only") and #pragma GCC pop_options
9986 to disable non-GPR ISAs.
9988 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
9990 PR middle-end/101787
9991 * doc/md.texi (cond_ashl, cond_ashr, cond_lshr): Document.
9993 2021-08-05 Richard Sandiford <richard.sandiford@arm.com>
9995 * tree-vectorizer.h (vect_is_store_elt_extraction, vect_is_reduction)
9996 (vect_reduc_type, vect_embedded_comparison_type, vect_comparison_type)
9997 (vect_is_extending_load, vect_is_integer_truncation): New functions,
9998 moved from aarch64.c but given different names.
9999 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction)
10000 (aarch64_is_reduction, aarch64_reduc_type)
10001 (aarch64_embedded_comparison_type, aarch64_comparison_type)
10002 (aarch64_extending_load_p, aarch64_integer_truncation_p): Delete
10003 in favor of the above. Update callers accordingly.
10005 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
10008 * config/arm/arm-cpus.in (generic-armv7-a): Add quirk to suppress
10009 writing .cpu directive in asm output.
10010 * config/arm/arm.c (arm_identify_fpu_from_isa): New variable.
10011 (arm_last_printed_arch_string): Delete.
10012 (arm_last-printed_fpu_string): Delete.
10013 (arm_configure_build_target): If use of floating-point/SIMD is
10014 disabled, remove all fp/simd related features from the target ISA.
10015 (last_arm_targ_options): New variable.
10016 (arm_print_asm_arch_directives): Add new parameters. Change order
10017 of emitted directives and handle all cases here.
10018 (arm_file_start): Always call arm_print_asm_arch_directives, move
10019 all generation of .arch/.arch_extension here.
10020 (arm_file_end): Call arm_print_asm_arch.
10021 (arm_declare_function_name): Call arm_print_asm_arch_directives
10022 instead of printing .arch/.fpu directives directly.
10024 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
10026 * config/arm/arm.c (arm_configure_build_target): Don't call
10027 arm_option_reconfigure_globals.
10028 (arm_option_restore): Call arm_option_reconfigure_globals after
10029 reconfiguring the target.
10030 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
10032 2021-08-05 Richard Earnshaw <rearnsha@arm.com>
10034 * config/arm/arm.c (arm_configure_build_target): Ensure the target's
10035 arch_name is always set.
10037 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
10039 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
10040 of vec_select high-half from being added into Neon subtract
10043 2021-08-05 Jonathan Wright <jonathan.wright@arm.com>
10045 * config/aarch64/aarch64.c: Traverse RTL tree to prevent cost
10046 of vec_select high-half from being added into Neon add cost.
10048 2021-08-05 Kewen Lin <linkw@linux.ibm.com>
10050 * cfgloop.h (loops_list::loops_list): Add one optional argument
10051 root and adjust accordingly, update loop tree walking and factor
10053 * cfgloop.c (loops_list::walk_loop_tree): ... this. New function.
10055 2021-08-05 Eric Botcazou <ebotcazou@gcc.gnu.org>
10057 PR tree-optimization/101626
10058 * tree-sra.c (propagate_subaccesses_from_rhs): Do not set the
10059 reverse scalar storage order on a pointer or vector component.
10061 2021-08-05 liuhongt <hongtao.liu@intel.com>
10063 * config/i386/sse.md (cond_<code><mode>): New expander.
10065 2021-08-05 liuhongt <hongtao.liu@intel.com>
10067 * config/i386/sse.md (cond_<code><mode>): New expander.
10069 2021-08-05 liuhongt <hongtao.liu@intel.com>
10071 * config/i386/sse.md (cond_<code><mode>): New expander.
10073 2021-08-04 David Malcolm <dmalcolm@redhat.com>
10076 * Makefile.in (ANALYZER_OBJS): Add analyzer/region-model-asm.o.
10078 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
10081 * config/i386/i386.h (STORE_MAX_PIECES): Allow 16/32/64 bytes
10082 only if TARGET_INTER_UNIT_MOVES_TO_VEC is true.
10084 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
10087 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
10088 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
10089 data with SSE register from one memory location to another.
10091 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
10093 * config/s390/s390.c (expand_perm_with_vpdi): New function.
10094 (vectorize_vec_perm_const_1): Call expand_perm_with_vpdi.
10095 * config/s390/vector.md (*vpdi1<mode>, @vpdi1<mode>): Enable a
10096 parameterized expander.
10097 (*vpdi4<mode>, @vpdi4<mode>): Likewise.
10099 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
10101 * config/s390/s390.c (MAX_VECT_LEN): Define macro.
10102 (struct expand_vec_perm_d): Define struct.
10103 (expand_perm_with_merge): New function.
10104 (vectorize_vec_perm_const_1): New function.
10105 (s390_vectorize_vec_perm_const): New function.
10106 (TARGET_VECTORIZE_VEC_PERM_CONST): Define target macro.
10108 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
10110 * config/s390/vector.md (V_HW_64): Remove mode iterator.
10111 (*vec_load_pair<mode>): Use V_HW_2 instead of V_HW_64.
10112 * config/s390/vx-builtins.md
10113 (vec_scatter_element<V_HW_2:mode>_SI): Use V_HW_2 instead of
10116 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
10118 * config/s390/s390.md (UNSPEC_VEC_PERMI): Remove constant
10120 * config/s390/vector.md (*vpdi1<mode>, *vpdi4<mode>): New pattern
10122 * config/s390/vx-builtins.md (*vec_permi<mode>): Emit generic rtx
10123 instead of an unspec.
10125 2021-08-04 Andreas Krebbel <krebbel@linux.ibm.com>
10127 * config/s390/s390-modes.def: Add more vector modes to support
10128 concatenation of two vectors.
10129 * config/s390/s390-protos.h (s390_expand_merge_perm_const): Add
10131 (s390_expand_merge): Likewise.
10132 * config/s390/s390.c (s390_expand_merge_perm_const): New function.
10133 (s390_expand_merge): New function.
10134 * config/s390/s390.md (UNSPEC_VEC_MERGEH, UNSPEC_VEC_MERGEL):
10135 Remove constant definitions.
10136 * config/s390/vector.md (V_HW_2): Add mode iterators.
10137 (VI_HW_4, V_HW_4): Rename VI_HW_4 to V_HW_4.
10138 (vec_2x_nelts, vec_2x_wide): New mode attributes.
10139 (*vmrhb, *vmrlb, *vmrhh, *vmrlh, *vmrhf, *vmrlf, *vmrhg, *vmrlg):
10140 New pattern definitions.
10141 (vec_widen_umult_lo_<mode>, vec_widen_umult_hi_<mode>)
10142 (vec_widen_smult_lo_<mode>, vec_widen_smult_hi_<mode>)
10143 (vec_unpacks_lo_v4sf, vec_unpacks_hi_v4sf, vec_unpacks_lo_v2df)
10144 (vec_unpacks_hi_v2df): Adjust expanders to emit non-unspec RTX for
10146 * config/s390/vx-builtins.md (V_HW_4): Remove mode iterator. Now
10148 (vec_mergeh<mode>, vec_mergel<mode>): Use s390_expand_merge to
10149 emit vec merge pattern.
10151 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
10153 * config/aarch64/aarch64.c (aarch64_strip_extend_vec_half):
10155 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent cost of
10156 vec_select high-half from being added into Neon multiply
10158 * rtlanal.c (vec_series_highpart_p): Define.
10159 * rtlanal.h (vec_series_highpart_p): Declare.
10161 2021-08-04 Jonathan Wright <jonathan.wright@arm.com>
10163 * config/aarch64/aarch64.c (aarch64_strip_duplicate_vec_elt):
10165 (aarch64_rtx_mult_cost): Traverse RTL tree to prevent
10166 vec_select cost from being added into Neon multiply cost.
10168 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
10170 * tree-vect-loop.c (vect_better_loop_vinfo_p): Detect cases in
10171 which old_loop_vinfo is an epilogue loop that handles a constant
10172 number of iterations.
10174 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
10176 * tree-vect-loop.c (vect_analyze_loop): Print a dump message
10177 when a reanalyzed loop fails to be cheaper than the current
10180 2021-08-04 Richard Sandiford <richard.sandiford@arm.com>
10182 * config/aarch64/aarch64.c: Fix a typo.
10184 2021-08-04 Vincent Lefèvre <vincent-gcc@vinc17.net>
10186 PR gcov-profile/101773
10187 * gcov-io.c (gcov_close): Check return code of a fclose.
10189 2021-08-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
10192 * dwarf2out.c (dwarf2out_assembly_start): Emit a dummy
10193 .file statement when needed.
10195 2021-08-04 Richard Biener <rguenther@suse.de>
10197 * tree-vect-data-refs.c (vect_check_gather_scatter):
10198 Include widening conversions only when the result is
10199 still handed by native gather or the current offset
10200 size not already matches the data size.
10201 Also succeed analysis in case there's no native support,
10202 noted by a IFN_LAST ifn and a NULL decl.
10203 (vect_analyze_data_refs): Always consider gathers.
10204 * tree-vect-patterns.c (vect_recog_gather_scatter_pattern):
10205 Test for no IFN gather rather than decl gather.
10206 * tree-vect-stmts.c (vect_model_load_cost): Pass in the
10207 gather-scatter info and cost emulated gathers accordingly.
10208 (vect_truncate_gather_scatter_offset): Properly test for
10210 (vect_use_strided_gather_scatters_p): Likewise.
10211 (get_load_store_type): Handle emulated gathers and its
10213 (vectorizable_load): Likewise. Emulate them by extracting
10214 scalar offsets, doing scalar loads and a vector construct.
10216 2021-08-04 H.J. Lu <hjl.tools@gmail.com>
10219 * expr.c (op_by_pieces_d::op_by_pieces_d): Add a max_pieces
10220 argument to set m_max_size.
10221 (move_by_pieces_d): Pass MOVE_MAX_PIECES to op_by_pieces_d.
10222 (store_by_pieces_d): Pass STORE_MAX_PIECES to op_by_pieces_d.
10223 (compare_by_pieces_d): Pass COMPARE_MAX_PIECES to op_by_pieces_d.
10225 2021-08-04 Roger Sayle <roger@nextmovesoftware.com>
10226 Marc Glisse <marc.glisse@inria.fr>
10228 * match.pd (bit_ior, bit_xor): Canonicalize (X*C1)|(X*C2) and
10229 (X*C1)^(X*C2) as X*(C1+C2), and related variants, using
10230 tree_nonzero_bits to ensure that operands are bit-wise disjoint.
10232 2021-08-04 Richard Biener <rguenther@suse.de>
10234 * tree-ssa-forwprop.c (pass_forwprop::execute): Split
10235 out code to decompose vector loads ...
10236 (optimize_vector_load): ... here. Generalize it to
10237 handle intermediate widening and TARGET_MEM_REF loads
10238 and apply it to loads with a supported vector mode as well.
10240 2021-08-04 Richard Biener <rguenther@suse.de>
10242 PR tree-optimization/101756
10243 * tree-vect-slp.c (vectorizable_bb_reduc_epilogue): Make sure
10244 the result of the reduction epilogue is compatible to the original
10247 2021-08-04 liuhongt <hongtao.liu@intel.com>
10250 * config/i386/i386.md (peephole2): Refine predicate from
10251 register_operand to general_reg_operand.
10253 2021-08-04 Aldy Hernandez <aldyh@redhat.com>
10255 * gimple-range-path.h (path_range_query::dump): Mark override.
10257 2021-08-04 Richard Biener <rguenther@suse.de>
10259 PR tree-optimization/101769
10260 * tree-tailcall.c (eliminate_tail_call): Add the created loop
10261 for the first recursion and return it via the new output parameter.
10262 (optimize_tail_call): Pass through new output param.
10263 (tree_optimize_tail_calls_1): After creating all latches,
10264 add the created loop to the loop tree. Do not mark loops for fixup.
10266 2021-08-04 Martin Liska <mliska@suse.cz>
10268 * doc/invoke.texi: Document threader-mode param.
10270 2021-08-04 liuhongt <hongtao.liu@intel.com>
10272 * config/i386/sse.md (cond_fma<mode>): New expander.
10273 (cond_fms<mode>): Ditto.
10274 (cond_fnma<mode>): Ditto.
10275 (cond_fnms<mode>): Ditto.
10277 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
10279 * config/rs6000/vsx.md (*vsx_le_perm_store_<mode>): Use && instead of &.
10281 2021-08-03 Segher Boessenkool <segher@kernel.crashing.org>
10283 * config/rs6000/constraints.md: Remove "e" from the list of available
10284 constraint characters.
10286 2021-08-03 Eugene Rozenfeld <erozen@microsoft.com>
10288 PR gcov-profile/71672
10289 * auto-profile.c (afdo_indirect_call): Fix setup of the historgram value for indirect calls.
10291 2021-08-03 Paul A. Clarke <pc@us.ibm.com>
10293 * config/rs6000/smmintrin.h (_mm_minpos_epu16): New.
10295 2021-08-03 H.J. Lu <hjl.tools@gmail.com>
10297 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): In 64-bit mode,
10298 try XMM31 to avoid vzeroupper.
10300 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10302 * doc/invoke.texi: Document -mtune=neoverse-512tvb and
10303 -mcpu=neoverse-512tvb.
10304 * config/aarch64/aarch64-cores.def (neoverse-512tvb): New entry.
10305 * config/aarch64/aarch64-tune.md: Regenerate.
10306 * config/aarch64/aarch64.c (neoverse512tvb_sve_vector_cost)
10307 (neoverse512tvb_sve_issue_info, neoverse512tvb_vec_issue_info)
10308 (neoverse512tvb_vector_cost, neoverse512tvb_tunings): New structures.
10309 (aarch64_adjust_body_cost_sve): Handle -mtune=neoverse-512tvb.
10310 (aarch64_adjust_body_cost): Likewise.
10312 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10314 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Only
10315 record issue information for operations that occur in the
10318 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10320 * config/aarch64/aarch64.c (aarch64_multiply_add_p): Add a vec_flags
10321 parameter. Detect cases in which an Advanced SIMD MLA would almost
10322 certainly require a MOV.
10323 (aarch64_count_ops): Update accordingly.
10325 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10327 * config/aarch64/aarch64.c (aarch64_is_store_elt_extraction): New
10328 function, split out from...
10329 (aarch64_detect_vector_stmt_subtype): ...here.
10330 (aarch64_add_stmt_cost): Treat extracting element 0 as free.
10332 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10334 * config/aarch64/aarch64-protos.h (sve_vec_cost):
10335 Add gather_load_x32_cost and gather_load_x64_cost.
10336 * config/aarch64/aarch64.c (generic_sve_vector_cost)
10337 (a64fx_sve_vector_cost, neoversev1_sve_vector_cost): Update
10338 accordingly, using the values given by the scalar_load * number
10339 of elements calculation that we used previously.
10340 (aarch64_detect_vector_stmt_subtype): Use the new fields.
10342 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10344 * config/aarch64/aarch64.c (aarch64_adjust_body_cost_sve): New
10345 function, split out from...
10346 (aarch64_adjust_body_cost): ...here.
10348 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10350 * config/aarch64/fractional-cost.h: New file.
10351 * config/aarch64/aarch64.c: Include <algorithm> (indirectly)
10352 and cost_fraction.h.
10353 (vec_cost_fraction): New typedef.
10354 (aarch64_detect_scalar_stmt_subtype): Use it for statement costs.
10355 (aarch64_detect_vector_stmt_subtype): Likewise.
10356 (aarch64_sve_adjust_stmt_cost, aarch64_adjust_stmt_cost): Likewise.
10357 (aarch64_estimate_min_cycles_per_iter): Use vec_cost_fraction
10359 (aarch64_adjust_body_cost): Likewise.
10360 (aarch64_test_cost_fraction): New function.
10361 (aarch64_run_selftests): Call it.
10363 2021-08-03 Richard Sandiford <richard.sandiford@arm.com>
10365 * config/aarch64/aarch64-protos.h (tune_params::sve_width): Turn
10367 * config/aarch64/aarch64.c (aarch64_cmp_autovec_modes): Update
10369 (aarch64_estimated_poly_value): Likewise. Use the least significant
10370 set bit for the minimum and likely values. Use the most significant
10371 set bit for the maximum value.
10373 2021-08-03 liuhongt <hongtao.liu@intel.com>
10375 * config/i386/sse.md (cond_<insn><mode>): New expander.
10376 (cond_mul<mode>): Ditto.
10378 2021-08-03 Kewen Lin <linkw@linux.ibm.com>
10380 * tree-cfg.c (move_sese_region_to_fn): Fix typos on dloop.
10382 2021-08-03 liuhongt <hongtao.liu@intel.com>
10384 * config/i386/sse.md (cond_<insn><mode>):New expander.
10385 (cond_mul<mode>): Ditto.
10386 (cond_div<mode>): Ditto.
10388 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
10390 * config/i386/i386.c (ix86_finalize_stack_frame_flags): Also
10391 check stack_realign_needed for stack realignment.
10392 (ix86_legitimate_constant_p): Always allow CONST_WIDE_INT smaller
10393 than the largest integer supported by vector register.
10394 * config/i386/i386.h (MAX_MOVE_MAX): New. Set to 64.
10395 (MOVE_MAX): Set to bytes of the largest integer supported by
10397 (STORE_MAX_PIECES): New.
10399 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
10401 * config/i386/i386-expand.c (ix86_expand_vector_move): Call
10402 ix86_gen_scratch_sse_rtx to get a scratch SSE register to copy
10403 data from one memory location to another.
10405 2021-08-02 H.J. Lu <hjl.tools@gmail.com>
10407 PR middle-end/90773
10408 * config/i386/i386.c (TARGET_GEN_MEMSET_SCRATCH_RTX): New.
10410 2021-08-02 Aldy Hernandez <aldyh@redhat.com>
10412 PR tree-optimization/101724
10413 * params.opt: Remove --param=threader-iterative.
10414 * tree-ssa-threadbackward.c (pass_thread_jumps::execute): Remove
10417 2021-08-02 Tom de Vries <tdevries@suse.de>
10419 PR middle-end/101665
10420 * doc/extend.texi (nonnull attribute): Improve documentation.
10422 2021-08-02 Andrew Pinski <apinski@marvell.com>
10424 PR rtl-optimization/101683
10425 * rtlanal.c (may_trap_p_1): Handle UNSIGNED_FIX.
10427 2021-08-02 Roger Sayle <roger@nextmovesoftware.com>
10429 * tree-ssa-phiopt.c (cond_removal_in_builtin_zero_pattern):
10430 Renamed from cond_removal_in_popcount_clz_ctz_pattern.
10431 Add support for BSWAP, FFS, PARITY and CLRSB builtins.
10432 (tree_ssa_phiop_worker): Update call to function above.
10434 2021-08-01 H.J. Lu <hjl.tools@gmail.com>
10437 * config/i386/i386.md (bsr_rex64_1_zext): New.
10438 (combine splitter for constant - clzll): Replace gen_bsr_rex64_1
10439 with gen_bsr_rex64_1_zext.
10441 2021-07-31 Jakub Jelinek <jakub@redhat.com>
10444 * config/i386/i386.md (bsr_rex64_1, bsr_1, bsr_zext_1): New
10445 define_insn patterns.
10446 (*bsr_rex64_2, *bsr_2): New define_insn_and_split patterns.
10447 Add combine splitters for constant - clz.
10448 (clz<mode>2): Use a temporary pseudo for bsr result.
10450 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
10452 * config/rs6000/smmintrin.h (_mm_floor_pd, _mm_floor_ps,
10453 _mm_floor_sd, _mm_floor_ss): New.
10455 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
10457 * config/rs6000/smmintrin.h (_mm_ceil_pd, _mm_ceil_ps,
10458 _mm_ceil_sd, _mm_ceil_ss): New.
10460 2021-07-30 Paul A. Clarke <pc@us.ibm.com>
10462 * config/rs6000/smmintrin.h (_mm_blend_pd, _mm_blendv_pd,
10463 _mm_blend_ps, _mm_blendv_ps): New.
10465 2021-07-30 Roger Sayle <roger@nextmovesoftware.com>
10466 Uroš Bizjak <ubizjak@gmail.com>
10468 * config/i386/i386.md (*dec_cmov<mode>): New define_insn_and_split
10469 to generate a conditional move using the carry flag after sub $1.
10470 (peephole2): Eliminate a register-to-register move by inverting
10471 the condition of a conditional move.
10473 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
10475 * config/mmix/mmix.md ("call", "call_value", "*call_real")
10476 ("*call_value_real"): Don't generate rtx mentioning the generic
10477 operands 1 and 2 to "call", and similarly for "call_value".
10478 * config/mmix/mmix.c (mmix_print_operand_punct_valid_p)
10479 (mmix_print_operand): Use '!' instead of 'p'.
10481 2021-07-30 Hans-Peter Nilsson <hp@bitrange.com>
10483 * doc/md.texi (call): Correct information about operand 2.
10484 * config/mmix/mmix.md ("call", "call_value"): Remove fixed FIXMEs.
10486 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
10488 * range-op.cc (operator_trunc_mod::wi_fold): Fold constants.
10490 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
10492 * range-op.cc (operator_div::wi_fold): Return UNDEFINED for [0, 0] divisor.
10494 2021-07-30 Andrew MacLeod <amacleod@redhat.com>
10496 * gimple-range-cache.cc (*::set_bb_range): Change const basic_block to
10497 const_basic_block..
10498 (*::get_bb_range): Ditto.
10499 (*::bb_range_p): Ditto.
10500 * gimple-range-cache.h: Change prototypes.
10502 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
10504 PR middle-end/90773
10505 * builtins.c (builtin_memcpy_read_str): Change the mode argument
10506 from scalar_int_mode to fixed_size_mode.
10507 (builtin_strncpy_read_str): Likewise.
10508 (gen_memset_value_from_prev): New function.
10509 (builtin_memset_read_str): Change the mode argument from
10510 scalar_int_mode to fixed_size_mode. Use gen_memset_value_from_prev
10511 and support CONST_VECTOR.
10512 (builtin_memset_gen_str): Likewise.
10513 (try_store_by_multiple_pieces): Use by_pieces_constfn to declare
10515 * builtins.h (builtin_strncpy_read_str): Replace scalar_int_mode
10516 with fixed_size_mode.
10517 (builtin_memset_read_str): Likewise.
10518 * expr.c (widest_int_mode_for_size): Renamed to ...
10519 (widest_fixed_size_mode_for_size): Add a bool argument to
10520 indicate if QI vector mode can be used.
10521 (by_pieces_ninsns): Call widest_fixed_size_mode_for_size
10522 instead of widest_int_mode_for_size.
10523 (pieces_addr::adjust): Change the mode argument from
10524 scalar_int_mode to fixed_size_mode.
10525 (op_by_pieces_d): Make m_len read-only. Add a bool member,
10526 m_qi_vector_mode, to indicate that QI vector mode can be used.
10527 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
10528 initialize m_qi_vector_mode. Call widest_fixed_size_mode_for_size
10529 instead of widest_int_mode_for_size.
10530 (op_by_pieces_d::get_usable_mode): Change the mode argument from
10531 scalar_int_mode to fixed_size_mode. Call
10532 widest_fixed_size_mode_for_size instead of
10533 widest_int_mode_for_size.
10534 (op_by_pieces_d::smallest_fixed_size_mode_for_size): New member
10535 function to return the smallest integer or QI vector mode.
10536 (op_by_pieces_d::run): Call widest_fixed_size_mode_for_size
10537 instead of widest_int_mode_for_size. Call
10538 smallest_fixed_size_mode_for_size instead of
10539 smallest_int_mode_for_size.
10540 (store_by_pieces_d::store_by_pieces_d): Add a bool argument to
10541 indicate that QI vector mode can be used and pass it to
10542 op_by_pieces_d::op_by_pieces_d.
10543 (can_store_by_pieces): Call widest_fixed_size_mode_for_size
10544 instead of widest_int_mode_for_size. Pass memsetp to
10545 widest_fixed_size_mode_for_size to support QI vector mode.
10546 Allow all CONST_VECTORs for memset if vec_duplicate is supported.
10547 (store_by_pieces): Pass memsetp to
10548 store_by_pieces_d::store_by_pieces_d.
10549 (clear_by_pieces_1): Removed.
10550 (clear_by_pieces): Replace clear_by_pieces_1 with
10551 builtin_memset_read_str and pass true to store_by_pieces_d to
10552 support vector mode broadcast.
10553 (string_cst_read_str): Change the mode argument from
10554 scalar_int_mode to fixed_size_mode.
10555 * expr.h (by_pieces_constfn): Change scalar_int_mode to
10557 (by_pieces_prev): Likewise.
10558 * rtl.h (lowpart_subreg_regno): New.
10559 * rtlanal.c (lowpart_subreg_regno): New. A wrapper around
10560 simplify_subreg_regno.
10561 * target.def (gen_memset_scratch_rtx): New hook.
10562 * doc/tm.texi.in: Add TARGET_GEN_MEMSET_SCRATCH_RTX.
10563 * doc/tm.texi: Regenerated.
10565 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
10568 * config/mips/mips.c (mips_atomic_assign_expand_fenv): Use
10569 TARGET_EXPR instead of MODIFY_EXPR.
10571 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
10574 * config/mips/mips-protos.h (mips_expand_vec_cmp_expr): Declare.
10575 * config/mips/mips.c (mips_expand_vec_cmp_expr): New function.
10576 * config/mips/mips-msa.md (vec_cmp<MSA:mode><mode_i>): New
10578 (vec_cmpu<IMSA:mode><mode_i>): New expander.
10580 2021-07-30 H.J. Lu <hjl.tools@gmail.com>
10583 * config/i386/i386-options.c (ix86_option_override_internal):
10584 Don't enable LZCNT/POPCNT if they have been disabled explicitly.
10586 2021-07-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
10589 * config/arm/arm_neon.h (vld1_p64): Replace call to builtin by
10590 explicitly dereferencing __a.
10591 (vld1_s64): Likewise.
10592 (vld1_u64): Likewise.
10593 * config/arm/arm_neon_builtins.def (vld1): Remove entry for di
10594 and change to VAR13.
10596 2021-07-30 Aldy Hernandez <aldyh@redhat.com>
10598 * gimple-loop-versioning.cc (lv_dom_walker::lv_dom_walker): Remove
10599 use of m_range_analyzer.
10600 (loop_versioning::lv_dom_walker::before_dom_children): Same.
10601 (loop_versioning::lv_dom_walker::after_dom_children): Remove.
10602 (loop_versioning::prune_loop_conditions): Replace vr_values use
10603 with range_query interface.
10604 (pass_loop_versioning::execute): Use ranger.
10606 2021-07-30 Xi Ruoyao <xry111@mengyan1223.wang>
10609 * ipa-devirt.c (ipa_odr_read_section): Compare the precision of
10610 enum values, and emit a warning if they mismatch.
10612 2021-07-30 Kewen Lin <linkw@linux.ibm.com>
10614 * cfgloop.h (as_const): New function.
10615 (class loop_iterator): Rename to ...
10616 (class loops_list): ... this.
10617 (loop_iterator::next): Rename to ...
10618 (loops_list::Iter::fill_curr_loop): ... this and adjust.
10619 (loop_iterator::loop_iterator): Rename to ...
10620 (loops_list::loops_list): ... this and adjust.
10621 (loops_list::Iter): New class.
10622 (loops_list::iterator): New type.
10623 (loops_list::const_iterator): New type.
10624 (loops_list::begin): New function.
10625 (loops_list::end): Likewise.
10626 (loops_list::begin const): Likewise.
10627 (loops_list::end const): Likewise.
10628 (FOR_EACH_LOOP): Remove.
10629 (FOR_EACH_LOOP_FN): Remove.
10630 * cfgloop.c (flow_loops_dump): Adjust FOR_EACH_LOOP* with range-based
10631 for loop with loops_list instance.
10632 (sort_sibling_loops): Likewise.
10633 (disambiguate_loops_with_multiple_latches): Likewise.
10634 (verify_loop_structure): Likewise.
10635 * cfgloopmanip.c (create_preheaders): Likewise.
10636 (force_single_succ_latches): Likewise.
10637 * config/aarch64/falkor-tag-collision-avoidance.c
10638 (execute_tag_collision_avoidance): Likewise.
10639 * config/mn10300/mn10300.c (mn10300_scan_for_setlb_lcc): Likewise.
10640 * config/s390/s390.c (s390_adjust_loops): Likewise.
10641 * doc/loop.texi: Likewise.
10642 * gimple-loop-interchange.cc (pass_linterchange::execute): Likewise.
10643 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
10644 * gimple-loop-versioning.cc (loop_versioning::analyze_blocks): Likewise.
10645 (loop_versioning::make_versioning_decisions): Likewise.
10646 * gimple-ssa-split-paths.c (split_paths): Likewise.
10647 * graphite-isl-ast-to-gimple.c (graphite_regenerate_ast_isl): Likewise.
10648 * graphite.c (canonicalize_loop_form): Likewise.
10649 (graphite_transform_loops): Likewise.
10650 * ipa-fnsummary.c (analyze_function_body): Likewise.
10651 * ipa-pure-const.c (analyze_function): Likewise.
10652 * loop-doloop.c (doloop_optimize_loops): Likewise.
10653 * loop-init.c (loop_optimizer_finalize): Likewise.
10654 (fix_loop_structure): Likewise.
10655 * loop-invariant.c (calculate_loop_reg_pressure): Likewise.
10656 (move_loop_invariants): Likewise.
10657 * loop-unroll.c (decide_unrolling): Likewise.
10658 (unroll_loops): Likewise.
10659 * modulo-sched.c (sms_schedule): Likewise.
10660 * predict.c (predict_loops): Likewise.
10661 (pass_profile::execute): Likewise.
10662 * profile.c (branch_prob): Likewise.
10663 * sel-sched-ir.c (sel_finish_pipelining): Likewise.
10664 (sel_find_rgns): Likewise.
10665 * tree-cfg.c (replace_loop_annotate): Likewise.
10666 (replace_uses_by): Likewise.
10667 (move_sese_region_to_fn): Likewise.
10668 * tree-if-conv.c (pass_if_conversion::execute): Likewise.
10669 * tree-loop-distribution.c (loop_distribution::execute): Likewise.
10670 * tree-parloops.c (parallelize_loops): Likewise.
10671 * tree-predcom.c (tree_predictive_commoning): Likewise.
10672 * tree-scalar-evolution.c (scev_initialize): Likewise.
10673 (scev_reset): Likewise.
10674 * tree-ssa-dce.c (find_obviously_necessary_stmts): Likewise.
10675 * tree-ssa-live.c (remove_unused_locals): Likewise.
10676 * tree-ssa-loop-ch.c (ch_base::copy_headers): Likewise.
10677 * tree-ssa-loop-im.c (analyze_memory_references): Likewise.
10678 (tree_ssa_lim_initialize): Likewise.
10679 * tree-ssa-loop-ivcanon.c (canonicalize_induction_variables): Likewise.
10680 * tree-ssa-loop-ivopts.c (tree_ssa_iv_optimize): Likewise.
10681 * tree-ssa-loop-manip.c (get_loops_exits): Likewise.
10682 * tree-ssa-loop-niter.c (estimate_numbers_of_iterations): Likewise.
10683 (free_numbers_of_iterations_estimates): Likewise.
10684 * tree-ssa-loop-prefetch.c (tree_ssa_prefetch_arrays): Likewise.
10685 * tree-ssa-loop-split.c (tree_ssa_split_loops): Likewise.
10686 * tree-ssa-loop-unswitch.c (tree_ssa_unswitch_loops): Likewise.
10687 * tree-ssa-loop.c (gate_oacc_kernels): Likewise.
10688 (pass_scev_cprop::execute): Likewise.
10689 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
10690 * tree-ssa-sccvn.c (do_rpo_vn): Likewise.
10691 * tree-ssa-threadupdate.c
10692 (jump_thread_path_registry::thread_through_all_blocks): Likewise.
10693 * tree-vectorizer.c (vectorize_loops): Likewise.
10694 * tree-vrp.c (vrp_asserts::find_assert_locations): Likewise.
10696 2021-07-29 Hans-Peter Nilsson <hp@bitrange.com>
10698 * config/mmix/mmix.c (mmix_function_arg_1): Avoid
10699 generating a VOIDmode register for e.g the
10700 function_arg_info::end_marker.
10702 2021-07-29 Jeff Law <jeffreyalaw@gmail.com>
10704 * config/h8300/h8300-modes.def: Add CCZ, CCV and CCC, drop CCZNV.
10705 * config/h8300/h8300.md (H8cc mode iterator): Add CCZ.
10706 (cc mode_attr): Similarly.
10707 (ccz subst_attr): Similarly.
10708 * config/h8300/jumpcall.md: Add new patterns for branch-on-bit.
10709 * config/h8300/testcompare.md: Remove various cc0 based patterns
10710 that had been commented out. Add pattern to set CCZ from a bit
10713 2021-07-29 Thomas Schwinge <thomas@codesourcery.com>
10714 Julian Brown <julian@codesourcery.com>
10715 Kwok Cheung Yeung <kcy@codesourcery.com>
10717 * omp-offload.c (oacc_loop_xform_head_tail, oacc_loop_process):
10718 'update_stmt' after modification.
10719 (pass_oacc_loop_designation): New function, extracted out of...
10720 (pass_oacc_device_lower): ... this.
10721 (pass_data_oacc_loop_designation, pass_oacc_loop_designation)
10722 (make_pass_oacc_loop_designation): New
10723 * passes.def: Add it.
10724 * tree-parloops.c (create_parallel_loop): Adjust.
10725 * tree-pass.h (make_pass_oacc_loop_designation): New.
10727 2021-07-29 Aldy Hernandez <aldyh@redhat.com>
10729 * flag-types.h (enum threader_mode): New.
10730 * params.opt: Add entry for --param=threader-mode.
10731 * tree-ssa-threadbackward.c (THREADER_ITERATIVE_MODE): New.
10732 (class back_threader): New.
10733 (back_threader::back_threader): New.
10734 (back_threader::~back_threader): New.
10735 (back_threader::maybe_register_path): New.
10736 (back_threader::find_taken_edge): New.
10737 (back_threader::find_taken_edge_switch): New.
10738 (back_threader::find_taken_edge_cond): New.
10739 (back_threader::resolve_def): New.
10740 (back_threader::resolve_phi): New.
10741 (back_threader::find_paths_to_names): New.
10742 (back_threader::find_paths): New.
10745 (thread_jumps::find_jump_threads_backwards): Call ranger threader.
10746 (thread_jumps::find_jump_threads_backwards_with_ranger): New.
10747 (pass_thread_jumps::execute): Abstract out code...
10748 (try_thread_blocks): ...here.
10749 * tree-ssa-threadedge.c (jump_threader::thread_outgoing_edges):
10750 Abstract out threading candidate code to...
10751 (single_succ_to_potentially_threadable_block): ...here.
10752 * tree-ssa-threadedge.h (single_succ_to_potentially_threadable_block):
10754 * tree-ssa-threadupdate.c (register_jump_thread): Return boolean.
10755 * tree-ssa-threadupdate.h (class jump_thread_path_registry):
10756 Return bool from register_jump_thread.
10758 2021-07-29 Andreas Krebbel <krebbel@linux.ibm.com>
10760 * target.def: in0 and in1 do not need to be registers.
10761 * doc/tm.texi: Regenerate.
10763 2021-07-29 liuhongt <hongtao.liu@intel.com>
10766 * config/i386/i386.c (ix86_widen_mult_cost): New function.
10767 (ix86_add_stmt_cost): Use ix86_widen_mult_cost for
10770 2021-07-29 Jiufu Guo <guojiufu@linux.ibm.com>
10773 * config/rs6000/rs6000.c (TARGET_PREFERRED_DOLOOP_MODE): New hook.
10774 (rs6000_preferred_doloop_mode): New hook.
10775 * doc/tm.texi: Regenerate.
10776 * doc/tm.texi.in: Add hook preferred_doloop_mode.
10777 * target.def (preferred_doloop_mode): New hook.
10778 * targhooks.c (default_preferred_doloop_mode): New hook.
10779 * targhooks.h (default_preferred_doloop_mode): New hook.
10780 * tree-ssa-loop-ivopts.c (compute_doloop_base_on_mode): New function.
10781 (add_iv_candidate_for_doloop): Call targetm.preferred_doloop_mode
10782 and compute_doloop_base_on_mode.
10784 2021-07-28 Martin Sebor <msebor@redhat.com>
10786 PR middle-end/101494
10787 * tree-ssa-uninit.c (maybe_warn_operand): Correct object offset
10788 and size computation.
10790 2021-07-28 Martin Sebor <msebor@redhat.com>
10792 PR middle-end/101601
10793 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Remove
10795 Handle pointers to functions.
10797 2021-07-28 Martin Sebor <msebor@redhat.com>
10799 * Makefile.in (OBJS): Add gimple-ssa-warn-access.o and pointer-query.o.
10800 * attribs.h (fndecl_dealloc_argno): Move fndecl_dealloc_argno to tree.h.
10801 * builtins.c (compute_objsize_r): Move to pointer-query.cc.
10802 (access_ref::access_ref): Same.
10803 (access_ref::phi): Same.
10804 (access_ref::get_ref): Same.
10805 (access_ref::size_remaining): Same.
10806 (access_ref::offset_in_range): Same.
10807 (access_ref::add_offset): Same.
10808 (access_ref::inform_access): Same.
10809 (ssa_name_limit_t::visit_phi): Same.
10810 (ssa_name_limit_t::leave_phi): Same.
10811 (ssa_name_limit_t::next): Same.
10812 (ssa_name_limit_t::next_phi): Same.
10813 (ssa_name_limit_t::~ssa_name_limit_t): Same.
10814 (pointer_query::pointer_query): Same.
10815 (pointer_query::get_ref): Same.
10816 (pointer_query::put_ref): Same.
10817 (pointer_query::flush_cache): Same.
10818 (warn_string_no_nul): Move to gimple-ssa-warn-access.cc.
10819 (check_nul_terminated_array): Same.
10820 (unterminated_array): Same.
10821 (maybe_warn_for_bound): Same.
10822 (check_read_access): Same.
10823 (warn_for_access): Same.
10824 (get_size_range): Same.
10825 (check_access): Same.
10826 (gimple_call_alloc_size): Move to tree.c.
10827 (gimple_parm_array_size): Move to pointer-query.cc.
10828 (get_offset_range): Same.
10829 (gimple_call_return_array): Same.
10830 (handle_min_max_size): Same.
10831 (handle_array_ref): Same.
10832 (handle_mem_ref): Same.
10833 (compute_objsize): Same.
10834 (gimple_call_alloc_p): Move to gimple-ssa-warn-access.cc.
10835 (call_dealloc_argno): Same.
10836 (fndecl_dealloc_argno): Same.
10837 (new_delete_mismatch_p): Same.
10838 (matching_alloc_calls_p): Same.
10839 (warn_dealloc_offset): Same.
10840 (maybe_emit_free_warning): Same.
10841 * builtins.h (check_nul_terminated_array): Move to
10842 gimple-ssa-warn-access.h.
10843 (check_nul_terminated_array): Same.
10844 (warn_string_no_nul): Same.
10845 (unterminated_array): Same.
10846 (class ssa_name_limit_t): Same.
10847 (class pointer_query): Same.
10848 (struct access_ref): Same.
10849 (class range_query): Same.
10850 (struct access_data): Same.
10851 (gimple_call_alloc_size): Same.
10852 (gimple_parm_array_size): Same.
10853 (compute_objsize): Same.
10854 (class access_data): Same.
10855 (maybe_emit_free_warning): Same.
10856 * calls.c (initialize_argument_information): Remove call to
10857 maybe_emit_free_warning.
10858 * gimple-array-bounds.cc: Include new header..
10859 * gimple-fold.c: Same.
10860 * gimple-ssa-sprintf.c: Same.
10861 * gimple-ssa-warn-restrict.c: Same.
10862 * passes.def: Add pass_warn_access.
10863 * tree-pass.h (make_pass_warn_access): Declare.
10864 * tree-ssa-strlen.c: Include new headers.
10865 * tree.c (fndecl_dealloc_argno): Move here from builtins.c.
10866 * tree.h (fndecl_dealloc_argno): Move here from attribs.h.
10867 * gimple-ssa-warn-access.cc: New file.
10868 * gimple-ssa-warn-access.h: New file.
10869 * pointer-query.cc: New file.
10870 * pointer-query.h: New file.
10872 2021-07-28 Jakub Jelinek <jakub@redhat.com>
10874 PR middle-end/101624
10875 * ubsan.c (maybe_instrument_pointer_overflow,
10876 instrument_object_size): Only test DECL_REGISTER on VAR_DECLs,
10877 PARM_DECLs or RESULT_DECLs.
10878 * sanopt.c (maybe_optimize_ubsan_ptr_ifn): Likewise.
10880 2021-07-28 Jakub Jelinek <jakub@redhat.com>
10882 PR middle-end/101642
10883 * match.pd (bswap16 (x) == bswap16 (y)): Cast both operands
10884 to type of bswap16 for comparison.
10885 (bswap16 (x) == cst): Cast bswap16 operand to type of cst.
10887 2021-07-28 Richard Biener <rguenther@suse.de>
10889 PR tree-optimization/101615
10890 * tree-vect-slp.c (vect_optimize_slp): Materialize permutes
10891 at CTOR SLP graph entries.
10893 2021-07-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
10895 * config/aarch64/aarch64.md (*extend<SHORT:mode><GPI:mode>2_aarch64):
10896 Add "r,w" alternative.
10898 2021-07-28 H.J. Lu <hjl.tools@gmail.com>
10901 * config/i386/i386.c (ix86_avx_u128_mode_needed): Don't set
10902 AVX_U128_DIRTY when all bits are zero.
10904 2021-07-28 Richard Biener <rguenther@suse.de>
10906 PR tree-optimization/101615
10907 * tree-vect-slp.c (vect_optimize_slp): Pre-existing vector
10908 external nodes cannot be permuted so make them perm_out 0.
10910 2021-07-28 Andrew Stubbs <ams@codesourcery.com>
10913 * config.in: Regenerate.
10914 * config/gcn/gcn-hsa.h (A_FIJI): New define.
10915 (A_900): New define.
10916 (A_906): New define.
10917 (A_908): New define.
10918 (ASM_SPEC): Use A_FIJI, A_900, A_906 and A_908.
10919 * config/gcn/gcn.c (output_file_start): Adjust attributes according
10920 to the assembler capabilities.
10921 * config/gcn/mkoffload.c (main): Likewise.
10922 * configure: Regenerate.
10923 * configure.ac: Add tests for LLVM assembler attribute features.
10925 2021-07-28 Andrew MacLeod <amacleod@redhat.com>
10927 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Check for
10928 cond_false and cond_true on branches.
10930 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
10932 * config/aarch64/aarch64.c (aarch64_gen_adjusted_ldpstp): use
10935 2021-07-28 Bin Cheng <bin.cheng@linux.alibaba.com>
10937 * alias.c (init_alias_analysis): Don't skip prologue/epilogue.
10939 2021-07-28 Jakub Jelinek <jakub@redhat.com>
10942 * config/i386/sse.md (vashr<mode>3): Split into vashrv8di3 expander
10943 and vashrv4di3 expander, where the latter requires just TARGET_AVX2
10944 and has special !TARGET_AVX512VL expansion.
10945 (vashrv2di3<mask_name>): Rename to ...
10946 (vashrv2di3): ... this. Change condition to TARGET_XOP || TARGET_AVX2
10947 and add special !TARGET_XOP && !TARGET_AVX512VL expansion.
10949 2021-07-28 Martin Uecker <muecker@gwdg.de>
10951 * calls.c (maybe_warn_rdwr_sizes): Correct argument
10952 numbers in warning that were switched.
10954 2021-07-28 Kewen Lin <linkw@linux.ibm.com>
10956 PR tree-optimization/101596
10957 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Fix wrong check
10958 by using new_type's precision instead.
10960 2021-07-28 liuhongt <hongtao.liu@intel.com>
10963 * config/i386/i386.h (processor_costs): Add new member
10965 * config/i386/x86-tune-costs.h (ix86_size_cost, i386_cost,
10966 i486_cost, pentium_cost, lakemont_cost, pentiumpro_cost,
10967 geode_cost, k6_cost, athlon_cost, k8_cost, amdfam10_cost,
10968 bdver_cost, znver1_cost, znver2_cost, znver3_cost,
10969 btver1_cost, btver2_cost, btver3_cost, pentium4_cost,
10970 nocona_cost, atom_cost, atom_cost, slm_cost, intel_cost,
10971 generic_cost, core_cost): Initialize integer_to_sse same value
10973 (skylake_cost): Initialize integer_to_sse twice as much as sse_op.
10974 * config/i386/i386.c (ix86_builtin_vectorization_cost):
10975 Use integer_to_sse instead of sse_op to calculate the cost of
10978 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
10980 * config/rs6000/rs6000-gen-builtins.c (write_ovld_static_init): New
10982 (write_init_file): Call write_ovld_static_init.
10984 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
10986 * config/rs6000/rs6000-gen-builtins.c (write_bif_static_init): New
10988 (write_init_file): Call write_bif_static_init.
10990 2021-07-27 Bill Schmidt <wschmidt@linux.ibm.com>
10992 * config/rs6000/rs6000-gen-builtins.c (typemap): New struct.
10993 (TYPE_MAP_SIZE): New macro.
10994 (type_map): New initialized variable.
10995 (typemap_cmp): New function.
10996 (write_type_node): Likewise.
10997 (write_fntype_init): Implement.
10999 2021-07-27 Martin Sebor <msebor@redhat.com>
11001 PR tree-optimization/101584
11002 * tree-ssa-uninit.c (builtin_call_nomodifying_p): New function.
11003 (check_defs): Call it.
11005 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
11007 * tree-ssa-dom.c (dom_jump_threader_simplifier):
11008 Put avail_exprs_stack in the class, instead of passing it to
11009 jump_threader_simplifier.
11010 (dom_jump_threader_simplifier::simplify): Add state argument.
11011 (dom_opt_dom_walker): Add state.
11012 (pass_dominator::execute): Pass state to threader.
11013 (dom_opt_dom_walker::before_dom_children): Use state.
11014 * tree-ssa-threadedge.c (jump_threader::jump_threader): Replace
11015 arguments by state.
11016 (jump_threader::record_temporary_equivalences_from_phis):
11017 Register equivalences through the state variable.
11018 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
11019 Record ranges in a statement through the state variable.
11020 (jump_threader::simplify_control_stmt_condition): Pass state to
11022 (jump_threader::simplify_control_stmt_condition_1): Same.
11023 (jump_threader::thread_around_empty_blocks): Remove obsolete
11025 (jump_threader::thread_through_normal_block): Record equivalences
11026 on edge through the state variable.
11027 (jump_threader::thread_across_edge): Abstract state pushing.
11028 (jt_state::jt_state): New.
11029 (jt_state::push): New.
11030 (jt_state::pop): New.
11031 (jt_state::register_equiv): New.
11032 (jt_state::record_ranges_from_stmt): New.
11033 (jt_state::register_equivs_on_edge): New.
11034 (jump_threader_simplifier::jump_threader_simplifier): Move from
11036 (jump_threader_simplifier::simplify): Add state argument.
11037 * tree-ssa-threadedge.h (class jt_state): New.
11038 (class jump_threader): Add state to constructor.
11039 (class jump_threader_simplifier): Add state to simplify. Remove
11040 avail_exprs_stack from class.
11041 * tree-vrp.c (vrp_jump_threader_simplifier::simplify): Add state
11043 (vrp_jump_threader::vrp_jump_threader): Add state.
11044 (vrp_jump_threader::~vrp_jump_threader): Cleanup state.
11046 2021-07-27 Aldy Hernandez <aldyh@redhat.com>
11048 * Makefile.in (OBJS): Add gimple-range-path.o.
11049 * gimple-range-path.cc: New file.
11050 * gimple-range-path.h: New file.
11052 2021-07-27 Jonathan Wright <jonathan.wright@arm.com>
11054 * config/aarch64/aarch64-simd.md: Push sign/zero-extension
11055 inside vec_duplicate for all patterns.
11056 * simplify-rtx.c (simplify_context::simplify_unary_operation_1):
11057 Push sign/zero-extension inside vec_duplicate.
11059 2021-07-27 Richard Biener <rguenther@suse.de>
11061 PR tree-optimization/101573
11062 * tree-ssa-uninit.c (warn_uninit_phi_uses): New function
11063 looking at uninitialized PHI arg defs in some constrained cases.
11064 (warn_uninitialized_vars): Call it.
11065 (execute_early_warn_uninitialized): Calculate dominators.
11067 2021-07-27 Richard Biener <rguenther@suse.de>
11069 PR tree-optimization/39821
11070 * tree-vect-stmts.c (vect_model_promotion_demotion_cost): Use
11071 vector_stmt for widening arithmetic.
11072 (vectorizable_conversion): Adjust.
11074 2021-07-27 Martin Jambor <mjambor@suse.cz>
11076 * cgraph.h (ipa_replace_map): New field force_load_ref.
11077 * ipa-prop.h (ipa_param_descriptor): Reduce precision of move_cost,
11078 aded new flag load_dereferenced, adjusted comments.
11079 (ipa_get_param_dereferenced): New function.
11080 (ipa_set_param_dereferenced): Likewise.
11081 * cgraphclones.c (cgraph_node::create_virtual_clone): Follow it.
11082 * ipa-cp.c: Include gimple.h.
11083 (ipcp_discover_new_direct_edges): Take into account dereferenced flag.
11084 (get_replacement_map): New parameter force_load_ref, set the
11085 appropriate flag in ipa_replace_map if set.
11086 (struct symbol_and_index_together): New type.
11087 (adjust_refs_in_act_callers): New function.
11088 (adjust_references_in_caller): Likewise.
11089 (create_specialized_node): When appropriate, call
11090 adjust_references_in_caller and force only load references.
11091 * ipa-prop.c (load_from_dereferenced_name): New function.
11092 (ipa_analyze_controlled_uses): Also detect loads from a
11093 dereference, harden testing of call statements.
11094 (ipa_write_node_info): Stream the dereferenced flag.
11095 (ipa_read_node_info): Likewise.
11096 (ipa_set_jf_constant): Also create refdesc when jump function
11097 references a variable.
11098 (cgraph_node_for_jfunc): Rename to symtab_node_for_jfunc, work
11099 also on references of variables and return a symtab_node. Adjust
11101 (propagate_controlled_uses): Also remove references to VAR_DECLs.
11103 2021-07-27 Jakub Jelinek <jakub@redhat.com>
11105 PR middle-end/101586
11106 * gimple-fold.c (clear_padding_type): Ignore FIELD_DECLs with byte
11107 positions above or equal to sz except for diagnostics of flexible
11110 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
11112 PR tree-optimization/78888
11113 * gimple-range-fold.cc (get_letter_range): New.
11114 (fold_using_range::range_of_builtin_call): Call get_letter_range.
11116 2021-07-26 Andrew MacLeod <amacleod@redhat.com>
11118 PR tree-optimization/78888
11119 * gimple-range-fold.cc (fold_using_range::range_of_builtin_call): Add cases
11120 for CFN_BUILT_IN_TOUPPER and CFN_BUILT_IN_TOLOWER.
11122 2021-07-26 Roger Sayle <roger@nextmovesoftware.com>
11123 Marc Glisse <marc.glisse@inria.fr>
11125 * match.pd (rotate): Simplify equality/inequality of rotations.
11126 (bswap): Simplify equality/inequality tests of byte swapping.
11128 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
11130 * range-op.cc (operator_bitwise_xor::op1_op2_relation_effect):
11133 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
11135 * range-op.cc (operator_lshift::fold_range): Pass rel to
11136 base class fold_range.
11137 (operator_rshift::fold_range): Same.
11139 2021-07-26 Ashimida <ashimida@linux.alibaba.com>
11142 * toplev.h (min_align_loops_log): Remove declaration.
11143 (min_align_jumps_log, min_align_labels_log): Likewise.
11144 (min_align_functions_log): Likewise.
11146 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
11148 * tree-vrp.c (vrp_simplify_cond_using_ranges): Rename vr_values
11150 (execute_vrp): Abstract out simplification of conditionals...
11151 (simplify_casted_conds): ...here.
11153 2021-07-26 Aldy Hernandez <aldyh@redhat.com>
11155 * gimple-array-bounds.cc (array_bounds_checker::get_value_range):
11156 Add gimple argument.
11157 (array_bounds_checker::check_array_ref): Same.
11158 (array_bounds_checker::check_addr_expr): Same.
11159 (array_bounds_checker::check_array_bounds): Pass statement to
11160 check_array_bounds and check_addr_expr.
11161 * gimple-array-bounds.h (check_array_bounds): Add gimple argument.
11162 (check_addr_expr): Same.
11163 (get_value_range): Same.
11165 2021-07-26 Tamar Christina <tamar.christina@arm.com>
11167 * config/aarch64/aarch64-simd-builtins.def (sdot, udot): Rename to..
11168 (sdot_prod, udot_prod): ... This.
11169 * config/aarch64/aarch64-simd.md (aarch64_<sur>dot<vsi2qi>): Merged
11171 (<sur>dot_prod<vsi2qi>): ... this.
11172 (aarch64_<sur>dot_lane<vsi2qi>, aarch64_<sur>dot_laneq<vsi2qi>):
11173 Change operands order.
11174 (<sur>sadv16qi): Use new operands order.
11175 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32,
11176 vdotq_s32): Use new RTL ordering.
11178 2021-07-26 Tamar Christina <tamar.christina@arm.com>
11180 * config/aarch64/aarch64-builtins.c (TYPES_TERNOP_SUSS,
11181 aarch64_types_ternop_suss_qualifiers): New.
11182 * config/aarch64/aarch64-simd-builtins.def (usdot_prod): Use it.
11183 * config/aarch64/aarch64-simd.md (usdot_prod<vsi2qi>): Re-organize RTL.
11184 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Use it.
11186 2021-07-23 Jakub Jelinek <jakub@redhat.com>
11188 PR rtl-optimization/101562
11189 * expmed.c (store_integral_bit_field): Only use movstrict_optab
11190 if the operand isn't paradoxical.
11192 2021-07-23 Aldy Hernandez <aldyh@redhat.com>
11194 * gimple-array-bounds.h (class array_bounds_checker): Change
11195 ranges type to range_query.
11197 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11199 * config/aarch64/arm_neon.h (vst1_s64_x2): Use
11200 __builtin_memcpy instead of constructing
11201 __builtin_aarch64_simd_oi one vector at a time.
11202 (vst1_u64_x2): Likewise.
11203 (vst1_f64_x2): Likewise.
11204 (vst1_s8_x2): Likewise.
11205 (vst1_p8_x2): Likewise.
11206 (vst1_s16_x2): Likewise.
11207 (vst1_p16_x2): Likewise.
11208 (vst1_s32_x2): Likewise.
11209 (vst1_u8_x2): Likewise.
11210 (vst1_u16_x2): Likewise.
11211 (vst1_u32_x2): Likewise.
11212 (vst1_f16_x2): Likewise.
11213 (vst1_f32_x2): Likewise.
11214 (vst1_p64_x2): Likewise.
11215 (vst1q_s8_x2): Likewise.
11216 (vst1q_p8_x2): Likewise.
11217 (vst1q_s16_x2): Likewise.
11218 (vst1q_p16_x2): Likewise.
11219 (vst1q_s32_x2): Likewise.
11220 (vst1q_s64_x2): Likewise.
11221 (vst1q_u8_x2): Likewise.
11222 (vst1q_u16_x2): Likewise.
11223 (vst1q_u32_x2): Likewise.
11224 (vst1q_u64_x2): Likewise.
11225 (vst1q_f16_x2): Likewise.
11226 (vst1q_f32_x2): Likewise.
11227 (vst1q_f64_x2): Likewise.
11228 (vst1q_p64_x2): Likewise.
11230 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11232 * config/aarch64/arm_neon.h (vst1_s64_x3): Use
11233 __builtin_memcpy instead of constructing
11234 __builtin_aarch64_simd_ci one vector at a time.
11235 (vst1_u64_x3): Likewise.
11236 (vst1_f64_x3): Likewise.
11237 (vst1_s8_x3): Likewise.
11238 (vst1_p8_x3): Likewise.
11239 (vst1_s16_x3): Likewise.
11240 (vst1_p16_x3): Likewise.
11241 (vst1_s32_x3): Likewise.
11242 (vst1_u8_x3): Likewise.
11243 (vst1_u16_x3): Likewise.
11244 (vst1_u32_x3): Likewise.
11245 (vst1_f16_x3): Likewise.
11246 (vst1_f32_x3): Likewise.
11247 (vst1_p64_x3): Likewise.
11248 (vst1q_s8_x3): Likewise.
11249 (vst1q_p8_x3): Likewise.
11250 (vst1q_s16_x3): Likewise.
11251 (vst1q_p16_x3): Likewise.
11252 (vst1q_s32_x3): Likewise.
11253 (vst1q_s64_x3): Likewise.
11254 (vst1q_u8_x3): Likewise.
11255 (vst1q_u16_x3): Likewise.
11256 (vst1q_u32_x3): Likewise.
11257 (vst1q_u64_x3): Likewise.
11258 (vst1q_f16_x3): Likewise.
11259 (vst1q_f32_x3): Likewise.
11260 (vst1q_f64_x3): Likewise.
11261 (vst1q_p64_x3): Likewise.
11263 2021-07-23 H.J. Lu <hjl.tools@gmail.com>
11266 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): Don't return
11267 hard register when LRA is in progress.
11269 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11271 * config/aarch64/arm_neon.h (vst1_s8_x4): Use
11272 __builtin_memcpy instead of using a union.
11273 (vst1q_s8_x4): Likewise.
11274 (vst1_s16_x4): Likewise.
11275 (vst1q_s16_x4): Likewise.
11276 (vst1_s32_x4): Likewise.
11277 (vst1q_s32_x4): Likewise.
11278 (vst1_u8_x4): Likewise.
11279 (vst1q_u8_x4): Likewise.
11280 (vst1_u16_x4): Likewise.
11281 (vst1q_u16_x4): Likewise.
11282 (vst1_u32_x4): Likewise.
11283 (vst1q_u32_x4): Likewise.
11284 (vst1_f16_x4): Likewise.
11285 (vst1q_f16_x4): Likewise.
11286 (vst1_f32_x4): Likewise.
11287 (vst1q_f32_x4): Likewise.
11288 (vst1_p8_x4): Likewise.
11289 (vst1q_p8_x4): Likewise.
11290 (vst1_p16_x4): Likewise.
11291 (vst1q_p16_x4): Likewise.
11292 (vst1_s64_x4): Likewise.
11293 (vst1_u64_x4): Likewise.
11294 (vst1_p64_x4): Likewise.
11295 (vst1q_s64_x4): Likewise.
11296 (vst1q_u64_x4): Likewise.
11297 (vst1q_p64_x4): Likewise.
11298 (vst1_f64_x4): Likewise.
11299 (vst1q_f64_x4): Likewise.
11301 2021-07-23 Jonathan Wrightt <jonathan.wright@arm.com>
11303 * config/aarch64/arm_neon.h (vst2_s64): Use __builtin_memcpy
11304 instead of constructing __builtin_aarch64_simd_oi one vector
11306 (vst2_u64): Likewise.
11307 (vst2_f64): Likewise.
11308 (vst2_s8): Likewise.
11309 (vst2_p8): Likewise.
11310 (vst2_s16): Likewise.
11311 (vst2_p16): Likewise.
11312 (vst2_s32): Likewise.
11313 (vst2_u8): Likewise.
11314 (vst2_u16): Likewise.
11315 (vst2_u32): Likewise.
11316 (vst2_f16): Likewise.
11317 (vst2_f32): Likewise.
11318 (vst2_p64): Likewise.
11319 (vst2q_s8): Likewise.
11320 (vst2q_p8): Likewise.
11321 (vst2q_s16): Likewise.
11322 (vst2q_p16): Likewise.
11323 (vst2q_s32): Likewise.
11324 (vst2q_s64): Likewise.
11325 (vst2q_u8): Likewise.
11326 (vst2q_u16): Likewise.
11327 (vst2q_u32): Likewise.
11328 (vst2q_u64): Likewise.
11329 (vst2q_f16): Likewise.
11330 (vst2q_f32): Likewise.
11331 (vst2q_f64): Likewise.
11332 (vst2q_p64): Likewise.
11334 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11336 * config/aarch64/arm_neon.h (vst3_s64): Use __builtin_memcpy
11337 instead of constructing __builtin_aarch64_simd_ci one vector
11339 (vst3_u64): Likewise.
11340 (vst3_f64): Likewise.
11341 (vst3_s8): Likewise.
11342 (vst3_p8): Likewise.
11343 (vst3_s16): Likewise.
11344 (vst3_p16): Likewise.
11345 (vst3_s32): Likewise.
11346 (vst3_u8): Likewise.
11347 (vst3_u16): Likewise.
11348 (vst3_u32): Likewise.
11349 (vst3_f16): Likewise.
11350 (vst3_f32): Likewise.
11351 (vst3_p64): Likewise.
11352 (vst3q_s8): Likewise.
11353 (vst3q_p8): Likewise.
11354 (vst3q_s16): Likewise.
11355 (vst3q_p16): Likewise.
11356 (vst3q_s32): Likewise.
11357 (vst3q_s64): Likewise.
11358 (vst3q_u8): Likewise.
11359 (vst3q_u16): Likewise.
11360 (vst3q_u32): Likewise.
11361 (vst3q_u64): Likewise.
11362 (vst3q_f16): Likewise.
11363 (vst3q_f32): Likewise.
11364 (vst3q_f64): Likewise.
11365 (vst3q_p64): Likewise.
11367 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11369 * config/aarch64/arm_neon.h (vst4_s64): Use __builtin_memcpy
11370 instead of constructing __builtin_aarch64_simd_xi one vector
11372 (vst4_u64): Likewise.
11373 (vst4_f64): Likewise.
11374 (vst4_s8): Likewise.
11375 (vst4_p8): Likewise.
11376 (vst4_s16): Likewise.
11377 (vst4_p16): Likewise.
11378 (vst4_s32): Likewise.
11379 (vst4_u8): Likewise.
11380 (vst4_u16): Likewise.
11381 (vst4_u32): Likewise.
11382 (vst4_f16): Likewise.
11383 (vst4_f32): Likewise.
11384 (vst4_p64): Likewise.
11385 (vst4q_s8): Likewise.
11386 (vst4q_p8): Likewise.
11387 (vst4q_s16): Likewise.
11388 (vst4q_p16): Likewise.
11389 (vst4q_s32): Likewise.
11390 (vst4q_s64): Likewise.
11391 (vst4q_u8): Likewise.
11392 (vst4q_u16): Likewise.
11393 (vst4q_u32): Likewise.
11394 (vst4q_u64): Likewise.
11395 (vst4q_f16): Likewise.
11396 (vst4q_f32): Likewise.
11397 (vst4q_f64): Likewise.
11398 (vst4q_p64): Likewise.
11400 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11402 * config/aarch64/arm_neon.h (vtbx4_s8): Use __builtin_memcpy
11403 instead of constructing __builtin_aarch64_simd_oi one vector
11405 (vtbx4_u8): Likewise.
11406 (vtbx4_p8): Likewise.
11408 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11410 * config/aarch64/arm_neon.h (vtbl3_s8): Use __builtin_memcpy
11411 instead of constructing __builtin_aarch64_simd_oi one vector
11413 (vtbl3_u8): Likewise.
11414 (vtbl3_p8): Likewise.
11415 (vtbl4_s8): Likewise.
11416 (vtbl4_u8): Likewise.
11417 (vtbl4_p8): Likewise.
11419 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11421 * config/aarch64/arm_neon.h (vqtbx2_s8): Use __builtin_memcpy
11422 instead of constructing __builtin_aarch64_simd_oi one vector
11424 (vqtbx2_u8): Likewise.
11425 (vqtbx2_p8): Likewise.
11426 (vqtbx2q_s8): Likewise.
11427 (vqtbx2q_u8): Likewise.
11428 (vqtbx2q_p8): Likewise.
11429 (vqtbx3_s8): Use __builtin_memcpy instead of constructing
11430 __builtin_aarch64_simd_ci one vector at a time.
11431 (vqtbx3_u8): Likewise.
11432 (vqtbx3_p8): Likewise.
11433 (vqtbx3q_s8): Likewise.
11434 (vqtbx3q_u8): Likewise.
11435 (vqtbx3q_p8): Likewise.
11436 (vqtbx4_s8): Use __builtin_memcpy instead of constructing
11437 __builtin_aarch64_simd_xi one vector at a time.
11438 (vqtbx4_u8): Likewise.
11439 (vqtbx4_p8): Likewise.
11440 (vqtbx4q_s8): Likewise.
11441 (vqtbx4q_u8): Likewise.
11442 (vqtbx4q_p8): Likewise.
11444 2021-07-23 Jonathan Wright <jonathan.wright@arm.com>
11446 * config/aarch64/arm_neon.h (vqtbl2_s8): Use __builtin_memcpy
11447 instead of constructing __builtin_aarch64_simd_oi one vector
11449 (vqtbl2_u8): Likewise.
11450 (vqtbl2_p8): Likewise.
11451 (vqtbl2q_s8): Likewise.
11452 (vqtbl2q_u8): Likewise.
11453 (vqtbl2q_p8): Likewise.
11454 (vqtbl3_s8): Use __builtin_memcpy instead of constructing
11455 __builtin_aarch64_simd_ci one vector at a time.
11456 (vqtbl3_u8): Likewise.
11457 (vqtbl3_p8): Likewise.
11458 (vqtbl3q_s8): Likewise.
11459 (vqtbl3q_u8): Likewise.
11460 (vqtbl3q_p8): Likewise.
11461 (vqtbl4_s8): Use __builtin_memcpy instead of constructing
11462 __builtin_aarch64_simd_xi one vector at a time.
11463 (vqtbl4_u8): Likewise.
11464 (vqtbl4_p8): Likewise.
11465 (vqtbl4q_s8): Likewise.
11466 (vqtbl4q_u8): Likewise.
11467 (vqtbl4q_p8): Likewise.
11469 2021-07-23 Haochen Gui <guihaoc@gcc.gnu.org>
11472 * config/rs6000/rs6000.md (cstore<mode>4): Fix wrong fall through.
11474 2021-07-22 Andrew Pinski <apinski@marvell.com>
11476 PR tree-optimization/10153
11477 * tree-tailcall.c (create_tailcall_accumulator):
11478 Don't call fold_convert as the type should be correct already.
11479 (tree_optimize_tail_calls_1): Use build_{one,zero}_cst instead
11480 of integer_{one,zero}_node for the call of create_tailcall_accumulator.
11482 2021-07-22 Aldy Hernandez <aldyh@redhat.com>
11484 * gimple-range-cache.cc (non_null_ref::adjust_range): Replace
11485 varying_p check for null/non-null check.
11487 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
11489 PR tree-optimization/101511
11490 * value-relation.cc (relation_oracle::query_relation): Check if ssa1
11491 is in ssa2's equiv set, and don't trap if so.
11493 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
11495 PR tree-optimization/101497
11496 * gimple-range-fold.cc (fold_using_range::range_of_cond_expr): Check
11499 2021-07-22 Andrew MacLeod <amacleod@redhat.com>
11501 PR tree-optimization/101496
11502 * vr-values.c (simplify_using_ranges::fold_cond): Call range_of_stmt
11503 first, then vrp_visit_cond_Stmt.
11505 2021-07-22 liuhongt <hongtao.liu@intel.com>
11507 * config/i386/i386-expand.c
11508 (ix86_broadcast_from_integer_constant): Rename to ..
11509 (ix86_broadcast_from_constant): .. this, and extend it to
11511 (ix86_expand_vector_move): Extend to float mode.
11512 * config/i386/i386-features.c
11513 (replace_constant_pool_with_broadcast): Remove.
11514 (remove_partial_avx_dependency_gate): Ditto.
11515 (constant_pool_broadcast): Ditto.
11516 (class pass_constant_pool_broadcast): Ditto.
11517 (make_pass_constant_pool_broadcast): Ditto.
11518 (remove_partial_avx_dependency): Adjust gate.
11519 * config/i386/i386-passes.def: Remove pass_constant_pool_broadcast.
11520 * config/i386/i386-protos.h
11521 (make_pass_constant_pool_broadcast): Remove.
11523 2021-07-22 liuhongt <hongtao.liu@intel.com>
11525 * config/i386/constraints.md (Wb): New constraint.
11527 * config/i386/i386.md (*ashlhi3_1): Extend to avx512 mask
11529 (*ashlqi3_1): Ditto.
11530 (*<insn><mode>3_1): Split to ..
11531 (*ashr<mode>3_1): this, ...
11532 (*lshr<mode>3_1): and this, also extend this pattern to avx512
11534 (*<insn><mode>3_1): Split to ..
11535 (*ashr<mode>3_1): this, ...
11536 (*lshrqi3_1): and this, also extend this pattern to avx512
11538 (*lshrhi3_1): And this, also extend this pattern to avx512
11540 * config/i386/sse.md (k<code><mode>): New define_split after
11541 it to convert generic shift pattern to mask shift ones.
11543 2021-07-21 Thomas Schwinge <thomas@codesourcery.com>
11544 Joseph Myers <joseph@codesourcery.com>
11545 Cesar Philippidis <cesar@codesourcery.com>
11547 * tree-core.h (omp_clause_code): Add 'OMP_CLAUSE_NOHOST'.
11548 * tree.c (omp_clause_num_ops, omp_clause_code_name, walk_tree_1):
11550 * tree-pretty-print.c (dump_omp_clause): Likewise.
11551 * omp-general.c (oacc_verify_routine_clauses): Likewise.
11552 * gimplify.c (gimplify_scan_omp_clauses)
11553 (gimplify_adjust_omp_clauses): Likewise.
11554 * tree-nested.c (convert_nonlocal_omp_clauses)
11555 (convert_local_omp_clauses): Likewise.
11556 * omp-low.c (scan_sharing_clauses): Likewise.
11557 * omp-offload.c (execute_oacc_device_lower): Update.
11559 2021-07-21 Martin Sebor <msebor@redhat.com>
11561 * tree-ssa-alias.c (walk_aliased_vdefs_1): Fix typos in a comment.
11563 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11565 * config/rs6000/rs6000-gen-builtins.c (write_init_bif_table):
11568 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11570 * config/rs6000/rs6000-gen-builtins.c (write_fntype): New
11572 (write_fntype_init): New stub function.
11573 (write_init_bif_table): Likewise.
11574 (write_init_ovld_table): New function.
11575 (write_init_file): Implement.
11577 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11579 * config/rs6000/rs6000-gen-builtins.c
11580 (write_autogenerated_header): New function.
11581 (write_decls): Likewise.
11582 (write_extern_fntype): New callback function.
11583 (write_header_file): Implement.
11585 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11587 * config/rs6000/rs6000-gen-builtins.c (write_defines_file):
11590 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11592 * config/rs6000/rs6000-gen-builtins.c (complete_vector_type): New
11594 (complete_base_type): Likewise.
11595 (construct_fntype_id): Likewise.
11596 (parse_bif_entry): Call contruct_fntype_id.
11597 (parse_ovld_entry): Likewise.
11599 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11601 * config/rs6000/rs6000-gen-builtins.c (ovld_stanza): New struct.
11602 (MAXOVLDSTANZAS): New macro.
11603 (ovld_stanzas): New variable.
11604 (curr_ovld_stanza): Likewise.
11605 (MAXOVLDS): New macro.
11606 (ovlddata): New struct.
11607 (ovlds): New variable.
11608 (curr_ovld): Likewise.
11609 (max_ovld_args): Likewise.
11610 (parse_ovld_entry): New function.
11611 (parse_ovld_stanza): Likewise.
11612 (parse_ovld): Implement.
11614 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11616 * config/rs6000/rs6000-gen-builtins.c (parse_bif_attrs):
11619 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11621 * config/rs6000/rs6000-gen-builtins.c (parse_args): New function.
11622 (parse_prototype): Implement.
11624 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11626 * config/rs6000/rs6000-gen-builtins.c (bif_stanza): New enum.
11627 (curr_bif_stanza): New variable.
11628 (stanza_entry): New struct.
11629 (stanza_map): New initialized variable.
11630 (enable_string): Likewise.
11631 (fnkinds): New enum.
11632 (typelist): New struct.
11633 (attrinfo): Likewise.
11634 (MAXRESTROPNDS): New macro.
11635 (prototype): New struct.
11636 (MAXBIFS): New macro.
11637 (bifdata): New struct.
11638 (bifs): New variable.
11639 (curr_bif): Likewise.
11640 (bif_order): Likewise.
11641 (bif_index): Likewise.
11642 (fatal): New function.
11643 (stanza_name_to_stanza): Likewise.
11644 (parse_bif_attrs): New stub function.
11645 (parse_prototype): Likewise.
11646 (parse_bif_entry): New function.
11647 (parse_bif_stanza): Likewise.
11648 (parse_bif): Implement.
11649 (set_bif_order): New function.
11650 (create_bif_order): Implement.
11652 2021-07-21 Bill Schmidt <wschmidt@linux.ibm.com>
11654 * config/rs6000/rs6000-gen-builtins.c (rbtree.h): New #include.
11655 (num_bifs): New variable.
11656 (num_ovld_stanzas): Likewise.
11657 (num_ovlds): Likewise.
11658 (parse_codes): New enum.
11659 (bif_rbt): New variable.
11660 (ovld_rbt): Likewise.
11661 (fntype_rbt): Likewise.
11662 (bifo_rbt): Likewise.
11663 (parse_bif): New stub function.
11664 (create_bif_order): Likewise.
11665 (parse_ovld): Likewise.
11666 (write_header_file): Likewise.
11667 (write_init_file): Likewise.
11668 (write_defines_file): Likewise.
11669 (delete_output_files): New function.
11672 2021-07-21 H.J. Lu <hjl.tools@gmail.com>
11675 * config/i386/i386-builtin.def: Remove OPTION_MASK_ISA_SSE4_2
11676 from CRC32 _builtin functions.
11678 2021-07-21 Sebastian Huber <sebastian.huber@embedded-brains.de>
11680 * coverage.c (build_gcov_info_var_registration): Mark the object placed
11681 in the linker set as referenced so that it does not get optimized away.
11683 2021-07-21 Kito Cheng <kito.cheng@sifive.com>
11686 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
11688 * config.gcc (riscv*-*-*): Detect which python is available.
11690 2021-07-21 Jakub Jelinek <jakub@redhat.com>
11692 PR middle-end/101535
11693 * gimplify.c (omp_check_private): Properly skip ORT_TARGET_DATA
11694 contexts in which decl isn't privatized and for ORT_TARGET return
11695 false if decl is mapped.
11697 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
11699 * gimple-loop-jam.c: Include tree-ssa-sccvn.h.
11700 (tree_loop_unroll_and_jam): Run value-numbering on a loop that
11701 has been successfully unrolled.
11703 2021-07-21 Richard Sandiford <richard.sandiford@arm.com>
11705 * tree-ssa-loop-manip.c (determine_exit_conditions): Return a null
11706 exit condition if no tail loop is needed, and if the original exit
11707 condition should therefore be kept as-is.
11708 (tree_transform_and_unroll_loop): Handle that case here too.
11710 2021-07-21 Kewen Lin <linkw@linux.ibm.com>
11712 * tree-data-ref.c (free_dependence_relations): Adjust to pass vec
11714 (free_data_refs): Likewise.
11715 * tree-data-ref.h (free_dependence_relations): Likewise.
11716 (free_data_refs): Likewise.
11717 * tree-predcom.c (struct chain): Use auto_vec instead of vec for
11719 (struct component): Likewise.
11720 (pcom_worker::pcom_worker): Adjust for auto_vec and renaming changes.
11721 (pcom_worker::~pcom_worker): Likewise.
11722 (pcom_worker::release_chain): Adjust as auto_vec changes.
11723 (pcom_worker::loop): Rename to ...
11724 (pcom_worker::m_loop): ... this.
11725 (pcom_worker::datarefs): Rename to ...
11726 (pcom_worker::m_datarefs): ... this. Use auto_vec instead of vec.
11727 (pcom_worker::dependences): Rename to ...
11728 (pcom_worker::m_dependences): ... this. Use auto_vec instead of vec.
11729 (pcom_worker::chains): Rename to ...
11730 (pcom_worker::m_chains): ... this. Use auto_vec instead of vec.
11731 (pcom_worker::looparound_phis): Rename to ...
11732 (pcom_worker::m_looparound_phis): ... this. Use auto_vec instead of
11734 (pcom_worker::cache): Rename to ...
11735 (pcom_worker::m_cache): ... this. Use auto_vec instead of vec.
11736 (pcom_worker::release_chain): Adjust for auto_vec changes.
11737 (pcom_worker::release_chains): Adjust for auto_vec and renaming
11739 (release_component): Remove.
11740 (release_components): Adjust for release_component removal.
11741 (component_of): Adjust to use vec.
11742 (merge_comps): Likewise.
11743 (pcom_worker::aff_combination_dr_offset): Adjust for renaming changes.
11744 (pcom_worker::determine_offset): Likewise.
11745 (class comp_ptrs): Remove.
11746 (pcom_worker::split_data_refs_to_components): Adjust for renaming
11747 changes, for comp_ptrs removal with auto_vec.
11748 (pcom_worker::suitable_component_p): Adjust for renaming changes.
11749 (pcom_worker::filter_suitable_components): Adjust for release_component
11751 (pcom_worker::valid_initializer_p): Adjust for renaming changes.
11752 (pcom_worker::find_looparound_phi): Likewise.
11753 (pcom_worker::add_looparound_copies): Likewise.
11754 (pcom_worker::determine_roots_comp): Likewise.
11755 (pcom_worker::single_nonlooparound_use): Likewise.
11756 (pcom_worker::execute_pred_commoning_chain): Likewise.
11757 (pcom_worker::execute_pred_commoning): Likewise.
11758 (pcom_worker::try_combine_chains): Likewise.
11759 (pcom_worker::prepare_initializers_chain): Likewise.
11760 (pcom_worker::prepare_initializers): Likewise.
11761 (pcom_worker::prepare_finalizers_chain): Likewise.
11762 (pcom_worker::prepare_finalizers): Likewise.
11763 (pcom_worker::tree_predictive_commoning_loop): Likewise.
11765 2021-07-20 Martin Sebor <msebor@redhat.com>
11767 PR middle-end/101397
11768 * builtins.c (gimple_call_return_array): Add argument. Correct
11769 offsets for memchr, mempcpy, stpcpy, and stpncpy.
11770 (compute_objsize_r): Adjust offset computation for argument returning
11773 2021-07-20 Martin Sebor <msebor@redhat.com>
11775 PR middle-end/101300
11776 * tree-ssa-uninit.c (check_defs): Handle UBSAN built-ins.
11778 2021-07-20 Jeff Law <jlaw@localhost.localdomain>
11780 * function.c (assign_parm_setup_block): Use adjust_address instead
11781 of change_address to preserve MEM_EXPR and friends.
11783 2021-07-20 Martin Sebor <msebor@redhat.com>
11785 * cfgloop.h (single_likely_exit): Adjust by-value argument to
11786 by-const-reference.
11787 * cfgloopanal.c (single_likely_exit): Same.
11788 * cgraph.h (struct cgraph_node): Same.
11789 * cgraphclones.c (cgraph_node::create_virtual_clone): Same.
11790 * genautomata.c (merge_states): Same.
11791 * genextract.c (VEC_char_to_string): Same.
11792 * genmatch.c (dt_node::gen_kids_1): Same.
11793 (walk_captures): Adjust by-value argument to by-reference.
11794 * gimple-ssa-store-merging.c (check_no_overlap): Adjust by-value argument
11795 to by-const-reference.
11796 * gimple.c (gimple_build_call_vec): Same.
11797 (gimple_build_call_internal_vec): Same.
11798 (gimple_build_switch): Same.
11799 (sort_case_labels): Same.
11800 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
11802 * gimple.h (gimple_build_call_vec): Adjust by-value argument to
11803 by-const-reference.
11804 (gimple_build_call_internal_vec): Same.
11805 (gimple_build_switch): Same.
11806 (sort_case_labels): Same.
11807 (preprocess_case_label_vec_for_gimple): Adjust by-value argument to
11809 * haifa-sched.c (calc_priorities): Adjust by-value argument to
11810 by-const-reference.
11811 (sched_init_luids): Same.
11812 (haifa_init_h_i_d): Same.
11813 * ipa-cp.c (ipa_get_indirect_edge_target_1): Same.
11814 (adjust_callers_for_value_intersection): Adjust by-value argument to
11816 (find_more_scalar_values_for_callers_subset): Adjust by-value argument to
11817 by-const-reference.
11818 (find_more_contexts_for_caller_subset): Same.
11819 (find_aggregate_values_for_callers_subset): Same.
11820 (copy_useful_known_contexts): Same.
11821 * ipa-fnsummary.c (remap_edge_summaries): Same.
11822 (remap_freqcounting_predicate): Same.
11823 * ipa-inline.c (add_new_edges_to_heap): Adjust by-value argument to
11825 * ipa-predicate.c (predicate::remap_after_inlining): Adjust by-value argument
11826 to by-const-reference.
11827 * ipa-predicate.h (predicate::remap_after_inlining): Same.
11828 * ipa-prop.c (ipa_find_agg_cst_for_param): Same.
11829 * ipa-prop.h (ipa_find_agg_cst_for_param): Same.
11830 * ira-build.c (ira_loop_tree_body_rev_postorder): Same.
11831 * read-rtl.c (add_overload_instance): Same.
11832 * rtl.h (native_decode_rtx): Same.
11833 (native_decode_vector_rtx): Same.
11834 * sched-int.h (sched_init_luids): Same.
11835 (haifa_init_h_i_d): Same.
11836 * simplify-rtx.c (native_decode_vector_rtx): Same.
11837 (native_decode_rtx): Same.
11838 * tree-call-cdce.c (gen_shrink_wrap_conditions): Same.
11839 (shrink_wrap_one_built_in_call_with_conds): Same.
11840 (shrink_wrap_conditional_dead_built_in_calls): Same.
11841 * tree-data-ref.c (create_runtime_alias_checks): Same.
11842 (compute_all_dependences): Same.
11843 * tree-data-ref.h (compute_all_dependences): Same.
11844 (create_runtime_alias_checks): Same.
11845 (index_in_loop_nest): Same.
11846 * tree-if-conv.c (mask_exists): Same.
11847 * tree-loop-distribution.c (class loop_distribution): Same.
11848 (loop_distribution::create_rdg_vertices): Same.
11849 (dump_rdg_partitions): Same.
11850 (debug_rdg_partitions): Same.
11851 (partition_contains_all_rw): Same.
11852 (loop_distribution::distribute_loop): Same.
11853 * tree-parloops.c (oacc_entry_exit_ok_1): Same.
11854 (oacc_entry_exit_single_gang): Same.
11855 * tree-ssa-loop-im.c (hoist_memory_references): Same.
11856 (loop_suitable_for_sm): Same.
11857 * tree-ssa-loop-niter.c (bound_index): Same.
11858 * tree-ssa-reassoc.c (update_ops): Same.
11859 (swap_ops_for_binary_stmt): Same.
11860 (rewrite_expr_tree): Same.
11861 (rewrite_expr_tree_parallel): Same.
11862 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Same.
11863 * tree-ssa-sccvn.h (ao_ref_init_from_vn_reference): Same.
11864 * tree-ssa-structalias.c (process_all_all_constraints): Same.
11865 (make_constraints_to): Same.
11866 (handle_lhs_call): Same.
11867 (find_func_aliases_for_builtin_call): Same.
11868 (sort_fieldstack): Same.
11869 (check_for_overlaps): Same.
11870 * tree-vect-loop-manip.c (vect_create_cond_for_align_checks): Same.
11871 (vect_create_cond_for_unequal_addrs): Same.
11872 (vect_create_cond_for_lower_bounds): Same.
11873 (vect_create_cond_for_alias_checks): Same.
11874 * tree-vect-slp-patterns.c (vect_validate_multiplication): Same.
11875 * tree-vect-slp.c (vect_analyze_slp_instance): Same.
11876 (vect_make_slp_decision): Same.
11877 (vect_slp_bbs): Same.
11878 (duplicate_and_interleave): Same.
11879 (vect_transform_slp_perm_load): Same.
11880 (vect_schedule_slp): Same.
11881 * tree-vectorizer.h (vect_transform_slp_perm_load): Same.
11882 (vect_schedule_slp): Same.
11883 (duplicate_and_interleave): Same.
11884 * tree.c (build_vector_from_ctor): Same.
11885 (build_vector): Same.
11886 (check_vector_cst): Same.
11887 (check_vector_cst_duplicate): Same.
11888 (check_vector_cst_fill): Same.
11889 (check_vector_cst_stepped): Same.
11890 * tree.h (build_vector_from_ctor): Same.
11892 2021-07-20 Jakub Jelinek <jakub@redhat.com>
11895 * config/rs6000/rs6000-protos.h (easy_altivec_constant): Change return
11896 type from bool to int.
11897 * config/rs6000/rs6000.c (vspltis_constant): Fix up handling the
11898 EASY_VECTOR_MSB case if either step or copies is not 1.
11899 (vspltis_shifted): Fix comment typo.
11900 (easy_altivec_constant): Change return type from bool to int, instead
11901 of returning true return byte size of the element mode that should be
11902 used to synthetize the constant.
11903 * config/rs6000/predicates.md (easy_vector_constant_msb): Require
11904 that vspltis_shifted is 0, handle the case where easy_altivec_constant
11905 assumes using different vector mode from CONST_VECTOR's mode.
11906 * config/rs6000/altivec.md (easy_vector_constant_msb splitter): Use
11907 easy_altivec_constant to determine mode in which -1 >> -1 should be
11908 performed, use rs6000_expand_vector_init instead of gen_vec_initv4sisi.
11910 2021-07-20 Richard Biener <rguenther@suse.de>
11913 * dwarf2out.h (dwarf_file_data): Add key member.
11914 * dwarf2out.c (dwarf_file_hasher::equal): Compare key.
11915 (dwarf_file_hasher::hash): Hash key.
11916 (lookup_filename): Remap the filename and store it in the
11917 filename member of dwarf_file_data when creating a new
11919 (file_name_acquire): Do not remap the filename again.
11920 (maybe_emit_file): Likewise.
11922 2021-07-20 Jonathan Wright <jonathan.wright@arm.com>
11924 * config/aarch64/aarch64-simd-builtins.def: Use two variant
11925 generators for all TBL/TBX intrinsics and rename to
11926 consistent forms: qtbl[1234] or qtbx[1234].
11927 * config/aarch64/aarch64-simd.md (aarch64_tbl1<mode>):
11929 (aarch64_qtbl1<mode>): This.
11930 (aarch64_tbx1<mode>): Rename to...
11931 (aarch64_qtbx1<mode>): This.
11932 (aarch64_tbl2v16qi): Delete.
11933 (aarch64_tbl3<mode>): Rename to...
11934 (aarch64_qtbl2<mode>): This.
11935 (aarch64_tbx4<mode>): Rename to...
11936 (aarch64_qtbx2<mode>): This.
11937 * config/aarch64/aarch64.c (aarch64_expand_vec_perm_1): Use
11938 renamed qtbl1 and qtbl2 RTL patterns.
11939 * config/aarch64/arm_neon.h (vqtbl1_p8): Use renamed qtbl1
11941 (vqtbl1_s8): Likewise.
11942 (vqtbl1_u8): Likewise.
11943 (vqtbl1q_p8): Likewise.
11944 (vqtbl1q_s8): Likewise.
11945 (vqtbl1q_u8): Likewise.
11946 (vqtbx1_s8): Use renamed qtbx1 RTL pattern.
11947 (vqtbx1_u8): Likewise.
11948 (vqtbx1_p8): Likewise.
11949 (vqtbx1q_s8): Likewise.
11950 (vqtbx1q_u8): Likewise.
11951 (vqtbx1q_p8): Likewise.
11952 (vtbl1_s8): Use renamed qtbl1 RTL pattern.
11953 (vtbl1_u8): Likewise.
11954 (vtbl1_p8): Likewise.
11955 (vtbl2_s8): Likewise
11956 (vtbl2_u8): Likewise.
11957 (vtbl2_p8): Likewise.
11958 (vtbl3_s8): Use renamed qtbl2 RTL pattern.
11959 (vtbl3_u8): Likewise.
11960 (vtbl3_p8): Likewise.
11961 (vtbl4_s8): Likewise.
11962 (vtbl4_u8): Likewise.
11963 (vtbl4_p8): Likewise.
11964 (vtbx2_s8): Use renamed qtbx2 RTL pattern.
11965 (vtbx2_u8): Likewise.
11966 (vtbx2_p8): Likewise.
11967 (vqtbl2_s8): Use renamed qtbl2 RTL pattern.
11968 (vqtbl2_u8): Likewise.
11969 (vqtbl2_p8): Likewise.
11970 (vqtbl2q_s8): Likewise.
11971 (vqtbl2q_u8): Likewise.
11972 (vqtbl2q_p8): Likewise.
11973 (vqtbx2_s8): Use renamed qtbx2 RTL pattern.
11974 (vqtbx2_u8): Likewise.
11975 (vqtbx2_p8): Likewise.
11976 (vqtbx2q_s8): Likewise.
11977 (vqtbx2q_u8): Likewise.
11978 (vqtbx2q_p8): Likewise.
11979 (vtbx4_s8): Likewise.
11980 (vtbx4_u8): Likewise.
11981 (vtbx4_p8): Likewise.
11983 2021-07-20 Uroš Bizjak <ubizjak@gmail.com>
11986 * config/i386/sync.md (define_peephole2 atomic_storedi_fpu):
11988 (define_peephole2 atomic_loaddi_fpu): Ditto.
11990 2021-07-20 Kito Cheng <kito.cheng@sifive.com>
11992 * config.gcc (riscv*-*-*): Detect which python is available.
11994 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
11996 * config/rs6000/vsx.md (mulhs_<mode>): Rename to...
11997 (smul<mode>3_highpart): ... this.
11998 (mulhu_<mode>): Rename to...
11999 (umul<mode>3_highpart): ... this.
12000 * config/rs6000/rs6000-builtin.def (MULHS_V2DI, MULHS_V4SI,
12001 MULHU_V2DI, MULHU_V4SI): Adjust.
12003 2021-07-20 Kewen Lin <linkw@linux.ibm.com>
12005 PR tree-optimization/100696
12006 * internal-fn.c (first_commutative_argument): Add info for IFN_MULH.
12007 * internal-fn.def (IFN_MULH): New internal function.
12008 * tree-vect-patterns.c (vect_recog_mulhs_pattern): Add support to
12009 recog normal multiply highpart as IFN_MULH.
12010 * config/i386/i386.c (ix86_add_stmt_cost): Adjust for combined
12013 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
12015 * config/elfos.h (CTF_DEBUGGING_INFO): New definition.
12016 (BTF_DEBUGGING_INFO): Likewise.
12017 * doc/tm.texi.in: Document the new macros.
12018 * doc/tm.texi: Regenerated.
12019 * toplev.c: Guard initialization of debug hooks.
12021 2021-07-19 Indu Bhagat <indu.bhagat@oracle.com>
12023 * flags.h (ctf_debuginfo_p): New function declaration.
12024 * opts.c (ctf_debuginfo_p): New function definition.
12026 2021-07-19 Andrew Stubbs <ams@codesourcery.com>
12029 * config/gcn/gcn-hsa.h (DRIVER_SELF_SPECS): New.
12030 (ASM_SPEC): Set -mattr for xnack and sram-ecc.
12031 * config/gcn/gcn-opts.h (enum sram_ecc_type): New.
12032 * config/gcn/gcn-valu.md: Add a warning comment.
12033 * config/gcn/gcn.c (gcn_option_override): Add "sorry" for -mxnack.
12034 (output_file_start): Add xnack and sram-ecc state to ".amdgcn_target".
12035 * config/gcn/gcn.md: Add a warning comment.
12036 * config/gcn/gcn.opt: Add -mxnack and -msram-ecc.
12037 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): Remove
12039 (EF_AMDGPU_XNACK): New.
12040 (EF_AMDGPU_SRAM_ECC): New.
12042 (copy_early_debug_info): Use elf_flags.
12043 (main): Handle -mxnack and -msram-ecc options.
12044 * doc/invoke.texi: Document -mxnack and -msram-ecc.
12046 2021-07-19 Andrew Pinski <apinski@marvell.com>
12049 * config/aarch64/aarch64.md (csneg3_uxtw_insn): Rename to ...
12050 (*cs<neg_not_cs>3_uxtw_insn4): and extend to NEG_NOT.
12052 2021-07-19 Richard Biener <rguenther@suse.de>
12054 PR tree-optimization/101505
12055 * tree-vect-patterns.c (vect_determine_precisions): Walk
12056 PHIs also for loop vectorization.
12058 2021-07-19 Richard Biener <rguenther@suse.de>
12060 * gimple.h (gimple_expr_type): Remove.
12061 * doc/gimple.texi: Remove gimple_expr_type documentation.
12063 2021-07-19 Richard Biener <rguenther@suse.de>
12065 * tree-ssa-sccvn.c (vn_reference_eq): Handle NULL vr->type.
12066 (ao_ref_init_from_vn_reference): Likewise.
12067 (fully_constant_reference): Likewise.
12068 (vn_reference_lookup_call): Do not set vr->type to random
12070 * tree-ssa-pre.c (compute_avail): Do not try to PRE calls
12072 * tree-vect-generic.c (expand_vector_piecewise): Pass in
12073 whether we expanded parallel.
12074 (expand_vector_parallel): Adjust.
12075 (expand_vector_addition): Likewise.
12076 (expand_vector_comparison): Likewise.
12077 (expand_vector_operation): Likewise.
12078 (expand_vector_scalar_condition): Likewise.
12079 (expand_vector_conversion): Likewise.
12081 2021-07-19 Richard Biener <rguenther@suse.de>
12083 * tree-vrp.c (register_edge_assert_for_2): Use the
12085 (vrp_folder::fold_predicate_in): Likewise.
12086 * vr-values.c (gimple_assign_nonzero_p): Likewise.
12087 (vr_values::extract_range_from_comparison): Likewise.
12088 (vr_values::extract_range_from_ubsan_builtin): Use the
12089 type of the first operand.
12090 (vr_values::extract_range_basic): Push down type
12091 computation, use the appropriate LHS.
12092 (vr_values::extract_range_from_assignment): Use the
12095 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
12098 * common/config/i386/i386-common.c (ix86_handle_option): For
12099 -mgeneral-regs-only, enable the GPR only instructions which are
12100 enabled implicitly by SSE ISAs unless they have been disabled
12103 2021-07-18 H.J. Lu <hjl.tools@gmail.com>
12106 * config/i386/i386.c (ix86_check_avx_upper_stores): Moved before
12107 ix86_avx_u128_mode_needed.
12108 (ix86_avx_u128_mode_needed): Return AVX_U128_DIRTY if callee
12109 returns AVX register.
12111 2021-07-17 Jan Hubicka <hubicka@ucw.cz>
12113 * tree-ssa-structalias.c (handle_rhs_call): Support EAF_NOT_RETURNED.
12114 (handle_const_call): Liekise
12115 (handle_pure_call): Liekise
12117 2021-07-17 Andrew MacLeod <amacleod@redhat.com>
12119 PR tree-optimization/96542
12120 * range-op.cc (range_operator::wi_fold_in_parts): New.
12121 (range_operator::fold_range): Call wi_fold_in_parts.
12122 (operator_lshift::wi_fold): Fix broken lshift by [0,0].
12123 * range-op.h (wi_fold_in_parts): Add prototype.
12125 2021-07-16 David Malcolm <dmalcolm@redhat.com>
12127 * doc/analyzer.texi: Add __analyzer_dump_state.
12129 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12131 * config/rs6000/rbtree.c: New file.
12132 * config/rs6000/rbtree.h: New file.
12134 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12136 * config/rs6000/rs6000-gen-builtins.c (restriction): New enum.
12137 (typeinfo): Add restr field.
12138 (match_bracketed_pair): New function.
12139 (match_const_restriction): Implement.
12141 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12143 * config/rs6000/rs6000-gen-builtins.c (match_basetype): Implement.
12145 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12147 * config/rs6000/rs6000-gen-builtins.c (void_status): New enum.
12148 (basetype): Likewise.
12149 (typeinfo): Likewise.
12150 (handle_pointer): New function.
12151 (match_basetype): New stub function.
12152 (match_const_restriction): Likewise.
12153 (match_type): New function.
12155 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12157 * config/rs6000/rs6000-gen-builtins.c (consume_whitespace): New
12159 (advance_line): Likewise.
12160 (safe_inc_pos): Likewise.
12161 (match_identifier): Likewise.
12162 (match_integer): Likewise.
12163 (match_to_right_bracket): Likewise.
12165 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12167 * config/rs6000/rs6000-gen-builtins.c (bif_file): New variable.
12168 (ovld_file): Likewise.
12169 (header_file): Likewise.
12170 (init_file): Likewise.
12171 (defines_file): Likewise.
12172 (pgm_path): Likewise.
12173 (bif_path): Likewise.
12174 (ovld_path): Likewise.
12175 (header_path): Likewise.
12176 (init_path): Likewise.
12177 (defines_path): Likewise.
12178 (LINELEN): New macro.
12179 (linebuf): New variable.
12183 (bif_diag): New function.
12184 (ovld_diag): Likewise.
12186 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12188 * config/rs6000/rs6000-builtin-new.def: New.
12189 * config/rs6000/rs6000-overload.def: New.
12191 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12193 * config/rs6000/rs6000-gen-builtins.c: New.
12195 2021-07-16 Bill Schmidt <wschmidt@linux.ibm.com>
12197 * Makefile.in (EXTRA_GTYPE_DEPS): New variable.
12198 (s-gtype): Depend on EXTRA_GTYPE_DEPS.
12199 * gengtype-state.c (state_writer::write_state_file_list): Add a
12200 parameter to the fileslist expression for the number of build
12202 (read_state_files_list): Detect build headers and strip the
12203 initial "./" or ".\" from their names.
12204 * gengtype.c (build_headers): New global variable.
12205 (num_build_headers): Likewise.
12206 (open_base_files): Emit #include for each build header.
12207 (main): Detect and count build headers.
12208 * gengtype.h (build_headers): New extern variable.
12209 (num_build_headers): Likewise.
12211 2021-07-16 Richard Biener <rguenther@suse.de>
12213 * gimple-ssa-store-merging.c (verify_symbolic_number_p): Use
12214 the type of the LHS.
12215 (find_bswap_or_nop_1): Likewise.
12216 (find_bswap_or_nop): Likewise.
12217 * tree-vectorizer.h (vect_get_smallest_scalar_type): Adjust
12219 * tree-vect-data-refs.c (vect_get_smallest_scalar_type):
12220 Remove unused parameters, pass in the scalar type. Fix
12221 internal store function handling.
12222 * tree-vect-stmts.c (vect_analyze_stmt): Remove assert.
12223 (vect_get_vector_types_for_stmt): Move down check for
12224 existing vector stmt after we've determined a scalar type.
12225 Pass down the used scalar type to vect_get_smallest_scalar_type.
12226 * tree-vect-generic.c (expand_vector_condition): Use
12227 the type of the LHS.
12228 (expand_vector_scalar_condition): Likewise.
12229 (expand_vector_operations_1): Likewise.
12230 * tree-vect-patterns.c (vect_widened_op_tree): Likewise.
12231 (vect_recog_dot_prod_pattern): Likewise.
12232 (vect_recog_sad_pattern): Likewise.
12233 (vect_recog_widen_op_pattern): Likewise.
12234 (vect_recog_widen_sum_pattern): Likewise.
12235 (vect_recog_mixed_size_cond_pattern): Likewise.
12237 2021-07-16 Jan Hubicka <hubicka@ucw.cz>
12239 * ipa-modref.c (struct escape_entry): Use eaf_fleags_t.
12240 (dump_eaf_flags): Dump EAF_NOT_RETURNED
12241 (eaf_flags_useful_p): Use eaf_fleags_t; handle const functions
12242 and EAF_NOT_RETURNED.
12243 (modref_summary::useful_p): Likewise.
12244 (modref_summary_lto::useful_p): Likewise.
12245 (struct) modref_summary_lto: Use eaf_fleags_t.
12246 (deref_flags): Handle EAF_NOT_RETURNED.
12247 (struct escape_point): Use min_flags.
12248 (modref_lattice::init): Add EAF_NOT_RETURNED.
12249 (merge_call_lhs_flags): Ignore EAF_NOT_RETURNED functions
12250 (analyze_ssa_name_flags): Clear EAF_NOT_RETURNED on return;
12252 (analyze_parms): Also analyze const functions; update conition on
12254 (modref_write): Update streaming.
12255 (read_section): Update streaming.
12256 (remap_arg_flags): Use eaf_flags_t.
12257 (modref_merge_call_site_flags): Hanlde EAF_NOT_RETURNED.
12258 * ipa-modref.h: (eaf_flags_t): New typedef.
12259 (struct modref_summary): Use eaf_flags_t.
12260 * tree-core.h (EAF_NOT_RETURNED): New constant.
12262 2021-07-16 Richard Biener <rguenther@suse.de>
12264 * gimple-fold.c (gimple_fold_stmt_to_constant_1): Use
12265 the type of the LHS.
12266 (gimple_assign_nonnegative_warnv_p): Likewise.
12267 (gimple_call_nonnegative_warnv_p): Likewise. Return false
12268 if the call has no LHS.
12269 * gimple.c (gimple_could_trap_p_1): Use the type of the LHS.
12270 * tree-eh.c (stmt_could_throw_1_p): Likewise.
12271 * tree-inline.c (insert_init_stmt): Likewise.
12272 * tree-ssa-loop-niter.c (get_val_for): Likewise.
12273 * tree-outof-ssa.c (ssa_is_replaceable_p): Use the type of
12275 * tree-ssa-sccvn.c (init_vn_nary_op_from_stmt): Take a
12276 gassign *. Use the type of the lhs.
12277 (vn_nary_op_lookup_stmt): Adjust.
12278 (vn_nary_op_insert_stmt): Likewise.
12280 2021-07-16 Ilya Leoshkevich <iii@linux.ibm.com>
12282 * config/s390/predicates.md (bras_sym_operand): Accept all
12283 functions in 64-bit mode, use UNSPEC_PLT31.
12284 (larl_operand): Use UNSPEC_PLT31.
12285 * config/s390/s390.c (s390_loadrelative_operand_p): Likewise.
12286 (legitimize_pic_address): Likewise.
12287 (s390_emit_tls_call_insn): Mark __tls_get_offset as function,
12289 (s390_delegitimize_address): Use UNSPEC_PLT31.
12290 (s390_output_addr_const_extra): Likewise.
12291 (print_operand): Add @PLT to TLS calls, handle %K.
12292 (s390_function_profiler): Mark __fentry__/_mcount as function,
12293 use %K, use UNSPEC_PLT31.
12294 (s390_output_mi_thunk): Use only UNSPEC_GOT, use %K.
12295 (s390_emit_call): Use UNSPEC_PLT31.
12296 (s390_emit_tpf_eh_return): Mark __tpf_eh_return as function.
12297 * config/s390/s390.md (UNSPEC_PLT31): Rename from UNSPEC_PLT.
12298 (*movdi_64): Use %K.
12299 (reload_base_64): Likewise.
12300 (*sibcall_brc): Likewise.
12301 (*sibcall_brcl): Likewise.
12302 (*sibcall_value_brc): Likewise.
12303 (*sibcall_value_brcl): Likewise.
12305 (*brasl): Likewise.
12306 (*bras_r): Likewise.
12307 (*brasl_r): Likewise.
12308 (*bras_tls): Likewise.
12309 (*brasl_tls): Likewise.
12310 (main_base_64): Likewise.
12311 (reload_base_64): Likewise.
12312 (@split_stack_call<mode>): Likewise.
12314 2021-07-16 Richard Biener <rguenther@suse.de>
12316 PR tree-optimization/101467
12317 * tree-vect-stmts.c (vect_gen_while): Properly guard
12318 make_temp_ssa_name usage.
12320 2021-07-16 Cooper Qu <cooper.qu@linux.alibaba.com>
12322 * config.gcc: Don't use forked print-sysroot-suffix.sh and
12323 t-sysroot-suffix for C-SKY.
12324 * config/csky/print-sysroot-suffix.sh: Delete.
12325 * config/csky/t-csky-linux: Delete.
12326 * config/csky/t-sysroot-suffix: Define MULTILIB_DIRNAMES
12327 instead of CSKY_MULTILIB_DIRNAMES.
12329 2021-07-16 Richard Biener <rguenther@suse.de>
12331 * tree-vect-loop.c (vect_transform_cycle_phi): Correct sign
12332 conversion issues with the partial reduction of the reused
12333 vector accumulator.
12335 2021-07-16 Richard Biener <rguenther@suse.de>
12337 * config/i386/i386-options.c (ix86_option_override_internal): Set
12338 param_vect_partial_vector_usage to zero if not set.
12340 2021-07-15 Uroš Bizjak <ubizjak@gmail.com>
12343 * config/i386/i386.h (VALID_SSE_REG_MODE): Add TDmode.
12344 (VALID_INT_MODE_P): Add SDmode and DDmode.
12345 Add TDmode for TARGET_64BIT.
12346 (VALID_DFP_MODE_P): Remove.
12347 * config/i386/i386.c (ix86_hard_regno_mode_ok):
12348 Do not use VALID_DFP_MODE_P.
12350 2021-07-15 Andrew MacLeod <amacleod@redhat.com>
12352 * gimple-range-fold.cc (adjust_pointer_diff_expr): Use
12354 (fold_using_range::fold_stmt): Ditto.
12355 (fold_using_range::range_of_range_op): Ditto.
12356 (fold_using_range::range_of_phi): Ditto.
12357 (fold_using_range::range_of_call): Ditto.
12358 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
12359 (fold_using_range::range_of_builtin_call): Ditto.
12360 (fold_using_range::range_of_cond_expr): Ditto.
12361 * gimple-range-fold.h (gimple_range_type): New.
12363 2021-07-15 Martin Sebor <msebor@redhat.com>
12365 PR middle-end/97027
12366 * tree-ssa-strlen.c (handle_assign): New function.
12367 (maybe_warn_overflow): Add argument.
12368 (nonzero_bytes_for_type): New function.
12369 (count_nonzero_bytes): Handle more tree types. Call
12370 nonzero_bytes_for_tye.
12371 (count_nonzero_bytes): Handle types.
12372 (handle_store): Handle stores from function calls.
12373 (strlen_check_and_optimize_call): Move code to handle_assign. Call
12374 it for assignments from function calls.
12376 2021-07-15 David Malcolm <dmalcolm@redhat.com>
12381 * doc/invoke.texi: Add -Wanalyzer-use-of-uninitialized-value.
12383 2021-07-15 David Malcolm <dmalcolm@redhat.com>
12385 * doc/invoke.texi (-fdump-analyzer-exploded-paths): New.
12387 2021-07-15 Martin Sebor <msebor@redhat.com>
12391 * fold-const.c (operand_compare::operand_equal_p): Handle OEP_DECL_NAME.
12392 (operand_compare::verify_hash_value): Same.
12393 * tree-core.h (OEP_DECL_NAME): New.
12395 2021-07-15 Martin Jambor <mjambor@suse.cz>
12397 * profile-count.h (profile_count::value): Change the return type to
12399 * gimple-pretty-print.c (dump_gimple_bb_header): Adjust print
12401 * tree-cfg.c (dump_function_to_file): Likewise.
12403 2021-07-15 Bill Schmidt <wschmidt@linux.ibm.com>
12406 * config/rs6000/rs6000-p8swap.c (has_part_mult): New.
12407 (rs6000_analyze_swaps): Insns containing a subreg of a mult are
12410 2021-07-15 Richard Biener <rguenther@suse.de>
12412 * tree-vectorizer.h (vect_gen_while): Match up with
12413 vect_gen_while_not.
12414 * tree-vect-stmts.c (vect_gen_while): Adjust API to that
12415 of vect_gen_while_not.
12416 (vect_gen_while_not): Adjust.
12417 * tree-vect-loop-manip.c (vect_set_loop_controls_directly): Likewise.
12419 2021-07-15 Aldy Hernandez <aldyh@redhat.com>
12421 * gimple-range-cache.cc (non_null_ref::adjust_range): New.
12422 (ranger_cache::range_of_def): Call adjust_range.
12423 (ranger_cache::entry_range): Same.
12424 * gimple-range-cache.h (non_null_ref::adjust_range): New.
12425 * gimple-range.cc (gimple_ranger::range_of_expr): Call
12427 (gimple_ranger::range_on_entry): Same.
12429 2021-07-15 Tamar Christina <tamar.christina@arm.com>
12432 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12434 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
12436 2021-07-15 Tamar Christina <tamar.christina@arm.com>
12439 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12441 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
12442 (sdot_prod, udot_prod): ...These.
12443 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
12444 (aarch64_<sur>dot<vsi2qi>): Rename to...
12445 (<sur>dot_prod<vsi2qi>): ...This.
12446 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
12449 2021-07-15 Jakub Jelinek <jakub@redhat.com>
12451 PR middle-end/101437
12452 * gimplify.c (gimplify_expr): Throw away volatile reads from empty
12453 types even if they have non-BLKmode TYPE_MODE.
12455 2021-07-15 Richard Biener <rguenther@suse.de>
12458 * gcc.c (process_command): Process -gtoggle like process_options
12459 would after parsing options.
12461 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
12463 * cfgexpand.c (expand_asm_loc): Adjust.
12464 (expand_asm_stmt): Likewise.
12465 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Likewise.
12466 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
12467 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
12468 * config/avr/avr.c (avr_md_asm_adjust): Likewise.
12469 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
12470 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
12471 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
12472 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
12473 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
12474 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
12475 * config/s390/s390.c (s390_md_asm_adjust): Likewise.
12476 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
12477 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
12478 * doc/tm.texi: Regenerate.
12479 * target.def: Add location argument to md_asm_adjust.
12481 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
12483 * tree-diagnostic.c (diagnostic_report_current_function): Use the
12484 diagnostic's location, not input_location.
12486 2021-07-15 Trevor Saunders <tbsaunde@tbsaunde.org>
12488 * cfgexpand.c (tree_conflicts_with_clobbers_p): Pass location to
12490 (expand_asm_stmt): Likewise.
12492 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
12494 * config/rs6000/rs6000.c (adjacent_mem_locations): Return the lower
12495 addressed memory rtx, if any.
12496 (rs6000_split_multireg_move): Fix code formatting.
12497 Handle MMA build built-ins with operands in adjacent memory locations.
12499 2021-07-14 Peter Bergner <bergner@linux.ibm.com>
12501 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Move to later
12504 2021-07-14 Jason Merrill <jason@redhat.com>
12506 * sel-sched-ir.h (get_all_loop_exits): Use auto_vec.
12508 2021-07-14 Jason Merrill <jason@redhat.com>
12510 * doc/invoke.texi: -fdelete-dead-exceptions is on by default for
12513 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12515 * tree-vect-patterns.c (vect_recog_dot_prod_pattern):
12516 Remove erroneous line.
12518 2021-07-14 Andrew MacLeod <amacleod@redhat.com>
12520 * params.opt (param_evrp_mode): Change default.
12522 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12524 * config/aarch64/aarch64-simd-builtins.def (udot, sdot): Rename to...
12525 (sdot_prod, udot_prod): ...These.
12526 * config/aarch64/aarch64-simd.md (<sur>dot_prod<vsi2qi>): Remove.
12527 (aarch64_<sur>dot<vsi2qi>): Rename to...
12528 (<sur>dot_prod<vsi2qi>): ...This.
12529 * config/aarch64/arm_neon.h (vdot_u32, vdotq_u32, vdot_s32, vdotq_s32):
12532 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12534 * config/arm/neon.md (<sup>dot_prod<vsi2qi>): Drop statements.
12536 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12538 * doc/sourcebuild.texi (arm_v8_2a_i8mm_neon_hw): Document.
12540 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12542 * config/arm/neon.md (usdot_prod<vsi2qi>): New.
12544 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12546 * config/aarch64/aarch64-simd.md (aarch64_usdot<vsi2qi>): Rename to...
12547 (usdot_prod<vsi2qi>): ... This.
12548 * config/aarch64/aarch64-simd-builtins.def (usdot): Rename to...
12549 (usdot_prod): ...This.
12550 * config/aarch64/arm_neon.h (vusdot_s32, vusdotq_s32): Likewise.
12551 * config/aarch64/aarch64-sve.md (@aarch64_<sur>dot_prod<vsi2qi>):
12553 (@<sur>dot_prod<vsi2qi>): ...This.
12554 * config/aarch64/aarch64-sve-builtins-base.cc
12555 (svusdot_impl::expand): Use it.
12557 2021-07-14 Tamar Christina <tamar.christina@arm.com>
12559 * optabs.def (usdot_prod_optab): New.
12560 * doc/md.texi: Document it and clarify other dot prod optabs.
12561 * optabs-tree.h (enum optab_subtype): Add optab_vector_mixed_sign.
12562 * optabs-tree.c (optab_for_tree_code): Support usdot_prod_optab.
12563 * optabs.c (expand_widen_pattern_expr): Likewise.
12564 * tree-cfg.c (verify_gimple_assign_ternary): Likewise.
12565 * tree-vect-loop.c (vectorizable_reduction): Query dot-product kind.
12566 * tree-vect-patterns.c (vect_supportable_direct_optab_p): Take optional
12568 (vect_widened_op_tree): Optionally ignore
12570 (vect_recog_dot_prod_pattern): Support usdot_prod_optab.
12572 2021-07-14 H.J. Lu <hjl.tools@gmail.com>
12575 * config/i386/driver-i386.c (host_detect_local_cpu): Check
12576 "arch [32|64]" and "tune [32|64]" for 32-bit and 64-bit codegen.
12577 Enable UINTR only for 64-bit codegen.
12578 * config/i386/i386-options.c
12579 (ix86_option_override_internal::DEF_PTA): Skip PTA_UINTR if not
12581 * config/i386/i386.h (ARCH_ARG): New.
12582 (CC1_CPU_SPEC): Pass "[arch|tune] 32" for 32-bit codegen and
12583 "[arch|tune] 64" for 64-bit codegen.
12585 2021-07-14 Richard Biener <rguenther@suse.de>
12587 PR tree-optimization/101445
12588 * tree-vect-stmts.c (vectorizable_load): Do the gap adjustment
12589 of the IV in the correct direction for negative stride
12592 2021-07-14 Jakub Jelinek <jakub@redhat.com>
12595 * godump.c (godump_str_hash): New type.
12596 (godump_container::pot_dummy_types): Use string_hash instead of
12597 ptr_hash in the hash_set.
12599 2021-07-14 Richard Biener <rguenther@suse.de>
12601 * tree-vect-loop.c (vect_find_reusable_accumulator): Handle
12602 vector types where the old vector type has a multiple of
12603 the new vector type elements.
12604 (vect_create_partial_epilog): New function, split out from...
12605 (vect_create_epilog_for_reduction): ... here.
12606 (vect_transform_cycle_phi): Reduce the re-used accumulator
12607 to the new vector type.
12609 2021-07-14 Alexandre Oliva <oliva@adacore.com>
12611 * tree-ssa-alias.c (attr_fnspec::verify): Fix index in
12612 non-'t'-sized arg check.
12614 2021-07-14 Alexandre Oliva <oliva@adacore.com>
12616 * tree-cfg.c (cleanup_dead_labels_eh): Update
12617 post_landing_pad label upon change of landing pad block's
12619 (cleanup_dead_labels): Check that a removed label is not that
12622 2021-07-13 Jonathan Wright <jonathan.wright@arm.com>
12624 * combine.c (combine_simplify_rtx): Add vec_select -> subreg
12626 * config/aarch64/aarch64.md (*zero_extend<SHORT:mode><GPI:mode>2_aarch64):
12627 Add Neon to general purpose register case for zero-extend
12629 * config/arm/vfp.md (*arm_movsi_vfp): Remove "*" from *t -> r
12630 case to prevent some cases opting to go through memory.
12631 * cse.c (fold_rtx): Add vec_select -> subreg simplification.
12632 * rtl.c (rtvec_series_p): Define predicate to determine
12633 whether a vector contains a linear series of integers.
12634 * rtl.h (rtvec_series_p): Define.
12635 * rtlanal.c (vec_series_lowpart_p): Define predicate to
12636 determine if a vector selection is equivalent to the low part
12638 * rtlanal.h (vec_series_lowpart_p): Define.
12639 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
12640 Add vec_select -> subreg simplification.
12642 2021-07-13 Paul A. Clarke <pc@us.ibm.com>
12644 * config/rs6000/smmintrin.h (_mm_testz_si128, _mm_testc_si128,
12645 _mm_testnzc_si128, _mm_test_all_ones, _mm_test_all_zeros,
12646 _mm_test_mix_ones_zeros): New.
12648 2021-07-13 Roger Sayle <roger@nextmovesoftware.com>
12649 Richard Biener <rguenther@suse.de>
12651 * gimple.c (gimple_could_trap_p_1): Make S argument a
12652 "const gimple*". Preserve constness in call to
12653 gimple_asm_volatile_p.
12654 (gimple_could_trap_p): Make S argument a "const gimple*".
12655 * gimple.h (gimple_could_trap_p_1, gimple_could_trap_p):
12656 Update function prototypes.
12658 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12660 * tree-vectorizer.h (vect_reusable_accumulator): New structure.
12661 (_loop_vec_info::main_loop_edge): New field.
12662 (_loop_vec_info::skip_main_loop_edge): Likewise.
12663 (_loop_vec_info::skip_this_loop_edge): Likewise.
12664 (_loop_vec_info::reusable_accumulators): Likewise.
12665 (_stmt_vec_info::reduc_scalar_results): Likewise.
12666 (_stmt_vec_info::reused_accumulator): Likewise.
12667 (vect_get_main_loop_result): Declare.
12668 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize
12669 reduc_scalar_inputs.
12670 (vec_info::free_stmt_vec_info): Free reduc_scalar_inputs.
12671 * tree-vect-loop-manip.c (vect_get_main_loop_result): New function.
12672 (vect_do_peeling): Fill an epilogue loop's main_loop_edge,
12673 skip_main_loop_edge and skip_this_loop_edge fields.
12674 * tree-vect-loop.c (INCLUDE_ALGORITHM): Define.
12675 (vect_emit_reduction_init_stmts): New function.
12676 (get_initial_def_for_reduction): Use it.
12677 (get_initial_defs_for_reduction): Likewise. Change the vinfo
12678 parameter to a loop_vec_info.
12679 (vect_create_epilog_for_reduction): Store the scalar results
12680 in the reduc_info. If an epilogue loop is reusing an accumulator
12681 from the main loop, and if the epilogue loop can also be skipped,
12682 try to place the reduction code in the join block. Record
12683 accumulators that could potentially be reused by epilogue loops.
12684 (vect_transform_cycle_phi): When vectorizing epilogue loops,
12685 try to reuse accumulators from the main loop. Record the initial
12686 value in reduc_info for non-SLP reductions too.
12688 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12690 * tree-vect-loop.c (get_initial_def_for_reduction): Remove
12691 adjustment handling. Take the neutral value as an argument,
12692 in place of the code argument.
12693 (vect_transform_cycle_phi): Update accordingly. Handle the
12694 initial values of cond reductions separately from code reductions.
12695 Choose the adjustment here rather than in
12696 get_initial_def_for_reduction. Sink the splat of vec_initial_def.
12698 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12700 * tree-vect-loop.c (neutral_op_for_slp_reduction): Replace with...
12701 (neutral_op_for_reduction): ...this, providing a more general
12703 (vect_create_epilog_for_reduction): Update accordingly.
12704 (vectorizable_reduction): Likewise.
12705 (vect_transform_cycle_phi): Likewise.
12707 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12709 * tree-vect-loop.c (get_initial_def_for_reduction): Take the
12710 reduc_info instead of the original stmt_vec_info.
12711 (vect_transform_cycle_phi): Update accordingly.
12713 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12715 * tree-vect-loop.c (get_initial_defs_for_reduction): Take the
12716 reduc_info as an additional parameter.
12717 (vect_transform_cycle_phi): Update accordingly.
12719 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12721 * tree-vectorizer.h: Include tree-ssa-operands.h.
12722 (vect_phi_initial_value): New function.
12723 * tree-vect-loop.c (neutral_op_for_slp_reduction): Use it.
12724 (get_initial_defs_for_reduction, info_for_reduction): Likewise.
12725 (vect_create_epilog_for_reduction, vectorizable_reduction): Likewise.
12726 (vect_transform_cycle_phi, vectorizable_induction): Likewise.
12728 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12730 * tree-vect-loop.c (vect_create_epilog_for_reduction): Convert
12731 the phi results to vectype after creating them. Remove later
12732 conversion code that thus becomes redundant.
12734 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12736 * tree-vect-loop.c (vect_create_epilog_for_reduction): Replace
12737 the new_phis vector with a reduc_inputs vector. Combine handling
12738 of reduction chains and ncopies > 1.
12740 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12742 * tree-vect-loop.c (vect_create_epilog_for_reduction): Truncate
12743 scalar_results to group_size elements after reducing down from
12744 N*group_size elements. Construct an array_slice of the live-out
12745 stmts and assert that there is one stmt per scalar result.
12747 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12749 * tree-vect-loop.c (vect_create_epilog_for_reduction): Remove
12750 nested_in_vect_loop and use double_reduc everywhere. Remove dead
12751 assignment to "loop".
12753 2021-07-13 Richard Sandiford <richard.sandiford@arm.com>
12755 * internal-fn.c (vectorized_internal_fn_supported_p): Handle
12756 vector types first. For scalar types, consider both the preferred
12757 vector mode and the alternative vector modes.
12758 * optabs-query.c (can_vec_mask_load_store_p): Use the same
12759 structure as above, in particular using related_vector_mode
12760 for modes provided by autovectorize_vector_modes.
12762 2021-07-13 Jakub Jelinek <jakub@redhat.com>
12763 Richard Biener <rguenther@suse.de>
12765 PR tree-optimization/101419
12766 * tree-pass.h (PROP_objsz): Define.
12767 (make_pass_early_object_sizes): Declare.
12768 * passes.def (pass_all_early_optimizations): Rename pass_object_sizes
12769 there to pass_early_object_sizes, drop parameter.
12770 (pass_all_optimizations): Move pass_object_sizes right after pass_ccp,
12771 drop parameter, move pass_post_ipa_warn right after that.
12772 * tree-object-size.c (pass_object_sizes::execute): Rename to...
12773 (object_sizes_execute): ... this. Add insert_min_max_p argument.
12774 (pass_data_object_sizes): Move after object_sizes_execute.
12775 (pass_object_sizes): Likewise. In execute method call
12776 object_sizes_execute, drop set_pass_param method and insert_min_max_p
12777 non-static data member and its initializer in the ctor.
12778 (pass_data_early_object_sizes, pass_early_object_sizes,
12779 make_pass_early_object_sizes): New.
12780 * tree-ssa-sccvn.c (copy_reference_ops_from_ref): Use
12781 (cfun->curr_properties & PROP_objsz) instead of cfun->after_inlining.
12783 2021-07-13 Kito Cheng <kito.cheng@sifive.com>
12786 * config/riscv/constraints.md ("S"): Update description and remove
12788 * doc/md.texi (Machine Constraints): Document the 'S' constraints
12791 2021-07-13 Richard Biener <rguenther@suse.de>
12794 2021-07-12 Richard Biener <rguenther@suse.de>
12796 * tree-vect-slp.c (vect_slp_region): Show the number of
12797 SLP graph entries in the optimization message.
12799 2021-07-13 Michael Meissner <meissner@linux.ibm.com>
12801 * config/rs6000/altivec.md (xxspltiw_v4sf): Change local variable
12803 * config/rs6000/rs6000-protos.h (rs6000_const_f32_to_i32): Change
12804 return type to long.
12805 * config/rs6000/rs6000.c (rs6000_const_f32_to_i32): Change return
12808 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
12810 * gimple-range-fold.cc (fold_using_range::range_of_builtin_ubsan_call):
12811 Query relation between the 2 operands and use it.
12813 2021-07-12 Sergei Trofimovich <siarheit@google.com>
12815 * doc/cfg.texi: Fix s/ei_safe_safe/ei_safe_edge/ typo.
12817 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
12820 * config/i386/predicates.md (vec_setm_sse41_operand):
12821 Rename from vec_setm_operand.
12822 (vec_setm_avx2_operand): New predicate.
12823 * config/i386/sse.md (vec_set<V_128:mode>): Use V_128 mode iterator.
12824 Use vec_setm_sse41_operand as operand 2 predicate.
12825 (vec_set<V_256_512:mode): New expander.
12826 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_sse41_operand
12827 as operand 2 predicate.
12829 2021-07-12 Andrew MacLeod <amacleod@redhat.com>
12831 PR tree-optimization/101335
12832 * range-op.cc (operator_cast::lhs_op1_relation): Delete.
12834 2021-07-12 Andrew Pinski <apinski@marvell.com>
12836 * tree-ssa-phiopt.c (match_simplify_replacement): Move
12837 insert of the sequence before the movement of the
12838 statement. Check if to see if the statement is used
12839 outside of the original phi to see if we should move it.
12841 2021-07-12 Richard Biener <rguenther@suse.de>
12843 * dump-context.h (debug_dump_context::debug_dump_context):
12844 Add FILE * parameter defaulted to stderr.
12845 * dumpfile.c (debug_dump_context::debug_dump_context): Adjust.
12846 * tree-vect-slp.c (dot_slp_tree): New functions.
12848 2021-07-12 Richard Biener <rguenther@suse.de>
12850 PR tree-optimization/101373
12851 * tree-ssa-pre.c (prune_clobbered_mems): Also prune trapping
12852 references when the BB may not return.
12853 (compute_avail): Pass in the function we're working on and
12854 replace cfun references with it. Externally throwing
12855 const calls also possibly terminate the function.
12856 (pass_pre::execute): Pass down the function we're working on.
12857 * gcse.c (compute_hash_table_work): Externally throwing
12858 const/pure calls also need record_last_mem_set_info.
12859 * postreload-gcse.c (record_opr_changes): Looping or externally
12860 throwing const/pure calls also need record_last_mem_set_info.
12862 2021-07-12 Uroš Bizjak <ubizjak@gmail.com>
12864 * recog.c (memory_address_addr_space_p): Change the type to bool.
12865 Return true/false instead of 1/0.
12866 (offsettable_memref_p): Ditto.
12867 (offsettable_nonstrict_memref_p): Ditto.
12868 (offsettable_address_addr_space_p): Ditto.
12869 Change the type of addressp indirect function to bool.
12870 * recog.h (memory_address_addr_space_p): Change the type to bool.
12871 (strict_memory_address_addr_space_p): Ditto.
12872 (offsettable_memref_p): Ditto.
12873 (offsettable_nonstrict_memref_p): Ditto.
12874 (offsettable_address_addr_space_p): Ditto.
12875 * reload.c (maybe_memory_address_addr_space_p): Ditto.
12876 (strict_memory_address_addr_space_p): Change the type to bool.
12877 Return true/false instead of 1/0.
12878 (maybe_memory_address_addr_space_p): Change the type to bool.
12880 2021-07-12 Richard Biener <rguenther@suse.de>
12882 * tree-vect-slp.c (vect_slp_region): Show the number of
12883 SLP graph entries in the optimization message.
12885 2021-07-12 Richard Biener <rguenther@suse.de>
12887 PR tree-optimization/101394
12888 * tree-ssa-pre.c (do_pre_regular_insertion): Avoid inserting
12889 copies from abnormals for a full redundancy.
12891 2021-07-12 Richard Biener <rguenther@suse.de>
12893 PR middle-end/101423
12894 * gimple.c (gimple_could_trap_p_1): Internal function calls
12896 * tree-eh.c (tree_could_trap_p): Likewise.
12898 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
12901 * config/arm/arm_neon.h (vmul_n_u32): Replace call to builtin with
12903 (vmulq_n_u32): Likewise.
12904 (vmul_n_f32): Gate __a * __b on __FAST_MATH__.
12905 (vmulq_n_f32): Likewise.
12906 (vmul_n_f16): Likewise.
12907 (vmulq_n_f16): Likewise.
12909 2021-07-12 Martin Liska <mliska@suse.cz>
12911 PR sanitizer/101425
12912 * gcc.c (check_offload_target_name): Call
12913 candidates_list_and_hint only if we have a candidate.
12915 2021-07-12 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
12918 * config/arm/neon.md (vec_init): Move to ...
12919 * config/arm/vec-common.md (vec_init): ... here.
12920 Change the pattern's mode to VDQX and gate it on VALID_MVE_MODE.
12922 2021-07-12 Roger Sayle <roger@nextmovesoftware.com>
12924 PR tree-optimization/101403
12925 * match.pd ((T)bswap(X)>>C): Correctly handle cases where
12926 signedness of the shift is not the same as the signedness of
12927 the type extension.
12929 2021-07-09 Roger Sayle <roger@nextmovesoftware.com>
12930 Uroš Bizjak <ubizjak@gmail.com>
12932 * config/i386/i386.md (*divmodsi4_const): Optimize SImode
12933 divmod of a constant numerator with new define_insn_and_split.
12935 2021-07-09 Iain Sandoe <iain@sandoe.co.uk>
12938 * config/i386/i386-expand.c (ix86_expand_call): If a call is
12939 to a non-local-binding, or local but to a public symbol, then
12940 assume that it might be indirected via the lazy symbol binder.
12941 Mark R10 and R10 as clobbered in that case.
12943 2021-07-09 Eric Botcazou <ebotcazou@adacore.com>
12946 * gcc.c (ASM_DEBUG_DWARF_OPTION): Set again to --gdwarf2 in
12947 the case where HAVE_AS_WORKING_DWARF_N_FLAG is not defined
12948 and HAVE_LD_BROKEN_PE_DWARF5 is defined.
12950 2021-07-09 Uroš Bizjak <ubizjak@gmail.com>
12952 * config/i386/i386.md (*udivmodsi4_pow2_zext_1): Limit the
12953 log2 range of operands[3] to [1,31].
12954 (*udivmodsi4_pow2_zext_2): Ditto. Correct insn RTX pattern.
12956 2021-07-09 Sergei Trofimovich <siarheit@google.com>
12958 * doc/md.texi: Don't split @smallexample in multiple @groups.
12960 2021-07-09 Sergei Trofimovich <siarheit@google.com>
12962 * doc/md.texi: Add missing 'see' word.
12964 2021-07-09 Andrew Pinski <apinski@marvell.com>
12966 * tree-ssa-phiopt.c (phiopt_early_allow): Change arguments
12967 to take sequence and gimple_match_op. Accept the case where
12968 op is a SSA_NAME and one statement in the sequence.
12969 Also allow constants.
12970 (gimple_simplify_phiopt): Always pass a sequence to resimplify.
12971 Update call to phiopt_early_allow. Discard the sequence if not
12974 2021-07-09 Xi Ruoyao <xry111@mengyan1223.wang>
12979 * config/mips/mips.c (mips_const_insns): Use MSA_SUPPORTED_MODE_P
12980 instead of ISA_HAS_MSA.
12981 (mips_expand_vec_unpack): Likewise.
12982 (mips_expand_vector_init): Likewise.
12984 2021-07-09 Kewen Lin <linkw@linux.ibm.com>
12986 * config/rs6000/vsx.md (mods_<mode>): Rename to...
12987 (mod<mode>3): ... this.
12988 (modu_<mode>): Rename to...
12989 (umod<mode>3): ... this.
12990 * config/rs6000/rs6000-builtin.def (MODS_V2DI, MODS_V4SI, MODU_V2DI,
12991 MODU_V4SI): Adjust.
12993 2021-07-08 Jeff Law <jeffreyalaw@gmail.com>
12995 * config/h8300/shiftrotate.md (variable shifts): Expose condition
12996 code handling for the test before the loop.
12998 2021-07-08 Martin Jambor <mjambor@suse.cz>
13001 * ipa-sra.c (class isra_call_summary): New member
13002 m_before_any_store, initialize it in the constructor.
13003 (isra_call_summary::dump): Dump the new field.
13004 (ipa_sra_call_summaries::duplicate): Copy it.
13005 (process_scan_results): Set it.
13006 (isra_write_edge_summary): Stream it.
13007 (isra_read_edge_summary): Likewise.
13008 (param_splitting_across_edge): Only override
13009 safe_to_import_accesses if m_before_any_store is set.
13011 2021-07-08 Martin Sebor <msebor@redhat.com>
13013 PR bootstrap/101374
13014 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref):
13015 Use Object Size Type 0 instead of 1.
13017 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
13019 * tree-vect-loop.c (vectorizable_reduction): Remove always-true
13022 2021-07-08 Richard Sandiford <richard.sandiford@arm.com>
13024 * match.pd: Simplify an extend-operate-truncate sequence involving
13027 2021-07-08 Roger Sayle <roger@nextmovesoftware.com>
13028 Richard Biener <rguenther@suse.de>
13030 PR tree-optimization/40210
13031 * match.pd (bswap optimizations): Simplify (bswap(x)>>C1)&C2 as
13032 (x>>C3)&C2 when possible. Simplify bswap(x)>>C1 as ((T)x)>>C2
13033 when possible. Simplify bswap(x)&C1 as (x>>C2)&C1 when 0<=C1<=255.
13035 2021-07-08 Uroš Bizjak <ubizjak@gmail.com>
13038 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
13040 * config/i386/mmx.md (V_32): New mode iterator.
13041 (mov<V_32:mode>): Use V_32 mode iterator.
13042 (*mov<V_32:mode>_internal): Ditto.
13043 (*push<V_32:mode>2_rex64): Ditto.
13044 (*push<V_32:mode>2): Ditto.
13045 (movmisalign<V_32:mode>): Ditto.
13046 (mmx_<any_shiftrt:insn>v1si3): New insn pattern.
13047 (sse4_1_<any_extend:code>v2qiv2hi2): Ditto.
13048 (vec_unpacks_lo_v4qi): New expander.
13049 (vec_unpacks_hi_v4qi): Ditto.
13050 (vec_unpacku_lo_v4qi): Ditto.
13051 (vec_unpacku_hi_v4qi): Ditto.
13052 * config/i386/i386.h (VALID_SSE2_REG_MODE): Add V1SImode.
13053 (VALID_INT_MODE_P): Ditto.
13055 2021-07-08 Michael Meissner <meissner@linux.ibm.com>
13058 * config/rs6000/rs6000.md (udivti3): New insn.
13059 (divti3): New insn.
13060 (umodti3): New insn.
13061 (modti3): New insn.
13063 2021-07-07 Martin Sebor <msebor@redhat.com>
13065 PR tree-optimization/100137
13066 PR tree-optimization/99121
13067 PR tree-optimization/97027
13068 * builtins.c (access_ref::access_ref): Also set offmax.
13069 (access_ref::offset_in_range): Define new function.
13070 (access_ref::add_offset): Set offmax.
13071 (access_ref::inform_access): Handle access_none.
13072 (handle_mem_ref): Clear ostype.
13073 (compute_objsize_r): Handle ASSERT_EXPR.
13074 * builtins.h (struct access_ref): Add offmax member.
13075 * gimple-array-bounds.cc (array_bounds_checker::check_mem_ref): Use
13076 compute_objsize() and simplify.
13078 2021-07-07 Peter Bergner <bergner@linux.ibm.com>
13080 * config/rs6000/rs6000-call.c (mma_init_builtins): Use VSX_BUILTIN_LXVP
13081 and VSX_BUILTIN_STXVP.
13083 2021-07-07 Martin Sebor <msebor@redhat.com>
13086 * config/aarch64/aarch64.c (aarch64_simd_lane_bounds): Remove
13087 a stray %K from error_at() missed in r12-2088.
13089 2021-07-07 Richard Biener <rguenther@suse.de>
13091 PR tree-optimization/99728
13092 * tree-ssa-loop-im.c (gather_mem_refs_stmt): Record
13094 (mem_refs_may_alias_p): Add assert we handled aggregate
13096 (sm_seq_valid_bb): Give up when running into aggregate copies.
13097 (ref_indep_loop_p): Handle aggregate copies as never
13098 being invariant themselves but allow other refs to be
13099 disambiguated against them.
13100 (can_sm_ref_p): Do not try to apply store-motion to aggregate
13103 2021-07-06 Indu Bhagat <indu.bhagat@oracle.com>
13106 * dwarf2ctf.c (ctf_get_AT_data_member_location): Multiply by 8 to get
13109 2021-07-06 Martin Sebor <msebor@redhat.com>
13111 * gimple-pretty-print.c (percent_G_format): Remove.
13112 * tree-diagnostic.c (default_tree_printer): Remove calls.
13113 * tree-pretty-print.c (percent_K_format): Remove.
13114 * tree-pretty-print.h (percent_K_format): Remove.
13116 2021-07-06 Martin Sebor <msebor@redhat.com>
13118 * config/aarch64/aarch64-builtins.c (aarch64_simd_expand_builtin):
13119 Remove %K and use error_at.
13120 (aarch64_expand_fcmla_builtin): Same.
13121 (aarch64_expand_builtin_tme): Same.
13122 (aarch64_expand_builtin_memtag): Same.
13123 * config/arm/arm-builtins.c (arm_expand_acle_builtin): Same.
13124 (arm_expand_builtin): Same.
13125 * config/arm/arm.c (bounds_check): Same.
13127 2021-07-06 Martin Sebor <msebor@redhat.com>
13129 * builtins.c (warn_string_no_nul): Remove %G.
13130 (maybe_warn_for_bound): Same.
13131 (warn_for_access): Same.
13132 (check_access): Same.
13133 (check_strncat_sizes): Same.
13134 (expand_builtin_strncat): Same.
13135 (expand_builtin_strncmp): Same.
13136 (expand_builtin): Same.
13137 (expand_builtin_object_size): Same.
13138 (warn_dealloc_offset): Same.
13139 (maybe_emit_free_warning): Same.
13140 * calls.c (maybe_warn_alloc_args_overflow): Same.
13141 (maybe_warn_nonstring_arg): Same.
13142 (maybe_warn_rdwr_sizes): Same.
13143 * expr.c (expand_expr_real_1): Remove %K.
13144 * gimple-fold.c (gimple_fold_builtin_strncpy): Remove %G.
13145 (gimple_fold_builtin_strncat): Same.
13146 * gimple-ssa-sprintf.c (format_directive): Same.
13147 (handle_printf_call): Same.
13148 * gimple-ssa-warn-alloca.c (pass_walloca::execute): Same.
13149 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
13150 (maybe_diag_access_bounds): Same. Call gimple_location.
13151 (check_bounds_or_overlap): Same.
13152 * trans-mem.c (ipa_tm_scan_irr_block): Remove %K. Simplify.
13153 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Remove %G.
13154 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
13155 (maybe_diag_stxncpy_trunc): Same.
13156 (handle_builtin_stxncpy_strncat): Same.
13157 (maybe_warn_pointless_strcmp): Same.
13158 * tree-ssa-uninit.c (maybe_warn_operand): Same.
13160 2021-07-06 Uroš Bizjak <ubizjak@gmail.com>
13163 * config/i386/predicates.md (vec_setm_operand): Enable
13164 register_operand for TARGET_SSE4_1.
13165 * config/i386/mmx.md (vec_setv2hi): Use vec_setm_operand
13166 as operand 2 predicate. Call ix86_expand_vector_set_var
13167 for non-constant index operand.
13168 (vec_setv4qi): Use vec_setm_mmx_operand as operand 2 predicate.
13169 Call ix86_expand_vector_set_var for non-constant index operand.
13171 2021-07-06 Jeff Law <jeffreyalaw@gmail.com>
13173 * config/h8300/jumpcall.md (*branch): When possible, generate
13174 the comparison in CCZN mode.
13175 * config/h8300/predicates.md (simple_memory_operand): Reject all
13176 auto-increment addressing modes.
13178 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
13180 PR bootstrap/100246
13181 * config/i386/i386.h (struct stringop_algs): Define a CTOR for
13184 2021-07-06 Richard Biener <rguenther@suse.de>
13186 * doc/md.texi (vec_fmaddsub<mode>4): Document.
13187 (vec_fmsubadd<mode>4): Likewise.
13188 * optabs.def (vec_fmaddsub$a4): Add.
13189 (vec_fmsubadd$a4): Likewise.
13190 * internal-fn.def (IFN_VEC_FMADDSUB): Add.
13191 (IFN_VEC_FMSUBADD): Likewise.
13192 * tree-vect-slp-patterns.c (addsub_pattern::recognize):
13193 Refactor to handle IFN_VEC_FMADDSUB and IFN_VEC_FMSUBADD.
13194 (addsub_pattern::build): Likewise.
13195 * tree-vect-slp.c (vect_optimize_slp): CFN_VEC_FMADDSUB
13196 and CFN_VEC_FMSUBADD are not transparent for permutes.
13197 * config/i386/sse.md (vec_fmaddsub<mode>4): New expander.
13198 (vec_fmsubadd<mode>4): Likewise.
13200 2021-07-06 Richard Biener <rguenther@suse.de>
13202 * doc/invoke.texi (fmove-loop-stores): Document.
13203 * common.opt (fmove-loop-stores): New option.
13204 * opts.c (default_options_table): Enable -fmove-loop-stores
13205 at -O1 but not -Og.
13206 * tree-ssa-loop-im.c (pass_lim::execute): Pass
13207 flag_move_loop_stores instead of true to
13208 loop_invariant_motion_in_fun.
13210 2021-07-06 Iain Sandoe <iain@sandoe.co.uk>
13212 * doc/install.texi: Document --with-dsymutil.
13214 2021-07-06 Andrew Pinski <apinski@marvell.com>
13216 PR tree-optimization/101256
13217 * dbgcnt.def (phiopt_edge_range): New counter.
13218 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
13219 Check to make sure the new name is defined in the same
13220 bb as the conditional before duplicating range info.
13221 Also add debug counter.
13223 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
13225 PR rtl-optimization/100328
13226 * config/i386/i386-options.c (ix86_option_override_internal):
13227 Set param_ira_consider_dup_in_all_alts to 0.
13229 2021-07-06 Kewen Lin <linkw@linux.ibm.com>
13231 PR rtl-optimization/100328
13232 * doc/invoke.texi (ira-consider-dup-in-all-alts): Document new
13234 * ira.c (ira_get_dup_out_num): Adjust as parameter
13235 param_ira_consider_dup_in_all_alts.
13236 * params.opt (ira-consider-dup-in-all-alts): New.
13237 * ira-conflicts.c (process_regs_for_copy): Add one parameter
13238 single_input_op_has_cstr_p.
13239 (get_freq_for_shuffle_copy): New function.
13240 (add_insn_allocno_copies): Adjust as single_input_op_has_cstr_p.
13241 * ira-int.h (ira_get_dup_out_num): Add one bool parameter.
13243 2021-07-05 Jeff Law <jeffreyalaw@gmail.com>
13245 * config/h8300/shiftrotate.md (shift-by-variable patterns): Update to
13246 generate condition code aware RTL directly.
13248 2021-07-05 Andrew Pinski <apinski@marvell.com>
13250 PR tree-optimization/101039
13251 * match.pd (A CMP 0 ? A : -A): New patterns.
13252 * tree-ssa-phiopt.c (abs_replacement): Delete function.
13253 (tree_ssa_phiopt_worker): Don't call abs_replacement.
13254 Update comment about abs_replacement.
13256 2021-07-05 Andrew Pinski <apinski@marvell.com>
13258 * tree-ssa-phiopt.c (gimple_simplify_phiopt):
13259 If "A ? B : C" fails to simplify, try "(!A) ? C : B".
13261 2021-07-05 Andrew Pinski <apinski@marvell.com>
13263 * tree-ssa-phiopt.c (match_simplify_replacement):
13264 Add early_p argument. Call gimple_simplify_phiopt
13265 instead of gimple_simplify.
13266 (tree_ssa_phiopt_worker): Update call to
13267 match_simplify_replacement and allow unconditionally.
13268 (phiopt_early_allow): New function.
13269 (gimple_simplify_phiopt): New function.
13271 2021-07-05 Andrew Pinski <apinski@marvell.com>
13273 PR middle-end/101237
13274 * fold-const.c (negate_expr_p): Remove call to element_mode
13275 and TREE_MODE/TREE_TYPE when calling HONOR_SIGNED_ZEROS,
13276 HONOR_SIGN_DEPENDENT_ROUNDING, and HONOR_SNANS.
13277 (fold_negate_expr_1): Likewise.
13278 (const_unop): Likewise.
13279 (fold_cond_expr_with_comparison): Likewise.
13280 (fold_binary_loc): Likewise.
13281 (fold_ternary_loc): Likewise.
13282 (tree_call_nonnegative_warnv_p): Likewise.
13283 * match.pd (-(A + B) -> (-B) - A): Likewise.
13285 2021-07-05 Iain Sandoe <iain@sandoe.co.uk>
13287 * configure.ac: Handle --with-dsymutil in the same way as we
13288 do for the assembler and linker. (DEFAULT_DSYMUTIL): New.
13289 Extract the type and version for the dsymutil configured or
13290 found by the default searches.
13291 * config.in: Regenerated.
13292 * configure: Regenerated.
13293 * collect2.c (do_dsymutil): Handle locating dsymutil in the
13294 same way as for the assembler and linker.
13295 * config/darwin.h (DSYMUTIL): Delete.
13296 * gcc.c: Report a configured dsymutil correctly.
13297 * exec-tool.in: Allow for dsymutil.
13299 2021-07-05 Uroš Bizjak <ubizjak@gmail.com>
13301 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
13302 Handle V4QI and V2HI modes.
13303 (expand_vec_perm_blend): Allow 4-byte vector modes with TARGET_SSE4_1.
13304 Handle V4QI mode. Emit mmx_pblendvb32 for 4-byte modes.
13305 (expand_vec_perm_pshufb): Rewrite to use switch statemets.
13306 Handle 4-byte dual operands with TARGET_XOP and single operands
13307 with TARGET_SSSE3. Emit mmx_ppermv32 for TARGET_XOP and
13308 mmx_pshufbv4qi3 for TARGET_SSSE3.
13309 (expand_vec_perm_pblendv): Allow 4-byte vector modes with TARGET_SSE4_1.
13310 (expand_vec_perm_interleave2): Allow 4-byte vector modes.
13311 (expand_vec_perm_pshufb2): Allow 4-byte vector modes with TARGET_SSSE3.
13312 (expand_vec_perm_even_odd_1): Handle V4QI mode.
13313 (expand_vec_perm_broadcast_1): Handle V4QI mode.
13314 (ix86_vectorize_vec_perm_const): Handle V4QI mode.
13315 * config/i386/mmx.md (mmx_ppermv32): New insn pattern.
13316 (mmx_pshufbv4qi3): Ditto.
13317 (*mmx_pblendw32): Ditto.
13318 (*mmx_pblendw64): Rename from *mmx_pblendw.
13319 (mmx_punpckhbw_low): New insn_and_split pattern.
13320 (mmx_punpcklbw_low): Ditto.
13322 2021-07-05 Richard Biener <rguenther@suse.de>
13324 * tree-vect-loop-manip.c (vect_loop_versioning): Do not
13325 set LOOP_C_INFINITE on the vectorized loop.
13327 2021-07-05 Richard Biener <rguenther@suse.de>
13329 PR middle-end/101291
13330 * cfgloopmanip.c (loop_version): Set the loop copy of the
13331 versioned loop to the new loop.
13333 2021-07-04 Iain Sandoe <iain@sandoe.co.uk>
13336 * config.gcc: Ensure that Darwin biarch definitions are
13337 added before i386.h.
13338 * config/i386/darwin.h (TARGET_64BIT): Remove.
13339 (PR80556_WORKAROUND): New.
13340 (REAL_LIBGCC_SPEC): Amend to use PR80556_WORKAROUND.
13341 (DARWIN_SUBARCH_SPEC): New.
13342 * config/i386/darwin32-biarch.h (TARGET_64BIT_DEFAULT,
13343 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
13344 (REAL_LIBGCC_SPEC): Remove.
13345 * config/i386/darwin64-biarch.h (TARGET_64BIT_DEFAULT,
13346 TARGET_BI_ARCH, PR80556_WORKAROUND): New.
13347 (REAL_LIBGCC_SPEC): Remove.
13349 2021-07-03 H.J. Lu <hjl.tools@gmail.com>
13351 PR middle-end/101294
13352 * expr.c (store_constructor): Don't use vec_duplicate on vector.
13354 2021-07-02 Martin Sebor <msebor@redhat.com>
13356 PR middle-end/98871
13357 PR middle-end/98512
13358 * diagnostic.c (get_any_inlining_info): New.
13359 (update_effective_level_from_pragmas): Handle inlining context.
13360 (diagnostic_enabled): Same.
13361 (diagnostic_report_diagnostic): Same.
13362 * diagnostic.h (struct diagnostic_info): Add ctor.
13363 (struct diagnostic_context): Add new member.
13364 * tree-diagnostic.c (set_inlining_locations): New.
13365 (tree_diagnostics_defaults): Set new callback pointer.
13367 2021-07-02 Peter Bergner <bergner@linux.ibm.com>
13369 * config/rs6000/rs6000-builtin.def (BU_MMA_PAIR_LD, BU_MMA_PAIR_ST):
13371 (__builtin_vsx_lxvp, __builtin_vsx_stxvp): New built-ins.
13372 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Expand
13373 lxvp and stxvp built-ins.
13374 (mma_init_builtins): Handle lxvp and stxvp built-ins.
13375 (builtin_function_type): Likewise.
13376 * doc/extend.texi (__builtin_vsx_lxvp, __builtin_mma_stxvp): Document.
13378 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13380 * config/h8300/h8300-protos.h (compute_a_shift_cc): Accept
13381 additional argument for the code.
13382 * config/h8300/h8300.c (compute_a_shift_cc): Accept additional
13383 argument for the code. Just return if the ZN bits are useful or
13384 not rather than the old style CC_* enums.
13385 * config/h8300/shiftrotate.md (shiftqi_noscratch): Move before
13386 more generic shiftqi patterns.
13387 (shifthi_noscratch, shiftsi_noscratch): Similarly.
13388 (shiftqi_noscratch_set_flags): New pattern.
13389 (shifthi_noscratch_set_flags, shiftsi_noscratch_set_flags): Likewise.
13391 2021-07-02 Andrew MacLeod <amacleod@redhat.com>
13393 PR tree-optimization/101223
13394 * range-op.cc (build_lt): Add -1 for signed values.
13395 (built_gt): Subtract -1 for signed values.
13397 2021-07-02 David Faust <david.faust@oracle.com>
13399 * btfout.c (get_btf_kind): Support BTF_KIND_FLOAT.
13400 (btf_asm_type): Likewise.
13402 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13404 * config/h8300/h8300-protos.h (output_a_shift): Make first argument
13405 an array of rtx rather than a pointer to rtx. Add code argument.
13406 (compute_a_shift_length): Similarly.
13407 * config/h8300/h8300.c (h8300_shift_costs): Adjust now that the
13408 shift itself isn't an operand. Create dummy operand[0] to carry
13409 a mode and pass a suitable rtx code to compute_a_shift_length.
13410 (get_shift_alg): Adjust operand number of clobber in output templates.
13411 (output_a_shift): Make first argument an array of rtx rather than
13412 a pointer to rtx. Add code argument for the type of shift.
13413 Adjust now that the shift itself is no longer an operand.
13414 (compute_a_shift_length): Similarly.
13415 * config/h8300/shiftrotate.md (shiftqi, shifthi, shiftsi): Use an
13416 iterator rather than nshift_operator.
13417 (shiftqi_noscratch, shifthi_noscratch, shiftsi_noscratch): Likewise.
13418 (shiftqi_clobber_flags): Adjust to API changes in output_a_shift
13419 and compute_a_shift_length.
13420 (shiftqi_noscratch_clobber_flags): Likewise.
13421 (shifthi_noscratch_clobber_flags): Likewise.
13422 (shiftsi_noscratch_clobber_flags): Likewise.
13424 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
13427 * config/darwin.h (DSYMUTIL_SPEC): Do not try to run
13428 dsymutil for BTF/CTF.
13430 2021-07-02 Iain Sandoe <iain@sandoe.co.uk>
13433 * config/darwin.h (CTF_INFO_SECTION_NAME): Update the
13434 segment to include BTF.
13435 (BTF_INFO_SECTION_NAME): New.
13437 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13439 * config/m32r/m32r-protos.h (call_operand): Adjust return type.
13440 (small_data_operand, memreg_operand, small_insn_p): Likewise.
13441 * config/m32r/m32r.c (call_operand): Adjust return type.
13442 (small_data_operand, memreg_operand): Likewise.
13444 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13446 * config/frv/frv-protos.h (integer_register_operand): Adjust return
13448 (frv_load_operand, gpr_or_fpr_operand, gpr_no_subreg_operand): Likewise.
13449 (fpr_or_int6_operand, gpr_or_int_operand); Likewise.
13450 (gpr_or_int12_operand, gpr_or_int10_operand); Likewise.
13451 (move_source_operand, move_destination_operand): Likewise.
13452 (condexec_source_operand, condexec_dest_operand): Likewise.
13453 (lr_operand, gpr_or_memory_operand, fpr_or_memory_operand): Likewise.
13454 (reg_or_0_operand, fcc_operand, icc_operand, cc_operand): Likewise.
13455 (fcr_operand, icr_operand, cr_operand, call_operand): Likewise.
13456 (fpr_operand, even_reg_operand, odd_reg_operand): Likewise.
13457 (even_gpr_operand, odd_gpr_operand, quad_fpr_operand): Likewise.
13458 (even_fpr_operand, odd_fpr_operand): Likewise.
13459 (dbl_memory_one_insn_operand, dbl_memory_two_insn_operand): Likewise.
13460 (int12_operand, int6_operand, int5_operand, uint5_operand): Likewise.
13461 (uint4_operand, uint1_operand, int_2word_operand): Likewise
13462 (upper_int16_operand, uint16_operand, symbolic_operand): Likewise.
13463 (relational_operator, float_relational_operator): Likewise.
13464 (ccr_eqne_operator, minmax_operator): Likewise.
13465 (condexec_si_binary_operator, condexec_si_media_operator): Likewise.
13466 (condexec_si_divide_operator, condexec_si_unary_operator): Likewise.
13467 (condexec_sf_conv_operator, condexec_sf_add_operator): Likewise.
13468 (intop_compare_operator, acc_operand, even_acc_operand): Likewise.
13469 (quad_acc_operand, accg_operand): Likewise.
13471 2021-07-02 Jeff Law <jeffreyalaw@gmail.com>
13473 * config/stormy16/stormy16-protos.h (xstormy16_below_100_symbol): Change
13474 return type to a bool.
13475 (nonimmediate_nonstack_operand): Likewise.
13476 (xstormy16_splittable_below100_operand): Likewise.
13477 * config/stormy16/stormy16.c (xstormy16_below_100_symbol): Fix
13479 (xstormy16_splittable_below100_operand): Likewise.
13481 2021-07-02 Richard Biener <rguenther@suse.de>
13483 PR tree-optimization/101293
13484 * tree-ssa-loop-im.c (mem_ref_hasher::equal): Compare MEM_REF bases
13485 with combined offsets.
13486 (gather_mem_refs_stmt): Hash MEM_REFs as if their offset were
13487 combined with the rest of the offset.
13489 2021-07-02 Eric Botcazou <ebotcazou@adacore.com>
13491 * config/i386/i386.c (asm_preferred_eh_data_format): Always use the
13492 PIC encodings for PE-COFF targets.
13494 2021-07-02 Jakub Jelinek <jakub@redhat.com>
13497 * config/i386/i386-expand.c (ix86_broadcast_from_integer_constant):
13498 Return nullptr for TImode inner mode.
13500 2021-07-02 Richard Biener <rguenther@suse.de>
13502 PR tree-optimization/101280
13503 PR tree-optimization/101173
13504 * gimple-loop-interchange.cc
13505 (tree_loop_interchange::valid_data_dependences): Properly
13506 guard all dependence checks with DDR_REVERSED_P or its
13509 2021-07-02 Hongyu Wang <hongyu.wang@intel.com>
13511 * config/i386/i386-expand.c (ix86_expand_builtin):
13512 Add branch to clear odata when ZF is set for asedecenc_expand
13513 and wideaesdecenc_expand.
13515 2021-07-02 Eugene Rozenfeld <erozen@microsoft.com>
13517 * config/i386/gcc-auto-profile: regenerate
13519 2021-07-02 liuhongt <hongtao.liu@intel.com>
13521 * config/i386/sse.md (trunc<mode><pmov_dst_4>2): Refined to ..
13522 (trunc<mode><pmov_dst_4_lower>2): this.
13524 2021-07-01 David Malcolm <dmalcolm@redhat.com>
13526 * diagnostic.h (diagnostic_context::m_file_cache): New field.
13527 * input.c (class fcache): Rename to...
13528 (class file_cache_slot): ...this, making most members private and
13529 prefixing fields with "m_".
13530 (file_cache_slot::get_file_path): New accessor.
13531 (file_cache_slot::get_use_count): New accessor.
13532 (file_cache_slot::missing_trailing_newline_p): New accessor.
13533 (file_cache_slot::inc_use_count): New.
13534 (fcache_buffer_size): Move to...
13535 (file_cache_slot::buffer_size): ...here.
13536 (fcache_line_record_size): Move to...
13537 (file_cache_slot::line_record_size): ...here.
13538 (fcache_tab): Delete, in favor of global_dc->m_file_cache.
13539 (fcache_tab_size): Move to file_cache::num_file_slots.
13540 (diagnostic_file_cache_init): Update for move of fcache_tab
13541 to global_dc->m_file_cache.
13542 (diagnostic_file_cache_fini): Likewise.
13543 (lookup_file_in_cache_tab): Convert to...
13544 (file_cache::lookup_file): ...this.
13545 (diagnostics_file_cache_forcibly_evict_file): Update for move of
13546 fcache_tab to global_dc->m_file_cache, moving most of
13547 implementation to...
13548 (file_cache::forcibly_evict_file): ...this new function and...
13549 (file_cache_slot::evict): ...this new function.
13550 (evicted_cache_tab_entry): Convert to...
13551 (file_cache::evicted_cache_tab_entry): ...this.
13552 (add_file_to_cache_tab): Convert to...
13553 (file_cache::add_file): ...this, moving bulk of implementation
13555 (file_cache_slot::create): ..this new function.
13556 (file_cache::file_cache): New.
13557 (file_cache::~file_cache): New.
13558 (lookup_or_add_file_to_cache_tab): Convert to...
13559 (file_cache::lookup_or_add_file): ..this new function.
13560 (fcache::fcache): Rename to...
13561 (file_cache_slot::file_cache_slot): ...this, adding "m_" prefixes
13563 (fcache::~fcache): Rename to...
13564 (file_cache_slot::~file_cache_slot): ...this, adding "m_" prefixes
13566 (needs_read): Convert to...
13567 (file_cache_slot::needs_read_p): ...this.
13568 (needs_grow): Convert to...
13569 (file_cache_slot::needs_grow_p): ...this.
13570 (maybe_grow): Convert to...
13571 (file_cache_slot::maybe_grow): ...this.
13572 (read_data): Convert to...
13573 (file_cache_slot::read_data): ...this.
13574 (maybe_read_data): Convert to...
13575 (file_cache_slot::maybe_read_data): ...this.
13576 (get_next_line): Convert to...
13577 (file_cache_slot::get_next_line): ...this.
13578 (goto_next_line): Convert to...
13579 (file_cache_slot::goto_next_line): ...this.
13580 (read_line_num): Convert to...
13581 (file_cache_slot::read_line_num): ...this.
13582 (location_get_source_line): Update for moving of globals to
13583 global_dc->m_file_cache.
13584 (location_missing_trailing_newline): Likewise.
13585 * input.h (class file_cache_slot): New forward decl.
13586 (class file_cache): New.
13588 2021-07-01 Michael Meissner <meissner@linux.ibm.com>
13590 * config/rs6000/rs6000.c (rs6000_maybe_emit_fp_cmove): Add IEEE
13591 128-bit floating point conditional move support.
13592 (have_compare_and_set_mask): Add IEEE 128-bit floating point
13594 * config/rs6000/rs6000.md (mov<mode>cc, IEEE128 iterator): New insn.
13595 (mov<mode>cc_p10, IEEE128 iterator): New insn.
13596 (mov<mode>cc_invert_p10, IEEE128 iterator): New insn.
13597 (fpmask<mode>, IEEE128 iterator): New insn.
13598 (xxsel<mode>, IEEE128 iterator): New insn.
13600 2021-07-01 Iain Sandoe <iain@sandoe.co.uk>
13603 * config/darwin.h (CTF_INFO_SECTION_NAME): New.
13605 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
13607 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
13609 * config/i386/i386-protos.h (ix86_expand_vector_init_duplicate):
13611 * config/i386/sse.md (INT_BROADCAST_MODE): New mode iterator.
13612 (vec_duplicate<mode>): New expander.
13614 2021-07-01 H.J. Lu <hjl.tools@gmail.com>
13617 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
13619 (ix86_byte_broadcast): New function.
13620 (ix86_convert_const_wide_int_to_broadcast): Likewise.
13621 (ix86_expand_move): Convert CONST_WIDE_INT to broadcast if mode
13622 size is 16 bytes or bigger.
13623 (ix86_broadcast_from_integer_constant): New function.
13624 (ix86_expand_vector_move): Convert CONST_WIDE_INT and CONST_VECTOR
13625 to broadcast if mode size is 16 bytes or bigger.
13626 * config/i386/i386-protos.h (ix86_gen_scratch_sse_rtx): New
13628 * config/i386/i386.c (ix86_gen_scratch_sse_rtx): New function.
13630 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13632 * config/i386/predicates.md (ix86_endbr_immediate_operand):
13633 Return true/false instead of 1/0.
13634 (movq_parallel): Ditto.
13636 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13638 * recog.c (general_operand): Return true/false instead of 1/0.
13639 (register_operand): Ditto.
13640 (immediate_operand): Ditto.
13641 (const_int_operand): Ditto.
13642 (const_scalar_int_operand): Ditto.
13643 (const_double_operand): Ditto.
13644 (push_operand): Ditto.
13645 (pop_operand): Ditto.
13646 (memory_operand): Ditto.
13647 (indirect_operand): Ditto.
13649 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13651 * genpreds.c (write_predicate_subfunction):
13652 Change the type of written subfunction to bool.
13653 (write_one_predicate_function):
13654 Change the type of written function to bool.
13655 (write_tm_preds_h): Ditto.
13656 * recog.h (*insn_operand_predicate_fn): Change the type to bool.
13657 * recog.c (general_operand): Change the type to bool.
13658 (address_operand): Ditto.
13659 (register_operand): Ditto.
13660 (pmode_register_operand): Ditto.
13661 (scratch_operand): Ditto.
13662 (immediate_operand): Ditto.
13663 (const_int_operand): Ditto.
13664 (const_scalar_int_operand): Ditto.
13665 (const_double_operand): Ditto.
13666 (nonimmediate_operand): Ditto.
13667 (nonmemory_operand): Ditto.
13668 (push_operand): Ditto.
13669 (pop_operand): Ditto.
13670 (memory_operand): Ditto.
13671 (indirect_operand): Ditto.
13672 (ordered_comparison_operator): Ditto.
13673 (comparison_operator): Ditto.
13674 * config/i386/i386-expand.c (ix86_expand_sse_cmp):
13675 Change the type of indirect predicate function to bool.
13676 * config/rs6000/rs6000.c (easy_vector_constant):
13677 Change the type to bool.
13678 * config/mips/mips-protos.h (m16_based_address_p):
13679 Change the type of operand 3 to bool.
13681 2021-07-01 Richard Biener <rguenther@suse.de>
13683 PR tree-optimization/101280
13684 PR tree-optimization/101173
13685 * gimple-loop-interchange.cc
13686 (tree_loop_interchange::valid_data_dependences): Revert
13687 previous change and instead correctly handle DDR_REVERSED_P
13690 2021-07-01 Richard Biener <rguenther@suse.de>
13692 PR tree-optimization/101278
13693 * tree-ssa-dse.c (dse_classify_store): First check for
13694 uses, then ignore stmt for chaining purposes.
13696 2021-07-01 Richard Biener <rguenther@suse.de>
13698 PR tree-optimization/100778
13699 * tree-vect-slp.c (vect_schedule_slp_node): Do not place trapping
13700 vectorized ops ahead of their scalar BB.
13702 2021-07-01 Uroš Bizjak <ubizjak@gmail.com>
13705 * config/i386/i386.md (*nabs<dwi>2_doubleword):
13706 New insn_and_split pattern.
13707 (*nabs<dwi>2_1): Ditto.
13708 * config/i386/i386-features.c
13709 (general_scalar_chain::compute_convert_gain):
13710 Handle (NEG (ABS (...))) RTX. Rewrite src code
13711 scanner as switch statement.
13712 (general_scalar_chain::convert_insn):
13713 Handle (NEG (ABS (...))) RTX.
13714 (general_scalar_to_vector_candidate_p):
13715 Detect (NEG (ABS (...))) RTX. Reorder case statements
13716 for (AND (NOT (...) ...)) fallthrough.
13718 2021-07-01 Richard Biener <rguenther@suse.de>
13720 PR tree-optimization/101178
13721 * tree-vect-slp.c (slpg_vertex::materialize): Remove.
13722 (slpg::perm_in): Add.
13723 (slpg::get_perm_in): Remove.
13724 (slpg::get_perm_materialized): Add.
13725 (vect_optimize_slp): Handle VEC_PERM nodes more optimally
13726 during permute propagation and materialization.
13728 2021-07-01 Jakub Jelinek <jakub@redhat.com>
13731 * dwarf2out.c (loc_list_from_tree_1): Handle COMPOUND_LITERAL_EXPR.
13733 2021-07-01 Jakub Jelinek <jakub@redhat.com>
13735 PR middle-end/94366
13736 * omp-low.c (lower_rec_input_clauses): Rename is_fp_and_or to
13737 is_truth_op, set it for TRUTH_*IF_EXPR regardless of new_var's type,
13738 use boolean_type_node instead of integer_type_node as NE_EXPR type.
13739 (lower_reduction_clauses): Likewise.
13741 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
13743 * config/gcn/gcn.c: Include dwarf2.h.
13744 (gcn_addr_space_debug): New function.
13745 (TARGET_ADDR_SPACE_DEBUG): New hook.
13747 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
13749 * common/config/gcn/gcn-common.c
13750 (gcn_option_optimization_table): Change OPT_fomit_frame_pointer to -O3.
13751 * config/gcn/gcn.c (gcn_expand_prologue): Prefer the frame pointer
13753 (gcn_expand_prologue): Prefer the frame pointer when emitting CFI.
13754 (gcn_frame_pointer_rqd): New function.
13755 (TARGET_FRAME_POINTER_REQUIRED): New hook.
13757 2021-06-30 Hafiz Abid Qadeer <abidh@codesourcery.com>
13759 * config/gcn/gcn.c (move_callee_saved_registers): Emit CFI notes for
13760 prologue register saves.
13761 (gcn_debug_unwind_info): Use UI_DWARF2.
13762 (gcn_dwarf_register_number): Map DWARF_LINK_REGISTER to DWARF PC.
13763 (gcn_dwarf_register_span): DWARF_LINK_REGISTER doesn't span.
13764 * config/gcn/gcn.h: (DWARF_FRAME_RETURN_COLUMN): New define.
13765 (DWARF_LINK_REGISTER): New define.
13766 (FIRST_PSEUDO_REGISTER): Increment.
13767 (FIXED_REGISTERS): Add entry for DWARF_LINK_REGISTER.
13768 (CALL_USED_REGISTERS): Likewise.
13769 (REGISTER_NAMES): Likewise.
13771 2021-06-30 Richard Biener <rguenther@suse.de>
13773 PR tree-optimization/101267
13774 * tree-vect-stmts.c (vect_check_scalar_mask): Adjust
13775 API and use SLP compatible interface of vect_is_simple_use.
13776 Reject not vectorized SLP defs for callers that do not support
13778 (vect_check_store_rhs): Handle masked stores and pass down
13779 the appropriate operator index.
13780 (vectorizable_call): Adjust.
13781 (vectorizable_store): Likewise.
13782 (vectorizable_load): Likewise. Handle SLP pecularity of
13784 (vect_is_simple_use): Remove special-casing of masked stores.
13786 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
13788 * common.opt (foffload): Remove help as Driver only.
13789 * gcc.c (display_help): Add -foffload.
13791 2021-06-30 Tobias Burnus <tobias@codesourcery.com>
13793 * gcc.c (close_at_file, execute): Replace alloca by XALLOCAVEC.
13794 (check_offload_target_name): Fix splitting OFFLOAD_TARGETS into
13795 a candidate list; better inform no offload target is configured
13796 and fix hint extraction when passed target is not '\0' at [len].
13797 * common.opt (foffload): Add tailing '.'.
13798 (foffload-options): Likewise; fix flag name in the help string.
13800 2021-06-30 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
13803 * config/arm/arm_neon.h: Move vabs intrinsics before vcage_f32.
13804 (vcage_f32): Gate comparison on __FAST_MATH__.
13805 (vcageq_f32): Likewise.
13806 (vcale_f32): Likewise.
13807 (vcaleq_f32): Likewise.
13808 (vcagt_f32): Likewise.
13809 (vcagtq_f32): Likewise.
13810 (vcalt_f32): Likewise.
13811 (vcaltq_f32): Likewise.
13812 (vcage_f16): Likewise.
13813 (vcageq_f16): Likewise.
13814 (vcale_f16): Likewise.
13815 (vcaleq_f16): Likewise.
13816 (vcagt_f16): Likewise.
13817 (vcagtq_f16): Likewise.
13818 (vcalt_f16): Likewise.
13819 (vcaltq_f16): Likewise.
13821 2021-06-30 Richard Biener <rguenther@suse.de>
13823 PR tree-optimization/101264
13824 * tree-vect-slp.c (vect_optimize_slp): Propagate the
13825 computed perm_in to all "any" permute successors
13826 we cannot de-duplicate immediately.
13828 2021-06-30 liuhongt <hongtao.liu@intel.com>
13831 * config/i386/sse.md
13832 (avx512f_sfixupimm<mode><sd_maskz_name><round_saeonly_name>):
13834 (avx512f_sfixupimm<mode><maskz_scalar_name><round_saeonly_name>):
13836 (avx512f_sfixupimm<mode>_mask<round_saeonly_name>"): Refined.
13837 * config/i386/subst.md (maskz_scalar): New define_subst.
13838 (maskz_scalar_name): New subst_attr.
13839 (maskz_scalar_op5): Ditto.
13840 (round_saeonly_maskz_scalar_op5): Ditto.
13841 (round_saeonly_maskz_scalar_operand5): Ditto.
13843 2021-06-30 David Edelsohn <dje.gcc@gmail.com>
13845 * config/rs6000/rs6000.c (rs6000_xcoff_section_type_flags):
13846 Increase code CSECT alignment to at least 32 bytes.
13847 * config/rs6000/xcoff.h (TEXT_SECTION_ASM_OP): Add 32 byte
13848 alignment designation.
13850 2021-06-29 Sergei Trofimovich <siarheit@google.com>
13852 * doc/generic.texi: Fix s/net yet/not yet/ typo.
13854 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
13856 PR tree-optimization/101254
13857 * range-op.cc (operator_minus::op1_op2_relation_effect): Check for
13858 wrapping/non-wrapping when setting the result range.
13860 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
13862 * value-query.cc (gimple_range_global): Allow phis.
13864 2021-06-29 Andrew MacLeod <amacleod@redhat.com>
13866 * vr-values.c (vr_values::vrp_stmt_computes_nonzero): Use stmt.
13867 (simplify_using_ranges::op_with_boolean_value_range_p): Add a
13868 statement for location context.
13869 (check_for_binary_op_overflow): Ditto.
13870 (simplify_using_ranges::get_vr_for_comparison): Ditto.
13871 (simplify_using_ranges::compare_name_with_value): Ditto.
13872 (simplify_using_ranges::compare_names): Ditto.
13873 (vrp_evaluate_conditional_warnv_with_ops_using_ranges): Ditto.
13874 (simplify_using_ranges::simplify_truth_ops_using_ranges): Ditto.
13875 (simplify_using_ranges::simplify_min_or_max_using_ranges): Ditto.
13876 (simplify_using_ranges::simplify_internal_call_using_ranges): Ditto.
13877 (simplify_using_ranges::two_valued_val_range_p): Ditto.
13878 (simplify_using_ranges::simplify): Ditto.
13879 * vr-values.h: Adjust prototypes.
13881 2021-06-29 Uroš Bizjak <ubizjak@gmail.com>
13884 * config/i386/mmx.md (vec_addsubv2sf3): New insn pattern.
13886 2021-06-29 Julian Brown <julian@codesourcery.com>
13888 * config/gcn/gcn.c (gcn_init_libfuncs): New function.
13889 (TARGET_INIT_LIBFUNCS): Define target hook using above function.
13890 * config/gcn/gcn.h (UNITS_PER_WORD): Define to 8 for IN_LIBGCC2, 4
13892 (LIBGCC2_UNITS_PER_WORD, BITS_PER_WORD): Remove definitions.
13893 (MAX_FIXED_MODE_SIZE): Change to 128.
13895 2021-06-29 Julian Brown <julian@codesourcery.com>
13897 * config/gcn/gcn.md (UNSPEC_FLBIT_INT): New unspec constant.
13898 (s_mnemonic): Add clrsb.
13899 (gcn_flbit<mode>_int): Add insn pattern for SImode/DImode.
13900 (clrsb<mode>2): Add expander for SImode/DImode.
13902 2021-06-29 Julian Brown <julian@codesourcery.com>
13904 * config/gcn/gcn.md (<su>mulsidi3, <su>mulsidi3_reg, <su>mulsidi3_imm,
13905 muldi3): Add patterns.
13907 2021-06-29 Julian Brown <julian@codesourcery.com>
13909 * config/gcn/gcn.md (<su>mulsi3_highpart): Change to expander.
13910 (<su>mulsi3_highpart_reg, <su>mulsi3_highpart_imm): New patterns.
13912 2021-06-29 Julian Brown <julian@codesourcery.com>
13914 * config/gcn/gcn.md (mulsi3): Make s_mulk_i32 variant clobber SCC.
13916 2021-06-29 Joseph Myers <joseph@codesourcery.com>
13918 * btfout.c, ctfout.c: Include "memmodel.h".
13920 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
13922 * gcc.c (check_offload_target_name): Cast len argument to
13923 %q.*s to 'int'; avoid -Wstringop-truncation warning.
13925 2021-06-29 Richard Biener <rguenther@suse.de>
13927 * tree-vect-slp.c (vect_optimize_slp): Forward propagate
13928 to "any" permute nodes and relax "any" permute proapgation
13929 during iterative backward propagation.
13931 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
13934 * common.opt (-foffload=): Update description.
13935 (-foffload-options=): New.
13936 * doc/invoke.texi (C Language Options): Document
13937 -foffload and -foffload-options.
13938 * gcc.c (check_offload_target_name): New, split off from
13939 handle_foffload_option.
13940 (check_foffload_target_names): New.
13941 (handle_foffload_option): Handle -foffload=default.
13942 (driver_handle_option): Update for -foffload-options.
13943 * lto-opts.c (lto_write_options): Use -foffload-options
13944 instead of -foffload.
13945 * lto-wrapper.c (merge_and_complain, append_offload_options):
13947 * opts.c (common_handle_option): Likewise.
13949 2021-06-29 Tobias Burnus <tobias@codesourcery.com>
13951 * doc/invoke.texi (C Language Options): Sort options
13952 alphabetically in optlist and also the description itself.
13953 Remove leftover -fallow-single-precision from and add missing
13954 -fgnu-tm to the optlist.
13956 2021-06-29 Richard Biener <rguenther@suse.de>
13958 * tree-vect-slp.c (slpg_vertex::visited): Remove.
13959 (vect_slp_perms_eq): Handle -1 permutes.
13960 (vect_optimize_slp): Rewrite permute propagation.
13962 2021-06-29 Jakub Jelinek <jakub@redhat.com>
13965 * match.pd ((intptr_t)x eq/ne CST to x eq/ne (typeof x) CST): Don't
13966 perform the optimization in GENERIC when sanitizing and x has a
13969 2021-06-29 Richard Biener <rguenther@suse.de>
13971 PR tree-optimization/101242
13972 * tree-vect-slp.c (vect_slp_build_vertices): Force-add
13973 PHIs with not represented initial values as leafs.
13975 2021-06-29 Jan-Benedict Glaw <jbglaw@getslash.de>
13977 * config/pdp11/pdp11.h (ASM_OUTPUT_SKIP): Fix signedness warning.
13978 * config/pdp11/pdp11.c (pdp11_asm_print_operand_punct_valid_p): Remove
13979 "register" keyword.
13980 (pdp11_initial_elimination_offset) Remove unused variable.
13981 (pdp11_cmp_length) Ditto.
13982 (pdp11_insn_cost): Ditto, and fix signedness warning.
13984 2021-06-29 David Edelsohn <dje.gcc@gmail.com>
13986 * btfout.c: Include tm_p.h.
13989 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
13991 * config/bpf/bpf.c (bpf_expand_prologue): Do not mark insns as
13993 (bpf_expand_epilogue): Likewise.
13994 * config/bpf/bpf.h (DWARF2_FRAME_INFO): Define to 0.
13995 Do not define DBX_DEBUGGING_INFO.
13997 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
13999 * doc/invoke.texi: Document the CTF and BTF debug info options.
14001 2021-06-28 Indu Bhagat <indu.bhagat@oracle.com>
14002 David Faust <david.faust@oracle.com>
14003 Jose E. Marchesi <jose.marchesi@oracle.com>
14004 Weimin Pan <weimin.pan@oracle.com>
14006 * Makefile.in: Add ctfc.*, ctfout.c and btfout.c files to
14007 GTFILES. Add new object files.
14008 * common.opt: Add CTF and BTF debug info options.
14009 * btfout.c: New file.
14010 * ctfc.c: Likewise.
14011 * ctfc.h: Likewise.
14012 * ctfout.c: Likewise.
14013 * dwarf2ctf.c: Likewise.
14014 * dwarf2ctf.h: Likewise.
14015 * dwarf2cfi.c (dwarf2out_do_frame): Acknowledge CTF_DEBUG and
14017 * dwarf2out.c (dwarf2out_source_line): Likewise.
14018 (dwarf2out_finish): Skip emitting DWARF if CTF or BTF are to
14020 (debug_format_do_cu): New function.
14021 (dwarf2out_early_finish): Traverse DIEs and emit CTF/BTF for
14023 Include dwarf2ctf.c.
14024 * final.c (dwarf2_debug_info_emitted_p): Acknowledge DWARF-based debug
14026 * flag-types.h (enum debug_info_type): Add CTF_DEBUG and BTF_DEBUG.
14027 (CTF_DEBUG): New bitmask.
14028 (BTF_DEBUG): Likewise.
14029 (enum ctf_debug_info_levels): New enum.
14030 * gengtype.c (open_base_files): Handle ctfc.h.
14031 (main): Handle uint32_t type.
14032 * flags.h (btf_debuginfo_p): New definition.
14033 (dwarf_based_debuginfo_p): Likewise.
14034 * opts.c (debug_type_names): Add entries for CTF and BTF.
14035 (btf_debuginfo_p): New function.
14036 (dwarf_based_debuginfo_p): Likewise.
14037 (common_handle_option): Handle -gctfN and -gbtf options.
14038 (set_debug_level): Set CTF_DEBUG, BTF_DEBUG whenever appropriate.
14039 * toplev.c (process_options): Inform the user and ignore -gctfLEVEL if
14042 2021-06-28 Jose E. Marchesi <jose.marchesi@oracle.com>
14044 * dwarf2out.c (AT_class): Function is no longer static.
14045 (AT_int): Likewise.
14046 (AT_unsigned): Likewise.
14047 (AT_loc): Likewise.
14048 (get_AT): Likewise.
14049 (get_AT_string): Likewise.
14050 (get_AT_flag): Likewise.
14051 (get_AT_unsigned): Likewise.
14052 (get_AT_ref): Likewise.
14053 (new_die_raw): Likewise.
14054 (lookup_decl_die): Likewise.
14055 (base_type_die): Likewise.
14056 (add_name_attribute): Likewise.
14057 (add_AT_int): Likewise.
14058 (add_AT_unsigned): Likewise.
14059 (add_AT_loc): Likewise.
14060 (dw_get_die_tag): New function.
14061 (dw_get_die_child): Likewise.
14062 (dw_get_die_sib): Likewise.
14063 (struct dwarf_file_data): Move from here to dwarf2out.h
14064 (struct dw_attr_struct): Likewise.
14065 * dwarf2out.h: Analogous changes.
14067 2021-06-28 Martin Jambor <mjambor@suse.cz>
14070 * ipa-param-manipulation.h (class ipa_param_body_adjustments): New
14071 members m_dead_stmts and m_dead_ssas.
14072 * ipa-param-manipulation.c
14073 (ipa_param_body_adjustments::mark_dead_statements): New function.
14074 (ipa_param_body_adjustments::common_initialization): Call it on
14075 all removed but not split parameters.
14076 (ipa_param_body_adjustments::ipa_param_body_adjustments): Initialize
14078 (ipa_param_body_adjustments::modify_call_stmt): Remove arguments that
14080 * tree-inline.c (remap_gimple_stmt): Do not copy dead statements, reset
14081 dead debug statements.
14082 (copy_phis_for_bb): Do not copy dead PHI nodes.
14084 2021-06-28 Martin Jambor <mjambor@suse.cz>
14087 * symtab-clones.h (clone_info): Removed member param_adjustments.
14088 * ipa-param-manipulation.h: Adjust initial comment to reflect how we
14089 deal with pass-through splits now.
14090 (ipa_param_performed_split): Removed.
14091 (ipa_param_adjustments::modify_call): Adjusted parameters.
14092 (class ipa_param_body_adjustments): Adjusted parameters of
14093 register_replacement, modify_gimple_stmt and modify_call_stmt.
14094 (ipa_verify_edge_has_no_modifications): Declare.
14095 (ipa_edge_modifications_finalize): Declare.
14096 * cgraph.c (cgraph_edge::redirect_call_stmt_to_callee): Remove
14097 performed_splits processing, pas only edge to padjs->modify_call,
14098 check that call arguments were not modified if they should not have
14100 * cgraphclones.c (cgraph_node::create_clone): Do not copy performed
14102 * ipa-param-manipulation.c (struct pass_through_split_map): New type.
14103 (ipa_edge_modification_info): Likewise.
14104 (ipa_edge_modification_sum): Likewise.
14105 (ipa_edge_modifications): New edge summary.
14106 (ipa_verify_edge_has_no_modifications): New function.
14107 (transitive_split_p): Removed.
14108 (transitive_split_map): Likewise.
14109 (init_transitive_splits): Likewise.
14110 (ipa_param_adjustments::modify_call): Adjusted to use the new edge
14111 summary instead of performed_splits.
14112 (ipa_param_body_adjustments::register_replacement): Drop dummy
14113 parameter, set base_index of the created ipa_param_body_replacement.
14114 (phi_arg_will_live_p): New function.
14115 (ipa_param_body_adjustments::common_initialization): Do not create
14116 IPA_SRA dummy decls.
14117 (simple_tree_swap_info): Removed.
14118 (remap_split_decl_to_dummy): Likewise.
14119 (record_argument_state_1): New function.
14120 (record_argument_state): Likewise.
14121 (ipa_param_body_adjustments::modify_call_stmt): New parameter
14122 orig_stmt. Do not work with dummy decls, save necessary info about
14123 changes to ipa_edge_modifications.
14124 (ipa_param_body_adjustments::modify_gimple_stmt): New parameter
14125 orig_stmt, pass it to modify_call_stmt.
14126 (ipa_param_body_adjustments::modify_cfun_body): Adjust call to
14127 modify_gimple_stmt.
14128 (ipa_edge_modifications_finalize): New function.
14129 * tree-inline.c (remap_gimple_stmt): Pass original statement to
14130 modify_gimple_stmt.
14131 (copy_phis_for_bb): Do not copy dead PHI nodes.
14132 (expand_call_inline): Do not remap performed_splits.
14133 (update_clone_info): Likewise.
14134 * toplev.c: Include ipa-param-manipulation.h.
14135 (toplev::finalize): Call ipa_edge_modifications_finalize.
14137 2021-06-28 Andrew Pinski <apinski@marvell.com>
14139 * tree-ssa-phiopt.c (replace_phi_edge_with_variable): Duplicate range
14140 info if we're the only things setting the target PHI.
14141 (value_replacement): Don't duplicate range here.
14142 (minmax_replacement): Likewise.
14144 2021-06-28 Richard Biener <rguenther@suse.de>
14146 PR tree-optimization/101229
14147 * gimple-walk.c (gimple_walk_op): Handle PHIs.
14149 2021-06-28 Martin Liska <mliska@suse.cz>
14151 * config/v850/v850.c (construct_dispose_instruction): Allocate
14153 (construct_prepare_instruction): Likewise.
14155 2021-06-28 Martin Liska <mliska@suse.cz>
14157 * config/v850/v850.c (v850_option_override): Build default
14159 (v850_can_inline_p): New. Allow MASK_PROLOG_FUNCTION to be
14160 ignored for inlining.
14161 (TARGET_CAN_INLINE_P): New.
14163 2021-06-28 Richard Biener <rguenther@suse.de>
14165 PR tree-optimization/101207
14166 * tree-vect-slp.c (vect_optimize_slp): Do BB reduction
14167 permute eliding for load permutations properly.
14169 2021-06-28 Richard Biener <rguenther@suse.de>
14171 PR tree-optimization/101173
14172 * gimple-loop-interchange.cc
14173 (tree_loop_interchange::valid_data_dependences): Disallow outer
14174 loop dependence distance of zero.
14176 2021-06-28 liuhongt <hongtao.liu@intel.com>
14179 * config/i386/sse.md (*avx_cmp<mode>3_lt): New
14180 define_insn_and_split.
14181 (*avx_cmp<mode>3_ltint): Ditto.
14182 (*avx2_pcmp<mode>3_3): Ditto.
14183 (*avx2_pcmp<mode>3_4): Ditto.
14184 (*avx2_pcmp<mode>3_5): Ditto.
14186 2021-06-28 liuhongt <hongtao.liu@intel.com>
14188 * config/i386/i386-builtin.def (IX86_BUILTIN_BLENDVPD256,
14189 IX86_BUILTIN_BLENDVPS256, IX86_BUILTIN_PBLENDVB256,
14190 IX86_BUILTIN_BLENDVPD, IX86_BUILTIN_BLENDVPS,
14191 IX86_BUILTIN_PBLENDVB128): Replace icode with
14193 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold blendv
14195 * config/i386/sse.md (*<sse4_1_avx2>_pblendvb_lt_subreg_not):
14196 New pre_reload splitter.
14198 2021-06-27 Andrew Pinski <apinski@marvell.com>
14200 PR middle-end/101230
14201 * fold-const.c (fold_ternary_loc): Check
14202 the return value of invert_tree_comparison.
14204 2021-06-27 David Edelsohn <dje.gcc@gmail.com>
14206 * config.gcc: Add SPDX License Identifier.
14207 (powerpc-ibm-aix789): Default to aix73.h.
14208 (powerpc-ibm-aix7.2.*.*): New stanza.
14209 * config/rs6000/aix72.h: Add SPDX License Identifier.
14210 * config/rs6000/aix73.h: New file.
14212 2021-06-26 Jason Merrill <jason@redhat.com>
14214 * except.c: #include "dwarf2.h" instead of "dwarf2out.h".
14216 2021-06-26 Andrew Pinski <apinski@marvell.com>
14218 * genmatch.c (lower_cond): Copy for_subst_vec
14219 for the simplify also.
14220 (lower): Swap the order for lower_for and lower_cond.
14222 2021-06-26 Andrew Pinski <apinski@marvell.com>
14224 * tree-ssa-phiopt.c (match_simplify_replacement): Reset
14225 flow senatitive info on the moved ssa set.
14227 2021-06-26 Andrew Pinski <apinski@marvell.com>
14229 * fold-const.c (fold_cond_expr_with_comparison):
14230 Exand arg0 into comp_code, arg00, and arg01.
14231 (fold_ternary_loc): Use invert_tree_comparison
14232 instead of fold_invert_truthvalue for the case
14233 where we have A CMP B ? C : A.
14235 2021-06-25 Martin Sebor <msebor@redhat.com>
14237 PR middle-end/101216
14238 * calls.c (maybe_warn_rdwr_sizes): Use the no_warning constant.
14240 2021-06-25 Jeff Law <jeffreyalaw@gmail.com>
14242 * config/h8300/h8300.c (select_cc_mode): Handle ASHIFTRT and LSHIFTRT.
14244 2021-06-25 Richard Biener <rguenther@suse.de>
14246 PR tree-optimization/101202
14247 * tree-vect-slp.c (vect_optimize_slp): Explicitely handle
14250 2021-06-25 Richard Biener <rguenther@suse.de>
14252 * tree-vect-slp-patterns.c (addsub_pattern::build): Copy
14253 STMT_VINFO_REDUC_DEF from the original representative.
14255 2021-06-25 Martin Sebor <msebor@redhat.com>
14257 * builtins.c (warn_string_no_nul): Replace uses of TREE_NO_WARNING,
14258 gimple_no_warning_p and gimple_set_no_warning with
14259 warning_suppressed_p, and suppress_warning.
14261 (maybe_warn_for_bound): Same.
14262 (warn_for_access): Same.
14263 (check_access): Same.
14264 (expand_builtin_strncmp): Same.
14265 (fold_builtin_varargs): Same.
14266 * calls.c (maybe_warn_nonstring_arg): Same.
14267 (maybe_warn_rdwr_sizes): Same.
14268 * cfgexpand.c (expand_call_stmt): Same.
14269 * cgraphunit.c (check_global_declaration): Same.
14270 * fold-const.c (fold_undefer_overflow_warnings): Same.
14271 (fold_truth_not_expr): Same.
14272 (fold_unary_loc): Same.
14273 (fold_checksum_tree): Same.
14274 * gimple-array-bounds.cc (array_bounds_checker::check_array_ref): Same.
14275 (array_bounds_checker::check_mem_ref): Same.
14276 (array_bounds_checker::check_addr_expr): Same.
14277 (array_bounds_checker::check_array_bounds): Same.
14278 * gimple-expr.c (copy_var_decl): Same.
14279 * gimple-fold.c (gimple_fold_builtin_strcpy): Same.
14280 (gimple_fold_builtin_strncat): Same.
14281 (gimple_fold_builtin_stxcpy_chk): Same.
14282 (gimple_fold_builtin_stpcpy): Same.
14283 (gimple_fold_builtin_sprintf): Same.
14284 (fold_stmt_1): Same.
14285 * gimple-ssa-isolate-paths.c (diag_returned_locals): Same.
14286 * gimple-ssa-nonnull-compare.c (do_warn_nonnull_compare): Same.
14287 * gimple-ssa-sprintf.c (handle_printf_call): Same.
14288 * gimple-ssa-store-merging.c (imm_store_chain_info::output_merged_store): Same.
14289 * gimple-ssa-warn-restrict.c (maybe_diag_overlap): Same.
14290 * gimple-ssa-warn-restrict.h: Adjust declarations.
14291 (maybe_diag_access_bounds): Replace uses of TREE_NO_WARNING,
14292 gimple_no_warning_p and gimple_set_no_warning with
14293 warning_suppressed_p, and suppress_warning.
14294 (check_call): Same.
14295 (check_bounds_or_overlap): Same.
14296 * gimple.c (gimple_build_call_from_tree): Same.
14297 * gimplify.c (gimplify_return_expr): Same.
14298 (gimplify_cond_expr): Same.
14299 (gimplify_modify_expr_complex_part): Same.
14300 (gimplify_modify_expr): Same.
14301 (gimple_push_cleanup): Same.
14302 (gimplify_expr): Same.
14303 * omp-expand.c (expand_omp_for_generic): Same.
14304 (expand_omp_taskloop_for_outer): Same.
14305 * omp-low.c (lower_rec_input_clauses): Same.
14306 (lower_lastprivate_clauses): Same.
14307 (lower_send_clauses): Same.
14308 (lower_omp_target): Same.
14309 * tree-cfg.c (pass_warn_function_return::execute): Same.
14310 * tree-complex.c (create_one_component_var): Same.
14311 * tree-inline.c (remap_gimple_op_r): Same.
14312 (copy_tree_body_r): Same.
14313 (declare_return_variable): Same.
14314 (expand_call_inline): Same.
14315 * tree-nested.c (lookup_field_for_decl): Same.
14316 * tree-sra.c (create_access_replacement): Same.
14317 (generate_subtree_copies): Same.
14318 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Same.
14319 * tree-ssa-forwprop.c (combine_cond_expr_cond): Same.
14320 * tree-ssa-loop-ch.c (ch_base::copy_headers): Same.
14321 * tree-ssa-loop-im.c (execute_sm): Same.
14322 * tree-ssa-phiopt.c (cond_store_replacement): Same.
14323 * tree-ssa-strlen.c (maybe_warn_overflow): Same.
14324 (handle_builtin_strcpy): Same.
14325 (maybe_diag_stxncpy_trunc): Same.
14326 (handle_builtin_stxncpy_strncat): Same.
14327 (handle_builtin_strcat): Same.
14328 * tree-ssa-uninit.c (get_no_uninit_warning): Same.
14329 (set_no_uninit_warning): Same.
14330 (uninit_undefined_value_p): Same.
14331 (warn_uninit): Same.
14332 (maybe_warn_operand): Same.
14333 * tree-vrp.c (compare_values_warnv): Same.
14334 * vr-values.c (vr_values::extract_range_for_var_from_comparison_expr): Same.
14335 (test_for_singularity): Same.
14336 * gimple.h (warning_suppressed_p): New function.
14337 (suppress_warning): Same.
14338 (copy_no_warning): Same.
14339 (gimple_set_block): Call gimple_set_location.
14340 (gimple_set_location): Call copy_warning.
14342 2021-06-25 Martin Sebor <msebor@redhat.com>
14344 * tree.h (warning_suppressed_at, copy_warning,
14345 warning_suppressed_p, suppress_warning): New functions.
14347 2021-06-25 Martin Sebor <msebor@redhat.com>
14349 * Makefile.in (OBJS-libcommon): Add diagnostic-spec.o.
14350 * gengtype.c (open_base_files): Add diagnostic-spec.h.
14351 * diagnostic-spec.c: New file.
14352 * diagnostic-spec.h: New file.
14353 * tree.h (no_warning, all_warnings, suppress_warning_at): New
14355 * warning-control.cc: New file.
14357 2021-06-25 liuhongt <hongtao.liu@intel.com>
14360 * config/i386/i386.c (x86_order_regs_for_local_alloc):
14363 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14365 PR tree-optimization/101189
14366 * gimple-range-fold.cc (fold_using_range::range_of_range_op): Pass
14367 LHS range of condition to postfold routine.
14368 (fold_using_range::postfold_gcond_edges): Only process the TRUE or
14369 FALSE edge if the LHS range supports it being taken.
14370 * gimple-range-fold.h (postfold_gcond_edges): Add range parameter.
14372 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14374 * value-relation.cc (equiv_oracle::dump): Do not dump NULL blocks.
14375 (relation_oracle::find_relation_block): Check correct bitmap.
14376 (relation_oracle::dump): Do not dump NULL blocks.
14378 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14380 * gimple-range-cache.cc (ranger_cache::propagate_cache): Call
14381 range_on_edge instead of manually calculating.
14383 2021-06-24 Andrew MacLeod <amacleod@redhat.com>
14385 * range-op.cc: Fix comment.
14387 2021-06-24 Uroš Bizjak <ubizjak@gmail.com>
14390 * config/i386/i386-expand.c (ix86_expand_sse_unpack):
14391 Handle V8QI and V4HI modes.
14392 * config/i386/mmx.md (sse4_1_<any_extend:code>v4qiv4hi2):
14394 (sse4_1_<any_extend:code>v4qiv4hi2): Ditto.
14395 (mmxpackmode): New mode attribute.
14396 (vec_pack_trunc_<mmxpackmode:mode>): New expander.
14397 (mmxunpackmode): New mode attribute.
14398 (vec_unpacks_lo_<mmxunpackmode:mode>): New expander.
14399 (vec_unpacks_hi_<mmxunpackmode:mode>): Ditto.
14400 (vec_unpacku_lo_<mmxunpackmode:mode>): Ditto.
14401 (vec_unpacku_hi_<mmxunpackmode:mode>): Ditto.
14402 * config/i386/i386.md (extsuffix): Move from ...
14403 * config/i386/sse.md: ... here.
14405 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
14407 * dwarf2out.c (dwarf2out_assembly_start): Emit .file 0 marker here..
14408 (dwarf2out_finish): ...instead of here.
14410 2021-06-24 Eric Botcazou <ebotcazou@adacore.com>
14412 * configure.ac (--gdwarf-5 option): Use objdump instead of readelf.
14413 (working --gdwarf-4/--gdwarf-5 for all sources): Likewise.
14414 (--gdwarf-4 not refusing generated .debug_line): Adjust for Windows.
14415 * configure: Regenerate.
14417 2021-06-24 Richard Biener <rguenther@suse.de>
14419 * config/i386/sse.md (vec_addsubv4df3, vec_addsubv2df3,
14420 vec_addsubv8sf3, vec_addsubv4sf3): Merge into ...
14421 (vec_addsub<mode>3): ... using a new addsub_cst mode attribute.
14423 2021-06-24 Richard Biener <rguenther@suse.de>
14425 * config/i386/sse.md (avx_addsubv4df3): Rename to
14427 (avx_addsubv8sf3): Rename to vec_addsubv8sf3.
14428 (sse3_addsubv2df3): Rename to vec_addsubv2df3.
14429 (sse3_addsubv4sf3): Rename to vec_addsubv4sf3.
14430 * config/i386/i386-builtin.def: Adjust.
14431 * internal-fn.def (VEC_ADDSUB): New internal optab fn.
14432 * optabs.def (vec_addsub_optab): New optab.
14433 * tree-vect-slp-patterns.c (class addsub_pattern): New.
14434 (slp_patterns): Add addsub_pattern.
14435 * tree-vect-slp.c (vect_optimize_slp): Disable propagation
14436 across CFN_VEC_ADDSUB.
14437 * tree-vectorizer.h (vect_pattern::vect_pattern): Make
14439 * doc/md.texi (vec_addsub<mode>3): Document.
14441 2021-06-24 Jakub Jelinek <jakub@redhat.com>
14443 PR middle-end/101170
14444 * df-scan.c (df_ref_record): For paradoxical big-endian SUBREGs
14445 where regno + subreg_regno_offset wraps around use 0 as starting
14448 2021-06-24 Jakub Jelinek <jakub@redhat.com>
14450 PR middle-end/101172
14451 * stor-layout.c (finish_bitfield_representative): If nextf has
14452 error_mark_node type, set repr type to error_mark_node too.
14454 2021-06-24 Ilya Leoshkevich <iii@linux.ibm.com>
14456 * config/s390/s390.c (s390_function_profiler): Ignore labelno
14458 * config/s390/s390.h (NO_PROFILE_COUNTERS): Define.
14460 2021-06-24 Richard Biener <rguenther@suse.de>
14462 * tree-vect-slp.c (vect_optimize_slp): Do not propagate
14463 across operations that have different semantics on different
14466 2021-06-24 Jakub Jelinek <jakub@redhat.com>
14468 * tree.h (OMP_CLAUSE_MAP_IN_REDUCTION): Document meaning for OpenMP.
14469 * gimplify.c (gimplify_scan_omp_clauses): For OpenMP map clauses
14470 with OMP_CLAUSE_MAP_IN_REDUCTION flag partially defer gimplification
14471 of non-decl OMP_CLAUSE_DECL. For OMP_CLAUSE_IN_REDUCTION on
14472 OMP_TARGET user outer_ctx instead of ctx for placeholders and
14473 initializer/combiner gimplification.
14474 * omp-low.c (scan_sharing_clauses): Handle OMP_CLAUSE_MAP_IN_REDUCTION
14475 on target constructs.
14476 (lower_rec_input_clauses): Likewise.
14477 (lower_omp_target): Likewise.
14478 * omp-expand.c (expand_omp_target): Temporarily ignore nowait clause
14479 on target if in_reduction is present.
14481 2021-06-24 Kewen Lin <linkw@linux.ibm.com>
14483 * tree-predcom.c (class pcom_worker): New class.
14484 (release_chain): Renamed to...
14485 (pcom_worker::release_chain): ...this.
14486 (release_chains): Renamed to...
14487 (pcom_worker::release_chains): ...this.
14488 (aff_combination_dr_offset): Renamed to...
14489 (pcom_worker::aff_combination_dr_offset): ...this.
14490 (determine_offset): Renamed to...
14491 (pcom_worker::determine_offset): ...this.
14492 (class comp_ptrs): New class.
14493 (split_data_refs_to_components): Renamed to...
14494 (pcom_worker::split_data_refs_to_components): ...this,
14495 and update with class comp_ptrs.
14496 (suitable_component_p): Renamed to...
14497 (pcom_worker::suitable_component_p): ...this.
14498 (filter_suitable_components): Renamed to...
14499 (pcom_worker::filter_suitable_components): ...this.
14500 (valid_initializer_p): Renamed to...
14501 (pcom_worker::valid_initializer_p): ...this.
14502 (find_looparound_phi): Renamed to...
14503 (pcom_worker::find_looparound_phi): ...this.
14504 (add_looparound_copies): Renamed to...
14505 (pcom_worker::add_looparound_copies): ...this.
14506 (determine_roots_comp): Renamed to...
14507 (pcom_worker::determine_roots_comp): ...this.
14508 (determine_roots): Renamed to...
14509 (pcom_worker::determine_roots): ...this.
14510 (single_nonlooparound_use): Renamed to...
14511 (pcom_worker::single_nonlooparound_use): ...this.
14512 (remove_stmt): Renamed to...
14513 (pcom_worker::remove_stmt): ...this.
14514 (execute_pred_commoning_chain): Renamed to...
14515 (pcom_worker::execute_pred_commoning_chain): ...this.
14516 (execute_pred_commoning): Renamed to...
14517 (pcom_worker::execute_pred_commoning): ...this.
14518 (struct epcc_data): New member worker.
14519 (execute_pred_commoning_cbck): Call execute_pred_commoning
14520 with pcom_worker pointer.
14521 (find_use_stmt): Renamed to...
14522 (pcom_worker::find_use_stmt): ...this.
14523 (find_associative_operation_root): Renamed to...
14524 (pcom_worker::find_associative_operation_root): ...this.
14525 (find_common_use_stmt): Renamed to...
14526 (pcom_worker::find_common_use_stmt): ...this.
14527 (combinable_refs_p): Renamed to...
14528 (pcom_worker::combinable_refs_p): ...this.
14529 (reassociate_to_the_same_stmt): Renamed to...
14530 (pcom_worker::reassociate_to_the_same_stmt): ...this.
14531 (stmt_combining_refs): Renamed to...
14532 (pcom_worker::stmt_combining_refs): ...this.
14533 (combine_chains): Renamed to...
14534 (pcom_worker::combine_chains): ...this.
14535 (try_combine_chains): Renamed to...
14536 (pcom_worker::try_combine_chains): ...this.
14537 (prepare_initializers_chain): Renamed to...
14538 (pcom_worker::prepare_initializers_chain): ...this.
14539 (prepare_initializers): Renamed to...
14540 (pcom_worker::prepare_initializers): ...this.
14541 (prepare_finalizers_chain): Renamed to...
14542 (pcom_worker::prepare_finalizers_chain): ...this.
14543 (prepare_finalizers): Renamed to...
14544 (pcom_worker::prepare_finalizers): ...this.
14545 (tree_predictive_commoning_loop): Renamed to...
14546 (pcom_worker::tree_predictive_commoning_loop): ...this, adjust
14547 some calls and remove some cleanup code.
14548 (tree_predictive_commoning): Adjusted to use pcom_worker instance.
14549 (static variable looparound_phis): Remove.
14550 (static variable name_expansions): Remove.
14552 2021-06-24 Richard Biener <rguenther@suse.de>
14554 * tree-vect-slp.c (slpg_vertex): New struct.
14555 (vect_slp_build_vertices): Adjust.
14556 (vect_optimize_slp): Likewise. Maintain an outgoing permute
14557 and a materialized one.
14559 2021-06-24 Richard Biener <rguenther@suse.de>
14561 PR tree-optimization/101105
14562 * tree-vect-data-refs.c (vect_prune_runtime_alias_test_list):
14563 Only ignore steps when they are equal or scalar order is preserved.
14565 2021-06-24 liuhongt <hongtao.liu@intel.com>
14568 * config/i386/i386-expand.c (ix86_expand_vec_interleave):
14569 Adjust comments for ix86_expand_vecop_qihi2.
14570 (ix86_expand_vecmul_qihi): Renamed to ..
14571 (ix86_expand_vecop_qihi2): Adjust function prototype to
14572 support shift operation, add static to definition.
14573 (ix86_expand_vec_shift_qihi_constant): Add static to definition.
14574 (ix86_expand_vecop_qihi): Call ix86_expand_vecop_qihi2 and
14575 ix86_expand_vec_shift_qihi_constant.
14576 * config/i386/i386-protos.h (ix86_expand_vecmul_qihi): Deleted.
14577 (ix86_expand_vec_shift_qihi_constant): Deleted.
14578 * config/i386/sse.md (VI12_256_512_AVX512VL): New mode
14580 (mulv8qi3): Call ix86_expand_vecop_qihi directly, add
14581 condition TARGET_64BIT.
14582 (mul<mode>3): Ditto.
14583 (<insn><mode>3): Ditto.
14584 (vlshr<mode>3): Extend to support avx512 vlshr.
14585 (v<insn><mode>3): New expander for
14587 (v<insn>v8qi3): Ditto.
14588 (vashrv8hi3<mask_name>): Renamed to ..
14589 (vashr<mode>3): And extend to support V16QImode for avx512.
14590 (vashrv16qi3): Deleted.
14591 (vashrv2di3<mask_name>): Extend expander to support avx512
14594 2021-06-23 Dimitar Dimitrov <dimitar@dinux.eu>
14596 * doc/lto.texi (Design Overview): Update that slim objects are
14599 2021-06-23 Aaron Sawdey <acsawdey@linux.ibm.com>
14601 * config/rs6000/rs6000-cpus.def: Take OPTION_MASK_PCREL_OPT out
14602 of OTHER_POWER10_MASKS so it will not be enabled by default.
14604 2021-06-23 Richard Biener <rguenther@suse.de>
14605 Martin Jambor <mjambor@suse.cz>
14607 * tree-inline.c (setup_one_parameter): Set TREE_READONLY of the
14608 param replacement unconditionally. Adjust comment.
14610 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14612 * Makefile.in (OBJS): Add gimple-range-fold.o
14613 * gimple-range-fold.cc: New.
14614 * gimple-range-fold.h: New.
14615 * gimple-range-gori.cc (gimple_range_calc_op1): Move to here.
14616 (gimple_range_calc_op2): Ditto.
14617 * gimple-range-gori.h: Move prototypes to here.
14618 * gimple-range.cc: Adjust include files.
14619 (fur_source:fur_source): Relocate to gimple-range-fold.cc.
14620 (fur_source::get_operand): Ditto.
14621 (fur_source::get_phi_operand): Ditto.
14622 (fur_source::query_relation): Ditto.
14623 (fur_source::register_relation): Ditto.
14624 (class fur_edge): Ditto.
14625 (fur_edge::fur_edge): Ditto.
14626 (fur_edge::get_operand): Ditto.
14627 (fur_edge::get_phi_operand): Ditto.
14628 (fur_stmt::fur_stmt): Ditto.
14629 (fur_stmt::get_operand): Ditto.
14630 (fur_stmt::get_phi_operand): Ditto.
14631 (fur_stmt::query_relation): Ditto.
14632 (class fur_depend): Relocate to gimple-range-fold.h.
14633 (fur_depend::fur_depend): Relocate to gimple-range-fold.cc.
14634 (fur_depend::register_relation): Ditto.
14635 (fur_depend::register_relation): Ditto.
14636 (class fur_list): Ditto.
14637 (fur_list::fur_list): Ditto.
14638 (fur_list::get_operand): Ditto.
14639 (fur_list::get_phi_operand): Ditto.
14640 (fold_range): Ditto.
14641 (adjust_pointer_diff_expr): Ditto.
14642 (gimple_range_adjustment): Ditto.
14643 (gimple_range_base_of_assignment): Ditto.
14644 (gimple_range_operand1): Ditto.
14645 (gimple_range_operand2): Ditto.
14646 (gimple_range_calc_op1): Relocate to gimple-range-gori.cc.
14647 (gimple_range_calc_op2): Ditto.
14648 (fold_using_range::fold_stmt): Relocate to gimple-range-fold.cc.
14649 (fold_using_range::range_of_range_op): Ditto.
14650 (fold_using_range::range_of_address): Ditto.
14651 (fold_using_range::range_of_phi): Ditto.
14652 (fold_using_range::range_of_call): Ditto.
14653 (fold_using_range::range_of_builtin_ubsan_call): Ditto.
14654 (fold_using_range::range_of_builtin_call): Ditto.
14655 (fold_using_range::range_of_cond_expr): Ditto.
14656 (fold_using_range::range_of_ssa_name_with_loop_info): Ditto.
14657 (fold_using_range::relation_fold_and_or): Ditto.
14658 (fold_using_range::postfold_gcond_edges): Ditto.
14659 * gimple-range.h: Add gimple-range-fold.h to include files. Change
14660 GIMPLE_RANGE_STMT_H to GIMPLE_RANGE_H.
14661 (gimple_range_handler): Relocate to gimple-range-fold.h.
14662 (gimple_range_ssa_p): Ditto.
14663 (range_compatible_p): Ditto.
14664 (class fur_source): Ditto.
14665 (class fur_stmt): Ditto.
14666 (class fold_using_range): Ditto.
14667 (gimple_range_calc_op1): Relocate to gimple-range-gori.h
14668 (gimple_range_calc_op2): Ditto.
14670 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14672 PR tree-optimization/101148
14673 PR tree-optimization/101014
14674 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
14675 (ranger_cache::~ranger_cache): Adjust.
14676 (ranger_cache::block_range): Check if propagation disallowed.
14677 (ranger_cache::propagate_cache): Disallow propagation if new value
14678 can't be stored properly.
14679 * gimple-range-cache.h (ranger_cache::m_propfail): New member.
14681 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14683 * gimple-range-cache.cc (class ssa_block_ranges): Adjust prototype.
14684 (sbr_vector::set_bb_range): Return true.
14685 (class sbr_sparse_bitmap): Adjust.
14686 (sbr_sparse_bitmap::set_bb_range): Return value.
14687 (block_range_cache::set_bb_range): Return value.
14688 (ranger_cache::propagate_cache): Use return value to print msg.
14689 * gimple-range-cache.h (class block_range_cache): Adjust.
14691 2021-06-23 Andrew MacLeod <amacleod@redhat.com>
14693 * gimple-range.cc (dump_bb): Use range_on_edge from the cache.
14695 2021-06-23 Jeff Law <jeffreyalaw@gmail.com>
14697 * config/h8300/logical.md (<code><mode>3<ccnz>): Use <cczn>
14698 so this pattern can be used for test/compare removal. Pass
14699 current insn to compute_logical_op_length and output_logical_op.
14700 * config/h8300/h8300.c (compute_logical_op_cc): Remove.
14701 (h8300_and_costs): Add argument to compute_logical_op_length.
14702 (output_logical_op): Add new argument. Use it to determine if the
14703 condition codes are used and adjust the output accordingly.
14704 (compute_logical_op_length): Add new argument and update length
14705 computations when condition codes are used.
14706 * config/h8300/h8300-protos.h (compute_logical_op_length): Update
14708 (output_logical_op): Likewise.
14710 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
14713 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
14714 Handle 64bit modes for TARGET_XOP. Use indirect gen_* functions.
14715 * config/i386/mmx.md (mmx_ppermv64): New insn pattern.
14716 * config/i386/i386.md (unspec): Move UNSPEC_XOP_PERMUTE from ...
14717 * config/i386/sse.md (unspec): ... here.
14719 2021-06-23 Martin Liska <mliska@suse.cz>
14722 * optc-save-gen.awk: Put back arm_fp16_format to
14725 2021-06-23 Uroš Bizjak <ubizjak@gmail.com>
14728 * config/i386/i386.md (bsr_rex64): Add zero-flag setting RTX.
14731 (clz<mode>2): Update RTX pattern for additions.
14733 2021-06-23 Jakub Jelinek <jakub@redhat.com>
14735 PR middle-end/101167
14736 * omp-low.c (lower_omp_regimplify_p): Regimplify also PARM_DECLs
14737 and RESULT_DECLs that have DECL_HAS_VALUE_EXPR_P set.
14739 2021-06-22 Sergei Trofimovich <siarheit@google.com>
14741 * doc/rtl.texi: drop unbalanced parenthesis.
14743 2021-06-22 Richard Biener <rguenther@suse.de>
14745 PR middle-end/101156
14746 * gimplify.c (gimplify_expr): Remove premature incorrect
14749 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14751 PR tree-optimization/101159
14752 * tree-vect-patterns.c (vect_recog_popcount_pattern): Fix some
14755 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14757 PR middle-end/101160
14758 * function.c (assign_parms): For decl_result with TYPE_EMPTY_P type
14759 clear crtl->return_rtx instead of keeping it referencing a pseudo.
14761 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14762 Andrew Pinski <apinski@marvell.com>
14764 PR tree-optimization/101162
14765 * fold-const.c (range_check_type): Handle OFFSET_TYPE like pointer
14768 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14770 * range-op.cc (range_relational_tests): New.
14771 (range_op_tests): Call range_relational_tests.
14773 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14775 * range-op.cc (operator_cast::lhs_op1_relation): New.
14776 (operator_identity::lhs_op1_relation): Mew.
14778 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14780 * range-op.cc (operator_minus::op1_op2_relation_effect): New.
14782 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14784 * range-op.cc (operator_plus::lhs_op1_relation): New.
14785 (operator_plus::lhs_op2_relation): New.
14787 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14789 * gimple-range-cache.cc (ranger_cache::ranger_cache): Create a
14790 relation_oracle if dominators exist.
14791 (ranger_cache::~ranger_cache): Dispose of oracle.
14792 (ranger_cache::dump_bb): Dump oracle.
14793 * gimple-range.cc (fur_source::fur_source): New.
14794 (fur_source::get_operand): Use mmeber query.
14795 (fur_source::get_phi_operand): Use member_query.
14796 (fur_source::query_relation): New.
14797 (fur_source::register_dependency): Delete.
14798 (fur_source::register_relation): New.
14799 (fur_edge::fur_edge): Adjust.
14800 (fur_edge::get_phi_operand): Fix comment.
14801 (fur_edge::query): Delete.
14802 (fur_stmt::fur_stmt): Adjust.
14803 (fur_stmt::query): Delete.
14804 (fur_depend::fur_depend): Adjust.
14805 (fur_depend::register_relation): New.
14806 (fur_depend::register_relation): New.
14807 (fur_list::fur_list): Adjust.
14808 (fur_list::get_operand): Use member query.
14809 (fold_using_range::range_of_range_op): Process and query relations.
14810 (fold_using_range::range_of_address): Adjust dependency call.
14811 (fold_using_range::range_of_phi): Ditto.
14812 (gimple_ranger::gimple_ranger): New. Use ranger_ache oracle.
14813 (fold_using_range::relation_fold_and_or): New.
14814 (fold_using_range::postfold_gcond_edges): New.
14815 * gimple-range.h (class gimple_ranger): Adjust.
14816 (class fur_source): Adjust members.
14817 (class fur_stmt): Ditto.
14818 (class fold_using_range): Ditto.
14820 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14822 * range-op.cc (range_operator::wi_fold): Apply relation effect.
14823 (range_operator::fold_range): Adjust and apply relation effect.
14824 (*::fold_range): Add relation parameters.
14825 (*::op1_range): Ditto.
14826 (*::op2_range): Ditto.
14827 (range_operator::lhs_op1_relation): New.
14828 (range_operator::lhs_op2_relation): New.
14829 (range_operator::op1_op2_relation): New.
14830 (range_operator::op1_op2_relation_effect): New.
14831 (relop_early_resolve): New.
14832 (operator_equal::op1_op2_relation): New.
14833 (operator_equal::fold_range): Call relop_early_resolve.
14834 (operator_not_equal::op1_op2_relation): New.
14835 (operator_not_equal::fold_range): Call relop_early_resolve.
14836 (operator_lt::op1_op2_relation): New.
14837 (operator_lt::fold_range): Call relop_early_resolve.
14838 (operator_le::op1_op2_relation): New.
14839 (operator_le::fold_range): Call relop_early_resolve.
14840 (operator_gt::op1_op2_relation): New.
14841 (operator_gt::fold_range): Call relop_early_resolve.
14842 (operator_ge::op1_op2_relation): New.
14843 (operator_ge::fold_range): Call relop_early_resolve.
14844 * range-op.h (class range_operator): Adjust parameters and methods.
14846 2021-06-22 Andrew MacLeod <amacleod@redhat.com>
14848 * Makefile.in (OBJS): Add value-relation.o.
14849 * gimple-range.h: Adjust include files.
14850 * tree-data-ref.c: Adjust include file order.
14851 * value-query.cc (range_query::get_value_range): Default to no oracle.
14852 (range_query::query_relation): New.
14853 (range_query::query_relation): New.
14854 * value-query.h (class range_query): Adjust.
14855 * value-relation.cc: New.
14856 * value-relation.h: New.
14858 2021-06-22 Richard Biener <rguenther@suse.de>
14860 PR tree-optimization/101151
14861 * tree-ssa-sink.c (statement_sink_location): Expand irreducible
14864 2021-06-22 Jojo R <rjiejie@linux.alibaba.com>
14866 * config/riscv/riscv.c (thead_c906_tune_info): New.
14867 (riscv_tune_info_table): Use new tune.
14869 2021-06-22 Richard Biener <rguenther@suse.de>
14871 PR tree-optimization/101158
14872 * tree-vect-slp.c (vect_build_slp_tree_1): Move same operand
14873 checking after checking for matching operation.
14875 2021-06-22 Richard Biener <rguenther@suse.de>
14877 PR tree-optimization/101159
14878 * tree-vect-patterns.c (vect_recog_popcount_pattern): Add
14879 missing NULL vectype check.
14881 2021-06-22 Richard Biener <rguenther@suse.de>
14883 PR tree-optimization/101154
14884 * tree-vect-slp.c (vect_build_slp_tree_2): Fix out-of-bound access.
14886 2021-06-22 Jakub Jelinek <jakub@redhat.com>
14889 * config/i386/i386-protos.h (ix86_last_zero_store_uid): Declare.
14890 * config/i386/i386-expand.c (ix86_last_zero_store_uid): New variable.
14891 * config/i386/i386.c (ix86_expand_prologue): Clear it.
14892 * config/i386/i386.md (peephole2s for 1/2/4 stores of const0_rtx):
14893 Remove "" from match_operand. Emit new insns using emit_move_insn and
14894 set ix86_last_zero_store_uid to INSN_UID of the last store.
14895 Add peephole2s for 1/2/4 stores of const0_rtx following previous
14898 2021-06-22 Martin Liska <mliska@suse.cz>
14900 * auto-profile.c (AUTO_PROFILE_VERSION): Bump as string format
14903 2021-06-22 Martin Liska <mliska@suse.cz>
14905 * gcov-io.h: Remove padding entries.
14907 2021-06-22 liuhongt <hongtao.liu@intel.com>
14909 PR tree-optimization/97770
14910 * tree-vect-patterns.c (vect_recog_popcount_pattern):
14912 (vect_recog_func vect_vect_recog_func_ptrs): Add new pattern.
14914 2021-06-22 liuhongt <hongtao.liu@intel.com>
14917 * config/i386/i386-builtin.def (BDESC): Adjust builtin name.
14918 * config/i386/sse.md (<avx512>_expand<mode>_mask): Rename to ..
14919 (expand<mode>_mask): this ..
14920 (*expand<mode>_mask): New pre_reload splitter to transform
14921 v{,p}expand* to vmov* when mask is zero, all ones, or has all
14922 ones in it's lower part, otherwise still generate
14925 2021-06-22 liuhongt <hongtao.liu@intel.com>
14928 * config/i386/i386-expand.c
14929 (ix86_expand_special_args_builtin): Keep constm1_operand only
14930 if it satisfies insn's operand predicate.
14932 2021-06-21 Jason Merrill <jason@redhat.com>
14935 * df-scan.c (df_ref_record): Check that regno < endregno.
14936 * function.c (assign_parms, expand_function_end): Do nothing with a
14937 TYPE_EMPTY_P result.
14939 2021-06-21 Richard Biener <rguenther@suse.de>
14941 PR tree-optimization/101120
14942 * tree-vect-data-refs.c (bump_vector_ptr): Fold the
14944 * tree-vect-slp.c (vect_transform_slp_perm_load): Add
14945 DR chain DCE capability.
14946 * tree-vectorizer.h (vect_transform_slp_perm_load): Adjust.
14947 * tree-vect-stmts.c (vectorizable_load): Remove unused
14948 loads in the DR chain for SLP.
14950 2021-06-21 Jakub Jelinek <jakub@redhat.com>
14952 PR inline-asm/100785
14953 * gimplify.c (gimplify_asm_expr): Don't diagnose errors if
14954 output or input operands were already error_mark_node.
14955 * cfgexpand.c (expand_asm_stmt): If errors are emitted,
14956 remove all inputs, outputs and clobbers from the asm and
14957 set template to "".
14959 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
14961 * config/arm/arm_neon.h (vceq_s8): Replace builtin with __a == __b.
14962 (vceq_s16): Likewise.
14963 (vceq_s32): Likewise.
14964 (vceq_u8): Likewise.
14965 (vceq_u16): Likewise.
14966 (vceq_u32): Likewise.
14967 (vceq_p8): Likewise.
14968 (vceqq_s8): Likewise.
14969 (vceqq_s16): Likewise.
14970 (vceqq_s32): Likewise.
14971 (vceqq_u8): Likewise.
14972 (vceqq_u16): Likewise.
14973 (vceqq_u32): Likewise.
14974 (vceqq_p8): Likewise.
14975 (vceq_f32): Gate __a == __b on __FAST_MATH__.
14976 (vceqq_f32): Likewise.
14977 (vceq_f16): Likewise.
14978 (vceqq_f16): Likewise.
14980 2021-06-21 prathamesh.kulkarni <prathamesh.kulkarni@linaro.org>
14983 * config/arm/iterators.md (NEON_VACMP): Remove.
14984 * config/arm/neon.md (neon_vca<cmp_op><mode>): Use GLTE instead of GTGE
14986 (neon_vca<cmp_op><mode>_insn): Likewise.
14987 (neon_vca<cmp_op_unsp><mode>_insn_unspec): Use NEON_VAGLTE instead of
14990 2021-06-21 Richard Biener <rguenther@suse.de>
14992 PR tree-optimization/101121
14993 * tree-vect-slp.c (vect_build_slp_tree_2): To not fail fatally
14994 when we just lack a stmt with the desired op when doing permutation.
14995 (vect_build_slp_tree): When caching a failed SLP build attempt
14996 assert that at least one lane is marked as not matching.
14998 2021-06-21 liuhongt <hongtao.liu@intel.com>
15001 * config/i386/i386.md: (*anddi_1): Disparage slightly the mask
15002 register alternative.
15003 (*and<mode>_1): Ditto.
15005 (*andn<mode>_1): Ditto.
15006 (*<code><mode>_1): Ditto.
15007 (*<code>qi_1): Ditto.
15008 (*one_cmpl<mode>2_1): Ditto.
15009 (*one_cmplsi2_1_zext): Ditto.
15010 (*one_cmplqi2_1): Ditto.
15011 * config/i386/i386.c (x86_order_regs_for_local_alloc): Change
15012 the order of mask registers to be before general registers.
15014 2021-06-21 Roger Sayle <roger@nextmovesoftware.com>
15017 * config/i386/i386.md: New define_peephole2s to shrink writing
15018 1, 2 or 4 consecutive zeros to memory when optimizing for size.
15020 2021-06-18 Jeff Law <jeffreyalaw@gmail.com>
15022 * config/h8300/h8300.c (h8300_select_cc_mode): Handle SYMBOL_REF.
15023 * config/h8300/logical.md (<code><mode>3 logcial expander): Generate
15024 more efficient code when the source can be trivially simplified.
15026 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
15028 * gimple-range-cache.cc (ranger_cache::range_of_def): Calculate
15029 a range if global is not available.
15030 (ranger_cache::entry_range): Fallback to range_of_def.
15031 * gimple-range-cache.h (range_of_def): Adjust prototype.
15033 2021-06-18 Andrew MacLeod <amacleod@redhat.com>
15035 PR tree-optimization/101014
15036 * gimple-range-cache.cc (ranger_cache::ranger_cache): Remove poor
15038 (ranger_cache::~ranger_cache): Ditto.
15039 (ranger_cache::enable_new_values): Delete.
15040 (ranger_cache::push_poor_value): Delete.
15041 (ranger_cache::range_of_def): Remove poor value processing.
15042 (ranger_cache::entry_range): Ditto.
15043 (ranger_cache::fill_block_cache): Ditto.
15044 * gimple-range-cache.h (class ranger_cache): Remove poor value members.
15045 * gimple-range.cc (gimple_ranger::range_of_expr): Remove call.
15046 * gimple-range.h (class gimple_ranger): Adjust.
15048 2021-06-18 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
15051 * common/config/arm/arm-common.c (arm_canon_arch_option_1): New function
15052 derived from arm_canon_arch.
15053 (arm_canon_arch_option): Call it.
15054 (arm_canon_arch_multilib_option): New function.
15055 * config/arm/arm-cpus.in (IGNORE_FOR_MULTILIB): New fgroup.
15056 * config/arm/arm.h (arm_canon_arch_multilib_option): New prototype.
15057 (CANON_ARCH_MULTILIB_SPEC_FUNCTION): New macro.
15058 (MULTILIB_ARCH_CANONICAL_SPECS): New macro.
15059 (DRIVER_SELF_SPECS): Add MULTILIB_ARCH_CANONICAL_SPECS.
15060 * config/arm/arm.opt (mlibarch): New option.
15061 * config/arm/t-rmprofile (MULTILIB_MATCHES): For armv8*-m, replace use
15062 of march on RHS with mlibarch.
15064 2021-06-18 Marcel Vollweiler <marcel@codesourcery.com>
15066 * config.in: Regenerate.
15067 * config/gcn/gcn.c (print_operand_address): Fix for global_load assembler
15069 * configure: Regenerate.
15070 * configure.ac: Fix for global_load assembler functions.
15072 2021-06-18 Richard Biener <rguenther@suse.de>
15074 PR tree-optimization/101112
15075 * tree-vect-slp.c (vect_slp_linearize_chain): Fix condition
15076 to lookup a pattern stmt def.
15078 2021-06-18 Jakub Jelinek <jakub@redhat.com>
15080 PR middle-end/101062
15081 * stor-layout.c (finish_bitfield_layout): Don't add bitfield
15082 representatives in QUAL_UNION_TYPE.
15084 2021-06-18 Andrew Pinski <apinski@marvell.com>
15086 * tree-ssa-phiopt.c (replace_phi_edge_with_variable):
15087 Add counting of how many times it is done.
15088 (factor_out_conditional_conversion): Likewise.
15089 (match_simplify_replacement): Likewise.
15090 (value_replacement): Likewise.
15091 (spaceship_replacement): Likewise.
15092 (cond_store_replacement): Likewise.
15093 (cond_if_else_store_replacement_1): Likewise.
15094 (hoist_adjacent_loads): Likewise.
15096 2021-06-18 Andrew Pinski <apinski@marvell.com>
15098 * tree-cfg.c (verify_gimple_assign_unary): Reject point and offset
15099 types on NEGATE_EXPR, ABS_EXPR, BIT_NOT_EXPR, PAREN_EXPR and CNONJ_EXPR.
15100 (verify_gimple_assign_binary): Reject point and offset types on
15101 MULT_EXPR, MULT_HIGHPART_EXPR, TRUNC_DIV_EXPR, CEIL_DIV_EXPR,
15102 FLOOR_DIV_EXPR, ROUND_DIV_EXPR, TRUNC_MOD_EXPR, CEIL_MOD_EXPR,
15103 FLOOR_MOD_EXPR, ROUND_MOD_EXPR, RDIV_EXPR, and EXACT_DIV_EXPR.
15105 2021-06-18 Michael Meissner <meissner@linux.ibm.com>
15107 * config/rs6000/rs6000.c (rs6000_emit_minmax): Add support for ISA
15108 3.1 IEEE 128-bit floating point xsmaxcqp/xsmincqp instructions.
15109 * config/rs6000/rs6000.md (s<minmax><mode>3, IEEE128 iterator):
15112 2021-06-17 Aaron Sawdey <acsawdey@linux.ibm.com>
15114 * config/rs6000/genfusion.pl (gen_logical_addsubf): Add
15115 earlyclobber to alts 0/1.
15116 (gen_addadd): Add earlyclobber to alts 0/1.
15117 * config/rs6000/fusion.md: Regenerate file.
15119 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15121 * cfgloopanal.c (get_loop_hot_path): Make path an auto_vec.
15123 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
15125 * gimple-range-cache.cc: Comment cleanups.
15126 * gimple-range-gori.cc: Comment cleanups.
15127 * gimple-range.cc: Comment/spacing cleanups
15128 * value-range.h: Comment cleanups.
15130 2021-06-17 H.J. Lu <hjl.tools@gmail.com>
15133 * calls.c (expand_call): Replace PUSH_ARGS with
15134 targetm.calls.push_argument (0).
15135 (emit_library_call_value_1): Likewise.
15136 * defaults.h (PUSH_ARGS): Removed.
15137 (PUSH_ARGS_REVERSED): Replace PUSH_ARGS with
15138 targetm.calls.push_argument (0).
15139 * expr.c (block_move_libcall_safe_for_call_parm): Likewise.
15140 (emit_push_insn): Pass the number bytes to push to
15141 targetm.calls.push_argument and pass 0 if ARGS_ADDR is 0.
15142 * hooks.c (hook_bool_uint_true): New.
15143 * hooks.h (hook_bool_uint_true): Likewise.
15144 * rtlanal.c (nonzero_bits1): Replace PUSH_ARGS with
15145 targetm.calls.push_argument (0).
15146 * target.def (push_argument): Add a targetm.calls hook.
15147 * targhooks.c (default_push_argument): New.
15148 * targhooks.h (default_push_argument): Likewise.
15149 * config/bpf/bpf.h (PUSH_ARGS): Removed.
15150 * config/cr16/cr16.c (TARGET_PUSH_ARGUMENT): New.
15151 * config/cr16/cr16.h (PUSH_ARGS): Removed.
15152 * config/i386/i386.c (ix86_push_argument): New.
15153 (TARGET_PUSH_ARGUMENT): Likewise.
15154 * config/i386/i386.h (PUSH_ARGS): Removed.
15155 * config/m32c/m32c.c (TARGET_PUSH_ARGUMENT): New.
15156 * config/m32c/m32c.h (PUSH_ARGS): Removed.
15157 * config/nios2/nios2.h (PUSH_ARGS): Likewise.
15158 * config/pru/pru.h (PUSH_ARGS): Likewise.
15159 * doc/tm.texi.in: Remove PUSH_ARGS documentation. Add
15160 TARGET_PUSH_ARGUMENT hook.
15161 * doc/tm.texi: Regenerated.
15163 2021-06-17 Uroš Bizjak <ubizjak@gmail.com>
15166 * config/i386/i386-expand.c (expand_vector_set_var):
15167 Handle V2FS mode remapping. Pass TARGET_MMX_WITH_SSE to
15168 ix86_expand_vector_init_duplicate.
15169 (ix86_expand_vector_init_duplicate): Emit insv_1 for
15170 QImode for !TARGET_PARTIAL_REG_STALL.
15171 * config/i386/predicates.md (vec_setm_mmx_operand): New predicate.
15172 * config/i386/mmx.md (vec_setv2sf): Use vec_setm_mmx_operand
15173 as operand 2 predicate. Call ix86_expand_vector_set_var
15174 for non-constant index operand.
15175 (vec_setv2si): Ditto.
15176 (vec_setv4hi): Ditto.
15177 (vec_setv8qi): ditto.
15179 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
15181 PR tree-optimization/100790
15182 * gimple-range.cc (range_of_builtin_call): Cleanup clz and ctz
15185 2021-06-17 Martin Liska <mliska@suse.cz>
15187 * doc/invoke.texi: Use consistently -O1 instead of -O.
15189 2021-06-17 Martin Liska <mliska@suse.cz>
15191 * gcov-io.h: Update documentation entry about string format.
15193 2021-06-17 Marius Hillenbrand <mhillen@linux.ibm.com>
15196 * config/s390/vecintrin.h (vec_doublee): Fix to use
15197 __builtin_s390_vflls.
15198 (vec_floate): Fix to use __builtin_s390_vflrd.
15200 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15202 * dominance.c (get_dominated_to_depth): Return auto_vec<basic_block>.
15203 * dominance.h (get_dominated_to_depth): Likewise.
15204 (get_all_dominated_blocks): Likewise.
15205 * cfgcleanup.c (delete_unreachable_blocks): Adjust.
15206 * gcse.c (hoist_code): Likewise.
15207 * tree-cfg.c (remove_edge_and_dominated_blocks): Likewise.
15208 * tree-parloops.c (oacc_entry_exit_ok): Likewise.
15209 * tree-ssa-dce.c (eliminate_unnecessary_stmts): Likewise.
15210 * tree-ssa-phiprop.c (pass_phiprop::execute): Likewise.
15212 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15214 * dominance.c (get_dominated_by_region): Return auto_vec<basic_block>.
15215 * dominance.h (get_dominated_by_region): Likewise.
15216 * tree-cfg.c (gimple_duplicate_sese_region): Adjust.
15217 (gimple_duplicate_sese_tail): Likewise.
15218 (move_sese_region_to_fn): Likewise.
15220 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15222 * dominance.c (get_dominated_by): Return auto_vec<basic_block>.
15223 * dominance.h (get_dominated_by): Likewise.
15224 * auto-profile.c (afdo_find_equiv_class): Adjust.
15225 * cfgloopmanip.c (duplicate_loop_to_header_edge): Likewise.
15226 * loop-unroll.c (unroll_loop_runtime_iterations): Likewise.
15227 * tree-cfg.c (test_linear_chain): Likewise.
15228 (test_diamond): Likewise.
15230 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15232 * cfgloop.h (get_loop_hot_path): Return auto_vec<basic_block>.
15233 * cfgloopanal.c (get_loop_hot_path): Likewise.
15234 * tree-ssa-loop-ivcanon.c (tree_estimate_loop_size): Likewise.
15236 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15238 * cgraph.c (cgraph_node::collect_callers): Return
15239 auto_vec<cgraph_edge *>.
15240 * cgraph.h (cgraph_node::collect_callers): Likewise.
15241 * ipa-cp.c (create_specialized_node): Adjust.
15242 (decide_about_value): Likewise.
15243 (decide_whether_version_node): Likewise.
15244 * ipa-sra.c (process_isra_node_results): Likewise.
15246 2021-06-17 Trevor Saunders <tbsaunde@tbsaunde.org>
15248 * vec.h (vl_ptr>::using_auto_storage): Handle null m_vec.
15249 (auto_vec<T, 0>::auto_vec): Define move constructor, and delete copy
15251 (auto_vec<T, 0>::operator=): Define move assignment and delete copy
15254 2021-06-17 Aldy Hernandez <aldyh@redhat.com>
15256 * gimple-range.cc (debug_seed_ranger): New.
15257 (dump_ranger): New.
15258 (debug_ranger): New.
15260 2021-06-17 Richard Biener <rguenther@suse.de>
15262 PR tree-optimization/54400
15263 * tree-vectorizer.h (enum slp_instance_kind): Add
15264 slp_inst_kind_bb_reduc.
15265 (reduction_fn_for_scalar_code): Declare.
15266 * tree-vect-data-refs.c (vect_slp_analyze_instance_dependence):
15267 Check SLP_INSTANCE_KIND instead of looking at the
15269 (vect_slp_analyze_instance_alignment): Likewise.
15270 * tree-vect-loop.c (reduction_fn_for_scalar_code): Export.
15271 * tree-vect-slp.c (vect_slp_linearize_chain): Split out
15272 chain linearization from vect_build_slp_tree_2 and generalize
15273 for the use of BB reduction vectorization.
15274 (vect_build_slp_tree_2): Adjust accordingly.
15275 (vect_optimize_slp): Elide permutes at the root of BB reduction
15277 (vectorizable_bb_reduc_epilogue): New function.
15278 (vect_slp_prune_covered_roots): Likewise.
15279 (vect_slp_analyze_operations): Use them.
15280 (vect_slp_check_for_constructors): Recognize associatable
15281 chains for BB reduction vectorization.
15282 (vectorize_slp_instance_root_stmt): Generate code for the
15283 BB reduction epilogue.
15285 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
15287 * gimple-range-gori.cc (gori_compute::has_edge_range_p): Check with
15289 (gori_compute::may_recompute_p): New.
15290 (gori_compute::outgoing_edge_range_p): Perform recomputations.
15291 * gimple-range-gori.h (class gori_compute): Add prototype.
15293 2021-06-17 Andrew MacLeod <amacleod@redhat.com>
15295 * gimple-range-cache.cc (ranger_cache::range_on_edge): Always return
15296 true when a range can be calculated.
15297 * gimple-range.cc (gimple_ranger::dump_bb): Check has_edge_range_p.
15299 2021-06-16 Martin Sebor <msebor@redhat.com>
15301 * doc/invoke.texi (-Wmismatched-dealloc, -Wmismatched-new-delete):
15302 Correct documented defaults.
15304 2021-06-16 Andrew MacLeod <amacleod@redhat.com>
15306 * gimple-range-cache.cc (ranger_cache::ranger_cache): Initialize
15307 m_new_value_p directly.
15309 2021-06-16 Uroš Bizjak <ubizjak@gmail.com>
15312 * config/i386/i386-expand.c (expand_vec_perm_2perm_pblendv):
15313 Handle 64bit modes for TARGET_SSE4_1.
15314 (expand_vec_perm_pshufb2): Handle 64bit modes for TARGET_SSSE3.
15315 (expand_vec_perm_even_odd_pack): Handle V4HI mode.
15316 (expand_vec_perm_even_odd_1) <case E_V4HImode>: Expand via
15317 expand_vec_perm_pshufb2 for TARGET_SSSE3 and via
15318 expand_vec_perm_even_odd_pack for TARGET_SSE4_1.
15319 * config/i386/mmx.md (mmx_packusdw): New insn pattern.
15321 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15323 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn<mode>):
15324 Change to an expander that emits the correct instruction
15325 depending on endianness.
15326 (aarch64_<sur><addsub>hn<mode>_insn_le): Define.
15327 (aarch64_<sur><addsub>hn<mode>_insn_be): Define.
15329 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15331 * config/aarch64/aarch64-simd-builtins.def: Split generator
15332 for aarch64_<su>qmovn builtins into scalar and vector
15334 * config/aarch64/aarch64-simd.md (aarch64_<su>qmovn<mode>_insn_le):
15336 (aarch64_<su>qmovn<mode>_insn_be): Define.
15337 (aarch64_<su>qmovn<mode>): Split into scalar and vector
15338 variants. Change vector variant to an expander that emits the
15339 correct instruction depending on endianness.
15341 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15343 * config/aarch64/aarch64-simd-builtins.def: Split generator
15344 for aarch64_sqmovun builtins into scalar and vector variants.
15345 * config/aarch64/aarch64-simd.md (aarch64_sqmovun<mode>):
15346 Split into scalar and vector variants. Change vector variant
15347 to an expander that emits the correct instruction depending
15349 (aarch64_sqmovun<mode>_insn_le): Define.
15350 (aarch64_sqmovun<mode>_insn_be): Define.
15352 2021-06-16 Jonathan Wright <jonathan.wright@arm.com>
15354 * config/aarch64/aarch64-simd.md (aarch64_xtn<mode>_insn_le):
15355 Define - modeling zero-high-half semantics.
15356 (aarch64_xtn<mode>): Change to an expander that emits the
15357 appropriate instruction depending on endianness.
15358 (aarch64_xtn<mode>_insn_be): Define - modeling zero-high-half
15360 (aarch64_xtn2<mode>_le): Rename to...
15361 (aarch64_xtn2<mode>_insn_le): This.
15362 (aarch64_xtn2<mode>_be): Rename to...
15363 (aarch64_xtn2<mode>_insn_be): This.
15364 (vec_pack_trunc_<mode>): Emit truncation instruction instead
15366 * config/aarch64/iterators.md (Vnarrowd): Add Vnarrowd mode
15367 attribute iterator.
15369 2021-06-16 Martin Jambor <mjambor@suse.cz>
15371 PR tree-optimization/100453
15372 * tree-sra.c (create_access): Disqualify any const candidates
15373 which are written to.
15374 (sra_modify_expr): Do not store sub-replacements back to a const base.
15375 (handle_unscalarized_data_in_subtree): Likewise.
15376 (sra_modify_assign): Likewise. Earlier, use TREE_READONLy test
15377 instead of constant_decl_p.
15379 2021-06-16 Jakub Jelinek <jakub@redhat.com>
15381 PR middle-end/101062
15382 * stor-layout.c (finish_bitfield_representative): For fields in unions
15383 assume nextf is always NULL.
15384 (finish_bitfield_layout): Compute bit field representatives also in
15385 unions, but handle it as if each bitfield was the only field in the
15388 2021-06-16 Richard Biener <rguenther@suse.de>
15390 PR tree-optimization/101088
15391 * tree-ssa-loop-im.c (sm_seq_valid_bb): Only look for
15392 supported refs on edges. Do not assert same ref but
15393 different kind stores are unsuported but mark them so.
15394 (hoist_memory_references): Only look for supported refs
15397 2021-06-16 Roger Sayle <roger@nextmovesoftware.com>
15399 PR rtl-optimization/46235
15400 * config/i386/i386.md: New define_split for bt followed by cmov.
15401 (*bt<mode>_setcqi): New define_insn_and_split for bt followed by setc.
15402 (*bt<mode>_setncqi): New define_insn_and_split for bt then setnc.
15403 (*bt<mode>_setnc<mode>): New define_insn_and_split for bt followed
15404 by setnc with zero extension.
15406 2021-06-16 Richard Biener <rguenther@suse.de>
15408 PR tree-optimization/101083
15409 * tree-vect-slp.c (vect_slp_build_two_operator_nodes): Get
15410 vectype as argument.
15411 (vect_build_slp_tree_2): Adjust.
15413 2021-06-15 Martin Sebor <msebor@redhat.com>
15415 PR middle-end/100876
15416 * builtins.c: (gimple_call_return_array): Account for size_t
15417 mangling as either unsigned int or unsigned long
15419 2021-06-15 Jeff Law <jeffreyalaw@gmail.com>
15421 * compare-elim.c (try_eliminate_compare): Run DCE to clean things
15422 up before eliminating comparisons.
15424 2021-06-15 Aldy Hernandez <aldyh@redhat.com>
15426 * range-op.cc (operator_bitwise_or::wi_fold): Make sure
15427 nonzero|X is nonzero.
15428 (range_op_bitwise_and_tests): Add tests for above.
15430 2021-06-15 Carl Love <cel@us.ibm.com>
15433 * config/rs6000/rs6000-builtin.def (VCMPEQUT): Fix the ICODE for the
15435 (VRLQ, VSLQ, VSRQ, VSRAQ): Remove unused BU_P10_OVERLOAD_2
15438 2021-06-15 Tobias Burnus <tobias@codesourcery.com>
15441 * gimplify.c (enum gimplify_defaultmap_kind): Add GDMK_SCALAR_TARGET.
15442 (struct gimplify_omp_ctx): Extend defaultmap array by one.
15443 (new_omp_context): Init defaultmap[GDMK_SCALAR_TARGET].
15444 (omp_notice_variable): Update type classification for Fortran.
15445 (gimplify_scan_omp_clauses): Update calls for new argument; handle
15446 GDMK_SCALAR_TARGET; for Fortran, GDMK_POINTER avoid GOVD_MAP_0LEN_ARRAY.
15447 * langhooks-def.h (lhd_omp_scalar_p): Add 'ptr_ok' argument.
15448 * langhooks.c (lhd_omp_scalar_p): Likewise.
15449 (LANG_HOOKS_OMP_ALLOCATABLE_P, LANG_HOOKS_OMP_SCALAR_TARGET_P): New.
15450 (LANG_HOOKS_DECLS): Add them.
15451 * langhooks.h (struct lang_hooks_for_decls): Add new hooks, update
15452 omp_scalar_p pointer type to include the new bool argument.
15454 2021-06-15 David Malcolm <dmalcolm@redhat.com>
15456 * doc/analyzer.texi
15457 (Special Functions for Debugging the Analyzer): Add
15458 __analyzer_dump_capacity.
15460 2021-06-15 Jakub Jelinek <jakub@redhat.com>
15463 * expr.c (expand_expr_real_2) <case VEC_PACK_FIX_TRUNC_EXPR,
15464 case VEC_PACK_TRUNC_EXPR>: Clear subtarget when changing mode.
15466 2021-06-15 Richard Biener <rguenther@suse.de>
15468 * cfgloopanal.c (mark_irreducible_loops): Use a dominance
15469 check to identify loop latches.
15470 * cfgloop.c (verify_loop_structure): Likewise.
15471 * loop-init.c (apply_loop_flags): Allow marked irreducible
15472 regions even with multiple latches.
15473 * predict.c (rebuild_frequencies): Simplify.
15475 2021-06-15 Richard Biener <rguenther@suse.de>
15477 * tree-ssa-threadupdate.c
15478 (jump_thread_path_registry::mark_threaded_blocks): Assert we
15479 have marked irreducible regions.
15481 2021-06-14 Martin Sebor <msebor@redhat.com>
15484 * builtins.c (gimple_call_return_array): Check for attribute fn spec.
15485 Handle calls to placement new.
15486 (ndecl_dealloc_argno): Avoid placement delete.
15488 2021-06-14 Peter Bergner <bergner@linux.ibm.com>
15491 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_mma_builtin): Use
15492 create_tmp_reg_or_ssa_name().
15494 2021-06-14 Andrew MacLeod <amacleod@redhat.com>
15496 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust.
15497 (ranger_cache::enable_new_values): Set to specified value and
15498 return the old value.
15499 (ranger_cache::disable_new_values): Delete.
15500 (ranger_cache::fill_block_cache): Disable non 1st order derived
15502 * gimple-range-cache.h (ranger_cache): Adjust prototypes.
15503 * gimple-range.cc (gimple_ranger::range_of_expr): Adjust.
15505 2021-06-14 Uroš Bizjak <ubizjak@gmail.com>
15508 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const):
15509 Return true early when testing with V2HImode.
15510 * config/i386/mmx.md (*punpckwd): Split to sse2_pshuflw_1.
15512 2021-06-14 Christophe Lyon <christophe.lyon@linaro.org>
15514 * config/arm/mve.md (mve_vec_unpack<US>_lo_<mode>): New pattern.
15515 (mve_vec_unpack<US>_hi_<mode>): New pattern.
15516 (@mve_vec_pack_trunc_lo_<mode>): New pattern.
15517 (mve_vmovntq_<supf><mode>): Prefix with '@'.
15518 * config/arm/neon.md (vec_unpack<US>_hi_<mode>): Move to
15520 (vec_unpack<US>_lo_<mode>): Likewise.
15521 (vec_pack_trunc_<mode>): Rename to
15522 neon_quad_vec_pack_trunc_<mode>.
15523 * config/arm/vec-common.md (vec_unpack<US>_hi_<mode>): New
15525 (vec_unpack<US>_lo_<mode>): New.
15526 (vec_pack_trunc_<mode>): New.
15528 2021-06-14 Richard Biener <rguenther@suse.de>
15530 PR tree-optimization/100934
15531 * tree-ssa-dom.c (pass_dominator::execute): Properly
15532 mark irreducible regions.
15534 2021-06-14 Martin Liska <mliska@suse.cz>
15536 * doc/invoke.texi: Put r{...} on the same line as @item.
15538 2021-06-14 Martin Liska <mliska@suse.cz>
15540 * doc/invoke.texi: Add missing newline.
15542 2021-06-14 Martin Liska <mliska@suse.cz>
15544 * doc/invoke.texi: Remove '+' charasters.
15546 2021-06-14 Claudiu Zissulescu <claziss@synopsys.com>
15548 * config.gcc (arc): Add support for with_cpu option.
15549 * config/arc/arc.h (OPTION_DEFAULT_SPECS): Add fpu.
15551 2021-06-14 Richard Biener <rguenther@suse.de>
15553 PR tree-optimization/101031
15554 * tree-ssa-strlen.c (maybe_invalidate): Increment max_size
15555 instead of size when accounting for a possibly string
15558 2021-06-14 Martin Liska <mliska@suse.cz>
15560 * gimple-ssa-evrp.c (pointer_equiv_analyzer::~pointer_equiv_analyzer): Use delete[].
15562 2021-06-14 Aldy Hernandez <aldyh@redhat.com>
15564 * value-query.cc (gimple_range_global): Call get_range_global
15565 if called after inlining.
15567 2021-06-13 Uroš Bizjak <ubizjak@gmail.com>
15570 * config/i386/i386-expand.c (expand_vec_perm_pshufb):
15571 Emit constant permutation insn directly from here.
15573 2021-06-13 Trevor Saunders <tbsaunde@tbsaunde.org>
15575 * attribs.c (find_attribute_namespace): Iterate over vec<> with
15577 * auto-profile.c (afdo_find_equiv_class): Likewise.
15578 * gcc.c (do_specs_vec): Likewise.
15579 (do_spec_1): Likewise.
15580 (driver::set_up_specs): Likewise.
15581 * gimple-loop-jam.c (any_access_function_variant_p): Likewise.
15582 * gimple-ssa-store-merging.c (compatible_load_p): Likewise.
15583 (imm_store_chain_info::try_coalesce_bswap): Likewise.
15584 (imm_store_chain_info::coalesce_immediate_stores): Likewise.
15585 (get_location_for_stmts): Likewise.
15586 * graphite-poly.c (print_iteration_domains): Likewise.
15587 (free_poly_bb): Likewise.
15588 (remove_gbbs_in_scop): Likewise.
15589 (free_scop): Likewise.
15590 (dump_gbb_cases): Likewise.
15591 (dump_gbb_conditions): Likewise.
15592 (print_pdrs): Likewise.
15593 (print_scop): Likewise.
15594 * ifcvt.c (cond_move_process_if_block): Likewise.
15595 * lower-subreg.c (decompose_multiword_subregs): Likewise.
15596 * regcprop.c (pass_cprop_hardreg::execute): Likewise.
15597 * sanopt.c (sanitize_rewrite_addressable_params): Likewise.
15598 * sel-sched-dump.c (dump_insn_vector): Likewise.
15599 * store-motion.c (store_ops_ok): Likewise.
15600 (store_killed_in_insn): Likewise.
15601 * timevar.c (timer::named_items::print): Likewise.
15602 * tree-cfgcleanup.c (cleanup_control_flow_pre): Likewise.
15603 (cleanup_tree_cfg_noloop): Likewise.
15604 * tree-data-ref.c (dump_data_references): Likewise.
15605 (print_dir_vectors): Likewise.
15606 (print_dist_vectors): Likewise.
15607 (dump_data_dependence_relations): Likewise.
15608 (dump_dist_dir_vectors): Likewise.
15609 (dump_ddrs): Likewise.
15610 (create_runtime_alias_checks): Likewise.
15611 (free_subscripts): Likewise.
15612 (save_dist_v): Likewise.
15613 (save_dir_v): Likewise.
15614 (invariant_access_functions): Likewise.
15615 (same_access_functions): Likewise.
15616 (access_functions_are_affine_or_constant_p): Likewise.
15617 (find_data_references_in_stmt): Likewise.
15618 (graphite_find_data_references_in_stmt): Likewise.
15619 (free_dependence_relations): Likewise.
15620 (free_data_refs): Likewise.
15621 * tree-inline.c (copy_debug_stmts): Likewise.
15622 * tree-into-ssa.c (dump_currdefs): Likewise.
15623 (rewrite_update_phi_arguments): Likewise.
15624 * tree-ssa-propagate.c (clean_up_loop_closed_phi): Likewise.
15625 * tree-vect-data-refs.c (vect_analyze_possibly_independent_ddr):
15627 (vect_slp_analyze_node_dependences): Likewise.
15628 (vect_slp_analyze_instance_dependence): Likewise.
15629 (vect_record_base_alignments): Likewise.
15630 (vect_get_peeling_costs_all_drs): Likewise.
15631 (vect_peeling_supportable): Likewise.
15632 * tree-vectorizer.c (vec_info::~vec_info): Likewise.
15633 (vec_info::free_stmt_vec_infos): Likewise.
15635 2021-06-13 Jeff Law <jeffreyalaw@gmail.com>
15637 * config/h8300/logical.md (<code>qi3_1<cczn>): New pattern.
15638 (andqi3_1<cczn>): Removed.
15639 (<ors>qi3_1): Do not split for IOR/XOR a single bit.
15640 (H8/SX bit logicals): Split out from other patterns.
15641 * config/h8300/multiply.md (mulqihi3_const<cczn>): Renamed from
15642 mulqihi3_const_clobber_flags.
15643 (mulqihi3<cczn>, mulhisi3_const<cczn>, mulhisi3<cczn>): Similarly
15645 2021-06-13 H.J. Lu <hjl.tools@gmail.com>
15648 * config/i386/i386.c (ix86_expand_prologue): Set red_zone_used
15649 to true if red zone is used.
15650 (ix86_output_indirect_jmp): Replace ix86_red_zone_size with
15651 ix86_red_zone_used.
15652 * config/i386/i386.h (machine_function): Add red_zone_used.
15653 (ix86_red_zone_size): Removed.
15654 (ix86_red_zone_used): New.
15655 * config/i386/i386.md (peephole2 patterns): Replace
15656 ix86_red_zone_size with ix86_red_zone_used.
15658 2021-06-12 Jason Merrill <jason@redhat.com>
15660 * doc/extend.texi (unused variable attribute): Applies to
15661 structure fields as well.
15663 2021-06-12 Eugene Rozenfeld <erozen@microsoft.com>
15665 * auto-profile.c (read_profile): fix a typo in an error string
15667 2021-06-11 Thomas Schwinge <thomas@codesourcery.com>
15669 * tree-pretty-print.h (dump_omp_clauses): Add 'bool = true'
15671 * tree-pretty-print.c (dump_omp_clauses): Update.
15672 (dump_generic_node) <OMP_CLAUSE>: Use it.
15674 2021-06-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
15677 * config/arm/arm_mve.h (__arm_vld1q): Change __ARM_mve_coerce(p0,
15678 int8_t const *) to __ARM_mve_coerce1(p0, int8_t *) in the argument for
15679 the polymorphic variants matching code.
15680 (__arm_vld1q_z): Likewise.
15681 (__arm_vld2q): Likewise.
15682 (__arm_vld4q): Likewise.
15683 (__arm_vldrbq_gather_offset): Likewise.
15684 (__arm_vldrbq_gather_offset_z): Likewise.
15686 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
15688 PR tree-optimization/96392
15689 * fold-const.h (tree_expr_maybe_real_minus_zero_p): Fix prototype.
15691 2021-06-11 Roger Sayle <roger@nextmovesoftware.com>
15693 PR tree-optimization/96392
15694 * fold-const.c (fold_real_zero_addition_p): Take both arguments
15695 of the addition or subtraction, not just the zero. Use this
15696 other argument in tests for signaling NaNs and signed zeros.
15697 (tree_expr_maybe_real_minus_zero_p): New predicate.
15698 * fold-const.h (fold_real_zero_addition_p): Update prototype.
15699 (tree_expr_maybe_real_minus_zero_p): New function prototype.
15700 * match.pd: Update calls to fold_real_zero_addition_p.
15701 Replace HONOR_NANS with tree_expr_maybe_nan_p.
15702 Replace HONOR_SIGNED_ZEROS with tree_expr_maybe_real_minus_zero_p.
15703 Replace HONOR_SNANS with tree_expr_maybe_signaling_nan_p.
15704 * tree-ssa-reassoc.c (eliminate_using_constants): Update
15705 call to fold_real_zero_addition_p.
15707 2021-06-11 Richard Biener <rguenther@suse.de>
15709 PR tree-optimization/101025
15710 * tree-ssa-loop-im.c (sm_seq_valid_bb): Make sure to process
15711 all refs that require dependence checking.
15713 2021-06-11 Richard Biener <rguenther@suse.de>
15715 PR tree-optimization/101028
15716 * tree-vect-slp.c (vect_build_slp_tree_2): When SLP
15717 reassoc discovery fails fatally, mark appropriate lanes
15720 2021-06-11 Richard Biener <rguenther@suse.de>
15722 PR tree-optimization/101026
15723 * tree-vect-slp.c (vect_build_slp_tree_2): Make sure we
15724 have a representative for the associated chain nodes.
15726 2021-06-11 Jakub Jelinek <jakub@redhat.com>
15728 PR rtl-optimization/101008
15729 * simplify-rtx.c (relational_result): New function.
15730 (simplify_logical_relational_operation,
15731 simplify_relational_operation): Use it.
15733 2021-06-11 Jakub Jelinek <jakub@redhat.com>
15736 * config/i386/sse.md (*vec_concat<mode>_0_1): Require TARGET_SSE2.
15738 2021-06-11 Uroš Bizjak <ubizjak@gmail.com>
15741 * config/i386/i386-expand.c (expand_vec_perm_pshufb): Return
15742 false if the permutation can be implemented with constant
15743 permutation instruction in wider mode.
15744 (canonicalize_vector_int_perm): Move above expand_vec_perm_pshufb.
15745 Handle V8QImode and V4HImode.
15747 2021-06-11 Martin Liska <mliska@suse.cz>
15749 PR gcov-profile/100788
15750 * common.opt: Add new option.
15751 * coverage.c (coverage_begin_function): Emit warning instead on
15752 the internal compiler error.
15753 * doc/invoke.texi: Document the option.
15754 * toplev.c (process_options): Enable it by default.
15756 2021-06-11 Richard Biener <rguenther@suse.de>
15758 PR middle-end/101009
15759 * tree-data-ref.c (build_classic_dist_vector_1): Make sure
15760 to set *init_b to true when we encounter a constant equal
15762 (compute_affine_dependence): Also dump the actual DR_REF.
15764 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
15766 PR tree-optimization/100984
15767 * gimple-ssa-evrp.c (ssa_equiv_stack): Use auto_vec for
15768 replacements table.
15769 (ssa_equiv_stack::~ssa_equiv_stack): Remove.
15771 2021-06-11 Kewen Lin <linkw@linux.ibm.com>
15773 * config/rs6000/rs6000.md
15774 (floatsi<SFDF:mode>2_lfiwax_<QHI:mode>_mem_zext): New
15775 define_insn_and_split.
15777 2021-06-11 Richard Biener <rguenther@suse.de>
15779 * tree-vect-slp.c (vect_build_slp_tree_2): Use stablesort
15780 to sort operands of the associative chain.
15782 2021-06-11 Richard Biener <rguenther@suse.de>
15784 * system.h (gcc_stablesort_r): Declare.
15785 * sort.cc (gcc_sort_r): Support stable sort.
15786 (gcc_stablesort_r): Define.
15787 * vec.h (vec<>::stablesort): Add.
15789 2021-06-10 Uroš Bizjak <ubizjak@gmail.com>
15792 * config/i386/i386-expand.c (ix86_split_mmx_punpck):
15793 Handle V2SF mode. Emit SHUFPS to fixup unpack-high for V2SF mode.
15794 (expand_vec_perm_blend): Handle 64bit modes for TARGET_SSE4_1.
15795 (expand_vec_perm_pshufb): Handle 64bit modes for TARGET_SSSE3.
15796 (expand_vec_perm_pblendv): Handle 64bit modes for TARGET_SSE4_1.
15797 (expand_vec_perm_interleave2): Handle 64bit modes.
15798 (expand_vec_perm_even_odd_pack): Handle V8QI mode.
15799 (expand_vec_perm_even_odd_1): Ditto.
15800 (ix86_vectorize_vec_perm_const): Ditto.
15801 * config/i386/i386.md (UNSPEC_PSHUFB): Move from ...
15802 * config/i386/sse.md: ... here.
15803 * config/i386/mmx.md (*vec_interleave_lowv2sf):
15804 New insn_and_split pattern.
15805 (*vec_interleave_highv2sf): Ditto.
15806 (mmx_pshufbv8qi3): New insn pattern.
15807 (*mmx_pblendw): Ditto.
15809 2021-06-10 Peter Bergner <bergner@linux.ibm.com>
15811 * config/rs6000/rs6000-builtin.def (build_pair): New built-in.
15812 (build_acc): Likewise.
15813 * config/rs6000/rs6000-call.c (mma_expand_builtin): Swap assemble
15814 source operands in little-endian mode.
15815 (rs6000_gimple_fold_mma_builtin): Handle VSX_BUILTIN_BUILD_PAIR.
15816 (mma_init_builtins): Likewise.
15817 * config/rs6000/rs6000.c (rs6000_split_multireg_move): Handle endianness
15818 ordering for the MMA assemble and build source operands.
15819 * doc/extend.texi (__builtin_vsx_build_acc, __builtin_mma_build_pair):
15821 (__builtin_mma_assemble_acc, __builtin_mma_assemble_pair): Remove
15824 2021-06-10 Jeff Law <jeffreyalaw@gmail.com>
15826 * config/h8300/h8300.c (select_cc_mode): Handle MEM. Use
15828 * config/h8300/extensions.md: Replace _clobber_flags patterns
15831 2021-06-10 Robin Dapp <rdapp@linux.ibm.com>
15833 * config/s390/vector.md (vcond_mask_<mode><mode>): Change to
15834 (vcond_mask_<mode><tointvec>): this.
15836 2021-06-10 Andrew Stubbs <ams@codesourcery.com>
15837 Thomas Schwinge <thomas@codesourcery.com>
15839 * omp-builtins.def (BUILT_IN_GOACC_ENTER_EXIT_DATA): Split into...
15840 (BUILT_IN_GOACC_ENTER_DATA, BUILT_IN_GOACC_EXIT_DATA): ... these.
15841 * gimple.h (enum gf_mask): Split
15842 'GF_OMP_TARGET_KIND_OACC_ENTER_EXIT_DATA' into
15843 'GF_OMP_TARGET_KIND_OACC_ENTER_DATA' and
15844 'GF_OMP_TARGET_KIND_OACC_EXIT_DATA'.
15845 (is_gimple_omp_oacc): Update.
15846 * gimple-pretty-print.c (dump_gimple_omp_target): Likewise.
15847 * gimplify.c (gimplify_omp_target_update): Likewise.
15848 * omp-expand.c (expand_omp_target, build_omp_regions_1)
15849 (omp_make_gimple_edges): Likewise.
15850 * omp-low.c (check_omp_nesting_restrictions, lower_omp_target):
15853 2021-06-10 Aldy Hernandez <aldyh@redhat.com>
15855 * value-query.cc (value_query::value_on_edge): Rename name to
15857 (range_query::range_on_edge): Same.
15858 (range_query::value_of_expr): Same.
15859 (range_query::value_on_edge): Same.
15860 * value-query.h (class value_query): Same.
15861 (class range_query): Same.
15863 2021-06-10 Richard Biener <rguenther@suse.de>
15865 PR tree-optimization/101003
15866 * tree-vect-slp.c (vect_build_slp_tree_2): Appropriately
15867 use the pattern stmt defs when linearizing a chain.
15869 2021-06-10 Jakub Jelinek <jakub@redhat.com>
15872 * ifcvt.c (noce_get_alt_condition, noce_try_abs): Use
15873 prev_nonnote_nondebug_insn instead of prev_nonnote_insn.
15875 2021-06-10 Clement Chigot <clement.chigot@atos.net>
15877 * config/rs6000/aix71.h (ASM_CPU_SPEC): Add Power10 directive.
15878 * config/rs6000/aix72.h (ASM_CPU_SPEC): Likewise.
15880 2021-06-09 Andrew Pinski <apinski@marvell.com>
15882 PR tree-optimization/100925
15883 * match.pd (a ? CST1 : CST2): Limit transformations
15884 that would produce a negative to integeral types only.
15885 Change !POINTER_TYPE_P to INTEGRAL_TYPE_P also.
15887 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
15890 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
15892 * doc/tm.texi: Correctly update.
15894 2021-06-09 Jeff Law <jeffreyalaw@gmail.com>
15896 * doc/tm.texi: Correctly update.
15898 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
15901 * doc/tm.texi.in (Trampolines): Add a missing blank line.
15903 2021-06-09 Paul Eggert <eggert@cs.ucla.edu>
15906 * doc/invoke.texi (Code Gen Options); Document that -fno-trampolines
15907 and -ftrampolines work only with Ada.
15908 * doc/tm.texi.in (Trampolines): Likewise.
15909 * doc/tm.texi: Regenerated.
15911 2021-06-09 Carl Love <cel@us.ibm.com>
15913 * config/rs6000/altivec.h (vec_signextll, vec_signexti, vec_signextq):
15914 Add define for new builtins.
15915 * config/rs6000/altivec.md(altivec_vreveti2): Add define_expand.
15916 * config/rs6000/rs6000-builtin.def (VSIGNEXTI, VSIGNEXTLL): Add
15917 overloaded builtin definitions.
15918 (VSIGNEXTSB2W, VSIGNEXTSH2W, VSIGNEXTSB2D, VSIGNEXTSH2D,VSIGNEXTSW2D,
15919 VSIGNEXTSD2Q): Add builtin expansions.
15920 (SIGNEXT): Add P10 overload definition.
15921 * config/rs6000/rs6000-call.c (P9V_BUILTIN_VEC_VSIGNEXTI, P9V_BUILTIN_VEC_VSIGNEXTLL,
15922 P10_BUILTIN_VEC_SIGNEXT): Add overloaded argument definitions.
15923 * config/rs6000/vsx.md (vsx_sign_extend_v2di_v1ti): Add define_insn.
15924 (vsignextend_v2di_v1ti, vsignextend_qi_<mode>, vsignextend_hi_<mode>,
15925 vsignextend_si_v2di)[VIlong]: Add define_expand.
15926 Make define_insn vsx_sign_extend_si_v2di visible.
15927 * doc/extend.texi: Add documentation for the vec_signexti,
15928 vec_signextll builtins and vec_signextq.
15930 2021-06-09 Carl Love <cel@us.ibm.com>
15932 * config/rs6000/rs6000.c (__fixkfti, __fixunskfti, __floattikf,
15933 __floatuntikf): Names changed to __fixkfti_sw, __fixunskfti_sw,
15934 __floattikf_sw, __floatuntikf_sw respectively.
15935 * config/rs6000/rs6000.md (floatti<mode>2, floatunsti<mode>2,
15936 fix_trunc<mode>ti2, fixuns_trunc<mode>ti2): Add
15937 define_insn for mode IEEE 128.
15939 2021-06-09 Carl Love <cel@us.ibm.com>
15941 * config/rs6000/altivec.md (altivec_vslq, altivec_vsrq):
15942 Rename to altivec_vslq_<mode>, altivec_vsrq_<mode>, mode VEC_TI.
15943 * config/rs6000/vector.md (VEC_TI): Was named VSX_TI in vsx.md.
15944 (vashlv1ti3): Change to vashl<mode>3, mode VEC_TI.
15945 (vlshrv1ti3): Change to vlshr<mode>3, mode VEC_TI.
15946 * config/rs6000/vsx.md (VSX_TI): Remove define_mode_iterator. Update
15947 uses of VSX_TI to VEC_TI.
15949 2021-06-09 Carl Love <cel@us.ibm.com>
15951 * config/rs6000/dfp.md (floattitd2, fixtdti2): New define_insns.
15953 2021-06-09 Carl Love <cel@us.ibm.com>
15955 * config/rs6000/altivec.h (vec_dive, vec_mod): Add define for new
15957 * config/rs6000/altivec.md (UNSPEC_VMULEUD, UNSPEC_VMULESD,
15958 UNSPEC_VMULOUD, UNSPEC_VMULOSD): New unspecs.
15959 (altivec_eqv1ti, altivec_gtv1ti, altivec_gtuv1ti, altivec_vmuleud,
15960 altivec_vmuloud, altivec_vmulesd, altivec_vmulosd, altivec_vrlq,
15961 altivec_vrlqmi, altivec_vrlqmi_inst, altivec_vrlqnm,
15962 altivec_vrlqnm_inst, altivec_vslq, altivec_vsrq, altivec_vsraq,
15963 altivec_vcmpequt_p, altivec_vcmpgtst_p, altivec_vcmpgtut_p): New
15965 (vec_widen_umult_even_v2di, vec_widen_smult_even_v2di,
15966 vec_widen_umult_odd_v2di, vec_widen_smult_odd_v2di, altivec_vrlqmi,
15967 altivec_vrlqnm): New define_expands.
15968 * config/rs6000/rs6000-builtin.def (VCMPEQUT_P, VCMPGTST_P,
15969 VCMPGTUT_P): Add macro expansions.
15970 (BU_P10V_AV_P): Add builtin predicate definition.
15971 (VCMPGTUT, VCMPGTST, VCMPEQUT, CMPNET, CMPGE_1TI,
15972 CMPGE_U1TI, CMPLE_1TI, CMPLE_U1TI, VNOR_V1TI_UNS, VNOR_V1TI, VCMPNET_P,
15973 VCMPAET_P, VMULEUD, VMULESD, VMULOUD, VMULOSD, VRLQ,
15974 VSLQ, VSRQ, VSRAQ, VRLQNM, DIV_V1TI, UDIV_V1TI, DIVES_V1TI, DIVEU_V1TI,
15975 MODS_V1TI, MODU_V1TI, VRLQMI): New macro expansions.
15976 (VRLQ, VSLQ, VSRQ, VSRAQ, DIVE, MOD): New overload expansions.
15977 * config/rs6000/rs6000-call.c (P10_BUILTIN_VCMPEQUT,
15978 P10V_BUILTIN_CMPGE_1TI, P10V_BUILTIN_CMPGE_U1TI,
15979 P10V_BUILTIN_VCMPGTUT, P10V_BUILTIN_VCMPGTST,
15980 P10V_BUILTIN_CMPLE_1TI, P10V_BUILTIN_VCMPLE_U1TI,
15981 P10V_BUILTIN_DIV_V1TI, P10V_BUILTIN_UDIV_V1TI,
15982 P10V_BUILTIN_VMULESD, P10V_BUILTIN_VMULEUD,
15983 P10V_BUILTIN_VMULOSD, P10V_BUILTIN_VMULOUD,
15984 P10V_BUILTIN_VNOR_V1TI, P10V_BUILTIN_VNOR_V1TI_UNS,
15985 P10V_BUILTIN_VRLQ, P10V_BUILTIN_VRLQMI,
15986 P10V_BUILTIN_VRLQNM, P10V_BUILTIN_VSLQ,
15987 P10V_BUILTIN_VSRQ, P10V_BUILTIN_VSRAQ,
15988 P10V_BUILTIN_VCMPGTUT_P, P10V_BUILTIN_VCMPGTST_P,
15989 P10V_BUILTIN_VCMPEQUT_P, P10V_BUILTIN_VCMPGTUT_P,
15990 P10V_BUILTIN_VCMPGTST_P, P10V_BUILTIN_CMPNET,
15991 P10V_BUILTIN_VCMPNET_P, P10V_BUILTIN_VCMPAET_P,
15992 P10V_BUILTIN_DIVES_V1TI, P10V_BUILTIN_MODS_V1TI,
15993 P10V_BUILTIN_MODU_V1TI):
15994 New overloaded definitions.
15995 (rs6000_gimple_fold_builtin) [P10V_BUILTIN_VCMPEQUT,
15996 P10V_BUILTIN_CMPNET, P10V_BUILTIN_CMPGE_1TI,
15997 P10V_BUILTIN_CMPGE_U1TI, P10V_BUILTIN_VCMPGTUT,
15998 P10V_BUILTIN_VCMPGTST, P10V_BUILTIN_CMPLE_1TI,
15999 P10V_BUILTIN_CMPLE_U1TI]: New case statements.
16000 (rs6000_init_builtins) [bool_V1TI_type_node, int_ftype_int_v1ti_v1ti]:
16002 (altivec_init_builtins): New E_V1TImode case statement.
16003 (builtin_function_type)[P10_BUILTIN_128BIT_VMULEUD,
16004 P10_BUILTIN_128BIT_VMULOUD, P10_BUILTIN_128BIT_DIVEU_V1TI,
16005 P10_BUILTIN_128BIT_MODU_V1TI, P10_BUILTIN_CMPGE_U1TI,
16006 P10_BUILTIN_VCMPGTUT, P10_BUILTIN_VCMPEQUT]: New case statements.
16007 * config/rs6000/rs6000.c (rs6000_handle_altivec_attribute) [E_TImode,
16008 E_V1TImode]: New case statements.
16009 * config/rs6000/rs6000.h (rs6000_builtin_type_index): New enum
16010 value RS6000_BTI_bool_V1TI.
16011 * config/rs6000/vector.md (vector_gtv1ti,vector_nltv1ti,
16012 vector_gtuv1ti, vector_nltuv1ti, vector_ngtv1ti, vector_ngtuv1ti,
16013 vector_eq_v1ti_p, vector_ne_v1ti_p, vector_ae_v1ti_p,
16014 vector_gt_v1ti_p, vector_gtu_v1ti_p, vrotlv1ti3, vashlv1ti3,
16015 vlshrv1ti3, vashrv1ti3): New define_expands.
16016 * config/rs6000/vsx.md (UNSPEC_VSX_DIVSQ, UNSPEC_VSX_DIVUQ,
16017 UNSPEC_VSX_DIVESQ, UNSPEC_VSX_DIVEUQ, UNSPEC_VSX_MODSQ,
16018 UNSPEC_VSX_MODUQ): New unspecs.
16019 (mulv2di3, vsx_div_v1ti, vsx_udiv_v1ti, vsx_dives_v1ti,
16020 vsx_diveu_v1ti, vsx_mods_v1ti, vsx_modu_v1ti, xxswapd_v1ti): New
16022 (vcmpnet): New define_expand.
16023 * doc/extend.texi: Add documentation for the new builtins vec_rl,
16024 vec_rlmi, vec_rlnm, vec_sl, vec_sr, vec_sra, vec_mule, vec_mulo,
16025 vec_div, vec_dive, vec_mod, vec_cmpeq, vec_cmpne, vec_cmpgt, vec_cmplt,
16026 vec_cmpge, vec_cmple, vec_all_eq, vec_all_ne, vec_all_gt, vec_all_lt,
16027 vec_all_ge, vec_all_le, vec_any_eq, vec_any_ne, vec_any_gt, vec_any_lt,
16028 vec_any_ge, vec_any_le.
16030 2021-06-09 Carl Love <cel@us.ibm.com>
16032 * config/rs6000/altivec.md (altivec_vrl<VI_char>mi): Fix
16033 bug in argument generation.
16035 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
16037 * config/arm/iterators.md (<supf>): Remove VCLZQ_U, VCLZQ_S.
16039 * config/arm/mve.md (mve_vclzq_<supf><mode>): Add '@' prefix,
16040 remove <supf> iterator.
16041 (mve_vclzq_u<mode>): New.
16042 * config/arm/neon.md (clz<mode>2): Rename to neon_vclz<mode>.
16043 (neon_vclz<mode): Move to ...
16044 * config/arm/unspecs.md (VCLZQ_U, VCLZQ_S): Remove.
16045 * config/arm/vec-common.md: ... here. Add support for MVE.
16047 2021-06-09 Christophe Lyon <christophe.lyon@linaro.org>
16049 * config/arm/mve.md (mve_vhaddq_<supf><mode>): Prefix with '@'.
16050 (@mve_vrhaddq_<supf><mode): Likewise.
16051 * config/arm/neon.md (neon_v<r>hadd<sup><mode>): Likewise.
16052 * config/arm/vec-common.md (avg<mode>3_floor, uavg<mode>3_floor)
16053 (avg<mode>3_ceil", uavg<mode>3_ceil): New patterns.
16055 2021-06-09 imba-tjd <109224573@qq.com>
16057 * doc/invoke.texi: Fix typo.
16059 2021-06-09 Roger Sayle <roger@nextmovesoftware.com>
16061 PR middle-end/53267
16062 * fold-const-call.c (fold_const_call_sss) [CASE_CFN_FMOD]:
16063 Support evaluation of fmod/fmodf/fmodl at compile-time.
16065 2021-06-09 Richard Biener <rguenther@suse.de>
16067 PR tree-optimization/100981
16068 * tree-vect-loop.c (vect_create_epilog_for_reduction): Use
16069 gimple_get_lhs to also handle calls.
16070 * tree-vect-slp-patterns.c (complex_pattern::build): Transfer
16073 2021-06-09 Richard Biener <rguenther@suse.de>
16075 PR tree-optimization/97832
16076 * tree-vectorizer.h (_slp_tree::failed): New.
16077 * tree-vect-slp.c (_slp_tree::_slp_tree): Initialize
16079 (_slp_tree::~_slp_tree): Free failed.
16080 (vect_build_slp_tree): Retain failed nodes and record
16081 matches in them, copying that back out when running
16082 into a cached fail. Dump start and end of discovery.
16083 (dt_sort_cmp): New.
16084 (vect_build_slp_tree_2): Handle associatable chains
16085 together doing more aggressive operand swapping.
16087 2021-06-09 H.J. Lu <hjl.tools@gmail.com>
16090 * config.gcc (gcc_cv_initfini_array): Set to yes for Linux and
16092 * doc/install.texi: Require glibc 2.1 and binutils 2.12 for
16093 Linux and GNU targets.
16095 2021-06-09 Richard Biener <rguenther@suse.de>
16097 * tree-vect-stmts.c (vect_is_simple_use): Always get dt
16100 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
16102 * config/arc/arc.md (loop_end): Change it to
16103 define_insn_and_split.
16105 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
16107 * config/arc/arc.md (maddhisi4): Use VMAC2H instruction.
16108 (machi): New pattern.
16109 (umaddhisi4): Use VMAC2HU instruction.
16110 (umachi): New pattern.
16112 2021-06-09 Claudiu Zissulescu <claziss@synopsys.com>
16114 * config/arc/arc-protos.h (arc_split_move_p): New prototype.
16115 * config/arc/arc.c (arc_split_move_p): New function.
16116 (arc_split_move): Clean up.
16117 * config/arc/arc.md (movdi_insn): Clean up, use arc_split_move_p.
16118 (movdf_insn): Likewise.
16119 * config/arc/simdext.md (mov<VWH>_insn): Likewise.
16121 2021-06-09 Uroš Bizjak <ubizjak@gmail.com>
16124 * config/i386/i386.c (print_operand_address_as): Rename "no_rip"
16125 argument to "raw". Do not emit segment overrides when "raw" is true.
16127 2021-06-09 Martin Liska <mliska@suse.cz>
16129 * doc/gcov.texi: Create a proper JSON files.
16130 * doc/invoke.texi: Remove dots in order to make it a valid
16133 2021-06-09 Xionghu Luo <luoxhu@linux.ibm.com>
16135 * config/rs6000/rs6000-p8swap.c (pattern_is_rotate64): New.
16136 (insn_is_load_p): Use pattern_is_rotate64.
16137 (insn_is_swap_p): Likewise.
16138 (quad_aligned_load_p): Likewise.
16139 (const_load_sequence_p): Likewise.
16140 (replace_swapped_aligned_load): Likewise.
16141 (recombine_lvx_pattern): Likewise.
16142 (recombine_stvx_pattern): Likewise.
16144 2021-06-09 Andrew MacLeod <amacleod@redhat.com>
16146 * gimple-range-gori.cc (gori_compute::outgoing_edge_range_p): Use a
16147 fur_stmt source record.
16148 * gimple-range.cc (fur_source::get_operand): Generic range query.
16149 (fur_source::get_phi_operand): New.
16150 (fur_source::register_dependency): New.
16151 (fur_source::query): New.
16152 (class fur_edge): New. Edge source for operands.
16153 (fur_edge::fur_edge): New.
16154 (fur_edge::get_operand): New.
16155 (fur_edge::get_phi_operand): New.
16156 (fur_edge::query): New.
16157 (fur_stmt::fur_stmt): New.
16158 (fur_stmt::get_operand): New.
16159 (fur_stmt::get_phi_operand): New.
16160 (fur_stmt::query): New.
16161 (class fur_depend): New. Statement source and process dependencies.
16162 (fur_depend::fur_depend): New.
16163 (fur_depend::register_dependency): New.
16164 (class fur_list): New. List source for operands.
16165 (fur_list::fur_list): New.
16166 (fur_list::get_operand): New.
16167 (fur_list::get_phi_operand): New.
16168 (fold_range): New. Instantiate appropriate fur_source class and fold.
16169 (fold_using_range::range_of_range_op): Use new API.
16170 (fold_using_range::range_of_address): Ditto.
16171 (fold_using_range::range_of_phi): Ditto.
16172 (imple_ranger::fold_range_internal): Use fur_depend class.
16173 (fold_using_range::range_of_ssa_name_with_loop_info): Use new API.
16174 * gimple-range.h (class fur_source): Now a base class.
16175 (class fur_stmt): New.
16176 (fold_range): New prototypes.
16177 (fur_source::fur_source): Delete.
16179 2021-06-08 Andrew Pinski <apinski@marvell.com>
16181 PR tree-optimization/25290
16182 * tree-ssa-phiopt.c (xor_replacement): Delete.
16183 (tree_ssa_phiopt_worker): Delete use of xor_replacement.
16184 (match_simplify_replacement): Allow one cheap preparation
16185 statement that can be moved to before the if.
16187 2021-06-08 Pat Haugen <pthaugen@linux.ibm.com>
16189 * config/rs6000/power10.md (power10-fused-load, power10-fused-store,
16190 power10-fused_alu, power10-fused-vec, power10-fused-branch): New.
16192 2021-06-08 Jeff Law <jeffreyalaw@gmail.com>
16194 * config/h8300/logical.md (andqi3_1): Move BCLR case into define_insn_and_split.
16195 Create length attribute on define_insn_and_split. Only split for cases which we
16197 (andqi3_1<cczn>): Renamed from andqi3_1_clobber_flags. Only handle AND here and
16198 fix length computation.
16199 (b<code><mode>msx): Combine QImode and HImode H8/SX patterns using iterator.
16201 2021-06-08 Richard Biener <rguenther@suse.de>
16203 PR tree-optimization/100923
16204 * tree-ssa-sccvn.c (valueize_refs_1): Take a pointer to
16205 the operand vector to be valueized.
16206 (valueize_refs): Likewise.
16207 (valueize_shared_reference_ops_from_ref): Adjust.
16208 (valueize_shared_reference_ops_from_call): Likewise.
16209 (vn_reference_lookup_3): Likewise.
16210 (vn_reference_lookup_pieces): Likewise. Re-valueize
16211 with honoring availability when we are about to create
16212 the ao_ref and valueized before.
16213 (vn_reference_lookup): Likewise.
16214 (vn_reference_insert_pieces): Adjust.
16216 2021-06-08 Richard Biener <rguenther@suse.de>
16218 * tree-vectorizer.h (_slp_instance::root_stmt): Change to...
16219 (_slp_instance::root_stmts): ... a vector.
16220 (SLP_INSTANCE_ROOT_STMT): Rename to ...
16221 (SLP_INSTANCE_ROOT_STMTS): ... this.
16222 (slp_root::root): Change to...
16223 (slp_root::roots): ... a vector.
16224 (slp_root::slp_root): Adjust.
16225 * tree-vect-slp.c (_slp_instance::location): Adjust.
16226 (vect_free_slp_instance): Release the root stmt vector.
16227 (vect_build_slp_instance): Adjust.
16228 (vect_analyze_slp): Likewise.
16229 (_bb_vec_info::~_bb_vec_info): Likewise.
16230 (vect_slp_analyze_operations): Likewise.
16231 (vect_bb_vectorization_profitable_p): Likewise. Adjust
16232 costs for the root stmt.
16233 (vect_slp_check_for_constructors): Gather all BIT_INSERT_EXPRs
16235 (vect_slp_analyze_bb_1): Simplify by marking all root stmts
16237 (vectorize_slp_instance_root_stmt): Adjust.
16238 (vect_schedule_slp): Likewise.
16240 2021-06-08 Aldy Hernandez <aldyh@redhat.com>
16242 * gimple-ssa-evrp.c (class ssa_equiv_stack): New.
16243 (ssa_equiv_stack::ssa_equiv_stack): New.
16244 (ssa_equiv_stack::~ssa_equiv_stack): New.
16245 (ssa_equiv_stack::enter): New.
16246 (ssa_equiv_stack::leave): New.
16247 (ssa_equiv_stack::push_replacement): New.
16248 (ssa_equiv_stack::get_replacement): New.
16249 (is_pointer_ssa): New.
16250 (class pointer_equiv_analyzer): New.
16251 (pointer_equiv_analyzer::pointer_equiv_analyzer): New.
16252 (pointer_equiv_analyzer::~pointer_equiv_analyzer): New.
16253 (pointer_equiv_analyzer::set_global_equiv): New.
16254 (pointer_equiv_analyzer::set_cond_equiv): New.
16255 (pointer_equiv_analyzer::get_equiv): New.
16256 (pointer_equiv_analyzer::enter): New.
16257 (pointer_equiv_analyzer::leave): New.
16258 (pointer_equiv_analyzer::get_equiv_expr): New.
16259 (pta_valueize): New.
16260 (pointer_equiv_analyzer::visit_stmt): New.
16261 (pointer_equiv_analyzer::visit_edge): New.
16262 (hybrid_folder::value_of_expr): Call PTA.
16263 (hybrid_folder::value_on_edge): Same.
16264 (hybrid_folder::pre_fold_bb): New.
16265 (hybrid_folder::post_fold_bb): New.
16266 (hybrid_folder::pre_fold_stmt): New.
16267 (rvrp_folder::pre_fold_bb): New.
16268 (rvrp_folder::post_fold_bb): New.
16269 (rvrp_folder::pre_fold_stmt): New.
16270 (rvrp_folder::value_of_expr): Call PTA.
16271 (rvrp_folder::value_on_edge): Same.
16273 2021-06-08 Jakub Jelinek <jakub@redhat.com>
16276 * tree-inline.c (copy_tree_body_r): For OMP_CLAUSE_DEPEND don't
16277 check TREE_CODE if OMP_CLAUSE_DECL is NULL.
16279 2021-06-08 Richard Biener <rguenther@suse.de>
16281 PR middle-end/100951
16282 * tree-vect-generic.c (expand_vector_piecewise): Build a
16283 VECTOR_CST if all elements are constant.
16284 (expand_vector_condition): Likewise.
16285 (lower_vec_perm): Likewise.
16286 (expand_vector_conversion): Likewise.
16288 2021-06-08 Martin Liska <mliska@suse.cz>
16290 * doc/invoke.texi: Document new param evrp-sparse-threshold.
16292 2021-06-08 Martin Liska <mliska@suse.cz>
16294 * genautomata.c (create_automata): Fix typo.
16296 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
16298 PR tree-optimization/100794
16299 * tree-predcom.c (tree_predictive_commoning_loop): Add parameter
16300 allow_unroll_p and only allow unrolling when it's true.
16301 (tree_predictive_commoning): Add parameter allow_unroll_p and
16303 (run_tree_predictive_commoning): Likewise.
16304 (pass_predcom::gate): Check flag_tree_loop_vectorize and
16305 global_options_set.x_flag_predictive_commoning.
16306 (pass_predcom::execute): Adjust for allow_unroll_p.
16308 2021-06-08 Kewen Lin <linkw@linux.ibm.com>
16310 * tree-predcom.c (execute_pred_commoning): Remove update_ssa call.
16311 (tree_predictive_commoning_loop): Factor some cleanup stuffs into
16312 lambda function cleanup, remove scev_reset call, and adjust return
16314 (tree_predictive_commoning): Adjust for different changed values,
16315 only set flag TODO_update_ssa_only_virtuals if changed.
16316 (pass_data pass_data_predcom): Remove TODO_update_ssa_only_virtuals
16317 from todo_flags_finish.
16319 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
16321 * gimple-range-cache.cc (class sbr_sparse_bitmap): New.
16322 (sbr_sparse_bitmap::sbr_sparse_bitmap): New.
16323 (sbr_sparse_bitmap::bitmap_set_quad): New.
16324 (sbr_sparse_bitmap::bitmap_get_quad): New.
16325 (sbr_sparse_bitmap::set_bb_range): New.
16326 (sbr_sparse_bitmap::get_bb_range): New.
16327 (sbr_sparse_bitmap::bb_range_p): New.
16328 (block_range_cache::block_range_cache): initialize bitmap obstack.
16329 (block_range_cache::~block_range_cache): Destruct obstack.
16330 (block_range_cache::set_bb_range): Decide when to utilze the
16331 sparse on entry cache.
16332 * gimple-range-cache.h (block_range_cache): Add bitmap obstack.
16333 * params.opt (-param=evrp-sparse-threshold): New.
16335 2021-06-07 Andrew MacLeod <amacleod@redhat.com>
16337 * bitmap.c (bitmap_set_aligned_chunk): New.
16338 (bitmap_get_aligned_chunk): New.
16339 (test_aligned_chunk): New.
16340 (bitmap_c_tests): Call test_aligned_chunk.
16341 * bitmap.h (bitmap_set_aligned_chunk, bitmap_get_aligned_chunk): New.
16343 2021-06-07 Uroš Bizjak <ubizjak@gmail.com>
16346 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
16348 (ix86_expand_vector_init_one_nonzero): Ditto.
16349 (ix86_expand_vector_init_one_var): Ditto.
16350 (ix86_expand_vector_init_general): Ditto.
16351 * config/i386/mmx.md (vec_initv4qiqi): New expander.
16353 2021-06-07 Jeff Law <jeffreyalaw@gmail.com>
16355 * config/h8300/movepush.md: Change most _clobber_flags
16356 patterns to instead use <cczn> subst.
16357 (movsi_cczn): New pattern with usable CC cases split out.
16358 (movsi_h8sx_cczn): Likewise.
16360 2021-06-07 Martin Liska <mliska@suse.cz>
16362 * common/common-target.def: Split long lines and replace them
16364 * target.def: Likewise.
16365 * doc/tm.texi: Re-generated.
16367 2021-06-07 Jakub Jelinek <jakub@redhat.com>
16370 * fold-const.c (fold_read_from_vector): Return NULL if trying to
16371 read from a CONSTRUCTOR with vector type elements.
16373 2021-06-07 Jakub Jelinek <jakub@redhat.com>
16375 PR middle-end/100898
16376 * tree-inline.c (copy_bb): Only use gimple_call_arg_ptr if memcpy
16377 should copy any arguments. Don't call gimple_call_num_args
16378 on id->call_stmt or call_stmt more than once.
16380 2021-06-07 liuhongt <hongtao.liu@intel.com>
16383 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): Refine
16385 (<insn>v4siv4di2): Delete constraints for define_expand.
16387 2021-06-07 liuhongt <hongtao.liu@intel.com>
16390 * config/i386/i386-expand.c (ix86_expand_builtin): Remove
16391 assignment of cfun->machine->has_explicit_vzeroupper.
16392 * config/i386/i386-features.c
16393 (ix86_add_reg_usage_to_vzerouppers): Delete.
16394 (ix86_add_reg_usage_to_vzeroupper): Ditto.
16395 (rest_of_handle_insert_vzeroupper): Remove
16396 ix86_add_reg_usage_to_vzerouppers, add df_analyze at the end
16398 (gate): Remove cfun->machine->has_explicit_vzeroupper.
16399 * config/i386/i386-protos.h (ix86_expand_avx_vzeroupper):
16401 * config/i386/i386.c (ix86_insn_callee_abi): New function.
16402 (ix86_initialize_callee_abi): Ditto.
16403 (ix86_expand_avx_vzeroupper): Ditto.
16404 (ix86_hard_regno_call_part_clobbered): Adjust for vzeroupper
16406 (TARGET_INSN_CALLEE_ABI): Define as ix86_insn_callee_abi.
16407 (ix86_emit_mode_set): Call ix86_expand_avx_vzeroupper
16409 * config/i386/i386.h (struct GTY(()) machine_function): Delete
16410 has_explicit_vzeroupper.
16411 * config/i386/i386.md (enum unspec): New member
16413 (ABI_DEFAULT,ABI_VZEROUPPER,ABI_UNKNOWN): New
16414 define_constants for insn callee abi index.
16415 * config/i386/predicates.md (vzeroupper_pattern): Adjust.
16416 * config/i386/sse.md (UNSPECV_VZEROUPPER): Deleted.
16417 (avx_vzeroupper): Call ix86_expand_avx_vzeroupper.
16418 (*avx_vzeroupper): Rename to ..
16419 (avx_vzeroupper_callee_abi): .. this, and adjust pattern as
16420 call_insn which has a special vzeroupper ABI.
16421 (*avx_vzeroupper_1): Deleted.
16423 2021-06-07 liuhongt <hongtao.liu@intel.com>
16426 * df-scan.c (df_get_call_refs): When call_insn is a fake call,
16427 it won't use stack pointer reg.
16428 * final.c (leaf_function_p): When call_insn is a fake call, it
16429 won't affect caller as a leaf function.
16430 * reg-stack.c (callee_clobbers_any_stack_reg): New.
16431 (subst_stack_regs): When call_insn doesn't clobber any stack
16432 reg, don't clear the arguments.
16433 * rtl.c (shallow_copy_rtx): Don't clear flag used when orig is
16435 * shrink-wrap.c (requires_stack_frame_p): No need for stack
16436 frame for a fake call.
16437 * rtl.h (FAKE_CALL_P): New macro.
16439 2021-06-06 Eric Botcazou <ebotcazou@adacore.com>
16441 * config/sparc/sparc-protos.h (order_regs_for_local_alloc): Rename
16443 (sparc_order_regs_for_local_alloc): ...this.
16444 (sparc_leaf_reg_remap): Declare.
16445 * config/sparc/sparc.h (ADJUST_REG_ALLOC_ORDER): Adjust.
16446 (LEAF_REG_REMAP): Reimplement as call to sparc_leaf_reg_remap.
16447 * config/sparc/sparc.c (leaf_reg_remap): Delete.
16448 (order_regs_for_local_alloc): Rename to...
16449 (sparc_order_regs_for_local_alloc): ...this.
16450 (sparc_leaf_reg_remap): New function.
16451 (sparc_conditional_register_usage): Do not modify leaf_reg_remap.
16453 2021-06-06 David Edelsohn <dje.gcc@gmail.com>
16455 * config/rs6000/rs6000.c (rs6000_xcoff_asm_output_aligned_decl_common):
16456 Use assemble_name to output BSS section name.
16458 2021-06-06 Uroš Bizjak <ubizjak@gmail.com>
16460 * config/i386/constraints.md (Bs):
16461 Remove boolean operators from match_test RTX.
16464 (M): Use "mode" variable instead of GET_MODE (op) in match_test RTX.
16467 2021-06-06 Martin Liska <mliska@suse.cz>
16469 * doc/extend.texi: Add missing @headitem.
16470 * doc/invoke.texi: Likewise.
16471 * doc/objc.texi: Likewise.
16473 2021-06-06 Martin Liska <mliska@suse.cz>
16475 * genhooks.c (emit_findices): Remove unused function.
16476 (emit_documentation): Do not call emit_findices
16477 and do not search for @Fcode directives.
16479 2021-06-06 Martin Liska <mliska@suse.cz>
16481 * doc/invoke.texi: Remove extra character.
16483 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16485 * config/sh/sh.md (doloop_end_split): Fix empty split condition.
16487 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16489 * config/sparc/sparc.md (*snedi<W:mode>_zero_vis3,
16490 *neg_snedi<W:mode>_zero_subxc, *plus_snedi<W:mode>_zero,
16491 *plus_plus_snedi<W:mode>_zero, *minus_snedi<W:mode>_zero,
16492 *minus_minus_snedi<W:mode>_zero): Fix empty split condition.
16494 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16496 * config/or1k/or1k.md (*movdi): Fix empty split condition.
16498 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16500 * config/mips/mips.md (<anonymous>, bswapsi2, bswapdi2): Fix empty
16503 2021-06-05 Kewen Lin <linkw@linux.ibm.com>
16505 * config/m68k/m68k.md (*zero_extend_inc, *zero_extend_dec,
16506 *zero_extendsidi2): Fix empty split condition.
16508 2021-06-05 Jeff Law <jeffreyalaw@gmail.com>
16510 * config/h8300/addsub.md: Fix split condition in define_insn_and_split
16512 * config/h8300/bitfield.md: Likewise.
16513 * config/h8300/combiner.md: Likewise.
16514 * config/h8300/divmod.md: Likewise.
16515 * config/h8300/extensions.md: Likewise.
16516 * config/h8300/jumpcall.md: Likewise.
16517 * config/h8300/movepush.md: Likewise.
16518 * config/h8300/multiply.md: Likewise.
16519 * config/h8300/other.md: Likewise.
16520 * config/h8300/shiftrotate.md: Likewise.
16521 * config/h8300/logical.md: Likewise. Fix split pattern to use
16522 code iterator that somehow slipped through.
16524 2021-06-04 Tobias Burnus <tobias@codesourcery.com>
16526 PR middle-end/100905
16527 * tree-nested.c (convert_nonlocal_omp_clauses,
16528 convert_local_omp_clauses): Handle OMP_CLAUSE_BIND.
16530 2021-06-04 Martin Sebor <msebor@redhat.com>
16532 PR middle-end/100732
16533 * gimple-fold.c (gimple_fold_builtin_sprintf): Avoid folding calls
16534 with either source or destination argument of invalid type.
16535 * tree-ssa-uninit.c (maybe_warn_pass_by_reference): Avoid checking
16536 calls with arguments of invalid type.
16538 2021-06-04 Martin Sebor <msebor@redhat.com>
16540 * attribs.c (init_attr_rdwr_indices): Use VLA bounds in the expected
16542 (attr_access::vla_bounds): Also handle VLA bounds.
16544 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
16546 * config/i386/predicates.md (GOT_memory_operand):
16547 Implement using match_code RTXes.
16548 (GOT32_symbol_operand): Ditto.
16550 2021-06-04 Uroš Bizjak <ubizjak@gmail.com>
16553 * config/i386/i386-expand.c (ix86_expand_vector_init_duplicate):
16555 (ix86_expand_vector_init_general): Ditto.
16556 Use SImode instead of word_mode for logic operations
16557 when GET_MODE_SIZE (mode) < UNITS_PER_WORD.
16558 (expand_vec_perm_even_odd_1): Assert that V2HI mode should be
16559 implemented by expand_vec_perm_1.
16560 (expand_vec_perm_broadcast_1): Assert that V2HI and V4HI modes
16561 should be implemented using standard shuffle patterns.
16562 (ix86_vectorize_vec_perm_const): Handle V2HImode. Add V4HI and
16563 V2HI modes to modes, implementable with shuffle for one operand.
16564 * config/i386/mmx.md (*punpckwd): New insn_and_split pattern.
16565 (*pshufw_1): New insn pattern.
16566 (*vec_dupv2hi): Ditto.
16567 (vec_initv2hihi): New expander.
16569 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
16571 * config/arm/vfp.md (no_literal_pool_df_immediate,
16572 no_literal_pool_sf_immediate): Fix empty split condition.
16574 2021-06-04 Kewen Lin <linkw@linux.ibm.com>
16576 * config/i386/i386.md (*load_tp_x32_zext, *add_tp_x32_zext,
16577 *tls_dynamic_gnu2_combine_32): Fix empty split condition.
16578 * config/i386/sse.md (*<sse2_avx2>_pmovmskb_lt,
16579 *<sse2_avx2>_pmovmskb_zext_lt, *sse2_pmovmskb_ext_lt,
16580 *<sse4_1_avx2>_pblendvb_lt): Likewise.
16582 2021-06-04 Jakub Jelinek <jakub@redhat.com>
16585 * config/i386/i386-expand.c (ix86_expand_vector_init): Handle
16586 concatenation from half-sized modes with TImode elements.
16588 2021-06-04 Claudiu Zissulescu <claziss@synopsys.com>
16590 * config/arc/arc.c (arc_override_options): Disable millicode
16591 thunks when RF16 is on.
16593 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
16595 * config/rs6000/rs6000.h (PROMOTE_MODE): Remove.
16597 2021-06-04 Haochen Gui <guihaoc@gcc.gnu.org>
16599 * config/rs6000/rs6000-call.c (rs6000_promote_function_mode):
16600 Replace PROMOTE_MODE marco with its content.
16602 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
16604 * config/cris/cris.md (*addi_reload): Fix empty split condition.
16606 2021-06-03 Jim Wilson <jimw@sifive.com>
16608 * config.gcc (riscv*-*-*): If --with-riscv-attribute not used,
16609 turn it on for all riscv targets.
16611 2021-06-03 Uroš Bizjak <ubizjak@gmail.com>
16614 * config/i386/i386-expand.c (ix86_expand_vector_set):
16615 Handle V2HI and V4QI modes.
16616 (ix86_expand_vector_extract): Ditto.
16617 * config/i386/mmx.md (*pinsrw): New insn pattern.
16620 (*pextrw_zext): Ditto.
16622 (*pextrb_zext): Ditto.
16623 (vec_setv2hi): New expander.
16624 (vec_extractv2hihi): Ditto.
16625 (vec_setv4qi): Ditto.
16626 (vec_extractv4qiqi): Ditto.
16627 (vec_setv8qi): Enable only for TARGET_SSE4_1.
16628 (vec_extractv8qiqi): Ditto.
16630 2021-06-03 Aaron Sawdey <acsawdey@linux.ibm.com>
16632 * config/rs6000/genfusion.pl (gen_logical_addsubf): Fix input
16633 order to subf instruction.
16634 * config/rs6000/fusion.md: Regenerate.
16636 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
16638 * calls.c (get_size_range): Use range_of_expr instead of
16639 determine_value_range.
16640 * tree-affine.c (expr_to_aff_combination): Same.
16641 * tree-data-ref.c (split_constant_offset): Same.
16642 * tree-vrp.c (determine_value_range_1): Remove.
16643 (determine_value_range): Remove.
16644 * tree-vrp.h (determine_value_range): Remove.
16646 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
16648 * function-tests.c (test_ranges): Call gimple_range_tests.
16649 * gimple-range-cache.cc (ranger_cache::range_of_expr): Pass stmt
16651 * gimple-range.cc (fur_source::get_operand): Do not call
16652 get_tree_range or gimple_range_global.
16654 (get_tree_range): Move to value-query.cc.
16655 Call get_arith_expr_range.
16656 (gimple_ranger::range_of_expr): Add argument to get_tree_range.
16657 Include gimple-range-tests.cc.
16658 * gimple-range.h (fold_range): Add argument.
16659 (get_tree_range): Remove.
16660 * selftest.h (gimple_range_tests): New.
16661 * value-query.cc (global_range_query::range_of_expr): Add
16663 (range_query::get_tree_range): Move from gimple-range.cc.
16664 * value-query.h (class range_query): Add get_tree_range and
16665 get_arith_expr_range. Make fur_source a friend.
16666 * vr-values.c (vr_values::range_of_expr): Pass stmt to
16668 * gimple-range-tests.cc: New file.
16670 2021-06-03 Aldy Hernandez <aldyh@redhat.com>
16672 * gimple-range.cc (gimple_ranger::export_global_ranges): Call
16673 update_global_range.
16674 * value-query.cc (update_global_range): New.
16675 * value-query.h (update_global_range): New.
16677 2021-06-03 David Malcolm <dmalcolm@redhat.com>
16679 * diagnostic-show-locus.c (diagnostic_show_locus): Don't reject
16680 printing the same location twice if there are fix-it hints,
16681 multiple locations, or a label.
16683 2021-06-03 Andre Vieira <andre.simoesdiasvieira@arm.com>
16685 * tree-vect-loop.c (vect_transform_loop): Use main loop's various'
16686 thresholds to narrow the upper bound on epilogue iterations.
16688 2021-06-03 Christophe Lyon <christophe.lyon@linaro.org>
16690 * config/arm/mve.md (mve_vabsq_f<mode>): Use 'abs' instead of unspec.
16691 (mve_vabsq_s<mode>): Likewise.
16692 * config/arm/neon.md (abs<mode>2): Rename to neon_abs<mode>2.
16693 * config/arm/unspecs.md (VABSQ_F, VABSQ_S): Delete.
16694 * config/arm/vec-common.md (neg<mode>2): Rename to
16695 <absneg_str><mode>2.
16697 2021-06-03 Claudiu Zissulescu <claziss@synopsys.com>
16699 * common/config/arc/arc-common.c (arc_option_optimization_table):
16700 Remove malign-call.
16701 * config/arc/arc.c (arc_unalign_branch_p): Remove unused function.
16702 * config/arc/arc.h (TARGET_MIXED_CODE): Remove macro.
16703 (INDEX_REG_CLASS): Only refer to GENERAL_REGS.
16704 * config/arc/arc.md (abssi2_mixed): Remove pattern.
16705 * config/arc/arc.opt (munalign-prob-threshold): Mark it obsolete.
16706 (malign-call): Likewise.
16707 (mmixed-code): Likewise.
16708 * doc/invoke.texi (ARC): Update doc.
16710 2021-06-03 Martin Liska <mliska@suse.cz>
16712 * common.opt: Use proper Enum values.
16713 * opts.c (COVERAGE_SANITIZER_OPT): Remove.
16714 (parse_sanitizer_options): Handle only sanitizer_opts.
16715 (common_handle_option): Just assign value.
16717 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16720 * tree-inline.c (inline_forbidden_p): Remove test on return type.
16722 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16724 * dwarf2out.c (loc_list_from_tree_1) <FUNCTION_DECL>: Also generate
16725 DW_OP_GNU_variable_value referencing an existing DIE at file scope.
16726 (type_byte_size): Inline into...
16727 (add_byte_size_attribute): ...this and call add_scalar_info.
16729 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16731 * dwarf2out.c (mem_loc_descriptor) <UDIV>: Fix typo.
16732 (typed_binop_from_tree): New function.
16733 (loc_list_from_tree_1) <EXACT_DIV_EXPR>: For an unsigned type,
16734 turn a divide by a power of 2 into a shift.
16735 <CEIL_DIV_EXPR>: For an unsigned type, use a signed divide if the
16736 size of the mode is lower than DWARF2_ADDR_SIZE; otherwise, do a
16737 typed divide by calling typed_binop_from_tree.
16739 2021-06-03 Eric Botcazou <ebotcazou@adacore.com>
16741 * dwarf2out.c (scompare_loc_descriptor): Fix head comment.
16742 (is_handled_procedure_type): Likewise.
16743 (struct loc_descr_context): Add strict_signedness field.
16744 (resolve_args_picking_1): Deal with DW_OP_[GNU_]deref_type,
16745 DW_OP_[GNU_]convert and DW_OP_[GNU_]reinterpret.
16746 (resolve_args_picking): Minor tweak.
16747 (function_to_dwarf_procedure): Initialize strict_signedness field.
16748 (type_byte_size): Likewise.
16749 (field_byte_offset): Likewise.
16750 (gen_descr_array_type_die): Likewise.
16751 (gen_variant_part): Likewise.
16752 (loc_list_from_tree_1) <CALL_EXPR>: Tidy up and set strict_signedness
16753 to true when a context is present before evaluating the arguments.
16754 <COND_EXPR>: Do not generate a useless comparison with zero.
16755 When dereferencing an address, if strict_signedness is true and the
16756 type is small and signed, use DW_OP_deref_type to do the dereference
16757 and then DW_OP_convert to convert back to the generic type.
16759 2021-06-03 Jakub Jelinek <jakub@redhat.com>
16762 * tree-inline.c (copy_tree_body_r): Handle iterators on
16763 OMP_CLAUSE_AFFINITY or OMP_CLAUSE_DEPEND.
16765 2021-06-03 Kewen Lin <linkw@linux.ibm.com>
16767 * config/arc/arc.md (*bbit_di): Remove.
16769 2021-06-02 Christoph Muellner <cmuellner@gcc.gnu.org>
16771 PR rtl-optimization/100264
16772 * ree.c (get_sub_rtx): Ignore SET expressions without register
16773 destinations and remove assertion, as it is not valid anymore
16774 with this new behaviour.
16775 (merge_def_and_ext): Eliminate destination check for register
16776 as such SET expressions can't occur anymore.
16777 (combine_reaching_defs): Likewise.
16779 2021-06-02 Jakub Jelinek <jakub@redhat.com>
16782 * config/xtensa/xtensa.h (LEAF_REG_REMAP): Cast REGNO to int to avoid
16783 -Wtype-limits warnings.
16784 (DWARF_FRAME_REGISTER): Rewrite into ternary operator with addition
16785 in operands to avoid -Wsign-compare warnings.
16787 2021-06-02 Pat Haugen <pthaugen@linux.ibm.com>
16789 * config/rs6000/rs6000-logue.c (rs6000_emit_prologue): Use
16792 2021-06-02 Vineet Gupta <vgupta@synopsys.com>
16794 * config/arc/arc.h (TARGET_CPU_DEFAULT): Change to hs38_linux.
16796 2021-06-02 Ilya Leoshkevich <iii@linux.ibm.com>
16798 * config/s390/s390.md(*ashrdi3_31<setcc><cconly>): Use a single
16800 * config/s390/subst.md(cconly_subst): Use a single constraint
16801 in (match_scratch).
16803 2021-06-02 Martin Liska <mliska@suse.cz>
16805 * ipa-icf.h: Use auto_vec for memory_access_types.
16807 2021-06-02 Jeff Law <jeffreyalaw@gmail.com>
16809 * config/h8300/h8300-protos.h (compute_a_shift_length): Drop unused
16810 argument from prototype.
16811 (output_logical_op): Add rtx_code argument.
16812 (compute_logical_op_length): Likewise.
16813 * config/h8300/h8300.c (h8300_and_costs): Pass additional argument
16814 to compute_a_shift_length.
16815 (output_logical_op); New argument with the rtx code rather than
16816 extracting it from an operand. Handle QImode too.
16817 (compute_logical_op_length): Similary.
16818 (compute_a_shift_length): Drop unused argument.
16819 * config/h8300/h8300.md (logicals): New code iterator.
16820 * config/h8300/logical.md (<code><mode>3 expander): Combine
16821 the "and" expander with the "ior"/"xor" expander.
16822 (bclr<mode>msx): Combine the QI/HI mode patterns.
16823 (<logical><mode>3 insns): Use code iterator rather than match_operator.
16824 Handle QImode as well. Update call to output_logical_op and
16825 compute_logical_op_length to pass in rtx_code
16826 Fix split condition on all define_insn_and_split patterns.
16827 (one_cmpl<mode>2<cczn>): Use <cczn> to support both clobbering
16828 the flags and setting ZN via existing define_subst.
16829 * config/h8300/shiftrotate.md: Drop unused argument from
16830 calls to compute_a_shift_length.
16831 Signed-off-by: Jeff Law <jeffreyalaw@gmail.com>
16833 2021-06-01 Andrew Pinski <apinski@marvell.com>
16835 PR tree-optimization/25290
16836 * tree-ssa-phiopt.c (match_simplify_replacement):
16838 (tree_ssa_phiopt_worker): Use match_simplify_replacement.
16839 (two_value_replacement): Change the comment about
16840 conditional_replacement.
16841 (conditional_replacement): Delete.
16843 2021-06-01 Andrew Pinski <apinski@marvell.com>
16845 PR tree-optimization/95481
16846 * tree-tailcall.c (find_tail_calls): Handle empty typed
16849 2021-06-01 Andrew Pinski <apinski@marvell.com>
16851 * gimplify.c (zero_sized_field_decl): Delete
16852 (zero_sized_type): Delete
16853 (gimplify_init_ctor_eval): Use is_empty_type instead
16854 of zero_sized_field_decl.
16855 (gimplify_modify_expr): Use is_empty_type instead of
16858 2021-06-01 Jason Merrill <jason@redhat.com>
16861 * tree.h (CALL_FROM_NEW_OR_DELETE_P): Adjust comment.
16863 2021-06-01 Jason Merrill <jason@redhat.com>
16866 * diagnostic.h (warning_enabled_at): Declare.
16867 * diagnostic.c (diagnostic_enabled): Factor out from...
16868 (diagnostic_report_diagnostic): ...here.
16869 (warning_enabled_at): New.
16871 2021-06-01 Aldy Hernandez <aldyh@redhat.com>
16873 * gimple-ssa-evrp.c: Enable exporting of global ranges.
16875 2021-06-01 Martin Liska <mliska@suse.cz>
16878 * doc/invoke.texi: Mention that -fgcse-after-reload
16879 is enabled with -O3.
16881 2021-06-01 liuhongt <hongtao.liu@intel.com>
16883 PR tree-optimization/98365
16884 * tree-if-conv.c (strip_nop_cond_scalar_reduction): New function.
16885 (is_cond_scalar_reduction): Handle nop_expr in cond scalar reduction.
16886 (convert_scalar_cond_reduction): Ditto.
16887 (predicate_scalar_phi): Ditto.
16889 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16891 PR tree-optimization/100781
16892 * gimple-range-cache.cc (ranger_cache::ranger_cache): Enable new
16893 value calculation by default.
16894 (ranger_cache::enable_new_values): New.
16895 (ranger_cache::disable_new_values): New.
16896 (ranger_cache::push_poor_value): Check if new values are allowed.
16897 * gimple-range-cache.h (class ranger_cache): New member/methods.
16898 * gimple-range.cc (gimple_ranger::range_of_expr): Check for debug
16899 statement, and disable/renable new value calculation.
16901 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16903 * gimple-range-cache.cc (ranger_cache::ssa_range_in_bb): Delete.
16904 (ranger_cache::range_of_def): New.
16905 (ranger_cache::entry_range): New.
16906 (ranger_cache::exit_range): New.
16907 (ranger_cache::range_of_expr): Adjust.
16908 (ranger_cache::range_on_edge): Adjust.
16909 (ranger_cache::propagate_cache): Call exit_range directly.
16910 * gimple-range-cache.h (class ranger_cache): Adjust.
16912 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16914 * gimple-range-cache.cc (ranger_cache::ranger_cache): Adjust for
16915 gori_compute being a member rather than base class.
16916 dervied call to member call.
16917 (ranger_cache::dump): No longer dump gori_map.
16918 (ranger_cache::dump_bb): New.
16919 (ranger_cache::get_non_stale_global_range): Adjust for gori_compute
16920 being a member rather than base class.
16921 (ranger_cache::set_global_range): Ditto.
16922 (ranger_cache::ssa_range_in_bb): Ditto.
16923 (ranger_cache::range_of_expr): New.
16924 (ranger_cache::range_on_edge): New.
16925 (ranger_cache::block_range): Adjust for gori_computes. Debug changes.
16926 (ranger_cache::propagate_cache): Adjust debugging output.
16927 (ranger_cache::fill_block_cache): Adjust for gori_computes. Debug
16929 * gimple-range-cache.h (class ranger_cache): Make gori_compute a
16930 member, and inherit from range_query instead.
16931 (ranger_cache::dump_bb): New. split from dump.
16932 * gimple-range-gori.cc (gori_compute::ssa_range_in_bb): Delete.
16933 (gori_compute::expr_range_at_stmt): Delete.
16934 (gori_compute::compute_name_range_op): Delete.
16935 (gori_compute::compute_operand_range_switch): Add fur_source.
16936 (gori_compute::compute_operand_range): Add fur_source param, inline
16937 old compute_name_range_op and optimize_logical_operands.
16938 (struct tf_range): Delete.
16939 (gori_compute::logical_combine): Adjust
16940 (gori_compute::optimize_logical_operands): Delete.
16941 (gori_compute::compute_logical_operands_in_chain): Delete.
16942 (gori_compute::compute_logical_operands): Adjust.
16943 (gori_compute::compute_operand1_range): Adjust to fur_source.
16944 (gori_compute::compute_operand2_range): Ditto.
16945 (gori_compute::compute_operand1_and_operand2_range): Ditto.
16946 (gori_compute::outgoing_edge_range_p): Add range_query parameter,
16947 and adjust to fur_source.
16948 * gimple-range-gori.h (class gori_compute): Simplify and adjust to
16949 range_query and fur_source.
16950 * gimple-range.cc (gimple_ranger::range_on_edge): Query range_on_edge
16951 from the ranger_cache..
16952 (gimple_ranger::fold_range_internal): Adjust to base class change of
16954 (gimple_ranger::dump_bb): Adjust dump.
16955 * gimple-range.h (gimple_ranger):export gori computes object.
16957 2021-06-01 Andrew MacLeod <amacleod@redhat.com>
16959 PR tree-optimization/100774
16960 * gimple-range-cache.cc (ranger_cache::get_non_stale_global_range):
16961 Constant values are also not stale.
16962 (ranger_cache::set_global_range): Range invariant values should also
16963 have the correct timestamp.
16965 2021-05-31 Martin Liska <mliska@suse.cz>
16967 * tree-streamer-in.c (unpack_ts_function_decl_value_fields):
16968 Unpack FUNCTION_DECL_DECL_TYPE.
16969 * tree-streamer-out.c (pack_ts_function_decl_value_fields):
16970 Stream FUNCTION_DECL_DECL_TYPE instead of
16971 DECL_IS_OPERATOR_NEW_P.
16972 * tree.h (set_function_decl_type): Use FUNCTION_DECL_DECL_TYPE
16974 (DECL_IS_OPERATOR_NEW_P): Likewise.
16975 (DECL_IS_OPERATOR_DELETE_P): Likewise.
16976 (DECL_LAMBDA_FUNCTION_P): Likewise.
16978 2021-05-31 Richard Biener <rguenther@suse.de>
16981 * internal-fn.c (expand_SHUFFLEVECTOR): Define.
16982 * internal-fn.def (SHUFFLEVECTOR): New.
16983 * internal-fn.h (expand_SHUFFLEVECTOR): Declare.
16984 * doc/extend.texi: Document __builtin_shufflevector.
16986 2021-05-31 Peter Bergner <bergner@linux.ibm.com>
16989 * config/rs6000/predicates.md(mma_assemble_input_operand): Allow
16990 indexed form addresses.
16992 2021-05-29 Jeff Law <jlaw@tachyum.com>
16994 * config/h8300/h8300.c (h8300_emit_stack_adjustment): Drop unused
16995 parameter. Call callers fixed.
16997 (output_plussi): Add FALLTHRU markers.
16998 (h8300_shift_needs_scratch_p): Add gcc_unreachable marker.
17000 2021-05-29 Jakub Jelinek <jakub@redhat.com>
17002 PR middle-end/99928
17003 * gimplify.c (gimplify_scan_omp_clauses): For taskloop simd
17004 combined with parallel, make sure to add shared clause to
17005 parallel for explicit linear clause.
17007 2021-05-29 Aldy Hernandez <aldyh@redhat.com>
17009 PR tree-optimization/100787
17010 * gimple-ssa-evrp.c: Disable exporting of global ranges.
17012 2021-05-28 Jason Merrill <jason@redhat.com>
17014 * tree-iterator.h (struct tree_stmt_iterator): Add operator++,
17015 operator--, operator*, operator==, and operator!=.
17016 (class tsi_range): New.
17018 2021-05-28 Richard Biener <rguenther@suse.de>
17020 PR tree-optimization/100778
17021 * tree-vect-slp.c (vect_build_slp_tree_1): Prevent possibly
17022 trapping ops in different BBs.
17024 2021-05-28 Richard Biener <rguenther@suse.de>
17027 * tree-inline.c (copy_bb): When processing __builtin_va_arg_pack
17028 copy fntype from original call.
17030 2021-05-28 Martin Liska <mliska@suse.cz>
17032 PR gcov-profile/100751
17033 * doc/gcov.texi: Revert partially a hunk that was wrong.
17035 2021-05-28 Cooper Qu <cooper.qu@linux.alibaba.com>
17037 * config/csky/csky-linux-elf.h (HAVE_sync_compare_and_swapqi):
17039 (HAVE_sync_compare_and_swaphi): Likewise.
17040 (HAVE_sync_compare_and_swapsi): Likewise.
17042 2021-05-28 Jakub Jelinek <jakub@redhat.com>
17044 PR middle-end/99928
17045 * tree.h (OMP_CLAUSE_MAP_IMPLICIT): Define.
17047 2021-05-28 Tobias Burnus <tobias@codesourcery.com>
17049 * gimplify.c (gimplify_omp_affinity): New.
17050 (gimplify_scan_omp_clauses): Call it; remove affinity clause afterwards.
17051 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_AFFINITY.
17052 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_AFFINITY.
17053 * tree.c (omp_clause_num_ops, omp_clause_code_name): Add clause.
17054 (walk_tree_1): Handle OMP_CLAUSE_AFFINITY.
17056 2021-05-28 Joern Rennecke <joern.rennecke@riscy-ip.com>
17057 Richard Biener <rguenther@suse.de>
17059 * match.pd <popcount & / + pattern matching>:
17060 When generating popcount directly fails, try doing it in two halves.
17062 2021-05-28 Bernd Edlinger <bernd.edlinger@hotmail.de>
17064 * Makefile.in (generated_files): Add gimple-match.c and
17067 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
17069 * gensupport.c (alter_predicate_for_insn): Handle MATCH_DUP.
17071 2021-05-28 Joern Rennecke <joern.rennecke@embecosm.com>
17073 * gensupport.c (alter_constraints): Add MATCH_SCRATCH case.
17075 2021-05-28 Kewen Lin <linkw@linux.ibm.com>
17077 PR tree-optimization/99398
17078 * tree-ssa-forwprop.c (simplify_permutation): Optimize some cases
17079 where the fed operands are CTOR/CST and propagated through
17080 VIEW_CONVERT_EXPR. Call vec_perm_indices::new_shrunk_vector.
17081 * vec-perm-indices.c (vec_perm_indices::new_shrunk_vector): New
17083 * vec-perm-indices.h (vec_perm_indices::new_shrunk_vector): New
17086 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
17088 * config/i386/mmx.md (addv2sf3): Do not call
17089 ix86_fixup_binary_operands_no_copy.
17092 (<smaxmin:code>v2sf3): Ditto.
17093 (<plusminus:insn><MMXMODEI:mode>3): Ditto.
17094 (<plusminus:insn><VI_32:mode>3): Remove expander.
17095 (<plusminus:insn><VI_32:mode>3): Rename from
17096 "*<plusminus:insn><VI_32:mode>3".
17097 (mulv4hi): Do not call ix86_fixup_binary_operands_no_copy.
17098 (mulv2hi3): Remove expander.
17099 (mulv2hi3): Rename from *mulv2hi3.
17100 (<s>mulv2hi3_highpart): Remove expander.
17101 (<s>mulv2hi3_highpart): Rename from *<s>mulv2hi3_highpart.
17102 (<smaxmin:code><MMXMODE14:mode>3): Rename from
17103 "*mmx_<smaxmin:code><MMXMODE14:mode>3".
17104 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Remove expander.
17105 (SMAXMIN_MMXMODEI): Remove mode iterator.
17106 (<smaxmin:code>v4hi3): New expander.
17107 (<smaxmin:code>v4qi3): Rename from *<smaxmin:code>v4qi3.
17108 (<smaxmin:code>v2hi3): Rename from *<smaxmin:code>v2hi3.
17109 (<smaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
17110 (SMAXMIN_VI_32): Remove mode iterator.
17111 (<umaxmin:code><MMXMODE24:mode>3): Rename from
17112 "*mmx_<umaxmin:code><MMXMODE24:mode>3".
17113 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Remove expander.
17114 (UMAXMIN_MMXMODEI): Remove mode iterator.
17115 (<umaxmin:code>v8qi3): New expander.
17116 (<umaxmin:code>v4qi3): Rename from *<umaxmin:code>v4qi3.
17117 (<umaxmin:code>v2hi3): Rename from *<umaxmin:code>v2hi3.
17118 (<umaxmin:code><SMAXMIN_VI_32:mode>3): Remove expander.
17119 (UMAXMIN_VI_32): Remove mode iterator.
17120 (<any_shift:insn>v2hi3): Remove expander.
17121 (<any_shift:insn>v2hi3): Rename from *<any_shift:insn>v2hi3.
17122 (<any_logic:code><MMXMODEI:mode>3): Do not call
17123 ix86_fixup_binary_operands_no_copy.
17124 (<any_logic:code><VI_32:mode>3): Remove expander.
17125 (<any_logic:code><VI_32:mode>3): Rename from
17126 "*<any_logic:code><VI_32:mode>3".
17127 (uavg<mode>3_ceil): Do not call ix86_fixup_binary_operands_no_copy.
17128 * config/i386/sse.md (div<VF2:mode>3): Do not call
17129 ix86_fixup_binary_operands_no_copy.
17130 (div<VF1:mode>3): Ditto.
17131 (<maxmin:code><VI8_AVX2_AVX512F:mode>3): Ditto.
17132 (smulhrsv4hi3): Ditto.
17133 (smulhrsv2hi3): Ditto.
17135 2021-05-27 Martin Sebor <msebor@redhat.com>
17137 * ggc.h (gt_ggc_mx): Add overloads for all integers.
17139 * hash-map.h (class hash_map): Add pch_nx_helper overloads for all
17141 (hash_map::operator==): New function.
17143 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
17146 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
17147 For TARGET_XOP bypass SSE comparisons for all supported vector modes.
17148 * config/i386/mmx.md (*xop_maskcmp<MMXMODEI:mode>3): New insn pattern.
17149 (*xop_maskcmp<VI_32:mode>3): Ditto.
17150 (*xop_maskcmp_uns<MMXMODEI:mode>3): Ditto.
17151 (*xop_maskcmp_uns<VI_32:mode>3): Ditto.
17153 2021-05-27 Richard Earnshaw <rearnsha@arm.com>
17156 * config/arm/arm.c (arm_configure_build_target): Remove parameter
17157 opts_set, directly check opts parameters for being non-null.
17158 (arm_option_restore): Update call to arm_configure_build_target.
17159 (arm_option_override): Likewise.
17160 (arm_can_inline_p): Likewise.
17161 (arm_valid_target_attribute_tree): Likewise.
17162 * config/arm/arm-c.c (arm_pragma_target_parse): Likewise.
17163 * config/arm/arm-protos.h (arm_configure_build_target): Adjust
17166 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
17168 * vr-values.c (simplify_conversion_using_ranges): Use
17169 get_range_query instead of get_global_range_query.
17171 2021-05-27 Aldy Hernandez <aldyh@redhat.com>
17173 * gimple-range.cc (get_range_global): Move to value-query.cc.
17174 (gimple_range_global): Same.
17175 (get_global_range_query): Same.
17176 (global_range_query::range_of_expr): Same.
17177 * gimple-range.h (class global_range_query): Move to
17179 (gimple_range_global): Same.
17180 * tree-ssanames.c (get_range_info): Move to value-query.cc.
17181 (get_ptr_nonnull): Same.
17182 * tree-ssanames.h (get_range_info): Remove.
17183 (get_ptr_nonnull): Remove.
17184 * value-query.cc (get_ssa_name_range_info): Move from
17186 (get_ssa_name_ptr_info_nonnull): Same.
17187 (get_range_global): Move from gimple-range.cc.
17188 (gimple_range_global): Same.
17189 (get_global_range_query): Same.
17190 (global_range_query::range_of_expr): Same.
17191 * value-query.h (class global_range_query): Move from
17193 (gimple_range_global): Same.
17195 2021-05-27 Uroš Bizjak <ubizjak@gmail.com>
17198 * config/i386/mmx.md (uavgv4qi3_ceil): New insn pattern.
17199 (uavgv2hi3_ceil): Ditto.
17201 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
17204 * doc/extend.texi (scalar_storage_order): Rephrase slightly.
17206 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17208 * tree-ssanames.c (get_range_info): Merge both copies of
17209 get_range_info into one that works with irange.
17210 * tree-ssanames.h (get_range_info): Remove version that works on
17213 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17215 * builtins.c (check_nul_terminated_array): Convert to get_range_query.
17216 (expand_builtin_strnlen): Same.
17217 (determine_block_size): Same.
17218 * fold-const.c (expr_not_equal_to): Same.
17219 * gimple-fold.c (size_must_be_zero_p): Same.
17220 * gimple-match-head.c: Include gimple-range.h.
17221 * gimple-pretty-print.c (dump_ssaname_info): Convert to get_range_query.
17222 * gimple-ssa-warn-restrict.c
17223 (builtin_memref::extend_offset_range): Same.
17224 * graphite-sese-to-poly.c (add_param_constraints): Same.
17225 * internal-fn.c (get_min_precision): Same.
17226 * ipa-fnsummary.c (set_switch_stmt_execution_predicate): Same.
17227 * ipa-prop.c (ipa_compute_jump_functions_for_edge): Same.
17229 * tree-data-ref.c (split_constant_offset): Same.
17230 (dr_step_indicator): Same.
17231 * tree-dfa.c (get_ref_base_and_extent): Same.
17232 * tree-scalar-evolution.c (iv_can_overflow_p): Same.
17233 * tree-ssa-loop-niter.c (refine_value_range_using_guard): Same.
17234 (determine_value_range): Same.
17235 (record_nonwrapping_iv): Same.
17236 (infer_loop_bounds_from_signedness): Same.
17237 (scev_var_range_cant_overflow): Same.
17238 * tree-ssa-phiopt.c (two_value_replacement): Same.
17239 * tree-ssa-pre.c (insert_into_preds_of_block): Same.
17240 * tree-ssa-reassoc.c (optimize_range_tests_to_bit_test): Same.
17241 * tree-ssa-strlen.c (handle_builtin_stxncpy_strncat): Same.
17243 (dump_strlen_info): Same.
17244 (set_strlen_range): Same.
17245 (maybe_diag_stxncpy_trunc): Same.
17246 (get_len_or_size): Same.
17247 (handle_integral_assign): Same.
17248 * tree-ssa-structalias.c (find_what_p_points_to): Same.
17249 * tree-ssa-uninit.c (find_var_cmp_const): Same.
17250 * tree-switch-conversion.c (bit_test_cluster::emit): Same.
17251 * tree-vect-patterns.c (vect_get_range_info): Same.
17252 (vect_recog_divmod_pattern): Same.
17253 * tree-vrp.c (intersect_range_with_nonzero_bits): Same.
17254 (register_edge_assert_for_2): Same.
17255 (determine_value_range_1): Same.
17256 * tree.c (get_range_pos_neg): Same.
17257 * vr-values.c (vr_values::get_lattice_entry): Same.
17258 (vr_values::update_value_range): Same.
17259 (simplify_conversion_using_ranges): Same.
17261 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17263 * gimple-ssa-warn-alloca.c (alloca_call_type): Use
17264 get_range_query instead of query argument.
17265 (pass_walloca::execute): Enable and disable global ranger.
17267 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17269 * gimple-ssa-evrp.c (rvrp_folder::rvrp_folder): Call
17271 (rvrp_folder::~rvrp_folder): Call disable_ranger.
17272 (hybrid_folder::hybrid_folder): Call enable_ranger.
17273 (hybrid_folder::~hybrid_folder): Call disable_ranger.
17275 2021-05-26 Aldy Hernandez <aldyh@redhat.com>
17277 * function.c (allocate_struct_function): Set cfun->x_range_query.
17278 * function.h (struct function): Declare x_range_query.
17279 (get_range_query): New.
17280 (get_global_range_query): New.
17281 * gimple-range-cache.cc (ssa_global_cache::ssa_global_cache):
17282 Remove call to safe_grow_cleared.
17283 * gimple-range.cc (get_range_global): New.
17284 (gimple_range_global): Move from gimple-range.h.
17285 (get_global_range_query): New.
17286 (global_range_query::range_of_expr): New.
17287 (enable_ranger): New.
17288 (disable_ranger): New.
17289 * gimple-range.h (gimple_range_global): Move to gimple-range.cc.
17290 (class global_range_query): New.
17291 (enable_ranger): New.
17292 (disable_ranger): New.
17293 * gimple-ssa-evrp.c (evrp_folder::~evrp_folder): Rename
17294 dump_all_value_ranges to dump.
17295 * tree-vrp.c (vrp_prop::finalize): Same.
17296 * value-query.cc (range_query::dump): New.
17297 * value-query.h (range_query::dump): New.
17298 * vr-values.c (vr_values::dump_all_value_ranges): Rename to...
17299 (vr_values::dump): ...this.
17300 * vr-values.h (class vr_values): Rename dump_all_value_ranges to
17301 dump and make virtual.
17303 2021-05-26 Uroš Bizjak <ubizjak@gmail.com>
17305 * config/i386/i386.c (ix86_autovectorize_vector_modes):
17306 Add V4QImode and V16QImode for TARGET_SSE2.
17307 * doc/sourcebuild.texi (Vector-specific attributes):
17308 Add vect64 and vect32 description.
17310 2021-05-26 Bernd Edlinger <bernd.edlinger@hotmail.de>
17312 * gimple-range-gori.cc (range_def_chain::register_dependency):
17313 Resize m_def_chain when needed.
17315 2021-05-26 Christophe Lyon <christophe.lyon@linaro.org>
17317 * config/arm/mve.md (mve_vaddvq_<supf><mode>): Prefix with '@'.
17318 * config/arm/neon.md (reduc_plus_scal_<mode>): Move to ..
17319 * config/arm/vec-common.md: .. here. Add support for MVE.
17321 2021-05-26 Jakub Jelinek <jakub@redhat.com>
17323 * config/epiphany/epiphany.c (epiphany_print_operand_address): Remove
17325 * config/microblaze/microblaze.c (microblaze_legitimize_address,
17327 microblaze_option_override, print_operand): Likewise.
17328 * config/microblaze/microblaze.md (call_internal_plt,
17329 call_value_intern_plt, call_value_intern): Likewise.
17330 * config/arm/aout.h (ASM_OUTPUT_ALIGN): Likewise.
17331 * config/iq2000/iq2000.md (call_internal1, call_value_internal1,
17332 call_value_multiple_internal1): Likewise.
17333 * config/bfin/bfin.c (symbolic_reference_mentioned_p): Likewise.
17335 2021-05-26 Jan-Benedict Glaw <jbglaw@lug-owl.de>
17337 * config/arc/arc.c (arc_address_cost, arc_print_operand_address,
17338 arc_ccfsm_advance, symbolic_reference_mentioned_p,
17339 arc_raw_symbolic_reference_mentioned_p): Remove register
17342 2021-05-26 Jakub Jelinek <jakub@redhat.com>
17345 * omp-low.c: Include omp-offload.h.
17346 (create_omp_child_function): If current_function_decl has
17347 "omp declare target" attribute and is_gimple_omp_offloaded,
17348 remove that attribute from the copy of attribute list and
17349 add "omp target entrypoint" attribute instead.
17350 (lower_omp_target): Mark .omp_data_sizes.* and .omp_data_kinds.*
17351 variables for offloading if in omp_maybe_offloaded_ctx.
17352 * omp-offload.c (pass_omp_target_link::execute): Nullify second
17353 argument to GOMP_target_data_ext in offloaded code.
17355 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17357 * config/csky/csky.c (csky_can_change_mode_class): Delete.
17358 For csky, HF/SF mode use the low bits of VREGS.
17360 2021-05-26 Eric Botcazou <ebotcazou@adacore.com>
17362 * gimplify.c (gimplify_decl_expr): Do not clear TREE_READONLY on a
17363 DECL which is a reference for OMP.
17365 2021-05-26 Martin Liska <mliska@suse.cz>
17367 PR gcov-profile/100751
17368 * doc/gcov.texi: Document that __gcov_dump can be called just
17369 once and that __gcov_reset resets run-time counters.
17371 2021-05-26 Martin Liska <mliska@suse.cz>
17373 * doc/install.texi: Port relevant part from install-old.texi
17374 and re-generate list of CPUs and systems.
17376 2021-05-26 Martin Liska <mliska@suse.cz>
17378 * Makefile.in: Remove it.
17379 * doc/include/fdl.texi: Update next/previous chapters.
17380 * doc/install.texi: Likewise.
17381 * doc/install-old.texi: Removed.
17383 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17385 * config/csky/csky.c (ck810_legitimate_index_p): Support
17386 "base + index" with DF mode.
17387 * config/csky/constraints.md ("Y"): New constraint for memory operands
17388 without index register.
17389 * config/csky/csky_insn_fpuv2.md (fpuv3_movdf): Use "Y" instead of "m"
17390 when mov between memory and general registers, and lower their priority.
17391 * config/csky/csky_insn_fpuv3.md (fpuv2_movdf): Likewise.
17393 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17395 * config/csky/csky.c (TARGET_PROMOTE_PROTOTYPES): Delete.
17397 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17399 * config/csky/csky.md (untyped_call): Emit clobber for return
17400 registers to mark them used.
17402 2021-05-26 Geng Qi <gengqi@linux.alibaba.com>
17404 * config/csky/csky.md (cskyv2_sextend_ldbs): New.
17406 2021-05-26 Andrew Pinski <apinski@marvell.com>
17408 * match.pd (x < 0 ? ~y : y): New patterns.
17410 2021-05-26 Andrew Pinski <apinski@marvell.com>
17412 * match.pd (A?CST1:CST2): Add simplifcations for A?0:+-1, A?+-1:0,
17413 A?POW2:0 and A?0:POW2.
17415 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17417 * gimple-range-gori.cc (class logical_stmt_cache): Delete
17418 (logical_stmt_cache::logical_stmt_cache ): Delete.
17419 (logical_stmt_cache::~logical_stmt_cache): Delete.
17420 (logical_stmt_cache::cache_entry::dump): Delete.
17421 (logical_stmt_cache::get_range): Delete.
17422 (logical_stmt_cache::cached_name ): Delete.
17423 (logical_stmt_cache::same_cached_name): Delete.
17424 (logical_stmt_cache::cacheable_p): Delete.
17425 (logical_stmt_cache::slot_diagnostics ): Delete.
17426 (logical_stmt_cache::dump): Delete.
17427 (gori_compute_cache::gori_compute_cache): Delete.
17428 (gori_compute_cache::~gori_compute_cache): Delete.
17429 (gori_compute_cache::compute_operand_range): Delete.
17430 (gori_compute_cache::cache_stmt): Delete.
17431 * gimple-range-gori.h (gori_compute::compute_operand_range): Remove
17433 (class gori_compute_cache): Delete.
17435 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17437 * gimple-range.cc (fold_using_range::range_of_range_op): Use m_gori
17439 (fold_using_range::range_of_address): Adjust.
17440 (fold_using_range::range_of_phi): Adjust.
17441 * gimple-range.h (class fur_source): Adjust.
17442 (fur_source::fur_source): Adjust.
17444 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17446 * gimple-range-gori.cc (gori_compute::expr_range_at_stmt): Rename
17447 from expr_range_in_bb and adjust.
17448 (gori_compute::compute_name_range_op): Adjust.
17449 (gori_compute::optimize_logical_operands): Adjust.
17450 (gori_compute::compute_logical_operands_in_chain): Adjust.
17451 (gori_compute::compute_operand1_range): Adjust.
17452 (gori_compute::compute_operand2_range): Adjust.
17453 (ori_compute_cache::cache_stmt): Adjust.
17454 * gimple-range-gori.h (gori_compute): Rename prototype.
17456 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17458 * gimple-range.cc (gimple_ranger::range_of_expr): Non-null should be
17459 checked only after range_of_stmt, not range_on_entry.
17460 (gimple_ranger::range_on_entry): Check for non-null in any
17461 predecessor block, if it is not already non-null.
17462 (gimple_ranger::range_on_exit): DOnt check for non-null after
17463 range on entry call.
17464 (gimple_ranger::dump_bb): New. Split from dump.
17465 (gimple_ranger::dump): Adjust.
17466 * gimple-range.h (class gimple_ranger): Adjust.
17468 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17470 * gimple-range-cache.cc (struct range_timestamp): Delete.
17471 (class temporal_cache): Adjust.
17472 (temporal_cache::get_timestamp): Delete.
17473 (temporal_cache::set_dependency): Delete.
17474 (temporal_cache::temporal_value): Adjust.
17475 (temporal_cache::current_p): Take dependencies as params.
17476 (temporal_cache::set_timestamp): Adjust.
17477 (temporal_cache::set_always_current): Adjust.
17478 (ranger_cache::get_non_stale_global_range): Adjust.
17479 (ranger_cache::register_dependency): Delete.
17480 * gimple-range-cache.h (class range_cache): Adjust.
17482 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17484 * gimple-range-gori.cc (range_def_chain::range_def_chain): init
17486 (range_def_chain::~range_def_chain): Dispose of obstack rather than
17487 each individual bitmap.
17488 (range_def_chain::set_import): New.
17489 (range_def_chain::get_imports): New.
17490 (range_def_chain::chain_import_p): New.
17491 (range_def_chain::register_dependency): Rename from build_def_chain
17493 (range_def_chain::def_chain_in_bitmap_p): New.
17494 (range_def_chain::add_def_chain_to_bitmap): New.
17495 (range_def_chain::has_def_chain): Just check first depenedence.
17496 (range_def_chain::get_def_chain): Process imports, use generic
17497 register_dependency routine.
17498 (range_def_chain::dump): New.
17499 (gori_map::gori_map): Allocate import list.
17500 (gori_map::~gori_map): Release imports.
17501 (gori_map::exports): Check for past allocated block size.
17502 (gori_map::imports): New.
17503 (gori_map::def_chain_in_export_p): Delete.
17504 (gori_map::is_import_p): New.
17505 (gori_map::maybe_add_gori): Handle imports.
17506 (gori_map::dump): Adjust output, add imports.
17507 (gori_compute::has_edge_range_p): Remove def_chain_in_export call.
17508 (gori_export_iterator::gori_export_iterator): New.
17509 (gori_export_iterator::next): New.
17510 (gori_export_iterator::get_name): New.
17511 * gimple-range-gori.h (range_def_chain): Add imports and direct
17512 dependecies via struct rdc.
17513 (range_def_chain::depend1): New.
17514 (range_def_chain::depend2): New.
17515 (class gori_map): Adjust.
17516 (FOR_EACH_GORI_IMPORT_NAME): New.
17517 (FOR_EACH_GORI_EXPORT_NAME): New.
17518 (class gori_export_iterator): New.
17520 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17522 * gimple-range-cache.cc (ranger_cache::ranger_cache): Move initial
17523 export cache filling to here.
17524 * gimple-range-gori.cc (gori_compute::gori_compute) : From Here.
17526 2021-05-25 Andrew MacLeod <amacleod@redhat.com>
17528 * gimple-range-gori.cc (range_def_chain): Move to gimple-range-gori.h.
17529 (gori_map): Move to gimple-range-gori.h.
17530 (gori_compute::gori_compute): Adjust.
17531 (gori_compute::~gori_compute): Delete.
17532 (gori_compute::compute_operand_range_switch): Adjust.
17533 (gori_compute::compute_operand_range): Adjust.
17534 (gori_compute::compute_logical_operands): Adjust.
17535 (gori_compute::has_edge_range_p ): Adjust.
17536 (gori_compute::set_range_invariant): Delete.
17537 (gori_compute::dump): Adjust.
17538 (gori_compute::outgoing_edge_range_p): Adjust.
17539 * gimple-range-gori.h (class range_def_chain): Relocate here.
17540 (class gori_map): Relocate here.
17541 (class gori_compute): Inherit from gori_map, and adjust.
17543 2021-05-25 Aldy Hernandez <aldyh@redhat.com>
17545 * value-range.cc (range_tests_legacy): Use
17546 build_nonstandard_integer_type instead of int and short.
17548 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
17550 * gimplify.c (gimplify_decl_expr): Clear TREE_READONLY on the DECL
17551 when really creating an initialization statement for it.
17553 2021-05-25 Eric Botcazou <ebotcazou@adacore.com>
17555 * tree-inline.c (setup_one_parameter): Fix thinko in new condition.
17557 2021-05-25 Kito Cheng <kito.cheng@sifive.com>
17559 * config/riscv/riscv.h (ASM_SPEC): Pass -mno-relax.
17561 2021-05-25 Martin Liska <mliska@suse.cz>
17563 PR tree-optimization/92860
17565 * optc-save-gen.awk: Remove exceptions.
17567 2021-05-25 Martin Liska <mliska@suse.cz>
17569 * asan.h (sanitize_coverage_p): New function.
17570 * doc/extend.texi: Document it.
17571 * fold-const.c (fold_range_test): Use sanitize_flags_p
17572 instead of flag_sanitize_coverage.
17573 (fold_truth_andor): Likewise.
17574 * sancov.c: Likewise.
17575 * tree-ssa-ifcombine.c (ifcombine_ifandif): Likewise.
17576 * ipa-inline.c (sanitize_attrs_match_for_inline_p): Handle
17577 -fsanitize-coverage when inlining.
17579 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
17581 * config/csky/csky-modes.def : Fix copyright.
17583 2021-05-25 Cooper Qu <cooper.qu@linux.alibaba.com>
17585 * config/csky/csky-modes.def : Amend copyright.
17586 * config/csky/csky_insn_fpuv2.md : Likewise.
17587 * config/csky/csky_insn_fpuv3.md : Likewise.
17589 2021-05-25 Richard Biener <rguenther@suse.de>
17591 PR middle-end/100727
17592 * calls.c (initialize_argument_information): Explicitely test
17593 for WITH_SIZE_EXPR.
17594 * gimple-expr.c (mark_addressable): Skip outer WITH_SIZE_EXPR.
17596 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17598 * config/csky/csky.h (FRAME_POINTER_REGNUM): Use
17599 HARD_FRAME_POINTER_REGNUM and FRAME_POINTER_REGNUM instead of
17600 the signle definition. The signle definition may not work well
17601 at simplify_subreg_regno().
17602 (HARD_FRAME_POINTER_REGNUM): New.
17603 (ELIMINABLE_REGS): Add for HARD_FRAME_POINTER_REGNUM.
17604 * config/csky/csky.c (get_csky_live_regs, csky_can_eliminate,
17605 csky_initial_elimination_offset, csky_expand_prologue,
17606 csky_expand_epilogue): Add for HARD_FRAME_POINTER_REGNUM.
17608 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17610 * config/csky/csky.c (csky_option_override):
17611 Init csky_arch_isa_features[] in advance, so TARGET_DSP
17612 and TARGET_DIV can be set well.
17614 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17616 * config/csky/constraints.md ("l", "h"): Delete.
17617 * config/csky/csky.h (reg_class, REG_CLASS_NAMES,
17618 REG_CLASS_CONTENTS): Delete LO_REGS and HI_REGS.
17619 * config/csky/csky.c (regno_reg_classm,
17620 csky_secondary_reload, csky_register_move_cost):
17621 Use HILO_REGS instead of LO_REGS and HI_REGS.
17623 2021-05-25 Geng Qi <gengqi@linux.alibaba.com>
17625 * config/csky/constraints.md ("W"): New constriant for mem operand
17626 with base reg, index register.
17627 ("Q"): Renamed and modified "csky_valid_fpuv2_mem_operand" to
17628 "csky_valid_mem_constraint_operand" to deal with both "Q" and "W"
17630 ("Dv"): New constraint for const double value that can be used at
17632 * config/csky/csky-modes.def (HFmode): New mode.
17633 * config/csky/csky-protos.h (csky_valid_fpuv2_mem_operand): Rename
17634 to "csky_valid_mem_constraint_operand" and support new constraint
17636 (csky_get_movedouble_length): New.
17637 (fpuv3_output_move): New.
17638 (fpuv3_const_double): New.
17639 * config/csky/csky.c (csky_option_override): New arch CK860 with fpv3.
17640 (decompose_csky_address): Refine.
17641 (csky_print_operand): New "CONST_DOUBLE" operand.
17642 (csky_output_move): Support fpv3 instructions.
17643 (csky_get_movedouble_length): New.
17644 (fpuv3_output_move): New.
17645 (fpuv3_const_double): New.
17646 (csky_emit_compare): Cover float comparsion.
17647 (csky_emit_compare_float): Refine.
17648 (csky_vaild_fpuv2_mem_operand): Rename to
17649 "csky_valid_mem_constraint_operand" and support new constraint "W".
17650 (ck860_rtx_costs): New.
17651 (csky_rtx_costs): Add the cost calculation of CK860.
17652 (regno_reg_class): New vregs for fpuv3.
17653 (csky_dbx_regno): Likewise.
17654 (csky_cpu_cpp_builtins): New builtin macro for fpuv3.
17655 (csky_conditional_register_usage): Suporrot fpuv3.
17656 (csky_dwarf_register_span): Suporrot fpuv3.
17657 (csky_init_builtins, csky_mangle_type): Support "__fp16" type.
17658 (ck810_legitimate_index_p): Support fp16.
17659 * config/csky/csky.h (TARGET_TLS): ADD CK860.
17660 (CSKY_VREG_P, CSKY_VREG_LO_P, CSKY_VREG_HI_P): Support fpuv3.
17661 (TARGET_SINGLE_FPU): Support fpuv3.
17662 (TARGET_SUPPORT_FPV3): New.
17663 (FIRST_PSEUDO_REGISTER): Change to 202 to hold the new fpuv3 registers.
17664 (FIXED_REGISTERS, CALL_REALLY_USED_REGISTERS, REGISTER_NAMES,
17665 REG_CLASS_CONTENTS): Support fpuv3.
17666 * config/csky/csky.md (movsf): Move to cksy_insn_fpu.md and refine.
17667 (csky_movsf_fpv2): Likewise.
17668 (ck801_movsf): Likewise.
17669 (csky_movsf): Likewise.
17671 (csky_movdf_fpv2): Likewise.
17672 (ck801_movdf): Likewise.
17673 (csky_movdf): Likewise.
17674 (movsicc): Refine. Use "comparison_operatior" instead of
17675 "ordered_comparison_operatior".
17676 (addsicc): Likewise.
17677 (CSKY_FIRST_VFP3_REGNUM, CSKY_LAST_VFP3_REGNUM): New constant.
17678 (call_value_internal_vh): New.
17679 * config/csky/csky_cores.def (CK860): New arch and cpu.
17684 * config/csky/csky_insn_fpu.md: Refactor. Separate all float patterns
17685 into emit-patterns and match-patterns, remain the emit-patterns here,
17686 and move the match-patterns to csky_insn_fpuv2.md or
17687 csky_insn_fpuv3.md.
17688 * config/csky/csky_insn_fpuv2.md: New file for fpuv2 instructions.
17689 * config/csky/csky_insn_fpuv3.md: New file and new patterns for fpuv3
17691 * config/csky/csky_isa.def (fcr): New.
17696 (CK860): New definition for ck860.
17697 * config/csky/csky_tables.opt (ck860): New processors ck860,
17698 ck860f. And new arch ck860.
17703 * config/csky/predicates.md (csky_float_comparsion_operator): Delete
17704 "geu", "gtu", "leu", "ltu", which will never appear at float comparison.
17705 * config/csky/t-csky-elf: Support 860.
17706 * config/csky/t-csky-linux: Likewise.
17707 * doc/md.texi: Add "Q" and "W" constraints for C-SKY.
17709 2021-05-24 Aaron Sawdey <acsawdey@linux.ibm.com>
17711 * config/rs6000/genfusion.pl (gen_logical_addsubf): Refactor to
17712 add generation of logical-add and add-logical fusion pairs.
17713 * config/rs6000/rs6000-cpus.def: Add new fusion to ISA 3.1 mask
17715 * config/rs6000/rs6000.c (rs6000_option_override_internal): Turn on
17716 logical-add and add-logical fusion by default.
17717 * config/rs6000/rs6000.opt: Add -mpower10-fusion-logical-add and
17718 -mpower10-fusion-add-logical options.
17719 * config/rs6000/fusion.md: Regenerate file.
17721 2021-05-24 Aldy Hernandez <aldyh@redhat.com>
17723 * value-range.cc (irange::legacy_equal_p): Check type when
17724 comparing VR_VARYING types.
17725 (range_tests_legacy): Test comparing VARYING ranges of different
17728 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
17730 * config/aarch64/aarch64.c (neoversen1_tunings):
17731 Enable AARCH64_EXTRA_TUNE_CHEAP_SHIFT_EXTEND.
17733 2021-05-24 Wilco Dijkstra <wdijkstr@arm.com>
17735 * config/aarch64/aarch64.c (aarch64_classify_symbol): Use GOT for
17736 extern weak symbols. Limit symbol offsets for non-GOT symbols with
17739 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
17741 * config/arm/neon.md (vec_load_lanesxi<mode>)
17742 (vec_store_lanexoi<mode>): Move ...
17743 * config/arm/vec-common.md: here.
17745 2021-05-24 Christophe Lyon <christophe.lyon@linaro.org>
17747 * config/arm/neon.md (vec_load_lanesoi<mode>)
17748 (vec_store_lanesoi<mode>): Move ...
17749 * config/arm/vec-common.md: here.
17751 2021-05-24 liuhongt <hongtao.liu@intel.com>
17754 * config/i386/i386.c (ix86_gimple_fold_builtin): Replacing
17755 stmt with GIMPLE_NOP when lhs doesn't exist.
17757 2021-05-23 Uroš Bizjak <ubizjak@gmail.com>
17760 * config/i386/mmx.md (*push<VI_32:mode>2_rex64):
17761 New instruction pattern.
17762 (*push<VI_32:mode>2): Ditto.
17763 (push splitter for SSE registers): New splitter.
17765 2021-05-23 Andrew Pinski <apinski@marvell.com>
17767 * match.pd ((A & C) != 0 ? D : 0): Limit to non pointer types.
17769 2021-05-22 Aaron Sawdey <acsawdey@linux.ibm.com>
17771 * config/rs6000/genfusion.pl (gen_addadd): Fix incorrect attr types.
17772 * config/rs6000/fusion.md: Regenerate file.
17774 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
17776 * config/rs6000/genfusion.pl (gen_addadd): New function.
17777 * config/rs6000/fusion.md: Regenerate file.
17778 * config/rs6000/rs6000-cpus.def: Add
17779 OPTION_MASK_P10_FUSION_2ADD to masks.
17780 * config/rs6000/rs6000.c (rs6000_option_override_internal):
17781 Handle default value of OPTION_MASK_P10_FUSION_2ADD.
17782 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2add.
17784 2021-05-21 Jakub Jelinek <jakub@redhat.com>
17786 PR middle-end/99928
17787 * tree.h (OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET): Define.
17788 * gimplify.c (enum gimplify_omp_var_data): Fix up
17789 GOVD_MAP_HAS_ATTACHMENTS value, add GOVD_FIRSTPRIVATE_IMPLICIT.
17790 (omp_lastprivate_for_combined_outer_constructs): If combined target
17791 has GOVD_FIRSTPRIVATE_IMPLICIT set for the decl, change it to
17792 GOVD_MAP | GOVD_SEEN.
17793 (gimplify_scan_omp_clauses): Set GOVD_FIRSTPRIVATE_IMPLICIT for
17794 firstprivate clauses with OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT.
17795 (gimplify_adjust_omp_clauses): For firstprivate clauses with
17796 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT either clear that bit and
17797 OMP_CLAUSE_FIRSTPRIVATE_IMPLICIT_TARGET too, or remove it and
17798 let it be replaced by implicit map clause.
17800 2021-05-21 Jakub Jelinek <jakub@redhat.com>
17802 PR middle-end/99928
17803 * gimplify.c (omp_lastprivate_for_combined_outer_constructs): New
17805 (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LASTPRIVATE>: Use it.
17806 (gimplify_omp_for): Likewise.
17808 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17810 PR middle-end/90115
17811 * omp-low.c (oacc_privatization_candidate_p): Reject 'static',
17812 'external' in blocks.
17814 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17816 PR middle-end/90115
17817 * flag-types.h (enum openacc_privatization): New.
17818 * params.opt (-param=openacc-privatization): New.
17819 * doc/invoke.texi (openacc-privatization): Document it.
17820 * omp-general.h (get_openacc_privatization_dump_flags): New
17822 * omp-low.c (oacc_privatization_candidate_p): Add diagnostics.
17823 * omp-offload.c (execute_oacc_device_lower)
17824 <IFN_UNIQUE_OACC_PRIVATE>: Re-work diagnostics.
17825 * target.def (goacc.adjust_private_decl): Add 'location_t'
17827 * doc/tm.texi: Regenerate.
17828 * config/gcn/gcn-protos.h (gcn_goacc_adjust_private_decl): Adjust.
17829 * config/gcn/gcn-tree.c (gcn_goacc_adjust_private_decl): Likewise.
17830 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl):
17831 Likewise. Preserve it for...
17832 (nvptx_goacc_expand_var_decl): ... use here.
17834 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17836 * doc/sourcebuild.texi (Other attributes): Document '__OPTIMIZE__'
17839 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17841 PR middle-end/90115
17842 * omp-low.c (oacc_privatization_candidate_p): New function.
17843 (oacc_privatization_scan_clause_chain)
17844 (oacc_privatization_scan_decl_chain): Use it. Also
17845 'gcc_checking_assert' that we're not seeing duplicates.
17847 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17849 PR middle-end/90115
17850 * omp-offload.c (execute_oacc_device_lower): Skip processing if no
17853 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17855 PR middle-end/90115
17856 * omp-offload.c (execute_oacc_device_lower): Explain.
17858 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17860 PR middle-end/90115
17861 * omp-offload.c (execute_oacc_device_lower)
17862 <IFN_UNIQUE_OACC_PRIVATE>: Diagnose and handle for 'level == -1'
17864 * internal-fn.c (expand_UNIQUE): Don't expect
17865 'IFN_UNIQUE_OACC_PRIVATE'.
17867 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17869 PR middle-end/90115
17870 * omp-low.c (lower_omp_for): Don't evaluate OpenMP 'for' clauses.
17872 2021-05-21 Thomas Schwinge <thomas@codesourcery.com>
17874 PR middle-end/90115
17875 * config/nvptx/nvptx.c (nvptx_goacc_adjust_private_decl)
17876 (nvptx_goacc_expand_var_decl): Tighten.
17878 2021-05-21 Julian Brown <julian@codesourcery.com>
17879 Chung-Lin Tang <cltang@codesourcery.com>
17880 Thomas Schwinge <thomas@codesourcery.com>
17882 PR middle-end/90115
17883 * doc/tm.texi.in (TARGET_GOACC_EXPAND_VAR_DECL)
17884 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Add documentation hooks.
17885 * doc/tm.texi: Regenerate.
17886 * expr.c (expand_expr_real_1): Expand decls using the
17887 expand_var_decl OpenACC hook if defined.
17888 * internal-fn.c (expand_UNIQUE): Handle IFN_UNIQUE_OACC_PRIVATE.
17889 * internal-fn.h (IFN_UNIQUE_CODES): Add OACC_PRIVATE.
17890 * omp-low.c (omp_context): Add oacc_privatization_candidates
17892 (lower_oacc_reductions): Add PRIVATE_MARKER parameter. Insert
17894 (lower_oacc_head_tail): Add PRIVATE_MARKER parameter. Modify
17895 private marker's gimple call arguments, and pass it to
17896 lower_oacc_reductions.
17897 (oacc_privatization_scan_clause_chain)
17898 (oacc_privatization_scan_decl_chain, lower_oacc_private_marker):
17900 (lower_omp_for, lower_omp_target, lower_omp_1): Use these.
17901 * omp-offload.c (convert.h): Include.
17902 (oacc_loop_xform_head_tail): Treat private-variable markers like
17903 fork/join when transforming head/tail sequences.
17904 (struct var_decl_rewrite_info): Add struct.
17905 (oacc_rewrite_var_decl, is_sync_builtin_call): New functions.
17906 (execute_oacc_device_lower): Support rewriting gang-private
17907 variables using target hook, and fix up addr_expr and var_decl
17909 * target.def (adjust_private_decl, expand_var_decl): New hooks.
17910 * config/gcn/gcn-protos.h (gcn_goacc_adjust_gangprivate_decl):
17912 (gcn_goacc_adjust_private_decl): ...this.
17913 * config/gcn/gcn-tree.c (gcn_goacc_adjust_gangprivate_decl):
17915 (gcn_goacc_adjust_private_decl): ...this. Add LEVEL parameter.
17916 * config/gcn/gcn.c (TARGET_GOACC_ADJUST_GANGPRIVATE_DECL): Rename
17917 definition using gcn_goacc_adjust_gangprivate_decl...
17918 (TARGET_GOACC_ADJUST_PRIVATE_DECL): ...to this, using
17919 gcn_goacc_adjust_private_decl.
17920 * config/nvptx/nvptx.c (tree-pretty-print.h): Include.
17921 (gang_private_shared_size): New global variable.
17922 (gang_private_shared_align): Likewise.
17923 (gang_private_shared_sym): Likewise.
17924 (gang_private_shared_hmap): Likewise.
17925 (nvptx_option_override): Initialize these.
17926 (nvptx_file_end): Output gang_private_shared_sym.
17927 (nvptx_goacc_adjust_private_decl, nvptx_goacc_expand_var_decl):
17929 (nvptx_set_current_function): Clear gang_private_shared_hmap.
17930 (TARGET_GOACC_ADJUST_PRIVATE_DECL): Define hook.
17931 (TARGET_GOACC_EXPAND_VAR_DECL): Likewise.
17933 2021-05-21 H.J. Lu <hjl.tools@gmail.com>
17935 * config/i386/i386-modes.def (MAX_BITSIZE_MODE_ANY_INT): Removed.
17937 2021-05-21 Richard Biener <rguenther@suse.de>
17938 H.J. Lu <hjl.tools@gmail.com>
17940 PR middle-end/90773
17941 * expr.c (expand_constructor): Elide expand_constructor if
17942 move by pieces is preferred.
17944 2021-05-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
17946 * config/aarch64/aarch64-builtins.c (aarch64_call_properties):
17947 Take a flag and mode value as arguments.
17948 (aarch64_modifies_global_state_p): Likewise.
17949 (aarch64_reads_global_state_p): Likewise.
17950 (aarch64_could_trap_p): Likewise.
17951 (aarch64_get_attributes): Likewise.
17952 (aarch64_init_simd_builtins): Adjust callsite of above.
17953 (aarch64_init_fcmla_laneq_builtins): Use aarch64_get_attributes to get
17954 function attributes to apply to builtins.
17955 (aarch64_init_crc32_builtins): Likewise.
17956 (aarch64_init_builtin_rsqrt): Likewise.
17958 2021-05-21 Aaron Sawdey <acsawdey@linux.ibm.com>
17960 * config/rs6000/rs6000.md (define_attr "type"): Add types for fusion.
17961 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10): Use new fusion types.
17962 (gen_2logical): Use new fusion types.
17963 * config/rs6000/fusion.md: Regenerate.
17965 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
17968 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
17969 Handle V4QI and V2HI modes.
17970 (ix86_expand_sse_movcc): Ditto.
17971 * config/i386/mmx.md (*<sat_plusminus:insn><VI_32:mode>3):
17972 New instruction pattern.
17973 (*eq<VI_32:mode>3): Ditto.
17974 (*gt<VI_32:mode>3): Ditto.
17975 (*xop_pcmov_<VI_32:mode>): Ditto.
17976 (mmx_pblendvb32): Ditto.
17977 (mmx_pblendvb64): Rename from mmx_pblendvb.
17978 (vec_cmp<VI_32:mode><VI_32:mode>): New expander.
17979 (vec_cmpu<VI_32:mode><VI_32:mode>): Ditto.
17980 (vcond<VI_32:mode><VI_32:mode>): Ditto.
17981 (vcondu<VI_32:mode><VI_32:mode>): Ditto.
17982 (vcond_mask_<VI_32:mode><VI_32:mode>): Ditto.
17984 2021-05-21 Jakub Jelinek <jakub@redhat.com>
17986 PR tree-optimization/94589
17987 * tree-ssa-phiopt.c (spaceship_replacement): For integral rhs1 and
17988 rhs2, treat x <= 4 equivalently to x < 5 etc. In cmp1 and cmp2 (if
17989 not the same as cmp3) treat <= the same as < and >= the same as >.
17990 Don't require that cond2_phi_edge is true edge, instead take
17991 false/true edges into account based on cmp1/cmp2 comparison kinds.
17993 2021-05-21 Uroš Bizjak <ubizjak@gmail.com>
17996 * config/i386/mmx.md (SMAXMIN_MMXMODEI): New mode iterator.
17997 (<smaxmin:code><SMAXMIN_MMXMODEI:mode>3): Macroize expander
17998 from <smaxmin:code>v4hi3> and <smaxmin:code><MMXMODE14:mode>3
17999 using SMAXMIN_MMXMODEI mode iterator.
18000 (*<smaxmin:code>v4qi3): New insn pattern.
18001 (*<smaxmin:code>v2hi3): Ditto.
18002 (SMAXMIN_VI_32): New mode iterator.
18003 (<smaxmin:code><SMAXMIN_VI_32>mode3): New expander.
18004 (UMAXMIN_MMXMODEI): New mode iterator.
18005 (<umaxmin:code><UMAXMIN_MMXMODEI:mode>3): Macroize expander
18006 from <umaxmin:code>v8qi3> and <umaxmin:code><MMXMODE24:mode>3
18007 using UMAXMIN_MMXMODEI mode iterator.
18008 (*<umaxmin:code>v4qi3): New insn pattern.
18009 (*<umaxmin:code>v2hi3): Ditto.
18010 (UMAXMIN_VI_32): New mode iterator.
18011 (<umaxmin:code><UMAXMIN_VI_32>mode3): New expander.
18012 (abs<VI_32:mode>2): New insn pattern.
18013 (ssse3_abs<MMXMODEI:mode>2, abs<MMXMODEI:mode>2): Move from ...
18014 * config/i386/sse.md: ... here.
18016 2021-05-20 Clement Chigot <clement.chigot@atos.net>
18017 David Edelsohn <dje.gcc@gmail.com>
18019 * collect2.c (scan_prog_file): Issue non-fatal warning for
18022 2021-05-20 Jonathan Wakely <jwakely@redhat.com>
18024 * doc/invoke.texi (-Wno-c++11-extensions)
18025 (-Wno-c++14-extensions, -Wno-c++17-extensions)
18026 (-Wno-c++20-extensions, -Wno-c++23-extensions): Document
18029 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
18031 * config/c6x/c6x.c (c6x_output_file_unwind): Use dwarf_debuginfo_p.
18032 * config/darwin.c (darwin_override_options): Likewise.
18033 * config/i386/cygming.h (DBX_REGISTER_NUMBER): Likewise.
18034 * config/i386/darwin.h (DBX_REGISTER_NUMBER): Likewise.
18035 (DWARF2_FRAME_REG_OUT): Likewise.
18036 * config/mips/mips.c (mips_output_filename): Likewise.
18037 * config/rs6000/rs6000.c (rs6000_xcoff_declare_function_name):
18039 (rs6000_dbx_register_number): Likewise.
18040 * dbxout.c: Include flags.h.
18041 * dwarf2cfi.c (cfi_label_required_p): Likewise.
18042 (dwarf2out_do_frame): Likewise.
18043 * except.c: Include flags.h.
18044 * final.c (dwarf2_debug_info_emitted_p): Likewise.
18045 (final_scan_insn_1): Likewise.
18046 * flags.h (dwarf_debuginfo_p): New function declaration.
18047 * opts.c (dwarf_debuginfo_p): New function definition.
18048 * targhooks.c (default_debug_unwind_info): Use dwarf_debuginfo_p.
18049 * toplev.c (process_options): Likewise.
18051 2021-05-20 Indu Bhagat <indu.bhagat@oracle.com>
18053 * common.opt: Change type to support bitmasks.
18054 * flag-types.h (enum debug_info_type): Rename enumerator constants.
18055 (NO_DEBUG): New bitmask.
18056 (DBX_DEBUG): Likewise.
18057 (DWARF2_DEBUG): Likewise.
18058 (XCOFF_DEBUG): Likewise.
18059 (VMS_DEBUG): Likewise.
18060 (VMS_AND_DWARF2_DEBUG): Likewise.
18061 * flags.h (debug_set_to_format): New function declaration.
18062 (debug_set_count): Likewise.
18063 (debug_set_names): Likewise.
18064 * opts.c (debug_type_masks): Array of bitmasks for debug formats.
18065 (debug_set_to_format): New function definition.
18066 (debug_set_count): Likewise.
18067 (debug_set_names): Likewise.
18068 (set_debug_level): Update access to debug_type_names.
18069 * toplev.c: Likewise.
18071 2021-05-20 Martin Sebor <msebor@redhat.com>
18073 PR middle-end/100684
18074 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Handle C++ lambda.
18076 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
18079 * config/i386/i386.md (isa): Remove x64_bmi.
18080 (enabled): Remove x64_bmi.
18081 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
18082 Remove general register alternative.
18083 (*andnot<VI_32:mode>3): Ditto.
18084 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
18085 (*<any_logic:code><VI_32:mode>3): Ditto.
18087 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
18089 * config/arm/arm.c: Include head files tree-vectorizer.h and
18092 2021-05-20 Uroš Bizjak <ubizjak@gmail.com>
18095 * config/i386/mmx.md (Yv_Yw): Revert adding V4QI and V2HI modes.
18096 (*<plusminus:insn><VI_32:mode>3): Use Yw instad of <Yv_Yw> constrint.
18097 (<s>mulv4hi3_highpart): New expander.
18098 (*<s>mulv2hi3_highpart): New insn pattern.
18099 (<s>mulv2hi3_higpart): New expander.
18100 (*<any_shift:insn>v2hi3): New insn pattern.
18101 (<any_shift:insn>v2hi3): New expander.
18102 * config/i386/sse.md (smulhrsv2hi3): New expander.
18103 (*smulhrsv2hi3): New insn pattern.
18105 2021-05-20 Kewen Lin <linkw@linux.ibm.com>
18107 * doc/invoke.texi (vect-inner-loop-cost-factor): Document new
18109 * params.opt (vect-inner-loop-cost-factor): New.
18110 * targhooks.c (default_add_stmt_cost): Replace hardcoded factor
18111 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR, include head file
18112 tree-vectorizer.h and its required ones.
18113 * config/aarch64/aarch64.c (aarch64_add_stmt_cost): Replace
18114 hardcoded factor 50 with LOOP_VINFO_INNER_LOOP_COST_FACTOR.
18115 * config/arm/arm.c (arm_add_stmt_cost): Likewise.
18116 * config/i386/i386.c (ix86_add_stmt_cost): Likewise.
18117 * config/rs6000/rs6000.c (rs6000_add_stmt_cost): Likewise.
18118 * tree-vect-loop.c (vect_compute_single_scalar_iteration_cost):
18120 (_loop_vec_info::_loop_vec_info): Init inner_loop_cost_factor.
18121 * tree-vectorizer.h (_loop_vec_info): Add inner_loop_cost_factor.
18122 (LOOP_VINFO_INNER_LOOP_COST_FACTOR): New macro.
18124 2021-05-20 Christophe Lyon <christophe.lyon@linaro.org>
18125 Torbjörn Svensson <torbjorn.svensson@st.com>
18128 * doc/cpp.texi (Common Predefined Macros): Document __FILE_NAME__.
18130 2021-05-20 Jakub Jelinek <jakub@redhat.com>
18132 PR middle-end/99928
18133 * gimplify.c (gimplify_scan_omp_clauses) <case OMP_CLAUSE_LINEAR>: For
18134 explicit linear clause when combined with target, make it map(tofrom:)
18135 instead of no clause or firstprivate.
18137 2021-05-20 Jakub Jelinek <jakub@redhat.com>
18139 PR tree-optimization/94589
18140 * match.pd ((X & Y) == X -> (X & ~Y) == 0): Simplify even in presence
18141 of integral conversions.
18143 2021-05-19 Andrew MacLeod <amacleod@redhat.com>
18145 * gimple-range.cc (fur_source::get_operand): New.
18146 (gimple_range_fold): Delete.
18147 (fold_using_range::fold_stmt): Move from gimple_ranger::calc_stmt.
18148 (fold_using_range::range_of_range_op): Move from gimple_ranger.
18149 (fold_using_range::range_of_address): Ditto.
18150 (fold_using_range::range_of_phi): Ditto.
18151 (fold_using_range::range_of_call): Ditto.
18152 (fold_using_range::range_of_builtin_ubsan_call): Move from
18153 range_of_builtin_ubsan_call.
18154 (fold_using_range::range_of_builtin_call): Move from
18155 range_of_builtin_call.
18156 (gimple_ranger::range_of_builtin_call): Delete.
18157 (fold_using_range::range_of_cond_expr): Move from gimple_ranger.
18158 (gimple_ranger::fold_range_internal): New.
18159 (gimple_ranger::range_of_stmt): Use new fold_using_range API.
18160 (fold_using_range::range_of_ssa_name_with_loop_info): Move from
18161 gimple_ranger. Improve ranges of SSA_NAMES when possible.
18162 * gimple-range.h (gimple_ranger): Remove various range_of routines.
18163 (class fur_source): New.
18164 (class fold_using_range): New.
18165 (fur_source::fur_source): New.
18167 * vr-values.c (vr_values::extract_range_basic): Use fold_using_range
18168 instead of range_of_builtin_call.
18170 2021-05-19 Jonathan Wakely <jwakely@redhat.com>
18172 * doc/cpp.texi (Common Predefined Macros): Update documentation
18173 for the __GXX_EXPERIMENTAL_CXX0X__ macro.
18175 2021-05-19 Alex Coplan <alex.coplan@arm.com>
18178 * config/arm/arm.md (nonsecure_call_internal): Always ensure
18179 callee's address is in a register.
18181 2021-05-19 Geng Qi <gengqi@linux.alibaba.com>
18183 * common/config/riscv/riscv-common.c
18184 (riscv_subset_list::parsing_subset_version): Properly parse the letter
18186 (riscv_subset_list::parse_std_ext,
18187 riscv_subset_list::parse_multiletter_ext): To handle errors generated
18188 in riscv_subset_list::parsing_subset_version.
18190 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18192 * config/aarch64/aarch64-simd.md: Use "neon_move_narrow_q"
18193 type attribute in patterns generating XTN(2).
18195 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18197 * config/aarch64/aarch64-simd.md (aarch64_simd_vec_pack_trunc_<mode>):
18198 Remove as duplicate of...
18199 (aarch64_xtn<mode>): This.
18200 (aarch64_xtn2<mode>_le): Move position in file.
18201 (aarch64_xtn2<mode>_be): Move position in file.
18202 (aarch64_xtn2<mode>): Move position in file.
18203 (vec_pack_trunc_<mode>): Define as an expander.
18205 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18207 * config/aarch64/aarch64-simd-builtins.def: Split builtin
18208 generation for aarch64_<sur>q<r>shr<u>n_n<mode> pattern into
18209 separate scalar and vector generators.
18210 * config/aarch64/aarch64-simd.md
18211 (aarch64_<sur>q<r>shr<u>n_n<mode>): Define as an expander and
18213 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_le): This and...
18214 (aarch64_<sur>q<r>shr<u>n_n<mode>_insn_be): This.
18215 * config/aarch64/iterators.md: Define SD_HSDI iterator.
18217 2021-05-19 Jonathn Wright <jonathan.wright@arm.com>
18219 * config/aarch64/aarch64-simd.md: Use UNSPEC_SQXTUN instead
18221 * config/aarch64/iterators.md: Remove UNSPEC_SQXTUN2.
18223 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18225 * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
18226 Implement as an expand emitting a big/little endian
18227 instruction pattern.
18228 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
18229 (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.
18231 2021-05-19 Jonathan Wright <jonathan.wright@arm.com>
18233 * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
18234 Implement as an expand emitting a big/little endian
18235 instruction pattern.
18236 (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
18237 (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
18238 * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
18239 UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.
18241 2021-05-19 Richard Biener <rguenther@suse.de>
18243 PR middle-end/100672
18244 * fold-const.c (fold_negate_expr_1): Use element_precision.
18245 (negate_expr_p): Likewise.
18247 2021-05-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
18249 * config/aarch64/iterators.md (SVE_PRED_LOAD): New iterator.
18250 (pred_load): New int attribute.
18251 * config/aarch64/aarch64-sve.md
18252 (aarch64_load_<ANY_EXTEND:optab><SVE_HSDI:mode><SVE_PARTIAL_I:mode>): Use
18253 SVE_PRED_LOAD enum iterator and corresponding pred_load attribute.
18254 * config/aarch64/aarch64-sve-builtins-base.cc (expand): Update call to
18255 code_for_aarch64_load.
18257 2021-05-19 Richard Biener <rguenther@suse.de>
18259 * cfgexpand.c (discover_nonconstant_array_refs_r): Make
18260 sure TARGET_MEM_REF bases are expanded as memory.
18261 * tree-ssa-operands.c (operands_scanner::get_tmr_operands):
18262 Do not mark TARGET_MEM_REF bases addressable.
18263 * tree-ssa.c (non_rewritable_mem_ref_base): Handle
18264 TARGET_MEM_REF bases as never rewritable.
18265 * gimple-walk.c (walk_stmt_load_store_addr_ops): Do not
18266 walk TARGET_MEM_REF bases as address-takens.
18267 * tree-ssa-dce.c (ref_may_be_aliased): Handle TARGET_MEM_REF.
18269 2021-05-19 Richard Biener <rguenther@suse.de>
18271 * builtins.c (get_object_alignment_1): Strip outer
18273 * tree-dfa.c (get_ref_base_and_extent): Handle outer
18274 WITH_SIZE_EXPR for size processing and process the
18276 * tree-ssa-alias.c (ao_ref_base_alias_set): Strip
18277 outer WITH_SIZE_EXPR.
18278 (ao_ref_base_alias_ptr_type): Likewise.
18279 (refs_may_alias_p_2): Allow WITH_SIZE_EXPR in ref->ref
18280 and handle that accordingly, stripping it for the
18281 core alias workers.
18282 * tree.c (get_base_address): Handle WITH_SIZE_EXPR by
18283 looking through it instead of returning NULL.
18285 2021-05-19 Jakub Jelinek <jakub@redhat.com>
18287 PR middle-end/100576
18288 * builtins.c (check_read_access): Convert bound to size_type_node if
18291 2021-05-19 Richard Biener <rguenther@suse.de>
18293 * tree-cfg.c (verify_types_in_gimple_min_lval): Inline...
18294 (verify_types_in_gimple_reference): ... here. Sanitize.
18295 (verify_gimple_call): Verify references in LHS and arguments.
18296 (verify_gimple_assign_single): Reject WITH_SIZE_EXPR.
18298 2021-05-19 Uroš Bizjak <ubizjak@gmail.com>
18300 * config/i386/i386.h (VALID_INT_MODE_P):
18301 Add V8QI, V4HI and V2SI modes for TARGET_64BIT.
18302 * config/i386/i386.md (isa): Add x64_bmi.
18303 (enabled): Handle x64_bmi.
18304 * config/i386/mmx.md (mmx_andnot<MMXMODEI:mode>3):
18305 Add alternative using 64bit general registers.
18306 (*mmx_<any_logic:code><MMXMODEI:mode>3): Ditto.
18308 2021-05-19 Jakub Jelinek <jakub@redhat.com>
18310 PR middle-end/99928
18311 * tree.h (OMP_MASTER_COMBINED): Define.
18312 * gimplify.c (gimplify_scan_omp_clauses): Rewrite lastprivate
18313 handling for outer combined/composite constructs to a loop.
18314 Handle lastprivate on combined target.
18315 (gimplify_expr): Formatting fix.
18317 2021-05-19 Xionghu Luo <luoxhu@linux.ibm.com>
18319 * passes.def: Add sink_code pass before store_merging.
18320 * tree-ssa-sink.c (pass_sink_code:clone): New.
18322 2021-05-18 Bill Schmidt <wschmidt@linux.ibm.com>
18324 * config/rs6000/freebsd64.h (ADJUST_FIELD_ALIGN): Remove call to
18325 rs6000_special_adjust_field_align_p.
18326 * config/rs6000/linux64.h (ADJUST_FIELD_ALIGN): Likewise.
18327 * config/rs6000/rs6000-call.c (rs6000_function_arg_boundary):
18328 Remove ABI warning.
18329 (rs6000_function_arg): Likewise.
18330 * config/rs6000/rs6000-protos.h
18331 (rs6000_special_adjust_field_align_p): Remove prototype.
18332 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align_p):
18334 * config/rs6000/sysv4.h (ADJUST_FIELD_ALIGN): Remove call to
18335 rs6000_special_adjust_field_align_p.
18337 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
18340 * config/i386/i386.h (VALID_SSE2_REG_MODE):
18341 Add V4QI and V2HI modes.
18342 (VALID_INT_MODE_P): Ditto.
18343 * config/i386/mmx.md (VI_32): New mode iterator.
18344 (mmxvecsize): Handle V4QI and V2HI.
18346 (mov<VI_32:mode>): New expander.
18347 (*mov<mode>_internal): New insn pattern.
18348 (movmisalign<VI_32:mode>): New expander.
18349 (neg<VI_32:mode>): New expander.
18350 (<plusminus:insn><VI_32:mode>3): New expander.
18351 (*<plusminus:insn><VI_32:mode>3): New insn pattern.
18352 (mulv2hi3): New expander.
18353 (*mulv2hi3): New insn pattern.
18354 (one_cmpl<VI_32:mode>2): New expander.
18355 (*andnot<VI_32:mode>3): New insn pattern.
18356 (<any_logic:code><VI_32:mode>3): New expander.
18357 (*<any_logic:code><VI_32:mode>3): New insn pattern.
18359 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
18361 * config/i386/sse.md (<any_extend:insn>v4qiv4di2):
18362 Fix a mode mismatch with operand 1.
18364 2021-05-18 Uroš Bizjak <ubizjak@gmail.com>
18367 * config/i386/i386-expand.c (split_double_mode): Return
18368 temporary register when simplify_gen_subreg fails with
18369 the high half od the paradoxical subreg.
18371 2021-05-18 Richard Biener <rguenther@suse.de>
18373 * cfgexpand.c (expand_one_var): Pass in forced_stack_var
18374 and honor it when expanding.
18375 (expand_used_vars_for_block): Pass through forced_stack_var.
18376 (expand_used_vars): Likewise.
18377 (discover_nonconstant_array_refs_r): Set bits in
18378 forced_stack_vars instead of marking vars TREE_ADDRESSABLE.
18379 (avoid_type_punning_on_regs): Likewise.
18380 (discover_nonconstant_array_refs): Likewise.
18381 (pass_expand::execute): Create and pass down forced_stack_var
18382 bitmap. For parameters and returns temporarily set
18383 TREE_ADDRESSABLE when expand_function_start.
18385 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
18387 * doc/sourcebuild.texi: Document 'dg-note'.
18389 2021-05-18 Tobias Burnus <tobias@codesourcery.com>
18392 * configure: Regenerate.
18393 * configure.ac (BUILD_CFLAG, BUILD_CXXFLAGS): Add $(CFLAGS-$@).
18395 2021-05-18 Thomas Schwinge <thomas@codesourcery.com>
18397 * gimple.h (is_gimple_omp_oacc): Tighten.
18398 * omp-low.c (check_omp_nesting_restrictions): Adjust.
18400 2021-05-18 Richard Biener <rguenther@suse.de>
18402 * tree-ssa-operands.c (mark_address_taken): Simplify.
18404 2021-05-18 Martin Liska <mliska@suse.cz>
18406 * config/gcn/mkoffload.c (STR): Redefine.
18407 * config/i386/intelmic-mkoffload.c (STR): Likewise.
18408 * config/nvptx/mkoffload.c (STR): Likewise.
18410 2021-05-18 Martin Liska <mliska@suse.cz>
18412 * common/config/aarch64/aarch64-common.c (aarch64_parse_extension):
18413 Use startswith function instead of strncmp.
18414 * common/config/bfin/bfin-common.c (bfin_handle_option): Likewise.
18415 * common/config/riscv/riscv-common.c (riscv_subset_list::parse): Likewise.
18416 * config/aarch64/aarch64-sve-builtins-shapes.cc (parse_type): Likewise.
18417 * config/aarch64/aarch64.c (aarch64_process_one_target_attr): Likewise.
18418 * config/alpha/alpha.c (alpha_elf_section_type_flags): Likewise.
18419 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
18420 * config/arm/arm.c (arm_file_start): Likewise.
18421 (arm_valid_target_attribute_rec): Likewise.
18422 (thumb1_md_asm_adjust): Likewise.
18423 * config/arm/driver-arm.c (host_detect_local_cpu): Likewise.
18424 * config/avr/avr.c (STR_PREFIX_P): Likewise.
18425 (avr_set_current_function): Likewise.
18426 (avr_handle_addr_attribute): Likewise.
18427 (avr_asm_output_aligned_decl_common): Likewise.
18428 (avr_asm_named_section): Likewise.
18429 (avr_section_type_flags): Likewise.
18430 (avr_asm_select_section): Likewise.
18431 * config/c6x/c6x.c (c6x_in_small_data_p): Likewise.
18432 (c6x_section_type_flags): Likewise.
18433 * config/darwin-c.c (darwin_cfstring_ref_p): Likewise.
18434 (darwin_objc_declare_unresolved_class_reference): Likewise.
18435 (darwin_objc_declare_class_definition): Likewise.
18436 * config/darwin.c (indirect_data): Likewise.
18437 (darwin_encode_section_info): Likewise.
18438 (darwin_objc2_section): Likewise.
18439 (darwin_objc1_section): Likewise.
18440 (machopic_select_section): Likewise.
18441 (darwin_globalize_label): Likewise.
18442 (darwin_label_is_anonymous_local_objc_name): Likewise.
18443 (darwin_asm_named_section): Likewise.
18444 (darwin_asm_output_dwarf_offset): Likewise.
18445 * config/frv/frv.c (frv_string_begins_with): Likewise.
18446 (frv_in_small_data_p): Likewise.
18447 * config/gcn/mkoffload.c (STR): Likewise.
18449 * config/i386/i386-builtins.c (get_builtin_code_for_version): Likewise.
18450 * config/i386/i386-options.c (ix86_option_override_internal): Likewise.
18451 * config/i386/i386.c (x86_64_elf_section_type_flags): Likewise.
18452 (ix86_md_asm_adjust): Likewise.
18453 * config/i386/intelmic-mkoffload.c (STR): Likewise.
18454 * config/i386/winnt.c (i386_pe_asm_named_section): Likewise.
18455 (i386_pe_file_end): Likewise.
18456 * config/ia64/ia64.c (ia64_in_small_data_p): Likewise.
18457 (ia64_section_type_flags): Likewise.
18458 * config/mips/driver-native.c (host_detect_local_cpu): Likewise.
18459 * config/mips/mips.c (mips_handle_interrupt_attr): Likewise.
18460 (mips16_stub_function_p): Likewise.
18461 (mips_function_rodata_section): Likewise.
18462 * config/msp430/msp430.c (msp430_mcu_name): Likewise.
18463 (msp430_function_section): Likewise.
18464 (msp430_section_type_flags): Likewise.
18465 (msp430_expand_helper): Likewise.
18466 * config/nios2/nios2.c (nios2_small_section_name_p): Likewise.
18467 (nios2_valid_target_attribute_rec): Likewise.
18468 * config/nvptx/mkoffload.c (process): Likewise.
18470 * config/pa/som.h: Likewise.
18471 * config/pdp11/pdp11.c (pdp11_output_ident): Likewise.
18472 * config/riscv/riscv.c (riscv_elf_select_rtx_section): Likewise.
18473 * config/rs6000/rs6000.c (VTABLE_NAME_P): Likewise.
18474 (rs6000_inner_target_options): Likewise.
18475 * config/s390/driver-native.c (s390_host_detect_local_cpu): Likewise.
18476 * config/sparc/driver-sparc.c (host_detect_local_cpu): Likewise.
18477 * config/vax/vax.c (vax_output_int_move): Likewise.
18478 * config/vms/vms-ld.c (startswith): Likewise.
18479 (process_args): Likewise.
18481 * config/vms/vms.c: Likewise.
18483 2021-05-18 Jakub Jelinek <jakub@redhat.com>
18485 PR rtl-optimization/100590
18486 * regcprop.c (copyprop_hardreg_forward_1): Only DCE dead sets if
18487 they are NONJUMP_INSN_P.
18489 2021-05-18 Jakub Jelinek <jakub@redhat.com>
18492 * function.c (push_dummy_function): Set DECL_ARTIFICIAL and
18493 DECL_ASSEMBLER_NAME on the fn_decl.
18495 2021-05-18 Jakub Jelinek <jakub@redhat.com>
18497 PR tree-optimization/94589
18498 * tree-ssa-phiopt.c (spaceship_replacement): Pattern match
18499 phi result used in (res & ~1) == 0 comparison as res >= 0 as
18500 res == 2 would be UB with -ffinite-math-only.
18502 2021-05-18 Martin Liska <mliska@suse.cz>
18504 * Makefile.in: genversion.o should depend on DATESTAMP.
18506 2021-05-18 Claudiu Zissulescu <claziss@synopsys.com>
18508 * config/arc/simdext.md (negv2si2): Remove round bracket.
18510 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
18512 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Define
18513 _Bool as macro expanding to _Bool.
18515 2021-05-18 Andreas Krebbel <krebbel@linux.ibm.com>
18518 * tree.c (build_reference_type_for_mode)
18519 (build_pointer_type_for_mode): Pick pointer mode if MODE argument
18521 (build_reference_type, build_pointer_type): Invoke
18522 build_*_type_for_mode with VOIDmode.
18524 2021-05-17 Andrew MacLeod <amacleod@redhat.com>
18526 PR tree-optimization/100512
18527 * gimple-range-cache.cc (ranger_cache::set_global_range): Mark const
18528 and non-zero pointer ranges as invariant.
18529 * gimple-range.cc (gimple_ranger::range_of_stmt): Remove pointer
18530 processing from here.
18532 2021-05-17 Tom de Vries <tdevries@suse.de>
18535 * config/nvptx/nvptx-protos.h (nvptx_output_atomic_insn): Declare
18536 * config/nvptx/nvptx.c (nvptx_output_barrier)
18537 (nvptx_output_atomic_insn): New function.
18538 (nvptx_print_operand): Add support for 'B'.
18539 * config/nvptx/nvptx.md: Use nvptx_output_atomic_insn for atomic
18542 2021-05-17 Aldy Hernandez <aldyh@redhat.com>
18544 PR tree-optimization/100349
18545 * vr-values.c (bounds_of_var_in_loop): Bail if scev returns
18548 2021-05-17 Tamar Christina <tamar.christina@arm.com>
18550 * config/aarch64/driver-aarch64.c (DEFAULT_ARCH): New.
18551 (host_detect_local_cpu): Use it.
18553 2021-05-17 Martin Liska <mliska@suse.cz>
18555 * doc/invoke.texi: Add 2 missing dots.
18557 2021-05-17 Marius Hillenbrand <mhillen@linux.ibm.com>
18559 PR bootstrap/100552
18560 * configure.ac: Replace pattern substitution with call to sed.
18561 * configure: Regenerate.
18563 2021-05-17 Richard Biener <rguenther@suse.de>
18565 PR middle-end/100582
18566 * tree.c (array_at_struct_end_p): Get to the base of the
18567 reference before looking for the underlying decl.
18569 2021-05-17 Joern Rennecke <joern.rennecke@embecosm.com>
18571 * genoutput.c (validate_insn_alternatives) Make "wrong number of
18572 alternatives" message more specific, and remove assumption on where
18575 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
18577 * config/arm/iterators.md (V16): New iterator.
18578 (VH_cvtto): New iterator.
18579 (v_cmp_result): Added V4HF and V8HF support.
18580 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result>): Use VDQWH.
18581 (vcond<mode><mode>): Likewise.
18582 (vcond_mask_<mode><v_cmp_result>): Likewise.
18583 (vcond<VH_cvtto><mode>): New expander.
18585 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
18587 * config/arm/arm-protos.h (arm_expand_vector_compare): Update
18589 * config/arm/arm.c (arm_expand_vector_compare): Add support for
18591 (arm_expand_vcond): Likewise.
18592 * config/arm/iterators.md (supf): Remove VCMPNEQ_S, VCMPEQQ_S,
18593 VCMPEQQ_N_S, VCMPNEQ_N_S.
18594 (VCMPNEQ, VCMPEQQ, VCMPEQQ_N, VCMPNEQ_N): Remove.
18595 * config/arm/mve.md (@mve_vcmp<mve_cmp_op>q_<mode>): Add '@' prefix.
18596 (@mve_vcmp<mve_cmp_op>q_f<mode>): Likewise.
18597 (@mve_vcmp<mve_cmp_op>q_n_f<mode>): Likewise.
18598 (@mve_vpselq_<supf><mode>): Likewise.
18599 (@mve_vpselq_f<mode>"): Likewise.
18600 * config/arm/neon.md (vec_cmp<mode><v_cmp_result): Enable for MVE
18601 and move to vec-common.md.
18602 (vec_cmpu<mode><mode>): Likewise.
18603 (vcond<mode><mode>): Likewise.
18604 (vcond<V_cvtto><mode>): Likewise.
18605 (vcondu<mode><v_cmp_result>): Likewise.
18606 (vcond_mask_<mode><v_cmp_result>): Likewise.
18607 * config/arm/unspecs.md (VCMPNEQ_U, VCMPNEQ_S, VCMPEQQ_S)
18608 (VCMPEQQ_N_S, VCMPNEQ_N_S, VCMPEQQ_U, CMPEQQ_N_U, VCMPNEQ_N_U)
18609 (VCMPGEQ_N_S, VCMPGEQ_S, VCMPGTQ_N_S, VCMPGTQ_S, VCMPLEQ_N_S)
18610 (VCMPLEQ_S, VCMPLTQ_N_S, VCMPLTQ_S, VCMPCSQ_N_U, VCMPCSQ_U)
18611 (VCMPHIQ_N_U, VCMPHIQ_U): Remove.
18612 * config/arm/vec-common.md (vec_cmp<mode><v_cmp_result): Moved
18614 (vec_cmpu<mode><mode>): Likewise.
18615 (vcond<mode><mode>): Likewise.
18616 (vcond<V_cvtto><mode>): Likewise.
18617 (vcondu<mode><v_cmp_result>): Likewise.
18618 (vcond_mask_<mode><v_cmp_result>): Likewise. Added unsafe math
18621 2021-05-17 liuhongt <hongtao.liu@intel.com>
18624 * config/i386/i386.c (ix86_gimple_fold_builtin): Use
18625 gsi_insert_seq_before instead.
18627 2021-05-17 Christophe Lyon <christophe.lyon@linaro.org>
18629 * doc/sourcebuild.texi (arm_qbit_ok): Rename into...
18630 (arm_sat_ok): ...this.
18632 2021-05-17 Martin Liska <mliska@suse.cz>
18634 * lto-wrapper.c (merge_flto_options): Factor out a new function.
18635 (merge_and_complain): Use it.
18636 (run_gcc): Merge also linker command line -flto=foo argument
18639 2021-05-16 Christophe Lyon <christophe.lyon@linaro.org>
18641 * config/arm/arm.h (CPP_SPEC): Remove error message about
18642 -mlittle-endian/-mbig-endian conflict.
18644 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
18646 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
18647 __ROP_PROTECT__ if -mrop-protect is selected.
18649 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
18651 * config/rs6000/rs6000-internal.h (rs6000_stack): Add
18652 rop_hash_save_offset and rop_hash_size.
18653 * config/rs6000/rs6000-logue.c (rs6000_stack_info): Compute
18654 rop_hash_size and rop_hash_save_offset.
18655 (debug_stack_info): Dump rop_hash_save_offset and rop_hash_size.
18656 (rs6000_emit_prologue): Emit hashst[p] in prologue.
18657 (rs6000_emit_epilogue): Emit hashchk[p] in epilogue.
18658 * config/rs6000/rs6000.md (unspec): Add UNSPEC_HASHST and
18660 (hashst): New define_insn.
18661 (hashchk): Likewise.
18663 2021-05-15 Bill Schmidt <wschmidt@linux.ibm.com>
18665 * config/rs6000/rs6000.c (rs6000_option_override_internal):
18666 Disable shrink wrap when inserting ROP-protect instructions.
18667 * config/rs6000/rs6000.opt (mrop-protect): New option.
18668 (mprivileged): Likewise.
18669 * doc/invoke.texi: Document mrop-protect and mprivileged.
18671 2021-05-15 Hans-Peter Nilsson <hp@axis.com>
18673 * reorg.c (fill_slots_from_thread): Reinstate code typoed out in
18676 2021-05-15 Martin Jambor <mjambor@suse.cz>
18679 2021-05-13 Martin Jambor <mjambor@suse.cz>
18681 PR tree-optimization/100453
18682 * tree-sra.c (sra_modify_assign): All const base accesses do not
18683 need refreshing, not just those from decl_pool.
18684 (sra_modify_assign): Do not refresh into a const base decl.
18686 2021-05-15 Jakub Jelinek <jakub@redhat.com>
18688 PR rtl-optimization/100342
18689 * regcprop.c (copy_value): When copying a source reg in a wider
18690 mode than it has recorded for the value, adjust recorded destination
18691 mode too or punt if !REG_CAN_CHANGE_MODE_P.
18693 2021-05-14 Jason Merrill <jason@redhat.com>
18695 * intl.h: Add comments.
18697 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
18699 * config/aarch64/aarch64-simd.md
18700 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>_internal): Split into...
18701 (aarch64_sqdmlsl2_lane<mode>_internal): ... This...
18702 (aarch64_sqdmlal2_lane<mode>_internal): ... And this.
18703 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>_internal): Split into ...
18704 (aarch64_sqdmlsl2_laneq<mode>_internal): ... This...
18705 (aarch64_sqdmlal2_laneq<mode>_internal): ... And this.
18706 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>_internal): Split into...
18707 (aarch64_sqdmlsl2_n<mode>_internal): ... This...
18708 (aarch64_sqdmlal2_n<mode>_internal): ... And this.
18710 2021-05-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
18713 * config/arm/arm_neon.h (vtst_s8): Replace call to vtst builtin with it's
18714 boolean logic equivalent.
18715 (vtst_s16): Likewise.
18716 (vtst_s32): Likewise.
18717 (vtst_u8): Likewise.
18718 (vtst_u16): Likewise.
18719 (vtst_u32): Likewise.
18720 (vtst_p8): Likewise.
18721 (vtst_p16): Likewise.
18722 (vtstq_s8): Likewise.
18723 (vtstq_s16): Likewise.
18724 (vtstq_s32): Likewise.
18725 (vtstq_u8): Likewise.
18726 (vtstq_u16): Likewise.
18727 (vtstq_u32): Likewise.
18728 (vtstq_p8): Likewise.
18729 (vtstq_p16): Likewise.
18730 * config/arm/arm_neon_builtins.def: Remove entry for vtst.
18731 * config/arm/neon.md (neon_vtst<mode>): Remove pattern.
18733 2021-05-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
18735 * config/aarch64/aarch64-simd.md (aarch64_sqdmlal2<mode>): Merge into...
18736 (aarch64_sqdml<SBINQOPS:as>l2<mode>): ... This.
18737 (aarch64_sqdmlsl2<mode>): Delete.
18738 (aarch64_sqdmlal2_lane<mode>): Merge this...
18739 (aarch64_sqdmlsl2_lane<mode>): ... And this...
18740 (aarch64_sqdml<SBINQOPS:as>l2_lane<mode>): ... Into this.
18741 (aarch64_sqdmlal2_laneq<mode>): Merge this...
18742 (aarch64_sqdmlsl2_laneq<mode>): ... And this...
18743 (aarch64_sqdml<SBINQOPS:as>l2_laneq<mode>): ... Into this.
18744 (aarch64_sqdmlal2_n<mode>): Merge this...
18745 (aarch64_sqdmlsl2_n<mode>): ... And this...
18746 (aarch64_sqdml<SBINQOPS:as>l2_n<mode>): ... Into this.
18748 2021-05-13 Martin Sebor <msebor@redhat.com>
18750 PR middle-end/100574
18751 * builtins.c (access_ref::get_ref): Improve detection of PHIs with
18752 all null arguments.
18754 2021-05-13 Martin Sebor <msebor@redhat.com>
18756 PR tree-optimization/93100
18757 PR middle-end/98583
18758 * tree-ssa-uninit.c (check_defs): Exclude intrinsic functions that
18759 don't modify referenced objects.
18761 2021-05-13 Martin Jambor <mjambor@suse.cz>
18763 PR tree-optimization/100453
18764 * tree-sra.c (sra_modify_assign): All const base accesses do not
18765 need refreshing, not just those from decl_pool.
18766 (sra_modify_assign): Do not refresh into a const base decl.
18768 2021-05-13 Martin Liska <mliska@suse.cz>
18770 * tree-ssa-dom.c: Remove m_simplifier.
18772 2021-05-13 Richard Earnshaw <rearnsha@arm.com>
18775 * config/arm/arm.c (arm_canonicalize_comparison): Correctly
18776 canonicalize DImode inequality comparisons against the
18777 maximum integral value.
18779 2021-05-13 Jakub Jelinek <jakub@redhat.com>
18781 PR tree-optimization/98856
18782 * config/i386/i386.c (ix86_shift_rotate_cost): Add CODE argument.
18783 Expect V2DI and V4DI arithmetic right shifts to be emulated.
18784 (ix86_rtx_costs, ix86_add_stmt_cost): Adjust ix86_shift_rotate_cost
18786 * config/i386/i386-expand.c (expand_vec_perm_2perm_interleave,
18787 expand_vec_perm_2perm_pblendv): New functions.
18788 (ix86_expand_vec_perm_const_1): Use them.
18789 * config/i386/sse.md (ashr<mode>3<mask_name>): Rename to ...
18790 (<mask_codefor>ashr<mode>3<mask_name>): ... this.
18791 (ashr<mode>3): New define_expand with VI248_AVX512BW iterator.
18792 (ashrv4di3): New define_expand.
18793 (ashrv2di3): Change condition to TARGET_SSE2, handle !TARGET_XOP
18794 and !TARGET_AVX512VL expansion.
18796 2021-05-13 Uroš Bizjak <ubizjak@gmail.com>
18799 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Force mode
18800 sizes < 16 to a register when constructing vpcmov pattern.
18801 * config/i386/mmx.md (*xop_pcmov_<mode>): Use MMXMODE124 mode.
18803 2021-05-13 Martin Liska <mliska@suse.cz>
18805 * gcov-io.c (gcov_write_block): Remove.
18806 (gcov_write_words): Likewise.
18807 (gcov_read_words): Re-implement using gcov_read_bytes.
18808 (gcov_allocate): Remove.
18809 (GCOV_BLOCK_SIZE): Likewise.
18810 (struct gcov_var): Remove most of the fields.
18811 (gcov_position): Implement with ftell.
18812 (gcov_rewrite): Remove setting of start and offset fields.
18813 (from_file): Re-format.
18814 (gcov_open): Remove setbuf call. It should not be needed.
18815 (gcov_close): Remove internal buffer handling.
18816 (gcov_magic): Use __builtin_bswap32.
18817 (gcov_write_counter): Use directly gcov_write_unsigned.
18818 (gcov_write_string): Use direct fwrite and do not round
18820 (gcov_seek): Use directly fseek.
18821 (gcov_write_tag): Use gcov_write_unsigned directly.
18822 (gcov_write_length): Likewise.
18823 (gcov_write_tag_length): Likewise.
18824 (gcov_read_bytes): Use directly fread.
18825 (gcov_read_unsigned): Use gcov_read_words.
18826 (gcov_read_counter): Likewise.
18827 (gcov_read_string): Use gcov_read_bytes.
18828 * gcov-io.h (GCOV_WORD_SIZE): Adjust to reflect
18829 that size is not in bytes, but words (4B).
18830 (GCOV_TAG_FUNCTION_LENGTH): Likewise.
18831 (GCOV_TAG_ARCS_LENGTH): Likewise.
18832 (GCOV_TAG_ARCS_NUM): Likewise.
18833 (GCOV_TAG_COUNTER_LENGTH): Likewise.
18834 (GCOV_TAG_COUNTER_NUM): Likewise.
18835 (GCOV_TAG_SUMMARY_LENGTH): Likewise.
18837 2021-05-13 liuhongt <hongtao.liu@intel.com>
18840 * config/i386/sse.md (ssedoublevecmode): Add attribute for
18841 V64QI/V32HI/V16SI/V4DI.
18842 (ssehalfvecmode): Add attribute for V2DI/V2DF.
18843 (*vec_concatv4si_0): Extend to VI124_128.
18844 (*vec_concat<mode>_0): New pre-reload splitter.
18845 * config/i386/predicates.md (movq_parallel): New predicate.
18847 2021-05-13 Alexandre Oliva <oliva@adacore.com>
18849 * targhooks.c (default_zero_call_used_regs): Retry using
18850 successfully-zeroed registers as sources.
18852 2021-05-12 Tobias Burnus <tobias@codesourcery.com>
18854 * omp-low.c (finish_taskreg_scan): Use the proper detach decl.
18856 2021-05-12 Aldy Hernandez <aldyh@redhat.com>
18859 * gimple-range.cc (range_of_builtin_call): Skip out on
18860 processing __builtin_clz when varying.
18862 2021-05-12 Tom de Vries <tdevries@suse.de>
18865 * config/nvptx/nvptx-opts.h (enum ptx_version): New enum.
18866 * config/nvptx/nvptx.c (nvptx_file_start): Print .version according
18867 to ptx_version_option.
18868 * config/nvptx/nvptx.h (TARGET_PTX_6_3): Define.
18869 * config/nvptx/nvptx.md (define_insn "nvptx_shuffle<mode>")
18870 (define_insn "nvptx_vote_ballot"): Use sync variant for
18872 * config/nvptx/nvptx.opt (ptx_version): Add enum.
18873 (mptx): Add option.
18874 * doc/invoke.texi (Nvidia PTX Options): Add mptx item.
18876 2021-05-12 Richard Biener <rguenther@suse.de>
18878 PR tree-optimization/100566
18879 * tree-ssa-sccvn.c (dominated_by_p_w_unex): Properly handle
18880 allow_back for all edge queries.
18882 2021-05-12 liuhongt <hongtao.liu@intel.com>
18885 * config/i386/sse.md (<sse4_1_avx2>_pblendvb): Add
18886 splitters for pblendvb of NOT mask register.
18888 2021-05-12 Richard Biener <rguenther@suse.de>
18890 PR tree-optimization/100519
18891 * tree-ssa-reassoc.c (can_associate_p): Split into...
18892 (can_associate_op_p): ... this
18893 (can_associate_type_p): ... and this.
18894 (is_reassociable_op): Call can_associate_op_p.
18895 (break_up_subtract_bb): Call the appropriate predicates.
18896 (reassociate_bb): Likewise.
18898 2021-05-12 Martin Liska <mliska@suse.cz>
18900 * lto-wrapper.c (merge_and_complain): Merge -flto=arg options.
18901 (run_gcc): Use -flto argument detection for merged
18904 2021-05-12 Martin Liska <mliska@suse.cz>
18906 * lto-wrapper.c (print_lto_docs_link): New function.
18907 (run_gcc): Print warning about missing job server detection
18908 after we know NR of partitions. Do the same for -flto{,=1}.
18909 * opts.c (get_option_html_page): Support -flto option.
18911 2021-05-12 Martin Liska <mliska@suse.cz>
18913 * lto-wrapper.c (get_options_from_collect_gcc_options): Change
18915 (append_option): Remove.
18916 (find_option): Rework to use the vector type.
18917 (remove_option): Remove.
18918 (merge_and_complain): Use vectors for cl_decoded_option data
18920 (append_compiler_options): Likewise.
18921 (append_diag_options): Likewise.
18922 (append_linker_options): Likewise.
18923 (append_offload_options): Likewise.
18924 (compile_offload_image): Likewise.
18925 (compile_images_for_offload_targets): Likewise.
18926 (find_and_merge_options): Likewise.
18927 (run_gcc): Likewise.
18929 2021-05-12 Bernd Edlinger <bernd.edlinger@hotmail.de>
18932 * dwarf2out.c (dwarf2out_finish): Set
18933 have_multiple_function_sections with multi-range text_section.
18935 2021-05-12 Martin Liska <mliska@suse.cz>
18937 PR bootstrap/100560
18938 * Makefile.in: Remove version.h from linker command line.
18940 2021-05-12 Richard Biener <rguenther@suse.de>
18942 PR middle-end/100547
18943 * rtl.h (rtvec_alloc): Make argument size_t.
18944 * rtl.c (rtvec_alloc): Verify the count is less than INT_MAX.
18946 2021-05-12 Jakub Jelinek <jakub@redhat.com>
18948 PR middle-end/100508
18949 * cfgexpand.c (expand_debug_expr): For DEBUG_EXPR_DECL with vector
18950 type, don't reuse DECL_RTL if it has different mode, instead force
18951 creation of a new DEBUG_EXPR.
18953 2021-05-12 Jakub Jelinek <jakub@redhat.com>
18954 Marc Glisse <marc.glisse@inria.fr>
18956 PR tree-optimization/94589
18957 * match.pd ((X & Y) == X -> (X & ~Y) == 0,
18958 (X | Y) == Y -> (X & ~Y) == 0): New GIMPLE simplifications.
18960 2021-05-12 Uroš Bizjak <ubizjak@gmail.com>
18963 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Handle V2SF mode.
18964 * config/i386/mmx.md (MMXMODE124): New mode iterator.
18966 (mmxintvecmode): New mode attribute.
18967 (mmxintvecmodelower): Ditto.
18968 (*mmx_maskcmpv2sf3_comm): New insn pattern.
18969 (*mmx_maskcmpv2sf3): Ditto.
18970 (vec_cmpv2sfv2si): New expander.
18971 (vcond<V2FI:mode>v2si): Ditto.
18972 (mmx_vlendvps): New insn pattern.
18973 (vcond<MMXMODE124:mode><MMXMODEI:mode>): Also handle V2SFmode.
18974 (vcondu<MMXMODE124:mode><MMXMODEI:mode>): Ditto.
18975 (vcond_mask_<mode><mmxintvecmodelower>): Ditto.
18977 2021-05-11 Martin Sebor <msebor@redhat.com>
18979 PR middle-end/21433
18980 * expr.c (expand_expr_real_1): Replace unreachable code with an assert.
18982 2021-05-11 Richard Biener <rguenther@suse.de>
18984 * gimple-fold.c (gimple_fold_call): Do not call
18985 maybe_fold_reference on call arguments or the static chain.
18986 (fold_stmt_1): Do not call maybe_fold_reference on GIMPLE_ASM
18989 2021-05-11 Martin Liska <mliska@suse.cz>
18991 * builtins.def (DEF_HSAIL_BUILTIN): Remove.
18992 (DEF_HSAIL_ATOMIC_BUILTIN): Likewise.
18993 (DEF_HSAIL_SAT_BUILTIN): Likewise.
18994 (DEF_HSAIL_INTR_BUILTIN): Likewise.
18995 (DEF_HSAIL_CVT_ZEROI_SAT_BUILTIN): Likewise.
18996 * doc/frontends.texi: Remove BRIG.
18997 * doc/install.texi: Likewise.
18998 * doc/invoke.texi: Likewise.
18999 * doc/standards.texi: Likewise.
19000 * brig-builtins.def: Removed.
19001 * brig/ChangeLog: Removed.
19002 * brig/Make-lang.in: Removed.
19003 * brig/brig-builtins.h: Removed.
19004 * brig/brig-c.h: Removed.
19005 * brig/brig-lang.c: Removed.
19006 * brig/brigfrontend/brig-arg-block-handler.cc: Removed.
19007 * brig/brigfrontend/brig-atomic-inst-handler.cc: Removed.
19008 * brig/brigfrontend/brig-basic-inst-handler.cc: Removed.
19009 * brig/brigfrontend/brig-branch-inst-handler.cc: Removed.
19010 * brig/brigfrontend/brig-cmp-inst-handler.cc: Removed.
19011 * brig/brigfrontend/brig-code-entry-handler.cc: Removed.
19012 * brig/brigfrontend/brig-code-entry-handler.h: Removed.
19013 * brig/brigfrontend/brig-comment-handler.cc: Removed.
19014 * brig/brigfrontend/brig-control-handler.cc: Removed.
19015 * brig/brigfrontend/brig-copy-move-inst-handler.cc: Removed.
19016 * brig/brigfrontend/brig-cvt-inst-handler.cc: Removed.
19017 * brig/brigfrontend/brig-fbarrier-handler.cc: Removed.
19018 * brig/brigfrontend/brig-function-handler.cc: Removed.
19019 * brig/brigfrontend/brig-function.cc: Removed.
19020 * brig/brigfrontend/brig-function.h: Removed.
19021 * brig/brigfrontend/brig-inst-mod-handler.cc: Removed.
19022 * brig/brigfrontend/brig-label-handler.cc: Removed.
19023 * brig/brigfrontend/brig-lane-inst-handler.cc: Removed.
19024 * brig/brigfrontend/brig-machine.c: Removed.
19025 * brig/brigfrontend/brig-machine.h: Removed.
19026 * brig/brigfrontend/brig-mem-inst-handler.cc: Removed.
19027 * brig/brigfrontend/brig-module-handler.cc: Removed.
19028 * brig/brigfrontend/brig-queue-inst-handler.cc: Removed.
19029 * brig/brigfrontend/brig-seg-inst-handler.cc: Removed.
19030 * brig/brigfrontend/brig-signal-inst-handler.cc: Removed.
19031 * brig/brigfrontend/brig-to-generic.cc: Removed.
19032 * brig/brigfrontend/brig-to-generic.h: Removed.
19033 * brig/brigfrontend/brig-util.cc: Removed.
19034 * brig/brigfrontend/brig-util.h: Removed.
19035 * brig/brigfrontend/brig-variable-handler.cc: Removed.
19036 * brig/brigfrontend/hsa-brig-format.h: Removed.
19037 * brig/brigfrontend/phsa.h: Removed.
19038 * brig/brigspec.c: Removed.
19039 * brig/config-lang.in: Removed.
19040 * brig/gccbrig.texi: Removed.
19041 * brig/lang-specs.h: Removed.
19042 * brig/lang.opt: Removed.
19044 2021-05-11 Richard Biener <rguenther@suse.de>
19047 * ipa-param-manipulation.c
19048 (ipa_param_body_adjustments::modify_call_stmt): Avoid
19049 altering SSA_NAME_DEF_STMT by adjusting the calls LHS
19050 via gimple_call_lhs_ptr.
19052 2021-05-11 Alex Coplan <alex.coplan@arm.com>
19055 * config/arm/arm.c (cmse_nonsecure_call_inline_register_clear):
19056 Avoid emitting CFA adjusts on the sp if we have the fp.
19058 2021-05-11 Richard Sandiford <richard.sandiford@arm.com>
19060 * config/aarch64/iterators.md (VMUL_CHANGE_NLANES): Delete.
19061 (VMULD): New iterator.
19062 (VCOND): Handle V4HF and V8HF.
19063 (VCONQ): Fix entry for V2SF.
19064 * config/aarch64/aarch64-simd.md (mul_lane<mode>3): Use VMULD
19065 instead of VMUL. Use a 64-bit vector mode for the indexed operand.
19066 (*aarch64_mul3_elt_<vswap_width_name><mode>): Merge with...
19067 (mul_laneq<mode>3): ...this define_insn. Use VMUL instead of VDQSF.
19068 Use a 128-bit vector mode for the indexed operand. Use stype for
19069 the scheduling type.
19071 2021-05-11 Richard Biener <rguenther@suse.de>
19073 * gimple-fold.c (maybe_fold_reference): Only return
19074 is_gimple_min_invariant values.
19076 2021-05-11 Richard Biener <rguenther@suse.de>
19078 PR middle-end/100509
19079 * gimple-fold.c (fold_gimple_assign): Only call
19080 get_symbol_constant_value on register type symbols.
19082 2021-05-11 Srinath Parvathaneni <srinath.parvathaneni@arm.com>
19083 Joe Ramsay <joe.ramsay@arm.com>
19086 * config/arm/arm_mve.h (__arm_vstrwq_scatter_offset): Fix wrong arguments.
19087 (__arm_vcmpneq): Remove duplicate definition.
19088 (__arm_vstrwq_scatter_offset_p): Likewise.
19089 (__arm_vmaxq_x): Likewise.
19090 (__arm_vmlsdavaq): Likewise.
19091 (__arm_vmlsdavaxq): Likewise.
19092 (__arm_vmlsdavq_p): Likewise.
19093 (__arm_vmlsdavxq_p): Likewise.
19094 (__arm_vrmlaldavhaq): Likewise.
19095 (__arm_vstrbq_p): Likewise.
19096 (__arm_vstrbq_scatter_offset): Likewise.
19097 (__arm_vstrbq_scatter_offset_p): Likewise.
19098 (__arm_vstrdq_scatter_offset): Likewise.
19099 (__arm_vstrdq_scatter_offset_p): Likewise.
19100 (__arm_vstrdq_scatter_shifted_offset): Likewise.
19101 (__arm_vstrdq_scatter_shifted_offset_p): Likewise.
19103 2021-05-11 Jakub Jelinek <jakub@redhat.com>
19105 PR middle-end/100471
19106 * omp-low.c (lower_omp_task_reductions): For OMP_TASKLOOP, if data
19107 is 0, bypass the reduction loop including
19108 GOMP_taskgroup_reduction_unregister call.
19110 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
19112 * config/rs6000/rs6000.c (struct rs6000_cost_data): New member
19113 costing_for_scalar.
19114 (rs6000_density_test): Early return if costing_for_scalar is true.
19115 (rs6000_init_cost): Init costing_for_scalar of rs6000_cost_data.
19117 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
19119 * doc/tm.texi: Regenerated.
19120 * target.def (init_cost): Add new parameter costing_for_scalar.
19121 * targhooks.c (default_init_cost): Adjust for new parameter.
19122 * targhooks.h (default_init_cost): Likewise.
19123 * tree-vect-loop.c (_loop_vec_info::_loop_vec_info): Likewise.
19124 (vect_compute_single_scalar_iteration_cost): Likewise.
19125 (vect_analyze_loop_2): Likewise.
19126 * tree-vect-slp.c (_bb_vec_info::_bb_vec_info): Likewise.
19127 (vect_bb_vectorization_profitable_p): Likewise.
19128 * tree-vectorizer.h (init_cost): Likewise.
19129 * config/aarch64/aarch64.c (aarch64_init_cost): Likewise.
19130 * config/i386/i386.c (ix86_init_cost): Likewise.
19131 * config/rs6000/rs6000.c (rs6000_init_cost): Likewise.
19133 2021-05-11 Kewen Lin <linkw@linux.ibm.com>
19135 * config/rs6000/rs6000.c (rs6000_vect_nonmem): Renamed to
19136 vect_nonmem and moved into...
19137 (struct rs6000_cost_data): ...here.
19138 (rs6000_init_cost): Use vect_nonmem of cost_data instead.
19139 (rs6000_add_stmt_cost): Likewise.
19140 (rs6000_finish_cost): Likewise.
19142 2021-05-10 Eric Botcazou <ebotcazou@adacore.com>
19144 * range-op.cc (get_bool_state): Adjust head comment.
19145 (operator_not_equal::op1_range): Fix comment.
19146 (operator_bitwise_xor::op1_range): Remove call to gcc_unreachable.
19148 2021-05-10 Martin Sebor <msebor@redhat.com>
19150 PR middle-end/100425
19151 PR middle-end/100510
19152 * gimple-ssa-warn-alloca.c (pass_walloca::firast_time_p): Rename...
19153 (pass_walloca::xlimit_certain_p): ...to this.
19154 (pass_walloca::gate): Execute for any kind of handled warning.
19155 (pass_walloca::execute): Avoid issuing "maybe" and "unbounded"
19156 warnings when xlimit_certain_p is set.
19158 2021-05-10 Pat Haugen <pthaugen@linux.ibm.com>
19160 * config/rs6000/rs6000.c (rs6000_ira_change_pseudo_allocno_class):
19161 Return ALTIVEC_REGS if that is best_class.
19162 (rs6000_compute_pressure_classes): Add ALTIVEC_REGS.
19164 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19166 * config/arm/arm.h (CPP_SPEC): Remove error message about
19169 2021-05-10 Martin Jambor <mjambor@suse.cz>
19171 * ipa-prop.h (IPA_NODE_REF): Removed.
19172 (IPA_NODE_REF_GET_CREATE): Likewise.
19173 (IPA_EDGE_REF): Likewise.
19174 (IPA_EDGE_REF_GET_CREATE): Likewise.
19175 (IS_VALID_JUMP_FUNC_INDEX): Likewise.
19176 * ipa-cp.c (print_all_lattices): Replaced IPA_NODE_REF with a direct
19177 use of ipa_node_params_sum.
19178 (ipcp_versionable_function_p): Likewise.
19179 (push_node_to_stack): Likewise.
19180 (pop_node_from_stack): Likewise.
19181 (set_single_call_flag): Replaced two IPA_NODE_REF with one single
19182 direct use of ipa_node_params_sum.
19183 (initialize_node_lattices): Replaced IPA_NODE_REF with a direct use of
19184 ipa_node_params_sum.
19185 (ipa_context_from_jfunc): Replaced IPA_EDGE_REF with a direct use of
19187 (ipcp_verify_propagated_values): Replaced IPA_NODE_REF with a direct
19188 use of ipa_node_params_sum.
19189 (self_recursively_generated_p): Likewise.
19190 (propagate_scalar_across_jump_function): Likewise.
19191 (propagate_context_across_jump_function): Replaced IPA_EDGE_REF with a
19192 direct use of ipa_edge_args_sum, moved the lookup after the early
19193 exit. Replaced IPA_NODE_REF with a direct use of ipa_node_params_sum.
19194 (propagate_bits_across_jump_function): Replaced IPA_NODE_REF with
19195 direct uses of ipa_node_params_sum.
19196 (propagate_vr_across_jump_function): Likewise.
19197 (propagate_aggregate_lattice): Likewise.
19198 (propagate_aggs_across_jump_function): Likewise.
19199 (propagate_constants_across_call): Likewise, also replaced
19200 IPA_EDGE_REF with a direct use of ipa_edge_args_sum.
19201 (good_cloning_opportunity_p): Replaced IPA_NODE_REF with a direct use
19202 of ipa_node_params_sum.
19203 (estimate_local_effects): Likewise.
19204 (add_all_node_vals_to_toposort): Likewise.
19205 (propagate_constants_topo): Likewise.
19206 (ipcp_propagate_stage): Likewise.
19207 (ipcp_discover_new_direct_edges): Likewise.
19208 (calls_same_node_or_its_all_contexts_clone_p): Likewise.
19209 (cgraph_edge_brings_value_p): Likewise (in both overloaded functions).
19210 (get_info_about_necessary_edges): Likewise.
19211 (want_remove_some_param_p): Likewise.
19212 (create_specialized_node): Likewise.
19213 (self_recursive_pass_through_p): Likewise.
19214 (self_recursive_agg_pass_through_p): Likewise.
19215 (find_more_scalar_values_for_callers_subset): Likewise and also
19216 replaced IPA_EDGE_REF with direct uses of ipa_edge_args_sum, in one
19217 case replacing two of those with a single query.
19218 (find_more_contexts_for_caller_subset): Likewise for the
19219 ipa_polymorphic_call_context overload.
19220 (intersect_aggregates_with_edge): Replaced IPA_EDGE_REF with a direct
19221 use of ipa_edge_args_sum. Replaced IPA_NODE_REF with direct uses of
19222 ipa_node_params_sum.
19223 (find_aggregate_values_for_callers_subset): Likewise, also reusing
19224 results of ipa_edge_args_sum->get.
19225 (cgraph_edge_brings_all_scalars_for_node): Replaced IPA_NODE_REF with
19226 direct uses of ipa_node_params_sum, replaced IPA_EDGE_REF with a
19227 direct use of ipa_edge_args_sum.
19228 (cgraph_edge_brings_all_agg_vals_for_node): Likewise, moved node
19229 summary query after the early exit and reused the result later.
19230 (decide_about_value): Replaced IPA_NODE_REF with a direct use of
19231 ipa_node_params_sum.
19232 (decide_whether_version_node): Likewise. Removed re-querying for
19233 summaries after cloning.
19234 (spread_undeadness): Replaced IPA_NODE_REF with a direct use of
19235 ipa_node_params_sum.
19236 (has_undead_caller_from_outside_scc_p): Likewise, reusing results of
19238 (identify_dead_nodes): Likewise.
19239 (ipcp_store_bits_results): Replaced IPA_NODE_REF with direct uses of
19240 ipa_node_params_sum.
19241 (ipcp_store_vr_results): Likewise.
19242 * ipa-fnsummary.c (evaluate_properties_for_edge): Likewise.
19243 (ipa_fn_summary_t::duplicate): Likewise.
19244 (analyze_function_body): Likewise.
19245 (estimate_calls_size_and_time): Likewise.
19246 (ipa_cached_call_context::duplicate_from): Likewise.
19247 (ipa_call_context::equal_to): Likewise.
19248 (remap_edge_params): Likewise.
19249 (ipa_merge_fn_summary_after_inlining): Likewise.
19250 (inline_read_section): Likewise.
19251 * ipa-icf.c (sem_function::param_used_p): Likewise.
19252 * ipa-modref.c (compute_parm_map): Likewise.
19253 (compute_parm_map): Replaced IPA_EDGE_REF with a direct use of
19255 (get_access_for_fnspec): Replaced IPA_NODE_REF with a direct use of
19256 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
19258 * ipa-profile.c (check_argument_count): Likewise.
19259 * ipa-prop.c (ipa_alloc_node_params): Replaced IPA_NODE_REF_GET_CREATE
19260 with a direct use of ipa_node_params_sum.
19261 (ipa_initialize_node_params): Likewise.
19262 (ipa_print_node_jump_functions_for_edge): Replaced IPA_EDGE_REF with a
19263 direct use of ipa_edge_args_sum and reused the query result.
19264 (ipa_compute_jump_functions_for_edge): Replaced IPA_NODE_REF with a
19265 direct use of ipa_node_params_sum and replaced IPA_EDGE_REF with a
19266 direct use of ipa_edge_args_sum.
19267 (ipa_note_param_call): Replaced IPA_NODE_REF with a direct use of
19268 ipa_node_params_sum and reused the result of the query.
19269 (ipa_analyze_node): Likewise.
19270 (ipa_analyze_controlled_uses): Replaced IPA_NODE_REF with a direct use
19271 of ipa_node_params_sum.
19272 (update_jump_functions_after_inlining): Replaced IPA_EDGE_REF with
19273 direct uses of ipa_edge_args_sum.
19274 (update_indirect_edges_after_inlining): Replaced IPA_NODE_REF with
19275 direct uses of ipa_node_params_sum and replaced IPA_EDGE_REF with a
19276 direct use of ipa_edge_args_sum. Removed superficial re-querying the
19278 (propagate_controlled_uses): Replaced IPA_NODE_REF with direct uses of
19279 ipa_node_params_sum and replaced IPA_EDGE_REF with a direct use of
19281 (ipa_propagate_indirect_call_infos): Replaced IPA_EDGE_REF with a
19282 direct use of ipa_edge_args_sum.
19283 (ipa_edge_args_sum_t::duplicate): Replaced IPA_NODE_REF with a direct
19284 use of ipa_node_params_sum.
19285 (ipa_print_node_params): Likewise.
19286 (ipa_write_node_info): Likewise and also replaced IPA_EDGE_REF with
19287 direct uses of ipa_edge_args_sum.
19288 (ipa_read_edge_info): Replaced IPA_EDGE_REF with a direct use of
19290 (ipa_read_node_info): Replaced IPA_NODE_REF with a direct use of
19291 ipa_node_params_sum.
19292 (ipa_prop_write_jump_functions): Likewise. Move variable node to the
19293 scopes where it is used.
19295 2021-05-10 Uroš Bizjak <ubizjak@gmail.com>
19297 * config/i386/i386-expand.c (ix86_expand_sse_movcc)
19298 <case E_V2SImode>: Force op_true to register.
19300 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19302 * config/arm/iterators.md (MVE_FP_COMPARISONS): New.
19303 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_f<mode>)
19304 (mve_vcmp<mve_cmp_op>q_n_f<mode>): New, merge all vcmp_*f*
19306 (mve_vcmpeqq_f<mode>, mve_vcmpeqq_n_f<mode>, mve_vcmpgeq_f<mode>)
19307 (mve_vcmpgeq_n_f<mode>, mve_vcmpgtq_f<mode>)
19308 (mve_vcmpgtq_n_f<mode>, mve_vcmpleq_f<mode>)
19309 (mve_vcmpleq_n_f<mode>, mve_vcmpltq_f<mode>)
19310 (mve_vcmpltq_n_f<mode>, mve_vcmpneq_f<mode>)
19311 (mve_vcmpneq_n_f<mode>): Remove.
19312 * config/arm/unspecs.md (VCMPEQQ_F, VCMPEQQ_N_F, VCMPGEQ_F)
19313 (VCMPGEQ_N_F, VCMPGTQ_F, VCMPGTQ_N_F, VCMPLEQ_F, VCMPLEQ_N_F)
19314 (VCMPLTQ_F, VCMPLTQ_N_F, VCMPNEQ_F, VCMPNEQ_N_F): Remove.
19316 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19318 * config/arm/iterators.md (MVE_COMPARISONS): New.
19320 (mve_cmp_type): New.
19321 * config/arm/mve.md (mve_vcmp<mve_cmp_op>q_<mode>): New, merge all
19323 (mve_vcmpneq_<mode>, mve_vcmpcsq_n_<mode>, mve_vcmpcsq_<mode>)
19324 (mve_vcmpeqq_n_<mode>, mve_vcmpeqq_<mode>, mve_vcmpgeq_n_<mode>)
19325 (mve_vcmpgeq_<mode>, mve_vcmpgtq_n_<mode>, mve_vcmpgtq_<mode>)
19326 (mve_vcmphiq_n_<mode>, mve_vcmphiq_<mode>, mve_vcmpleq_n_<mode>)
19327 (mve_vcmpleq_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
19328 (mve_vcmpneq_n_<mode>, mve_vcmpltq_n_<mode>, mve_vcmpltq_<mode>)
19329 (mve_vcmpneq_n_<mode>): Remove.
19331 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19333 * config/arm/arm_mve.h (__arm_vcmp*): Remove 's' suffix.
19334 * config/arm/arm_mve_builtins.def (vcmp*): Remove 's' suffix.
19335 * config/arm/mve.md (mve_vcmp*): Remove 's' suffix in pattern
19338 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19340 * config/arm/arm_mve_builtins.def (vcmpneq_u): Remove.
19341 (vcmpneq_n_u): Likewise.
19342 (vcmpeqq_u,): Likewise.
19343 (vcmpeqq_n_u): Likewise.
19344 * config/arm/iterators.md (supf): Remove VCMPNEQ_U, VCMPEQQ_U,
19345 VCMPEQQ_N_U and VCMPNEQ_N_U.
19346 * config/arm/mve.md (mve_vcmpneq): Remove <supf> iteration.
19347 (mve_vcmpeqq_n): Likewise.
19348 (mve_vcmpeqq): Likewise.
19349 (mve_vcmpneq_n): Likewise.
19351 2021-05-10 Christophe Lyon <christophe.lyon@linaro.org>
19353 * config/arm/arm_mve.h (__arm_vcmpeq*u*, __arm_vcmpne*u*): Call
19354 the 's' version of the builtin.
19356 2021-05-10 Richard Biener <rguenther@suse.de>
19358 PR tree-optimization/100492
19359 * tree-loop-distribution.c (find_seed_stmts_for_distribution):
19360 Find nothing when the loop contains an irreducible region.
19362 2021-05-10 Richard Biener <rguenther@suse.de>
19364 PR middle-end/100464
19366 * gimple-fold.c (canonicalize_constructor_val): Do not set
19369 2021-05-10 Richard Biener <rguenther@suse.de>
19371 PR tree-optimization/100434
19372 * tree-ssa-dse.c (initialize_ao_ref_for_dse): Handle
19374 (dse_optimize_stmt): Handle call LHS by dropping the
19375 LHS or the whole call if it doesn't have other
19377 (pass_dse::execute): Adjust.
19379 2021-05-10 Martin Liska <mliska@suse.cz>
19381 * Makefile.in: Add missing genversion rule.
19383 2021-05-10 Alex Coplan <alex.coplan@arm.com>
19386 * config/arm/mve.md (*mve_mov<mode>): Simplify output code. Use
19387 vldrw.u32 and vstrw.32 for V2D[IF]mode loads and stores.
19389 2021-05-10 Martin Liska <mliska@suse.cz>
19391 * builtins.c (is_builtin_name): Use startswith
19392 function instead of strncmp.
19393 * collect2.c (main): Likewise.
19394 (has_lto_section): Likewise.
19395 (scan_libraries): Likewise.
19396 * coverage.c (coverage_checksum_string): Likewise.
19397 (coverage_init): Likewise.
19398 * dwarf2out.c (is_cxx): Likewise.
19399 (gen_compile_unit_die): Likewise.
19400 * gcc-ar.c (main): Likewise.
19401 * gcc.c (init_spec): Likewise.
19402 (read_specs): Likewise.
19403 (execute): Likewise.
19404 (check_live_switch): Likewise.
19405 * genattrtab.c (write_attr_case): Likewise.
19406 (IS_ATTR_GROUP): Likewise.
19407 * gencfn-macros.c (main): Likewise.
19408 * gengtype.c (type_for_name): Likewise.
19409 (gen_rtx_next): Likewise.
19410 (get_file_langdir): Likewise.
19411 (write_local): Likewise.
19412 * genmatch.c (get_operator): Likewise.
19413 (get_operand_type): Likewise.
19414 (expr::gen_transform): Likewise.
19415 * genoutput.c (validate_optab_operands): Likewise.
19416 * incpath.c (add_sysroot_to_chain): Likewise.
19417 * langhooks.c (lang_GNU_C): Likewise.
19418 (lang_GNU_CXX): Likewise.
19419 (lang_GNU_Fortran): Likewise.
19420 (lang_GNU_OBJC): Likewise.
19421 * lto-wrapper.c (run_gcc): Likewise.
19422 * omp-general.c (omp_max_simt_vf): Likewise.
19423 * omp-low.c (omp_runtime_api_call): Likewise.
19424 * opts-common.c (parse_options_from_collect_gcc_options): Likewise.
19425 * read-rtl-function.c (function_reader::read_rtx_operand_r): Likewise.
19426 * real.c (real_from_string): Likewise.
19427 * selftest.c (assert_str_startswith): Likewise.
19428 * timevar.c (timer::validate_phases): Likewise.
19429 * tree.c (get_file_function_name): Likewise.
19430 * ubsan.c (ubsan_use_new_style_p): Likewise.
19431 * varasm.c (default_function_rodata_section): Likewise.
19432 (incorporeal_function_p): Likewise.
19433 (default_section_type_flags): Likewise.
19434 * system.h (startswith): Define startswith.
19436 2021-05-10 Martin Liska <mliska@suse.cz>
19438 * bitmap.h (class auto_bitmap): Remove
19439 __cplusplus >= 201103.
19440 * config/aarch64/aarch64.c: Likewise.
19441 * gimple-ssa-store-merging.c (store_immediate_info::store_immediate_info):
19443 * sbitmap.h: Likewise.
19445 2021-05-10 Martin Liska <mliska@suse.cz>
19447 * Makefile.in: Rename gcov-iov to genversion and depend
19448 on version.h (instead of gcov-iov.h).
19449 * gcov-io.h: Include version.h instread of gcov-iov.h.
19450 * gengtype-state.c (read_state_version): Likewise.
19451 * gcov-iov.c: Moved to...
19452 * genversion.c: ...here.
19453 * lto-streamer.h (LTO_major_version): Define it with
19455 * version.c: Removed.
19456 * version.h: Removed.
19458 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19460 * config/arc/arc.md (UNSPEC_ARC_DMPYWH): Define.
19461 * config/arc/simdext.md (VCT): Add predicates for iterator
19464 (voptab): Likewise.
19465 (vec_widen_<V_US>mult_hi_v4hi): Change pattern predicate.
19466 (<voptab>v2si3): New patterns.
19468 (reduc_plus_scal_v4hi): Likewise.
19469 (reduc_plus_scal_v2si): Likewise.
19470 (vec_duplicatev2si): Likewise.
19471 (vec_duplicatev4hi): Likewise.
19473 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19475 * config/arc/simdext.md: Format and cleanup file.
19477 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19479 * config/arc/simdext.md (movmisalignv2hi): Allow misaligned access
19480 only when munaligned-access option is on.
19481 (movmisalign<mode>): Likewise.
19483 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19485 * common/config/arc/arc-common.c (arc_handle_option): Remove dot
19487 * config/arc/arc.c (arc_reorg): Remove underscore from string.
19489 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19491 * config/arc/arc.h (CLZ_DEFINED_VALUE_AT_ZERO): Define.
19492 (CTZ_DEFINED_VALUE_AT_ZERO): Likewise.
19493 * config/arc/arc.md (clrsbsi2): Cleanup pattern.
19494 (norm_f): Likewise.
19497 (clzsi2): Use fls instruction when available.
19498 (arc_clzsi2): Likewise.
19500 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19502 * config/arc/arc.h (ADDITIONAL_REGISTER_NAMES): Add r26 and r27.
19504 2021-05-10 Claudiu Zissulescu <claziss@synopsys.com>
19506 * doc/extend.texi (__builtin_arc_sr): Swap arguments.
19508 2021-05-10 Bernd Edlinger <bernd.edlinger@hotmail.de>
19510 PR middle-end/100467
19511 * toplev.c (compile_file): Call insn_locations_init before
19512 targetm.asm_out.code_end.
19514 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
19517 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
19519 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
19521 2021-05-07 Jakub Jelinek <jakub@redhat.com>
19522 Andrew Stubbs <amd@codesourcery.com>
19525 * builtins.c (try_store_by_multiple_pieces): Use force_operand for
19526 emit_move_insn operands.
19528 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
19530 * cfgexpand.c (expand_gimple_basic_block): Do not inherit a current
19531 location for the outgoing edges of an empty block.
19532 * dwarf2out.c (add_subscript_info): Retrieve the bounds and index
19533 type by means of the get_array_descr_info langhook, if it is set and
19534 returns true. Remove obsolete code dealing with unnamed subtypes.
19536 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19538 * gimple-range-cache.cc (ssa_block_ranges): Virtualize.
19539 (sbr_vector): Renamed from ssa_block_cache.
19540 (sbr_vector::sbr_vector): Allocate from obstack abd initialize.
19541 (ssa_block_ranges::~ssa_block_ranges): Remove.
19542 (sbr_vector::set_bb_range): Use varying and undefined cached values.
19543 (ssa_block_ranges::set_bb_varying): Remove.
19544 (sbr_vector::get_bb_range): Adjust assert.
19545 (sbr_vector::bb_range_p): Adjust assert.
19546 (~block_range_cache): No freeing loop required.
19547 (block_range_cache::get_block_ranges): Remove.
19548 (block_range_cache::set_bb_range): Inline get_block_ranges.
19549 (block_range_cache::set_bb_varying): Remove.
19550 * gimple-range-cache.h (set_bb_varying): Remove prototype.
19551 * value-range.h (irange_allocator::get_memory): New.
19553 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19555 * gimple-range-cache.cc (non_null_ref::non_null_deref_p): Search
19556 dominator tree is available and requested.
19557 (ranger_cache::ssa_range_in_bb): Don't search dom tree here.
19558 (ranger_cache::fill_block_cache): Don't search dom tree here either.
19559 * gimple-range-cache.h (non_null_deref_p): Add dom_search param.
19561 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19563 * gimple-range.cc (gimple_ranger::range_on_exit): Handle block with
19564 only PHI nodes better.
19566 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19568 * gimple-range-edge.h (gimple_outgoing_range): Rename from
19570 (gcond_edge_range): Export prototype.
19571 * gimple-range-edge.cc (gcond_edge_range): New.
19572 (gimple_outgoing_range::edge_range_p): Use gcond_edge_range.
19573 * gimple-range-gori.h (gori_compute): Use gimple_outgoing_range.
19575 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19577 * gimple-range-edge.cc (outgoing_range::calc_switch_ranges): Compute
19578 default range into a temp and allocate only what is needed.
19580 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19582 * range-op.cc (operator_trunc_mod::wi_fold): x % 0 is UNDEFINED.
19584 2021-05-07 Andrew MacLeod <amacleod@redhat.com>
19586 * gimple-range.h (gimple_range_global): Pick up parameter initial
19587 values, and use-before defined locals are UNDEFINED.
19589 2021-05-07 Eric Botcazou <ebotcazou@adacore.com>
19591 * doc/extend.texi (scalar_storage_order): Mention effect on pointer
19593 * tree.h (reverse_storage_order_for_component_p): Return false if
19594 the type is a pointer.
19596 2021-05-07 Andrew Stubbs <ams@codesourcery.com>
19598 * config/gcn/gcn.c (gcn_scalar_mode_supported_p): Disable TImode.
19600 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
19603 * config/i386/i386-expand.c (ix86_expand_sse_movcc):
19604 Handle V8QI, V4HI and V2SI modes.
19605 * config/i386/mmx.md (mmx_pblendvb): New insn pattern.
19606 * config/i386/sse.md (unspec): Move UNSPEC_BLENDV ...
19607 * config/i386/i386.md (unspec): ... here.
19609 2021-05-07 Tobias Burnus <tobias@codesourcery.com>
19610 Tom de Vries <tdevries@suse.de>
19612 * omp-low.c (lower_rec_simd_input_clauses): Set max_vf = 1 if
19613 a truth_value_p reduction variable is nonintegral.
19615 2021-05-07 Uroš Bizjak <ubizjak@gmail.com>
19618 * config/i386/i386-expand.c (ix86_use_mask_cmp_p):
19619 Return false for mode sizes < 16.
19621 2021-05-07 Jakub Jelinek <jakub@redhat.com>
19624 * config/i386/mmx.md (*xop_pcmov_<mode>): New define_insn.
19626 2021-05-06 Martin Jambor <mjambor@suse.cz>
19628 * ipa-sra.c (ipa_sra_dump_all_summaries): Dump edge summaries even
19629 when there is no function summary.
19630 (ipa_sra_summarize_function): produce edge summaries even when
19633 2021-05-06 Tom Tromey <tom@tromey.com>
19635 * godump.c (string_hash_eq): Remove.
19636 (go_finish): Use htab_eq_string.
19638 2021-05-06 Tom Tromey <tom@tromey.com>
19640 * gengtype-state.c (read_state): Use htab_eq_string.
19641 (string_eq): Remove.
19643 2021-05-06 Tom Tromey <tom@tromey.com>
19645 * gensupport.c (htab_eq_string): Remove.
19647 2021-05-06 Bernd Edlinger <bernd.edlinger@hotmail.de>
19650 * debug.h (gcc_debug_hooks): Add set_ignored_loc function pointer.
19651 * dwarf2out.h (dw_fde_node::ignored_debug): New data item.
19652 * dbxout.c (dbx_debug_hooks, xcoff_debug_hooks): Add dummy
19653 set_ignored_loc callbacks.
19654 * debug.c (do_nothing_debug_hooks): Likewise.
19655 * vmsdbgout.c (vmsdbg_debug_hooks): Likewise.
19656 * dwarf2out.c (text_section_used, cold_text_section_used): Remove.
19657 (in_text_section_p, last_text_label, last_cold_label,
19658 switch_text_ranges, switch_cold_ranges): New data items.
19659 (dwarf2out_note_section_used): Remove.
19660 (dwarf2out_begin_prologue): Set fde->ignored_debug and
19662 (mark_ignored_debug_section): New helper function.
19663 (dwarf2out_end_epilogue, dwarf2out_switch_text_section): Call
19664 mark_ignored_debug_section.
19665 (dwarf2_debug_hooks): Use dwarf2out_set_ignored_loc.
19666 (dwarf2_lineno_debug_hooks): Use dummy for set_ignored_loc.
19667 (size_of_aranges): Adjust formula for multi-part text ranges size.
19668 (output_aranges): Output multi-part text ranges.
19669 (dwarf2out_set_ignored_loc): New callback function.
19670 (dwarf2out_finish): Output multi-part text ranges.
19671 (dwarf2out_c_finalize): Clear new data items.
19672 * final.c (final_start_function_1): Call set_ignored_loc callback.
19673 (final_scan_insn_1): Likewise.
19674 * ggc-page.c (gt_ggc_mx): New helper function.
19675 * stringpool.c (gt_pch_nx): Likewise.
19677 2021-05-06 Richard Biener <rguenther@suse.de>
19679 * timevar.def (TV_TREE_INSERT_PHI_NODES): Remove.
19680 (TV_TREE_SSA_REWRITE_BLOCKS): Likewise.
19681 (TV_TREE_INTO_SSA): New.
19682 * tree-into-ssa.c (insert_phi_nodes): Do not account separately.
19683 (rewrite_blocks): Likewise.
19684 (pass_data_build_ssa): Account to TV_TREE_INTO_SSA.
19686 2021-05-06 Jakub Jelinek <jakub@redhat.com>
19688 * tree-ssa-phiopt.c (value_replacement, minmax_replacement,
19689 abs_replacement, xor_replacement,
19690 cond_removal_in_popcount_clz_ctz_pattern,
19691 replace_phi_edge_with_variable): Change type of phi argument from
19692 gimple * to gphi *.
19694 2021-05-06 Richard Biener <rguenther@suse.de>
19696 * tree-ssa-loop-split.c (split_loop): Delay updating SSA form.
19697 Output an opt-info message.
19698 (do_split_loop_on_cond): Likewise.
19699 (tree_ssa_split_loops): Update SSA form here.
19701 2021-05-06 Richard Biener <rguenther@suse.de>
19703 * tree-inline.c (tree_function_versioning): Fix DECL_BY_REFERENCE
19704 return variable removal.
19706 2021-05-06 Marius Hillenbrand <mhillen@linux.ibm.com>
19708 * config/s390/s390-builtins.def (O_M5, O1_M5, ...): Remove unused macros.
19709 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
19710 (s390_vec_permi_dbl, s390_vpdi): Use the O3_U2 type for the immediate
19712 * config/s390/s390.c (s390_const_operand_ok): Remove unused
19715 2021-05-06 Jakub Jelinek <jakub@redhat.com>
19717 PR tree-optimization/94589
19718 * tree-ssa-phiopt.c (tree_ssa_phiopt_worker): Call
19719 spaceship_replacement.
19720 (cond_only_block_p, spaceship_replacement): New functions.
19722 2021-05-06 Richard Biener <rguenther@suse.de>
19725 * tree-emutls.c (gen_emutls_addr): Pass in whether we're
19726 dealing with a debug use and only query existing addresses
19728 (lower_emutls_1): Avoid splitting out addresses for debug
19729 stmts, reset the debug stmt when we fail to find existing
19731 (lower_emutls_phi_arg): Set wi.stmt.
19733 2021-05-06 Christoph Muellner <cmuellner@gcc.gnu.org>
19736 * config/riscv/riscv.c (riscv_block_move_loop): Use cbranch helper.
19737 * config/riscv/riscv.md (cbranch<mode>4): Generate helpers.
19738 (stack_protect_test): Use cbranch helper.
19740 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
19743 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
19744 always return the establisher frame for __builtin_frame_address (0).
19746 2021-05-05 Ivan Sorokin <vanyacpp@gmail.com>
19749 * config/i386/i386-builtins.c (ix86_cpu_model_type_node): New.
19750 (ix86_cpu_model_var): Likewise.
19751 (ix86_cpu_features2_type_node): Likewise.
19752 (ix86_cpu_features2_var): Likewise.
19753 (fold_builtin_cpu): Cache __cpu_model and __cpu_features2 with
19756 2021-05-05 Martin Sebor <msebor@redhat.com>
19758 * passes.def (pass_warn_printf): Run after SSA.
19760 2021-05-05 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
19762 * config/arm/neon.md (neon_vtst_combine<mode>): New pattern.
19763 * config/arm/predicates.md (minus_one_operand): New predicate.
19765 2021-05-05 Jeff Law <jlaw@tachyum.com>
19767 * config/avr/avr.md: Remove references to CC_STATUS_INIT.
19769 2021-05-05 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
19771 PR rtl-optimization/100263
19772 * postreload.c (move2add_valid_value_p): Ensure register can
19775 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
19777 PR rtl-optimization/100411
19778 * cfgcleanup.c (try_crossjump_to_edge): Also skip end of prologue
19779 and beginning of function markers.
19781 2021-05-05 Jeff Law <jlaw@tachyum.com>
19783 * config/cr16/cr16.h (NOTICE_UPDATE_CC): Remove.
19784 * config/cr16/cr16.c (notice_update_cc): Remove.
19785 * config/cr16/cr16-protos.h (notice_update_cc): Remove.
19787 2021-05-05 Uroš Bizjak <ubizjak@gmail.com>
19790 * config/i386/i386-expand.c (ix86_expand_int_sse_cmp):
19791 Handle V8QI, V4HI and V2SI modes.
19792 * config/i386/i386.c (ix86_build_const_vector): Handle V2SImode.
19793 (ix86_build_signbit_mask): Ditto.
19794 * config/i386/mmx.md (MMXMODE14): New mode iterator.
19795 (<smaxmin:code><MMXMODE14:mode>3): New expander.
19796 (*mmx_<smaxmin:code><MMXMODE14:mode>3): New insn pattern.
19797 (<umaxmin:code><MMXMODE24:mode>3): New expander.
19798 (*mmx_<umaxmin:code><MMXMODE24:mode>3): New insn pattern.
19799 (vec_cmp<MMXMODEI:mode><MMXMODEI:mode>): New expander.
19800 (vec_cmpu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19801 (vcond<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19802 (vcondu<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19803 (vcond_mask_<MMXMODEI:mode><MMXMODEI:mode>): Ditto.
19805 2021-05-05 Eric Botcazou <ebotcazou@adacore.com>
19807 * dwarf2out.c (loc_list_from_tree_1) <DECL>: During early DWARF, do
19808 not expand the VALUE_EXPR of variables put in the non-local frame.
19809 * gimplify.c (gimplify_type_sizes) <RECORD_TYPE>: If the type is not
19810 to be ignored for debug info, ensure its variable offsets are not.
19812 2021-05-05 Richard Biener <rguenther@suse.de>
19814 PR tree-optimization/79333
19815 * tree-ssa-sccvn.c (eliminate_dom_walker::eliminate_stmt):
19816 Fold stmt following SSA edges.
19818 2021-05-05 Richard Biener <rguenther@suse.de>
19820 PR middle-end/100394
19821 * calls.c (expand_call): Preserve possibly throwing calls.
19822 * cfgexpand.c (expand_call_stmt): When a call can throw signal
19823 RTL expansion there are side-effects.
19824 * tree-ssa-dce.c (mark_stmt_if_obviously_necessary): Simplify,
19825 mark all possibly throwing stmts necessary unless we can elide
19827 * tree-ssa-dse.c (pass_dse::execute): Preserve exceptions unless
19828 -fdelete-dead-exceptions.
19829 * tree.h (DECL_PURE_P): Add note about exceptions.
19831 2021-05-05 Alexandre Oliva <oliva@adacore.com>
19833 * config/i386/vxworks.h (DBX_REGISTER_NUMBER): Make it
19836 2021-05-04 David Edelsohn <dje.gcc@gmail.com>
19838 * config/rs6000/rs6000-call.c (rs6000_output_mi_thunk): Use
19839 get_fnname_from_decl for name of thunk.
19840 * config/rs6000/rs6000.c (rs6000_declare_alias): Use assemble_name
19841 and ASM_OUTPUT_LABEL.
19842 (rs6000_xcoff_declare_function_name): Use assemble_name and
19844 (rs6000_xcoff_declare_object_name): Use ASM_OUTPUT_LABEL.
19845 (rs6000_xcoff_encode_section_info): Don't add mapping class
19846 for aliases. Always add [DS] mapping class to primary
19848 (rs6000_asm_weaken_decl): Don't explicitly add [DS].
19850 2021-05-04 Martin Sebor <msebor@redhat.com>
19852 PR middle-end/100307
19853 * builtins.c (compute_objsize_r): Clear base0 for pointers.
19855 2021-05-04 Jeff Law <jlaw@tachyum.com>
19857 * config/bfin/bfin.h (NOTICE_UPDATE_CC): Remove.
19859 2021-05-04 Segher Boessenkool <segher@kernel.crashing.org>
19861 * caller-save.c: Remove CC0.
19862 * cfgcleanup.c: Remove CC0.
19863 * cfgrtl.c: Remove CC0.
19864 * combine.c: Remove CC0.
19865 * compare-elim.c: Remove CC0.
19866 * conditions.h: Remove CC0.
19867 * config/h8300/h8300.h: Remove CC0.
19868 * config/h8300/h8300-protos.h: Remove CC0.
19869 * config/h8300/peepholes.md: Remove CC0.
19870 * config/i386/x86-tune-sched.c: Remove CC0.
19871 * config/m68k/m68k.c: Remove CC0.
19872 * config/rl78/rl78.c: Remove CC0.
19873 * config/sparc/sparc.c: Remove CC0.
19874 * config/xtensa/xtensa.c: Remove CC0.
19875 (gen_conditional_move): Use pc_rtx instead of cc0_rtx in a piece of
19876 RTL where that is used as a placeholder only.
19877 * cprop.c: Remove CC0.
19878 * cse.c: Remove CC0.
19879 * cselib.c: Remove CC0.
19880 * df-problems.c: Remove CC0.
19881 * df-scan.c: Remove CC0.
19882 * doc/md.texi: Remove CC0. Adjust an example.
19883 * doc/rtl.texi: Remove CC0. Adjust an example.
19884 * doc/tm.texi: Regenerate.
19885 * doc/tm.texi.in: Remove CC0.
19886 * emit-rtl.c: Remove CC0.
19887 * final.c: Remove CC0.
19888 * fwprop.c: Remove CC0.
19889 * gcse-common.c: Remove CC0.
19890 * gcse.c: Remove CC0.
19891 * genattrtab.c: Remove CC0.
19892 * genconfig.c: Remove CC0.
19893 * genemit.c: Remove CC0.
19894 * genextract.c: Remove CC0.
19895 * gengenrtl.c: Remove CC0.
19896 * genrecog.c: Remove CC0.
19897 * haifa-sched.c: Remove CC0.
19898 * ifcvt.c: Remove CC0.
19899 * ira-costs.c: Remove CC0.
19900 * ira.c: Remove CC0.
19901 * jump.c: Remove CC0.
19902 * loop-invariant.c: Remove CC0.
19903 * lra-constraints.c: Remove CC0.
19904 * lra-eliminations.c: Remove CC0.
19905 * optabs.c: Remove CC0.
19906 * postreload-gcse.c: Remove CC0.
19907 * postreload.c: Remove CC0.
19908 * print-rtl.c: Remove CC0.
19909 * read-rtl-function.c: Remove CC0.
19910 * reg-notes.def: Remove CC0.
19911 * reg-stack.c: Remove CC0.
19912 * reginfo.c: Remove CC0.
19913 * regrename.c: Remove CC0.
19914 * reload.c: Remove CC0.
19915 * reload1.c: Remove CC0.
19916 * reorg.c: Remove CC0.
19917 * resource.c: Remove CC0.
19918 * rtl.c: Remove CC0.
19919 * rtl.def: Remove CC0.
19920 * rtl.h: Remove CC0.
19921 * rtlanal.c: Remove CC0.
19922 * sched-deps.c: Remove CC0.
19923 * sched-rgn.c: Remove CC0.
19924 * shrink-wrap.c: Remove CC0.
19925 * simplify-rtx.c: Remove CC0.
19926 * system.h: Remove CC0. Poison NOTICE_UPDATE_CC, CC_STATUS_MDEP_INIT,
19927 CC_STATUS_MDEP, and CC_STATUS.
19928 * target.def: Remove CC0.
19929 * valtrack.c: Remove CC0.
19930 * var-tracking.c: Remove CC0.
19932 2021-05-04 Richard Biener <rguenther@suse.de>
19934 PR tree-optimization/100414
19935 * tree-ssa-phiopt.c (get_non_trapping): Do not compute dominance
19937 (tree_ssa_phiopt_worker): But unconditionally here.
19939 2021-05-04 Tobias Burnus <tobias@codesourcery.com>
19941 * omp-low.c (lower_rec_input_clauses, lower_reduction_clauses): Handle
19942 && and || with floating-point and complex arguments.
19944 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
19946 * tree-inline.c (insert_debug_decl_map): Delete.
19947 (copy_debug_stmt): Minor tweak.
19948 (setup_one_parameter): Do not use a variable if the value is either
19949 a read-only DECL or a non-addressable local variable in the caller.
19950 In this case, insert the debug-only variable in the map manually.
19951 (expand_call_inline): Do not generate a CLOBBER for these values.
19952 * tree-inline.h (debug_map): Minor tweak.
19954 2021-05-04 Eric Botcazou <ebotcazou@adacore.com>
19956 * builtins.c (builtin_with_linkage_p): Return true for stp[n]cpy.
19957 * symtab.c (symtab_node::output_to_lto_symbol_table_p): Tidy up.
19959 2021-05-04 Richard Biener <rguenther@suse.de>
19961 PR tree-optimization/100329
19962 * tree-ssa-reassoc.c (can_reassociate_p): Do not reassociate
19964 (insert_stmt_after): Assert we're not running into asm goto.
19966 2021-05-04 Richard Biener <rguenther@suse.de>
19968 PR tree-optimization/100398
19969 * tree-ssa-dse.c (pass_dse::execute): Preserve control
19972 2021-05-04 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
19974 * builtins.c (try_store_by_multiple_pieces): Fix constfun's prototype.
19976 2021-05-04 Alexandre Oliva <oliva@adacore.com>
19978 * builtins.c (try_store_by_multiple_pieces): New.
19979 (expand_builtin_memset_args): Use it. If target_char_cast
19980 fails, proceed as for non-constant val. Pass len's ctz to...
19981 * expr.c (clear_storage_hints): ... this. Try store by
19982 multiple pieces after setmem.
19983 (clear_storage): Adjust.
19984 * expr.h (clear_storage_hints): Likewise.
19985 (try_store_by_multiple_pieces): Declare.
19986 * passes.def: Replace the last copy_prop with ccp.
19988 2021-05-03 Tom de Vries <tdevries@suse.de>
19991 * omp-low.c (lower_rec_input_clauses): Disable SIMT for user-defined
19994 2021-05-03 Richard Biener <rguenther@suse.de>
19996 * tree-ssa-dse.c (dse_classify_store): Track two PHI defs.
19998 2021-05-03 Richard Biener <rguenther@suse.de>
20000 * tree-ssa-dse.c: Do not include domwalk.h but cfganal.h.
20001 (dse_dom_walker): Remove.
20002 (dse_dom_walker::dse_optimize_stmt): Rename...
20003 (dse_optimize_stmt): ... to this, pass in live_bytes sbitmap.
20004 (dse_dom_walker::before_dom_children): Inline ...
20005 (pass_dse::execute): ... here. Perform a reverse program
20008 2021-05-03 H.J. Lu <hjl.tools@gmail.com>
20011 * configure: Regenerated.
20013 2021-05-03 Ilya Leoshkevich <iii@linux.ibm.com>
20016 * config/s390/s390.c (s390_hard_fp_reg_p): New function.
20017 (s390_md_asm_adjust): Handle hard registers.
20019 2021-05-03 Jakub Jelinek <jakub@redhat.com>
20021 PR tree-optimization/100382
20022 * tree-ssa-dse.c: Include tree-eh.h.
20023 (dse_dom_walker::before_dom_children): Don't remove stmts if
20024 stmt_unremovable_because_of_non_call_eh_p is true.
20026 2021-05-02 David Edelsohn <dje.gcc@gmail.com>
20028 * varasm.c (compute_reloc_for_var): Split out from...
20029 (get_variable_section): Use it.
20030 * output.h (compute_reloc_for_var): Declare.
20031 * config/rs6000/rs6000-protos.h
20032 (rs6000_xcoff_asm_output_aligned_decl_common): Change alignment to
20034 * config/rs6000/rs6000.c (rs6000_legitimize_tls_address_aix):
20035 Don't append storage mapping class to symbol.
20036 (rs6000_xcoff_asm_named_section): Add BS and UL mapping classes.
20037 Don't convert TLS BSS to common.
20038 (rs6000_xcoff_unique_section): Don't fall back to select_secton.
20039 (rs6000_xcoff_section_type_flags): Add SECTION_BSS if DECL is
20041 (rs6000_xcoff_asm_globalize_decl_name): Don't strip storage
20043 (rs6000_xcoff_asm_output_aligned_decl_common): Align is unsigned int.
20044 If align is 0 from TLS class, use the same rules as varasm.c
20045 If not common, switch to BSS section manually.
20046 If common, emit appropriate comm or lcomm directive.
20047 (rs6000_xcoff_encode_section_info): Add logic to append all
20048 storage mapping classes.
20049 (rs6000_asm_weaken_decl): Adjust for qualname symbols.
20050 * config/rs6000/xcoff.h (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
20051 rs6000_xcoff_asm_output_aligned_decl_common.
20052 (ASM_OUTPUT_ALIGNED_DECL_LOCAL): Use
20053 rs6000_xcoff_asm_output_aligned_decl_common.
20054 (ASM_OUTPUT_TLS_COMMON): Use
20055 rs6000_xcoff_asm_output_aligned_decl_common.
20057 2021-05-02 Jakub Jelinek <jakub@redhat.com>
20060 * config/nvptx/nvptx.c (nvptx_sese_pseudo): Use nullptr instead of 0
20061 as first argument of pseudo_node_t constructors.
20063 2021-05-02 Jakub Jelinek <jakub@redhat.com>
20066 * config/i386/t-i386 (TM_H): Add $(srcdir)/config/i386/i386-isa.def.
20068 2021-05-01 Aldy Hernandez <aldyh@redhat.com>
20070 * value-range.cc (DEFINE_INT_RANGE_GC_STUBS): Remove.
20071 (gt_pch_nx (int_range<1> *&)): New.
20072 (gt_ggc_mx (int_range<1> *&)): New.
20073 * value-range.h (class irange): Add GTY support for
20076 2021-05-01 Geng Qi <gengqi@linux.alibaba.com>
20078 * doc/options.texi (Negative): Change either or to both and.
20080 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
20082 * config/aarch64/aarch64-simd-builtins.def: Add
20083 float_ml[as][q]_laneq builtin generator macros.
20084 * config/aarch64/aarch64-simd.md (mul_laneq<mode>3): Define.
20085 (aarch64_float_mla_laneq<mode>): Define.
20086 (aarch64_float_mls_laneq<mode>): Define.
20087 * config/aarch64/arm_neon.h (vmla_laneq_f32): Use RTL builtin
20088 instead of GCC vector extensions.
20089 (vmlaq_laneq_f32): Likewise.
20090 (vmls_laneq_f32): Likewise.
20091 (vmlsq_laneq_f32): Likewise.
20093 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
20095 * config/aarch64/aarch64-simd-builtins.def: Add
20096 float_ml[as]_lane builtin generator macros.
20097 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt<mode>):
20099 (mul_lane<mode>3): This, and re-order arguments.
20100 (aarch64_float_mla_lane<mode>): Define.
20101 (aarch64_float_mls_lane<mode>): Define.
20102 * config/aarch64/arm_neon.h (vmla_lane_f32): Use RTL builtin
20103 instead of GCC vector extensions.
20104 (vmlaq_lane_f32): Likewise.
20105 (vmls_lane_f32): Likewise.
20106 (vmlsq_lane_f32): Likewise.
20108 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
20110 * config/aarch64/aarch64-simd-builtins.def: Add float_ml[as]
20111 builtin generator macros.
20112 * config/aarch64/aarch64-simd.md (aarch64_float_mla<mode>):
20114 (aarch64_float_mls<mode>): Define.
20115 * config/aarch64/arm_neon.h (vmla_f32): Use RTL builtin
20116 instead of relying on GCC vector extensions.
20117 (vmla_f64): Likewise.
20118 (vmlaq_f32): Likewise.
20119 (vmlaq_f64): Likewise.
20120 (vmls_f32): Likewise.
20121 (vmls_f64): Likewise.
20122 (vmlsq_f32): Likewise.
20123 (vmlsq_f64): Likewise.
20124 * config/aarch64/iterators.md: Define VDQF_DF mode iterator.
20126 2021-04-30 Jonathan Wright <jonathan.wright@arm.com>
20128 * config/aarch64/aarch64-simd-builtins.def: Add
20129 float_ml[as]_n_builtin generator macros.
20130 * config/aarch64/aarch64-simd.md (*aarch64_mul3_elt_from_dup<mode>):
20132 (mul_n<mode>3): This, and re-order arguments.
20133 (aarch64_float_mla_n<mode>): Define.
20134 (aarch64_float_mls_n<mode>): Define.
20135 * config/aarch64/arm_neon.h (vmla_n_f32): Use RTL builtin
20136 instead of inline asm.
20137 (vmlaq_n_f32): Likewise.
20138 (vmls_n_f32): Likewise.
20139 (vmlsq_n_f32): Likewise.
20141 2021-04-30 Jonathan Wright <joanthan.wright@arm.com>
20143 * config/aarch64/aarch64-simd-builtins.def: Add pmull[2]
20144 builtin generator macros.
20145 * config/aarch64/aarch64-simd.md (aarch64_pmullv8qi): Define.
20146 (aarch64_pmull_hiv16qi_insn): Define.
20147 (aarch64_pmull_hiv16qi): Define.
20148 * config/aarch64/arm_neon.h (vmull_high_p8): Use RTL builtin
20149 instead of inline asm.
20150 (vmull_p8): Likewise.
20152 2021-04-30 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
20154 * config/avr/avr.md: Adjust peepholes to match and
20155 generate parallels with clobber of REG_CC.
20156 (mov<mode>_insn): Rename to mov<mode>_insn_split.
20157 (*mov<mode>_insn): Rename to mov<mode>_insn.
20159 2021-04-30 David Edelsohn <dje.gcc@gmail.com>
20161 * varasm.c (use_blocks_for_decl_p): Don't use section anchors
20162 for VAR_DECLs if -fdata-sections enabled.
20164 2021-04-30 Michael Meissner <meissner@linux.ibm.com>
20166 PR bootstrap/100327
20167 * config/rs6000/rs6000.c
20168 (TARGET_LIBGCC_FLOATING_MODE_SUPPORTED_P): Define.
20169 (rs6000_libgcc_floating_mode_supported_p): New target hook.
20171 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
20173 * tree-ssa-threadbackward.c (class thread_jumps): Split out code
20175 (class back_threader_registry): ...to here...
20176 (class back_threader_profitability): ...and here...
20177 (thread_jumps::thread_through_all_blocks): Remove argument.
20178 (back_threader_registry::back_threader_registry): New.
20179 (back_threader_registry::~back_threader_registry): New.
20180 (back_threader_registry::thread_through_all_blocks): New.
20181 (thread_jumps::profitable_jump_thread_path): Move from here...
20182 (back_threader_profitability::profitable_path_p): ...to here.
20183 (thread_jumps::find_taken_edge): New.
20184 (thread_jumps::convert_and_register_current_path): Move...
20185 (back_threader_registry::register_path): ...to here.
20186 (thread_jumps::register_jump_thread_path_if_profitable): Move...
20187 (thread_jumps::maybe_register_path): ...to here.
20188 (thread_jumps::handle_phi): Call find_taken_edge and
20189 maybe_register_path.
20190 (thread_jumps::handle_assignment): Same.
20191 (thread_jumps::fsm_find_control_statement_thread_paths): Remove
20192 tree argument to handle_phi and handle_assignment.
20193 (thread_jumps::find_jump_threads_backwards): Set m_name. Remove
20194 set of m_speed_p and m_max_threaded_paths.
20195 (pass_thread_jumps::execute): Remove second argument from
20196 find_jump_threads_backwards.
20197 (pass_early_thread_jumps::execute): Same.
20199 2021-04-30 Aldy Hernandez <aldyh@redhat.com>
20201 * tree-ssa-dom.c (class dom_jump_threader_simplifier): New.
20202 (class dom_opt_dom_walker): Initialize some class variables.
20203 (pass_dominator::execute): Pass evrp_range_analyzer and
20204 dom_jump_threader_simplifier to dom_opt_dom_walker.
20205 Adjust for some functions moving into classes.
20206 (simplify_stmt_for_jump_threading): Adjust and move to...
20207 (jump_threader_simplifier::simplify): ...here.
20208 (dom_opt_dom_walker::before_dom_children): Adjust for
20209 m_evrp_range_analyzer.
20210 (dom_opt_dom_walker::after_dom_children): Remove x_vr_values hack.
20211 (test_for_singularity): Place in dom_opt_dom_walker class.
20212 (dom_opt_dom_walker::optimize_stmt): The argument
20213 evrp_range_analyzer is now a class field.
20214 * tree-ssa-threadbackward.c (class thread_jumps): Add m_registry.
20215 (thread_jumps::thread_through_all_blocks): New.
20216 (thread_jumps::convert_and_register_current_path): Use m_registry.
20217 (pass_thread_jumps::execute): Adjust for thread_through_all_blocks
20218 being in the threader class.
20219 (pass_early_thread_jumps::execute): Same.
20220 * tree-ssa-threadedge.c (threadedge_initialize_values): Move...
20221 (jump_threader::jump_threader): ...here.
20222 (threadedge_finalize_values): Move...
20223 (jump_threader::~jump_threader): ...here.
20224 (jump_threader::remove_jump_threads_including): New.
20225 (jump_threader::thread_through_all_blocks): New.
20226 (record_temporary_equivalences_from_phis): Move...
20227 (jump_threader::record_temporary_equivalences_from_phis): ...here.
20228 (record_temporary_equivalences_from_stmts_at_dest): Move...
20229 (jump_threader::record_temporary_equivalences_from_stmts_at_dest):
20231 (simplify_control_stmt_condition_1): Move to jump_threader class.
20232 (simplify_control_stmt_condition): Move...
20233 (jump_threader::simplify_control_stmt_condition): ...here.
20234 (thread_around_empty_blocks): Move...
20235 (jump_threader::thread_around_empty_blocks): ...here.
20236 (thread_through_normal_block): Move...
20237 (jump_threader::thread_through_normal_block): ...here.
20238 (thread_across_edge): Move...
20239 (jump_threader::thread_across_edge): ...here.
20240 (thread_outgoing_edges): Move...
20241 (jump_threader::thread_outgoing_edges): ...here.
20242 * tree-ssa-threadedge.h: Move externally facing functings...
20243 (class jump_threader): ...here...
20244 (class jump_threader_simplifier): ...and here.
20245 * tree-ssa-threadupdate.c (struct redirection_data): Remove comment.
20246 (jump_thread_path_allocator::jump_thread_path_allocator): New.
20247 (jump_thread_path_allocator::~jump_thread_path_allocator): New.
20248 (jump_thread_path_allocator::allocate_thread_edge): New.
20249 (jump_thread_path_allocator::allocate_thread_path): New.
20250 (jump_thread_path_registry::jump_thread_path_registry): New.
20251 (jump_thread_path_registry::~jump_thread_path_registry): New.
20252 (jump_thread_path_registry::allocate_thread_edge): New.
20253 (jump_thread_path_registry::allocate_thread_path): New.
20254 (dump_jump_thread_path): Make extern.
20255 (debug (const vec<jump_thread_edge *> &path)): New.
20256 (struct removed_edges): Move to tree-ssa-threadupdate.h.
20257 (struct thread_stats_d): Remove.
20258 (remove_ctrl_stmt_and_useless_edges): Make static.
20259 (lookup_redirection_data): Move...
20260 (jump_thread_path_registry::lookup_redirection_data): ...here.
20261 (ssa_redirect_edges): Make static.
20262 (thread_block_1): Move...
20263 (jump_thread_path_registry::thread_block_1): ...here.
20264 (thread_block): Move...
20265 (jump_thread_path_registry::thread_block): ...here.
20266 (thread_through_loop_header): Move...
20267 (jump_thread_path_registry::thread_through_loop_header): ...here.
20268 (mark_threaded_blocks): Move...
20269 (jump_thread_path_registry::mark_threaded_blocks): ...here.
20270 (debug_path): Move...
20271 (jump_thread_path_registry::debug_path): ...here.
20272 (debug_all_paths): Move...
20273 (jump_thread_path_registry::dump): ..here.
20274 (rewire_first_differing_edge): Move...
20275 (jump_thread_path_registry::rewire_first_differing_edge): ...here.
20276 (adjust_paths_after_duplication): Move...
20277 (jump_thread_path_registry::adjust_paths_after_duplication): ...here.
20278 (duplicate_thread_path): Move...
20279 (jump_thread_path_registry::duplicate_thread_path): ..here.
20280 (remove_jump_threads_including): Move...
20281 (jump_thread_path_registry::remove_jump_threads_including): ...here.
20282 (thread_through_all_blocks): Move to...
20283 (jump_thread_path_registry::thread_through_all_blocks): ...here.
20284 (delete_jump_thread_path): Remove.
20285 (register_jump_thread): Move...
20286 (jump_thread_path_registry::register_jump_thread): ...here.
20287 * tree-ssa-threadupdate.h: Move externally facing functions...
20288 (class jump_thread_path_allocator): ...here...
20289 (class jump_thread_path_registry): ...and here.
20290 (thread_through_all_blocks): Remove.
20291 (struct removed_edges): New.
20292 (register_jump_thread): Remove.
20293 (remove_jump_threads_including): Remove.
20294 (delete_jump_thread_path): Remove.
20295 (remove_ctrl_stmt_and_useless_edges): Remove.
20296 (free_dom_edge_info): New prototype.
20297 * tree-vrp.c: Remove x_vr_values hack.
20298 (class vrp_jump_threader_simplifier): New.
20299 (vrp_jump_threader_simplifier::simplify): New.
20300 (vrp_jump_threader::vrp_jump_threader): Adjust method signature.
20301 Remove m_dummy_cond.
20302 Instantiate m_simplifier and m_threader.
20303 (vrp_jump_threader::thread_through_all_blocks): New.
20304 (vrp_jump_threader::simplify_stmt): Remove.
20305 (vrp_jump_threader::after_dom_children): Do not set m_dummy_cond.
20306 Remove x_vr_values hack.
20307 (execute_vrp): Adjust for thread_through_all_blocks being in a
20310 2021-04-30 Christophe Lyon <christophe.lyon@linaro.org>
20312 * genflags.c (gen_insn): Print failed expansion string.
20314 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
20316 * expr.c (alignment_for_piecewise_move): Call mode_for_size
20317 without limit to MAX_FIXED_MODE_SIZE.
20319 2021-04-30 H.J. Lu <hjl.tools@gmail.com>
20321 PR middle-end/90773
20322 * builtins.c (builtin_memset_gen_str): Don't use return from
20323 simplify_gen_subreg.
20325 2021-04-30 Uroš Bizjak <ubizjak@gmail.com>
20328 * config/i386/i386.md (*add<mode>3_carry_0r): New insn pattern.
20329 (*addsi3_carry_zext_0r): Ditto.
20330 (*sub<mode>3_carry_0): Ditto.
20331 (*subsi3_carry_zext_0r): Ditto.
20332 * config/i386/predicates.md (ix86_carry_flag_unset_operator):
20334 * config/i386/i386.c (ix86_rtx_costs) <case PLUS, case MINUS>:
20335 Also consider ix86_carry_flag_unset_operator to calculate
20336 the cost of adc/sbb insn.
20338 2021-04-30 Roman Zhuykov <zhroma@ispras.ru>
20340 PR rtl-optimization/100225
20341 PR rtl-optimization/84878
20342 * modulo-sched.c (sms_schedule): Use note_stores to skip loops
20343 where we have an instruction which touches (writes) any hard
20344 register from df->regular_block_artificial_uses set.
20345 Allow not-single-set instruction only right before basic block
20348 2021-04-30 Geng Qi <gengqi@linux.alibaba.com>
20350 * config/riscv/riscv.opt (march=,mabi=): Negative itself.
20352 2021-04-30 LevyHsu <admin@levyhsu.com>
20354 * config/riscv/riscv.c (riscv_min_arithmetic_precision): New.
20355 * config/riscv/riscv.h (TARGET_MIN_ARITHMETIC_PRECISION): New.
20356 * config/riscv/riscv.md (addv<mode>4, uaddv<mode>4): New.
20357 (subv<mode>4, usubv<mode>4, mulv<mode>4, umulv<mode>4): New.
20359 2021-04-29 Alexandre Oliva <oliva@adacore.com>
20361 * config.gcc: Merged x86 and x86_64 cpu_type-setting cases.
20363 2021-04-29 Alexandre Oliva <oliva@adacore.com>
20365 * config/i386/i386.h (ASM_OUTPUT_MAX_SKIP_PAD): Rename to...
20366 (ASM_OUTPUT_MAX_SKIP_ALIGN): ... this. Enclose in do/while(0).
20367 * config/i386/i386.c: Adjust.
20368 * config/i386/i386.md: Adjust.
20369 * config/i386/darwin.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Drop.
20370 * config/i386/dragonfly.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20371 * config/i386/freebsd.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20372 * config/i386/gas.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20373 * config/i386/gnu-user.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20374 * config/i386/iamcu.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20375 * config/i386/lynx.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20376 * config/i386/netbsd-elf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20377 * config/i386/openbsdelf.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20378 * config/i386/x86-64.h (ASM_OUTPUT_MAX_SKIP_ALIGN): Likewise.
20379 (ASM_OUTPUT_MAX_SKIP_PAD): Likewise.
20381 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
20383 * config/i386/i386-expand.c (ix86_expand_int_compare):
20384 Swap operands of GTU and LEU comparison to emit carry flag comparison.
20385 * config/i386/i386.md (*add<mode>3_carry_0): Change insn
20386 predicate to allow more combine opportunities with memory operands.
20387 (*sub<mode>3_carry_0): Ditto.
20389 2021-04-29 Richard Sandiford <richard.sandiford@arm.com>
20391 PR rtl-optimization/100303
20392 * rtl-ssa/accesses.cc (function_info::make_use_available): Take a
20393 boolean that indicates whether the use will only be used in
20394 debug instructions. Treat it in the same way that existing
20395 cross-EBB debug references would be handled if so.
20396 (function_info::make_uses_available): Likewise.
20397 * rtl-ssa/functions.h (function_info::make_uses_available): Update
20398 prototype accordingly.
20399 (function_info::make_uses_available): Likewise.
20400 * fwprop.c (try_fwprop_subst): Update call accordingly.
20402 2021-04-29 Jeff Law <jlaw@tachyum.com>
20404 * config/nios2/nios2-protos.h (nios2_fpu_insn_enabled): Move outside
20407 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
20408 Richard Biener <rguenther@suse.de>
20411 * config/i386/i386-builtin.def (IX86_BUILTIN_MASKLOADPD)
20412 (IX86_BUILTIN_MASKLOADPS, IX86_BUILTIN_MASKLOADPD256)
20413 (IX86_BUILTIN_MASKLOADPS256, IX86_BUILTIN_MASKLOADD)
20414 (IX86_BUILTIN_MASKLOADQ, IX86_BUILTIN_MASKLOADD256)
20415 (IX86_BUILTIN_MASKLOADQ256): Move from SPECIAL_ARGS
20416 to PURE_ARGS category.
20417 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
20418 Handle PURE_ARGS category.
20419 * config/i386/i386-expand.c (ix86_expand_builtin): Ditto.
20421 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
20423 * configure.ac: Check for the presence of sys/locking.h header and
20424 for whether _LK_LOCK is supported by _locking.
20425 * configure: Regenerate.
20426 * config.in: Likewise.
20427 * gcov-io.h: Define GCOV_LOCKED_WITH_LOCKING if HOST_HAS_LK_LOCK.
20428 * gcov-io.c (gcov_open): Add support for GCOV_LOCKED_WITH_LOCKING.
20429 * system.h: Include <sys/locking.h> if HAVE_SYS_LOCKING_H.
20431 2021-04-29 Uroš Bizjak <ubizjak@gmail.com>
20433 * config/i386/predicates.md (fcmov_comparison_operator):
20434 Do not check for trivial FP comparison operator.
20435 <case GEU, case LTU>: Allow CCGZmode.
20436 <case GTU, case LEU>: Do not allow CCCmode.
20437 (ix86_comparison_operator) <case GTU, case LEU>: Allow only CCmode.
20438 (ix86_carry_flag_operator): Match only LTU and UNLT code.
20439 Do not check for trivial FP comparison operator. Allow CCGZmode.
20441 2021-04-29 Tom de Vries <tdevries@suse.de>
20443 * omp-expand.c (expand_omp_simd): Add step_orig, and replace uses of
20444 fd->loop.step by either step or orig_step.
20446 2021-04-29 Eric Botcazou <ebotcazou@adacore.com>
20448 * config/sparc/sparc.c (gen_load_pcrel_sym): Delete.
20449 (load_got_register): Do the PIC dance here.
20450 (sparc_legitimize_tls_address): Simplify.
20451 (sparc_emit_probe_stack_range): Likewise.
20452 (sparc32_initialize_trampoline): Likewise.
20453 (sparc64_initialize_trampoline): Likewise.
20454 * config/sparc/sparc.md (load_pcrel_sym<P:mode>): Add @ marker.
20455 (probe_stack_range<P:mode>): Likewise.
20456 (flush<P:mode>): Likewise.
20457 (tgd_hi22<P:mode>): Likewise.
20458 (tgd_lo10<P:mode>): Likewise.
20459 (tgd_add<P:mode>): Likewise.
20460 (tgd_call<P:mode>): Likewise.
20461 (tldm_hi22<P:mode>): Likewise.
20462 (tldm_lo10<P:mode>): Likewise.
20463 (tldm_add<P:mode>): Likewise.
20464 (tldm_call<P:mode>): Likewise.
20465 (tldo_hix22<P:mode>): Likewise.
20466 (tldo_lox10<P:mode>): Likewise.
20467 (tldo_add<P:mode>): Likewise.
20468 (tie_hi22<P:mode>): Likewise.
20469 (tie_lo10<P:mode>): Likewise.
20470 (tie_add<P:mode>): Likewise.
20471 (tle_hix22<P:mode>): Likewise.
20472 (tle_lox10<P:mode>): Likewise.
20473 (stack_protect_setsi): Rename to...
20474 (stack_protect_set32): ...this.
20475 (stack_protect_setdi): Rename to...
20476 (stack_protect_set64): ...this.
20477 (stack_protect_set): Adjust calls to above.
20478 (stack_protect_testsi): Rename to...
20479 (stack_protect_test32): ...this.
20480 (stack_protect_testdi): Rename to...
20481 (stack_protect_test64): ...this.
20482 (stack_protect_test): Adjust calls to above.
20484 2021-04-29 H.J. Lu <hjl.tools@gmail.com>
20486 PR middle-end/90773
20487 * builtins.c (builtin_memcpy_read_str): Add a dummy argument.
20488 (builtin_strncpy_read_str): Likewise.
20489 (builtin_memset_read_str): Add an argument for the previous RTL
20490 information and generate the new RTL from the previous RTL info.
20491 (builtin_memset_gen_str): Likewise.
20492 * builtins.h (builtin_strncpy_read_str): Update the prototype.
20493 (builtin_memset_read_str): Likewise.
20494 * expr.c (by_pieces_ninsns): If targetm.overlap_op_by_pieces_p()
20495 returns true, round up size and alignment to the widest integer
20496 mode for maximum size.
20497 (pieces_addr::adjust): Add a pointer to by_pieces_prev argument
20498 and pass it to m_constfn.
20499 (op_by_pieces_d): Add m_push and m_overlap_op_by_pieces.
20500 (op_by_pieces_d::op_by_pieces_d): Add a bool argument to
20501 initialize m_push. Initialize m_overlap_op_by_pieces with
20502 targetm.overlap_op_by_pieces_p ().
20503 (op_by_pieces_d::run): Pass the previous RTL information to
20504 pieces_addr::adjust and generate overlapping operations if
20505 m_overlap_op_by_pieces is true.
20507 (move_by_pieces_d::move_by_pieces_d): Updated for op_by_pieces_d
20509 (store_by_pieces_d::store_by_pieces_d): Updated for op_by_pieces_d
20511 (can_store_by_pieces): Use by_pieces_constfn on constfun.
20512 (store_by_pieces): Use by_pieces_constfn on constfun. Updated
20513 for op_by_pieces_d change.
20514 (clear_by_pieces_1): Add a dummy argument.
20515 (clear_by_pieces): Updated for op_by_pieces_d change.
20516 (compare_by_pieces_d::compare_by_pieces_d): Likewise.
20517 (string_cst_read_str): Add a dummy argument.
20518 * expr.h (by_pieces_constfn): Add a dummy argument.
20519 (by_pieces_prev): New.
20520 * target.def (overlap_op_by_pieces_p): New target hook.
20521 * config/i386/i386.c (TARGET_OVERLAP_OP_BY_PIECES_P): New.
20522 * doc/tm.texi.in: Add TARGET_OVERLAP_OP_BY_PIECES_P.
20523 * doc/tm.texi: Regenerated.
20525 2021-04-29 Richard Biener <rguenther@suse.de>
20527 PR tree-optimization/100253
20528 * tree-vect-stmts.c (vectorizable_load): Do not assume
20529 element alignment when DR_MISALIGNMENT is -1.
20530 (vectorizable_store): Likewise.
20532 2021-04-29 Jakub Jelinek <jakub@redhat.com>
20535 * config/aarch64/aarch64.c (aarch64_add_offset_1_temporaries): Use
20536 absu_hwi instead of abs_hwi.
20538 2021-04-29 Richard Biener <rguenther@suse.de>
20540 PR middle-end/38474
20541 * tree-ssa-structalias.c (add_graph_edge): Avoid direct
20542 forwarding when indirect forwarding through ESCAPED
20545 2021-04-29 Tom de Vries <tdevries@suse.de>
20548 * internal-fn.c (expand_GOMP_SIMT_ENTER_ALLOC)
20549 (expand_GOMP_SIMT_LAST_LANE, expand_GOMP_SIMT_ORDERED_PRED)
20550 (expand_GOMP_SIMT_VOTE_ANY, expand_GOMP_SIMT_XCHG_BFLY)
20551 (expand_GOMP_SIMT_XCHG_IDX): Ensure target is assigned to.
20553 2021-04-29 Richard Biener <rguenther@suse.de>
20555 PR tree-optimization/99912
20556 * tree-ssa-dse.c (dse_dom_walker::m_need_cfg_cleanup): New.
20557 (dse_dom_walker::todo): Likewise.
20558 (dse_dom_walker::dse_optimize_stmt): Move VDEF check to the
20560 (dse_dom_walker::before_dom_children): Remove trivially
20561 dead SSA defs and schedule CFG cleanup if we removed all
20563 (pass_dse::execute): Get TODO as computed by the DOM walker
20564 and return it. Wipe dominator info earlier.
20566 2021-04-29 Richard Biener <rguenther@suse.de>
20569 * ipa-prop.c (ipcp_modif_dom_walker::before_dom_children):
20570 Track blocks to cleanup EH in new m_need_eh_cleanup.
20571 (ipcp_modif_dom_walker::cleanup_eh): New.
20572 (ipcp_transform_function): Release dominator info before
20575 2021-04-29 Martin Sebor <msebor@redhat.com>
20577 PR middle-end/100250
20578 * attribs.c (attr_access::array_as_string): Avoid dereferencing
20579 a pointer when it's null.
20581 2021-04-29 Martin Sebor <msebor@redhat.com>
20583 * Makefile.in (OBJS): Add ipa-free-lang-data.o.
20584 * ipa-free-lang-data.cc: New file.
20585 * tree.c: Move pass free_lang_data to file above.
20586 (build_array_type_1): Declare extern.
20587 * tree.h (build_array_type_1): Declare.
20589 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20591 * config/aarch64/aarch64-simd-builtins.def: Modify comment to
20592 make consistent with updated RTL pattern.
20593 * config/aarch64/aarch64-simd.md (aarch64_<sur>qmovn<mode>):
20594 Implement using ss_truncate and us_truncate rather than
20596 * config/aarch64/iterators.md: Remove redundant unspecs and
20597 iterator: UNSPEC_[SU]QXTN and SUQMOVN respectively.
20599 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20601 * config/aarch64/arm_acle.h (__attribute__): Make intrinsic
20602 attributes consistent with those defined in arm_neon.h.
20604 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20606 * config/aarch64/arm_fp16.h (__attribute__): Make intrinsic
20607 attributes consistent with those defined in arm_neon.h.
20609 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20611 * config/aarch64/aarch64-simd-builtins.def: Add
20612 float_trunc_rodd builtin generator macros.
20613 * config/aarch64/aarch64-simd.md (aarch64_float_trunc_rodd_df):
20615 (aarch64_float_trunc_rodd_lo_v2sf): Define.
20616 (aarch64_float_trunc_rodd_hi_v4sf_le): Define.
20617 (aarch64_float_trunc_rodd_hi_v4sf_be): Define.
20618 (aarch64_float_trunc_rodd_hi_v4sf): Define.
20619 * config/aarch64/arm_neon.h (vcvtx_f32_f64): Use RTL builtin
20620 instead of inline asm.
20621 (vcvtx_high_f32_f64): Likewise.
20622 (vcvtxd_f32_f64): Likewise.
20623 * config/aarch64/iterators.md: Add FCVTXN unspec.
20625 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20627 * config/aarch64/aarch64-simd-builtins.def: Add tbx1 builtin
20629 * config/aarch64/aarch64-simd.md (aarch64_tbx1<mode>):
20631 * config/aarch64/arm_neon.h (vqtbx1_s8): USE RTL builtin
20632 instead of inline asm.
20633 (vqtbx1_u8): Likewise.
20634 (vqtbx1_p8): Likewise.
20635 (vqtbx1q_s8): Likewise.
20636 (vqtbx1q_u8): Likewise.
20637 (vqtbx1q_p8): Likewise.
20638 (vtbx2_s8): Likewise.
20639 (vtbx2_u8): Likewise.
20640 (vtbx2_p8): Likewise.
20642 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20644 * config/aarch64/aarch64-simd-builtins.def: Add tbl1 builtin
20646 * config/aarch64/arm_neon.h (vqtbl1_p8): Use RTL builtin
20647 instead of inline asm.
20648 (vqtbl1_s8): Likewise.
20649 (vqtbl1_u8): Likewise.
20650 (vqtbl1q_p8): Likewise.
20651 (vqtbl1q_s8): Likewise.
20652 (vqtbl1q_u8): Likewise.
20653 (vtbl1_s8): Likewise.
20654 (vtbl1_u8): Likewise.
20655 (vtbl1_p8): Likewise.
20656 (vtbl2_s8): Likewise.
20657 (vtbl2_u8): Likewise.
20658 (vtbl2_p8): Likewise.
20660 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20662 * config/aarch64/aarch64-simd-builtins.def: Add polynomial
20663 ssri_n buitin generator macro.
20664 * config/aarch64/arm_neon.h (vsri_n_p8): Use RTL builtin
20665 instead of inline asm.
20666 (vsri_n_p16): Likewise.
20667 (vsri_n_p64): Likewise.
20668 (vsriq_n_p8): Likewise.
20669 (vsriq_n_p16): Likewise.
20670 (vsriq_n_p64): Likewise.
20672 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20674 * config/aarch64/aarch64-simd-builtins.def: Use VALLP mode
20675 iterator for polynomial ssli_n builtin generator macro.
20676 * config/aarch64/arm_neon.h (vsli_n_p8): Use RTL builtin
20677 instead of inline asm.
20678 (vsli_n_p16): Likewise.
20679 (vsliq_n_p8): Likewise.
20680 (vsliq_n_p16): Likewise.
20681 * config/aarch64/iterators.md: Define VALLP mode iterator.
20683 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20685 * config/aarch64/aarch64-simd-builtins.def: Use VDQV_L
20686 iterator to generate [su]adalp RTL builtins.
20687 * config/aarch64/aarch64-simd.md: Use VDQV_L iterator in
20688 [su]adalp RTL pattern.
20689 * config/aarch64/arm_neon.h (vpadal_s32): Use RTL builtin
20690 instead of inline asm.
20691 (vpadal_u32): Likewise.
20693 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20695 * config/aarch64/aarch64-simd-builtins.def: Add [su]addlp
20696 builtin generator macros.
20697 * config/aarch64/aarch64-simd.md (aarch64_<su>addlp<mode>):
20699 * config/aarch64/arm_neon.h (vpaddl_s8): Use RTL builtin
20700 instead of inline asm.
20701 (vpaddl_s16): Likewise.
20702 (vpaddl_s32): Likewise.
20703 (vpaddl_u8): Likewise.
20704 (vpaddl_u16): Likewise.
20705 (vpaddl_u32): Likewise.
20706 (vpaddlq_s8): Likewise.
20707 (vpaddlq_s16): Likewise.
20708 (vpaddlq_s32): Likewise.
20709 (vpaddlq_u8): Likewise.
20710 (vpaddlq_u16): Likewise.
20711 (vpaddlq_u32): Liwewise.
20712 * config/aarch64/iterators.md: Define [SU]ADDLP unspecs with
20713 appropriate attributes.
20715 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20717 * config/aarch64/aarch64-simd-builtins.def: Use VDQ_I iterator
20718 for aarch64_addp<mode> builtin macro generator.
20719 * config/aarch64/aarch64-simd.md: Use VDQ_I iterator in
20720 aarch64_addp<mode> RTL pattern.
20721 * config/aarch64/arm_neon.h (vpaddq_s8): Use RTL builtin
20722 instead of inline asm.
20723 (vpaddq_s16): Likewise.
20724 (vpaddq_s32): Likewise.
20725 (vpaddq_s64): Likewise.
20726 (vpaddq_u8): Likewise.
20727 (vpaddq_u16): Likewise.
20728 (vpaddq_u32): Likewise.
20729 (vpaddq_u64): Likewise.
20731 2021-04-28 Jonathan Wright <jonathan.wright@arm.com>
20733 * config/aarch64/aarch64-simd-builtins.def: Add sq[r]dmulh_n
20734 builtin generator macros.
20735 * config/aarch64/aarch64-simd.md (aarch64_sq<r>dmulh_n<mode>):
20737 * config/aarch64/arm_neon.h (vqdmulh_n_s16): Use RTL builtin
20738 instead of inline asm.
20739 (vqdmulh_n_s32): Likewise.
20740 (vqdmulhq_n_s16): Likewise.
20741 (vqdmulhq_n_s32): Likewise.
20742 (vqrdmulh_n_s16): Likewise.
20743 (vqrdmulh_n_s32): Likewise.
20744 (vqrdmulhq_n_s16): Likewise.
20745 (vqrdmulhq_n_s32): Likewise.
20747 2021-04-28 Tobias Burnus <tobias@codesourcery.com>
20749 * doc/install.texi (--enable-offload-defaulted): Document.
20751 2021-04-28 Senthil Kumar Selvaraj <saaadhu@gcc.gnu.org>
20753 * config/avr/avr-dimode.md: Turn existing patterns into
20754 define_insn_and_split style patterns where the splitter
20755 adds a clobber of the condition code register. Drop "cc"
20756 attribute. Add new patterns to match output of
20758 * config/avr/avr-fixed.md: Likewise.
20759 * config/avr/avr.c (cc_reg_rtx): New.
20760 (avr_parallel_insn_from_insns): Adjust insn count
20761 for removal of set of cc0.
20762 (avr_is_casesi_sequence): Likewise.
20763 (avr_casei_sequence_check_operands): Likewise.
20764 (avr_optimize_casesi): Likewise. Also insert
20765 new insns after jump_insn.
20766 (avr_pass_casesi::avr_rest_of_handle_casesi): Adjust
20767 for removal of set of cc0.
20768 (avr_init_expanders): Initialize cc_reg_rtx.
20769 (avr_regno_reg_class): Handle REG_CC.
20770 (cond_string): Remove usage of CC_OVERFLOW_UNUSABLE.
20771 (avr_notice_update_cc): Remove function.
20772 (ret_cond_branch): Remove usage of CC_OVERFLOW_UNUSABLE.
20773 (compare_condition): Adjust for PARALLEL with
20775 (out_shift_with_cnt): Likewise.
20776 (ashlhi3_out): Likewise.
20777 (ashrhi3_out): Likewise.
20778 (lshrhi3_out): Likewise.
20779 (avr_class_max_nregs): Return single reg for REG_CC.
20780 (avr_compare_pattern): Check for REG_CC instead
20782 (avr_reorg_remove_redundant_compare): Likewise.
20783 (avr_reorg):Adjust for PARALLEL with REG_CC clobber.
20784 (avr_hard_regno_nregs): Return single reg for REG_CC.
20785 (avr_hard_regno_mode_ok): Allow only CCmode for REG_CC.
20786 (avr_md_asm_adjust): Clobber REG_CC.
20787 (TARGET_HARD_REGNO_NREGS): Define.
20788 (TARGET_CLASS_MAX_NREGS): Define.
20789 (TARGET_MD_ASM_ADJUST): Define.
20790 * config/avr/avr.h (FIRST_PSEUDO_REGISTER): Adjust
20792 (enum reg_class): Add CC_REG class.
20793 (NOTICE_UPDATE_CC): Remove.
20794 (CC_OVERFLOW_UNUSABLE): Remove.
20795 (CC_NO_CARRY): Remove.
20796 * config/avr/avr.md: Turn existing patterns into
20797 define_insn_and_split style patterns where the splitter
20798 adds a clobber of the condition code register. Drop "cc"
20799 attribute. Add new patterns to match output of
20801 (sez): Remove unused pattern.
20803 2021-04-28 Richard Earnshaw <rearnsha@arm.com>
20806 * config/arm/arm.c (arm_hard_regno_mode_ok): Only allow VPR to be
20809 2021-04-28 Richard Sandiford <richard.sandiford@arm.com>
20812 * config/aarch64/constraints.md (Utq): Require the address to
20813 be valid for both the element mode and for V2DImode.
20815 2021-04-28 Jakub Jelinek <jakub@redhat.com>
20816 Tobias Burnus <tobias@codesourcery.com>
20818 * configure.ac (OFFLOAD_DEFAULTED): AC_DEFINE if offload-defaulted.
20819 * gcc.c (process_command): New variable.
20820 (driver::maybe_putenv_OFFLOAD_TARGETS): If OFFLOAD_DEFAULTED,
20821 set it if -foffload is defaulted.
20822 * lto-wrapper.c (OFFLOAD_TARGET_DEFAULT_ENV): Define.
20823 (compile_offload_image): If OFFLOAD_DEFAULTED and
20824 OFFLOAD_TARGET_DEFAULT is in the environment, don't fail
20825 if corresponding mkoffload can't be found.
20826 (compile_images_for_offload_targets): Likewise. Free and clear
20827 offload_names if no valid offload is found.
20828 * config.in: Regenerate.
20829 * configure: Regenerate.
20831 2021-04-28 Richard Biener <rguenther@suse.de>
20833 PR tree-optimization/100292
20834 * tree-vect-generic.c (expand_vector_condition): Do not fold
20837 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
20839 * config/rs6000/aix.h (SUBTARGET_DRIVER_SELF_SPECS): New.
20840 * config/rs6000/aix64.opt (m64): New.
20843 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
20845 * config/vax/vax.c (print_operand_address, vax_address_cost_1)
20846 (index_term_p): Handle ASHIFT too.
20848 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
20850 * config/vax/builtins.md (jbb<ccss>i<mode>): Remove operand #3.
20851 (sync_lock_test_and_set<mode>): Adjust accordingly.
20852 (sync_lock_release<mode>): Likewise.
20854 2021-04-27 Maciej W. Rozycki <macro@orcam.me.uk>
20856 * config/vax/vax-protos.h (adjacent_operands_p): Remove
20858 * config/vax/vax.c (adjacent_operands_p): Remove.
20860 2021-04-27 Maciej W. Rozycki <macro@linux-mips.org>
20862 * ifcvt.c (dead_or_predicable) [!IFCVT_MODIFY_TESTS]: Fall
20863 through to the non-conditional execution case if getting the
20864 condition for conditional execution has failed.
20866 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
20868 PR middle-end/100284
20869 * gimple.c (gimple_could_trap_p_1): Remove VEC_COND_EXPR test.
20870 * tree-eh.c (operation_could_trap_p): Handle VEC_COND_EXPR rather
20871 than asserting on it.
20873 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
20875 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): Protect
20876 with TARGET_AIX_OS.
20878 2021-04-27 David Edelsohn <dje.gcc@gmail.com>
20881 * calls.c (precompute_register_parameters): Additionally test
20882 targetm.precompute_tls_p to pre-compute argument.
20883 * config/rs6000/aix.h (TARGET_PRECOMPUTE_TLS_P): Define.
20884 * config/rs6000/rs6000.c (rs6000_aix_precompute_tls_p): New.
20885 * target.def (precompute_tls_p): New.
20886 * doc/tm.texi.in (TARGET_PRECOMPUTE_TLS_P): Add hook documentation.
20887 * doc/tm.texi: Regenerated.
20889 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20892 * config/aarch64/aarch64.c (aarch64_print_operand): Cast -UINTVAL
20893 back to HOST_WIDE_INT.
20895 2021-04-27 Bernd Edlinger <bernd.edlinger@hotmail.de>
20898 * simplify-rtx.c (simplify_context::simplify_subreg): Check the
20899 memory alignment for the outer mode.
20901 2021-04-27 H.J. Lu <hjl.tools@gmail.com>
20903 PR middle-end/90773
20904 * expr.c (op_by_pieces_d::get_usable_mode): New member function.
20905 (op_by_pieces_d::run): Cange a while loop to a do-while loop.
20907 2021-04-27 Alex Coplan <alex.coplan@arm.com>
20910 * config/arm/arm.c (arm_split_compare_and_swap): Fix up codegen
20911 with negative immediates: ensure we expand cbranchsi4_scratch
20912 correctly and ensure we satisfy its constraints.
20913 * config/arm/sync.md
20914 (@atomic_compare_and_swap<CCSI:arch><NARROW:mode>_1): Don't
20915 attempt to tie two output operands together with constraints;
20916 collapse two alternatives.
20917 (@atomic_compare_and_swap<CCSI:arch><SIDI:mode>_1): Likewise.
20918 * config/arm/thumb1.md (cbranchsi4_neg_late): New.
20920 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20923 * config/aarch64/predicates.md (aarch64_sub_immediate,
20924 aarch64_plus_immediate): Use -UINTVAL instead of -INTVAL.
20925 * config/aarch64/aarch64.md (casesi, rotl<mode>3): Likewise.
20926 * config/aarch64/aarch64.c (aarch64_print_operand,
20927 aarch64_split_atomic_op, aarch64_expand_subvti): Likewise.
20929 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20931 PR tree-optimization/100239
20932 * tree-vect-generic.c (lower_vec_perm): Don't accept constant
20933 permutations with all indices from the first zero element as vec_shl.
20935 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20937 PR rtl-optimization/100254
20938 * cfgcleanup.c (outgoing_edges_match): Check REG_EH_REGION on
20939 last1 and last2 insns rather than BB_END (bb1) and BB_END (bb2) insns.
20941 2021-04-27 Richard Biener <rguenther@suse.de>
20943 PR tree-optimization/99912
20944 * passes.def: Add comment about new TODO_remove_unused_locals.
20945 * tree-stdarg.c (pass_data_stdarg): Run TODO_remove_unused_locals
20948 2021-04-27 Richard Biener <rguenther@suse.de>
20950 PR tree-optimization/99912
20951 * passes.def (pass_all_optimizations): Add pass_dse before
20952 the first pass_dce, move the first pass_dse before the
20953 pass_dce following pass_pre.
20955 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20957 PR tree-optimization/95527
20958 * generic-match-head.c: Include tm.h.
20959 * gimple-match-head.c: Include tm.h.
20960 * match.pd (CLZ == INTEGER_CST): Don't use
20961 #ifdef CLZ_DEFINED_VALUE_AT_ZERO, only test CLZ_DEFINED_VALUE_AT_ZERO
20962 if clz == CFN_CLZ. Add missing val declaration.
20963 (CTZ cmp CST): New simplifications.
20965 2021-04-27 Jakub Jelinek <jakub@redhat.com>
20967 PR tree-optimization/96696
20968 * expr.c (expand_expr_divmod): New function.
20969 (expand_expr_real_2) <case TRUNC_DIV_EXPR>: Use it for truncations and
20970 divisions. Formatting fixes.
20971 <case MULT_EXPR>: Optimize x / y * y as x - x % y if the latter is
20974 2021-04-27 Martin Jambor <mjambor@suse.cz>
20977 * ipa-param-manipulation.c (ipa_param_adjustments::modify_call):
20978 If removing a call statement LHS SSA name, release it.
20980 2021-04-27 Richard Earnshaw <rearnsha@arm.com>
20983 * config/arm/arm.c (THUMB2_WORK_REGS): Check PIC_OFFSET_TABLE_REGNUM
20984 is valid before including it in the mask.
20986 2021-04-27 Richard Sandiford <richard.sandiford@arm.com>
20989 * config/aarch64/aarch64.c (aarch64_comp_type_attributes): Handle
20992 2021-04-27 Richard Biener <rguenther@suse.de>
20994 PR tree-optimization/100051
20995 * tree-ssa-alias.c (indirect_ref_may_alias_decl_p): Add
20996 disambiguator based on access size vs. decl size.
20998 2021-04-27 Richard Biener <rguenther@suse.de>
21000 PR tree-optimization/100278
21001 * tree-ssa-pre.c (compute_avail): Give up when we cannot
21002 adjust TBAA beacuse of mismatching bases.
21004 2021-04-27 Jakub Jelinek <jakub@redhat.com>
21007 * config/i386/i386.md (*<insn><mode>3_mask, *<insn><mode>3_mask_1):
21008 For any_rotate define_insn_split and following splitters, use
21009 SWI iterator instead of SWI48.
21011 2021-04-27 Richard Biener <rguenther@suse.de>
21013 PR tree-optimization/99776
21014 * match.pd (bit_field_ref (ctor)): Relax element extract
21015 type compatibility checks.
21017 2021-04-27 Cui,Lili <lili.cui@intel.com>
21019 * common/config/i386/i386-common.c (processor_names):
21020 Sync processor_names with processor_type.
21021 * config/i386/i386-options.c (processor_cost_table):
21022 Sync processor_cost_table with processor_type.
21024 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
21026 * value-range.cc (irange::irange_set_1bit_anti_range): Add assert.
21027 (irange::set): Call irange_set_1bit_anti_range for handling all
21028 1-bit ranges. Fall through on ~[MIN,MAX].
21030 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
21032 * value-range.cc (irange::legacy_num_pairs): Remove.
21033 (irange::invert): Change gcc_assert to gcc_checking_assert.
21034 * value-range.h (irange::num_pairs): Adjust for a cached
21035 num_pairs(). Also, rename all gcc_assert's to
21036 gcc_checking_assert's.
21038 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
21040 * value-range.cc (irange::operator=): Set m_kind.
21041 (irange::copy_to_legacy): Handle varying and undefined sources
21042 as a legacy copy since they can be easily copied.
21043 (irange::irange_set): Set m_kind.
21044 (irange::irange_set_anti_range): Same.
21045 (irange::set): Rename normalize_min_max to normalize_kind.
21046 (irange::verify_range): Adjust for multi-ranges having the
21048 (irange::irange_union): Set m_kind.
21049 (irange::irange_intersect): Same.
21050 (irange::invert): Same.
21051 * value-range.h (irange::kind): Always return m_kind.
21052 (irange::varying_p): Rename to...
21053 (irange::varying_comptaible_p): ...this.
21054 (irange::undefined_p): Only look at m_kind.
21055 (irange::irange): Always set VR_UNDEFINED if applicable.
21056 (irange::set_undefined): Always set VR_UNDEFINED.
21057 (irange::set_varying): Always set m_kind to VR_VARYING.
21058 (irange::normalize_min_max): Rename to...
21059 (irange::normalize_kind): ...this.
21061 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
21063 * gimple-ssa-evrp-analyze.c (evrp_range_analyzer::set_ssa_range_info):
21064 Adjust for constant_p including varying_p.
21065 * tree-vrp.c (vrp_prop::finalize): Same.
21066 (determine_value_range): Same.
21067 * vr-values.c (vr_values::range_of_expr): Same.
21068 * value-range.cc (irange::symbolic_p): Do not check varying_p.
21069 (irange::constant_p): Same.
21071 2021-04-26 Aldy Hernandez <aldyh@redhat.com>
21073 * value-range.cc (irange::legacy_lower_bound): Replace
21074 !undefined_p check with num_ranges > 0.
21075 (irange::legacy_upper_bound): Same.
21076 * value-range.h (irange::type): Same.
21077 (irange::lower_bound): Same.
21078 (irange::upper_bound): Same.
21080 2021-04-26 Richard Biener <rguenther@suse.de>
21082 PR tree-optimization/99956
21083 * gimple-loop-interchange.cc (compute_access_stride):
21084 Try instantiating the access in a shallower loop nest
21085 if instantiating failed.
21086 (compute_access_strides): Pass adjustable loop_nest
21087 to compute_access_stride.
21089 2021-04-26 Christophe Lyon <christophe.lyon@linaro.org>
21091 * doc/sourcebuild.texi (arm_cmse_hw): Document.
21093 2021-04-26 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
21095 * config/aarch64/iterators.md (vwcore): Handle V4BF, V8BF.
21097 2021-04-26 Thomas Schwinge <thomas@codesourcery.com>
21098 Nathan Sidwell <nathan@codesourcery.com>
21099 Tom de Vries <vries@codesourcery.com>
21100 Julian Brown <julian@codesourcery.com>
21101 Kwok Cheung Yeung <kcy@codesourcery.com>
21103 * omp-offload.c (oacc_validate_dims): Implement
21104 '-Wopenacc-parallelism'.
21105 * doc/invoke.texi (-Wopenacc-parallelism): Document.
21107 2021-04-26 Richard Biener <rguenther@suse.de>
21109 * tree-cfg.h (gimplify_build1): Remove.
21110 (gimplify_build2): Likewise.
21111 (gimplify_build3): Likewise.
21112 * tree-cfg.c (gimplify_build1): Move to tree-vect-generic.c.
21113 (gimplify_build2): Likewise.
21114 (gimplify_build3): Likewise.
21115 * tree-vect-generic.c (gimplify_build1): Move from tree-cfg.c.
21117 (gimplify_build2): Likewise.
21118 (gimplify_build3): Likewise.
21119 (tree_vec_extract): Use resimplify with following SSA edges.
21120 (expand_vector_parallel): Avoid passing NULL size/bitpos
21121 to tree_vec_extract.
21122 * expr.c (store_constructor): Deal with zero-element CTORs.
21123 * match.pd (bit_field_ref <vector CTOR>): Make sure to
21124 produce vector constants when possible.
21126 2021-04-26 Richard Biener <rguenther@suse.de>
21128 * tree-complex.c: Include gimple-fold.h.
21129 (expand_complex_addition): Use gimple_build.
21130 (expand_complex_multiplication_components): Likewise.
21131 (expand_complex_multiplication): Likewise.
21132 (expand_complex_div_straight): Likewise.
21133 (expand_complex_div_wide): Likewise.
21134 (expand_complex_division): Likewise.
21135 (expand_complex_conjugate): Likewise.
21136 (expand_complex_comparison): Likewise.
21138 2021-04-26 Richard Biener <rguenther@suse.de>
21140 * tree-ssa-phiopt.c (two_value_replacement): Remove use
21141 of legacy gimplify_buildN API.
21143 2021-04-26 Richard Biener <rguenther@suse.de>
21145 PR tree-optimization/99473
21146 * tree-ssa-phiopt.c (cond_store_replacement): Handle all
21149 2021-04-26 Richard Biener <rguenther@suse.de>
21151 * config/rs6000/rs6000-call.c (rs6000_gimple_fold_builtin):
21152 Use replace_call_with_value.
21154 2021-04-26 Richard Biener <rguenther@suse.de>
21156 * tree-ssa-propagate.h (valid_gimple_rhs_p): Remove.
21157 (update_gimple_call): Likewise.
21158 (update_call_from_tree): Likewise.
21159 * tree-ssa-propagate.c (valid_gimple_rhs_p): Remove.
21160 (valid_gimple_call_p): Likewise.
21161 (move_ssa_defining_stmt_for_defs): Likewise.
21162 (finish_update_gimple_call): Likewise.
21163 (update_gimple_call): Likewise.
21164 (update_call_from_tree): Likewise.
21165 (propagate_tree_value_into_stmt): Use replace_call_with_value.
21166 * gimple-fold.h (update_gimple_call): Declare.
21167 * gimple-fold.c (valid_gimple_rhs_p): Move here from
21168 tree-ssa-propagate.c.
21169 (update_gimple_call): Likewise.
21170 (valid_gimple_call_p): Likewise.
21171 (finish_update_gimple_call): Likewise, and simplify.
21172 (gimplify_and_update_call_from_tree): Implement
21173 update_call_from_tree functionality, avoid excessive
21174 push/pop_gimplify_context.
21175 (gimple_fold_builtin): Use only gimplify_and_update_call_from_tree.
21176 (gimple_fold_call): Likewise.
21177 * gimple-ssa-sprintf.c (try_substitute_return_value): Likewise.
21178 * tree-ssa-ccp.c (ccp_folder::fold_stmt): Likewise.
21179 (pass_fold_builtins::execute): Likewise.
21180 (optimize_stack_restore): Use replace_call_with_value.
21181 * tree-cfg.c (fold_loop_internal_call): Likewise.
21182 * tree-ssa-dce.c (maybe_optimize_arith_overflow): Use
21183 only gimplify_and_update_call_from_tree.
21184 * tree-ssa-strlen.c (handle_builtin_strlen): Likewise.
21185 (handle_builtin_strchr): Likewise.
21186 * tsan.c: Include gimple-fold.h instead of tree-ssa-propagate.h.
21188 2021-04-26 Jakub Jelinek <jakub@redhat.com>
21191 * vmsdbgout.c (ASM_OUTPUT_DEBUG_STRING, vmsdbgout_begin_block,
21192 vmsdbgout_end_block, lookup_filename, vmsdbgout_source_line): Remove
21195 2021-04-25 liuhongt <hongtao.liu@intel.com>
21198 * config/i386/i386-builtin.def (BDESC): Change the icode of
21199 the following builtins to CODE_FOR_nothing.
21200 * config/i386/i386.c (ix86_gimple_fold_builtin): Fold
21201 IX86_BUILTIN_PCMPEQB128, IX86_BUILTIN_PCMPEQW128,
21202 IX86_BUILTIN_PCMPEQD128, IX86_BUILTIN_PCMPEQQ,
21203 IX86_BUILTIN_PCMPEQB256, IX86_BUILTIN_PCMPEQW256,
21204 IX86_BUILTIN_PCMPEQD256, IX86_BUILTIN_PCMPEQQ256,
21205 IX86_BUILTIN_PCMPGTB128, IX86_BUILTIN_PCMPGTW128,
21206 IX86_BUILTIN_PCMPGTD128, IX86_BUILTIN_PCMPGTQ,
21207 IX86_BUILTIN_PCMPGTB256, IX86_BUILTIN_PCMPGTW256,
21208 IX86_BUILTIN_PCMPGTD256, IX86_BUILTIN_PCMPGTQ256.
21209 * config/i386/sse.md (avx2_eq<mode>3): Deleted.
21210 (sse2_eq<mode>3): Ditto.
21211 (sse4_1_eqv2di3): Ditto.
21212 (sse2_gt<mode>3): Rename to ..
21213 (*sse2_gt<mode>3): .. this.
21215 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
21218 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
21221 * config/darwin.c (darwin_binds_local_p): Assume that any
21222 public symbol might be interposed for PIC code. Update function
21223 header comment to reflect current Darwin capability.
21225 2021-04-24 Iain Sandoe <iain@sandoe.co.uk>
21228 * config/darwin.c (darwin_binds_local_p): Assume that any
21229 public symbol might be interposed for PIC code. Update function
21230 header comment to reflect current Darwin capability.
21232 2021-04-24 Richard Sandiford <richard.sandiford@arm.com>
21234 * doc/sourcebuild.texi: Document no-opts and any-opts target
21237 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
21239 * config/bpf/bpf.h (ASM_OUTPUT_ALIGNED_BSS): Use .type and .lcomm.
21241 2021-04-23 YiFei Zhu <zhuyifei1999@gmail.com>
21243 * config/bpf/bpf.h (FUNCTION_BOUNDARY): Set to 64.
21245 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
21248 * config/i386/i386-options.c (ix86_option_override_internal):
21249 Error out when -m96bit-long-double is used with 64bit targets.
21250 * config/i386/i386.md (*pushxf_rounded): Remove pattern.
21252 2021-04-23 Martin Liska <mliska@suse.cz>
21254 * lto-wrapper.c: Remove FIXME about usage of
21255 hardware_concurrency. The function is not on par with
21258 2021-04-23 Uroš Bizjak <ubizjak@gmail.com>
21261 * config/i386/sync.md (FILD_ATOMIC/FIST_ATOMIC FP load peephole2):
21262 Copy operand 3 to operand 4. Use sse_reg_operand
21263 as operand 3 predicate.
21264 (FILD_ATOMIC/FIST_ATOMIC FP load peephole2 with mem blockage): Ditto.
21265 (LDX_ATOMIC/STX_ATOMIC FP load peephole2): Ditto.
21266 (LDX_ATOMIC/LDX_ATOMIC FP load peephole2 with mem blockage): Ditto.
21267 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2):
21268 Copy operand 1 to operand 0.
21269 (FILD_ATOMIC/FIST_ATOMIC FP store peephole2 with mem blockage): Ditto.
21270 (LDX_ATOMIC/STX_ATOMIC FP store peephole2): Ditto.
21271 (LDX_ATOMIC/LDX_ATOMIC FP store peephole2 with mem blockage): Ditto.
21273 2021-04-23 Alex Coplan <alex.coplan@arm.com>
21275 PR rtl-optimization/100230
21276 * early-remat.c (early_remat::sort_candidates): Use delete[]
21277 instead of delete for array allocated with new[].
21279 2021-04-23 Richard Biener <rguenther@suse.de>
21281 * genmatch.c (lower_cond): Remove VEC_COND_EXPR special-casing.
21282 (capture_info::capture_info): Likewise.
21283 (capture_info::walk_match): Likewise.
21284 (expr::gen_transform): Likewise.
21285 (dt_simplify::gen_1): Likewise.
21286 * gimple-match-head.c (maybe_resimplify_conditional_op):
21287 Remove VEC_COND_EXPR special-casing.
21288 (gimple_simplify): Likewise.
21289 * gimple.c (gimple_could_trap_p_1): Adjust.
21290 * tree-ssa-pre.c (compute_avail): Allow VEC_COND_EXPR
21291 to participate in PRE.
21293 2021-04-23 Richard Biener <rguenther@suse.de>
21295 * cfganal.c (connect_infinite_loops_to_exit): First call
21296 add_noreturn_fake_exit_edges.
21297 * ipa-sra.c (process_scan_results): Do not call the now redundant
21298 add_noreturn_fake_exit_edges.
21299 * predict.c (tree_estimate_probability): Likewise.
21300 (rebuild_frequencies): Likewise.
21301 * store-motion.c (one_store_motion_pass): Likewise.
21303 2021-04-23 Richard Biener <rguenther@suse.de>
21305 PR tree-optimization/100222
21306 * predict.c (pass_profile::execute): Remove redundant call to
21307 mark_irreducible_loops.
21308 (report_predictor_hitrates): Likewise.
21310 2021-04-23 Richard Biener <rguenther@suse.de>
21312 * tree-ssa-loop-ivopts.c (rewrite_use_nonlinear_expr): Avoid
21313 valid_gimple_rhs_p by instead gimplifying to one.
21315 2021-04-23 Richard Biener <rguenther@suse.de>
21317 PR tree-optimization/99971
21318 * tree-vect-data-refs.c (vect_slp_analyze_node_dependences):
21319 Always use TBAA for loads.
21321 2021-04-23 liuhongt <hongtao.liu@intel.com>
21324 * config/i386/i386-options.c (ix86_option_override_internal):
21325 Clear MASK_AVX256_SPLIT_UNALIGNED_LOAD/STORE in x_target_flags
21326 when X86_TUNE_AVX256_UNALIGNED_LOAD/STORE_OPTIMAL is enabled
21327 by target attribute.
21329 2021-04-23 David Edelsohn <dje.gcc@gmail.com>
21331 * config/rs6000/aix71.h (PREFERRED_DEBUGGING_TYPE): Change to
21333 * config/rs6000/aix72.h (PREFERRED_DEBUGGING_TYPE): Same.
21335 2021-04-22 David Edelsohn <dje.gcc@gmail.com>
21337 * config.gcc (powerpc-ibm-aix6.*): Remove.
21338 * config/rs6000/aix61.h: Delete.
21340 2021-04-22 Martin Liska <mliska@suse.cz>
21342 PR testsuite/100159
21343 PR testsuite/100192
21344 * builtins.c (expand_builtin): Fix typos and missing comments.
21345 * dwarf2out.c (gen_subprogram_die): Likewise.
21346 (gen_struct_or_union_type_die): Likewise.
21348 2021-04-22 Uroš Bizjak <ubizjak@gmail.com>
21351 * config/i386/i386-expand.c (ix86_expand_convert_uns_sidf_sse):
21352 Remove the sign with FE_DOWNWARD, where x - x = -0.0.
21354 2021-04-21 Iain Sandoe <iain@sandoe.co.uk>
21356 * config/i386/darwin.h (TARGET_64BIT): Remove definition
21357 based on TARGET_ISA_64BIT.
21358 (TARGET_64BIT_P): Remove definition based on
21359 TARGET_ISA_64BIT_P().
21361 2021-04-21 Martin Liska <mliska@suse.cz>
21364 2021-04-21 Martin Liska <mliska@suse.cz>
21366 * lto-wrapper.c (cpuset_popcount): Remove.
21367 (init_num_threads): Remove and use hardware_concurrency.
21369 2021-04-21 Martin Liska <mliska@suse.cz>
21372 * main.c (main): Call toplev::finalize in CHECKING_P mode.
21373 * ipa-modref.c (ipa_modref_c_finalize): summaries are NULL
21374 when incremental LTO linking happens.
21376 2021-04-21 Martin Liska <mliska@suse.cz>
21378 * lto-wrapper.c (run_gcc): When -flto=jobserver is used, but the
21379 makeserver cannot be detected, then use -flto=N fallback.
21381 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
21383 * acinclude.m4 (gcc_AC_INITFINI_ARRAY): When cross-compiling,
21384 default to yes for aarch64-linux-gnu.
21385 * configure: Regenerate.
21387 2021-04-21 Martin Liska <mliska@suse.cz>
21389 * lto-wrapper.c (cpuset_popcount): Remove.
21390 (init_num_threads): Remove and use hardware_concurrency.
21392 2021-04-21 Martin Liska <mliska@suse.cz>
21394 * config/i386/i386.c: Remove superfluous || TARGET_MACHO
21395 which remains to be '(... || 0)' and clang complains about it.
21396 * dwarf2out.c (AT_vms_delta): Declare conditionally.
21397 (add_AT_vms_delta): Likewise.
21398 * tree.c (fld_simplified_type): Use rather more common pattern
21399 for disabling of something (#if 0).
21400 (get_tree_code_name): Likewise.
21401 (verify_type_variant): Likewise.
21403 2021-04-21 Martin Liska <mliska@suse.cz>
21405 * config/i386/i386-expand.c (decide_alignment): Use newly named
21406 macro TARGET_CPU_P.
21407 * config/i386/i386.c (ix86_decompose_address): Likewise.
21408 (ix86_address_cost): Likewise.
21409 (ix86_lea_outperforms): Likewise.
21410 (ix86_avoid_lea_for_addr): Likewise.
21411 (ix86_add_stmt_cost): Likewise.
21412 * config/i386/i386.h (TARGET_*): Remove.
21413 (TARGET_CPU_P): New macro.
21414 * config/i386/i386.md: Use newly named macro TARGET_CPU_P.
21415 * config/i386/x86-tune-sched-atom.c (do_reorder_for_imul): Likewise.
21416 (swap_top_of_ready_list): Likewise.
21417 (ix86_atom_sched_reorder): Likewise.
21418 * config/i386/x86-tune-sched-bd.c (ix86_bd_has_dispatch): Likewise.
21419 * config/i386/x86-tune-sched.c (ix86_adjust_cost): Likewise.
21421 2021-04-21 Martin Liska <mliska@suse.cz>
21423 * config/i386/i386-options.c (TARGET_EXPLICIT_NO_SAHF_P):
21425 (SET_TARGET_NO_SAHF): Likewise.
21426 (TARGET_EXPLICIT_PREFETCH_SSE_P): Likewise.
21427 (SET_TARGET_PREFETCH_SSE): Likewise.
21428 (TARGET_EXPLICIT_NO_TUNE_P): Likewise.
21429 (SET_TARGET_NO_TUNE): Likewise.
21430 (TARGET_EXPLICIT_NO_80387_P): Likewise.
21431 (SET_TARGET_NO_80387): Likewise.
21433 * config/i386/i386.h (TARGET_*): Remove.
21434 * opth-gen.awk: Generate new used macros.
21436 2021-04-21 Martin Liska <mliska@suse.cz>
21438 * config/i386/i386.h (PTA_*): Remove.
21439 (enum pta_flag): New.
21440 (DEF_PTA): Generate PTA_* values from i386-isa.def.
21441 * config/i386/i386-isa.def: New file.
21443 2021-04-21 Alex Coplan <alex.coplan@arm.com>
21446 * config/aarch64/aarch64-bti-insert.c (aarch64_bti_j_insn_p): New.
21447 (rest_of_insert_bti): Avoid inserting duplicate bti j insns for
21448 jump table targets.
21450 2021-04-21 H.J. Lu <hjl.tools@gmail.com>
21452 * config.gcc: Install mwaitintrin.h for i[34567]86-*-* and
21453 x86_64-*-* targets.
21454 * common/config/i386/i386-common.c (OPTION_MASK_ISA2_MWAIT_SET):
21456 (OPTION_MASK_ISA2_MWAIT_UNSET): Likewise.
21457 (ix86_handle_option): Handle -mmwait.
21458 * config/i386/i386-builtins.c (ix86_init_mmx_sse_builtins):
21459 Replace OPTION_MASK_ISA_SSE3 with OPTION_MASK_ISA2_MWAIT on
21460 __builtin_ia32_monitor and __builtin_ia32_mwait.
21461 * config/i386/i386-options.c (isa2_opts): Add -mmwait.
21462 (ix86_valid_target_attribute_inner_p): Likewise.
21463 (ix86_option_override_internal): Enable mwait/monitor
21464 instructions for -msse3.
21465 * config/i386/i386.h (TARGET_MWAIT): New.
21466 (TARGET_MWAIT_P): Likewise.
21467 * config/i386/i386.opt: Add -mmwait.
21468 * config/i386/mwaitintrin.h: New file.
21469 * config/i386/pmmintrin.h: Include <mwaitintrin.h>.
21470 * config/i386/sse.md (sse3_mwait): Replace TARGET_SSE3 with
21472 (@sse3_monitor_<mode>): Likewise.
21473 * config/i386/x86gprintrin.h: Include <mwaitintrin.h>.
21474 * doc/extend.texi: Document mwait target attribute.
21475 * doc/invoke.texi: Document -mmwait.
21477 2021-04-21 Martin Liska <mliska@suse.cz>
21479 * config/i386/i386-options.c (DEF_ENUM): Remove it.
21480 * config/i386/i386-opts.h (DEF_ENUM): Likewise.
21481 * config/i386/stringop.def (DEF_ENUM): Likewise.
21483 2021-04-21 Martin Liska <mliska@suse.cz>
21485 * tree-cfg.c (gimple_verify_flow_info): Use qD instead
21486 of print_generic_expr.
21488 2021-04-21 Jakub Jelinek <jakub@redhat.com>
21490 PR rtl-optimization/100148
21491 * cprop.c (constprop_register): Use next_nondebug_insn instead of
21494 2021-04-21 Martin Liska <mliska@suse.cz>
21497 * cgraphunit.c (cgraph_node::analyze): Remove duplicate
21498 free_dominance_info calls.
21500 2021-04-21 Richard Biener <rguenther@suse.de>
21502 * gimple-fold.c (maybe_fold_reference): Remove is_lhs
21503 parameter (and assume it to be false).
21504 (fold_gimple_assign): Adjust, remove all callers of
21505 maybe_fold_reference calling it with is_lhs true.
21506 (gimple_fold_call): Likewise.
21507 (fold_stmt_1): Likewise.
21509 2021-04-21 Richard Biener <rguenther@suse.de>
21511 * fold-const.c (pedantic_non_lvalue_loc): Remove.
21512 (fold_binary_loc): Adjust.
21513 (fold_ternary_loc): Likewise.
21515 2021-04-21 Richard Sandiford <richard.sandiford@arm.com>
21517 PR middle-end/100130
21518 * varasm.c (get_block_for_decl): Make sure that any use of the
21519 retain attribute matches the section's retain flag.
21520 (switch_to_section): Check for retain mismatches even when
21521 changing sections, but do not warn if the given decl is the
21522 section's named.decl.
21523 (output_object_block): Pass the first decl in the block (if any)
21524 to switch_to_section.
21526 2021-04-20 H.J. Lu <hjl.tools@gmail.com>
21528 * config/i386/i386-c.c (ix86_target_macros_internal): Define
21529 __CRC32__ for -mcrc32.
21530 * config/i386/i386-options.c (ix86_option_override_internal):
21531 Enable crc32 instruction for -msse4.2.
21532 * config/i386/i386.md (sse4_2_crc32<mode>): Remove TARGET_SSE4_2
21534 (sse4_2_crc32di): Likewise.
21535 * config/i386/ia32intrin.h: Use crc32 target option for CRC32
21538 2021-04-20 Segher Boessenkool <segher@kernel.crashing.org>
21541 * config/rs6000/rs6000.c (rs6000_machine_from_flags): Do not consider
21544 2021-04-20 Martin Liska <mliska@suse.cz>
21546 * doc/invoke.texi: Fix typo.
21547 * params.opt: Likewise.
21549 2021-04-20 Martin Liska <mliska@suse.cz>
21551 * doc/invoke.texi: Document new param.
21553 2021-04-19 Andrew MacLeod <amacleod@redhat.com>
21555 PR tree-optimization/100081
21556 * gimple-range-cache.h (ranger_cache): Inherit from gori_compute
21557 rather than gori_compute_cache.
21558 * gimple-range-gori.cc (is_gimple_logical_p): Move to top of file.
21559 (range_def_chain::m_logical_depth): New member.
21560 (range_def_chain::range_def_chain): Initialize m_logical_depth.
21561 (range_def_chain::get_def_chain): Don't build defchains through more
21562 than LOGICAL_LIMIT logical expressions.
21563 * params.opt (param_ranger_logical_depth): New.
21565 2021-04-19 Richard Earnshaw <rearnsha@arm.com>
21568 * config/arm/arm.c (arm_configure_build_target): Do not strip
21569 extended FPU/SIMD feature bits from the target ISA when -mfpu
21570 is specified (partial revert of r11-8168).
21572 2021-04-19 Thomas Schwinge <thomas@codesourcery.com>
21574 * params.opt (-param=openacc-kernels=): Add.
21575 * omp-oacc-kernels-decompose.cc
21576 (pass_omp_oacc_kernels_decompose::gate): Use it.
21577 * doc/invoke.texi (-fopenacc-kernels=@var{mode}): Move...
21578 (--param): ... here, 'openacc-kernels'.
21580 2021-04-19 Martin Liska <mliska@suse.cz>
21583 * gengtype.c (finish_root_table): Align function arguments
21584 in between declaration and definition.
21586 2021-04-19 Eric Botcazou <ebotcazou@adacore.com>
21588 * config/i386/winnt.c (i386_pe_seh_cold_init): Properly deal with
21589 frames larger than the SEH maximum frame size.
21591 2021-04-18 Segher Boessenkool <segher@kernel.crashing.org>
21593 PR rtl-optimization/99927
21594 * combine.c (distribute_notes) [REG_UNUSED]: If the register already
21595 is dead, just drop it.
21597 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
21600 * config/i386/winnt-d.c (TARGET_D_TEMPLATES_ALWAYS_COMDAT): Define.
21601 * doc/tm.texi: Regenerate.
21602 * doc/tm.texi.in (D language and ABI): Add @hook for
21603 TARGET_D_TEMPLATES_ALWAYS_COMDAT.
21605 2021-04-17 Iain Buclaw <ibuclaw@gdcproject.org>
21607 * config/darwin-d.c (darwin_d_handle_target_object_format): New
21609 (darwin_d_register_target_info): New function.
21610 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21611 * config/dragonfly-d.c (dragonfly_d_handle_target_object_format): New
21613 (dragonfly_d_register_target_info): New function.
21614 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21615 * config/freebsd-d.c (freebsd_d_handle_target_object_format): New
21617 (freebsd_d_register_target_info): New function.
21618 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21619 * config/glibc-d.c (glibc_d_handle_target_object_format): New
21621 (glibc_d_register_target_info): New function.
21622 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21623 * config/i386/i386-d.c (ix86_d_handle_target_object_format): New
21625 (ix86_d_register_target_info): Add ix86_d_handle_target_object_format
21626 as handler for objectFormat key.
21627 * config/i386/winnt-d.c (winnt_d_handle_target_object_format): New
21629 (winnt_d_register_target_info): New function.
21630 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21631 * config/netbsd-d.c (netbsd_d_handle_target_object_format): New
21633 (netbsd_d_register_target_info): New function.
21634 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21635 * config/openbsd-d.c (openbsd_d_handle_target_object_format): New
21637 (openbsd_d_register_target_info): New function.
21638 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21639 * config/pa/pa-d.c (pa_d_handle_target_object_format): New function.
21640 (pa_d_register_target_info): Add pa_d_handle_target_object_format as
21641 handler for objectFormat key.
21642 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_object_format): New
21644 (rs6000_d_register_target_info): Add
21645 rs6000_d_handle_target_object_format as handler for objectFormat key.
21646 * config/sol2-d.c (solaris_d_handle_target_object_format): New
21648 (solaris_d_register_target_info): New function.
21649 (TARGET_D_REGISTER_OS_TARGET_INFO): Define.
21651 2021-04-16 Jakub Jelinek <jakub@redhat.com>
21654 * config/aarch64/aarch64.c (aarch64_function_arg_alignment): Change
21655 abi_break argument from bool * to unsigned *, store there the pre-GCC 9
21657 (aarch64_layout_arg, aarch64_gimplify_va_arg_expr): Adjust callers.
21658 (aarch64_function_arg_regno_p): Likewise. Only emit -Wpsabi note if
21659 the old and new alignment after applying MIN/MAX to it is different.
21661 2021-04-16 Tamar Christina <tamar.christina@arm.com>
21664 * config/aarch64/aarch64-sve.md (@aarch64_sve_trn1_conv<mode>): New.
21665 * config/aarch64/aarch64.c (aarch64_expand_sve_const_pred_trn): Use new
21667 * config/aarch64/iterators.md (UNSPEC_TRN1_CONV): New.
21669 2021-04-16 Bill Schmidt <wschmidt@linux.ibm.com>
21671 * doc/extend.texi (PowerPC AltiVec/VSX Built-in Functions): Revise
21672 this section and its subsections.
21674 2021-04-16 Jakub Jelinek <jakub@redhat.com>
21677 * config/aarch64/aarch64.md (*neg_asr_si2_extr, *extrsi5_insn_di): New
21678 define_insn patterns.
21680 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
21682 PR rtl-optimization/98689
21683 * reg-notes.def (UNTYPED_CALL): New note.
21684 * combine.c (distribute_notes): Handle it.
21685 * emit-rtl.c (try_split): Likewise.
21686 * rtlanal.c (rtx_properties::try_to_add_insn): Likewise. Assume
21687 that calls with the note implicitly set all return value registers.
21688 * builtins.c (expand_builtin_apply): Add a REG_UNTYPED_CALL
21691 2021-04-16 Richard Sandiford <richard.sandiford@arm.com>
21693 PR rtl-optimization/99596
21694 * rtlanal.c (rtx_properties::try_to_add_insn): Don't add global
21695 register accesses for const calls. Assume that pure functions
21696 can only read from global registers. Ignore cases in which
21697 the stack pointer has been marked global.
21699 2021-04-16 Jakub Jelinek <jakub@redhat.com>
21702 * tree-vect-loop.c (vect_transform_loop): Don't remove just
21703 dead scalar .MASK_LOAD calls, but also dead .COND_* calls - replace
21704 them by their last argument.
21706 2021-04-15 Martin Liska <mliska@suse.cz>
21708 * doc/invoke.texi: Other params don't use it, remove it.
21710 2021-04-15 Richard Biener <rguenther@suse.de>
21712 * gimple-builder.h: Add deprecation note.
21714 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
21717 * attribs.h (restrict_type_identity_attributes_to): Declare.
21718 * attribs.c (restrict_type_identity_attributes_to): New function.
21720 2021-04-15 Richard Sandiford <richard.sandiford@arm.com>
21723 * attribs.h (affects_type_identity_attributes): Declare.
21724 * attribs.c (remove_attributes_matching): New function.
21725 (affects_type_identity_attributes): Likewise.
21727 2021-04-15 Jakub Jelinek <jakub@redhat.com>
21730 * config/aarch64/aarch64.md (*<LOGICAL:optab>_<SHIFT:optab><mode>3):
21731 Add combine splitters for *<LOGICAL:optab>_ashl<mode>3 with
21732 ZERO_EXTEND, SIGN_EXTEND or AND.
21734 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
21736 PR rtl-optimization/99929
21737 * rtl.h (same_vector_encodings_p): New function.
21738 * cse.c (exp_equiv_p): Check that CONST_VECTORs have the same encoding.
21739 * cselib.c (rtx_equal_for_cselib_1): Likewise.
21740 * jump.c (rtx_renumbered_equal_p): Likewise.
21741 * lra-constraints.c (operands_match_p): Likewise.
21742 * reload.c (operands_match_p): Likewise.
21743 * rtl.c (rtx_equal_p_cb, rtx_equal_p): Likewise.
21745 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
21747 * print-rtl.c (rtx_writer::print_rtx_operand_codes_E_and_V): Print
21748 more information about variable-length CONST_VECTORs.
21750 2021-04-14 Vladimir N. Makarov <vmakarov@redhat.com>
21752 PR rtl-optimization/100066
21753 * lra-constraints.c (split_reg): Check paradoxical_subreg_p for
21754 ordered modes when choosing splitting mode for hard reg.
21756 2021-04-14 Richard Sandiford <richard.sandiford@arm.com>
21759 * config/aarch64/aarch64.c (aarch64_expand_sve_const_vector_sel):
21761 (aarch64_expand_sve_const_vector): Use it for nelts_per_pattern==2.
21763 2021-04-14 Andreas Krebbel <krebbel@linux.ibm.com>
21765 * config/s390/s390-builtins.def (O_M5, O_M12, ...): Add new macros
21766 for mask operand types.
21767 (s390_vec_permi_s64, s390_vec_permi_b64, s390_vec_permi_u64)
21768 (s390_vec_permi_dbl, s390_vpdi): Use the M5 type for the immediate
21770 (s390_vec_msum_u128, s390_vmslg): Use the M12 type for the
21772 * config/s390/s390.c (s390_const_operand_ok): Check the new
21773 operand types and generate a list of valid values.
21775 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
21777 * doc/tm.texi: Regenerate.
21778 * doc/tm.texi.in (D language and ABI): Add @hook for
21779 TARGET_D_REGISTER_OS_TARGET_INFO.
21781 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
21783 * config/aarch64/aarch64-d.c (aarch64_d_handle_target_float_abi): New
21785 (aarch64_d_register_target_info): New function.
21786 * config/aarch64/aarch64-protos.h (aarch64_d_register_target_info):
21788 * config/aarch64/aarch64.h (TARGET_D_REGISTER_CPU_TARGET_INFO):
21790 * config/arm/arm-d.c (arm_d_handle_target_float_abi): New function.
21791 (arm_d_register_target_info): New function.
21792 * config/arm/arm-protos.h (arm_d_register_target_info): Declare.
21793 * config/arm/arm.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21794 * config/i386/i386-d.c (ix86_d_handle_target_float_abi): New function.
21795 (ix86_d_register_target_info): New function.
21796 * config/i386/i386-protos.h (ix86_d_register_target_info): Declare.
21797 * config/i386/i386.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21798 * config/mips/mips-d.c (mips_d_handle_target_float_abi): New function.
21799 (mips_d_register_target_info): New function.
21800 * config/mips/mips-protos.h (mips_d_register_target_info): Declare.
21801 * config/mips/mips.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21802 * config/pa/pa-d.c (pa_d_handle_target_float_abi): New function.
21803 (pa_d_register_target_info): New function.
21804 * config/pa/pa-protos.h (pa_d_register_target_info): Declare.
21805 * config/pa/pa.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21806 * config/riscv/riscv-d.c (riscv_d_handle_target_float_abi): New
21808 (riscv_d_register_target_info): New function.
21809 * config/riscv/riscv-protos.h (riscv_d_register_target_info): Declare.
21810 * config/riscv/riscv.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21811 * config/rs6000/rs6000-d.c (rs6000_d_handle_target_float_abi): New
21813 (rs6000_d_register_target_info): New function.
21814 * config/rs6000/rs6000-protos.h (rs6000_d_register_target_info):
21816 * config/rs6000/rs6000.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21817 * config/s390/s390-d.c (s390_d_handle_target_float_abi): New function.
21818 (s390_d_register_target_info): New function.
21819 * config/s390/s390-protos.h (s390_d_register_target_info): Declare.
21820 * config/s390/s390.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21821 * config/sparc/sparc-d.c (sparc_d_handle_target_float_abi): New
21823 (sparc_d_register_target_info): New function.
21824 * config/sparc/sparc-protos.h (sparc_d_register_target_info): Declare.
21825 * config/sparc/sparc.h (TARGET_D_REGISTER_CPU_TARGET_INFO): Define.
21826 * doc/tm.texi: Regenerate.
21827 * doc/tm.texi.in (D language and ABI): Add @hook for
21828 TARGET_D_REGISTER_CPU_TARGET_INFO.
21830 2021-04-14 Iain Buclaw <ibuclaw@gdcproject.org>
21832 * config/i386/i386-d.c (ix86_d_has_stdcall_convention): New function.
21833 * config/i386/i386-protos.h (ix86_d_has_stdcall_convention): Declare.
21834 * config/i386/i386.h (TARGET_D_HAS_STDCALL_CONVENTION): Define.
21835 * doc/tm.texi: Regenerate.
21836 * doc/tm.texi.in (D language and ABI): Add @hook for
21837 TARGET_D_HAS_STDCALL_CONVENTION.
21839 2021-04-14 Richard Biener <rguenther@suse.de>
21841 * tree-cfg.c (verify_gimple_assign_ternary): Verify that
21842 VEC_COND_EXPRs have a gimple_val condition.
21843 * tree-ssa-propagate.c (valid_gimple_rhs_p): VEC_COND_EXPR
21844 can no longer have a GENERIC condition.
21846 2021-04-14 Richard Earnshaw <rearnsha@arm.com>
21849 * config/arm/arm.c (arm_configure_build_target): Strip isa_all_fpbits
21850 from the isa_delta when -mfpu has been used.
21851 (arm_options_perform_arch_sanity_checks): It's the architecture that
21852 lacks an FPU not the processor.
21854 2021-04-13 Richard Biener <rguenther@suse.de>
21856 PR tree-optimization/100053
21857 * tree-ssa-sccvn.c (vn_nary_op_get_predicated_value): Do
21858 not use optimistic dominance queries for backedges to validate
21860 (dominated_by_p_w_unex): Add parameter to ignore executable
21861 state on backedges.
21862 (rpo_elim::eliminate_avail): Adjust.
21864 2021-04-13 Jakub Jelinek <jakub@redhat.com>
21867 * config/aarch64/aarch64.md (*aarch64_bfxil<mode>_extr,
21868 *aarch64_bfxilsi_extrdi): New define_insn patterns.
21870 2021-04-13 Jakub Jelinek <jakub@redhat.com>
21873 * simplify-rtx.c (simplify_immed_subreg): For MODE_COMPOSITE_P
21874 outermode, return NULL if the result doesn't encode back to the
21875 original byte sequence.
21876 (simplify_gen_subreg): Don't create SUBREGs from constants to
21877 MODE_COMPOSITE_P outermode.
21879 2021-04-12 Jakub Jelinek <jakub@redhat.com>
21881 PR rtl-optimization/99905
21882 * combine.c (expand_compound_operation): If pos + len > modewidth,
21883 perform the right shift by pos in inner_mode and then convert to mode,
21884 instead of trying to simplify a shift of rtx with inner_mode by pos
21885 as if it was a shift in mode.
21887 2021-04-12 Jakub Jelinek <jakub@redhat.com>
21890 * combine.c (simplify_and_const_int_1): Don't optimize varop
21891 away if it has side-effects.
21893 2021-04-12 Martin Liska <mliska@suse.cz>
21895 * doc/extend.texi: Escape @smallexample content.
21897 2021-04-12 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
21899 * config/s390/s390.md ("*movdi_31", "*movdi_64"): Add
21900 alternative in order to load a DFP zero.
21902 2021-04-12 Martin Liska <mliska@suse.cz>
21904 * doc/extend.texi: Be more precise in documentation
21905 of symver attribute.
21907 2021-04-12 Martin Liska <mliska@suse.cz>
21910 * gimplify.c (gimplify_expr): Right now, we unpoison all
21911 variables before a goto <dest>. We should not do it if we are
21914 2021-04-12 Cui,Lili <lili.cui@intel.com>
21916 * common/config/i386/cpuinfo.h (get_intel_cpu): Handle
21918 * common/config/i386/i386-common.c (processor_names): Add
21920 (processor_alias_table): Add rocketlake.
21921 * common/config/i386/i386-cpuinfo.h (processor_subtypes): Add
21922 INTEL_COREI7_ROCKETLAKE.
21923 * config.gcc: Add -march=rocketlake.
21924 * config/i386/i386-c.c (ix86_target_macros_internal): Handle
21926 * config/i386/i386-options.c (m_ROCKETLAKE) : Define.
21927 (processor_cost_table): Add rocketlake cost.
21928 * config/i386/i386.h (ix86_size_cost) : Define
21930 (processor_type) : Add PROCESSOR_ROCKETLAKE.
21931 (PTA_ROCKETLAKE): Ditto.
21932 * doc/extend.texi: Add rocketlake.
21933 * doc/invoke.texi: Add rocketlake.
21935 2021-04-12 Cui,Lili <lili.cui@intel.com>
21937 * config/i386/i386.h (PTA_ALDERLAKE): Change alderlake ISA list.
21938 * config/i386/i386-options.c (m_CORE_AVX2): Add m_ALDERLAKE.
21939 * common/config/i386/cpuinfo.h (get_intel_cpu): Add AlderLake model.
21940 * doc/invoke.texi: Change alderlake ISA list.
21942 2021-04-11 Hafiz Abid Qadeer <abidh@codesourcery.com>
21944 PR middle-end/98088
21945 * omp-expand.c (expand_oacc_collapse_init): Update condition in
21948 2021-04-10 H.J. Lu <hjl.tools@gmail.com>
21951 * config/i386/serializeintrin.h (_serialize): Defined as macro.
21953 2021-04-10 Jakub Jelinek <jakub@redhat.com>
21956 * expr.c (expand_expr_addr_expr_1): Test is_global_var rather than
21957 just TREE_STATIC on COMPOUND_LITERAL_EXPR_DECLs.
21959 2021-04-10 Jakub Jelinek <jakub@redhat.com>
21961 PR middle-end/99989
21962 * gimple-ssa-warn-alloca.c
21963 (alloca_type_and_limit::alloca_type_and_limit): Initialize limit to
21964 0 with integer precision unconditionally.
21966 2021-04-10 Jakub Jelinek <jakub@redhat.com>
21968 PR rtl-optimization/98601
21969 * rtlanal.c (rtx_addr_can_trap_p_1): Allow in assert unknown size
21970 not just for BLKmode, but also for VOIDmode. For STRICT_ALIGNMENT
21971 unaligned_mems handle VOIDmode like BLKmode.
21973 2021-04-10 Jan Hubicka <hubicka@ucw.cz>
21976 * tree.c (free_lang_data_in_decl): Do not release body of
21977 declare_variant_alt.
21979 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
21981 * config/aarch64/aarch64.c (aarch64_option_restore): If the
21982 architecture was specified explicitly and the tuning wasn't,
21983 tune for the architecture rather than the configured default CPU.
21985 2021-04-09 Richard Sandiford <richard.sandiford@arm.com>
21987 * config/aarch64/aarch64.md (tlsdesc_small_sve_<mode>): Use X30
21988 as the temporary register.
21990 2021-04-09 Martin Liska <mliska@suse.cz>
21992 * doc/extend.texi: Move non-target attributes on the top level.
21994 2021-04-09 Martin Liska <mliska@suse.cz>
21996 * doc/invoke.texi: Document minimum and maximum value of the
21997 argument for both supported compression algorithms.
21999 2021-04-08 David Edelsohn <dje.gcc@gmail.com>
22001 * config/rs6000/rs6000.c (rs6000_xcoff_select_section): Select
22002 TLS BSS before TLS data.
22003 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .comm.
22005 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
22007 * doc/sourcebuild.texi (stdint_types_mbig_endian): Document.
22009 2021-04-08 Richard Sandiford <richard.sandiford@arm.com>
22011 * match.pd: Extend vec_cond folds to handle shifts.
22013 2021-04-08 Maciej W. Rozycki <macro@orcam.me.uk>
22015 * config/vax/vax.md: Fix comment for `*bit<mode>' pattern's
22018 2021-04-08 Alex Coplan <alex.coplan@arm.com>
22021 * config/arm/iterators.md (MVE_vecs): New.
22022 (V_elem): Also handle V2DF.
22023 * config/arm/mve.md (*mve_mov<mode>): Rename to ...
22024 (*mve_vdup<mode>): ... this. Remove second alternative since
22025 vec_duplicate of const_int is not canonical RTL, and we don't
22026 want to match symbol_refs.
22027 (*mve_vec_duplicate<mode>): Delete (pattern is redundant).
22029 2021-04-08 Xionghu Luo <luoxhu@linux.ibm.com>
22031 * fold-const.c (fold_single_bit_test): Fix typo.
22032 * print-rtl.c (print_rtx_insn_vec): Call print_rtl_single
22035 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
22037 PR tree-optimization/97513
22038 * tree-vect-slp.c (vect_add_slp_permutation): New function,
22040 (vectorizable_slp_permutation): ...here. Detect cases in which
22041 all VEC_PERM_EXPRs are guaranteed to have the same stepped
22042 permute vector and only generate one permute vector for that case.
22043 Extend that case to handle variable-length vectors.
22045 2021-04-07 Richard Sandiford <richard.sandiford@arm.com>
22047 PR tree-optimization/99873
22048 * tree-vect-slp.c (vect_slp_prefer_store_lanes_p): New function.
22049 (vect_build_slp_instance): Don't split store groups that could
22050 use IFN_STORE_LANES.
22052 2021-04-07 Jakub Jelinek <jakub@redhat.com>
22055 * varasm.c (output_constant_pool_contents): Don't strip name encoding
22056 from XSTR (desc->sym, 0) or from label before passing those to
22059 2021-04-07 Richard Biener <rguenther@suse.de>
22061 PR tree-optimization/99954
22062 * tree-loop-distribution.c: Include tree-affine.h.
22063 (generate_memcpy_builtin): Try using tree-affine to prove
22065 (loop_distribution::classify_builtin_ldst): Always classify
22068 2021-04-07 Richard Biener <rguenther@suse.de>
22070 PR tree-optimization/99947
22071 * tree-vect-loop.c (vectorizable_induction): Pre-allocate
22072 steps vector to avoid pushing elements from the reallocated
22075 2021-04-07 Richard Biener <rguenther@suse.de>
22077 * tree-ssa-sccvn.h (print_vn_reference_ops): Declare.
22078 * tree-ssa-pre.c (print_pre_expr): Factor out VN reference operand
22080 * tree-ssa-sccvn.c (print_vn_reference_ops): ... into this new
22082 (debug_vn_reference_ops): New.
22084 2021-04-07 Bin Cheng <bin.cheng@linux.alibaba.com>
22086 PR tree-optimization/98736
22087 * tree-loop-distribution.c
22088 * (loop_distribution::bb_top_order_init):
22089 Compute RPO with programing order preserved by calling function
22090 rev_post_order_and_mark_dfs_back_seme.
22092 2021-04-06 Vladimir N. Makarov <vmakarov@redhat.com>
22095 * lra-constraints.c (split_reg): Don't check paradoxical_subreg_p.
22096 * lra-lives.c (clear_sparseset_regnos, regnos_in_sparseset_p): New
22098 (process_bb_lives): Don't update biggest mode of hard reg for
22099 implicit in multi-register group. Use the new functions for
22100 updating dead_set and unused_set by register notes.
22102 2021-04-06 Xianmiao Qu <xianmiao_qu@c-sky.com>
22104 * config/csky/csky_pipeline_ck802.md : Use insn reservation name
22107 2021-04-06 H.J. Lu <hjl.tools@gmail.com>
22109 * config/i386/x86-tune-costs.h (skylake_memcpy): Updated.
22110 (skylake_memset): Likewise.
22111 (skylake_cost): Change CLEAR_RATIO to 17.
22112 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
22113 Replace m_CANNONLAKE, m_ICELAKE_CLIENT, m_ICELAKE_SERVER,
22114 m_TIGERLAKE and m_SAPPHIRERAPIDS with m_SKYLAKE and m_CORE_AVX512.
22116 2021-04-06 Richard Biener <rguenther@suse.de>
22118 PR tree-optimization/99880
22119 * tree-vect-loop.c (maybe_set_vectorized_backedge_value): Only
22120 set vectorized defs of relevant PHIs.
22122 2021-04-06 Richard Biener <rguenther@suse.de>
22124 PR tree-optimization/99924
22125 * tree-vect-slp.c (vect_bb_partition_graph_r): Do not mark
22126 nodes w/o scalar stmts as visited.
22128 2021-04-06 Alex Coplan <alex.coplan@arm.com>
22131 * config/arm/arm.c (arm_libcall_uses_aapcs_base): Also use base
22132 PCS for [su]fix_optab.
22134 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
22136 * config/darwin.c (machopic_legitimize_pic_address): Check
22137 that the current pic register is one of the hard reg set
22138 before setting liveness.
22140 2021-04-03 Iain Sandoe <iain@sandoe.co.uk>
22142 * config/darwin.c (machopic_legitimize_pic_address): Fix
22143 whitespace, remove unused code.
22145 2021-04-03 Jakub Jelinek <jakub@redhat.com>
22147 PR tree-optimization/99882
22148 * gimple-ssa-store-merging.c (bswap_view_convert): Handle val with
22151 2021-04-03 Jakub Jelinek <jakub@redhat.com>
22153 PR rtl-optimization/99863
22154 * dse.c (replace_read): Drop regs_live argument. Instead of
22155 regs_live, use store_insn->fixed_regs_live if non-NULL,
22156 otherwise punt if insns sequence clobbers or sets any hard
22159 2021-04-03 Jakub Jelinek <jakub@redhat.com>
22162 * targhooks.h (default_print_patchable_function_entry_1): Declare.
22163 * targhooks.c (default_print_patchable_function_entry_1): New function,
22164 copied from default_print_patchable_function_entry with an added flags
22166 (default_print_patchable_function_entry): Rewritten into a small
22167 wrapper around default_print_patchable_function_entry_1.
22168 * config/rs6000/rs6000.c (TARGET_ASM_PRINT_PATCHABLE_FUNCTION_ENTRY):
22170 (rs6000_print_patchable_function_entry): New function.
22172 2021-04-02 Eric Botcazou <ebotcazou@adacore.com>
22174 * doc/invoke.texi (fdelete-dead-exceptions): Minor tweak.
22176 2021-04-01 Jason Merrill <jason@redhat.com>
22179 * common.opt: Document v15 and v16.
22181 2021-04-01 Richard Biener <rguenther@suse.de>
22183 PR tree-optimization/99863
22184 * gimplify.c (gimplify_init_constructor): Recompute vector
22187 2021-04-01 Jakub Jelinek <jakub@redhat.com>
22189 * doc/extend.texi (symver attribute): Fix up syntax errors
22192 2021-04-01 Jakub Jelinek <jakub@redhat.com>
22194 PR tree-optimization/96573
22195 * gimple-ssa-store-merging.c (init_symbolic_number): Handle
22196 also pointer types.
22198 2021-04-01 Richard Biener <rguenther@suse.de>
22200 PR tree-optimization/99856
22201 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Promote
22202 precision to vector element precision.
22204 2021-04-01 Martin Jambor <mjambor@suse.cz>
22206 PR tree-optimization/97009
22207 * tree-sra.c (access_or_its_child_written): New function.
22208 (propagate_subaccesses_from_rhs): Use it instead of a simple grp_write
22211 2021-03-31 Jan Hubicka <hubicka@ucw.cz>
22214 * cif-code.def (USES_COMDAT_LOCAL): Make CIF_FINAL_NORMAL.
22216 2021-03-31 Pat Haugen <pthaugen@linux.ibm.com>
22219 * config/rs6000/altivec.md (xxspltiw_v4si, xxspltiw_v4sf_inst,
22220 xxspltidp_v2df_inst, xxsplti32dx_v4si_inst, xxsplti32dx_v4sf_inst,
22221 xxblend_<mode>, xxpermx_inst, xxeval): Mark prefixed.
22222 * config/rs6000/mma.md (mma_<vvi4i4i8>, mma_<avvi4i4i8>,
22223 mma_<vvi4i4i2>, mma_<avvi4i4i2>, mma_<vvi4i4>, mma_<avvi4i4>,
22224 mma_<pvi4i2>, mma_<apvi4i2>, mma_<vvi4i4i4>, mma_<avvi4i4i4>):
22226 * config/rs6000/rs6000.c (rs6000_final_prescan_insn): Adjust test.
22227 * config/rs6000/rs6000.md (define_attr "maybe_prefixed"): New.
22228 (define_attr "prefixed"): Update initializer.
22230 2021-03-31 Jakub Jelinek <jakub@redhat.com>
22233 * dwarf2out.c (debug_ranges_dwo_section): New variable.
22234 (DW_RANGES_IDX_SKELETON): Define.
22235 (struct dw_ranges): Add begin_entry and end_entry members.
22236 (DEBUG_DWO_RNGLISTS_SECTION): Define.
22237 (add_ranges_num): Adjust r initializer for addition of *_entry
22239 (add_ranges_by_labels): For -gsplit-dwarf and force_direct,
22240 set idx to DW_RANGES_IDX_SKELETON.
22241 (use_distinct_base_address_for_range): New function.
22242 (index_rnglists): Don't set r->idx if it is equal to
22243 DW_RANGES_IDX_SKELETON. Initialize r->begin_entry and
22244 r->end_entry for -gsplit-dwarf if those will be needed by
22246 (output_rnglists): Add DWO argument. If true, switch to
22247 debug_ranges_dwo_section rather than debug_ranges_section.
22248 Adjust l1/l2 label indexes. Only output the offset table when
22249 dwo is true and don't include in there the skeleton range
22250 entry if present. For -gsplit-dwarf, skip ranges that belong
22251 to the other rnglists section. Change return type from void
22252 to bool and return true if there are any range entries for
22253 the other section. For dwarf_split_debug_info use
22254 DW_RLE_startx_endx, DW_RLE_startx_length and DW_RLE_base_addressx
22255 entries instead of DW_RLE_start_end, DW_RLE_start_length and
22256 DW_RLE_base_address. Use use_distinct_base_address_for_range.
22257 (init_sections_and_labels): Initialize debug_ranges_dwo_section
22258 if -gsplit-dwarf and DWARF >= 5. Adjust ranges_section_label
22259 and range_base_label indexes.
22260 (dwarf2out_finish): Call index_rnglists earlier before finalizing
22261 .debug_addr. Never emit DW_AT_rnglists_base attribute. For
22262 -gsplit-dwarf and DWARF >= 5 call output_rnglists up to twice
22263 with different dwo arguments.
22264 (dwarf2out_c_finalize): Clear debug_ranges_dwo_section.
22266 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22268 PR tree-optimization/98268
22269 * gimple-fold.c (maybe_canonicalize_mem_ref_addr): Call
22270 recompute_tree_invariant_for_addr_expr after successfully
22271 folding a TARGET_MEM_REF that occurs inside an ADDR_EXPR.
22273 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22275 PR tree-optimization/99726
22276 * tree-data-ref.c (create_intersect_range_checks_index): Bail
22277 out if there is more than one access function SCEV for the loop
22280 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22282 PR rtl-optimization/97141
22283 PR rtl-optimization/98726
22284 * emit-rtl.c (valid_for_const_vector_p): Return true for
22286 * rtx-vector-builder.h (rtx_vector_builder::step): Return a
22287 poly_wide_int instead of a wide_int.
22288 (rtx_vector_builder::apply_set): Take a poly_wide_int instead
22290 * rtx-vector-builder.c (rtx_vector_builder::apply_set): Likewise.
22291 * config/aarch64/aarch64.c (aarch64_legitimate_constant_p): Return
22292 false for CONST_VECTORs that cannot be forced to memory.
22293 * config/aarch64/aarch64-simd.md (mov<mode>): If a CONST_VECTOR
22294 is too complex to force to memory, build it up from individual
22297 2021-03-31 Jan Hubicka <jh@suse.cz>
22300 * cgraph.c (cgraph_node::release_body): Fix overactive check.
22302 2021-03-31 Christophe Lyon <christophe.lyon@linaro.org>
22305 * config/arm/vec-common.md (mul<mode>3): Disable on iwMMXT, expect
22308 2021-03-31 H.J. Lu <hjl.tools@gmail.com>
22310 * config/i386/i386-expand.c (expand_set_or_cpymem_via_rep):
22311 For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, don't convert QImode
22313 (decide_alg): For TARGET_PREFER_KNOWN_REP_MOVSB_STOSB, use
22314 "rep movsb/stosb" only for known sizes.
22315 * config/i386/i386-options.c (processor_cost_table): Use Ice
22316 Lake cost for Cannon Lake, Ice Lake, Tiger Lake, Sapphire
22317 Rapids and Alder Lake.
22318 * config/i386/i386.h (TARGET_PREFER_KNOWN_REP_MOVSB_STOSB): New.
22319 * config/i386/x86-tune-costs.h (icelake_memcpy): New.
22320 (icelake_memset): Likewise.
22321 (icelake_cost): Likewise.
22322 * config/i386/x86-tune.def (X86_TUNE_PREFER_KNOWN_REP_MOVSB_STOSB):
22325 2021-03-31 Richard Sandiford <richard.sandiford@arm.com>
22328 * config/aarch64/aarch64.c
22329 (aarch64_vectorize_preferred_vector_alignment): Query the size
22330 of the provided SVE vector; do not assume that all SVE vectors
22331 have the same size.
22333 2021-03-31 Jan Hubicka <jh@suse.cz>
22336 * cgraph.c (cgraph_node::release_body): Remove all callers and
22338 * cgraphclones.c (cgraph_node::materialize_clone): Do not do it here.
22339 * cgraphunit.c (cgraph_node::expand): And here.
22341 2021-03-31 Martin Liska <mliska@suse.cz>
22343 * ipa-modref.c (analyze_ssa_name_flags): Fix coding style
22344 and one negated condition.
22346 2021-03-31 Jakub Jelinek <jakub@redhat.com>
22347 Richard Sandiford <richard.sandiford@arm.com>
22350 * config/aarch64/aarch64.md (*add<mode>3_poly_1): Swap Uai and Uav
22351 constraints on operands[2] and similarly 0 and rk constraints
22352 on operands[1] corresponding to that.
22354 2021-03-31 Jakub Jelinek <jakub@redhat.com>
22357 * configure.ac (HAVE_LD_BROKEN_PE_DWARF5): New AC_DEFINE if PECOFF
22358 linker doesn't support DWARF sections new in DWARF5.
22359 * config/i386/i386-options.c (ix86_option_override_internal): Default
22360 to dwarf_version 4 if HAVE_LD_BROKEN_PE_DWARF5 for TARGET_PECOFF
22362 * config.in: Regenerated.
22363 * configure: Regenerated.
22365 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22368 * config/aarch64/aarch64.c (aarch64_analyze_loop_vinfo): Check for
22369 available issue_info before using it.
22371 2021-03-30 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22374 * config/aarch64/aarch64.md (sub<mode>3_compare1_imm): Do not allow zero
22377 2021-03-30 Xionghu Luo <luoxhu@linux.ibm.com>
22380 * config/rs6000/altivec.md (altivec_lvsl_reg): Change to ...
22381 (altivec_lvsl_reg_<mode>): ... this.
22382 (altivec_lvsr_reg): Change to ...
22383 (altivec_lvsr_reg_<mode>): ... this.
22384 * config/rs6000/predicates.md (vec_set_index_operand): New.
22385 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
22386 Enable 32bit variable vec_insert for all TARGET_VSX.
22387 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
22388 Enable 32bit variable vec_insert for p9 and above.
22389 (rs6000_expand_vector_set_var_p8): Rename to ...
22390 (rs6000_expand_vector_set_var_p7): ... this.
22391 (rs6000_expand_vector_set): Use TARGET_VSX and adjust assert
22393 * config/rs6000/vector.md (vec_set<mode>): Use vec_set_index_operand.
22394 * config/rs6000/vsx.md (xl_len_r): Use gen_altivec_lvsl_reg_di and
22395 gen_altivec_lvsr_reg_di.
22397 2021-03-30 H.J. Lu <hjl.tools@gmail.com>
22400 * config/i386/ia32intrin.h (__rdtsc): Defined as macro.
22401 (__rdtscp): Likewise.
22403 2021-03-30 Tamar Christina <tamar.christina@arm.com>
22405 PR tree-optimization/99825
22406 * tree-vect-slp-patterns.c (vect_check_evenodd_blend):
22407 Reject non-mult 2 lanes.
22409 2021-03-30 Richard Earnshaw <rearnsha@arm.com>
22412 * config/arm/arm.c (arm_file_start): Fix emission of
22413 Tag_ABI_VFP_args attribute.
22415 2021-03-30 Richard Biener <rguenther@suse.de>
22417 PR tree-optimization/99824
22418 * stor-layout.c (set_min_and_max_values_for_integral_type):
22419 Assert the precision is within the bounds of
22420 WIDE_INT_MAX_PRECISION.
22421 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Use
22422 the outermost component ref only to lower the access size
22423 and initialize that from the access type.
22425 2021-03-30 Richard Sandiford <richard.sandiford@arm.com>
22428 * config/aarch64/aarch64.md (mov<mode>): Pass multi-instruction
22429 CONST_INTs to aarch64_expand_mov_immediate when called after RA.
22431 2021-03-30 Mihailo Stojanovic <mihailo.stojanovic@typhoon-hil.com>
22433 * config/aarch64/aarch64.md
22434 (<optab>_trunc<fcvt_target><GPI:mode>2): Set the "arch"
22435 attribute to disambiguate between SIMD and FP variants of the
22438 2021-03-29 Jan Hubicka <hubicka@ucw.cz>
22440 * ipa-modref.c (merge_call_lhs_flags): Correct handling of deref.
22441 (analyze_ssa_name_flags): Fix typo in comment.
22443 2021-03-29 Alex Coplan <alex.coplan@arm.com>
22446 * config/aarch64/aarch64-sve-builtins.cc
22447 (function_builder::add_function): Add placeholder_p argument, use
22448 placeholder decls if this is set.
22449 (function_builder::add_unique_function): Instead of conditionally adding
22450 direct overloads, unconditionally add either a direct overload or a
22452 (function_builder::add_overloaded_function): Set placeholder_p if we're
22453 using C++ overloads. Use the obstack for string storage instead
22454 of relying on the tree nodes.
22455 (function_builder::add_overloaded_functions): Don't return early for
22456 m_direct_overloads: we need to add placeholders.
22457 * config/aarch64/aarch64-sve-builtins.h
22458 (function_builder::add_function): Add placeholder_p argument.
22460 2021-03-29 Richard Biener <rguenther@suse.de>
22462 PR tree-optimization/99807
22463 * tree-vect-slp.c (vect_slp_analyze_node_operations_1): Move
22464 assert below VEC_PERM handling.
22466 2021-03-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
22469 * config/aarch64/aarch64-simd.md (move_lo_quad_internal_<mode>): Use
22470 aarch64_simd_or_scalar_imm_zero to match zeroes. Remove pattern
22471 matching const_int 0.
22472 (move_lo_quad_internal_be_<mode>): Likewise.
22473 (move_lo_quad_<mode>): Update for the above.
22474 * config/aarch64/iterators.md (VQ_2E): Delete.
22476 2021-03-29 Jakub Jelinek <jakub@redhat.com>
22478 PR tree-optimization/99777
22479 * fold-const.c (extract_muldiv_1): For conversions, punt on casts from
22480 types other than scalar integral types.
22482 2021-03-28 David Edelsohn <dje.gcc@gmail.com>
22484 * config/rs6000/rs6000.c (rs6000_output_dwarf_dtprel): Do not add
22485 XCOFF TLS reloc decorations.
22487 2021-03-28 Gerald Pfeifer <gerald@pfeifer.com>
22489 * doc/analyzer.texi (Analyzer Internals): Update link to
22490 "A Memory Model for Static Analysis of C Programs".
22492 2021-03-26 David Edelsohn <dje.gcc@gmail.com>
22494 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Call function.
22495 * config/rs6000/rs6000-protos.h (rs6000_special_adjust_field_align):
22497 * config/rs6000/rs6000.c (rs6000_special_adjust_field_align): New.
22498 (rs6000_special_round_type_align): Recursively check innermost first
22501 2021-03-26 Jakub Jelinek <jakub@redhat.com>
22504 * dwarf2out.h (struct dw_fde_node): Add rule18 member.
22505 * dwarf2cfi.c (dwarf2out_frame_debug_expr): When handling (set hfp sp)
22506 assignment with drap_reg active, queue reg save for hfp with offset 0
22507 and flush queued reg saves. When handling a push with rule18,
22508 defer queueing reg save for hfp and just assert the offset is 0.
22509 (scan_trace): Assert that fde->rule18 is false.
22511 2021-03-26 Vladimir Makarov <vmakarov@redhat.com>
22514 * ira-costs.c (record_reg_classes): Put case with
22515 CT_RELAXED_MEMORY adjacent to one with CT_MEMORY.
22516 * ira.c (ira_setup_alts): Ditto.
22517 * lra-constraints.c (process_alt_operands): Ditto.
22518 * recog.c (asm_operand_ok): Ditto.
22519 * reload.c (find_reloads): Ditto.
22521 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22523 * config/aarch64/aarch64-protos.h
22524 (cpu_addrcost_table::post_modify_ld3_st3): New member variable.
22525 (cpu_addrcost_table::post_modify_ld4_st4): Likewise.
22526 * config/aarch64/aarch64.c (generic_addrcost_table): Update
22527 accordingly, using the same costs as for post_modify.
22528 (exynosm1_addrcost_table, xgene1_addrcost_table): Likewise.
22529 (thunderx2t99_addrcost_table, thunderx3t110_addrcost_table):
22530 (tsv110_addrcost_table, qdf24xx_addrcost_table): Likewise.
22531 (a64fx_addrcost_table): Likewise.
22532 (neoversev1_addrcost_table): New.
22533 (neoversev1_tunings): Use neoversev1_addrcost_table.
22534 (aarch64_address_cost): Use the new post_modify costs for CImode
22537 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22539 * config/aarch64/aarch64.opt
22540 (-param=aarch64-loop-vect-issue-rate-niters=): New parameter.
22541 * doc/invoke.texi: Document it.
22542 * config/aarch64/aarch64-protos.h (aarch64_base_vec_issue_info)
22543 (aarch64_scalar_vec_issue_info, aarch64_simd_vec_issue_info)
22544 (aarch64_advsimd_vec_issue_info, aarch64_sve_vec_issue_info)
22545 (aarch64_vec_issue_info): New structures.
22546 (cpu_vector_cost): Write comments above the variables rather
22548 (cpu_vector_cost::issue_info): New member variable.
22549 * config/aarch64/aarch64.c: Include gimple-pretty-print.h
22550 and tree-ssa-loop-niter.h.
22551 (generic_vector_cost, a64fx_vector_cost, qdf24xx_vector_cost)
22552 (thunderx_vector_cost, tsv110_vector_cost, cortexa57_vector_cost)
22553 (exynosm1_vector_cost, xgene1_vector_cost, thunderx2t99_vector_cost)
22554 (thunderx3t110_vector_cost): Initialize issue_info to null.
22555 (neoversev1_scalar_issue_info, neoversev1_advsimd_issue_info)
22556 (neoversev1_sve_issue_info, neoversev1_vec_issue_info): New structures.
22557 (neoversev1_vector_cost): Use them.
22558 (aarch64_vec_op_count, aarch64_sve_op_count): New structures.
22559 (aarch64_vector_costs::saw_sve_only_op): New member variable.
22560 (aarch64_vector_costs::num_vector_iterations): Likewise.
22561 (aarch64_vector_costs::scalar_ops): Likewise.
22562 (aarch64_vector_costs::advsimd_ops): Likewise.
22563 (aarch64_vector_costs::sve_ops): Likewise.
22564 (aarch64_vector_costs::seen_loads): Likewise.
22565 (aarch64_simd_vec_costs_for_flags): New function.
22566 (aarch64_analyze_loop_vinfo): Initialize num_vector_iterations.
22567 Count the number of predicate operations required by SVE WHILE
22569 (aarch64_comparison_type, aarch64_multiply_add_p): New functions.
22570 (aarch64_sve_only_stmt_p, aarch64_in_loop_reduction_latency): Likewise.
22571 (aarch64_count_ops): Likewise.
22572 (aarch64_add_stmt_cost): Record whether see an SVE operation
22573 that cannot currently be implementing using Advanced SIMD.
22574 Record issue information about the scalar, Advanced SIMD
22575 and (where relevant) SVE versions of a loop.
22576 (aarch64_vec_op_count::dump): New function.
22577 (aarch64_sve_op_count::dump): Likewise.
22578 (aarch64_estimate_min_cycles_per_iter): Likewise.
22579 (aarch64_adjust_body_cost): If issue information is available,
22580 try to compare the issue rates of the various loop implementations
22581 and increase or decrease the vector body cost accordingly.
22583 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22585 * config/aarch64/aarch64.c (aarch64_detect_vector_stmt_subtype):
22586 Assume a zero cost for induction phis.
22588 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22590 * config/aarch64/aarch64.c (aarch64_embedded_comparison_type): New
22592 (aarch64_adjust_stmt_cost): Add the costs of embedded scalar and
22593 vector comparisons.
22595 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22597 * config/aarch64/aarch64.c (aarch64_detect_scalar_stmt_subtype):
22599 (aarch64_add_stmt_cost): Call it.
22601 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22603 * config/aarch64/aarch64-tuning-flags.def (matched_vector_throughput):
22604 New tuning parameter.
22605 * config/aarch64/aarch64.c (neoversev1_tunings): Use it.
22606 (aarch64_estimated_sve_vq): New function.
22607 (aarch64_vector_costs::analyzed_vinfo): New member variable.
22608 (aarch64_vector_costs::is_loop): Likewise.
22609 (aarch64_vector_costs::unrolled_advsimd_niters): Likewise.
22610 (aarch64_vector_costs::unrolled_advsimd_stmts): Likewise.
22611 (aarch64_record_potential_advsimd_unrolling): New function.
22612 (aarch64_analyze_loop_vinfo, aarch64_analyze_bb_vinfo): Likewise.
22613 (aarch64_add_stmt_cost): Call aarch64_analyze_loop_vinfo or
22614 aarch64_analyze_bb_vinfo on the first use of a costs structure.
22615 Detect whether we're vectorizing a loop for SVE that might be
22616 completely unrolled if it used Advanced SIMD instead.
22617 (aarch64_adjust_body_cost_for_latency): New function.
22618 (aarch64_finish_cost): Call it.
22620 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22622 * config/aarch64/aarch64.c (aarch64_vector_costs): New structure.
22623 (aarch64_init_cost): New function.
22624 (aarch64_add_stmt_cost): Use aarch64_vector_costs instead of
22625 the default unsigned[3].
22626 (aarch64_finish_cost, aarch64_destroy_cost_data): New functions.
22627 (TARGET_VECTORIZE_INIT_COST): Override.
22628 (TARGET_VECTORIZE_FINISH_COST): Likewise.
22629 (TARGET_VECTORIZE_DESTROY_COST_DATA): Likewise.
22631 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22633 * config/aarch64/aarch64.c (neoversev1_advsimd_vector_cost)
22634 (neoversev1_sve_vector_cost): New cost structures.
22635 (neoversev1_vector_cost): Likewise.
22636 (neoversev1_tunings): Use them. Enable use_new_vector_costs.
22638 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22640 * config/aarch64/aarch64-protos.h
22641 (sve_vec_cost::scatter_store_elt_cost): New member variable.
22642 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
22643 accordingly, taking the cost from the cost of a scalar_store.
22644 (a64fx_sve_vector_cost): Likewise.
22645 (aarch64_detect_vector_stmt_subtype): Detect scatter stores.
22647 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22649 * config/aarch64/aarch64-protos.h
22650 (simd_vec_cost::store_elt_extra_cost): New member variable.
22651 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
22652 accordingly, using the vec_to_scalar cost for the new field.
22653 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
22654 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
22655 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
22656 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
22657 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
22658 (thunderx3t110_advsimd_vector_cost): Likewise.
22659 (aarch64_detect_vector_stmt_subtype): Detect single-element stores.
22661 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22663 * config/aarch64/aarch64-protos.h (simd_vec_cost::ld2_st2_permute_cost)
22664 (simd_vec_cost::ld3_st3_permute_cost): New member variables.
22665 (simd_vec_cost::ld4_st4_permute_cost): Likewise.
22666 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
22667 accordingly, using zero for the new costs.
22668 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
22669 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
22670 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
22671 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
22672 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
22673 (thunderx3t110_advsimd_vector_cost): Likewise.
22674 (aarch64_ld234_st234_vectors): New function.
22675 (aarch64_adjust_stmt_cost): Likewise.
22676 (aarch64_add_stmt_cost): Call aarch64_adjust_stmt_cost if using
22677 the new vector costs.
22679 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22681 * config/aarch64/aarch64-protos.h (sve_vec_cost): Turn into a
22682 derived class of simd_vec_cost. Add information about CLAST[AB]
22683 and FADDA instructions.
22684 * config/aarch64/aarch64.c (generic_sve_vector_cost): Update
22685 accordingly, using the vec_to_scalar costs for the new fields.
22686 (a64fx_sve_vector_cost): Likewise.
22687 (aarch64_reduc_type): New function.
22688 (aarch64_sve_in_loop_reduction_latency): Likewise.
22689 (aarch64_detect_vector_stmt_subtype): Take a vinfo parameter.
22690 Use aarch64_sve_in_loop_reduction_latency to handle SVE reductions
22691 that occur in the loop body.
22692 (aarch64_add_stmt_cost): Update call accordingly.
22694 2021-03-26 Richard Sandiford <richard.sandiford@arm.com>
22696 * config/aarch64/aarch64-tuning-flags.def (use_new_vector_costs):
22698 * config/aarch64/aarch64-protos.h (simd_vec_cost): Put comments
22699 above the fields rather than to the right.
22700 (simd_vec_cost::reduc_i8_cost): New member variable.
22701 (simd_vec_cost::reduc_i16_cost): Likewise.
22702 (simd_vec_cost::reduc_i32_cost): Likewise.
22703 (simd_vec_cost::reduc_i64_cost): Likewise.
22704 (simd_vec_cost::reduc_f16_cost): Likewise.
22705 (simd_vec_cost::reduc_f32_cost): Likewise.
22706 (simd_vec_cost::reduc_f64_cost): Likewise.
22707 * config/aarch64/aarch64.c (generic_advsimd_vector_cost): Update
22708 accordingly, using the vec_to_scalar_cost for the new fields.
22709 (generic_sve_vector_cost, a64fx_advsimd_vector_cost): Likewise.
22710 (a64fx_sve_vector_cost, qdf24xx_advsimd_vector_cost): Likewise.
22711 (thunderx_advsimd_vector_cost, tsv110_advsimd_vector_cost): Likewise.
22712 (cortexa57_advsimd_vector_cost, exynosm1_advsimd_vector_cost)
22713 (xgene1_advsimd_vector_cost, thunderx2t99_advsimd_vector_cost)
22714 (thunderx3t110_advsimd_vector_cost): Likewise.
22715 (aarch64_use_new_vector_costs_p): New function.
22716 (aarch64_simd_vec_costs): New function, split out from...
22717 (aarch64_builtin_vectorization_cost): ...here.
22718 (aarch64_is_reduction): New function.
22719 (aarch64_detect_vector_stmt_subtype): Likewise.
22720 (aarch64_add_stmt_cost): Call aarch64_detect_vector_stmt_subtype if
22721 using the new vector costs.
22723 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22726 * tree-emutls.c (get_emutls_init_templ_addr): Mark initializer of weak
22727 TLS declarations as public.
22729 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22731 * config/aarch64/aarch64-d.c (IN_TARGET_CODE): Define.
22732 * config/arm/arm-d.c (IN_TARGET_CODE): Likewise.
22733 * config/i386/i386-d.c (IN_TARGET_CODE): Likewise.
22734 * config/mips/mips-d.c (IN_TARGET_CODE): Likewise.
22735 * config/pa/pa-d.c (IN_TARGET_CODE): Likewise.
22736 * config/riscv/riscv-d.c (IN_TARGET_CODE): Likewise.
22737 * config/rs6000/rs6000-d.c (IN_TARGET_CODE): Likewise.
22738 * config/s390/s390-d.c (IN_TARGET_CODE): Likewise.
22739 * config/sparc/sparc-d.c (IN_TARGET_CODE): Likewise.
22741 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22744 * config.gcc (*-*-cygwin*): Add winnt-d.o
22745 (*-*-mingw*): Likewise.
22746 * config/i386/cygwin.h (EXTRA_TARGET_D_OS_VERSIONS): New macro.
22747 * config/i386/mingw32.h (EXTRA_TARGET_D_OS_VERSIONS): Likewise.
22748 * config/i386/t-cygming: Add winnt-d.o.
22749 * config/i386/winnt-d.c: New file.
22751 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22753 * config/freebsd-d.c: Include memmodel.h.
22755 2021-03-26 Iain Buclaw <ibuclaw@gdcproject.org>
22758 * config.gcc (*-*-openbsd*): Add openbsd-d.o.
22759 * config/t-openbsd: Add openbsd-d.o.
22760 * config/openbsd-d.c: New file.
22762 2021-03-25 Stam Markianos-Wright <stam.markianos-wright@arm.com>
22764 PR tree-optimization/96974
22765 * tree-vect-stmts.c (vect_get_vector_types_for_stmt): Replace assert
22766 with graceful exit.
22768 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
22771 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
22775 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
22776 always_inline in system headers.
22778 2021-03-25 Kewen Lin <linkw@linux.ibm.com>
22780 * tree-vect-loop.c (vect_model_reduction_cost): Init inside_cost.
22782 2021-03-25 Jakub Jelinek <jakub@redhat.com>
22785 * tree-core.h (enum operand_equal_flag): Add OEP_ADDRESS_OF_SAME_FIELD.
22786 * fold-const.c (operand_compare::operand_equal_p): Don't compare
22787 field offsets if OEP_ADDRESS_OF_SAME_FIELD.
22789 2021-03-25 H.J. Lu <hjl.tools@gmail.com>
22793 * config/i386/i386.c (ix86_can_inline_p): Don't check ISA for
22794 always_inline in system headers.
22796 2021-03-25 Richard Biener <rguenther@suse.de>
22798 PR tree-optimization/99746
22799 * tree-vect-slp-patterns.c (complex_pattern::build): Do not mark
22800 the scalar stmt as patterned. Instead set up required things
22803 2021-03-25 Xionghu Luo <luoxhu@linux.ibm.com>
22805 * config/rs6000/rs6000.c (power8_costs): Change l2 cache
22808 2021-03-24 Martin Liska <mliska@suse.cz>
22811 * common/config/i386/i386-common.c (ARRAY_SIZE): Fix off-by-one
22813 * config/i386/i386-options.c (ix86_option_override_internal):
22814 Add run-time assert.
22816 2021-03-24 Martin Jambor <mjambor@suse.cz>
22819 * ipa-cp.c (initialize_node_lattices): Mark as bottom all
22820 parameters with unknown type.
22821 (ipacp_value_safe_for_type): New function.
22822 (propagate_vals_across_arith_jfunc): Verify that the constant type
22823 can be used for a type of the formal parameter.
22824 (propagate_vals_across_ancestor): Likewise.
22825 (propagate_scalar_across_jump_function): Likewise. Pass the type
22826 also to propagate_vals_across_ancestor.
22828 2021-03-24 Christophe Lyon <christophe.lyon@linaro.org>
22831 * config/arm/mve.md (movmisalign<mode>_mve_store): Use Ux
22833 (movmisalign<mode>_mve_load): Likewise.
22835 2021-03-24 Jakub Jelinek <jakub@redhat.com>
22838 * config/arm/vec-common.md (one_cmpl<mode>2, neg<mode>2,
22839 movmisalign<mode>): Disable expanders for TARGET_REALLY_IWMMXT.
22841 2021-03-24 Alexandre Oliva <oliva@adacore.com>
22843 * doc/sourcebuild.texi (sysconf): New effective target.
22845 2021-03-24 Alexandre Oliva <oliva@adacore.com>
22847 * config/i386/predicates.md (reg_or_const_vec_operand): New.
22848 * config/i386/sse.md (ssse3_pshufbv8qi3): Add an expander for
22849 the now *-prefixed insn_and_split, turn the splitter const vec
22850 into an input for the insn, making it an ignored immediate for
22851 non-split cases, and loaded into the scratch register
22854 2021-03-23 Vladimir N. Makarov <vmakarov@redhat.com>
22857 * config/aarch64/constraints.md (Utq, UOb, UOh, UOw, UOd, UOty):
22858 Use define_relaxed_memory_constraint for them.
22860 2021-03-23 Iain Sandoe <iain@sandoe.co.uk>
22863 * config/host-darwin.c (darwin_gt_pch_use_address): Add a
22864 colon to the diagnostic message.
22866 2021-03-23 Ilya Leoshkevich <iii@linux.ibm.com>
22868 * fwprop.c (fwprop_propagation::fwprop_propagation): Look at
22870 (try_fwprop_subst_note): Use set_info instead of insn_info.
22871 (try_fwprop_subst_pattern): Likewise.
22872 (try_fwprop_subst_notes): Likewise.
22873 (try_fwprop_subst): Likewise.
22874 (forward_propagate_subreg): Likewise.
22875 (forward_propagate_and_simplify): Likewise.
22876 (forward_propagate_into): Likewise.
22877 * rtl-ssa/accesses.h (set_info::single_nondebug_use) New
22879 (set_info::single_nondebug_insn_use): Likewise.
22880 (set_info::single_phi_use): Likewise.
22881 * rtl-ssa/member-fns.inl (set_info::single_nondebug_use) New
22883 (set_info::single_nondebug_insn_use): Likewise.
22884 (set_info::single_phi_use): Likewise.
22886 2021-03-23 Christophe Lyon <christophe.lyon@linaro.org>
22888 * doc/sourcebuild.texi (arm_dsp_ok, arm_dsp): Document.
22890 2021-03-23 Jakub Jelinek <jakub@redhat.com>
22893 * config/aarch64/aarch64.c (aarch64_add_offset): Tell
22894 expand_mult to perform an unsigned rather than a signed
22897 2021-03-23 H.J. Lu <hjl.tools@gmail.com>
22900 * config/i386/cpuid.h (__cpuid): Add __volatile__.
22901 (__cpuid_count): Likewise.
22903 2021-03-23 Richard Biener <rguenther@suse.de>
22905 PR tree-optimization/99721
22906 * tree-vect-slp.c (vect_slp_analyze_node_operations):
22907 Make sure we can schedule the node.
22909 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22911 * config/riscv/riscv.c (riscv_subword): Take endianness into
22912 account when calculating the byte offset.
22914 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22916 * config/riscv/predicates.md (subreg_lowpart_operator): New predicate
22917 * config/riscv/riscv.md (*addsi3_extended2, *subsi3_extended2)
22918 (*negsi2_extended2, *mulsi3_extended2, *<optab>si3_mask)
22919 (*<optab>si3_mask_1, *<optab>di3_mask, *<optab>di3_mask_1)
22920 (*<optab>si3_extend_mask, *<optab>si3_extend_mask_1): Use
22921 new predicate "subreg_lowpart_operator"
22923 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22925 * config/riscv/riscv.c (riscv_swap_instruction): New function
22926 to byteswap an SImode rtx containing an instruction.
22927 (riscv_trampoline_init): Byteswap the generated instructions
22930 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22932 * common/config/riscv/riscv-common.c
22933 (TARGET_DEFAULT_TARGET_FLAGS): Set default endianness.
22934 * config.gcc (riscv32be-*, riscv64be-*): Set
22935 TARGET_BIG_ENDIAN_DEFAULT to 1.
22936 * config/riscv/elf.h (LINK_SPEC): Change -melf* value
22937 depending on default endianness.
22938 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
22939 * config/riscv/linux.h (LINK_SPEC): Likewise.
22940 * config/riscv/riscv.c (TARGET_DEFAULT_TARGET_FLAGS): Set
22941 default endianness.
22942 * config/riscv/riscv.h (DEFAULT_ENDIAN_SPEC): New macro.
22944 2021-03-23 Marcus Comstedt <marcus@mc.pp.se>
22946 * config/riscv/elf.h (LINK_SPEC): Pass linker endianness flag.
22947 * config/riscv/freebsd.h (LINK_SPEC): Likewise.
22948 * config/riscv/linux.h (LINK_SPEC): Likewise.
22949 * config/riscv/riscv.h (ASM_SPEC): Pass -mbig-endian and
22951 (BYTES_BIG_ENDIAN): Handle big endian.
22952 (WORDS_BIG_ENDIAN): Define to BYTES_BIG_ENDIAN.
22953 * config/riscv/riscv.opt (-mbig-endian, -mlittle-endian): New
22955 * doc/invoke.texi (-mbig-endian, -mlittle-endian): Document.
22957 2021-03-23 Stefan Schulze Frielinghaus <stefansf@linux.ibm.com>
22959 * regcprop.c (find_oldest_value_reg): Ask target whether
22960 different mode is fine for replacement register.
22962 2021-03-23 Aldy Hernandez <aldyh@redhat.com>
22964 PR tree-optimization/99296
22965 * value-range.cc (irange::irange_set_1bit_anti_range): New.
22966 (irange::irange_set_anti_range): Call irange_set_1bit_anti_range
22967 * value-range.h (irange::irange_set_1bit_anti_range): New.
22969 2021-03-22 Vladimir N. Makarov <vmakarov@redhat.com>
22972 * config/aarch64/constraints.md (UtQ): Use
22973 define_relaxed_memory_constraint for it.
22974 * doc/md.texi (define_relaxed_memory_constraint): Describe it.
22975 * genoutput.c (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
22976 * genpreds.c (constraint_data): Add bitfield is_relaxed_memory.
22977 (have_relaxed_memory_constraints): New static var.
22978 (relaxed_memory_start, relaxed_memory_end): Ditto.
22979 (add_constraint): Add arg is_relaxed_memory. Check name for
22980 relaxed memory. Set up is_relaxed_memory in constraint_data and
22981 have_relaxed_memory_constraints. Adjust calls.
22982 (choose_enum_order): Process relaxed memory.
22983 (write_tm_preds_h): Ditto.
22984 (main): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
22985 * gensupport.c (process_rtx): Process DEFINE_RELAXED_MEMORY_CONSTRAINT.
22986 * ira-costs.c (record_reg_classes): Process CT_RELAXED_MEMORY.
22987 * ira-lives.c (single_reg_class): Use
22988 insn_extra_relaxed_memory_constraint.
22989 * ira.c (ira_setup_alts): CT_RELAXED_MEMORY.
22990 * lra-constraints.c (valid_address_p): Use
22991 insn_extra_relaxed_memory_constraint instead of other memory
22993 (process_alt_operands): Process CT_RELAXED_MEMORY.
22994 (curr_insn_transform): Use insn_extra_relaxed_memory_constraint.
22995 * recog.c (asm_operand_ok, preprocess_constraints): Process
22997 * reload.c (find_reloads): Ditto.
22998 * rtl.def (DEFINE_RELAXED_MEMORY_CONSTRAINT): New.
22999 * stmt.c (parse_input_constraint): Use
23000 insn_extra_relaxed_memory_constraint.
23002 2021-03-22 Segher Boessenkool <segher@kernel.crashing.org>
23005 * ubsan.c (ubsan_instrument_float_cast): Don't test for unordered if
23008 2021-03-22 Alex Coplan <alex.coplan@arm.com>
23011 * config/arm/arm-protos.h (neon_make_constant): Add generate
23012 argument to guard emitting insns, default to true.
23013 * config/arm/arm.c (arm_legitimate_constant_p_1): Reject
23014 CONST_VECTORs which neon_make_constant can't handle.
23015 (neon_vdup_constant): Add generate argument, avoid emitting
23016 insns if it's not set.
23017 (neon_make_constant): Plumb new generate argument through.
23018 * config/arm/constraints.md (Ui): New. Use it...
23019 * config/arm/mve.md (*mve_mov<mode>): ... here.
23020 * config/arm/vec-common.md (movv8hf): Use neon_make_constant to
23021 synthesize constants.
23023 2021-03-22 Richard Biener <rguenther@suse.de>
23025 * debug.h: Add deprecation warning.
23027 2021-03-22 Richard Biener <rguenther@suse.de>
23029 PR tree-optimization/99694
23030 * tree-ssa-sccvn.c (visit_phi): Ignore edges with the
23033 2021-03-22 Kito Cheng <kito.cheng@sifive.com>
23036 * config/riscv/riscv.c (riscv_expand_block_move): Get RTL value
23037 after type checking.
23039 2021-03-22 Jakub Jelinek <jakub@redhat.com>
23043 * dwarf2out.c (get_full_len): Use get_precision rather than
23045 (add_const_value_attribute): Make sure add_AT_wide argument has
23046 precision prec rather than some very wide one.
23048 2021-03-22 Kewen Lin <linkw@linux.ibm.com>
23050 * config/rs6000/rs6000.md (*rotldi3_insert_sf,
23051 *mov<SFDF:mode><SFDF2:mode>cc_p9, floatsi<mode>2_lfiwax,
23052 floatsi<mode>2_lfiwax_mem, floatunssi<mode>2_lfiwzx,
23053 floatunssi<mode>2_lfiwzx_mem, *floatsidf2_internal,
23054 *floatunssidf2_internal, fix_trunc<mode>si2_stfiwx,
23055 fix_trunc<mode>si2_internal, fixuns_trunc<mode>si2_stfiwx,
23056 *round32<mode>2_fprs, *roundu32<mode>2_fprs,
23057 *fix_trunc<mode>si2_internal): Fix empty split condition.
23058 * config/rs6000/vsx.md (*vsx_le_undo_permute_<mode>,
23059 vsx_reduc_<VEC_reduc_name>_v2df, vsx_reduc_<VEC_reduc_name>_v4sf,
23060 *vsx_reduc_<VEC_reduc_name>_v2df_scalar,
23061 *vsx_reduc_<VEC_reduc_name>_v4sf_scalar): Likewise.
23063 2021-03-22 Xionghu Luo <luoxhu@linux.ibm.com>
23066 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var_p9):
23067 Convert idx to DImode.
23068 (rs6000_expand_vector_set_var_p8): Likewise.
23070 2021-03-21 Jakub Jelinek <jakub@redhat.com>
23073 * dwarf2out.c (insert_float): Change return type from void to
23074 unsigned, handle GET_MODE_SIZE (mode) == 2 and return element size.
23075 (mem_loc_descriptor, loc_descriptor, add_const_value_attribute):
23078 2021-03-20 H.J. Lu <hjl.tools@gmail.com>
23081 * config/i386/i386.c (construct_container): Check cfun != NULL
23082 before accessing silent_p.
23084 2021-03-20 Ahamed Husni <ahamedhusni73@gmail.com>
23086 * asan.c: Fix typos in comments.
23088 2021-03-20 Vladimir N. Makarov <vmakarov@redhat.com>
23090 PR rtl-optimization/99680
23091 * lra-constraints.c (skip_contraint_modifiers): Rename to skip_constraint_modifiers.
23092 (process_address_1): Check empty constraint before using
23095 2021-03-19 Pat Haugen <pthaugen@linux.ibm.com>
23097 * config/rs6000/rs6000.c (power10_cost): New.
23098 (rs6000_option_override_internal): Set Power10 costs.
23099 (rs6000_issue_rate): Set Power10 issue rate.
23100 * config/rs6000/power10.md: Rewrite for Power10.
23102 2021-03-19 Vladimir N. Makarov <vmakarov@redhat.com>
23105 * lra-constraints.c (process_address_1): Don't use unknown
23106 constraint for address constraint.
23108 2021-03-19 Iain Sandoe <iain@sandoe.co.uk>
23111 * config.gcc (powerpc-*-darwin8): Delete the reference to
23112 the now removed darwin8.h.
23114 2021-03-19 Olivier Hainque <hainque@adacore.com>
23117 * config/vxworksae.h (VX_CPU_PREFIX): Define.
23119 2021-03-19 John David Anglin <danglin@gcc.gnu.org>
23121 * config/pa/pa.c (import_milli): Use memcpy instead of strncpy.
23123 2021-03-19 Tamar Christina <tamar.christina@arm.com>
23125 PR tree-optimization/99656
23126 * tree-vect-slp-patterns.c (linear_loads_p,
23127 complex_add_pattern::matches, is_eq_or_top,
23128 vect_validate_multiplication, complex_mul_pattern::matches,
23129 complex_fms_pattern::matches): Remove complex_perm_kinds_t.
23130 * tree-vectorizer.h: (complex_load_perm_t): Removed.
23131 (slp_tree_to_load_perm_map_t): Use complex_perm_kinds_t instead of
23132 complex_load_perm_t.
23134 2021-03-19 H.J. Lu <hjl.tools@gmail.com>
23137 * config/i386/i386-options.c (ix86_init_machine_status): Set
23139 * config/i386/i386.c (init_cumulative_args): Set silent_p to
23141 (construct_container): Return early for return and argument
23142 errors if silent_p is true.
23143 * config/i386/i386.h (machine_function): Add silent_p.
23145 2021-03-19 Jakub Jelinek <jakub@redhat.com>
23148 * config/arm/constraints.md (Ds): New constraint.
23149 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): Use w,Ds
23150 constraint instead of w,Dm.
23152 2021-03-19 Andrew Stubbs <ams@codesourcery.com>
23154 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Fix quotes
23157 2021-03-19 Eric Botcazou <ebotcazou@adacore.com>
23159 PR middle-end/99641
23160 * fold-const.c (native_encode_initializer) <CONSTRUCTOR>: For an
23161 array type, do the computation of the current position in sizetype.
23163 2021-03-18 Vladimir N. Makarov <vmakarov@redhat.com>
23166 * lra-constraints.c (process_address_1): Use lookup_constraint
23167 only for a single constraint.
23169 2021-03-18 Martin Sebor <msebor@redhat.com>
23171 PR middle-end/99502
23172 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): Rename...
23173 (inbounds_memaccess_p): ...to this. Check the ending offset of
23174 the accessed member.
23176 2021-03-18 Andrew Stubbs <ams@codesourcery.com>
23178 * config/gcn/gcn.c (gcn_parse_amdgpu_hsa_kernel_attribute): Add %< and
23179 %> quote markers to error messages.
23180 (gcn_goacc_validate_dims): Likewise.
23181 (gcn_conditional_register_usage): Remove exclaimation mark from error
23183 (gcn_vectorize_vec_perm_const): Ensure perm is fully uninitialized.
23185 2021-03-18 Jan Hubicka <hubicka@ucw.cz>
23187 * config/i386/x86-tune-costs.h (struct processor_costs): Fix costs of
23190 2021-03-18 Sinan Lin <sinan@isrc.iscas.ac.cn>
23191 Kito Cheng <kito.cheng@sifive.com>
23193 * config/riscv/riscv.c (riscv_block_move_straight): Change type
23194 to unsigned HOST_WIDE_INT for parameter and local variable with
23195 HOST_WIDE_INT type.
23196 (riscv_adjust_block_mem): Ditto.
23197 (riscv_block_move_loop): Ditto.
23198 (riscv_expand_block_move): Ditto.
23200 2021-03-18 Nick Clifton <nickc@redhat.com>
23202 * config/v850/v850.c (construct_restore_jr): Increase static
23204 (construct_save_jarl): Likewise.
23205 * config/v850/v850.h (DWARF2_DEBUGGING_INFO): Define.
23207 2021-03-18 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23209 * config/aarch64/aarch64.c (aarch64_adjust_generic_arch_tuning): Define.
23210 (aarch64_override_options_internal): Use it.
23211 (generic_tunings): Add AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS to
23214 2021-03-17 Sandra Loosemore <sandra@codesourcery.com>
23216 * config/nios2/nios2.c (nios2_custom_check_insns): Clean up
23217 error message format issues.
23218 (nios2_option_override): Likewise.
23219 (nios2_expand_fpu_builtin): Likewise.
23220 (nios2_init_custom_builtins): Adjust to avoid bogus strncpy
23221 truncation warning.
23222 (nios2_expand_custom_builtin): More error message format fixes.
23223 (nios2_expand_rdwrctl_builtin): Likewise.
23224 (nios2_expand_rdprs_builtin): Likewise.
23225 (nios2_expand_eni_builtin): Likewise.
23226 (nios2_expand_builtin): Likewise.
23227 (nios2_register_custom_code): Likewise.
23228 (nios2_valid_target_attribute_rec): Likewise.
23229 (nios2_add_insn_asm): Fix uninitialized variable warning.
23231 2021-03-17 Jan Hubicka <jh@suse.cz>
23233 * config/i386/x86-tune-costs.h (struct processor_costs): Update costs
23234 of gather to match reality.
23235 * config/i386/x86-tune.def (X86_TUNE_USE_GATHER): Enable for znver3.
23237 2021-03-17 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23239 * config/aarch64/aarch64-builtins.c (aarch64_expand_rng_builtin): Use EQ
23240 to compare against CC_REG rather than NE.
23242 2021-03-17 H.J. Lu <hjl.tools@gmail.com>
23245 * config/i386/i386.c (ix86_force_load_from_GOT_p): Support
23246 inline assembly statements.
23247 (ix86_print_operand): Update 'P' handling for -fno-plt.
23249 2021-03-17 Tamar Christina <tamar.christina@arm.com>
23252 * config/aarch64/aarch64.c
23253 (aarch64_simd_clone_compute_vecsize_and_simdlen): Remove unused var.
23255 2021-03-16 Segher Boessenkool <segher@kernel.crashing.org>
23258 * config/rs6000/predicates.md (branch_comparison_operator): Allow
23259 ordered and unordered for CCFPmode, if flag_finite_math_only.
23261 2021-03-16 Jakub Jelinek <jakub@redhat.com>
23264 * config/i386/i386-expand.c (ix86_split_lea_for_addr): Emit a MULT
23265 rather than ASHIFT.
23266 * config/i386/i386.md (mult by 1248 into ashift): New splitter.
23268 2021-03-16 Martin Liska <mliska@suse.cz>
23271 * optc-save-gen.awk: Add flag_ipa_ra to exceptions for
23272 cl_optimization_compare function.
23274 2021-03-16 Ilya Leoshkevich <iii@linux.ibm.com>
23276 * config/s390/s390.c (f_constraint_p): Treat "fv" constraints
23279 2021-03-16 Jakub Jelinek <jakub@redhat.com>
23282 * config/i386/i386.h (struct machine_function): Add
23283 has_explicit_vzeroupper bitfield.
23284 * config/i386/i386-expand.c (ix86_expand_builtin): Set
23285 cfun->machine->has_explicit_vzeroupper when expanding
23286 IX86_BUILTIN_VZEROUPPER.
23287 * config/i386/i386-features.c (rest_of_handle_insert_vzeroupper):
23288 Do the mode switching only when TARGET_VZEROUPPER, expensive
23289 optimizations turned on and not optimizing for size.
23290 (pass_insert_vzeroupper::gate): Enable even when
23291 cfun->machine->has_explicit_vzeroupper is set.
23293 2021-03-16 Jakub Jelinek <jakub@redhat.com>
23296 * config/aarch64/aarch64.c
23297 (aarch64_simd_clone_compute_vecsize_and_simdlen): If not a function
23298 definition, walk TYPE_ARG_TYPES list if non-NULL for argument types
23299 instead of DECL_ARGUMENTS. Ignore types for uniform arguments.
23301 2021-03-15 Richard Biener <rguenther@suse.de>
23303 PR tree-optimization/98834
23304 * tree-ssa-sccvn.c (vn_reference_lookup_3): Handle missing
23305 subsetting by truncating the access size.
23307 2021-03-15 Jan Hubicka <hubicka@ucw.cz>
23309 * config/i386/i386-options.c (processor_cost_table): Add znver3_cost.
23310 * config/i386/x86-tune-costs.h (znver3_cost): New gobal variable; copy
23313 2021-03-15 Martin Liska <mliska@suse.cz>
23315 * spellcheck.c: Add missing comma in initialization.
23317 2021-03-14 Uroš Bizjak <ubizjak@gmail.com>
23319 * config/i386/sse.md (*vec_extract<mode>): Merge alternative 0 with
23320 alternative 2 and alternative 1 with alternative 3 using
23321 YW register constraint.
23322 (*vec_extract<PEXTR_MODE12:mode>_zext): Merge alternatives
23323 using YW register constraint.
23324 (*vec_extractv16qi_zext): Ditto.
23325 (*vec_extractv4si): Merge alternatives 4 and 5
23326 using Yw register constraint.
23327 (*ssse3_palignr<mode>_perm): Use Yw instead of v for alternative 3.
23329 2021-03-13 Martin Sebor <msebor@redhat.com>
23331 PR tree-optimization/99489
23332 * builtins.c (gimple_call_alloc_size): Fail gracefully when argument
23333 is not a call statement.
23335 2021-03-13 Jakub Jelinek <jakub@redhat.com>
23337 PR tree-optimization/99544
23338 * match.pd (X + (X << C) -> X * (1 + (1 << C))): Don't simplify
23339 if for vector types multiplication can't be done in type's mode.
23341 2021-03-12 Eric Botcazou <ebotcazou@adacore.com>
23344 * config/sparc/constraints.md (w): Rename to...
23345 (W): ... this and ditch previous implementation.
23346 * config/sparc/sparc.md (*movdi_insn_sp64): Replace W with m.
23347 (*movdf_insn_sp64): Likewise.
23348 (*mov<VM64:mode>_insn_sp64): Likewise.
23349 * config/sparc/sync.md (*atomic_compare_and_swap<mode>_1): Replace
23351 (atomic_compare_and_swap_leon3_1): Likewise.
23352 (*atomic_compare_and_swapdi_v8plus): Likewise.
23353 * config/sparc/sparc.c (memory_ok_for_ldd): Remove useless test on
23354 architecture and add missing address validity check during LRA.
23356 2021-03-12 Tobias Burnus <tobias@codesourcery.com>
23359 * gimplify.c (omp_add_variable): Handle NULL_TREE as size
23360 occuring for assumed-size arrays in use_device_{ptr,addr}.
23362 2021-03-12 Jakub Jelinek <jakub@redhat.com>
23365 * config/i386/constraints.md (YW): New internal constraint.
23366 * config/i386/sse.md (v_Yw): Add V4TI, V2TI, V1TI and TI cases.
23367 (*<sse2_avx2>_<insn><mode>3<mask_name>,
23368 *<sse2_avx2>_uavg<mode>3<mask_name>, *abs<mode>2,
23369 *<s>mul<mode>3_highpart<mask_name>): Use <v_Yw> instead of v in
23371 (<sse2_avx2>_psadbw): Use YW instead of v in constraints.
23372 (*avx2_pmaddwd, *sse2_pmaddwd, *<code>v8hi3, *<code>v16qi3,
23373 avx2_pmaddubsw256, ssse3_pmaddubsw128): Merge last two alternatives
23374 into one, use Yw instead of former x,v.
23375 (ashr<mode>3, <insn><mode>3): Use <v_Yw> instead of x in constraints of
23376 the last alternative.
23377 (<sse2_avx2>_packsswb<mask_name>, <sse2_avx2>_packssdw<mask_name>,
23378 <sse2_avx2>_packuswb<mask_name>, <sse4_1_avx2>_packusdw<mask_name>,
23379 *<ssse3_avx2>_pmulhrsw<mode>3<mask_name>, <ssse3_avx2>_palignr<mode>,
23380 <ssse3_avx2>_pshufb<mode>3<mask_name>): Merge last two alternatives
23381 into one, use <v_Yw> instead of former x,v.
23382 (avx2_interleave_highv32qi<mask_name>,
23383 vec_interleave_highv16qi<mask_name>): Use Yw instead of v in
23384 constraints. Add && <mask_avx512bw_condition> to condition.
23385 (avx2_interleave_lowv32qi<mask_name>,
23386 vec_interleave_lowv16qi<mask_name>,
23387 avx2_interleave_highv16hi<mask_name>,
23388 vec_interleave_highv8hi<mask_name>,
23389 avx2_interleave_lowv16hi<mask_name>, vec_interleave_lowv8hi<mask_name>,
23390 avx2_pshuflw_1<mask_name>, sse2_pshuflw_1<mask_name>,
23391 avx2_pshufhw_1<mask_name>, sse2_pshufhw_1<mask_name>,
23392 avx2_<code>v16qiv16hi2<mask_name>, sse4_1_<code>v8qiv8hi2<mask_name>,
23393 *sse4_1_<code>v8qiv8hi2<mask_name>_1, <sse2_avx2>_<insn><mode>3): Use
23394 Yw instead of v in constraints.
23395 * config/i386/mmx.md (Yv_Yw): New define_mode_attr.
23396 (*mmx_<insn><mode>3, mmx_ashr<mode>3, mmx_<insn><mode>3): Use <Yv_Yw>
23397 instead of Yv in constraints.
23398 (*mmx_<insn><mode>3, *mmx_mulv4hi3, *mmx_smulv4hi3_highpart,
23399 *mmx_umulv4hi3_highpart, *mmx_pmaddwd, *mmx_<code>v4hi3,
23400 *mmx_<code>v8qi3, mmx_pack<s_trunsuffix>swb, mmx_packssdw,
23401 mmx_punpckhbw, mmx_punpcklbw, mmx_punpckhwd, mmx_punpcklwd,
23402 *mmx_uavgv8qi3, *mmx_uavgv4hi3, mmx_psadbw): Use Yw instead of Yv in
23404 (*mmx_pinsrw, *mmx_pinsrb, *mmx_pextrw, *mmx_pextrw_zext, *mmx_pextrb,
23405 *mmx_pextrb_zext): Use YW instead of Yv in constraints.
23406 (*mmx_eq<mode>3, mmx_gt<mode>3): Use x instead of Yv in constraints.
23407 (mmx_andnot<mode>3, *mmx_<code><mode>3): Split last alternative into
23408 two, one with just x, another isa avx512vl with v.
23410 2021-03-12 Martin Liska <mliska@suse.cz>
23412 * doc/invoke.texi: Add missing param documentation.
23414 2021-03-11 David Malcolm <dmalcolm@redhat.com>
23417 * Makefile.in (ANALYZER_OBJS): Add analyzer/feasible-graph.o and
23418 analyzer/trimmed-graph.o.
23419 * doc/analyzer.texi (Analyzer Paths): Rewrite description of
23420 feasibility checking to reflect new implementation.
23421 * doc/invoke.texi (-fdump-analyzer-feasibility): Document new
23423 * shortest-paths.h (shortest_paths::get_shortest_distance): New.
23425 2021-03-11 David Malcolm <dmalcolm@redhat.com>
23427 * digraph.cc (selftest::test_shortest_paths): Update
23428 shortest_paths init for new param. Add test of
23429 SPS_TO_GIVEN_TARGET.
23430 * shortest-paths.h (enum shortest_path_sense): New.
23431 (shortest_paths::shortest_paths): Add "sense" param.
23432 Update for renamings. Generalize to use "sense" param.
23433 (shortest_paths::get_shortest_path): Rename param.
23434 (shortest_paths::m_sense): New field.
23435 (shortest_paths::m_prev): Rename...
23436 (shortest_paths::m_best_edge): ...to this.
23437 (shortest_paths::get_shortest_path): Update for renamings.
23438 Conditionalize flipping of path on sense of traversal.
23440 2021-03-11 David Malcolm <dmalcolm@redhat.com>
23442 * digraph.cc (selftest::test_shortest_paths): Add test coverage
23443 for paths from B and C.
23444 * shortest-paths.h (shortest_paths::shortest_paths): Handle
23445 unreachable nodes, rather than asserting.
23447 2021-03-11 David Edelsohn <dje.gcc@gmail.com>
23450 * config/rs6000/rs6000.c (rs6000_xcoff_file_start): Don't create
23451 xcoff_tbss_section_name.
23452 * config/rs6000/xcoff.h (ASM_OUTPUT_TLS_COMMON): Use .lcomm.
23453 * xcoffout.c (xcoff_tbss_section_name): Delete.
23454 * xcoffout.h (xcoff_tbss_section_name): Delete.
23456 2021-03-11 Richard Biener <rguenther@suse.de>
23458 PR tree-optimization/99523
23459 * tree-cfg.c (dump_function_to_file): Dump SSA names
23460 w/o identifier to the decls section as well, not only those
23461 without a VAR_DECL.
23463 2021-03-11 Jakub Jelinek <jakub@redhat.com>
23466 * ipa-icf-gimple.c (func_checker::compare_gimple_call): For internal
23467 function calls with lhs fail if the lhs don't have compatible types.
23469 2021-03-11 Hans-Peter Nilsson <hp@axis.com>
23471 * config/cris/cris.h (HARD_FRAME_POINTER_REGNUM): Define.
23472 Change FRAME_POINTER_REGNUM to correspond to a new faked
23473 register faked_fp, part of GENNONACR_REGS like faked_ap.
23474 (CRIS_FAKED_REGS_CONTENTS): New helper macro.
23475 (FIRST_PSEUDO_REGISTER, FIXED_REGISTERS, CALL_USED_REGISTERS):
23476 (REG_ALLOC_ORDER, REG_CLASS_CONTENTS, REGNO_OK_FOR_BASE_P)
23477 (ELIMINABLE_REGS, REGISTER_NAMES): Adjust accordingly.
23478 * config/cris/cris.md (CRIS_FP_REGNUM): Renumber to new faked
23480 (CRIS_REAL_FP_REGNUM): New constant.
23481 * config/cris/cris.c (cris_reg_saved_in_regsave_area): Check
23482 for HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM.
23483 (cris_initial_elimination_offset): Handle elimination changes
23484 to HARD_FRAME_POINTER_REGNUM instead of FRAME_POINTER_REGNUM
23485 and add one from FRAME_POINTER_REGNUM to
23486 HARD_FRAME_POINTER_REGNUM.
23487 (cris_expand_prologue, cris_expand_epilogue): Emit code for
23488 hard_frame_pointer_rtx instead of frame_pointer_rtx.
23490 2021-03-10 David Edelsohn <dje.gcc@gmail.com>
23493 * config/rs6000/aix.h (ADJUST_FIELD_ALIGN): Add check for DCmode.
23494 * config/rs6000/rs6000.c (rs6000_special_round_type_align): Same.
23496 2021-03-10 Vladimir N. Makarov <vmakarov@redhat.com>
23499 * lra-constraints.c (process_address_1): Don't check unknown
23500 constraint, use X for empty constraint.
23502 2021-03-10 Alex Coplan <alex.coplan@arm.com>
23504 * config/aarch64/aarch64.c (aarch64_vfp_is_call_or_return_candidate):
23505 Fix typo in comment describing "is_ha" argument.
23507 2021-03-10 John David Anglin <danglin@gcc.gnu.org>
23509 * doc/sourcebuild.texi: Document LRA target selector.
23511 2021-03-10 David Malcolm <dmalcolm@redhat.com>
23513 * doc/ux.texi: Add subsection contrasting interactive versus
23514 batch usage of GCC.
23516 2021-03-10 Joel Hutton <joel.hutton@arm.com>
23519 * tree-vect-stmts.c (vectorizable_store): Fix scatter store mask
23521 (vectorizable_load): Fix gather load mask check condition.
23523 2021-03-10 Richard Biener <rguenther@suse.de>
23525 PR tree-optimization/99510
23526 * tree.c (check_aligned_type): Check that the candidate
23527 has TYPE_USER_ALIGN set instead of matching with the
23530 2021-03-10 Eric Botcazou <ebotcazou@adacore.com>
23532 * config/sparc/sparc.c (sparc_regmode_natural_size): Return 4 for
23533 float and vector integer modes only if the mode is not larger.
23535 2021-03-10 Hans-Peter Nilsson <hp@axis.com>
23537 * config/cris/cris.h (DWARF_FRAME_REGISTERS): Define.
23539 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
23541 * ira.c (ira_setup_alts, ira_get_dup_out_num): Process digital
23543 * ira-lives.c (single_reg_class): Ditto.
23545 2021-03-09 Sebastian Huber <sebastian.huber@embedded-brains.de>
23547 * config.gcc (aarch64-*-rtems*): Include general rtems.h after
23548 the architecture-specific rtems.h.
23549 (aarch64-*-rtems*): Likewise.
23550 (arm*-*-rtems*): Likewise.
23551 (epiphany-*-rtems*): Likewise.
23552 (riscv*-*-rtems*): Likewise.
23554 2021-03-09 Jakub Jelinek <jakub@redhat.com>
23556 PR tree-optimization/99305
23557 * tree-ssa-phiopt.c (conditional_replacement): Test integer_pow2p
23558 before integer_all_onesp instead of vice versa.
23560 2021-03-09 Richard Earnshaw <rearnsha@arm.com>
23562 * common/config/arm/arm-common.c (arm_config_default): Change type
23563 of 'i' to unsigned.
23565 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
23568 * lra-constraints.c (process_address_1): Process constraint 'g'
23569 separately and digital constraints containing more one digit.
23571 2021-03-09 Nick Clifton <nickc@redhat.com>
23573 * config/rx/rx.h (DBX_DEBUGGING_INFO): Define.
23574 (DWARF"_DEBUGGING_INFO): Define.
23576 2021-03-09 Eric Botcazou <ebotcazou@adacore.com>
23579 * calls.c (initialize_argument_information): When the argument
23580 is passed by reference, do not make a copy in a thunk only if
23581 the argument is already in memory. Remove redundant test for
23582 the case of callee copy.
23584 2021-03-09 Vladimir N. Makarov <vmakarov@redhat.com>
23587 * lra-constraints.c (process_address_1): Process 0..9 constraints
23588 in process_address_1.
23590 2021-03-09 Andreas Krebbel <krebbel@linux.ibm.com>
23592 * config/s390/s390.c (struct s390_processor processor_table):
23593 Binutils name string must not be empty.
23595 2021-03-09 Claudiu Zissulescu <claziss@synopsys.com>
23597 * config/arc/arc.c (arc_attr_type): Remove function.
23599 2021-03-09 Martin Liska <mliska@suse.cz>
23602 * config/i386/i386-options.c (ix86_option_override_internal):
23603 Set isa_flags for OPTS argument and not for the global
23606 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
23608 * config/rs6000/predicates.md (ds_form_mem_operand): Check
23611 2021-03-09 Aaron Sawdey <acsawdey@linux.ibm.com>
23614 * config/rs6000/predicates.md (ds_form_mem_operand) New
23616 * config/rs6000/genfusion.pl (gen_ld_cmpi_p10) Use
23617 ds_form_mem_operand in ld/lwa patterns.
23618 * config/rs6000/fusion.md: Regenerate file.
23620 2021-03-08 Martin Sebor <msebor@redhat.com>
23622 PR middle-end/98266
23623 * gimple-array-bounds.cc (inbounds_vbase_memaccess_p): New function.
23624 (array_bounds_checker::check_array_bounds): Call it.
23626 2021-03-08 Martin Sebor <msebor@redhat.com>
23628 PR middle-end/97631
23629 * tree-ssa-strlen.c (maybe_warn_overflow): Test rawmem.
23630 (handle_builtin_stxncpy_strncat): Rename locals. Determine
23631 destination size from allocation calls. Issue a more appropriate
23633 (handle_builtin_memcpy): Pass true as rawmem to maybe_warn_overflow.
23634 (handle_builtin_memset): Same.
23636 2021-03-08 Peter Bergner <bergner@linux.ibm.com>
23639 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_permute): Add an assert
23640 to ensure we do not have an Altivec style address.
23641 * config/rs6000/vsx.md (*vsx_le_perm_load_<mode>): Disable if passed
23642 an Altivec style address.
23643 (*vsx_le_perm_store_<mode>): Likewise.
23644 (splitters after *vsx_le_perm_store_<mode>): Likewise.
23645 (vsx_load_<mode>): Disable special expander if passed an Altivec
23647 (vsx_store_<mode>): Likewise.
23649 2021-03-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
23652 * config/aarch64/predicates.md (aarch64_simd_shift_imm_vec_qi): Define.
23653 (aarch64_simd_shift_imm_vec_hi): Likewise.
23654 (aarch64_simd_shift_imm_vec_si): Likewise.
23655 (aarch64_simd_shift_imm_vec_di): Likewise.
23656 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le): Use
23657 predicate from above.
23658 (aarch64_shrn<mode>_insn_be): Likewise.
23659 (aarch64_rshrn<mode>_insn_le): Likewise.
23660 (aarch64_rshrn<mode>_insn_be): Likewise.
23661 (aarch64_shrn2<mode>_insn_le): Likewise.
23662 (aarch64_shrn2<mode>_insn_be): Likewise.
23663 (aarch64_rshrn2<mode>_insn_le): Likewise.
23664 (aarch64_rshrn2<mode>_insn_be): Likewise.
23666 2021-03-08 Vladimir N. Makarov <vmakarov@redhat.com>
23669 * lra-constraints.c (skip_contraint_modifiers): New function.
23670 (process_address_1): Use it before lookup_constraint call.
23672 2021-03-08 Martin Liska <mliska@suse.cz>
23675 * config/i386/i386-options.c (ix86_option_override_internal):
23676 Enable UINTR and HRESET for -march that supports it.
23678 2021-03-08 Ilya Leoshkevich <iii@linux.ibm.com>
23680 * config/s390/s390.c (f_constraint_p): New function.
23681 (s390_md_asm_adjust): Implement TARGET_MD_ASM_ADJUST.
23682 (TARGET_MD_ASM_ADJUST): Likewise.
23684 2021-03-08 Tobias Burnus <tobias@codesourcery.com>
23687 * tree-nested.c (convert_local_reference_stmt): Avoid calling
23688 lookup_field_for_decl for Fortran module (= namespace context).
23690 2021-03-08 Andreas Krebbel <krebbel@linux.ibm.com>
23692 * config/s390/s390.c (s390_expand_vec_compare): Implement <0
23693 comparison with arithmetic right shift.
23694 (s390_expand_vcond): No need for a force_reg anymore.
23695 s390_vec_compare will do it.
23696 * config/s390/vector.md ("vec_cmp<mode><tointvec>"): Accept also
23697 immediate operands.
23699 2021-03-07 Jakub Jelinek <jakub@redhat.com>
23702 * config/i386/constraints.md (Yw): Use SSE_REGS if TARGET_SSE
23703 but TARGET_AVX512BW or TARGET_AVX512VL is not set. Adjust description
23705 * config/i386/sse.md (v_Yw): New define_mode_attr.
23706 (*<insn><mode>3, *mul<mode>3<mask_name>, *avx2_<code><mode>3,
23707 *sse4_1_<code><mode>3<mask_name>): Use <v_Yw> instead of v
23709 * config/i386/mmx.md (mmx_pshufw_1, *vec_dupv4hi): Use Yw instead of
23710 xYw in constraints.
23712 2021-03-06 Julian Brown <julian@codesourcery.com>
23714 * tree-pretty-print.c (dump_generic_node): Emit non-generic
23715 address space info for aggregates.
23717 2021-03-06 Hans-Peter Nilsson <hp@axis.com>
23719 * config/cris/cris.h (MAX_FIXED_MODE_SIZE): Don't define.
23721 2021-03-05 Jakub Jelinek <jakub@redhat.com>
23723 PR middle-end/99322
23724 * tree-cfg.c (bb_to_omp_idx): New variable.
23725 (execute_build_cfg): Release the bb_to_omp_idx vector after
23726 cleanup_tree_cfg returns.
23727 (handle_abnormal_edges): Remove bb_to_omp_idx argument, adjust
23728 for bb_to_omp_idx being a vec<int> instead of pointer to array
23730 (make_edges): Remove bb_to_omp_idx local variable, don't pass
23731 it to handle_abnormal_edges, adjust for bb_to_omp_idx being a
23732 vec<int> instead of pointer to array of ints and don't free/release
23734 (remove_bb): When removing a bb and placing forced label somewhere
23735 else, ensure it is put into the same OpenMP region during cfg
23736 pass if possible or to entry successor as fallback. Unregister
23737 bb from bb_to_omp_idx.
23739 2021-03-05 Vladimir N. Makarov <vmakarov@redhat.com>
23742 * lra-constraints.c (process_address_1): Skip decomposing address
23743 for asm insn operand with unknown constraint.
23745 2021-03-05 Martin Jambor <mjambor@suse.cz>
23748 * cgraph.c (cgraph_edge::set_call_stmt): Do not update all
23749 corresponding speculative edges if we are about to resolve
23750 sepculation. Make edge direct (and so resolve speculations) before
23751 removing it from call_site_hash.
23752 (cgraph_edge::make_direct): Relax the initial assert to allow calling
23753 the function on speculative direct edges.
23755 2021-03-05 Eric Botcazou <ebotcazou@adacore.com>
23757 PR rtl-optimization/99376
23758 * rtlanal.c (nonzero_bits1) <arithmetic operators>: If the number
23759 of low-order zero bits is too large, set the result to 0 directly.
23761 2021-03-04 Jakub Jelinek <jakub@redhat.com>
23763 PR middle-end/93235
23764 * expmed.c (store_bit_field_using_insv): Return false of xop0 is a
23765 SUBREG and a SUBREG to op_mode can't be created.
23767 2021-03-04 Alex Coplan <alex.coplan@arm.com>
23770 * config/aarch64/aarch64-sve-builtins.cc
23771 (function_resolver::require_vector_type): Handle error_mark_node.
23773 2021-03-04 Ilya Leoshkevich <iii@linux.ibm.com>
23775 * cfgexpand.c (expand_asm_loc): Pass new parameter.
23776 (expand_asm_stmt): Likewise.
23777 * config/arm/aarch-common-protos.h (arm_md_asm_adjust): Add new
23779 * config/arm/aarch-common.c (arm_md_asm_adjust): Likewise.
23780 * config/arm/arm.c (thumb1_md_asm_adjust): Likewise.
23781 * config/cris/cris.c (cris_md_asm_adjust): Likewise.
23782 * config/i386/i386.c (ix86_md_asm_adjust): Likewise.
23783 * config/mn10300/mn10300.c (mn10300_md_asm_adjust): Likewise.
23784 * config/nds32/nds32.c (nds32_md_asm_adjust): Likewise.
23785 * config/pdp11/pdp11.c (pdp11_md_asm_adjust): Likewise.
23786 * config/rs6000/rs6000.c (rs6000_md_asm_adjust): Likewise.
23787 * config/vax/vax.c (vax_md_asm_adjust): Likewise.
23788 * config/visium/visium.c (visium_md_asm_adjust): Likewise.
23789 * doc/tm.texi (md_asm_adjust): Likewise.
23790 * target.def (md_asm_adjust): Likewise.
23792 2021-03-04 Richard Biener <rguenther@suse.de>
23794 PR middle-end/97855
23795 * tree-pretty-print.c: Poison pp_printf.
23796 (dump_decl_name): Avoid use of pp_printf.
23797 (dump_block_node): Likewise.
23798 (dump_generic_node): Likewise.
23800 2021-03-04 Martin Sebor <msebor@redhat.com>
23802 PR middle-end/96963
23803 PR middle-end/94655
23804 * builtins.c (handle_array_ref): New helper.
23805 (handle_mem_ref): New helper.
23806 (compute_objsize_r): Factor out ARRAY_REF and MEM_REF handling
23807 into new helper functions. Correct a workaround for vectorized
23810 2021-03-03 Pat Haugen <pthaugen@linux.ibm.com>
23812 * config/rs6000/dfp.md (extendddtd2, trunctddd2, *cmp<mode>_internal1,
23813 floatditd2, ftrunc<mode>2, fix<mode>di2, dfp_ddedpd_<mode>,
23814 dfp_denbcd_<mode>, dfp_dxex_<mode>, dfp_diex_<mode>,
23815 *dfp_sgnfcnc_<mode>, dfp_dscli_<mode>, dfp_dscri_<mode>): Update size
23816 attribute for Power10.
23817 * config/rs6000/mma.md (*movoo): Likewise.
23818 * config/rs6000/rs6000.md (define_attr "size"): Add 256.
23819 (define_mode_attr bits): Add DD/TD modes.
23820 * config/rs6000/sync.md (load_quadpti, store_quadpti, load_lockedpti,
23821 store_conditionalpti): Update size attribute for Power10.
23823 2021-03-03 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
23826 * config/sparc/t-sparc (tree-ssanames.o-warn): Don't error for
23827 -Wuninitialized, -Wmaybe-uninitialized.
23828 (wide-int.o-warn): Likewise.
23830 2021-03-03 Richard Earnshaw <rearnsha@arm.com>
23832 * common/config/arm/arm-common.c: Include configargs.h.
23833 (arm_config_default): New function.
23834 (arm_target_mode): Renamed from arm_target_thumb_only. Handle
23835 processors that do not support Thumb. Take into account the
23836 --with-mode configuration setting for selecting the default.
23837 * config/arm/arm.h (OPTION_DEFAULT_SPECS): Remove entry for 'mode'.
23838 (TARGET_MODE_SPEC_FUNCTIONS): Update for function name change.
23840 2021-03-03 Martin Liska <mliska@suse.cz>
23842 PR gcov-profile/97461
23843 * gcov-io.h (GCOV_PREALLOCATED_KVP): Remove.
23845 2021-03-03 Eric Botcazou <ebotcazou@adacore.com>
23848 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
23849 point back the hard frame pointer to its default location when the
23850 frame is larger than SEH_MAX_FRAME_SIZE.
23852 2021-03-03 Jakub Jelinek <jakub@redhat.com>
23855 * config/i386/predicates.md (logic_operator): New define_predicate.
23856 * config/i386/i386.md (mov + mem using comm arith peephole2):
23857 Punt if operands[1] is EXT_REX_SSE_REGNO_P, AVX512BW is not enabled
23858 and the inner mode is [QH]Imode.
23860 2021-03-03 Jakub Jelinek <jakub@redhat.com>
23863 * dwarf2out.c (dw_loc_list_struct): Add end_entry member.
23864 (new_loc_list): Clear end_entry.
23865 (output_loc_list): Only use DW_LLE_startx_length for -gsplit-dwarf
23866 if HAVE_AS_LEB128, otherwise use DW_LLE_startx_endx. Fix comment
23868 (index_location_lists): For dwarf_version >= 5 without HAVE_AS_LEB128,
23869 initialize also end_entry.
23871 2021-03-03 Jakub Jelinek <jakub@redhat.com>
23874 * cfgrtl.c (fixup_partitions): When changing some bbs from hot to cold
23875 partitions, if in non-layout mode after reorder_blocks also move
23876 affected blocks to ensure a single partition transition.
23878 2021-03-03 Jason Merrill <jason@redhat.com>
23881 * cgraphunit.c (process_function_and_variable_attributes): Don't
23882 warn about flatten on an alias if the target also has it.
23883 * cgraph.h (symtab_node::get_alias_target_tree): New.
23885 2021-03-02 David Edelsohn <dje.gcc@gmail.com>
23887 * config/rs6000/rs6000.md (tls_get_tpointer_internal): Prepend
23888 period to symbol name.
23889 (tls_get_addr_internal<mode>): Same.
23891 2021-03-02 David Malcolm <dmalcolm@redhat.com>
23894 * diagnostic-show-locus.c
23895 (selftest::test_one_liner_many_fixits_2): Fix accidental usage of
23898 2021-03-02 Martin Sebor <msebor@redhat.com>
23900 PR middle-end/99276
23901 * builtins.c (warn_for_access): Remove stray warning text.
23903 2021-03-02 Martin Sebor <msebor@redhat.com>
23905 PR middle-end/99295
23906 * doc/extend.texi (attribute malloc): Reword and clarify nonaliasing
23909 2021-03-02 Jakub Jelinek <jakub@redhat.com>
23912 * dwarf2out.c (output_macinfo_op): Use DW_MACRO_*_str* even with
23913 -gdwarf-5 -gstrict-dwarf. For -gsplit-dwarf -gdwarf-5 use
23914 DW_MACRO_*_strx instead of DW_MACRO_*_strp. Handle
23915 DW_MACRO_define_strx and DW_MACRO_undef_strx.
23916 (save_macinfo_strings): Use DW_MACRO_*_str* even with
23917 -gdwarf-5 -gstrict-dwarf. Handle DW_MACRO_define_strx and
23918 DW_MACRO_undef_strx.
23920 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
23922 * config/s390/s390-builtin-types.def (BT_FN_V4SF_V8HI_UINT): New
23924 (BT_FN_V8HI_V8HI_UINT): Likewise.
23925 (BT_FN_V8HI_V4SF_V4SF_UINT): Likewise.
23926 * config/s390/s390-builtins.def (B_NNPA): New macro definition.
23927 (s390_vclfnhs, s390_vclfnls, s390_vcrnfs, s390_vcfn, s390_vcnf):
23928 New builtin definitions.
23929 * config/s390/s390-c.c (s390_cpu_cpp_builtins_internal): Bump
23930 vector extension version.
23931 * config/s390/s390.c (s390_expand_builtin): Check if builtins are
23932 available with current -march level.
23933 * config/s390/s390.md (UNSPEC_NNPA_VCLFNHS_V8HI)
23934 (UNSPEC_NNPA_VCLFNLS_V8HI, UNSPEC_NNPA_VCRNFS_V8HI)
23935 (UNSPEC_NNPA_VCFN_V8HI, UNSPEC_NNPA_VCNF_V8HI): New constants.
23936 * config/s390/vecintrin.h (vec_extend_to_fp32_hi): New macro.
23937 (vec_extend_to_fp32_lo): Likewise.
23938 (vec_round_from_fp32): Likewise.
23939 (vec_convert_to_fp16): Likewise.
23940 (vec_convert_from_fp16): Likewise.
23941 * config/s390/vx-builtins.md (vclfnhs_v8hi): New insn pattern.
23942 (vclfnls_v8hi): Likewise.
23943 (vcrnfs_v8hi): Likewise.
23944 (vcfn_v8hi): Likewise.
23945 (vcnf_v8hi): Likewise.
23947 2021-03-02 Andreas Krebbel <krebbel@linux.ibm.com>
23949 * common/config/s390/s390-common.c (processor_flags_table): New entry.
23950 * config.gcc: Enable arch14 for --with-arch and --with-tune.
23951 * config/s390/driver-native.c (s390_host_detect_local_cpu): Pick
23952 arch14 for unknown CPU models.
23953 * config/s390/s390-opts.h (enum processor_type): Add PROCESSOR_ARCH14.
23954 * config/s390/s390.c (s390_issue_rate): Add case for PROCESSOR_ARCH14.
23955 (s390_get_sched_attrmask): Likewise.
23956 (s390_get_unit_mask): Likewise.
23957 * config/s390/s390.h (enum processor_flags): Add PF_NNPA and PF_ARCH14.
23958 (TARGET_CPU_ARCH14, TARGET_CPU_ARCH14_P, TARGET_CPU_NNPA)
23959 (TARGET_CPU_NNPA_P, TARGET_ARCH14, TARGET_ARCH14_P, TARGET_NNPA)
23960 (TARGET_NNPA_P): New macro definitions.
23961 * config/s390/s390.md ("cpu_facility", "enabled"): Add arch14 and nnpa.
23962 * config/s390/s390.opt: Add PROCESSOR_ARCH14.
23964 2021-03-02 Jakub Jelinek <jakub@redhat.com>
23966 PR middle-end/95757
23967 * tree-vrp.c (register_edge_assert_for): Remove superfluous ()s around
23968 condition. Call register_edge_assert_for_1 for == 0, != 0, == 1 and
23969 != 1 comparisons if name is lhs of a comparison.
23971 2021-03-01 Iain Sandoe <iain@sandoe.co.uk>
23975 * config/darwin-protos.h (darwin_should_restore_cfa_state): New.
23976 * config/darwin.c (darwin_should_restore_cfa_state): New.
23977 * config/darwin.h (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New.
23978 * doc/tm.texi: Regenerated.
23979 * doc/tm.texi.in: Document TARGET_ASM_SHOULD_RESTORE_CFA_STATE.
23980 * dwarf2cfi.c (connect_traces): If the target requests, restore
23981 the CFA expression after a DW_CFA_restore.
23982 * target.def (TARGET_ASM_SHOULD_RESTORE_CFA_STATE): New hook.
23984 2021-03-01 Martin Liska <mliska@suse.cz>
23987 * optc-save-gen.awk: Add 4 more exceptions.
23989 2021-03-01 Nathan Sidwell <nathan@acm.org>
23992 * tree.h (TYPE_ALIGN_RAW): New accessor.
23993 (TYPE_ALIGN): Use it.
23995 2021-03-01 Jan Hubicka <jh@suse.cz>
23998 * ipa-fnsummary.c (compute_fn_summary): Fix sanity check.
24000 2021-03-01 Eric Botcazou <ebotcazou@adacore.com>
24003 * config/i386/i386.c (ix86_compute_frame_layout): For a SEH target,
24004 point the hard frame pointer to the SSE register save area instead
24005 of the general register save area. Perform only minimal adjustment
24006 for small frames if it is initially not correctly aligned.
24007 (ix86_expand_prologue): Remove early saves for a SEH target.
24008 * config/i386/winnt.c (struct seh_frame_state): Document constraint.
24010 2021-02-28 Jakub Jelinek <jakub@redhat.com>
24013 * ipa.c (symbol_table::remove_unreachable_nodes): Fix a comment
24014 typo - referneced -> referenced.
24015 * tree.c (component_ref_size): Fix comment typo -
24016 refernce -> reference.
24017 * tree-ssa-alias.c (access_path_may_continue_p): Fix comment typo -
24018 traling -> trailing.
24019 (aliasing_component_refs_p): Fix comment typos -
24020 refernce -> reference and refernece -> reference and
24021 traling -> trailing.
24022 (nonoverlapping_refs_since_match_p): Fix comment typo -
24023 referneces -> references.
24024 * doc/invoke.texi (--param modref-max-bases): Fix a typo -
24025 referneces -> references.
24027 2021-02-27 Iain Sandoe <iain@sandoe.co.uk>
24029 * config/host-darwin.c (darwin_gt_pch_use_address): Modify
24030 diagnostic message to avoid use of a contraction and format
24033 2021-02-27 Jakub Jelinek <jakub@redhat.com>
24036 * gcse.c (gcse_or_cprop_is_too_expensive): Use %wu instead of
24037 HOST_WIDE_INT_PRINT_UNSIGNED in warning format string.
24038 * ipa-devirt.c (ipa_odr_read_section): Use %wd instead of
24039 HOST_WIDE_INT_PRINT_DEC in inform format string. Fix comment
24042 2021-02-26 Richard Biener <rguenther@suse.de>
24044 PR middle-end/99281
24045 * expr.c (store_field): For calls with return-slot optimization
24046 and addressable return type expand the store directly.
24048 2021-02-26 Richard Biener <rguenther@suse.de>
24051 * builtins.c (warn_string_no_nul): Fix diagnostic formatting.
24053 2021-02-26 Peter Bergner <bergner@linux.ibm.com>
24056 * config/rs6000/rs6000-call.c (rs6000_init_builtins): Replace assert
24059 2021-02-26 Aaron Sawdey <acsawdey@linux.ibm.com>
24061 * config.gcc: Add rs6000-pcrel-opt.o.
24062 * config/rs6000/rs6000-pcrel-opt.c: New file.
24063 * config/rs6000/pcrel-opt.md: New file.
24064 * config/rs6000/predicates.md: Add d_form_memory predicate.
24065 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_PCREL_OPT.
24066 * config/rs6000/rs6000-passes.def: Add pass_pcrel_opt.
24067 * config/rs6000/rs6000-protos.h: Add reg_to_non_prefixed(),
24068 pcrel_opt_valid_mem_p(), output_pcrel_opt_reloc(),
24069 and make_pass_pcrel_opt().
24070 * config/rs6000/rs6000.c (reg_to_non_prefixed): Make global.
24071 (rs6000_option_override_internal): Add pcrel-opt.
24072 (rs6000_delegitimize_address): Support pcrel-opt.
24073 (rs6000_opt_masks): Add pcrel-opt.
24074 (pcrel_opt_valid_mem_p): New function.
24075 (reg_to_non_prefixed): Make global.
24076 (rs6000_asm_output_opcode): Reset prepend_p_to_next_insn.
24077 (output_pcrel_opt_reloc): New function.
24078 * config/rs6000/rs6000.md (loads_extern_addr): New attr.
24079 (pcrel_extern_addr): Set loads_extern_addr.
24080 Add include for pcrel-opt.md.
24081 * config/rs6000/rs6000.opt: Add -mpcrel-opt.
24082 * config/rs6000/t-rs6000: Add rules for pcrel-opt.c and
24085 2021-02-26 YunQiang Su <yunqiang.su@cipunited.com>
24088 * config/mips/mips.c (mips_expand_ext_as_unaligned_load):
24089 If TARGET_64BIT and dest is SUBREG, we check the width, if it
24090 equal to SImode, we use SImode operation, just like what we are
24093 2021-02-26 Marek Polacek <polacek@redhat.com>
24095 * builtins.c (warn_for_access): Fix typos.
24097 2021-02-25 Iain Sandoe <iain@sandoe.co.uk>
24099 * config/aarch64/aarch64.md (<optab>_rol<mode>3): Add a '#'
24100 mark in front of the immediate quantity.
24101 (<optab>_rolsi3_uxtw): Likewise.
24103 2021-02-25 Richard Earnshaw <rearnsha@arm.com>
24106 * config/arm/thumb2.md (nonsecure_call_reg_thumb2_fpcxt): New pattern.
24107 (nonsecure_call_value_reg_thumb2_fpcxt): Likewise.
24108 (nonsecure_call_reg_thumb2): Restrict to using r4 for the callee
24109 address and disable when the FPCXT is not available.
24110 (nonsecure_call_value_reg_thumb2): Likewise.
24112 2021-02-25 Nathan Sidwell <nathan@acm.org>
24115 * doc/invoke.texi (flang-info-module-cmi): Renamed option.
24117 2021-02-25 Tamar Christina <tamar.christina@arm.com>
24119 * tree-vect-slp.c (optimize_load_redistribution_1): Abort on NULL nodes.
24121 2021-02-25 Richard Biener <rguenther@suse.de>
24123 PR tree-optimization/99253
24124 * tree-vect-loop.c (check_reduction_path): First compute
24125 code, then verify out-of-loop uses.
24127 2021-02-25 Jakub Jelinek <jakub@redhat.com>
24130 * match.pd ((T)(A) + CST -> (T)(A + CST)): Add :s to convert.
24132 2021-02-25 Jakub Jelinek <jakub@redhat.com>
24134 PR tree-optimization/80635
24135 * tree-vrp.c (vrp_simplify_cond_using_ranges): Also handle
24136 VIEW_CONVERT_EXPR if modes are the same, innerop is integral and
24137 has mode precision.
24139 2021-02-25 Richard Biener <rguenther@suse.de>
24141 * tree-vect-slp.c (optimize_load_redistribution_1): Delay
24142 load_map population.
24143 (vect_match_slp_patterns_2): Revert part of last change.
24144 (vect_analyze_slp): Do not interleave optimize_load_redistribution
24145 with pattern detection but do it afterwards. Dump the
24146 whole SLP graph after pattern recognition and load
24147 redistribution optimization finished.
24149 2021-02-24 Jakub Jelinek <jakub@redhat.com>
24152 * omp-low.c (struct omp_context): Add teams_nested_p and
24153 nonteams_nested_p members.
24154 (scan_omp_target): Diagnose teams nested inside of target with other
24155 directives strictly nested inside of the same target.
24156 (check_omp_nesting_restrictions): Set ctx->teams_nested_p or
24157 ctx->nonteams_nested_p as needed.
24159 2021-02-24 Vladimir N. Makarov <vmakarov@redhat.com>
24161 PR inline-asm/99123
24162 * lra-constraints.c (uses_hard_regs_p): Don't use decompose_mem_address.
24164 2021-02-24 Hans-Peter Nilsson <hp@axis.com>
24166 * config/cris/cris.c (cris_expand_prologue): Set
24167 current_function_static_stack_size, if flag_stack_usage_info.
24169 2021-02-24 Pat Haugen <pthaugen@linux.ibm.com>
24171 * config/rs6000/rs6000.c (next_insn_prefixed_p): Rename.
24172 (rs6000_final_prescan_insn): Adjust.
24173 (rs6000_asm_output_opcode): Likewise.
24175 2021-02-24 Martin Sebor <msebor@redhat.com>
24177 PR middle-end/97172
24178 * attribs.c (attr_access::free_lang_data): Clear attribute arg spec
24179 from function arguments.
24181 2021-02-24 Tamar Christina <tamar.christina@arm.com>
24183 PR tree-optimization/99220
24184 * tree-vect-slp.c (optimize_load_redistribution_1): Remove
24185 node from cache when it's about to be deleted.
24187 2021-02-24 Jakub Jelinek <jakub@redhat.com>
24189 PR tree-optimization/99225
24190 * fold-const.c (fold_binary_loc) <case NE_EXPR>: In (x & (1 << y)) != 0
24191 to ((x >> y) & 1) != 0 simplifications use build_one_cst instead of
24192 build_int_cst (..., 1). Formatting fixes.
24194 2021-02-24 Tamar Christina <tamar.christina@arm.com>
24196 PR tree-optimization/99149
24197 * tree-vect-slp-patterns.c (vect_detect_pair_op): Don't recreate the
24199 (vect_slp_reset_pattern): Remove.
24200 (complex_fma_pattern::matches): Remove call to vect_slp_reset_pattern.
24201 (complex_mul_pattern::build, complex_fma_pattern::build,
24202 complex_fms_pattern::build): Fix ref counts.
24203 * tree-vect-slp.c (vect_free_slp_tree): Undo SLP only pattern relevancy
24204 when node is being deleted.
24205 (vect_match_slp_patterns_2): Correct result of cache hit on patterns.
24206 (vect_schedule_slp): Invalidate SLP_TREE_REPRESENTATIVE of removed
24208 * tree-vectorizer.c (vec_info::new_stmt_vec_info): Initialize value.
24210 2021-02-24 Matthias Klose <doko@ubuntu.com>
24213 2020-12-07 Matthias Klose <doko@ubuntu.com>
24215 * genextract.c (print_header): Undefine ENABLE_RTL_CHECKING
24216 and ENABLE_RTL_FLAG_CHECKING.
24218 2021-02-24 Richard Biener <rguenther@suse.de>
24221 * builtins.c (fold_builtin_next_arg): Avoid NULL arg.
24223 2021-02-23 Peter Bergner <bergner@linux.ibm.com>
24225 * config/rs6000/mma.md (mma_assemble_pair): Rename from this...
24226 (vsx_assemble_pair): ...to this.
24227 (*mma_assemble_pair): Rename from this...
24228 (*vsx_assemble_pair): ...to this.
24229 (mma_disassemble_pair): Rename from this...
24230 (vsx_disassemble_pair): ...to this.
24231 (*mma_disassemble_pair): Rename from this...
24232 (*vsx_disassemble_pair): ...to this.
24233 * config/rs6000/rs6000-builtin.def (BU_MMA_V2, BU_MMA_V3,
24234 BU_COMPAT): New macros.
24235 (mma_assemble_pair): Rename from this...
24236 (vsx_assemble_pair): ...to this.
24237 (mma_disassemble_pair): Rename from this...
24238 (vsx_disassemble_pair): ...to this.
24239 (mma_assemble_pair): New compatibility built-in.
24240 (mma_disassemble_pair): Likewise.
24241 * config/rs6000/rs6000-call.c (struct builtin_compatibility): New.
24242 (RS6000_BUILTIN_COMPAT): Define.
24243 (bdesc_compat): New.
24244 (mma_expand_builtin): Use VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
24245 (rs6000_gimple_fold_mma_builtin): Use MMA_BUILTIN_DISASSEMBLE_PAIR
24246 and VSX_BUILTIN_ASSEMBLE_PAIR.
24247 (rs6000_init_builtins): Register compatibility built-ins.
24248 (mma_init_builtins): Use VSX_BUILTIN_ASSEMBLE_PAIR,
24249 VSX_BUILTIN_ASSEMBLE_PAIR_INTERNAL, VSX_BUILTIN_DISASSEMBLE_PAIR and
24250 VSX_BUILTIN_DISASSEMBLE_PAIR_INTERNAL.
24251 * doc/extend.texi (__builtin_mma_assemble_pair): Rename from this...
24252 (__builtin_vsx_assemble_pair): ...to this.
24253 (__builtin_mma_disassemble_pair): Rename from this...
24254 (__builtin_vsx_disassemble_pair): ...to this.
24256 2021-02-23 Martin Liska <mliska@suse.cz>
24259 * ipa-icf.c (sem_variable::merge): Do not merge 2 variables
24260 with different alignment. That leads to an invalid red zone
24261 size allocated in runtime.
24263 2021-02-23 Jakub Jelinek <jakub@redhat.com>
24265 PR tree-optimization/99204
24266 * fold-const.c (fold_read_from_constant_string): Check that
24267 tree_fits_uhwi_p (index) rather than just that index is INTEGER_CST.
24269 2021-02-23 Segher Boessenkool <segher@kernel.crashing.org>
24270 Kewen Lin <linkw@gcc.gnu.org>
24272 * config/rs6000/rs6000.md (*rotl<mode>3_insert_3): Renamed to...
24273 (rotl<mode>3_insert_3): ...this.
24274 (plus_ior_xor): New code_iterator.
24275 (define_split for GPR rl*imi): New splitter.
24276 * config/rs6000/vsx.md (vsx_init_v4si): Use gen_rotldi3_insert_3
24277 for integer merging.
24279 2021-02-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24281 * config/aarch64/aarch64-tuning-flags.def (cse_sve_vl_constants):
24283 * config/aarch64/aarch64.md (add<mode>3): Force CONST_POLY_INT immediates
24284 into a register when the above is enabled.
24285 * config/aarch64/aarch64.c (neoversev1_tunings):
24286 AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
24287 (aarch64_rtx_costs): Use AARCH64_EXTRA_TUNE_CSE_SVE_VL_CONSTANTS.
24289 2021-02-22 Hans-Peter Nilsson <hp@axis.com>
24291 * config/cris/cris.c (cris_print_operand) <'T'>: Change
24292 valid operand from is now an addi mult-value to shift-value.
24293 * config/cris/cris.md (*addi): Change expression of scaled
24294 operand from mult to ashift.
24295 * config/cris/cris.md (*addi_reload): New insn_and_split.
24297 2021-02-22 John David Anglin <danglin@gcc.gnu.org>
24300 * config/pa/pa.c (TARGET_ASM_CAN_OUTPUT_MI_THUNK): Define as
24301 hook_bool_const_tree_hwi_hwi_const_tree_true.
24302 (pa_asm_output_mi_thunk): Add support for nonzero vcall_offset.
24304 2021-02-22 Andre Vieira <andre.simoesdiasvieira@arm.com>
24306 PR rtl-optimization/98791
24307 * ira-conflicts.c (process_regs_for_copy): Don't create allocno copies
24308 for unordered modes.
24310 2021-02-22 Martin Liska <mliska@suse.cz>
24312 * tree-inline.c (inline_forbidden_p): Set
24313 inline_forbidden_reason.
24315 2021-02-22 Richard Biener <rguenther@suse.de>
24317 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Dump
24320 2021-02-22 Richard Biener <rguenther@suse.de>
24322 PR tree-optimization/99165
24323 * gimple-ssa-store-merging.c (pass_store_merging::process_store):
24324 Accumulate changed to ret.
24326 2021-02-21 Uros Bizjak <ubizjak@gmail.com>
24329 2020-12-09 Uroš Bizjak <ubizjak@gmail.com>
24331 * config/i386/i386.h (REG_ALLOC_ORDER): Remove
24333 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
24336 * config/s390/vector.md (trunctf<DFP_ALL:mode>2_vr): New
24338 (trunctf<DFP_ALL:mode>2): Likewise.
24339 (trunctdtf2_vr): Likewise.
24340 (trunctdtf2): Likewise.
24341 (extend<DFP_ALL:mode>tf2_vr): Likewise.
24342 (extend<DFP_ALL:mode>tf2): Likewise.
24343 (extendtftd2_vr): Likewise.
24344 (extendtftd2): Likewise.
24346 2021-02-20 Ilya Leoshkevich <iii@linux.ibm.com>
24348 * config/s390/vector.md (*fprx2_to_tf): Rename to fprx2_to_tf,
24349 add memory alternative.
24350 (tf_to_fprx2): New pattern.
24352 2021-02-19 Martin Sebor <msebor@redhat.com>
24355 * attribs.c (init_attr_rdwr_indices): Guard vblist use.
24356 (attr_access::free_lang_data): Remove a spurious test.
24358 2021-02-19 Nathan Sidwell <nathan@acm.org>
24360 * doc/invoke.texi (flang-info-module-read): Document.
24362 2021-02-19 Martin Liska <mliska@suse.cz>
24364 PR translation/99167
24365 * params.opt: Fix typo.
24367 2021-02-19 Richard Biener <rguenther@suse.de>
24369 PR middle-end/99122
24370 * tree-inline.c (inline_forbidden_p): Do not inline functions
24371 with VLA arguments or return value.
24373 2021-02-19 Jakub Jelinek <jakub@redhat.com>
24376 * config/arm/arm.md (*stack_protect_combined_set_insn,
24377 *stack_protect_combined_test_insn): If force_const_mem result
24378 is not valid general operand, force its address into the destination
24381 2021-02-19 Jakub Jelinek <jakub@redhat.com>
24384 * tree-cfg.c (gimple_merge_blocks): If bb a starts with eh landing
24385 pad or non-local label, put FORCED_LABELs from bb b after that label
24386 rather than before it.
24388 2021-02-19 Andre Vieira <andre.simoesdiasvieira@arm.com>
24391 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3): Use
24392 expand_vector_broadcast' to emit the vec_duplicate operand.
24394 2021-02-18 Vladimir N. Makarov <vmakarov@redhat.com>
24396 PR rtl-optimization/96264
24397 * lra-remat.c (reg_overlap_for_remat_p): Check also output insn
24400 2021-02-18 H.J. Lu <hjl.tools@gmail.com>
24403 * varasm.c (get_section): Replace SUPPORTS_SHF_GNU_RETAIN with
24404 looking up the retain attribute.
24405 (resolve_unique_section): Likewise.
24406 (get_variable_section): Likewise.
24407 (switch_to_section): Likewise. Warn when a symbol without the
24408 retain attribute and a symbol with the retain attribute are
24409 placed in the section with the same name, instead of the used
24411 * doc/extend.texi: Document the "retain" attribute.
24413 2021-02-18 Nathan Sidwell <nathan@acm.org>
24416 * doc/invoke.texi (flang-info-include-translate): Document header
24419 2021-02-18 Richard Biener <rguenther@suse.de>
24421 PR middle-end/99122
24422 * ipa-fnsummary.c (analyze_function_body): Set
24423 CIF_FUNCTION_NOT_INLINABLE for VLA parameter calls.
24424 * tree-inline.c (insert_init_debug_bind): Pass NULL for
24425 error_mark_node values.
24426 (force_value_to_type): Do not build V_C_Es for WITH_SIZE_EXPR
24428 (setup_one_parameter): Delay force_value_to_type until when
24431 2021-02-18 Hans-Peter Nilsson <hp@axis.com>
24433 PR tree-optimization/99142
24434 * match.pd (clz cmp 0): Gate replacement on single_use of clz result.
24436 2021-02-18 Jakub Jelinek <jakub@redhat.com>
24438 * wide-int-bitmask.h (wide_int_bitmask::wide_int_bitmask (),
24439 wide_int_bitmask::wide_int_bitmask (uint64_t),
24440 wide_int_bitmask::wide_int_bitmask (uint64_t, uint64_t),
24441 wide_int_bitmask::operator ~ () const,
24442 wide_int_bitmask::operator | (wide_int_bitmask) const,
24443 wide_int_bitmask::operator & (wide_int_bitmask) const): Use constexpr
24445 * config/i386/i386.h (PTA_3DNOW, PTA_3DNOW_A, PTA_64BIT, PTA_ABM,
24446 PTA_AES, PTA_AVX, PTA_BMI, PTA_CX16, PTA_F16C, PTA_FMA, PTA_FMA4,
24447 PTA_FSGSBASE, PTA_LWP, PTA_LZCNT, PTA_MMX, PTA_MOVBE, PTA_NO_SAHF,
24448 PTA_PCLMUL, PTA_POPCNT, PTA_PREFETCH_SSE, PTA_RDRND, PTA_SSE, PTA_SSE2,
24449 PTA_SSE3, PTA_SSE4_1, PTA_SSE4_2, PTA_SSE4A, PTA_SSSE3, PTA_TBM,
24450 PTA_XOP, PTA_AVX2, PTA_BMI2, PTA_RTM, PTA_HLE, PTA_PRFCHW, PTA_RDSEED,
24451 PTA_ADX, PTA_FXSR, PTA_XSAVE, PTA_XSAVEOPT, PTA_AVX512F, PTA_AVX512ER,
24452 PTA_AVX512PF, PTA_AVX512CD, PTA_NO_TUNE, PTA_SHA, PTA_PREFETCHWT1,
24453 PTA_CLFLUSHOPT, PTA_XSAVEC, PTA_XSAVES, PTA_AVX512DQ, PTA_AVX512BW,
24454 PTA_AVX512VL, PTA_AVX512IFMA, PTA_AVX512VBMI, PTA_CLWB, PTA_MWAITX,
24455 PTA_CLZERO, PTA_NO_80387, PTA_PKU, PTA_AVX5124VNNIW, PTA_AVX5124FMAPS,
24456 PTA_AVX512VPOPCNTDQ, PTA_SGX, PTA_AVX512VNNI, PTA_GFNI, PTA_VAES,
24457 PTA_AVX512VBMI2, PTA_VPCLMULQDQ, PTA_AVX512BITALG, PTA_RDPID,
24458 PTA_PCONFIG, PTA_WBNOINVD, PTA_AVX512VP2INTERSECT, PTA_PTWRITE,
24459 PTA_AVX512BF16, PTA_WAITPKG, PTA_MOVDIRI, PTA_MOVDIR64B, PTA_ENQCMD,
24460 PTA_CLDEMOTE, PTA_SERIALIZE, PTA_TSXLDTRK, PTA_AMX_TILE, PTA_AMX_INT8,
24461 PTA_AMX_BF16, PTA_UINTR, PTA_HRESET, PTA_KL, PTA_WIDEKL, PTA_AVXVNNI,
24462 PTA_X86_64_BASELINE, PTA_X86_64_V2, PTA_X86_64_V3, PTA_X86_64_V4,
24463 PTA_CORE2, PTA_NEHALEM, PTA_WESTMERE, PTA_SANDYBRIDGE, PTA_IVYBRIDGE,
24464 PTA_HASWELL, PTA_BROADWELL, PTA_SKYLAKE, PTA_SKYLAKE_AVX512,
24465 PTA_CASCADELAKE, PTA_COOPERLAKE, PTA_CANNONLAKE, PTA_ICELAKE_CLIENT,
24466 PTA_ICELAKE_SERVER, PTA_TIGERLAKE, PTA_SAPPHIRERAPIDS, PTA_ALDERLAKE,
24467 PTA_KNL, PTA_BONNELL, PTA_SILVERMONT, PTA_GOLDMONT, PTA_GOLDMONT_PLUS,
24468 PTA_TREMONT, PTA_KNM): Use constexpr instead of const.
24470 2021-02-18 Jakub Jelinek <jakub@redhat.com>
24472 PR middle-end/99109
24473 * gimple-array-bounds.cc (build_zero_elt_array_type): Rename to ...
24474 (build_printable_array_type): ... this. Add nelts argument. For
24475 overaligned eltype, use TYPE_MAIN_VARIANT (eltype) instead. If
24476 nelts, call build_array_type_nelts.
24477 (array_bounds_checker::check_mem_ref): Use build_printable_array_type
24478 instead of build_zero_elt_array_type and build_array_type_nelts.
24480 2021-02-18 Jakub Jelinek <jakub@redhat.com>
24483 * config/i386/i386.c (distance_non_agu_define): Don't call
24484 extract_insn_cached here.
24485 (ix86_lea_outperforms): Save and restore recog_data around call
24486 to distance_non_agu_define and distance_agu_use.
24487 (ix86_ok_to_clobber_flags): Remove.
24488 (ix86_avoid_lea_for_add): Don't call ix86_ok_to_clobber_flags.
24489 (ix86_avoid_lea_for_addr): Likewise. Adjust function comment.
24490 * config/i386/i386.md (*lea<mode>): Change from define_insn_and_split
24491 into define_insn. Move the splitting to define_peephole2 and
24492 check there using peep2_regno_dead_p if FLAGS_REG is dead.
24494 2021-02-17 Julian Brown <julian@codesourcery.com>
24496 * gimplify.c (gimplify_scan_omp_clauses): Handle ATTACH_DETACH
24499 2021-02-17 Xi Ruoyao <xry111@mengyan1223.wang>
24502 * config/mips/mips.c (mips_symbol_insns): Do not use
24503 MSA_SUPPORTED_MODE_P if mode is MAX_MACHINE_MODE.
24505 2021-02-16 Vladimir N. Makarov <vmakarov@redhat.com>
24507 PR inline-asm/98096
24508 * stmt.c (resolve_operand_name_1): Take inout operands into account
24509 for access to labels by names.
24510 * doc/extend.texi: Describe counting operands for accessing labels.
24512 2021-02-16 Richard Biener <rguenther@suse.de>
24514 PR tree-optimization/38474
24515 * tree-ssa-structalias.c (variable_info::address_taken): New.
24516 (new_var_info): Initialize address_taken.
24517 (process_constraint): Set address_taken.
24518 (solve_constraints): Use the new address_taken flag rather
24519 than is_reg_var for sorting variables.
24520 (dump_constraint): Dump the variable number if the name
24523 2021-02-16 Jakub Jelinek <jakub@redhat.com>
24526 * tree-vect-stmts.c (vectorizable_simd_clone_call): For num_calls != 1
24527 multiply by 4096 and for inbranch by 8192.
24528 * config/i386/i386.c (ix86_simd_clone_usable): For TARGET_AVX512F,
24529 return 3, 2 or 1 for mangle letters 'b', 'c' or 'd'.
24531 2021-02-15 Maya Rashish <coypu@sdf.org>
24533 * config/aarch64/aarch64.c (aarch64_init_builtins):
24534 Call SUBTARGET_INIT_BUILTINS.
24536 2021-02-15 Peter Bergner <bergner@linux.ibm.com>
24538 PR rtl-optimization/98872
24539 * init-regs.c (initialize_uninitialized_regs): Skip initialization
24540 if CONST0_RTX is NULL.
24542 2021-02-15 Richard Sandiford <richard.sandiford@arm.com>
24544 PR rtl-optimization/98863
24545 * rtl-ssa/functions.h (function_info::bb_live_out_info): Delete.
24546 (function_info::build_info): Turn into a declaration, moving the
24547 definition to internals.h.
24548 (function_info::bb_walker): Declare.
24549 (function_info::create_reg_use): Likewise.
24550 (function_info::calculate_potential_phi_regs): Take a build_info
24552 (function_info::place_phis, function_info::create_ebbs): Declare.
24553 (function_info::calculate_ebb_live_in_for_debug): Likewise.
24554 (function_info::populate_backedge_phis): Delete.
24555 (function_info::start_block, function_info::end_block): Declare.
24556 (function_info::populate_phi_inputs): Delete.
24557 (function_info::m_potential_phi_regs): Move information to build_info.
24558 * rtl-ssa/internals.h: New file.
24559 (function_info::bb_phi_info): New class.
24560 (function_info::build_info): Moved from functions.h.
24561 Add a constructor and destructor.
24562 (function_info::build_info::ebb_use): Delete.
24563 (function_info::build_info::ebb_def): Likewise.
24564 (function_info::build_info::bb_live_out): Likewise.
24565 (function_info::build_info::tmp_ebb_live_in_for_debug): New variable.
24566 (function_info::build_info::potential_phi_regs): Likewise.
24567 (function_info::build_info::potential_phi_regs_for_debug): Likewise.
24568 (function_info::build_info::ebb_def_regs): Likewise.
24569 (function_info::build_info::bb_phis): Likewise.
24570 (function_info::build_info::bb_mem_live_out): Likewise.
24571 (function_info::build_info::bb_to_rpo): Likewise.
24572 (function_info::build_info::def_stack): Likewise.
24573 (function_info::build_info::old_def_stack_limit): Likewise.
24574 * rtl-ssa/internals.inl (function_info::build_info::record_reg_def):
24575 Remove the regno argument. Push the previous definition onto the
24576 definition stack where necessary.
24577 * rtl-ssa/accesses.cc: Include internals.h.
24578 * rtl-ssa/changes.cc: Likewise.
24579 * rtl-ssa/blocks.cc: Likewise.
24580 (function_info::build_info::build_info): Define.
24581 (function_info::build_info::~build_info): Likewise.
24582 (function_info::bb_walker): New class.
24583 (function_info::bb_walker::bb_walker): Define.
24584 (function_info::add_live_out_use): Convert a logarithmic-complexity
24585 test into a linear one. Allow the same definition to be passed
24587 (function_info::calculate_potential_phi_regs): Moved from
24588 functions.cc. Take a build_info parameter and store the
24589 information there instead.
24590 (function_info::place_phis): New function.
24591 (function_info::add_entry_block_defs): Update call to record_reg_def.
24592 (function_info::calculate_ebb_live_in_for_debug): New function.
24593 (function_info::add_phi_nodes): Use bb_phis to decide which
24594 registers need phi nodes and initialize ebb_def_regs accordingly.
24595 Do not add degenerate phis here.
24596 (function_info::add_artificial_accesses): Use create_reg_use.
24597 Assert that all definitions are listed in the DF LR sets.
24598 Update call to record_reg_def.
24599 (function_info::record_block_live_out): Record live-out register
24600 values in the phis of successor blocks. Use the live-out set
24601 when processing the last block in an EBB, instead of always
24602 using the live-in sets of successor blocks. AND the live sets
24603 with the set of registers that have been defined in the EBB,
24604 rather than with all potential phi registers. Cope correctly
24605 with branches back to the start of the current EBB.
24606 (function_info::start_block): New function.
24607 (function_info::end_block): Likewise.
24608 (function_info::populate_phi_inputs): Likewise.
24609 (function_info::create_ebbs): Likewise.
24610 (function_info::process_all_blocks): Rewrite into a multi-phase
24612 * rtl-ssa/functions.cc: Include internals.h.
24613 (function_info::calculate_potential_phi_regs): Move to blocks.cc.
24614 (function_info::init_function_data): Remove caller.
24615 * rtl-ssa/insns.cc: Include internals.h
24616 (function_info::create_reg_use): New function. Lazily any
24617 degenerate phis needed by the linear RPO view.
24618 (function_info::record_use): Use create_reg_use. When processing
24619 debug uses, use potential_phi_regs and test it before checking
24620 whether the register is live on entry to the current EBB. Lazily
24621 calculate ebb_live_in_for_debug.
24622 (function_info::record_call_clobbers): Update call to record_reg_def.
24623 (function_info::record_def): Likewise.
24625 2021-02-15 Martin Liska <mliska@suse.cz>
24627 * toplev.c (init_asm_output): Free output of
24628 gen_command_line_string function.
24629 (process_options): Likewise.
24631 2021-02-15 Martin Liska <mliska@suse.cz>
24633 * params.opt: Add 2 missing Param keywords.
24635 2021-02-15 Eric Botcazou <ebotcazou@adacore.com>
24637 * df-core.c (df_worklist_dataflow_doublequeue): Use proper cast.
24639 2021-02-15 Jakub Jelinek <jakub@redhat.com>
24641 PR tree-optimization/99079
24642 * match.pd (A % (pow2pcst << N) -> A & ((pow2pcst << N) - 1)): Remove
24643 useless tree_nop_conversion_p (type, TREE_TYPE (@3)) check. Instead
24644 require both type and TREE_TYPE (@1) to be integral types and either
24645 type having smaller or equal precision, or TREE_TYPE (@1) being
24646 unsigned type, or type being signed type. If TREE_TYPE (@1)
24647 doesn't have wrapping overflow, perform the subtraction of one in
24650 2021-02-14 Jan Hubicka <hubicka@ucw.cz>
24651 Richard Biener <rguether@suse.de>
24654 * ipa-reference.c (ipa_init): Only conditinally initialize
24655 reference_vars_to_consider.
24656 (propagate): Conditionally deninitialize reference_vars_to_consider.
24657 (ipa_reference_write_optimization_summary): Sanity check that
24658 reference_vars_to_consider is not allocated.
24660 2021-02-13 Levy Hsu <admin@levyhsu.com>
24663 * config/riscv/riscv-shorten-memrefs.c (pass_shorten_memrefs): Add
24664 extend parameter to get_si_mem_base_reg declaration.
24665 (get_si_mem_base_reg): Add extend parameter. Set it.
24666 (analyze): Pass extend arg to get_si_mem_base_reg.
24667 (transform): Likewise. Use it when rewriting mems.
24668 * config/riscv/riscv.c (riscv_legitimize_move): Check for subword
24669 loads and emit sign/zero extending load followed by subreg move.
24671 2021-02-13 Jim Wilson <jimw@sifive.com>
24674 * config/riscv/riscv.c (riscv_compressed_lw_address_p): Drop early
24675 exit when !reload_completed. Only perform check for compressed reg
24676 if reload_completed.
24677 (riscv_rtx_costs): In MEM case, when optimizing for size and
24678 shorten memrefs, if not compressible, then increase cost.
24680 2021-02-13 Jakub Jelinek <jakub@redhat.com>
24682 PR rtl-optimization/98439
24683 * recog.c (pass_split_before_regstack::gate): Enable even when
24684 pass_split_before_sched2 is enabled if -fselective-scheduling2 is
24687 2021-02-13 Jakub Jelinek <jakub@redhat.com>
24690 * config/i386/mmx.md (*mmx_pshufd_1): Add a combine splitter for
24691 swap of V2SImode elements in memory into DImode memory rotate by 32.
24693 2021-02-12 Martin Sebor <msebor@redhat.com>
24695 * tree-pretty-print.c (print_generic_expr_to_str): Update comment.
24697 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
24699 * rtl-ssa/accesses.cc (function_info::make_use_available): Use
24700 m_temp_obstack rather than m_obstack to allocate the temporary use.
24702 2021-02-12 Richard Sandiford <richard.sandiford@arm.com>
24704 * df-problems.c (df_lr_bb_local_compute): Treat partial definitions
24705 as read-modify operations.
24707 2021-02-12 Richard Biener <rguenther@suse.de>
24709 PR middle-end/38474
24710 * ipa-fnsummary.c (unmodified_parm_1): Only walk when
24711 fbi->aa_walk_budget is bigger than zero. Update
24712 fbi->aa_walk_budget.
24713 (param_change_prob): Likewise.
24714 * ipa-prop.c (detect_type_change_from_memory_writes):
24715 Properly account walk_aliased_vdefs.
24716 (parm_preserved_before_stmt_p): Canonicalize updates.
24717 (parm_ref_data_preserved_p): Likewise.
24718 (parm_ref_data_pass_through_p): Likewise.
24719 (determine_known_aggregate_parts): Account own alias queries.
24721 2021-02-12 Martin Liska <mliska@suse.cz>
24723 * opts-common.c (decode_cmdline_option): Release werror_arg.
24724 * opts.c (gen_producer_string): Release output of
24725 gen_command_line_string.
24727 2021-02-12 Richard Biener <rguenther@suse.de>
24729 PR tree-optimization/38474
24730 * params.opt (-param=max-store-chains-to-track=): New param.
24731 (-param=max-stores-to-track=): Likewise.
24732 * doc/invoke.texi (max-store-chains-to-track): Document.
24733 (max-stores-to-track): Likewise.
24734 * gimple-ssa-store-merging.c (pass_store_merging::m_n_chains):
24736 (pass_store_merging::m_n_stores): Likewise.
24737 (pass_store_merging::terminate_and_process_chain): Update
24738 m_n_stores and m_n_chains.
24739 (pass_store_merging::process_store): Likewise. Terminate
24740 oldest chains if the number of stores or chains get too large.
24741 (imm_store_chain_info::terminate_and_process_chain): Dump
24744 2021-02-11 Eric Botcazou <ebotcazou@adacore.com>
24746 * config/i386/winnt.c (i386_pe_seh_unwind_emit): When switching to
24747 the cold section, emit a nop before the directive if the previous
24748 active instruction can throw.
24750 2021-02-11 Peter Bergner <bergner@linux.ibm.com>
24753 * config/rs6000/predicates.md (mma_assemble_input_operand): Restrict
24754 memory addresses that are legal for quad word accesses.
24756 2021-02-11 Andrea Corallo <andrea.corallo@arm.com>
24759 * config/arm/thumb2.md (*doloop_end_internal): Generate
24760 alternative sequence to handle long range branches.
24762 2021-02-11 Joel Hutton <joel.hutton@arm.com>
24764 PR tree-optimization/98772
24765 * optabs-tree.c (supportable_half_widening_operation): New function
24766 to check for supportable V8QI->V8HI widening patterns.
24767 * optabs-tree.h (supportable_half_widening_operation): New function.
24768 * tree-vect-stmts.c (vect_create_half_widening_stmts): New function
24769 to create promotion stmts for V8QI->V8HI widening patterns.
24770 (vectorizable_conversion): Add case for V8QI->V8HI.
24772 2021-02-11 Richard Biener <rguenther@suse.de>
24774 * sparseset.h (SPARSESET_ELT_BITS): Remove.
24775 (SPARSESET_ELT_TYPE): Use unsigned int.
24776 * fwprop.c: Do not include sparseset.h.
24778 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24781 * varasm.c (declare_weak): For -fsyntax-only, allow even
24782 TREE_ASM_WRITTEN function decls.
24784 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24787 * config/i386/sse.md (fix<fixunssuffix>_truncv2sfv2di2,
24788 <insn>v8qiv8hi2, <insn>v8qiv8si2, <insn>v4qiv4si2, <insn>v4hiv4si2,
24789 <insn>v8qiv8di2, <insn>v4qiv4di2, <insn>v2qiv2di2, <insn>v4hiv4di2,
24790 <insn>v2hiv2di2, <insn>v2siv2di2): Force operands[1] into REG before
24791 calling simplify_gen_subreg on it.
24793 2021-02-10 Martin Liska <mliska@suse.cz>
24795 * config/nvptx/nvptx.c (nvptx_option_override): Use
24796 flag_patchable_function_entry instead of the removed
24797 function_entry_patch_area_size.
24799 2021-02-10 Martin Liska <mliska@suse.cz>
24801 PR tree-optimization/99002
24802 PR tree-optimization/99026
24803 * gimple-if-to-switch.cc (if_chain::is_beneficial): Fix memory
24804 leak when adjacent cases are merged.
24805 * tree-switch-conversion.c (switch_decision_tree::analyze_switch_statement): Use
24807 (make_pass_lower_switch): Remove trailing whitespace.
24808 * tree-switch-conversion.h (release_clusters): New.
24810 2021-02-10 Richard Biener <rguenther@suse.de>
24812 PR rtl-optimization/99054
24813 * cfgrtl.c (rtl-optimization/99054): Return an auto_vec.
24814 (fixup_partitions): Adjust.
24815 (rtl_verify_edges): Likewise.
24817 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24819 PR middle-end/99007
24820 * gimplify.c (gimplify_scan_omp_clauses): For MEM_REF on reductions,
24821 temporarily disable gimplify_ctxp->into_ssa around gimplify_expr
24824 2021-02-10 Richard Biener <rguenther@suse.de>
24827 * ipa-pure-const.c (propagate_malloc): Use an auto_vec<>
24830 2021-02-10 Richard Biener <rguenther@suse.de>
24832 PR tree-optimization/99024
24833 * tree-vect-loop.c (_loop_vec_info::~_loop_vec_info): Only
24834 clear loop->aux if it is associated with the destroyed loop_vinfo.
24836 2021-02-10 Martin Liska <mliska@suse.cz>
24838 PR tree-optimization/99002
24839 * gimple-if-to-switch.cc (find_conditions): Fix memory leak
24842 2021-02-10 Martin Liska <mliska@suse.cz>
24845 * ipa-icf.c (sem_item::add_reference): Fix memory leak when
24846 a reference exists.
24848 2021-02-10 Jakub Jelinek <jakub@redhat.com>
24851 * dwarf2out.c (prune_unused_types_walk): Mark DW_TAG_variable DIEs
24852 at class scope for DWARF5+.
24854 2021-02-09 Eric Botcazou <ebotcazou@adacore.com>
24856 PR rtl-optimization/96015
24857 * reorg.c (skip_consecutive_labels): Minor comment tweaks.
24858 (relax_delay_slots): When deleting a jump to the next active
24859 instruction over a barrier, first delete the barrier if the
24860 jump is the only way to reach the target label.
24862 2021-02-09 Andre Vieira <andre.simoesdiasvieira@arm.com>
24864 * config/aarch64/aarch64-cost-tables.h: Add entries for vect.mul.
24865 * config/aarch64/aarch64.c (aarch64_rtx_mult_cost): Use vect.mul for
24866 vector multiplies and vect.alu for SSRA.
24867 * config/arm/aarch-common-protos.h (struct vector_cost_table): Define
24868 vect.mul cost field.
24869 * config/arm/aarch-cost-tables.h: Add entries for vect.mul.
24870 * config/arm/arm.c: Likewise.
24872 2021-02-09 Richard Biener <rguenther@suse.de>
24874 PR tree-optimization/98863
24875 * tree-ssa-sccvn.h (vn_avail::next_undo): Add.
24876 * tree-ssa-sccvn.c (last_pushed_avail): New global.
24877 (rpo_elim::eliminate_push_avail): Chain pushed avails.
24878 (unwind_state::avail_top): Add.
24879 (do_unwind): Rewrite unwinding of avail entries.
24880 (do_rpo_vn): Initialize last_pushed_avail and
24881 avail_top of the undo state.
24883 2021-02-09 Jakub Jelinek <jakub@redhat.com>
24885 PR middle-end/99004
24886 * calls.c (maybe_warn_rdwr_sizes): Change s0 and s1 type from
24887 const char * to char * and free those pointers after use.
24889 2021-02-09 Richard Biener <rguenther@suse.de>
24891 PR tree-optimization/99017
24892 * tree-vect-slp.c (vect_bb_vectorization_profitable_p): Allow
24893 zero vector cost entries.
24895 2021-02-08 Andre Vieira <andre.simoesdiasvieira@arm.com>
24897 PR middle-end/98974
24898 * tree-vect-stmts.c (vectorizable_condition): Remove shadow vec_num
24899 parameter in vectorizable_condition.
24901 2021-02-08 Richard Biener <rguenther@suse.de>
24904 * tree.c (walk_tree_1): Walk VECTOR_CST elements.
24906 2021-02-08 Martin Liska <mliska@suse.cz>
24909 * cfgexpand.c (pass_expand::execute): Parse per-function option
24910 flag_patchable_function_entry and use it.
24911 * common.opt: Remove function_entry_patch_area_size and
24912 function_entry_patch_area_start global variables.
24913 * opts.c (parse_and_check_patch_area): New function.
24914 (common_handle_option): Use it.
24915 * opts.h (parse_and_check_patch_area): New function.
24916 * toplev.c (process_options): Parse and use
24917 function_entry_patch_area_size.
24919 2021-02-08 Martin Sebor <msebor@redhat.com>
24921 * doc/extend.texi (attribute malloc): Correct typos.
24923 2021-02-05 Nathan Sidwell <nathan@acm.org>
24926 * gcc.c (driver::maybe_run_linker): Check for input file
24927 accessibility if not linking.
24929 2021-02-05 Richard Biener <rguenther@suse.de>
24931 PR tree-optimization/98855
24932 * tree-vectorizer.h (add_stmt_cost): New overload.
24933 * tree-vect-slp.c (li_cost_vec_cmp): New.
24934 (vect_bb_slp_scalar_cost): Cost individual loop regions
24935 separately. Account for the scalar instance root stmt.
24937 2021-02-05 Tom de Vries <tdevries@suse.de>
24940 * tree-switch-conversion.c (jump_table_cluster::emit): Add loc
24942 (bit_test_cluster::emit): Reuse location_t for newly created
24944 (switch_decision_tree::try_switch_expansion): Preserve
24946 * tree-switch-conversion.h: Change function signatures.
24948 2021-02-05 Jakub Jelinek <jakub@redhat.com>
24951 * config/i386/i386-options.c (m_NONE, m_ALL): Define.
24952 * config/i386/x86-tune.def (X86_TUNE_BRANCH_PREDICTION_HINTS,
24953 X86_TUNE_PROMOTE_QI_REGS): Use m_NONE instead of 0U.
24954 (X86_TUNE_QIMODE_MATH): Use m_ALL instead of ~0U.
24956 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24958 * config/aarch64/aarch64-simd-builtins.def (get_high): Define builtin.
24959 * config/aarch64/aarch64-simd.md (aarch64_get_high<mode>): Define.
24960 * config/aarch64/arm_neon.h (__GET_HIGH): Delete.
24961 (vget_high_f16): Reimplement using new builtin.
24962 (vget_high_f32): Likewise.
24963 (vget_high_f64): Likewise.
24964 (vget_high_p8): Likewise.
24965 (vget_high_p16): Likewise.
24966 (vget_high_p64): Likewise.
24967 (vget_high_s8): Likewise.
24968 (vget_high_s16): Likewise.
24969 (vget_high_s32): Likewise.
24970 (vget_high_s64): Likewise.
24971 (vget_high_u8): Likewise.
24972 (vget_high_u16): Likewise.
24973 (vget_high_u32): Likewise.
24974 (vget_high_u64): Likewise.
24976 2021-02-05 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
24978 * config/aarch64/aarch64-simd-builtins.def (get_low): Define builtin.
24979 * config/aarch64/aarch64-simd.md (aarch64_get_low<mode>): Define.
24980 * config/aarch64/arm_neon.h (__GET_LOW): Delete.
24981 (vget_low_f16): Reimplement using new builtin.
24982 (vget_low_f32): Likewise.
24983 (vget_low_f64): Likewise.
24984 (vget_low_p8): Likewise.
24985 (vget_low_p16): Likewise.
24986 (vget_low_p64): Likewise.
24987 (vget_low_s8): Likewise.
24988 (vget_low_s16): Likewise.
24989 (vget_low_s32): Likewise.
24990 (vget_low_s64): Likewise.
24991 (vget_low_u8): Likewise.
24992 (vget_low_u16): Likewise.
24993 (vget_low_u32): Likewise.
24994 (vget_low_u64): Likewise.
24996 2021-02-05 Kito Cheng <kito.cheng@sifive.com>
24998 * gcc.c (print_multilib_info): Check all required argument is provided
25001 2021-02-05 liuhongt <hongtao.liu@intel.com>
25004 * config/i386/i386-expand.c (ix86_expand_sse_cmp): Don't
25005 generate integer mask comparison for 128/256-bits vector when
25006 op_true/op_false is NULL_RTX or CONSTM1_RTX/CONST0_RTX. Also
25007 delete redundant !maskcmp condition.
25008 (ix86_expand_int_vec_cmp): Ditto but no redundant deletion
25010 (ix86_expand_sse_movcc): Delete definition of maskcmp, add the
25011 condition directly to if (maskcmp), add extra check for
25012 cmpmode, it should be MODE_INT.
25013 (ix86_expand_fp_vec_cmp): Pass NULL to ix86_expand_sse_cmp's
25014 parameters op_true/op_false.
25015 (ix86_use_mask_cmp_p): New.
25017 2021-02-05 liuhongt <hongtao.liu@intel.com>
25020 * config/i386/x86-tune.def (X86_TUNE_AVX256_UNALIGNED_LOAD_OPTIMAL):
25021 Remove m_GENERIC from ~list.
25022 (X86_TUNE_AVX256_UNALIGNED_STORE_OPTIMAL): Ditto.
25024 2021-02-04 David Malcolm <dmalcolm@redhat.com>
25027 * diagnostic-show-locus.c (compatible_locations_p): Require
25028 locations in the same macro map to be either both from the
25029 macro definition, or both from the macro arguments.
25031 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
25033 * config/aarch64/aarch64-simd-builtins.def: Add
25034 [su]mull_hi_lane[q] builtin generator macros.
25035 * config/aarch64/aarch64-simd.md
25036 (aarch64_<su>mull_hi_lane<mode>_insn): Define.
25037 (aarch64_<su>mull_hi_lane<mode>): Define.
25038 (aarch64_<su>mull_hi_laneq<mode>_insn): Define.
25039 (aarch64_<su>mull_hi_laneq<mode>): Define.
25040 * config/aarch64/arm_neon.h (vmull_high_lane_s16): Use RTL
25041 builtin instead of inline asm.
25042 (vmull_high_lane_s32): Likewise.
25043 (vmull_high_lane_u16): Likewise.
25044 (vmull_high_lane_u32): Likewise.
25045 (vmull_high_laneq_s16): Likewise.
25046 (vmull_high_laneq_s32): Likewise.
25047 (vmull_high_laneq_u16): Likewise.
25048 (vmull_high_laneq_u32): Liekwise.
25050 2021-02-04 Jonathan Wright <jonathan.wright@arm.com>
25052 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_hi_n
25053 builtin generator macros.
25054 * config/aarch64/aarch64-simd.md
25055 (aarch64_<su>mull_hi_n<mode>_insn): Define.
25056 (aarch64_<su>mull_hi_n<mode>): Define.
25057 * config/aarch64/arm_neon.h (vmull_high_n_s16): Use RTL builtin
25058 instead of inline asm.
25059 (vmull_high_n_s32): Likewise.
25060 (vmull_high_n_u16): Likewise.
25061 (vmull_high_n_u32): Likewise.
25063 2021-02-04 Richard Biener <rguenther@suse.de>
25065 PR tree-optimization/98855
25066 * tree-vect-loop.c (vectorizable_phi): Do not cost
25067 single-argument PHIs.
25068 * tree-vect-slp.c (vect_bb_slp_scalar_cost): Likewise.
25069 * tree-vect-stmts.c (vectorizable_bswap): Also perform
25070 costing for SLP operation.
25072 2021-02-04 Martin Liska <mliska@suse.cz>
25074 * doc/extend.texi: Mention -mprefer-vector-width in target
25077 2021-02-03 Martin Sebor <msebor@redhat.com>
25079 PR tree-optimization/98937
25080 * tree-ssa-strlen.c (strlen_dom_walker::~strlen_dom_walker): Define.
25081 Flush pointer_query cache.
25083 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
25085 * config/rs6000/genfusion.pl (gen_2logical): Add missing
25086 fixes based on patch review.
25087 * config/rs6000/fusion.md: Regenerate file.
25089 2021-02-03 Aaron Sawdey <acsawdey@linux.ibm.com>
25091 * config/rs6000/t-rs6000: Comment out auto generation of
25094 2021-02-03 Andrew Stubbs <ams@codesourcery.com>
25096 * config/gcn/gcn-opts.h (enum processor_type): Add PROCESSOR_GFX908.
25097 * config/gcn/gcn.c (gcn_omp_device_kind_arch_isa): Add gfx908.
25098 (output_file_start): Add gfx908.
25099 * config/gcn/gcn.opt (gpu_type): Add gfx908.
25100 * config/gcn/t-gcn-hsa (MULTILIB_OPTIONS): Add march=gfx908.
25101 (MULTILIB_DIRNAMES): Add gfx908.
25102 * config/gcn/mkoffload.c (EF_AMDGPU_MACH_AMDGCN_GFX908): New define.
25103 (main): Recognize gfx908.
25104 * config/gcn/t-omp-device: Add gfx908.
25106 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25108 * config/aarch64/aarch64-simd-builtins.def: Add
25109 [su]mlsl_hi_lane[q] builtin macro generators.
25110 * config/aarch64/aarch64-simd.md
25111 (aarch64_<su>mlsl_hi_lane<mode>_insn): Define.
25112 (aarch64_<su>mlsl_hi_lane<mode>): Define.
25113 (aarch64_<su>mlsl_hi_laneq<mode>_insn): Define.
25114 (aarch64_<su>mlsl_hi_laneq<mode>): Define.
25115 * config/aarch64/arm_neon.h (vmlsl_high_lane_s16): Use RTL
25116 builtin instead of inline asm.
25117 (vmlsl_high_lane_s32): Likewise.
25118 (vmlsl_high_lane_u16): Likewise.
25119 (vmlsl_high_lane_u32): Likewise.
25120 (vmlsl_high_laneq_s16): Likewise.
25121 (vmlsl_high_laneq_s32): Likewise.
25122 (vmlsl_high_laneq_u16): Likewise.
25123 (vmlsl_high_laneq_u32): Likewise.
25124 (vmlal_high_laneq_u32): Likewise.
25126 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25128 * config/aarch64/aarch64-simd-builtins.def: Add
25129 [su]mlal_hi_lane[q] builtin generator macros.
25130 * config/aarch64/aarch64-simd.md
25131 (aarch64_<su>mlal_hi_lane<mode>_insn): Define.
25132 (aarch64_<su>mlal_hi_lane<mode>): Define.
25133 (aarch64_<su>mlal_hi_laneq<mode>_insn): Define.
25134 (aarch64_<su>mlal_hi_laneq<mode>): Define.
25135 * config/aarch64/arm_neon.h (vmlal_high_lane_s16): Use RTL
25136 builtin instead of inline asm.
25137 (vmlal_high_lane_s32): Likewise.
25138 (vmlal_high_lane_u16): Likewise.
25139 (vmlal_high_lane_u32): Likewise.
25140 (vmlal_high_laneq_s16): Likewise.
25141 (vmlal_high_laneq_s32): Likewise.
25142 (vmlal_high_laneq_u16): Likewise.
25143 (vmlal_high_laneq_u32): Likewise.
25145 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25147 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_hi_n
25148 builtin generator macros.
25149 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_hi_n<mode>_insn):
25151 (aarch64_<su>mlsl_hi_n<mode>): Define.
25152 * config/aarch64/arm_neon.h (vmlsl_high_n_s16): Use RTL builtin
25153 instead of inline asm.
25154 (vmlsl_high_n_s32): Likewise.
25155 (vmlsl_high_n_u16): Likewise.
25156 (vmlsl_high_n_u32): Likewise.
25158 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25160 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_hi_n
25161 builtin generator macros.
25162 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_hi_n<mode>_insn):
25164 (aarch64_<su>mlal_hi_n<mode>): Define.
25165 * config/aarch64/arm_neon.h (vmlal_high_n_s16): Use RTL builtin
25166 instead of inline asm.
25167 (vmlal_high_n_s32): Likewise.
25168 (vmlal_high_n_u16): Likewise.
25169 (vmlal_high_n_u32): Likewise.
25171 2021-02-03 Jonathan Wright <jonathan.wright@arm.com>
25173 * config/aarch64/aarch64-simd-builtins.def: Add RTL builtin
25175 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal_hi<mode>):
25177 (aarch64_<su>mlal_hi<mode>_insn): This.
25178 (aarch64_<su>mlal_hi<mode>): Define.
25179 * config/aarch64/arm_neon.h (vmlal_high_s8): Use RTL builtin
25180 instead of inline asm.
25181 (vmlal_high_s16): Likewise.
25182 (vmlal_high_s32): Likewise.
25183 (vmlal_high_u8): Likewise.
25184 (vmlal_high_u16): Likewise.
25185 (vmlal_high_u32): Likewise.
25187 2021-02-03 Ilya Leoshkevich <iii@linux.ibm.com>
25189 * lra-spills.c (remove_pseudos): Call lra_update_insn_recog_data()
25190 after calling alter_subreg() on a (mem).
25192 2021-02-03 Martin Liska <mliska@suse.cz>
25195 * lto-streamer-out.c (produce_lto_section): Fill up missing
25197 * lto-streamer.h (struct lto_section): Add _padding field.
25199 2021-02-03 Richard Biener <rguenther@suse.de>
25201 * lto-streamer.c (lto_get_section_name): Free temporary
25203 * tree-loop-distribution.c
25204 (loop_distribution::merge_dep_scc_partitions): Free edge data.
25206 2021-02-03 Jakub Jelinek <jakub@redhat.com>
25208 PR middle-end/97487
25209 * ifcvt.c (noce_can_force_operand): New function.
25210 (noce_emit_move_insn): Use it.
25211 (noce_try_sign_mask): Likewise. Formatting fix.
25213 2021-02-03 Jakub Jelinek <jakub@redhat.com>
25215 PR middle-end/97971
25216 * lra-constraints.c (process_alt_operands): For inline asm, don't call
25217 fatal_insn, but instead return false.
25219 2021-02-03 Jakub Jelinek <jakub@redhat.com>
25221 PR tree-optimization/98287
25222 * config/i386/mmx.md (<insn><mode>3): For shifts don't enable expander
25225 2021-02-03 Tamar Christina <tamar.christina@arm.com>
25227 PR tree-optimization/98928
25228 * tree-vect-loop.c (vect_analyze_loop_2): Change
25229 STMT_VINFO_SLP_VECT_ONLY to STMT_VINFO_SLP_VECT_ONLY_PATTERN.
25230 * tree-vect-slp-patterns.c (complex_pattern::build): Likewise.
25231 * tree-vectorizer.h (STMT_VINFO_SLP_VECT_ONLY_PATTERN): New.
25232 (class _stmt_vec_info): Add slp_vect_pattern_only_p.
25234 2021-02-02 Richard Biener <rguenther@suse.de>
25236 * gimple-loop-interchange.cc (prepare_data_references):
25238 * gimple-loop-jam.c (tree_loop_unroll_and_jam): Likewise.
25239 * tree-ssa-loop-im.c (hoist_memory_references): Likewise.
25240 * tree-vect-stmts.c (vectorizable_condition): Do not
25242 (vectorizable_comparison): Likewise.
25244 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25246 * config/aarch64/aarch64-simd-builtins.def (ursqrte): Define builtin.
25247 * config/aarch64/aarch64-simd.md (aarch64_ursqrte<mode>): New pattern.
25248 * config/aarch64/arm_neon.h (vrsqrte_u32): Reimplement using builtin.
25249 (vrsqrteq_u32): Likewise.
25251 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25253 * config/aarch64/aarch64-simd-builtins.def (sqxtun2): Define builtin.
25254 * config/aarch64/aarch64-simd.md (aarch64_sqxtun2<mode>_le): Define.
25255 (aarch64_sqxtun2<mode>_be): Likewise.
25256 (aarch64_sqxtun2<mode>): Likewise.
25257 * config/aarch64/arm_neon.h (vqmovun_high_s16): Reimplement using builtin.
25258 (vqmovun_high_s32): Likewise.
25259 (vqmovun_high_s64): Likewise.
25260 * config/aarch64/iterators.md (UNSPEC_SQXTUN2): Define.
25262 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25264 * config/aarch64/aarch64-simd-builtins.def (bfdot_lane, bfdot_laneq): Use
25266 (bfmlalb_lane, bfmlalt_lane, bfmlalb_lane_q, bfmlalt_lane_q): Use FP flags.
25268 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25270 * config/aarch64/aarch64-simd-builtins.def (fcmla_lane0, fcmla_lane90,
25271 fcmla_lane180, fcmla_lane270, fcmlaq_lane0, fcmlaq_lane90, fcmlaq_lane180,
25272 fcmlaq_lane270, scvtf, ucvtf, fcvtzs, fcvtzu, scvtfsi, scvtfdi, ucvtfsi,
25273 ucvtfdi, fcvtzshf, fcvtzuhf, fmlal_lane_low, fmlsl_lane_low,
25274 fmlal_laneq_low, fmlsl_laneq_low, fmlalq_lane_low, fmlslq_lane_low,
25275 fmlalq_laneq_low, fmlslq_laneq_low, fmlal_lane_high, fmlsl_lane_high,
25276 fmlal_laneq_high, fmlsl_laneq_high, fmlalq_lane_high, fmlslq_lane_high,
25277 fmlalq_laneq_high, fmlslq_laneq_high): Use FP flags.
25279 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25281 * config/aarch64/aarch64-builtins.c (FLAG_LOAD): Define.
25282 * config/aarch64/aarch64-simd-builtins.def (ld1x2, ld2, ld3, ld4, ld2r,
25283 ld3r, ld4r, ld1, ld1x3, ld1x4): Use LOAD flags.
25285 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25287 * config/aarch64/aarch64-simd-builtins.def (combine, zip1, zip2,
25288 uzp1, uzp2, trn1, trn2, simd_bsl): Use AUTO_FP flags.
25290 2021-02-02 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25292 * config/aarch64/aarch64-simd-builtins.def (clrsb, clz, ctz, popcount,
25293 vec_smult_lane_, vec_smlal_lane_, vec_smult_laneq_, vec_smlal_laneq_,
25294 vec_umult_lane_, vec_umlal_lane_, vec_umult_laneq_, vec_umlal_laneq_,
25295 ashl, sshl, ushl, srshl, urshl, sdot_lane, udot_lane, sdot_laneq,
25296 udot_laneq, usdot_lane, usdot_laneq, sudot_lane, sudot_laneq, ashr,
25297 ashr_simd, lshr, lshr_simd, srshr_n, urshr_n, ssra_n, usra_n, srsra_n,
25298 ursra_n, sshll_n, ushll_n, sshll2_n, ushll2_n, ssri_n, usri_n, ssli_n,
25299 ssli_n, usli_n, bswap, rbit, simd_bsl, eor3q, rax1q, xarq, bcaxq): Use
25300 NONE builtin flags.
25302 2021-02-02 Jakub Jelinek <jakub@redhat.com>
25304 PR tree-optimization/98848
25305 * tree-vect-patterns.c (vect_recog_over_widening_pattern): Punt if
25306 STMT_VINFO_DEF_TYPE (last_stmt_info) is vect_reduction_def.
25308 2021-02-02 Kito Cheng <kito.cheng@sifive.com>
25311 * expr.c: Check mode before calling store_expr.
25313 2021-02-02 Christophe Lyon <christophe.lyon@linaro.org>
25315 * config/arm/iterators.md (supf): Remove VORNQ_S and VORNQ_U.
25317 * config/arm/mve.md (mve_vornq_s<mode>): New entry for vorn
25318 instruction using expression ior.
25319 (mve_vornq_u<mode>): New expander.
25320 (mve_vornq_f<mode>): Use ior code instead of unspec.
25321 * config/arm/unspecs.md (VORNQ_S, VORNQ_U, VORNQ_F): Remove.
25323 2021-02-02 Alexandre Oliva <oliva@adacore.com>
25325 * tree-nested.c (convert_nonlocal_reference_op): Move
25326 current_function_decl restore after re-gimplification.
25327 (convert_local_reference_op): Likewise.
25329 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25331 * config/aarch64/aarch64-simd-builtins.def (rshrn, rshrn2):
25333 * config/aarch64/aarch64-simd.md (aarch64_rshrn<mode>_insn_le):
25335 (aarch64_rshrn<mode>_insn_be): Likewise.
25336 (aarch64_rshrn<mode>): Likewise.
25337 (aarch64_rshrn2<mode>_insn_le): Likewise.
25338 (aarch64_rshrn2<mode>_insn_be): Likewise.
25339 (aarch64_rshrn2<mode>): Likewise.
25340 * config/aarch64/aarch64.md (unspec): Add UNSPEC_RSHRN.
25341 * config/aarch64/arm_neon.h (vrshrn_high_n_s16): Reimplement
25343 (vrshrn_high_n_s32): Likewise.
25344 (vrshrn_high_n_s64): Likewise.
25345 (vrshrn_high_n_u16): Likewise.
25346 (vrshrn_high_n_u32): Likewise.
25347 (vrshrn_high_n_u64): Likewise.
25348 (vrshrn_n_s16): Likewise.
25349 (vrshrn_n_s32): Likewise.
25350 (vrshrn_n_s64): Likewise.
25351 (vrshrn_n_u16): Likewise.
25352 (vrshrn_n_u32): Likewise.
25353 (vrshrn_n_u64): Likewise.
25355 2021-02-01 Sergei Trofimovich <siarheit@google.com>
25357 PR tree-optimization/98499
25358 * ipa-modref.c (analyze_ssa_name_flags): treat RVO
25359 conservatively and assume all possible side-effects.
25361 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25363 * config/aarch64/aarch64-simd-builtins.def (vec_unpacks_hi,
25364 vec_unpacku_hi_): Define builtins.
25365 * config/aarch64/arm_neon.h (vmovl_high_s8): Reimplement using
25367 (vmovl_high_s16): Likewise.
25368 (vmovl_high_s32): Likewise.
25369 (vmovl_high_u8): Likewise.
25370 (vmovl_high_u16): Likewise.
25371 (vmovl_high_u32): Likewise.
25373 2021-02-01 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25375 * config/aarch64/aarch64-simd-builtins.def (sabdl, uabdl):
25377 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl<mode>): New
25379 * config/aarch64/aarch64.md (unspec): Define UNSPEC_SABDL,
25381 * config/aarch64/arm_neon.h (vabdl_s8): Reimplemet using
25383 (vabdl_s16): Likewise.
25384 (vabdl_s32): Likewise.
25385 (vabdl_u8): Likewise.
25386 (vabdl_u16): Likewise.
25387 (vabdl_u32): Likewise.
25388 * config/aarch64/iterators.md (ABDL): New int iterator.
25389 (sur): Handle UNSPEC_SABDL, UNSPEC_UABDL.
25391 2021-02-01 Martin Sebor <msebor@redhat.com>
25393 * tree.h (BLOCK_VARS): Add comment.
25394 (BLOCK_SUBBLOCKS): Same.
25395 (BLOCK_SUPERCONTEXT): Same.
25396 (BLOCK_ABSTRACT_ORIGIN): Same.
25397 (inlined_function_outer_scope_p): Same.
25399 2021-02-01 Martin Sebor <msebor@redhat.com>
25401 PR middle-end/97172
25402 * attribs.c (attr_access::free_lang_data): Define new function.
25403 * attribs.h (attr_access::free_lang_data): Declare new function.
25405 2021-02-01 Richard Biener <rguenther@suse.de>
25407 * vec.h (auto_vec::auto_vec): Add memory stat parameters
25409 * bitmap.h (auto_bitmap::auto_bitmap): Likewise.
25411 2021-02-01 Tamar Christina <tamar.christina@arm.com>
25413 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>,
25414 aarch64_<su>mlsl<mode>, aarch64_<su>mlsl_n<mode>): Flip mult operands.
25416 2021-02-01 Richard Biener <rguenther@suse.de>
25418 PR rtl-optimization/98863
25419 * config/i386/i386-features.c (convert_scalars_to_vector):
25420 Set DF_RD_PRUNE_DEAD_DEFS.
25422 2021-01-31 Eric Botcazou <ebotcazou@adacore.com>
25424 * system.h (SIZE_MAX): Define if not already defined.
25426 2021-01-30 Aaron Sawdey <acsawdey@linux.ibm.com>
25428 * config/rs6000/genfusion.pl (gen_2logical): New function to
25429 generate patterns for logical-logical fusion.
25430 * config/rs6000/fusion.md: Regenerated patterns.
25431 * config/rs6000/rs6000-cpus.def: Add
25432 OPTION_MASK_P10_FUSION_2LOGICAL.
25433 * config/rs6000/rs6000.c (rs6000_option_override_internal):
25434 Enable logical-logical fusion for p10.
25435 * config/rs6000/rs6000.opt: Add -mpower10-fusion-2logical.
25437 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
25439 * config/rs6000/rs6000.opt: Add periods to new AIX options.
25441 2021-01-30 David Edelsohn <dje.gcc@gmail.com>
25443 * config/rs6000/rs6000.opt (mabi=vec-extabi): New.
25444 (mabi=vec-default): New.
25445 * config/rs6000/rs6000-c.c (rs6000_target_modify_macros): Define
25446 __EXTABI__ for AIX Vector extended ABI.
25447 * config/rs6000/rs6000.c (rs6000_debug_reg_global): Print AIX Vector
25449 (conditional_register_usage): If AIX vec_extabi enabled, vs20-vs31
25451 * doc/invoke.texi (PowerPC mabi): Add AIX vec-extabi and vec-default.
25453 2021-01-30 Jakub Jelinek <jakub@redhat.com>
25455 * config/i386/i386-features.c (remove_partial_avx_dependency): Clear
25456 DF_DEFER_INSN_RESCAN after calling df_process_deferred_rescans.
25458 2021-01-29 Vladimir N. Makarov <vmakarov@redhat.com>
25461 * lra-constraints.c (in_class_p): Don't narrow class only for REG
25464 2021-01-29 Will Schmidt <will_schmidt@vnet.ibm.com>
25466 * config/rs6000/rs6000-call.c (rs6000_expand_binup_builtin): Add
25467 clauses for CODE_FOR_vsx_xvcvuxddp_scale and
25468 CODE_FOR_vsx_xvcvsxddp_scale to the parameter checking code.
25470 2021-01-29 Andrew MacLeod <amacleod@redhat.com>
25472 PR tree-optimization/98866
25473 * gimple-range-gori.h (gori_compute:set_range_invariant): New.
25474 * gimple-range-gori.cc (gori_map::set_range_invariant): New.
25475 (gori_map::m_maybe_invariant): Rename from all_outgoing.
25476 (gori_map::gori_map): Rename all_outgoing to m_maybe_invariant.
25477 (gori_map::is_export_p): Ditto.
25478 (gori_map::calculate_gori): Ditto.
25479 (gori_compute::set_range_invariant): New.
25480 * gimple-range.cc (gimple_ranger::range_of_stmt): Set range
25481 invariant for pointers evaluating to [1, +INF].
25483 2021-01-29 Richard Biener <rguenther@suse.de>
25485 PR rtl-optimization/98863
25486 * config/i386/i386-features.c (remove_partial_avx_dependency):
25487 Do not perform DF analysis.
25488 (pass_data_remove_partial_avx_dependency): Remove
25491 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
25493 * config/aarch64/aarch64-simd-builtins.def: Add [su]mull_n
25494 builtin generator macros.
25495 * config/aarch64/aarch64-simd.md (aarch64_<su>mull_n<mode>):
25497 * config/aarch64/arm_neon.h (vmull_n_s16): Use RTL builtin
25498 instead of inline asm.
25499 (vmull_n_s32): Likewise.
25500 (vmull_n_u16): Likewise.
25501 (vmull_n_u32): Likewise.
25503 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25505 * config/aarch64/aarch64-simd-builtins.def (sabdl2, uabdl2):
25507 * config/aarch64/aarch64-simd.md (aarch64_<sur>abdl2<mode>_3):
25509 (aarch64_<sur>abdl2<mode>): ... This.
25510 (<sur>sadv16qi): Adjust use of above.
25511 * config/aarch64/arm_neon.h (vabdl_high_s8): Reimplement using
25513 (vabdl_high_s16): Likewise.
25514 (vabdl_high_s32): Likewise.
25515 (vabdl_high_u8): Likewise.
25516 (vabdl_high_u16): Likewise.
25517 (vabdl_high_u32): Likewise.
25519 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25521 * config/aarch64/aarch64-simd-builtins.def (sabal2): Define
25523 (uabal2): Likewise.
25524 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal2<mode>): New
25526 * config/aarch64/aarch64.md (unspec): Add UNSPEC_SABAL2 and
25528 * config/aarch64/arm_neon.h (vabal_high_s8): Reimplement using
25530 (vabal_high_s16): Likewise.
25531 (vabal_high_s32): Likewise.
25532 (vabal_high_u8): Likewise.
25533 (vabal_high_u16): Likewise.
25534 (vabal_high_u32): Likewise.
25535 * config/aarch64/iterators.md (ABAL2): New mode iterator.
25536 (sur): Handle UNSPEC_SABAL2, UNSPEC_UABAL2.
25538 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25540 * config/aarch64/aarch64-simd-builtins.def (sabal): Define
25543 * config/aarch64/aarch64-simd.md (aarch64_<sur>abal<mode>_4):
25545 (aarch64_<sur>abal<mode>): ... This
25546 (<sur>sadv16qi): Adust use of the above.
25547 * config/aarch64/arm_neon.h (vabal_s8): Reimplement using
25549 (vabal_s16): Likewise.
25550 (vabal_s32): Likewise.
25551 (vabal_u8): Likewise.
25552 (vabal_u16): Likewise.
25553 (vabal_u32): Likewise.
25555 2021-01-29 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25557 * config/aarch64/aarch64-simd-builtins.def (saddlv, uaddlv):
25559 * config/aarch64/aarch64-simd.md (aarch64_<su>addlv<mode>):
25561 * config/aarch64/arm_neon.h (vaddlv_s8): Reimplement using
25563 (vaddlv_s16): Likewise.
25564 (vaddlv_u8): Likewise.
25565 (vaddlv_u16): Likewise.
25566 (vaddlvq_s8): Likewise.
25567 (vaddlvq_s16): Likewise.
25568 (vaddlvq_s32): Likewise.
25569 (vaddlvq_u8): Likewise.
25570 (vaddlvq_u16): Likewise.
25571 (vaddlvq_u32): Likewise.
25572 (vaddlv_s32): Likewise.
25573 (vaddlv_u32): Likewise.
25574 * config/aarch64/iterators.md (VDQV_L): New mode iterator.
25575 (unspec): Add UNSPEC_SADDLV, UNSPEC_UADDLV.
25576 (Vwstype): New mode attribute.
25578 (VWIDE_S): Likewise.
25579 (USADDLV): New int iterator.
25580 (su): Handle UNSPEC_SADDLV, UNSPEC_UADDLV.
25582 2021-01-29 Jonathan Wright <jonathan.wright@arm.com>
25584 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_lane[q]
25585 builtin generator macros.
25586 * config/aarch64/aarch64-simd.md (aarch64_vec_<su>mlsl_lane<Qlane>):
25588 * config/aarch64/arm_neon.h (vmlsl_lane_s16): Use RTL builtin
25589 instead of inline asm.
25590 (vmlsl_lane_s32): Likewise.
25591 (vmlsl_lane_u16): Likewise.
25592 (vmlsl_lane_u32): Likewise.
25593 (vmlsl_laneq_s16): Likewise.
25594 (vmlsl_laneq_s32): Likewise.
25595 (vmlsl_laneq_u16): Likewise.
25596 (vmlsl_laneq_u32): Likewise.
25598 2021-01-29 Richard Biener <rguenther@suse.de>
25600 * doc/invoke.texi (--param max-gcse-memory): Document unit
25602 * gcse.c (gcse_or_cprop_is_too_expensive): Adjust.
25603 * params.opt (--param max-gcse-memory): Adjust default and
25604 document unit of size.
25606 2021-01-29 Richard Biener <rguenther@suse.de>
25608 PR rtl-optimization/98863
25609 * gcse.c (gcse_or_cprop_is_too_expensive): Use unsigned
25610 HOST_WIDE_INT for the memory estimate.
25612 2021-01-29 Bin Cheng <bin.cheng@linux.alibaba.com>
25613 Richard Biener <rguenther@suse.de>
25615 PR tree-optimization/97627
25616 * tree-ssa-loop-niter.c (number_of_iterations_exit_assumptions):
25617 Do not analyze fake edges.
25619 2021-01-29 Richard Biener <rguenther@suse.de>
25621 PR rtl-optimization/98144
25622 * df.h (df_mir_bb_info): Add con_visited member.
25623 * df-problems.c (df_mir_alloc): Initialize con_visited,
25624 do not fully populate IN and OUT.
25625 (df_mir_reset): Likewise.
25626 (df_mir_confluence_0): Set con_visited.
25627 (df_mir_confluence_n): Properly handle implicitely
25628 fully populated IN and OUT as designated by con_visited
25629 and update con_visited accordingly.
25631 2021-01-29 Jakub Jelinek <jakub@redhat.com>
25634 * config/arm/vec-common.md (mve_vshlq_<supf><mode>,
25635 vashl<mode>3, vashr<mode>3, vlshr<mode>3): Add
25636 && !TARGET_REALLY_IWMMXT to conditions.
25638 2021-01-29 Jakub Jelinek <jakub@redhat.com>
25641 * cfgbuild.c (find_bb_boundaries): Reset debug_insn when seeing
25644 2021-01-28 Marek Polacek <polacek@redhat.com>
25647 * stor-layout.c (finalize_type_size): If we reset TYPE_USER_ALIGN in
25648 the main variant, maybe reset it in its variants too.
25649 * tree.c (check_base_type): Return true only if TYPE_USER_ALIGN match.
25650 (check_aligned_type): Check if TYPE_USER_ALIGN match.
25652 2021-01-28 Christophe Lyon <christophe.lyon@linaro.org>
25655 * config/arm/arm.c (arm_rtx_costs_internal): Adjust cost of vector
25656 of constant zero for comparisons.
25658 2021-01-28 Michael Meissner <meissner@linux.ibm.com>
25660 * config/rs6000/rs6000.c (rs6000_mangle_decl_assembler_name): Add
25661 support for mapping built-in function names for long double
25662 built-in functions if long double is IEEE 128-bit.
25664 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
25666 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlsl_n
25667 builtin generator macros.
25668 * config/aarch64/aarch64-simd.md (aarch64_<su>mlsl_n<mode>):
25670 * config/aarch64/arm_neon.h (vmlsl_n_s16): Use RTL builtin
25671 instead of inline asm.
25672 (vmlsl_n_s32): Likewise.
25673 (vmlsl_n_u16): Likewise.
25674 (vmlsl_n_u32): Likewise.
25676 2021-01-28 Jonathan Wright <jonathan.wright@arm.com>
25678 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal_n
25679 builtin generator macros.
25680 * config/aarch64/aarch64-simd.md (aarch64_<su>mlal_n<mode>):
25682 * config/aarch64/arm_neon.h (vmlal_n_s16): Use RTL builtin
25683 instead of inline asm.
25684 (vmlal_n_s32): Likewise.
25685 (vmlal_n_u16): Likewise.
25686 (vmlal_n_u32): Likewise.
25688 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25690 * config/aarch64/aarch64-simd-builtins.def (shrn2): Define
25692 * config/aarch64/aarch64-simd.md (aarch64_shrn2<mode>_insn_le):
25694 (aarch64_shrn2<mode>_insn_be): Likewise.
25695 (aarch64_shrn2<mode>): Likewise.
25696 * config/aarch64/arm_neon.h (vshrn_high_n_s16): Reimlplement
25698 (vshrn_high_n_s32): Likewise.
25699 (vshrn_high_n_s64): Likewise.
25700 (vshrn_high_n_u16): Likewise.
25701 (vshrn_high_n_u32): Likewise.
25702 (vshrn_high_n_u64): Likewise.
25704 2021-01-28 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
25706 * config/aarch64/aarch64-simd-builtins.def (shrn): Define
25708 * config/aarch64/aarch64-simd.md (aarch64_shrn<mode>_insn_le):
25710 (aarch64_shrn<mode>_insn_be): Likewise.
25711 (aarch64_shrn<mode>): Likewise.
25712 * config/aarch64/arm_neon.h (vshrn_n_s16): Reimplement using
25714 (vshrn_n_s32): Likewise.
25715 (vshrn_n_s64): Likewise.
25716 (vshrn_n_u16): Likewise.
25717 (vshrn_n_u32): Likewise.
25718 (vshrn_n_u64): Likewise.
25719 * config/aarch64/iterators.md (vn_mode): New mode attribute.
25721 2021-01-28 Richard Biener <rguenther@suse.de>
25723 PR rtl-optimization/80960
25724 * dse.c (check_mem_read_rtx): Call get_addr on the
25727 2021-01-28 Xionghu Luo <luoxhu@linux.ibm.com>
25728 David Edelsohn <dje.gcc@gmail.com>
25731 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
25732 Don't generate VIEW_CONVERT_EXPR for fcode ALTIVEC_BUILTIN_VEC_INSERT
25734 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
25736 * config/rs6000/rs6000.c (rs6000_expand_vector_set): Remove the
25737 wrapper call rs6000_expand_vector_set_var for cleanup. Call
25738 rs6000_expand_vector_set_var_p9 and rs6000_expand_vector_set_var_p8
25740 (rs6000_expand_vector_set_var): Delete.
25741 (rs6000_expand_vector_set_var_p9): Make static.
25742 (rs6000_expand_vector_set_var_p8): Make static.
25744 2021-01-28 Xing GUO <higuoxing@gmail.com>
25746 * common/config/riscv/riscv-common.c
25747 (riscv_subset_list::parsing_subset_version): Fix -march option parsing
25748 when `p` extension exists.
25750 2021-01-27 Vladimir N. Makarov <vmakarov@redhat.com>
25752 PR rtl-optimization/97684
25753 * ira.c (ira): Call ira_set_pseudo_classes before
25754 update_equiv_regs when it is necessary.
25756 2021-01-27 Jakub Jelinek <jakub@redhat.com>
25759 * config/aarch64/aarch64.md (*aarch64_bfxilsi_uxtw): Use
25760 %w0, %w1 and %2 instead of %0, %1 and %2.
25762 2021-01-27 Aaron Sawdey <acsawdey@linux.ibm.com>
25764 * config/rs6000/genfusion.pl: New script to generate
25765 define_insn_and_split patterns so combine can arrange fused
25766 instructions next to each other.
25767 * config/rs6000/fusion.md: New file, generated fused instruction
25768 patterns for combine.
25769 * config/rs6000/predicates.md (const_m1_to_1_operand): New predicate.
25770 (non_update_memory_operand): New predicate.
25771 * config/rs6000/rs6000-cpus.def: Add OPTION_MASK_P10_FUSION and
25772 OPTION_MASK_P10_FUSION_LD_CMPI to ISA_3_1_MASKS_SERVER and
25774 * config/rs6000/rs6000-protos.h (address_is_non_pfx_d_or_x): Add
25776 * config/rs6000/rs6000.c (rs6000_option_override_internal):
25777 Automatically set OPTION_MASK_P10_FUSION and
25778 OPTION_MASK_P10_FUSION_LD_CMPI if target is power10.
25779 (rs600_opt_masks): Allow -mpower10-fusion
25780 in function attributes.
25781 (address_is_non_pfx_d_or_x): New function.
25782 * config/rs6000/rs6000.h: Add MASK_P10_FUSION.
25783 * config/rs6000/rs6000.md: Include fusion.md.
25784 * config/rs6000/rs6000.opt: Add -mpower10-fusion
25785 and -mpower10-fusion-ld-cmpi.
25786 * config/rs6000/t-rs6000: Add dependencies involving fusion.md.
25788 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25790 * config/aarch64/aarch64-simd-builtins.def: Add [su]mlal
25791 builtin generator macros.
25792 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlal<mode>):
25794 (aarch64_<su>mlal<mode>): This.
25795 * config/aarch64/arm_neon.h (vmlal_s8): Use RTL builtin
25796 instead of inline asm.
25797 (vmlal_s16): Likewise.
25798 (vmlal_s32): Likewise.
25799 (vmlal_u8): Likewise.
25800 (vmlal_u16): Likewise.
25801 (vmlal_u32): Likewise.
25803 2021-01-27 Richard Biener <rguenther@suse.de>
25805 PR tree-optimization/98854
25806 * tree-vect-slp.c (vect_build_slp_tree_2): Also build
25807 PHIs from scalars when the number of CTORs matches the
25808 number of children.
25810 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25812 * config/aarch64/aarch64-simd-builtins.def: Add mls_n builtin
25814 * config/aarch64/aarch64-simd.md (*aarch64_mls_elt_merge<mode>):
25816 (aarch64_mls_n<mode>): This.
25817 * config/aarch64/arm_neon.h (vmls_n_s16): Use RTL builtin
25819 (vmls_n_s32): Likewise.
25820 (vmls_n_u16): Likewise.
25821 (vmls_n_u32): Likewise.
25822 (vmlsq_n_s16): Likewise.
25823 (vmlsq_n_s32): Likewise.
25824 (vmlsq_n_u16): Likewise.
25825 (vmlsq_n_u32): Likewise.
25827 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25829 * config/aarch64/aarch64-simd-builtins.def: Add mls builtin
25831 * config/aarch64/arm_neon.h (vmls_s8): Use RTL builtin rather
25833 (vmls_s16): Likewise.
25834 (vmls_s32): Likewise.
25835 (vmls_u8): Likewise.
25836 (vmls_u16): Likewise.
25837 (vmls_u32): Likewise.
25838 (vmlsq_s8): Likewise.
25839 (vmlsq_s16): Likewise.
25840 (vmlsq_s32): Likewise.
25841 (vmlsq_u8): Likewise.
25842 (vmlsq_u16): Likewise.
25843 (vmlsq_u32): Likewise.
25845 2021-01-27 Jonathan Wright <jonathan.wright@arm.com>
25847 * config/aarch64/aarch64-simd-builtins.def: Add mla_n builtin
25849 * config/aarch64/aarch64-simd.md (*aarch64_mla_elt_merge<mode>):
25851 (aarch64_mla_n<mode>): This.
25852 * config/aarch64/arm_neon.h (vmla_n_s16): Use RTL builtin
25854 (vmla_n_s32): Likewise.
25855 (vmla_n_u16): Likewise.
25856 (vmla_n_u32): Likewise.
25857 (vmlaq_n_s16): Likewise.
25858 (vmlaq_n_s32): Likewise.
25859 (vmlaq_n_u16): Likewise.
25860 (vmlaq_n_u32): Likewise.
25862 2021-01-27 liuhongt <hongtao.liu@intel.com>
25865 * config/i386/sse.md (sse2_gt<mode>3): Drop !TARGET_XOP in condition.
25866 (*sse2_eq<mode>3): Ditto.
25868 2021-01-27 Jakub Jelinek <jakub@redhat.com>
25870 * tree-pass.h (PROP_trees): Rename to ...
25871 (PROP_gimple): ... this.
25872 * cfgexpand.c (pass_data_expand): Replace PROP_trees with PROP_gimple.
25873 * passes.c (execute_function_dump, execute_function_todo,
25874 execute_one_ipa_transform_pass, execute_one_pass): Likewise.
25875 * varpool.c (ctor_for_folding): Likewise.
25877 2021-01-27 Jakub Jelinek <jakub@redhat.com>
25879 PR tree-optimization/97260
25880 * varpool.c: Include tree-pass.h.
25881 (ctor_for_folding): In GENERIC return DECL_INITIAL for TREE_READONLY
25882 non-TREE_SIDE_EFFECTS automatic variables.
25884 2021-01-26 Paul Fee <paul.f.fee@gmail.com>
25886 * doc/cpp.texi (__cplusplus): Document value for -std=c++23
25888 * doc/invoke.texi: Document -std=c++23 and -std=gnu++23.
25889 * dwarf2out.c (highest_c_language): Recognise C++20 and C++23.
25890 (gen_compile_unit_die): Recognise C++23.
25892 2021-01-26 Jakub Jelinek <jakub@redhat.com>
25895 * dwarf2asm.c (dw2_assemble_integer): Cast DWARF2_ADDR_SIZE to int
25898 2021-01-26 Jakub Jelinek <jakub@redhat.com>
25901 * config/aarch64/aarch64.c (aarch64_mask_and_shift_for_ubfiz_p):
25902 Use UINTVAL (shft_amnt) and UINTVAL (mask) instead of INTVAL (shft_amnt)
25903 and INTVAL (mask). Add && INTVAL (mask) > 0 condition.
25905 2021-01-26 Richard Biener <rguenther@suse.de>
25907 * gimple-pretty-print.c (dump_binary_rhs): Handle
25908 VEC_WIDEN_{PLUS,MINUS}_{LO,HI}_EXPR.
25910 2021-01-26 Richard Biener <rguenther@suse.de>
25912 PR middle-end/98726
25913 * tree.h (vector_cst_int_elt): Remove.
25914 * tree.c (vector_cst_int_elt): Use poly_wide_int for computations,
25917 2021-01-26 Andrew Stubbs <ams@codesourcery.com>
25919 * config/gcn/gcn.c (gcn_expand_reduc_scalar): Use move instructions
25920 for V64DFmode min/max reductions.
25922 2021-01-26 Jakub Jelinek <jakub@redhat.com>
25924 * dwarf2asm.c (dw2_assemble_integer): Handle size twice as large
25925 as DWARF2_ADDR_SIZE if x is not a scalar int by emitting it as
25926 two halves, one with x and the other with const0_rtx, ordered
25927 depending on endianity.
25929 2021-01-26 Alexandre Oliva <oliva@adacore.com>
25931 * gimplify.c (gimplify_decl_expr): Skip asan marking calls for
25932 temporaries not seen in binding block, and not about to be
25933 added as gimple variables.
25935 2021-01-25 Martin Sebor <msebor@redhat.com>
25938 * tree-ssa-ccp.c (pass_post_ipa_warn::execute): Adjust warning text.
25940 2021-01-25 Martin Liska <mliska@suse.cz>
25942 * value-prof.c (get_nth_most_common_value): Use %s instead
25945 2021-01-25 Jakub Jelinek <jakub@redhat.com>
25948 * configure.ac (HAVE_AS_GDWARF_5_DEBUG_FLAG): Only define if
25949 readelf -wi is able to read the emitted .debug_info back.
25950 * configure: Regenerated.
25952 2021-01-25 Martin Liska <mliska@suse.cz>
25954 PR gcov-profile/98739
25955 * common.opt: Add missing sign symbol.
25956 * value-prof.c (get_nth_most_common_value): Restore handling
25957 of PROFILE_REPRODUCIBILITY_PARALLEL_RUNS and
25958 PROFILE_REPRODUCIBILITY_MULTITHREADED.
25960 2021-01-25 Richard Biener <rguenther@suse.de>
25962 PR middle-end/98807
25963 * tree.c (vector_element_bits): Always use precision of
25964 the element type for boolean vectors.
25966 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
25968 * config/rtems.h (STARTFILE_SPEC): Remove qnolinkcmds.
25969 (ENDFILE_SPEC): Evaluate qnolinkcmds.
25971 2021-01-25 Sebastian Huber <sebastian.huber@embedded-brains.de>
25973 * config/rtems.h (STARTFILE_SPEC): Remove nostdlib and
25974 nostartfiles handling since this is already done by
25975 LINK_COMMAND_SPEC. Evaluate qnolinkcmds.
25976 (ENDFILE_SPEC): Remove nostdlib and nostartfiles handling since this
25977 is already done by LINK_COMMAND_SPEC.
25978 (LIB_SPECS): Remove nostdlib and nodefaultlibs handling since
25979 this is already done by LINK_COMMAND_SPEC. Remove qnolinkcmds
25982 2021-01-25 Jakub Jelinek <jakub@redhat.com>
25985 * fold-const-call.c (host_size_t_cst_p): Renamed to ...
25986 (size_t_cst_p): ... this. Check and store unsigned HOST_WIDE_INT
25987 value rather than host size_t.
25988 (fold_const_call): Change type of s2 from size_t to
25989 unsigned HOST_WIDE_INT. Use size_t_cst_p instead of
25990 host_size_t_cst_p. For strncmp calls, pass MIN (s2, SIZE_MAX)
25991 instead of s2 as last argument.
25993 2021-01-25 Tamar Christina <tamar.christina@arm.com>
25995 * config/arm/iterators.md (rotsplit1, rotsplit2, conj_op, fcmac1,
25996 VCMLA_OP, VCMUL_OP): New.
25997 * config/arm/mve.md (mve_vcmlaq<mve_rot><mode>): Support vec_dup 0.
25998 * config/arm/neon.md (cmul<conj_op><mode>3): New.
25999 * config/arm/unspecs.md (UNSPEC_VCMLA_CONJ, UNSPEC_VCMLA180_CONJ,
26000 UNSPEC_VCMUL_CONJ): New.
26001 * config/arm/vec-common.md (cmul<conj_op><mode>3, arm_vcmla<rot><mode>,
26002 cml<fcmac1><conj_op><mode>4): New.
26004 2021-01-23 Jakub Jelinek <jakub@redhat.com>
26007 * config/rs6000/mmintrin.h (__m64): Add __may_alias__ attribute.
26009 2021-01-22 Jonathan Wright <jonathan.wright@arm.com>
26011 * config/aarch64/aarch64-simd-builtins.def: Add mla builtin
26013 * config/aarch64/arm_neon.h (vmla_s8): Use RTL builtin rather
26015 (vmla_s16): Likewise.
26016 (vmla_s32): Likewise.
26017 (vmla_u8): Likewise.
26018 (vmla_u16): Likewise.
26019 (vmla_u32): Likewise.
26020 (vmlaq_s8): Likewise.
26021 (vmlaq_s16): Likewise.
26022 (vmlaq_s32): Likewise.
26023 (vmlaq_u8): Likewise.
26024 (vmlaq_u16): Likewise.
26025 (vmlaq_u32): Likewise.
26027 2021-01-22 David Malcolm <dmalcolm@redhat.com>
26029 * doc/invoke.texi (GCC_EXTRA_DIAGNOSTIC_OUTPUT): Add @findex
26032 2021-01-22 Jakub Jelinek <jakub@redhat.com>
26035 * dwarf2out.c (output_file_names): For -gdwarf-5, if there are no
26036 filenames to emit, still emit the required 0 index directory and
26037 filename entries that match DW_AT_comp_dir and DW_AT_name of the
26040 2021-01-22 Marek Polacek <polacek@redhat.com>
26043 * doc/invoke.texi: Update C++ ABI Version 15 description.
26045 2021-01-22 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26047 PR tree-optimization/98766
26048 * tree-ssa-math-opts.c (convert_mult_to_fma): Use maybe_le when
26049 comparing against type size with param_avoid_fma_max_bits.
26051 2021-01-22 Richard Biener <rguenther@suse.de>
26053 PR middle-end/98793
26054 * tree.c (vector_element_bits): Key single-bit bool vector on
26055 integer mode rather than not vector mode.
26057 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
26060 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
26061 Generate ARRAY_REF(VIEW_CONVERT_EXPR) for P8 and later
26063 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): Update
26064 to call different path for P8 and P9.
26065 (rs6000_expand_vector_set_var_p9): New function.
26066 (rs6000_expand_vector_set_var_p8): New function.
26068 2021-01-22 Xionghu Luo <luoxhu@linux.ibm.com>
26072 * config/rs6000/rs6000-c.c (altivec_resolve_overloaded_builtin):
26073 Ajdust variable index vec_insert from address dereference to
26074 ARRAY_REF(VIEW_CONVERT_EXPR) tree expression.
26075 * config/rs6000/rs6000-protos.h (rs6000_expand_vector_set_var):
26077 * config/rs6000/rs6000.c (rs6000_expand_vector_set_var): New function.
26079 2021-01-22 Martin Liska <mliska@suse.cz>
26081 PR gcov-profile/98739
26082 * profile.c (compute_value_histograms): Drop time profile for
26083 -fprofile-reproducible=multithreaded.
26085 2021-01-22 Nathan Sidwell <nathan@acm.org>
26087 * gcc.c (process_command): Don't check OPT_SPECIAL_input_file
26090 2021-01-22 Richard Biener <rguenther@suse.de>
26092 PR middle-end/98773
26093 * tree-data-ref.c (initalize_matrix_A): Revert previous
26094 change, retaining failing on HOST_WIDE_INT_MIN CHREC_RIGHT.
26096 2021-01-22 Jakub Jelinek <jakub@redhat.com>
26098 PR tree-optimization/90248
26099 * match.pd (X cmp 0.0 ? 1.0 : -1.0 -> copysign(1, +-X),
26100 X cmp 0.0 ? -1.0 : +1.0 -> copysign(1, -+X)): Remove
26102 (X * (X cmp 0.0 ? 1.0 : -1.0) -> +-abs(X),
26103 X * (X cmp 0.0 ? -1.0 : 1.0) -> +-abs(X)): New simplifications.
26105 2021-01-22 Jakub Jelinek <jakub@redhat.com>
26107 PR tree-optimization/98255
26108 * tree-dfa.c (get_ref_base_and_extent): For ARRAY_REFs, sign
26109 extend index - low_bound from sizetype's precision rather than index
26111 (get_addr_base_and_unit_offset_1): Likewise.
26112 * tree-ssa-sccvn.c (ao_ref_init_from_vn_reference): Likewise.
26113 * gimple-fold.c (fold_const_aggregate_ref_1): Likewise.
26115 2021-01-22 Richard Biener <rguenther@suse.de>
26117 PR tree-optimization/98786
26118 * tree-ssa-phiopt.c (factor_out_conditional_conversion): Avoid
26119 adding new uses of abnormals. Verify we deal with a conditional
26122 2021-01-22 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26125 * optc-save-gen.awk: Add arm_fp16_format to checked_options.
26127 2021-01-22 liuhongt <hongtao.liu@intel.com>
26131 * config/i386/sse.md (VI_128_256): New mode iterator.
26132 (*avx_cmp<mode>3_1, *avx_cmp<mode>3_2, *avx_cmp<mode>3_3,
26133 *avx_cmp<mode>3_4, *avx2_eq<mode>3, *avx2_pcmp<mode>3_1,
26134 *avx2_pcmp<mode>3_2, *avx2_gt<mode>3): New
26135 define_insn_and_split to lower avx512 vector comparison to avx
26136 version when dest is vector.
26137 (*<avx512>_cmp<mode>3,*<avx512>_cmp<mode>3,*<avx512>_ucmp<mode>3):
26138 define_insn_and_split for negating the comparison result.
26139 * config/i386/predicates.md (float_vector_all_ones_operand):
26141 * config/i386/i386-expand.c (ix86_expand_sse_movcc): Use
26142 general NOT operator without UNSPEC_MASKOP.
26144 2021-01-21 Vladimir N. Makarov <vmakarov@redhat.com>
26146 PR rtl-optimization/98777
26147 * lra-int.h (lra_pmode_pseudo): New extern.
26148 * lra.c (lra_pmode_pseudo): New global.
26150 * lra-eliminations.c (eliminate_regs_in_insn): Use it.
26152 2021-01-21 Ilya Leoshkevich <iii@linux.ibm.com>
26154 * fwprop.c (fwprop_propagation::classify_result): Allow
26155 (subreg (mem)) simplifications.
26157 2021-01-21 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26159 * config/aarch64/aarch64-simd.md (aarch64_sqdml<SBINQOPS:as>l<mode>):
26161 (aarch64_sqdmlal<mode>): ... This...
26162 (aarch64_sqdmlsl<mode>): ... And this.
26163 (aarch64_sqdml<SBINQOPS:as>l_lane<mode>): Split into...
26164 (aarch64_sqdmlal_lane<mode>): ... This...
26165 (aarch64_sqdmlsl_lane<mode>): ... And this.
26166 (aarch64_sqdml<SBINQOPS:as>l_laneq<mode>): Split into...
26167 (aarch64_sqdmlsl_laneq<mode>): ... This...
26168 (aarch64_sqdmlal_laneq<mode>): ... And this.
26169 (aarch64_sqdml<SBINQOPS:as>l_n<mode>): Split into...
26170 (aarch64_sqdmlsl_n<mode>): ... This...
26171 (aarch64_sqdmlal_n<mode>): ... And this.
26172 (aarch64_sqdml<SBINQOPS:as>l2<mode>_internal): Split into...
26173 (aarch64_sqdmlal2<mode>_internal): ... This...
26174 (aarch64_sqdmlsl2<mode>_internal): ... And this.
26176 2021-01-21 Christophe Lyon <christophe.lyon@linaro.org>
26178 * config/arm/arm_mve.h (__arm_vcmpneq_s8): Fix return type.
26180 2021-01-21 Andrea Corallo <andrea.corallo@arm.com>
26183 * doc/sourcebuild.texi (arm_thumb2_no_arm_v8_1_lob): Document.
26185 2021-01-21 liuhongt <hongtao.liu@intel.com>
26187 PR rtl-optimization/98694
26188 * regcprop.c (copy_value): If SRC had been assigned a mode
26189 narrower than the copy, we can't link DEST into the chain even
26190 they have same hard_regno_nregs(i.e. HImode/SImode in i386
26193 2021-01-20 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26195 * config/aarch64/aarch64-simd.md (aarch64_get_lane<mode>):
26196 Convert to define_insn_and_split. Split into simple move when moving
26199 2021-01-20 Segher Boessenkool <segher@kernel.crashing.org>
26201 * config/rs6000/rs6000.c (rs6000_emit_le_vsx_store): Change assert.
26202 Adjust comment. Simplify code.
26204 2021-01-20 Jakub Jelinek <jakub@redhat.com>
26207 * dwarf2out.c (reset_indirect_string): Also reset indirect strings
26208 with DW_FORM_line_strp form.
26209 (prune_unused_types_update_strings): Don't add into debug_str_hash
26210 indirect strings with DW_FORM_line_strp form.
26211 (adjust_name_comp_dir): New function.
26212 (dwarf2out_finish): Call it on CU DIEs after resetting
26213 debug_line_str_hash.
26215 2021-01-20 Vladimir N. Makarov <vmakarov@redhat.com>
26217 PR rtl-optimization/98722
26218 * lra-eliminations.c (eliminate_regs_in_insn): Check that target
26219 has no 3-op add insn to transform insns containing two pluses.
26221 2021-01-20 Richard Biener <rguenther@suse.de>
26223 * hwint.h (add_hwi): New function.
26224 (mul_hwi): Likewise.
26225 * tree-data-ref.c (initialize_matrix_A): Properly translate
26226 tree constants and avoid HOST_WIDE_INT_MIN.
26227 (lambda_matrix_row_add): Avoid undefined integer overflow
26228 and return true on such overflow.
26229 (lambda_matrix_right_hermite): Handle overflow from
26230 lambda_matrix_row_add gracefully. Simplify previous fix.
26231 (analyze_subscript_affine_affine): Likewise.
26233 2021-01-20 Eugene Rozenfeld <erozen@microsoft.com>
26235 PR tree-optimization/96674
26236 * match.pd: New patterns: x < y || y == XXX_MIN --> x <= y - 1
26237 x >= y && y != XXX_MIN --> x > y - 1
26239 2021-01-20 Richard Sandiford <richard.sandiford@arm.com>
26241 PR tree-optimization/98535
26242 * tree-vect-slp.c (duplicate_and_interleave): Use quick_grow_cleared.
26243 If the high and low permutes are the same, remove the high permutes
26244 from the working set and only continue with the low ones.
26246 2021-01-20 Jakub Jelinek <jakub@redhat.com>
26248 PR tree-optimization/98721
26249 * builtins.c (access_ref::inform_access): Don't assume
26250 SSA_NAME_IDENTIFIER must be non-NULL. Print messages about
26251 object whenever allocfn is NULL, rather than only when DECL_P
26252 is true. Use %qE instead of %qD for that. Formatting fixes.
26254 2021-01-20 Richard Biener <rguenther@suse.de>
26256 PR tree-optimization/98758
26257 * tree-data-ref.c (int_divides_p): Use lambda_int arguments.
26258 (lambda_matrix_right_hermite): Avoid undefinedness with
26259 signed integer abs and multiplication.
26260 (analyze_subscript_affine_affine): Use lambda_int.
26262 2021-01-20 David Malcolm <dmalcolm@redhat.com>
26265 * dwarf2out.c (output_line_info): Rename static variable
26266 "generation", moving it out of the function to...
26267 (output_line_info_generation): New.
26268 (init_sections_and_labels): Likewise, renaming the variable to...
26269 (init_sections_and_labels_generation): New.
26270 (dwarf2out_c_finalize): Reset the new variables.
26272 2021-01-19 Martin Sebor <msebor@redhat.com>
26274 PR middle-end/98664
26275 * tree-ssa-live.c (remove_unused_scope_block_p): Keep scopes for
26276 all functions, even if they're not declared artificial or inline.
26277 * tree.c (tree_inlined_location): Use macro expansion location
26278 only if scope traversal fails to expose one.
26280 2021-01-19 Richard Sandiford <richard.sandiford@arm.com>
26282 PR rtl-optimization/92294
26283 * alias.c (compare_base_symbol_refs): Take an extra parameter
26284 and add the distance between two symbols to it. Enshrine in
26285 comments that -1 means "either 0 or 1, but we can't tell
26286 which at compile time".
26287 (memrefs_conflict_p): Update call accordingly.
26288 (rtx_equal_for_memref_p): Likewise. Take the distance between symbols
26291 2021-01-19 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26293 * config/aarch64/aarch64-simd-builtins.def (sqshl, uqshl,
26294 sqrshl, uqrshl, sqadd, uqadd, sqsub, uqsub, suqadd, usqadd, sqmovn,
26295 uqmovn, sqxtn2, uqxtn2, sqabs, sqneg, sqdmlal, sqdmlsl, sqdmlal_lane,
26296 sqdmlsl_lane, sqdmlal_laneq, sqdmlsl_laneq, sqdmlal_n, sqdmlsl_n,
26297 sqdmlal2, sqdmlsl2, sqdmlal2_lane, sqdmlsl2_lane, sqdmlal2_laneq,
26298 sqdmlsl2_laneq, sqdmlal2_n, sqdmlsl2_n, sqdmull, sqdmull_lane,
26299 sqdmull_laneq, sqdmull_n, sqdmull2, sqdmull2_lane, sqdmull2_laneq,
26300 sqdmull2_n, sqdmulh, sqrdmulh, sqdmulh_lane, sqdmulh_laneq,
26301 sqrdmulh_lane, sqrdmulh_laneq, sqshrun_n, sqrshrun_n, sqshrn_n,
26302 uqshrn_n, sqrshrn_n, uqrshrn_n, sqshlu_n, sqshl_n, uqshl_n, sqrdmlah,
26303 sqrdmlsh, sqrdmlah_lane, sqrdmlsh_lane, sqrdmlah_laneq, sqrdmlsh_laneq,
26304 sqmovun): Use NONE flags.
26306 2021-01-19 Richard Biener <rguenther@suse.de>
26309 * ipa-modref.c (analyze_stmt): Only record a summary for a
26312 2021-01-19 Richard Biener <rguenther@suse.de>
26314 PR middle-end/98638
26315 * tree-ssanames.c (fini_ssanames): Zero SSA_NAME_DEF_STMT.
26317 2021-01-19 Daniel Hellstrom <daniel@gaisler.com>
26319 * config/sparc/rtemself.h (TARGET_OS_CPP_BUILTINS): Add
26320 built-in define __FIX_LEON3FT_TN0018.
26322 2021-01-19 Richard Biener <rguenther@suse.de>
26325 * tree-inline.c (tree_function_versioning): Set input_location
26326 to UNKNOWN_LOCATION throughout the function.
26328 2021-01-19 Tobias Burnus <tobias@codesourcery.com>
26331 * omp-low.c (lower_omp_target): Handle nonpointer is_device_ptr.
26333 2021-01-19 Martin Jambor <mjambor@suse.cz>
26336 * ipa-sra.c (ssa_name_only_returned_p): New parameter fun. Check
26337 whether non-call exceptions allow removal of a statement.
26338 (isra_analyze_call): Pass the appropriate function to
26339 ssa_name_only_returned_p.
26341 2021-01-19 Geng Qi <gengqi@linux.alibaba.com>
26343 * config/riscv/arch-canonicalize (longext_sort): New function for
26344 sorting 'multi-letter'.
26345 * config/riscv/multilib-generator: Adjusting the loop of 'alt' in
26346 'alts'. The 'arch' may not be the first of 'alts'.
26347 (_expand_combination): Add underline for the 'ext' without '*'.
26348 This is because, a single-letter extension can always be treated well
26349 with a '_' prefix, but it cannot be separated out if it is appended
26352 2021-01-18 Vladimir N. Makarov <vmakarov@redhat.com>
26355 * ira.c (ira): Skip abnormal critical edge splitting.
26357 2021-01-18 Jakub Jelinek <jakub@redhat.com>
26359 PR tree-optimization/98727
26360 * tree-ssa-math-opts.c (match_arith_overflow): Fix up computation of
26361 second .MUL_OVERFLOW operand for signed multiplication with overflow
26362 checking if the second operand of multiplication is not constant.
26364 2021-01-18 David Edelsohn <dje.gcc@gmail.com>
26366 * doc/invoke.texi (-gdwarf): TPF defaults to version 2 and AIX
26367 defaults to version 4.
26369 2021-01-18 David Malcolm <dmalcolm@redhat.com>
26371 * attribs.h (fndecl_dealloc_argno): New decl.
26372 * builtins.c (call_dealloc_argno): Split out second half of
26374 (fndecl_dealloc_argno): New.
26375 * doc/extend.texi (Common Function Attributes): Document the
26376 interaction between the analyzer and the malloc attribute.
26377 * doc/invoke.texi (Static Analyzer Options): Likewise.
26379 2021-01-17 David Edelsohn <dje.gcc@gmail.com>
26381 * config/rs6000/aix71.h (SUBTARGET_OVERRIDE_OPTIONS): Override
26382 dwarf_version to 4.
26383 * config/rs6000/aix72.h (SUBTARGET_OVERRIDE_OPTIONS): Same.
26385 2021-01-17 Martin Jambor <mjambor@suse.cz>
26388 * cgraph.c (clone_of_p): Check also former_clone_of as we climb
26391 2021-01-17 Mark Wielaard <mark@klomp.org>
26393 * common.opt (gdwarf-): Init(5).
26394 * doc/invoke.texi (-gdwarf): Document default to 5.
26396 2021-01-16 Kwok Cheung Yeung <kcy@codesourcery.com>
26398 * builtin-types.def
26399 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT): Rename
26401 (BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR):
26402 ...this. Add extra argument.
26403 * gimplify.c (omp_default_clause): Ensure that event handle is
26404 firstprivate in a task region.
26405 (gimplify_scan_omp_clauses): Handle OMP_CLAUSE_DETACH.
26406 (gimplify_adjust_omp_clauses): Likewise.
26407 * omp-builtins.def (BUILT_IN_GOMP_TASK): Change function type to
26408 BT_FN_VOID_OMPFN_PTR_OMPCPYFN_LONG_LONG_BOOL_UINT_PTR_INT_PTR.
26409 * omp-expand.c (expand_task_call): Add GOMP_TASK_FLAG_DETACH to flags
26410 if detach clause specified. Add detach argument when generating
26412 * omp-low.c (scan_sharing_clauses): Setup data environment for detach
26414 (finish_taskreg_scan): Move field for variable containing the event
26415 handle to the front of the struct.
26416 * tree-core.h (enum omp_clause_code): Add OMP_CLAUSE_DETACH. Fix
26418 * tree-nested.c (convert_nonlocal_omp_clauses): Handle
26419 OMP_CLAUSE_DETACH clause.
26420 (convert_local_omp_clauses): Handle OMP_CLAUSE_DETACH clause.
26421 * tree-pretty-print.c (dump_omp_clause): Handle OMP_CLAUSE_DETACH.
26422 * tree.c (omp_clause_num_ops): Add entry for OMP_CLAUSE_DETACH.
26424 (omp_clause_code_name): Add entry for OMP_CLAUSE_DETACH. Fix
26426 (walk_tree_1): Handle OMP_CLAUSE_DETACH.
26428 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
26430 * config/nios2/t-rtems: Reset all MULTILIB_* variables. Shorten
26431 multilib directory names. Use MULTILIB_REQUIRED instead of
26432 MULTILIB_EXCEPTIONS. Add -mhw-mul -mhw-mulx -mhw-div
26433 -mcustom-fpu-cfg=fph2 multilib.
26435 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
26437 * config/nios2/nios2.c (NIOS2_FPU_CONFIG_NUM): Adjust value.
26438 (nios2_init_fpu_configs): Provide register values for new
26439 -mcustom-fpu-cfg=fph2 option variant.
26440 * doc/invoke.texi (-mcustom-fpu-cfg=fph2): Document new option
26443 2021-01-16 Sebastian Huber <sebastian.huber@embedded-brains.de>
26445 * config/nios2/nios2.c (nios2_custom_check_insns): Remove
26446 custom instruction warnings.
26448 2021-01-16 Jakub Jelinek <jakub@redhat.com>
26450 PR tree-optimization/96669
26451 * match.pd ((CST << x) & 1 -> x == 0): New simplification.
26453 2021-01-16 Jakub Jelinek <jakub@redhat.com>
26455 PR tree-optimization/96271
26456 * passes.def: Pass false argument to first two pass_cd_dce
26457 instances and true to last instance. Add comment that
26458 last instance rewrites no longer addressed locals.
26459 * tree-ssa-dce.c (pass_cd_dce): Add update_address_taken_p member and
26461 (pass_cd_dce::set_pass_param): New method.
26462 (pass_cd_dce::execute): Return TODO_update_address_taken from
26463 last cd_dce instance.
26465 2021-01-15 Carl Love <cel@us.ibm.com>
26467 * config/rs6000/altivec.h (vec_mulh, vec_div, vec_dive, vec_mod):
26469 * config/rs6000/altivec.md (VIlong): Move define to file vsx.md.
26470 * config/rs6000/rs6000-builtin.def (DIVES_V4SI, DIVES_V2DI,
26471 DIVEU_V4SI, DIVEU_V2DI, DIVS_V4SI, DIVS_V2DI, DIVU_V4SI,
26472 DIVU_V2DI, MODS_V2DI, MODS_V4SI, MODU_V2DI, MODU_V4SI,
26473 MULHS_V2DI, MULHS_V4SI, MULHU_V2DI, MULHU_V4SI, MULLD_V2DI):
26474 Add builtin define.
26475 (MULH, DIVE, MOD): Add new BU_P10_OVERLOAD_2 definitions.
26476 * config/rs6000/rs6000-call.c (VSX_BUILTIN_VEC_DIV,
26477 VSX_BUILTIN_VEC_DIVE, P10_BUILTIN_VEC_MOD, P10_BUILTIN_VEC_MULH):
26478 New overloaded definitions.
26479 (builtin_function_type) [P10V_BUILTIN_DIVEU_V4SI,
26480 P10V_BUILTIN_DIVEU_V2DI, P10V_BUILTIN_DIVU_V4SI,
26481 P10V_BUILTIN_DIVU_V2DI, P10V_BUILTIN_MODU_V2DI,
26482 P10V_BUILTIN_MODU_V4SI, P10V_BUILTIN_MULHU_V2DI,
26483 P10V_BUILTIN_MULHU_V4SI]: Add case
26484 statement for builtins.
26485 * config/rs6000/rs6000.md (bits): Add new attribute sizes V4SI, V2DI.
26486 * config/rs6000/vsx.md (VIlong): Moved from config/rs6000/altivec.md.
26487 (UNSPEC_VDIVES, UNSPEC_VDIVEU): New unspec definitions.
26488 (vsx_mul_v2di): Add if TARGET_POWER10 statement.
26489 (vsx_udiv_v2di): Add if TARGET_POWER10 statement.
26490 (dives_<mode>, diveu_<mode>, div<mode>3, uvdiv<mode>3,
26491 mods_<mode>, modu_<mode>, mulhs_<mode>, mulhu_<mode>, mulv2di3):
26492 Add define_insn, mode is VIlong.
26493 * doc/extend.texi (vec_mulh, vec_mul, vec_div, vec_dive, vec_mod):
26494 Add builtin descriptions.
26496 2021-01-15 Eric Botcazou <ebotcazou@adacore.com>
26498 * final.c (final_start_function_1): Reset force_source_line.
26500 2021-01-15 Jakub Jelinek <jakub@redhat.com>
26502 PR tree-optimization/96669
26503 * match.pd (((1 << A) & 1) != 0 -> A == 0,
26504 ((1 << A) & 1) == 0 -> A != 0): Generalize for 1s replaced by
26505 possibly different power of two constants and to right shift too.
26507 2021-01-15 Jakub Jelinek <jakub@redhat.com>
26509 PR tree-optimization/96681
26510 * match.pd ((x < 0) ^ (y < 0) to (x ^ y) < 0): New simplification.
26511 ((x >= 0) ^ (y >= 0) to (x ^ y) < 0): Likewise.
26512 ((x < 0) ^ (y >= 0) to (x ^ y) >= 0): Likewise.
26513 ((x >= 0) ^ (y < 0) to (x ^ y) >= 0): Likewise.
26515 2021-01-15 Alexandre Oliva <oliva@adacore.com>
26517 * opts.c (gen_command_line_string): Exclude -dumpbase-ext.
26519 2021-01-15 Tamar Christina <tamar.christina@arm.com>
26521 * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
26522 cmul<conj_op><mode>3): New.
26523 * config/aarch64/iterators.md (UNSPEC_FCMUL,
26524 UNSPEC_FCMUL180, UNSPEC_FCMLA_CONJ, UNSPEC_FCMLA180_CONJ,
26525 UNSPEC_CMLA_CONJ, UNSPEC_CMLA180_CONJ, UNSPEC_CMUL, UNSPEC_CMUL180,
26526 FCMLA_OP, FCMUL_OP, conj_op, rotsplit1, rotsplit2, fcmac1, sve_rot1,
26527 sve_rot2, SVE2_INT_CMLA_OP, SVE2_INT_CMUL_OP, SVE2_INT_CADD_OP): New.
26528 (rot): Add UNSPEC_FCMUL, UNSPEC_FCMUL180.
26529 (rot_op): Renamed to conj_op.
26530 * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
26531 cmul<conj_op><mode>3): New.
26532 * config/aarch64/aarch64-sve2.md (cml<fcmac1><conj_op><mode>4,
26533 cmul<conj_op><mode>3): New.
26535 2021-01-15 David Malcolm <dmalcolm@redhat.com>
26539 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
26540 Escape the tempfile name when constructing the expected output.
26542 2021-01-15 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26544 * config/aarch64/aarch64-simd.md (*aarch64_<su>mlsl_hi<mode>):
26546 (aarch64_<su>mlsl_hi<mode>): ... This.
26547 (aarch64_<su>mlsl_hi<mode>): Define.
26548 (*aarch64_<su>mlsl<mode): Rename to...
26549 (aarch64_<su>mlsl<mode): ... This.
26550 * config/aarch64/aarch64-simd-builtins.def (smlsl, umlsl,
26551 smlsl_hi, umlsl_hi): Define builtins.
26552 * config/aarch64/arm_neon.h (vmlsl_high_s8, vmlsl_high_s16,
26553 vmlsl_high_s32, vmlsl_high_u8, vmlsl_high_u16, vmlsl_high_u32,
26554 vmlsl_s8, vmlsl_s16, vmlsl_s32, vmlsl_u8,
26555 vmlsl_u16, vmlsl_u32): Reimplement with builtins.
26557 2021-01-15 Uroš Bizjak <ubizjak@gmail.com>
26559 * config/i386/i386-c.c (ix86_target_macros):
26560 Use cpp_define_formatted for __SIZEOF_FLOAT80__ definition.
26562 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
26565 * config.gcc (aarch64*-*-*): Add aarch64-cc-fusion.o to extra_objs.
26566 * Makefile.in (RTL_SSA_H): New variable.
26567 * config/aarch64/t-aarch64 (aarch64-cc-fusion.o): New rule.
26568 * config/aarch64/aarch64-protos.h (make_pass_cc_fusion): Declare.
26569 * config/aarch64/aarch64-passes.def: Add pass_cc_fusion after
26571 * config/aarch64/aarch64-cc-fusion.cc: New file.
26573 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
26575 * recog.h (insn_change_watermark::~insn_change_watermark): Avoid
26576 calling cancel_changes for changes that no longer exist.
26578 2021-01-15 Richard Sandiford <richard.sandiford@arm.com>
26580 * rtl-ssa/functions.h (function_info::ref_defs): Rename to...
26581 (function_info::reg_defs): ...this.
26582 * rtl-ssa/member-fns.inl (function_info::ref_defs): Rename to...
26583 (function_info::reg_defs): ...this.
26585 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26588 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
26590 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26593 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26596 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
26598 2021-01-15 Richard Biener <rguenther@suse.de>
26600 PR tree-optimization/96376
26601 * tree-vect-stmts.c (get_load_store_type): Disregard alignment
26602 for VMAT_INVARIANT.
26604 2021-01-15 Martin Liska <mliska@suse.cz>
26606 * doc/install.texi: Document that some tests need pytest module.
26607 * doc/sourcebuild.texi: Likewise.
26609 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26612 * config/arm/arm_neon.h (vceqz_p64, vceqq_p64, vceqzq_p64): New.
26614 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26616 * config/arm/mve.md (mve_vshrq_n_s<mode>_imm): New entry.
26617 (mve_vshrq_n_u<mode>_imm): Likewise.
26618 * config/arm/neon.md (vashr<mode>3, vlshr<mode>3): Move to ...
26619 * config/arm/vec-common.md: ... here.
26621 2021-01-15 Christophe Lyon <christophe.lyon@linaro.org>
26623 * config/arm/mve.md (mve_vshlq_<supf><mode>): Move to
26625 * config/arm/neon.md (vashl<mode>3): Delete.
26626 * config/arm/vec-common.md (mve_vshlq_<supf><mode>): New.
26627 (vasl<mode>3): New expander.
26629 2021-01-15 Richard Biener <rguenther@suse.de>
26631 PR tree-optimization/98685
26632 * tree-vect-slp.c (vect_schedule_slp_node): Refactor handling
26633 of vector extern defs.
26635 2021-01-14 David Malcolm <dmalcolm@redhat.com>
26638 * diagnostic.c (diagnostic_kind_text): Break out this array
26640 (diagnostic_build_prefix): ...here.
26641 (fancy_abort): Detect when diagnostic_initialize has not yet been
26642 called and fall back to a minimal implementation of printing the
26643 ICE, rather than segfaulting in internal_error.
26645 2021-01-14 David Malcolm <dmalcolm@redhat.com>
26647 * diagnostic.c (diagnostic_initialize): Eliminate
26648 parseable_fixits_p in favor of initializing extra_output_kind from
26649 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
26650 (convert_column_unit): New function, split out from...
26651 (diagnostic_converted_column): ...this.
26652 (print_parseable_fixits): Add "column_unit" and "tabstop" params.
26653 Use them to call convert_column_unit on the column values.
26654 (diagnostic_report_diagnostic): Eliminate conditional on
26655 parseable_fixits_p in favor of a switch statement on
26656 extra_output_kind, passing the appropriate values to the new
26657 params of print_parseable_fixits.
26658 (selftest::test_print_parseable_fixits_none): Update for new
26659 params of print_parseable_fixits.
26660 (selftest::test_print_parseable_fixits_insert): Likewise.
26661 (selftest::test_print_parseable_fixits_remove): Likewise.
26662 (selftest::test_print_parseable_fixits_replace): Likewise.
26663 (selftest::test_print_parseable_fixits_bytes_vs_display_columns):
26665 (selftest::diagnostic_c_tests): Call it.
26666 * diagnostic.h (enum diagnostics_extra_output_kind): New.
26667 (diagnostic_context::parseable_fixits_p): Delete field in favor
26669 (diagnostic_context::extra_output_kind): ...this new field.
26670 * doc/invoke.texi (Environment Variables): Add
26671 GCC_EXTRA_DIAGNOSTIC_OUTPUT.
26672 * opts.c (common_handle_option): Update handling of
26673 OPT_fdiagnostics_parseable_fixits for change to diagnostic_context
26676 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26678 * tree-vect-slp-patterns.c (class complex_operations_pattern,
26679 complex_operations_pattern::matches,
26680 complex_operations_pattern::recognize,
26681 complex_operations_pattern::build): New.
26682 (slp_patterns): Use it.
26684 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26686 * internal-fn.def (COMPLEX_FMS, COMPLEX_FMS_CONJ): New.
26687 * optabs.def (cmls_optab, cmls_conj_optab): New.
26688 * doc/md.texi: Document them.
26689 * tree-vect-slp-patterns.c (class complex_fms_pattern,
26690 complex_fms_pattern::matches, complex_fms_pattern::recognize,
26691 complex_fms_pattern::build): New.
26693 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26695 * internal-fn.def (COMPLEX_FMA, COMPLEX_FMA_CONJ): New.
26696 * optabs.def (cmla_optab, cmla_conj_optab): New.
26697 * doc/md.texi: Document them.
26698 * tree-vect-slp-patterns.c (vect_match_call_p,
26699 class complex_fma_pattern, vect_slp_reset_pattern,
26700 complex_fma_pattern::matches, complex_fma_pattern::recognize,
26701 complex_fma_pattern::build): New.
26703 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26705 * internal-fn.def (COMPLEX_MUL, COMPLEX_MUL_CONJ): New.
26706 * optabs.def (cmul_optab, cmul_conj_optab): New.
26707 * doc/md.texi: Document them.
26708 * tree-vect-slp-patterns.c (vect_match_call_complex_mla,
26709 vect_normalize_conj_loc, is_eq_or_top, vect_validate_multiplication,
26710 vect_build_combine_node, class complex_mul_pattern,
26711 complex_mul_pattern::matches, complex_mul_pattern::recognize,
26712 complex_mul_pattern::build): New.
26714 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26716 * tree-vect-slp.c (optimize_load_redistribution_1): New.
26717 (optimize_load_redistribution, vect_is_slp_load_node): New.
26718 (vect_match_slp_patterns): Use it.
26720 2021-01-14 Tamar Christina <tamar.christina@arm.com>
26722 * tree-vect-slp-patterns.c (complex_add_pattern::build):
26725 2021-01-14 Thomas Schwinge <thomas@codesourcery.com>
26727 * config/gcn/mkoffload.c (main): Create an offload image only in
26728 64-bit configurations.
26730 2021-01-14 H.J. Lu <hjl.tools@gmail.com>
26733 * config/i386/i386-options.c (ix86_option_override_internal):
26734 Issue an error for -fcf-protection with CF_BRANCH when compiling
26735 for 32-bit non-TARGET_CMOV targets.
26737 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
26740 * config/i386/i386-options.c (ix86_valid_target_attribute_inner_p):
26741 Remove declaration and initialization of shadow variable "ret".
26742 (ix86_option_override_internal): Remove delcaration of
26743 shadow variable "i". Redeclare shadowed variable to unsigned.
26744 * common/config/i386/i386-common.c (pta_size): Redeclare to unsigned.
26745 * config/i386/i386-builtins.c (get_builtin_code_for_version):
26746 Update for redeclaration.
26747 * config/i386/i386.h (pta_size): Ditto.
26749 2021-01-14 Richard Biener <rguenther@suse.de>
26751 PR tree-optimization/98674
26752 * tree-data-ref.c (base_supports_access_fn_components_p): New.
26753 (initialize_data_dependence_relation): For two bases without
26754 possible access fns resort to type size equality when determining
26755 shape compatibility.
26757 2021-01-14 Prathamesh Kulkarni <prathamesh.kulkarni@linaro.org>
26760 * config/arm/arm_neon.h: Replace calls to __builtin_vcge* by
26761 <=, >= operators in vcle and vcge intrinsics respectively.
26762 * config/arm/arm_neon_builtins.def: Remove entry for
26765 2021-01-14 Uroš Bizjak <ubizjak@gmail.com>
26768 * config/i386/i386-options.c (ix86_function_specific_save):
26769 Remove redundant assignment to opts->x_ix86_branch_cost.
26770 * config/i386/i386.c (ix86_prefetch_sse):
26771 Rename from x86_prefetch_sse. Update all uses.
26772 * config/i386/i386.h: Update for rename.
26773 * config/i386/i386-options.h: Ditto.
26775 2021-01-14 Jakub Jelinek <jakub@redhat.com>
26778 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3,
26779 *sse4_1_zero_extendv4hiv4si2_3, *sse4_1_zero_extendv2siv2di2_3):
26780 Use Bm instead of m for non-avx. Add isa attribute.
26782 2021-01-14 Jakub Jelinek <jakub@redhat.com>
26784 PR tree-optimization/96688
26785 * match.pd (~(X >> Y) -> ~X >> Y): New simplification if
26786 ~X can be simplified.
26788 2021-01-14 Richard Sandiford <richard.sandiford@arm.com>
26790 * tree-vect-stmts.c (vect_model_load_cost): Account for unused
26791 IFN_LOAD_LANES results.
26793 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26795 * config/aarch64/aarch64-simd.md (aarch64_<su>xtl<mode>):
26797 (aarch64_xtn<mode>): Likewise.
26798 * config/aarch64/aarch64-simd-builtins.def (sxtl, uxtl, xtn):
26801 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
26803 (vmovl_s16): Likewise.
26804 (vmovl_s32): Likewise.
26805 (vmovl_u8): Likewise.
26806 (vmovl_u16): Likewise.
26807 (vmovl_u32): Likewise.
26808 (vmovn_s16): Likewise.
26809 (vmovn_s32): Likewise.
26810 (vmovn_s64): Likewise.
26811 (vmovn_u16): Likewise.
26812 (vmovn_u32): Likewise.
26813 (vmovn_u64): Likewise.
26815 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26817 * config/aarch64/aarch64-simd.md (aarch64_<su>qxtn2<mode>_le):
26819 (aarch64_<su>qxtn2<mode>_be): Likewise.
26820 (aarch64_<su>qxtn2<mode>): Likewise.
26821 * config/aarch64/aarch64-simd-builtins.def (sqxtn2, uqxtn2):
26823 * config/aarch64/iterators.md (SAT_TRUNC): Define code_iterator.
26824 (su): Handle ss_truncate and us_truncate.
26825 * config/aarch64/arm_neon.h (vqmovn_high_s16): Reimplement using
26827 (vqmovn_high_s32): Likewise.
26828 (vqmovn_high_s64): Likewise.
26829 (vqmovn_high_u16): Likewise.
26830 (vqmovn_high_u32): Likewise.
26831 (vqmovn_high_u64): Likewise.
26833 2021-01-14 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
26835 * config/aarch64/aarch64-simd.md (aarch64_xtn2<mode>_le):
26837 (aarch64_xtn2<mode>_be): Likewise.
26838 (aarch64_xtn2<mode>): Likewise.
26839 * config/aarch64/aarch64-simd-builtins.def (xtn2): Define
26841 * config/aarch64/arm_neon.h (vmovn_high_s16): Reimplement using
26843 (vmovn_high_s32): Likewise.
26844 (vmovn_high_s64): Likewise.
26845 (vmovn_high_u16): Likewise.
26846 (vmovn_high_u32): Likewise.
26847 (vmovn_high_u64): Likewise.
26849 2021-01-13 Stafford Horne <shorne@gmail.com>
26851 * config/or1k/or1k.h (ASM_PREFERRED_EH_DATA_FORMAT): New macro.
26853 2021-01-13 Stafford Horne <shorne@gmail.com>
26855 * config/or1k/linux.h (TARGET_ASM_FILE_END): Define macro.
26857 2021-01-13 Stafford Horne <shorne@gmail.com>
26859 * config/or1k/or1k.h (TARGET_CPU_CPP_BUILTINS): Add builtin
26860 define for __or1k_hard_float__.
26862 2021-01-13 Stafford Horne <shorne@gmail.com>
26864 * config/or1k/or1k.h (NO_PROFILE_COUNTERS): Define as 1.
26865 (PROFILE_HOOK): Define to call _mcount.
26866 (FUNCTION_PROFILER): Change from abort to no-op.
26868 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26870 PR tree-optimization/96691
26871 * match.pd ((~X | C) ^ D -> (X | C) ^ (~D ^ C),
26872 (~X & C) ^ D -> (X & C) ^ (D ^ C)): New simplifications if
26873 (~D ^ C) or (D ^ C) can be simplified.
26875 2021-01-13 Richard Biener <rguenther@suse.de>
26877 PR tree-optimization/92645
26878 * match.pd (BIT_FIELD_REF to conversion): Delay canonicalization
26879 until after vector lowering.
26881 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26883 * config/aarch64/aarch64-sve.md (fnma<mode>4): Extend from SVE_FULL_I
26885 (@aarch64_pred_fnma<mode>, cond_fnma<mode>, *cond_fnma<mode>_2)
26886 (*cond_fnma<mode>_4, *cond_fnma<mode>_any): Likewise.
26888 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26890 * config/aarch64/aarch64-sve.md (fma<mode>4): Extend from SVE_FULL_I
26892 (@aarch64_pred_fma<mode>, cond_fma<mode>, *cond_fma<mode>_2)
26893 (*cond_fma<mode>_4, *cond_fma<mode>_any): Likewise.
26895 2021-01-13 Richard Biener <rguenther@suse.de>
26897 PR tree-optimization/92645
26898 * tree-vect-slp.c (vect_build_slp_tree_1): Relax supported
26899 BIT_FIELD_REF argument.
26900 (vect_build_slp_tree_2): Record the desired vector type
26901 on the external vector def.
26902 (vectorizable_slp_permutation): Handle required punning
26903 of existing vector defs.
26905 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26907 * rtl-ssa/accesses.h (def_lookup): Fix order of comparison results.
26909 2021-01-13 Richard Sandiford <richard.sandiford@arm.com>
26911 * config/sh/sh.md (movsf_ie): Remove operands[2] test.
26913 2021-01-13 Samuel Thibault <samuel.thibault@ens-lyon.org>
26915 * config.gcc [$target == *-*-gnu*]: Enable
26916 'default_gnu_indirect_function'.
26918 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26921 * optabs.c (expand_vec_perm_const): Don't force v0 and v1 into
26922 registers before calling targetm.vectorize.vec_perm_const, only after
26924 * config/i386/i386-expand.c (ix86_vectorize_vec_perm_const): Handle
26925 two argument permutation when one operand is zero vector and only
26926 after that force operands into registers.
26927 * config/i386/sse.md (*avx2_zero_extendv16qiv16hi2_1): New
26928 define_insn_and_split pattern.
26929 (*avx512bw_zero_extendv32qiv32hi2_1): Likewise.
26930 (*avx512f_zero_extendv16hiv16si2_1): Likewise.
26931 (*avx2_zero_extendv8hiv8si2_1): Likewise.
26932 (*avx512f_zero_extendv8siv8di2_1): Likewise.
26933 (*avx2_zero_extendv4siv4di2_1): Likewise.
26934 * config/mips/mips.c (mips_vectorize_vec_perm_const): Force operands
26936 * config/arm/arm.c (arm_vectorize_vec_perm_const): Likewise.
26937 * config/sparc/sparc.c (sparc_vectorize_vec_perm_const): Likewise.
26938 * config/ia64/ia64.c (ia64_vectorize_vec_perm_const): Likewise.
26939 * config/aarch64/aarch64.c (aarch64_vectorize_vec_perm_const): Likewise.
26940 * config/rs6000/rs6000.c (rs6000_vectorize_vec_perm_const): Likewise.
26941 * config/gcn/gcn.c (gcn_vectorize_vec_perm_const): Likewise. Use std::swap.
26943 2021-01-13 Martin Liska <mliska@suse.cz>
26945 PR tree-optimization/98455
26946 * gimple-if-to-switch.cc (condition_info::record_phi_mapping):
26947 Record also virtual PHIs.
26948 (pass_if_to_switch::execute): Return TODO_cleanup_cfg only
26951 2021-01-13 Jonathan Wakely <jwakely@redhat.com>
26953 * doc/invoke.texi (C++ Modules): Fix typos.
26955 2021-01-13 Richard Biener <rguenther@suse.de>
26957 PR tree-optimization/98640
26958 * tree-ssa-sccvn.c (visit_nary_op): Do not try to
26959 handle plus or minus from a truncated operand to be
26962 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26965 * config/i386/i386.md (*btr<mode>_1, *btr<mode>_2): New
26966 define_insn_and_split patterns.
26967 (splitter after *btr<mode>_2): New splitter.
26969 2021-01-13 Martin Liska <mliska@suse.cz>
26972 * cgraphunit.c (analyze_functions): Remove dead code.
26974 2021-01-13 Qian Jianhua <qianjh@cn.fujitsu.com>
26976 * config/aarch64/aarch64-cost-tables.h (a64fx_extra_costs): New.
26977 * config/aarch64/aarch64.c (a64fx_addrcost_table): New.
26978 (a64fx_regmove_cost, a64fx_vector_cost): New.
26979 (a64fx_tunings): Use the new added cost tables.
26981 2021-01-13 Jakub Jelinek <jakub@redhat.com>
26984 * config/i386/predicates.md (pmovzx_parallel): New predicate.
26985 * config/i386/sse.md (*sse4_1_zero_extendv8qiv8hi2_3): New
26986 define_insn_and_split pattern.
26987 (*sse4_1_zero_extendv4hiv4si2_3): Likewise.
26988 (*sse4_1_zero_extendv2siv2di2_3): Likewise.
26990 2021-01-13 Julian Brown <julian@codesourcery.com>
26992 * config/gcn/gcn.c (gcn_conditional_register_usage): Remove dead code
26993 to fix v0 register.
26995 2021-01-13 Julian Brown <julian@codesourcery.com>
26997 * config/gcn/gcn.c (gcn_md_reorg): Fix case where EXEC reg is live
27000 2021-01-13 Julian Brown <julian@codesourcery.com>
27002 * config/gcn/gcn-valu.md (recip<mode>2<exec>, recip<mode>2): Use unspec
27003 for reciprocal-approximation instructions.
27004 (div<mode>3): Use fused multiply-accumulate operations for reciprocal
27005 refinement and division result.
27006 * config/gcn/gcn.md (UNSPEC_RCP): New unspec constant.
27008 2021-01-13 Julian Brown <julian@codesourcery.com>
27010 * config/gcn/gcn-valu.md (subdf): Rename to...
27013 2021-01-12 Martin Liska <mliska@suse.cz>
27015 * gcov.c (source_info::debug): Fix printf format for 32-bit hosts.
27017 2021-01-12 Andrea Corallo <andrea.corallo@arm.com>
27019 * function-abi.h: Fix typo.
27021 2021-01-12 Christophe Lyon <christophe.lyon@linaro.org>
27025 * config/arm/arm.h (ARM_HAVE_NEON_V8QI_LDST): New macro.
27026 (ARM_HAVE_NEON_V16QI_LDST, ARM_HAVE_NEON_V4HI_LDST): Likewise.
27027 (ARM_HAVE_NEON_V8HI_LDST, ARM_HAVE_NEON_V2SI_LDST): Likewise.
27028 (ARM_HAVE_NEON_V4SI_LDST, ARM_HAVE_NEON_V4HF_LDST): Likewise.
27029 (ARM_HAVE_NEON_V8HF_LDST, ARM_HAVE_NEON_V4BF_LDST): Likewise.
27030 (ARM_HAVE_NEON_V8BF_LDST, ARM_HAVE_NEON_V2SF_LDST): Likewise.
27031 (ARM_HAVE_NEON_V4SF_LDST, ARM_HAVE_NEON_DI_LDST): Likewise.
27032 (ARM_HAVE_NEON_V2DI_LDST): Likewise.
27033 (ARM_HAVE_V8QI_LDST, ARM_HAVE_V16QI_LDST): Likewise.
27034 (ARM_HAVE_V4HI_LDST, ARM_HAVE_V8HI_LDST): Likewise.
27035 (ARM_HAVE_V2SI_LDST, ARM_HAVE_V4SI_LDST, ARM_HAVE_V4HF_LDST): Likewise.
27036 (ARM_HAVE_V8HF_LDST, ARM_HAVE_V4BF_LDST, ARM_HAVE_V8BF_LDST): Likewise.
27037 (ARM_HAVE_V2SF_LDST, ARM_HAVE_V4SF_LDST, ARM_HAVE_DI_LDST): Likewise.
27038 (ARM_HAVE_V2DI_LDST): Likewise.
27039 * config/arm/mve.md (*movmisalign<mode>_mve_store): New pattern.
27040 (*movmisalign<mode>_mve_load): New pattern.
27041 * config/arm/neon.md (movmisalign<mode>): Move to ...
27042 * config/arm/vec-common.md: ... here.
27044 2021-01-12 Vladimir N. Makarov <vmakarov@redhat.com>
27047 * lra-eliminations.c (eliminate_regs_in_insn): Add transformation
27048 of pattern 'plus (plus (hard reg, const), pseudo)'.
27050 2021-01-12 Richard Biener <rguenther@suse.de>
27052 PR tree-optimization/98550
27053 * tree-vect-slp.c (vect_record_max_nunits): Check whether
27054 the group size is a multiple of the vector element count.
27055 (vect_build_slp_tree_1): When we need to fail because
27056 the vector type choosen causes unrolling do so lazily
27057 without affecting matches only at the end to guide group splitting.
27059 2021-01-12 Martin Liska <mliska@suse.cz>
27062 * optc-save-gen.awk: Compare also n_target_save vars with
27065 2021-01-12 Martin Liska <mliska@suse.cz>
27067 * gcov.c (source_info::debug): New.
27068 (print_usage): Add --debug (-D) option.
27069 (process_args): Likewise.
27070 (generate_results): Call src->debug after
27071 accumulate_line_counts.
27072 (read_graph_file): Properly assign id for EXIT_BLOCK.
27073 * profile.c (branch_prob): Dump function body before it is
27076 2021-01-12 Jakub Jelinek <jakub@redhat.com>
27078 PR tree-optimization/98629
27079 * tree-ssa-math-opts.c (arith_overflow_check_p): Don't update use_stmt
27080 unless returning non-zero.
27082 2021-01-12 Jakub Jelinek <jakub@redhat.com>
27084 PR tree-optimization/95731
27085 * tree-ssa-reassoc.c (optimize_range_tests_cmp_bitwise): Also optimize
27086 x < 0 && y < 0 && z < 0 into (x | y | z) < 0 for signed x, y, z.
27087 (optimize_range_tests): Call optimize_range_tests_cmp_bitwise
27088 only after optimize_range_tests_var_bound.
27090 2021-01-12 Jakub Jelinek <jakub@redhat.com>
27092 * configure.ac: Ensure c/Make-lang.in comes first in @all_lang_makefrags@.
27093 * configure: Regenerated.
27095 2021-01-12 liuhongt <hongtao.liu@intel.com>
27098 * config/i386/i386-builtins.h (BUILTIN_DESC_SWAP_OPERANDS):
27100 * config/i386/i386-expand.c (ix86_expand_sse_comi): Delete
27103 2021-01-12 Alexandre Oliva <oliva@adacore.com>
27105 * ssa-iterators.h (end_imm_use_stmt_traverse): Forward
27107 (auto_end_imm_use_stmt_traverse): New struct.
27108 (FOR_EACH_IMM_USE_STMT): Use it.
27109 (BREAK_FROM_IMM_USE_STMT, RETURN_FROM_IMM_USE_STMT): Remove,
27111 * gimple-ssa-strength-reduction.c: ... here, ...
27112 * graphite-scop-detection.c: ... here, ...
27113 * ipa-modref.c, ipa-pure-const.c, ipa-sra.c: ... here, ...
27114 * tree-predcom.c, tree-ssa-ccp.c: ... here, ...
27115 * tree-ssa-dce.c, tree-ssa-dse.c: ... here, ...
27116 * tree-ssa-loop-ivopts.c, tree-ssa-math-opts.c: ... here, ...
27117 * tree-ssa-phiprop.c, tree-ssa.c: ... here, ...
27118 * tree-vect-slp.c: ... and here, ...
27119 * doc/tree-ssa.texi: ... and the example here.
27121 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27123 * config/aarch64/aarch64-sve.md (sdiv_pow2<mode>3): Extend from
27124 SVE_FULL_I to SVE_I. Generate an UNSPEC_PRED_X.
27125 (*sdiv_pow2<mode>3): New pattern.
27126 (@cond_<sve_int_op><mode>): Extend from SVE_FULL_I to SVE_I.
27127 Wrap the ASRD in an UNSPEC_PRED_X.
27128 (*cond_<sve_int_op><mode>_2): Likewise. Replace the UNSPEC_PRED_X
27129 predicate with a constant PTRUE, if it isn't already.
27130 (*cond_<sve_int_op><mode>_z): Replace with...
27131 (*cond_<sve_int_op><mode>_any): ...this new pattern.
27133 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27135 * config/aarch64/aarch64-sve.md (*cond_bic<mode>_2): Extend from
27136 SVE_FULL_I to SVE_I.
27137 (*cond_bic<mode>_any): Likewise.
27139 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27141 * config/aarch64/aarch64-sve.md (<su>mul<mode>3_highpart)
27142 (@aarch64_pred_<MUL_HIGHPART:optab><mode>): Extend from SVE_FULL_I
27145 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27147 * config/aarch64/aarch64-sve.md (<su>abd<mode>_3): Extend from
27148 SVE_FULL_I to SVE_I.
27149 (*aarch64_cond_<su>abd<mode>_2): Likewise.
27150 (*aarch64_cond_<su>abd<mode>_any): Likewise.
27151 (@aarch64_pred_<su>abd<mode>): Likewise. Use UNSPEC_PRED_X
27152 for the max and min but not for the minus.
27153 (*aarch64_cond_<su>abd<mode>_3): New pattern.
27155 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27157 * config/aarch64/iterators.md (SVE_24I): New iterator.
27158 * config/aarch64/aarch64-sve.md (*aarch64_adr<mode>_shift): Extend from
27159 SVE_FULL_SDI to SVE_24I. Use containers rather than elements.
27161 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27163 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_BINARY:optab><mode>)
27164 (*cond_<SVE_INT_BINARY:optab><mode>_2): Extend from SVE_FULL_I
27166 (*cond_<SVE_INT_BINARY:optab><mode>_3): Likewise.
27167 (*cond_<SVE_INT_BINARY:optab><mode>_any): Likewise.
27168 (*cond_<SVE_INT_BINARY:optab><mode>_2_const): Likewise.
27169 (*cond_<SVE_INT_BINARY:optab><mode>_any_const): Likewise.
27171 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27173 * config/aarch64/aarch64-sve.md (<SVE_INT_BINARY_IMM:optab><mode>3)
27174 (@aarch64_pred_<SVE_INT_BINARY_IMM:optab><mode>)
27175 (*post_ra_<SVE_INT_BINARY_IMM:optab><mode>3): Extend from SVE_FULL_I
27178 2021-01-11 Richard Sandiford <richard.sandiford@arm.com>
27180 * config/aarch64/aarch64-sve.md (<ASHIFT:optab><mode>3)
27181 (v<ASHIFT:optab><mode>3, @aarch64_pred_<optab><mode>)
27182 (*post_ra_v<ASHIFT:optab><mode>3): Extend from SVE_FULL_I to SVE_I.
27184 2021-01-11 Martin Liska <mliska@suse.cz>
27187 * symtab-clones.h (clone_info::release): Release
27188 symtab::m_clones with ggc_delete as it's a GGC memory.
27190 2021-01-11 Matthias Klose <doko@ubuntu.com>
27192 * Makefile.in (LINK_PROGRESS): Show the link target.
27194 2021-01-11 Richard Biener <rguenther@suse.de>
27196 PR tree-optimization/91403
27197 * tree-vect-data-refs.c (vect_analyze_group_access_1): Cap
27198 single-element interleaving group size at 4096 elements.
27200 2021-01-11 Richard Biener <rguenther@suse.de>
27202 PR tree-optimization/98526
27203 * tree-vect-loop.c (vect_model_reduction_cost): Remove costing
27204 of the actual reduction op for the regular case.
27205 (vectorizable_reduction): Cost the stmts
27206 vect_transform_reduction produces here.
27208 2021-01-11 Andreas Krebbel <krebbel@linux.ibm.com>
27210 * tree-ssa-forwprop.c (simplify_vector_constructor): For
27211 big-endian, use UNPACK[_FLOAT]_HI.
27213 2021-01-11 Tamar Christina <tamar.christina@arm.com>
27215 * tree-vect-slp-patterns.c (class complex_pattern,
27216 class complex_add_pattern): Add parameters to matches.
27217 (complex_add_pattern::build): Free memory.
27218 (complex_add_pattern::matches): Move validation end of match.
27219 (complex_add_pattern::recognize): Likewise.
27221 2021-01-11 Tamar Christina <tamar.christina@arm.com>
27223 * tree-vect-slp-patterns.c (linear_loads_p): Fix externals.
27225 2021-01-11 Tamar Christina <tamar.christina@arm.com>
27227 * tree-vect-slp-patterns.c (is_linear_load_p): Fix ambiguity.
27229 2021-01-11 Jakub Jelinek <jakub@redhat.com>
27231 PR tree-optimization/95867
27232 * tree-ssa-math-opts.h: New header.
27233 * tree-ssa-math-opts.c: Include tree-ssa-math-opts.h.
27234 (powi_as_mults): No longer static. Use build_one_cst instead of
27235 build_real. Formatting fix.
27236 * tree-ssa-reassoc.c: Include tree-ssa-math-opts.h.
27237 (attempt_builtin_powi): Handle multiplication reassociation without
27238 powi_fndecl using powi_as_mults.
27239 (reassociate_bb): For integral types don't require
27240 -funsafe-math-optimizations to call attempt_builtin_powi.
27242 2021-01-11 Jakub Jelinek <jakub@redhat.com>
27244 PR tree-optimization/95852
27245 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): Change
27246 mul_stmts parameter type to vec<gimple *> &. Before cond_stmt
27247 allow in the bb any of the stmts in that vector, div_stmt and
27248 up to 3 cast stmts.
27249 (arith_cast_equal_p): New function.
27250 (arith_overflow_check_p): Add cast_stmt argument, handle signed
27251 multiply overflow checks.
27252 (match_arith_overflow): Adjust caller. Handle signed multiply
27255 2021-01-11 Jakub Jelinek <jakub@redhat.com>
27257 PR tree-optimization/95852
27258 * tree-ssa-math-opts.c (maybe_optimize_guarding_check): New function.
27259 (uaddsub_overflow_check_p): Renamed to ...
27260 (arith_overflow_check_p): ... this. Handle also multiplication
27261 with overflow check.
27262 (match_uaddsub_overflow): Renamed to ...
27263 (match_arith_overflow): ... this. Add cfg_changed argument. Handle
27264 also multiplication with overflow check. Adjust function comment.
27265 (math_opts_dom_walker::after_dom_children): Adjust callers. Call
27266 match_arith_overflow also for MULT_EXPR.
27268 2021-01-11 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27270 * config/aarch64/arm_neon.h (vmovl_s8): Reimplement using
27271 __builtin_convertvector.
27272 (vmovl_s16): Likewise.
27273 (vmovl_s32): Likewise.
27274 (vmovl_u8): Likewise.
27275 (vmovl_u16): Likewise.
27276 (vmovl_u32): Likewise.
27277 (vmovn_s16): Likewise.
27278 (vmovn_s32): Likewise.
27279 (vmovn_s64): Likewise.
27280 (vmovn_u16): Likewise.
27281 (vmovn_u32): Likewise.
27282 (vmovn_u64): Likewise.
27284 2021-01-11 Martin Liska <mliska@suse.cz>
27286 * gimple-if-to-switch.cc (struct condition_info): Use auto_var.
27287 (if_chain::is_beneficial): Delete clusters
27288 (find_conditions): Make second argument of conditions_in_bbs a
27289 pointer so that we control over it's lifetime.
27290 (pass_if_to_switch::execute): Delete them.
27292 2021-01-11 Kewen Lin <linkw@linux.ibm.com>
27294 * ira.c (move_unallocated_pseudos): Check other_reg and skip if
27297 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27299 * config/vax/vax.md (cc): Remove mode attribute.
27300 (subst_<cc>, subst_f<cc>): Rename to...
27301 (subst_<mode>, subst_f<VAXccnz:mode>): ... these respectively.
27302 (*cbranch<VAXint:mode>4_<VAXcc:mode>): Update for `cc' removal.
27303 (*cbranch<VAXfp:mode>4_<VAXccnz:mode>): Likewise.
27304 (*branch_<mode>, *branch_<mode>_reversed): Likewise.
27306 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27308 * config/vax/vax.md (subst_f<cc>): Add mode to operands and
27309 `const_double_zero'.
27311 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27313 * config/pdp11/pdp11.md (PDPfp): New mode iterator.
27314 (fcc_cc, fcc_ccnz): Use it. Add mode to `const_double_zero' and
27317 2021-01-09 Maciej W. Rozycki <macro@linux-mips.org>
27319 * genemit.c (gen_exp) <CONST_DOUBLE>: Handle `const_double_zero'
27321 * read-rtl.c (rtx_reader::read_rtx_code): Handle machine mode
27322 with `const_double_zero'.
27323 * doc/rtl.texi (Constant Expression Types): Document it.
27325 2021-01-09 Jakub Jelinek <jakub@redhat.com>
27328 * tree-cfg.c (verify_gimple_assign_binary): Allow lhs of
27329 POINTER_DIFF_EXPR to be any integral type.
27331 2021-01-09 Jakub Jelinek <jakub@redhat.com>
27333 PR rtl-optimization/98603
27334 * function.c (instantiate_virtual_regs_in_insn): For asm goto
27335 with impossible constraints, drop all SETs, CLOBBERs, drop PARALLEL
27336 if any, set ASM_OPERANDS mode to VOIDmode and change
27337 ASM_OPERANDS_OUTPUT_CONSTRAINT and ASM_OPERANDS_OUTPUT_IDX.
27339 2021-01-09 Alexandre Oliva <oliva@gnu.org>
27342 * final.c (notice_source_line): Narrow down the condition to
27343 skip a line-0 marker.
27345 2021-01-08 Sergei Trofimovich <siarheit@google.com>
27347 * ipa-modref.c (merge_call_side_effects): Fix
27348 linebreak split by reordering two print calls.
27350 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
27352 * config/s390/vector.md (*tf_to_fprx2_0): Rename from
27353 "*mov_tf_to_fprx2_0" for consistency, fix constraint.
27354 (*tf_to_fprx2_1): Rename from "*mov_tf_to_fprx2_1" for
27355 consistency, fix constraint.
27357 2021-01-08 Ilya Leoshkevich <iii@linux.ibm.com>
27359 * config/s390/s390-c.c (s390_def_or_undef_macro): Accept
27360 callables instead of mask values.
27361 (struct target_flag_set_p): New predicate.
27362 (s390_cpu_cpp_builtins_internal): Define or undefine
27363 __LONG_DOUBLE_VX__ macro.
27365 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
27368 * config/i386/i386.c (x86_function_profiler): Use R10 and R11
27369 to call mcount in large model with PIC for NO_PROFILE_COUNTERS
27372 2021-01-08 Richard Biener <rguenther@suse.de>
27374 * tree-ssa-sccvn.c (pass_fre::execute): Reset the SCEV hash table.
27376 2021-01-08 Richard Biener <rguenther@suse.de>
27378 * tree-vect-slp.c (scalar_stmts_to_slp_tree_map_t): Fix.
27379 (vect_build_slp_tree): On cache hit release the matched
27380 scalar stmts vector.
27381 * tree-vect-stmts.c (vectorizable_store): Properly free
27382 vec_oprnds before possibly gathering them again.
27384 2021-01-08 Richard Biener <rguenther@suse.de>
27386 PR tree-optimization/98544
27387 * tree-vect-slp.c (vect_optimize_slp): Always materialize
27388 permutes at a permute node.
27390 2021-01-08 H.J. Lu <hjl.tools@gmail.com>
27393 * config/i386/i386.c (x86_function_profiler): Use R10 to call
27394 mcount in large model. Sorry for large model with PIC.
27396 2021-01-08 Jakub Jelinek <jakub@redhat.com>
27399 * config/i386/i386.opt (ix86_cmodel, ix86_incoming_stack_boundary_arg,
27400 ix86_pmode, ix86_preferred_stack_boundary_arg, ix86_regparm,
27401 ix86_veclibabi_type): Remove x_ prefix, use TargetVariable instead of
27402 TargetSave and initialize for variables with enum types.
27403 (mfentry, mstack-protector-guard-reg=, mstack-protector-guard-offset=,
27404 mstack-protector-guard-symbol=): Add Save.
27405 * config/i386/i386-options.c (ix86_function_specific_save,
27406 ix86_function_specific_restore): Don't save or restore x_ix86_cmodel,
27407 x_ix86_incoming_stack_boundary_arg, x_ix86_pmode,
27408 x_ix86_preferred_stack_boundary_arg, x_ix86_regparm,
27409 x_ix86_veclibabi_type.
27411 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
27413 * config/aarch64/aarch64-sve.md (*cnot<mode>): Extend from
27414 SVE_FULL_I to SVE_I.
27415 (*cond_cnot<mode>_2, *cond_cnot<mode>_any): Likewise.
27417 2021-01-08 Richard Sandiford <richard.sandiford@arm.com>
27419 * config/aarch64/aarch64-sve.md (*cond_uxt<mode>_2): Extend from
27420 SVE_FULL_I to SVE_I.
27421 (*cond_uxt<mode>_any): Likewise.
27423 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27425 * config/aarch64/iterators.md (Vwhalf): New iterator.
27426 * config/aarch64/aarch64-simd.md (aarch64_<sur>adalp<mode>_3):
27428 (aarch64_<sur>adalp<mode>): ... This. Make more
27430 (<sur>sadv16qi): Adjust callsite of the above.
27431 * config/aarch64/aarch64-simd-builtins.def (sadalp, uadalp): New
27433 * config/aarch64/arm_neon.h (vpadal_s8): Reimplement using
27435 (vpadal_s16): Likewise.
27436 (vpadal_u8): Likewise.
27437 (vpadal_u16): Likewise.
27438 (vpadalq_s8): Likewise.
27439 (vpadalq_s16): Likewise.
27440 (vpadalq_s32): Likewise.
27441 (vpadalq_u8): Likewise.
27442 (vpadalq_u16): Likewise.
27443 (vpadalq_u32): Likewise.
27445 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27447 * config/aarch64/aarch64-simd.md (aarch64_<su>abd<mode>_3):
27449 (aarch64_<su>abd<mode>): ... This.
27450 (<sur>sadv16qi): Adjust callsite of the above.
27451 * config/aarch64/aarch64-simd-builtins.def (sabd, uabd): Define
27453 * config/aarch64/arm_neon.h (vabd_s8): Reimplement using
27455 (vabd_s16): Likewise.
27456 (vabd_s32): Likewise.
27457 (vabd_u8): Likewise.
27458 (vabd_u16): Likewise.
27459 (vabd_u32): Likewise.
27460 (vabdq_s8): Likewise.
27461 (vabdq_s16): Likewise.
27462 (vabdq_s32): Likewise.
27463 (vabdq_u8): Likewise.
27464 (vabdq_u16): Likewise.
27465 (vabdq_u32): Likewise.
27467 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27469 * config/aarch64/aarch64-simd-builtins.def (saba, uaba): Define
27471 * config/aarch64/arm_neon.h (vaba_s8): Implement using builtin.
27472 (vaba_s16): Likewise.
27473 (vaba_s32): Likewise.
27474 (vaba_u8): Likewise.
27475 (vaba_u16): Likewise.
27476 (vaba_u32): Likewise.
27477 (vabaq_s8): Likewise.
27478 (vabaq_s16): Likewise.
27479 (vabaq_s32): Likewise.
27480 (vabaq_u8): Likewise.
27481 (vabaq_u16): Likewise.
27482 (vabaq_u32): Likewise.
27484 2021-01-08 Kyrylo Tkachov <kyrylo.tkachov@arm.com>
27486 * config/aarch64/aarch64-simd.md (aba<mode>_3): Rename to...
27487 (aarch64_<su>aba<mode>): ... This. Handle uaba as well.
27488 Change RTL pattern to match.
27490 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
27492 * common/config/riscv/riscv-common.c (riscv_current_subset_list): New.
27493 * config/riscv/riscv-c.c (riscv-subset.h): New.
27494 (INCLUDE_STRING): Define.
27495 (riscv_cpu_cpp_builtins): Add new style architecture extension
27497 * config/riscv/riscv-subset.h (riscv_subset_list::begin): New.
27498 (riscv_subset_list::end): New.
27499 (riscv_current_subset_list): New.
27501 2021-01-08 Kito Cheng <kito.cheng@sifive.com>
27503 * common/config/riscv/riscv-common.c (RISCV_DONT_CARE_VERSION):
27504 Move to riscv-subset.h.
27505 (struct riscv_subset_t): Ditto.
27506 (class riscv_subset_list): Ditto.
27507 * config/riscv/riscv-subset.h (RISCV_DONT_CARE_VERSION): Move
27508 from riscv-common.c.
27509 (struct riscv_subset_t): Ditto.
27510 (class riscv_subset_list): Ditto.
27511 * config/riscv/t-riscv ($(common_out_file)): Add file
27514 2021-01-07 Jakub Jelinek <jakub@redhat.com>
27517 * config/i386/i386.md (*bmi_blsi_<mode>_cmp, *bmi_blsi_<mode>_ccno):
27518 New define_insn patterns.
27520 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
27522 * config/aarch64/aarch64-sve.md (@cond_<SVE_INT_UNARY:optab><mode>)
27523 (*cond_<SVE_INT_UNARY:optab><mode>_2): Extend from SVE_FULL_I to SVE_I.
27524 (*cond_<SVE_INT_UNARY:optab><mode>_any): Likewise.
27526 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
27528 PR tree-optimization/98560
27529 * internal-fn.def (IFN_VCONDU, IFN_VCONDEQ): Use type vec_cond.
27530 * internal-fn.c (vec_cond_mask_direct): Get the data mode from
27532 (vec_cond_direct): Likewise argument 2.
27533 (vec_condu_direct, vec_condeq_direct): Delete.
27534 (expand_vect_cond_optab_fn): Rename to...
27535 (expand_vec_cond_optab_fn): ...this, replacing old macro.
27536 (expand_vec_condu_optab_fn, expand_vec_condeq_optab_fn): Delete.
27537 (expand_vect_cond_mask_optab_fn): Rename to...
27538 (expand_vec_cond_mask_optab_fn): ...this, replacing old macro.
27539 (direct_vec_cond_mask_optab_supported_p): Treat the optab as a
27541 (direct_vec_cond_optab_supported_p): Likewise.
27542 (direct_vec_condu_optab_supported_p): Delete.
27543 (direct_vec_condeq_optab_supported_p): Delete.
27544 * gimple-isel.cc: Include internal-fn.h.
27545 (gimple_expand_vec_cond_expr): Check that IFN_VCONDEQ is supported
27548 2021-01-07 Richard Sandiford <richard.sandiford@arm.com>
27550 PR tree-optimization/98560
27551 * gimple-isel.cc (gimple_expand_vec_cond_expr): If we fail to use
27552 IFN_VCOND{,U,EQ}, fall back on IFN_VCOND_MASK.
27554 2021-01-07 Uroš Bizjak <ubizjak@gmail.com>
27556 * config/i386/i386.md (insn): Merge from plusminus_insn, shift_insn,
27557 rotate_insn and optab code attributes.
27558 Update all uses to merged code attribute.
27559 * config/i386/sse.md: Update all uses to merged code attribute.
27560 * config/i386/mmx.md: Update all uses to merged code attribute.
27562 2021-01-07 Jakub Jelinek <jakub@redhat.com>
27564 PR tree-optimization/98568
27565 * gimple-ssa-store-merging.c (bswap_view_convert): New function.
27566 (bswap_replace): Use it.
27568 2021-01-06 Vladimir N. Makarov <vmakarov@redhat.com>
27570 PR rtl-optimization/97978
27571 * lra-int.h (lra_hard_reg_split_p): New external.
27572 * lra.c (lra_hard_reg_split_p): New global.
27573 (lra): Set up lra_hard_reg_split_p after splitting a hard reg.
27574 * lra-assigns.c (lra_assign): Don't check allocation correctness
27575 after hard reg splitting.
27577 2021-01-06 Martin Sebor <msebor@redhat.com>
27580 * builtins.c (new_delete_mismatch_p): New overload.
27581 (new_delete_mismatch_p (tree, tree)): Call it.
27583 2021-01-06 Alexandre Oliva <oliva@adacore.com>
27585 * Makefile.in (T_GLIMITS_H): New.
27586 (stmp-int-hdrs): Depend on it, use it.
27587 * config/t-vxworks (T_GLIMITS_H): Override it.
27588 (vxw-glimits.h): New.
27590 2021-01-06 Richard Biener <rguenther@suse.de>
27592 PR tree-optimization/98513
27593 * value-range.cc (intersect_ranges): Compare the upper bounds
27594 for the expected relation.
27596 2021-01-06 Gerald Pfeifer <gerald@pfeifer.com>
27599 2020-12-28 Gerald Pfeifer <gerald@pfeifer.com>
27601 * doc/standards.texi (HSAIL): Remove section.
27603 2021-01-05 Samuel Thibault <samuel.thibault@ens-lyon.org>
27605 * configure: Re-generate.
27607 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27609 * doc/invoke.texi (-std=c++20): Adjust for the publication of
27610 ISO 14882:2020 standard.
27611 * doc/standards.texi: Likewise.
27613 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27615 PR tree-optimization/94802
27616 * expr.h (maybe_optimize_sub_cmp_0): Declare.
27617 * expr.c: Include tree-pretty-print.h and flags.h.
27618 (maybe_optimize_sub_cmp_0): New function.
27619 (do_store_flag): Use it.
27620 * cfgexpand.c (expand_gimple_cond): Likewise.
27622 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27624 * mux-utils.h (pointer_mux::m_ptr): Tweak description of contents.
27625 * rtlanal.c (simple_regno_set): Tweak description to clarify the
27628 2021-01-05 Richard Biener <rguenther@suse.de>
27630 PR tree-optimization/98516
27631 * tree-vect-slp.c (vect_optimize_slp): Permute the incoming
27632 lanes when materializing on a VEC_PERM node.
27633 (vectorizable_slp_permutation): Dump the permute properly.
27635 2021-01-05 Richard Biener <rguenther@suse.de>
27637 * tree-vect-slp.c (vect_slp_region): Move debug counter
27638 to cover individual subgraphs.
27640 2021-01-05 Richard Biener <rguenther@suse.de>
27642 PR tree-optimization/98428
27643 * tree-vect-slp.c (vect_build_slp_tree_1): Properly reject
27644 vector lane extracts for loop vectorization.
27646 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27648 PR tree-optimization/98514
27649 * tree-ssa-reassoc.c (bb_rank): Change type from long * to
27651 (operand_rank): Change type from hash_map<tree, long> to
27652 hash_map<tree, int64_t>.
27653 (phi_rank): Change return type from long to int64_t.
27654 (loop_carried_phi): Change block_rank variable type from long to
27656 (propagate_rank): Change return type, rank parameter type and
27657 op_rank variable type from long to int64_t.
27658 (find_operand_rank): Change return type from long to int64_t
27659 and change slot variable type from long * to int64_t *.
27660 (insert_operand_rank): Change rank parameter type from long to
27662 (get_rank): Change return type and rank variable type from long to
27663 int64_t. Use PRId64 instead of ld to print the rank.
27664 (init_reassoc): Change rank variable type from long to int64_t
27665 and adjust correspondingly bb_rank and operand_rank initialization.
27667 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27669 PR tree-optimization/96928
27670 * tree-ssa-phiopt.c (xor_replacement): New function.
27671 (tree_ssa_phiopt_worker): Call it.
27673 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27675 PR tree-optimization/96930
27676 * match.pd ((A / (1 << B)) -> (A >> B)): If A is extended
27677 from narrower value which has the same type as 1 << B, perform
27678 the right shift on the narrower value followed by extension.
27680 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27682 PR tree-optimization/96239
27683 * gimple-ssa-store-merging.c (maybe_optimize_vector_constructor): New
27685 (get_status_for_store_merging): Don't return BB_INVALID for blocks
27686 with potential bswap optimizable CONSTRUCTORs.
27687 (pass_store_merging::execute): Optimize vector CONSTRUCTORs with bswap
27690 2021-01-05 Richard Biener <rguenther@suse.de>
27692 PR tree-optimization/98381
27693 * tree.c (vector_element_bits): Properly compute bool vector
27695 * tree-vect-loop.c (vectorizable_live_operation): Properly
27696 compute the last lane bit offset.
27698 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
27701 * config/i386/sse.md (sse_cvtps2pi): Redefine as define_insn_and_split.
27702 Clear the top 64 bytes of the input XMM register.
27703 (sse_cvttps2pi): Ditto.
27705 2021-01-05 Uroš Bizjak <ubizjak@gmail.com>
27708 * config/i386/xopintrin.h (_mm256_cmov_si256): New.
27710 2021-01-05 H.J. Lu <hjl.tools@gmail.com>
27713 * config/i386/xmmintrin.h (_mm_extract_pi16): Cast to unsigned
27716 2021-01-05 Claudiu Zissulescu <claziss@synopsys.com>
27718 * config/arc/arc.md (maddsidi4_split): Use ACC_REG_FIRST.
27719 (umaddsidi4_split): Likewise.
27721 2021-01-05 liuhongt <hongtao.liu@intel.com>
27724 * config/i386/sse.md (*sse2_pmovskb_zexthisi): New
27725 define_insn_and_split for zero_extend of subreg HI of pmovskb
27727 (*sse2_pmovskb_zexthisi): Add new combine splitters for
27728 zero_extend of not of subreg HI of pmovskb result.
27730 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27733 * explow.c (convert_memory_address_addr_space_1): Handle UNSPECs
27735 * config/aarch64/aarch64.c (aarch64_expand_mov_immediate): Use
27736 convert_memory_address to convert symbolic immediates to ptr_mode
27737 before forcing them to memory.
27739 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27741 PR rtl-optimization/97144
27742 * recog.c (constrain_operands): Initialize matching_operand
27743 for each alternative, rather than only doing it once.
27745 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27747 PR rtl-optimization/98403
27748 * rtl-ssa/changes.cc (function_info::finalize_new_accesses): Explain
27749 why we don't remove call clobbers.
27750 (function_info::apply_changes_to_insn): Don't attempt to add
27751 call clobbers here.
27753 2021-01-05 Richard Sandiford <richard.sandiford@arm.com>
27755 PR tree-optimization/98371
27756 * tree-vect-loop.c (vect_reanalyze_as_main_loop): New function.
27757 (vect_analyze_loop): If an epilogue loop appears to be cheaper
27758 than the main loop, re-analyze it as a main loop before adopting
27761 2021-01-05 Rainer Orth <ro@CeBiTec.Uni-Bielefeld.DE>
27764 * configure.ac (NETLIBS): Determine using AX_LIB_SOCKET_NSL.
27765 * aclocal.m4, configure: Regenerate.
27766 * Makefile.in (NETLIBS): Define.
27767 (BACKEND): Remove $(CODYLIB).
27769 2021-01-05 Jakub Jelinek <jakub@redhat.com>
27771 PR rtl-optimization/98334
27772 * simplify-rtx.c (simplify_context::simplify_binary_operation_1):
27773 Optimize (X - 1) * Y + Y to X * Y or (X + 1) * Y - Y to X * Y.
27775 2021-01-05 Bernd Edlinger <bernd.edlinger@hotmail.de>
27777 * tree-inline.c (expand_call_inline): Restore input_location.
27778 Return result from recursive call.
27780 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
27782 PR tree-optimization/95401
27783 * config/aarch64/aarch64-sve-builtins.cc
27784 (gimple_folder::load_store_cookie): Use bits rather than bytes
27785 for the alignment argument to IFN_MASK_LOAD and IFN_MASK_STORE.
27786 * gimple-fold.c (gimple_fold_mask_load_store_mem_ref): Likewise.
27787 * tree-vect-stmts.c (vectorizable_store): Likewise.
27788 (vectorizable_load): Likewise.
27790 2021-01-04 Richard Biener <rguenther@suse.de>
27792 PR tree-optimization/98308
27793 * tree-vect-stmts.c (vectorizable_load): Set invariant mask
27796 2021-01-04 Jakub Jelinek <jakub@redhat.com>
27798 PR tree-optimization/95771
27799 * tree-ssa-loop-niter.c (number_of_iterations_popcount): Handle types
27800 with precision smaller than int's precision and types with precision
27801 twice as large as long long. Formatting fixes.
27803 2021-01-04 Richard Biener <rguenther@suse.de>
27805 PR tree-optimization/98464
27806 * tree-ssa-sccvn.c (vn_valueize_for_srt): Rename from ...
27807 (vn_valueize_wrapper): ... this. Temporarily adjust vn_context_bb.
27808 (process_bb): Adjust.
27810 2021-01-04 Matthew Malcomson <matthew.malcomson@arm.com>
27813 * doc/invoke.texi (-fsanitize=address): Fix wording describing
27814 clash with -fsanitize=hwaddress.
27816 2021-01-04 Richard Biener <rguenther@suse.de>
27818 PR tree-optimization/98282
27819 * tree-ssa-sccvn.c (vn_get_stmt_kind): Classify tcc_reference on
27820 invariants as VN_NARY.
27822 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
27825 * config/aarch64/aarch64-simd.md (aarch64_combine<mode>): Accept
27826 aarch64_simd_reg_or_zero for operand 2. Use the combinez patterns
27827 to handle zero operands.
27829 2021-01-04 Richard Sandiford <richard.sandiford@arm.com>
27831 * config/aarch64/aarch64.c (offset_6bit_signed_scaled_p): New function.
27832 (offset_6bit_unsigned_scaled_p): Fix typo in comment.
27833 (aarch64_sve_prefetch_operand_p): Accept MUL VLs in the range
27836 2021-01-04 Richard Biener <rguenther@suse.de>
27838 PR tree-optimization/98393
27839 * tree-vect-slp.c (vect_build_slp_tree): Properly zero matches
27840 when hitting the limit.
27842 2021-01-04 Richard Biener <rguenther@suse.de>
27844 PR tree-optimization/98291
27845 * tree-vect-loop.c (vectorizable_reduction): Bypass
27846 associativity check for SLP reductions with VF 1.
27848 2021-01-04 Jakub Jelinek <jakub@redhat.com>
27850 PR tree-optimization/96782
27851 * match.pd (x == ~x -> false, x != ~x -> true): New simplifications.
27853 2021-01-04 Bernd Edlinger <bernd.edlinger@hotmail.de>
27855 * collect-utils.c (collect_execute): Check dumppfx.
27856 * collect2.c (maybe_run_lto_and_relink, do_link): Pass atsuffix
27857 to collect_execute.
27858 (do_link): Add new parameter atsuffix.
27859 (main): Handle -dumpdir option. Skip one argument for
27860 -o, -isystem and -B options.
27861 * gcc.c (make_at_file): New helper function.
27862 (close_at_file): Use it.
27864 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27866 * config/darwin.h (MIN_LD64_NO_COAL_SECTS): Adjust.
27867 Amend handling for LD64_VERSION fallback defaults.
27869 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27871 * config.gcc: Compute default version information
27872 from the configured target. Likewise defaults for
27874 * config/darwin10.h: Removed.
27875 * config/darwin12.h: Removed.
27876 * config/darwin9.h: Removed.
27877 * config/rs6000/darwin8.h: Removed.
27879 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27881 * config/darwin9.h (ASM_OUTPUT_ALIGNED_COMMON): Delete.
27883 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27885 * config/darwin9.h (STACK_CHECK_STATIC_BUILTIN): Move from here..
27886 * config/darwin.h (STACK_CHECK_STATIC_BUILTIN): .. to here.
27888 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27890 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move from
27892 * config/darwin.h (LINK_GCC_C_SEQUENCE_SPEC): ... to here.
27894 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27896 * config/darwin10.h (LINK_GCC_C_SEQUENCE_SPEC): Move the spec
27897 for the Darwin10 unwinder stub from here ...
27898 * config/darwin.h (LINK_COMMAND_SPEC_A): ... to here.
27900 2021-01-02 Iain Sandoe <iain@sandoe.co.uk>
27902 * config/darwin.h (DSYMUTIL_SPEC): Default to DWARF
27903 (ASM_DEBUG_SPEC):Only define if the assembler supports
27905 (PREFERRED_DEBUGGING_TYPE): Default to DWARF.
27906 (DARWIN_PREFER_DWARF): Define.
27907 * config/darwin9.h (PREFERRED_DEBUGGING_TYPE): Remove.
27908 (DARWIN_PREFER_DWARF): Likewise
27909 (DSYMUTIL_SPEC): Likewise.
27910 (COLLECT_RUN_DSYMUTIL): Likewise.
27911 (ASM_DEBUG_SPEC): Likewise.
27912 (ASM_DEBUG_OPTION_SPEC): Likewise.
27914 2021-01-02 Jan Hubicka <jh@suse.cz>
27916 * cfg.c (free_block): ggc_free bb.
27918 2021-01-01 Jakub Jelinek <jakub@redhat.com>
27920 * gcc.c (process_command): Update copyright notice dates.
27921 * gcov-dump.c (print_version): Ditto.
27922 * gcov.c (print_version): Ditto.
27923 * gcov-tool.c (print_version): Ditto.
27924 * gengtype.c (create_file): Ditto.
27925 * doc/cpp.texi: Bump @copying's copyright year.
27926 * doc/cppinternals.texi: Ditto.
27927 * doc/gcc.texi: Ditto.
27928 * doc/gccint.texi: Ditto.
27929 * doc/gcov.texi: Ditto.
27930 * doc/install.texi: Ditto.
27931 * doc/invoke.texi: Ditto.
27933 2021-01-01 Jakub Jelinek <jakub@redhat.com>
27935 * ChangeLog-2020: Rotate ChangeLog. New file.
27938 Copyright (C) 2021 Free Software Foundation, Inc.
27940 Copying and distribution of this file, with or without modification,
27941 are permitted in any medium without royalty provided the copyright
27942 notice and this notice are preserved.