fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-26 01:38:20 +02:00

Author	SHA1	Message	Date
Caio Oliveira	1ef6415d22	intel/compiler: Remove unused headers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26323>	2023-12-12 19:36:13 +00:00
Ian Romanick	87cdcbd7d7	intel/compiler: Verify that DO is alone in the block Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26439>	2023-12-08 20:21:28 +00:00
Ian Romanick	65237f8bbc	intel/fs: Don't add MOV instructions to DO blocks in combine constants There was a subtle bug related to CFG tracking. Namely, some branch instructions may point only to the block after the DO instruction for the loop. If the MOV instructions are in the DO block, the may not have liveness properly tracked. Like in !25132, having the MOV instructions in blocks that might contain other instructions helps scheduling. shader-db: All Broadwell and newer Intel GPUs had similar results (Ice Lake shown) total cycles in shared programs: 848577248 -> 848557268 (<.01%) cycles in affected programs: 78256396 -> 78236416 (-0.03%) helped: 361 / HURT: 18 fossil-db: All Skylake and newer Intel GPUs had similar results (Ice Lake shown) Totals: Cycles: 15021501924 -> 15021372904 (-0.00%); split: -0.00%, +0.00% Totals from 735 (0.11% of 656080) affected shaders: Cycles: 676429502 -> 676300482 (-0.02%); split: -0.02%, +0.00% Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26439>	2023-12-08 20:21:28 +00:00
Rohan Garg	db6aaa691d	intel/compiler: infer the number of operands using lsc_op_num_data_values nir_emit_global_atomic should utilize lsc_op_num_data_values to infer the number of operands for global atomic ops, following the same pattern as nir_emit_surface_atomic Fixes: `90a2137` ('intel/compiler: Use LSC opcode enum rather than legacy BRW_AOPs') Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26432>	2023-12-07 14:40:24 +00:00
Rohan Garg	46d98a71ef	intel/compiler: use the proper enum type to store the op Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26432>	2023-12-07 14:40:24 +00:00
Yonggang Luo	72e30c8853	treewide: Avoid use align as variable, replace it with other names align is a function and when we want use it, the align variable will shadow it So replace it with other names Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25997>	2023-12-07 02:30:53 +00:00
Faith Ekstrand	09fc5e1c4d	nir: Split has_[su]dot_4x8 bits into regular and _sat versions Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26533>	2023-12-06 23:15:33 +00:00
Faith Ekstrand	e3ff5a3b0e	intel/vec4: Use MESA_PRIM_* instead of GL_* Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24821>	2023-12-05 23:12:32 +00:00
Caio Oliveira	d9565a0e66	intel/compiler: Remove the linking step in intel_clc A previous patch already removed individual compilation of the inputs, by simply concatenating the files. This patch removes the linking of the remaining single object that's compiled. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26458>	2023-12-05 11:48:25 +00:00
Caio Oliveira	d9e49ce194	intel/compiler: Fix memory leaks in intel_clc Avoids failures when using Address Sanitizer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26458>	2023-12-05 11:48:25 +00:00
Caio Oliveira	db9111bb87	intel/compiler: Use single variable instead of dynarray A previous change concatenated multiple SPIR-V inputs to be compiled together, so we have a single clc_binary to work on. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26458>	2023-12-05 11:48:25 +00:00
Caio Oliveira	73276c1ece	intel/compiler: Refactor program exit in intel_clc Move the clean up code (at the moment just ralloc_ctx) into a single place at the end. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26458>	2023-12-05 11:48:25 +00:00
Jordan Justen	064bdecb36	intel/compiler: Define XE2 compiler enum Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26390>	2023-12-01 02:36:12 +00:00
Caio Oliveira	bbb12dbbf9	intel/compiler: Add a few tests to opt_predicated_break v2 (idr): Fix expectations BottomBreakWithContinue. opt_predicated_break will remove the IF and make the CONTINUE predicated. v3 (idr): Temporarily disable the one test that fails. v4 (idr): Free strings allocated by open_memstream. Fixes gitlab CI failures in debian-testing-asan. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Caio Oliveira	0b072c5351	intel/compiler: Sort lists of succs and preds in CFG dump output Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Caio Oliveira	47c5656f0e	intel/compiler: Allow dumping CFG to a specific FILE* Add optional argument for both cfg and block dump() function to pass a FILE*. Default behavior remains dumping to stderr. v2 (idr): Don't add the new test framework in this commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Caio Oliveira	21cf9323f0	intel/compiler: Add a few more helpers to fs_builder Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	c0ecc0d70b	intel/compiler: Don't promote CFG link types when removing a block Imagine 3 blocks A, B, and C. A has a physical link to B, and B has a logical link to C. Previous to this commit, if B were removed, A would get a logical link to C. This is not correct. This was specifically observed to occur when block A was a DO block and B was the WHILE block. The DO block would have two logical successors, and that is completely invalid. v2: Assert that the links from A-to-B and B-back-to-A are the same kind. Suggested by Caio. v3: Assume the successor and predecessor lists are well formed. Use this to simplify the logic. Suggested by Caio. Add checks to cfg_t::validate to ensure the lists are well formed. v4: Remove (now unused) bblock_link_invalid. Suggested by Curro. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	77c0c1ce54	intel/compiler: Don't create extra CFG links when deleting a block The previous is_successor_of and is_predecessor_of checks prevented creating a physical link when a logical link already existed. However, a logical link could be added when a physical link already existed. This change causes an existing physical link to be "promoted" to a logical link. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	7e842a75ac	intel/compiler: Don't create extra CFG links in opt_predicated_break Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	bbd7729993	intel/compiler: Delete bidirectional block links in opt_predicated_break Previously when earlier_block->children.make_empty() was called, the child blocks would still have links back to earlier_block. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	5842829380	intel/compiler: Limit scope of cur_endif variable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	02f9bbf6f3	intel/compiler: Add basic CFG validation v2: Use _mesa_shader_stage_to_abbrev(stage) instead of stage_abbrev. Noticed by Caio and GCC. That's what I get for not recompiling after rebasing. Wrap cfg_t::validate in NDEBUG magic. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Ian Romanick	19db6f1cd9	intel/vec4: Don't emit an empty ELSE This matches the behavior of fs_visitor::nir_emit_if. This is not technically wrong, but the cfg_t generates some invalid parent / child links in this case. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25216>	2023-11-30 20:58:05 +00:00
Caio Oliveira	5de5a0d475	intel/compiler: Don't use fs_visitor::bld in thread payload classes Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26301>	2023-11-28 19:53:51 +00:00
Caio Oliveira	2d6240ab14	intel/compiler: Don't use fs_visitor::bld in fs_reg_alloc Just set up the builder without relying on the pre-existing one. Moves one step close to remove bld from fs_visitor. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26301>	2023-11-28 19:53:51 +00:00
Caio Oliveira	f55867b56c	intel/compiler: Don't use fs_visitor::bld in tests Tests create their own fs_builder now. Moves one step closer to remove bld from fs_visitor. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26301>	2023-11-28 19:53:51 +00:00
Caio Oliveira	9540259e1c	intel/compiler: Prefer ctor/dtors in some Google Tests Per Google Test FAQ recommendation, prefer consutrctors and destructors unless there's a need to use SetUp/TearDown. We will take advantage of this later to initialize an fs_builder. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26301>	2023-11-28 19:53:51 +00:00
Lionel Landwerlin	e22e88f8ce	intel/fs: reuse set_predicate() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26306>	2023-11-28 13:40:07 +00:00
Lionel Landwerlin	83a1657b6c	intel/fs: fix incorrect register flag interaction with dynamic interpolator mode Once NIR code is lowered and a few optimization passes have run, there might be flag register interactions between instructions quite far away from one another. In the following case : f0 = and r0, r1 ... fs_interpolate r2, r3 ... if f0 ... endif If we lower fs_inteporlate while using the f0 register, we completely garble the value meant for the if block. To fix this, emit the predication for fs_interpolate in brw_fs_nir.cpp when doing the NIR translation to the backend IR. This will guarantee that the flag register interactions are visible to the optimization passes, avoiding the problem above. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `68027bd38e` ("intel/fs: implement dynamic interpolation mode for dynamic persample shaders") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9757 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26306>	2023-11-28 13:40:07 +00:00
Daniel Schürmann	1179d83a89	nir: remove info.fs.needs_all_helper_invocations Use info.uses_wide_subgroup_intrinsics instead. Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26026>	2023-11-22 11:31:11 +01:00
Caio Oliveira	e8220b9319	intel/compiler: Simplify allocation of NIR related arrays Those are not reused, so this will be the first and only allocation, so no need to use the "realloc" variants. For the fs_reg arrays, there's currently no particular reason to keep them uninitialized, so zero-initialize them too -- not ideal but better than random values. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26302>	2023-11-21 18:31:05 +00:00
Lionel Landwerlin	4eb4197d27	intel/nir/rt: fix reportIntersection() hitT handling We're currently updating the hitT value in the traversal result with the hitT value from reportIntersection(), but this is not correct. First the hitT value of reportIntersection() should update the gl_RayTmaxEXT value (maps to brw_nir_rt_mem_ray_defs::t_far). Second the hitT determined by traversal should only be updated if the reportIntersection() hitT value has updated the gl_RayTmaxEXT and that the new gl_RayTmaxEXT is smaller than the determined hitT value from traversal. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Fixes: `303378e1dd` ("intel/rt: Add lowering for combined intersection/any-hit shaders") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25146>	2023-11-17 07:06:30 +00:00
Lionel Landwerlin	6dbb5f1e07	intel/fs: rerun divergence analysis prior to convert_from_ssa Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9964 Cc: mesa-stable Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26235>	2023-11-17 06:40:49 +00:00
Rhys Perry	f695a9fed2	intel/compiler: use nir_lower_fp16_casts Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25566>	2023-11-16 11:02:31 +00:00
Lionel Landwerlin	295734bf88	intel/fs: fix residency handling on Xe2 We're missing a few reg_unit() scaling when dealing with residency data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26208>	2023-11-15 20:06:12 +00:00
Faith Ekstrand	80376146ed	nak: Encode program headers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24998>	2023-11-14 00:48:06 +00:00
Caio Oliveira	dcb68de656	intel/compiler: Clear up block instructions before re-adding them Avoids fixing up list pointers that we don't care about anymore -- since all the instructions will be re-added in a different order anyway. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	a9f95bf687	intel/compiler: Reuse same scheduler for all pre-RA scheduling modes Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	0dd5378ffe	intel/compiler: Make scheduler classes take an external mem_ctx Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	04aa2df461	intel/compiler: Separate schedule_node temporary data Some fields in schedule_node will need to be reset each time they are used. The `cand_generation` needs to be back to zero, and both `unblocked_time` and `parent_count` need to be back to their initial values, which were pre-calculated. Rename the initial data fields and add new ones for the temporary data. Note the helper function is `per node` to allow it "tag along" with an existing loops. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	81594d0db1	intel/compiler: Move earlier scheduler code that is not mode-specific This will be useful later on when we reuse the same scheduler for multiple modes. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	73d4e4118a	intel/compiler: Tidy up code in scheduler related to reads_remaining - Just assert in functions we expect it to exist - Predicate usage with `!post_reg_alloc` to avoid suggest there are more combinations. - Reuse an existing loop to call the count function. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	4f246cf4e7	intel/compiler: Merge child/latency arrays in schedule_node Values are used together, saves one pointer in schedule_node, reduces amount of reallocations when children count grows. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	e59a054203	intel/compiler: Move FS specific fields to fs_instruction_scheduler Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	a6297d05ca	intel/compiler: Remove virtual calls from scheduler Pull run() and schedule_instructions() for fs, and pull a very simplified version of those into a run() for vec4. Because of the previous patches the duplication is small. Since we are touching these, change run() implementations to use the cfg from the existing reference to the visitor/shader instead of taking one as argument. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	d76d58cf50	intel/compiler: Cache issue_time information Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	ecd7ffcf78	intel/compiler: Extract scheduling related basic functions Those will be used in multiple places later. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	8a8dd2db0c	intel/compiler: Add only available instructions to scheduling list The list was used for iterating through all instructions and then later also to track the available ones. Now that the array iteration is used, change how we fill it and rename it to reflect its only job. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00
Caio Oliveira	ddff6428c5	intel/compiler: Use array to iterate the scheduler nodes For all the preparation data collection before the scheduling actually happens, it is possible to walk the schedule nodes in order by iterating on the range of the array dedicated to a given block. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25841>	2023-11-13 23:05:47 +00:00

... 9 10 11 12 13 ...

3369 commits