fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 13:58:04 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	08bfcc12d4	radv: rename radv_pipeline_stage to radv_shader_stage It's more generic and it will fit shader object just well. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	090d88247d	radv: cleanup pipeline compute emit helpers Merge both functions together and rename the function. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	fdec88bd7c	radv: rework determining the NGG stage without a graphics pipeline Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	174816019f	radv: simplify lowering NGG GS intrinsics The is_ngg field is already set correctly for GS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	70dbe011bb	radv: rename graphics pipeline linking helpers There is no pipeline dependency. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	697d4d4b03	radv: move removing all varyings when the FS is a noop This allows us to remove one more pipeline dependency. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	5da9f38c53	radv: stop passing radv_graphics_pipeline to radv_fill_shader_info() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	a7fdcc3b22	radv: rework considering force VRS without relying on graphics pipeline Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	9d89b29a80	radv: set next_stage to MESA_SHADER_NONE if there is no FS This follows the same convention as shader object where the last stage would have nextStage to 0. This will allow more refactoring. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Samuel Pitoiset	b250efa714	radv: initialize stage/next_stage earlier This will allow more refactoring. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24313>	2023-07-26 07:44:49 +00:00
Lionel Landwerlin	d62e494b37	intel/vec4: fix log_data pointer Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3384f029be` ("intel/compiler: rework input parameters") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9421 Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24307>	2023-07-26 06:36:18 +00:00
Yonggang Luo	6e43618b82	ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23940>	2023-07-26 03:43:40 +00:00
Yonggang Luo	a606074a7a	radeonsi: Convert to use nir_foreach_function_impl Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23940>	2023-07-26 03:43:40 +00:00
Yonggang Luo	3f7a3a6698	microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23940>	2023-07-26 03:43:40 +00:00
Yonggang Luo	d5baad2afa	microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23940>	2023-07-26 03:43:40 +00:00
Rebecca Mckeever	87109c3e1b	vulkan/runtime: Add helper functions for VK_EXT_host_image_copy Add helper functions vk_memory_to_image_copy_layout() and vk_image_to_memory_copy_layout(), which will be useful in VK_EXT_host_image_copy implementations. vk_memory_to_image_copy_layout() is similar to vk_image_buffer_copy_layout(), except the second parameter is VkMemoryToImageCopyEXT instead of VkBufferImageCopy2. vk_image_to_memory_copy_layout() is similar to vk_image_buffer_copy_layout(), except the second parameter is VkImageToMemoryCopyEXT instead of VkBufferImageCopy2. Signed-off-by: Rebecca Mckeever <rebecca.mckeever@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24290>	2023-07-25 23:34:02 +00:00
Karol Herbst	2388f22a5e	gm107/ir: fix SULDP for loads without a known format Signed-off-by: Karol Herbst <git@karolherbst.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24312>	2023-07-25 23:15:41 +00:00
Iván Briano	377c2a045f	intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir Calling anything after nir_trivialize_registers() risks undoing some of its work. In this case, brw_nir_adjust_payload() will do a constant folding pass if any payload adjusting happened, and that can turn a bunch of @store_regs into basically noops. Fixes dEQP-VK.subgroups.*task Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24325>	2023-07-25 22:48:09 +00:00
Ian Romanick	cb0de0a1d3	intel/fs: Constant fold OR and AND The path taken in fs_visitor::swizzle_nir_scratch_addr for DG2 generates some AND and OR instructions before the SHL. This commit folds those so the whold calculation becomes a constant (like on older platforms). v2: Fix return type of src_as_uint. Noticed by Marcin. shader-db results: DG2 total instructions in shared programs: 23190475 -> 23179540 (-0.05%) instructions in affected programs: 36026 -> 25091 (-30.35%) helped: 7 / HURT: 0 total cycles in shared programs: 841196807 -> 841142563 (<.01%) cycles in affected programs: 1660670 -> 1606426 (-3.27%) helped: 7 / HURT: 0 No shader-db changes on any older Intel platforms. fossil-db results: DG2 Totals: Instrs: 197780372 -> 197773966 (-0.00%) Cycles: 14066410782 -> 14066399378 (-0.00%); split: -0.00%, +0.00% Subgroup size: 8438104 -> 8438112 (+0.00%) Send messages: 8049445 -> 8049446 (+0.00%) Scratch Memory Size: 14263296 -> 14264320 (+0.01%) Totals from 9 (0.00% of 668055) affected shaders: Instrs: 24547 -> 18141 (-26.10%) Cycles: 1984791 -> 1973387 (-0.57%); split: -0.98%, +0.40% Subgroup size: 88 -> 96 (+9.09%) Send messages: 867 -> 868 (+0.12%) Scratch Memory Size: 69632 -> 70656 (+1.47%) No fossil-db changes on any older Intel platforms. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23884>	2023-07-25 22:11:21 +00:00
Ian Romanick	61c786bad5	intel/fs: Constant fold SHL This is a modified version of a commit originally in !7698. This version add the changes to brw_fs_copy_propagation. If the address passed to fs_visitor::swizzle_nir_scratch_addr is a constant, that function will generate SHL with two constant sources. DG2 uses a different path to generate those addresses, so the constant folding can't occur there yet. That will be addressed in the next commit. What follows is the commit change history from that older MR. v2: Previously this commit was after `intel/fs: Combine constants for integer instructions too`. However, this commit can create invalid instructions that are only cleaned up by `intel/fs: Combine constants for integer instructions too`. That would potentially affect the shader-db results of each commit, but I did not collect new data for the reordering. v3: Fix masking for W/UW and for Q/UQ types. Add an assertion for !saturate. Both suggested by Ken. Also add an assertion that B/UB types don't matically come back. v4: Fix sources count. See also `ed3c2f73db` ("intel/fs: fixup sources number from opt_algebraic"). v5: Fix typo in comment added in v3. Noticed by Marcin. Fix a typo in a comment added when pulling this commit out of !7698. Noticed by Ken. shader-db results: DG2 No changes. Tiger Lake, Ice Lake, and Skylake had similar results (Ice Lake shown) total instructions in shared programs: 20655696 -> 20651648 (-0.02%) instructions in affected programs: 23125 -> 19077 (-17.50%) helped: 7 / HURT: 0 total cycles in shared programs: 858436639 -> 858407749 (<.01%) cycles in affected programs: 8990532 -> 8961642 (-0.32%) helped: 7 / HURT: 0 Broadwell and Haswell had similar results. (Broadwell shown) total instructions in shared programs: 18500780 -> 18496630 (-0.02%) instructions in affected programs: 24715 -> 20565 (-16.79%) helped: 7 / HURT: 0 total cycles in shared programs: 946100660 -> 946087688 (<.01%) cycles in affected programs: 5838252 -> 5825280 (-0.22%) helped: 7 / HURT: 0 total spills in shared programs: 17588 -> 17572 (-0.09%) spills in affected programs: 1206 -> 1190 (-1.33%) helped: 2 / HURT: 0 total fills in shared programs: 25192 -> 25156 (-0.14%) fills in affected programs: 156 -> 120 (-23.08%) helped: 2 / HURT: 0 No shader-db changes on any older Intel platforms. fossil-db results: DG2 Totals: Instrs: 197780415 -> 197780372 (-0.00%); split: -0.00%, +0.00% Cycles: 14066412266 -> 14066410782 (-0.00%); split: -0.00%, +0.00% Totals from 16 (0.00% of 668055) affected shaders: Instrs: 16420 -> 16377 (-0.26%); split: -0.43%, +0.17% Cycles: 220133 -> 218649 (-0.67%); split: -0.69%, +0.01% Tiger Lake, Ice Lake and Skylake had similar results. (Ice Lake shown) Totals: Instrs: 153425977 -> 153423678 (-0.00%) Cycles: 14747928947 -> 14747929547 (+0.00%); split: -0.00%, +0.00% Subgroup size: 8535968 -> 8535976 (+0.00%) Send messages: 7697606 -> 7697607 (+0.00%) Scratch Memory Size: 4380672 -> 4381696 (+0.02%) Totals from 6 (0.00% of 662749) affected shaders: Instrs: 13893 -> 11594 (-16.55%) Cycles: 5386074 -> 5386674 (+0.01%); split: -0.42%, +0.43% Subgroup size: 80 -> 88 (+10.00%) Send messages: 675 -> 676 (+0.15%) Scratch Memory Size: 91136 -> 92160 (+1.12%) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23884>	2023-07-25 22:11:21 +00:00
Ian Romanick	56e6186dcf	intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress opt_copy_propagation can create invalid instructions like shl(8) vgrf96:UD, 2d, 8u These instructions will be cleaned up by opt_algebraic. The irony is opt_algebraic converts these to simple mov instructions that opt_copy_propagation should clean up. I don't think we want a loop like do { progress = false; if (OPT(opt_copy_propagation)) { OPT(opt_algebraic); OPT(dead_code_eliminate); } } while (progress); But maybe we do? Maybe this would be sufficient: while (OPT(opt_copy_propagation)) OPT(opt_algebraic); OPT(dead_code_eliminate); No shader-db or fossil-db changes (yet) on any Intel platform. This is expected. v2: Do opt_algebraic immediately after every call to opt_copy_propagation instead of being clever. Suggested by Lionel. Tested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23884>	2023-07-25 22:11:21 +00:00
Emma Anholt	d089272fc0	ci/a5xx: Add another GPU hanging piglit test to the skips. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23485>	2023-07-25 21:29:33 +00:00
Collabora's Gfx CI Team	2f834340a6	Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1 `5036601c43...ed58dfbd12` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23485>	2023-07-25 21:29:33 +00:00
Emma Anholt	65ff9f0a55	tu: Fix data race in userspace VMA management. The sequence was two threads A and B on a shared VkDevice: A: move a BO to zombie VMA list A: drop the BO VMA lock B: prepare to allocate a BO B: Lock BO VMA lock B: call tu_free_zombie_vma_locked() B: close the gem handle from the VMA list B: Drop BO VMA lock B: allocate a BO, getting the recently-closed handle back. B: initialize the BO struct for the new handle. A: memset the BO struct to 0. Multithreading in C is the worst. Closes: #9049, #9247 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24324>	2023-07-25 13:34:25 -07:00
José Roberto de Souza	3efba1e2e9	iris: Request Xe KMD to place BOs to CPU visible VRAM when required This is required to support discrete GPUs placed in systems with large PCI bar or resizeble PCI bar not available or disabled. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23781>	2023-07-25 19:33:16 +00:00
José Roberto de Souza	f59d272e93	anv: Request Xe KMD to place BOs to CPU visible VRAM when required This is required to support discrete GPUs placed in systems with large PCI bar or resizeble PCI bar not available or disabled. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23781>	2023-07-25 19:33:16 +00:00
José Roberto de Souza	f9fcd7168a	intel/dev/xe: Add support for small-bar setups This adds support for discrete GPUs placed in systems with large PCI bar or resizeble PCI bar not available or disabled. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23781>	2023-07-25 19:33:15 +00:00
José Roberto de Souza	a8279d37ec	intel: Sync xe_drm.h Sync with commit aef50195664a ("drm/xe/uapi: add the userspace bits for small-bar") Link: https://patchwork.freedesktop.org/series/115515/ Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23781>	2023-07-25 19:33:15 +00:00
Emma Anholt	a3e3609590	ci/tu: Drop some xfails for !24086 Fixes: `99e58460ef` ("tu: Fix zombie VMAs array not initialized when first BOs may be freed") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24322>	2023-07-25 18:53:16 +00:00
Emma Anholt	1d97838871	ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too. Noticed in a full run. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24322>	2023-07-25 18:53:16 +00:00
Emma Anholt	b05d640b95	ci/tu: Add more crash cases for the multithreading bugs caught on a630. Weirdly, we don't see this group on a618. Different CPU timings/core counts just getting unlucky? Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24322>	2023-07-25 18:53:16 +00:00
Alyssa Rosenzweig	6619317172	nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP Just drop the store. Written while debugging dEQP-VK.pipeline.monolithic.logic_op.r8_uint.no_op. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24252>	2023-07-25 18:03:57 +00:00
Alyssa Rosenzweig	9c0740211d	nir/lower_blend: Fix 32-bit logicops nir_const_value_for_int asserts signed bounds on the input, but we pass in an unsigned value that would be out-of-bounds for 32-bit channels, causing the assert to fail for 32-bit channel formats. Fixes dEQP-VK.pipeline.monolithic.logic_op.r32_uint.* on AGXV (and probably PanVK). Fixes: `dbd0615e7a` ("nir/lower_blend: Avoid useless iand with logic ops") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24252>	2023-07-25 18:03:57 +00:00
Alyssa Rosenzweig	b010b6f691	panfrost: Disable blending for no-op logic ops Prevents regression from the series, since we don't support empty blend shaders. This could be fixed more generically but I'm not inclined to compile more blend shaders than needed so shrug. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24252>	2023-07-25 18:03:57 +00:00
Karol Herbst	2d902dbf02	rusticl: fix warnings with newer rustc Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24315>	2023-07-25 17:12:40 +00:00
Faith Ekstrand	94f36cfaa3	intel/fs: Assume NIR is in SSA form Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24310>	2023-07-25 16:25:11 +00:00
Faith Ekstrand	965bbe5286	intel/fs: Rework the overlapping mov/vec case Now that we're using load/store_reg intrinsics, the previous checks for registers aren't what we want. Instead, we need to be looking for a mov or vec where both the destination and a source are load/store_reg with matching decl_reg. Fixes: `b8209d69ff` ("intel/fs: Add support for new-style registers") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24310>	2023-07-25 16:25:11 +00:00
Faith Ekstrand	45ee952efb	intel/fs: Use write masks from store_reg intrinsics Fixes: `b8209d69ff` ("intel/fs: Add support for new-style registers") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24310>	2023-07-25 16:25:10 +00:00
Faith Ekstrand	d89ca14e71	broadcom/compiler: Convert to new-style NIR registers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	355afc92d1	nir/schedule: Support load/store_reg These are tracked the same way as register reads and writes, allowing them to be re-arranged as long as they respect dependencies within the same reg. Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	6908814d46	vc4: Convert to new-style NIR registers Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Iago Toral Quiroga	dff85b6163	nir/trivialize: Move decl_reg to the start of the block This makes it so we never find a reg_decl in between a reg_store and the def for its value, which helps avid inserting copy movs. Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Alyssa Rosenzweig	0655bada4b	nir/trivialize: Handle more RaW hazards Consider the snippet of NIR: div 32 %447 = @load_reg (%442) (base=0, legacy_fabs=0, legacy_fneg=0) div 32 %463 = @load_reg (%442) (base=0, legacy_fabs=0, legacy_fneg=0) con 32 %409 = iadd %17 (0x3), %447 @store_output (%182 (0x601), %463) (base=0, wrmask=x, component=0, src_type=invalid... @store_reg (%409, %442) (base=0, wrmask=x, legacy_fsat=0) The load_reg's are trivial, so the %442 read will get folded into store_output. But under the old definition, the store_reg is also trivial so it gets folded into the iadd... causing a read-after-write hazard and invalid code generation. The fix is to amend our definition of store_reg triviality to account for loads getting folded in. It's not good enough that there's no intervening load_reg, there can also be no intervening source that gets chased to a load_reg. Handle that case as well. Identified in dEQP-VK.geometry.input.basic_primitive.triangles_adjacency on V3DV. Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reported-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	f8b69abbd4	nir/trivialize: Trivialize cross-block loads In order for a register load to be trivial, it cannot be used in any block other than the one in which it is loaded. We're not currently explicitly doing anything to ensure this invariant holds. It may be that it holds regardless but I couldn't find any documented reason why it should so let's explicitly handle that case. Worst case, the newly added code does nothing. Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	f1f05cc7cf	nir/trivialize: Maintain divergence information Because this pass is intended to be run after out-of-SSA and directly before injesting the NIR into the back-end, it may come after divergence analysis and needs to preserve the divergence information. Fortunately, since all we ever do is insert nir_op_mov, this is easy. Fixes: `d313eba94e` ("nir: Add pass for trivializing register access") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Faith Ekstrand	4fd257d20f	nir: Properly handle divergence for load_reg This commit makes three changes: 1. Default all newly created registers divergent because this is the safer default. 2. Make divergence analysis do something sane with register divergence. It's not perfect because divergence analysis isn't able to prove registers divergent based on stores but at least if someone uses registers a bit they'll end up with safe defaults. This matches what they'd get with nir_ssa_def_init(). 3. Make the load_reg() helper automatically propagate divergence from the register. Because the defaults for both nir_ssa_def_init() and nir_decl_reg() are to mark everything divergent, this only means that nir_load_reg() of a uniform reg is now uniform. Putting all these together, nir_from_ssa should now be producing load_reg intrinsics with the proper uniform information. Fixes: `7229bffcb1` ("nir: Add intrinsics for register access") Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Alyssa Rosenzweig	91c3ee2412	pan/bi: Remove leftover include Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24153>	2023-07-25 15:36:52 +00:00
Marcin Ślusarz	4f1125e4ae	intel/compiler/test: fix crashes when TEST_DEBUG is set Dumping instructions requires that ISA info is not empty. Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24274>	2023-07-25 15:13:29 +00:00
Yonggang Luo	23a2b83639	lavapipe: fixes indent of function lvp_inline_uniforms The indent fixes are in separate patch is for easier to review Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24316>	2023-07-25 12:09:07 +00:00
Yonggang Luo	b4ed366d6b	lavapipe: Convert to use nir_foreach_function_impl Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/24316>	2023-07-25 12:09:07 +00:00

1 2 3 4 5 ...

174776 commits