fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-04 17:00:39 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	ffc798c364	anv: move CreateComputePipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	2c816b4f2e	anv: move CreateGraphicsPipelines to common code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	1ba89d35ab	anv: rename internal function for consistency Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Jason Ekstrand	cb682a1cdd	anv: Don't use the wrong ARRAY_SIZE Even though this doesn't change anything, it's not good to use an ARRAY_SIZE for one array to iterate over another. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	f66192a4b3	anv: split graphics nir loading Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	36aa0f668f	anv: break up anv_pipeline_compile_graphics() This function is pretty overwhelming. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	c806d1e5ed	anv: simplify dynamic buffer count in pipeline layout anv_descriptor_set_layout already has the information we're gather here. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Lionel Landwerlin	5b561b501a	anv: remove local computation of dynamic states This bit mask is already computed in anv_graphics_pipeline::dynamic_states in anv_graphics_pipeline_init(). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17601>	2022-07-19 02:36:09 +00:00
Emma Anholt	94bd06256a	intel/fs: Simplify brw_barycentric_mode() args. Reduce a bit of mode lookup noise I was tracing through trying to resolve the previous bug. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Lionel Landwerlin	2d1f021e16	intel/fs: Set NonPerspectiveBarycentricEnable when the interpolator needs it. [anholt: changed to make all drivers do the right thing by moving the payload barycentric check into the compiler] Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17381>	2022-07-19 01:25:47 +00:00
Jason Ekstrand	cb6375d40c	anv: Stop compacting surface state tables Instead of trying to compact the surface state table to get rid of any unused render targets, emit MAX(1, colorAttachmentCount) surface states always. This ensures that secondaries will always match with primaries when we go to do the copy since there's no rule requiring the secondary to have VK_FORMAT_UNDEFINED when the primary has a NULL image view. Fixes: `3501a3f9ed` ("anv: Convert to 100% dynamic rendering") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17543>	2022-07-14 20:36:24 +00:00
Lionel Landwerlin	2cac3b3817	anv: ensure tile flush before streamout writes Streamout is not L3 coherent so previous writes to the same address might be pending and overwrite the SO writes later when they get flushed from L3, even though the SO write happened later in the batch. v2: Use the right flag (not COUNTER) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6680 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17503>	2022-07-14 18:28:52 +00:00
Jordan Justen	4246a1ff47	intel/compiler: Don't create vec4 reg-set for gen8+ After `60e1d0f028`, we know that vec4 will never be used for gen >= 8. Ref: `60e1d0f028` ("intel/compiler: Remove INTEL_SCALAR_... env variables") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17437>	2022-07-14 17:49:01 +00:00
Jason Ekstrand	3346f6918f	intel/fs,anv: Rework handling of coarse and sample shading Now that this information is accurately gathered by spirv_to_nir, we no longer need the hack. We just need to fix up the way we handle some of the key bits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	d0b154319d	intel/fs: Simplify persample_dispatch Thanks to the previous commit, we no longer need this check. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	fd17aaf430	intel/fs: Use nir_lower_single_sampled This lets us drop demote_sample_qualifiers as well as a back-end check for key->multisample_fbo. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	ca9f0f72db	intel/fs: Use shader_info::fs::uses_sample_shading NIR constructs this information for us as part of nir_gather_info these days so we can simplify our logic a bit. This will also let us be more correct once we move uses_sample_shading scraping earlier. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Jason Ekstrand	9d438799c8	intel/blorp: Set uses_sample_shading for MSAA blit shaders Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14020>	2022-07-13 20:28:42 +00:00
Adam Jackson	768238fdc0	glx: Fix drawable refcounting for naked Windows driFetchDrawable is only ever called from the MakeCurrent path, which means it has to handle the case of pre-GLX-1.3 Windows being named as the drawable. When it finds the drawable in the hash, it increments its refcount before returning it, so for a GLXWindow it would be 2 on first return, one from glXCreateWindow and one from glXMakeCurrent. But when it does not find the drawable and creates one for the naked Window, the reference count on first return would only be 1. As a result, if this context was then ever bound to a different drawable, the old Window's DRI drawable state (like the back buffer) would be destroyed. Fixes piglit's glx-multi-window-single-context and glx-make-current for a variety of drivers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6713 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17479>	2022-07-13 12:25:30 -04:00
Marcin Ślusarz	585d81e3ec	intel/compiler: print shaders after nir_remove_unused_varyings Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17516>	2022-07-13 15:50:02 +00:00
Chuansheng Liu	39f8c61f32	iris,anv: correct the max thread number for DG2+ Correct the max thread number for DG2+ platforms according to below bspec. Ref: Bspec: 47202 Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Chuansheng Liu <chuansheng.liu@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17506>	2022-07-13 08:11:19 +00:00
Lionel Landwerlin	3a8ad28524	anv: skip flush/invalidate faster Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17481>	2022-07-13 01:33:27 +00:00
Lionel Landwerlin	1aeb11cde1	intel: protect against empty invalidate ranges It's legal for an application to call vkInvalidateMappedMemoryRanges() / vkFlushMappedMemoryRanges() with zero sized ranges. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b91971c240` ("anv: use the right helper to invalidate memory") Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6852 Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17481>	2022-07-13 01:33:27 +00:00
Lionel Landwerlin	af1ecbeb0a	anv: add a comment about handling buffer view swizzles on gfx7 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17385>	2022-07-13 00:45:36 +00:00
Lionel Landwerlin	a9edc268b9	anv: validate image view lowered storage formats for storage Ensure that if we have swizzle on the initial format, that the component bits are identical with the lowered format. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17385>	2022-07-13 00:45:36 +00:00
Lionel Landwerlin	57a8efa222	anv: deal with isl format swizzles for buffer views For some formats like VK_FORMAT_B5G6R5_UNORM_PACK16, we have no direct matching HW format. We can support it by swizzling. We already apply those swizzles for image views. We just forgot to deal with buffer views. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6235 Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17385>	2022-07-13 00:45:36 +00:00
Eric Engestrom	0225293a97	anv: use updated tokens from vk.xml Signed-off-by: Eric Engestrom <eric@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17342>	2022-07-12 15:53:11 +00:00
Jason Ekstrand	f336a4f9b3	anv: Init supported extensions before WSI Otherwise WSI extension support checks won't work. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17436>	2022-07-12 14:14:01 +00:00
Lionel Landwerlin	e082b26458	anv: move gfx12+ CCS_E clear color restriction Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17329>	2022-07-11 14:57:26 +00:00
Lionel Landwerlin	682383e5b3	anv: track if images can be fast clear with non-zero color Because clear colors are stored as 4 32bit component values, there is an issue if you try to format instance : - clearing in R16G16_UNORM - draw in R32_UINT Clear will use 2 components of the clear color in dword0 & dword1. While draw will use only one component of dword0. This change uses the mutable format information to track whether clear colors can be non-zero for fast clears. With : - non mutable formats, we can fast clear with any color on Gfx > 8 - mutable formats with incompatible component sizes, we can only fast clear with 0 color Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5930 Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17329>	2022-07-11 14:57:26 +00:00
Lionel Landwerlin	ab4beaf3fb	isl: add new helper for format component compatibility Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17329>	2022-07-11 14:57:26 +00:00
Kenneth Graunke	c08f51d0cf	intel/compiler: Don't set SBID on EOT send messages The send message with EOT is the last instruction so there's no need to set any SWSB annotations for future instructions. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17202>	2022-07-09 05:26:25 +00:00
Jason Ekstrand	90114fb034	anv: Implement VK_EXT_shader_module_identifier Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	530de844ef	intel,anv,iris,crocus: Drop subgroup size from the shader key Use nir->info.subgroup_size instead. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	e9b2862c1a	anv: Use vk_pipeline_shader_stage_is_null() Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	62915eb4fe	anv: Use vk_pipeline_shader_stage_to_nir Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	c2b3d9ca2b	anv: Put a VkPipelineShaderStageCreateInfo* in anv_pipeline_stage It's an entirely temporary struct used by the compile process and never escapes vkCreate*Pipelines so it's safe to just stuff the pointer in there. This makes it easier to use some of our new helpers. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	56b815e91d	anv: Drop unnecessary parameters to anv_pipeline_compile_cs Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Jason Ekstrand	e1ee201722	shader_info: Move subgroup_size out of cs and make it an enum Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17337>	2022-07-08 22:47:22 +00:00
Ian Romanick	bbcb881f46	intel/fs: Remove non-_LOGICAL URB messages The _LOGICAL versions are lowered direct to SEND, so nothing can ever generate these messages. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	bdc7668008	intel/fs: Lower URB messages to SEND Before rebasing on top of Ken's split-SEND optimization (see !17018), this commit just caused some scheduling changes in various tessellation and geometry shaders. These changes were caused by the addition of real latency information for the URB messages. With the addition of the split-SEND optimization, the changes are... staggering. All of the shaders helped for spills and fills are vertex shaders from Batman Arkham Origins. What surprises me is that these shaders account for such a high percentage of the spills and fills in fossil-db. 85%?!? v2: Use FIXED_GRF instead of BRW_GENERAL_REGISTER_FILE in an assertion. Suggested by Ken. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) total instructions in shared programs: 20013625 -> 19954020 (-0.30%) instructions in affected programs: 4007157 -> 3947552 (-1.49%) helped: 31161 HURT: 0 helped stats (abs) min: 1 max: 400 x̄: 1.91 x̃: 2 helped stats (rel) min: 0.08% max: 59.70% x̄: 2.20% x̃: 1.83% 95% mean confidence interval for instructions value: -1.97 -1.86 95% mean confidence interval for instructions %-change: -2.22% -2.18% Instructions are helped. total cycles in shared programs: 859337569 -> 858636788 (-0.08%) cycles in affected programs: 74168298 -> 73467517 (-0.94%) helped: 13812 HURT: 16846 helped stats (abs) min: 1 max: 291078 x̄: 82.83 x̃: 4 helped stats (rel) min: <.01% max: 37.09% x̄: 3.47% x̃: 2.02% HURT stats (abs) min: 1 max: 1543 x̄: 26.31 x̃: 14 HURT stats (rel) min: <.01% max: 77.97% x̄: 4.11% x̃: 2.58% 95% mean confidence interval for cycles value: -55.10 9.39 95% mean confidence interval for cycles %-change: 0.62% 0.77% Inconclusive result (value mean confidence interval includes 0). Broadwell total cycles in shared programs: 904844939 -> 904832320 (<.01%) cycles in affected programs: 525360 -> 512741 (-2.40%) helped: 215 HURT: 4 helped stats (abs) min: 4 max: 1018 x̄: 60.16 x̃: 39 helped stats (rel) min: 0.14% max: 15.85% x̄: 2.16% x̃: 2.04% HURT stats (abs) min: 79 max: 79 x̄: 79.00 x̃: 79 HURT stats (rel) min: 1.31% max: 1.57% x̄: 1.43% x̃: 1.43% 95% mean confidence interval for cycles value: -75.02 -40.22 95% mean confidence interval for cycles %-change: -2.37% -1.81% Cycles are helped. No shader-db changes on any older Intel platforms. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 142622800 -> 141461114 (-0.8%) Instructions helped: 197186 Cycles in all programs: 9101223846 -> 9099440025 (-0.0%) Cycles helped: 37963 Cycles hurt: 151233 Spills in all programs: 98829 -> 13695 (-86.1%) Spills helped: 2159 Fills in all programs: 128142 -> 18400 (-85.6%) Fills helped: 2159 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	a477587b4a	intel/fs: Add _LOGICAL versions of URB messages The lowering is currently fake. It just changes the opcode from the _LOGICAL version to the non-_LOGICAL version. v2: Remove some rebase cruft. 's/gfx8_//;s/simd8_/' in brw_instruction_name. Both suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	07b9bfacc7	intel/compiler: Move logical-send lowering to a separate file brw_fs.cpp was 10kloc. Now it's only 7.5kloc. Ugh. v2: Rebase on `9680e0e4a2`. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	c751ca769f	intel/eu: Validate some aspects of URB messages If these checks had been in place previously, some bugs that... eh-hem... practically took down the Intel CI would have been caught earlier. blush v2: Update to account for split sends. v3: Add some more Gfx version checks. Remove the redundant "src0 is a GRF" check. Both suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Ian Romanick	b909ac350f	intel/compiler: Rename vec4 state URB opcodes to have VEC4_ prefix An argument could be made that all stage-specific opcodes for vec4 stages should be prefixed with VEC4_ like the stage-agnostic opcodes. I'll leave those additional sed jobs for another day. egrep -lr '(VS\|GS\|TCS)_OPCODE_URB_WRITE' src \|\ while read f; do sed --in-place 's/$VS\\|GS\\|TCS$_OPCODE_URB_WRITE/VEC4_\1_OPCODE_URB_WRITE/g' $f done egrep -lr 'T.S_OPCODE[_A-Z]URB_OFFSETS' src \|\ while read f; do sed --in-place 's/$T.S_OPCODE[_A-Z]URB_OFFSETS$/VEC4_\1/g' $f done Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17379>	2022-07-08 19:45:34 +00:00
Lionel Landwerlin	9680e0e4a2	intel/fs: ray query fix for global address With stages dispatching with a mask, we can run into situations where we don't have the global address in all lanes. The existing code always assumed we had the addres in at least lane0. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `bb40e999d1` ("intel/nir: use a single intel intrinsic to deal with ray traversal") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17330>	2022-07-08 00:36:04 +00:00
Kenneth Graunke	ec2e8bc33f	intel/compiler: Avoid copy propagating large registers into EOT messages EOT messages need to use g112-g127 for their sources. With the new opt_split_sends pass, we may be constructing an EOT message from two different registers, and be able to copy propagate the original values into those SENDs. This can cause problems if we copy propagate from a large register (say an RGBA value which is 4 GRFs in SIMD8 or 8 GRFs in SIMD16), in a situation where the SEND only read a subset of that (say the alpha value out of an RGBA texturing result). g112-127 can only hold 16 registers worth of data, and sometimes we can only use g112-126. So, we can't propagate if the GRFs in question are larger than 15 GRFs. Fixes a shader validation failure in Alan Wake. Thanks to Ian Romanick for catching this! shader-db on Icelake shows that only SIMD32 programs are affected, and the results are pretty negligable: total instructions in shared programs: 19615228 -> 19615269 (<.01%) instructions in affected programs: 10702 -> 10743 (0.38%) helped: 1 / HURT: 43 / largest change: +/- 2 instructions total cycles in shared programs: 852001706 -> 852001566 (<.01%) cycles in affected programs: 767098 -> 766958 (-0.02%) helped: 68 / HURT: 64 / largest change: +/- 774 cycles GAINED: 2 / LOST: 0 Fixes: `589b03d02f` ("intel/fs: Opportunistically split SEND message payloads") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6803 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17390>	2022-07-07 20:20:01 +00:00
Lionel Landwerlin	c67c9688c3	anv/utrace: use a bo pool for utrace buffers When utrace/perfetto is active, we allocate/free utrace buffers at the same rate as command buffers. It's useful to have a pool that avoids GEM_CREATE/GEM_CLOSE ioctls. v2: Use the pool more Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16613>	2022-07-07 18:49:05 +00:00
Pierre-Eric Pelloux-Prayer	8856379a03	mesa/st: don't guess the internal format if it's known This fixes tests using imageLoad/imageStore on texture created using glEGLImageTargetTexture2DOES. Before this change, the format was guessed as GL_RGBA, which would be rejected by _mesa_get_shader_image_format. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16662>	2022-07-07 12:25:05 +00:00
Lionel Landwerlin	f1dd487531	intel/nir: temporarly disable opt_uniform_atomics for RT/CL Not had time to investiguate what is going is on but it's definitely a contributor to failures. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16104>	2022-07-07 10:21:48 +00:00

1 2 3 4 5 ...

8128 commits