fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 18:18:06 +02:00

Author	SHA1	Message	Date
Isabella Basso	a553d3cd29	nir/algebraic: make patterns for float conversion lowerings imprecise As noted on [1], lowering patterns of the form floatS -> floatB -> floatS ==> floatS cannot require precision since this may cause flush denorming. [1] `3f779013` ("nir: Add an algebraic optimization for float->double->float") Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	79c94ef52e	nir/algebraic: extend lowering patterns for conversions on smaller bit sizes Conversions on smaller bit sizes should also be collapsed when composed. This also adds more patterns on the intS -> intB -> floatB ==> intS -> floatB lowering so as to deal with any int size C > B instead of a fixed intB. Closes: #7776 Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	a27bcd63d0	nir/algebraic: extend mediump patterns Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Suggested-by: Italo Nicola <italonicola@collabora.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Isabella Basso	b3685f3ba7	nir/algebraic: insert patterns inside optimizations list Some patterns were outside the list of optimizations. Fixes: `b86305bb` ("nir/algebraic: collapse conversion opcodes (many patterns)") Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Signed-off-by: Isabella Basso <isabellabdoamaral@usp.br> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20965>	2023-03-11 17:21:37 +00:00
Alyssa Rosenzweig	2ba48eea88	nir/lower_point_size: Use shader_instructions_pass Sleepy code deletion mood. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21750>	2023-03-11 16:42:36 +00:00
Alyssa Rosenzweig	933b5c76f6	agx: Switch to scoped_barrier Rather than ingesting separate control and memory barriers, ingest only the combined and optimized scoped_barrier intrinsic. For barriers originating from GLSL, this makes it easier to ensure correctness. For barriers originating from SPIR-V, this is required for translation at all, as spirv_to_nir knows only scoped barriers. So this gets us closer to Vulkan and OpenCL. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21752>	2023-03-11 16:20:06 +00:00
David Heidelberg	84767a5160	ci/lava: every LAVA job doesn't want to run gles2 deqp, drop it Very annoying when adding new job and not getting failure due to missing `DEQP_VER: ` Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
David Heidelberg	8cdbb894ca	ci/panfrost: correct the job name, as it runs on gles2 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
David Heidelberg	e3660c2820	ci/amd: move skqp and va jobs on raven from XOrg to the XWayland Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
David Heidelberg	1e262f129b	ci: add and utilize dalboz devices New 10 devices - asus-CM1400CXA-dalboz hosted on Collabora farm. 1x Move VA-API tests to the dalboz (more resources). One timeout dropped. 9x Run VKCTS on dalboz. Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21702>	2023-03-11 14:48:20 +00:00
Sil Vilerino	3067bda0f3	d3d12: Fix video decode for interlaced streams with reference only textures required Fixes: `d8206f6286` ("d3d12: Add video decode implementation of pipe_video_codec") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21832>	2023-03-11 14:31:32 +00:00
Alyssa Rosenzweig	b768a254f7	agx: Use nir_lower_mem_access_bit_sizes Lowers away 64-bit loads, which we'll create in the sysval lowering for dynamically indexed UBOs/VBOs. The lowering generates pack_64_2x32 instructions, so lower those too. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	8a53050d7d	agx: Implement extract_[ui]16 Instead of lowering to bitwise ops. Yet another way of subdividing in NIR. Probably insignificant but makes it easy to check that the pass ordering from the previous pass is right. It does let us get much better codegen for unpacksnorm2x16, whatever that's worth. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	706815488e	agx: Fix subdivision coalescing As intended. We can't CSE with partial null destinations in the way, so we shouldn't eliminate dead destinations until after CSE has run. But we should still eliminate dead instructions to ensure CSE doesn't move things around needlessly, hurting register pressure. Noticed while debugging live range splitting. No GLES3.0 shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	5ea9c2e634	agx: Make partial DCE optional Our dead code elimination pass does two things: 1. delete instructions that are entirely unnecessary 2. delete unnecessary destinations of necessary instructions To deal with pass ordering issues, we sometimes want to do #1 without #2. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Alyssa Rosenzweig	16f8bfb042	agx: Don't set lower_pack_split We should handle nir_op_unpack_32_2x16_split_* natively, since we can generate better code with agx_subdivide (coalescing the ops away) than the bitshift lowering. That said, we do need some extra instructions for the floating point conversions. No shader-db changes (which makes sense because we're targetting the GLES3.0 shader-db, which doesn't have the packing GLSL functions). The real motivation of this change isn't optimizing some GLSL pack functions, though, it's avoiding a code regression from using NIR's memory bit size lowering in a future MR. That lowering will turn things like "load i16vec4" into "load i32vec2 + unpack_32_2x16", so we need to be able to coalesce that unpack. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21674>	2023-03-11 14:15:50 +00:00
Daniel Stone	c1aa876747	ci: Disable Collabora LAVA farm Looks like a power or network issue. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21851>	2023-03-11 11:59:31 +00:00
Eric Engestrom	9cf636834c	ci: take valve farm offline It seems to be experiencing networking issues Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21851>	2023-03-11 11:59:18 +00:00
Daniel Stone	50378f59a7	ci: Actually run Piglit on LAVA At some point in a refactoring long ago, our 'Piglit' runs on arm64 started actually being dEQP-GLES2 runs. Oh dear. Surprisingly, there are a number of expectation changes; added every fail I saw from a long overnight stress test. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21851>	2023-03-11 11:58:30 +00:00
Alyssa Rosenzweig	b190d08a8a	pan/mdg: Remove reference to removed macro This will soon be more confusing than helpful. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	cc16e7322f	panfrost: Remove MALI_POSITIVE macro Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	131845eb84	panfrost: Inline the last MALI_POSITIVE use Big shrug on this one. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	14eb964e59	panfrost: Remove FBD tag enum from XML This was a hack to avoid modelling the full data structure. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	67cbbf9417	panfrost: Use framebuffer pointer XML Rather than manipulating the raw pointers. This is cleaner. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	1a5546293c	panfrost: Add XML for framebuffer pointers We shouldn't have to open-code these. They are real data structures, model them as such in the architecture XML files. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	35985be275	panfrost: Handle fixed-point packing in GenXML Minimum/maximum LOD and LOD bias are unsigned and signed fixed point formats respectively. They are not unsigned integers. Introduce fixed-point types into our GenXML and use them in the XML, rather than packing in sidebands. This makes the XML more correct and fixes pretty-printing of texture and sampler descriptors. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	17c55e0d12	panfrost: Don't use DECODE_FIXED16 for sample position Strictly this is a signed fixed-point, anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	e0752673be	docs/panfrost: Move description of instancing Connor Abbott wrote a nice explanation of how instance divisors work on Mali. Let's add it to the driver docs instead of letting it languish in a forgotten header file. This is mostly pasted from the existing header in tree, with a few local changes applied. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Alyssa Rosenzweig	07b43d6231	panfrost: Remove some unused definitions Nowadays, formats are defined with GenXML, not the old panfrost-job.h, so most of the format #defines in panfrost-job.h are unused. That said, a few are still in use as a backdoor for compressed format queries to avoid a GenXML dependency. That's not great but cleaning that up isn't the subject of this MR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20445>	2023-03-11 06:30:02 +00:00
Felix DeGrood	341f1011a6	intel/perf: Hide extended metrics by default XE architecture enables many more metrics, perhaps too many for the average user. Reduce reported metrics to smaller subset, known as non-extended metrics, by default. Can re-enable extended metrics with env var INTEL_EXTENDED_METRICS=1 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21841>	2023-03-11 05:05:06 +00:00
Alyssa Rosenzweig	6b22a02f90	asahi,agx: Implement buffer textures with gnarly NIR Implement buffer textures in full generality. There are a few issues here: * OpenGL requires buffer textures support a minimum size of 65536 elements, however 1D textures in AGX are (at most) 8192 elements. * OpenGL 4.0 (and OpenGL ES) require buffer textures to support the "RGB32" texture formats. These are 3 packed channels of 32-bits each. In general, non-power-of-two texel sizes are problematic. AGX does not support any such formats and we rely on the GL frontend to lower to a padded format (RGBX) if necessary. Such a lowering cannot work for buffer textures, however, so we need to find a way to implement RGB32 buffer textures. We solve these issues in the follow way: * Use 2D texture descriptors for buffer textures, with a large fixed power-of-two size along one axis. Then large texel indices may be accessed at a small vec2 texel coordinate, and since the fixed dimension is a power-of-two, that vector may be recovered by simply shifting and masking. This effectively avoids size restriction. We do need to clamp texel indices to the buffer size to avoid faulting on OOB reads, since we may read past the end of the buffer (if the app binds a non-page-aligned offset into the buffer). * Use a general purpose memory load for RGB32 buffer textures. Lower the texture load instruction to a memory load from the buffer and some address arithmetic. There's no format conversion needed for RGB32, other than maybe filling in a format-appropriate alpha, so this is straightforward. Again, we need to clamp the texel index for robustness with OOB reads. Each of these solutions brings its own problem. * Using 2D textures instead of 1D requires physically rounding up the buffer size when packing the descriptor, so we can no longer implement textureSize() by reading off the texture descriptor like normal. * We don't know at compile-time whether a given texture load will read from an RGB32 buffer texture or not, so we need to emit code for both. In Vulkan, we can't key the shader to this property, either, since it's descriptor set state and not pipeline state. And each of these problems in turn brings its own solution: * The texture descriptor is linear, so the "compression buffer address" field is ignored by the hardware. We stash the real buffer size there so that textureSize becomes a load from the texture descriptor like usual, without requiring a sideband (which would complicate bindless textures). * If we determine a texture descriptor contains RGB32 data, then it will never be interpreted by the hardware and hence does not need to be a valid texture descriptor. So, we extend the hardware's format enum to contain a software-defined RGB32 format enum. Then, when lowering texture buffer loads, we either read it as a typed RGB32 memory load or as a texture load depending on the value of the format field in the texture descriptor. All of this is accomplished with a big NIR pass generating a pile of strange looking code. But it should be good enough in practice for this silly feature. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21672>	2023-03-11 02:26:31 +00:00
Alyssa Rosenzweig	826649ba19	asahi, agx: Implement dummy samplers In NIR, texelFetch (txf) does not use a sampler, but in AGX, it does -- even though the contents of the sampler are semantically irrelevant. Rather than requiring the state tracker to bind a sampler anyway (indicated for texture buffers with PIPE_CAP_TEXTURE_BUFFER_SAMPLER), just add a dummy sampler ourselves if txf is used and there are otherwise no samplers. This is helpful because PIPE_CAP_TEXTURE_BUFFER_SAMPLER isn't honoured by Rusticl or seemingly mesa/st's PBO code, and after implementing this dummy sampler workaround in Panfrost for Rusticl, I realized this CAP is silly and shouldn't exist in the first place. (And I regret pushing for its reinclusion.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21672>	2023-03-11 02:26:31 +00:00
Guilherme Gallo	bc178c044e	ci/baremetal: Wrap artifact download curl with xtrace Setting `set -x`can be useful to known via trace which URL baremetal used to download artifacts. Today its only printed the command with the environment variables. Also, this commit fixes multiple `section_end` for the related Gitlab sections. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21804>	2023-03-10 21:40:23 +00:00
Guilherme Gallo	256e7888fd	ci: Fix release build use for performance jobs This commit ensures that we are using mesa release builds in performance jobs. To achieve that, some modifications were made on top of https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21492. - Append the `BUILDTYPE` variable into the S3 artifact name (MINIO_ARTIFACT_NAME environment variable) to allow for better artifact management. - The ./artifacts directory has been added to the list of artifact directories for build-common. This ensures that the debian-release and debian-arm64-release jobs are the only ones necessary for running performance jobs. These jobs only produce artifacts via prepare-artifacts.sh when we are under performance workflow. - Make lava-submit.sh behave similar to baremetal jobs regarding MINIO_ARTIFACT_NAME variable. For example, users can now easily differentiate between mesa-arm64.tar.zstd and mesa-arm64-release.tar.zstd by looking inside the `Downloading artifacts from s3` Gitlab section. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21804>	2023-03-10 21:40:23 +00:00
José Roberto de Souza	91a129b44a	iris: Move i915 submit_batch() to i915 backend No changes in behavior intented here. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21700>	2023-03-10 20:13:56 +00:00
José Roberto de Souza	21d5034edb	iris: Add batch_check_for_reset() to kmd backend Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21700>	2023-03-10 20:13:56 +00:00
José Roberto de Souza	e0ce31d7cf	iris: Add gem_mmap() to kmd backend Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21700>	2023-03-10 20:13:56 +00:00
José Roberto de Souza	c5888bf610	build: Block build of HASVK, Crocus and i915 in non-x86 architectures HASVK, Crocus and i915 drivers only supports integrated GPUs. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21773>	2023-03-10 19:41:14 +00:00
José Roberto de Souza	757e2dd692	intel/perf: Disable it for Xe KMD Xe still don't have support for performance metrics. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21773>	2023-03-10 19:41:14 +00:00
José Roberto de Souza	266d961fdc	iris: Don't mark protected bo as reusable The check in alloc_bo_from_cache() was skiping any try to get a bo from cache but after use a protected bo was still being put in some cache bucket and could be used for cases that don't require a protected bo. Using a protected bo in cases that don't require it can have performance implications. So here returning NULL when trying to get a cache bucket for a protected bo, this will cause bo->real.reusable to be set to false avoiding the bo to be reused. Fixes: `9402ac8023` ("iris: handle protected BO creation") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21824>	2023-03-10 18:59:59 +00:00
Alyssa Rosenzweig	e61d6540e6	asahi: Don't allow linear depth/stencil buffers We don't have a way to tell the ZLS hardware to use linear buffers, so if a buffer could be used for depth/stencil, we have to twiddle. This isn't a problem in practice, since depth/stencil buffers can't be shared across processes or mapped directly as linear. Fixes faults in depthstencil-render-miplevels, which was picking linear for one buffer because of a STAGING bind flag. But that won't work :-) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21753>	2023-03-10 18:29:52 +00:00
Daniel Stone	e61d022313	ci/android: Use a more aggressive timeout for the job This job sometimes - very, very, rarely - fails to start Cuttlefish, the Android VM environment. Given that we don't have any structural monitoring and restarting (unlike LAVA/BM/B2C) for this, just stick a more aggressive timeout on the job, so it'll be retried if it fails to start. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21837>	2023-03-10 16:39:36 +00:00
Ian Romanick	0cadc3830f	nir/lower_int64: Optionally lower ufind_msb using uadd_sat v2: Fix inverted condition for applying the optimization. Noticed by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	831f9d3f61	nir/algebraic: Optimize some ifind_msb to ufind_msb On Intel platforms, the uclz lowering if ufind_msb is either one instruction better (Gfx7 and newer) or two instructions better (all older platforms) than the ifind_msb implementations. On platforms that use lower_find_msb_to_reverse, there should be no difference. All Haswell and newer Intel platforms had similar results. (Ice Lake shown) total instructions in shared programs: 19938662 -> 19938634 (<.01%) instructions in affected programs: 850 -> 822 (-3.29%) helped: 2 / HURT: 0 total cycles in shared programs: 858467067 -> 858465538 (<.01%) cycles in affected programs: 10080 -> 8551 (-15.17%) helped: 2 / HURT: 0 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	db6d1edc1b	nir: Restrict ufind_msb and ufind_msb_rev to 32- or 64-bit sources `4d802df3aa` loosened the type restrictions on these opcodes to enable support for 64-bit ballot operations. In doing so, it enabled 8-bit and 16-bit sizes as well. It's impossible to get these sizes through GLSL or SPIR-V. None of the lowering in nir_opt_algebraic can handle non-32-bit sizes. Almost no drivers can handle non-32-bit sizes. It doesn't seem possible to enforce anything other than "one bit size" or "all bit sizes" in nir_opcodes.py. The only way it seems possible to enforce this is in nir_validate. This is not ideal, but it be what it be. v2: Remove restriction on find_lsb. It is acutally possible to get this via GLSL by doing findLSB() on a lowp value. findMSB declares its parameter as highp, so that path is still impossible. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	2d6f48f6ef	nir/algebraic: Do not generate 8- or 16-bit find_msb The next commit will add validation to restrict this instruction (and others) to only 32-bit or 64-bit sources. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	2119ab7319	nir/builder: Do not generate 8- or 16-bit find_msb Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	28311f9d02	nir: intel/compiler: Move ufind_msb lowering to NIR Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Cycles in all programs: 9098346105 -> 9098333765 (-0.0%) Cycles helped: 6 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	a4052e70ea	nir/algebraic: Only lower ufind_msb with 32-bit sources The 31-ufind_msb_rev(x) lowering only produces the correct result for 32-bit sources. ufind_msb_rev can also have 64-bit sources, and most platforms are expected to lower this to 32-bit instructions with extra logic operations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00
Ian Romanick	08ca862ef8	intel/compiler: Tighter src and dest size bounds checking for some opcodes Enforce the sizes listed in the Skylake PRM: BFREV: source types: D destination types: D CBIT: source types: UB, UW, UD destination types: UD FBH: source types: D, UD destination types: UD FBL: source types: UD destination types: UD LZD: source types: D, UD destination types: UD v2: Update BFREV commit message documentation. Suggested by Ken. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19042>	2023-03-10 15:27:17 +00:00

1 2 3 4 5 ...

168037 commits