fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 17:38:08 +02:00

Author	SHA1	Message	Date
Ian Romanick	8ab7ec0129	intel/compiler: Enable lower_bitfield_extract_to_shifts and lower_bitfield_insert_to_shifts for pre-Gfx7 GLSL IR opcodes generated for bitfieldExtract and bitfieldInsert are lowered by lower_instructions. `4dff3ff005` ("nir/opt_algebraic: Optimize open coded bfm.") adds an optimization that can rematerialize nir_op_bfm that was prevented by the GLSL IR lowering. It appears that every piece of hardware, except older Intel GPUS, that has real integers (i.e., lower_bitops is not set) also sets lower_bitfield_extract_to_shifts and lower_bitfield_insert_to_shifts. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `4dff3ff005` ("nir/opt_algebraic: Optimize open coded bfm.") Closes: #7874 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20323>	2023-01-03 18:37:53 -08:00
Emma Anholt	0cff5d51ac	ci/intel: Switch skqp testing over to deqp-runner. The skqp runner gets us parallel execution, automatic caselist handling, nice reports, and the same xfail/flake handling you know and love from deqp and piglit. And, now that we have flake handling, we can turn the tests back on! Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20070>	2023-01-04 00:34:33 +00:00
Tapani Pälli	97f2b60833	anv: implement Wa_14015814527 for task shaders After using task shader, we need to emit a zero URB state and a nullprim (empty pipe control) before rendering with primitives. After this, a normal URB state needs to be returned, this will happen when pipeline batch is emitted during pipeline switch. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20334>	2023-01-03 12:44:08 +00:00
Sviatoslav Peleshko	261a334509	hasvk: Add layer with work-around for Doom 64 texture corruption Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7817 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19502>	2023-01-02 15:05:06 +00:00
Sviatoslav Peleshko	c2acd9f76a	anv: Add layer with work-around for Doom 64 texture corruption Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7817 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19502>	2023-01-02 15:05:06 +00:00
José Roberto de Souza	def474e916	intel/genxml/gen12.5: Pipe_Control::Remove Global Snapshot Count Reset It was not meant to be used(Iris have assert for it) and it was removed from Pipe_Control instruction in gen12.5 and newer. BSpec: 47112 Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20444>	2022-12-29 08:34:25 -08:00
José Roberto de Souza	c6d1f76da2	anv: Add and use emit_pipeline_select() To avoid the replication of code to properly emit PIPELINE_SELECT. init_compute_queue_state() had a different emit of PIPELINE_SELECT but as there is no compute engine in GFX VER 11 we are safe with the differences. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20444>	2022-12-29 08:34:15 -08:00
David Heidelberg	57f73d097e	ci/iris: add iris-kbl flake Ref: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7547 Signed-off-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20448>	2022-12-28 23:08:37 +00:00
Tapani Pälli	b9aa66d5d0	anv: disable preemption for 3DPRIMITIVE during streamout This is required by Wa_16013994831. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20438>	2022-12-27 15:53:42 +00:00
Tapani Pälli	910f5a18cf	intel/genxml: add disable preemption field for gen125 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20438>	2022-12-27 15:53:42 +00:00
Lionel Landwerlin	afdbed9e9c	anv: fix potential integer overflow The loop going from 0 to max_draw_count multiplies the value which could potentially overflow. Fixes Coverity CID 1517852 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `3596a8ea7a` ("anv: factor out some indirect draw count entry points") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20436>	2022-12-27 14:21:44 +00:00
Lionel Landwerlin	2024115b79	intel/ds: add missing generate draws perfetto glue Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c950fe97a0` ("anv: implement generated (indexed) indirect draws") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7956 Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Tested-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20433>	2022-12-26 14:11:44 +02:00
Lionel Landwerlin	c950fe97a0	anv: implement generated (indexed) indirect draws Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15642>	2022-12-23 22:52:50 +00:00
Lionel Landwerlin	3596a8ea7a	anv: factor out some indirect draw count entry points Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15642>	2022-12-23 22:52:50 +00:00
Lionel Landwerlin	61b730f1f4	anv: decouple util function from anv_cmd_buffer The issue we're addressing here is that we have 2 batches and the both grow at different rate. We want to keep doubling the main batch size as the application writes more and more commands to limit the number of GEM BOs. But we don't want to have the generation batch size to be linked to the main batch. v2: remove gfx7 code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15642>	2022-12-23 22:52:50 +00:00
José Roberto de Souza	3e28c5b9f9	anv: Pass anv_bo as parameter to anv_gem_mmap() anv_bo has information that will be needed by a future patch in anv_gem_mmap(), so here already preparing for that. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20423>	2022-12-23 18:22:29 +00:00
José Roberto de Souza	95ce9664d5	intel/common: Move i915 gem specific code to its own file Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20423>	2022-12-23 18:22:29 +00:00
José Roberto de Souza	f51bafc368	intel/common: Move i915 engine specific code to its own file Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20423>	2022-12-23 18:22:29 +00:00
Väinö Mäkelä	4c986c58b3	intel/blorp: Fix a hang caused by invalid dispatch enables on gfx7 Because commit `b9403b1c47` moved dispatch enable handling away from the compiler, the drivers must ensure correct dispatch enable values. This is handled by the intel_set_ps_dispatch_state function. v2: Fix gfx6 build and use brw_fs_get_dispatch_enables for gfx6 in crocus v3: Rebase, use intel_set_ps_dispatch_state, drop gfx6 handling Fixes: `b9403b1c47` ("intel: factor out dispatch PS enabling logic") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20267>	2022-12-23 10:17:04 +00:00
Tapani Pälli	7db1b94e07	intel/dev: setup 1024 GS urb entries for ADL-N v2: apply only for devices with less than 32 EUs (Lionel) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7942 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20414>	2022-12-23 09:51:01 +00:00
Lionel Landwerlin	25608659a0	intel/compiler: mark shader_record_ptr as uniform Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20413>	2022-12-23 09:22:13 +00:00
Lionel Landwerlin	739a08ad23	anv: handle null push descriptors in deferred optimization Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b49b18f0` ("anv: reduce BT emissions & surface state writes with push descriptors") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20410>	2022-12-22 14:07:21 +00:00
Rohan Garg	ad9c0e8cd9	anv: Ensure we clear ANV_PIPE_PSS_STALL_SYNC_BIT on flush Add the PSS stall bit to ANV_PIPE_STALL_BITS so that it get's cleared on flush. Fixes: `f3c62973` ("anv,iris: PSS Stall Sync around color fast clears") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20317>	2022-12-20 10:44:54 +00:00
Chad Versace	a5f9e59ce3	anv: Use vma_heap for descriptor pool host allocation Pre-patch, anv_descriptor_pool used a free list for host allocations that never merged adjacent free blocks. If the pool only allocated fixed-sized blocks, then this would not be a problem. But the pool allocations are variable-sized, and this caused over half of the pool's memory to be consumed by unusable free blocks in some workloads, causing unnecessary memory footprint. Replacing the free list with util_vma_heap, which does merge adjacent free blocks, fixes the memory explosion in the target workload. Disdavantges of util_vma_heap compared to the free list: - The heap calls malloc() when a new hole is created. - The heap calls free() when a hole disappears or is merged with an adjacent hole. - The Vulkan spec expects descriptor set creation/destruction to be thread-local lockless in the common case. For workloads that create/destroy with high frequency, malloc/free may cause overhead. Profiling is needed. Tested with a ChromeOS internal TensorFlow benchmark, provided by package 'tensorflow', running with its OpenCL backend on clvk. cmdline: benchmark_model --graph=mn2.tflite --use_gpu=true --min_secs=60 gpu: adl memory footprint from start of benchmark: before: init=132.691MB max=227.684MB after: init=134.988MB max=134.988MB Reported-by: Romaric Jodin <rjodin@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20289>	2022-12-16 07:18:38 +00:00
Iván Briano	766508f56a	Revert "anv: Refactor anv_pipeline to use the anv_pipeline_type" This reverts commit `b1126abb38`. This breaks all hell at least on DG2, as there are several cases left where current_pipeline gets checked against GPGPU to decide what to do, and the value doesn't match that of ANV_HW_PIPELINE_STATE_COMPUTE. On top of that, it also misses checking for ANV_HW_PIPELINE_STATE_RAYTRACING. Then there's the fact that in some cases, current_pipeline will be UINT32_MAX, because it's the original undefined state and also used after executing a secondary command buffer because we are not tracking on which pipeline did the secondary left us. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7910 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20349>	2022-12-16 06:39:32 +00:00
Tapani Pälli	77244e30b6	anv: remove some gen8 specifics handled now in hasvk Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20342>	2022-12-16 07:25:30 +02:00
Jordan Justen	5df50292d6	intel/isl: Disable CCS on MTL until B0 (Wa_14017353530) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20322>	2022-12-15 11:43:00 -08:00
Jianxun Zhang	6e33423a6f	intel/dev: Enable AUX map on MTL Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20322>	2022-12-15 11:43:00 -08:00
Jordan Justen	f81579628a	intel/aux_map: Ignore format bits when using tile-4 Based on Jianxun's ("iris: don't get format bits in AUX tables"). With gfx12.5+, the compression format is once again coming from the surface state programming. MTL once again uses an aux-map, but it ignores the format bits within the the aux-map metadata. Ref: Bspec 44930: "Compression format from AUX page walk is ignored. Instead compression format from Surface State is used." gfx12.5+ also uses tile-4 rather than y-tiling, so if we don't see y-tiling, we can return 0 from intel_aux_map_format_bits() for the ignored format bits. Rework: * Just return 0 if not using y-tiling as suggested by Nanley. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20322>	2022-12-15 11:43:00 -08:00
Lionel Landwerlin	b21cd1ee1b	anv: fixup another dirty issue with gpu_memcpy Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20335>	2022-12-15 17:30:55 +00:00
Rohan Garg	b1126abb38	anv: Refactor anv_pipeline to use the anv_pipeline_type Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20316>	2022-12-15 16:38:18 +00:00
Jordan Justen	78a75e0d25	intel/common/intel_genX_state.h: Add intel_set_ps_dispatch_state() This replaces brw_fs_get_dispatch_enables(), which was added in `b9403b1c47` ("intel: factor out dispatch PS enabling logic"), but this function will not work well for future changes to 3DSTATE_PS. So, instead, this moves the related code into a "genX" file which can directly update 3DSTATE_PS for the given platform. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20329>	2022-12-15 00:54:59 -08:00
Jordan Justen	f16e76d940	intel/common: Add intel_genX_state.h Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20329>	2022-12-15 00:54:59 -08:00
Paulo Zanoni	e930bff19e	anv: remove anv_reloc_list->array_length This is another field that, after the recent commits, became unused. It's either zero-initialized (by the memset) or copy-initialized (which means it's also zero). And it never even gets used anywhere anyway, so even if the value was non-zero it wouldn't matter. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20309>	2022-12-14 10:44:31 -08:00
Paulo Zanoni	1358622878	anv: remove anv_reloc_list->reloc_bos As a consequence of the last two commits, reloc_bos is always NULL and never used anywhere, so remove it. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20309>	2022-12-14 10:44:31 -08:00
Paulo Zanoni	f1c4c646b8	anv: remove anv_reloc_list_grow() The last commit made it clear that anv_reloc_list_grow() only ever gets called with zero as num_additional_relocs, which means it will always immediately return VK_SUCCESS without doing anything. That means we can remove it. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20309>	2022-12-14 10:44:31 -08:00
Paulo Zanoni	4168d3ef30	anv: remove anv_reloc_list->num_relocs There are only a few places in the code where num_relocs gets set: - During anv_reloc_list_init() where it gets memset() to 0. - At anv_reloc_list_init_clone() where it gets set with the value of another anv_reloc_list->num_relocs. - During anv_reloc_list_clear(), where it gets set to 0. - During anv_reloc_list_append(), where it gets added with the value of another anv_reloc_list->num_relocs. As you can see, either we explicitly set the value to 0 or we copy the value that's present in another anv_reloc_list, which should be 0. The one place where we used to increment num_relocs was in anv_reloc_list_add(), but that was deleted by: `7b7381e8d7` ("anv: Delete anv_reloc_list_add()") So in this commit we delete the num_relocs field from struct anv_reloc_list and we also delete some lines where, if the value is 0, nothing will happen. There's more we could be deleting here, but I wanted this commit to be minimal so it's very clear that num_relocs can't be non-zero. We were having some speculation that anv_reloc_list may still be important for actually adding BOs to the batch and building the validation list, so let's go slowly with the removal to make everything more easily reviewable. The one possibility I could be missing here is another situation like the memset() we have at anv_reloc_list_init() or some other crazy indirect overwrite, but as far as I have checked, that is not the case. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20309>	2022-12-14 10:44:31 -08:00
Paulo Zanoni	4b1c4925e7	anv: remove anv_execbuf->surface_states_relocs Now that we removed relocations, this is not being used anywhere. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20309>	2022-12-14 10:44:31 -08:00
Jianxun Zhang	c14857e915	intel/common: clean up AUX macros The hardcoded is either replaced with new interfaces or relocated to C file if it is private. Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20259>	2022-12-14 18:11:13 +00:00
Jianxun Zhang	9ff471fdc6	intel/vulkan: replace AUX macros with interfaces Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20259>	2022-12-14 18:11:13 +00:00
Jianxun Zhang	78a4b6deed	intel/isl: Support 1MB alignment for AUX mapping Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20259>	2022-12-14 18:11:13 +00:00
Jianxun Zhang	9698eee50d	intel/common: Support 1MB granularity AUX mapping format (Bspec 44930) Signed-off-by: Jianxun Zhang <jianxun.zhang@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20259>	2022-12-14 18:11:13 +00:00
Marcin Ślusarz	264a0cabd1	anv: assert when number of primitives is higher than max Such cases can lead to memory corruptions. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20279>	2022-12-14 09:55:11 +00:00
Marcin Ślusarz	d7a1916798	anv: handle mesh shaders with max primitives == 0 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20279>	2022-12-14 09:55:10 +00:00
Ian Romanick	eb76cee9f8	nir: Eliminate nir_op_i2b There are a lot of optimizations in opt_algebraic that match ('ine', a, 0), but there are almost none that match i2b. Instead of adding a huge pile of additional patterns (including variations that include both ine and i2b), always lower i2b to a != 0. At this point in the series, it should be impossible for anything to generate i2b, so there /should not/ be any changes. The failing test on d3d12 is a pre-existing bug that is triggered by this change. I talked to Jesse about it, and, after some analysis, he suggested just adding it to the list of known failures. v2: Don't rematerialize i2b instructions in dxil_nir_lower_x2b. v3: Don't rematerialize i2b instructions in zink_nir_algebraic.py. v4: Fix zink-on-TGL CI failures by calling nir_opt_algebraic after nir_lower_doubles makes progress. The latter can generate b2i instructions, but nir_lower_int64 can't handle them (anymore). v5: Add back most of the hunk at line 2125 of nir_opt_algebraic.py. I had accidentally removed the f2b(bf2(x)) optimization. v6: Just eliminate the i2b instruction. v7: Remove missed i2b32 in midgard_compile.c. Remove (now unused) emit_alu_i2orf2_b1 function from sfn_instr_alu.cpp. Previously this function was still used. 🤷 No shader-db changes on any Intel platform. All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141165875 -> 141165873 (-0.0%) Instructions helped: 2 Cycles in all programs: 9098956382 -> 9098956350 (-0.0%) Cycles helped: 2 The two Vulkan shaders are helped because of the "new" (('b2i32', ('ine', ('ubfe', a, b, 1), 0)), ('ubfe', a, b, 1)) algebraic pattern. Acked-by: Jesse Natalie <jenatali@microsoft.com> [earlier version] Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Tested-by: Daniel Schürmann <daniel@schuermann.dev> [earlier version] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Ian Romanick	edae161d98	intel/fs: Use nir_type_convert instead of nir_type_conversion_op In a future commit, nit_type_conversion_op won't be able to handle i2b (and in a much later commit f2b), so switch many users to the fully featured function. No shader-db or fossil-db changes on any Intel platform. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15121>	2022-12-14 06:23:21 +00:00
Nanley Chery	e4e4ba2304	intel: Allow CCS_E on R11G11B10_FLOAT for TGL+ We now support blorp_copy with this format. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19937>	2022-12-14 03:05:24 +00:00
Nanley Chery	e862626031	intel/isl: Bump format_info entries from 100 to 110 The new format support is only tested on Ice Lake and onward. Makes the next patch clearer. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19937>	2022-12-14 03:05:24 +00:00
Nanley Chery	2b2141d193	intel/isl: Lessen CCS_E-compatibility checks for TGL+ Tiger Lake and onward allow drivers to specify a compression format independently from the surface format. So, even if the surface format changes, hardware is still able to determine how to access the CCS. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19937>	2022-12-14 03:05:24 +00:00
Nanley Chery	2add57d0c2	intel: Hook up RENDER_SURFACE_STATE::DecompressInL3 The sampler's decompressor seems to lack support for some types of format re-interpretation. Use the more capable decompressor for these cases. This will be needed to avoid regressing piglit's arb_texture_view-rendering-formats in later commits. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jianxun Zhang <jianxun.zhang@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19937>	2022-12-14 03:05:24 +00:00

1 2 3 4 5 ...

8850 commits