fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-16 05:28:14 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	f3969e2413	intel/fs: Rework dynamic coarse handling Use 2 flags for PI & RT messages. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:18 +00:00
Jason Ekstrand	964b878986	intel/fs: Break out yet another FB write helper This new helper, do_emit_fb_writes() does the actual walk over all the render targets to emit each of the different FB writes. We want this in a helper because we're about to go a bit crazy with coarse. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:18 +00:00
Jason Ekstrand	9c658b1fc8	intel/fs/validate: Assert SEND [extended] descriptors are uniform This is required by code-gen since it generates a 1-wide OR and it'll blow up if the register width > 1. It's also way better than the "your register is the wrong size" assert you get from the more generic validation check. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:18 +00:00
Jason Ekstrand	949b42c4dc	intel/compiler: Convert wm_prog_key::multisample_fbo to a tri-state This allows us to communicate to the back-end that we don't actually know if the framebuffer is multisampled or not. No drivers set anything but ALWAYS/NEVER and we still have a few ALWAYS/NEVER assumptions but those should be asserted. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:18 +00:00
Jason Ekstrand	5644011f06	intel/compiler: Convert wm_prog_key::persample_interp to a tri-state This allows for the possibility that we may not know at compile time if sample shading is enabled through the API. While we're here, also document exactly what this bit means so we don't confuse ourselves. v2: Fixup coarse pixel values (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:18 +00:00
Jason Ekstrand	d8dfd153c5	intel/fs: Make per-sample and coarse dispatch tri-state Whenever one of them is BRW_SOMETIMES, we depend on dynamic flag pushed in as a push constant. In this case, we have to often have to do the calculation both ways and SEL the result. It's a bit more code but decouples MSAA from the shader key. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:18 +00:00
Jason Ekstrand	43ca7f4178	intel/compiler: Convert brw_wm_aa_enable to brw_sometimes There are other cases where we want a tri-state logic like this. May as well have one enum for all the cases. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Jason Ekstrand	5d1c538449	intel/fs: Return early in a couple builtin setup helpers Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Jason Ekstrand	714a291673	intel/compiler: Use SHADER_OPCODE_SEND for PI messages Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Jason Ekstrand	d25e5310bc	intel/nir: Lower barycentrics to per-sample in a dedicated pass This is more similar to what we do for single-sample and it should be more clear going forward once our lowering gets more complex. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Jason Ekstrand	991d546102	intel/compiler: Document wm_prog_key::persample_interp Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21094>	2023-02-06 09:12:17 +00:00
Ian Romanick	ea413e826b	nir: Eliminate nir_op_f2b Builds on the work of !15121. This gets to delete even more code because many drivers shared a lot of code for i2b and f2b. No shader-db or fossil-db changes on any Intel platform. v2: Rebase on `1a35acd8d9`. v3: Update a comment in nir_opcodes_c.py. Suggested by Konstantin. v4: Another rebase. Remove f2b stuff from Midgard. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20509>	2023-02-03 22:39:57 +00:00
Constantine Shablya	9b3b7e5091	anv: handle ATTACHMENT_OPTIMAL layout Closes: #8216 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21066>	2023-02-03 09:17:03 +00:00
Juston Li	6f4b375c94	anv: check initial cmd_buffer is chainable Submitting a batch with the first command buffer with the simultaneous bit set followed by a command buffer without the bit set gets past the check and triggers this assert attempting to chain them: ../src/intel/vulkan/anv_batch_chain.c:1147: anv_cmd_buffer_chain_command_buffers: Assertion `num_cmd_buffers == 1' failed. Signed-off-by: Juston Li <justonli@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21056>	2023-02-02 21:17:33 +00:00
Erik Faye-Lund	c8994256a6	anv, hasvk: remove stale TODO-files This file hasn't really been updated since 2016, apart from a single search-replace two years ago. That's an eternity in ANV-land, so let's just remove these. While we're at it, also remove the duplicate in hasvk. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21044>	2023-02-02 19:25:15 +00:00
Kenneth Graunke	873dfb673b	anv: Perform load_constant address math in 32-bit rather than 64-bit We lower NIR's load_constant to load_global_constant, which uses A64 bindless messages. As such, we do the following math to produce the address for each load: base_lo@32 <- BRW_SHADER_RELOC_CONST_DATA_ADDR_LOW base_hi@32 <- BRW_SHADER_RELOC_CONST_DATA_ADDR_HIGH base@64 <- pack_64_2x32_split(base_lo, base_hi) addr@64 <- iadd(base@64, u2u64(offset@32)) On platforms that emulate 64-bit math, we have to emit additional code for the 64-bit iadd to handle the possibility of a carry happening and affecting the top bits. However, NIR constant data is always uploaded adjacent to the shader assembly, in the same buffer. These buffers are required to live in a 4GB region of memory starting at Instruction State Base Address. We always place the base address at a 4GB address. So the constant data always lives in a buffer entirely contained within a 4GB region, which means any offsets from the start of the buffer cannot possibly affect the high bits. So instead, we can simply do a 32-bit addition between the low bits of the base and the offset, then pack that with the unchanged high bits. On anv, INSTRUCTION_STATE_POOL_MIN_ADDRESS is 8GB, so the high bits are always 0x2. We don't even need to patch that portion of the address and can just use an immediate value. We do still need to pack, however. fossil-db on Icelake indicates the following for affected shaders: Instrs: 10830023 -> 10750080 (-0.74%) Cycles: 1048521282 -> 1046770379 (-0.17%); split: -0.33%, +0.16% Subgroup size: 103104 -> 103112 (+0.01%) Send messages: 570886 -> 570760 (-0.02%) Loop count: 14428 -> 14429 (+0.01%) Spill count: 14246 -> 14244 (-0.01%); split: -0.06%, +0.04% Fill count: 22802 -> 22794 (-0.04%); split: -0.04%, +0.01% Scratch Memory Size: 654336 -> 662528 (+1.25%) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20999>	2023-02-02 02:45:04 +00:00
Sagar Ghuge	0c083d29a5	intel/fs: Always stall between the fences on Gen11+ Be conservative in Gfx11+ and always stall in a fence. Since there are two different fences, and shader might want to synchronize between them. This change also brings back the original code block for the stall between the fence and comment from the commit `b390ff3517`. v2: (Caio) - Re-arrange code block. - Adjust comment. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6958 Fixes: `f7262462` ("intel/fs: Rework fence handling in brw_fs_nir.cpp") Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Tested-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20996>	2023-02-02 00:21:21 +00:00
Amber	ab4c2990ed	intel/compiler: use lower_image_samples_to_one Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Reviewer-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: Amber Amber <amber@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20813>	2023-02-01 19:52:49 +00:00
José Roberto de Souza	8092bc2158	intel/ds: Fix crash when allocating more intel_ds_queues than u_vector was initialized u_vector_add() don't keep the returned pointers valid. After the initial size allocated in u_vector_init() is reached it will allocate a bigger buffer and copy data from older buffer to the new one and free the old buffer, making all the previous pointers returned by u_vector_add() invalid and crashing the application when trying to access it. This is reproduced when running dEQP-VK.synchronization.signal_order.timeline_semaphore.* in DG2 SKUs that has 4 CCS engines, INTEL_COMPUTE_CLASS=1 is set and of course perfetto build is enabled. To fix this issue here I'm moving the storage/allocation of struct intel_ds_queue to struct anv_queue/iris_batch and using struct list_head to maintain a chain of intel_ds_queue of the intel_ds_device. This allows us to append or remove queues dynamically in future if necessary. Fixes: `e760c5b37b` ("anv: add perfetto source") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20977>	2023-02-01 18:31:29 +00:00
Faith Ekstrand	1b3c746eec	hasvk: Let spirv_to_nir() set UBO/SSBO base cast alignments Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21027>	2023-02-01 17:54:40 +00:00
Faith Ekstrand	85d44b0f97	anv: Let spirv_to_nir() set UBO/SSBO base cast alignments Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21027>	2023-02-01 17:54:40 +00:00
Lionel Landwerlin	6eb75dc74c	anv: expose EXT_load_store_op_none Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21018>	2023-02-01 12:53:29 +00:00
Tapani Pälli	6bdc29e2e6	intel: enable existing workaround for ICL platform Patch changes comment to refer to the lineage 14014097488, this workaround applies for ICL as well. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20952>	2023-02-01 11:09:19 +00:00
Lionel Landwerlin	13de23ea07	intel: add missing PS restriction on BDW+ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20949>	2023-02-01 00:28:53 +00:00
Marcin Ślusarz	af9e2b8bf1	intel/compiler/mesh: remove dead code path supporting >4 dword writes Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20858>	2023-01-31 18:28:21 +00:00
Marcin Ślusarz	be82ed28f0	intel/compiler/mesh: support longer write messages Allowing longer writes reduces the number of send messages needed to support unaligned 4-component writes. Note: nothing currently generates 8-component writes, so this change makes "second_mask" code path in emit_urb_direct_writes and emit_urb_indirect_writes_mod dead. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20858>	2023-01-31 18:28:21 +00:00
Marcin Ślusarz	3131c2fc7a	intel/compiler/mesh: optimize indirect writes Our hardware requires that we write to URB using full vec4s at aligned addresses. It gives us an ability to mask-off dwords within vec4 we don't want to write, but we have to know their positions at compile time. Let's assume that: - V represents one dword we want to write - ? is an unitinitialized value - "\|" is a vec4 boundary. When we want to write 2-dword value at offset 0 we generate 1 write message: \| V1 V2 ? ? \| with mask: \| 1 1 0 0 \| When we want to write 4-dword value at offset 2 we generate 2 write messages: \| ? ? V1 V2 \| V3 V4 ? ? \| with mask: \| 0 0 1 1 \| 1 1 0 0 \| However if we don't know the offset within vec4 at compile time we currently generate 4 write messages: \| V1 V1 V1 V1 \| \| 0 0 1 0 \| \| V2 V2 V2 V2 \| \| 0 0 0 1 \| \| V3 V3 V3 V3 \| \| 1 0 0 0 \| \| V4 V4 V4 V4 \| \| 0 1 0 0 \| where masks are determined at run time. This is quite wasteful and slow. However, if we could determine the offset modulo 4 statically at compile time, we could generate only 1 or 2 write messages (1 if modulo is 0) instead of 4. This is what this patch does: it analyzes the addressing expression for modulo 4 value and if it can determine it at compile time, we generate 1 or 2 writes, and if it can't we fallback to the old 4 writes method. In mesh shader, the value of offset modulo 4 should be known for all outputs, with an exception of primitive indices. The modulo value should be known because of MUE layout restrictions, which require that user per-primitive and per-vertex data start at address aligned to 8 dwords and we should statically always know the offset from this base. There can be some cases where the offset from the base is more dynamic (e.g. indirect array access inside a per-vertex value), so we always do the analysis. Primitive indices are an exception, because they form vec3s (for triangles), which means that the offset will not be easy to analyse. When U888X index format lands, primitive indices will use only one dword per triangle, which means that we'll always write them using one message. Task shaders don't have any predetermined structure of output memory, so always do the analysis. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20050>	2023-01-31 13:50:08 +00:00
Lionel Landwerlin	0d7f8aa249	anv: fix null descriptors When writing descriptor with a null buffer/image we expect that writing 0 will point to the null surface. For that to work the null surface has to be in the bindless surface heap. This fixes some new failures in dEQP-VK.robustness.* tests once rewritten from the NV_ray_tracing to KHR_ray_tracing extension. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4ceaed7839` ("anv: split internal surface states from descriptors") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7762 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20953>	2023-01-27 21:38:32 +00:00
Paulo Zanoni	38a087c4ce	anv: there's no need to set exec_obj offsets twice The anv_execbuf_add_bo() function already sets the offsets for the exec_objects. Since we're always using softpin and never using relocations all these objects should have non-changing offsets, all set during anv_bo creation and never changed. Not only we should not change these offsets, we definitely don't change them between anv_execbuf_add_bo() and this loop we're removing. Previously, we'd have the offset set as -1 for BOs that had never been submitted when we were not using softpin. Notice that with games we can have several hundreds of BOs in this array. This loop was added by: `c5f7e1f5b4` ("anv: Delete relocation support from batch submission") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20885>	2023-01-27 18:53:11 +00:00
Erik Faye-Lund	b6a344f4ba	meson: do not reconstruct ICD paths Meson will already construct these paths for us, so let's reuse them instead of throwing away the result and recontstructing them. Reviewed-by: Eric Engestrom <eric@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20907>	2023-01-27 11:35:50 +00:00
Marcin Ślusarz	432e263284	intel/compiler: fine-grained control of dispatch widths Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> [v2] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20854>	2023-01-27 11:00:41 +00:00
Brian Paul	fbd32a04da	anv: add a third memory type for LLC configuration Commit `582bf4d9` turned on write-combining for most (all?) memory allocations. This caused a fairly large performance drop in some of our VMware tests (application traces, such as Windows Metro Paint). This patch adds a third memory type configuration: DEVICE_LOCAL, HOST_VISIBLE, HOST_COHERENT. This is uncached. Then, in anv_AllocateMemory() we only use write-combining for this uncached type. This memory type is found in the Intel Windows Vulkan driver. And according to https://asawicki.info/news_1740_vulkan_memory_types_on_pc_and_how_to_use_them uncached memory correlates to write-combined memory. This fixes our performance regression (and actually produced the fastest ever results for our test suite). Signed-off-by: Brian Paul <brianp@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20770>	2023-01-26 22:45:49 +00:00
Lionel Landwerlin	13cca48920	intel/fs: drop FS_OPCODE_UNIFORM_PULL_CONSTANT_LOAD_GFX7 We can lower FS_OPCODE_UNIFORM_PULL_CONSTANT_LOAD into other more generic sends and drop this internal opcode. The idea behind this change is to allow bindless surfaces to be used for UBO pulls and why it's interesting to be able to reuse setup_surface_descriptors(). But that will come in a later change. No shader-db changes on TGL & DG2. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20416>	2023-01-26 11:26:53 +00:00
Emma Anholt	f6c06ef2f6	ci: Add manual rules variations to disable irrelevant driver jobs. If you're only affecting one or a couple of drivers, it would be nice if your pipeline buttons on the web UI weren't full of manual run buttons for all the other drivers. This is a bunch of duplicated lines, but less than it could have been now that we have !references. In some of these cases (i915g, nouveau, etnaviv), we have no non-manual jobs for those drivers, so I could have just rewritten the original "driver-rules" to "driver-manual-rules". I decided to keep things consistent between drivers, though, because this is all esoteric enough to readers already without making different drivers' rules look different. Fixes: #4891 Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17445>	2023-01-26 00:48:19 +00:00
Emma Anholt	67547a04b6	ci: Move the performance jobs' allow_failure:true to the gl rules. This helps clarify what's going on with these magic manual jobs. Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17445>	2023-01-26 00:48:19 +00:00
Francisco Jerez	7b5e933629	intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation. The ACP entries created by copy propagation to track the implied copies of LOAD_PAYLOAD instructions don't model the behavior of LOAD_PAYLOAD correctly, since (as of `41868bb682`) header moves are implicitly retyped to UD and the destination of non-header copies implicitly uses the same type as the corresponding source, even though the ACP entries created for such copies could incorrectly represent a type conversion, which can lead to mis-optimization of the program. According to Marcin, this fixes the func.mesh.ext.workgroup_id.task.q0 crucible test. Fixes: `41868bb682` ("i965/fs: Rework the fs_visitor LOAD_PAYLOAD instruction") Reported-by: Marcin Ślusarz <marcin.slusarz@intel.com> Tested-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18980>	2023-01-25 22:22:12 +00:00
José Roberto de Souza	5ccc2049fa	intel/ds: Nuke intel_ds_queue::queue_id queue_id is not used anywhere. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20888>	2023-01-25 17:43:57 +00:00
José Roberto de Souza	f2fdd14907	hasvk: Nuke anv_queue:index_in_family This value is also stored in vk_queue, so we can nuke from anv_queue. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20888>	2023-01-25 17:43:57 +00:00
José Roberto de Souza	013e849a2c	anv: Nuke anv_queue:index_in_family This value is also stored in vk_queue, so we can nuke from anv_queue. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20888>	2023-01-25 17:43:57 +00:00
José Roberto de Souza	80dd6ed074	intel: Add kmd_type parameter to necessary intel_gem.h functions Here adding kmd_type parameter to intel_gem_read_render_timestamp(), intel_gem_can_render_on_fd() and intel_gem_supports_protected_context(). Those 3 functions will have Xe implementations, the other functions in intel_gem.h will not be called by Xe code paths so not adding kernel_driver_type to it. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20773>	2023-01-25 09:17:15 -08:00
José Roberto de Souza	162695f0b8	intel: Add intel_kmd_type parameter to intel_engine_get_info() This will be needed to execute proper function to fetch engines from kernel mode driver. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20773>	2023-01-25 09:16:55 -08:00
José Roberto de Souza	2c42641994	intel/dev: Detect what is the kernel mode driver loaded Intel GPUs will have a new kernel mode driver, support for some platforms will overlap so we will need to detect in runtime what is the kernel mode driver loaded for each device. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20773>	2023-01-25 09:16:55 -08:00
Kenneth Graunke	a8108f1d44	anv: Add missing untyped data port flush on PIPELINE_SELECT See the comments in emit_apply_pipe_flushes(). Flushing HDC is not sufficient in GPGPU mode, and we need to set the untyped data port flush bit as well. Fixes many dEQP-VK failures with INTEL_COMPUTE_CLASS=1 on Alchemist. Fixes: `1067ec90a5` ("anv: Update PIPELINE_CONTROL flush when switching pipeline mode in TGL+") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20774>	2023-01-25 07:08:55 +00:00
Lionel Landwerlin	5b79745b04	intel/common: add a INTEL_DECODE variable to parameter decoder at runtime Sometimes you want to diff 2 runs with INTEL_DEBUG=bat, but a tiny allocation change can mess quite badly with offsets printed in the decoding, making it hard to look at the diff with meld. Fortunately our decoder can avoid printing offsets. We just need a variable to specify that. We still use the defaults specified by the driver but you can turn things on/off with : INTEL_DECODE=+color,-offsets,-floats INTEL_DEBUG=bat ./my_app Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20874>	2023-01-24 15:15:17 +00:00
Lionel Landwerlin	31931754bc	intel/decoder: print out compute push constants Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20874>	2023-01-24 15:15:17 +00:00
Marcin Ślusarz	536a2acfc2	intel/compiler/mesh: handle const data in task & mesh programs Started showing up when nir_opt_large_constants call was moved in `88756cee8d`. Fixes dEQP-VK.mesh_shader.ext.smoke.monolithic.fullscreen_gradient* Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `88756cee8d` ("intel/compiler: Run nir_opt_large_constants before scalarizing consts") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20876>	2023-01-24 14:47:21 +00:00
Marcin Ślusarz	9bb18a4f9e	intel/compiler: fix generation of vec8/vec16 alu instruction I stumbled on this when I inserted some suboptimal lowering code after all optimizations. Adding certain subset of optimizations after my lowering code actually avoided this bug, so I think it's not possible to hit this on upstream. Let's fix this for the next person generating suboptimal code... Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20857>	2023-01-24 13:15:58 +00:00
Tapani Pälli	0813c1a6fa	intel/genxml: set unused 3DSTATE_PS_EXTRA field as mbz Wa_14015360517 mentions situations where HW produces invalid occlusion query results when "Pixel Shader Does not write to RT" bit is set. "When Pixel Shader Kills Pixel is set, SW must perform a dummy render target write from the shader and not set this bit, so that Occlusion Query is correct." Another situation is when writing to UAV or to NULL render target. Patch sets field as 'must be zero' to discourage possible use of it. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20849>	2023-01-24 10:28:15 +00:00
Lionel Landwerlin	a50d2fdb46	intel/fs: avoid cmod optimization on instruction with different write_mask I've been running into failures with tests like : dEQP-VK.robustness.robustness2.bind.notemplate.rgba32i.unroll.nonvolatile.uniform_buffer_dynamic.no_fmt_qual.len_4.samples_1.1d.frag With the load_global_const_block_intel NIR intrinsic, you can load a vec8/vec16 with a predicate. The predicate is correctly uniformized to feed into the SEND instruction's flag register. The problem is that a series of optimization first remove the find_live_channel and then changes the broadcast into a simple MOV instruction, on the assumption that the first channel is always active if there is not control flow. This is correct. But after that the cmod optimzation will remove this instruction : mov.nz.f0.0(16) null:D, vgrf16+0.0<0>:D NoMask because it seems to be equivalent to : cmp.g.f0.0(16) vgrf16:D, vgrf12:D, 63d In this case vgrf16 is the predicate to the load block SEND instruction. Since the execution mask is different between both, some of the channels of the SEND instruction end up not being loaded or loaded with the wrong predication and we end up with incorrect UBO data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20852>	2023-01-24 07:35:42 +00:00
Emma Anholt	849af68dbd	ci/piglit: Add some common piglit skips for Mesa CI's testing of glx. Since our X servers don't have a compositor, and we run tests in parallel, various swap and frontbuffer tests won't ever be stable. Rather than having every driver have to track those flakes, make a general X11 skips list as a known issue of our CI rather than pointing fingers at drivers. Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Karol Herbst <kherbst@redhat.com> Acked-by: Martin Roukala <martin.roukala@mupuf.org> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20798>	2023-01-24 00:13:02 +00:00

1 2 3 4 5 ...

8993 commits