fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 11:08:24 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	03ab1d6aaa	intel/compiler: document units of brw_ubo_range fields Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17908>	2022-08-05 11:51:31 +00:00
Lionel Landwerlin	734384e8bc	intel/fs: fixup simd selection with shader calls Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17908>	2022-08-05 11:51:31 +00:00
Lionel Landwerlin	9cb9390962	intel/fs: store num of resume shaders in prog_data That way we can look at the SBT entries for debug purposes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17908>	2022-08-05 11:51:31 +00:00
Sagar Ghuge	845ab3d627	anv: Handle bits to flush data-port's Untyped L1 data cache v2: Drop ANV_PIPE_UNTYPED_DATAPORT_CACHE_FLUSH_BIT from invalidate bits (Lionel) Add utrace support Expand on comment about PIPE_CONTROL::UntypedDataPortCache Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:43:50 +03:00
Lionel Landwerlin	1f34ce7e8e	intel/ds: track untyped dataport flushes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:43:50 +03:00
Sagar Ghuge	79cd2c2759	anv: Specify Untyped L1 cache policy for stateless accesses Set write back L1 cache policy in STATE_BASE_ADDRESS instruction for A64 messages. v2: Also set the value in genX_state.c (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:43:50 +03:00
Sagar Ghuge	d4b2b769d1	intel/isl: Setting L1 caching policy to Write-back mode For a RW L1 cache, both reads and writes are cached in the L1, at high priority (MRU position). For a RO L1 cache, reads are cached at higher priority and writes bypass the cache. v1: (Ken) - Set caching policy for buffer surfaces too Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:43:50 +03:00
Lionel Landwerlin	5e21f47428	anv: fixup PIPE_CONTROL restriction on gfx8 We're missing a condition that is currently papered over by having ANV_PIPE_HDC_PIPELINE_FLUSH_BIT in the invalidate bits. v2: rework with simplication (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16905>	2022-08-05 10:42:16 +03:00
Dave Airlie	a17635e988	gallivm/nir/st: lower image derefs in advance. This improves clover from crashing to just failing, but I mainly want it this to cleanup the nir code first It's also important the shaders coming from the state tracker for feedback get images lowered when they are draw shaders now. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10641>	2022-08-05 06:18:44 +00:00
Constantine Shablya	2af624706a	anv: use nir_opt_uniform_access Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17558>	2022-08-03 23:57:50 +00:00
Nanley Chery	e7419c11ae	anv: Make the D16 reg mode single-sampled Wa_14010455700 is dependent on the format and sample count, but our code to track whether or not it had been applied was only dependent on the format. As a result, we failed to enable the workaround when an app used a D16 2xMSAA buffer, then a D16 1xMSAA buffer right afterwards. Make the workaround tracking code sample-dependent to fix this. Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17859>	2022-08-03 15:31:10 +00:00
Mykhailo Skorokhodov	8b13acd715	anv: Move Wa_1806527549 and enable by default Move Wa_1806527549 into `init_render_queue_state` and set HIZ_CHICKEN (7018h) bit = 1 by default. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6717 Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17778>	2022-08-02 16:33:10 +03:00
Lionel Landwerlin	8c9dd9e783	intel/dev: remove INTEL_DEVID_OVERRIDE Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17831>	2022-08-02 11:17:58 +00:00
Lionel Landwerlin	7f82ab7104	intel/dev: add a test verifying that device override works Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17831>	2022-08-02 11:17:58 +00:00
Lionel Landwerlin	9d55c5237e	intel/tools/stub: fixup parsing of --platform= Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17831>	2022-08-02 11:17:58 +00:00
Lionel Landwerlin	f2bbc959a0	intel/tools/drm-shim: fixup eu_stride for topology Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17831>	2022-08-02 11:17:58 +00:00
Lionel Landwerlin	186ff4696a	intel/dev: move verification function to a header Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17831>	2022-08-02 11:17:58 +00:00
Lionel Landwerlin	6931ae83ce	anv: decode init batch with INTEL_DEBUG=bat Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17852>	2022-08-02 10:42:26 +00:00
Marcin Ślusarz	883acc4150	intel/compiler: use NIR_PASS more Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17619>	2022-08-02 10:07:05 +00:00
Marcin Ślusarz	7ebae85955	intel/compiler: insert URB fence before task/mesh termination Bspec 53421 says: "A URB fence memory is typically performed prior the thread exit message, so that the next thread dispatch that reads that URB memory will see it." Cc: 22.1 <mesa-stable> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16665>	2022-08-02 09:31:24 +00:00
Marcin Ślusarz	30c0f2bfbb	intel/compiler: there are 4 types of fences on gfx >= 12.5 Found by code inspection. There's an assert later checking that we haven't overflown this array, so this change probably doesn't matter for any workload. Cc: 22.1 <mesa-stable> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16665>	2022-08-02 09:31:24 +00:00
Marcin Ślusarz	2bd148c990	intel/compiler: emit URB fences for TASK/MESH Cc: 22.1 <mesa-stable> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16665>	2022-08-02 09:31:24 +00:00
Kenneth Graunke	9afd955353	intel/compiler: Delete unused Gfx8+ code in brw_find_live_channel() We now handle this in fs_visitor::lower_find_live_channel(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17530>	2022-08-02 08:41:43 +00:00
Kenneth Graunke	49ee3ae9e8	intel/compiler: Lower FIND_[LAST_]LIVE_CHANNEL in IR on Gfx8+ This allows the software scoreboarding pass, scheduler, and so on to handle the individual instructions and handle them, rather than trusting in the generator to do scoreboarding correctly when expanding the virtual instruction to multiple actual instructions. By using SHADER_OPCODE_READ_SR_REG, we also correctly handle the software scoreboarding workaround when reading DMask/VMask. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17530>	2022-08-02 08:41:43 +00:00
Mark Janes	6401d768b9	intel/dev: drop warning for unhandled hwconfig keys The hwconfig api may change unexpectedly prior to public release of new platforms. Also, public documentation of the hwconfig api sometimes lags the release. For these reasons, warnings about unhandled hwconfig keys are noisy, likely to occur, and unhelpful to most users. This commit drops those warnings, in favor of a separate internal process for tracking hwconfig api changes. Suggested-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17846>	2022-08-02 08:08:02 +00:00
Jordan Justen	68b88fae8c	intel/dev: Fill in system memory info when using INTEL_DEVID_OVERRIDE Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17828>	2022-08-01 23:54:55 +00:00
Sviatoslav Peleshko	cb99365403	intel/nullhw: Use correct macro to fix build regression Fixes: `b510ee0d` ("Use vk_foreach_struct_const where needed") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6950 Signed-off-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Tested-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17789>	2022-08-01 10:54:38 +00:00
Yiwei Zhang	71a0ae2796	anv: enable VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM for modifier support This is a missed format to properly support media interop for Android. Currently only used when layering GL atop Vulkan on Android, but will be used directly with Vulkan when the platform default renderer has switched to skiavk in modern Android. Test: CtsMediaTestCases and CtsVideoTestCases with angle on venus on anv Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chad Versace <chadversary@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17808>	2022-07-29 23:24:15 +00:00
Iván Briano	a05fcc94c2	anv: assert inheritance_info is not NULL Makes some static analysis tools happier. Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17809>	2022-07-29 22:50:21 +00:00
Mark Janes	dc8df485e9	intel/compiler: reorder shader cache keys to minimize padding Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17749>	2022-07-29 20:45:25 +00:00
Mark Janes	a4a4aefa03	intel/compiler: pad all data structures used by shader cache keys When the compiler pads a data structure, the padded bytes will not be initialized. Shader keys are compared with memcmp and unitialized bytes within the structure breaks this mechanism. Explicitly pad the structures with members, so the compiler is forced to initialize them. Add a warning to indicate if a change to alignment in any of the data structures requires additional padding. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17749>	2022-07-29 20:45:25 +00:00
Dylan Baker	42b89276e6	iris\|anv: gfx version 12.5 data cache flush is not a workaround This was not a workaround, it was simply missing from the documentation. So remove the workaround language. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17797>	2022-07-28 22:08:46 +00:00
Dylan Baker	180af73101	anv: add gfx version 12.5 flushes to CCS path This was already added to the MCS path in !17218, so this is just adding it in the CCS path as well. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17797>	2022-07-28 22:08:46 +00:00
Kenneth Graunke	fc02ce5713	intel/eu: Mark header present in URB memory fences on XeHP Fixes the following EU validation error: ERROR: Header must be present for all URB messages. The message header is ignored for URB fence messages, so I doubt that this actually matters in practice. But we should probably mark it as present, because you have to send something, and according to the documentation, there is a message header, it's just ignored. Fixes: `e6a9501aa2` ("intel/fs: Add the URB fence message") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17624>	2022-07-28 21:31:45 +00:00
Kenneth Graunke	986b49a56d	intel/eu: Clarify spec citations for XeHP region restrictions When this rule started causing issues, I looked it up in the documentation, and found the rule for 64-bit destinations and integer DWord multiplication, but there was no mention of floating point destinations, as the text in brackets suggested. The actual restriction text had been updated, so this led to some confusion where I thought the conditions had been changed in newer docs. However, what's actually going on is that there are two separate conditions, each listed in separate rows of the table. One lists 64-bit destinations or integer DWord multiplication, and the other mentions floating-point destinations. In both cases, the actual restrictions are identical, so we handle them together in the code. Try to update the comment to avoid future confusion. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17624>	2022-07-28 21:31:45 +00:00
Kenneth Graunke	5c88488a64	intel/eu: Fix XeHP register region validation for hstride == 0 Recently, we started using <1;1,0> register regions for consecutive channels, rather than the <8;8,1> we've traditionally used, as the <1;1,0> encoding can be compacted on XeHP. Since then, one of the EU validator rules has been flagging tons of instructions as errors: mov(16) g114<1>F g112<1,1,0>UD { align1 1H I@2 compacted }; ERROR: Register Regioning patterns where register data bit locations are changed between source and destination are not supported except for broadcast of a scalar. Our code for this restriction checked three things: #1: vstride != width * hstride \|\| #2: src_stride != dst_stride \|\| #3: subreg != dst_subreg Destination regions are always linear (no replicated values, nor any overlapping components), as they only have hstride. Rule #1 is requiring that the source region be linear as well. Rules #2-3 are straightforward: the subregister must match (for the first channel to line up), and the source/destination strides must match (for any subsequent channels to line up). Unfortunately, rules #1-2 weren't working when horizontal stride was 0. In that case, regions are linear if width == 1, and the stride between consecutive channels is given by vertical stride instead. So we adjust our src_stride calculation from src_stride = hstride * type_size; to: src_stride = (hstride ? hstride : vstride) * type_size; and adjust rule #1 to allow hstride == 0 as long as width == 1. While here, we also update the text of the rule to match the latest documentation, which apparently clarifies that it's the location of the LSB of the channel which matters. Fixes: `3f50dde8b3` ("intel/eu: Teach EU validator about FP/DP pipeline regioning restrictions.") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17624>	2022-07-28 21:31:45 +00:00
Jason Ekstrand	0772242feb	intel/eu: Don't throw validation errors on float MOV_INDIRECT Fixes: `3f50dde8b3` ("intel/eu: Teach EU validator about FP/DP pipeline regioning restrictions.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17624>	2022-07-28 21:31:45 +00:00
Kenneth Graunke	82ee30e558	intel/eu: Handle compaction when inserting validation errors When the EU validator encountered an error, it would add an annotation to the disassembly. Unfortunately, the code to insert an error assumed that the next instruction would start at (offset + sizeof(brw_inst)), which is not true if the instruction with an error is compacted. This could lead to cascading disassembly errors, where we started trying to decode the next instruction at the wrong offset, and getting lots of scary looking output: ERROR: Register Regioning patterns where [...] (-f0.1.any16h) illegal(* invalid execution size value 6 ) { align1 $7.src atomic }; (+f0.1.any16h) illegal.sat(* invalid execution size value 6 ) { align1 $9.src AccWrEnable }; illegal(* invalid execution size value 6 ) { align1 $11.src }; (+f0.1) illegal.sat(* invalid execution size value 6 ) { align1 F@2 AccWrEnable }; (+f0.1) illegal.sat(* invalid execution size value 6 ) { align1 F@2 AccWrEnable }; (+f0.1) illegal.sat(* invalid execution size value 6 ) { align1 $15.src AccWrEnable }; illegal(* invalid execution size value 6 ) { align1 $15.src }; (+f0.1) illegal.sat.g.f0.1(* invalid execution size value 6 ) { align1 $13.src AccWrEnable }; Only the first instruction was actually wrong - the rest are just a result of starting the disassembler at the wrong offset. Trash ensues! To fix this, just pass the instruction size in a few layers so we can record the next offset properly. Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17624>	2022-07-28 21:31:45 +00:00
Eric Engestrom	2c67457e5e	util/list: rename LIST_ENTRY() to list_entry() This follows the Linux kernel convention, and avoids collision with macOS header macro. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6751 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6840 Cc: mesa-stable Signed-off-by: Eric Engestrom <eric@igalia.com> Acked-by: David Heidelberg <david.heidelberg@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17772>	2022-07-28 10:10:44 +00:00
Jordan Justen	fa79020ba9	anv: Fix PHYSICAL_DEVICE_MEMORY_BUDGET_PROPERTIES with large BAR Reported-by: Dave Airlie <airlied@redhat.com> Fixes: `fae88d8791` ("anv: make use of the new smallbar uAPI") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6937 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17761>	2022-07-27 09:44:38 +00:00
Jordan Justen	2863e720f0	intel/dev: Determine the amount of free vram using small BAR uapi Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16739>	2022-07-26 20:34:02 +00:00
Jordan Justen	acc6457ff4	intel/dev: Use i915 region probed_cpu_visible_size when non-zero Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16739>	2022-07-26 20:34:02 +00:00
Lionel Landwerlin	fae88d8791	anv: make use of the new smallbar uAPI Instead of having 2 VkMemoryType pointing to the same VkMemoryHeap, we have each VkMemoryType with VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT (one host visible, the other not) point to its own VkMemoryHeap. For the local heap that is host visible, we'll use the I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS flag at GEM BO creation. When the smallbar uAPI is not available we fallback to a single heap and do not use I915_GEM_CREATE_EXT_FLAG_NEEDS_CPU_ACCESS. v2: Handle probed_cpu_visible_size == probed_size (Matthew) v3: * Jordan: Use region info from devinfo v4: Also make the vram host visible heap as local (Ken) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16739>	2022-07-26 20:34:02 +00:00
Ian Romanick	f7f232385f	intel/fs: Use canonical form for "work around" tags Trivial. Also clean up some weird whitespace. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:19 +00:00
Ian Romanick	377246318a	intel/fs: Eliminate "masked" and "per slot offset" URB messages All of this information can be inferred from the sources. v2: Fix "error: unused variable 'opcode'" detected by marge-bot. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:19 +00:00
Ian Romanick	b21b901b46	intel/fs: Don't pass flags to lower_urb_read_logical_send or lower_urb_write_logical_send ...because the flags can be inferred from the sources. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:19 +00:00
Ian Romanick	1b17f8fc5a	intel/fs: Make logical URB read instructions more like other logical instructions No shader-db changes on any Intel platform Fossil-db results: Tiger Lake Instructions in all programs: 156926440 -> 156926470 (+0.0%) Instructions hurt: 15 Cycles in all programs: 7513099349 -> 7513099402 (+0.0%) Cycles hurt: 15 Ice Lake and Skylake had similar results. (Ice Lake shown) Cycles in all programs: 9099036492 -> 9099036489 (-0.0%) Cycles helped: 1 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:19 +00:00
Ian Romanick	349a040f68	intel/fs: Make logical URB write instructions more like other logical instructions The changes to fs_visitor::validate() helped track down a place where I initially forgot to convert a message to the new sources layout. This had caused a different validation failure in dEQP-GLES31.functional.tessellation.tesscoord.triangles_equal_spacing, but this were not detected until after SENDs were lowered. Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) total instructions in shared programs: 19951145 -> 19951133 (<.01%) instructions in affected programs: 2429 -> 2417 (-0.49%) helped: 8 / HURT: 0 total cycles in shared programs: 858904152 -> 858862331 (<.01%) cycles in affected programs: 5702652 -> 5660831 (-0.73%) helped: 2138 / HURT: 1255 Broadwell total cycles in shared programs: 904869459 -> 904835501 (<.01%) cycles in affected programs: 7686744 -> `7652786` (-0.44%) helped: 2861 / HURT: 2050 Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 141442369 -> 141442032 (-0.0%) Instructions helped: 337 Cycles in all programs: 9099270231 -> 9099036492 (-0.0%) Cycles helped: 40661 Cycles hurt: 28606 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17605>	2022-07-26 17:25:18 +00:00
Constantine Shablya	85c3cea96f	anv: set image_read_without_format NIR option on Vulkan 1.3 VK_KHR_format_feature_flags2 is core and implicitly enabled in 1.3. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17442>	2022-07-24 17:27:10 +00:00
Ian Romanick	95e50d198f	intel/vec4: Set lower_usub_sat Reviewed-by: Emma Anholt <emma@anholt.net> Closes: #6900 Fixes: `90a8fb03` ("nir/lower_io: Fix array length of buffers larger than INT32_MAX.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17637>	2022-07-22 17:54:28 +00:00

1 2 3 4 5 ...

8204 commits