fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 12:48:13 +02:00

Author	SHA1	Message	Date
Lionel Landwerlin	afeb0c3022	genxml: drop gen10 Finishing off the job started in !6899 v2: Remove remaining gen10_pack.h include (Sagar) v3: Forgot isl gen10 removal (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7185>	2020-10-20 07:56:40 +00:00
Caio Marcelo de Oliveira Filho	8dd03a7c12	anv: Advertise VK_KHR_shader_terminate_invocation Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7221>	2020-10-19 19:38:35 +00:00
Kenneth Graunke	aca31baafc	isl: Enable Tigerlake HDC:L1 caches via MOCS in various cases. Thanks to Felix Degrood for discovering that we missed enabling this additional caching on Tigerlake! Felix also benchmarked the changes. We now use MOCS 48 (HDC:L1 + L3 + LLC) for render targets, textures, and pull constant buffers. We leave storage buffers & images, as well as stateless messages, using the previous MOCS 2 value. We can't use HDC:L1 with atomics, and we don't know a priori whether storage buffers will be used with atomics or not. Similarly, the Vulkan buffer device address feature allows atomics to be performed on buffers via stateless messages, and we only can control MOCS at the base address level, so we can't do much there. This is closer to what the Windows Vulkan and OpenGL drivers do, though it isn't quite the same - they also disable LLC in some cases, but we observed this to have noticable performance regressions when we tried (though a couple titles benefited). We may try experiment with that in the future. Improves performance in a number of titles: - Unreal Engine 4 Shooter Demo [VK]: 11.8% - Witcher 3 [DXVK]: 3.9% - Rise of the Tomb Raider [VK]: 1.5% - Shadow of the Tomb Raider [VK]: 1.0% - Grand Theft Auto V [DXVK]: 0.8% We did not observe any performance regressions. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7104>	2020-10-19 19:18:11 +00:00
Kenneth Graunke	02fe825a61	isl, anv, iris: Add a centralized helper to select MOCS based on usage On Gen12+, we can enable additional caches in certain usage situations. This routes that decision making to a central place in ISL, based on surface usage flags, and updates both drivers to use it. (i965 doesn't need to change because it doesn't support Gen12.) We continue handling the "external" decision via an anv_mocs() wrapper for now, since we store that flag in anv_bo, which isl doesn't know about. (We could introduce an ISL_SURF_USAGE_EXTERNAL, but I'm not actually sure that would be cleaner.) This patch should not have any functional nor performance effects, as we continue selecting the exact same MOCS values for now. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7104>	2020-10-19 19:18:11 +00:00
Kenneth Graunke	103ad427bc	anv: Set only one ISL usage bit (RT/texture) for CopyBuffer sources Most uses of this function deal with destination buffers, but for copy_buffer_to_image, the buffer is the source, and isn't rendered to. We should avoid setting ISL_SURF_USAGE_RENDER_TARGET_BIT. Also, we should avoid setting ISL_SURF_USAGE_TEXTURE_BIT for the destination, which isn't sampled from. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7104>	2020-10-19 19:18:10 +00:00
Nanley Chery	3c87ac1f60	isl: Fix the aux-map encoding for D24_UNORM_X8 Bspec: 53911 now defines the encoding for this format. Cc: mesa-stable Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7198>	2020-10-19 15:58:43 +00:00
Jason Ekstrand	9a062383e3	anv: Implement VariableDescriptorCount Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7180>	2020-10-19 14:50:47 +00:00
Jason Ekstrand	215218f32f	anv: Add a descriptor_count to descriptor sets This is useful for asserting in-bounds descriptor set access. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7180>	2020-10-19 14:50:47 +00:00
Jason Ekstrand	dd1971c9d8	anv: Bump the number of update-after-bind descriptors to 1M It's a bit hard to exactly map our implementation to the limits described by Vulkan. The bindless surface handle in the extended message descriptors is 20 bits and it's an index into the table of RENDER_SURFACE_STATE structs that starts at bindless surface base address. This means that we can have at must 1M surface states allocated at any given time. Since most image views take two descriptors, this means we have a limit of about 500K image views. However, since we allocate surface states at vkCreateImageView time, this means our limit is actually something on the order of 500K image views allocated at any time. The actual limit describe by Vulkan, on the other hand, is a limit of how many you can have in a descriptor set. Assuming anyone using 1M descriptors will be using the same image view twice a bunch of times (or a bunch of null descriptors), we can safely advertise a larger limit. 1M is what's required by D3D12, so let's advertise that. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3335 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7180>	2020-10-19 14:50:47 +00:00
Ricardo Garcia	9d4999e7cf	anv: Ignore continue flag in primary cmd buffers According to the Vulkan specification, the VK_COMMAND_BUFFER_USAGE_RENDER_PASS_CONTINUE_BIT flag will be ignored if included in a VkCommandBufferBeginInfo for a primary command buffer. This also implies pBeginInfo->pInheritanceInfo should not be read even if the flag is present, and makes it legal to include the flag knowing it will be ignored. Signed-off-by: Ricardo Garcia <rgarcia@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7128>	2020-10-19 09:14:29 +02:00
Nanley Chery	5e27e04322	intel/isl: Drop redundant unpack of unorm channels Fixes: `09ced65420` ("intel/isl: Add format conversion code") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7168>	2020-10-16 18:35:57 +00:00
Caio Marcelo de Oliveira Filho	e7e24d5039	intel/fs: Handle nir_intrinsic_terminate For terminate operation, jump the invocation without predicating on the rest of the quad being disabled -- which is what is done for demote and discard. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7150>	2020-10-15 21:40:09 +00:00
Jason Ekstrand	a5529eb235	anv: Go back to using the sampler for UBO pulls This functionally reverts `b54d37a867`. This fixes a 12% performance regression in DOOM (2016) on Tigerlake. Fixes: `b54d37a867` "anv: Use the data cache for indirect UBO..." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7173>	2020-10-15 20:53:05 +00:00
Ian Romanick	19aaafd4a6	intel: Remove Gen10-specific device entries The enables removal of gen_device_info::is_cannonlake. v2: Remove GEN10_FEATURES and GEN10_HW_INFO macros. Suggested by Lionel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:54 -07:00
Ian Romanick	b0e3af4fb8	intel: Remove Gen10-speicific perf support v2: Also update Makefile.sources and Android build files. Noticed by Lionel. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:54 -07:00
Ian Romanick	fc04733f09	anv: Don't generate Gen10-specific functions v2: Re-wrap lines in meson.build. Suggested by Jason. v3: Also update Makefile.sources and Android build files. Noticed by Lionel. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> [v2] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:54 -07:00
Ian Romanick	495dc514ac	intel/isl: Don't generate Gen10-specific functions v2: Also update Makefile.sources and Android build files. Noticed by Lionel. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:54 -07:00
Ian Romanick	1f00b0f4ef	intel: Remove Gen10-specific cache config code Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:54 -07:00
Ian Romanick	262ca98b3a	intel/compiler: Remove Gen10-specific code Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:53 -07:00
Ian Romanick	e22f106e01	intel: Disable all support for Gen10 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:53 -07:00
Ian Romanick	b9c326bc4d	intel: Silence many unused parameter warnings in blorp_genX_exec.h I considered a couple other options (including adding #if / #endif around UNUSED and adding an UNUSED_ON_SOME_GEN), but this seemed the best. There was also at least one other case of having UNUSED on a paramter that is sometimes unused (params in blorp_emit_color_calc_state). This header gets included in a lot of places (esp. in files that get built per-Gen), so the warnings are repeated a lot. In file included from src/mesa/drivers/dri/i965/genX_blorp_exec.c:33: src/intel/blorp/blorp_genX_exec.h: In function ‘emit_urb_config’: src/intel/blorp/blorp_genX_exec.h:193:48: warning: unused parameter ‘deref_block_size’ [-Wunused-parameter] 193 \| enum gen_urb_deref_block_size deref_block_size) \| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~ src/intel/blorp/blorp_genX_exec.h: In function ‘blorp_fill_vertex_buffer_state’: src/intel/blorp/blorp_genX_exec.h:350:52: warning: unused parameter ‘batch’ [-Wunused-parameter] 350 \| blorp_fill_vertex_buffer_state(struct blorp_batch batch, \| ~~~~~~~~~~~~~~~~~~~~^~~~~ src/intel/blorp/blorp_genX_exec.h: In function ‘blorp_emit_surface_state’: src/intel/blorp/blorp_genX_exec.h:1403:42: warning: unused parameter ‘aux_op’ [-Wunused-parameter] 1403 \| enum isl_aux_op aux_op, \| ~~~~~~~~~~~~~~~~^~~~~~ src/intel/blorp/blorp_genX_exec.h: In function ‘blorp_update_clear_color’: src/intel/blorp/blorp_genX_exec.h:1867:46: warning: unused parameter ‘batch’ [-Wunused-parameter] 1867 \| blorp_update_clear_color(struct blorp_batch *batch, \| ~~~~~~~~~~~~~~~~~~~~^~~~~ Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6899>	2020-10-15 09:29:53 -07:00
Kenneth Graunke	341f5bffb7	intel/compiler, anv: Delete cs_prog_data->slm_size cs_prog_data->slm_size is basically redundant with prog_data->total_shared, which is the field that we actually use for controlling the shared local memory size in all drivers. We were still using it in one place for VK_EXT_pipeline_executable_properties, but we should just fix that and delete the field. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7152>	2020-10-14 23:13:41 +00:00
Jason Ekstrand	f8117f7051	intel/fs: Allow constant-propagation into SAMPLEINFO and IMAGE_SIZE Without this, we end up with indirect sampler messages all the time because we don't propagate the texture/image BTI. This makes debugging shaders with imageSize or textureSamples in them a pain. Shader-db results on Ice Lake: total instructions in shared programs: 19720612 -> 19720564 (<.01%) instructions in affected programs: 4998 -> 4950 (-0.96%) helped: 12 HURT: 0 All affected shaders were compute shaders in Deus Ex: Mankind Divided. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6794>	2020-10-14 21:35:30 +00:00
Nanley Chery	0d9216a7cb	isl: Allow CCS for 8bpp surfaces with 3+ miplevels I can't find a restriction for enabling CCS on these surfaces in recent versions of the Bspec. Since I didn't cite my source, I'm not even sure such a restriction existed in the first place. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7085>	2020-10-14 18:44:08 +00:00
Jason Ekstrand	5abac85177	intel/fs: Rework scratch handling on Gen9+ The current scratch mechanism uses an MRF hack where we reserve a few GRF registers to treat like the MRF and we collect the data into that MRF region before doing a scratch write. We also use that region for the header for scratch reads. This commit changes things and gets rid of the MRF hack. Instead, we reserve a single register (which RA is free to pick) for the scratch header and uses split sends for scratch writes to avoid having to do the copy. This should provide RA with more freedom in the presence of spilling as well as avoid some unnecessary data moves. In future, the new GEN9_SCRATCH_HEADER opcode gives us a place where we can do our own per-thread scratch base address calculations rather than depending on the scratch base address that gets pushed into g0. Having an opcode for this lets us do it once at the top of the shader rather than repeating it at every read/write. One other noticeable difference is the use of SHADER_OPCODE_SEND. We can get away with this thanks to the fact that we're now using a set to track which instructions are generated by spills and don't rely on the opcodes to find spill/fill instructions. This allows us to avoid adding more virtual opcodes and let the normal code paths handle things like scoreboard dependencies between header setup and the SEND. It also means that post-RA scheduling may be able to space out the header setup MOV and the SEND for better latency hiding. Shader-db results on Skylake: total spills in shared programs: 12137 -> 10604 (-12.63%) spills in affected programs: 6685 -> 5152 (-22.93%) helped: 274 HURT: 2 total fills in shared programs: 13065 -> 11515 (-11.86%) fills in affected programs: 9007 -> 7457 (-17.21%) helped: 275 HURT: 1 Shader-db results on Ice Lake: total spills in shared programs: 12482 -> 10953 (-12.25%) spills in affected programs: 6586 -> 5057 (-23.22%) helped: 275 HURT: 0 total fills in shared programs: 12819 -> 11234 (-12.36%) fills in affected programs: 7867 -> 6282 (-20.15%) helped: 274 HURT: 0 Shader-db results on Tigerlake: total spills in shared programs: 11689 -> 10233 (-12.46%) spills in affected programs: 4740 -> 3284 (-30.72%) helped: 259 HURT: 0 total fills in shared programs: 10840 -> 9443 (-12.89%) fills in affected programs: 6244 -> 4847 (-22.37%) helped: 259 HURT: 0 Fossil-db results on Ice Lake: Spills in all programs: 245249 -> 201633 (-17.8%) Fills in all programs: 366066 -> 314368 (-14.1%) More practically, this seems to give about a 0.5-1% perf boost in Witcher 3 (DXVK) and Shadow of the Tomb Raider (Vulkan native). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	e557af9781	intel/fs/ra: Use a set to track added spill/fill instructions Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	f650c4c0c6	intel/fs/ra: Sanity-check our IP counts Starting with `e99081e76d`, we don't re-construct liveness information every time we spill a register. Instead, we're very careful to track which instructions are spill instructions and not contribute those to the IP count so that we can continue to use the old liveness information even though instructions have been added. This commit adds an assert that sanity-checks that we count the same number of instructions as our liveness information is based on. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	d80d0a6ced	intel/fs/ra: Store the last non-spill VGRF node Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	2af6528c33	intel/fs/ra: Refactor handling of Gen7 scratch reads The attempt at de-duplication with the gen7_read Boolean wasn't actually saving us anything. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	74a1843ca0	intel/fs/ra: Increment spill_offset as part of the emit_spill loop This makes it consistent with our handling of src.offset and with our handling of spill_offset in emit_unspill. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	06ebf23283	intel/fs: Add a SCRATCH_HEADER opcode This opcode is responsible for setting up the buffer base address and per-thread scratch space fields of a scratch message header. For the most part, it's a copy of g0 but some messages need us to zero out g0.2 and the bottom bits of g0.5. This may actually fix a bug when nir_load/store_scratch is used. The docs say that the DWORD scattered messages respect the per-thread scratch size specified in gN.3[3:0] in the message header but we've been leaving it zero. This may mean that we've been ignoring any scratch reads/writes from a load/store_scratch intrinsic above the 1KB mark. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	24b64c8408	intel/fs: Copy the PTSS from g0 for scratch reads/writes In theory, this fixes a bug where we were dropping the PTSS bound on the floor. The hardware docs claim that the A32 DWORD and BYTE scattered read/write messages do a PTSS bounds check. However, in practice, it seems that the hardware ignores the bounds check so this doesn't actually matter. I verified this with the following couple of piglit tests: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/399 In practice, this prevents the next commit from making a subtle behavioral change. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Jason Ekstrand	f103012fad	intel/batch_decoder: Don't clame vec4 vs/gs/tcs shaders on Gen11+ Because we hard-coded the default to vec4, any platform where it doesn't have a "Dispatch Mode" field gets vec4 by default. This includes Gen11+ where vec4 is no longer a thing. Change the default so it works on newer hardware. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7084>	2020-10-13 21:59:27 +00:00
Rhys Perry	8850a63161	radv/aco,nir/lower_subgroups: don't lower elect ACO can implement this better. fossil-db (Navi): Totals from 33 (0.02% of 135946) affected shaders: SGPRs: 1736 -> 1744 (+0.46%) VGPRs: 1680 -> 1656 (-1.43%) CodeSize: 246160 -> 245916 (-0.10%); split: -0.14%, +0.04% MaxWaves: 449 -> 461 (+2.67%) Instrs: 48301 -> 48266 (-0.07%); split: -0.12%, +0.05% Cycles: 469740 -> 469240 (-0.11%); split: -0.18%, +0.08% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6558>	2020-10-13 12:47:20 +00:00
Timur Kristóf	f11f4a2a4d	nir: Add ability to count primitives per stream. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	aac5adc3c2	nir: Count vertices per stream. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	2be99012e9	nir: Add ability to count emitted GS primitives. Add an option to nir_lower_gs_intrinsics which tells it to track the number of emitted primitives, not just vertices. Additionally, also make it per-stream. Also rename the set_vertex_count intrinsic to set_vertex_and_primitive_count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Greg V	73dd86c421	radv,anv: use CLOCK_MONOTONIC_FAST when CLOCK_MONOTONIC_RAW is undefined CLOCK_MONOTONIC_FAST is a similar clock from FreeBSD. Acked-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6995>	2020-10-09 09:49:20 +00:00
Nanley Chery	290f3fe897	Revert "anv: Add driconf option to disable compression for 16bpp format" This reverts commit `bcfec61d1e`. The previous patch fixed the underlying issue that the above commit was actually working around. It turns out that the previously observed performance regression was due to invalid aux-map entries for multi-layer HiZ+CCS buffers. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7046>	2020-10-08 20:47:24 +00:00
Nanley Chery	cce6fc3b5c	anv: Enable multi-layer aux-map init for HIZ+CCS Fixes rendering corruption in the shadowmappingcascade Sascha Willems Vulkan demo. To see the corruption, I adjusted the demo options as follows: 1. Enable "Display depth map" 2. Set "Split lambda" to 0.100 3. Make "Cascade" non-zero. Fixes: `80ffbe915f` ("anv: Add support for HiZ+CCS") Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7046>	2020-10-08 20:47:24 +00:00
Jason Ekstrand	b54d37a867	anv: Use the data cache for indirect UBO pulls on Gen8+ On Gen7, the data cache is pretty terrible so we'd rather avoid it there. On Gen8+, it should be fine and is less likely to conflict with texturing so we should get less cache thrashing there. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3932>	2020-10-08 01:17:11 -05:00
Jason Ekstrand	89f3d116a8	anv: Plumb the device into *bits_for_access_flags Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3932>	2020-10-08 01:17:11 -05:00
Jason Ekstrand	3a33560681	anv: Use format_for_descriptor_type for descriptor buffers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3932>	2020-10-08 01:17:11 -05:00
Jason Ekstrand	d2185f0c3f	anv: Add a device parameter to format_for_descriptor_type Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3932>	2020-10-08 01:17:11 -05:00
Jason Ekstrand	3d22de05ca	intel/fs: Add an option to use dataport messages for UBOs Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3932>	2020-10-08 01:17:06 -05:00
Jason Ekstrand	0d462dbee5	intel/fs: Add an alignment to VARYING_PULL_CONSTANT_LOAD_LOGICAL Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3932>	2020-10-08 01:14:46 -05:00
Lionel Landwerlin	caea5a6a20	intel/dev: fix 32bit build issue Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7049>	2020-10-08 05:42:31 +00:00
Jason Ekstrand	dd9c34a907	intel/nir: Lower load_global_constant in lower_mem_access_bit_sizes It's identical to nir_intrinsic_load_global except that it works on data that's guaranteed to be constant throughout the shader invocation. Fixes: `ff2f44d865` "intel/fs: Implement nir_intrinsic_load_global_constant" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6872>	2020-10-08 03:56:01 +00:00
Jason Ekstrand	fd04f858b0	intel/nir: Don't try to emit vector load_scratch instructions In `53bfcdeecf`, we added load/store_scratch instructions which deviate a little bit from most memory load/store instructions in that we can't use the normal untyped read/write instructions which can read and write up to a vec4 at a time. Instead, we have to use the DWORD scattered read/write instructions which are scalar. To handle this, we added code to brw_nir_lower_mem_access_bit_sizes to cause them to be scalarized. However, one case was missing: the load-as-larger-vector case. In this case, we take small bit-sized constant-offset loads replace it with a 32-bit load and shuffle the result around as needed. For scratch, this case is much trickier to get right because it often emits vec2 or wider which we would then have to lower again. We did this for other load and store ops because, for lower bit-sizes we have to scalarize thanks to the byte scattered read/write instructions being scalar. However, for scratch we're not losing as much because we can't vectorize 32-bit loads and stores either. It's easier to just disallow it whenever we have to scalarize. Fixes: `53bfcdeecf` "intel/fs: Implement the new load/store_scratch..." Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6872>	2020-10-08 03:56:01 +00:00
Jason Ekstrand	9df9f940f0	iris: Add support for load_work_dim as a system value Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7047>	2020-10-07 16:01:31 -05:00

1 2 3 4 5 ...

5940 commits