fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-24 12:48:13 +02:00

Author	SHA1	Message	Date
Caio Oliveira	be102fede4	intel/compiler: Fix missing tie-breaker in brw_nir_analyze_ubo_ranges() ordering code Per Ken suggestion, use ascending order for the start offset. Fixes: `6d28c6e52c` ("i965: Select ranges of UBO data to be uploaded as push constants.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19731> (cherry picked from commit `494e2edb90`)	2022-11-17 14:05:03 +00:00
Caio Oliveira	6e4a46e2a8	intel/compiler: Fix dynarray usage in intel_clc The code builds up the dynamic array of objects (spirv_objs) and collect pointers to each of them into another dynamic array (spirv_ptr_objs). If the growth of the first array cause a reallocation, it is possible that the previous pointers end up invalid. Fixes: `77e929a527` ("intel/clc: allow multiple CL files to be compiled together") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19730> (cherry picked from commit `9fd1d47aa0`)	2022-11-17 14:05:03 +00:00
Lionel Landwerlin	97a017ed25	anv: bump pool bucket max allocation size Age of Empire IV generates a shader of ~2.3Mb on DG2 which is above the limit we currently have. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19669> (cherry picked from commit `ae76bba34a`)	2022-11-17 14:05:03 +00:00
Tapani Pälli	fc57b9ac44	anv: setup stage bitmask for Wa_22011440098 Fixes: `40b66a4499` ("anv, iris: Add Wa_22011440098 for DG2") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19636> (cherry picked from commit `ecd4517560`)	2022-11-17 14:05:03 +00:00
Lionel Landwerlin	d7ca6ccee2	anv: split internal surface states from descriptors On Intel HW we use the same mechanism for internal operations surfaces as well as application surfaces (VkDescriptor). This change splits the surface pool in 2, one part dedicated to internal allocations, the other to application VkDescriptors. To do so, the STATE_BASE_ADDRESS::SurfaceStateBaseAddress points to a 4Gb area, with the following layout : - 1Gb of binding table pool - 2Gb of internal surface states - 1Gb of bindless surface states That way any entry from the binding table can refer to both internal & bindless surface states but none of the driver allocations interfere with the allocation of the application. Based off a change from Sviatoslav Peleshko. v2: Allocate image view null surface state from bindless heap (Sviatoslav) Removed debug stuff (Sviatoslav) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7110 Cc: mesa-stable Tested-by: Sviatoslav Peleshko <sviatoslav.peleshko@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19275> (cherry picked from commit `4ceaed7839`)	2022-11-17 14:05:03 +00:00
Lionel Landwerlin	91dfc02570	anv: fixup invalid enum for nir environment Also switching away from PIPE_ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `8c4c4c3ee1` ("anv: Add softtp64 workaround") Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19638> (cherry picked from commit `68fd9d2829`)	2022-11-17 14:05:02 +00:00
Jason Ekstrand	7701bf1228	intel: Don't cross DWORD boundaries with byte scratch load/store The back-end swizzles dwords so that our indirect scratch messages match the memory layout of spill/fill messages for better cache coherency. The swizzle happens at a DWORD granularity. If a read or write crosses a DWORD boundary, the first bit will get correctly swizzled but whatever piece lands in the next dword will not because the scatter instructions assume sequential addresses for all bytes. For DWORD writes, this is handled naturally as part of scalarizing. For smaller writes, we need to be sure that a single write never escapes a dword. Fixes: `fd04f858b0` ("intel/nir: Don't try to emit vector load_scratch instructions") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7364 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19580> (cherry picked from commit `25c180b509`)	2022-11-09 21:22:06 +00:00
Jason Ekstrand	d580ab8898	intel/lower_mem_access_bit_sizes: Compute alignments automatically Because dup_mem_intrinsic() retains the SSA offset from the original intrinsic and only modifies it by adding a constant, we can compute the alignment based on the original alignment and the constant offset. This is both easier and more accurate. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19580> (cherry picked from commit `85685cf932`)	2022-11-09 21:22:06 +00:00
Lionel Landwerlin	e54150d6e1	anv: fix missing VkPhysicalDeviceExtendedDynamicState3PropertiesEXT handling Fixes: `13c422e1b2` ("anv: toggle on EXT_extended_dynamic_state3") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19573> (cherry picked from commit `97b3dd34c1`)	2022-11-09 21:22:06 +00:00
Ian Romanick	87e7794d7b	intel/fs: Fix constant propagation into 32x16 integer multiplication Don't copy propagate the constant in situations like mov(8) g8<1>D 0x7fffffffD mul(8) g16<1>D g8<8,8,1>D g15<16,8,2>W On platforms that only have a 32x16 multiplier, this will result in lowering the multiply to mul(8) g15<1>D g14<8,8,1>D 0xffffUW mul(8) g16<1>D g14<8,8,1>D 0x7fffUW add(8) g15.1<2>UW g15.1<16,8,2>UW g16<16,8,2>UW On Gfx8 and Gfx9, which have the full 32x32 multiplier, it results in mul(8) g16<1>D g15<16,8,2>W 0x7fffffffD Volume 2a of the Skylake PRM says: When multiplying a DW and any lower precision integer, the DW operand must on src0. See also https://gitlab.freedesktop.org/mesa/crucible/-/merge_requests/104. Previous to INTEL_shader_integer_functions2 (in Vulkan or OpenGL), I don't think it would be possible to create a situation where this could occur. I discovered this via some optimizations that can determine that the non-constant source must be able to fit in 16-bits. The case listed above came from piglit's "ext_transform_feedback-order arrays points" with those optimizations in place. No shader-db or fossil-db changes on any Intel platform. Fixes: `de6c0f8487` ("intel/fs: Implement support for NIR opcodes for INTEL_shader_integer_functions2") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17718> (cherry picked from commit `db20412168`)	2022-11-09 21:22:06 +00:00
Mauro Rossi	15aae04df5	hasvk: fix android build and reported API version anv_device.c for vulkan.intel_hasvk requires changes to be compiled and behave correctly for android target Fixes the following building error: FAILED: src/intel/vulkan_hasvk/libanv_hasvk_common.a.p/anv_device.c.o ... ../src/intel/vulkan_hasvk/anv_device.c:143:19: error: use of undeclared identifier 'ANV_API_VERSION_1_3' *pApiVersion = ANV_API_VERSION_1_3; ^ ../src/intel/vulkan_hasvk/anv_device.c:1822:44: error: use of undeclared identifier 'ANV_API_VERSION_1_3' .apiVersion = pdevice->use_softpin ? ANV_API_VERSION_1_3 : ANV_API_VERSION_1_2, ^ ../src/intel/vulkan_hasvk/anv_device.c:1822:66: error: use of undeclared identifier 'ANV_API_VERSION_1_2' .apiVersion = pdevice->use_softpin ? ANV_API_VERSION_1_3 : ANV_API_VERSION_1_2, ^ 3 errors generated. Cc: "22.3" mesa-stable Fixes: `00eefdc` ("hasvk: stop advertising Vk 1.3 on non-softpin") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19452> (cherry picked from commit `814b822fe0`)	2022-11-09 21:22:05 +00:00
Lionel Landwerlin	32a7d9b892	anv: Reduce RHWO optimization (Wa_1508744258) Implement Wa_1508744258: Disable RHWO by setting 0x7010[14] by default except during resolve pass. Disable the RCC RHWO optimization at all times except when resolving single sampled color surfaces. v2: Move stalling to genX(cmd_buffer_apply_pipe_flushes) for clarity (Mark) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19450> (cherry picked from commit `ba0336ab3f`)	2022-11-09 21:22:05 +00:00
Marcin Ślusarz	dcaaeb56ef	anv: program 3DSTATE_MESH_DISTRIB with the recommended values It improves performance of vk_meshlet_cadscene on A770. Fixes: `f083df8710` ("anv: update task/mesh distribution with the recommended values") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19412>	2022-11-02 08:56:53 +00:00
Marcin Ślusarz	d1d2dee970	anv: set 3DSTATE_[MESH\|TASK]_CONTROL.MaximumNumberofThreadGroups Documentation is worded in a confusing way, which may be understood that we don't have to set this field to get good results. MESH part of this commit improves performance of vk_meshlet_cadscene by a factor of 2 on A380. Fixes: `ef04caea9b` ("anv: Implement Mesh Shading pipeline") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19412>	2022-11-02 08:56:53 +00:00
Marcin Ślusarz	11612d81b7	intel/genxml: fix width of 3DSTATE_TASK_CONTROL.MaximumNumberofThreadGroups Fixes: `3567d47f3e` ("intel/genxml: Inline the BODY structs into the instructions") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19412>	2022-11-02 08:56:53 +00:00
Illia Abernikhin	aa4ac5ff8b	utils: Merge util/debug.* into util/u_debug.* and remove util/debug.* Rename env_var_as_unsigned() -> debug_get_num_option(), because duplicate Rename env_var_as_bool() -> debug_get_bool_option(), because duplicate Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7177 Signed-off-by: Illia Abernikhin <illia.abernikhin@globallogic.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19336>	2022-11-02 07:25:39 +00:00
Kenneth Graunke	fde99747e9	nir: Drop infer_non_readable option for nir_opt_access() Everybody sets it to true now, and the only reason for the option to exist was to work around a bug that's now been fixed. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19162>	2022-11-02 03:42:04 +00:00
Kenneth Graunke	88756cee8d	intel/compiler: Run nir_opt_large_constants before scalarizing consts nir_opt_large_constants balks at seeing a store_deref of a variable where the source is a vecN operation of multiple load_consts, and thinks that isn't a constant, so it should not bother promoting it. Unfortunately, we were running nir_lower_load_const_to_scalar before nir_opt_large_constants, so this prevented a ton of constant promotion. This commit /used to help/ some shaders in shader-db. Presumably since !16770 landed, those shaders were already helped. Currently ther are no shader-db changes on any Intel platform. Fossil-db results: All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 141998227 -> 141421756 (-0.4%) Instructions helped: 12515 Instructions hurt: 237 SENDs in all programs: 7437925 -> 7468033 (+0.4%) SENDs hurt: 12806 Cycles in all programs: 9161655753 -> 9132869800 (-0.3%) Cycles helped: 10163 Cycles hurt: 2637 Spills in all programs: 19977 -> 18678 (-6.5%) Spills helped: 384 Spills hurt: 40 Fills in all programs: 32863 -> 31396 (-4.5%) Fills helped: 385 Fills hurt: 42 Lost: 1 Lots of Shadow of the Tomb Raider fragment shaders and Batman Arkham Origins vertex shaders were hurt for SENDs in this commit. A couple Aztec Ruins compute shaders and Spaceship shaders (multiple stages) were also hurt. All of the shaders hurt for spills or fills were Spaceship compute shaders. Nearly all of the shaders helped were Shadow of the Tomb Raider fragmenet shaders. One Spaceship shader was reall, REALLY helped: Spills helped fossils/fossil-db/Spaceship.run.9f90a2a226fcc57f.1.foz/0b507d3abe2e3c28/compute: 321 -> 13 (-96.0%) Fills helped fossils/fossil-db/Spaceship.run.9f90a2a226fcc57f.1.foz/0b507d3abe2e3c28/compute: 279 -> 21 (-92.5%) Overall this seems like an improvement, but we may want to actually run these few benchmarks before landing. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16539>	2022-11-01 14:55:21 -07:00
Nanley Chery	0fa540ef61	iris: Reduce use of RHWO optimization (Wa_1508744258) Implement Wa_1508744258: Disable RHWO by setting 0x7010[14] by default except during resolve pass. Disable the RCC RHWO optimization at all times except when resolving single sampled color surfaces. MCS partial resolves are done via software (i.e., not via a HW bit) and so are not expected to need this workaround. Reviewed-by: Mark Janes <markjanes@swizzler.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19360>	2022-10-31 23:26:06 +00:00
Tapani Pälli	7cfd0e8d31	hasvk: remove some unused functions Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19368>	2022-10-31 06:59:36 +00:00
Tapani Pälli	f9176d9b2c	anv: remove some unused functions Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19368>	2022-10-31 06:59:36 +00:00
Lionel Landwerlin	6b52834ece	anv: remove shader fp64 inspection after parsing Unfortunately some crucible tests are using all floating point widths in a single shader and specializing a variable to select what code path to use for a particular supported floating point width. This is reporting errors in the validation layers. Remove the validation for now. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes `8c4c4c3ee1` ("anv: Add softtp64 workaround") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19401>	2022-10-30 22:16:52 +00:00
José Roberto de Souza	bb9f66800c	intel/perf: Use intel_device_info functions to compute subslice and eu totals Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19359>	2022-10-28 20:12:07 +00:00
Mykhailo Skorokhodov	a954933f4f	drirc: Add fp64_workaround_enabled option Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18854>	2022-10-28 10:08:50 +00:00
Mykhailo Skorokhodov	8c4c4c3ee1	anv: Add softtp64 workaround Pass float64.glsl into nir_lower_doubles() resolves the problem on ICL/TGL when the shader uses float64, but the device doesn't support that type. Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18854>	2022-10-28 10:08:50 +00:00
Mykhailo Skorokhodov	829d74b2f2	anv/meson: Add float64_spv_h custom target Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18854>	2022-10-28 10:08:50 +00:00
Lionel Landwerlin	920aed2121	intel/compiler: don't allocate compaction arrays on the stack Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7569 Cc: mesa-stable Reviewed-by: Luis Felipe Strano Moraes <luis.strano@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19339>	2022-10-28 07:10:58 +00:00
Matt Turner	3ef88cd0a2	intel/dev: Set display_ver = 13 on all ADL/RPL/DG2 display_ver doesn't seem to be used anywhere, but if that were to change, we'd want this to be consistent. Fixes: `c746bf4c5c` ("intel/dev: Add display_ver and set adl-p to 13") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19354>	2022-10-28 01:31:44 +00:00
Lionel Landwerlin	e59c4a912b	intel/fs: use fs implementation of dump_instructions This specialized version prints out the liveness count as well as the maximum liveness count. It was eye opening when seeing the max liveness jump after lowering of packing instructions which should not have changed the count. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18657>	2022-10-27 21:05:00 +00:00
Lionel Landwerlin	e5dfff0946	intel/fs: reduce liveness of variables in lowering passes When lowering a single instruction with a destination VGRF to 2 or more, the VGRF is now considered partially written by each generated instruction and that increases its liveness especially in loops. Thus potentially increasing the number of spills/fills due to register allocation. Putting an UNDEF instruction in front of the lowered instructions allows the IR to limit the liveness of the VGRF, reducing register pressure. This has a pretty dramatic effect on spills/fills for RT shaders. Here the stats on Q2RTX shaders on DG2 (wipping out any spills/fills due to register allocation) : Instructions in all programs: 26150 -> 24955 (-4.6%) SENDs in all programs: 1148 -> 1148 (+0.0%) Loops in all programs: 4 -> 4 (+0.0%) Cycles in all programs: 392179 -> 332787 (-15.1%) Spills in all programs: 132 -> 116 (-12.1%) Fills in all programs: 262 -> 154 (-41.2%) Shader-db results on TGL : total instructions in shared programs: 21158140 -> 21158377 (<.01%) instructions in affected programs: 76629 -> 76866 (0.31%) helped: 18 HURT: 20 helped stats (abs) min: 1 max: 60 x̄: 18.89 x̃: 12 helped stats (rel) min: 0.21% max: 3.61% x̄: 1.02% x̃: 0.77% HURT stats (abs) min: 1 max: 79 x̄: 28.85 x̃: 18 HURT stats (rel) min: 0.04% max: 2.81% x̄: 1.13% x̃: 0.79% 95% mean confidence interval for instructions value: -4.82 17.30 95% mean confidence interval for instructions %-change: -0.34% 0.57% Inconclusive result (value mean confidence interval includes 0). total loops in shared programs: 5753 -> 5753 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 798856834 -> 798870688 (<.01%) cycles in affected programs: 6208395 -> 6222249 (0.22%) helped: 22 HURT: 17 helped stats (abs) min: 2 max: 8794 x̄: 1438.18 x̃: 782 helped stats (rel) min: 0.05% max: 2.28% x̄: 0.63% x̃: 0.44% HURT stats (abs) min: 2 max: 19178 x̄: 2676.12 x̃: 1358 HURT stats (rel) min: 0.04% max: 23.49% x̄: 2.25% x̃: 0.71% 95% mean confidence interval for cycles value: -952.19 1662.65 95% mean confidence interval for cycles %-change: -0.64% 1.90% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 4078 -> 4066 (-0.29%) spills in affected programs: 40 -> 28 (-30.00%) helped: 2 HURT: 0 total fills in shared programs: 2856 -> 2832 (-0.84%) fills in affected programs: 127 -> 103 (-18.90%) helped: 2 HURT: 0 total sends in shared programs: 998554 -> 998554 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 0 GAINED: 0 Total CPU time (seconds): 2346.06 -> 2304.80 (-1.76%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18657>	2022-10-27 21:05:00 +00:00
Lionel Landwerlin	dd6d40429b	intel/fs: make split_virtual_grfs deal with partial undefs v2: fix up UNDEFs instructions (Curro) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18657>	2022-10-27 21:05:00 +00:00
Lionel Landwerlin	14b99df7d9	intel/fs: require UNDEFs register offsets to be aligned to REG_SIZE Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18657>	2022-10-27 21:05:00 +00:00
Alyssa Rosenzweig	941c37c085	nir/lower_idiv: Remove imprecise_32bit_lowering NIR has two implementations of lower_idiv, keyed on the imprecise_32bit_lowering flag. This flag is misleading: the results when setting this flag "imprecise", they're completely wrong for some values. If a backend has a native implementation of umul_high, the correct path isn't that much more expensive. If it doesn't, it's substantially slower for highp integer divison... but in practice, non-constant highp integer division is pretty rare. After a painful migration of the tree, this code path has no more users. Remove it so nobody else gets the bright idea of using it again. Closes: #6555 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19303>	2022-10-27 19:37:14 +00:00
José Roberto de Souza	5bbe3271e6	hasvk: Fix build around intel_measure_state_changed() call Fixes: `2bc82581ad` ("anv: add support for mesh shading in INTEL_MEASURE") Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19355>	2022-10-27 10:31:30 -07:00
Marcin Ślusarz	ea7e331fb8	anv: add mesh shading tracepoints Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19344>	2022-10-27 15:03:28 +00:00
Marcin Ślusarz	63ad8aed41	intel/ds: add new category/stage for draw mesh events Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19344>	2022-10-27 15:03:28 +00:00
Marcin Ślusarz	2bc82581ad	anv: add support for mesh shading in INTEL_MEASURE Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19344>	2022-10-27 15:03:28 +00:00
Lionel Landwerlin	ed7d64962e	intel/common: add detection of protected context support v2: Add anv bits Fix missing i915 extension chaining helper Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8092>	2022-10-27 10:53:18 +00:00
Lionel Landwerlin	4172596382	isl: add new MOCS field for protected buffers Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8092>	2022-10-27 10:53:18 +00:00
Jordan Justen	c238699afa	intel/compiler: Broadcast lower code should check 64-bit int support This will affect MTL which will have fp64 support without int64 support. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Iván Briano <ivan.briano@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19284>	2022-10-27 09:22:09 +00:00
Lionel Landwerlin	2da7ec0db9	intel/clc: assert when libclc shader is not found Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7483 Reviewed-by: Luis Felipe Strano Moraes <luis.strano@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19091>	2022-10-27 08:53:55 +00:00
Iago Toral Quiroga	9deef4cde6	vulkan/runtime: include robustness info when hashing a shader stage Suggested by Jason Ekstrand. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18883>	2022-10-27 08:17:11 +00:00
Lionel Landwerlin	93dbd14ed7	anv: init major/minor before WSI So that we can provide that information to WSI if it asks for it immediately. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19224>	2022-10-26 20:34:15 +00:00
Lionel Landwerlin	324d945589	anv: disable mesh in memcpy We can't have streamout and mesh enabled at the same time. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `ef04caea9b` ("anv: Implement Mesh Shading pipeline") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19323>	2022-10-26 19:55:11 +00:00
Lionel Landwerlin	d766093199	anv: enable localized loads for lower_shader_calls On Q2RTX shaders : Instructions in all programs: 31039 -> 26150 (-15.8%) SENDs in all programs: 1587 -> 1148 (-27.7%) Loops in all programs: 4 -> 4 (+0.0%) Cycles in all programs: 420218 -> 392179 (-6.7%) Spills in all programs: 157 -> 132 (-15.9%) Fills in all programs: 337 -> 262 (-22.3%) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:26 +00:00
Lionel Landwerlin	1d10d17817	nir/lower_shader_calls: add an option structure for future optimizations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:25 +00:00
José Roberto de Souza	4096c15f4f	hasvk: Nuke code around local memory None of the platforms supported by this driver supports local memory. Signed-off-by: José Roberto de Souza <jose.souza@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19240>	2022-10-23 16:33:20 -07:00
Tapani Pälli	1e51383258	intel/compiler: run nir_opt_idiv_const before nir_lower_idiv Integer div lowering can potentially create a lot of code that is not removed later on. Running const lowering pass first can be used to eliminate that code. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19157>	2022-10-20 15:35:48 +03:00
Luis Felipe Strano Moraes	eb6668ee86	anv: adding parsetab.py to the .gitignore for grl This file is autogenerated by python's ply package and is used during compilation to cache tables built by it for quicker parsing. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19153>	2022-10-20 02:24:39 +00:00
Luis Felipe Strano Moraes	eff1517cd7	anv: added proper handling for input argument in intel_clc That was previously listed on the getopt_long struct but not actually being used. This makes intel_clc argument processing easier as now all of its arguments are handled with getopt and anything after the special argument '--' is passed along to clang to form the final build command. Thanks to Dylan Baker for help with changes to the meson file. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19153>	2022-10-20 02:24:39 +00:00

1 2 3 4 5 ...

8598 commits