fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 07:08:04 +02:00

Author	SHA1	Message	Date
Kenneth Graunke	dff3ab5c04	iris: Avoid unnecessary resolves on transfer maps We were always resolving the buffer as if we were accessing it via CPU maps, which don't understand any auxiliary surfaces. But we often copy to a temporary using BLORP, which understands compression just fine. So we can avoid the resolve, and accelerate the copy as well. Fixes: `9d1334d2a0` ("iris: Use copy_region and staging resources to avoid transfer stalls") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> (cherry picked from commit `2d79925034`)	2019-09-04 11:54:10 -07:00
Kenneth Graunke	d78f39eba0	iris: Drop copy format hacks from copy region based transfer path. This doesn't work for compressed formats, as the source texture and temporary texture would have different block sizes. (Forcing the driver to always take the GPU path would expose the bug.) Instead, just use the source format for the temporary, and let blorp_copy deal with overrides. The one case where we can't do this is ASTC, because isl won't let us create a linear ASTC surface. Fall back to the CPU paths there for now. Fixes: `9d1334d2a0` ("iris: Use copy_region and staging resources to avoid transfer stalls") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> (cherry picked from commit `136629a1e3`)	2019-09-04 11:54:05 -07:00
Kenneth Graunke	1be5f26cfb	iris: Update fast clear colors on Gen9 with direct immediate writes. Gen11 stores the fast clear color in an "indirect clear buffer", as a packed pixel value. Gen9 hardware stores it as a float or integer value, which is interpreted via the format. We were trying to store that in a buffer, for similarity with Icelake, and MI_COPY_MEM_MEM it from there to the actual SURFACE_STATE bytes where it's stored. This unfortunately doesn't work for blorp_copy(), which does bit-for-bit copies, and overrides the format to a CCS-compatible UINT format. This causes the clear color to be interpreted in the overridden format. Normally, we provide the clear color on the CPU, and blorp_blit.c:2611 converts it to a packed pixel value in the original format, then unpacks it in the overridden format, so the clear color we use expands to the bits we originally desired. However, BLORP doesn't support this pack/unpack with an indirect clear buffer, as it would need to do the math on the GPU. On Gen11+, it isn't necessary, as the hardware does the right thing. This patch changes Gen9 to stop using an indirect clear buffer and simply do PIPE_CONTROLs with post-sync write immediate operations to store the new color over the surface states for regular drawing. BLORP continues streaming out surface states, and handles fast clear colors on the CPU. Fixes: `53c484ba8a` ("iris: blorp using resolve hooks") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> (cherry picked from commit `1cd13ccee7`)	2019-09-04 11:52:53 -07:00
Kenneth Graunke	14588c0727	iris: Fix broken aux.possible/sampler_usages bitmask handling For renderable surfaces, we allocate SURFACE_STATEs for each bit in res->aux.possible_usages. Sampler views use res->aux.sampler_usages. When pinning buffers, we call surf_state_offset_for_aux() to calculate the offset to the desired surface state. surf_state_offset_for_aux() took an aux_modes parameter, which should be one of those two fields. However...it was not using that parameter. It always used the broader res->aux.possible_usages field directly. One of the callers, update_clear_value(), was passing incorrect masks for this parameter. It iterated through the bits in order, using u_bit_scan(), which destructively modifies the mask. So each time we called it, the count of bits before our selected mode was 0, which would cause us to always update the SURFACE_STATE for ISL_AUX_USAGE_NONE, rather than updating each in turn. This was hidden by the earlier bug where surf_state_offset_for_aux() ignored the parameter. Fixes: `7339660e80` ("iris: Add aux.sampler_usages.") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> (cherry picked from commit `117a0368b0`)	2019-09-04 11:52:46 -07:00
Kenneth Graunke	973d58e9b3	iris: Replace devinfo->gen with GEN_GEN This is genxml, we can compile out this code. Fixes: `2660667284` ("iris/gen8: Re-emit the SURFACE_STATE if the clear color changed.") Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> (cherry picked from commit `f6c44549ee`)	2019-09-04 11:52:41 -07:00
Alyssa Rosenzweig	58acce6dd9	pan/midgard: Fix writeout combining shader-db regression in the scheduler. Fixes: `dff4986b1a` ("pan/midgard: Emit store_output branch just-in-time") total bundles in shared programs: 2055 -> 2019 (-1.75%) bundles in affected programs: 1055 -> 1019 (-3.41%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.35% max: 20.00% x̄: 6.71% x̃: 5.16% 95% mean confidence interval for bundles value: -1.00 -1.00 95% mean confidence interval for bundles %-change: -8.45% -4.97% Bundles are helped. total quadwords in shared programs: 3444 -> 3408 (-1.05%) quadwords in affected programs: 1897 -> 1861 (-1.90%) helped: 36 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.19% max: 14.29% x̄: 3.97% x̃: 2.99% 95% mean confidence interval for quadwords value: -1.00 -1.00 95% mean confidence interval for quadwords %-change: -5.08% -2.86% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> (cherry picked from commit `272ce6f5a7`)	2019-09-04 11:52:36 -07:00
Bas Nieuwenhuizen	bd0300f8ef	radv: Disable NGG for geometry shaders. A bunch of remaining issues including some that affect users. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111248 Fixes: `ee21bd7440` "radv/gfx10: implement NGG support (VS only)" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `c037fe5ad1`)	2019-09-04 11:52:30 -07:00
Lionel Landwerlin	4385e6cf02	util/timespec: use unsigned 64 bit integers for nsec values We added this utility for vulkan where all timeouts are given as uint64_t values. We can switch from signed to unsigned as this is the only user and if we ever deal with signed integers somewhere else we'll have to be careful to use the corresponding timespec_(add\|sub)_msec and always pass absolute values. v2: Forgot to drop the test calling add_nsec() with a negative number Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Juan A. Suarez Romero <jasuarez@igalia.com> Fixes: `d2d70c3bb5` ("util: add a timespec helper") Acked-by: Daniel Stone <daniels@collabora.com> (cherry picked from commit `5833f43305`)	2019-09-04 11:52:26 -07:00
Tapani Pälli	18511e3f5b	iris/android: fix build and link with libmesa_intel_perf Fixes: `0fd4359733` "iris/perf: implement routines to return counter info" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (cherry picked from commit `728ebcdec2`)	2019-09-04 11:52:09 -07:00
Samuel Pitoiset	7c615873e5	ac: fix exclusive scans on GFX8-GFX9 This fixes a regression introduced with scan&reduce operations on GFX10. Note that some subgroups CTS still fail on GFX10 but I assume it's a different issue. This fixes dEQP-VK.subgroups.arithmetic..subgroupexclusive. Fixes: `227c29a80d` "amd/common/gfx10: implement scan & reduce operations" Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `2d9f401a83`)	2019-09-04 11:52:02 -07:00
Tapani Pälli	6af303f6fc	util: fix os_create_anonymous_file on android Commit fixes current crashes with Vulkan applications on Android. Fixes: `c0376a1234` "util: add anon_file.h for all memfd/temp file usage" Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> (cherry picked from commit `ce8fd042a5`)	2019-09-04 11:51:55 -07:00
Kenneth Graunke	844fbc5c42	gallium/noop: Implement resource_get_param v2: Pass through to oscreen rather than faking it (review from Marek). Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `bc844d92ce`)	2019-09-04 11:51:50 -07:00
Kenneth Graunke	813ed8629e	gallium/rbug: Wrap resource_get_param if available Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `f02d1a0b75`)	2019-09-04 11:51:44 -07:00
Kenneth Graunke	6e6f137a4e	gallium/trace: Wrap resource_get_param if available Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `c43a44791b`)	2019-09-04 11:51:39 -07:00
Kenneth Graunke	07760c1c9e	gallium/ddebug: Wrap resource_get_param if available Fixes: `0346b70083` ("gallium/screen: Add pipe_screen::resource_get_param") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `0e6b573ae5`)	2019-09-04 11:51:31 -07:00
Jose Maria Casanova Crespo	0504bff354	mesa: recover target_check before get_current_tex_objects At compressed_tex_sub_image we only can obtain the tex_object after compressed_subtexture_target_check is validated for TEX_MODE_CURRENT. So if the target is wrong the error is raised to the user. This completes the fix for the regression introduced on "mesa: refactor compressed_tex_sub_image function" of the pending failing tests: dEQP-GLES3.functional.negative_api.texture.compressedtexsubimage3d dEQP-GLES31.functional.debug.negative_coverage.get_error.texture.compressedtexsubimage3d v2: Fix warning that texObj might be used uninitialized (Gert Wollny) Fixes: `7df233d68d` ("mesa: refactor compressed_tex_sub_image function") Reviewed-By: Gert Wollny <gert.wollny@collabora.com> (cherry picked from commit `74a7e3ed3b`)	2019-09-04 11:51:25 -07:00
Samuel Pitoiset	637a9cbd3b	radv: force enable VK_AMD_shader_ballot for Wolfenstein Youngblood This gives a nice boost, +20% at this time on my Vega 56. Shader ballot should be enabled by default at some point but it reduces performance a bit (-6%) with Wolfeinstein II. Enable it only for Youngblood at the moment, like what we did for Talos in the past. As a bonus point, it gets rid of some minor artifacts that only happens when ballot is disabled for some reasons. Cc: 19.2 <mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `a6ad9e8ccf`)	2019-09-04 11:51:15 -07:00
Samuel Pitoiset	690f050608	radv: add a new debug option called RADV_DEBUG=noshaderballot Shader ballot will be enabled by default for Wolfenstein Youngblood. This follows what we did for sisched. Cc: 19.2 <mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `f202ac27a9`)	2019-09-04 11:50:59 -07:00
Samuel Pitoiset	3ab1368c4f	radv: allow to enable VK_AMD_shader_ballot only on GFX8+ Scans aren't implemented on SI/CIK. Cc: 19.2 <mesa-stable@lists.freedesktop.org> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `e73d863a66`)	2019-09-04 11:50:53 -07:00
Danylo Piliaiev	71daf2ef67	nir/loop_unroll: Prepare loop for unrolling in wrapper_unroll Without loop_prepare_for_unroll loops are losing phis. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111411 Fixes: `5db98195` "nir: add loop unroll support for wrapper loops" Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> (cherry picked from commit `84b3ef6a96`)	2019-09-04 11:50:48 -07:00
Bas Nieuwenhuizen	614def1a89	radv: Emit VGT_GS_ONCHIP_CNTL for tess on GFX10. Otherwise hangs are possible. This register was already set for GS and NGG. Fixes: `5eaed7ecfc` "radv/gfx10: enable support for NAVI10, NAVI12 and NAVI14" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `e04761d0f9`)	2019-09-04 11:50:42 -07:00
Bas Nieuwenhuizen	55334521f7	radv: Use correct vgpr_comp_cnt for VS if both prim_id and instance_id are needed. Should take the max of the 2. Fixes: `ea337c8b7e` "radv/gfx10: fix VS input VGPRs with the legacy path" Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> (cherry picked from commit `2e763f7c87`)	2019-09-04 11:50:37 -07:00
Ilia Mirkin	8ee40f6b63	gallium/vl: use compute preference for all multimedia, not just blit The compute paths in vl are a bit AMD-specific. For example, they (on nouveau), try to use a BGRX8 image format, which is not supported. Fixing all this is probably possible, but since the compute paths aren't in any way better, it's difficult to care. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111213 Fixes: `9364d66cb7` (gallium/auxiliary/vl: Add video compositor compute shader render) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `958390a9bf`)	2019-09-04 11:50:32 -07:00
Marek Olšák	25de459644	radeonsi: consolidate determining VGPR_COMP_CNT for API VS	2019-08-27 16:10:40 -04:00
Marek Olšák	5d7754017c	radeonsi/gfx10: set PA_CL_VS_OUT_CNTL with CONTEXT_REG_RMW to fix edge flags We need two different values of the register, one for NGG and one for legacy, in order to fix edge flags for the legacy pipeline. Passing the ngg flag to emit_clip_regs would be too complicated, so CONTEXT_REG_RMW is used for partial register updates.	2019-08-27 16:10:40 -04:00
Marek Olšák	d23bf14d44	radeonsi/gfx10: remove incorrect ngg/pos_writes_edgeflag variables It varies depending on si_shader_key::as_ngg.	2019-08-27 16:10:40 -04:00
Marek Olšák	514eb1587e	radeonsi: add PKT3_CONTEXT_REG_RMW	2019-08-27 16:10:40 -04:00
Marek Olšák	a935da7cef	winsys/amdgpu+radeon: process AMD_DEBUG in addition to R600_DEBUG	2019-08-27 16:10:40 -04:00
Marek Olšák	b9330a6189	radeonsi/gfx10: add AMD_DEBUG=nongg	2019-08-27 16:10:40 -04:00
Marek Olšák	0207c318e0	radeonsi/gfx10: finish up Navi14, add PCI ID	2019-08-27 16:10:40 -04:00
Marek Olšák	e09d469622	radeonsi/gfx10: always use the legacy pipeline for streamout The best way to prevent GDS hangs is not to use GDS.	2019-08-27 16:10:40 -04:00
Marek Olšák	f208b04dba	radeonsi/gfx10: don't initialize VGT_INSTANCE_STEP_RATE_0 Only gfx9 and older use it to get InstanceID in VGPR1.	2019-08-27 16:10:40 -04:00
Marek Olšák	c0716446a4	radeonsi/gfx10: fix InstanceID for legacy VS+GS	2019-08-27 16:10:40 -04:00
Marek Olšák	4d3097f36a	radeonsi/gfx10: add as_ngg variant for VS as ES to select Wave32/64 Legacy GS only works with Wave64.	2019-08-27 16:10:40 -04:00
Marek Olšák	a3a266807e	radeonsi/gfx10: create the GS copy shader if using legacy streamout	2019-08-27 16:10:40 -04:00
Marek Olšák	beea2dee8a	radeonsi/gfx10: fix the PRIMITIVES_GENERATED query if using legacy streamout	2019-08-27 16:10:40 -04:00
Marek Olšák	78c603ebf5	radeonsi/gfx10: fix tessellation for the legacy pipeline ported from PAL	2019-08-27 16:10:40 -04:00
Marek Olšák	3dec21a8aa	radeonsi: move some global shader cache flags to per-binary flags	2019-08-27 16:10:40 -04:00
Marek Olšák	6e07ac3343	radeonsi/gfx10: fix the legacy pipeline by storing as_ngg in the shader cache It could load an NGG shader when we want a legacy shader and vice versa.	2019-08-27 16:10:40 -04:00
Emil Velikov	c0b9399d9d	Update version to 19.2.0-rc1 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2019-08-20 23:18:37 +01:00
Erico Nunes	71fb721ca5	lima/ppir: use ra_get_best_spill_node to select spill node ra_get_best_spill_node is what other users of the mesa register allocator use. Switching to it now also fixes an infinite loop issue with ppir regalloc with the ppir control flow patchset, and also provides a small gain over the previous herusitic on number of spilled nodes testing with shader-db. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-08-20 21:16:02 +00:00
Eric Anholt	c1dc84e71d	tgsi: Remove unused tgsi_check_soa_dependencies(). Acked-by: Eric Engestrom <eric@engestrom.ch> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2019-08-20 13:31:13 -07:00
Eric Anholt	4ebe6b2e72	tgsi: Drop the SSE2 constants setup that's been dead code since 2011. The SSE2 executor was removed in `4eb3225b38` ("Remove tgsi_sse2.") Acked-by: Eric Engestrom <eric@engestrom.ch> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2019-08-20 13:31:13 -07:00
Eric Anholt	98c58355d3	tgsi: drop a stale comment This was fixed in `912ed84f83` ("tgsi: move to using vector for system values.") Acked-by: Eric Engestrom <eric@engestrom.ch> Reviewed-By: Gert Wollny <gert.wollny@collabora.com>	2019-08-20 13:31:13 -07:00
Eric Anholt	553cd82d64	gitlab-ci: Enable the GLES2/3 CTS on softpipe. The GLES2 CTS takes about 8 minutes of total runtime (at parallel 4 is ~2 minutes in the test stage if runners are free), while GLES3 takes about 25. Since the GLES3 run is pretty expensive, just do a cheap touch test of 1 out of every 10 tests in the test list on MRs, until we can get the runtime down. v2: Drop the full run for now until we can bring runtime down or bring up a dedicated mesa runner. Reviewed-by: Eric Engestrom <eric@engestrom.ch> (v1) Reviewed-By: Gert Wollny <gert.wollny@collabora.com> (v1)	2019-08-20 13:31:13 -07:00
Jose Maria Casanova Crespo	6c904773fe	mesa: reverse no_error on compressed_tex_sub_image for TEX_MODE_CURRENT This fixes the regression introduced on "mesa: refactor compressed_tex_sub_image function" that started to crash KHR-GLES2.texture_3d.compressed_texture.negative_compressed_tex_sub_image Fixes: `7df233d68d` ("mesa: refactor compressed_tex_sub_image function") Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-20 20:45:21 +01:00
Adam Jackson	b283919398	glx: Eliminate glx_config::{rgb,float,colorIndex}Mode These are redundant with glx_config::renderType, let's just use that consistently.	2019-08-20 14:05:07 -04:00
Adam Jackson	74ca87e4bc	glx: Remove unused glx_config::pixmapMode Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-20 14:05:03 -04:00
Adam Jackson	35fc7bdf0e	glx: convert glx_config_create_list to one big calloc Simpler, less failure prone, less malloc overhead, what's not to like. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-20 14:05:01 -04:00
Adam Jackson	97d58eabcc	glx: convert a malloc+memset to calloc Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-08-20 14:04:59 -04:00

1 2 3 4 5 ...

114611 commits