fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-04-16 12:50:38 +02:00

Author	SHA1	Message	Date
Jordan Justen	acce7d3460	intel/genxml: Handle field names with different spacing/hyphen If a field name differs slightly between two generations then this change will still add the fields into the same group. For example, these will be treated as equal: * "Software Exception" and "Software Exception" * "Per Thread" and "Per-Thread" Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-08-28 13:38:28 -07:00
Eric Anholt	973b49386c	freedreno/a6xx: Fix non-mipmap filtering selection. We were clamping the LOD to force non-mipmap filtering, but that means that the HW doesn't get to select between the min and mag filters. Setting MIPFILTER_LINEAR_FAR appears to force non-mipmap filtering. Fixes all failures in dEQP-GLES2.functional.texture.filtering.2d.* Reviewed-by: Rob Clark <robdclark@chromium.org>	2019-08-28 13:14:41 -07:00
Ian Romanick	b418269d7d	intel/compiler: Request bitfield_reverse lowering on pre-Gen7 hardware See the previous commit for the explanation of the Fixes tag. Hurts 21 shaders in shader-db. All of the hurt shaders are in Unreal Engine 4 tech demos. Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `7afa26d4e3` ("nir: Add lowering for nir_op_bitfield_reverse.")	2019-08-28 11:39:29 -07:00
Ian Romanick	d3fd1c761a	nir/algrbraic: Don't optimize open-coded bitfield reverse when lowering is enabled This caused a problem on Sandybridge where an open-coded bitfieldReverse() function could be optimized to a nir_op_bitfield_reverse that would generate an unsupported BFREV instruction in the backend. This was encountered in some Unreal4 tech demos in shader-db. The bug was not previously noticed because we don't actually try to run those demos on Sandybridge. The fixes tag is a bit a lie. The actual bug was introduced about 26,000 commits earlier in `371c4b3c48` ("nir: Recognize open-coded bitfield_reverse."). Without the NIR lowering pass, the flag needed to avoid the optimization does not exist. Hopefully nobody will care to fix this on an earlier Mesa release. Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `7afa26d4e3` ("nir: Add lowering for nir_op_bitfield_reverse.")	2019-08-28 11:38:51 -07:00
Eric Anholt	4662b70d23	gallium: Don't emit identical endian-dependent pack/unpack code. Reduces the size of the u_format_table.c file by 140k (out of 1.64M) and makes me less confused about endianness in gallium. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-28 10:39:36 -07:00
Eric Anholt	d17ff2f7f1	gallium: Fix big-endian addressing of non-bitmask array formats. The formats affected are: - LA x (16_FLOAT, 32_FLOAT, 32_UINT, 32_SINT) - R8G8B8 x (UNORM, SNORM, SRGB, USCALED, SSCALED, UINT, SINT) - RG/RGB/RGBA x (64_FLOAT, 32_FLOAT, 16_FLOAT, 32_UNORM, 32_SNORM, 32_USCALED, 32_SSCALED, 32_FIXED, 32_UINT, 32_SINT) - RGB/RGBA x (16_UNORM, 16_SNORM, 16_USCALED, 16_SSCALED, 16_UINT, 16_SINT) - RGBx16 x (UNORM, SNORM, FLOAT, UINT, SINT) - RGBx32 x (FLOAT, UINT, SINT) - RA x (16_FLOAT, 32_FLOAT, 32_UINT, 32_SINT) The updated st_formats.c unit test checks that the formats affected by this change are all array formats in the equivalent Mesa format (if any). Mesa's array format definition is clear: the value stored is an array (increasing memory address) of values of the channel's type. It's also the only thing that makes sense for the RGB types, or very large types like RGBA64_FLOAT (A should not move to the low address because the cpu is BE). Acked-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Adam Jackson <ajax@redhat.com> Tested-by: Matt Turner <mattst88@gmail.com> (unit tests on BE) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-28 10:39:36 -07:00
Eric Anholt	0547fdd7ee	gallium: Drop a bit of dead code from the pack/unpack python. Nothing used this var. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-28 10:39:36 -07:00
Eric Anholt	309ef968cd	gallium: Drop the useless union wrapper on pack/unpack. Nothing accessed the .value field, just the .chan. Unwrap all the code from the union, for clarity (and 13k less generated code). Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-28 10:39:36 -07:00
Eric Anholt	174240c5e4	gallium: Skip generating the pack/unpack union if we don't use it. Shaves 30k off of the 1.6M .c file, and makes for less noise for me trying to understand how gallium formats actually work. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-28 10:39:36 -07:00
Eric Anholt	7c8cdee0b2	gallium: Fix mesa format name in unit test failure path. We clearly wanted the mesa format here. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-28 10:39:36 -07:00
Boris Brezillon	8709b865ce	panfrost: Reset the damage area on imported resources Reset the damage area in the resource_from_handle() path (as done in panfrost_resource_create()). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-28 17:50:44 +02:00
Boris Brezillon	938c5b0148	panfrost: Use ralloc() to allocate instructions to avoid leaking those objs Instructions attached to blocks are never explicitly freed. Let's use ralloc() to attach those objects to the compiler context so that they are automatically freed when the ctx object is freed. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-28 17:50:01 +02:00
Jose Fonseca	6e01575b68	scons: Make GCC builds stricter. Uses some of the same -Werror options used by Meson, as suggested by Michel Dänzer. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Michel Dänzer <michel@daenzer.net> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-28 15:52:07 +01:00
Jose Fonseca	6b2bc8f25e	util: Prevent strcasecmp macro redefinion. MinGW headers already define it. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-28 15:52:07 +01:00
Jose Fonseca	46f7b3662f	util: Prevent implicit declaration of function getenv. With MinGW cross compilation. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Acked-by: Eric Engestrom <eric@engestrom.ch>	2019-08-28 15:52:07 +01:00
Jose Fonseca	7029556398	glx: Fix incompatible function pointer types. I don't know how Meson didn't hit this issue, when it too already uses -Werror=incompatible-pointer-types Fixes: `3dd299c3d5` ("glx: Sync <GL/glxext.h> with Khronos") Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-08-28 15:52:07 +01:00
Vasily Khoruzhick	200859f45c	lima: fix texture descriptor issues Looks like initial RE was wrong and some fields have different purpose. I.e. there's no "disable_mipmap" field, it's actually part of another field that selects mipmap filtering. Also fix layout position. Reviewed-by: Qiang Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-08-28 00:28:38 +00:00
Kenneth Graunke	7e095a4fbf	iris: Drop swizzling parameter from s8_offset. This is always false on Gen8+, no need for dead code and parameters.	2019-08-27 17:11:32 -07:00
Kenneth Graunke	e18cd5452a	mesa: Fix _mesa_float_to_unorm() on 32-bit systems. This fixes the following CTS test on 32-bit systems: GTF-GL46.gtf30.GL3Tests.packed_depth_stencil.packed_depth_stencil_init It does glGetTexImage of a 16-bit SNORM image, requesting 32-bit UNORM data. In get_tex_rgba_uncompressed, we round trip through float to handle image transfer ops for clamping. _mesa_format_convert does: _mesa_float_to_unorm(0.571428597f, 32) which translated to: _mesa_lroundevenf(0.571428597f * 0xffffffffu) which produced different results on 64-bit and 32-bit systems: 64-bit: result = 0x92492500 32-bit: result = 0x80000000 This is because the size of "long" varies between the two systems, and 0x92492500 is too large to fit in a signed 32-bit integer. To fix this, we switch to the new _mesa_i64roundevenf function which always does the 64-bit operation. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104395 Fixes: `594fc0f859` ("mesa: Replace F_TO_I() with _mesa_lroundevenf().") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-08-27 23:57:02 +00:00
Kenneth Graunke	b59914e179	util: Add a _mesa_i64roundevenf() helper. This always returns a int64_t, translating to _mesa_lroundevenf on systems where long is 64-bit, and llrintf where "long long" is needed. Fixes: `594fc0f859` ("mesa: Replace F_TO_I() with _mesa_lroundevenf().") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-08-27 23:57:02 +00:00
Adam Jackson	163fc11f27	glx: Unset the direct_support bit for GLX_EXT_import_context GLX_EXT_import_context operates only on indirect contexts, a direct context cannot possibly support it. Without this change the extension will appear in the combined GLX extension string even if it is missing from the server string, indicating a lack of required server support.	2019-08-27 22:34:46 +00:00
Daniel Kolesa	1b9fce56c4	util: add auxv based PowerPC AltiVec/VSX detection At least on Linux, we can use the ELF auxiliary vector to detect the presence of AltiVec, VSX and other CPU features without having to go through handling SIGILL, which has various problems of its own. A similar thing is already being done for ARM to detect NEON. Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Daniel Kolesa <daniel@octaforge.org>	2019-08-27 14:55:37 -07:00
Kenneth Graunke	23f42f8dcf	intel/compiler: Use new Gen11 headerless RT writes for MRT cases Gen11 adds support for specifying the render target index and src0 alpha present bits in the extended message descriptor. Previously, we had to use a message header for this, requiring extra instructions to write the fields, and two registers of extra payload. Improves performance on my ICL 8x8 frequency locked to 700Mhz, on iris: GfxBench5 Manhattan 3.0: 2.13635% +/- 0.159859% (n=5) GfxBench5 Aztec Ruins: 1.57173% +/- 0.128749% (n=5) Synmark2 OglDeferred: 2.86914% +/- 0.191211% (n=10) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-27 14:20:07 -07:00
Kenneth Graunke	0d96484165	intel/compiler: Use generic SEND for Gen7+ FB writes This takes care of generate_fb_write/fire_fb_write/brw_fb_WRITE's stuff earlier in the visitor. It will also make it easier to generate SENDSC messages with indirect extended descriptors in a few patches. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-27 14:20:07 -07:00
Kenneth Graunke	86a63b1098	intel/compiler: Refactor FB write message control setup into a helper. This will be used by visitor code to convert directly to SEND in a bit. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-27 14:20:07 -07:00
Kenneth Graunke	b6fe25c7f5	intel/compiler: Handle bits 15:12 in brw_send_indirect_split_message() Annoyingly, these bits exist in some extended message descriptors (in particular render target writes), but they don't have any corresponding bits in the ISA encoding. So we can't use an immediate and have to fall back to an indirect extended descriptor. Thanks to Jason Ekstrand for reminding me that you can still set these bits via an indirect descriptor, even if they don't exist in the ISA. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-27 14:20:07 -07:00
Kenneth Graunke	c8c9c48684	intel/compiler: Fix src0/desc setter ordering src0 vstride and type overlap with bits of the extended descriptor. brw_set_desc() also sets the extended descriptor to 0. So by setting the descriptor, then setting src0, we were accidentally setting a bunch of extended descriptor bits unintentionally. When using this infrastructure for framebuffer writes (in a future patch), this ended up setting the extended descriptor bit 20, which is "Null Render Target" on Icelake, causing nothing to be written to the framebuffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-08-27 14:20:07 -07:00
Marek Olšák	360cf3c4b0	radeonsi: fix scratch buffer WAVESIZE setting leading to corruption Cc: 19.2 19.1 <mesa-stable@lists.freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:52:32 -04:00
Marek Olšák	f95a28d361	radeonsi: unbind blend/DSA/rasterizer state correctly in delete functions Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=111414 Fixes: `b758eed9c3` ("radeonsi: make sure that blend state != NULL and remove all NULL checking") Cc: 19.2 <mesa-stable@lists.freedesktop.org> Tested-by: Edmondo Tommasina <edmondo.tommasina@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:52:30 -04:00
Marek Olšák	40e5ac45ae	radeonsi: align scratch and ring buffer allocations for faster memory access Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:52:28 -04:00
Marek Olšák	d8f27552f4	radeonsi: consolidate determining VGPR_COMP_CNT for API VS Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	4dde40908f	radeonsi/gfx10: set PA_CL_VS_OUT_CNTL with CONTEXT_REG_RMW to fix edge flags We need two different values of the register, one for NGG and one for legacy, in order to fix edge flags for the legacy pipeline. Passing the ngg flag to emit_clip_regs would be too complicated, so CONTEXT_REG_RMW is used for partial register updates. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	1426acf9e7	radeonsi/gfx10: remove incorrect ngg/pos_writes_edgeflag variables It varies depending on si_shader_key::as_ngg. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	2e94cb6693	radeonsi: add PKT3_CONTEXT_REG_RMW Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	d9a453c747	winsys/amdgpu+radeon: process AMD_DEBUG in addition to R600_DEBUG Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	467df4b90a	radeonsi/gfx10: add AMD_DEBUG=nongg Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	6229b5a058	radeonsi/gfx10: finish up Navi14, add PCI ID Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	73bde2b029	radeonsi/gfx10: always use the legacy pipeline for streamout The best way to prevent GDS hangs is not to use GDS. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	f251fd7bf5	radeonsi/gfx10: don't initialize VGT_INSTANCE_STEP_RATE_0 Only gfx9 and older use it to get InstanceID in VGPR1. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	28f44ee533	radeonsi/gfx10: fix InstanceID for legacy VS+GS Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	e121d75de9	radeonsi/gfx10: add as_ngg variant for VS as ES to select Wave32/64 Legacy GS only works with Wave64. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	f34d023f1a	radeonsi/gfx10: create the GS copy shader if using legacy streamout Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	776f05a307	radeonsi/gfx10: fix the PRIMITIVES_GENERATED query if using legacy streamout Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	cab5b3861d	radeonsi/gfx10: fix tessellation for the legacy pipeline ported from PAL Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	a9bb566955	radeonsi: move some global shader cache flags to per-binary flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Marek Olšák	810846e157	radeonsi/gfx10: fix the legacy pipeline by storing as_ngg in the shader cache It could load an NGG shader when we want a legacy shader and vice versa. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>	2019-08-27 16:16:08 -04:00
Kenneth Graunke	6342d43ae9	iris: Delete dead prototype	2019-08-27 13:15:02 -07:00
Boris Brezillon	2734a4951e	Revert "panfrost: Free all block/instruction objects before leaving midgard_compile_shader_nir()" This reverts commit `5882e0def9`. This commit causes a segfault. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com>	2019-08-27 20:07:28 +02:00
Boris Brezillon	0142dcb990	panfrost: Make sure bundle.instructions[] contains valid instructions Add an assert() in schedule_bundle() to make sure all instruction pointers in bundle.instructions[] are valid. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-27 16:50:52 +02:00
Boris Brezillon	5882e0def9	panfrost: Free all block/instruction objects before leaving midgard_compile_shader_nir() Right now we're leaking all block and instruction objects allocated by the compiler. Let's clean things up before leaving midgard_compile_shader_nir(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-27 16:50:52 +02:00

... 10 11 12 13 14 ...

115447 commits