fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-27 01:28:12 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	60f8224171	radv/gfx10: fix storing/loading NGG stream outputs for VS and TES The LDS storage allocated for stream outputs is 4 * N, where N is the number of outputs. So, we have to store/load with N as index and not with the output location as index. This doesn't fix anything known but it should fix out-of-bounds access and it also reduces the number of outputs written to the LDS storage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-02 18:09:27 +02:00
Samuel Pitoiset	56e1b1ff0c	radv/gfx10: add missing counter buffer to the BO list The buffer isn't necessarily used before. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-02 18:09:25 +02:00
Samuel Pitoiset	683c5e27c7	radv/gfx10: add radv_device::use_ngg Trivial. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-10-02 18:06:01 +02:00
Gert Wollny	c5da8230de	etnaviv: enable triangle strips only when the hardware supports it Some hardware has a bug with triangle strips and it is signalled by the flag BUG_FIXED8 whether this bug has been fixed. So only enable triangle strips when this flag is set. Thanks: Jonathan Marek and Christian Gmeiner for the pointers v2: Add TODO to indicate that the handling should be refined (Jonathan & Christian) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com>	2019-10-02 07:34:36 +00:00
Dylan Baker	d855e19b87	meson: remove -DGALLIUM_SOFTPIPE from st/osmesa It's unused here, and undefined in scons. It is used in targets/osmesa, but it's properly defined there already. Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-10-01 12:34:27 -07:00
Lionel Landwerlin	2208d79dde	mesa: don't forget to clear _Layer field on texture unit On the Android Antutu benchmark we ran into an assert in ISL where the (base layer + num layers) > total layers. It turns out the core of mesa forgot to clear the _Layer variable, potentially leaving an inconsistent value. v2: Pull setting u->_Layer out of the conditional blocks (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-10-01 21:49:13 +03:00
Robin Murphy	563f8974d8	egl/gbm: Fix config validation In converting to shift/size-based validation, we lost a condition from the ARGB/XRGB equivalence check, which left it working one way round but not the other, and broke applications like glmark2-es2-drm on some platforms. Restore the equivalent check that both configs actually have an alpha channel before considering a mismatch. Fixes: `7b4ed2b513` ("egl: Convert configs to use shifts and sizes instead of masks") Signed-off-by: Robin Murphy <robin.murphy@arm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-10-01 14:45:15 +01:00
Ken Mays	4943c89d6d	haiku: fix Mesa build 1. The hgl.c file is a read-only file versus read-write. Ref: src/gallium/state_trackers/hgl/hgl.c 2. I've included the Haiku-specific patches I used to get a successful build of Mesa 19.1.7 on Haiku using the meson/ninja build procedure. Shows "[764/764] linking target ... libswpipe.so" at build completion. v2: Remove autotools files (Eric) v3: Update the patch Reported-by: Ken Mays <kmays2000@gmail.com> Tested-by: Ken Mays <kmays2000@gmail.com> CC: mesa-stable@lists.freedesktop.org Reviewed-by: Alexander von Gluck IV <kallisti5@unixzen.com>	2019-10-01 10:31:02 +00:00
Kevin Strasser	641320ce02	egl: Fix implicit declaration of ffs Found when building for Android in C99 mode. Include bitscan.h to ensure ffs is available. Fixes: `7b4ed2b5` ("egl: Convert configs to use shifts and sizes instead of masks") Signed-off-by: Kevin Strasser <kevin.strasser@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-30 14:33:43 -07:00
Rafael Antognolli	b9994cb8d5	intel/tools: Fix aubinator usage of rb_tree. The order of comparison has changed, so we need to invert the logic of "insert_left" when using rb_tree_insert_at(). Fixes: `dae33052db` (util/rb_tree: Reverse the order of comparison functions). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-09-30 13:43:23 -07:00
Caio Marcelo de Oliveira Filho	54f1de1c5c	i965: Enable EXT_demote_to_helper_invocation Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Caio Marcelo de Oliveira Filho	a3776df7b1	iris: Enable EXT_demote_to_helper_invocation Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Caio Marcelo de Oliveira Filho	008de52305	gallium: Add PIPE_CAP_DEMOTE_TO_HELPER_INVOCATION To enable EXT_demote_to_helper_invocation: This extension adds a "demote" keyword that is similar to "discard" but only suppresses subsequent writes and outputs to the framebuffer, and does not terminate the execution of the invocation. For the remainder of the execution, the invocation is "demoted" to act like a helper invocation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Caio Marcelo de Oliveira Filho	61fa4b5707	glsl: Add helperInvocationEXT() builtin From EXT_demote_to_helper_invocation, implemented with the existing nir_intrinsic_is_helper_invocation. Such builtin is necessary when using `demote` because we can't redefine the value of gl_HelperInvocation (since it is an input variable). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Caio Marcelo de Oliveira Filho	3439956377	glsl: Parse `demote` statement When the EXT_demote_to_helper_invocation extension is enabled, `demote` is treated as a keyword, and produces an ir_demote. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Caio Marcelo de Oliveira Filho	af1a6f0f77	glsl: Add ir_demote To represent the new `demote` keyword when using EXT_demote_to_helper_invocation extension. Most of the changes are to include it in the visitors. Demote is not considered a control flow, so also include an empty visit member function in ir_control_flow_visitor. Only NIR actually supports `demote`, so assert the translations for TGSI and Mesa's gl_program -- since the demote is not expected to appear for those. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Caio Marcelo de Oliveira Filho	c81b912eb7	mesa: Extension boilerplate for EXT_demote_to_helper_invocation Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-09-30 12:44:30 -07:00
Kenneth Graunke	309924c3c9	iris: Fix iris_rebind_buffer() for VBOs with non-zero offsets. We can't just check for the BO base address, we need to check for the full address including any offset we may have applied. When updating the address, we need to include the offset again. Fixes: `5ad0c88dbe` ("iris: Replace buffer backing storage and rebind to update addresses.")	2019-09-30 12:41:03 -07:00
Marek Olšák	a1545af079	ac/nir: fix GLSL imageSamples() Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-30 14:21:42 -04:00
Marek Olšák	0cc233e3dc	ac: add ac_build_image_get_sample_count from radeonsi Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-30 14:21:42 -04:00
Marek Olšák	39e638c14e	ac/surface: don't allocate FMASK if there is no graphics Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2019-09-30 14:21:42 -04:00
Marek Olšák	f704fb7f0b	tgsi_to_nir: handle PIPE_FORMAT_NONE in image opcodes radeonsi doesn't use the format and internal shaders don't set it. Reviewed-By: Timur Kristóf <timur.kristof@gmail.com>	2019-09-30 14:20:48 -04:00
Dylan Baker	3b265f61f5	meson: gallium media state trackers require libdrm with x11 v2: - update copyright year in all changed files - rebase on master Cc: 19.1 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Eric Engestrom <eric.engestrom@intel.com>	2019-09-30 18:06:56 +00:00
Kenneth Graunke	a0a93763fb	iris: Disable CCS_E for 32-bit floating point textures. A while back, Michael Larabel noticed that Paraview's Wavelet Volume case runs significantly slower on iris than i965. It turns out this is because we enable CCS_E for 32-bit floating point formats, while i965 disables it, with an oblique comment saying that we benchmarked it (on what exactly?) and determined that it was a loss. Paraview uses both R32_FLOAT and R32G32B32A32_FLOAT, and I observed large framerate drops when enabling CCS_E for either format. However, several other benchmarks (Aztec Ruins, many Synmark cases) use 16-bit floating point formats, with no apparent ill effects. So, disable compression for 32-bit float formats for now, but leave it enabled for 16-bit float formats as they seem to be working fine. Improves performance in Paraview's Wavelet Volume test by 62% on a Skylake GT4e. Fixes: `3cfc6a207b` ("iris: Fill out res->aux.possible_usages")	2019-09-30 10:44:52 -07:00
Marek Olšák	4a0d2e2880	ac: reorder and print all radeon_info fields Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:21 -04:00
Marek Olšák	e8b1538587	ac: set the number of SDPs same as the number of TCCs Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:21 -04:00
Marek Olšák	b7c2f7c5a6	ac: fix num_good_cu_per_sh for harvested chips Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	235ebe9163	radeonsi/gfx10: fix corruption for chips with harvested TCCs Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	8cbe83445b	ac: add radeon_info::tcc_harvested Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	7d97013294	ac: fix incorrect vram_size reported by the kernel Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Marek Olšák	3c0938bece	radeonsi/gfx10: fix L2 cache rinse programming Cc: 19.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-09-30 13:36:20 -04:00
Eric Engestrom	0efc253f02	etnaviv: fix bitmask typo Fixes: `d92689c46f` ("etnaviv: nir: add native integers (HALTI2+)") Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-09-30 17:54:33 +01:00
Adam Jackson	855dc17fcf	glx: Log the filename of the drm device if we fail to open it Helps point the user to the specific device that's having issues, since you're increasingly likely to have more than one. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/issues/107 Reviewed-by: Eric Anholt <eric@anholt.net>	2019-09-30 15:30:16 +00:00
Alyssa Rosenzweig	7be00b2a06	pan/midgard: Allow scheduling conditions with constants Now that we have constant adjustment logic abstracted, we can do this safely. Along with the csel inversion patch, this allows many more common csel ops to inline their condition in the bundle. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	c20063aa4a	pan/midgard: Add csel invert optimization Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	f0f4b39548	pan/midgard: Add mir_flip helper Useful for various operations on both commutative and anticommutative ops. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	10037ce523	pan/midgard: Tightly pack 32-bit constants If we can reuse constant slots from other instructions, we would like to do so to include more instructions per bundle. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	a3ca283bc1	pan/midgard: Allow writeout to see into the future If an instruction could be scheduled to vmul to satisfy the writeout conditions, let's do that and save an instruction+cycle per fragment shader. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	12a70ccd9e	pan/midgard: Allow 6 instructions per bundle We never had a scheduler good enough to hit this case before! :) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	34ff50cadd	pan/midgard: Only one conditional per bundle allowed There's no r32 to save ya after you use up r31 :) Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	2715bd02ee	pan/midgard: Schedule to smul/sadd Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	57bac68fff	pan/midgard: Extend choose_instruction for scalar units Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	e9edae3ecb	pan/midgard: Don't double check SCALAR units Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	d3b3daa9d3	pan/midgard: Use new scheduler We still emit in-order but we switch to using the bundles created from the new scheduler, which will allow greater flexibility and room for out-of-order optimization. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	1409af9fc7	pan/midgard: Add distance metric to choose_instruction We require chosen instructions to be "close", to avoid ballooning register pressure. This is a kludge that will go away once we have proper liveness tracking in the scheduler, but for now it prevents a lot of needless spilling. v2: Lower threshold to 6 (from 8). Schedule is hurt, but a few shaders that spilled excessively are fixed. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Derp	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	e9571b53e1	pan/midgard: Add mir_choose_alu helper Based on a given unit. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	8462e82467	pan/midgard: Implement load/store pairing We can bundle two load/store together. This eliminates the need for explicit load/store pairing in a prepass, as well. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	7cf4932410	pan/midgard: Extend csel_swizzle to branches Conditions for branches don't have a swizzle explicitly in the emitted binary, but they do implicitly get swizzled in whatever instruction wrote r31, so we need to handle that. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	c9ce5a92a0	pan/midgard: Add helpers for scheduling conditionals Conditional instructions (csel and conditional branches) require their condition to be written to a special condition pipeline register (r31.w for scalar, r31.xyzw for vector). However, pipeline registers are live only for the duration of a single bundle. As such, the logic to schedule conditionals correct is surprisingly complex. Essentially, we see if we could stuff the conditional within the same bundle as the csel/branch without breaking anything; if we can, we do that. If we can't, we add a dummy move to make room. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00
Alyssa Rosenzweig	6f92288e85	pan/midgard: Implement predicate->unit This allows ALUs to select for each unit of the bundle separately. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-09-30 08:40:13 -04:00

1 2 3 4 5 ...

106950 commits