fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 13:48:06 +02:00

Author	SHA1	Message	Date
Ilia Mirkin	ae2cb72804	nv50: disable compute It causes more trouble than it's worth. Now vl tries to create compute shaders without all the proper checking. Since there's really no (current) way to use compute on nv50, just mark it disabled. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109742 Fixes: `f6ac0b5d71` ("gallium/auxiliary/vl: Add compute shader to support video compositor render") Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-02-22 09:42:41 -05:00
Lionel Landwerlin	1d626fc028	intel: fix urb size for CFL GT1 Same 192Kb amount as SKL/KBL GT1 applies. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> Fixes: `de7ed0ba55` ("i965/CFL: Add PCI Ids for Coffee Lake.")	2019-02-22 11:53:49 +00:00
Samuel Iglesias Gonsálvez	bd2c5a8203	isl: the display engine requires 64B alignment for linear surfaces v2: Add PRM quote (Lionel) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2019-02-22 11:45:45 +00:00
Gert Wollny	2ee197d6e8	virgl: Enable mixed color FBO attachemnets only when the host supports it Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Elie Tournier <elie.tournier@collabora.com>	2019-02-22 10:44:08 +01:00
Mauro Rossi	338dacc341	android: intel/isl: remove redundant building rules Fixes the following building error: including ./external/mesa/Android.mk ... build/core/base_rules.mk:183: * external/mesa/src/intel: MODULE.TARGET.STATIC_LIBRARIES.libmesa_isl_tiled_memcpy already defined by external/mesa/src/intel. make: * [build/core/ninja.mk:164: out/build-android_x86_64.ninja] Error 1 ISL_TILED_MEMCPY_FILES is isl/isl_tiled_memcpy_normal.c and that source file includes isl_tiled_memcpy.c source Fixes: `96bb328` ("iris: add Android build") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com>	2019-02-22 07:56:11 +02:00
Kenneth Graunke	b21de090d6	Revert "iris: Enable auxiliary buffer support" This reverts commit `cd0ced49e7`. It breaks glxgears rendering.	2019-02-21 15:50:46 -08:00
Kenneth Graunke	e2cb0c5e0e	iris: Enable -msse2 and -mstackrealign This is needed for gen_clflush.h intrinsics to work on 32-bit builds. i965 and anv both set these, and iris needs to as well. Tested-by: Mark Janes <mark.a.janes@intel.com>	2019-02-21 14:51:15 -08:00
Francisco Jerez	7272fe9c08	intel/fs: Rely on undocumented unrestricted regioning for 32x16-bit integer multiply. Even though the hardware spec claims that any "integer DWord multiply" operation is affected by the regioning restrictions of CHV/BXT/GLK, this is inconsistent with the behavior of the simulator and with empirical evidence -- Return false from has_dst_aligned_region_restriction() for such instructions as a micro-optimization. Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 14:07:25 -08:00
Francisco Jerez	e03be78252	intel/fs: Implement extended strides greater than 4 for IR source regions. Strides up to 32B can be implemented for the source regions of most instructions by leveraging either the vertical or the horizontal stride of the hardware Align1 region. The main motivation for this is that currently the lower_integer_multiplication() pass will happily double the stride of one of the 32-bit sources, which can blow up if the stride of the original source was already the maximum value allowed by the hardware. An alternative would be to use the regioning legalization pass in order to lower such strides into the composition of multiple legal strides, but that would be somewhat less efficient. This showed up as a regression from my commit `cbea91eb57` in Vulkan 1.1 CTS tests on CHV/BXT platforms, however it was really a pre-existing problem that had affected conformance on other platforms without native support for integer multiplication. CHV/BXT were getting around it because the code I removed in that commit had the "fortunate" side effect of emitting narrower regions that didn't hit the hardware stride limit after lowering. Beyond fixing the regression this fixes ~90 additional Vulkan 1.1 subgroup CTS tests on ICL (that's why this patch is marked for inclusion in mesa-stable even though the original regressing patch was not). According to Jason, a nearly equivalent change had been committed previously as `e8c9e65185` and then (mistakenly?) reverted as `a31d038208`. Cc: mesa-stable@lists.freedesktop.org Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109328 Reported-by: Mark Janes <mark.a.janes@intel.com> Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 14:07:25 -08:00
Francisco Jerez	7f9f6263c1	intel/fs: Cap dst-aligned region stride to maximum representable hstride value. This is required in combination with the following commit, because otherwise if a source region with an extended 8+ stride is present in the instruction (which we're about to declare legal) we'll end up emitting code that attempts to write to such a region, even though strides greater than four are still illegal for the destination. Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 14:07:25 -08:00
Francisco Jerez	e2f475ddff	intel/fs: Lower integer multiply correctly when destination stride equals 4. Because the "low" temporary needs to be accessed with word type and twice the original stride, attempting to preserve the alignment of the original destination can potentially lead to instructions with illegal destination stride greater than four. Because the CHV/BXT alignment restrictions are now being enforced by the regioning lowering pass run after lower_integer_multiplication(), there is no real need to preserve the original strides anymore. Note that this bug can be reproduced on stable branches, but back-porting would be non-trivial, because the fix relies on the regioning lowering pass recently introduced. Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 14:07:25 -08:00
Francisco Jerez	c3c27762f7	intel/fs: Exclude control sources from execution type and region alignment calculations. Currently the execution type calculation will return a bogus value in cases like: mov_indirect(8) vgrf0:w, vgrf1:w, vgrf2:ud, 32u Which will be considered to have a 32-bit integer execution type even though the actual indirect move operation will be carried out with 16-bit precision. Similarly there's no need to apply the CHV/BXT double-precision region alignment restrictions to such control sources, since they aren't directly involved in the double-precision arithmetic operations emitted by these virtual instructions. Applying the CHV/BXT restrictions to control sources was expected to be harmless if mildly inefficient, but unfortunately it exposed problems at codegen level for virtual instructions (namely the SHUFFLE instruction used for the Vulkan 1.1 subgroup feature) that weren't prepared to accept control sources with an arbitrary strided region. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=109328 Reported-by: Mark Janes <mark.a.janes@intel.com> Fixes: `efa4e4bc5f` "intel/fs: Introduce regioning lowering pass." Tested-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 14:07:25 -08:00
Timothy Arceri	d9e08e753b	nir: clone instruction set rather than removing individual entries This reduces the time spent in nir_opt_cse() by almost a half. The massif tool from callgrind reported no change in peak memory use with the large doliphin uber shaders I used for testing. Reviewed-by: Thomas Helland<thomashelland90@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-22 08:36:36 +11:00
Jordan Justen	cd0ac3a6af	genxml: Remove extra space in gen4/45/5 field name Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 13:17:10 -08:00
Jordan Justen	a9b0b72a78	genxml/gen_bits_header.py: Use regex to strip no alphanum chars Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-02-21 13:15:59 -08:00
Kenneth Graunke	cd0ced49e7	iris: Enable auxiliary buffer support This currently regresses KHR-GL4x.compute_shader.resource-texture, but that's a pre-existing bug (https://bugs.freedesktop.org/109113) which should be fixed up once we have fast clear support.	2019-02-21 10:26:12 -08:00
Rafael Antognolli	db81445837	iris: Flag ALL_DIRTY_BINDINGS on aux state change. If we change the aux state for a given resource, we need to re-emit the binding table pointers for any stage that has such resource bound. Since we don't track that, flag IRIS_ALL_DIRTY_BINDINGS and emit all of them.	2019-02-21 10:26:12 -08:00
Rafael Antognolli	95589652a1	iris: Skip resolve if there's no context. If iris_resource_get_handle() gets called without a context, we can't resolve the resource. Hopefully it shouldn't be compressed anyway, so let's just add an assert to ensure it's correct.	2019-02-21 10:26:12 -08:00
Rafael Antognolli	36138bb7fc	iris/clear: Pass on render_condition_enabled.	2019-02-21 10:26:12 -08:00
Rafael Antognolli	8190165d13	iris: Avoid leaking if we fail to allocate the aux buffer. Otherwise we could leak the aux state map or the aux BO.	2019-02-21 10:26:12 -08:00
Kenneth Graunke	7da53d7188	iris: Only resolve compute resources for compute shaders	2019-02-21 10:26:12 -08:00
Kenneth Graunke	95a36bd55c	iris: Fix aux usage in render resolve code	2019-02-21 10:26:12 -08:00
Rafael Antognolli	4f191feb0c	iris: Pin HiZ buffers when rendering.	2019-02-21 10:26:12 -08:00
Rafael Antognolli	dfd54f9954	iris: Flush before hiz_exec.	2019-02-21 10:26:12 -08:00
Kenneth Graunke	f3f7d45a63	iris: Allow disabling aux via INTEL_DEBUG options	2019-02-21 10:26:12 -08:00
Kenneth Graunke	4634b754f4	iris: do flush for buffers still	2019-02-21 10:26:12 -08:00
Kenneth Graunke	15822f33ad	iris: make surface states for CCS_D too CCS_E can fall back to CCS_D with incompatible format views CCS_D is pretty useless without fast clears and we may as well use NONE, but we're surely going to hook those up at some point, so may as well just go ahead and do it now...	2019-02-21 10:26:12 -08:00
Rafael Antognolli	689b590069	iris: Skip msaa16 on gen < 9. Also needed to add gen information to KEY_INIT.	2019-02-21 10:26:12 -08:00
Kenneth Graunke	fd2038b22a	iris: Set program key fields for MCS	2019-02-21 10:26:12 -08:00
Kenneth Graunke	92c310fd3f	iris: don't use hiz for MSAA buffers	2019-02-21 10:26:12 -08:00
Kenneth Graunke	2cddc953cd	iris: some initial HiZ bits	2019-02-21 10:26:12 -08:00
Kenneth Graunke	9b1126c990	iris: disable aux for external things	2019-02-21 10:26:12 -08:00
Kenneth Graunke	45f4dab62b	iris: Resolves for compute	2019-02-21 10:26:12 -08:00
Kenneth Graunke	ecc897b8ad	iris: consider framebuffer parameter for aux usages	2019-02-21 10:26:12 -08:00
Kenneth Graunke	b77d2dc71b	iris: Make blit code use actual aux usages	2019-02-21 10:26:12 -08:00
Kenneth Graunke	bfc76d3525	iris: store modifier info in res	2019-02-21 10:26:12 -08:00
Kenneth Graunke	56f1fe3eac	iris: pin the buffers	2019-02-21 10:26:12 -08:00
Kenneth Graunke	f8aa9aa353	iris: resolve before transfer maps	2019-02-21 10:26:12 -08:00
Kenneth Graunke	c53a67d469	iris: be sure to skip buffers in resolve code Buffers don't have ISL surfaces, and this can get us into trouble.	2019-02-21 10:26:12 -08:00
Kenneth Graunke	5eb75345b8	iris: try to fix copyimage vs copybuffers	2019-02-21 10:26:12 -08:00
Kenneth Graunke	d8f3bc1c4c	iris: actually use the multiple surf states for aux modes	2019-02-21 10:26:12 -08:00
Kenneth Graunke	3c979b0e6d	iris: add some draw resolve hooks	2019-02-21 10:26:12 -08:00
Kenneth Graunke	53c484ba8a	iris: blorp using resolve hooks	2019-02-21 10:26:12 -08:00
Kenneth Graunke	77a1070d36	iris: Initial import of resolve code	2019-02-21 10:26:12 -08:00
Kenneth Graunke	f879349398	iris: create aux surface if needed	2019-02-21 10:26:12 -08:00
Kenneth Graunke	3efd5299af	iris: Fill out SURFACE_STATE entries for each possible aux usage	2019-02-21 10:26:12 -08:00
Kenneth Graunke	3cfc6a207b	iris: Fill out res->aux.possible_usages	2019-02-21 10:26:12 -08:00
Kenneth Graunke	a7bc4d6074	iris: Add iris_resource fields for aux surfaces But without fast clears or HiZ per-level tracking just yet.	2019-02-21 10:26:12 -08:00
Jordan Justen	d0996d5fab	iris: Emit default L3 config for the render pipeline Signed-off-by: Jordan Justen <jordan.l.justen@intel.com>	2019-02-21 10:26:12 -08:00
Kenneth Graunke	51ddc40084	iris: Always emit at least one BLEND_STATE	2019-02-21 10:26:12 -08:00

1 2 3 4 5 ...

108504 commits