fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 09:58:22 +02:00

Author	SHA1	Message	Date
Alyssa Rosenzweig	871c02b12e	panfrost: Invoke compute shader according to grid info We already have helpers for packing invocations (due to its role in instanced vertex shaders), so we can reuse this drop in for compute shaders. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	748ccbc808	panfrost: Explain and include compute FBD Squint at it hard enough and you realize it's the beginning of an SFBD... I guess... A compute shader with register spilling would be able to confirm this, but we would expect to see the first field \| 1 and an address splattered later, setting up TLS. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	3113be3127	panfrost: Unify-driven cleanup Again, now that stages are unified some logic goes away. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	ac6aa93f9e	panfrost: Unify ctx->vs and ctx->fs It's a little verbose, but this way we can support other shader stages without too much contortion. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:03 -07:00
Alyssa Rosenzweig	4b93152c29	panfrost: Flesh out launch_grid stub It's still incomplette, but we're able to hook into launch_grid to create a stub COMPUTE job. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	cd1be4605c	panfrost: Cleanup via payload unification Since these are now indexable, quite a bit of code cleans up. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	0da52015a1	panfrost: Unify payload_vertex/payload_tiler Rather than disparate variables, let's use an array of payloads indexed by the shader stage. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	902115f94f	panfrost: Only wallpaper if we drew something last_tiler.gpu may be NULL at flush time despite no clear and existing jobs -- if we executed a compute-only workload. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:02 -07:00
Alyssa Rosenzweig	2d86828243	panfrost: Adjust shader CAPs to expose dEQP compute Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	39fe9f5e2f	panfrost: Expose NIR as our PIPE_SHADER_CAP_SUPPORTED_IRS We could expose TGSI as well -- we pipe it through tgsi_to_nir for Gallium-internal shaders anyway -- but we'd rather not. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	1697760e05	panfrost: Copy freedreno's panfrost_get_compute_param Values reported here aren't remotely correct, but it's a start to just get the entrypoint stubbed out. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	c8bc664447	panfrost: Expose COMPUTE-related caps for GLES3.1 Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	5a8b83ca0b	panfrost: Stub out launch_grid Just dumps some information about the invocation for later debug. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	a8fc40aaf5	panfrost: Stub out compute CSO Doesn't do anything, just gets the functions there. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:23:01 -07:00
Alyssa Rosenzweig	e913986868	panfrost: Implement gl_FrontFacing Interestingly, this requires no compiler changes. It's just exposed as a special varying. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-08-01 16:15:03 -07:00
Timothy Arceri	2afedfaf9a	iris: add support for gl_ClipVertex in tess eval shaders Required for OpenGL compat support. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-01 16:12:37 -07:00
Timothy Arceri	00b5bf2d72	iris: add support for gl_ClipVertex in geometry shaders This will enable us to support the OpenGL compat profile. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-08-01 16:12:27 -07:00
Jason Ekstrand	70dc017aec	nir: Stop whacking gl_FrontFacing to a system value We have a cap bit for gallium and a GLSL compiler flag to control this. Just trust what GLSL gives us and stop forcing it. In order for this to be safe, we have to advertise another cap in some of the gallium drivers. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-01 21:59:37 +00:00
Alyssa Rosenzweig	4e736b88f3	panfrost: Implement panfrost_set_shader_buffers callback Just copy over the passed SSBO for now. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-01 14:32:08 -07:00
Alyssa Rosenzweig	898a18ea89	gallium/util: Add util_set_shader_buffers_mask helper Conceptually follows util_set_vertex_buffers_mask but for SSBOs. v2: Fix missing ~ when clearing mask. Adjust mask behaviour to match freedreno/v3d when buffer == NULL. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-01 14:31:56 -07:00
Jonathan Marek	3e33173200	kmsro: move entry points from etnaviv to kmsro These drivers are kmsro drivers so they should be part of the kmsro #if This fixes missing imx_drm driver when building with only freedreno+kmsro Signed-off-by: Jonathan Marek <jonathan@marek.ca> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-08-01 16:31:51 -04:00
Gert Wollny	9de00e74fe	virgl: Enable depth_clamp by lowering if the host is new enough. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Gert Wollny	b2e92c45ce	gallium: Make PIPE_CAP_DEPTH_CLIP_DISABLE a tri-state value and use it Use value "2" to signal that lowering is needed and supported and enable it accordingly. v2: - Note in CAP description that this lowering currently requires TGSI - use "true" instead of GL_TRUE (both Erik) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-08-01 05:58:53 +00:00
Gert Wollny	7fb47195d8	Revert "softpipe: Don't draw when rasterizer_discard is set" This was too aggressive and breaks TF (Ilia) This reverts commit `4ee638cd78`. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2019-08-01 05:57:41 +00:00
Kenneth Graunke	b61f17d362	iris: Skip emitting 3DSTATE_INDEX_BUFFER if possible We were emitting 3DSTATE_INDEX_BUFFER on every indexed draw, even if back-to-back draws referred to the same index buffer. This improves drawoverhead scores in the DrawElements cases by about 10%, by giving us even more minimal batches.	2019-07-31 15:14:10 -07:00
Mike Blumenkrantz	8af1990ad7	st/dri: simplify dri_get_egl_image by reusing dri2_format_table this makes dri2_get_mapping_by_fourcc accessible from dri_helpers.h and does a direct lookup on the fourcc id to match the pipe format v2 (Ken): Allow map to be NULL, use img->texture->format. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net>	2019-07-31 15:11:15 -07:00
Erico Nunes	82bf5a8aac	lima: enable lower_bitops in ppir The mali pp doesn't support integers and some nir_algebraic optimizations may result in ops that are not easily lowerable to floats, so disable optimizations resulting in bitops. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-07-31 23:06:26 +02:00
Erico Nunes	b3676a6548	nir/algebraic: rename lower_bitshift to lower_bitops Optimizations that insert bitshift or bitwise operations should not be applied on GPUs that don't support integer operations. The .lower_bitshift could be used to control the bitshift related ones, but there was also one bitwise optimization uncovered. Since only lima and freedreno use this option and the use case is that no bit operations are wanted, let's rename it to .lower_bitops and use it to control all bitops related optimizations. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Jonathan Marek <jonathan@marek.ca>	2019-07-31 23:06:04 +02:00
Erico Nunes	99c956fb47	lima/ppir: lower fdot in nir_opt_algebraic Now that we have fsum in nir, we can move fdot lowering there. This helps reduce ppir complexity and enables the lowered ops to be part of other nir optimizations in the optimization loop. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-07-31 21:35:58 +02:00
Erico Nunes	7f8ff686b7	lima/ppir: refactor texture code to simplify scheduler The 'varying fetch' pp instruction deals only with coordinates, and 'texture fetch' deals only with the sampler index. Previously it was not possible to clearly map ppir_op_load_coords and ppir_op_load_texture to pp instructions as the source coordinates were kept in the ppir_op_load_texture node, making this harder to maintain. The refactor is made with the attempt to clearly map ppir_op_load_coords to the 'varying fetch' and ppir_op_load_texture to the 'texture fetch'. The coordinates are still temporarily kept in the ppir_op_load_texture node as nir has both sampler and coordinates in a single instruction and it is only possible to output one ppir node during emit. But now after lowering, the sources are transferred to the (always) created ppir_op_load_coords node, and it should be possible to directly map them to their pp instructions from there onwards. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-07-31 21:22:41 +02:00
Erico Nunes	d2901de09e	lima/ppir: lower texture projection Lower texture projection in ppir using nir_lower_tex and nir_lower_tex. This will insert a mul with the coordinate division before the load varying. Even though the lima pp supports projection in the load varying instruction while loading the coordinates (from a register or a varying), it requires that both the coordinates and projector be components in a single register. nir currently handles them in separate ssa, and attempting to merge them manually may end up in worse code than just doing the coordinate division manually. So for now let's just lower the projection to add support for it in lima. In the future, an optimization pass may be implemented in lima to ensure that both coords and projector come in the same register, then this lowering may be disabled and in this case lima may use the built-in projection and save the mul instruction from lowering. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com>	2019-07-31 21:22:41 +02:00
Kenneth Graunke	3f9012839e	Revert "st/dri: simplify dri_get_egl_image by reusing dri2_format_table" This reverts commit `c47af8b95f`. It causes dEQP-EGL regressions. (I think there is an easy fix, but we'll have it go through review again.)	2019-07-31 11:06:32 -07:00
Alyssa Rosenzweig	3e47a1181b	panfrost: Add MALI_SAMP_NORM_COORDS flag Corresponds to the normalized coordinates? flag on images in OpenCL and evidently also shows up in GL, so let's wire it in. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Alyssa Rosenzweig	cf6cad3922	panfrost: Simplify filter_mode definition It's just a bit field containing some flags; there's no need for all the macro magic. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-07-31 10:56:11 -07:00
Mike Blumenkrantz	c47af8b95f	st/dri: simplify dri_get_egl_image by reusing dri2_format_table this makes dri2_get_mapping_by_fourcc accessible from dri_helpers.h and does a direct lookup on the fourcc id to match the pipe format Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-31 09:50:06 -07:00
Mike Blumenkrantz	7404833c2e	gallium: add handling for YUV planar surfaces st/dri: this adds a table (similar to the one in i965) which provides mappings for turning various planar formats into multiple sampler views. whereas only NV12 and IYUV were supported, now many more formats are supported here: * P0XX * YUV4XX * YVU4XX * AYUV * XYUV * YUYV * UYVY the table is used directly to handle image creation, simplifying a lot of code and resolving related TODO/FIXME items where workarounds were previously in place to manage NV12 and IYUV formats exclusively st/mesa: the changes here relate to setting up samplers for the planar formats. this requires: * checking for driver support for all the sampler formats * creating the samplers with the corresponding formats and swizzling * running nir_lower_tex with the appropriate options to trigger the lowering for each plane->sampler fixes kwg/mesa#36 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-31 09:50:06 -07:00
Mike Blumenkrantz	338a29b08f	gallium: add AYUV and XYUV formats this only adds the PIPE_FORMAT members, not any direct handling for them Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2019-07-31 09:50:06 -07:00
Eric Engestrom	53b98b0185	virgl: make use of local variable Otherwise that variable is only used in an assert() and would need an ASSERTED to avoid the warning. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Eric Engestrom	abc226cf41	tree-wide: replace MAYBE_UNUSED with ASSERTED Suggested-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Eric Engestrom	ab9c76769a	r600: replace MAYBE_UNUSED with specific #ifdef Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Eric Engestrom	745bae40ad	gallium/aux: replace MAYBE_UNUSED with UNUSED MAYBE_UNUSED is going away, so let's replace legitimate uses of it with UNUSED, which the former aliased to so far anyway. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Eric Engestrom	c8a453a770	v3d: replace MAYBE_UNUSED with UNUSED MAYBE_UNUSED is going away, so let's replace legitimate uses of it with UNUSED, which the former aliased to so far anyway. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Eric Engestrom	d470f1acce	v3d: drop incorrect MAYBE_UNUSED While at it, use that `screen` variable everywhere. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Eric Engestrom	21196ec927	r600: move variable to proper scope It helps show when it's actually used. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-07-31 09:41:05 +01:00
Kenneth Graunke	3a22a8bf49	iris: Skip repeated depth buffer disables. Often times, the depth buffer is entirely disabled, but color render targets change. For example, GenerateMipmaps will change the color render target for each miplevel, but there is no depth buffer. In the Civilization VI benchmark, this drops the median number of 3DSTATE_DEPTH_BUFFER etc. packets emitted per frame from 472 to 34.	2019-07-30 19:47:41 -07:00
Marek Olšák	665989d98b	radeonsi: release NIR in the right place to fix crashes	2019-07-30 22:06:23 -04:00
Marek Olšák	9ac7d0a0e2	radeonsi: fix packing of key.mono.u.ps	2019-07-30 22:06:23 -04:00
Marek Olšák	33a8eab7a9	radeonsi: don't use lp_build_if for the prim discard compute shader	2019-07-30 22:06:23 -04:00
Marek Olšák	5562b6b067	radeonsi: don't use lp_build_if for the wrapping if block in the VS prolog	2019-07-30 22:06:23 -04:00
Marek Olšák	0ef4c1c04d	radeonsi: don't use lp_build_if for the wrapping if block in merged shaders	2019-07-30 22:06:23 -04:00

... 13 14 15 16 17 ...

39979 commits