fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 07:08:04 +02:00

Author	SHA1	Message	Date
Samuel Pitoiset	31dc03e21e	radv: link primitive ID/clip distance shader info from the new helper No functional changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	96b9d9f081	radv: add a helper that links shader info between stages Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	8c6a252c74	radv: remove redundant VS output parameter assignments assign_outinfo_params() should already assign them. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	2d0500d24a	radv: fill radv_vs_output_info unconditionally for vertex related stages That shouldn't change anything for VS as LS (or as ES) and for TES as ES because radv_vs_output_info is only used by the last vertex stage. So, if we have TES+GS, radv_vs_output_info for TES will be overwritten by GS. This allows to decouple the shader info pass from other stages. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	ee5b9bcc57	radv: stop duplicating radv_vs_output_info Only the last vertex stage needs to access this. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	45a0276cd1	radv/llvm: remove unused parameter in handle_vs_outputs_post() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	20ebdc3c2b	radv: replace cs.uses_task_rings by ms.has_task Task shaders always use a ring, so this field was useless somehow. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	03d2af30f6	radv: remove dead code about task ring when binding a compute pipeline This is probably a leftover when task shader has been reworked, but it has no effect. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	38ae5b6da6	radv: compute the ESGS itemsize outside of radv_nir_shader_info_pass() radv_nir_shader_info_pass() should run on individual shaders only, and "linked" shader info should be done separately for better design. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	dbf175f255	radv: use esgs_itemsize when calling ac_nir_lower_es_outputs_to_mem Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	0df2d5e318	radv: stop duplicating radv_es_output_info This structure isn't really useful and it contains only one field. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	a04fd5c61f	ac: constify ac_compute_cs_workgroup_size() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18210>	2022-08-26 14:07:09 +00:00
Samuel Pitoiset	8cd1683944	aco: fix wrong size for 1D images and A16 on GFX9 Size is in bytes, not bits. Fixes plenty of crashes in CI, like dEQP-VK.synchronization.op.single_queue.event.write_image_fragment_read_image_tess_eval.image_128_r32_uint. Fixes: `46f6e2ddbb` ("aco: Implement storage image A16.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18266>	2022-08-26 13:30:46 +00:00
Samuel Pitoiset	0250925f07	radv: destroy the pipeline layout if creating a library failed It should be properly cleaned. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18252>	2022-08-26 13:08:29 +00:00
Samuel Pitoiset	39bebff1ac	radv: fix missing initialization of the pipeline layout when creating a lib The base object won't be initialized otherwise. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18252>	2022-08-26 13:08:29 +00:00
Samuel Pitoiset	e6e8c092ff	radv: remove bogus assertion about independent set layouts with GPL layout->independent_sets can't be TRUE here. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18252>	2022-08-26 13:08:29 +00:00
Samuel Pitoiset	64045fcf7c	radv: re-emit viewports if negative one to one or depth clamp mode changed The following sequence would be broken if we don't re-emit viewports. vkCmdSetViewport() VkCmdBindPipeline(negative_one_to_one = false) vkCmdDraw() VkCmdBindPipeline(negative_one_to_one = true) vkCmdDraw() Found by inspection. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18245>	2022-08-26 12:46:41 +00:00
Danylo Piliaiev	1eb7a85b55	tu: Update HS_WAVE_INPUT_SIZE formula A better explanation for SP_HS_WAVE_INPUT_SIZE is that it is the size of local memory to allocate per wave (which can be more than one patch), in 256B units. Then the maximum of 64 makes sense because only 16KB of local memory is reserved for VS<->HS linkage. The resulting formula matches the blob behaviour, even when patch_control_points and tcs_vertices_out have different values, while the past formula gave wrong answers on gen3+. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Suggested-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17957>	2022-08-26 15:18:42 +03:00
Danylo Piliaiev	a7db1da37d	tu: Fix streamout with tess_use_shared Mirrors `31835ac3b8` change in freedreno. Together with "tu: Fix HS input size formula for gen3+" fixes following tests from GL CTS running via Zink: dEQP-GLES31.functional.tessellation.invariance.inner_triangle_set.quads_fractional_odd_spacing dEQP-GLES31.functional.tessellation.invariance.inner_triangle_set.triangles_fractional_odd_spacing dEQP-GLES31.functional.tessellation.invariance.primitive_set.triangles_fractional_odd_spacing_ccw dEQP-GLES31.functional.tessellation.invariance.primitive_set.triangles_fractional_odd_spacing_cw dEQP-GLES31.functional.tessellation.invariance.triangle_set.triangles_fractional_odd_spacing dEQP-GLES31.functional.tessellation.primitive_discard.quads_fractional_odd_spacing_ccw dEQP-GLES31.functional.tessellation.primitive_discard.quads_fractional_odd_spacing_cw dEQP-GLES31.functional.tessellation.primitive_discard.triangles_fractional_odd_spacing_ccw dEQP-GLES31.functional.tessellation.primitive_discard.triangles_fractional_odd_spacing_cw Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17957>	2022-08-26 15:14:10 +03:00
Danylo Piliaiev	0120e7b9d9	freedreno: PC_SO_STREAM_CNTL_STREAM_ENABLE has per-stream enable bits PC_SO_STREAM_CNTL.STREAM_ENABLE mirrors VPC_SO_STREAM_CNTL.STREAM_ENABLE Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17957>	2022-08-26 15:14:10 +03:00
Danylo Piliaiev	0bf2033e0d	tu: Implement VK_EXT_attachment_feedback_loop_layout Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18064>	2022-08-26 10:29:00 +00:00
Erik Faye-Lund	b7601dd27e	zink: wrap discard in a function This makes discard less weird, and allows us to treat it as control-flow. This makes things less bizarre for drivers. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7070 Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18244>	2022-08-26 10:05:03 +00:00
Erik Faye-Lund	47d67912bd	zink: add spirv_builder_function_call It can be useful not just to create functions, but also being able to call them. This adds the spirv_builder-helper for this. Cc: mesa-stable Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18244>	2022-08-26 10:05:03 +00:00
Erik Faye-Lund	41dfed6e12	zink: type_main -> type_void_func This type will be reused later on, so let's have the name describe what is is, not what it's used for. Cc: mesa-stable Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18244>	2022-08-26 10:05:03 +00:00
Jordan Justen	f4c44444ad	intel/pci_ids: Add 0x468b ADL-S PCI-id Ref: bspec 53655 Fixes: `d399c3e861` ("intel/dev: Add device info for ADL-S") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17569>	2022-08-26 08:50:42 +00:00
Jordan Justen	6ca37aabfb	intel/pci_ids: Update ADL-S strings Ref: bspec 53655 Fixes: `d399c3e861` ("intel/dev: Add device info for ADL-S") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17569>	2022-08-26 08:50:42 +00:00
Gert Wollny	bf4234d088	r600/sfn: Use a low number for unused target register This reduces the number of registers reserved by the shader units and makes more threads possible. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6856 Fixes: `79ca456b48` r600/sfn: rewrite NIR backend Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:42 +00:00
Gert Wollny	90f99369ae	r600: Fix reporting TGSI IR support When NIR is not explicitely enabled we still support TGSI. Fixes: `33765aa92a` r600/sfn: Enable NIR for pre RG hardware Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:42 +00:00
Gert Wollny	c81fe5b235	r600/sfn: Use a heuristic to keep SSBO setup and store close When SSBO instructions use constant address values the address loading is immediately ready, scheduling the address loads early increases the register pressure, so force a new instruction block to work around this problem. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6975 Fixes: `79ca456b48` r600/sfn: rewrite NIR backend v2: do handling in shader block to be thread save (hinted to by Filip) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:42 +00:00
Gert Wollny	1f5dccb760	r600/sfn: Don't scan the whole block for ready instructions Limit the number of tested instructions and the number of ready instructions that might be taken into account. This reduces the time needed to run the scheduler significantly. Fixes: `79ca456b48` r600/sfn: rewrite NIR backend Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:42 +00:00
Gert Wollny	79eabb8130	r600/sfn: Don't schedule GDS instructions early Atomic GDS instructions like inc, dec, or read will increase the register pressure, therefore we shouldn't prioritize scheduling them. Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6975 Fixes: `79ca456b48` r600/sfn: rewrite NIR backend Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:41 +00:00
Gert Wollny	fd71cd0b6a	r600/sfn: Don't tag mem-ring and stream instructions as exports Export instructions allow burst writes, so it makes send to try to allocate consecutive registers, but for ring writes we don't schedule the outputs correctly to exploit this, so for now don't mark these instructions as export to let the RA restart picking colors. When the scheduler starts to emit the ring writes in the right order to allow for bust writes we might revisit this. This fixes spec@glsl-1.50@execution@variable-indexing@gs-output-array-vec4-index-wr Fixes: `79ca456b48` r600/sfn: rewrite NIR backend Related: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6975 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:41 +00:00
Gert Wollny	3a0f085837	r600/sfn: Handle color0 writes all on R700 like on EG Fixes: `069f3869ac` r600/sfn: Fix color outputs when color0 writes all Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Filip Gawin <filip@gawin.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18212>	2022-08-26 08:27:41 +00:00
Lucas Stach	43eb5e777e	etnaviv: add debug option to disable linear PE feature Linear PE has already shown to have some rough corner cases in the hardware and also has performance implications. Add a debug option to allow to disable the feature, so users can more easily check if some issue is caused by this feature. CC: mesa-stable #22.2 Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18232>	2022-08-26 07:47:09 +00:00
Lucas Stach	ea8fc9592c	etnaviv: use linear PE rendering only on properly aligned surfaces When linear rendering is used together with TS, the color tiles must be fully contained in a single row of pixels. When wrapping around to the next row TS gets confused and records wrong tile status information, leading to visual corruption when the surface is resolved/decompressed. The corruption can be fixed by increasing the stride alignment for linear render targets, but that would break some existing use-cases, as some display engines used together with Vivante GPUs currently don't support strides that don't match the horizontal display resolution. For now only enable linear PE rendering when the surface is properly aligned already. This allows to use the optimization in a lot of common use-cases, but falls back to the proven tiled rendering with subsequent resolve into linear for the problematic cases. CC: mesa-stable #22.2 Fixes: `53445284a4` ("etnaviv: add linear PE support") Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Tested-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18232>	2022-08-26 07:47:09 +00:00
Lucas Stach	09953d7b75	etnaviv: move checking for MC2.0 for TS into screen init The decision whether to use fast clear aka TS currently checks for two feature bits: FAST_CEAR and MC20. We check for MC20, as TS on MC1.0 bypasses the memory offset and we don't have any way to fixup the GPU address to account for that. It could be done with some support of the kernel driver, but then GPUs with MC1.0 are very rare to find these days, so not sure if we are ever going to bother with that. Instead of checking two separate feature bits to determine if TS can be used, mask out the FAST_CLEAR bit from the features when MC20 isn't present. This way we only have to check for a single feature bit. CC: mesa-stable #22.2 Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Tested-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18232>	2022-08-26 07:47:09 +00:00
Samuel Pitoiset	68e69d002f	radv: stop emitting RMW context registers for updating sample locations RMW context registers have been removed in RadeonSI a while ago because they don't seem good for performance. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18234>	2022-08-26 06:33:05 +00:00
Samuel Pitoiset	2f5891108a	radv: cleanup dynamic states in radv_emit_graphics_pipeline() Some dynamic states always need to be emitted when the first pipeline is emitted, some others depend on pipeline state. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18234>	2022-08-26 06:33:05 +00:00
Samuel Pitoiset	85a55009be	radv: stop clearing bitfields for registers that are emitted dynamically These fields aren't set at pipeline creation, so clearing them is just useless. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18234>	2022-08-26 06:33:05 +00:00
Samuel Pitoiset	7aaa016b23	radv: stop setting CB_COLOR_CONTROL.ROP3 from the pipeline This is useless because logic op is a dynamic state and it's already emitted from the cmdbuf. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18234>	2022-08-26 06:33:05 +00:00
Qiang Yu	b5c10a9028	ac/llvm: cast tes_u/v_replaced to float Otherwise LLVM float ops fail to operate on them. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	f75452918b	ac/nir/ngg: support clipdist culling Port from radeonsi. Besides vertex position based primitive culling, clipdist attribute can also be used to cull a primitive. Normally it's used by fixed-pipeline, but when NGG we can treate it as a culling condition to filter out invisible primitive before fixed-pipeline. There are two kinds of clipdist: 1. user define a clip plane explicitly by glClipPlane(), fixed-pipeline calculate with vertex position to get clipdist, then cull. This is the legacy way. 2. Now GLSL define gl_ClipDistance/gl_CullDiatance so that user can calculate clipdist in any way he like. This implementation support both way. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	620e62bb39	ac/nir/ngg: support component position store Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	1bdeb961bd	ac/nir/ngg: add gs culling Port from radeonsi. Cull primitive after GS thread and before final vertex/primitive export. GS culling is like VS/TES culling which read out saved vertex positions of a primitive from LDS then call the primitive culling algorithm to check whether it's visiable or not, only passed primitives will be exported. Unlike the VS/TES culling that read vertex index of a primitive from VGPRs as shader args, GS will set a primitive complete flag for each last vertex of a primitive in LDS, so that vertex thread know the previous 1/2/3 vertex can form a primitive and do primitive culling. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	b212fd4b1e	ac/nir/ngg: save and restore position output base for nogs radeonsi has different driver_location and io location. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	7e17e01973	ac/nir/ngg: save and restore output bit size for gs radeonsi does not have io nir variables, so need to save output bit size when lower store_output intrinsic. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	93a635c2c8	ac/nir/ngg: use same driver location for gs output driver_location and io location are different for radeonsi, and radeonsi llvm rely on the correct driver_location to index output variables. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	347a94666c	ac/nir/ngg: fix and simplify gs store output lower Simplify: 64bit IO has been lowered by nir_lower_io with nir_lower_io_lower_64bit_to_32, so no need to handle in the ngg lower. Fix: we need to increase io_sem.location by base_offset for correct gs_output_info. radeonsi has different driver_location and io location, so also change the output variable index to io location. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	db0e9d3cab	ac/nir/ngg: support line culling Port from ac_llvm_cull.c Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00
Qiang Yu	f1f2c931a7	ac/nir/cull: support caller react when primitive is rejected Make accept_func optional, and return accpect result for caller react when primitive is rejected. This is for GS culling. Acked-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17651>	2022-08-26 05:50:30 +00:00

1 2 3 4 5 ...

158643 commits