fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 15:30:14 +01:00

Author	SHA1	Message	Date
Marek Olšák	fe35a8b00e	nir: change "user_data_amd" sysval from 4 to 8 components so that we can pass more fast constants to compute shaders (without reading memory in the shader). Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28606>	2024-04-13 16:45:08 +00:00
Marek Olšák	c1f750eed9	nir: add nir_intrinsic_optimization_barrier_sgpr_amd for radeonsi Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28606>	2024-04-13 16:45:08 +00:00
Konstantin Seurer	85e840786c	nir: Add lavapipe ray tracing intrinsics Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28187>	2024-04-09 07:13:01 +00:00
Rhys Perry	543ca160a5	nir,aco: add test intrinsics These don't really do anything. They're just a source and user of SSA defs. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28301>	2024-04-08 18:38:39 +00:00
Alyssa Rosenzweig	57fa9a2b8e	nir: add intrinsics for non-monolithic agx shaders Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	e536b4973f	nir: add export/load_exported_agx intrinsics for lowering non-monolithic ABI Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	df8e52a795	nir: add samples_log2_agx sysval Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	70395c1ac1	asahi: delete layer id code Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	5c7ce24896	asahi: make point size replacement dynamic I'm not measuring a significant perf difference in -bshading:shading=phong:model=bunny -bideas -brefract so this seems Good Enough For Me. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	499d091208	nir: add intrinsics for lowered VS outputs handling VS indirects will require some driver cooperation. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	1773eb329c	nir: add offset to load_coefficients_agx for indirect varyings Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:19 +00:00
Alyssa Rosenzweig	72ef80dfc8	asahi: stop merging VS and TCS unfortunately, shader stage merging is bogus when coherent images are used, so we need an unmerged path. i'd rather not maintain two paths, so let's just stop merging. as a bonus this makes ESO a lot easier, and lets us reuse the same VS for both VS->GS and VS->TCS. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28483>	2024-03-30 00:26:17 +00:00
Ian Romanick	a8115221e5	nir: intel/brw: Change the order of sources for nir_dpas_intel It was by pure luck that all sources (and the result) of nir_dpas_intel had the same number of components. It is possible to support matrix sizes where the accumlator matrix and the result matrix are larger (e.g., 16x8 * 8x16 = 16x16). This breaks all of the assumptions of NIR's infrastructure for code generating intrinsics. Fix the by making the accumulator matrix be the first source. The accumulator and the result will always have the same dimensions (due to rules of matrix multiplication) and the same type (due to restructions of the cooperative matrix extension). This forces them to have the same number of components. This doesn't fix all the potential problems. NIR expects that all 0-sized sources will have the same number of components. This just ensures that the result has the correct number of components. Fixes: `6b14da33ad` ("intel/fs: nir: Add nir_intrinsic_dpas_intel") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28404>	2024-03-29 21:12:32 +00:00
Timur Kristóf	411de8488c	nir: Add two new AMD specific tess intrinsics. These will be needed to implement some tessellation dynamic states within the TCS as opposed to using an epilog. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28408>	2024-03-28 23:44:03 +00:00
Faith Ekstrand	d4ac4ce112	nak/nir: Use nir_io_semantics for FS outputs We also add a new nir_intrinsic_fs_out_nv to which is a lot simpler than store_output to pass to the NAK back-end. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28377>	2024-03-26 05:57:12 +00:00
Faith Ekstrand	879c5c1dda	nak: Add a condition to bar_break_nv Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:49 +00:00
Faith Ekstrand	4fcbf558dd	nak: Add a copy_fs_outputs_nv intrinsic This is just a little handle to tell the back-end where to do the copy. Ideally, we'd have a NIR intrinsic that does the copy but we need to be able to copy any number of registers up to 34 and NIR intrinsics just aren't that flexible. Reviewed-by: M Henning <drawoc@darkrefraction.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28300>	2024-03-25 15:55:49 +00:00
Marek Olšák	1585a5cc6d	nir,amd: add nir_intrinsic_load_debug_log_desc_amd and its use for shader debugging Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27952>	2024-03-22 21:58:02 +00:00
Marek Olšák	6773595ed0	nir: rename AMD XFB intrinsics to *_gfx11_amd to indicate it's only for gfx11. Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27952>	2024-03-22 21:58:02 +00:00
Faith Ekstrand	b68f2e747c	zink: Rework sparse texture lowering Instead of the previous fragile attempt to handle sparse_resident_and by crawling deref chains, we now insert an is_sparse_resident_zink intrinsic immediately after the tex or sparse_load intrinsic and define Zink's sparse resident codes to always be 0/1. Then sparse_resident_and becomes iand and is_sparse_texels_resident becomes != 0 and everything is well-defined and robust. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28123>	2024-03-14 22:37:51 +00:00
Juan A. Suarez Romero	62e1dff256	v3d: add load_fep_w_v3d intrinsic This intrinsic helps to read the W coordinate stored in the QPU register when initializing the input data for the fragment shaders. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/28072>	2024-03-11 12:42:49 +00:00
Georg Lehmann	230743da2e	nir: remove rotate scope All other subgroup operations do not have a scope in NIR, so for consistency rotate shouldn't have one either. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27964>	2024-03-05 14:12:21 +00:00
Lionel Landwerlin	259cdc5496	nir: add additional flag to resource_intel for embedded samplers This will enable specific lowering of embedded samplers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22151>	2024-02-29 07:05:06 +00:00
Ian Romanick	5da5106727	nir: Add documentation for subgroup_.._mask v2: Fix reference to GL_ARB_shader_ballot. Noticed by Lionel. Suggested-by: Lionel Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27044>	2024-02-27 08:36:09 -08:00
Bas Nieuwenhuizen	c7b2ac3377	radv: Remove ray_launch_size_addr_amd system value. Not used anymore, so clean it up. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27664>	2024-02-17 11:08:16 +00:00
Caio Oliveira	a88084f8be	intel/compiler: Rename brw_image_param to isl_image_param And move them to ISL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27475>	2024-02-14 22:31:23 -08:00
Alyssa Rosenzweig	cb0b027c59	asahi: make clip_halfz dynamic we could move this to the linker but meh, this is good enough for now Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:32 +00:00
Alyssa Rosenzweig	6673924b7e	asahi: make gs topology dynamic even with shobjs, we know the class of topology statically, so we just need to select between the (up to) 3 compatible topologies, and luckily there are common subexpressions we can factor out when calculating all 3 at once. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:32 +00:00
Alyssa Rosenzweig	17896f1699	nir: rm load_vert_id_in_prim_agx now unused since we separate vs/gs Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:31 +00:00
Alyssa Rosenzweig	c6c8262ce1	asahi: implement pipeline stats as a checkbox real impl is blocked on uapi to plumb thru hw perf counters. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:30 +00:00
Asahi Lina	b89da92a5e	agx: compiler: Add fence_helper_exit_agx barrier This is used by the helper program on exit. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Asahi Lina	b07dbf7b0f	nir: Add AGX-specific helper opcodes These opcodes are used by the helper program to fetch the current operation info and core ID. Signed-off-by: Asahi Lina <lina@asahilina.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Alyssa Rosenzweig	311070f7af	nir: add active_subgroup_invocation_agx sysval Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Alyssa Rosenzweig	5dc0f5ccba	asahi: implement VBO robustness GL semantics. GLES (weaker) and VK (stronger) semantics are left as a todo, with explanations given. Enabled always to deal with null VBOs, this should be optimized once we have soft fault. This necessitates a rework of VBO keys, but hopefully for the best. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:29 +00:00
Alyssa Rosenzweig	9753cd44f7	asahi: Implement skeleton for tessellation This implements a rough skeleton of what's needed for tessellation. It contains the relevant lowerings to merge the VS and TCS, running them as a compute kernel, and to lower the TES to a new VS (possibly merged in with a subsequent GS). This is sufficient for both standalone tessellation and tess + geom/xfb together. It does not yet contain a GPU accellerated tessellator, simply falling back to the CPU for that for now. Nevertheless the data structures are engineered with that end goal in mind, in particular to be able to tessellate all patches in parallel without needing any prefix sums etc (using simple watermark allocation for the heap). Work on fleshing out the skeleton continues in parallel. For now, this does pass the tests and lets the harder stuff get regression tested more easily. And merging early will ease rebase. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:28 +00:00
Alyssa Rosenzweig	2d37d1b704	asahi: lower poly stipple Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27616>	2024-02-14 21:02:28 +00:00
Connor Abbott	6a744ddebc	ir3: Initial support for pushing globals with ldg.k Add a separate pass which uses the analyze_ubo_ranges machinery to construct ranges of readonly globals accessed in the shader and push them to constants in the preamble, using ldg.k if possible. This is enough to handle inline uniforms in turnip but also provides a base for OpenCL, although the pass would need further work for that. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26934>	2024-02-12 22:05:13 +00:00
Connor Abbott	45c71803f9	tu: Add more info to ldg inline uniform path This will let us push the ldg into the preamble. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26934>	2024-02-12 22:05:13 +00:00
Job Noorman	60413e11c2	ir3: optimize subgroup operations using brcst.active Follow the blob and optimize subgroup operation using brcst.active and getlast when supported. The transformation consists of two parts. First, a NIR transform replaces subgroup operations with a sequence of new brcst_active_ir3 intrinsics followed by a new [type]_clusters_ir3 intrinsic (where type can be reduce, inclusive_scan, or exclusive_scan). The brcst_active_ir3 intrinsic is lowered directly to a brcst.active instruction. The other intrinsics get lowered to a new macro (OPC_SCAN_CLUSTERS_MACRO) which later gets emitted as a loop (using getlast/getone) that iterates all clusters and produces the requested scan result. OPC_SCAN_CLUSTERS_MACRO has a number of optional arguments. First, since the exclusive scan result is not a natural by-product of the loop but has to be calculated explicitly, its destination is optional. This is necessary since adding it unconditionally will produce unused instructions that won't be DCE'd anymore at this point. Second, when performing 32b MUL_U reductions (that expand to multiple instructions), an extra scratch register is necessary. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6387 Signed-off-by: Job Noorman <jnoorman@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26950>	2024-02-02 19:49:22 +00:00
Faith Ekstrand	48ebfeba34	nak: Add a source barrier intrinsic This just inserts a GPU stall until the given source is available. We need this in order to properly implement shader clock. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27303>	2024-01-26 16:55:50 +00:00
Georg Lehmann	1cb5bf7009	nir: add ballot_relaxed and as_uniform intrinsics Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27116>	2024-01-19 20:13:33 +00:00
Faith Ekstrand	82fe981e35	nir,spirv: Add support for SPV_NV_shader_sm_builtins Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27154>	2024-01-18 20:20:06 +00:00
Alyssa Rosenzweig	8ddd89ffa5	nir,zink: Redefine flat_mask in terms of I/O locations Robust against separable shaders, and still makes sense for lowered I/O drivers, whereas just counting FS variables and expecting them to match with the VS is... questionable. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Signed-off-by: antonino <antonino.maniscalco@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26888>	2024-01-10 14:30:14 +00:00
Alyssa Rosenzweig	97f9f7ab0a	asahi: implement point sprites w/o shader key we can replace varyings with point sprites, we just need to fix up .zw appropriately. do that with some bcsels, ALU is cheap. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26963>	2024-01-10 08:44:38 -04:00
Ian Romanick	6b14da33ad	intel/fs: nir: Add nir_intrinsic_dpas_intel v2: Fix parameter order in nir_intrinsic_dpas_intel to DPAS conversion. v3: Fix float16 destination DPAS on DG2. v4: Use nir_component_mask(...) instead of 0xffff. Suggested by Caio. v5: Rebase on !26323. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25994>	2023-12-29 20:28:43 -08:00
Lionel Landwerlin	f53748c481	nir: fixup nir_printf intrinsic description Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26505>	2023-12-12 11:11:10 +00:00
Alyssa Rosenzweig	c43c90a5fa	asahi: rewrite pointsize handling In the wise words of Mike Blumenkrantz, "I hate gl_PointSize and so can you". The mesa/st lowering won't mesh well with vertex shader epilogues, and it falls over in various circumstances. I am too tired to go against the grain, so let's just pretend to be a normal gallium driver and trust in the rasterizer CSO, lowering point size internally. This properly handles transform feedback without any hacks, both GL and GLES behaviours, etc. Fixes: KHR-GL31.transform_feedback.capture_vertex_separate_test gl-2.0-large-point-fs Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26614>	2023-12-09 12:08:39 -04:00
Alyssa Rosenzweig	5987e47a29	asahi: rework GS input assembly in prep for tessellation (which will share the IA lowering), and for multidraw indirect (which greatly complicates IA lowering with geom/tess). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26614>	2023-12-09 12:08:39 -04:00
Marek Olšák	7d2faa88ab	nir,radeonsi: add FLAGS into load_vector_arg_amd to record color input usage This will be needed for gathering color usage from lowered PS. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26307>	2023-12-09 00:05:27 +00:00
Faith Ekstrand	eda940c855	nak: Make barriers SSA-friendly The NIR intrinsics now take and return a barrier whenever one is modified instead of modifying in-place. In NAK, we give the internal instructions the same treatment and convert everything to use barrier SSA values and RegRefs. In nak_from_nir, we move all barriers to/from GPRs. We'll clean up the massive pile of OpBMov later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26463>	2023-12-05 18:59:40 +00:00

1 2 3 4 5 ...

435 commits