fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 15:50:11 +01:00

Author	SHA1	Message	Date
Qiang Yu	5351209632	nir,ac/llvm,radeonsi: replace nir_buffer_atomic_add_amd with ssbo atomic Now that radeonsi support pass desc to ssbo atomic ops, we can use ssbo atomic instead. aco does not implement nir_buffer_atomic_add either. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23096>	2023-06-02 17:51:02 +08:00
Lionel Landwerlin	54dfc08b89	nir: add a new intrinsic to describe resources accessed on intel Intel HW has multiple ways to access resources like UBO/SSBO/images : - binding tables : a small ~240 heap of surfaces - bindless surfaces : a 64Mb heap of surfaces up to Gfx12+, 4Gb on Gfx12.5+ - surfaces : a 4Gb heap on Gfx12.5+ (mostly unused at the moment, only available through the LSC) For samplers, we have 2 options since Gfx11+ : - samplers indexed from the Dynamic State Heap (4Gb) - samplers indexed from the Bindless Sampler Heap (4Gb) Additionally our whole push constant promotion mechanism is based around binding table indices. This is problematic if you want to also promote to push constants things that would be accessed through the bindless heap. To solve this issue, we introduce a new intrinsic that will cary a block index that is not based off the binding table index nor the bindless table offset. We will also use this intrinsic to identify whether the buffer/surface index in load_ubo/load_ssbo/store_ssbo/etc... is relative to the binding table or the bindless heap. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21645>	2023-05-30 06:36:37 +00:00
Rhys Perry	0d26d9d9b6	ac/nir: add fix_derivs_in_divergent_cf Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22636>	2023-05-25 16:29:16 +00:00
Alyssa Rosenzweig	66656822e3	nir: Add image_texel_address intrinsics Some hardware has an instruction to load the address of a texel in a writeable image, given the coordinates ("LEA_IMAGE"). This operation is defined only for uncompressed images, but it is well-defined regardless of the underlying twiddling. As such, it is not expected to be produced by APIs but is useful for internal lowering when it is known that images will be uncompressed (e.g. because image_store does not support compression on the hardware). Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23120>	2023-05-22 14:33:13 +00:00
Samuel Pitoiset	f023ab01e9	nir: add nir_intrinsic_load_poly_line_smooth_enabled To lower smooth lines conditionally in fragment shaders for RADV because the line rasterization mode in Vulkan can be dynamic. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21587>	2023-05-22 07:58:34 +00:00
Alyssa Rosenzweig	59e73674c3	nir: Drop legacy atomics in simple cases This commit drops legacy atomic support from core passes where we can simply delete switch cases with no other changes. As such it's separated from the more complex pass-specific commits for ease of review. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/23036>	2023-05-16 22:36:21 +00:00
Alyssa Rosenzweig	d51bc95837	nir: Add unified atomics Currently, we have an atomic intrinsic for each combination of memory type (global, shared, image, etc) and atomic operation (add, sub, etc). So for m types of memory supported by the driver and n atomic opcodes, the driver has to handle O(mn) intrinsics. This makes a total mess in every single backend I've looked at, without fail. It would be a lot nicer to unify the intrinsics. There are two obvious ways: 1. Make the memory type a constant index, keep different intrinsics for different operations. The problem with this is that different memory types imply different intrinsic signatures (number of sources, etc). As an example, it doesn't make sense to unify global_atomic_amd with global_atomic_2x32, as an example. The first takes 3 scalar sources, the second takes 1 vector and 1 scalar. Also, in any single backend, there are a lot more operations than there are memory types. 2. Make the opcode a constant index, keep different intrinsics for different operations. This works well, with one exception: compswap and fcompswap take an extra argument that other atomics don't, so there's an extra axis of variation for the intrinsic signatures. So, the solution is to have 2 intrinsics for each memory type -- for atomics taking 1 argument and atomics taking 2 respectively. Both of these intrinsics take an nir_atomic_op enum to describe its operation. We don't use a nir_op for this purpose, as there are some atomics (cmpxchg, inc_wrap, etc) that don't cleanly map to any ALU op and it would be weird to force it. The plan is to transition to these new opcodes gradually. This series adds a lowering pass producing these opcodes from the existing opcodes, so that backends can opt-in to the new forms one-by-one. Then we can convert backends separately without any cross-tree flag day. Once everything is converted, we can convert the producers and core NIR as a flag day, but we have far fewer producers than backends so this should be fine. Finally we can drop the old stuff. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rob Clark <robclark@freedesktop.org> Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22914>	2023-05-12 20:39:46 +00:00
Alyssa Rosenzweig	aa6bdbd54a	nir: Use nir_foreach_phi(_safe) The pattern shows up all the time open-coded. Use the macro instead. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22967>	2023-05-12 14:02:23 +00:00
Timur Kristóf	f66281c7fb	amd: Add and implement gs_wave_id sysval. Contains a global wave ID of legacy GS waves. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22690>	2023-05-04 19:08:58 +00:00
Lionel Landwerlin	1e0e4657f9	spirv/nir: wire ray interection triangle position fetch Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <f{merge_request.web_url}>	2023-05-04 11:25:41 +00:00
Lionel Landwerlin	d6e9479d4b	nir/divergence: add missing load_global_constant_* intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22624>	2023-04-27 09:08:03 +00:00
Qiang Yu	f7f0d31fcc	nir,ac/llvm,radeonsi: replace nir_load_smem_buffer_amd with nir_load_ubo They use same instruction. Just because when the time nir_load_smem_buffer_amd was introduced, radeonsi didn't support pass buffer descriptor to nir_load_ubo directly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22523>	2023-04-19 01:59:02 +00:00
Lionel Landwerlin	0b8a2de2a1	anv: add dynamic buffer offsets support with independent sets With independent sets, we're not able to compute immediate values for the index at which to read anv_push_constants::dynamic_offsets to get the offset of a dynamic buffer. This is because the pipeline layout may not have all the descriptor set layouts when we compile the shader. To solve that issue, we insert a layer of indirection. This reworks the dynamic buffer offset storage with a 2D array in anv_cmd_pipeline_state : dynamic_offsets[MAX_SETS][MAX_DYN_BUFFERS] When the pipeline or the dynamic buffer offsets are updated, we flatten that array into the anv_push_constants::dynamic_offsets[MAX_DYN_BUFFERS] array. For shaders compiled with independent sets, the bottom 6 bits of element X in anv_push_constants::desc_sets[] is used to specify the base offsets into the anv_push_constants::dynamic_offsets[] for the set X. The computation in the shader is now something like : base_dyn_buffer_set_idx = anv_push_constants::desc_sets[set_idx] & 0x3f dyn_buffer_offset = anv_push_constants::dynamic_offsets[base_dyn_buffer_set_idx + dynamic_buffer_idx] It was suggested by Faith to use a different push constant buffer with dynamic_offsets prepared for each stage when using independent sets instead, but it feels easier to understand this way. And there is some room for optimization if you are set X and that you know all the sets in the range [0, X], then you can still avoid the indirection. Separate push constant allocations per stage do have a CPU cost. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15637>	2023-04-17 22:43:37 +00:00
Qiang Yu	7fcc5aa9c0	nir: add nir_load_barycentric_optimize_amd intrinsic Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21683>	2023-04-17 02:11:55 +00:00
Lionel Landwerlin	2cf93f7632	nir: add 2 new intel intrinsics for uniform ssbo/shared loads Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21853>	2023-04-05 12:32:56 +00:00
Timur Kristóf	b688a6d227	nir: Remove IB address and stride intrinsics. RADV used these to emulate firstTask for NV_mesh_shader. They are no longer needed. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22139>	2023-03-29 15:08:55 +00:00
Qiang Yu	c9d60547ef	nir,radeonsi: add and implement nir_load_alpha_reference_amd Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21552>	2023-03-28 19:57:11 +00:00
Samuel Pitoiset	bb7e0c4280	spirv,nir: add support for SpvBuiltInFullyCoveredEXT Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21497>	2023-03-21 08:44:09 +00:00
Timur Kristóf	022e55557b	nir: Add load_typed_buffer_amd intrinsic. This new intrinsic maps to the MTBUF instruction format on AMD GPUs and represents a typed buffer load in NIR. Also add an unsigned upper bound for the new intrinsic. Code for that ported from aco_instruction_selection_setup. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16805>	2023-03-15 14:54:27 +00:00
Marek Olšák	f7076d129d	amd: add nir_intrinsic_xfb_counter_sub_amd and fix overflowed streamout offsets Fixes: `5ec79f9899` - ac/nir/ngg: nogs support streamout Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21584>	2023-03-07 22:08:47 +00:00
Marek Olšák	9f1e6d8f70	nir,amd: add and use nir_intrinsic_load_esgs_vertex_stride_amd This will emulate VGT_ESGS_RING_ITEMSIZE, which does the multiplication for us. It's beneficial to stop setting VGT_ESGS_RING_ITEMSIZE to reduce context rolls, and also the register will be removed in the future. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21525>	2023-02-24 21:27:24 +00:00
Caio Oliveira	e40b1df432	nir: Add nir_intrinsic_rotate Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19797>	2023-02-24 06:33:51 +00:00
Georg Lehmann	ee47cc8256	amd,nir: remove byte_permute_amd intrinsic It's unused and if we ever want to use it again we should make it an alu opcode instead. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/21445>	2023-02-22 20:13:52 +00:00
Daniel Schürmann	2bb369dd8d	nir: add assertions that loops don't have a Continue Construct Hoping that I didn't miss any, this should add assertions to all functions and passes which explicitly handle 'nir_loop'. Acked-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13962>	2023-02-21 10:41:11 +00:00
Lionel Landwerlin	b82d9b1a3d	nir/divergence: add missing RT intrinsinc handling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20763>	2023-01-18 22:32:43 +00:00
Lionel Landwerlin	3af08b9c30	nir/divergence: handle shader_record_ptr intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes `6b8fd65e84` ("spirv: Implement the new ray-tracing storage classes") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20413>	2022-12-23 09:22:13 +00:00
Qiang Yu	e85c5d8779	nir/divergence_analysis: add missing intrinsics Reviewed-by: Marek Olšák <marek.olsak@amd.com> Singed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18666>	2022-12-19 09:22:24 +08:00
Qiang Yu	1461b5f61b	nir: add image fragment mask load intrinsic Like nir_texop_fragment_mask_fetch_amd, this is used to load multi sample image fmask data for AMD GPU. We will lower multi sample image load and samples_identical intrinsics to use it latter for radeonsi. RADV does not need this because it always expand fmask images before dispatch compute shader. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18666>	2022-12-19 09:22:11 +08:00
Qiang Yu	796a150196	nir: add nir_load_ring_gs2vs_offset_amd Used by legacy GS output lowering. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20158>	2022-12-13 11:42:33 +08:00
Qiang Yu	bb837bf6ef	nir,ac/llvm: add nir_buffer_atomic_add_amd Used by radeonsi for lower nir_atomic_add_gen/xfb_prim_count_amd. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Qiang Yu	8030fbcf16	nir,ac/llvm: add nir_load_smem_buffer_amd Used by radeonsi to load const buffer. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18010>	2022-12-02 07:34:31 +00:00
Jason Ekstrand	4fb33124c3	nir/divergence: Handle base_workgroup_id and workgrpu_id_zero_base Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20068>	2022-12-01 04:56:48 +00:00
Lionel Landwerlin	99dcdf4d64	nir/divergence: add missing btd_shader_type_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6d9ae6ec1e` ("intel: add a new intrinsic to get the shader stage from bindless shaders") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19948>	2022-11-23 15:04:22 +00:00
Qiang Yu	533b39bfcb	nir,ac/llvm,radeonsi: add nir_load_clamp_vertex_color_amd Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19429>	2022-11-11 04:22:20 +00:00
Rhys Perry	e6d26cb288	nir,ac/nir,aco,radv: replace has_input_*_amd with more general intrinsics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19228>	2022-10-31 14:33:43 +00:00
Marek Olšák	0ac37b595a	nir: add nir_intrinsic_optimization_barrier_vgpr_amd for LLVM We need this for the MSAA resolve shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19243>	2022-10-29 18:38:33 +00:00
Qiang Yu	3d6cce2e4c	nir: add two amd ngg lds base load intrinsics These two values are not known when compile for radeonsi. They are relocated when link/upload time. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18832>	2022-10-27 07:35:01 +00:00
Lionel Landwerlin	117b32a594	nir/divergence_analysis: add missing desc_set_address_intel Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19320>	2022-10-26 21:09:20 +00:00
Lionel Landwerlin	edda5731c0	nir/divergence_analysis: add some missing RT intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19320>	2022-10-26 21:09:20 +00:00
Lionel Landwerlin	5a9f8d21d0	nir/lower_shader_calls: lower scratch access to format internally For a follow up optimization, we would like to track scratch loads. This isn't possible with global load/store intrinsics. So use a couple of special intrinsic in the pass and only lower it to global intrinsics at the end. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/16556>	2022-10-26 12:53:25 +00:00
Rhys Perry	382831c986	radv,nir: add intrinsics for streamout and GS copy shaders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19302>	2022-10-25 17:35:08 +00:00
Qiang Yu	7fb506d068	nir: add nir_load_prim_xfb_query_enabled_amd Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17457>	2022-10-25 12:58:43 +00:00
Qiang Yu	83643e4dc8	nir,ac/nir/ngg,radv: split shader_query_enabled_amd For used by different counter. Vulkan: 1. VK_QUERY_PIPELINE_STATISTIC_GEOMETRY_SHADER_PRIMITIVES_BIT, sum generated primitives of all 4 streams when GS. 2. VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT, count generated primitives for all 4 streams when VS/TES/GS. 3. VK_QUERY_TYPE_TRANSFORM_FEEDBACK_STREAM_EXT, count generated and streamout primitives for all 4 streams when VS/TES/GS. OpenGL: 1. GL_GEOMETRY_SHADER_PRIMITIVES_EMITTED_ARB, sum generated primitives for all 4 streams when GS. 2. GL_PRIMITIVES_GENERATED, count generated primitives for all 4 streams when VS/TES/GS. 3. GL_TRANSFORM_FEEDBACK_PRIMITIVES_WRITTEN, count streamout primitives for all 4 streams when VS/TES/GS. pipeline_stat_query_enabled_amd is for Vulkan 1 and OpenGL 1. xfb_query_enabled_amd is for Vulkan 2/3 and OpenGL 2/3. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19015>	2022-10-25 02:42:52 +00:00
Samuel Pitoiset	09033c7b22	nir: add nir_intrinsic_load_ring_attr_{offset}_amd These intrinsics will be used to lower NGG attributes to memory on GFX11. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19173>	2022-10-20 15:59:44 +00:00
Qiang Yu	58e006b174	nir,ac/llvm,radv: add nir_intrinsic_load_provoking_vtx_in_prim_amd For radeonsi which load this from arg. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/19166>	2022-10-20 06:53:56 +00:00
Samuel Pitoiset	dd30e7bfa0	nir: add nir_load_rasterization_samples_amd This will be used to load the number of rasterization samples when a fragment shader is compiled inside a library without the MSAA state. RADV needs to know the number of samples for loading sample positions with interpolateAtSample(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18677>	2022-09-21 10:30:33 +00:00
Samuel Pitoiset	7f444fc72c	nir: add nir_intrinsic_load_sample_positions_amd This will be used to lower barycentric_at_sample in NIR for RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18615>	2022-09-20 09:52:37 +00:00
Rhys Perry	7d26fafacf	radv: fix dynamic RT stack size with VGPR spilling VGPR spilling might cause VGPRs to be spilled at scratch offset 0, so we can't use that. fossil-db (Sienna Cichlid, Q2RTX and Control): Totals from 4 (0.26% of 1524) affected shaders: Instrs: 8734 -> 8737 (+0.03%) CodeSize: 48492 -> 48504 (+0.02%) Latency: 384375 -> 384369 (-0.00%) InvThroughput: 256250 -> 256246 (-0.00%) Copies: 1312 -> 1313 (+0.08%) Branches: 256 -> 258 (+0.78%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18541>	2022-09-20 01:39:20 +00:00
Qiang Yu	4e06a8f15e	nir: add nir_intrinsic_ordered_xfb_counter_add_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17654>	2022-09-16 08:51:28 +00:00
Qiang Yu	1119e06a45	nir,ac/llvm: add nir_intrinsic_load_ordered_id_amd Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/17654>	2022-09-16 08:51:28 +00:00

... 2 3 4 5 6

288 commits