fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 07:08:05 +02:00

Author	SHA1	Message	Date
Rob Clark	790144e65a	util+treewide: container_of() cleanup Replace mesa's slightly different container_of() with one more aligned to the linux kernel's version which takes a type as the 2nd param. This avoids warnings like: freedreno_context.c:396:44: warning: variable 'batch' is uninitialized when used within its own initialization [-Wuninitialized] At the same time, we can add additional build-time type-checking asserts Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7941>	2020-12-10 16:48:36 +00:00
Jason Ekstrand	630e54a08b	nir: Add a halt instruction type Halt is like a return for the entire shader or exit() if you prefer to think of it that way. Once an invocation hits a halt, it's 100% dead. Any writes to output variables which happened before the halt do, however, still apply. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:09 +00:00
Alejandro Piñeiro	429c336412	broadcom/compiler: separate texture/sampler info from v3d_key So far the v3d compiler has them combined, as for OpenGL both are the same. This change is intended to fit the v3d compiler better with Vulkan, where they are separate concepts. Note that NIR has them separate for a long time, both on nir_variable and on some NIR lowerings. v2: (from Iago feedback) * Use key->num_tex/sampler_used to iterate through the array * Fill up num_samplers_used on v3d, assert that is the same that num_tex_used if possible. v3: (Iago) * Assert num_tex/samplers_used is smaller that tex/sampler array size. v4: Update assert mentioned on v3 to use <= instead of < (detected by CI) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> squash! broadcom/compiler: separate texture/sampler info from v3d_key Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7545>	2020-11-14 15:59:02 +00:00
Arcady Goldmints-Orlov	a1a365e818	broadcom/compiler: Allow spills of temporaries from TMU reads Since spills and fills use the TMU, special care has to be taken to avoid putting one between a TMU setup instruction and the corresponding reads or writes. This change adds logic to move fills up and move spills down to avoid interrupting such sequences. This allows compiling 6 more programs from shader-db. Other stats: total spills in shared programs: 446 -> 446 (0.00%) spills in affected programs: 0 -> 0 helped: 0 HURT: 0 total fills in shared programs: 606 -> 610 (0.66%) fills in affected programs: 38 -> 42 (10.53%) helped: 0 HURT: 2 total instructions in shared programs: 19330 -> 19363 (0.17%) instructions in affected programs: 3299 -> 3332 (1.00%) helped: 0 HURT: 5 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6606>	2020-11-09 20:45:58 +00:00
Juan A. Suarez Romero	1e723745dd	v3d/compiler: extend swapping R/B support to all vertex attributes So far the support for R/B swapping in vertex attributes were for the generic attributes. But there are cases like glSecondaryColorPointer() supporting BGRA formats that require the R/B swapping to be also allowed in the non-generic vertex attributes (in this case, in the COLOR1 attribute). v2: - Don't split line (Iago) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7196>	2020-11-05 12:15:28 +00:00
Arcady Goldmints-Orlov	0b30336906	broadcom/compiler: Handle non-SSA destinations for tex instructions The NIR that is given to the VIR compiler is not in SSA form, and so the v3d*_vir_emit_tex() functions must be able to handle both SSA and register destinations. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7318>	2020-11-05 09:03:46 +00:00
Alejandro Piñeiro	09b2bd1df9	broadcom/compiler: remove v3d_fs_key depth_enabled field. It is not used right now, so keeping it adds some noise/confusion. So far configuring Z test are done through the CFG_BITS. See v3dX(emit_state) at v3dx_emit.c for v3d, and pack_cfg_bits at v3dv_pipeline.c for v3dv. There flags like z_updates_enable and others are filled up. That key field seems like a leftover coming from using vc4 as reference, as that driver defines and uses a field with name name. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7421>	2020-11-03 10:55:08 +00:00
Iago Toral Quiroga	40788be134	v3d/compiler: fix BGRA vertex attributes for vec2/float size. We don't natively support BGRA format, instead we handle these as RGBA and we lower the loads to swap components R and B. However, the driver emits VPM loads based on the size of the input variables so when we have a vec2 or float BGRA input, it would only emit VPM loads for components 0 and 1, which is not correct since we emit a load of component 2 to swap with component 0. v2: handle GL legacy vertex inputs gracefully. Fixes: dEQP-VK.draw.output_location.array.b8g8r8a8-unorm-highp-output-vec2 dEQP-VK.draw.output_location.array.b8g8r8a8-unorm-mediump-output-vec2 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7271>	2020-10-23 09:19:02 +02:00
Erik Faye-Lund	8ad931808e	v3d: do not report alpha-test as supported This triggers lowering in the state-tracker, which makes things a bit simpler. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7251>	2020-10-21 16:33:43 +00:00
Iago Toral Quiroga	442f48f27b	v3d/compiler: implement load interpolated input intrinsics We will lower GLSL interpolateAt functions to these. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7155>	2020-10-15 02:04:04 +02:00
Iago Toral Quiroga	3ec165bce9	broadcom/compiler: track partially interpolated fragment inputs We will need these to implement GLSL's interpolateAt*() functions where we are required to perform interpolation in the shader at arbitrary offsets. Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7155>	2020-10-15 02:04:04 +02:00
Arcady Goldmints-Orlov	e881290979	broadcom/compiler: use nir io semantics This allows to clean up some code. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6721>	2020-10-14 22:54:58 +00:00
Arcady Goldmints-Orlov	ac5f0ee19c	broadcom/compiler: support varyings with struct types This adds support for using structs as outputs from vertex shaders and inputs to fragment shaders. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6721>	2020-10-14 22:54:58 +00:00
Iago Toral Quiroga	7eb8eb10f6	v3d/compiler: allow to batch spills Some shaders that need to spill hundreds of registers can take very long times to compile as each allocation attempt spills a single register and restarts the allocation process. We can significantly cut down these times if we allow the compiler to spill in batches, which should be possible if we are spilling uniforms, which is in fact the kind of spills that we do first because they have lower cost than TMU spills. Doing this could cause us to slightly over spill in some cases (depending on the chosen batch size) leading to slightly worse performance, so we only enable this behavior after we have started to spill over a certain threshold, at which point we assume that performance won't be good and we want to favor compilation speed instead. v2: - Keep it simple and just try to spill a fixed amount of registers in a batch instead of trying to compute this dynamically based on accumulated spills and current register pressure. (Eric). v3: - Check if the node is valid before doing anything with it. - Drop the environment variable to select batch size and just fix it to 20. With this we can take this CTS test from 35 minutes down to about 3 minutes: dEQP-VK.ssbo.layout.random.all_shared_buffer.5 Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	23c727dd67	v3d/compiler: add a lowering pass for robust buffer access Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	4401dde0e9	broadcom/compiler: rename QUNIFORM_GET_BUFFER_SIZE to QUNIFORM_GET_SSBO_SIZE Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	d93d903a37	v3d/compiler: implement nir_intrinsic_get_ubo_size Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Alejandro Piñeiro	02b9670611	broadcom/compiler: allow GLSL_SAMPLER_DIM_BUF on txs emission Although we don't support texture buffers on the OpenGL driver, we are already doing that for the Vulkan driver. This would be needed for the OpenGL driver in any case. Fixes following tests on v3dv: dEQP-VK.memory.pipeline_barrier.host_write_uniform_texel_buffer.* dEQP-VK.memory.pipeline_barrier.transfer_dst_uniform_texel_buffer.* Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:33 +00:00
Iago Toral Quiroga	644a15e69e	v3dv: implement nir_texop_texture_samples Fixes: dEQP-VK.glsl.texture_functions.query.texturesamples.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	1c4c7d95f7	broadcom/compiler: track if the fragment shader forces per-sample MSAA For example, regarding gl_SampleID, the GLSL spec states: "Any static use of this variable in a fragment shader causes the entire shader to be evaluated per-sample." So we need to track if the fragment shader does anything that implicitly enables per-sample shading in the compiler for the driver to auto-enable sample rate shading if needed. v2: - Instead of tracking reads of gl_SampleID, check SYSTEM_BIT_SAMPLE_ID and SYSTEM_BIT_SAMPLE_POS as well as the sample layout qualifier like other drivers are doing to activate this behavior (Eric). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	531ea3596d	broadcom/compiler: implement nir_intrinsic_load_sample_pos This is intended to return the sample location within the pixel. Fixes: dEQP-VK.pipeline.multisample_shader_builtin.sample_position.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	14d74c07aa	broadcom/compiler: handle gl_SampleMask writes in fragment shaders We didn't need this until now, since this was included with GLES 3.2, but we need it for Vulkan. Eric had already done the plumbing for it though, we just need to actually emit the mask. Fixes some tests in: dEQP-VK.renderpass.suballocation.multisample_resolve.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:32 +00:00
Iago Toral Quiroga	5a2ef59963	v3d/compiler: support swapping R/B channels in vertex attributes. We will need this in Vulkan to support vertex format VK_FORMAT_B8G8R8A8_UNORM. The hardware doesn't allow to swizzle vertex attribute components, so we need to do it in the shader. v2: - Use nir_intrinsic_io_semantics() to retrieve the location instead of looping through the shader input variables (Eric). - Assert that we only have one component (Eric). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (v1) Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Alejandro Piñeiro	b9dd7e30a6	v3d/tex: avoid to ask back for a sampler state if not needed So far we were not asking the driver for the sampler state if we could just use the default P1 values. But even if we need to fill P1 (for example to fill up the output type of the format), if the texture operation doesn't need a sampler, we can let that field as NULL (so default values) and avoid calling back the driver for a sampler. This is not mandatory for OpenGL (as we always have a sampler object), although still a good to have. For Vulkan this is needed, as we don't have a sampler object in that case. v2: reword comment (Eric) Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:31 +00:00
Iago Toral Quiroga	f41857eb48	v3d/compiler: implement nir_intrinsic_load_base_instance Vulkan lowers gl_InstanceIndex to load_base_instance + load_instance_id, so we need to implement loading the base instance in the compiler. The base instance is set by the BASE_VERTEX_BASE_INSTANCE command right before the instanced draw call and it is included in the VPM payload together with the InstanceID and VertexID if this is requested by the shader record. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:29 +00:00
Iago Toral Quiroga	1f41a128e0	v3d/compiler: implement nir_op_fquantize2f16 Reviewd-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:28 +00:00
Alejandro Piñeiro	c8212731e7	v3d/compiler: handle GL/Vulkan differences in uniform handling This also adds a v3d_execution_environment, so compiler could know if it is generating code for OpenGL or Vulkan needs. Reviewed-by: Iago Toral <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:27 +00:00
Alejandro Piñeiro	1c8226c682	v3d/compiler: update uses_vid/uses_iid check In order to take into account the vulkan specific system values SYSTEM_VALUE_INSTANCE_INDEX and SYSTEM_VALUE_VERTEX_ID_ZERO_BASE. Reviewed-by: Iago Toral <itoral@igalia.com> Reviewed-by: Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:26 +00:00
Alejandro Piñeiro	62ca997476	v3d/compiler: num_tex_used on v3d_key We would need on OpenGL to update values for all the textures used. On OpenGL that value can be always took from the context or the nir shader, but there are cases on Vulkan that it is not the case, or would force up to recompute it. Acked-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6766>	2020-10-13 21:21:25 +00:00
Alejandro Piñeiro	8de380d26a	broadcom/compiler: add V3D_DEBUG_RA option To ask to debug a registr allocation failure (V3D_DEBUG_REGISTER_ALLOCATION seemed too long to me). When a fallback register allocation algorithm was added, if the register allocation fails, it only dumpg the current vir with the register pressure info with the failed fallback. But if we want do debug the problem, we would be interested on both. Additionally, it was strange that we got the full vir dump with the failure even if no debug option was set. Additionally we add shaderdb like stats for those failures, to make easier to compare one and the other. v2: keep a small warning message in case both register allocation algorithms fails (Neil) Reviewed-by: Neil Roberts <nroberts@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6999>	2020-10-07 20:21:17 +00:00
Jason Ekstrand	0aa08ae2f6	nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices We're about to introduce conversion ops which are going to want two different types. We may as well just split the one we have rather than end up with three. There are a couple places where this is mildly inconvenient but most of the time I find it to actually be nicer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Kenneth Graunke	140f53e646	Revert "nir: replace lower_ffma and fuse_ffma with has_ffma" This reverts commit `939ddf3f67`. Intel has a separate pass for fusing FFMAs selectively. We split these flags in commit `1b72c31e1f` and the reasoning still stands. The patch being reverted was just a cleanup, so there should be no issue with reverting it. Acked-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6849>	2020-09-24 13:11:50 -07:00
Marek Olšák	939ddf3f67	nir: replace lower_ffma and fuse_ffma with has_ffma Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Marek Olšák	771aad3027	nir: split lower_ffma into lower_ffma16/32/64 AMD wants different behavior for each bit size Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Jason Ekstrand	9750164c09	nir: Rename get_buffer_size to get_ssbo_size This makes it explicit that this intrinsic is only for SSBOs. For the v3dv driver, we'll be adding a get_ubo_size intrinsic and we want to be able to distinguish between the two. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6812>	2020-09-22 13:34:12 +00:00
Iago Toral Quiroga	3182209673	v3d/compiler: fix V3D double-rounding of .8 fixed-point XY coordinates Pre-V3D 4.3 hardware has a quirk where it expects XY coordinates in .8 fixed-point format, but then it will internally round it to .6 fixed-point, introducing a double rounding. The double rounding can cause very slight differences in triangle raterization coverage that can actually be noticed by some CTS tests. The correct fix for this as recommended by Broadcom is to convert to .8 fixed-point with ffloor(). Fixes: dEQP-VK.renderpass.suballocation.subpass_dependencies.late_fragment_tests.* Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6677>	2020-09-11 09:51:33 +02:00
Marek Olšák	ac55b1a9a6	nir: get ffma support from NIR options for nir_lower_flrp This also fixes the inverted last parameter of nir_lower_flrp in most drivers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6599>	2020-09-04 17:06:22 +00:00
Alejandro Piñeiro	7059708dcd	broadcom/qpu_instr: wait is not a read or write vpm instruction For several schedule restrictions, we are checking if the instruction is using the vpm. So far it was implemented as being a read or a write of the vpm. But VPM wait (vpmwt) is not a read or a write (it is a wait until all pending writes finishes). This is relevant to implement peripheral accesses restrictions, as for some cases where vpm read\|writes are allowed, vpmwt is not. Fixes: dEQP-VK.binding_model.descriptorset_random.sets8.constant.ubolimitlow.sbolimitlow.sampledimglow.outimgtexlow.noiub.nouab.vert.noia.0 On the sim, as it was raising an assert for wrong peripheral access. v2: simplify v3d_qpu_waits_vpm (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6498>	2020-08-31 15:02:42 +02:00
Jesse Natalie	d3faac7a15	nir: Add options to nir_lower_compute_system_values to control compute ID base lowering If no options are provided, existing intrinsics are used. If the lowering pass indicates there should be offsets used for global invocation ID or work group ID, then those instructions are lowered to include the offset. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5891>	2020-08-21 22:07:05 +00:00
Jesse Natalie	2e1df6a17f	nir: Move compute system value lowering to a separate pass The actual variable -> intrinsic lowering stays where it is, but ops which convert one intrinsic to be implemented in terms of another have moved. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5891>	2020-08-21 22:07:05 +00:00
Karol Herbst	e5899c1e88	nir: rename nir_op_fne to nir_op_fneu It was always fneu but naming it fne causes confusion from time to time. So lets rename it. Later we also want to add other unordered and fne, this is a smaller preparation for that. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6377>	2020-08-21 17:26:21 +00:00
Jason Ekstrand	1ccd681109	nir: Add an LOD parameter to image_*_size The OpenCL image_width/height/depth functions have variants which can take an LOD parameter. More importantly, LLVM-SPIRV-Translator always generates OpImageQuerySizeLod even if the LOD is guaranteed to be zero. Given that over half the hardware out there has an LOD field for image size queries (based on a rudimentary scan through their NIR -> whatever code), we may as well just add the source to the NIR intrinsic. If this is ever a problem for anyone, the lowering is pretty trivial. I've also added asserts to everyone's drivers that should alert them if they ever see an LOD other than zero. This will never happen with GL or Vulkan so there's no need for panic. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6396>	2020-08-20 20:48:10 +00:00
Arcady Goldmints-Orlov	a104902590	broadcom/compiler: Enable PER_QUAD for UBO and SSBO loads. Helper invocations need to be able to read from UBOs since those values can be used for flow control, but writes from helper invocations need to be dropped. Fixes CTS tests: dEQP-VK.glsl.derivate..uniform_loop. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6356>	2020-08-20 20:14:14 +00:00
Arcady Goldmints-Orlov	c3258f927c	broadcom/compiler: Add a constant folding pass after nir_lower_io The nir_lower_io pass produces a bunch of constant arithmetic, and assumes that constant folding will simplify it away. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6356>	2020-08-20 20:14:14 +00:00
Arcady Goldmints-Orlov	bd87cdad18	broadcom/compiler: support nir_intrinsic_load_sample_id This adds support for the intrinsic as well as the vir_SAMPID instruction that corresponds to it in vir. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6356>	2020-08-20 20:14:14 +00:00
Alejandro Piñeiro	bd38ea77e8	v3d/compiler: add v3dv_prog_data_size helper Main use case is to help to implement Vulkan PipelineCache, as we are serializing/deserializing the prog_data too. Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6078>	2020-08-19 22:50:21 +02:00
Karol Herbst	025bdbac3e	nir: Add goto_if jump instruction Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2401>	2020-08-14 20:35:36 +00:00
Jason Ekstrand	feb32f898c	nir: Add a nir_foreach_uniform_variable helper This one's a bit more complex because it filters off only those variables with mode == nir_var_uniform. As such, it's not exactly a drop-in replacement for nir_foreach_variable(var, &nir->uniforms). Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:58 +00:00
Jason Ekstrand	2956d53400	nir: Add nir_foreach_shader_in/out_variable helpers Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:57 +00:00
Jason Ekstrand	5746af4446	nir: Take a mode in remove_unused_io_vars Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5966>	2020-07-29 17:38:57 +00:00

1 2 3 4 5 ...

401 commits