fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-21 21:58:10 +02:00

Author	SHA1	Message	Date
Rhys Perry	1070bba19e	android: fix SPIR-V -> NIR build Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Mauro Rossi <issor.oruam@gmail.com> Fixes: `18f9fc919e` ('spirv: add and use a generator id enum') Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7097>	2020-10-12 22:26:05 +00:00
Rhys Perry	037d9fb278	spirv: replace discard with demote for incorrect HLSL->SPIR-V translations Fixes artifacts on decals in Path of Exile. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3610 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7062>	2020-10-12 11:07:38 +00:00
Rhys Perry	18f9fc919e	spirv: add and use a generator id enum Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7062>	2020-10-12 11:07:38 +00:00
Jason Ekstrand	181d5f59b8	nir: Allow more deref modes in phis In particular, OpenCL needs to allow shader_temp and function_temp through because they're 100% real pointers. Fixes piglit CL calls.cl Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7092>	2020-10-11 21:50:23 +00:00
Mauro Rossi	002a23efb4	android: util: Move xxd.py to util Android porting of gen rules as per `22ffc05266` ("util: Move xxd.py to util") Fixes the following building error: ninja: error: 'external/mesa/src/compiler/glsl/xxd.py', needed by 'out/target/product/x86_64/gen/STATIC_LIBRARIES/libmesa_glsl_intermediates/glsl/float64_glsl.h', missing and no known rule to make it Fixes: `22ffc05266` ("util: Move xxd.py to util") Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7087>	2020-10-11 23:22:34 +02:00
Jose Maria Casanova Crespo	e7127b3468	nir/algebraic: optimize iand/ior of (n)eq zero when umax/umin not available Before `8e1b75b330` ("nir/algebraic: optimize iand/ior of (n)eq zero") this optimization didn't need the use of umax/umin. VC4 HW supports only signed integer max/min operations. lower_umin and lower_umax are added to allow enabling previous optimizations behaviour for this cases. Fixes: `8e1b75b330` ("nir/algebraic: optimize iand/ior of (n)eq zero") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7083>	2020-10-10 13:16:37 +02:00
John Bates	5de56937a3	disk_cache: build option for disabled-by-default On some systems it is problematic to have the shader cache enabled by default. This adds a build option to support the disk cache but keep it disabled unless the environment variable MESA_GLSL_CACHE_DISABLE=false. For example, on Chrome OS, Chrome already has it's own shader disk cache implementation so it disables the mesa feature. Tests do not want the shader disk cache enabled because it can cause inconsistent performance results and the default 1GB for the disk cache could lead to problems that require more effort to work around. The Mesa shader disk cache is useful for VMs though, where it is easy to configure the feature with environment variables. With the current version of Mesa, Chrome OS would need to have a system-wide environment variable to disable the disk cache everywhere except where needed. More elegant to just build Mesa with the cache feature disabled by default. Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6967>	2020-10-09 16:52:49 +00:00
Rhys Perry	5f2671bcc5	nir: return progress from nir_lower_io_to_scalar_early Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6891>	2020-10-09 15:47:59 +00:00
Timur Kristóf	f11f4a2a4d	nir: Add ability to count primitives per stream. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	aac5adc3c2	nir: Count vertices per stream. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	70b94adddb	nir: Add ability to overwrite incomplete GS primitives. After each end_primitive and at the end of the shader before emitting set_vertex_and_primitive_count, we check if the primitive that is being emitted has enough vertices or not, and we adjust the vertex and primitive counters accordingly. As a result, if the backend uses this option, the backend compiler will not have to worry about discarding the unneeded vertices and primitives. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	c977c369d3	nir: Add ability to count emitted GS vertices per primitive. Add an option to nir_lower_gs_intrinsics so that it can also track the number of emitted vertices per primitive, not just the total vertex count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Timur Kristóf	2be99012e9	nir: Add ability to count emitted GS primitives. Add an option to nir_lower_gs_intrinsics which tells it to track the number of emitted primitives, not just vertices. Additionally, also make it per-stream. Also rename the set_vertex_count intrinsic to set_vertex_and_primitive_count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6964>	2020-10-09 15:26:14 +02:00
Jason Ekstrand	06a5edf247	nir/opt_deref: Fix the vector bitcast optimization It assumes the parent is a vector or scalar so we need to fail if it isn't. Fixes: `9190f82d57` "nir/opt_deref: Add an optimization for bitcasts" Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7064>	2020-10-08 12:22:45 -05:00
Kristian H. Kristensen	826a10255f	st/mesa: Add NV12 lowering to PIPE_FORMAT_R8_G8B8_420_UNORM Some GPUs can sample biplanar formats like NV12 natively, returning the YUV values. Add a lowering type that uses that for sampling and relies on existing colorspace conversions. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6693>	2020-10-08 09:37:14 +00:00
Jason Ekstrand	2fa7c79045	spirv: Move nir_lower_libclc to src/compiler/spirv This puts it in a shared place where everyone can get at it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7034>	2020-10-07 21:52:04 +00:00
Jason Ekstrand	ef453f5439	spirv: Add a shared libclc loader Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7034>	2020-10-07 21:52:04 +00:00
Jesse Natalie	22ffc05266	util: Move xxd.py to util Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7034>	2020-10-07 21:52:04 +00:00
Dylan Baker	3ff513ee5d	glsl/xxd.py: fix imports sys and string are unused, os is needed but not imported fixes: `412472da5c` ("glsl: Add utility to convert text files to C strings") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7034>	2020-10-07 21:52:04 +00:00
Tony Wasserka	0ef2f1d4a0	nir: Fix unaligned pointer access This was observed with the intel vulkan driver when running dEQP-VK.spirv_assembly.instruction.compute.float32.comparison_1.modfstruct with ubsan enabled. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6728>	2020-10-07 19:50:01 +00:00
Tony Wasserka	6a9dc75cc2	nir: Fix undefined behavior due to signed integer multiplication overflows Notably this happened when applying constant folding on the intermediate computations generated from nir_lower_idiv. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6728>	2020-10-07 19:50:01 +00:00
Marek Olšák	3f1b35a2f0	nir: add new helper passes that lower uniforms to literals Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6955>	2020-10-07 17:30:12 +00:00
Marek Olšák	1e7d82c881	nir/algebraic: always lower idiv to shifts if bitops are allowed why would you want anything else The only platform significantly affected by this is Intel where `lower_idiv` is not set today but neither is `lower_bitops`. There it seems to still be a boon over-all. Shader-db results on Ice Lake: total instructions in shared programs: 19719051 -> 19735766 (0.08%) instructions in affected programs: 106992 -> 123707 (15.62%) helped: 0 HURT: 445 HURT stats (abs) min: 3 max: 295 x̄: 37.56 x̃: 44 HURT stats (rel) min: 0.16% max: 33.33% x̄: 19.60% x̃: 19.38% 95% mean confidence interval for instructions value: 33.60 41.53 95% mean confidence interval for instructions %-change: 18.97% 20.23% Instructions are HURT. total loops in shared programs: 5973 -> 5973 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 489405810 -> 486917482 (-0.51%) cycles in affected programs: 4759097 -> 2270769 (-52.29%) helped: 406 HURT: 34 helped stats (abs) min: 2 max: 64661 x̄: 6291.95 x̃: 3126 helped stats (rel) min: 0.02% max: 79.42% x̄: 43.32% x̃: 55.83% HURT stats (abs) min: 2 max: 29376 x̄: 1947.12 x̃: 30 HURT stats (rel) min: 0.04% max: 23.82% x̄: 4.66% x̃: 1.33% 95% mean confidence interval for cycles value: -6753.06 -4557.52 95% mean confidence interval for cycles %-change: -42.60% -36.63% Cycles are helped. total spills in shared programs: 12481 -> 12482 (<.01%) spills in affected programs: 47 -> 48 (2.13%) helped: 0 HURT: 1 total fills in shared programs: 12816 -> 12819 (0.02%) fills in affected programs: 71 -> 74 (4.23%) helped: 0 HURT: 1 total sends in shared programs: 1010124 -> 1010124 (0.00%) sends in affected programs: 0 -> 0 helped: 0 HURT: 0 LOST: 1 GAINED: 0 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6963>	2020-10-07 10:50:53 -04:00
Samuel Pitoiset	4c54f05915	nir/constant_folding: init nir_const_value to zero To avoid NIR validation failures. Fixes: `9df1ff3678` ("nir/constant_folding: Use the builder") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7035>	2020-10-06 20:27:39 +00:00
Jason Ekstrand	60825a542d	nir/constant_folding: Fold load_deref of nir_var_mem_constant Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6974>	2020-10-06 15:42:03 +00:00
Jason Ekstrand	481b7538ab	nir: Validate constant initializers Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6974>	2020-10-06 15:42:03 +00:00
Jason Ekstrand	1ada83504f	nir/constant_folding: Use nir_shader_instruction_pass This gets rid of so much boilerplate... Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6974>	2020-10-06 15:42:03 +00:00
Jason Ekstrand	9df1ff3678	nir/constant_folding: Use the builder Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6974>	2020-10-06 15:42:03 +00:00
Jason Ekstrand	a0c13c9de9	spirv: Make the clc_shader const Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7016>	2020-10-06 04:26:22 +00:00
Vinson Lee	3b3a3af9c7	glsl: Initialize ast_node member field location.path in constructor. Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member field location.path is not initialized in this constructor nor in any functions that it calls. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6905>	2020-10-03 10:45:46 +00:00
Jason Ekstrand	b2e1fc8976	nir: Add a pass to lower vec3s to vec4s LLVM loves take advantage of the fact that vec3s in OpenCL are 16B aligned and so it can just read/write them as vec4s. This results in a LOT of vec4->vec3 casts on loads and stores. One solution to this problem is to get rid of all vec3 variables. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	9190f82d57	nir/opt_deref: Add an optimization for bitcasts LLVM loves take advantage of the fact that vec3s in OpenCL are 16B aligned so it can just read/write them as vec4s. This is questionably legal except that it uses a xyz write-mask when it does it. The result is a LOT of vec4->vec3 casts on loads and stores. This optimization detects this case as well as other bit-cast cases and rewrites them to get rid of the cast. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	80e6ac3341	nir/opt_deref: Add an instruction type switch Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	769ede2de4	nir: Add component mask re-interpret helpers These are based on the ones which already existed in the load/store vectorization pass but I made some improvements while moving them. In particular, 1. They're both faster if the bit sizes are equal 2. The check is faster if old_bit_size > new_bit_size 3. The check now fails if it would use more than NIR_MAX_VEC_COMPONENTS Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	57e7c5f05e	nir/opt_load_store_vectorize: Use bit sizes when checking mask compatibility Without this, it was checking bit size compatibility with bit sizes such as 96 which is clearly invalid. No shader-db changes on Ice Lake Fixes: `ce9205c03b` "nir: add a load/store vectorization pass" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	f6667cb0ce	nir: Add a memcpy optimization pass This pass attempts to optimize three broad categories of memcpy: 1. Self-copies: These we can discard out-of-hand. 2. Vector copies: It doesn't matter what the vector size is or if the source and destination have different vector types, it's still easy enough to emit a load/store pair. 3. Tightly packed copies: In the case where a type is tightly packed (no padding bits), we can replace the memcpy with a copy_deref instruction which the optimizer is far better at handling. This has proven capable of getting rid of many of the memcpy instances in some rather gnarly OpenCL C kernels I've been looking at, even after coming out of LLVM's optimizer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	e363da3bdd	nir: Handle memcpy in copy_prop_vars and combine_stores Fixes: `b2899f7265` "nir: Add a new memcpy intrinsic" Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	100a5ace63	nir/find_array_copies: Properly discard copies for casts In `9f3c595dfc`, we attempted to handle casts in opt_find_array_copies but missed a critical case. In particular, in the case where we begin finding a copy but then encounter a cast, we need to discard everything which might alias that cast. Fixes: `9f3c595dfc` "nir/find_array_copies: Handle cast derefs" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	98bb74b67d	nir: Fix a misspelling Fixes: `cb95065dd1` "nir: Add lowering from regular ALU conversions..." Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6975>	2020-10-01 20:44:04 -05:00
Timothy Arceri	038fcbcaed	glsl: don't duplicate state vars as uniforms in the NIR linker The linker was adding all state vars as uniforms, doubling the storage size for shaders using only builtin uniforms, which increased CPU overhead for constant buffer uploads. When this code was originally ported from the GLSL IR linker we forgot to exclude builtins because the check was not done in the add_uniform_to_shader class but rather a check was done when passing variables to this class for processing. Fixes: `664e4a610d` ("glsl/nir: Fill in the Parameters in NIR linker") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Tested-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6958>	2020-10-02 00:57:00 +00:00
Jason Ekstrand	cb95065dd1	nir: Add lowering from regular ALU conversions to the intrinsic Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jesse Natalie	7d97f3dfdc	spirv: Implement vload[a]_half[n] and vstore[a]_half[n][_r] Note, the aligned versions aren't handled specially yet. The float16buffer capability is now at least partially supported after this patch, so move it to be supported when kernels are supported. v2 (Jason Ekstrand): - A few cosmetic cleanups around type/base_type - Rebased on top of the big SPIR-V SSA value rework - Use the new version of the conversion helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	a85afb797e	spirv/opencl: Drop dest_type from handle_v_load_store At that point in the function, we don't know if it's a load or a store so calling it dest_type isn't really helpful. Also, we don't really want the glsl_type; we want the base_type. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	8610af12b6	spirv: Handle all OpenCL conversion ops with full rounding This is done for kernels via the new convert_alu_types intrinsic. For Vulkan and OpenGL, we maintain the old path so that drivers don't have to add that lowering pass. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	8e8458218c	spirv: Add some conversion handling helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	383ecfbc70	nir: Add a passes for nir_intrinsic_convert_alu_types This adds primarily two passes: One is a lowering pass which turns these conversion intrinsics into a series of ALU ops. The other is an optimization pass which attempt to simplify the conversion whenever possible in the hopes that we can turn it into a "normal" conversion op which doesn't need special treatment. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	d5cb51e2b9	nir: Add builder helpers for OpenCL type conversions Most of these were originally written by Daniel Stone in the Microsoft ClOn12 branch, reworked by Jesse Natalie, fixed by Boris Brezillon, and possibly touched by others along the way. Unfortunately, none of that is in the commit history thanks to living in the CLOn12 branch. I ported them to mesa master and further reworked things for better cosmetics. In particular, 1. They now live in a builder helper rather than in vtn_alu.c. 2. Instead of looping inside each builder helper, we just trust NIR vector instructions to handle vectors. 3. Lots of re-arranging of the helpers for clarity, better asserting, and better re-use with the upcoming lowering pass. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	588bb6686b	nir: Add a conversion and rounding intrinsic This new intrinsic is capable of handling the full range of conversions from OpenCL including rounding modes and possible saturation. The intention is that we'll emit this intrinsic directly from spirv_to_nir and then lower it to ALU ops later. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	0aa08ae2f6	nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices We're about to introduce conversion ops which are going to want two different types. We may as well just split the one we have rather than end up with three. There are a couple places where this is mildly inconvenient but most of the time I find it to actually be nicer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Eric Anholt	e3f4655805	nir: Make nir_lower_ubo_vec4() handle non-vec4-aligned loads. It turns out I had missed a case in my enumeration of why everything currently was vec4-aligned. Fixes a simple testcase of loading from a vec3[2] array in freedreno with IR3_SHADER_DEBUG=nouboopt. Initial shader-db results look devastating: total instructions in shared programs: 8019997 -> 12829370 (59.97%) total cat6 in shared programs: 87683 -> 145840 (66.33%) Hopefully this will recover once we introduce the i/o vectorizer, but that was blocked on getting the vec3 case fixed. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00

1 2 3 4 5 ...

5486 commits