fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 22:20:14 +01:00

Author	SHA1	Message	Date
Rhys Perry	f199b7188b	nir/load_store_vectorize: add data as callback args Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Rhys Perry	00c8bec47b	nir: add nir_load_store_vectorize_options Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4202>	2021-01-07 16:34:53 +00:00
Christian Gmeiner	c5a9270109	intel/compiler: use intrinsic builders Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8295>	2021-01-06 14:34:41 +00:00
Yevhenii Kolesnikov	5ad54d498c	intel/fs: don't spill a register, set by undef Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3941 Signed-off-by: Yevhenii Kolesnikov <yevhenii.kolesnikov@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8185>	2020-12-21 21:18:01 +00:00
Jason Ekstrand	a1976e1cb2	intel/fs: Implement nir_jump_halt Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5071>	2020-12-01 16:19:18 -06:00
Jason Ekstrand	6992d2f625	intel/fs: Emit HALT_TARGET in emit_nir_code() Instead of making it a fragment-specific thing based on uses_kill, track whether or not we need one in fs_visitor and emit HALT_TARGET at the end of emit_nir_code() if needed. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5071>	2020-12-01 16:19:14 -06:00
Jason Ekstrand	4a7f0aa2e0	intel/fs: Remove unnecessary HALT_TARGET in opt_redundant_halt() This means the pass has to walk all the instructions but it was doing that in a bunch of cases anyway when it didn't have a HALT_TARGET. However, removing HALT_TARGET frees up the scheduler a bit because HALT_TARGET is considered a scheduling barrier. The shader-db results are kind-of a wash but we're about to add HALT_TARGET unconditionally so we want to be able to get rid of it. Shader-db results on Ice Lake: total instructions in shared programs: 19935623 -> 19935623 (0.00%) instructions in affected programs: 0 -> 0 helped: 0 HURT: 0 total cycles in shared programs: 976758472 -> 976766135 (<.01%) cycles in affected programs: 11097707 -> 11105370 (0.07%) helped: 1750 HURT: 875 helped stats (abs) min: 1 max: 866 x̄: 26.39 x̃: 4 helped stats (rel) min: <.01% max: 39.24% x̄: 1.25% x̃: 0.46% HURT stats (abs) min: 1 max: 1678 x̄: 61.54 x̃: 10 HURT stats (rel) min: <.01% max: 65.69% x̄: 1.86% x̃: 0.42% 95% mean confidence interval for cycles value: -2.48 8.32 95% mean confidence interval for cycles %-change: -0.40% -0.03% Inconclusive result (value mean confidence interval includes 0). LOST: 62 GAINED: 46 All of the lost/gained programs are SIMD32 fragment shaders. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5071>	2020-12-01 16:19:10 -06:00
Jason Ekstrand	f9d549b2bf	intel/fs: Use BRW_OPCODE_HALT for discards We're about to start using it to implement nir_jump_halt which has nothing inherently to do with fragment shaders or discards. May as well name it for the HW instruction it generates. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5071>	2020-12-01 16:19:08 -06:00
Jason Ekstrand	e76e359007	intel/fs: Rename PLACEHOLDER_HALT to HALT_TARGET It's a bit more explicit and will play more nicely with what we're about to do. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5071>	2020-12-01 16:18:50 -06:00
Jordan Justen	071fd55381	intel/compiler: Add GEN125 to enum gen Recommended-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7757>	2020-12-01 19:06:22 +00:00
Jason Ekstrand	73c6899285	intel/fs: DISCARD_JUMP does not have side-effects This accidentally snuck into `75209d5bd1` due to a rebase fail. Fixes: `75209d5bd1` "intel/fs: Add and implement intel-specific..." Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3897 Tested-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7839>	2020-11-30 20:33:48 +00:00
Jason Ekstrand	b70847a389	nir: Use the right argument order for load_scratch_base_ptr Fixes: `c9bcad2573` "nir: add generated intrinsic builders" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7836>	2020-11-30 12:03:33 -06:00
Rhys Perry	c9bcad2573	nir: add generated intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6587>	2020-11-26 17:50:38 +00:00
Caio Marcelo de Oliveira Filho	ff9ea469f6	intel/disasm: Don't rely on FALLTHROUGHTs to print unsupported SFID The code works but is a bit fragile if we ever add a case that has a less strict requirement (a smaller gen) than the case above. To avoid having to reason about this, refactor code to use a variable to indicate whether the SFID is supported or not. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7742>	2020-11-25 16:46:16 +00:00
Jason Ekstrand	9fa1cdfe7f	intel/rt: Implement push constants as global memory reads They're not really "push" anymore but that's because there is no such thing as push constants in bindless shaders on Intel. They should be fast enough, though. There is some room for debate here as to whether we want to do the pull in NIR or push it into the back-end. The advantage of doing it in the back-end is that it'd be easier to use MOV_INDIRECT for indirect push constant access rather than falling back to a dataport message. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	f7e24e559f	intel/rt: Add support for hit attributes For triangle geometry, the hit attributes are always two floats which contain the barycentric coordinates of the hit. For procedural geometry, they're an arbitrary blob of data passed from the intersection shader to the hit shaders. In our implementation, we stash that data right after the HW RayQuery in the ray stack. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	96fde5518b	intel/rt: Add a helper to create the raygen trampoline shader Unlike graphics and compute pipelines, Vulkan ray-tracing pipelines do not have a single entrypoint. Instead, the raygen shader is specified as a one-element shader binding table in the vkCmdTraceRay call. This means that raygen shaders have to be bindless shaders just like any other ray tracing shader. To launch them, we have a tiny compute shader that acts as a trampoline and sets up the hotzone and uses btd_spawn to fire off the raygen shader. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	303378e1dd	intel/rt: Add lowering for combined intersection/any-hit shaders Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	cb261b03e5	intel/rt: Add lowering for ray-walk intrinsics in any-hit shaders Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	c3ddefa000	intel/rt: Add support for shader buffer record memory Most of the work for this is done for us by spirv_to_nir which gives us a load_global from a memory address based on the shader_record_ptr system values. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	9ba7d459a3	intel/rt: Implement the new ray-tracing system values Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	7ce7c93755	intel/rt: Implement traceRay() This is a little bit more work than executeCallable() because we also have to set up the MemRay data structure which the ray traversal hardware uses to keep its state. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	75209d5bd1	intel/fs: Add and implement intel-specific ray-tracing intrinsics Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	1f6ae809ef	intel/rt: Implement support for shader call payloads Both traceRay() and executeCallable() take a payload parameter which gets passed from the caller to the callee and which the callee can write to pass data back to the caller. We implement these by passing a pointer to the data structure in the callee to the caller as the second QWord on its stack. Coming out of spirv_to_nir, the incoming call payloads get the nir_var_shader_call_data variable mode allowing us to easily identify them. Outgoing call payloads get assigned the nir_var_shader_temp mode and will have been turned into function_temp by nir_lower_global_vars_to_local. All we have to do is crawl the shader looking for references to the nir_var_shader_call_data variable and rewrite those to use the passed in pointer. nir_lower_explicit_io will do the rest for us. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	72354b0e9d	intel/rt: Add a helper to create a trivial return shader These are required for ray-tracing. There are many cases where the ray-tracing hardware may decide to execute some but not all of our shaders. In these cases, it needs a shader to execute at the end which will pop the stack back to the shader which called traceRay(). Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	fad81a3968	intel/rt: Add a pass to lower shader call instructions Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	ca88cd8e5a	intel/rt: Add return instructions at the end of ray-tracing shaders Each callable ray-tracing shader shader stage has to perform a return operation at the end. In the case of raygen shaders, it retires the bindless thread because the raygen shader is always the root of the call tree. In the case of any-hit shaders, the default action is accep the hit. For callable, miss, and closest-hit shaders, it does a return operation. The assumption is that the calling shader has placed a BINDLESS_SHADER_RECORD address for the return in the first QWord of the callee's scratch space. The return operation simply loads this value and calls a btd_spawn intrinsic to jump to it. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	49778a7253	intel/rt: Add support for scratch in ray-tracing shaders In ray-tracing shader stages, we have a real call stack and so we can't use the normal scratch mechanism. Instead, the invocation's stack lives in a memory region of the RT scratch buffer that sits after the HW ray stacks. We handle this by asking nir_lower_io to lower local variables to 64-bit global memory access. Unlike nir_lower_io for 32-bit offset scratch, when 64-bit global access is requested, nir_lower_io generates an address calculation which starts from a load_scratch_base_ptr. We then lower this intrinsic to the appropriate address calculation in brw_nir_lower_rt_intrinsics. When a COMPUTE_WALKER command is sent to the hardware with the BTD Mode bit set to true, the hardware generates a set of stack IDs, one for each invocation. These then get passed along from one shader invocation to the next as we trace the ray. We can use those stack IDs to figure out which stack our invocation needs to access. Because we may not be the first shader in the stack, there's a per-stack offset that gets stored in the "hotzone". Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	2b3f6cdc6c	intel/rt: Add lowering functions for each ray-tracing stage These will eventually contain per-stage lowering for various ray-tracing things. This is separate from brw_nir_lower_rt_intrinsics because, for reasons that will become apparent later, brw_nir_lower_rt_intrinsics has to be run very late in the compile process, right before brw_compile_bs. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	c7660918d7	intel/rt: Add a pass to lower the new ray-tracing intrinsics The new intrinsics we added for doing address calculations are all things we fetch from the RT_DISPATCH_GLOBALS struct. We could emit an RT_DISPATCH_GLOBALS load at every point we want it and trust NIR to CSE it for us but it's easier to use intermediate intrinsics. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	6e50db4eda	intel/rt: Add builder helpers for accessing RT data structures Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:10 +00:00
Jason Ekstrand	1f6e70c85a	intel/fs: Add and implement a load_global_const_block intrinsic Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:09 +00:00
Jason Ekstrand	6d5b57aeb7	intel/rt: Add a brw_rt.h header with #defines for basic RT data structures Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:09 +00:00
Jason Ekstrand	7280b0911d	intel/compiler: Add support for bindless shaders The Intel bindless thread dispatch model is very simple. When a compute shader is to be used for bindless dispatch, it can request a set of stack IDs. These are allocated per-dual-subslice by the hardware and recycled automatically when the stack ID is returned. Passed to the bindless dispatch are a global argument address, a stack ID, and an address of the BINDLESS_SHADER_RECORD to invoke. When the bindless shader is dispatched, it is passed its stack ID as well as the global and local argument pointers. The local argument pointer is the address of the BINDLESS_SHADER_RECORD plus some offset which is specified as part of the BINDLESS_SHADER_RECORD. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7356>	2020-11-25 05:37:09 +00:00
Rob Clark	53f7d539cd	util: Add helgrind support for simple_mtx Annoyingly mtypes.h pulls in simple_mtx, which means we end up needing to sprinkle a lot of idep_mesautil around. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3773 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7644>	2020-11-24 21:03:34 +00:00
Ian Romanick	50fef61fa5	intel/fs: Add support for printing half-float immediate values v2: Remove offensive, extraneous 0 in hex constant. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7650>	2020-11-19 21:23:53 +00:00
Ian Romanick	91f7e262e1	intel/fs: Silence unused parameter warning in filter_simd src/intel/compiler/brw_fs.cpp: In function ‘bool filter_simd(const nir_instr, const void)’: src/intel/compiler/brw_fs.cpp:8870:50: warning: unused parameter ‘_options’ [-Wunused-parameter] 8870 \| filter_simd(const nir_instr instr, const void _options) \| ~~~~~~~~~~~~~^~~~~~~~ Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7650>	2020-11-19 21:23:53 +00:00
Ian Romanick	9f0907a4f5	intel/compiler: Delete redundant MAC declaration It already appears about 20 lines earlier near MACH. Trivial. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7650>	2020-11-19 21:23:53 +00:00
Ian Romanick	6edc2a3d40	intel/compiler: Rotate instructions ROR and ROL cannot have source modifiers I checked the Bspec for both Gen11 and Gen12, and it appears that rotate instructions cannot have source modifiers or saturate modifer. Saturate was already handled. Fixes: `1e92e83856` ("intel/compiler: Emit ROR and ROL instruction") Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7650>	2020-11-19 21:23:53 +00:00
Kenneth Graunke	31290f9806	intel/fs: Fix sampler message headers on Gen11+ when using scratch Icelake's sampler message header introduces a field in m0.3 bit 0 which controls whether the sampler state pointer should be relative to bindless sampler state base address or dynamic state base address. g0.3 bit 0 is part of the per-thread scratch space field. On older hardware, we were able to copy that along because the sampler ignored bits 4:0. Now, however, we need to mask them out. Fixes various textureGatherOffsets piglit tests when forcing the FS to run with 2048 bytes of per-thread scratch space (which is a per-thread scratch space encoding of 1, meaning bit 0 will be set). Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6735>	2020-11-18 23:32:09 +00:00
Kenneth Graunke	97ebb896af	intel/compiler: Do interpolateAtOffset coordinate scaling in NIR In our source languages, interpolateAtOffset() takes a floating point offset in the range [-0.5, +0.5]. However, the hardware takes integer valued offsets in the range [-8, 7], in units of 1/16th of a pixel. So, we need to multiply and clamp the coordinates. We were doing this in the FS backend, but with the advent of IBC, I'd like to avoid doing it twice. This patch instead moves the lowering to NIR so we can reuse it across both backends. v2: Use nir_shader_instructions_pass (suggested by Eric Anholt). Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6193>	2020-11-18 23:26:53 +00:00
Caio Marcelo de Oliveira Filho	b3daf341d4	intel/fs: Add assert on the brw_STAGE_prog_data downcasts Motivation is to detect earlier certain bugs that can occur when missing a check for the stage before using the downcast. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7540>	2020-11-16 12:40:59 -09:00
Kenneth Graunke	2009258796	intel/compiler: Fix passthrough TCS regressions from program rename In commit `eda3e4e055`, Eric added names to various programs. In that patch, he also renamed our passthrough TCS shader from "passthrough" to "passthrough TCS". The passthrough TCS directly supplies the VUE headers rather than doing the whole "patch parameters are in backwards order" reswizzling dance. We failed to detect this and started trying to supply vec4s starting at component 3, leading to a stack smash on an array of 7 sources, not to mention the values were being put in the wrong place. Easy fix: update the code for the new name. Fixes: `eda3e4e055` ("nir/builder: Add a name format arg to nir_builder_init_simple_shader().") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3777 Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7564>	2020-11-11 21:19:40 -08:00
Jason Ekstrand	e9caba6ce5	intel/fs: Fix use of undefined value in fixup_nomask_control_flow Fixes: `a8ac0bd759` "intel/fs/gen12: Workaround unwanted SEND execution..." Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7536>	2020-11-11 17:42:47 +00:00
Eric Anholt	eda3e4e055	nir/builder: Add a name format arg to nir_builder_init_simple_shader(). This cleans up a bunch of gross sprintfs and keeps the caller from needing to remember to ralloc_strdup. I added a couple of '"%s", name ? name : ""' to radv where I didn't fully trace through whether a non-null name was being passed in. I also took the liberty of adding a basic name to a few shaders (pan_blit, unit tests) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:50:29 -08:00
Eric Anholt	5f992802f5	nir/builder: Drop the mem_ctx arg from nir_builder_init_simple_shader(). This looks a lot more simple now! Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:50:29 -08:00
Eric Anholt	ef5bce9253	intel: Drop the last uses of a mem_ctx in nir_builder_init_simple_shader(). These two consumers were the only ones out of the ~65 calls to init_simple_shader, so there's a pretty clear consensus on how to allocate simple shaders. I suspect that actually these would be just fine with b.shader being the mem_ctx, but that would take a bit more rework. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:50:27 -08:00
Eric Anholt	4e9328e3b6	nir_builder: Return a new builder from nir_builder_init_simple_shader(). It's a little inline function, so we can just RAII it for better ergonomics. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7323>	2020-11-11 08:49:49 -08:00
Jason Ekstrand	68092df8d8	intel/nir: Lower 8-bit ops to 16-bit in NIR on Gen11+ Intel hardware supports 8-bit arithmetic but it's tricky and annoying: - Byte operations don't actually execute with a byte type. The execution type for byte operations is actually word. (I don't know if this has implications for the HW implementation. Probably?) - Destinations are required to be strided out to at least the execution type size. This means that B-type operations always have a stride of at least 2. This means wreaks havoc on the back-end in multiple ways. - Thanks to the strided destination, we don't actually save register space by storing things in bytes. We could, in theory, interleave two byte values into a single 2B-strided register but that's both a pain for RA and would lead to piles of false dependencies pre-Gen12 and on Gen12+, we'd need some significant improvements to the SWSB pass. - Also thanks to the strided destination, all byte writes are treated as partial writes by the back-end and we don't know how to copy-prop them. - On Gen11, they added a new hardware restriction that byte types aren't allowed in the 2nd and 3rd sources of instructions. This means that we have to emit B->W conversions all over to resolve things. If we emit said conversions in NIR, instead, there's a chance NIR can get rid of some of them for us. We can get rid of a lot of this pain by just asking NIR to get rid of 8-bit arithmetic for us. It may lead to a few more conversions in some cases but having back-end copy-prop actually work is probably a bigger bonus. There is still a bit we have to handle in the back-end. In particular, basic MOVs and conversions because 8-bit load/store ops still require 8-bit types. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7482>	2020-11-09 18:58:51 +00:00
Jason Ekstrand	b98f0d3d7c	intel/nir: Lower 8-bit scan/reduce ops to 16-bit We can't really support these directly on any platform. May as well let NIR lower them. The NIR lowering is potentially one more instruction for scan/reduce ops thanks to not being able to do the B->W conversion as part of SEL_EXEC. For imax/imin exclusive scan, it's yet another instruction thanks to the extra imax/imin NIR has to insert to deal with the fact that the first live channel will contain the identity value which, when signed, will cast wrong. However, it does let us drop some complexity from our back-end so it's probably worth it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7482>	2020-11-09 18:58:51 +00:00

1 2 3 4 5 ...

1573 commits