fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-21 18:00:13 +01:00

Author	SHA1	Message	Date
Anuj Phogat	926d343acf	intel: Rename files with gen_debug prefix export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" find $SEARCH_PATH -type f -name "gen_debug.[cph]" -exec sh -c 'f="{}"; mv -- "$f" "${f/gen_debug/intel_debug}"' \; grep -E "gen_debug" -rIl $SEARCH_PATH \| xargs sed -ie "s/gen_debug\./intel_debug\./g" grep -E "GEN_DEBUG" -rIl $SEARCH_PATH \| xargs sed -ie "s/GEN_DEBUG_H/INTEL_DEBUG_H/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10241>	2021-04-20 20:06:33 +00:00
Matt Turner	566dc4d740	intel/eu: Add instruction compaction support on XeHP. This patch includes a number of reworks and fixes squashed in by Nanley Chery, Sagar Ghuge, Jordan Justen and Francisco Jerez. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	a2572a9da4	intel/fs: Add more efficient fragment coordinate calculation. The PIXEL_X/Y opcodes used by the current implementation are broken on XeHP due to the new regioning restrictions of the floating-point pipe. We could have the regioning lowering pass fix it in theory by lowering the conversions into separate MOV instructions, but that would be more costly than this implementation that only needs a pair of pipelined ADDs and a pair of pipelined MOVs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	a0e0dfe174	intel/fs: Introduce lowering pass to implement derivatives in terms of quad swizzles. Unfortunately the funky Align1 regions used by the code generator in order to implement derivatives efficiently aren't available to the floating-point pipeline on XeHP. We need to lower them into a number of pipelined integer shuffle instructions followed by the floating-point difference computation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Jordan Justen	635ed58e52	intel/compiler: Lower txd for 3D samplers on XeHP. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Jordan Justen	515ee73b4e	intel/fs: End computer shader with message gateway on XeHP. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Jordan Justen	262cb08557	intel/fs: Disable 3-src immediates on XeHP. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> [ Francisco Jerez: Add TODO comment explaining why this is helpful and how we could better fix it. ] Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Jordan Justen	02ce55d2b1	intel/eu: Allow 64-bit registers on XeHP. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	262b647b25	intel/compiler: Lower integer division on XeHP. It has been removed from the hardware. [jordan.l.justen@intel.com: Move to brw_postprocess_nir] v2: Switch to nir_lower_idiv_precise (Rhys). v3: Fix for interface changes of nir_lower_idiv. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Rafael Antognolli	49b2d9f428	intel/fs: Lower dword integer multiplies on XeHP. From the BSpec: "When multiplying DW X DW, resulting dst can only be QW precision. If DW precision is required at output than MUL/MACH macro must be used." So for now simply lower it. We might want to revisit it later. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	3f50dde8b3	intel/eu: Teach EU validator about FP/DP pipeline regioning restrictions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	f3e5cd813a	intel/fs: Handle regioning restrictions of split FP/DP pipelines. The floating-point and double-precision FPU pipelines of XeHP platforms don't support arbitrary regioning modes, corresponding channels of sources and destination are required to be aligned to the same sub-register offset, similar to the restriction FP64 instructions had on CHV/BXT platforms. Most violations of this restriction can be fixed easily by teaching has_dst_aligned_region_restriction() about the change so the regioning lowering pass gets rid of any unsupported regioning. For cases where this is not sufficient (e.g. because a virtual instruction internally uses some regioning mode not supported by the floating-point pipeline) the regioning lowering pass is extended with an additional lower_exec_type() codepath that bit-casts sources and destination to an integer type whenever the execution type is not supported by the instruction. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	0dc16965a9	intel/fs: Fix repclear assembly for XeHP+ regioning restrictions. The regioning mode used here is no longer supported by the floating-point pipeline. We could run the regioning lowering pass in order to fix it with some extra copies, but it's more efficient to change the instruction to use integer types. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	05cce1f97d	intel/fs: Use CHV/BXT implementation of 64-bit MOV_INDIRECT on XeHP+. According to the hardware spec "Vx1 and VxH indirect addressing for Float, Half-Float, Double-Float and Quad-Word data must not be used." Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	d57f3ced6c	intel/fs: Calculate SWSB cross-pipeline synchronization information. In combination with the previous changes we can just check whether an instruction has any potentially unsatisfied dependencies on more than one pipeline, and if so use TGL_PIPE_ALL synchronization with an appropriate RegDist counter, otherwise synchronize with the single pipeline it has a dependency on, if any. Only minor difficulty is caused by the fact that the hardware doesn't have any way to encode pipeline information when a RegDist and an SBID dependency need to be provided simultaneously, in which case the synchronization pipeline is inferred by the hardware. We need to verify that the hardware's inference will give the correct result (which may not be the case if e.g. some data was bit-cast from a different type), and if not emit separate SYNC instructions instead of baking the RegDist dependency into the instruction (Note that SET SBID dependencies must always be baked into the corresponding out-of-order instruction). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	3f063334fc	intel/fs: Represent SWSB in-order dependency addresses as vectors. This extends the current ordered_address instruction counter to a vector with one component per asynchronous ALU pipeline, allowing us to track the last instruction that accessed a register separately for each ALU pipeline of the XeHP EU, making it straightforward to infer the right cross-pipeline synchronization annotations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> v2: Make unit tests happy (with ubsan as run by GitLab automation). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Jordan Justen	78b643fb7f	Revert "intel/compiler: Silence unused parameter warning in update_inst_scoreboard" This was a placeholder for the XeHP cross-pipeline synchronization code, bring it back. This reverts commit `a80e44902f`. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	d4537770bb	intel/fs: Add helper functions inferring sync and exec pipeline of an instruction. Define two helper functions local to the software scoreboard lowering pass describing the behavior of the hardware and code generator: inferred_sync_pipe() calculates the ALU pipeline the hardware will implicitly synchronize with when a RegDist SWSB annotation is used without providing explicit pipeline synchronization information, inferred_exec_pipe() infers the ALU pipeline that will execute the instruction. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:35 +00:00
Francisco Jerez	12479abded	intel/fs: Implement representation of SWSB cross-pipeline synchronization annotations. The execution units of XeHP platforms have multiple asynchronous ALU pipelines instead of (as far as software is concerned) the single in-order pipeline that handled most ALU instructions except for extended math in the original Xe. It's now the compiler's responsibility to identify cross-pipeline dependencies and insert synchronization annotations whenever necessary, which are encoded as some additional bits of the SWSB instruction field. This commit represents the cross-pipeline synchronization annotations as part of the existing tgl_swsb structure used for codegen. The existing tgl_swsb_*() helpers used by hand-crafted assembly are extended to default to TGL_PIPE_ALL big-hammer synchronization in order to ensure backwards compatibility with the existing assembly. The following commits will extend the software scoreboard lowering pass in order to keep track of cross-pipeline dependencies across IR instructions, and insert more specific pipeline annotations in the SWSB field. The disassembler is also extended here to print out any existing pipeline sync annotations. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10000>	2021-04-16 08:27:34 +00:00
Michel Dänzer	d200f45875	Use explicit break instead of fall-through to break-only case clang generates a warning if there's no explicit break or fall-through annotation. The latter would be kind of silly in this case, and not robust against any future changes turning the fall-through invalid. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Michel Dänzer	2928c21eb7	Convert most remaining free-form fall-through comments to FALLTHROUGH One exception is src/amd/addrlib/, for which -Wimplicit-fallthrough is explicitly disabled. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10220>	2021-04-15 16:01:22 +00:00
Iván Briano	8328989130	intel, anv: propagate robustness setting to nir_opt_load_store_vectorize Closes #4309 Fixes dEQP-VK-robustness.robustness2..readonly. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10147>	2021-04-13 13:30:09 -07:00
Jason Ekstrand	e6c79329dd	intel: fix querying mip levels on null surfaces on SKL and prior When a surface of type SURFTYPE_NULL is accessed by resinfo, the MIPCount returned is undefined instead of 0. Closes #4309 Fixes dEQP-VK.robustness.robustness2..sampled_image..null_descriptor.* Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10147>	2021-04-13 13:30:09 -07:00
Yevhenii Kharchenko	edd12acbec	intel/compiler: remove unused member 'input_vue_map' v2: Instead of fixing unitialized member 'fs_visitor::input_vue_map' (as reported by Coverity Scan in defect CID 1474559), remove unused members 'vec4_tcs_visitor::input_vue_map' and 'fs_visitor::input_vue_map'. Also fixed 'debug_enabled' argument skipped in a fs_visitor constructor call from brw_compile_tes(). Signed-off-by: Yevhenii Kharchenko <yevhenii.kharchenko@globallogic.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10040>	2021-04-08 18:20:10 +00:00
Bas Nieuwenhuizen	580f1ac473	nir: Extract shader_info->cs.shared_size out of union. It is valid for all stages, just 0 for most of them. In particular mesh/task shaders might be using it. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10094>	2021-04-08 14:39:28 +00:00
Lionel Landwerlin	49be175a4b	intel/fs: limit OW reads to 8 owords on XeHP+ We can only use 16 OW reads/writes on SLM. v2: Update comment (Curro) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> BSpec: 47652 Fixes: `369eab9420` ("intel/fs: Emit code for Gen12-HP indirect compute data") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10082>	2021-04-08 09:25:38 +00:00
Lionel Landwerlin	33b2daab1a	intel/compiler: lower bit sizes in NIR postprocessing It appears that between preprocess & postprocess some descriptor lowering introduces 8bit types in the shader, so run the lower bit size again to make sure we don't have any unsupported types in our shader. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e06144a818` ("anv: Use 64bit_global_32bit_offset for SSBOs") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4478 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9705>	2021-04-06 23:21:30 +03:00
Anuj Phogat	f96c3b8b63	intel: Rename GEN:BUG:### to Wa_### Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "GEN:BUG:" -rIl $SEARCH_PATH \| xargs sed -ie "s/GEN$:BUG:$/Wa_/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	69c3578c8b	intel: Rename GEN_{ALL, LT, ..} macros to GFX_{ALL, LT, ..} Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "GEN_" -rIl $SEARCH_PATH \| xargs sed -ie "s/GEN_$ALL\\|GE\\|GT\\|LT\\|LE$$[^[:alnum:]]$/GFX_\1\2/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	e7e55af4d6	intel: Rename GENx keyword to GFXx Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "GEN[[:digit:]]+" -rIl $SEARCH_PATH \| xargs sed -ie "s/GEN$[[:digit:]]\+$/GFX\1/g" Exclude the changes to modifiers: grep -E "I915_.GFX" -rIl $SEARCH_PATH \| xargs sed -ie "s/$I915_.$GFX/\1GEN/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	1d296484b4	intel: Rename Genx keyword to Gfxx Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "Gen[[:digit:]]+" -rIl $SEARCH_PATH \| xargs sed -ie "s/Gen$[[:digit:]]\+$/Gfx\1/g" Exclude changes in src/intel/perf/oa-.xml: find src/intel/perf -type f $ -name ".xml" $ \| xargs sed -ie "s/Gfx/Gen/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	b75f095bc7	intel: Rename genx keyword to gfxx in source files Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "gen[[:digit:]]+" -rIl $SEARCH_PATH \| xargs sed -ie "s/gen$[[:digit:]]\+$/gfx\1/g" Exclude pack.h and xml changes in this patch: grep -E "gfx[[:digit:]]+_pack\.h" -rIl $SEARCH_PATH \| xargs sed -ie "s/gfx$[[:digit:]]\+_pack\.h$/gen\1/g" grep -E "gfx[[:digit:]]+\.xml" -rIl $SEARCH_PATH \| xargs sed -ie "s/gfx$[[:digit:]]\+\.xml$/gen\1/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	c1f3a778de	intel: Rename GENx prefix in macros to GFXx in source files Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "GEN" -rIl src/intel/genxml \| grep -E ".py" \| xargs sed -ie "s/GEN$[%{]$/GFX\1/g" grep -E "[^_]GEN[[:digit:]]+" -rIl $SEARCH_PATH \| grep -E ".(\.c\|\.h\|\.y\|\.l)" \| xargs sed -ie "s/$[^_]$GEN$[[:digit:]]\+$/\1GFX\2/g" Leave out renaming GFX12_CCS_E macros. They fall under renaming pattern like "_GEN[[:digit:]]+": grep -E "GFX12_CCS_E" -rIl $SEARCH_PATH \| xargs sed -ie "s/GFX12_CCS_E/GEN12_CCS_E/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	dc28390e3c	intel: Rename genx keyword in filenames to gfxx Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" find $SEARCH_PATH -type f -name "gen[[:digit:]].[cph]" -exec sh -c 'f="{}"; mv -- "$f" "${f/gen/gfx}"' \; grep -E "gen[[:digit:]]+_[[:alnum:]_]\.(c\|h\|cpp)" -rIl $SEARCH_PATH \| xargs sed -ie "s/gen$[[:digit:]]\+_[[:alnum:]_]\.$$c\\|h\\|cpp$/gfx\1\2/g" grep -E "_gen[[:digit:]]+[[:alnum:]_]\.(c\|h\|cpp)" -rIl $SEARCH_PATH \| xargs sed -ie "s/$_$gen$[[:digit:]]\+[[:alnum:]_]\.$$c\\|h\\|cpp$/\1gfx\2\3/g" grep -E "GEN[[:digit:]]+[[:alnum:]_]_H( \|$)" -rIl $SEARCH_PATH \| xargs sed -ie "s/GEN$[[:digit:]]\+[[:alnum:]_]*_H$$ \\|$$/GFX\1\2/g" Exclude the "_pack.h" changes: grep -E "gfx[[:digit:]]+_pack\.h" -rIl $SEARCH_PATH \| xargs sed -ie "s/gfx$[[:digit:]]\+_pack\.h$/gen\1/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	abe9a71a09	intel: Rename gen field in gen_device_info struct to ver Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "info\)(.\|->)gen" -rIl $SEARCH_PATH \| xargs sed -ie "s/info$)$$\.\\|->$gen/info\1\2ver/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Anuj Phogat	99331f6deb	intel: Rename genx10 field in gen_device_info struct to verx10 Commands used to do the changes: export SEARCH_PATH="src/intel src/gallium/drivers/iris src/mesa/drivers/dri/i965" grep -E "info\)(.\|->)genx10" -rIl $SEARCH_PATH \| xargs sed -ie "s/info$)$$\.\\|->$genx10/info\1\2verx10/g" Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9936>	2021-04-02 18:33:07 +00:00
Lionel Landwerlin	200e56f84d	intel/fs: implement another copy propagation restriction We are missing an additional restriction on CHV & upcoming Xe-Hp. v2: Quote BSW PRMs (Curro) Check source is not a scalar (Curro) Fix comment (Marcin) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9929>	2021-04-01 07:48:06 +00:00
Lionel Landwerlin	aa53665fda	intel/fs/copy_prop: check stride constraints with actual final type In some cases we will change the type of the destination register of an instruction. This is the type we should use to verify that we're allow to do the replacement. Otherwise we can hit restrictions on CHV and upcoming Xe-Hp for instance where the copy propagation transforms this : send(16) (mlen: 2) vgrf10:UD, 0u, 0u, vgrf35:D, null:UD mov(16) vgrf11:UW, vgrf10<2>:UW mov(16) vgrf12:UW, vgrf10+0.2<2>:UW mov(16) vgrf15:HF, \|vgrf11\|:HF mov(16) vgrf16:HF, \|vgrf12\|:HF mov(8) vgrf41<2>:UW, vgrf15+0.0:UW group0 mov(8) vgrf42<2>:UW, vgrf15+0.16:UW group8 mov(8) vgrf45<2>:UW, vgrf16+0.0:UW group0 mov(8) vgrf46<2>:UW, vgrf16+0.16:UW group8 into this : send(16) (mlen: 2) vgrf10:UD, 0u, 0u, vgrf35:D, null:UD mov(8) vgrf41<2>:HF, \|vgrf10+0.0\|<2>:HF group0 mov(8) vgrf42<2>:HF, \|vgrf10+1.0\|<2>:HF group8 mov(8) vgrf45<2>:HF, \|vgrf10+0.2\|<2>:HF group0 mov(8) vgrf46<2>:HF, \|vgrf10+1.2\|<2>:HF group8 Because of the floating point use, stride and offets should be the same. v2: Fix final destination type selection (Curro) v3: constify (Curro) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9832>	2021-03-29 22:14:45 +00:00
Kenneth Graunke	5ae276f7e0	intel: Fix release build breakage We missed changing one instance of debug_flag to debug_enabled in a release-only ifdef branch. Fixes: `758eb18c6f` ("intel/compiler: Make vec4 generator take debug_enabled as a parameter") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9839>	2021-03-25 11:36:58 -07:00
Caio Marcelo de Oliveira Filho	e93c8ab023	intel/compiler: Use a struct for brw_compile_cs parameters Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	05933fb0f7	intel/compiler: Use INTEL_DEBUG=blorp to dump blorp shaders Make INTEL_DEBUG=blorp dump the blorp shaders instead using the general INTEL_DEBUG=fs,vs, which is now reserved to the actual FS and VS shaders used by the pipeline. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	7fb1e58651	intel/compiler: Make visitors take debug_enabled as a parameter The callers already have this value, and we would like to make it follow different rules other than stage that might not be visible to the helper function, so just pass explicitly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	758eb18c6f	intel/compiler: Make vec4 generator take debug_enabled as a parameter The callers already have this value, and we would like to make it follow different rules other than stage that might not be visible to the helper function, so just pass explicitly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	244d2daa00	intel/compiler: Make brw_postprocess_nir take debug_enabled as a parameter The callers already have this value, and we would like to make it follow different rules other than stage that might not be visible to the helper function, so just pass explicitly. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	82d77f0ea8	intel/compiler: Refactor the shader INTEL_DEBUG checks Make the check once in a variable, that can be reused for other parts. Also add `unlikely` to the various conditionals depending on it Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	57d664245e	intel/compiler: Use a struct for brw_compile_vs parameters Makes calling code more explicit about what is being set, and allows take advantage of zero initialization for the ones the callsite don't care. Besides moving to the struct, two extra "ergonomic" changes were done: - Add a new shader_time boolean, so shader_time_index is ignored when unused -- this allow taking advantage of the zero initialization of unset fields. - Since we have a struct, provide space for the error_str pointer. Both iris and i965 were using it, and the extra rstrdup in case of failure shouldn't be a burden for the others. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	f5e1765f98	intel/compiler: Use a struct for brw_compile_fs parameters Makes calling code more explicit about what is being set, and allows take advantage of zero initialization for the ones the callsite don't care. Besides moving to the struct, two extra "ergonomic" changes were done: - Add a new shader_time boolean, so shader_time_index is ignored when unused -- this allow taking advantage of the zero initialization of unset fields. - Since we have a struct, provide space for the error_str pointer. Both iris and i965 were using it, and the extra rstrdup in case of failure shouldn't be a burden for the others. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Caio Marcelo de Oliveira Filho	84c3d68344	intel/compiler: Make vue_map parameter const for brw_compile_fs Just a documentation hint that the VUE map is not modified. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9779>	2021-03-24 23:18:46 +00:00
Jason Ekstrand	91192696e6	intel/fs: Add support for 16-bit A64 float and integer atomics The messages for those 16-bit operations still use 32-bit sources and destinations, so expand them accordingly when building the payload. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8750>	2021-03-18 00:13:40 +00:00
Lionel Landwerlin	8b6d22109f	intel/fs/vec4: add missing dependency in write-on-write fixed GRFs If we load constant data using pull constant SENDS, and we later load that register with some other data, we can end up in a situation where we don't track the initial fixed register write and therefore end up using uninitialized registers. This tracks write-on-write of fixed GRFs like we do for normal virtual GRFs. v2: Fix post_alloc_reg case (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9667>	2021-03-17 23:25:02 +00:00

1 2 3 4 5 ...

1665 commits