fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-23 22:00:13 +01:00

Author	SHA1	Message	Date
Marek Olšák	939ddf3f67	nir: replace lower_ffma and fuse_ffma with has_ffma Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Marek Olšák	771aad3027	nir: split lower_ffma into lower_ffma16/32/64 AMD wants different behavior for each bit size Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6756>	2020-09-24 12:29:11 +00:00
Jason Ekstrand	9750164c09	nir: Rename get_buffer_size to get_ssbo_size This makes it explicit that this intrinsic is only for SSBOs. For the v3dv driver, we'll be adding a get_ubo_size intrinsic and we want to be able to distinguish between the two. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6812>	2020-09-22 13:34:12 +00:00
Lionel Landwerlin	cc3bf00cc2	intel/compiler: fixup Gen12 workaround for array sizes We didn't handle the case of NULL images/textures for which we should return 0. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `397ff2976b` ("intel: Implement Gen12 workaround for array textures of size 1") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3522 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6729>	2020-09-21 21:20:09 +00:00
Jason Ekstrand	f63ffc18e7	intel/fs/swsb: SCHEDULING_FENCE only emits SYNC_NOP It's not really unordered in the sense that it can still stall on ordered things and we don't need a SYNC_NOP for that because it is a SYNC_NOP. However, it also doesn't count when computing instruction distances. Fixes: `18e72ee210` "intel/fs: Add FS_OPCODE_SCHEDULING_FENCE" Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6781>	2020-09-20 14:43:40 +00:00
Gert Wollny	80cde3ad55	intel/compiler: Set lower_uniform_to_ubo compiler flag Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6316>	2020-09-16 10:07:42 +00:00
Marcin Ślusarz	18eb853ac8	intel/compiler: quiet Coverity warnings Coverity complains about possible out-of-bounds write & read, because it thinks that "loc + i" can be bigger than sizes of the 2 used arrays. It's not obvious from the code it cannot happen, so add asserts here. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	5ea0b6a9c6	intel/compiler: initialize remaining fields of various classes These variables seem to be initialized before being used, so this patch is not fixing any bug, but leaving them unitialized may become a bug after some refactoring. These classes were affected: fs_reg_alloc, fs_visitor, fs_generator, instruction_scheduler. Found by Coverity. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Marcin Ślusarz	40b964dc8f	intel/compiler: remove unused fs_validator::param_size Found by Coverity as unitialized variable. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6667>	2020-09-10 12:16:58 +00:00
Jason Ekstrand	3bd7c3c9db	intel/nir: Call validate_ssa_dominance at both ends of the NIR compile This invokes it before we go into the optimization/lowering pass and then right before we go out of SSA. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5288>	2020-09-08 19:44:01 +00:00
Marcin Ślusarz	64b0b7c274	intel/compiler: fix typo in a comment Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	95ce619680	intel/compiler: print dispatch width when shader fails to compile Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	e5f735a986	intel/compiler: move extern C functions out of namespace brw brw_compile_gs and brw_compile_tcs are extern C functions, but are defined inside of brw namespace, which somehow works but confuses Eclipse CDT's code analysis. Move these functions out of brw namespace and fix references to objects from brw namespace. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	d4c6e3f196	intel/compiler: use the same name for nir shaders in brw_compile_* functions Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marcin Ślusarz	0dda209406	intel/compiler: match brw_compile_* declarations with their definitions Current state confuses Eclipse CDT's code analysis. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6602>	2020-09-04 17:38:25 +00:00
Marek Olšák	ac55b1a9a6	nir: get ffma support from NIR options for nir_lower_flrp This also fixes the inverted last parameter of nir_lower_flrp in most drivers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6599>	2020-09-04 17:06:22 +00:00
Marcin Ślusarz	663c4d5377	intel/fs: add hint how to get more info when shader validation fails Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6559>	2020-09-04 12:09:22 +00:00
Jason Ekstrand	a7a0315d7f	intel/nir: Stop using nir_lower_vars_to_scratch Instead, we do a limited indirect deref lowering and then use nir_lower_vars_to_explicit_types and nir_lower_explicit_io to lower it as if it were SSBO or global memory access. Among other things, this should enable pointer arithmetic on local variables. Fun! The only shader-db change from this change on ICL was a few tiny cycle count changes in 7 Aztec Ruins compute shaders. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5909>	2020-09-03 14:26:49 +00:00
Jason Ekstrand	38a83a3048	nir/lower_indirect_derefs: Add a threshold Instead of always lowering everything, we add a threshold such that if the total indirected array size (AoA size) is above that threshold, it won't lower. It's assumed that the driver will sort things out somehow by, for instance, lowering to scratch. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5909>	2020-09-03 14:26:49 +00:00
Jason Ekstrand	c897cd0278	intel/compiler: Handle all indirect lowering choices in brw_nir.c Since everything flows through NIR and we're doing all of our indirect deref lowering there now, there's no reason to keep making those decisions in brw_compiler and stuffing them in the GLSL compiler structs. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5909>	2020-09-03 14:26:49 +00:00
Jason Ekstrand	fe18a0fd45	intel/nir: Lower load_num_work_groups to 32-bit if needed For OpenCL-style kernels, this builtin is 64-bit. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6570>	2020-09-02 20:38:22 +00:00
Jason Ekstrand	5799da47c7	intel/fs: Use a single untyped surface read for load_num_work_groups There's no good reason to split this into three. Sure, CS indirects are only guaranteed by the spec to be DWORD aligned, but that's all untyped surface reads require anyway. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6570>	2020-09-02 20:38:22 +00:00
Jason Ekstrand	8e8701b43a	intel/fs: Don't copy-propagate stride=0 sources into ddx/ddy This can come up if, for instance, the shader does a derivative of a uniform or flat input. Ideally, NIR would use divergence analysis to get rid of the derivative in this case but it doesn't right now. This fixes a crash in F1 2017. Cc: mesa-stable@lists.freedesktop.org Reported-by: Marcin Ślusarz <marcin.slusarz@intel.com> Tested-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6564>	2020-09-02 20:31:32 +00:00
Jason Ekstrand	91becd84ae	intel/fs: Add support for a new load_reloc_const intrinsic Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	8d8a3815ef	intel/eu: Add a mechanism for emitting relocatable constant MOVs Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	272ab2823d	intel/eu: Include brw_compiler.h in brw_eu.h Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	90b6745bc8	intel/fs,vec4: Stuff the constant data from NIR in the end of the program Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	91348d125d	intel/eu: Add some new helpers Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Jason Ekstrand	54ba0daa28	intel/compiler: Get rid of the global compaction table pointers With discrete GPUs, it's going to be possible to have GPUs from two different hardware generations in the machine at the same time. Global singletons like this aren't going to fly. Have a struct containing the pointers which gets initialized once per shader disassemble instead. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6244>	2020-09-02 19:48:44 +00:00
Matt Turner	e4dadb545f	intel/tools: Disassemble WAIT's argument as a destination WAIT takes a notification register as a destination and a src0 argument. Since the same notification register is specified in both fields, we treat it as a special case and disassemble it only once. If we disassemble it as if it is a source register, its scalar region will be printed as <0,1,0>. This causes difficulties round-tripping through the assembler <-> disassembler because that is not an acceptable destination region. If we instead disassemble the destination, we instead get a <1> region which is an acceptable and equivalent region for source and destination. The test .asm files are regenerated by round-tripping them through the assembler/disassembler. Note that the <0> region in the tests was a harmless mistake: the compiler translated it to a <0,1,0> source region and a <1> destination region, since <0> isn't valid. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6543>	2020-09-02 17:18:18 +00:00
Marcin Ślusarz	ed9ac3d60c	intel/fs,vec4: remove unused assignments Reported by Coverity. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6126>	2020-09-02 15:08:01 +00:00
Marcin Ślusarz	8e8356e3dc	intel/compiler: mark debug constant as const Should quiet Coverity's "'Constant' variable guards dead code". Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6126>	2020-09-02 15:08:01 +00:00
Marcin Ślusarz	c7a9dc76dc	intel/compiler/test: use TEST_DEBUG env var consistently Other tests use the same environment variable to decide whether they should print debugging information. Will quiet Coverity's "'Constant' variable guards dead code". Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6126>	2020-09-02 15:08:01 +00:00
Danylo Piliaiev	87fa645b94	intel/compiler: Fix pointer arithmetic when reading shader assembly start_offset is a byte offset. Fixes: `04a9951580` Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6557>	2020-09-02 13:23:14 +00:00
Danylo Piliaiev	bc4a127d6e	intel/disasm: Label support in shader disassembly for UIP/JIP Shader instructions which use UIP/JIP now get formatted with a label in addition with immediate value, labels have "LABEL%d" format. v2: - Consider brw_jump_scale when calculating label's offset From: "Lonnberg, Toni" <toni.lonnberg@intel.com> Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4245>	2020-09-02 10:33:29 +00:00
Danylo Piliaiev	6cbd4764cd	intel/disasm: brw_label and support functions Pre-work for shader disassembly label support. Introduction of the structures and functions used by the shader disassembly jump target labeling. From: "Lonnberg, Toni" <toni.lonnberg@intel.com> Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4245>	2020-09-02 10:33:29 +00:00
Danylo Piliaiev	afa39d07e4	intel/disasm: Change visibility of has_uip and has_jip Pre-work for shader disassembly label support. From: "Lonnberg, Toni" <toni.lonnberg@intel.com> Signed-off-by: Danylo Piliaiev <danylo.piliaiev@globallogic.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4245>	2020-09-02 10:33:29 +00:00
Jason Ekstrand	ff2f44d865	intel/fs: Implement nir_intrinsic_load_global_constant Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6379>	2020-09-01 20:50:04 +00:00
Jason Ekstrand	cccb497d3c	intel/fs: Fix MOV_INDIRECT and BROADCAST of Q types on Gen11+ The immediate case is pretty uncommon to see but it can happen, in theory. BROADCAST is typically used to uniformize values and those are usually 32-bit. However, it does come up in some subgroup ops. Fixes: `49c21802cb` "intel/compiler: Split has_64bit_types into float/int" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6211>	2020-09-01 13:25:20 -05:00
Karol Herbst	70cbddc4a7	nir: use enum operator helper for nir_variable_mode and nir_metadata those are used quite a bit Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6520>	2020-09-01 17:45:08 +00:00
Jason Ekstrand	4d18e71fea	nir: Rename num_shared to shared_size This one is always a size in bytes. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6524>	2020-09-01 17:30:51 +00:00
Jason Ekstrand	e8b3bc1d55	intel/nir: Lower things with > 4 components in lower_mem_access_bit_sizes Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6502>	2020-08-31 17:04:40 +00:00
Jason Ekstrand	55ae704513	intel/fs: Add support for vec8 and vec16 ops Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6502>	2020-08-31 17:04:40 +00:00
Jason Ekstrand	b7db9ee320	intel/nir: Clean up lower_alpha_to_coverage a bit Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6233>	2020-08-29 16:41:05 +00:00
Jason Ekstrand	b6fdb1405e	intel/nir: Rewrite the guts of lower_alpha_to_coverage I have no idea how this pass ever worked. I guess it worked ok on the one or two piglit tests but the whole thing seemed very fragile. It makes a number of undocumented and unasserted assumptions and they aren't always valid. This rewrite makes a number of changes: 1. It now properly handles the case where the gl_SampleMask write comes before the gl_FragColor or gl_FragData[0] write. 2. It should early-exit faster because it now looks at bits in shader_info::outputs_written instead of looking for variables. 3. Instead of the fragile variable lookup where we try to look the variable up by both location and driver_location and match, we just use the driver_location calculations used by brw_fs_nir. 4. It asserts that the index parameter to store_output is a constant instead of silently failing if it isn't. 5. We now actually assert the implicit assumption that the two writes are in the same block. We go even further and assert that they are in the last block in the shader. 6. In the case where 3 or fewer components of the output are written, we explicitly choose to leave the sample mask alone. Fixes: `7ecfbd4f6d` "nir: Add alpha_to_coverage lowering pass" Closes: #3166 Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6233>	2020-08-29 16:41:05 +00:00
Jason Ekstrand	72dc06e07e	intel/nir: Pass the nir_builder by reference in lower_alpha_to_coverage I'm honestly not sure how passing a builder by-value ever worked. I guess the struct is mostly copyable. In any case, that's the wrong way to use it and it's causing issues. Fixes: `7ecfbd4f6d` "nir: Add alpha_to_coverage lowering pass" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6233>	2020-08-29 16:41:05 +00:00
Jason Ekstrand	c84e2784eb	intel/nir: Allow splitting a single load into up to 32 loads Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6405>	2020-08-21 22:49:54 +00:00
Jason Ekstrand	febe762246	intel/fs: Fix an assert in load_scratch Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6405>	2020-08-21 22:49:54 +00:00
Jesse Natalie	d3faac7a15	nir: Add options to nir_lower_compute_system_values to control compute ID base lowering If no options are provided, existing intrinsics are used. If the lowering pass indicates there should be offsets used for global invocation ID or work group ID, then those instructions are lowered to include the offset. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5891>	2020-08-21 22:07:05 +00:00
Jesse Natalie	2e1df6a17f	nir: Move compute system value lowering to a separate pass The actual variable -> intrinsic lowering stays where it is, but ops which convert one intrinsic to be implemented in terms of another have moved. Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5891>	2020-08-21 22:07:05 +00:00

... 20 21 22 23 24 ...

2524 commits