fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-19 15:48:19 +02:00

Author	SHA1	Message	Date
Jason Ekstrand	76d2772472	anv: Allow all clear colors for texturing on Gen11+ Starting with Gen11, we have two indirect clear colors: An unconverted float/int version which is us used for rendering and a converted pixel value version which is used for texturing. Because the one used for texturing is stored as a single pixel of that color, it works no matter what format is being used. Because it's a simple HW indirect and doesn't involve copying surface states around, we can use it in the sampler without having to worry about surface states having out-of-date clear values. The result is that we can now allow any clear color when texturing. This cuts the number of resolves in a RenderDoc trace of Dota2 by 95% on Gen11+ (you read that right) and improves perf by 3.5%. It improves perf in a handful of other workloads by < 1%. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	e63c662c26	anv: Use anv_layout_to_aux_usage for color during render passes Previously, we tried to treat color image layouts as a special case during render passes. This is largely an artifact of history as our initial understanding of Vulkan placed much more emphasis on render passes than our current understanding. The only real practical use for magic layouts in the middle of a render pass, as far as I can tell, is to allow more clear colors to get passed through to input attachments. However, most apps aren't very creative with their clear colors and very few of them (none coming from DXVK) actually use render passes in any interesting way. Therefore, the risk of being able to pass fewer clear colors through to input attachments should be minimal. There are, however, three very big advantages to this change: 1. We are now consistent in our handling of aux usage and layouts between color and depth/stencil. 2. We are now actually following the layout guidelines from the app and aren't nearly as likely to see strange behavior due to us overriding the image layouts manually. 3. It's more obviously correct. While I think our old render pass code was probably correct, it was full of corner cases and it's very possible that it was behaving badly in weird ways. This follows the Vulkan API much more blindly and, as such, is more likely to be correct and behave the same as other implementations. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	30016f6e82	anv: Split color_attachment_compute_aux_usage in two In particular, we split out an anv_can_fast_clear_color_view helper which only cares about fast-clear and not aux_usage itself. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	3fe45a9b6c	anv: Rework depth_stencil_attachment_compute_aux_usage Instead of making it a function that pretends to choose aux usage (which isn't what it does at all), make it a function which returns whether or not we want to do a fast clear. This is far more accurate to the purpose of the function. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	26e6da90ab	anv: Refactor cmd_buffer_setup_attachments This commit just renames some things so that we use names for temporary variables which are more consistent with other places in the code-base. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	36a74835df	anv: Stop allowing non-zero clear colors in input attachments Previously, we bent over backwards to allow non-zero clear colors input attachments whenever we could. However, very few apps use input attachments and very few use non-zero clear colors. Getting rid of support for non-zero clear colors input attachments will allow us to treat them identically to textures which should help us simplify things a good bit. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	bf92e96d9c	anv: Disallow fast-clears which require format-reinterpretation In order to actually hit this case you have to be using a very odd color/view combination. The common cases of clear-to-zero and 0/1 clear colors with an sRGB view don't require any re-interpretation. This is probably better than always resolving whenever we have a format mismatch like we are today because that hits the sRGB case every time. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	20e72e435c	intel: Move swizzle_color_value from blorp to ISL Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	814dc66935	anv: Allocate surface states per-subpass Instead of allocating surface states for attachments in BeginRenderPass, we now allocate them in begin_subpass. Also, since we're zeroing things, we can be a bit cleaner about or implementation and just fill out all those passes for which we have allocated surface states. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	a3d185d091	anv: Split command buffer attachment setup in three This commit splits genX(cmd_buffer_setup_attachments)() into three functions: one which sets up cmd_buffer->state.attachments, one which allocates surface states, and one which fills out the surface states. While we're here, we make both functions take the framebuffer (if any) as an argument instead of pulling it from the command buffer so it's more clear what things are inputs to the functions. We also make the render pass and framebuffer parameters const as those are immutable objects. The only functional change here should be that we now vk_zalloc the attachments which should be a bit safer. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	c195d55161	anv: Mark images written in end_subpass This makes a lot more sense than marking them written in begin_subpass since, at that point, we haven't written them yet. This should reduce the chances of accidental extra resolves. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	d5e30872ca	anv: Use ANV_FROM_HANDLE for pInheritanceInfo fields Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	7cbc5fde13	anv: Assert surface states are valid Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	eaa8f043cd	anv: Stop filling out the clear color in compute_aux_usage It's a pointless micro-optimization that just makes compute_aux_usage unnecessarily entangled with setting up surface states. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	5808efdf40	anv: Add TRANSFER_SRC to pass usage not subpass usage The subpass usage flags are supposed to always be one bit and never multiple bits. However, when adding in TRANSFER_SRC usage for resolve attachments we were adding it to the subpass bits and not the render pass bits. This potentially is causing issues where images aren't getting marked written properly. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	513ed7542a	anv: Return an error if allocating attachment memory fails Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4393>	2020-04-28 22:45:39 +00:00
Jason Ekstrand	81ac741f89	anv: Expose CS workgroup sizes based on a maximum of 64 threads Otherwise, we'll hit asserts in brw_compile_cs. Fixes: `cf12faef61` "intel/compiler: Restrict cs_threads to 64" Closes: #2835 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4746>	2020-04-28 14:51:08 +00:00
Jason Ekstrand	86f67952d3	intel/devinfo: Compute the correct L3$ size for Gen12 Fixes: `8125d7960b` "intel/dev: Add preliminary device info for Tigerlake" Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Tested-by: Clayton Craft <clayton.a.craft@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4782>	2020-04-28 14:34:17 +00:00
Jason Ekstrand	80ffbe915f	anv: Add support for HiZ+CCS Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	752eefdb3d	intel/isl: Refactor isl_surf_get_ccs_surf This refactor breaks out a new isl_surf_supports_ccs function which does most of the validity checking. The isl_surf_get_ccs_surf function calls this function and then dives right into constructing the CCS aux_surf. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	3eb1993625	intel/isl: Delete a misleading comment Untyped messages are only use on Gen9+ for UBOs and SSBOs. They will never be used on anything using an isl_surf. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	483a1d5e6c	anv/cmd_buffer: Move anv_image_init_aux_tt higher Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	65e541ab16	anv: Simplify a case in layout_to_aux_usage If it's depth, the only possible value of planes[plane].aux_usage is ISL_AUX_USAGE_HIZ at least right now. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	5cb6c5d11d	intel/blorp: Allow more HiZ usages in hiz_clear_depth_stencil Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	0d91dae7f0	anv: Generalize some aux usage checks For the checks dealing with fast-clear values, we change them to check for the depth aspect because the distinction there really is between color and depth more than between HiZ and CCS. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:45 +00:00
Jason Ekstrand	86ded00c40	anv/blorp: Do less hard-coding of aux usages Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:44 +00:00
Jason Ekstrand	54b525caf0	anv: Rework anv_layout_to_aux_state Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:44 +00:00
Jason Ekstrand	eb0cede586	anv: Be more conservative about image view usage We were ORing together the image and stencil usage rather than actually following the formula in the spec. This can lead to assertions in other parts of the driver if we're not careful. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:44 +00:00
Jason Ekstrand	d2f3576d33	anv: Move vk_image_layout_is_read_only higher While we're at it, we drop some _KHR suffixes Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:44 +00:00
Jason Ekstrand	5de9f4409a	anv: Add a vk_image_layout_to_usage_flags helper Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4100>	2020-04-24 16:17:44 +00:00
Rafael Antognolli	e3ab86c599	anv: Enable HiZ on multi-layer depth buffers. Improves The Witcher 3 fps by 2-10% on ICL (depending on the configs and system). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4661>	2020-04-24 15:14:59 +00:00
Jason Ekstrand	9c009da208	anv: Drop an assert Ever since Vulkan 1.2, this feature has been in core so enabling the extension is no longer required. Fixes: `4ef3f7e3d3` "anv: Enable Vulkan 1.2 support" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4694>	2020-04-24 11:37:03 +00:00
Jason Ekstrand	f4addfdde3	spirv: Use nir_const_value for spec constants When we originally wrote spirv_to_nir we didn't have a good scalar value union to handily use so we rolled our own thing for spec constants. Now that we have nir_const_value, we can use that and simplify a bunch of the spec constant logic. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4675>	2020-04-24 09:23:59 +00:00
Jason Ekstrand	a44e63398b	anv: Properly handle all sizes of specialization constants Closes: #2812 cc: mesa-stable@lists.freedesktop.org Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4675>	2020-04-24 09:23:59 +00:00
Eduardo Lima Mitev	4dc7b76276	anv/radv: Resolving 'GetInstanceProcAddr' should not require a valid instance Since vk_icdGetInstanceProcAddr() is wired through vkGetInstanceProcAddr() in both drivers, we lost the ability for 'GetInstanceProcAddr' to resolve itself prior to having a valid instance. An upcoming spec change will fix that and allow vkGetInstanceProcAddr() to resolve itself passing NULL as instance. See https://gitlab.khronos.org/vulkan/vulkan/issues/2057 for details. This patch implements the change in both radv and anvil. CTS changes have already landed: https://gitlab.khronos.org/Tracker/vk-gl-cts/issues/2278 vulkan-loader changes have also landed: https://gitlab.khronos.org/Tracker/vk-gl-cts/issues/2278 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4273>	2020-04-24 09:09:14 +00:00
Lionel Landwerlin	efdb7fa9a8	anv: force whole EU array to be powered for perf queries Because of functional requirements for Gen11, when perf is enabled we only power half the EU array. This change forces it to enable everything. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4021>	2020-04-23 15:55:59 +00:00
Lionel Landwerlin	a7998371ed	intel/perf: specify sseu configuration when supported Because of functional requirements for Gen11, when perf is enabled we only power half the EU array. This change forces it to enable everything. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4021>	2020-04-23 15:55:59 +00:00
Lionel Landwerlin	8f152ed101	intel/perf: store default sseu configuration This is the powergating configuration of the EU array. The default is everything powered. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com> Reviewed-by: Mark Janes <mark.a.janes@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4021>	2020-04-23 15:55:59 +00:00
Rhys Perry	32d871b48f	nir/algebraic: don't undo lowering of 8/16-bit comparisons to 32-bit Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4387>	2020-04-23 10:57:38 +00:00
Kenneth Graunke	259cae4442	intel/compiler: Don't create 64-bit src1 immediates in opt_peephole_sel 64-bit immediates are only allowed as src0. Long ago, we decided to avoid constructing such illegal situations in the IR, rather than allowing them in the IR but then promoting bogus immediates to GRFs later. So, we need to fix opt_peephole_sel to not put 64-bit immediates as src1 of the new SEL instruction. Fixes: `a4b36cd3dd` ("intel/fs: Coalesce when the src live range is contained in the dst") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2816 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4692>	2020-04-23 00:53:14 +00:00
Kenneth Graunke	4459a70a6e	intel/compiler: Delete abs/neg handling in fsign code This should have gone away when removing source modifiers. They won't be set any longer, so this is simply dead code. Fixes: `b7c47c4f7c` ("intel/compiler: Drop nir_lower_to_source_mods() and related handling.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4691>	2020-04-22 17:04:37 -07:00
Kenneth Graunke	220f0e10d8	intel/compiler: Don't copy prop source mods into PICK_HIGH_32BIT VEC4_OPCODE_PICK_HIGH_32BIT performs 32-bit UD access on a 64-bit DF value. abs and negate make sense on DF, but break entirely when trying to access pieces of the value as unsigned integer dwords. Fixes an fsign Piglit test on Ivybridge: tests/spec/arb_gpu_shader_fp64/execution/built-in-functions/vs-sign-neg-abs It had regressed when I removed nir_lower_to_source_modifiers, as that caused us to start generating different code which provoked this bug. Fixes: `b7c47c4f7c` ("intel/compiler: Drop nir_lower_to_source_mods() and related handling.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2817 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4691>	2020-04-22 17:03:18 -07:00
Kenneth Graunke	902c8731f4	intel/compiler: Put back saturate on [iu]add_sat opcodes I deleted one too many inst->saturate = ... lines. This one must stay. Fixes: `b7c47c4f7c` ("intel/compiler: Drop nir_lower_to_source_mods() and related handling.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4669>	2020-04-22 00:47:40 +00:00
Kenneth Graunke	b7c47c4f7c	intel/compiler: Drop nir_lower_to_source_mods() and related handling. I think we're unanimous in wanting to drop nir_lower_to_source_mods. It's a bit of complexity to handle in the backend, but perhaps more importantly, would be even more complexity to handle in nir_search. And, it turns out that since we made other compiler improvements in the last few years, they no longer appear to buy us anything of value. Summarizing the results from shader-db from this patch: - Icelake (scalar mode) Instruction counts: - 411 helped, 598 hurt (out of 139,470 shaders) - 99.2% of shaders remain unaffected. The average increase in instruction count in hurt programs is 1.78 instructions. - total instructions in shared programs: 17214951 -> 17215206 (<.01%) - instructions in affected programs: 1143879 -> 1144134 (0.02%) Cycles: - 1042 helped, 1357 hurt - total cycles in shared programs: 365613294 -> 365882263 (0.07%) - cycles in affected programs: 138155497 -> 138424466 (0.19%) - Haswell (both scalar and vector modes) Instruction counts: - 73 helped, 1680 hurt (out of 139,470 shaders) - 98.7% of shaders remain unaffected. The average increase in instruction count in hurt programs is 1.9 instructions. - total instructions in shared programs: 14199527 -> 14202262 (0.02%) - instructions in affected programs: 446499 -> 449234 (0.61%) Cycles: - 5253 helped, 5559 hurt - total cycles in shared programs: 359996545 -> 360038731 (0.01%) - cycles in affected programs: 155897127 -> 155939313 (0.03%) Given that ~99% of shader-db remains unaffected, and the affected programs are hurt by about 1-2 instructions - which are all cheap ALU instructions - this is unlikely to be measurable in terms of any real performance impact that would affect users. So, drop them and simplify the backend, and hopefully enable other future simplifications in NIR. Reviewed-by: Eric Anholt <eric@anholt.net> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4616>	2020-04-21 21:42:21 +00:00
Dylan Baker	8e3696137f	remove final imports.h and imports.c bits This moves the fi_types to a new mesa_private.h and removes the imports.c file. The vast majority of this patch is just removing pound includes of imports.h and fixing up the recursive includes. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3024>	2020-04-21 11:09:04 -07:00
Dylan Baker	f8e4542bad	replace _mesa_logbase2 with util_logbase2 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3024>	2020-04-21 11:09:03 -07:00
Dylan Baker	e533fad182	replace _mesa_is_pow_two with util_is_power_of_two_* Mostly this uses util_is_power_of_two_or_zero, which has the same behavior as _mesa_is_pow_two when the input is zero. In cases where the value is known to be != 0 ahead of time I used the _nonzero variant as it may be faster on some platforms. Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3024>	2020-04-21 11:09:03 -07:00
Abhishek Kumar	f06e4ab319	anv/android: fix assert in anv_import_ahw_memory Commit fixes assert that triggers when running dEQP-VK.api.external.memory.android_hardware_buffer.dedicated.buffer#bind_export_import_bind on a debug build of Mesa. Fixes: `c79a528d` ("anv/android: support import/export of AHardwareBuffer objects") Signed-off-by: Abhishek Kumar <abhishek4.kumar@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4655>	2020-04-21 11:50:15 +00:00
Jason Ekstrand	7c43b8ce1b	nir: Delete the fnoise opcodes As of the previous commit, they are never used. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Eric Anholt <eric@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4624>	2020-04-21 06:16:13 +00:00
Jason Ekstrand	a4b36cd3dd	intel/fs: Coalesce when the src live range is contained in the dst Consider the following case: // g119-123 are written somewhere above mul.sat(16) g67<1>F g6.4<0,1,0>F g125<8,8,1>F mul.sat(16) g69<1>F g6.5<0,1,0>F g125<8,8,1>F mul.sat(16) g71<1>F g6.6<0,1,0>F g125<8,8,1>F mov(16) g119<1>F g67<8,8,1>F mov(16) g121<1>F g69<8,8,1>F mov(16) g123<1>F g71<8,8,1>F We should be able to coalesce it into mul.sat(16) g119<1>F g6.4<0,1,0>F g125<8,8,1>F mul.sat(16) g121<1>F g6.5<0,1,0>F g125<8,8,1>F mul.sat(16) g123<1>F g6.6<0,1,0>F g125<8,8,1>F What's stopping us is an overly conservative check for writes to the two registers being coalesced. The check walks over the intersection of their live ranges and checks for no writes to either one. However, because the register which starts the live range (the mul.sat in this case) is inside that intersection, we flag it as a write in the intersection and don't coalesce. However, this case is safe because the destination register of the copy is never read after the source is written. Shader-db changes on ICL: total instructions in shared programs: 16043613 -> 16042610 (<.01%) instructions in affected programs: 43036 -> 42033 (-2.33%) helped: 226 HURT: 0 helped stats (abs) min: 1 max: 30 x̄: 4.44 x̃: 4 helped stats (rel) min: 0.09% max: 26.67% x̄: 4.89% x̃: 3.43% 95% mean confidence interval for instructions value: -4.86 -4.02 95% mean confidence interval for instructions %-change: -5.57% -4.22% Instructions are helped. total cycles in shared programs: 334766372 -> 334710124 (-0.02%) cycles in affected programs: 617548 -> 561300 (-9.11%) helped: 214 HURT: 2 helped stats (abs) min: 15 max: 1512 x̄: 263.21 x̃: 212 helped stats (rel) min: 0.30% max: 75.36% x̄: 25.30% x̃: 21.58% HURT stats (abs) min: 40 max: 40 x̄: 40.00 x̃: 40 HURT stats (rel) min: 0.15% max: 0.15% x̄: 0.15% x̃: 0.15% 95% mean confidence interval for cycles value: -277.91 -242.90 95% mean confidence interval for cycles %-change: -27.58% -22.55% Cycles are helped. No spill/fill changes or gained/lost Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4627>	2020-04-21 01:00:24 +00:00

1 2 3 4 5 ...

5456 commits