fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-24 11:00:11 +01:00

Author	SHA1	Message	Date
Dylan Baker	8396043f30	Replace uses of _mesa_bitcount with util_bitcount and _mesa_bitcount_64 with util_bitcount_64. This fixes a build problem in nir for platforms that don't have popcount or popcountll, such as 32bit msvc. v2: - Fix additional uses of _mesa_bitcount added after this was originally written Acked-by: Eric Engestrom <eric.engestrom@intel.com> (v1) Acked-by: Eric Anholt <eric@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2018-09-07 10:21:26 -07:00
Jason Ekstrand	152fdeddbb	nir/format_convert: Rename nir_format_bitcast_uint_vec We have a name for that, it's called a uvec. This just makes the function name a bit shorter. While we're here, we also add an assert for one of the assumptions this function makes. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-08-29 14:04:02 -05:00
Kenneth Graunke	de57926dc9	blorp: Properly handle Z24X8 blits. One of the reasons we didn't notice that R24_UNORM_X8_TYPELESS destinations were broken was that an earlier layer was swapping it out for B8G8R8A8_UNORM. That made Z24X8 -> Z24X8 blits work. However, R32_FLOAT -> R24_UNORM_X8_TYPELESS was still totally broken. The old code only considered one format at a time, without thinking that format conversion may need to occur. This patch moves the translation out to a place where it can consider both formats. If both are Z24X8, we continue using B8G8R8A8_UNORM to avoid having to do shader math workarounds. If we have a Z24X8 destination, but a non-matching source, we use our shader hacks to actually render to it properly. Fixes: `804856fa57` (intel/blorp: Handle more exotic destination formats) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-11 12:34:01 -07:00
Kenneth Graunke	8a29086285	blorp: Don't try to use R32_UNORM for R24_UNORM_X8_TYPELESS rendering. The hardware doesn't support rendering to R24_UNORM_X8_TYPELESS, so Jason decided to fake it with a bit of shader math and R32_UNORM RTs. The only problem is that R32_UNORM isn't renderable either...so we've just traded one bad format for another. This patch makes us use R32_UINT instead. Fixes: `804856fa57` (intel/blorp: Handle more exotic destination formats) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-08-11 12:33:27 -07:00
Jason Ekstrand	a9f7bcfdf9	intel: Switch the order of the 2x MSAA sample positions The Vulkan 1.1.82 spec flipped the order to better match D3D. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com>	2018-08-11 10:58:12 -05:00
Jason Ekstrand	d0ee0a0a5d	intel/blorp: Fix blits to R8G8B8_UNORM_SRGB sRGB harder The first fix attempt contained a nasty typo which somehow didn't get caught in review. It also didn't work as intended because the sRGB conversion was happening but then throwing away all but the red channel because it dind't know it was RGB. Really, it's my fault for trying to fix a bug without first writing tests. I've now written tests and they pass with this change. :) Fixes: `11712b9ca1` "intel/blorp: Fix blits to R8G8B8_UNORM_SRGB" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-07-23 00:36:39 -07:00
Jason Ekstrand	aaa6fac8f6	intel/blorp: Take an explicit filter parameter in blorp_blit This lets us move the glBlitFramebuffer nonsense into the GL driver and make the usage of BLORP mutch more explicit and obvious as to what it's doing. Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-07-18 09:47:28 -07:00
Jason Ekstrand	9fbe2a2007	intel/blorp: Add a blorp_filter enum for use in blorp_blit At the moment, this is entirely internal but we'll expose it to clients of the BLORP API in the next commit. Reviewed-by: Chad Versace <chadversary@chromium.org>	2018-07-18 09:47:28 -07:00
Jason Ekstrand	daa78f30b6	intel/blorp: Handle 3-component formats in clears This fixes a nasty hang in Batman: Arkham City which apparently calls vkCmdClearColorImage on a linear RGB image. cc: mesa-stable@lists.freedesktop.org Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-07-13 20:57:46 -07:00
Jason Ekstrand	11712b9ca1	intel/blorp: Fix blits to R8G8B8_UNORM_SRGB In this case, the surface faking will give us a R8_UNORM surface and we need to do an sRGB conversion in the shader. Found by inspection. cc: mesa-stable@lists.freedesktop.org Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com>	2018-07-13 20:57:46 -07:00
Jason Ekstrand	3891c1906f	intel/blorp: Stop setting tex->texture/sampler nir_tex_instr_create uses rzalloc so it's already NULL Acked-by: Rob Clark <robdclark@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Acked-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-06-22 20:54:00 -07:00
Jason Ekstrand	ae514ca695	intel/blorp: Support blits and clears on surfaces with offsets For certain EGLImage cases, we represent a single slice or LOD of an image with a byte offset to a tile and X/Y intratile offsets to the given slice. Most of i965 is fine with this but it breaks blorp. This is a terrible way to represent slices of a surface in EGL and we should stop some day but that's a very scary and thorny path. This gets blorp to start working with those surfaces and fixes some dEQP EGL test bugs. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=106629 Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2018-05-25 14:01:44 -07:00
Jason Ekstrand	18f8200a99	intel/blorp: Use linear formats for CCS_E clear colors in copies It's clear that the original code meant to do this and there is even a 10-line comment explaining why. Originally, we had a simple function for packing the clear colors which was unaware of sRGB. However, in `a6b66a7b26`, when we started using ISL to do the packing, the wrong format was used. Fixes: `a6b66a7b26` "intel/blorp: Use ISL instead of bitcast_color..." Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-14 10:41:26 -07:00
Jason Ekstrand	ccb44b8a94	intel/blorp: Allow CCS copies of 1010102 formats Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	1978de66f7	intel/blorp: Add support for more format bitcasting nir_format_bitcast_uint_vec_unmasked can only be used to cast between formats with uniform channel sizes. In particular, it cannot handle 10_10_10_2 formats. By making use of the NIR helper for uint vector casts, we should now be able to bitcast between any two uint formats so long as their channels are in RGBA order (possibly with channels missing). In order to do this we need to rework the key a bit to pass the actual formats instead of just the number of bits in each. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	7998fe268e	intel/blorp: Use nir_format_bitcast_uint_vec_unmasked Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	a6b66a7b26	intel/blorp: Use ISL instead of bitcast_color_value_to_uint Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	8ce31c9cc5	intel/blorp: Support the RGB workaround on more formats Previously we only supported UINT formats because that's what blorp_copy required. If we want to use it in blorp_blit, however, we need to support everything. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	4e26e3dea9	intel/blorp: Silently convert RGBX destination formats to RGBA Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	804856fa57	intel/blorp: Handle more exotic destination formats This commit adds support for the following formats as destination formats even though the hardware does not support rendering to them: - ISL_FORMAT_R24_UNORM_X8_TYPELESS - ISL_FORMAT_A4B4G4R4_UNORM - ISL_FORMAT_L8_UNORM_SRGB - ISL_FORMAT_R9G9B9E5_SHAREDEXP This is done by using a different format and emitting shader code to fake it the rest of the way. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	9e492bb92e	intel/blorp: Include nir_format_convert.h in blorp_blit.c nir_mask_shift_or is now defined in nir_format_convert.h so we can delete the copy in blorp_blit.c. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	906c32ce87	intel/blorp: Add swizzle support for all hardware This commit makes blorp capable of swizzling anything even on hardware that doesn't support texture swizzle. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Jason Ekstrand	293b8de161	blorp: Handle the RGB workaround more like other workarounds The previous version was sort-of strapped on in that it just adjusted the blit rectangle and trusted in the fact that we would use texelFetch and round to the nearest integer to ensure that the component positions matched. This new version, while slightly more complicated, is more accurate because all three components end up with exactly the same dst_pos and so they will get interpolated and sampled at the same texture coordinate. This makes the workaround suitable for using with scaled blits. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2018-05-09 11:16:33 -07:00
Vadym Shovkoplias	cdb3eb7174	intel/blorp: Fix possible NULL pointer dereferencing Fix incomplete check of input params in blorp_surf_convert_to_uncompressed() which can lead to NULL pointer dereferencing. Fixes: `5ae8043fed` ("intel/blorp: Add an entrypoint for doing bit-for-bit copies") Fixes: `f395d0abc8` ("intel/blorp: Internally expose surf_convert_to_uncompressed") Reviewed-by: Emil Velikov <emli.velikov@collabora.com> Reviewed-by: Andres Gomez <agomez@igalia.com>	2017-11-30 16:20:05 +02:00
Jason Ekstrand	86becfd2de	intel/blorp: Add fast-clear to the special case in MSAA resolves This doesn't go all the way of avoiding the txf_ms if it's fast-cleared, however it does at least make us only do it once. This should improve performance of MSAA resolves in the presence of lots of clear color. Without the patch, enabling fast-clears in the multisampling Sascha demo drops the framerate by about 10%. With this patch, enabling fast-clears increases the demo's framerate by 25%. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-11-27 16:22:11 -08:00
Jason Ekstrand	dc21c3937c	intel/blorp/blit: Rename blorp_nir_txf_ms_mcs That name is already taken by one of the helpers in blorp_nir_builder.h and, while we haven't moved the guts of blorp_blit.c there yet, we'd like to start using some things from that header. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com>	2017-11-27 16:19:38 -08:00
Jordan Justen	3dcbc5cdaa	intel/compiler: Remove final_program_size from brw_compile_* The caller can now use brw_stage_prog_data::program_size which is set by the brw_compile_* functions. Cc: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-10-31 23:36:54 -07:00
Lionel Landwerlin	0c92651a3b	blorp: enable R32G32B32X32 blorp ccs copies Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-10-21 02:37:33 +01:00
Jason Ekstrand	f395d0abc8	intel/blorp: Internally expose surf_convert_to_uncompressed Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-09-20 17:21:06 -07:00
Kenneth Graunke	fc20df830c	blorp: Make blorp_buffer_copy work on Gen4-6. Gen4-6 can only handle surfaces up to 8192. Only Gen7+ can do 16384. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-08-30 16:59:19 -07:00
Kenneth Graunke	81d5b61a19	blorp: Turn anv_CmdCopyBuffer into a blorp_buffer_copy() helper. I want to be able to copy between buffer objects using BLORP in the i965 driver. Anvil already had code to do this, in a reasonably efficient manner - first using large bpp copies, then smaller bpp copies. This patch moves that logic into BLORP as blorp_buffer_copy(), so we can use it in both drivers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-08-30 16:59:07 -07:00
Topi Pohjolainen	393ec1a507	intel/blorp: Adjust intra-tile x when faking rgb with red-only v2 (Jason): Adjust directly in surf_fake_rgb_with_red() Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101910 CC: mesa-stable@lists.freedesktop.org Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-08-21 09:55:08 +03:00
Jason Ekstrand	5de4209f91	intel/isl: Add a helper to get a subimage surface We already have a helper for doing this in BLORP, this just moves the logic into ISL where we can share it with other components. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2017-07-22 21:41:12 -07:00
Jason Ekstrand	b26b2490e5	intel/blorp: Allow blorp_copy on sRGB formats Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-22 20:59:22 -07:00
Topi Pohjolainen	0926fb69a4	intel/blorp/gen4: Drop cube map flag for single face copy This will falsely trigger an assert on number of layers once isl is used for 3D layouts of Gen4 cube maps. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-07-18 21:36:13 +03:00
Ian Romanick	1b101ca809	blorp: Use normalized coordinates on Gen6 Apparently, the sampler has some sort of precision issues for non-normalized texture coordinates with linear filtering. This caused some small precision issues in scaled blits. Work around this by using normalized coordinates. There is some extra work necessary because Gen6 uses TEX (instead of TXF) for some multisample resolve blits. Fixes piglit.spec.arb_framebuffer_object.fbo-blit-stretch on SNB. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=68365 Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2017-06-26 13:41:11 -07:00
Ian Romanick	cbb941cdec	intel/blorp: Apply source offset in the TEX case Previously the offset was only applied in the TXF case. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Suggested-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-06-20 11:07:02 -07:00
Ian Romanick	990f2be139	intel/blorp: Apply Gen4 coord. normalization after cubemap sizes are adjusted Otherwise the values used for coordinate normalization use the wrong sizes. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Suggested-by: Jason Ekstrand <jason.ekstrand@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-06-20 11:07:02 -07:00
Jason Ekstrand	b2dd61196e	intel/blorp: Set needs_(dst\|src)_offset for Gen4 cubemaps We call convert_to_single_slice so they may end up with a non-trivial offset that needs to be taken into account. v2 (idr): Also set needs_src_offset. Suggested by Jason. Fixes ES2-CTS.functional.texture.specification.basic_copyteximage2d.cube_rgba and ES2-CTS.functional.texture.specification.basic_copytexsubimage2d.cube_rgba on G45. Signed-off-by: Ian Romanick <ian.d.romanick@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101284 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-06-20 11:07:02 -07:00
Jason Ekstrand	d065a9540c	intel/isl: Add a helper for getting the byte/tile offset of a subimage Frequently, get_image_offset_sa is combined with get_intratile_offset_sa so it makes sense to have a single helper to do both. If the caller doesn't want the intratile offsets, it can simply pass NULL and ISL will assert that they are 0. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-06-01 15:33:58 -07:00
Jason Ekstrand	c1a70165be	intel/isl: Remove the device parameter from isl_tiling_get_info We were only using it for validating that we don't use Ys/Yf on gen8 and earlier. Removing it from isl_tiling_get_info lets us remove it from a bunch of other things that had no business needing a hardware generation. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-06-01 15:33:31 -07:00
Jason Ekstrand	fa13ef285d	intel/blorp: Assert that no one tries to blit combined depth stencil Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	752d7af77a	i965: Add blorp support for gen4-5 Due to complications with things such as URB setup on gen4-5, it's easier to keep gen4 support in blorp completely internal to i965. This makes things a bit awkward because that means there's a file in i965 that includes blorp_priv.h but it's either that or have a file in blorp that includes brw_context.h. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	0ed6f196fc	intel/blorp: Add support for gen4-5 SF programs As part of enabling support for SF programs, we plumb the SF URB size through to emit_urb_config. For now, it's always zero but, on gen4, it may be something larger. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	8bce7bda45	intel/blorp: Make convert_to_single_slice available outside blorp_blit Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	302c0488cf	intel/blorp: Don't use ffma directly It isn't supported prior to gen6 and, on gen6+, NIR will fuse the fmul and fadd into an ffma automatically for us anyway. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	3d35e5a51e	intel/blorp/blit: Add support for normalized coordinates Gen5 and earlier can't do non-normalized coordinates so we need to compensate in the shader. Fortunately, it's pretty easy plumb through. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	554a1731a5	intel/blorp: Move the gen7 stencil format workaround to blorp_blit It's not needed for blorp_copy because it already overrides formats. It's also not needed for blorp_clear because it clears stencil as stencil. Reviewed-by: Topi Pohjolainen <topi.pohjolainen@intel.com>	2017-05-26 07:58:01 -07:00
Jason Ekstrand	b86dba8a0e	nir: Embed the shader_info in the nir_shader again Commit `e1af20f18a` changed the shader_info from being embedded into being just a pointer. The idea was that sharing the shader_info between NIR and GLSL would be easier if it were a pointer pointing to the same shader_info struct. This, however, has caused a few problems: 1) There are many things which generate NIR without GLSL. This means we have to support both NIR shaders which come from GLSL and ones that don't and need to have an info elsewhere. 2) The solution to (1) raises all sorts of ownership issues which have to be resolved with ralloc_parent checks. 3) Ever since `00620782c9`, we've been using nir_gather_info to fill out the final shader_info. Thanks to cloning and the above ownership issues, the nir_shader::info may not point back to the gl_shader anymore and so we have to do a copy of the shader_info from NIR back to GLSL anyway. All of these issues go away if we just embed the shader_info in the nir_shader. There's a little downside of having to copy it back after calling nir_gather_info but, as explained above, we have to do that anyway. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-09 15:07:47 -07:00
Chad Versace	6cbc13d94c	intel: Fix requests for exact surface row pitch (v2) All callers of isl_surf_init() that set 'min_row_pitch' wanted to request an exact row pitch, as evidenced by nearby asserts, but isl lacked API for doing so. Now that isl has an API for that, update the code to use it. v2: Assert that isl_surf_init() succeeds because the callers assume it. [for jekstrand] Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> (v1) Reviewed-by: Anuj Phogat <anuj.phogat@gmail.com> (v1) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (v2)	2017-03-28 09:44:44 -07:00

1 2

94 commits