fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-07 15:48:36 +02:00

Author	SHA1	Message	Date
Roland Scheidegger	0be1dc25cf	r600: increase number of ubos by one to 14 Ideally we'd support 16 (d3d11 requires 15, and mesa subtracts one for non-ubo constants), but that's kind of impossible (it would be only doable if either we'd somehow merge the mesa non-ubo constants with the driver constants, or only use the driver constants with vtx fetch instead of through the kcache mechanism - the latter probably wouldn't be too bad). For now just do as the comment already said, place the gs ring (not really a const buffer in any case) which is only ever referred to through vc fetch clauses at index 16. Throw in a couple asserts for good measure to make sure the hw limit isn't exceeded. Tested-by: Konstantin Kharlamov <hi-angel@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	43292c78b7	r600: set up constants needed for txq for buffers and cube maps with tes We only did this for the other stages, but obviously tess eval/ctrl need it too. This fixes the (newly modified) piglit texturing/textureSize test when run with tes stage and bufferSampler. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	22ba4ebb18	r600: don't emit reloc for ring buffer out into the blue It looks like this reloc belongs to setting the constant reg, which is skipped for gs ring. Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	76baf99737	r600: hack up num_render_backends on Juniper to 8 Juniper really has a maximum of 4 RBEs (16 pixels). However, predication always locks up on my HD 5750, and through experiments it looks like if we're pretending it has a maximum of 8, with 4 disabled, it works correctly. My conclusion would be that there's a bug (likely firmware, not hw) which causes the predication logic to try to read 8 results out of the query buffer instead of just 4, and since of course noone ever writes the upper 4, the status bit is never set and hence it will wait for it forever. Ideally this would be fixed in firmware, but I'd guess chances of that happening are slim. This will double the size of (occlusion) query result buffers, write the status bit for the disabled rbs in these buffers, and will also add 8 results together instead of just 4 when reading them back. The latter is unnecessary, but it's probably not worth bothering - luckily num_render_backends isn't used outside of occlusion queries, so don't need separate value for the "real" maximum. Also print out the enabled_rb_mask if it changed from the pre-fixed value (which is already printed out), just in case there's some more problems with chips which have some rbs disabled... This fixes all the lockups with piglit nv_conditional_render tests on my HD 5750 (all pass). Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	f0dd1b3612	winsys/radeon: fix up default enabled_rb_mask for r600 The logic had two fatal flaws which completely killed the default value. 1) drm will overwrite the value anyway even if the chip can't be handled 2) the default value logic is relying on num_render_backends, which was filled in later. Luckily noone is relying on it, but it's a bit confusing seeing the chip clock printed out there (as hex) with R600_DEBUG=info... (Albeit radeonsi does not appear to fix up the value. If kernels which don't handle this query are still supported, radeonsi will still end up with a broken enabled_rb_mask, I have no idea of the potential results of this there.) Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	7c0bc495f1	r600: fix enabled_rb_mask on eg/cm For eg/cm, the r600_gb_backend_map will always be 0. This is a bug in the drm kernel driver, as it just just never fills the information in (it is now being fixed - the history shows it was being filled in when the query was brand new but got lost shortly thereafter with backend_map fixes). This causes r600_query_hw_prepare_buffer to write the "status bit" (just the highest bit of the occlusion query result) even for active rbes (all but the first). This doesn't make much sense, albeit I suppose it's mostly safe. According to the commit history, it's necessary to set these bits for inactive rbes since otherwise predication will lock up - presumably the hw just is waiting for the status bit to appear, which will never happen with inactive rbes. I'd guess potentially predication could be wrong (due to not waiting for the actual result if the status bit is already there) if this is set for active rbes. Discovered while trying to fix predication lockups on Juniper (needs another patch). Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	762ccf483a	r600: fix sampler indexing with texture buffers sampling This fixes the new piglit test. While here also fix up the logic for early exit of setting up driver consts. Tested-by: Konstantin Kharlamov <hi-angel@yandex.ru> Reviewed-by: Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Roland Scheidegger	6c8d6ce982	r600: don't use vtx offset for load_sample_position The offset looks bogus to me. Albeit in the end it doesn't matter, by the looks of it offsets smaller than 4 get ignored there (not sure of the rules, I suppose either non-dword aligned offsets never work there or the offset must be at least aligned to the size of a single element). Tested-by: Konstantin Kharlamov <hi-angel@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com>	2018-01-10 04:59:00 +01:00
Dave Airlie	f4b1ec2972	r600: drop l2 related queries radeonsi only. Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-01-10 00:56:09 +00:00
Dave Airlie	e836fb2002	r600/shader: only read back the necessary tess factor components. This just reduces the lds reads for the the tess factor emission. Signed-off-by: Dave Airlie <airlied@redhat.com>	2018-01-10 00:54:32 +00:00
Jon Turney	adfb9c5c7b	Fix use of alloca() without #include <c99_alloca.h> ../../../src/mesa/main/shaderapi.c: In function ‘_mesa_ShaderBinary’: ../../../src/mesa/main/shaderapi.c:2188:9: error: implicit declaration of function ‘alloca’ [-Werror=implicit-function-declaration]	2018-01-09 22:07:52 +00:00
Kenneth Graunke	28c2d0d80b	genxml: Add missing INSTDONE_1 bits on Gen7.5+. This will make aubinator_error_decode decode them properly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-09 10:13:53 -08:00
Kenneth Graunke	8eadc2fb8f	intel: Apply Geminilake "Barrier Mode" workaround. Apparently, Geminilake requires you to whack a chicken bit to select either compute or tessellation mode for barriers. The recommendation is to switch between them at PIPELINE_SELECT time. We may not need to do this all the time, but I don't know that it hurts either. PIPELINE_SELECT is already a pretty giant stall. This appears to fix hangs in tessellation control shaders with barriers on Geminilake. Note that this requires a corresponding kernel change, drm/i915: Whitelist SLICE_COMMON_ECO_CHICKEN1 on Geminilake. in order for the register write to actually happen. Without an updated kernel, this register write will be noop'd and the fix will not work. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2018-01-09 10:13:33 -08:00
Emil Velikov	5e7d06fcb0	docs: update calendar, add news and link release notes for 17.3.2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2018-01-09 16:13:31 +00:00
Emil Velikov	ea9c548494	docs: add sha256 checksums for 17.3.2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `3a67ca681b`)	2018-01-09 16:10:12 +00:00
Emil Velikov	6c73767596	docs: add release notes for 17.3.2 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> (cherry picked from commit `0f27052e32`)	2018-01-09 16:10:10 +00:00
Indrajit Das	e05d5b0cf3	st/omx_bellagio: Update default intra matrix per MPEG2 spec Signed-off-by: Indrajit Das <indrajit-kumar.das@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2018-01-09 09:10:24 -05:00
Scott D Phillips	42f421cbbf	aubinator: add support for aubinating memtrace aubs Memtrace aubs are similar to classic aubs, with the major difference being how command submission is serialized (as register writes instead of a high-level submit message). Some internal tools generate or consume only memtrace aubs. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-01-08 21:11:11 -08:00
Scott D Phillips	8cdf5bd292	aubinator: extract aubinator_init() out of the header handler function A later patch will use the aubinator_init() function from the memtrace aub header handler. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-01-08 21:11:11 -08:00
Scott D Phillips	4f0a2ff4c1	aubinator: honor --color option when printing the header Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-01-08 21:11:11 -08:00
Scott D Phillips	161a97c3d5	.gitignore: Ignore new generated files New generated files from: `bb1e6ff161` ("spirv: Add a prepass to set types on vtn_values") `65fc16c974` ("autotools: set XA versions in configure.ac and configure header file") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2018-01-08 21:11:11 -08:00
Dylan Baker	73ce7cb474	Meson: ensure variable defined A gallium driver is undefined if passing -Dgallium-drivers='' Fixes: `e0b037d697` ("meson: Build SWR driver") Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Acked-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@intel.com>	2018-01-08 17:43:45 -08:00
Dylan Baker	21bca27349	meson: Fix typo in clover build The leading space breaks things. fixes: `42ea0631f1` ("meson: build clover") Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>	2018-01-08 17:31:55 -08:00
Dylan Baker	eab0316d10	meson: set opencl flags for r600 Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>	2018-01-08 16:39:48 -08:00
Dylan Baker	42ea0631f1	meson: build clover This has only been compile tested. v2: - Have a single option for opencl (Eric E) - fix typo "tgis" -> "tgsi" (Curro) - Don't add "lib" to pipe loader libraries, which matches the autotools behavior v3: - Remove trailing whitespace - Make PIPE_SEARCH_DIR an absolute path v4: - add trailing / to LIBCLC defines Acked-by: Curro Jerez <currojerez@riseup.net> Tested-by: Jan Vesely <jan.vesely@rutgers.edu> cc: Aaron Watry <awatry@gmail.com> Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>	2018-01-08 16:39:42 -08:00
Dylan Baker	425fcbde3f	meson: Turn on swr for relevant targets Currently that's dri, libgl-xlib, and osmesa. v2: - put drivers on a separate line from normal dependencies (Eric E) cc: George Kyriazis <george.kyriazis@intel.com> cc: Tim Rowley <timothy.o.rowley@intel.com> cc: Bruce Cherniak <bruce.cherniak@intel.com> Signed-off-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2018-01-08 16:39:37 -08:00
Dylan Baker	e0b037d697	meson: Build SWR driver This enables the SWR driver, but doesn't actually hook it up to any of the targets yet. I felt like this patch was big and complicated enough without adding that. v2: - Fix typo 'delemeited' -> 'delimited' (Eric E) - Fix type 'errror' -> 'error' (Eric E) - Use variables to hold files instead of looking above the current meson build (Eric E) - Use foreach loops to reduce the number of unique generators - Add comment about why some generators have names and some are just added to a list v3: - Remove trailing whitespace Signed-off-by: Dylan Baker <dylan.c.baker@intel.com>	2018-01-08 16:39:30 -08:00
Timothy Arceri	f04d2ca0d9	ac: rework emit_barrier() to not segfault on radeonsi nir_to_llvm_context will always be NULL for radeonsi so we need work around this. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com>	2018-01-09 10:21:32 +11:00
Timothy Arceri	19f3141e6a	ac: add load_tess_level() to the abi Fixes the following piglit tests in radeonsi: vs-tcs-tes-tessinner-tessouter-inputs-quads.shader_test vs-tcs-tes-tessinner-tessouter-inputs-tris.shader_test vs-tes-tessinner-tessouter-inputs-quads.shader_test vs-tes-tessinner-tessouter-inputs-tris.shader_test v2: make use of si_shader_io_get_unique_index_patch() via the helper in the previous patch rather than shader_io_get_unique_index() Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-09 10:21:32 +11:00
Timothy Arceri	2bd7ab32cf	radeonsi: add load_tess_level() helper This will be shared by the tgsi and nir backends. v2: move si_shader_io_get_unique_index_patch() call inside the helper. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (v1) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2018-01-09 10:21:32 +11:00
Jason Ekstrand	9e5aaa93cb	spirv: Do implicit conversions of uint to bool in OpStore Technically, the GLSLang bug related to this can also affect SSBO writes where the bool -> uint conversion is missing. However, the only known shipping application with an old enough version of GLSLang to cause issues with this is the new DOOM game so we keep the workaround as small as possible. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104424 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	154668e79c	spirv: Loosen the validation for load/store type matching Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104338 Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=104424 Tested-by: Eero Tamminen <eero.t.tamminen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	986303cb92	spirv: Require a storage type for OpStore destinations This rules out things such as trying to store a pointer to a local variable. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	70f588778c	spirv: Add a vtn_types_compatible helper Tested-by: Eero Tamminen <eero.t.tamminen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	8bad7f33c6	spirv: Store the id of the type in vtn_type Previously, we were storing a pointer to the vtn_value because we use it to look up decorations when we create input/output variables. This works, but it also may be useful to have the id itself so we may as well store that instead. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	53265c8798	spirv: Add a mechanism for dumping failing shaders Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	819adfdfb4	spirv: Rework asserts in var_decoration_cb Now that higher levels are enforcing decoration sanity, we don't need the vtn_asserts here. This function should be safe but we still want a few well-placed regular asserts in case something goes awry. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	71ea4dded5	spirv: Rework error checking for decorations This reworks the error checking on our generic handling of decorations. The objective is to validate all of the SPIR-V assumptions we make up-front and convert redundant checks to compiled-out asserts. The most important part of this is to ensure that member decorations only occur on OpTypeStruct and that the member is never out-of-bounds. This way later code can assume that the member is sane and not have to worry about OOB array access due to a misplaced OpMemberDecorate. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	d6a4099303	spirv: Add better type validation to OpTypeImage Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	03c543d041	spirv: Switch on vtn_base_type in OpComposite(Extract\|Insert) This is a bit simpler since we have fewer enum values in the case. It's also a bit more efficient because we're making fewer glsl_get_* calls. While we're at it, add better type validation. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	936f49268e	spirv: Refactor Op[Spec]ConstantComposite and add better validation Now that vtn_base_type is a real and full base type, we can switch on that instead of the GLSL base type which is a lot fewer cases in our switch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	dabce5061d	spirv: Add better validation to Op[Spec]Constant Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	6cf965751a	spirv: Remove a pointless assignment in SpvOpSpecConstant We re-assign later inside the bit_size switch Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	f13a5cff72	spirv: Unify boolean constants and add better validation Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	0bb18858fb	spirv/info: Add spirv_op_to_string Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	ab85fd02d5	spirv: Make 'info' a local array spirv_info_c.py Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Jason Ekstrand	296046556a	spirv: Add better error messages in vtn_value helpers Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>	2018-01-08 14:57:44 -08:00
Caio Marcelo de Oliveira Filho	22980f941e	spirv: Import 1.2 rev 3 headers and grammar from Khronos Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2018-01-08 13:22:17 -08:00
Samuel Pitoiset	08a5f4412a	radv: get InstanceID from VGPR1 (or VGPR2 for tess) instead of VGPR3 VGPR1 = InstanceID / StepRate0; // StepRate0 can be set to 1 Ported from RadeonSI. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-08 21:30:01 +01:00
Samuel Pitoiset	be16bbe1d3	radv: avoid PS partial flushes when viewports/scissors don't change For Vega10 and Raven that need a special workaround for the scissor bug. This seems to give a minor boost for Talos and Dota 2, at least. To reduce the cost of memcmp, the driver checks if it's really useful to do the comparison. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2018-01-08 21:24:58 +01:00

... 12 13 14 15 16 ...

99639 commits