fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-01-31 15:40:24 +01:00

Author	SHA1	Message	Date
Marek Olšák	837f74aa51	mesa: implement GL_ATI_meminfo (v2) v2: rebase Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2016-02-05 17:31:20 +01:00
Marek Olšák	1d79b99580	mesa: implement GL_NVX_gpu_memory_info (v2) v2: implement eviction queries properly add gl_memory_info structure Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2016-02-05 17:30:07 +01:00
Marek Olšák	d2e4c9e737	gallium: add interface for querying memory usage and sizes (v2) If you're worried about the duplication of some CAPs, we can remove them later. v2: add fields for memory eviction stats Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2016-02-05 17:29:38 +01:00
Marek Olšák	c577f2843a	gallium/radeon: remove radeon_info::r600_tiling_config Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:29:19 +01:00
Marek Olšák	4f96846d9d	gallium/radeon: get pipe_interleave_bytes AKA group_bytes from the winsys Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:59 +01:00
Marek Olšák	276621da45	gallium/radeon: set num_banks in the winsys amdgpu doesn't have to set this, because radeonsi gets it from tile mode arrays by default. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:40 +01:00
Marek Olšák	294ec530c9	gallium/radeon: just get num_tile_pipes from the winsys Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:24 +01:00
Marek Olšák	0f3556d308	winsys/amdgpu: add an assertion to cik_get_num_tile_pipes (v2) v2: print an error to stderr Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:18 +01:00
Marek Olšák	a2291f7b57	winsys/amdgpu: remove an r600-only setting Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:12 +01:00
Marek Olšák	1e864d7379	gallium/radeon: rename & reorder members of radeon_info Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2016-02-05 17:28:00 +01:00
Steinar H. Gunderson	feb53912f8	mesa: Fix locking of GLsync objects. GLsync objects had a race condition when used from multiple threads (which is the main point of the extension, really); it could be validated as a sync object at the beginning of the function, and then deleted by another thread before use, causing crashes. Fix this by changing all casts from GLsync to struct gl_sync_object to a new function _mesa_get_and_ref_sync() that validates and increases the refcount. In a similar vein, validation itself uses _mesa_set_search(), which requires synchronization -- it was called without a mutex held, causing spurious error returns and other issues. Since _mesa_get_and_ref_sync() now takes the shared context mutex, this problem is also resolved. Fixes bug #92757, found while developing Nageru, my live video mixer (due for release at FOSDEM 2016). v2: Marek: silence warnings, fix declaration after code Signed-off-by: Steinar H. Gunderson <sesse@google.com> Cc: "11.0 11.1" <mesa-stable@lists.freedesktop.org> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 17:18:17 +01:00
Nicolai Hähnle	156e81f305	radeonsi: add placeholder MC and SRBM performance counter groups Yet another change motivated by AMD GPUPerfStudio compatibility. These groups are not directly accessible from userspace, and AMD GPUPerfStudio does not actually query them - it just requires them to be there. Hence, adding a placeholder for now. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:25:33 -05:00
Nicolai Hähnle	988f4b31f3	radeonsi: re-order the SQ_xx performance counter blocks This is yet another change motivated by appeasing AMD GPUPerfStudio's hardcoding of performance counter group numbers. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:25:30 -05:00
Nicolai Hähnle	75affd73b0	radeonsi: re-order the perfcounter hardware blocks As documented in the comment, AMD GPUPerfStudio unfortunately hardcodes the order of performance counter groups. Let's do the pragmatic thing and present the same order as Catalyst/Crimson. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:25:27 -05:00
Nicolai Hähnle	b0e32548c8	gallium/radeon: add GPIN driver query group This group was used by older versions of AMD GPUPerfStudio (via AMD_performance_monitor) to identify the GPU family, and GPUPerfStudio still complains when it isn't available. Reviewed-by: Edward O'Callaghan <eocallaghan@alterapraxis.com> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:24:59 -05:00
Nicolai Hähnle	4b672b8310	radeonsi: Allow dumping LLVM IR before optimization passes Set R600_DEBUG=preoptir to dump the LLVM IR before optimization passes, to allow diagnosing problems caused by optimization passes. Note that in order to compile the resulting IR with llc, you will first have to run at least the mem2reg pass, e.g. opt -mem2reg -S < shader.ll \| llc -march=amdgcn -mcpu=bonaire Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> (original patch) Signed-off-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (w/ debug flag) Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:22:04 -05:00
Nicolai Hähnle	5aafc169ca	gallium/radeon: emit LLVM `ret void` before radeon_llvm_finalize_module This allows dumping a consumable LLVM module before the initial optimization passes are run. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:21:54 -05:00
Nicolai Hähnle	7e9670c8bc	st/mesa: bail out of try_pbo_upload_common when constant upload fails Also fixes a resource leak when an upload_mgr is used for constants. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:21:51 -05:00
Nicolai Hähnle	a01e44adcc	st/mesa: bail out of try_pbo_upload_common when vertex upload fails At the same time, fix a memory leak noticed by Ilia Mirkin. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:21:48 -05:00
Nicolai Hähnle	b27c79bd81	st/mesa: reduce the scope of sampler_view in try_pbo_upload_common We can get rid of our reference immediately, since the driver will hold onto it for us. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:21:44 -05:00
Nicolai Hähnle	13e21e3ec5	st/mesa: do uploads earlier in try_pbo_upload_common While rather unlikely, uploads _can_ fail. Doing them earlier means we'll have to restore less state when they do fail, and it's slightly easier to check the restore code. Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-05 09:21:27 -05:00
Neil Roberts	eb9cf3cfc9	main: Use a derived value for the default sample count Previously the framebuffer default sample count was taken directly from the value given by the application. On the i965 driver on HSW if the value wasn't one that is supported by the hardware it would hit an assert when it tried to program the state for it. This patch fixes it by adding a derived sample count to the state for the default framebuffer. The driver can then quantize this to one of the valid values in its UpdateState handler when the _NEW_BUFFERS state changes. _mesa_geometric_samples is changed to use the new derived value. Fixes the piglit test arb_framebuffer_no_attachments-query Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93957 Cc: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-02-05 11:05:10 +00:00
Neil Roberts	5fd848f6c9	program: Use _mesa_geometric_samples to calculate gl_NumSamples Otherwise it won't take into account the default samples for framebuffers with no attachments. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-02-05 11:05:06 +00:00
Neil Roberts	4995d9c9a0	main: Use _mesa_geometric_samples to calculate GL_SAMPLE_BUFFERS Otherwise it won't take into account the default samples for framebuffers with no attachments. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-02-05 11:05:01 +00:00
Neil Roberts	d8d4661ddb	main: Use _mesa_geometric_samples to calculate the value of GL_SAMPLES Otherwise it won't take into account the default samples for framebuffers with no attachments. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-02-05 11:04:44 +00:00
Ilia Mirkin	2065e380b2	nvc0: avoid negatives in PUSH_SPACE argument Fixup to commit `03b3eb90d` - the number of buffers could be larger than the number of elements, in which case we'd pass a negative argument to PUSH_SPACE, which would be bad. While we're at it, merge it with the other PUSH_SPACE at the top of the function. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-02-05 00:49:51 -05:00
Ilia Mirkin	03b3eb90d7	nvc0: add some missing PUSH_SPACE's nvc0_vbo has explicit push space checking enabled, so we must run PUSH_SPACE by hand. A few spots missed that. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-02-05 00:41:43 -05:00
Ilia Mirkin	1a0fde1f52	nvc0/ir: fix converting between predicate and gpr The spill logic will insert convert ops when moving between files. It seems like the emission logic wasn't quite ready for these converts. Tested on fermi, and visually looked at nvdisasm output for maxwell. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: mesa-stable@lists.freedesktop.org	2016-02-05 00:41:33 -05:00
Ilia Mirkin	2fed18b8a5	nvc0: add support for ARB_query_buffer_object Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu>	2016-02-04 21:21:30 -05:00
Ilia Mirkin	9cd5bb9f9f	st/mesa: add query buffer support Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Ilia Mirkin	f9e6f46335	gallium: add PIPE_CAP_QUERY_BUFFER_OBJECT Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Ilia Mirkin	40d7f02c67	gallium: add a way to store query result into buffer Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Ilia Mirkin	386a9ec77b	mesa: add core implementation of ARB_query_buffer_object Forwards query result writes to drivers. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Ilia Mirkin	7c3f4b2fd8	mesa: add driver interface for writing query results to buffers Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Rafal Mielniczuk	3efcd4df01	mesa: Handle QUERY_BUFFER_BINDING in GetIntegerv Signed-off-by: Rafal Mielniczuk <rafal.mielniczuk2@gmail.com> [imirkin: move to GL/GL_CORE section] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Rafal Mielniczuk	2d0ec0c272	mesa: Add QueryBuffer to context Add QueryBuffer and initialise it to NullBufferObj on start Signed-off-by: Rafal Mielniczuk <rafal.mielniczuk2@gmail.com> [imirkin: also release QueryBuffer on free] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Rafal Mielniczuk	c5bab061da	mesa: Add ARB_query_buffer_object extension flag Signed-off-by: Rafal Mielniczuk <rafal.mielniczuk2@gmail.com> [imirkin: add string to extensions.c] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Rafal Mielniczuk	4913d381a0	glapi: Add xml infrastructure for ARB_query_buffer_object Signed-off-by: Rafal Mielniczuk <rafal.mielniczuk2@gmail.com> [imirkin: move definition to gl_API.xml as it is very short] Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-02-04 21:21:30 -05:00
Timothy Arceri	23e24e27ac	glsl: simplify setting of image access qualifiers Cc: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-02-05 10:05:40 +11:00
Timothy Arceri	815929bd15	mesa: remove dead program parameter functions Reviewed-by: Iago Toral Quiroga <itoral@igalia.com>	2016-02-05 09:11:00 +11:00
Axel Davy	94d91c6707	st/nine: Use align_free when needed Use align_free to free memory allocated with align_malloc. Signed-off-by: Axel Davy <axel.davy@ens.fr> Reviewed-by: Patrick Rudolph <siro@das-labor.org>	2016-02-04 22:12:17 +01:00
Axel Davy	6b12fe77ea	st/nine: Disallow non-argb8888 cursors Only argb8888 cursors are allowed. Signed-off-by: Axel Davy <axel.davy@ens.fr> Reviewed-by: Patrick Rudolph <siro@das-labor.org>	2016-02-04 22:12:17 +01:00
Axel Davy	24ddadbba9	st/nine: Enforce centroid for color input when multisampling is on The color inputs must automatically use centroid whether multisampling is used or not. Signed-off-by: Axel Davy <axel.davy@ens.fr> Reviewed-by: Patrick Rudolph <siro@das-labor.org>	2016-02-04 22:12:17 +01:00
Axel Davy	d5389bb92d	st/nine: Fix centroid flag sem.reg.mod & NINED3DSPDM_CENTROID is worth 4 when centroid is requested, whereas TGSI_INTERPOLATE_LOC_CENTROID is worth 1. Signed-off-by: Axel Davy <axel.davy@ens.fr>	2016-02-04 22:12:17 +01:00
Axel Davy	ee31f0fed4	st/nine: Use fast clears more often for MRTs This enables to use fast clears in the following case: pixel shader renders to 1 RT 4 RT bound clear new pixel shader bound that renders to 4 RTs Previously the fast clear path wouldn't be hit, because when trying the fast clear path, the framebuffer state would be configured for 1 RT, instead of 4. Signed-off-by: Axel Davy <axel.davy@ens.fr> Reviewed-by: Patrick Rudolph <siro@das-labor.org>	2016-02-04 22:12:17 +01:00
Axel Davy	e85ef7d8e5	st/nine: Use linear filtering for shadow mapping Some docs say linear filtering is always used when app does shadow mapping. Signed-off-by: Axel Davy <axel.davy@ens.fr> Reviewed-by: Patrick Rudolph <siro@das-labor.org>	2016-02-04 22:12:17 +01:00
Patrick Rudolph	0b35da59de	st/nine: Respect block alignment on surface lock Respect block alignment for ATI1/ATI2 format when trying to lock a surface using LockRect(). Fixes failing WINE tests device.c test_surface_blocks() tests. Signed-off-by: Patrick Rudolph <siro@das-labor.org> Reviewed-by: Axel Davy <axel.davy@ens.fr>	2016-02-04 22:12:17 +01:00
Axel Davy	56b4222b29	st/nine: Add Render state validation layer Testing Win behaviour seems to show wrong states are accepted, but then depending on the states some specific 'good' behaviours happen. This adds some validation to catch invalid states and have these 'good' behaviours when it happens. Also reorders SetRenderState to match the expected optimisation: (Value == previous Value) => return immediately, which affects D3D9 hacks too. Signed-off-by: Axel Davy <axel.davy@ens.fr> Signed-off-by: Patrick Rudolph <siro@das-labor.org>	2016-02-04 22:12:17 +01:00
Patrick Rudolph	7132617436	DRI_CONFIG: Add option to override vendor id Add config option override_vendorid to report a fake card in d3dadapter9 drm. Signed-off-by: Patrick Rudolph <siro@das-labor.org> Reviewed-by: Axel Davy <axel.davy@ens.fr>	2016-02-04 22:12:17 +01:00
Patrick Rudolph	1a893ac886	st/nine: Implement NineDevice9_GetAvailableTextureMem Implement a device private memory counter similar to Win 7. Only textures and surfaces increment vidmem and may return ERR_OUTOFVIDEOMEMORY. Vertexbuffers and indexbuffers creation always succeedes, even when out of video memory. Fixes "Vampire: The Masquerade - Bloodlines" allocating resources until crash. Fixes "Age of Conan" allocating resources until crash. Fixes failing WINE test device.c test_vidmem_accounting(). Signed-off-by: Patrick Rudolph <siro@das-labor.org> Reviewed-by: Axel Davy <axel.davy@ens.fr>	2016-02-04 22:12:17 +01:00

... 151 152 153 154 155 ...

85652 commits