fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-06 13:48:06 +02:00

Author	SHA1	Message	Date
Rob Clark	27a97097e1	freedreno/ir3: fix coverity warning CID 1362453 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	374ad2e2bd	freedreno/ir3: use nir_shader_get_entrypoint() helper Should also fix coverity warning: CID 1362454 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	df64cd6814	freedreno/a4xx: fix incorrect enum type a4xx has it's own enum, different from a2xx/a3xx. Spotted by coverity: CID 1362458, 1362459 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	1632b0eac0	freedreno: fix coverity negative array index warning Never can happen, since query would not have been created in the first place if pidx(query_type) return negative. Lets let coverity realize this. CID 1362460 Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	ba452d43e0	freedreno: fix dereference before null check ptr can actually never be null so just drop the check. CID 1362464 (#1 of 1): Dereference before null check (REVERSE_INULL) check_after_deref: Null-checking ptr suggests that it may be null, but it has already been dereferenced on all paths leading to the check. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	228b2b36f4	gallium/util: remove u_staging Unused, and fixes a couple of coverity warnings: CID `1362171`, `1362170` Signed-off-by: Rob Clark <robclark@freedesktop.org> Acked-by: Marek Olšák <marek.olsak@amd.com>	2016-06-02 15:44:07 -04:00
Rob Clark	18fb922faa	freedreno/a3xx: only update/emit bordercolor state when needed Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Rob Clark	11f0652404	freedreno/a4xx: only update/emit bordercolor state when needed I noticed in stk that it was contributing to a lot of overhead. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2016-06-02 15:44:07 -04:00
Matt Turner	0d81a684c1	i965: Add missing types to type_sz(). Coverity warns in multiple places about the potential for division by zero, caused by this function's default case. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-06-02 11:34:09 -07:00
Nanley Chery	c06cef7f9b	mesa/extensions: Fix ES1 extension reporting Commit `eda15abd84` , unintentionally advertised these extensions in ES1 contexts. Undo this error. Signed-off-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-02 10:46:59 -07:00
Plamena Manolova	e8b38ca202	egl: Check if API is supported when using eglBindAPI. According to the EGL specifications before binding an API we must check whether it's supported first. If not eglBindAPI should return EGL_FALSE and generate a EGL_BAD_PARAMETER error. Signed-off-by: Plamena Manolova <plamena.manolova@intel.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2016-06-02 07:45:19 -07:00
Eric Engestrom	17f4c723eb	st/osmesa: remove double-write (overwriting) These two lines have been here since the file was created. I'm guessing the second one was just for testing during dev, so it's the one that's going away. CoverityID: 1296205 Signed-off-by: Eric Engestrom <eric@engestrom.ch> Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Brian Paul <brianp@vmware.com>	2016-06-02 07:05:05 -06:00
Nayan Deshmukh	6c9a352d79	st/vdpau: check for null pointer in get/put bits. Check for null pointer before accessing arrays in get/put bits native/YCbCr/Indexed in VdpOutputSurface and VdpVideoSurface. Signed-off-by: Nayan Deshmukh <nayan26deshmukh@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2016-06-02 09:28:48 +02:00
Christian König	b3e75c3997	radeon/uvd: fix the H264 level for Tonga v2 We support 5.2 for a while now. v2: we even support 5.2 for H264, 5.1 is for HEVC. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com> Cc: <mesa-stable@lists.freedesktop.org>	2016-06-02 09:27:57 +02:00
Alejandro Piñeiro	b48c42cd1f	mesa/formatquery: add a comment to clarify INTERNALFORMAT_PREFERRED The comment clarifies that the driver is called only to try to get a preferred internalformat, and that it was already checked if the format is supported or not. Acked-by: Eduardo Lima <elima@igalia.com> Acked-by: Antia Puentes <apuentes@igalia.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-06-02 08:54:17 +02:00
Alejandro Piñeiro	c1ceee6cc9	i965/formatquery: remove INTERNALFORMAT_PREFERRED implementation Right now the implementation only checks if the internalformat is supported or not. But that implementation is wrong, returning unsupported for some internalformats. Additionally, checking if the internalformat is supported or not is already done at mesa/main before calling the driver hook, so this new check is not needed. Acked-by: Eduardo Lima <elima@igalia.com> Acked-by: Antia Puentes <apuentes@igalia.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2016-06-02 08:54:10 +02:00
Alejandro Piñeiro	58617bcebe	i965/eu: use simd8 when exec_size != EXECUTE_16 Among other thigs, fix a gpu hang when using INTEL_DEBUG=shader_time for any shader. Signed-off-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2016-06-02 08:08:10 +02:00
Jordan Justen	0a3acff5b5	i965: Remove old CS local ID handling The old method pushed data for each channels uvec3 data of gl_LocalInvocationID. The new method pushes 1 dword of data that is a 'thread local ID' value. Based on that value, we can generate gl_LocalInvocationIndex and gl_LocalInvocationID with some calculations. Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	b1f22c6317	i965: Enable cross-thread constants and compact local IDs for hsw+ The cross thread constant support appears on Haswell. It allows us to upload a set of uniform data for all threads without duplicating it per thread. One complication is that cross-thread constants are loaded into registers before per-thread constants. Previously, our local IDs were loaded before the uniform data and treated as 'payload' data, even though they were actually pushed into the registers like the other uniform data. Therefore, in this patch we simultaneously enable a newer layout where each thread now uses a single uniform slot for a unique local ID for the thread. This uniform is handled specially to make sure it is added last into the uniform push constant registers. This minimizes our usage of push constant registers, and maximizes our ability to use cross-thread constants for registers. To swap from the old to the new layout, we also need to flip some lowering pass switches to let our driver handle the lowering instead. We also no longer force thread_local_id_index to -1. v4: * Minimize size of patch that switches from the old local ID layout to the new layout (Jason) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	3ba9594f32	anv: Support new local ID generation & cross-thread constants The cross thread constant support appears on Haswell. It allows us to upload a set of uniform data for all threads without duplicating it per thread. We also support per-thread data which allows us to store a per-thread ID in one of the uniforms that can be used to calculate the gl_LocalInvocationIndex and gl_LocalInvocationID variables. v4: * Support the old local ID push constant layout as well (Jason) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	30685392e0	i965: Support new local ID push constant & cross-thread constants The cross thread constant support appears on Haswell. It allows us to upload a set of uniform data for all threads without duplicating it per thread. We also support per-thread data which allows us to store a per-thread ID in one of the uniforms that can be used to calculate the gl_LocalInvocationIndex and gl_LocalInvocationID variables. v4: * Support the old local ID push constant layout as well (Jason) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	d437798ace	i965: Add CS push constant info to brw_cs_prog_data We need information about push constants in a few places for the GL driver, and another couple places for the vulkan driver. When we add support for uploading both a common (cross-thread) set of push constants, combined with the previous per-thread push constant data, things are going to get even more complicated. To simplify things, we add push constant info into the cs prog_data struct. The cross-thread constant support is added as of Haswell. To support it we need to make sure all push constants with uniform values are added to earlier registers. The register that varies per thread and holds the thread invocation's unique local ID needs to be added last. For now we add the code that would calculate cross-thread constatn information for hsw+, but we force it (cross_thread_supported) off until the other parts of the driver support it. v4: * Support older local ID push constant layout as well. (Jason) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	1b79e7ebbd	i965: Store number of threads in brw_cs_prog_data Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	3ef0957dac	i965: Add nir based intrinsic lowering and thread ID uniform We add a lowering pass for nir intrinsics. This pass can replace nir intrinsics with driver specific nir lower code. We lower the gl_LocalInvocationIndex intrinsic based on a uniform which is loaded with a thread specific ID. We also lower the gl_LocalInvocationID based on gl_LocalInvocationIndex. v2: * Create variable during lowering pass. (Ken) v3: * Don't create a variable, but instead just insert an intrisic call to load a uniform from the allocated location. (Jason) v4: * Don't run this pass if thread_local_id_index < 0 Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	04fc72501a	i965: Put CS local thread ID uniform in last push register This thread ID uniform will be used to compute the gl_LocalInvocationIndex and gl_LocalInvocationID values. It is important for this uniform to be added in the last push constant register. fs_visitor::assign_constant_locations is updated to make sure this happens. The reason this is important is that the cross-thread push constant registers are loaded first, and the per-thread push constant registers are loaded after that. (Broadwell adds another push constant upload mechanism which reverses this order, but we are ignoring this for now.) v2: * Add variable in intrinsics lowering pass * Make sure the ID is pushed last in assign_constant_locations, and that we save a spot for the ID in the push constants v3: * Simplify code based with Jason's suggestions. Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	fa279dfbf0	i965: Add uniform for a CS thread local base ID v4: * Force thread_local_id_index to -1 for now, and have fs_visitor::setup_cs_payload look at thread_local_id_index. This enables us to more easily cut over from the old local ID layout to the new layout, as suggested by Jason. Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	8f48d23e0f	i965: Add nir channel_num system value v2: * simd16/32 fixes (curro) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	6f316c9d86	nir: Make lowering gl_LocalInvocationIndex optional Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jordan Justen	7b9def3583	glsl: Add glsl LowerCsDerivedVariables option v2: * Move lower flag to context constants. (Ken) Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> (v1) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2016-06-01 19:29:02 -07:00
Jason Ekstrand	1205999c22	i965/fs: Copy the offset when lowering logical pull constant sends This fixes 64 Vulkan CTS tests per gen Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=96299 Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: "12.0" <mesa-stable@lists.freedesktop.org>	2016-06-01 16:00:44 -07:00
Dave Airlie	8d4f4adfbd	glsl/distance: make sure we use clip dist varying slot for lowered var. When lowering, we always want to use the clip dist varying. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "12.0" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com>	2016-06-02 07:09:21 +10:00
Nicolai Hähnle	c7877b9dab	winsys/amdgpu: decay max_ib_size over time So that memory use will eventually decrease again after a temporary peak. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	6aff6377b1	winsys/amdgpu: implement IB chaining on the gfx ring As a consequence, CE IB size never triggers a flush anymore. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	45be461f55	winsys/amdgpu: consolidate IB size management in amdgpu_ib_finalize Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	89ba076de4	radeon/winsys: introduce radeon_winsys_cs_chunk We will chain multiple chunks together and will keep pointers to the older chunks to support IB dumping. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:20 +02:00
Nicolai Hähnle	a7c26bfc0c	radeonsi/sid: add packet definitions for IB chaining While we're at it, add packet printing in si_debug. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	83a01cb498	winsys/amdgpu: start with smaller IBs, growing as necessary This avoids allocating giant IBs from the outset, especially for CE and DMA. Since we now limit max_dw only by the size that the buffer happens to be (which, due to the buffer cache, can be even larger than the rounded-up size we request), the new function amdgpu_ib_max_submit_dwords controls when we submit an IB. With this change, we effectively never flush prematurely due to the CE IB, after an initial warm-up phase. v2: - clean up buffer_size calculation Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	f80c6abb9e	winsys/amdgpu: add amdgpu_ib and amdgpu_cs_from_ib helper functions The latter function allows getting the containing amdgpu_cs from any IB (including non-main ones). Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	9e5ed559ba	winsys/amdgpu: extract IB big buffer allocation for re-use Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	9db851b5ee	winsys/amdgpu: add IB buffer in amdgpu_get_new_ib Adding the buffer when we start using it for the IB makes the logic for chaining a bit simpler. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:19 +02:00
Nicolai Hähnle	d6211a61b0	gallium/radeon: use cs_check_space throughout Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Nicolai Hähnle	46ad3561be	radeon/winsys: add cs_check_space Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Nicolai Hähnle	92d5d97b10	winsys/amdgpu: simplify interface of amdgpu_get_new_ib We'll want to have an amdgpu_cs pointer for future changes. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Nicolai Hähnle	8396ab4241	winsys/amdgpu: add amdgpu_cs_has_user_fence v2: style change Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:52:18 +02:00
Kenneth Graunke	25e1b8d366	i965: Fix isoline reads in scalar TES. Isolines aren't reversed. commit `5b2d8c2273` fixed this for the vec4 TES backend, but not the scalar one. Found while debugging GL45-CTS.tessellation_shader. tessellation_control_to_tessellation_evaluation.gl_tessLevel. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: mesa-stable@lists.freedesktop.org	2016-06-01 13:46:09 -07:00
Nicolai Hähnle	ed0e9862c5	st/mesa: implement PBO downloads for ReadPixels v2: require PIPE_CAP_SAMPLER_VIEW_TARGET; technically only needed for some of the texture targets, but all hardware that has shader images should also have this cap. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:37:51 +02:00
Nicolai Hähnle	f3b62d4c74	st/mesa: hook up a no-op try_pbo_readpixels For better bisectability given that the order of some of the fallback tests in the blit path are rearranged. Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:37:48 +02:00
Nicolai Hähnle	1cb4be94ae	st/mesa: add layer_offset to PBO fragment shader This will be used to select a slice of a 3D texture. v2: fix a comment (Marek) Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:37:43 +02:00
Nicolai Hähnle	2bf6dfac8a	st/mesa: create PBO download fragment shaders Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:37:40 +02:00
Nicolai Hähnle	852d3fcd3b	st/mesa: add PBO download enable bit and fragment shaders For downloads, the fragment shader must know the source texture target, hence we may cache multiple fragment shaders. v2: break long line (Marek) Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2016-06-01 22:37:34 +02:00

... 2 3 4 5 6 ...

82384 commits