fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-28 18:38:21 +02:00

Author	SHA1	Message	Date
Tom Stellard	f8ba0f55d3	configure.ac: Use AX_GCC_BUILTIN to check availability of __builtin_bswap32 v2 v2: - Remove unnecessary AC_SUBST Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-24 12:56:26 -08:00
Emil Velikov	73b46136b0	targets/opencl: resolve undefined symbols at link time Current automake build does not try to resolve undefined symbols thus we could end up with a broken library. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-02-24 14:59:39 +00:00
Emil Velikov	1ad9534337	gallium/targets: resolve undefined reference to pipe_loader_sw_probe_dri With the introduction of the pipe_loader_sw_probe_dri helper we require the sw/dri winsys during linking stage despite it being unused by any of the targets. This will cause a minor increase in the resulting library which will be cleaned up via linker options with upcoming patches. v2: Link with libswdri.la only when available. Reported-and-tested-by: Tom Stellard <thomas.stellard@amd.com> (v1) Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-24 14:59:34 +00:00
Emil Velikov	3445e8bb92	pipe-loader: wrap pipe_loader_sw_probe_xlib within HAVE_PIPE_LOADER_XLIB The above function implies using the the xlib winsys, which has additional library dependencies that should not be forced. Make the software xlib pipe loader optional thus avoid all the dependency hell. A user that wishes to use the particular pipe-loader would need to set the following within configure.ac. enable_gallium_xlib_loader=yes v2: - Wrap sw/xlib/xlib_sw_winsys.h to handle compilation on systems lacking X11 headers. Spotted by Christian Prochaska. Tested-by: Tom Stellard <thomas.stellard@amd.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=75356 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-24 14:52:27 +00:00
Emil Velikov	0e7c30233f	targets/gbm: exit gracefully if pipe_loader_drm_probe_fd is not available When one builds without gallium_drm_loader, the above function will not be available, thus we'll segfault in gallium_screen_create due to memory access violation. Tested-by: Tom Stellard <thomas.stellard@amd.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=75335 Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-24 14:51:45 +00:00
Rob Clark	3f7239ca0e	freedreno/a3xx/compiler: half-precision output Using generic shaders caused a measurable fps drop, which was isolated to use of full precision (vs half precision) output. This is an attempt to regain that lost performance by using half precision solid/blit shaders (when the output format is not float32). Note: for the built-in shaders, I would not expect them to be register starved. And in fact it is the solid frag shader that seems to have the biggest impact. So I suspect you get double the pixel pipe units (or half the cycles) when the output is half precision. So there may be some gain to using half precision output for application shaders as well, even though the rest of register usage is still full precision. But for half precision to work for more complex shaders, we need to deal with some constraints, like cat2 needing same precision for it's two src registers. So for now it is not enabled by default except for the built-in shaders. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:24 -05:00
Rob Clark	141ae71671	freedreno/a3xx: add shader variants Start putting in place infrastructure to deal with multiple shader variants. Initially we'll use this for two sided color (frag) and binning pass (vert) shaders. Possibly need for others later (such as YUV vs RGB eglImage?). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:23 -05:00
Rob Clark	9bbfae6265	freedreno/a3xx/compiler: collapse nop's with repeat Easier than making more extensive use of rpt, and the more compact shaders seem to bring some bit of performance boost. (Perhaps repeat flag benefits are more than just instruction cache, possibly it saves on instruction decode as well?) Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:23 -05:00
Rob Clark	bb255fdf06	freedreno/a3xx: drop hand-coded blit/solid shaders Instead in the common code, construct these shaders from TGSI. For now we let a2xx keep it's hand coded shaders, as it's compiler isn't quite up to the job yet. All the same it is a net drop in code size and gets rid of special cases. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:23 -05:00
Rob Clark	1c953b7cda	freedreno/lowering: cleanup api Make things configurable, and tweak the API a bit to avoid an extra tgsi_shader_scan(). Getting closer to something generic which can be moved out of freedreno and shaderd by other drivers. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:23 -05:00
Rob Clark	67cea4b32a	freedreno/a3xx: add float 16 and 32bit formats Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:23 -05:00
Rob Clark	e819885b99	freedreno: resync generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-23 14:58:23 -05:00
Ilia Mirkin	6152ba0894	nv50: make sure to clear _all_ layers of all attachments Unfortunately there's only one RT_ARRAY_MODE setting for all attachments, so clears were previously truncated to the minimum number of layers any attachment had. Instead set the RT_ARRAY_MODE to 512 (the max number of layers) before doing the clear. This fixes gl-3.2-layered-rendering-clear-color-mismatched-layer-count. Also fix clears of individual layered rt/zeta, in case it ever happens. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Christoph Bumiller <e0425955@student.tuwien.ac.at> Cc: 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-22 18:42:31 -05:00
Chia-I Wu	d5cbd73d21	ilo: fix and enable fast depth clear Use tex->bo_format instead of zs->format in ilo_blitter_rectlist_clear_zs() because the latter may be combined depth/stencil format. hiz_can_clear_zs() is no-op for GEN7+, but move the GEN check so that the assertions are tested. Finally, call the fast depth clear function from ilo_clear().	2014-02-22 22:45:13 +08:00
Chia-I Wu	f57bddc7e4	ilo: add slice clear value It is needed for 3DSTATE_CLEAR_PARAMS, and can also be used to track what value the slice has been cleared to.	2014-02-22 22:45:13 +08:00
Chia-I Wu	4afb8a7fb5	ilo: better readability and doc for texture flags Improve comments for the flags, and explicitly separate their uses in slice flags and resolve flags.	2014-02-22 22:45:13 +08:00
Chia-I Wu	cb8a0d2be1	ilo: fix for stencil only rectlist ops 3DSTATE_STENCIL_BUFFER inherits some states from 3DSTATE_DEPTH_BUFFER. We need to emit both even the surface is stencil only.	2014-02-22 22:45:13 +08:00
Chia-I Wu	409add30b3	ilo: fix a false assertion failure on GEN6 Layer offsetting is possible when it is level 0, layer 0.	2014-02-22 22:45:12 +08:00
Chia-I Wu	e7307fe708	ilo: pipe_texture::usage is not a bitfield It happens to work because PIPE_USAGE_STAGING is 0x100.	2014-02-22 22:45:12 +08:00
Chia-I Wu	f8d19a58dc	ilo: set ILO_TEXTURE_CPU_WRITE for imported textures Assume the bo has been written by another process, which will trigger a HiZ resolve.	2014-02-22 22:45:12 +08:00
Christoph Bumiller	1f4bfb8797	nv50/ir/ra: fix SpillCodeInserter::offsetSlot usage We were turning non-memory spill slots into NULL. Cc: 10.1 <mesa-stable@lists.freedesktop.org>	2014-02-22 13:17:23 +01:00
Vinson Lee	079773d1cb	libgl-xlib: Fix xlib_sw_winsys.h include path. This patch fixes this SCons build error introduced with commit `4f37e52f37`. Compiling src/gallium/targets/libgl-xlib/xlib.c ... src/gallium/targets/libgl-xlib/xlib.c:35:42: fatal error: state_tracker/xlib_sw_winsys.h: No such file or directory #include "state_tracker/xlib_sw_winsys.h" ^ Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=75347 Signed-off-by: Vinson Lee <vlee@freedesktop.org>	2014-02-21 19:56:17 -08:00
Emil Velikov	dcbf404c0d	pipe-loader: introduce pipe_loader_sw_probe_null helper function v2: Handle null_sw_create failure, add missing function return type Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> (v1)	2014-02-22 03:26:29 +00:00
Emil Velikov	969e8d15b7	pipe-loader: introduce pipe_loader_sw_probe_dri helper Will be used in the following commits. v2: Link gallium tests against the library. v3: Handle dri_create_sw_winsys failure v4: Rebase on top of the targets/xa changes Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> (v2)	2014-02-22 03:26:29 +00:00
Emil Velikov	cc3aeacab6	pipe-loader: introduce pipe_loader_sw_probe_xlib helper Will be used in the upcoming patches. v2: handle xlib_create_sw_winsys failure, drop unneeded header Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> (v1)	2014-02-22 03:26:29 +00:00
Emil Velikov	6325fdd6cf	pipe-loader: use bool type for pipe_loader_drm_probe_fd() v2: Rebase on top of the rendernode changes. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> (v1)	2014-02-22 03:26:29 +00:00
Emil Velikov	4f37e52f37	winsys/xlib: move xlib_create_sw_winsys within the winsys v2: Rebase on top of vl_winsys_xsp.c removal Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> (v1)	2014-02-22 03:26:28 +00:00
Emil Velikov	b4e8572bca	pipe-loader: handle memory allocation failure Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com>	2014-02-22 03:26:28 +00:00
Emil Velikov	1fb750f7f7	pipe-loader: build pipe_loader_drm_x_auth whenever HAVE_PIPE_LOADER_XCB is defined Currently HAVE_PIPE_LOADER_XCB is defined, rather than being set to 1/0. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-02-22 03:26:28 +00:00
Emil Velikov	ed092a8e1f	pipe-loader: destroy sw_winsys on sw_release The sw pipe-loader implicitly handles winsys_create, thus we it would make sense to implicitly destroy it upon releasing the loader. Currently we leak the sw_winsys when releasing the pipe-loader. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-02-22 03:26:28 +00:00
Emil Velikov	636ac989b2	vl/winsys_dri: cleanup vl_screen_create error path Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2014-02-22 03:26:27 +00:00
Emil Velikov	0c9912b266	targets/pipe-loader: link pipe-nouveau against libdrm Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Jakob Bornecrantz <jakob@vmware.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net>	2014-02-22 03:26:27 +00:00
Francisco Jerez	9b2fe7cf96	clover: Unabbreviate a few data accessor names for consistency. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:51:23 +01:00
Francisco Jerez	a0d99937a0	clover: Replace the transfer(new ...) idiom with a safer create(...) helper function. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:51:22 +01:00
Francisco Jerez	c4578d2277	clover: Migrate a bunch of pointers and references in the object tree to smart references. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:51:22 +01:00
Francisco Jerez	d82b39ce38	clover: Allow storing a range into a container of different (but compatible) element type. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:51:22 +01:00
Francisco Jerez	1b9fb2fd91	clover: Define an intrusive smart reference class. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:51:22 +01:00
Francisco Jerez	9ae0bd3829	clover: Some improvements for the intrusive pointer class. Define some additional convenience operators, clean up the implementation slightly, and rename it to 'intrusive_ptr' for reasons that will be obvious in the next commit. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:51:22 +01:00
Francisco Jerez	198cd136b9	clover: Fix up NULL constant pointer arguments. Tested-by: Tom Stellard <thomas.stellard@amd.com>	2014-02-21 12:29:05 +01:00
Jordan Justen	c97763ca2d	tgsi_ureg: add property_gs_invocations Fixes a build break in state_tracker/st_program.c Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=75278 Reviewed-by: Dave Airlie <airlied@redhat.com>	2014-02-20 16:41:01 -08:00
Roland Scheidegger	b2b2a2c06c	gallivm: add smallfloat to float conversion not relying on cpu denorm handling The previous code relied on cpu denorm support for converting small float formats (such r11g11b10_float and r16_float) to floats, otherwise denorms are flushed to zero. We worked around that in llvmpipe blend code by reenabling denorms, but this did nothing for texture sampling. Now it would be possible to reenable it there too but I'm not really a fan of messing with fpu flags (and it seems we can't actually do it reliably with llvm in any case looking at some bug reports). (Not to mention if you actually have a lot of denorms in there, you can expect some order-of-magnitude slowdown with x86 cpus.) So instead use code which adjusts exponents etc. directly hence not relying on cpu denorm support for the rescaling mul. (We still need the fpu flag handling as we can't do float-to-smallfloat without using cpu denorms at least for now - I actually wanted to keep both the old and new code and using one or the other depending on from where it's called but that didn't work out as the parameter would have to be passed through too many layers than I'd like.) Reviewed-by: Zack Rusin <zackr@vmware.com> Reviewed-by: Si Chen <sichen@vmware.com>	2014-02-20 18:41:42 +01:00
Leo Liu	0206f0b3d4	st/omx/enc: add multi scaling buffers for performance improvement Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2014-02-20 13:34:16 +01:00
Christian König	754fa3a0d2	st/omx/dec/h264: fix prevFrameNumOffset handling Signed-off-by: Christian König <christian.koenig@amd.com>	2014-02-20 13:34:06 +01:00
Rob Clark	9186cd39d4	freedreno: tweak ringbuffer sizes/count Since we are now consuming two ringbuffers at a time, we probably want a pool larger than 4.. but we don't need each individual ringbuffer to be so large, so offset the pool size increase by reducing rb size. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-19 12:02:57 -05:00
Rob Clark	5993723471	freedreno/a3xx/compiler: scheduling/legalize fixes It seems the write-after-read hazard that applies to texture fetch instructions, also applies to sfu instructions. Also, cat5/cat6 instructions do not have a (ss) bit, so in these cases we need to insert a dummy nop instruction with (ss) bit set. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-02-19 12:01:26 -05:00
Michel Dänzer	cf0172d46a	r600g,radeonsi: Consolidate logic for short-circuiting flushes Fixes radeonsi emitting command streams to the kernel even when there have been no draw calls before a flush, potentially powering up the GPU needlessly. Incidentally, this also cuts the runtime of piglit gpu.py in about half on my Kaveri system, probably because an X11 client going away no longer always results in a command stream being submitted to the kernel via glamor. Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=65761 Cc: "10.1" mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-02-18 10:46:23 +09:00
Emil Velikov	adad8fb2e9	st/dri: remove #ifdef DRM_CAP_PRIME guard Required for libdrm 2.4.37 and earlier. Both scons and automake require version 2.4.38 now so that guard is not longer needed. Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2014-02-18 00:08:26 +00:00
Kusanagi Kouichi	d23f9e3390	targets/vdpau: Don't link unused libraries libvdpau, libselinux and libexpat are not used. Signed-off-by: Kusanagi Kouichi <slash@ac.auone-net.jp>	2014-02-17 21:14:17 +00:00
Kusanagi Kouichi	61f6cddef7	targets/vdpau: Always use c++ to link If built without llvm, the following error occurs with mplayer: Failed to open VDPAU backend .../libvdpau_r600.so: undefined symbol: _ZTVN10__cxxabiv117__class_type_infoE [vo/vdpau] Error when calling vdp_device_create_x11: 1 Cc: <mesa-stable@lists.freedesktop.org> Signed-off-by: Kusanagi Kouichi <slash@ac.auone-net.jp> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com>	2014-02-17 21:14:16 +00:00
Ilia Mirkin	6958fb341f	st/xvmc: fix tests so that they pass Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Christian König <christian.koenig@amd.com>	2014-02-16 23:21:57 -05:00

1 2 3 4 5 ...

20249 commits