fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 05:18:08 +02:00

Author	SHA1	Message	Date
Marek Olšák	8b587ee701	gallium: add interface and state tracker support for GL_AMD_pinned_memory v2: add alignment restrictions to docs, fix indentation in headers Reviewed-by: Christian König <christian.koenig@amd.com>	2015-02-17 17:31:48 +01:00
Marek Olšák	11ebb03c26	mesa: implement GL_AMD_pinned_memory It's not possible to query the current buffer binding, because the extension doesn't define GL_..._BUFFER__BINDING_AMD. Drivers should check the target parameter of Drivers.BufferData. If it's equal to GL_EXTERNAL_VIRTUAL_MEMORY_BUFFER_AMD, the memory should be pinned. That's all there is to it. A piglit test is on the piglit mailing list. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Christian König <christian.koenig@amd.com>	2015-02-17 17:31:48 +01:00
Christian König	4fa61b1a23	winsys/radeon: add user pointer support Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2015-02-17 17:31:48 +01:00
Marek Olšák	e8625a29fe	mesa: fix AtomicBuffer typo in _mesa_DeleteBuffers Cc: 10.5 10.4 10.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com>	2015-02-17 17:31:48 +01:00
Marek Olšák	218b15715e	radeonsi: initialize TC_L2_dirty to false after buffer allocation I forgot to do this, though "true" should have no effect on correctness. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-17 17:31:48 +01:00
Marek Olšák	a27b74819a	radeonsi: small fix in SPI state Cc: 10.5 10.4 <mesa-stable@lists.freedesktop.org> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-17 17:31:48 +01:00
Marek Olšák	5f1cef76f9	r600g,radeonsi: use fences to implement PIPE_QUERY_GPU_FINISHED Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=89014 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-17 17:31:48 +01:00
Marek Olšák	f1103f6a1e	r600g,radeonsi: demote TIMESTAMP_DISJOINT query to be a software query The query result is always constant. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2015-02-17 17:31:48 +01:00
Dave Airlie	59292b38eb	st/glsl_to_tgsi: fix whitespace everytime I open this file in emacs with show trailing whitespace or git add from it my screen flares with red. Just do a general cleanup, makes working on fp64 support not as jarring. I'm not saying this is perfect, its just better than before. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-02-17 14:49:19 +10:00
Ilia Mirkin	b53fbec01d	glsl/tests: add IMAGE type. This fixes a warning when running make check. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Dave Airlie <airlied@redhat.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2015-02-17 11:26:06 +10:00
Chia-I Wu	faaf13f6bf	ilo: always set up BLEND_STATE on Gen8 There is now an DW0 that seems to be always referenced.	2015-02-17 04:59:33 +08:00
Chia-I Wu	6d4475d7bf	ilo: fix alpha test on Gen8 Shoudl use GEN8_BLEND_DW0_ALPHA_TEST_ENABLE instead of GEN6_RT_DW1_ALPHA_TEST_ENABLE (and others).	2015-02-17 04:59:33 +08:00
Ben Widawsky	d9cd982d55	i965/simd8vs: Fix SIMD8 atomics The short version: we need to set bits in R0.7 which provide a mask to be used for PS kill samples/pixels. Since the VS has no such concept, we just need to set all 1. The longer version... Execution for SIMD8 atomics is defined as follows: SIMD8: The low 8 bits of the execution mask are ANDed with 8 bits of the Pixel/Sample Mask from the message header. For the typed messages, the Slot Group in the message descriptor selects either the low or high 8 bits. For the untyped messages, the low 8 bits are always selected. The resulting mask is used to determine which slots are read into the destination GRF register (for read), or which slots are written to the surface (for write). If the header is not present, only the low 8 bits of the execution mask are used. The message header for untyped messages is defined in R0.7 "This field contains the 16-bit pixel/sample mask to be used for SIMD16 and SIMD8 messages. All 16 bits are used for SIMD16 messages. For typed SIMD8 messages, Slot Group selects which 8 bits of this field are used. For untyped SIMD8 messages, the low 8 bits of this field are used." Furthermore, "The message header for the untyped messages only needs to be delivered for pixel shader threads, where the execution mask may indicate pixels/samples that are enabled only due to derivative (LOD) calculations, but the corresponding slot on the surface must not be accessed." We're not using a pixel shader here, but AFAICT, this mask is used for all stages. This leaves two options, Remove the header, or make the VS code emit the correct thing for the header. I believe one of the goals of using SIMD8 VS was to get as much code reuse as possible, and so I chose the latter. Since the VS has no such thing as kill instructions, the mask is derived simple as all 1's. v2: Add a comment to the code (stolen from Curro on the mailing list) Change the control flow style (Curro + Jason) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=87258 Cc: Kristian Høgsberg <krh@bitplanet.net> Signed-off-by: Ben Widawsky <ben@bwidawsk.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com>	2015-02-16 12:22:44 -08:00
Brian Paul	9ac3700146	mesa: move assertion after declarations in texstore.c To fix MSVC build.	2015-02-16 08:39:25 -07:00
Brian Paul	4d2cee4d5e	mesa: silence uninitialized var warning in get_tex_rgba_uncompressed() Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-02-16 08:33:28 -07:00
Neil Roberts	bb77745681	meta: Fix saving the results of the current occlusion query When restoring the current state in _mesa_meta_end it was previously trying to copy the on-going sample count of the current occlusion query into the new query after restarting it so that the driver will continue adding to the previous value. This wouldn't work for two reasons. Firstly, the query might not be ready yet so the Result member will usually be zero. Secondly the saved query is stored as a pointer to the query object, not a copy of the struct, so it is actually restarting the exact same object. Copying the result value is just copying between identical addresses with no effect. The call to _mesa_BeginQuery will have always reset it back to zero. This patch fixes it by making it actually wait for the query object to be ready before grabbing the previous result. The downside of doing this is that it could introduce a stall but I think this situation is unlikely so it might not matter too much. A better solution might be to introduce a real suspend/resume mechanism to the driver interface. This could be implemented in the i965 driver by saving the depth count multiple times like it does in the i945 driver. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=88248 Reviewed-by: Carl Worth <cworth@cworth.org> Cc: "10.5" <mesa-stable@lists.freedesktop.org>	2015-02-16 12:09:17 +00:00
Francisco Jerez	946e29847b	i965/vec4: Override destination register writemask in sampler message send. This line was removed by accident in commit `16b9112574` causing a regression in the ES3-CTS.gtf.GL3Tests.shadow.shadow_execution_vert Khronos conformance test. It's necessary because the swizzle_result() code below expects all four components of the vector to be valid. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=89094 Tested-by: Lu Hua <huax.lu@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-02-16 13:51:08 +02:00
Iago Toral Quiroga	0a811e1d1e	i965: Fix a crash in the texture gradient lowering pass with cube samplers We need to swizzle the rhs to match the number of components in the writemask, otherwise we'll hit an assertion in ir_assignment. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-02-16 10:53:48 +01:00
Iago Toral Quiroga	ba426522dd	mesa: Fix element count for byte-swaps in texstore, readpix and texgetimage Some old format conversion code in pack.c implemented byte-swapping like this: GLint comps = _mesa_components_in_format(dstFormat); GLint swapSize = _mesa_sizeof_packed_type(dstType); if (swapSize == 2) _mesa_swap2((GLushort ) dstAddr, n comps); else if (swapSize == 4) _mesa_swap4((GLuint ) dstAddr, n comps); where n is the pixel count. But this is incorrect for packed formats, where _mesa_sizeof_packed_type is already returning the size of a pixel instead of the size of a single component, so multiplying this by the number of components in the format results in a larger element count for _mesa_swap than we want. Unfortunately, we followed the same implementation for byte-swapping in the rewrite of the format conversion code for texstore, readpixels and texgetimage. This patch computes the correct element counts for _mesa_swap calls by computing the bytes per pixel in the image and dividing that by the swap size to obtain the number of swaps required per pixel. Then multiplies that by the number of pixels in the image to obtain the swap count that we need to use. Also, when handling byte-swapping in texstore_rgba, we were ignoring the image's depth. This patch fixes this too. Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Cc: "10.5" <mesa-stable@lists.freedesktop.org>	2015-02-16 10:51:18 +01:00
Iago Toral Quiroga	4b249d2eed	mesa: Handle transferOps in texstore_rgba In the recent rewrite of the format conversion code we did not handle this. This patch adds the missing support. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=89068 Reviewed-by: Jason Ekstrand <jason.ekstrand@intel.com> Cc: "10.5" <mesa-stable@lists.freedesktop.org>	2015-02-16 10:49:41 +01:00
Matt Turner	a2299bfbbd	i965/fs: Handle U/UW-type immediates in the generator.	2015-02-15 14:29:08 -08:00
Matt Turner	7a83f7d481	i965/fs: Handle W/UW-type immediates in dump_instructions().	2015-02-15 14:29:08 -08:00
Matt Turner	74ef90acd7	i965: Let dump_instructions() work before calculate_cfg(). Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-02-15 12:24:11 -08:00
Matt Turner	fa124a337c	i965/fs: Call calculate_cfg() before optimize(). The CFG is fundamental to the FS IR, not merely a piece of optimization. Reviewed-by: Ben Widawsky <ben@bwidawsk.net>	2015-02-15 12:24:11 -08:00
Matt Turner	eb47d0efd3	i965: Optimize multiplication by -1 into a negated MOV. instructions in affected programs: 968 -> 942 (-2.69%) helped: 4 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-02-15 12:24:10 -08:00
Matt Turner	e8a6f2ad65	i965: Add an is_negative_one() method. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2015-02-15 12:24:10 -08:00
Matt Turner	72b9f8db2a	i965/vec4/vp: Use vec4_visitor::CMP. ... instead of emit(BRW_OPCODE_CMP, ...). In commit `6b3a301f` I changed vec4_visitor::CMP to set the destination's type to that of src0. In the following commit (`2335153f`) I removed an apparently now unnecessary work around for Gen8 that did the same thing. But there was a single place that emitted a CMP instruction without using the vec4_visitor::CMP function. Use it there. And change dst_null_d to dst_null_f for good measure, since ARB vp doesn't have integers. Cc: "10.5" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=89032 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-02-15 12:24:10 -08:00
Chia-I Wu	69b1693ef3	ilo: fix some state pointer commands on Gen8 3DSTATE_CC_STATE_POINTERS seems to be ignored when bit 0 of DW1 is not set. Follow i965 and set the bit for 3DSTATE_CC_STATE_POINTERS and 3DSTATE_BLEND_STATE_POINTERS. Add gen checks for all state pointer commands.	2015-02-15 13:32:41 +08:00
Ilia Mirkin	854eb06bee	nvc0: allow holes in xfb target lists Tested with a modified xfb-streams test which outputs to streams 0, 2, and 3. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.4 10.5" <mesa-stable@lists.freedesktop.org>	2015-02-14 17:15:54 -05:00
Ilia Mirkin	80d373ed5b	st/mesa: treat resource-less xfb buffers as if they weren't there If a transform feedback buffer's size is 0, st_bufferobj_data doesn't end up creating a buffer for it. There's no point in trying to write to such a buffer, so just pretend as if it's not really there. This fixes arb_gpu_shader5-xfb-streams-without-invocations on nvc0. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: "10.4 10.5" <mesa-stable@lists.freedesktop.org>	2015-02-14 17:15:54 -05:00
Ilia Mirkin	68e4f3f572	nvc0: bail out of 2d blits with non-A8_UNORM alpha formats This fixes the teximage-colors uploads with GL_ALPHA format and non-GL_UNSIGNED_BYTE type. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Cc: "10.4 10.5" <mesa-stable@lists.freedesktop.org>	2015-02-14 17:15:54 -05:00
Jason Ekstrand	3c57a59527	i965/nir: Don't support gl_FrontFacing as an input variable Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-02-14 13:47:16 -08:00
Jason Ekstrand	dd110cdfd8	nir: Make gl_FrontFacing a system_value GLSL IR labels gl_FrontFacing as an input variable and not a system value. This commit makes NIR silently translate gl_FrontFacing to a system value so that it properly gets translated into a load_system_value intrinsic. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-02-14 13:47:16 -08:00
Jason Ekstrand	785b22caee	i965/nir: Add support for nir_intrinsic_load_front_face Reviewed-by: Matt Turner <mattst88@gmail.com>	2015-02-14 13:47:16 -08:00
Jason Ekstrand	929f43851e	nir/lower_phis_to_scalar: Fix some logic in is_phi_scalarizable Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2015-02-14 13:46:59 -08:00
Shawn Starr	7df256add2	clover: Use Legacy PassManager for LLVM trunk (3.7) Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Shawn Starr <shawn.starr@rogers.com>	2015-02-14 01:31:57 +00:00
Chia-I Wu	8323796840	ilo: fix JIP/UIP on Gen8 UIP is in DW2 and JIP is in DW3 on Gen8. Also, the units are in bytes.	2015-02-14 06:52:36 +08:00
Chia-I Wu	c62507f42c	ilo: do not set GEN6_THREADCTRL_SWITCH It is not needed on Gen6+, and it appears to be broken on Gen8.	2015-02-14 06:52:36 +08:00
Chia-I Wu	7504b357d4	ilo: correct ISA UIP/JIP decoding for Gen8 JIP is int32_t and UIP is in DW2 on Gen8.	2015-02-14 06:52:36 +08:00
Chia-I Wu	f8126fed95	ilo: prepare for 64-bit immediates decoding Replace imm32 by imm64. Add more ways (UD, D, etc) to access the immediate.	2015-02-14 06:52:36 +08:00
Chia-I Wu	9ed376a76c	ilo: cleanup ISA DW1 decoding Decode the higher and lower 16 bits separately.	2015-02-14 06:52:36 +08:00
Chia-I Wu	db362983d1	ilo: cleanup ISA DW0 decoding Add disasm_inst_decode_dw0_opcode_gen6() to decode the opcode. Simplify branch_ctrl/acc_wr_ctrl decoding.	2015-02-14 06:52:36 +08:00
Chia-I Wu	5fc0dd8953	ilo: update some outdated gen checks Update gen checks for 3DSTATE_POLY_STIPPLE_OFFSET, 3DSTATE_POLY_STIPPLE_PATTERN, 3DSTATE_LINE_STIPPLE, and 3DSTATE_AA_LINE_PARAMETERS.	2015-02-14 06:52:36 +08:00
Chia-I Wu	8b9446dbeb	ilo: fix rectlist length on Gen8 5 PIPE_CONTROLs, 2 3DSTATE_WM_HZ_OP, and depth buffer setup require 65 DWords.	2015-02-14 06:52:36 +08:00
Chia-I Wu	baba8b2745	ilo: fix 3DSTATE_VF_TOPOLOGY The pipe primitive type was wrongly translated twice.	2015-02-14 06:52:36 +08:00
Jose Fonseca	c944b91190	os,llvmpipe: Set rasterizer thread names on Linux. To help identify llvmpipe rasterizer threads -- especially when there can be so many. We can eventually generalize this to other OSes, but for that we must restrict the function to be called from the current thread. See also http://stackoverflow.com/a/7989973 Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-02-13 19:42:21 +00:00
Jose Fonseca	b09f25428f	uti/u_atomic: Don't test p_atomic_add with booleans. Add another class of tests. Fixes https://bugs.freedesktop.org/show_bug.cgi?id=89112 I failed to spot this in my previous change, because bool was a typedef for char on the system I tested. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2015-02-13 19:39:27 +00:00
Tapani Pälli	e333035c47	mesa: fix OES_texture_float texture render target behavior Current implementation allowed usage of unsized type texture GL_FLOAT and GL_HALF_FLOAT as a render target as this was 'expected behavior' by WEBGL_oes_texture_float and is also allowed by the oes-texture-float WebGL test. However this broke some ES3 conformance tests that do not accept such behavior. Patch sets such an fbo incomplete as expected by the ES3 conformance tests. Textures with sized types like RGBA32F will still continue to work as render targets. v2: code style cleanups (Ian Romanick, Matt Turner) Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=88905 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Cc: "10.5" <mesa-stable@lists.freedesktop.org>	2015-02-13 07:51:13 +02:00
Eric Anholt	3f1e1287fd	vc4: Make SF be a flag on the QIR instructions. Right now the places that used to emit a mov.sf just put the SF on the previous instruction when it generated the source of the SF value. Even without optimization to push the sf up further (and kill thus potentially kill more MOVs), this gets us: total uniforms in shared programs: 13455 -> 13457 (0.01%) uniforms in affected programs: 3 -> 5 (66.67%) total instructions in shared programs: 40296 -> 40198 (-0.24%) instructions in affected programs: 12595 -> 12497 (-0.78%)	2015-02-12 16:33:16 -08:00
Eric Anholt	4413861dd8	r200: Drop unused variable. Quiets compiler warning since `e7f2f2dea5`. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2015-02-12 16:33:16 -08:00

1 2 3 4 5 ...

68111 commits