fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2025-12-22 07:00:12 +01:00

Author	SHA1	Message	Date
Andreas Hartmetz	3a4b87511e	radeonsi: Rename r600->si remaining identifiers in si_resource.c. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	5d068f734c	radeonsi: Rename r600->si remaining identifiers in si_query.c. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	eb0ddb6d5b	radeonsi: Rename r600->si remaining identifiers in si_pipe.c. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	238427625f	radeonsi: Rename r600->si remaining identifier in si_hw_context.c. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	3160aa4877	radeonsi: Rename radeonsi->si remaining identifiers in si_compute.c. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	7b7eb4dd1f	radeonsi: Rename r600->si remaining identifiers in si_blit.c. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	45578def71	radeonsi: Rename r600->si for functions in si_pipe.h. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	280c360c02	radeonsi: Rename r600->si for functions in si.h. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	f2a21ed8b9	radeonsi: Rename r600->si for functions in si_resource.h. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	a88f46bc9b	radeonsi: Rename r600->si for structs in si_resource.h. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	3e81883a42	radeonsi: Rename r600->si for structs in si.h. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	238aeabce0	radeonsi: Rename r600->si for structs in si_pipe.h. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Andreas Hartmetz	786af2f963	radeonsi: Apply si_* file naming scheme. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2014-01-14 00:07:13 +01:00
Marek Olšák	df918b5b90	r600g: fix glClearBuffer by handling PIPE_CLEAR_COLORi flags correctly also restructure the code	2014-01-13 15:48:08 +01:00
Marek Olšák	6e98a17551	r600g: handle NULL colorbuffers correctly on R600-R700	2014-01-13 15:48:08 +01:00
Marek Olšák	07032d4068	r600g: handle NULL colorbuffers correctly on Evergreen	2014-01-13 15:48:08 +01:00
Marek Olšák	a86de9a72f	radeonsi: handle NULL colorbuffers correctly Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-01-13 15:48:08 +01:00
José Fonseca	9b96be595b	llvmpipe: Honour pipe_rasterizer::point_quad_rasterization. Commit `eda21d2a30` fixed the rasterization of points for Direct3D but ended up breaking the rasterization of OpenGL non-sprite points, in particular conform's pntrast.c test. The only way to get both working is to properly honour pipe_rasterizer::point_quad_rasterization, and follow the weird OpenGL rule when it is false. Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-01-09 12:35:11 +00:00
Thomas Sondergaard	8fcddd325c	mesa: Work around internal compiler error This small rearrangement avoids MSVC 2013 ICE. Also, this should be a better memory access order. Cc: "10.0" <mesa-stable@lists.freedesktop.org> Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com>	2014-01-08 17:33:06 -07:00
Rob Clark	646c16af6e	freedreno: add basic query support Add for now some simple/basic query support (ie. things not actually requiring the GPU). Might change around a bit when I actually add GPU queries, but for now this enables some useful performance info in the GALLIUM_HUD. For example: GALLIUM_HUD=fps+batches+batches-sysmem+batches-gmem+restores,draw-calls The driver specific specific queries are: + draw-calls + batches - number of batches per second, sum of batches-sysmem plus batches-gmem + batches-gmem - render a set of tiles in GMEM, for each tile (optionally) system mem -> gmem (restore), plus N draws, plus gmem -> system mem (resolve) per second + batches-sysmem - N draws to system memory (GMEM bypass) per second + restores - number of GMEM batches that required restore per second Ideally for GMEM rendering, you want batches-gmem to equal fps. If the app is doing something that triggers multiple passes (ie. requires extra round trip gmem <-> system memory) then the # of batches per second will go up relative to fps. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-01-08 16:30:18 -05:00
Rob Clark	725d736f6a	freedreno/a3xx: use cs patch instead of RFI+RMW Since we now have the cmdstream patch mechanism needed for hw binning, might as well also use it for RB_RENDER_CONTROL updates. This avoids the need to use RMW (and associated WFI) to update RB_RENDER_CONTROL. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-01-08 16:30:18 -05:00
Rob Clark	c0766528ba	freedreno/a3xx: support for hw binning pass The binning pass sorts vertices into which bins/tiles they apply to. The visibility information generated during the binning pass can be used to speed up the rendering pass by filtering out vertices which do not apply to the current tile. See: https://github.com/freedreno/freedreno/wiki/Adreno-tiling#optimized-approach This brings a significant fps boost. A rough assortment of tests (supertuxkart, etracer, tremulous, glmark2 'build' test, etc) seems to yield a ~35-45% fps improvement. For now, to be conservative, the binning pass is not enabled yet by default. To enable it use: FD_MESA_DEBUG=binning So far I haven't found anything that breaks with binning enabled, but I'd like a bit more testing before I enable it as default. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-01-08 16:30:18 -05:00
Rob Clark	bfb44c24bc	freedreno: be more clever about gmem usage Only need to leave room for depth/stencil if it is actually used, etc. Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-01-08 16:30:18 -05:00
Rob Clark	42c5e2a2ed	freedreno: resync generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2014-01-08 16:30:18 -05:00
José Fonseca	eda21d2a30	llvmpipe: Fix the bottom_edge_rule adjustment for points. The adjustment needs to be applied to the y coordinates and not the x coordinates, just like the equivalent code for lines and triangles in lp_setup_line.c and lp_setup_tri.c. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2014-01-08 12:18:17 +00:00
José Fonseca	37de6b0682	llvmpipe: Respect bottom_edge_rule when computing the rasterization bounding boxes. This was inadvertently forgotten when replacing gl_rasterization_rules with lower_left_origin and half_pixel_center (commit `2737abb44e`). This makes a difference when lower_left_origin != half_pixel_center, e.g, D3D10. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Zack Rusin <zackr@vmware.com>	2014-01-08 12:18:17 +00:00
Chia-I Wu	76edf44f9e	ilo: enable HiZ The support is still early. Fast depth buffer clear is not enabled yet. HiZ can be forced off with ILO_DEBUG=nohiz.	2014-01-08 18:11:36 +08:00
Chia-I Wu	e7b4219e22	ilo: resolve Z/HiZ correctly When the depth buffer is to be read, perform a Depth Buffer Resolve if it has been rendered. When the depth buffer is to be rendered, perform a HiZ Buffer Resolve when the depth buffer is modified externally.	2014-01-08 18:11:35 +08:00
Chia-I Wu	77e3db464f	ilo: add flags to texture slices The flags are used to mark who (CPU, BLT, or RENDER) has accessed the resource and how (READ or WRITE).	2014-01-08 18:11:35 +08:00
Chia-I Wu	846f70a6ef	ilo: rename and add an accessor for texture slices Rename ilo_texture::slice_offsets to ilo_texture::slices and add an accessor, ilo_texture_get_slice().	2014-01-08 18:11:35 +08:00
Chia-I Wu	127fbc086b	ilo: add HiZ op support to the pipelines Add blitter functions to perform Depth Buffer Clear, Depth Buffer Resolve, and Hierarchical Depth Buffer Resolve. Those functions set ilo_blitter up and pass it to the pipelines to emit the commands.	2014-01-08 18:11:35 +08:00
Chia-I Wu	546416d495	ilo: add support for HiZ allocation Add tex_create_hiz() to create HiZ bo. It is not really called yet.	2014-01-08 18:11:35 +08:00
Chia-I Wu	e372819589	ilo: refactor separate stencil allocation Move separate stencil allocation code to tex_create_separate_stencil to keep tex_create sane.	2014-01-08 18:11:35 +08:00
Chia-I Wu	82676f5d34	ilo: assorted GPE fixes for HiZ Allow HiZ op to be specified in 3DSTATE_WM. Pass depth format directly in gen7_emit_3DSTATE_SF. Use tex->hiz.bo to determine if HiZ exists. Fix 3DSTATE_SF for the case when there is no ilo_rasterizer_state. Fix 3DSTATE_PS for the case when there is no ilo_shader_state.	2014-01-08 18:11:35 +08:00
Chia-I Wu	6642381e75	ilo: no layer offsetting on GEN7+ Even though the Ivy Bridge PRM lists some restrictions that require layer offsetting as the Sandy Bridge PRM does, it seems they are actually lifted.	2014-01-08 18:11:34 +08:00
Chia-I Wu	011fde4bf2	ilo: offset to layers only when necessary GEN6 has several requirements regarding the LOD/Depth/Width/Height of the render targets and the depth buffer. We used to offset to the layers in question unconditionally to meet the requirements. With this commit, offseting is done only when the requirements are not met.	2014-01-08 18:11:34 +08:00
Chia-I Wu	0a2a221d01	ilo: allow ilo_zs_surface to skip layer offsetting Make offset to layer optional in ilo_gpe_init_zs_surface.	2014-01-08 18:11:34 +08:00
Chia-I Wu	8d9f5d57e2	ilo: allow ilo_view_surface to skip layer offsetting Make offset to layer optional in ilo_gpe_init_view_surface_for_texture. render_cache_rw is always the same as is_rt and is replaced.	2014-01-08 18:11:34 +08:00
José Fonseca	2d368b982a	llvmpipe: Basic implementation of pipe_context::set_sample_mask. We don't support MSAA (ie, number of samples is always one) therefore sample_mask boils down to a synonym of the rasterizer_discard flag. Also, this change makes setup actually use the value received in lp_setup_set_rasterizer_discard instead of reaching out to llvmpipe upper layers to re-fetch it. Based on Si Chen's draft. With this patch `wgf11multisample Coverage passes 100%` on the UMD D3D10 state tracker. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Si Chen <sichen@vmware.com>	2014-01-07 16:04:42 +00:00
Si Chen	72c6d0e506	llvmpipe: Implement alpha_to_coverage for non-MSAA framebuffers. Implement Alpha to Coverage by discarding a fragment alpha component is less than 0.5. This is a joint work of Jose and Si. Reviewed-by: José Fonseca <jfonseca@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com>	2014-01-07 16:04:42 +00:00
Marek Olšák	346b6abab9	radeonsi: calculate NUM_BANKS for DB correctly on CIK NUM_BANKS is not constant on CIK. Reviewed-by: Alex Deucher <alexander.deucher@amd.com>	2014-01-06 18:40:42 +01:00
Marek Olšák	bf3c361113	radeonsi: set correct pipe config for Hawaii in DB Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-01-06 18:40:42 +01:00
Marek Olšák	2748b7da7e	radeonsi: disable HTILE for 1D-tiled depth-stencil buffers Reviewed-by: Michel Dänzer <michel.daenzer@amd.com>	2014-01-06 18:40:41 +01:00
Rob Clark	8ab47b4353	freedreno/a3xx: fix blend state corruption issue Using RMW on banked context registers is not safe. The value read could be the wrong one. So if there has been a DRAW_IDX launched, the RMW must be preceded by a WAIT_FOR_IDLE to ensure the read part of RMW sees the correct value. To avoid unnecessary WFI's, keep track if there is a need for WFI, and only emit one if needed. Furthermore, keep track if we even need to update the register in the first place. And to cut down on the amount of RMW to avoid excessive WFI's, at the tiling/GMEM level we can always overwrite RB_RENDER_CONTROL, as the state at beginning of draw/clear cmds (which we IB to) is always undefined. In the draw/clear commands, we always still use RMW (with WFI if needed), but only if the register value actually changes. (At points where the current value cannot be known, the saved value is reset to ~0, which includes bits outside of RBRC_DRAW_STATE, so there never is chance for confusion.) Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-12-26 12:13:42 -05:00
Rob Clark	be01d7a905	freedreno: prepare for hw binning Actually assign VSC_PIPE's properly, which will be needed for tiling. And introduce fd_tile for per-tile state (including the assignment of tile to VSC_PIPE). This gives us the proper pipe setup that we'll need for hw binning pass, and also cleans things up a bit by not having to pass so many parameters around. And will also make it easier to introduce different tiling patterns (since we may no longer render tiles in a simple left-to-right top-to-bottom pattern). Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-12-26 12:06:29 -05:00
Rob Clark	64fe067066	freedreno: resync generated headers Signed-off-by: Rob Clark <robclark@freedesktop.org>	2013-12-26 12:06:29 -05:00
Aaron Watry	3ddabe0d52	r600/pipe: Stop leaking context->start_compute_cs_cmd.buf on EG/CM Found while tracking down memory leaks in VDPAU playback Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-12-23 07:24:50 -06:00
Aaron Watry	767b0f82c3	radeon/llvm: Free target data at end of optimization Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-12-23 07:24:50 -06:00
Aaron Watry	0bd858d7ff	r600/compute: Use the correct FREE macro when deleting compute state Reviewed-by: Tom Stellard <thomas.stellard@amd.com> CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-12-23 07:24:50 -06:00
Aaron Watry	e19717d075	r600/compute: Free compiled kernels when deleting compute state v2: Remove unnecessary null pointer check CC: "10.0" <mesa-stable@lists.freedesktop.org>	2013-12-23 07:24:50 -06:00

... 5 6 7 8 9 ...

11465 commits