fdo-mirrors/mesa

mirror of https://gitlab.freedesktop.org/mesa/mesa.git synced 2026-05-05 09:38:07 +02:00

Author	SHA1	Message	Date
Caio Oliveira	f5149e025e	intel/blorp: Explicitly include brw_compiler.h header Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	4eb63a0137	intel/blorp: Move brw specific code to a separate file Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	4866649052	intel/elk: Create separate header for opcodes This allow us to not include elk_eu_defines.h (mostly internal information) in elk_isa_info.h (that's used by other modules). This will help when using both brw and elk in Iris later. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	77f52417c5	intel/elk: Don't include elk_eu_defines.h in elk_nir.h Those definitions are not needed by the drivers (client code), so reduce the exposure. This will help when using both brw and elk in Iris later. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	6648e0ebd3	intel/elk: Rename symbols Either replace the BRW prefix with ELK or add an extra ELK prefix. Used the following sed script to perform the renames in this patch: ``` # Simple prefix changes. s/\<BRW_/ELK_/g s/\<brw_/elk_/g s/nir_to_brw/nir_to_elk/g s/\<as_brw_reg\>/as_elk_reg/g s/\<_brw_/_elk_/g # Add prefix to various symbols. # # Initially I've considered using C++ namespaces here, but in various # cases the structs or functions had to be also visible from C code. # So added explicit prefix instead. s/\<backend_instruction/elk_\0/g s/\<backend_reg/elk_\0/g s/\<backend_shader/elk_\0/g s/\<bblock_t\>/elk_\0/g s/\<bblock_link\>/elk_\0/g s/\<cfg_t\>/elk_\0/g s/\<fs_visitor\>/elk_\0/g s/\<fs_reg\>/elk_\0/g s/\<fs_instruction_scheduler\>/elk_\0/g s/\<vec4_instruction_scheduler\>/elk_\0/g s/\<instruction_scheduler\>/elk_\0/g s/\<schedule_node\>/elk_\0/g s/\<schedule_node_child\>/elk_\0/g s/\<\([a-z]*_\)\?thread_payload\>/elk_\1thread_payload/g s/\<fs_generator\>/elk_\0/g s/\<fs_inst\>/elk_\0/g s/\<fs_reg_alloc\>/elk_\0/g s/\<disasm_info\>/elk_\0/g s/\<gfx._math\>/elk_\0/g s/\<gfx7_block_read_scratch\>/elk_\0/g s/\<gfx6_IF\>/elk_\0/g s/\<gfx9_fb_READ\>/elk_\0/g s/\<gfx6_resolve_implied_move\>/elk_\0/g # Opcodes. s/\<opcode op\>/elk_\0/g s/\<opcode mov_op\>/elk_\0/g s/\<opcode opcode\>/elk_\0/g s/enum opcode\>/enum elk_opcode/g s/static opcode\>/static elk_opcode/g s/\<opcode elk_op/elk_opcode elk_op/g s/struct opcode_desc/struct elk_opcode_desc/g s/NUM_BRW_OPCODES/NUM_ELK_OPCODES/g s/\<.._OPCODE_/ELK_\0/g s/\<T.._OPCODE_/ELK_\0/g s/\<VEC4_OPCODE_/ELK_\0/g s/\<VEC4_...\?_OPCODE_/ELK_\0/g s/\<SHADER_OPCODE_/ELK_\0/g # Remaining specific cases. s/\<wm_prog_data_barycentric_modes\>/elk_\0/g s/\<encode_slm_size\>/elk_\0/g s/\<intel_calculate_slm_size\>/elk_\0/g s/\<gfx6_gather_sampler_wa\>/elk_\0/g s/\<is_3src\>/elk_\0/g s/\<WA_/ELK_\0/g s/\<conditional_modifier\>/elk_\0/g s/\<pred_ctrl_align16\>/elk_\0/g s/\<shuffle_from_32bit_read\>/elk_\0/g s/\<shuffle_src_to_dst\>/elk_\0/g s/\<setup_imm_..\?\>/elk_\0/g s/\<opt_predicated_break\>/elk_\0/g s/\<has_bank_conflict\>/elk_\0/g s/\<dead_control_flow_eliminate\>/elk_\0/g s/\<disasm_new_inst_group\>/elk_\0/g s/\<disasm_initialize\>/elk_\0/g s/\<dump_assembly\>/elk_\0/g s/\<disasm_insert_error\>/elk_\0/g s/\<disasm_annotate\>/elk_\0/g s/\<enum lsc_opcode\>/enum elk_lsc_opcode/g s/\<lsc_opcode_/elk_lsc_opcode_/g s/\<lsc_aop_[a-z_]\+\>/elk_\0/g s/\<type_size_vec4\>/elk_\0/g s/\<type_size_dvec4\>/elk_\0/g s/\<type_size_xvec4\>/elk_\0/g s/\<type_size_[a-z4]\+_bytes\>/elk_\0/g s/\<gfx12_systolic_depth\>/elk_\0/g ``` Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	f677485ad4	intel/elk: Rename C++ namespace Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	ed89a42f7a	intel/elk: Update doxygen-like file comments Not sure if we use them, but updated. Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	783f3d0b4f	intel/elk: Rename header guards Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	e6022281f2	intel/elk: Rename files to use elk prefix Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	b02712efc2	intel/elk: Remove DPAS lowering This is meant for Gfx9+. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	b743ab7acc	intel/elk: Remove stages not used in Gfx8- OpenCL, Mesh and RayTracing stages are not supported, so removing them and related code. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:31 +00:00
Caio Oliveira	8a57012ff4	intel/elk: Use common code in intel/compiler Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:30 +00:00
Caio Oliveira	dcf29202d4	intel/elk: Remove a bunch of files that don't apply for Gfx8- Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:30 +00:00
Caio Oliveira	06b553f02c	intel/elk: Remove compiler specific devinfo hash This more coarse-grained hash information for compiler (vs. full devinfo), used only by Iris and Anv, and relevant for more recent platforms. Remove it from elk. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:30 +00:00
Caio Oliveira	0083585fc5	intel/elk: Compile ELK library, tests and tools For now is not linked to any driver. The tools were renamed to use elk prefix to avoid conflicting with the brw ones. The run-test.py script was also updated due to that change. Before the new compiler can be linked together with the old (going to be done for Iris and other tools), the symbol conflicts need to be fixed first. This will happen in a later commit. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:30 +00:00
Caio Oliveira	d44462c08d	intel/elk: Fork Gfx8- compiler by copying existing code Based on code from commit `c3ceec6cd8`. Acked-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27563>	2024-02-24 00:24:30 +00:00
Faith Ekstrand	a9214460ee	nvk: Expose a host-visible VRAM type when we have REBAR We use host maps of VRAM all over the driver on Maxwell+ so we know they work. This exposes it to applications for use for data upload. To avoid thrashing, we only expose this on systems with properly a configured PCI controllwer with resizable BAR support. We already choose whether or not to set the MMAP bit when creating the BO based on memory type properties so we just need to add the type. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26622>	2024-02-23 16:27:21 -06:00
Faith Ekstrand	f5cb1eed26	nvk: Upload shaders on the CPU when we have ReBAR Without without resizable BAR, we're limited as to how much VRAM we can map and we sometimes run out of maps for games with large numbers of shaders. We keep using the DMA engine fallback in that case. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26622>	2024-02-23 16:25:49 -06:00
Faith Ekstrand	9ed7f8c54e	nvk/heap: Upload shaders on the CPU when we have a map Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26622>	2024-02-23 16:25:49 -06:00
Faith Ekstrand	d4623da9a3	nouveau/winsys: Getch the BAR size from the kernel Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/26622>	2024-02-23 16:25:49 -06:00
Faith Ekstrand	85849ed9cb	nvk: implement EXT_memory_budget Reviewed-by: Thomas H.P. Andersen <phomes@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:13 +00:00
Faith Ekstrand	db2162efa6	nvk: Add an available query to nvk_memory_heap Reviewed-by: Thomas H.P. Andersen <phomes@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:13 +00:00
Faith Ekstrand	bbb458be8d	nvk: Use 3/4 of total system memory for the VRAM heap Reviewed-by: Thomas H.P. Andersen <phomes@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:13 +00:00
Faith Ekstrand	e9bd005c44	nvk: Add a hand-rolled nvk_memory_heap struct Reviewed-by: Thomas H.P. Andersen <phomes@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:13 +00:00
Faith Ekstrand	6a8c60d674	nvk: Add a nouveau_ws_device to nvk_physical_device We don't want to use it for anything because we want VM isolation per-device but we need the device open for VK_EXT_memory_budget. Reviewed-by: Thomas H.P. Andersen <phomes@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:13 +00:00
Faith Ekstrand	b887c9ba2c	nouveau/winsys: Add a vram_used query Reviewed-by: Thomas H.P. Andersen <phomes@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:13 +00:00
Faith Ekstrand	6e08a84936	drm-uapi: Sync nouveau_drm.h From https://cgit.freedesktop.org/drm/ commit 72fa02fdf83306c52bc1eede28359e3fa32a151a Author: Dave Airlie <airlied@redhat.com> Date: Wed Jan 24 14:24:25 2024 +1000 nouveau: add an ioctl to report vram usage Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27662>	2024-02-23 21:27:12 +00:00
Christian Gmeiner	028080c716	isaspec: encode.py: Include util/log.h Generated encode functions are making use of mesa_loge(..). Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27714>	2024-02-23 20:29:57 +00:00
Marek Olšák	c9abb7ff6e	glthread: generate packed versions of glPointer/Offset calls The pointer/offset parameter is often NULL or a small number with VBOs. The idea is: - If the pointer/offset parameter is NULL/0, use a different cmd structure and unmarshal function that doesn't contain the pointer/offset parameter to save 8 bytes per call. - If the cmd structure has a hole and the pointer/offset parameter is a small number that fits into the hole, use a different cmd structure and unmarshal function that stores the value within the hole using a smaller type to save 8 bytes per call. This implements those ideas. It will continue generating the most optimal code even if we change the packing of other parameters. This decreases the size of 1 frame in glthread batches by 21% in Viewperf2020/Catia1. Example of generated code for glVertexPointer with and without the pointer parameter if it's NULL. See the arrows for comments. / VertexPointer: marshalled asynchronously / struct marshal_cmd_VertexPointer { struct marshal_cmd_base cmd_base; GLpacked16i size; GLenum16 type; GLclamped16i stride; const GLvoid pointer; }; struct marshal_cmd_VertexPointer_packed { struct marshal_cmd_base cmd_base; GLpacked16i size; GLenum16 type; GLclamped16i stride; // <------- no "pointer" }; uint32_t _mesa_unmarshal_VertexPointer(struct gl_context ctx, const struct marshal_cmd_VertexPointer restrict cmd) { GLpacked16i size = cmd->size; GLenum16 type = cmd->type; GLclamped16i stride = cmd->stride; const GLvoid * pointer = cmd->pointer; CALL_VertexPointer(ctx->Dispatch.Current, (size, type, stride, pointer)); return align(sizeof(struct marshal_cmd_VertexPointer), 8) / 8; } uint32_t _mesa_unmarshal_VertexPointer_packed(struct gl_context ctx, const struct marshal_cmd_VertexPointer_packed restrict cmd) { GLpacked16i size = cmd->size; GLenum16 type = cmd->type; GLclamped16i stride = cmd->stride; const GLvoid * pointer = (const GLvoid )(uintptr_t)0; // <------- using NULL CALL_VertexPointer(ctx->Dispatch.Current, (size, type, stride, pointer)); return align(sizeof(struct marshal_cmd_VertexPointer_packed), 8) / 8; } static void GLAPIENTRY _mesa_marshal_VertexPointer(GLint size, GLenum type, GLsizei stride, const GLvoid pointer) { GET_CURRENT_CONTEXT(ctx); if (!pointer) { // <------- the condition int cmd_size = sizeof(struct marshal_cmd_VertexPointer_packed); struct marshal_cmd_VertexPointer_packed cmd = _mesa_glthread_allocate_command(ctx, DISPATCH_CMD_VertexPointer_packed, cmd_size); cmd->size = size < 0 ? UINT16_MAX : MIN2(size, UINT16_MAX); cmd->type = MIN2(type, 0xffff); / clamped to 0xffff (invalid enum) / cmd->stride = CLAMP(stride, INT16_MIN, INT16_MAX); } else { int cmd_size = sizeof(struct marshal_cmd_VertexPointer); struct marshal_cmd_VertexPointer cmd = _mesa_glthread_allocate_command(ctx, DISPATCH_CMD_VertexPointer, cmd_size); cmd->size = size < 0 ? UINT16_MAX : MIN2(size, UINT16_MAX); cmd->type = MIN2(type, 0xffff); /* clamped to 0xffff (invalid enum) / cmd->stride = CLAMP(stride, INT16_MIN, INT16_MAX); cmd->pointer = pointer; } _mesa_glthread_AttribPointer(ctx, VERT_ATTRIB_POS, MESA_PACK_VFORMAT(type, size, 0, 0, 0), stride, pointer); } Example of generated code for glNormalPointer using a smaller type: / NormalPointer: marshalled asynchronously / struct marshal_cmd_NormalPointer { struct marshal_cmd_base cmd_base; GLenum16 type; GLclamped16i stride; const GLvoid pointer; }; struct marshal_cmd_NormalPointer_packed { struct marshal_cmd_base cmd_base; GLenum16 type; GLclamped16i stride; GLushort pointer; // <-------- truncated "pointer" }; uint32_t _mesa_unmarshal_NormalPointer(struct gl_context ctx, const struct marshal_cmd_NormalPointer restrict cmd) { GLenum16 type = cmd->type; GLclamped16i stride = cmd->stride; const GLvoid * pointer = cmd->pointer; CALL_NormalPointer(ctx->Dispatch.Current, (type, stride, pointer)); return align(sizeof(struct marshal_cmd_NormalPointer), 8) / 8; } uint32_t _mesa_unmarshal_NormalPointer_packed(struct gl_context ctx, const struct marshal_cmd_NormalPointer_packed restrict cmd) { GLenum16 type = cmd->type; GLclamped16i stride = cmd->stride; const GLvoid * pointer = (const GLvoid )(uintptr_t)cmd->pointer; // <-------- upcasting CALL_NormalPointer(ctx->Dispatch.Current, (type, stride, pointer)); return align(sizeof(struct marshal_cmd_NormalPointer_packed), 8) / 8; } static void GLAPIENTRY _mesa_marshal_NormalPointer(GLenum type, GLsizei stride, const GLvoid pointer) { GET_CURRENT_CONTEXT(ctx); if (((uintptr_t)pointer & 0xffff) == (uintptr_t)pointer) { // <-------- the condition int cmd_size = sizeof(struct marshal_cmd_NormalPointer_packed); struct marshal_cmd_NormalPointer_packed cmd = _mesa_glthread_allocate_command(ctx, DISPATCH_CMD_NormalPointer_packed, cmd_size); cmd->type = MIN2(type, 0xffff); / clamped to 0xffff (invalid enum) / cmd->stride = CLAMP(stride, INT16_MIN, INT16_MAX); cmd->pointer = (uintptr_t)pointer; / truncated / // <-------- the truncation } else { int cmd_size = sizeof(struct marshal_cmd_NormalPointer); struct marshal_cmd_NormalPointer cmd = _mesa_glthread_allocate_command(ctx, DISPATCH_CMD_NormalPointer, cmd_size); cmd->type = MIN2(type, 0xffff); /* clamped to 0xffff (invalid enum) */ cmd->stride = CLAMP(stride, INT16_MIN, INT16_MAX); cmd->pointer = pointer; } _mesa_glthread_AttribPointer(ctx, VERT_ATTRIB_NORMAL, MESA_PACK_VFORMAT(type, 3, 1, 0, 0), stride, pointer); } Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	24f14f8daa	glthread: add a packed version of DrawElementsUserBuf The reduces the call size by 24 bytes. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	c566df8b39	glthread: add a packed variant of glDrawElements with 16-bit count and indices This is just to decrease the size of glDrawElements by 8 more bytes. This packed glDrawElements call occupies only 1 slot in glthread_batch. This decreases the size of 1 frame in glthread batches by 13% in Viewperf2020/Catia1. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	5925a864b5	glthread: rewrite glBindBuffer packing We always reserved space for a doubled glBindBuffer call, occupying 2 slots. Thanks to the removal of cmd_size, we can finally represent glBindBuffer in only 1 slot, so do that. This saves space if there is only 1 glBindBuffer call. The combining of back-to-back BindBuffer calls is preserved by keeping track of 2 last BindBuffer calls. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	1f9b554839	glthread: use marshal_count instead of count for more functions Same as the previous commit, just applied to more functions. This removes safe_mul and checking whether cmd_size is too large because the size is always small with these functions. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	e8721fefcd	glthread: don't check cmd_size for small variable-sized calls This removes the size checking, syncing, and direct execution if the variable-sized call is always small. Don't use safe_mul in that case either. Only calls already using marshal_count are affected. Example: Before: static void GLAPIENTRY _mesa_marshal_PointParameterfv(GLenum pname, const GLfloat params) { GET_CURRENT_CONTEXT(ctx); int params_size = safe_mul(_mesa_point_param_enum_to_count(pname), 1 sizeof(GLfloat)); int cmd_size = sizeof(struct marshal_cmd_PointParameterfv) + params_size; if (unlikely(params_size < 0 \|\| (params_size > 0 && !params) \|\| (unsigned)cmd_size > MARSHAL_MAX_CMD_SIZE)) { _mesa_glthread_finish_before(ctx, "PointParameterfv"); CALL_PointParameterfv(ctx->Dispatch.Current, (pname, params)); return; } struct marshal_cmd_PointParameterfv cmd = _mesa_glthread_allocate_command(ctx, DISPATCH_CMD_PointParameterfv, cmd_size); cmd->num_slots = align(cmd_size, 8) / 8; cmd->pname = MIN2(pname, 0xffff); / clamped to 0xffff (invalid enum) / char variable_data = (char ) (cmd + 1); memcpy(variable_data, params, params_size); } After: static void GLAPIENTRY _mesa_marshal_PointParameterfv(GLenum pname, const GLfloat params) { GET_CURRENT_CONTEXT(ctx); int params_size = _mesa_point_param_enum_to_count(pname) * 1 * sizeof(GLfloat); int cmd_size = sizeof(struct marshal_cmd_PointParameterfv) + params_size; assert(cmd_size >= 0 && cmd_size <= MARSHAL_MAX_CMD_SIZE); struct marshal_cmd_PointParameterfv cmd = _mesa_glthread_allocate_command(ctx, DISPATCH_CMD_PointParameterfv, cmd_size); cmd->num_slots = align(cmd_size, 8) / 8; cmd->pname = MIN2(pname, 0xffff); / clamped to 0xffff (invalid enum) / char variable_data = (char *) (cmd + 1); memcpy(variable_data, params, params_size); } Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	798f430777	glthread: deduplicate batch finalization code Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	5d70c21d82	glthread: pack uploaded user vertex buffers and offsets better glthread_attrib_binding has 2 fields and 4 bytes of padding, which is arranged in array. This removes the padding by splitting the structure into 2 arrays, one for each field. This also fixes the pointer alignment. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	97532db988	glthread: fix multi draws with a negative draw count This fixes the invalid pointers when draw_count is invalid. I don't know if it had any adverse affect. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	eda0b9f8d4	glthread: pack glVertexAttribPointer calls better These parameters can use 8 bits. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	ad34c932cd	glapi: pass pointer size to python for glthread from meson glthread (the python generator) needs to know the pointer size at compile time to sort structure fields of calls for optimal structure packing based on the CPU. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	610bc4d115	glthread: remove "if True" from print_marshal_async_code This only changes indentation. No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	e299473968	glthread: separate marshal code generation into print_marshal_async_code I added "if True" to make the diff readable. No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	2d796de157	glthread: separate unmarshal function generation into print_unmarshal_func No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	dd6b0ea015	gltrhead: merge 3 blocks conditional on marshal_sync in print_async_body There are 3 blocks next to each other that check marshal_sync. Merge them. No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	e9819744b3	glthread: precompute fixed_params and variable_params lists This removes functions get_fixed_params and get_variable_params. No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:59 +00:00
Marek Olšák	6eef0c60f8	glthread: move global marshal_XML.py functions into class marshal_function No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:58 +00:00
Marek Olšák	aea16b5ae9	glthread: sort fixed-sized parameters before returning them No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:58 +00:00
Marek Olšák	ee0d869d66	glapi: fix type names for glthread and handle all types glthread will compare the whole type string, so the string must not have trailing spaces. No functional change. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:58 +00:00
Marek Olšák	12754aec1a	glthread: rewrite glDrawArrays call packing Since changing 1 field to 8 bits and the removal of cmd_size, call sizes have decreased, so we have 4 unused bytes in 2 DrawArrays structures So far we use: - DrawArrays - DrawArraysInstancedBaseInstance - DrawArraysInstancedBaseInstanceDrawID Change them to these by either removing 4 more bytes or adding 4 bytes, so that we don't waste space, which drops the number of used calls by 1: - DrawArraysInstanced - DrawArraysInstancedBaseInstanceDrawID Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:58 +00:00
Marek Olšák	5e5d91671e	glthread: rewrite glDrawElements call packing Since changing 2 fields to 8 bits and the removal of cmd_size, call sizes have decreased by 4 bytes, so we have 4 unused bytes in most DrawElements structures. So far we have used these calls for all DrawElements variants: - DrawElementsBaseVertex - DrawElementsInstanced - DrawElementsInstancedBaseVertexBaseInstance - DrawElementsInstancedBaseVertexBaseInstanceDrawID Change them to these by either removing 4 more bytes or adding 4 bytes, so that we don't waste space. - DrawElements - DrawElementsInstancedBaseVertex - DrawElementsInstancedBaseInstance - DrawElementsInstancedBaseVertexBaseInstanceDrawID This decreases the size of 1 frame in glthread batches by 12% in Viewperf2020/Catia1. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:58 +00:00
Marek Olšák	99f8f01dfb	glthread: pack the index type to 8 bits Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27350>	2024-02-23 18:03:58 +00:00

1 2 3 4 5 ...

185119 commits