mesa/src/intel
Alyssa Rosenzweig bccaeb28bb
Some checks are pending
macOS-CI / macOS-CI (dri) (push) Waiting to run
macOS-CI / macOS-CI (xlib) (push) Waiting to run
brw/nir_lower_cs_intrinsics: do some math at 16-bit
There are less than 2^16 lanes within a threadgroup, so it is safe to do
all math at 16-bit. This allows us to use 16-bit integer division which is
much faster than 32-bit integer division (in terms of the lowerings).

In a "hello world" kernel with variable wg size, simd32 goes 72 inst -> 57
inst on jay and 82 -> 67 inst on brw.

OTOH it's a loss for non-variable wg size, so do it only there to avoid
unwelcome stats regresions on Vulkan.

Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@intel.com>
Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/41084>
2026-04-24 17:13:24 +00:00
..
blorp intel/blorp: remove always-true #if 2026-04-14 18:26:09 +00:00
ci ci/intel: Switch over to the new tool for restricted traces. 2026-04-23 22:54:12 +00:00
common genxml/mi: add additional bit to FF_MODE and autostrip helper 2026-03-30 11:02:27 +00:00
compiler brw/nir_lower_cs_intrinsics: do some math at 16-bit 2026-04-24 17:13:24 +00:00
decoder intel/decoder: update warning message when buildtype=release 2026-03-09 20:01:01 +00:00
dev intel/dev: fixup intel_needs_workaround() macro 2026-04-20 21:53:35 +03:00
ds intel/ds: report when OA metrics are unavailable 2026-04-13 21:31:51 +00:00
executor meson: make dep_lua a disabler 2025-11-21 21:48:57 +00:00
genxml intel/genxml: Added dispatch timeout counter extended field 2026-04-24 01:38:20 +00:00
isl isl: speedup buffer fills by dropping swizzle programming 2026-03-30 12:05:28 +00:00
mda intel/mda: Use -W for color words diff and -U for regular unified diff 2026-01-28 22:11:11 +00:00
nullhw-layer build: avoid redefining unreachable() which is standard in C23 2025-07-31 17:49:42 +00:00
perf intel/ds: report when OA metrics are unavailable 2026-04-13 21:31:51 +00:00
shaders intel/shaders: Build for Xe3P (GFX_VERx10 == 350) 2026-03-04 11:10:34 -08:00
tools intel/tools: Add xe3p format for intel_monitor 2026-03-26 07:31:09 +00:00
vulkan anv: Update values for DispatchTimeoutCounter 2026-04-24 01:38:20 +00:00
vulkan_hasvk hasvk: unbreak assert format != ISL_FORMAT_UNSUPPORTED 2026-04-22 20:35:25 +00:00
meson.build brw: Move into a new src/intel/compiler/brw subdirectory 2025-10-09 07:01:47 +00:00