From 72b83894167e38a4e7e63802e186e9ea0114ea1b Mon Sep 17 00:00:00 2001 From: Eric Engestrom Date: Wed, 7 May 2025 18:35:09 +0200 Subject: [PATCH] docs: add release notes for 25.1.0 --- docs/relnotes.rst | 2 + docs/relnotes/25.1.0.rst | 4108 ++++++++++++++++++++++++++++++++ docs/relnotes/new_features.txt | 48 - 3 files changed, 4110 insertions(+), 48 deletions(-) create mode 100644 docs/relnotes/25.1.0.rst delete mode 100644 docs/relnotes/new_features.txt diff --git a/docs/relnotes.rst b/docs/relnotes.rst index 932eead0dfd..137fa488558 100644 --- a/docs/relnotes.rst +++ b/docs/relnotes.rst @@ -3,6 +3,7 @@ Release Notes The release notes summarize what's new or changed in each Mesa release. +- :doc:`25.1.0 release notes ` - :doc:`25.0.3 release notes ` - :doc:`25.0.2 release notes ` - :doc:`25.0.1 release notes ` @@ -446,6 +447,7 @@ The release notes summarize what's new or changed in each Mesa release. :maxdepth: 1 :hidden: + 25.1.0 25.0.3 25.0.2 25.0.1 diff --git a/docs/relnotes/25.1.0.rst b/docs/relnotes/25.1.0.rst new file mode 100644 index 00000000000..65d18fe5143 --- /dev/null +++ b/docs/relnotes/25.1.0.rst @@ -0,0 +1,4108 @@ +Mesa 25.1.0 Release Notes / 2025-05-07 +====================================== + +Mesa 25.1.0 is a new development release. People who are concerned +with stability and reliability should stick with a previous release or +wait for Mesa 25.1.1. + +Mesa 25.1.0 implements the OpenGL 4.6 API, but the version reported by +glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) / +glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used. +Some drivers don't support all the features required in OpenGL 4.6. OpenGL +4.6 is **only** available if requested at context creation. +Compatibility contexts may report a lower version depending on each driver. + +Mesa 25.1.0 implements the Vulkan 1.4 API, but the version reported by +the apiVersion property of the VkPhysicalDeviceProperties struct +depends on the particular driver being used. + +SHA checksums +------------- + +:: + + TBD. + + +New features +------------ + +- cl_khr_spirv_linkonce_odr in rusticl +- storagePushConstant16 on panvk +- storageInputOutput16 on panvk +- VK_KHR_depth_stencil_resolve on panvk +- VK_KHR_separate_depth_stencil_layouts on panvk +- VK_EXT_separate_stencil_usage on panvk +- VK_KHR_sampler_ycbcr_conversion on panvk/v10+ +- VK_EXT_ycbcr_2plane_444_formats on panvk/v10+ +- VK_EXT_ycbcr_image_arrays on panvk/v10+ +- VK_KHR_imageless_framebuffer on panvk +- VK_KHR_uniform_buffer_standard_layout on panvk +- VK_EXT_border_color_swizzle on panvk +- VK_MESA_image_alignment_control on NVK +- shaderFloat16 on panvk +- VK_KHR_shader_subgroup_uniform_control_flow on panvk/v10+ +- VK_KHR_shader_maximal_reconvergence on panvk/v10+ +- VK_EXT_device_memory_report on RADV +- VK_KHR_shader_subgroup_extended_types on panvk/v10+ +- shaderStorageImageExtendedFormats on panvk +- VK_KHR_display on panvk +- VK_EXT_display_control on panvk +- EXT_shader_framebuffer_image_fetch on v3d +- EXT_shader_framebuffer_image_fetch_coherent on v3d +- KHR_blend_equation_advanced on v3d +- KHR_blend_equation_advanced_coherent on v3d +- KHR_partial_update on etnaviv +- VK_KHR_line_rasterization on panvk +- VK_EXT_line_rasterization on panvk +- shaderImageGatherExtended on panvk +- textureCompressionBC on panvk +- VK_EXT_sample_locations on RADV for GFX10+ +- GL_ARB_shader_clock on panvk +- VK_KHR_shader_float_controls on panvk +- VK_KHR_shader_float_controls2 on panvk/v10+ +- storageBuffer8BitAccess on panvk +- storagePushConstant8 on panvk +- uniformAndStorageBuffer8BitAccess on panvk +- MSAA with 8 and 16 sample counts on panvk +- VK_KHR_spirv_1_4 on panvk/v10+ +- Mali G720 and G725 on Panfrost and panvk +- dualSrcBlend on panvk +- VK_KHR_dynamic_rendering_local_read on panvk +- VK_EXT_subgroup_size_control on panvk/v10+ +- VK_KHR_format_feature_flags2 on panvk +- shaderStorageImageReadWithoutFormat on panvk +- shaderStorageImageWriteWithoutFormat on panvk +- VK_EXT_direct_mode_display on panvk +- Vulkan 1.2 on panvk/v10+ + + +Bug fixes +--------- + +- RADV:RX 9070:Mesa-25.0.5 GTA 5 Enhanced GPU HANG +- [ANV/DG2] nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_reflections renders not all reflections +- radv: Flickering in Kingdom Come: Deliverance II +- RADV regression causes severe glitches in Hunt Showdown 1896 on Polaris +- Z-Fighting in Tomb Raider IV - VI Remastered Linux +- [anv] VK_ERROR_DEVICE_LOST on Linux 6.13.8 while playing Dota 2 on Intel Graphics +- Variable Rate Shading (VRS) produces very aliased results on RADV with an AMD gpu +- WWE 2k23 small "artifacts" +- RadeonSI - ACO does not spill enough SGPRs despite low VGPR pressure in some scenarios +- RADV: Performance Regression (~18%) on Vega 64 (GFX9) in Cyberpunk 2077 caused by commit 08918f08805f (MR 34361) +- radeonsi: CL conformance test \`vector_swizzle` fails since 177427877bb50ad7ba24abfa13e55a2684d804df +- [macOS] glapi_gentable.c:46:10: fatal error: 'glapitable.h' file not found +- commit 90faadae regression - Warhammer 40,000: Space Marine 2 crashing +- RADV: GPU hangs always at certain places in the game "Sniper Elite: Resistance" +- Random mesa crashes in kwin_wayland on a 6600XT +- Patch to fix clinfo on rusticl +- radv/aco: Ghost of Tsushima hangs and causes gpu resets on RDNA 3 GPU +- mesa-vulkan-driver-git.x86_64 causes strange colored rectangle artifacts in Final Fantasy XIV +- NVK - Out of video memory error when starting Marvel Rivals +- NVK - Out of video memory error when starting Marvel Rivals +- RADV: Performance regression in Elden Ring on GFX8/Polaris +- RADV: Performance regression in Elden Ring on GFX8/Polaris +- Vulkan issues after sleeping on 9070 XT +- ring gfx_0.0.0 timeout after waking from sleep - RX 9070 +- Confidential issue #12324 +- Confidential issue #12946 +- svga: how do I test this driver? +- brw: new Xe2 CTS failures +- [NVK] NAK assert in The Last of Us Part 2 shader +- anv: S.T.A.L.K.E.R. 2: Heart of Chornobyl crashes after starting a new game +- [ANV][LNL] - Lost Records: Bloom & Rage (1902960) - Title hangs on launch and subsequently crashes to desktop. +- [BMG] Intel b580 battlemage: Fort Solis (Unreal Engine game) boots to menu, hangs while loading after hitting continue from the main menu +- [ANV][LNL] - NINJA GAIDEN 2 Black (3287520) - Environment assets are incorrectly rendered or missing. +- [ANV][LNL] - The Headliners (3059070) - Title hangs a few minutes after launch. +- anv, regression: Invisibly blinking cliffs & rocks in Satisfactory DX12 on BMG +- [Feature request] Add HK support for VK_EXT_queue_family_foreign vulkan extension +- vk/overlay: output_file option failing +- The Last of Us Part I GPU hang on gfx1201 +- glsl: IR validation fails on some shaders after lower_precision (breaks GLSL-to-NIR) +- [bisected, LNL] brw: 341e5117ecbc ("brw/nir: Treat load_const as convergent") regresses arb_gpu_shader5-interpolateAtOffset on LNL +- vulkan regression mesa 24.3.4 to 25.0.0.rc3 with broadcom +- AMD: broken Minetest trace on Vega iGPU +- All OpenGL applications segfault +- radv: nir_opt_varyings.c:2766: deduplicate_outputs: Assertion \`list_index == 0' failed. +- vulkan/wsi: memory leak from wsi_CreateSwapchainKHR +- [RADV][RDNA3][Phoenix3][APU] NARAKA: BLADEPOINT (1203220) gpu hang reproducible (ice/water regression mesa 24.1 bisected SAMPLE_MASK_TRACKER_WATERMARK=15) random (maybe other apps/games) +- GPU hangs running Octopath Traveler II with 780M +- GPU crash on Radeon 780M with Tales of Arise +- The Last of Us - shadows flickering on gfx1201 without nohiz flag +- brw: Hit unreachable nir_op_fsign case that brw_nir_lower_fsign missed +- Panvk:Add Support for BCn (BC1–BC7) Texture Compression +- anv: Dark pattern overlayed on objects in Eve Online DX11 mode on BMG +- radv/aco: Strobing artifacts in Pacific Drive +- aco: Distorted light halos, Star Citizen +- Mesa 25 removes VA-API encoding for R9 390 +- debian-android build is huge +- isl: CPCBCompressionEnable is now LosslessCompressionEnable +- Video stuttering / anv: extend implicit fencing support +- anv, bmg: Visual issues in AC Origins, Odyssey and Fenyx Rising when dxvk doesn't export PointSize +- [ANV][LNL] - A Game About Digging A Hole (3244220) - Title throws an assertion failure on launch. +- [ANV][BMG] - A Plague Tale: Requiem - Severe shadow flicker +- [ANV][LNL] - Hogwarts Legacy (990080) - Flickering artifacts visible on magical barrier. +- brw: regression on Gfx9 dEQP-VK +- HTML Docs fail to build from source with Sphinx 8.2.1 +- libglapi.dll ain't built anymore under Windows +- anv/video: Timestamps are exposed in video encode queue, but it crashes +- Getting a crash with manually built llvmpipe (OpenGL) +- [RadeonSI] Blender assetshelf icons are borken in mesa >= 25.0.0 +- radeonsi regression after 24.3.4 +- misc OpenGL CTS failures +- glBindVertexBuffer regression due to ID reuse +- RADV: logic used to avoid running on CDNA is faulty +- [ANV][LNL] - Company of Heroes 3 (1677280) - Circular banding is present on screen during gameplay. +- video hardware acceleration issue with "VA" +- [LNL/BMG] Assassin's Creed Valhalla trace replay hang +- Vulkan conformanceVersion is reported as 0.0.0.0 in Mesa 25.0.0 +- X11 + Zink on NVK flickers older frames in Firefox based browsers +- VRAM Abnormal use on mesa 25.0 +- [radv][regression] Multiple games detect the wrong amount of vram +- Resident Evil 2 Remake flickers +- OpConstantNull not supported for OpTypeCooperativeMatrixKHR +- v3dv: vkcube-wayland crashes on raspberry pi 5 kernel 6.12 and latest mesa +- GMSH Visualization Fails with radeonsi:can't compile a main shader part, Fedora 41 AMD 7900xt +- AMD VDPAU deinterlacing SIGSEGV +- radv: vkd3d-proton test_primitive_restart_list_topology_stream_output randomly fails on NAVI2X +- Mesa 24.1 introduced a Vulkan problem with DOOM 2016 on AMD 780M GPU +- radeonsi: Firefox fails assertion requiring binding vertex elements before vertex_buffers +- nouveau & zink+nvk: Flashing in Firefox and Thunderbird on Hyprland +- Zink: Kopper's present thread causes Wayland protocol races +- GLmatrix needs aligned malloc +- Lavapipe crashes if no Position is output in mesh shader +- radeonsi/video: Allocate video buffers with modifiers 7f7206f1a9d brake 'mplayer -vo vdpau...' on GFX8 (Polaris 20) +- [RADV/aco][regression][bisected] - Avowed (2457220) - GPU hangs near Watermill outside of Dawnshore +- radv/sqtt: assertion "layout transition marker should be only emitted inside a barrier marker" +- mesa_shader_cache directory is created even if mesa_shader_cache_db is used. +- nak: Fold i2b(b2i(x)) +- [radv] Glitchy ground geometry regression in Total War Warhammer III on RX 7600 +- NVK: Implement shaderSharedInt64Atomics +- nvk: Implement VK_MESA_image_alignment_control +- radeonsi: regression with running DaVinci Resolve under rusticl since 666a6eb871d5dec79362bdc5d16f15915eb52f96 +- [ANV][LNL] - Black Myth: Wukong (2358720) - Corruption is visible near the edge of water. +- [ANV][LNL] - Hogwarts Legacy (990080) - Pixelated corruption is visible when looking out at the water. +- radv/video/h265: pps.flags.transform_skip_enabled_flag = 1 randomly hangs GPU +- turnip falls with \`assertion "errno == ETIME" failed` +- [ANV][LNL] - Steel Rats (619700) - Game crashes after opening logos play before reaching main menu +- nvk: Implement host-only descriptors +- Gnome-shell Wayland fails to start with segfault at modifier-less driver +- [ANV][LNL] - DYNASTY WARRIORS: ORIGINS (2384580) - Dithered transparency has vertical bands. +- AMD Radeon R9 270 randomly causes video playback applications to crash with "amdgpu: The CS has been rejected" +- ci: a306 jobs use a307 files +- Rendering issues on GravityMark with RadeonSI ACO +- Expose low latency encoding for radv vulkan video encode +- i915: multiple tests assert with tgsi_ureg.h:893: ureg_swizzle: Assertion \`reg.File != TGSI_FILE_NULL' failed. +- ci: debian-build-testing fails in nightly +- shaders/closed/steam/deus-ex-mankind-divided/260.shader_test fails NIR validation +- shaders/closed/steam/deus-ex-mankind-divided/260.shader_test fails NIR validation +- panvk : vk_pipeline_cache_object_deserialize: Assertion \`reader.current == reader.end && !reader.overrun' failed. +- 46a8d5e7ef61735416d0c54886a7a9930621ae2c causes a permission denied spam +- [BUILD] Build Failure: Implicit Function Declaration 'timespec_sub_saturate' (loader_wayland_helper.c) +- anv: \`MESA: warning: INTEL_HWCONFIG_MIN_GS_URB_ENTRIES (2) != devinfo->urb.min_entries[MESA_SHADER_GEOMETRY] (0)` +- intel genX_acceleration_structure: missing dependency to bvh/header.spv.h +- ci: what's going on with zink-venus-lavapipe ? + + +Changes +------- + +Aaron Ruby (17): + +- gfxstream: Replace pre-processor (LINUX && !ANDROID) checks with LINUX_GUEST_BUILD +- gfxstream: Make the virtgpu device discovery for LinuxVirtGpu more robust +- gfxstream: Change "mesaOnly" nomenclature to be "guestOnly" +- gfxstream: Add common interfaces in the VirtGpuDevice to query DrmInfo and PciBusInfo +- gfxstream: Clean up the gfxstream_vk device and instance init +- gfxstream: Fix precedence and ownership issues on Linux for imported FD and the VirtGpuResource +- gfxstream: Add reference counting for GEM handles in LinuxVirtGpuBlob +- gfxstream: Downgrade log severity when enabling params in LinuxVirtGpu +- gfxstream: Remove vkGetImageSubresourceLayout ResourceTracker entry +- gfxstream: Full emulation support for VK_EXT_image_drm_format_modifier +- gfxstream: Resolve/clean-up inconsistencies with advertising emulated extensions +- gfxstream: No VIRGL_BIND_LINEAR for ColorBuffers +- gfxstream: Move virtgpu_gfxstream_protocols.h to the common location for house protocols +- egl: Remove FallbackZink config item entirely +- gbm/dri: Match zink autoloading from egl +- drm-uapi: Sync virtgpu header +- virtio: Remove virglrenderer_hw.h entirely + +Adam Jackson (8): + +- zink: Enable KHR_shader_subgroup +- lvp: set subgroupQuadOperationsInAllStages to true +- glx: Make #undef GLX_INDIRECT_RENDERING do something +- mapi/glx: Remove xserver code generation +- glx: Remove (almost) all usage of _X_HIDDEN / _X_INTERNAL +- mapi/glx: Remove FASTCALL/PURE +- loader: Stop looking in ${libdir}/tls/ +- meson: Simplify the power8 optimization logic + +Aditya Kumar (1): + +- gfxstream: Fix compiling gfxstream for musl libs + +Alejandro Piñeiro (1): + +- nir: aliasing checks should be also done with index != 0 + +Aleksi Sapon (3): + +- draw: fix gl_PrimitiveID in tessellation +- llvmpipe: improve aniso filtering +- lp: fix gnu-empty-initializer warning + +Alyssa Rosenzweig (226): + +- meson: factor out with_driver_using_cl +- nir: add nir_function_intrinsics_pass +- nir/lower_scratch_to_var: handle KERNELs +- nir/lower_scratch_to_var: handle multi-function shaders +- nir/print: extract nir_print_function_body +- nir/serialize: add specialized function serialization +- nir: introduce bindgen_return +- nir: add nir_call_serialized helper +- vtn: add vtn_bindgen2 tool +- libcl: add MIN3/MAX3 macros like on the host +- libcl: define GLSL-style compute built-ins +- libagx: port to glsl-style compute builtins +- asahi: port to vtn_bindgen2 +- nir: include __LINE__ in NIR_PASS validation results +- libcl: add unreachable() macro +- nir: add image_min_lod_agx +- nir: add lod_bias_min_agx tex src +- agx: lower min LOD for txf +- agx: switch to nir_tex_src_lod_bias_min_agx +- hk: pack has_border with clamp_0_sampler_index +- hk: rearrange sampler image desc +- hk: emulate EXT_image_view_min_lod +- hk: advertise EXT_image_view_min_lod +- Revert "hk: Stop using strings or common key types for meta keys" +- intel: drop nir_lower_printf calls +- intel/nir_lower_printf: modernize nir +- intel: port to u_printf context + singleton +- nir/lower_printf: hash format strings in nir_printf_fmt +- nir: drop printf_base_identifier +- nir: add nir_intrinsic_has_semantic helper +- nir: switch intrinsic semantics to BIT +- nir: mark subgroup/quadgroup ops +- nir/gather_info: use subgroup/quadgroup flags +- nir/opt_move_discards_to_top: use semantic +- nir: mark a few more subgroup ops +- libcl/vk: add common query copy write routine +- nir: default-initialize next_stage +- asahi: clang-format +- asahi: fix libwrap.dylib +- asahi: fix cull distance with GS +- hk: implement calibrated timestamps +- hk: fix increment CS invs +- hk: do not incorrectly offset host-image-copy sources +- asahi: bind zero-page +- libagx: use zero page +- asahi: use zero sink for vbuf +- hk: use zero sink for null index buffer +- hk: don't allocate zero sink +- libagx: add missing null pointer check +- asahi: perf debug indirect tess +- asahi: use NIR_PASS to validate more +- asahi: switch tib lower to intrinsic pass +- libagx: fix subgroup id confusion +- libagx: fix wraparound issue with robust draw kernel +- libagx: use assert instead of 0xdeadbeef writes +- asahi: add more alignment asserts +- hk: reject non-2D modifiers +- hk: unstub UnmapMemory2KHR +- hk: unbind VAs +- hk: fix buffer binding +- hk: enable bufferDeviceAddressCaptureReplay +- docs/asahi: update varying info +- docs/asahi: add some section headers +- docs/asahi: describe sparse page tables +- asahi: rename Null layout +- asahi: identify image mode enum +- asahi: add sparse block XML +- ail: model sparse page tables +- ail: unit test sparse_table_size_B +- ail: report mip_tail_first_lod for sparse +- ail: report miptail stride +- ail: move helpers to layout.h for sharing +- ail: extract a blocksize helper +- ail: expose ail_get_max_tile_size +- ail: add ail_bytes_to_pages helper +- ail: add ail_get_twiddled_block_B helper +- ail: unit test ail_get_twiddled_block_B +- asahi: assert page alignment in vm_bind +- asahi: support unbinding VA in agx_va_free +- agx: implement sparse residency queries +- hk: ban sparse host-image-copy +- hk: implement sparse +- asahi: drop cargo culted disk cache disable +- agx: fix uniform atomic opts +- hk: ensure space with the dispatch +- hk: pass cmdbuf, not control stream, into precomp dispatch +- hk: assert more +- hk: accelerate buffer copies with CL +- hk: pass cmdbuf to perf_debug +- libagx: fix ia_primitives with tessellation +- libagx: vectorize tess level loads +- hk: perf debug sparse binding +- asahi,libagx,hk: don't set custom_target names +- asahi: drop trivial depends +- agx: assert shift bounds +- agx: fix botched address fuses +- hk: do not clamp txf for copy shaders +- libagx: use 64-bit multiply for image atomic calculation +- asahi: fix null deref in error path +- hk: fake min/max filtering for proton +- agx: handle rgb32 residency queries +- hk: ban sparse RGB32 +- agx: handle sparse buffer images +- asahi: support sparse in virtgpu +- nir/builder: add nir_shader_tex_pass +- treewide: use nir_shader_tex_pass +- radv/nir_lower_viewport_to_zero: intrinsic pass +- radv/nir_lower_view_index: intrinsic pass +- radv/nir_lower_fs_barycentric: intrinsic pass +- radv/nir_lower_intrinsics_early: intrinsic pass +- radv/nir_lower_fs_intrinsics: intrinsic pass +- nvk: rewrite query copy shader in CL C +- docs/asahi: clarify twiddled vs GPU-tiled +- ail: rename twiddled -> gpu tiled +- ail: split compression up +- asahi: rename wip modifier +- asahi: drop silly +- hk: drop silly +- nir: add nir_progress/nir_no_progress helpers +- asahi: clang-format +- nir: clang-format +- treewide: Switch to nir_progress +- treewide: port remaining nir_metadata_preserve users +- nir,nak: update comments referencing nir_metadata_preserve +- nir: eliminate nir_metadata_preserve +- nir: clean up progress +- ir3: clean up progress manually +- asahi: clang-format +- vulkan: add common VK_PRINT_STR/VK_COPY_STR macros +- hk: switch to common VK_COPY/PRINT_STR +- anv,hasvk: switch to common VK_COPY/PRINT_STR +- v3dv: switch to common VK_COPY/PRINT_STR +- radv: use VK_COPY_STR +- panfrost,panvk: fix clang warnings +- vulkan: add helpers to work with executable statistics +- util: add shader statistic framework +- asahi: port to common stats framework +- v3dv: fix clang warning +- meson: make CL args common +- meson: set NDEBUG appropriately for OpenCL +- intel: use common CL args +- meson,clc: set CL standard from meson +- panfrost: clean up CL meson rules +- panfrost: port to common stats framework +- agx: call nir_lower_is_helper_invocation +- pan/mdg: call nir_lower_is_helper_invocation +- nir/lower_helper_writes: fix stores after discard +- pan/lower_helper_invocation: clean up +- bin: add script for applying review trailers +- bin: add list of Mesa contributors +- libcl: add u_foreach_bit +- hk: fix cull distance confusion +- asahi: integrate printf/abort support +- libagx: do not use prefix sums for GS index buffer +- asahi/gs: drop non-XFB prefix sums +- asahi/gs: factor out output info +- asahi/gs: avoid recalculating +- asahi/gs: only prefix sum with XFB +- libagx,asahi: hoist GS draw generation +- asahi/gs: drop unused params +- asahi/gs: report whether xfb is needed +- asahi: do not dispatch count/pre-GS unless needed +- hk: do not dispatch count/pre-GS unless needed +- libagx: use indirect draw struct +- libagx: clean up +- tu: use the stats framework +- v3dv: use the stats framework +- drm-uapi: update drm_fourcc.h +- ail: drop fake modifiers +- libagx: reduce # of variants of unroll restart +- asahi: add assert for max variant count +- agx: fix ctz of zero with address calcs +- agx: fix tg4 offset residency +- hk: advertise semaphore extensions +- hk: advertise bufferDeviceAddressCaptureReplayEXT +- hk: fix unaligned copies +- libagx: clean up query copy; bug fix +- libagx: assert alignment for copies +- libagx: use common heap allocs +- asahi: fix printf without result buffer +- asahi: fix depth buffer feedback loops +- asahi: clang-format +- glsl_to_nir: upcast array indices +- nir/lower_blend: refactor logicop variables +- nir/lower_blend: disable logic ops for unsupported formats +- nir/lower_tex: use vector_insert_imm +- vk/meta: generalize 3D blit code +- asahi,hk: disentangle logicop_enable +- agx: optimize nonuniform offset +- hk: drop dead todo +- hk: fix last VGT output component limits +- hk: support colour <--> z/s copies +- hk: advertise VK_EXT_queue_family_foreign +- hk: advertise maintenance7 +- hk: advertise maintenance8 +- bin: add rebase mode +- drm-uapi: add asahi uapi +- asahi: disable virtio gpu for upstream +- asahi: port to stable uAPI +- asahi: remove unstable uapi header +- gallium: wire up asahi driver +- asahi: fix i/a queries with tess +- panfrost: invert and rename no_ubo_to_push flag +- panfrost: do not push "true" UBOs +- asahi: shrink VA space for sparse emulation +- asahi: add sparse emu helpers +- asahi: fix zero bo leak +- hk: bind for sparse emulation +- hk: use ro maps +- hk: advertise sparseResidencyBuffer +- hk: drop soft fault assumption in hk_buffer_addr_range +- util/simple_mtx: fix duplicate definition +- nir: unvendor lod_bias(_agx) +- nir: handle mismatched bias/lod bitsizes +- nir: add sampler LOD bias lowering +- hk: handle lod/min_lod size mismatch +- agx: switch to common LOD bias lowering +- hk: fix underbinding scratch +- hk: fix tessellation + clipper queries +- hk: fix null FS corner cases +- agx: model sources as late-kill in demand calcs +- agx: late-kill sources +- agx: early-kill sources only if it won't shuffle +- hk: fix patch count = 0 handling +- asahi: fix possible null deref +- asahi: do not use "Null" layout + +Andrew Wolfers (1): + +- vulkan: Add BGRA format support + +Antonino Maniscalco (2): + +- freedreno/crashdec: Avoid crashing on some traces +- freedreno/crashdec: Add check for \`CP_BV_SQE_STAT_ADDR` + +Antonio Ospite (28): + +- ci/android: fix building deqp on Android +- ci/android: add an android-angle-lavapipe job +- ci/android: don't do unnecessary cleanup +- ci/android: don't set HOME globally in cuttlefish-runner.sh +- ci/android: disable audio in cuttlefish, it is not needed +- ci/android: disable modem simulator in cuttlefish, it is not needed +- ci/android: stop cuttlefish before copying the logs, to log everything +- ci/android: increase the waiting time to stop the cuttlefish launcher +- ci/android: bump CUTTLEFISH_BUILD_NUMBER +- ci/android: use an x86_64_only cuttlefish image +- ci/android: push /deqp-gles/mustpass/egl-main.txt.zst +- ci/android: pass -vsock_guest_cid to launch_cvd +- ci/android: move all dEQP handling in one place in cuttlefish-runner.sh +- ci/android: add an android-deqp-runner.sh script +- ci/android: add a job using android-cts instead of deqp-runner +- ci/android: don't set EGL_PLATFORM on the host before launching cuttlefish +- ci/android: increase the waiting time to stop the cuttlefish launcher +- ci/android: factor out a generic android-runner.sh from cuttlefish-runner.sh +- ci/android: remove some unnecessary adb commands from android-runner.sh +- ci/android: check for ANGLE_TAG in android-deqp-runner.sh +- ci/android: remove old mesa and ANGLE libraries before pushing new ones +- ci/android: always push ANGLE libraries +- ci/android: push also the intel vulkan driver +- ci/android: handle ANGLE being installed under /system on Android 15+ +- ci/android: factor out GLES runtime version retrieval to a function +- ci/android: only ship x86_64 artifacts in debian-android build job +- ci/android: strip the artifacts of the debian-android job +- meson: pass --no-pager to \`meson configure` command + +Ashley Smith (2): + +- panfrost: Reset syncobj after use to avoid kernel warnings +- pan/bi: Enable ARB_shader_clock extension support + +Assadian, Navid (1): + +- amd/vpelib: More parameters to the segmentation process and introduce validation hook + +Autumn Ashton (5): + +- radv: Fix compute resolve rounding +- radv: Enable fragmentShadingRateWithCustomSampleLocations +- radv: Expose EXT_sample_locations everywhere +- radv: Expose VK_SAMPLE_COUNT_1_BIT for sample position on GFX10+ +- ci/zink+radv: Add sample locations fails + +Bas Nieuwenhuizen (1): + +- radv: Move support check out of winsys. + +Benjamin Cheng (1): + +- d3d12/video_enc: Fix AV1 tile_info() coding + +Benjamin Lee (41): + +- panfrost: minor refactors in preparation for panvk 16-bit IO +- panfrost: move handling for bifrost mediump lowering to pan_collect_varyings +- panfrost: support 16-bit vertex attributes +- panfrost: support 16-bit varyings +- panvk: advertise storagePushConstant16 and storageInputOutput16 +- panvk: enable VK_KHR_depth_stencil_resolve +- panvk: enable KHR_separate_depth_stencil_layouts +- panvk: document missed extensions in new_features +- panvk: implement VK_EXT_separate_stencil_usage +- panfrost: remove NIR_PASS_V usage for noperspective lowering +- meson: update wayland-protocols source_hash +- panfrost: fix large int32->float16 conversions +- panfrost: reorder lower_bit_size pass +- panfrost: fix condition in bi_nir_is_replicated +- panfrost/va: remove swizzle mod from LDEXP +- panfrost: implement 16-bit ldexp +- panvk: advertise shaderFloat16 +- panfrost: implement 16-bit pack/unpack intrinsics +- panvk/csf: fix uninitialized read in utrace_clone_init_builder +- panfrost/pps: fix omitting several counters +- panfrost: fix libpan_v4 header include +- panfrost/va: remove dead code for packing BRANCHZI.i16 lane mod +- driconf: add uint64 type +- panvk: add core mask driconf options +- panfrost: add core mask driconf options +- pan/va: add roundmode modifier to additional instructions +- panfrost: implement float controls rounding mode +- pan/va: preserve signed zero in f32->f16 conversions +- pan/bi: refactor bi_instr_ftz to allow dontcare FTZ states +- pan/bi: ignore ftz mode when scheduling int instructions +- pan/bi: implement denorm behavior float controls +- panvk/va: don't advertise independent denorm behavior +- panvk: advertise VK_KHR_shader_float_controls +- panvk: advertise VK_KHR_float_controls2 +- panfrost: define bi_swizzle alias values +- panfrost: add bi_swizzle values for unused swizzles +- panfrost/va: use 'lane' modifier for MKVEC.v2i8 +- panfrost: use bifrost instruction table for bi_lower_swizzle +- panfrost/va: valhall-specific swizzle lowering +- panvk: enable uniformAndStorageBuffer8BitAccess and storagePushConstant8 +- panvk: don't advertise VK_KHR_float_controls2 on bifrost + +Benjamin Otte (1): + +- lavapipe: Don't advertise support for multiplane drm formats + +Benjamin ROBIN (2): + +- util/disk_cache: Do not try to delete old cache if cache is disabled +- docs: Update MESA_SHADER_CACHE_DIR env variable behavior + +Bo Hu (2): + +- gfxstream: Adding support for VK_KHR_global_priority extension +- gfxstream: update code generator for simplified dep graph + +Boris Brezillon (29): + +- panvk: Don't clobber registers if the render pass was suspended +- pan/decode: Fix the blend_count mask +- panvk/csf: Don't free the resources twice when init_render_desc_ringbuf() fails +- panvk: Initialize device virtual address space after the VM creation +- vulkan/state: Fix input attachment map state initialization/copy +- vk/pass: Add input attachment location info +- vulkan/state: Fix default input attachment map values +- panvk/jm: Don't force a preload if the previous batch didn't have draws +- pan/bi: Allow depth/stencil tile buffer access using LD_TILE +- pan/bi: Pass load_converted_output_pan target through a source +- pan/bi: Pass an explicit sampleid to load_converted_output_pan +- panvk/csf: Set invalidate_inherited_ctx only if the render pass is inherited +- panvk: Re-order things in panvk_deserialize_shader() +- panvk: Isolate CS specific bits in panvk_shader +- panvk: Generate the earlyzs LUT at shader creation time +- pan/earlyzs: Support the shader ZS read-only case and its optimization on v10+ +- panvk/jm: Move cmd_prepare_draw_sysvals() out of the layer loop +- panvk: Support color attachment remapping +- pan/bi: Introduce two intrinsics to support input attachment remapping +- panvk: Optimize input attachment loads when we can +- panvk: Skip BY_REGION barriers if we're in a render pass +- panvk: Advertise KHR_dynamic_rendering_local_read support +- pan/va: Support relaxed waits on read-only render targets +- panvk/csf: Optimize read-only tile buffer access +- panfrost: also consider z/s for tile-size +- pan/earlyzs: Fix the read-only ZS optimization +- panvk/csf: Pass less arguments to emit_vs_attrib() +- panvk/csf: Fix instance attribute offseting +- pan/format: Disable image storage on A8_UNORM + +Caio Oliveira (112): + +- intel/brw: Move fs_inst implementation code together +- intel/brw: Rename fs_inst to brw_inst +- intel/brw: Rename fs_inst_box to brw_inst_box +- intel/brw: Move some larger functions from brw_inst.h to brw_inst.cpp +- intel/brw: Remove brw_gs_compile struct +- intel/brw: Rename file brw_fs_nir.cpp to brw_from_nir.cpp +- intel/brw: Remove 'fs' prefix from brw_from_nir functions +- anv: Add statistic for 'GRF registers' for Xe3+ +- intel/brw: Add brw_analysis.h +- intel/brw: Use brw_analysis prefix for liveness analysis files +- intel/brw: Use brw_analysis prefix for def analysis file +- intel/brw: Use brw_analysis prefix for performance analysis files +- intel/brw: Move idom_tree declaration to brw_analysis.h +- intel/brw: Move analysis passes without own file to brw_analysis.cpp +- intel/brw: Merge brw_ir_analysis.h into brw_analysis.h +- intel/brw: Use brw prefix instead of namespace for dep analysis enum +- intel/brw: Use brw prefix instead of namespace for analysis implementations +- intel/brw: Remove 'using namespace brw' directives +- intel/brw: Remove offsets and total_size from VGRF allocator +- intel/brw: Pass fs_visitor around instead of the simple_allocator +- intel/brw: Add functions to allocate VGRF space +- intel/brw: Fold simple_allocator into the shader +- intel/brw: Rename and move thread_payload types to own header +- intel/brw: Merge brw_fs_visitor.cpp into brw_fs.cpp +- intel/brw: Rename files brw_fs.cpp/h to brw_shader.cpp/h +- intel/brw: Rename fs_visitor to brw_shader +- intel/brw: Rename fs_copy_prop_dataflow to brw_copy_prop_dataflow +- intel/brw: Rename a few remaining functions to remove fs prefix +- intel/brw: Update outdated comments +- intel/compiler: Use -Werror=vla +- intel/brw: Use the builder DO() function in all places +- intel/brw: Always have a (non-DO) block after a DO in the CFG +- intel/brw: Don't need to repair CFG in brw_opt_combine_constants +- brw: Reformat brw_gram.y and brw_lex.l +- brw: Fix size in assembler when compacting +- brw: Always verify EU compaction in debug mode +- brw: Remove unused function +- brw: Add block pointer in brw_inst +- brw: Use brw_inst::block in Def analysis +- brw: Use brw_inst::block in Combine Constants +- brw: Use brw_inst::block in CSE +- brw: Remove block parameter from brw_inst::remove() +- brw: Simplify brw_builder "insert before inst" constructor +- brw: Add explicit way to get an empty brw_builder +- brw: Embed at_end() inside brw_builder(brw_shader \*) constructor +- brw: Remove bblock_t parameters from various passes +- brw: Make assembler strict about JIP and UIP order +- brw: Add support for GOTO/JOIN in the assembler +- brw: Rework label tracking in assembler +- brw: Remove extra interface in brw_cfg types +- brw: Remove dead code from control flow +- brw: Add missing dependency classes to various passes +- brw: Get the reference to brw_def_analysis only once in saturate propagation +- brw: Move defs analysis back to its place in saturate propagation +- brw: Simplify the test code for brw passes +- brw: Add assembler support for DPAS +- brw: Remove extra SHADER_OPCODE_FLOW emitted during NIR conversion +- spirv: Update headers and metadata from latest Khronos commit +- vulkan: Update XML and headers to 1.4.311 +- spirv: In SpecConstantOp handling don't adjust convert ops bit_size twice +- brw: Fix decoding of 3-src destination stride in EU validation +- brw: Allow generating destination with stride 2 in 3-src instructions +- brw: Remove prefix gfx10 from enum types +- brw: Make some integer check more explicit +- brw: Add BRW_TYPE_BF for bfloat16 +- brw: Add BRW_TYPE_BF validation +- brw: Consider bfloat16 in scoreboard +- brw: Add EU assembler support for bfloat16 +- intel/executor: Add small example with bfloat +- brw: Expand EU validation for DPAS +- brw: Fix disassembler trying to decode 3src_hstride in Gfx9 +- brw: Remove brw_cfg::dump() +- brw: Track num_instructions in a block +- brw: Use block's num_instructions in scoreboard tests +- brw: Track total_instructions in a shader +- brw: Add analysis for block IP ranges +- brw: Use brw_ip_ranges in passes +- brw: Use brw_ip_ranges in scheduling / regalloc +- brw: Use brw_ip_ranges in liveness analysis +- brw: Remove adjust_block_ips and brw_inst::remove() with defer +- intel/executor: Use getopt for command line arguments +- intel/executor: Allow selecting a device to use +- brw: Use control flow helpers in scoreboard tests +- brw: Use SIMD16 shaders in scoreboard tests for Xe2+ +- brw: Stop setting SFID in scoreboard tests +- brw: Return actual progress in brw_lower_scoreboard +- brw: Add parser of SWSB annotations to use in tests +- brw: Use new bld/exp style in scoreboard tests +- brw: Remove HSW specific code from brw_compile_cs.cpp +- brw: Add brw_builder::uniform() +- brw: Fix invalid memory access in scoreboard test +- brw: Fix memory leak in EU validation tests +- intel/executor: Fix check for open() failure +- brw: Properly handle cooperative matrices created with constants +- spirv: Take a separate data_type when creating a new vtn_variable +- brw: Add a few basic tests for register coalesce +- brw: Clean up saturate propagation after non-defs version removal +- brw: Add brw_range struct +- brw: Use brw_range in IP ranges analysis +- brw: Use brw_range when operating with live ranges +- brw: Use brw_range to store VGRF ranges +- brw: Use brw_range to store Vars ranges +- brw: Use brw_range::last() to explicit get the last valid IP +- brw: Make brw_range use half-open ranges +- broadcom/ci: Skip test due to timeout +- nir/load_store_vectorize: Skip new bit-sizes that are unaligned with high_offset +- intel: Add intel_device_info::has_systolic +- intel: Disable has_bfloat16 for MTL +- brw: Allow DPAS with BF on Gfx125 +- brw: Update EU validation to allow packed BF mixed with packed F +- intel/executor: Update bfloat example +- intel/executor: Fix bfloat example for converting F to packed BF + +Caleb Callaway (3): + +- anv: add INTEL_DEBUG=rt_notrace +- intel/compiler: fix lingering i965 references +- intel/tools: fix 32b build for EU stall tool + +Casey Bowman (2): + +- vulkan/screenshot-layer: Add RGBA surface format support +- vulkan/screenshot-layer: Correct queueFamilyIndex source + +Caterina Shablia (11): + +- panvk: enable imagelessFramebuffer +- panvk: enable uniformBufferStandardLayout +- nir: teach nir_lower_bit_size to handle ballot and ballot_relaxed +- pan/bi: ensure src bit sizes of ballot{,_relaxed} and read_invocation +- pan/bi: lower non-32-bit ballot{,_relaxed} +- pan/bi: lower subgroups before lowering int64 +- panvk: enable subgroupExtendedTypes +- panfrost: don't overwrite push uniforms and sysvals UBO with user's UBO +- panfrost: update nr_uniform_buffers before dispatching XFB +- panfrost: require buffer_count and pushed_words to be passed to panfrost_emit_const_buf +- panfrost: move the comment closer to what it's about + +Charmaine Lee (1): + +- svga: remove tgsi semantic in shader compiler key + +Christian Gmeiner (20): + +- docs: Update perfetto with the latest status +- docs: Update the list of drivers with CPU tracepoints +- docs: Add perfetto driver specifics for V3D +- etnaviv: isa: Add swizzle instruction +- nir: Add bool return value to nir_lower_clip_halfz(..) +- nir: Add bool return value to nir_lower_texcoord_replace(..) +- etnaviv: nir: Return progress for etna_lower_alu(..) +- etnaviv: nir: Return progress for etna_lower_io(..) +- etnaviv: nir: Don't use deprecated NIR_PASS_V macro anymore. +- zink/ntv: Only emit GeometryStreams cap if multiple streams are used +- etnaviv/ci: Bring back GC7000 +- etnaviv/ci: Start using the revision number for GPU_VERSION +- etnaviv/ci: Update flakes for gc7000-r6214 +- etnaviv/ci: Add gles2 run for GC3000 +- etnaviv/ci: Add missing rev to GC3000 +- etnaviv: rs: Factor out box alignment function +- etnaviv: rs: Change param etna_get_rs_alignment_mask(..) +- etnaviv: Add support for KHR_partial_update +- mesa/formats: Add MESA_FORMAT_NV12 +- etnaviv: Add multi-planar YUV support + +Colin Marc (1): + +- vulkan/wsi: implement the Wayland color management protocol + +Collabora's Gfx CI Team (7): + +- Uprev Piglit to fc8179d319046f45346bcbcc5aaeabebdf151f03 +- Uprev Piglit to 04d901e49de6b650f9dceaf73220371273d87f73 +- Uprev Piglit to 708a9e365b18fdd881af989f75e1a6c1409cae8c +- Uprev ANGLE to 1b34d2a18af12cc55a3bc74dd679c2937d10cc5c +- Uprev ANGLE to 3818d37d5e94317f01810053b8f28c1f1e8b98e6 +- Uprev Piglit to 0ecdebb0f592 +- Uprev ANGLE to a3f2545f6bb3 + +Connor Abbott (52): + +- tu: Constify frag_areas argument of tu_fdm_bin_apply_t +- tu: Refactor fragment density map sampling +- freedreno: Add VSC mask parameter to CP_SET_BIN_DATA5 +- tu: Implement bin merging for fragment density map +- freedreno: Add a750+ "absolute" VSC bin mask +- tu: Implement "absolute" bin mask on a750 +- tu: Make sure tiles being merged are adjacent +- tu: Fix vertical tile merging check +- tu: Fix static blend bandwidth calculation +- tu: Remove useless prim_order state merging +- ir3: Fix const allocation when parsing assembly +- freedreno/decode: Push hostaddr->gpuaddr conversion into highlight_gpuaddr() +- freedreno/crashdec: Use register for RB rptr +- freedreno/crashdec: Handle hangs where the SQE is in RB +- freedreno/crashdec: Fix and extend control reg dumping on a750 +- freedreno/crashdec: Dump CP_BV_SQE_UCODE_DBG +- freedreno: Misc control registers updates +- tu: Implement VK_KHR_maintenance7 +- tu: Plumb through VkMemoryBarrierAccessFlags3KHR +- tu: Handle D32S8 -> R32 and R32 -> D32S8 copies +- tu: Handle R8->D24S8 and D24S8->R8 copies +- tu: Handle a pipelineStageCreationFeedbackCount of 0 +- ir3: Remove ir3_info::data +- tu: Expose VK_KHR_maintenance8 +- compiler/shader_info: Better document require_full_quads +- nir, compiler: Rename needs_quad_helper_invocations +- compiler,nir: Gather needs_full_quad_helper_invocations info +- ir3: Use needs_full_helper_invocations +- tu: Fix binning_possible detection with bin merging +- tu: Fix GMEM offset for multisample layered separate stencil +- tu: Enable GMEM with layered rendering +- tu: Fix size of frag_size_ir3 and frag_offset_ir3 driver params +- tu: Fix reported FDM fragment size with multiview +- tu: Fix layer_count with dynamic rendering + multiview +- ir3: Split mad with scalar ALU +- tu: Only allow power-of-two fragment areas +- tu: Split out part of tiling config to vsc config +- tu: Fix CmdClearAttachments with fragment density map +- tu/fdm: Skip some patchpoints when binning +- tu: Implement VK_QCOM_fragment_density_map_offset +- vk/runtime: Use vk_command_buffer in renderpass wrappers +- vk/runtime: Add common CmdEndRendering +- tu: Implement VK_EXT_fragment_density_map_offset +- ir3: Pass through access flags when lowering global accesses +- nir/opt_preamble: Handle load_global_ir3 +- ir3: Move load/store vectorization to finalize +- ir3: Vectorize shared memory loads/stores +- tu: Fix flushing when using a staging buffer for copies +- freedreno: Add compute_lb_size device info +- freedreno/a6xx: Define CONSTANTRAMMODE +- freedreno/a6xx, turnip: Set CONSTANTRAMMODE correctly +- ir3: Take LB restriction on constlen into account on a7xx + +Corentin Noël (12): + +- venus/ci: Skip flaky test due to intermittent timeouts +- ci: Update CrosVM and Virglrenderer +- lavapipe: Remove doubly initialized features +- lavapipe: Change lvp_cmd_type to anonymous enum +- ci: Update CrosVM and Virglrenderer +- virgl: nir: Don't use deprecated NIR_PASS_V macro anymore +- nir: Add bool return value to nir_legacy_trivialize(..) +- ntt: nir: Don't use deprecated NIR_PASS_V macro anymore +- tnn: nir: Don't use deprecated NIR_PASS_V macro anymore +- ci: Uprev virglrenderer to latest version on time +- virgl: Close handle on resource info failure +- virgl: Use drmCloseBufferHandle instead of calling dmIoctl directly + +Daniel Schürmann (43): + +- aco/isel: fix empty exec tracking for uniform branches +- aco/isel: move cf_info into separate struct cf_context +- aco/isel: rename cf_context::has_divergent_branch +- aco/isel: use cf_context in if_context to restore cf information +- aco/isel: use cf_context in loop_context to restore cf information +- aco/isel: add and use exec_info::empty() helper +- aco/isel: fix assumptions about potential empty exec mask in nested control flow +- aco/isel: remove loop nest information from exec_info +- aco/isel: track control flow divergence in loops more accurately +- nir: make divergence information metadata +- nir: require nir_metadata_divergence if needed +- amd: switch to nir_metadata_divergence +- intel: switch to nir_metadata_divergence +- nouveau: switch to nir_metadata_divergence +- nir: only print divergence information if metadata is valid +- nir/serialize: don't serialize divergence information +- nir/validate: validate divergence metadata +- nir/print: print phi sources sorted by predecessor blocks +- aco/scheduler: always respect min_waves on GFX10+ +- aco/scheduler: stop rounding down the target number of waves on GFX10+ +- aco: unify get_addr_sgpr_from_waves() and get_addr_vgpr_from_waves() into one function +- aco/scheduler: keep track of RegisterDemand at UpwardsCursor::insert_idx +- aco/scheduler: keep track of RegisterDemand at DownwardsCursor::insert_idx{_clause} +- aco/scheduler: remove unused include of unordered_set +- aco/insert_exec_mask: Don't immediately set exec to zero in break/continue blocks +- aco/insert_exec_mask: don't restore exec in continue_or_break blocks +- aco/lower_branches: allow other instructions after s_andn2 in break blocks +- aco/ssa_elimination: insert parallelcopies for p_phi immediately before branch +- aco/ssa_elimination: refactor scratch_sgpr handling +- aco/ssa_elimination: create a single parallelcopy instruction for linear and logical phis +- aco/assembler: Fix short jumps over chained branches +- aco/assembler: Don't insert chained branches into otherwise empty blocks +- aco/tests: add more tests for chained branches +- zink: lower {demote|terminate}_if to CF after lowering point smooth +- nir: remove nir_lower_discard_if_to_cf option +- nir: set SYSTEM_VALUE_HELPER_INVOCATION read for nir_intrinsic_is_helper_invocation +- zink: clean up HelperInvocation code emission +- zink: enable CapabilityDemoteToHelperInvocation and extension only if required +- aco/lower_branches: properly consider exec mask needs of branch targets +- aco: don't assume that demote doesn't cause an empty exec mask +- aco/insert_exec_mask: if applicable, use s_wqm to restore exec after divergent CF +- aco/insert_exec_mask: don't disable dead quads on demote in divergent CF +- aco: Remove empty exec skipping after demote + +Daniel Stone (29): + +- ci/vvl: Use appropriate level of parallelism +- ci: Move common testing packages to test-base +- ci: Move apitrace to test-base container +- ci: Add debian/arm32_test-* images +- ci: Build libdrm for Android host builds +- ci/angle: Use native toolchain for builds +- ci/zink: Expand flake definition on radv +- ci: Update kernel for Panthor scheduling fixes +- ci/panfrost: Move G610 testing to pre-merge +- ci: Re-enable Collabora CI +- docs/ci: Fix nginx caching instructions +- ci/lava: Fail faster when getting results +- etnaviv: Add missing build dependency on generated header +- ci/amd: Add new radeonsi fail seen in the wild +- ci/zink: Flake out sparse tests +- ci: Add daniels to restricted-trace users +- ci: Re-enable trace jobs with updated Piglit +- ci: Move softpipe issue from fail to flake +- ci/amd: Disable radv-fossils +- ci/windows: Don't copy non-existent libglapi.dll +- ci/panfrost: Split inherit definitions into -inc +- ci: Reduce build job timeouts +- ci/softpipe: Add timeouts to softpipe jobs +- ci/llvmpipe: Add llvmpipe and lavapipe timeouts +- ci/virgl: Add timeout to software jobs +- ci/docs: Add timeout to doc build +- ci/microsoft: Add job timeouts and pin Piglit to GSt +- ci: Make all job timeouts explicit +- doc/ci: Update nginx caching snippets + +Danylo Piliaiev (28): + +- u_trace: print tracepoint params in csv output +- util: Make debug_dump_flags thread safe +- util: Add dump_debug_control_string to dump debug_control +- tu: Add gmem disable reason to renderpass tracepoint +- tu: Add info about debug options to command buffer tracepoint +- tu: Get correct src view when storing gmem attachment +- freedreno/fdl: Log mutability when dumping layout +- tu: Handle mismatched mutability when resolving from GMEM +- tu/a6xx: Emit VSC addresses for each bin to restore after preemption +- nir/nir_lower_multiview: Don't assert if load_deref doesn't have var +- ir3: Set need_full_quad depending on info.fs.require_full_quads +- tu: Be more granular in calculating whether blend state reads color +- tu: Fix NULL deref in trace_end_render_pass +- freedreno/regs: Rename BINNING bit to FS_DISABLE in a few regs +- ir3: Detect empty fragment shaders +- tu/a7xx: Bypass invoking empty FS for D/S-only draw calls +- ir3: Detect FS that write only color without other side effects +- tu: Disable FS in certain cases even if FS is not empty +- tu/lrz: Improve LRZ around stencil tests and reads_dest cases +- tu: Use EARLY_Z also for stencil tests +- ir3: Fix shaders that write only color classified as empty +- freedreno: Bump kernel uapi (linux 6.14) +- freedreno,tu: Read and pass to compiler uche_trap_base +- tu: Implement VK_KHR_shader_clock +- freedreno/a6xx: Implement ARB_shader_clock +- tu,freedreno: Don't fallback to LINEAR with DRM_FORMAT_MOD_QCOM_COMPRESSED +- ir3: VARYING_SLOT_LAYER output is used for binning +- tu: Fix disable_fs state update condition + +Dave Airlie (24): + +- radv/video: move encoder to using a buffer instead of an image +- radv/video: calculate colloc buffer size for h264 B frames. +- radv/video: add h264 b frame encoding support. +- vulkan/wsi/x11: don't use update_region for damage if not created +- nak: adjust latencies on fp16/64 instructions on Turing +- nvk: enable float16 on turing. +- loader/nouveau: load zink as the GL driver for turing and above. +- radv/video: don't try and send events on UVD devices. +- vulkan/video: add simple parameter retrieval wrappers +- radv/video: convert to using common parameter wrappers. +- anv/video: convert to common parameters retrieval code +- anv/video: don't write to params if not set. +- vulkan/video: add support for inline session paramters. +- radv: expose VK_KHR_video_mainteance2 +- anv: expose VK_KHR_video_maintenance2 +- gallivm: check for avx512vbmi and tell LLVM the correct answer. +- nak: add reads after setting writes +- nak: Add an a_has_pred parameter to waw_latency +- nak: Add Turing latency information +- nak: Add Ampere and Ada latency information +- nvk: update nvidia class header files. +- nvk: add ADA compute class to nv_push_dump +- nvk: add hopper support to nv_push_dump +- nak: add F2FP to sm75 instr latencies + +David Rosca (64): + +- ac/vcn_dec: Fix AV1 film grain on VCN5 +- radeonsi/video: Avoid stream handle duplicates in PID namespace +- frontends/va: Don't try to switch to protected buffer in EndPicture +- frontends/va: Add CreateContext flag to enable protected context +- frontends/va: Require protected context for VAProtectedSliceDataBuffer +- frontends/va: Switch to graphics context when creating protected surface +- radeonsi/vcn: Use texture instead of video buffer for DPB buffers +- radeonsi/video: Fix creating video buffers with AMD_DEBUG=tmz +- frontends/vdpau: Set H264 chroma_format_idc +- radeonsi/vcn: Set correct chroma format for H264 decode +- radeonsi/uvd: Set correct chroma format for H264 decode +- ci/amd: Remove VAAPI skips +- frontends/va: Use transfer stride and offset in DeriveImage +- radv/video: Fix setting balanced preset for HEVC encode with SAO enabled +- radv/video: Move IB header from begin/end to encode_video +- radeonsi: Use gfx for TMZ buffer clears +- winsys/amdgpu: Add assert for secure submissions on compute ring +- frontends/vdpau: Use extra reference buffer for AV1 film grain +- ac/surface: Only allow linear modifier for subsampled 422 formats +- ac/surface: Allow DCC for multi-plane formats on GFX12 +- radeonsi/vcn: Fix chroma pitch for JPEG decode +- radeonsi/video: Allocate video buffers with modifiers +- radeonsi/vcn: Add UDT support for VCN5 +- radeonsi/vcn: Rework decode ref handling +- radeonsi/video: Fix crash when creating buffers without modifiers support +- frontends/va: Set AV1 max_width/height to surface size +- frontends/vdpau: Set AV1 max_width/height to surface size +- Revert "radeonsi/vcn: Limit size to target size in AV1 decode" +- pipe: Remove PIPE_AV1_ENC_FRAME_TYPE_SHOW_EXISTING +- radeonsi/vcn: Set all pic params for H264 encode references +- radeonsi/vcn: Add radeon_enc_av1_picture_type +- radeonsi/vcn: Support H264 encode weighted_bipred_idc +- radeonsi/video: Remove mpeg12 shader decoder support +- gallium/vl: Fix video buffer supported format check +- Revert "frontends/vdpau: Alloc interlaced surface for interlaced pics" +- frontends/vdpau: Fix creating deinterlace filter for interleaved buffers +- frontends/va: Support A8R8G8B8 format for processing +- frontends/va: Use ARGB as default fourcc for RGB32 RT format +- frontends/va: Don't filter supported formats according to config RT format +- gallium/vl: Return YUV plane order for single plane formats +- radeonsi/video: Only allow 64K_S swizzle mode for VCN < 2.2 +- radeonsi/vce: Support old VCE firmware +- radeonsi/video: Allow DCC 256B block size with drm minor >= 63 +- gallium/vl: Fix rotation with scaling for compute shaders +- gallium/vl: Fix mirror with rotation for compute shaders +- frontends/va: Don't ignore rotation and mirror for conversions to RGB +- ac/parse_ib: Fix parsing output format on VCN5 +- ac/parse_ib: Parse VCN DYNAMIC_REFLIST_BUFFER +- radv: Use radv_format_to_pipe_format instead of vk_format_to_pipe_format +- radv: Add radv_format_description to remap 10/12bit formats to 16bit +- radeonsi/vcn: Disable AV1 unidir compound with rate control +- egl/x11: Fix swap interval setup +- radv/video: Fix msg header total size +- radv/video: Fix encode session info for VCN3+ +- radv/video: Use ac_vcn_enc_init_cmds +- radv/video: Always enable B pictures for H264 encode +- radeonsi/vpe: Don't try to flush cs from buffer_map +- radeonsi/vcn: Don't try to flush cs from buffer_map +- radeonsi/uvd,vce: Don't try to flush cs from buffer_map +- radeonsi/video: Remove cs argument from si_video_resize_buffer +- radeonsi/vpe: Use float division to get scaling ratio +- radeonsi/vpe: Fix process_frame return value +- radeonsi/vpe: Use studio range for YUV and full for RGB by default +- radeonsi/vcn: Fix decode target index for H264 interlaced streams + +David Tobolik (1): + +- rusticl/feat: LinkOnce ODR + +Dmitry Baryshkov (4): + +- freedreno/registers: allow skipping the validation +- meson: add freedreno (turnip) Vulkan to arm64 defaults +- meson: disable SIMD blake optimisations on x32 host +- mesa-clc: add an option to force inclusion of OpenCL headers + +Dmitry Osipenko (1): + +- virtio/vpipe: Correct vdrm_vpipe_connect() definition + +Dudemanguy (1): + +- treewide: remove unneeded executable bit in non-scripting files + +Dylan Baker (6): + +- intel: output a depfile with mesa_clc +- iris: Correctly set NOS for geometry shader state changes +- iris: fix handling of GL_*_VERTEX_CONVENTION +- intel/tools: deduplicate zlib_inflate function +- intel/tools: move ascii85_decode to common code +- intel/decoder: free memory in error case + +Ella Stanforth (4): + +- v3d/compiler: Implement load_output +- v3d: enable framebuffer fetch +- v3d: Fix fbfetch with discards. +- v3d/compiler: Fixup output types for all 8 outputs + +Emma Anholt (6): + +- mesa/ffvs: Skip doing redundant stores of .xyz when doing lighting calculation. +- ci/anv: Enable testing with Vulkan video encode/decode. +- docs: Drop some weird unhelpful text about DRI2. +- egl: Apply autopep8. +- egl: Retire NOK_swap_region support. +- egl: Retire NV_post_sub_buffer support. + +Emmanuel Gil Peyrot (1): + +- panvk: Initialize out array with the correct length + +Eric Engestrom (159): + +- VERSION: bump to 25.1 +- docs: reset new_features.txt +- docs: update calendar for 25.0.0-rc1 +- docs/android: drop libglapi.so now that it's gone +- ci/cuttlefish: drop \`rm libglapi.so` now that it's no longer loaded +- gfxstream: drop unnecessary semi-colons +- gfxstream: fix signedness of shifts +- gfxstream: drop dead variables +- gfxstream: use \`range` variable for its intended purpose +- gfxstream: mark unused variables as such +- docs: update calendar for 25.0.0-rc2 +- llvmpipe/tests: include math.h for INFINITY +- ci: don't run on tag pipelines +- ci: drop redundant condition +- ci: only trigger the CI for release managers when pushing to staging branch +- ci: run containers builds on staging branches +- ci/yaml-toml-shell-py-test: don't run on post-merge pipelines +- ci/yaml-toml-shell-py-test: run on direct push pipelines +- ci: finish sorting vars +- ci: rename generate-env.sh to export-gitlab-job-env-for-dut.sh +- docs: update calendar for 25.0.0-rc3 +- ci: debian-testing-ubsan is used by tests +- llvmpipe/ci: drop fraction for asan tests that takes 1.5 min without fraction +- docs: add release notes for 25.0.0 +- docs: add sha sum for 25.0.0 +- docs: update calendar for 25.0.0 +- docs: add release calendar for 25.0.x cycle +- docs/releasing: fix "release schedule" subsections nesting +- docs/release-calendar: add 25.1 branchpoint & rc dates +- ci/alpine: pin the release to avoid random unexpected changes +- ci/alpine: control wayland & wayland-protocols versions +- ci: move shader-db test job from build jobs yaml to test jobs yaml +- ci/build: remove a couple of unnecessary "override needs: to the same value" +- ci/build: move .use-debian/x86_64_build out of the generic .meson-build and into the debian/x86_64 jobs +- ci/build: split meson-build into build-for-tests and build-only +- ci/build: add explicit build-for-tests or build-only to all jobs +- ci/build: lower the delay to start build-only jobs +- docs: update gitlab docs urls +- vtn_bindgen2: keep the printf blob local to avoid LTO issues +- lavapipe/ci: add vkd3d job +- ci/build: build-test the dri2 code +- ci/b2c: fix comment location +- ci/b2c: explain better why we don't clone mesa +- ci/b2c: use B2C_JOB_TEMPLATE directly +- ci/b2c: set default value for B2C_MACHINE_REGISTRATION_IMAGE in the job +- ci/b2c: set default value for B2C_TELEGRAF_IMAGE in the job +- ci/b2c: set default value for B2C_KERNEL_CMDLINE_EXTRAS in the job +- ci/b2c: split B2C_JOB_VOLUME_EXCLUSIONS in the jinja template +- ci/b2c: rename IMAGE_UNDER_TEST to B2C_IMAGE_UNDER_TEST +- ci/b2c: pass through all the B2C_* variable without renaming them +- ci/b2c: pass through all the CI_* variables as well +- ci/b2c: set the registry proxy from the job +- ci/b2c: use more readable "long" argument names +- v3d/ci: mark traces humus/AmbientAperture and humus/DynamicBranching3 as flaky +- docs: update calendar for 25.0.1 +- docs: add release notes for 25.0.1 +- docs: add sha sum for 25.0.1 +- wsi/x11: drop misleading reference to anv in var names +- meson: simplify video-codecs option parsing +- ci/deqp: backport fix for dEQP-VK.binding_model.buffer_device_address.* +- meson: announce that clover is deprecated (slated for removal) +- lvp/ci: document fixed tests +- lvp/ci: skip tests that are timing out (>1 min) +- lvp/ci: fix sorting of flakes +- lvp/ci: remove duplicate flakes (noticed after sorting) +- lvp/ci: document flakes seen over the last week +- freedreno/ci: document fixed test +- freedreno/ci: document flakes seen over the last week +- nvk/ci: fix sorting of flakes +- nvk/ci: document flakes seen over the last week +- zink+nvk/ci: document new failures +- zink+nvk/ci: fix sorting of flakes +- zink+nvk/ci: document flakes seen over the last week +- radv/ci: document flakes seen over the last week +- zink+radv/ci: fix sorting of flakes +- zink+radv/ci: document flakes seen over the last week +- ci: document http proxy bug & disable farms relying on it +- ci/container: fix image tags comment in trampoline script +- ci/init-stage2: document that only lava jobs upload results to s3 +- ci/build: no need to list all the files that will go into the uploaded artifact +- ci/piglit: drop usage of s3cp for a simple download +- ci: always abort if the curl download fails +- ci/baremetal: make sure we can follow redirects on s3 downloads +- ci: do a regular GET request for /done files, instead of HEAD +- ci: replace broken s3cp command with a simple curl call +- ci: bump image tags +- ci: drop placeholder-job tags to allow jobs to run +- ci: document new llvmpipe & softpipe failures since the migration +- ci: run shader-db & zink-lvp on kvm runners +- ci: disable llvmpipe & virgl traces jobs +- meson: do not compile libisaspec unless used +- meson: do not compile libblake3 unless used +- meson: do not compile libmesa_util_sse41 unless used +- pick-ui: clean up formatting +- pick-ui: fix enum value in test expectation +- pick-ui: add missing field in test expectation +- pick-ui: rename s/out/commit_message/ variable to make its contents clearer +- pick-ui: fix parsing of multiple \`backport-to:` lines +- docs: update calendar for 25.0.2 +- docs: add release notes for 25.0.2 +- docs: add sha sum for 25.0.2 +- rpi/ci: disable traces jobs +- ci: re-enable igalia (rpi) farm +- ci: document who are the farm admins +- docs: fix last references to gallium-drivers=swrast +- ci: remove last uses of deprecated \`swrast` alias for softpipe+llvmpipe +- meson: drop deprecated \`swrast` alias for softpipe+llvmpipe +- meson: move special value \`all` out of the middle of the list +- rpi5/ci: sort flakes +- rpi5/ci: drop duplicate flakes +- ci: bump apitrace version +- ci: drop packet.net tag on git archive job +- ci: move aarch64 tag to .use-debian/arm64_build +- ci: move android's kvm requirement to .use-debian/x86_64_test-android +- ci: add FDO_RUNNER_JOB_PRIORITY_TAG_* to control priority of generic freedesktop runners +- docs: remove the last 24.3 releases +- virgl: fix typo inverting a condition +- docs: update calendar for 25.0.3 +- docs: add release notes for 25.0.3 +- docs: add sha sum for 25.0.3 +- ci/build: drop LTO from fedora build +- ci: rename ci-tron priority tag to avoid conflict with the generic fdo runners +- ci/fluster: use http proxy when checking for the vector files +- ci/fluster: don't overwrite FLUSTER_VECTORS_HOST_PATH to a different meaning +- radeonsi/ci: update expectations +- radv/ci: update expectations +- freedreno/ci: update expectations +- nvk/ci: update expectations +- lvp/ci: update expectations +- zink+nvk/ci: update expectations +- zink+radv/ci: update expectations +- zink+tu/ci: update expectations +- zink+anv/ci: document a bunch of flaky glx tests that have been preventing merges all day +- docs/ci: replace deprecated \`pages` job with \`pages: true` +- docs/ci: add link to the website preview for convenience +- docs/ci: add comment explaining what that long rule actually does +- docs/ci: split pre-merge & merge pipeline rules +- docs/ci: follow convention of only running jobs by default for Marge +- ci: fix image tags indentation +- ci/deqp: fix vulkan video build +- VERSION: bump for 25.1.0-rc1 +- .pick_status.json: Update to d5ad7981401c2393cac38fc3215e8bbb97de06b9 +- .pick_status.json: Update to 1bf8542490679fa2a244e27fb2b04bbc3d122f74 +- meson: remove duplicate \`deprecated` for \`gallium-xa` option +- meson: remove duplicate \`deprecated` for \`power8` option +- .pick_status.json: Update to 5f3a3740dcc6d243f2ef14138fb1c09bcbb9b5fd +- pick-ui: make \`Backport-to: 25.0` backport to 25.0 \*and more recent release branches* +- .pick_status.json: Update to dd3e1190a2bdbc6b996152510407adb9a8cb5618 +- pick-ui: add missing dependency +- [25.1 only] ci: don't treat misleading-indentation warnings are error on alpine +- VERSION: bump for 25.1.0-rc2 +- .pick_status.json: Update to 3493500abb78a4dc22aba14840bba5c777fde745 +- .pick_status.json: Update to eeffb4e674d10db9aefebeca91c2d87c1676b81e +- VERSION: bump for 25.1.0-rc3 +- .pick_status.json: Update to 615d0c9669595adf114a705f5b8ee88277aa99f2 +- .pick_status.json: Update to 7f0de1a51212881c9a7614327bf3e1fbc9784ddb +- ci: drop tracking of removed folder +- .pick_status.json: Update to c434050a0088ec3f07d63fd1019aea541632faed +- .pick_status.json: Update to 84b9c281fe82dd66f2552687cecb61a8e22809d0 + +Eric R. Smith (7): + +- panfrost: avoid potential divide by 0 calculating timer_resolution +- panfrost: fix YUV center information for 422 +- panfrost: fix backward propagation of values in loops +- panfrost: use an accessor function to read from bi_opcode_props +- panfrost: consider xfb shader when calculating thread local storage size +- panfrost,lima: use index size in panfrost minmax_cache +- panfrost: fix transaction elimination crc valid calculation + +Erico Nunes (2): + +- ci: re-enable lima farm +- panvk: disable VK_EXT_image_drm_format_modifier for arch < 10 + +Erik Faye-Lund (57): + +- panvk: fix line-rasterization of bifrost +- panvk: report strictLines as true +- panvk/ci: add back incorrectly removed crash +- pan/ci: add flaky tests to the flake-list +- pan/ci: add fail from llvm 19 upgrade +- pan/ci: add a couple of common flakes +- panvk: correct number of read bytes for dynamic buffers +- meson: rename meson_options.txt +- panvk: report passing the VK CTS +- panvk: rename helper +- mesa/main: wire up glapi bits for EXT_multi_draw_indirect +- pan/bi: use unreachable instead of DBG + assert +- pan/bi: remove unused debug output +- pan/genxml: rename field +- panfrost: respect pipe_rasterizer_state::line_rectangular +- panvk: disable shaderFloat16 on bifrost +- docs/features: add missing panvk feature +- docs/features: add VK_EXT_hdr_metadata +- panvk: fix extension requirement +- panvk: rework how we deal with extension-reqs +- panvk: expose VK_KHR_display +- panvk: expose EXT_display_control +- panvk: correct VkPhysicalDeviceProperties::deviceName +- panvk: enable KHR_line_rasterization support +- panvk: add basic driconf infrastructure +- panfrost: avoid accidental aliasing +- panfrost: fix overflow-debugging +- panfrost: use real array for panfrost_emit_plane +- panvk: check for texture-compression support +- panvk: expose textureCompressionBC when supported +- mesa/main: fix regression in extension-checking +- panvk/ci: disable some more slow tests +- docs/features: update panvk support +- panfrost: fixup typo in 16x sample-pattern +- panfrost: correct tile-buffer size for some v7 GPUs +- panvk/ci: move timeouts to crash +- panfrost: properly align value +- panfrost: allocate tile-buffer for dummy render-targets +- panfrost: disable tile-pipelining when needed +- panfrost: add color-attachment and msaa helpers +- panvk: enable 8x and 16x msaa when supported +- panvk: enable sampledImageIntegerSampleCounts for all MSAA formats +- nir/lower_tex: use texture_mask instead of shifting on use +- nir/lower_tex: avoid undefined-behavior +- panvk: set shared_addr_format +- panvk: enable KHR_spirv_1_4 on v10+ +- panvk: claim official conformance on v10 +- docs/panfrost: use anonymous hyperlinks +- panvk: enable dualSrcBlend +- docs/panvk: fixup extension support +- docs/panvk: remove disabled extension +- docs/panvk: fixup docs around float controls +- docs/panvk: add VK\_-prefix for extension name +- docs/panvk: document ycbr in terms of extensions +- docs/panvk: document EXT extension aswell +- docs/panvk: add missing new features +- panvk: support vulkan 1.2 on v10+ + +Ernst Persson (1): + +- intel/vulkan: Add bvh build dependency + +Faith Ekstrand (165): + +- nvk,nak: Only use u64 texture handles with codegen +- nvk: Only pass sampler handles when needed +- nak: Add support for bound and cbuf textures +- nak: Optimize bindless to cbuf textures on Volta+ +- nak: Fix cbuf textures +- nak: Stop setting .EF on tex ops +- nak: Rename MemEvictionPriority::Unchanged to LastUse +- nak: Add more MemEvictionPriorities +- nak: Print .dc for OpTld4::z_cmpr +- nak: Add MemEvictionPriorities to tex ops +- nvk: Fix scissor bounds +- nvk: Fix a typo in a comment +- nak/repair_ssa: Use a worklist for get_ssa_or_phi() +- nvk: Rename nvk_descriptor_set::mapped_ptr +- nvk: Respect VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_EXT +- nvk: Implement descriptorBufferPushDescriptors +- nvk: Pull shaders from the state command buffer in nvk_cmd_process_cmds() +- nvk: Handle shader==NULL in nvk_cmd_upload_qmd() +- nvk: Allow sparse loads on EDB buffers +- nak: Handle sparse texops with unused color destinations +- nvk: Use suld for EDB uniform texel buffers +- nvk: Align UBO/SSBO addresses down rather than up +- nak: Use suld.constant when ACCESS_CAN_REORDER is set +- nvk: Use suld.constant for EDB uniform texel buffers +- nvk: Constify instance and pdev pointers +- ci: Remove some NVK vkd3d fails +- nak: Only use suld.constant on Ampere+ +- nak: Use MemScope::GPU instead of MemScop::System +- zink: Use the correct array size for signal_values[] +- zink: Use persistent semaphores for PIPE_FD_TYPE_SYNCOBJ +- nvk/nvkmd: Fix logging of VA bind addresses +- nvk: Don't bind a fragment shading rate image pre-Turing +- nvk: Do not set INVALIDATE_SKED_CACHES pre-MaxwellB +- nak: Handle tex ops with only one source +- nak/nir: Don't provide dummy backend2 tex srcs +- nvk: Fix indentation in begin_end_query() +- nouveau/class_parser: Make strided element functions const +- nak/qmd: Drop some unnecessary .try_into().unwrap() +- nak/qmd: Add a nak_get_qmd_cbuf_desc_layout() helper +- nvk: Handle pre-Turing dispatch indirect commands +- nvk: Only support deviceGeneratedCommandsMultiDrawIndirectCount on Turing+ +- nvk: Only support compute shader derivatives on Turing+ +- nak/nir: Re-materialize load_const instructions in use blocks +- nvk/image: Prefer vk_image values over pCreateInfo +- nvk/image: Drop some unneeded initializers +- nvk: Fix capitalization of statistics +- nak: Fix NAK_DEBUG=spill for large FS outputs +- nak: Handle any->Mem parallel copies +- nak: Add a new ConstTracker struct +- nak: Don't spill/fill const values +- compiler/rust: Add u_printf_info to the rust bindings +- nir: Add a get_io_index_src() helper +- nir: Add a nir_opt_tex_skip_helpers optimization +- nak: Set .NODEP on tex ops based on nir_opt_tex_skip_helpers() +- zink: Don't present to Wayland surfaces asynchronously +- zink: Revert "zink: enable single-plane modifiers for generic 2D exports" +- egl/dri2: Rework get_wl_surface_proxy() +- egl/wayland: Pass the original wl_surface to kopper +- util/box: Add a intersect_2d helper +- iris: Use pipe_box helpers for damage calculations +- zink: Use pipe_box helpers for damage calculations +- vulkan: Add device address helpers to vk_buffer +- nvk: Use the new buffer device address infrastructure +- panvk: Use the new buffer device address infrastructure +- hk: Use the new buffer device address infrastructure +- vulkan/meta: Use vk_buffer.device_address directly +- zink: Set needs_barrier after transitioning to QUEUE_FAMILY_FOREIGN +- zink: Check queue families when binding image resources +- spirv: OpAsmTargetINTEL is untyped +- spirv: Update the JSON and headers +- vulkan: Update XML and headers to 1.4.309 +- nouveau/winsys: Stop asserting that imported BOs are aligned +- nvk: Allow rendering to linear images with unaligned strides +- nil: Relax alignment requirements for linear images +- nil: Split linear and tiled image creation +- nvk,nil: Stop panicing in image creation +- vtn: Support cooperative matrices in OpConstantNull +- loader/nouveau: Fix the comment in nouveau_zink_predicate() +- egl/x11: Re-order an if statement +- egl/kopper: Update the EGLSurface size after kopperSwapBuffers() +- nak: Insert the annotation in the right spot in assign_regs +- nak: Don't insert empty OpParCopy in assign_regs +- nak: Always copy sources when handling vec/pack/mov ops +- nak: Fix a SM check for OpPCnt +- nak: Check num_regs(UGPR) instead of SM version +- nak: Turing starts at SM73 +- nouveau/headers: Refactor class_parser +- nouveau/headers: Drop unused Rust constants +- nouveau/headers: Drop double-underscore from Rust names +- nouveau/headers: Re-use Rust method types when possible +- nvk: Reduce the size of nvk_image_view_capture +- nvk: Free owned_gart_mem correctly +- nvk: Fix a Volta check +- nvk: Disable VK_EXT_post_depth_coverage on Maxwell A and earlier +- nvk: Allocate QMDs from a heap on Maxwell A and earlier +- nvk: Disable VK_EXT_device_generated_commands on Maxwell A and earlier +- nvk: Don't set filterMinmax properties prior to Maxwell B +- nvk: Disable sparse buffer binding prior to Maxwell B +- nouveau/mme/fermi: Don't allow STATE and EMIT on the same op +- nvk: Use the right sample mask for 8x/4pass on Maxwell A +- nvk/nvkmd: Add a concept of incomplete pushes +- nvk: Mark the push before an indirect push as incomplete +- nak: hsetp2 and dsetp are slower on Volta +- nvk: Bump the conformance version to 1.4.1.3 +- vulkan/wsi: Signal buffer memory object when blitting +- venus: Assume wsi_mem->base_bo != NULL +- venus: Don't report global priorities if globalPriorityQuery is unsupported +- venus: Only claim modifiers in WSI if the host driver supports it +- venus: Set wsi_device::supports_scanout = false +- compiler/rust: Add a nir_alu_type wrapper +- compiler/rust: Add more NIR intrinsic getters +- nak: Implement nir_intrinsic_convert_alu_types +- nak/nir: Use correct rounding for fp64 -> fp16 conversions +- nak,nir: Generalize nak_nir_split_64bit_conversions and move it to NIR +- nak: Move some calc_instr_deps items to a new file +- nak: Box our RegTrackers +- nak: Improve WS abstractions in hw_runner +- nak: Add a QMD heap to hw_runner +- nak: Disable lea64 and f2fp.pack_ab tests pre-Volta +- nvk: Disable vulkanMemoryModel on Kepler and earlier +- nvk: Use max_image_dimension for maxFramebufferWidth/Height +- nvk: Disable 32k images on Pascal A +- nak: Move has_fixed_latency to Op +- nak: Add and use a ShaderModel::needs_scoreboard() helper +- nak: Add latency helpers to ShaderModel and use them +- nak: Move SM70 encoding and legalize to a separate file +- nak: Move exec_latency into the per-SM files +- nak: Move latency information into the per-SM files +- nak: Move sched_common.rs to reg_tracker.rs +- nak: Add GPU generation helpers +- nak: Use is_volta() instead of sm == 70 +- nak: Put the cycle count assert behind a debug flag +- nak: Handle delays > 15 +- nak: Add an Op::no_scoreboard() helper +- intel/compiler: Use nir_split_conversions() +- nak: Add a ChannelMask type +- nak: Add support for suld/st.b +- nak,nir: Add an image_load_raw_nv intrinsic +- nak: Use suld.b on Kepler if we have a format +- nak: Allow predicates in nir_intrinsic_as_uniform +- nak: Add a NAK_DEBUG=panic option +- nvk: Call vk_device_finish() last in nvk_DestroyDevice() +- nvk/nvkmd: Check the correct flag for the Kepler GART workaround +- nil: Multiply by array_stride_B instead of adding +- nak/hw_tests: Feed predicate/carry sources with 0/1 data +- nak: Add a plop2 test +- nak: Add False and True to IntCmpOp +- nak: Lower texture inputs for Kepler B +- nak/legalize: Add a helper for lowering ineg +- nak: Add stubs for Kepler B +- nak: Add stubs for Fermi and Kepler A +- nak: Move some legalization helpers from sm50 to common code +- nak/sm50: Add zero_reg() and true_reg() helpers +- nak/sm70: Add zero_reg() and true_reg() helpers +- nak: Get rid of RegRef::zero +- nvk: Disable VK_EXT_descriptor_buffer pre-Maxwell +- nak/qmd: Rework cbuf size suffix handling +- nak/qmd: Add support for shifted cbuf addresses +- nak/qmd: Add QMD version 4.0 for Hopper +- nvk: Handle shifted QMD cbuf addrs in indirect command processing +- nak/legalize: Take a RegFile in copy_alu_src_and_lower_fmod +- nak/legalize: Take a RegFile in copy_alu_src_and_lower_ineg() +- nak/sm70: Fix the bit74_75_ar_mod assert +- nvk: Maxwell+ is now conformant +- nak: Set lower_pack_64_4x16 + +Felix DeGrood (9): + +- vk/overlay-layer: fix regression in non-control pathway +- intel/brw: support for dumping shader line numbers +- anv: add INTEL_DEBUG=shaders-lineno +- iris: add INTEL_DEBUG=shaders-lineno +- drm-uapi: add eu_stall uapi +- intel/perf: remove unnused argument from xe_perf_stream_read_error +- intel/perf: add eu stall sampling support +- util: add hash functions for u64 data type +- intel/tools: create intel_monitor for sampling eu stalls + +GKraats (2): + +- x11: give error messages if Xorg only supports DRI2 and mesa only DRI3 +- EGL: legacy-x11=dri2 should support hardware driver + +Ganesh Belgur Ramachandra (1): + +- amd: use 128B compression for scanout images when drm.minor <63 + +Georg Lehmann (83): + +- nir/lower_poly_line_smooth: don't emit control flow +- nir/lower_poly_line_smooth: only smooth first color target +- nir/lower_poly_line_smooth: support partial store_output +- radv: remove radv_should_lower_poly_line_smooth +- radv: inline radv_nir_lower_poly_line_smooth +- nir/lower_poly_line_smooth: don't reject fp16 +- nir/lower_poly_line_smooth: use intrinsics_pass +- nir/opt_move: don't move into critical sections +- ac/nir/lower_ps: move exports after packing alu +- nir/print: print large floats as mantissa + exponent +- nir: range analysis for ffract +- nir: fix range analysis for frcp +- nir: fix frsq range analysis +- nir: improve fsqrt range analysis +- nir/opt_algebraic: optimize ffract(ffract(a)) +- nir/peephole_select: support demote for non CF HW +- nir/peephole_select: handle demote and terminate in nir_opt_collapse_if +- nir/peephole_select: don't special case nir_opt_collapse_if + limit = ~0 +- nir/peephole_select: don't include nir_search_helpers.h +- nir/peephole_select: add options struct +- nir/peephole_select: add option to allow discard without ~0 limit +- nir/peephole_select: don't completely ignore ifs with dont_flatten +- nir: replace nir_opt_conditional_discard with nir_opt_peephole_select +- radv: remove separate discard peephole select +- nir/opt_algebraic: optimize b2f(a != 0) * a +- nir/search_helpers: look through vecs in is_only_used_as_float +- nir/search_helpers: check tex source type in is_only_used_as_float +- nir/builder: add nir_shader_phi_pass +- nir/opt_phi_precision: use nir_shader_phi_pass +- nir/opt_remove_phis: use nir_shader_phi_pass +- aco/insert_exec: fix continue_or_break on gfx6-7 +- nir: add a pass to optimize phis to 1bit +- nir/opt_algebraic: optimize ineg(a) == ineg(b) +- nir/opt_algebraic: optimize ineg(a) == #b +- nir/opt_algebraic: 0 >= a -> 0 == a +- nir/opt_algebraic: optimize DXBC boolean bcsel +- nir/opt_algebraic: optimize more boolean bcsel with constants +- nir/opt_algebraic: optimize dxbc boolean not +- nir/opt_algebraic: optimize constant shift of DXBC booleans +- nir/opt_algebraic: optimize b2i(a) != -b2i(b) +- radv: use nir_opt_phi_to_bool +- nir/opt_varyings: clean up nir_progress usage +- radv/nir_apply_pipeline_layout: clean up progress handling +- radv/nir_lower_ray_queries: use nir_foreach_function_impl +- nir/opt_algebraic: optimize bit_count(a) != 0 +- nir/opt_algebraic: optimize bcsel of b2f and constants +- nir/opt_algebraic: optimize b2i/b2f comparision with non 0/1 constants +- nir/opt_algebraic: optimize ~a == ~b and ~a == #b +- nir/opt_algebraic: push comparisons with constants into bcsel with constant +- nir/opt_algebraic: optimize more ine/ieq(umin(b2i, ), 0) +- nir/opt_algebraic: optimize d3d a ? b : 0 +- aco/optimizer: delete combine_and_subbrev +- radv: remove outdated vectorize TODO +- ac/nir/mem_access_bit_sizes: split unaligned vec3 lds access to allow more read2/write2 +- aco/opt_postRA: split try_optimize_scc_nocompare in two functions +- aco/opt_postRA: allow try_optimize_scc_nocompare for all instructions +- aco/opt_postRA: remove scc != 0 with multiple uses +- aco/opt_postRA: remove scc == 0 for more opcodes +- aco/isel: use s_mul_i32 instead of s_cselect_b32 for a ? b : 0 +- radv: enable invariant geom for DOOM(2016) +- radv: add dcc_decompress_gfx11 in radv_graphics_state_key +- ac/nir/lower_ps_late: consider dcc decompression for null exports +- radv/gfx10+: remove null exports if discard isn't used +- aco: don't assume that v_interp_mov_f32 flushes denorms +- aco/gfx11.5: remove vinterp ddx/ddy path +- aco/validate: fix scalar source validation for DPP and gfx11+ VINTERP +- nir/opt_algebraic: create ubfe from (a & mask) >> c +- aco/ra: disallow vcc definitions for pseudo scalar trans instrs +- nir: add option to keep mul24_relaxed +- aco: implement mul24_relaxed +- ac/llvm: support mul24_relaxed +- ac/nir: set has_mul24_relaxed +- aco/insert_exec: reset exec temporary after combined p_demote + p_end_wqm +- spirv: clamp/sign-extend non 32bit ldexp exponents +- aco/gfx9+: use d16 global/scratch/buffer loads +- spirv: fix cooperative matrix by value function params +- aco/gfx10: simpler solution to avoid store instructions in clauses +- aco: form mixed MTBUF/MUBUF clauses +- nir/opt_algebraic: optimize open coded ffract +- nir/opt_algebraic: disable fsat(a + 1.0) opt if a can be NaN +- aco: set opsel_hi to 1 for WMMA +- aco/insert_exec: only restore wqm mask after control flow if necessary +- aco/insert_exec: reset temporary when recreating wqm mask from exact mask + +Gert Wollny (1): + +- r600/sfn: gather info and set lowering 64 bit after nir_lower_io + +Giovanni Mascellani (2): + +- llvmpipe: Remove an outdated comment about subclassing pipe_screen. +- lvp: Remove some dead code. + +Guilherme Gallo (30): + +- ci: Properly clean up rustup +- ci: Remove cargo symlink workaround +- ci/android: add missing pre/post build scripts +- ci: setup-test-env: Prefer functions over aliases +- ci: add _error_msg for internal messaging +- ci: add support for structural tagging +- ci: copy structural tag files to rootfs +- ci/angle: add structured tag check to ANGLE build time +- ci/angle: test-time structured tag checks +- ci/angle: condense angle variables in one job +- ci/angle: remove USE_ANGLE variable +- ci/docs: add structured tagging documentation +- ci/lava: Drop the repeating quotes on lava-test-case +- ci/lava: Propagate errors in SSH tests +- ci/lava: xtrace the lava_job_submitter call +- ci/lava: Add U-Boot action timeout for rockchip DUTs +- ci/lava: Properly detect VMWARE farm +- ci: Specify the FARM variable for DUT jobs +- ci: Simplify LAVA farm detection +- ci/lava: Remove depthcharge-start timeout +- ci/lava: Split boot action into deploy and boot +- ci/lava: Tweak timeouts +- ci/lava: Don't print empty lines when changing sections +- Revert "ci: setup-test-env: Prefer functions over aliases" +- ci/bin: update_tag: improve tag load +- ci/update_tag: fix linter errors +- ci/lava: Fix LAVA lima jobs +- ci/lava: Fix LAVA lima jobs +- ci/lava: Fetch kernel modules from overlay +- ci: Add some unit tests for the duration field + +Gurchetan Singh (5): + +- gallium: drop const qualifier on return type +- lavapipe: use quotes instead of angle bracket +- gfxstream: check device exists before using it +- gfxstream: refactor device initialization +- gfxstream: follow the semantics desired by distro VK loader + +Hans-Kristian Arntzen (3): + +- radv: Always allow sparse on normal GFX/COMPUTE/DMA queues. +- radv: Repurpose radv_legacy_sparse_binding drirc +- radv: Always set 0 dispatch offset for indirect CS. + +Hyunjun Ko (12): + +- anv: Fix to set CDEF flter flag correctly for AV1 decoding +- anv/video: clean-up duplicated code. +- dri: fix a build error +- kopper: implement to get sync values. +- anv: fix maxDpbSlots and maxActiveReferencePictures for AV1 decoding. +- anv: Add one more flag of VideoCapability for encoding. +- anv: Do not support the tiling of DRM modifier if DECODE_DST +- anv/ci: remove some expected failures of dEQP-VK.video.formats.* +- vulkan/video: Do byte-alignment when building a h264 slice header +- anv: Add stdSyntaxFlag values for h264/5 encoders +- anv: Move rateControlMode to the video session. +- anv: Use vk_video_derive_h265_scaling_list + +Iago Toral Quiroga (9): + +- v3dv: implement sync debug option +- v3dv: serialize jobs after any barrier when debug sync is set +- v3dv: fix missing access bit flag when checking for texel buffer reads +- mesa: fix RGBA_SIGNED_COMPONENTS for lowered signed luminance +- v3dv: fix crash on 32-bit builds +- v3dv: rename v3dv_cmd_buffer_merge_barrier_state +- v3dv: make cmd_buffer_serialize_job_if_needed take a barrier state +- v3dv: improve handling of trailing barriers +- pan/va: fix FAU validation + +Ian Romanick (38): + +- iris: Add missing nir_metadata_preserve in iris_lower_storage_image_derefs +- crocus: Add missing nir_metadata_preserve in crocus_lower_storage_image_derefs +- iris: Use nir_shader_intrinsics_pass in iris_lower_storage_image_derefs +- crocus: Use nir_shader_intrinsics_pass in crocus_lower_storage_image_derefs +- brw/copy: Fix handling of offset in extract_imm +- brw/copy: Use extract_imm in try_constant_propagate_value +- brw/copy: Allow constant propagation of some 64-bit integers +- nir/algebraic: More (a == 0 || a == 1 || ...) patterns +- nir/algebraic: Optimize zero comparisons of umax or umin +- nir/algebraic: Simplify equality comparisons of b2T with 1 or 0 +- nir/algebraic: Undistribute b2i from logic-ops +- brw/print: Don't let SHADER_OPCODE_FLOW affect indentation +- brw: Fix typo in comment +- brw/nir: Lower fsign again after last call to brw_nir_optimize +- brw/opt: Move non-SSA register accounting after first brw_opt_split_virtual_grfs +- brw: Add basic infrastructure for load_reg pseudo op +- brw/copy: Prepare copy_propagation for load_reg +- brw/coalesce: Prepare brw_opt_register_coalesce for load_reg +- brw/algebraic: Constant folding for BROADCAST and SHUFFLE +- brw: Add passes to generate and lower load_reg +- brw/sat: Convert tests to use load_reg +- brw/sat: Eliminate non-defs saturate propagation +- brw/opt: Don't call brw_opt_copy_propagation before brw_lower_load_reg +- brw/nir: Fix source handling of nir_intrinsic_load_barycentric_at_offset +- brw/nir: Eliminate default parameter to get_nir_src +- brw/algebraic: Optimize derivative of convergent value +- brw/copy: Refactor source modifier type checking +- brw/copy: Copy prop -X into Y&1 +- brw/nir: Optimize b2f(not(X)) using logical operations instead of arithmetic +- brw/nir: Allow b2f(not(X)) optimization on Gfx12.5+ +- brw/nir: Use offset() for all uses of offs in emit_pixel_interpolater_alu_at_offset +- nir/algebraic: Allow fmin(a,a) optimization when flush denorm to zero is not set +- brw/algebraic: Clear condition modifier on optimized SEL instruction +- brw/algebraic: Don't optimize float SEL.CMOD to MOV +- elk/algebraic: Clear condition modifier on optimized SEL instruction +- elk/algebraic: Don't optimize float SEL.CMOD to MOV +- brw/cmod: Fix some errors when propagating from CMP to ADD.SAT +- brw/cmod: Don't propagate from CMP to possible Inf + (-Inf) + +Ivan A. Melnikov (1): + +- gallium/radeon: Make sure radeonsi PCI IDs are also included + +Ivan Avdeev (2): + +- radv: add experimental support for AMD BC-250 board +- radv,radeonsi: disable compute queue for BC250 + +Iván Briano (2): + +- anv: handle REMAINING_LAYERS in host image copy cases +- brw: make HALT instruction act as barrier in new CSE pass + +James Hogan (8): + +- glsl: Expose gl_ViewID_OVR back to GLSL 1.30 +- mesa: Fix multiview attachment completeness check +- mesa: Fix FramebufferTextureMultiviewOVR num_views check +- mesa: Consider NumViews to reuse FBO attachments +- mesa: Handle GL_FRAMEBUFFER_INCOMPLETE_VIEW_TARGETS_OVR +- mesa: Check views don't exceed GL_MAX_ARRAY_TEXTURE_LAYERS +- mesa: OVR_multiview framebuffer attachment parameters +- mesa: Handle getting GL_MAX_VIEWS_OVR + +Jan Alexander Steffens (heftig) (1): + +- gfxstream: Use proper log format for 32-bit Vulkan + +Janne Grunau (8): + +- hk: Replace alloca with malloc in queue_submit +- hk: Use rowPitch from VkImageDrmFormatModifierExplicitCreateInfoEXT +- venus: Do not use instance pointer before NULL check +- venus: virtgpu: Require stable wire format +- asahi: build asahi_clc for -Dtools=asahi +- asahi: Drop unnecessary idep_mesaclc dependency +- panfrost: build panfrost_compile for -Dtools=panfrost +- ci: Switch cross-builds to '-D tools=panfrost' + +Jason Macnak (6): + +- gfxstream: Move snapshot decoder replay into VkDecoderGlobalState +- gfxstream: Remove unused handling mappers +- gfxstream: Move the handle replay buffer into BoxedHandleManager +- gfxstream: Remove duplicated boxed handle func declarations +- gfxstream: Update variable names to avoid -Wshadow error +- gfxstream: Remove extra dispatch variable + +Jeongik Cha (1): + +- gfxstream: Add AHARDWAREBUFFER_FORMAT_B8G8R8A8_UNORM in android_format_is_yuv + +Jesse Natalie (2): + +- meson: Enable /Zc:preprocessor for MSVC +- CI/Windows: Update container deps + +Job Noorman (46): + +- freedreno/drm-shim: enable raytracing +- ir3: fix emitting descriptor prefetches at end of preamble +- ir3: add braces around complex if/else block +- ir3/ra: handle phis with preferred regs first +- ir3/parser: add helper to generate syntax errors based on gen +- ir3/isa: fix (dis)asm of ldg.a/stg.a on a6xx +- ir3: don't create SRC2 for isam without .v +- ir3/legalize: use (sy) for ray_intersection WAR hazards +- ir3/lower_tess: make all NIR passes report progress +- ir3: don't use deprecated NIR_PASS_V anymore +- ir3: reformat after previous commit +- ir3/opt_prefetch_descriptors: fix crash after nir_progress rewrite +- ir3: add reformatting commits to .git-blame-ignore-revs +- nir/lower_phis_to_scalar: remove unused mem_ctx +- nir/lower_phis_to_scalar: use nir_builder API where possible +- nir/lower_phis_to_scalar: don't create moves for undef sources +- nir/lower_subgroups: use build_cluster_mask for quad mask +- ir3/ra: prevent reusing parent interval of reloaded sources +- ir3: clear instruction uses when cloned +- ir3/sched: unblock a0.x/a1.x after last use +- ir3: add ir3_cursor_current_block helper +- ir3/cse: add support for mov a0.x/a1.x +- ir3: remove hash table for a1.x +- ir3: add helper to create STC +- ir3: fix false dependencies of rpt instructions +- ir3/sched: handle dependencies between stc and const reads +- ir3: split immediate state from rest of const state +- ir3: make const_imm_index_to_reg helper public +- ir3: fix max const size calculation for the binning pass +- ir3: lower immediates to const regs in preamble on a7xx +- ir3: keep inputs at start block when creating empty preamble +- ir3/legalize: fix off-by-one error in kill_sched +- ir3/legalize: take wrmask into account for delay updates +- ir3: don't sync every TCS/GEOM block +- ir3: run opt_if after opt_vectorize +- ir3: make shpe a terminator +- ir3/ra: assign interval offsets to new defs after shared RA +- ir3: add ir3_aggressive_coalesce helper +- ir3/ra: create merge sets for splits/collects inserted for shared RA +- ir3/opt_preamble: take alias.rt into account for rewrite cost +- ir3: remove spaces in shader stats +- ir3/cp: add option to disable immediate to const lowering +- ir3/cp: ignore alias sources for sam.s2en +- ir3: run cp after ir3_imm_const_to_preamble +- ir3/ra: add helper for getting a dst interval +- ir3/ra: ignore phis handled by shared RA + +John Anthony (2): + +- panvk: Avoid division by zero for vkCmdCopyQueryPoolResults +- panvk: Enable VK_EXT_direct_mode_display + +Jon Hunter (1): + +- freedreno/registers: Fix gen_header.py for older python3 versions + +Jordan Justen (5): + +- intel/dev: Add BMG PCI IDs (0xe210, 0xe215, 0xe216) +- intel/dev: Stop checking hwconfig values at driver runtime +- tools/intel_dev_info: Print hwconfig discrepancies +- intel/dev: Ignore hwconfig difference due to WA 18040209780 +- intel/dev: Add BMG 0xe211 PCI ID + +Jose Fonseca (1): + +- glapi: Make _GLAPI_EXPORT a no-op on Windows. + +Jose Maria Casanova Crespo (3): + +- v3dv/ci: add new flakes +- glapi: import noop_array and public stubs earlier. +- v3dv: avoid TFU reading unmapped pages beyond the end of the buffers + +José Roberto de Souza (20): + +- intel: Initialize upper 32bits of drm_xe_sync.handle +- intel/dev: Improve max_cs_threads documentation +- intel/dev: Call intel_device_info_update_after_hwconfig() from common code +- intel/common: Retry GEM_CONTEXT_CREATE when PXP have not finished initialization +- anv: Remove protected memory support from compute queue +- intel: Sync xe_drm.h +- anv: Move code adding protected memory type to common code +- anv: Add support to create protected bo and protected exec_queue in Xe KMD +- iris: Add support to create protected bo and protected exec_queue in Xe KMD +- intel: Add function to check if PXP is supported in Xe KMD +- iris: Replace BO_ALLOC_* macros by a enum +- intel/hwconfig: Sync hwconfig with IGT +- intel/hwconfig: Remove ignored intel_hwconfigs from apply_hwconfig_item() +- intel/dev/xe3: Set max_slices and max_subslices_per_slice using hwconfig +- intel/perf: Update intel_perf to match xe_drm.h +- drm-uapi: Sync xe_drm.h +- intel: Program XY_FAST_COLOR_BLT::Destination Mocs for gfx12 +- intel: Fix the MOCS values in XY_FAST_COLOR_BLT for Xe2+ +- intel: Fix the MOCS values in XY_BLOCK_COPY_BLT for Xe2+ +- intel/tools: Fix batch buffer decoder + +Juan A. Suarez Romero (31): + +- broadcom/compiler: move stores to the end of shader +- Revert "st/mesa: move VS & TES output stores to the end before unlowering IO" +- broadcom/ci: add new failures/flakes +- v3dv: take into account GS when enabling line smooth +- v3dv/ci: disable rpi5 job +- vulkan: don't leak debug utils label name +- v3dv: duplicate key for texel_buffer cache +- vc4/ci: update expected results +- broadcom/simulator: use string copy instead of memcpy +- vc4/ci: update expected results +- v3dv/ci: Skip tests causing OOM +- Revert "v3dv/ci: disable rpi5 job" +- v3d/v3dv/vc4: review all expected timeouts +- v3dv: remove src_format from blit render pass creation +- v3dv: don't batch regions with different depth offsets +- v3dv: include depth offset on image view creation +- vc4: check instruction before setting flags +- v3dv: asserts struct is always non null +- v3dv: check dynamic offset output +- v3dv: asserts push constants data is valid +- vc4: initialize register +- vc4: add assertion on constant_fold +- vc4: assert there are sources when emitting texture +- broadcom/cle: assert attribute has a value +- vc4: use safe iterator to remove instructions +- broadcom/compiler: use safe iterator to remove instructions +- broadcom/compiler: don't use VLA on emit alu +- broadcom/compiler: initialize register +- v3dv: don't check if DRM device is master +- v3d(v)/ci: update expected results +- ci: include duration in the CustomLogger + +Julia Zhang (5): + +- vulkan: handle device memory report requests +- radv: add import and export handle_type in radv_alloc_memory +- radv: add obj_id to radeon_winsys_bo +- radv: emit device memory report for device memory events +- radv: advertise VK_EXT_device_memory_report + +Jung-uk Kim (1): + +- FreeBSD: Disable support for "-mtls-dialect" for FreeBSD + +Juston Li (3): + +- anv: xe: fully initialize drm_xe_sync addr/handle union +- iris: xe: fully initialize drm_xe_sync addr/handle union +- wsi/common: android: disable KHR_present_[wait/id] + +K900 (1): + +- meson: support building with system libgbm + +Karmjit Mahil (3): + +- loader/wayland: Fix missing timespec.h include +- tu: Fix Perfetto build error with vk_buffer +- tu: Fix segfault in fail_submit KGSL path + +Karol Herbst (49): + +- ci/windows: Bump Vulkan SDK for SPIRV-Tools +- clc: use SetUseHighestVersion when linking spirvs +- mesa_clc: drop spirv version workaround +- rusticl/mem: set bind flags for gl imports +- rusticl/mesa: add PipeContext::device_reset_status +- rusticl/queue: check device error status +- clc: bump SPIR-V target to 1.6 +- rusticl/kernel: call nir_lower_variable_initializers earlier +- rusticl: support SPIR-V 1.5 and 1.6 +- rusticl/mem: do not apply offset with in copy_image_to_buffer +- rusticl/mesa: add buffer and texture variant for resource_copy_region +- rusticl/mem: Buffer::copy_to_image layering +- rusticl/mem: Image::copy_to_buffer layering +- rusticl/mem: Image::copy_to_image layering +- rusticl/mem: Image::write layering +- rusticl/mem: accelerate Buffer::copy_rect +- rusticl/mem: accelerate Buffer::write_rect +- rusticl/mem: set num_samples and num_mip_levels to 0 when importing from GL +- rusticl/platform: advertise all extensions supported by all devices +- rusticl/util: add missing comment and assert to char_arr_to_cstr +- intel/brw, lp: enable lower_pack_64_4x16 +- nir: Do not eliminate dead writes to shared memory in called functions. +- rusticl/program: implement CL_INVALID_PROGRAM_EXECUTABLE check in clGetProgramInfo +- rusticl/program: pass options by reference +- rusticl/program: loop over all devices inside Program::build +- rusticl/program: rework build_nirs so it only touches devices we care about +- rusticl/program: fix building kernels +- rusticl/program: simplify active_kernels check +- rusticl/kernel: rename CSOWrapper to SharedCSOWrapper +- rusticl/queue: make it unncessary to keep QueueContext Send +- rusticl/queue: cache bound CSO +- rusticl/mesa: remove Sync from PipeContext +- nir/serialize: fix decoding of is_return and is_uniform +- vtn: Support the UniformDecoration capability. +- zink: don't apply the map_offset when mapping a staging resource in zink_buffer_map +- iris: remove all clover support code +- freedreno: remove all clover support code +- llvmpipe: remove all clover support code +- gallium: stop filling ir_target in various drivers +- gallium: stop using PIPE_BIND_COMPUTE_RESOURCE in drivers +- gallium: stop implementing set_compute_resources in various drivers +- nouveau: ignore req_input_mem +- rusticl/device: fix panic when disabling 3D image write support +- nir_lower_mem_access_bit_sizes: fix negative chunk offsets +- nak: fix handling of delays > 15 +- r600: fix r600_buffer_from_user_memory for rusticl +- iris: parse global bindings for every gen +- iris/xe: fix compute shader start address +- iris/xe: take the grids variable_shared_mem into account + +Kenneth Graunke (37): + +- brw: Drop unused defines +- brw: Eliminate fs_inst::shadow_compare +- brw: Replace fs_inst::pi_noperspective with a logical control source +- brw: Drop FB_WRITE_LOGICAL_SRC_DST_DEPTH source +- brw: Replace fs_inst::last_rt with a logical control source +- brw: Replace fs_inst::target field with logical FB read/write sources +- brw: Use correct builder size for MEMORY_FENCE/INTERLOCK virtual opcodes +- brw: Change destination of memory fences to UD type +- brw: Eliminate the BTI source from MEMORY_FENCE/INTERLOCK opcodes +- brw: Add latencies for HDC/RC memory fences +- brw: Lower MEMORY_FENCE and INTERLOCK in lower_logical_sends +- brw: Drop INTERPOLATE_AT mlen handling from size_read() +- brw: Drop unnecessary mlen/header_size on virtual GET_BUFFER_SIZE op +- nir: Eliminate dead writes to shared memory at the end of the program +- brw: Rename shared function enums for clarity +- isl: Delete redundant "use separate stencil?" check +- isl: Drop compile time "use separate stencil" checks. +- intel: Delete devinfo->must_use_separate_stencil +- intel: Delete devinfo->has_surface_tile_offset +- intel: Move devinfo->has_negative_rhw_bug into the elk compiler +- intel: Move devinfo->has_compr4 into the elk compiler +- intel/dev: Set minimum HS URB entries to 0. +- intel/dev: Set max_wm_threads to 0 in the Gfx9+ devinfo structs +- intel/dev: Rework device info macros for Gfx8+ +- intel/dev: Set a higher minimum number of URB entries for GS +- intel: Use devinfo->urb.min_entries[GS and TCS] for setting URB configs +- intel: Move unlit centroid workaround into the elk compiler +- intel/decoder: Decode compute shaders in EXECUTE_INDIRECT_DISPATCH +- brw: Make a helper to emit UNDEF for temporaries containing small types +- brw: Emit UNDEF as needed in SSA-style builder helpers +- brw: Skip unnecessary UNDEFs for comparisons +- brw: Use a smaller type for masked sub-32-bit shift values +- brw: Avoid regioning restrictions for u2u16/i2i16 narrowing conversions +- brw: Track the largest VGRF size in liveness analysis +- brw: Use live->max_vgrf_size in register coalescing +- brw: Use live->max_vgrf_size in pre-RA scheduling +- brw: Don't assert about MAX_VGRF_SIZE in brw_opt_split_virtual_grfs() + +Kenny Levinsen (1): + +- device-select: Support linux-dmabuf feedback + +Kevin Chuang (2): + +- anv/bvh: Fix encoder handling sparse buffer +- anv/bvh: Fix copy shader handling sparse buffer + +Konstantin (1): + +- nir/tests: Do not rely on __LINE__ + +Konstantin Seurer (68): + +- nir: Stop using instructions for debug info +- spirv: Handle NonSemantic.Shader.DebugInfo.100 +- nir: Add variable debug info to instructions +- nir/lower_vars_to_ssa: Annotate defs with variable names +- vulkan: Stop using strings for BVH build pipeline keys +- vulkan/meta: Remove object types from vk_meta_object_key_type +- vulkan/meta: Stop using strings for meta keys +- hk: Stop using strings or common key types for meta keys +- radv/meta: Stop using strings for meta keys +- lavapipe: Fix maintainance7 descriptor set limits +- vulkan/cmd_queue: Simplify freeing cmd_queue entries +- vulkan/cmd_queue: Add VK_CMD_TYPE_COUNT +- vulkan/radix_sort: Stop force-unrolling loops +- gallivm: Remove loop limiting +- lavapipe: Implement some functions required by the common BVH framework +- lavapipe: Use the common BVH framework +- radv: Optimize fs builtins using static gfx state +- gallivm: Split nir prepasses into aos/soa +- gallivm/nir/aos: Remove the dependency on lp_bld_nir.c +- gallivm/nir/soa: Remove the dependency on lp_bld_nir.c +- gallivm/nir/soa: Lower bools to i1 +- gallivm/nir/soa: Implement robusst uniform loads without controlflow +- gallivm/nir/soa: Select more IO to gather/scatter intrinsics +- lavapipe: Move nir passes to a new directory +- lavapipe: Lower descriptor sets in NIR +- lavapipe: Initialize the compiler options of the noop fs +- llvmpipe: Do not use coroutines when they are unnecessary +- nir: Rename in-bounds-agx to in-bounds +- nir: Do not emit amul if it is unsupported +- lavapipe: Optimize buffer robustness +- gallivm/nir/soa: Do not lower vectors to llvm arrays +- nir/divergence_analysis: Handle load_const_buf_base_addr_lvp +- gallivm/nir/soa: Use divergence analysis +- lavapipe: Lower push constants in NIR +- gallivm: Only guard tex/image ops if the exec mask can be zero +- gallivm/nir/soa: Skip bounds checking for in-bounds access +- gallivm/nir/soa: Properly skip empty else branches +- lavapipe: Remove uniform inlining +- vulkan: Add utilities for triggering renderdoc captures +- radv: Lower ray query vars to structs +- radv: Implement multidimensional ray query arrays +- llvmpipe: Skip draw_mesh if the ms did not write gl_Position +- nir: Test nir_minimize_call_live_states +- nir/sweep: Fix handling instructions with debug info +- nir/print: Do not print debug information when gathering it +- gallivm: Create a debug builder and add GALLIVM_DEBUG=symbols +- llvmpipe: Annotate functions with debug information +- gallivm: Handle nir_instr_debug_info +- gallivm: Emit debug info for definitions +- gallivm: Add a debug variable for the exec mask +- gallivm: Run nir_lower_load_const_to_scalar +- lavapipe: Enable debug information if GALLIVM_DEBUG=symbols is set +- radv/meta: Change the return type of get_r32g32b32_format to VkFormat +- ci: Do not build hk on alpine +- asahi: Only require IOKit for tooling +- hk: Fix building without the gallium driver +- spirv: Emit code for NonSemantic.DebugPrintf if supported +- nir: Turn the format string index into a const index +- radv: Handle nir_intrinsic_printf +- clc: Print errors when initializing clang fails +- clc: Allow bitfields +- clc,libcl: Clean up CL includes +- radv: Fix rayTracingPositionFetch with multiple geometies +- lavapipe: Prefetch 56 bytes of node data during ray traversal +- lavapipe: Run nir optimizations on ray tracing pipelines +- lavapipe: pre-load tmax +- lavapipe: Do not emit aabb handling if no isec shader is used +- radv: Return VK_ERROR_INCOMPATIBLE_DRIVER for unsupported devices + +Lakshman Chandu Kondreddy (1): + +- freedreno: Add support for Adreno623 GPU + +Lars-Ivar Hesselberg Simonsen (9): + +- panfrost: Do not evaluate_per_sample for non-MSAA +- Revert "panfrost: remove is_blit flag" +- Revert "panfrost: fix hang by using MALI_PIXEL_KILL_WEAK_EARLY in color preload" +- panvk: Set missing shader_modifies_coverage flag +- panfrost: Use RUN_COMPUTE over RUN_COMPUTE_INDIRECT +- panvk: Use RUN_COMPUTE over RUN_COMPUTE_INDIRECT +- vulkan/wsi/wayland: Avoid duplicate colorspace entry +- panvk: Add barrier for interleaved ZS copy cmds +- vk/sync: Fix execution only barriers + +Leder, Brendan Steve (1): + +- amd/vpelib: Reformat index variables and update enum + +Leonard Göhrs (2): + +- etnaviv/ci: add pengutronix LAVA lab with one i.MX8MP device +- ci: re-enable pengutronix farm + +LingMan (2): + +- meson: Update pest subproject family +- meson: Sync subproject version numbers in \`packagefiles` with their \`.wrap` equivalents + +Lionel Landwerlin (137): + +- anv: fixup missing compiler dependency on tests +- intel: move internal shader compile to vtn_bindgen2 +- compiler: drop vtn_bindgen +- intel_clc: remove NIR output support +- spirv: remove spirv_library_to_nir_builder +- brw: fixup scoreboarding for find_live_channels +- anv: reuse helper for compute push constants +- anv: increase general state pool +- anv: track the first 2MB of unused VA +- anv: use heap size to program generate state heap +- anv: add a helper for getting gfx push constant addresses +- brw/anv: rework push constants for mesh/task shaders +- brw: enable A64 pulling of push constants +- anv: use A64 messages for push constants loads on Gfx12.5+ +- nir: add a high precision conversion unorm->float +- anv: add source hashes for BVH building shaders +- vulkan/wsi: propagate protected swapchain to images +- vulkan: allow support for protected surfaces +- anv: support protected surfaces with display platform +- anv,driconf: Add sampler coordinate precision workaround for Dynasty Warriors +- genxml: make component packing an array +- genxml: add convenience dwords for packing components +- brw: fix indentation +- brw: remove nr_attribute_slots from vs_prog_data +- brw: port vs input to lower_64bit_to_32_new +- brw: update vulkan max attribute limit +- brw: add a max HW vertices attribute limit +- brw: enable vertex fetching component packing +- blorp: emit 3DSTATE_VF +- anv: disable VF statistics for memcpy +- anv: enable vertex fetching component packing +- anv: ensure Wa_16012775297 interacts correctly with Wa_18020335297 +- brw: use meaningful io locations for system values +- brw: add support for no VF input slot compaction +- brw: add documentation about slot compaction & component packing +- brw: fix component packing starting index +- anv: move RT stage bits to main header +- anv: move reg_mask push constant field to gfx +- anv: hold a prepacked COMPUTE_WALKER instruction on CS pipelines +- anv: make gfx state flushing available externally +- anv: make compute state flush helper visible +- runtime: sort push constant layouts +- anv: avoid memory type changes with INTEL_DEBUG=noccs +- anv/ci/adl: update fail expectation for video +- anv: fixup compute walker storage length +- nir: add options to lower only some image atomics to global +- brw: factor out base prog_data setting +- brw: store source_hash in prog_data +- anv: switch to use brw's prog_data source_hash +- isl: report tiling address swizzles +- isl: add usage for software detiling +- isl: centralize supported tilings in a single function +- isl: select a tiling for shader detiling +- isl: add support for R64 storage image lowering +- isl: add a helper to report what dimensions a tiling supports +- nir: track lowered image intrinsics to globals +- brw: include UGM fence when TGM + lowered image->global +- brw: add support for 64bit storage images load/store +- brw: add support for texel address lowering +- anv: rename compressed format emulation helpers +- anv: add mapping for VBO formats in format mapping +- anv: add a is_sparse for image format support checks +- u_trace: pass tracepoint flags to the read_timestamp callback +- intel/ds: rework RT tracepoints +- anv: fix missing 3DSTATE_PS:Kernel0MaximumPolysperThread programming +- brw: optimize load payload with immediate headers +- brw: avoid setting up the sampler header bits when unused +- spirv: fix racy build +- vulkan/runtime: ensure robustness state is fully initialized +- vulkan/runtime: pass robustness state to preprocess vfunc +- vulkan/runtime: add a multialloc vk_shader allocator +- vulkan/runtime: store flags on descriptor set layouts +- anv/iris: centralize TBIMR drirc +- iris: remove duplicate TessellationDistributionMode programming +- anv/iris: add drirc keys to disable VF/TE distribution +- anv/apply_layout: split binding table building into its own function +- intel/genxml: add a genX RT include header +- genxml: simplify genX_rt_pack.h +- brw: make intel_shader_enums.h opencl importable +- anv/brw: move INTEL_MSAA_* flag computation to the compiler +- anv: break down Wa_16014912113 in need/apply parts +- anv: fixup indentation around Wa_16014912113 +- brw: avoid calling lower_indirect_derefs multiple times +- anv: fill runtime buffer device_address field +- anv: move index buffer entry point out of genX code +- anv: move vertex buffer storage to 64bit address + mocs +- anv: move xfb buffer storage to 64bit address + mocs +- anv: track protection on anv_address +- anv: use addresses for buffer<->image copies +- anv: simplify internal blorp helper +- anv: fix non page aligned descriptor bindings on color copy restriction +- vulkan: add helper for color/depth-stencil capable formats +- anv: relax depth/stencil<->color copy restrictions +- anv: fix end of pipe timestamp query writes +- anv: disable replication when we don't have both VS/FS stages +- brw: always write the VUE header +- anv: limit implict write with drirc +- nir: add support for lowering non uniform texture offsets +- elk: stop using intel_nir_lower_texture +- brw: don't lower tg4 offsets without LOD +- intel: move lower_texture to brw +- brw: move texture offset packing to NIR +- anv: enable non uniform texture offset lowering +- anv: wire VkAccessFlagBits3KHR flags in internal helpers +- anv: expose VK_KHR_maintenance8 support +- intel/genxml: remove ISA fields +- intel/genxml: add more engine tagging on instructions +- intel/genxml: fixup engine filtering +- intel/genxml: define post-sync operations for MI_FLUSH_DW +- intel/genxml: add MI_FLUSH_DW to blitter engine +- anv/genxml: use special genX video pack files +- anv/hasvk: consider timeline semaphore support stable +- docs: remove unused env variable +- anv/hasvk: sort out debug options +- anv: consolidate environment variables +- anv: add shader-hash debug option +- brw: fix shuffle with scalar/uniform index +- anv: relax restriction on variable count descriptors +- anv: fix self dependency computation +- brw: fix Wa_22013689345 emission +- iris: update Wa_1607156449 to use WA infrastructure +- anv: update Wa_1607156449 to use WA infrastructure +- intel/dev: remove ADLN references +- anv: remove ALWAYS_INLINE from globally visible functions +- anv/iris: implement Wa_18040903259 +- anv: use companion batch for operations with HIZ/STC_CCS destination +- anv: force fragment shader execution when occlusion queries are active +- intel: fix null render target setup logic +- brw: add pre ray trace intrinsic moves + +Lorenzo Rossi (8): + +- nvk: fix preprocess buffer alignment +- nvk: Fix MSAA sparse residency lowering crash +- nir: support shared atomics in nir_lower_atomics +- nvk, nak: Implement shaderSharedInt64Atomics +- nak: Fold bool-int-bool conversions +- nak: Flatten AttrAccess into instructions +- nak: Fix SM50 rounding-mode encoding edge-case +- nak: Simplify shl64 lowering on Maxwell + +Loïc Minier (1): + +- freedreno: check if GPU supported in fd_pipe_new2 + +Loïc Molinari (5): + +- perfetto: Let MESA_TRACE_FUNC() take printf-like format arguments +- panfrost: Add CPU traces +- pan/kmod: Add drmIoctl() wrapper pan_kmod_ioctl() with CPU trace +- docs: Add Panfrost to the list of drivers with CPU traces +- mesa: Add CPU traces + +Lucas Fryzek (4): + +- anv: Release correct bo for RT scratch +- vulkan/runtime: Add object type to DMR API +- anv: Implement VK_EXT_device_memory_report +- anv: Expose VK_EXT_device_memory_report + +Lucas Stach (19): + +- etnaviv: drm: fix instruction limit for cores with instruction cache +- etnaviv: drop double masking in etna_emit_load_state +- etnaviv: split large multi-state updates into multiple batches +- etnaviv: emit PS start and end PC states only on shader changes +- etnaviv: correct and rename shader range register check +- etnaviv: replace open-coded shifts in shader range registers with macros +- etnaviv: place FS right behind VS in unified instruction memory +- etnaviv: allow larger shaders with unified instruction memory +- kmsro: look for graphics capable screen as renderonly device +- etnaviv: rs: fix slow/fast clear transitions +- etnaviv/ci: remove dEQP-GLES2.functional.polygon_offset.default_enable fail +- etnaviv: fix ETNA_MESA_DEBUG=no_early_z +- etnaviv: Update headers from rnndb +- etnaviv: enable forwarding ZW fragcoord components from RA to SH +- etnaviv/ci: remove fragcoord related fails on GC7000 +- etnaviv: use pipe_resource in etna_resource_status +- etnaviv: split PIPE_BUFFER resources from other types of resources +- include: update GL headers from the registry +- etnaviv: add context flush sw query + +Ludvig Lindau (1): + +- panvk: Get flush_id once per submit + +Maaz Mombasawala (3): + +- svga: Add all tgsi double instructions for shader codegen checks +- nir: Add option to preserve double immediates in tgsi shader. +- svga: Check float type when emitting atomic instructions + +MaciejDziuban (3): + +- vulkan: handle use_default_scaling_matrix_mask in h264 decoder +- vulkan: Add default scaling lists for H265 +- radv: Use vk_video_derive_h265_scaling_list + +Marek Olšák (128): + +- amd: drop support for LLVM 15, 16, 17 +- ci/debian-ppc64el: don't build AMD drivers due to having only LLVM 15 +- gallium,st/mesa: allow reporting compile failures from create_vs/fs/.._state +- nine: change the vtxbuf dirty mask to bool +- nine: always update vertex buffers after updating vertex elements +- nine: remove unused last_vtxbuf_count +- nine: bind exactly the number of vertex buffers as vertex elements need +- cso_context: add cso_get_vertex_elements_for_bind, letting the caller bind it +- gallium/u_threaded,st/mesa: add a merged set_vertex_elements_and_buffers call +- radeonsi: add assertion requiring binding vertex elements before vertex_buffers +- radeonsi: require that count in set_vertex_buffers matches vertex elements state +- radeonsi: don't set num_vertex_buffers and don't unbind in set_vertex_buffers +- radeonsi: simplify bind_vertex_elements due to being before set_vertex_buffers +- radeonsi: print why draws are rejected +- util: remove glthread enablement from app profiles +- mesa: don't build st_format_test on Windows +- glapi: remove dead code +- glapi: inline functions or use equivalent alternatives +- glapi: don't export always-private functions +- mesa: remove a glapi workaround for Mesa 10.5 and older +- glapi: fix build dependencies by putting all xml/py files into a single list +- glapi: remove support for dynamically-registered functions +- glapi: use an assertion in SET_by_offset instead of doing nothing +- glapi: clarify the meaning of static_data.functions +- glapi: remove check_table tests +- glapi: generate static offsets from the list of GL functions automatically +- glapi: remove unused functions from dispatch tables +- glapi: move legacy imaging functions to the end of dispatch tables +- glapi: verify that aliased functions don't have entries in dispatch tables +- glapi: just use _gloffset_COUNT_ everywhere, which is always the function count +- glthread: handle glFlush with HasExternallySharedImages more efficiently +- glthread: rename scripts to match the names of generated files +- glapi: disable python escape sequences in strings that use invalid ones +- glapi: use static_data.libgl_public_functions directly +- glapi: remove is_static_entry_point wrapper +- glapi: inline entry_current_get() +- glx: don't call GL functions directly, use the current dispatch instead +- glx: fix hardcoded use of dispatch table index in glAreTexturesResidentEXT +- glx: stop exporting GL functions from libGLX_mesa.so +- glx: make it more obvious what libglapi_bridge is +- glapi: remove unused python code +- mesa: allocate GLmatrix aligned to 16 bytes +- Remove osmesa +- gallium/u_blitter: remove UTIL_BLITTER_ATTRIB_COLOR, use a constant buffer +- gallium/u_blitter: change blitter_attrib from union to struct +- meson: never use static glapi because shared glapi is also static and better +- glx/xlib: switch glapi from static to shared (which is also static) +- meson: remove with_shared_glapi variable (it's always true) +- glapi: remove static glapi (it's dead code now) +- glapi: remove duplicated generated header glapitable.h +- glx: fix build with -Dlegacy-x11=dri2 +- egl: use the current dispatch to execute glFlush instead of get_proc_address +- glapi: remove extension definitions that will likely never be implemented +- ac/gpu_info: use max_good_cu_per_sa for computation of max_scratch_waves +- ac,radeonsi: don't set num_slots/src/dest_type/write_mask when they're set automatically +- Revert "ac/nir: clamp vertex color outputs in the right place" +- ac/nir/cull: extract a helper calling accept_func +- ac/nir/ngg: add an option to skip viewport-based culling +- ac/cmdbuf: split meta_*_policy to dcc and htile variables +- ac/cmdbuf: rework CB/DB cache controls for better perf +- winsys/amdgpu: don't use 32-bit address space for IBs +- radeonsi: use si_is_buffer_idle everywhere +- radeonsi: move buffer reallocation to a separate function +- radeonsi: enable NGG culling when the shader writes the viewport index +- radeonsi: enable Z/S caching in GL2 by default except FurMark +- radeonsi: use the restrict keyword for draw parameters +- radeonsi: lower IO only if io_lowered == false +- radeonsi: reflect blitter VS in si_context::num_vertex_elements +- radeonsi: don't cull front/back faces in the hw if the shader culls them +- ac,radeonsi: define all SDMA DCC fields & use them, enable compressed writes +- radeonsi: remove clover support +- glx: don't generate indirect GLX dispatch for ARB_imaging functions +- glapi: make a separate copy of entry.c for each lib +- glapi: replace mapi_func type with identical _glapi_proc +- glapi: simplify codegen macros +- glapi: merge all shared-glapi source files into one .c file +- glapi: remove unused _mesa_glapi_Dispatch +- glx: add a test to verify exported symbols +- glapi: get the list of public GL functions from libgl-symbols.txt +- glx: remove the hack that forced exporting GL functions from libGL +- glapi: remove the GLAPI_PREFIX macro, just use gl as the function prefix +- glapi: simplify headers of generated files +- glapi: simplify mapi_abi.py +- glapi: rename dispatch stubs to use function names instead of numbers +- glapi: remove noop_generic callbacks +- glapi: remove the option to set the nop handler +- mesa: move dispatch.h and marshal_generated.h generation to glapi/gen +- mesa: inline main/meson.build +- glx: stop exporting EXT_texture_object functions from libGLX_mesa.so +- glapi: fix x86 32-bit asm dispatch regression +- amd: update addrlib +- amd/addrlib: remove the DCC page fault workaround +- ac/nir/cull: cull small prims using a point-triangle intersection test +- glsl: return failure from gl_nir_validate_first_and_last_interface_explicit_locations +- glsl: return failure from varying_matches_assign_locations +- glsl: return failure from remove_unused_io_vars +- glsl: stop calling nir_opt_combine_stores (redundant with nir_opt_vectorize_io) +- glsl: remove a deprecated comment about nir_compact_varyings +- glsl: always return true at the end of link_varyings +- glsl: move code after link_varyings into link_varyings +- st/mesa: don't do nir_remove_dead_variables on in/out twice +- nir/opt_algebraic: lower 16-bit imul_high & umul_high +- ac: define physical VGPRs for fake hw overrides +- ac/nir/cull: rename skip_viewport_culling -> skip_viewport_state_culling +- ac/nir/cull: always do frustum culling, skip only small prim culling +- ac/surface: remove 64K_2D modifier with 64B max compressed blocks for gfx12 +- radeonsi: work around a primitive restart bug on gfx10-10.3 +- radeonsi: always use ACO callbacks to scalarize/vectorize 16-bit ALU +- radeonsi: expose 16-bit NIR types for ALU, MEM, and LDS (no inputs/outputs) +- radeonsi/ci: update gfx11 failures +- radeonsi/ci: don't run GTF tests (they have been removed from glcts) +- radeonsi: lower load/store bit sizes before load/store vectorization +- radeonsi: add a VOP3P swizzle requirement for 16-bit packed math +- radeonsi/ci: add gfx12 failures and flakes +- radeonsi: make si_shader_selector::main_shader_part_* an iterable union +- radeonsi: add ACO-specific main shader parts +- winsys/amdgpu: don't add VM_ALWAYS_VALID buffers into the BO list +- radeonsi: determine VM_ALWAYS_VALID accurately +- ac/gpu_info: increase the attribute ring size for gfx12 +- ac/gpu_info: remove has_tmz_support function +- ac/gpu_info: add payload_entry_size into ac_task_info +- ac/gpu_info: add 256 to payload_entry_size to increase future task shader perf +- ac/nir: set X=0 for task->mesh shader dispatch when Y or Z is 0 +- ac/surface: make gfx12_estimate_size reusable by gfx6 +- ac/surface: select 3D tile mode without overallocating too much for gfx6-8 +- radeonsi: always scalarize shared memory instructions +- radeonsi: use nir_opt_shrink_vectors +- radv: fix incorrect patch_outputs_read for TCS with dynamic state + +Mark Collins (4): + +- tu/kgsl: Remove zero CB queue submission fast path +- tu/kgsl: Revert "Remove zero CB queue submission fast path" +- tu/kgsl: Fix KGSL syncobj lifetime in no CB submit +- ir3/a7xx: Add post-RA pass to track liveness and insert (last) + +Martin Krastev (7): + +- svga/ci: disable vmware farm +- svga/ci: enable vmware farm +- svga/ci: enable vmware farm +- svga/ci: enable vmware farm +- svga/ci: update EXTERNAL_KERNEL_TAG to new kernel +- svga/ci: disable vmware farm +- docs/svga: Add steps how to get VMware Workstation Pro on Linux + +Martin Roukala (né Peres) (42): + +- ci: fix the artifact name +- ci: be explicit about the fact HW jobs do not need linter artifacts +- ci/test/b2c: do not download the debian/\*_test-(gl|vk) artifacts +- ci-templates: update to the latest upstream version +- ci/build-deqp: delegate the deqp main commit check to github +- ci/test: add arm32 b2c jobs +- ci/debian/test-vk: remove software-properties-common +- ci/debian/test-vk: trim down the container on a per-arch basis +- ci/image-tags: bump the debian base tag +- ci/b2c: add support for diskless DUTs +- ci/b2c: de-duplicate the download of install.tar +- etnaviv/ci: convert from baremetal to CI-Tron +- ci: re-enable austriancoder's farm +- zink/ci: use the debian-built-testing for nvk +- zink/ci: update the radv expectations +- radv/ci: bump tahiti's cpu cores +- radv/ci: update expectations +- freedreno/ci: update expectations +- ci/b2c: fix the S3 artifact for amd64 manual vk/gl +- radv/ci: document more Tahiti VKCTS flakes +- turnip/ci: re-introduce the \`multiviewport` flakes +- zink/ci: mark query-rgba-signed-components as fixed on more platforms +- zink/ci: document more RADV flakes +- radv/ci: add testing on RAPHAEL +- radv/ci: reduce Renoir concurrency in vkcts +- radv/ci: set a tight timeout on vkcts-navi31 +- radv/ci: document more flakes +- zink/ci: document more radv flakes +- zink/ci: document more GA106 flakes +- zink/ci: increase the a750 job's timeout to 18 minutes +- radv/ci: mark a whole subset of tests as flaky on tahiti +- zink/ci: document more RADV flakes +- zink/ci: document more NVK GA106 flakes +- ci/b2c: allow jobs to opt-in into the new mars setup command +- radv/ci: opt-in the new mars setup command +- etnaviv/ci: opt-in the new mars setup command +- nvk/ci: opt-in the new mars setup command +- freedreno/ci: opt-in the new mars setup command +- i915g/ci: opt-in the new mars setup command +- r300g/ci: opt-in the new mars setup command +- radv/ci: reduce the timeout of vkcts-tahiti to a more sensible time +- radv/ci: add hawaii to CI + +Mary Guillemard (95): + +- panfrost: Fix group priorities in drm-shim +- panfrost: Fix PROGRESS_LOAD destination register +- pan/bi: Properly encode LEA_BUF_IMM +- pan/bi: Remove shift lanes invalid encodings +- pan/bi: Fix invalid CLPER encoding +- pan/bi: Remove b3210 from valid swizzle +- pan/bi: Use 2D dimension with TEX_FETCH with CUBE on Valhall +- pan/decode: Fix indirect branch calculation for 64-bit +- panfrost: Properly name CSF instruction UMIN32 source 0 +- panvk: Disallow unknown GPU models early in physical device init +- pan/genxml: Stop using "constant" for struct fields in xml defs +- pan/genxml: Rework gen_pack.py to support OpenCL +- panfrost: Add base of OpenCL C infrastructure +- bi: Optimize scratch access +- panvk: Integrate libpan +- panfrost: Integrate libpan in gallium driver +- panvk: Add create_shader_from_binary +- panvk: Move TLS preparation logic to cmd_dispatch_prepare_tls +- panvk: Expose calculate_task_axis_and_increment on CSF +- panvk: Implement precomp dispatch +- panfrost: Implement precomp dispatch on Gallium +- panfrost,panvk: Wire printf and abort support +- ci: Transition to precomp-compiler +- ci: Enable mesa-clc and precomp-compiler on debian-arm32 +- pan/bi: Fix scratch access optimization +- panvk: Switch JM copy queries to CLC +- panvk: Switch JM clear queries to CLC +- pan/bi: Document removed instructions on v11 +- pan/bi: Do not run bi_fuse_small_int_to_f32 on v11+ +- pan/bi: Disallow dst swizzle optimization in case of conversion +- pan/bi: Add round mode modifier to FADD +- pan/bi: Stop using V2F32_TO_V2F16 on Valhall +- pan/bi: Lower SWZ.v4i8 to multiple MKVEC.v2i8 on v11+ +- pan/bi: Stop using S16_TO_F32 in nir_texop_lod computation on v11+ +- pan/bi: Stop using V2F16_TO_V2S16 for barycentric_at_offset on v11+ +- pan/bi: Lower removed instructions in algebraic on v11+ +- pan/bi: Lower FROUND.v2s16 in nir_lower_bit_size on v11+ +- pan/bi: Lower hadd on v11+ +- pan/bi: Handle LD_VAR_BUF_IMM encoding changes on v11+ +- pan/bi: Properly extract primitive facing on v11+ +- pan/genxml: Define RUN_IDVS staging registers in an enum +- panfrost: Remove write to TSD_3 in Gallium driver +- pan/genxml: Define RUN_COMPUTE staging registers in an enum +- pan/genxml: Define RUN_FRAGMENT staging registers in an enum +- panfrost: Rework cs_sr_regXX to be a macro +- panfrost: Switch Gallium driver to use cs_sr_regXX +- panfrost: Use CSIF info for CSF registers count +- panfrost: Avoid hard crash when major arch is unknown +- panfrost: Rename CSF MOVE into MOVE48 +- panfrost: Fix FLUSH_CACHE2 other definition +- panfrost: Rename CS ADD_IMMEDIATEXX to ADD_IMMXX +- pan/genxml: Use DCD Flags in Draw struct on v9+ +- pan/bi: Disallow FAU special page 3 and WARP_ID on message instructions +- pan/bi: Add unit tests for FAU special page 3 and WARP_ID +- pan/bi: Run nir_lower_bit_size after algebraic +- pan/bi: Run bifrost_nir_lower_algebraic_late while there is progress +- pan/bi: Lower FREXPE.v2f16 and FREXPM.v2f16 on v11+ +- pan/bi: Fix out of range access in bi_instr_replicates +- pan/bi: Ensure we select b0 with halfswizzle in va_lower_constants +- nir: Add Panfrost specific shader_output intrinsic +- pan/bi: Use shader_output intrinsic for IDVS +- pan/bi: Add support for IDVS2 on Avalon +- pan/bi: Lower IADD.v4s8 in algebraic on v11+ +- pan/bi: Allow no_psiz variant with IDVS2 +- pan/genxml: Rename UMIN32 opcode to COMPARE_SELECT32 +- pan/genxml: Add v12 definition +- pan/genxml: Add v13 definition +- pan/genxml: Build libpanfrost_decode for v12 +- pan/genxml: Build libpanfrost_decode for v13 +- pan/lib: Build for v12 +- pan/lib: Build for v13 +- pan/clc: Build for v12 +- pan/clc: Build for v13 +- panvk: Support 64x64 meta tile size for v12 in cmd_preload_render_area_border +- pan/lib: Adapt CRC calculation to align to 64x64 on v12+ +- panfrost: Disable hierarchy levels based on multiple of tile_size +- panvk: Use spd variant instead of pos_points when checking for vs shader presence +- pan/earlyzs: Default to FORCE_EARLY for ZS update on v11+ +- panvk: Implement Z/S dependency tracking on v11+ +- panvk: Add v12 support +- panvk: Add v13 support +- panfrost: Add v12 support to the Gallium driver +- panfrost: Add v13 support to the Gallium driver +- panfrost/ci: Add Mali-G720 current expectations +- panfrost/ci: Add Mali-G725 current expectations +- panfrost: Advertize Mali-G720 support +- panfrost: Advertize Mali-G725 support +- panfrost: Update the release note to mention G720/G725 addition +- panvk: Fix inverted ZS read flags in DCD2 on v11+ +- panfrost: Take tiler memory budget into account in pan_select_tiler_hierarchy_mask +- panvk: reset dyn_bufs map count to 0 in create_copy_table +- panvk: Take rasterization sample into account in indirect draw on v10+ +- panvk: Take resource index in valhall_lower_get_ssbo_size +- pan/bi: Properly lower add/sub with saturation on v11+ +- pan/genxml: Fix inverted logic on nr_regs + +Matt Turner (22): + +- gallium: Fix typos +- glsl: Fix typos +- glx: Fix typos +- intel: Fix typos +- mapi: Fix typos +- vulkan: Fix typos +- gallium: Return NULL, not false, from functions returning pointers +- nir: Return NULL, not false, from functions returning pointers +- iris: Initialize pointer with NULL, not false +- intel/compiler: Use FALLTHROUGH +- glsl: Use FALLTHROUGH +- glsl: Add missing break +- intel/compiler: Use unreachable instead of assert(!"...") +- intel/isl: Use unreachable instead of assert(!"...") +- anv: Use unreachable instead of assert(!"...") +- hasvk: Use unreachable instead of assert(!"...") +- intel/decoder: Remove assert(!"...") in recoverable condition +- intel/compiler: Add missing breaks +- intel/compiler: Use correct enum type +- iris: Remove ignored qualifier +- intel/isl: Remove ignored qualifier +- anv: Remove ignored qualifier + +Maíra Canal (4): + +- v3dv: VK_EXT_acquire_drm_display doesn't require a DRM master fd +- v3dv: don't overwrite the primary fd if it's already set +- broadcom/simulator: Expose V3D revision number in the simulator interface +- broadcom/simulator: Fix Indirect CSD jobs for V3D 7.1.6+ + +Mel Henning (31): + +- nvk: Use hw support for instancing on PASCAL_B+ +- nak/opt_copy_prop: Fix IAdd3 overflow check +- nak/opt_copy_prop: Add force_alu_src_type +- nak/opt_copy_prop: Force alu src for IAdd2X/IAdd3X +- driconf: force_vk_vendor on Deep Rock Galactic+NVK +- nak: Add OpLea +- nak,nir: Add 32-bit nir_op_lea_nv and use it +- nak: Add OpLeaX +- nak,nir: Add 64-bit lea_nv +- nouveau/mme: Print ip in mme_tu104_dump +- nak: Add static cycle count statistic +- nak: Add an occupancy statistic +- nak/spill_values: Make Spill take &mut self +- nak: Add spill/fill statistics +- nvk: Don't zero imported memory +- nvk: Add NVK_DEBUG=trash_memory +- vulkan: Relax bda assert for zero-size buffers +- nak: Add a simple postpass instruction scheduler +- nak: Calc static cycle count in instr_sched +- nak: Assert instr_sched matches calc_instr_deps +- nak/instr_sched: Barriers activate after 2 cycles +- nvk: SET_PS_{REGISTER,WARP}_WATERMARKS +- nvk: Support blackwell in max_warps_per_mp_for_sm +- nak/spill_values: Spill constants across edges if needed +- nak: Handle idp4 ureg latencies +- nvk: SET_STATISTICS_COUNTER at start of meta_begin +- nvk: Override render enable for blits and resolves +- wsi/headless: Override finish_create +- nak: Remove hfma2 src 1 modifiers +- nak: Add Src::is_unmodified() helper +- nak: Check that swizzles are none + +Mi, Yanfeng (2): + +- anv: Support putting image base address and image params in surface state +- anv: add emulated 64bit integer storage support + +Michael Cheng (5): + +- anv: Fix missing Perfetto trace for as build +- Revert "anv: Fix missing Perfetto trace for as build" +- vulkan : Pass in number of tlas/blas being built +- anv: Change as_build to show num tlas/blas +- vulkan: add leaves and ir_leaves sizes to encode for utrace. + +Michel Dänzer (1): + +- egl/glx/sw: Check xcb_query_extension_reply return value for MIT-SHM + +Mike Blumenkrantz (97): + +- zink: emit SpvCapabilityDemoteToHelperInvocation for IsHelperInvocation +- zink: implement ops for KHR_shader_subgroup +- aux/trace: trace tex2d_from_buf for samplers/images +- gallium: add a pipe_tex2d_from_buf struct +- rusticl: stop clearing shader images after every dispatch +- zink: also refcount needs_present from frontbuffer flush +- zink: delete some old DGC remnants +- zink: guard rebar check against fallback heap detection +- ac/surface: always allow LINEAR modifier for color formats +- radv: fix error reporting for VkExternalMemoryTypeFlagBitsKHR +- radv: stop blocking non-2D import/export ops +- radv: print stringname for VkExternalMemoryHandleTypeFlagBits error +- driconf: move a glthread viewperf option to global +- driconf: add GL_EXT_shader_image_load_store for viewperf to fix crashing +- zink: only enable unsynchronized_texture_subdata with HIC +- zink: never try to oom flush during unsync texture upload +- zink: apply layer/depth to clear handling +- zink: wait on tc fence before checking for fd semaphore +- zink: handle buffer import/export +- zink: verify that adding a dmabuf bind actually chooses a modifier +- zink: support cl_gl_sharing if dmabuf is supported +- egl/x11: delete some dri2 remnants +- zink: force cached mem for streaming uploads +- zink: always fully unwrap contexts +- zink: use a separate ralloc ctx for zink_program objects +- zink: put zink_program::reference on separate cacheline +- dri: delete INVALIDATE extension +- zink: enable single-plane modifiers for generic 2D exports +- zink: clamp UBO sizes instead of asserting +- anv: support all dimensions of image for LINEAR dmabufs +- aux/trace: truncate descriptor unbinds +- zink: always check submit_count to disambiguate when checking/waiting +- zink: don't reset all batch states when stalling +- zink: alloc bo ids for non-slab bos too +- zink: explicitly check usage in buffer barriers +- svga/ci: disable vmware farm +- llvmpipe: pass layer count to rast clear +- gallium: fix pipe_framebuffer_state::view_mask +- mesa: add error handling for OVR_multiview +- mesa: avoid creating incomplete surfaces when multiview goes out of range +- zink: improve precision on changes to depth bias between draws +- zink: delete zink_batch_state::ref_lock +- zink: split set_vertex_buffers to avoid conditionals +- zink: start using ctx->vertex_state_changed again +- zink: use VKCTX for vertex buffer binds and delete unused screen local +- zink: only add fb surf refs on unbind +- radv: rewrite radv_get_line_mode() conditional +- radv: store num_attributes to shader info +- radv: store vertex prolog simple input check to cmdbuf on vs bind +- radv: roll line topology dynamic state changes into existing rast samples flag +- radv: eliminate a memset in radv_get_vbo_info() +- radv: inline some vertex descriptor functions +- radv: split out dynamic vertex input descriptor writing +- radv: move non_trivial_format calc to dynamic VI bind +- radv: get vbo info directly into dgc upload +- radv: don't unnecessarily flag prolog recalc when binding VBOs +- zink: disable reordering on compute contexts +- zink: extract some shader image resource binding code +- zink: extract some samplerview bind/unbind code +- zink: support crazy CL buffer-to-texture extension +- st/drawpixels: move sv unref out to callers +- gallium: eliminate frontend refcounting from samplerviews +- gallium: make pipe_sampler_view::reference non-atomic +- mesa: remove st_sampler_view::private_refcount +- aux/trace: set sampler_view_release pointer +- gallium: delete tests +- util/tests: move u_half_test into half_float_test +- gallium: delete u_cache +- meson: add i915 to 'all' build for gallium-drivers +- egl/x11: always expose EXT_swap_buffers_with_damage +- egl: delete invalidate_available flag +- egl: move _EGLDisplay DriverData association into dri2_display_create +- egl: hoist dri2 display creation up a level +- egl/x11: split out dri2 init entirely +- egl: move kopper detection to display creation +- egl/x11: hoist up dri2_get_xcb_connection call +- egl/x11: simplify a kopper check on init +- egl/x11: hoist up swrast/zink driver_name setting +- egl/x11: hoist and document dri3_x11_connect() during init +- egl: delete dri2_egl_driver_fail +- egl/x11: consolidate LIBGL_DRI3_DISABLE use on init +- egl/x11: unify swrast/kopper/dri3 paths a bit +- gallium: delete pipe_surface::width and pipe_surface::height +- zink: fix refcounting of zink_surface objects +- zink: block streaming cached uploads on unsynchronized/persistent maps +- zink: implement unsynchronized staging uploads for buffers +- zink: block inferred loading for swrast +- gallium/util: check nr_samples in pipe_surface_equal() +- tu: check for valid descriptor set when binding descriptors +- meson: deprecate gallium-nine +- meson: deprecate gallium-xa +- zink: don't set shared block stride without KHR_workgroup_memory_explicit_layout +- zink: stop setting ArrayStride on image arrays +- zink: use implicit stride in ntv for temp vars +- ci: update VVL to current week +- zink: verify that surface exists when adding implicit feedback loop +- egl: fix sw fallback rejection in non-sw EGL_PLATFORM=device + +Mohamed Ahmed (7): + +- nvk: Add NVK_MAX_IMAGE_PLANES for images +- nvk: Add NVK_MAX_SAMPLER_PLANES for samplers +- nil: Use multiplanar init_info during image creation +- nil: Force smallest block size for images meant for Vulkan Video +- nvk: Set NIL_IMAGE_USAGE_VIDEO_BIT for images meant for video usage +- nil: Add an API to clamp max image alignment +- nvk: Implement VK_MESA_image_alignment_control + +Mykhailo Skorokhodov (1): + +- drirc/anv: force_vk_vendor=-1 for Drive Beyond Horizons + +Natalie Vock (30): + +- vulkan/bvh: Move leaf builder code to header +- vulkan/bvh: Add option to override leaf builder SPIR-Vs +- vulkan/bvh: Add optional fine-grained instance node bounds calculation +- radv/bvh, vulkan/bvh: Move AccelerationStructureInstance to vk_build_helpers +- radv/bvh: Remove unused build_instance helper +- radv/bvh: Prefix RADV-specific node functions with radv\_ +- radv/bvh: Add custom leaf node builder +- mailmap: Update my name +- radv/rt: Don't allocate the traversal shader in a capture/replay range +- aco/ra: Use iterators for linear VGPR copy extraction +- aco/ra: Use struct for parallelcopies +- aco/ra: Add option to skip renaming for parallelcopies +- aco/ra: Handle temps fixed to different regs in different operands +- aco/tests: Add tests for precolored operands in different regs +- aco/ra: Assert operands only clear their own id +- radv/rt: Limit monolithic pipelines to 50 stages +- radv/rt: Guard leaf encoding by leaf node count +- radv/rt: Flush L2 after writing internal node offset on GFX12 +- ac/llvm: Don't use getTriple() on LLVM21+ +- aco: Add get_temp_reg_changes helper +- aco/spill: Invert reloads map +- aco: Add RegisterDemand(Temp) constructor +- aco/spill: Allow spilling live-through operands +- radv/rt: Flush CP writes from the common BVH framework with INV_L2 on GFX12 +- vulkan/bvh: Move first PLOC task_count fetch inside PHASE +- aco: Fix RT VGPR limit on Navi31/32, GFX11.5, GFX12 +- aco: Make private_segment_buffer/scratch_offset per-resume +- radv: Add radv_emulate_rt drirc and enable for Indiana Jones TGC +- radv: Add radv_enable_float16_gfx8 drirc and enable for Indiana Jones TGC +- radv/ci: Test FP16 for GFX8 + +Nikita Popov (2): + +- clover: Don't include libclc headers +- clover: Fix MSVC build + +Nikolas Zimmermann (1): + +- etnaviv: drm: Add DRM_RDWR permissions to drmPrimeHandleToFD() call. + +Olivia Lee (1): + +- panfrost: allow promoting sysval UBO to push constants + +Patrick Lerda (27): + +- r600: fix r600_init_shader_caps() has_atomics issue +- r600: fix evergreen_emit_vertex_buffers() related cl regression +- r600: fix cayman sfn_nir_legalize_image_load_store ssa dominance +- r600: prepare the lds constant buffer to be shared +- r600: implement a conformant gl_VertexID with base offset +- r600: fix the indirect draw 8-bits path +- r600: fix emit_image_size() range base compatibility +- r600: update evergreen_convert_border_color() +- r600: update cayman_convert_border_color() +- r600: fix cayman main non-deterministic behavior problem +- r600: update the software fp64 support +- radeonsi: fix clear_depth_stencil refcnt imbalance +- r600: move stores to the end of shader when required +- r600: fix clear_depth_stencil refcnt imbalance +- r600: fix textures with swizzles limited to zero and one +- r600: implement EXT_window_rectangles +- r600: remove deprecated NIR_PASS_V +- r600: fallback to util_blitter_draw_rectangle when required +- r600: fix pa_su_vtx_cntl rounding mode +- r600: fix points clipping +- r600: enable ARB_compute_variable_group_size +- r600: clean up not used fields detected by clang +- i915: fix i915_set_vertex_buffers() related refcnt imbalance and remove redundancies +- i915: fix slab_create() related memory leaks +- i915: fix nir_to_tgsi() related memory leak +- i915: fix draw_create_fragment_shader() related memory leak +- mesa_interface: fix legacy dri2 compatibility + +Patrick Nicolas (1): + +- radv/video: Add low latency encoding + +Paul Gofman (1): + +- radv/amdgpu: Fix hash key in radv_amdgpu_winsys_destroy(). + +Paulo Zanoni (10): + +- brw: increase brw_reg::subnr size to 6 bits +- brw: don't always set cond_modifier on parsed assembly instructions +- brw: don't mark instructions read from text assembly as compacted +- brw: add instructions missing from is_control_flow() +- brw: extend the NOP+WHILE workaround +- drirc/anv: DiggingGame.exe needs force_vk_vendor=-1 +- intel/i915: restrict the RAM size restrictions to Anv +- anv/xe: detect the newer xe.ko memory reporting model and act accordingly +- anv: restore the old behavior of up to 75% of RAM for the system heap +- anv: add ANV_SYS_MEM_LIMIT for debugging system memory restrictions + +Pavel Ondračka (22): + +- i915/ci: use debian-build-testing instead of debian-testing +- r300: remove some dead code in redeon_program_alu +- r300: do not limit maximum TEX group for R300/R400 +- r300: copy propagate constant swizzles +- ci: fix debian-build-testing BUILDTYPE +- ci: disable LTO for nightly debian-build-testing +- i915/ci: update expectations +- i915: rework shader compile failures reporting +- mesa: properly signal report_compile_error to drivers +- nine: set pipe_shader_state.report_compile_errors +- r300: stop reporting compile failures in finalize_nir +- r300: forward all compile failures to state tracker +- r300: do not include newline in the error messages +- r300,i915: update CI expectations +- r300: fix temps counting for shader-db stats +- r300/ci: add dEQP job with RADEON_DEBUG=notcl +- r300: remove usage of NIR_PASS_V +- r300: fix INV and BIAS presubtract on R300/R400 +- r300: remove finalize_nir +- r300: remove support for tgsi_texcoord +- r300: fix crash when creating surfaces +- r300/ci: update expectations + +Peyton Lee (6): + +- radeonsi/vpe: check reduction ratio +- radeonsi/vpe: support geometric scaling +- amd/gmlib: add gmlib for radeonsi +- radeonsi/vpe: vpe support tonemapping +- radeonsi/vpe: vpe support hdr input +- radeonsi/vpe: fix background issue + +Philipp Zabel (3): + +- etnaviv/ml: Drop duplicated function reorder_for_hw_depthwise() +- etnaviv/ml: Fix padding input/output tensor zero points +- teflon: Log (un)supported operations + +Pierre-Eric Pelloux-Prayer (16): + +- radeonsi: update si_need_gfx_cs_space upper bound +- winsys/amdgpu: treat cs overflow as context lost +- radeonsi: disable dcc when external shader stores are used +- tc: add missing TC_SENTINEL for TC_END_BATCH +- mesa/st: call _mesa_glthread_finish before _mesa_make_current +- tc: flag closed batches +- st/mesa: fix nir_load_per_vertex_input parameter +- radeonsi: cache u_trace_perfetto_active value +- radeonsi: tag perfetto conditions as unlikely +- radeonsi: add pid/tid to the vk_queue_submit event +- radeonsi: guard perfetto support with ifdef in si_draw +- ac/nir: fix nir_metadata value of ac_nir_lower_image_opcodes +- radeonsi/video: require has_image_opcodes for full modifier support +- radeonsi: use composed swizzle in cdna_emu_make_image_descriptor +- winsys/amdgpu: disable VM_ALWAYS_VALID +- radeonsi: fix potential use after free in si_set_debug_callback + +Pohsiang (John) Hsu (7): + +- d3d12: fix incorrect IsRefUsedByCurrentPic marking for P Frame +- d3d12: log all the field of dpb buffer for better diagnostic, cosmetic space removal +- d3d12: initialize ReconstructedPicTexture +- d3d12: fix start code prevention in write_sei_nalu() +- d3d12: fix d3d12_video_nalu_writer_h264::write_slice_svc_prefix +- d3d12: use log2_max_pic_order_cnt_lsb_minus from upper layer +- d3d12: use log2_max_pic_order_cnt_lsb_minus4 from upper layer for h264 + +Qiang Yu (78): + +- lavapipe: fix min_vertex_pipeline_param +- gallium: fix ddebug and noop screen caps init +- gallium,clover: add compute caps used by clover only +- r600: init compute caps without ir_type param +- radeonsi: init compute caps without ir_type param +- gallium,mesa: remove ir_type param when get_compute_param +- gallium: add pipe_shader_caps and pipe_compute_caps +- gallium: copy shader and compute caps for ddebug/noop/trace +- tgsi: add tgsi_exec_init_shader_caps +- gallivm: add gallivm_init_shader_caps +- draw: add draw_init_shader_caps +- agx: init shader and compute caps +- crocus: init shader and compute caps +- d3d12: init shader and compute caps +- etnaviv: init shader caps +- freedreno: init shader and compute caps +- i915: init shader caps +- iris: init shader and compute caps +- lima: init shader caps +- llvmpipe: init shader and compute caps +- nouveau/nv30: init shader caps +- nouveau/nv50: init shader and compute caps +- nouveau/nvc0: init shader and compute caps +- panfrost: init shader and compute caps +- r300: init shader caps +- r600: init shader and compute caps +- radeonsi: init shader and compute caps +- softpipe: init shader and compute caps +- svga: init shader and compute caps +- tegra: init shader and compute caps +- v3d: init shader and compute caps +- vc4: init shader caps +- virgl: init shader and compute caps +- zink: init shader and compute caps +- nine: GET_SHADER_CAP use pipe_shader_caps +- lavapipe: change min shader param to use pipe_shader_caps +- gallium,mesa: replace get_shader_param with pipe_shader_caps access +- clover: replace get_compute_param with pipe_compute_caps +- gallium,mesa: replace get_compute_param with pipe_compute_caps +- rusticl: replace get_shader_param with pipe_shader_caps +- rusticl: replace get_compute_param with pipe_compute_caps +- asahi: remove shader and compute get param +- crocus: remove shader and compute get param +- d3d12: remove shader and compute get param +- etnaviv: remove shader get param +- freedreno: remove shader and compute get param +- i915: remove shader get param +- iris: remove shader and compute get param +- lima: remove shader get param +- llvmpipe: remove shader and compute get param +- nouveau/nv30: remove shader get param +- nouveau/nv50: remove shader and compute get param +- nouveau/nvc0: remove shader and compute get param +- panfrost: remove shader and compute get param +- r300: remove shader get param +- r600: remove shader and compute get param +- radeonsi: remove shader and compute get param +- softpipe: remove shader and compute get param +- svga: remove shader and compute get param +- tegra: remove shader and compute get param +- v3d: remove shader and compute get param +- vc4: remove shader get param +- virgl: remove shader and compute get param +- zink: remove shader and compute get param +- gallium/aux: remove aux shader param get function +- gallium: remove screen shader and compute get param callback +- doc,src: replace doc and comments for shader and compute cap +- ac/surface: fix radv import dmabuf from radeonsi +- gallium/ddebug: add missing modifier callback +- radeonsi: fix has_non_uniform_tex_access info +- radeonsi,util: add more usage for AMD_FORCE_SHADER_USE_ACO +- radeonsi: fix GravityMark corruption when use aco +- egl: decouple dmabuf import/export cap from xserver support +- dri: dmabuf cap does not rely on winsys multibuffer support +- glx/egl/x11: fix x11_dri3_check_multibuffer +- kopper: remove unused function definition +- dri,egl,glx: remove redundant usage of HAVE_X11_DRM +- llvmpipe/ci: change for oes_egl_image_external_essl3 + +Rebecca Mckeever (27): + +- pan/format: Use HW version to determine siting for YUV 422 formats +- pan/texture: Only use plane_chroma_2p for chroma planes +- vk/meta: Extend copy/fill/update helpers to support YCbCr +- util/hash_table: Add _mesa_hash_table_u64_replace() +- panvk: Allow a 32-bit binding value in desc id key and use 64-bit keys +- panvk: Move single-plane views of multiplane formats to pview.planes[0] +- panvk: Change immutable_samplers to panvk_sampler ** +- panvk: Create helper function for sampler descriptor emission +- panvk: Update panvk_get_desc_stride prototype +- panvk: Move mali_texture_packed structs in panvk_image_view to a union +- panvk: Use multiple sampler planes and one texture descriptor per plane +- panvk: Fix assertion in is_disjoint() +- panvk: Add YCbCr sampler NIR lowering pass +- panvk: Split get_format_properties into format features helper functions +- panvk: Report formats not supported by HW as unsupported +- panvk: Enable YCbCr support for v10+ +- vk/image: Add vk_image_can_be_aliased_to_yuv_plane() helper +- panvk: Use vk_image_can_be_aliased_to_yuv_plane() helper +- panvk: Allow 3-byte formats +- panfrost: Add BI_OPCODE_WMASK to bi_instr_uses_helpers +- panvk: Enable VK_KHR_shader_subgroup_uniform_control_flow for v10+ +- panvk: Enable VK_KHR_shader_maximal_reconvergence for v10+ +- panvk: Enable VK_KHR_shader_quad_control for v10+ +- panvk: Add STORAGE_IMAGE_BIT feature for formats supporting sampled images +- panvk: Enable shaderStorageImageExtendedFormats +- panvk: Remove lower_tg4_broadcom_swizzle from panvk_preprocess_nir() +- panvk: Support shaderImageGatherExtended + +Renato Pereyra (1): + +- perfetto/android: align datasource names with tooling expectations + +Rhys Perry (41): + +- nir,aco,radv: add align_mul/offset to buffer_amd intrinsics +- nir/opt_offsets: don't check NUW for unswizzled buffer_amd +- ac/nir/ngg: update bit_sizes_int +- nir/load_store_vectorize: handle load_buffer_amd/store_buffer_amd +- ac/nir: set memory_modes for lowered TES input loads +- radv: don't use bit_sizes_int to skip nir_lower_bit_size +- radv: move nir_opt_shrink_vectors later +- radv: DCE before nir_opt_shrink_vectors +- nir/load_store_vectorize: fix sorting of vectors in add_to_entry_key +- nir/algebraic: optimize ishl(iadd(iadd(a, #b), c), #d) +- radv: vectorize descriptor loads +- radv: vectorize lowered shader IO +- nir/use_dominance: invalidate nir_metadata_instr_index +- nir/opt_move: invalidate nir_metadata_instr_index +- nir/lower_io_arrays_to_elements: invalidate metadata +- nir/find_array_copies: invalidate nir_metadata_instr_index +- nir/linking_helpers: invalidate metadata in nir_link_opt_varyings +- radv/rt: correctly preserve metadata in move_rt_instructions +- nir: add NIR_DEBUG=extended_validation +- nir: add NIR_DEBUG=invalidate_metadata +- nir/validate: assert that if condition dominates use +- aco: validate operands fixed to definitions +- aco/ra: reverse renaming of operands outside update_renames +- aco/ra: unconditionally call undo_renames +- radeonsi: fix invalidation of metadata in si_nir_lower_abi +- ac/nir: don't cross swizzle elements when vectorizing buffer_amd intrinsic +- ac/nir: set higher alignment for some swizzled store_buffer_amd +- ac/nir: fix tess factor optimization when workgroup barriers are reduced +- aco: insert dependency waits in certain situations +- radv: improve radv_get_max_waves for multi-wave workgroups with LDS +- radv: don't assume WGP mode in radv_get_max_waves +- amd/drm-shim: add gfx1201 +- aco/ra: fix free register counting when moving variables +- radv/winsys: increase gfx12 vgprs for null winsys +- radv/winsys: set gart_page_size for null winsys +- radv/winsys: set has_distributed_tess for null winsys +- aco/gfx12: don't use second VALU for VOPD's OPX if there is a WaR +- aco: combine VALU lanemask hazard into VALUMaskWriteHazard +- aco/gfx11: create waitcnt for workgroup vmem barriers +- aco: fix get_temp_reg_changes with clobbered operands +- aco: swap the correct v_mov_b32 if there are two of them + +Rob Clark (17): + +- freedreno+tu: Add new virtgpu caps +- tu: Suballoc VkEvent BOs +- tu: Don't emit SP_PS_2D_WINDOW_OFFSET on a6xx +- tc: Add missing tc_set_driver_thread() +- freedreno: Wait for imported syncobj fences to be available +- ci: Re enable fd-farm +- freedreno/ci: Disable traces job until piglit is fixed +- ci: Re enable fd-farm +- ir3: Comment re-indent +- tu/vdrm: Fix userspace fence cmds +- tu: Avoid extraneous set_iova +- tu: Add some func traces +- util: Add drmSyncobj shim +- panvk: Remove dependency on vk_device::drm_fd +- vulkan: Use syncobj shim +- virtio/vdrm: Add vtest backend +- tu: vdrm vtest support + +Robert Mader (4): + +- llvmpipe: Take offset into account when importing dmabufs +- llvmpipe: Free dummy_dmabuf on shutdown +- gallivm: Re-add check for passmgr before disposing it +- meson: Bump minimum version to 1.3.0 + +Rohan Garg (7): + +- anv: refactor add_aux_surface_if_supported to use a common variable +- anv,blorp,isl: handle compressed CPS surfaces through the depth stencil hw +- anv: CPB surfaces that are used as color attachments or for stores cannot be compressed +- anv: no need to handle transitions for CPB surfaces +- anv: separate fast clear handling for compressed CPS +- isl: enable CPB compression +- anv: re enable compression for CPS surfaces on platforms other than Xe + +Roland Scheidegger (3): + +- llvmpipe: don't assert on exceeding if_stack size +- llvmpipe: Fix overflow issues calculating loop iterations for aniso +- llvmpipe: Fix alpha-to-coverage without dithering + +Ruijing Dong (1): + +- radeonsi/vcn: vcn5 roi fix + +Ryan Mckeever (5): + +- pan/format: Update format flags to follow HW spec +- pan/format: Add PAN_BIND_STORAGE_IMAGE flag +- panvk: Enable KHR_format_feature_flags2 and use them +- panvk: shaderStorageImageReadWithoutFormat support +- panvk: shaderStorageImageWriteWithoutFormat support + +Sagar Ghuge (12): + +- intel/genxml: Update BLEND_STATE_ENTRY structure +- anv: Enable simpleFloatBlendEnable on Xe3+ +- iris: Enable simpleFloatBlendEnable on Xe3+ +- blorp: Enable SimpleFloatBlendEnable on Xe3+ +- intel/compiler: Drop primitive leaf desc load code +- anv: Exclude non-standard block shapes on Xe2+ +- intel/compiler: Zero out the header for texel fetch +- anv: CPS LOD Compensation Enable is deprecated on Xe2+ +- intel/compiler: Add support for MSAA typed load/store messages +- intel/compiler: Lower sample index into coord for MSRT messages +- intel/compiler: Don't lower 64bit data memory access on LSC +- intel/compiler: Fix stackIDs on Xe2+ + +Samuel Pitoiset (227): + +- ac,radv,radeonsi: add new GFX12_DCC_WRITE_COMPRESS_DISABLE tiling flag +- radv/meta: add missing pipeline lookups +- radv/nir: update radv_nir_opt_tid for derivative group quads +- radv: implement derivative group quads on GFX12 +- radv: advertise computeDerivativeGroupQuads on GFX12 +- radv/meta: stop using string keys also for DGC and query objects +- util/disk_cache: add a new helper to create a disk cache +- vulkan/runtime: allow to use a different disk cache +- radv: fix caching on-demand meta shaders +- radv: fix adding the BO to cmdbuf list when starting conditional rendering +- radv: fix fetching draw vertex data from counter buffers with transform feedback +- radv: remove redundant drawCount == 0 for indirect mesh/task draws +- radv: use radv_indirect_dispatch() more +- radv: rework passing dispatch info via radv_dispatch_info +- radv: rework passing draw info via radv_draw_info +- radv: do not keep track of the streamout binding buffer +- nir: adjust number of components for cmat_muladd_amd +- radv/nir: add a struct for parameters to cooperative matrix lowering +- radv/nir: add cooperative matrix lowering for GFX12 +- radv: advertise VK_KHR_cooperative_matrix on GFX12 +- radv/meta: use BDA for query resolves +- radv/meta: compute the destination addr earlier for query resolves +- radv/meta: simplify creating buffers for R32G32B32 operations +- radv: remove unused device memory init/finish helpers +- radv/video: pass addr to send_cmd() +- radv/rmv: pass addr to log_resource_bind_locked() +- radv: pass addr to radv_copy_buffer() +- radv: rename radv_buffer::bo_va to addr +- radv: compute radv_buffer::addr at bind time +- radv: use radv_buffer::addr more +- radv/video: fix missing image offset when computing VA +- radv/meta: use BDA for clear HTILE mask +- radv/meta: use BDA for copying VRS rates to HTILE +- radv: use BDA for the uploaded parameters with DGC +- radv: simplify determining VBO size +- radv/meta: remove the heuristic that prefers CP DMA for GTT BOs +- radv/meta: disable conditional rendering for fill/update buffer operations +- radv/meta: inline copy_buffer() +- radv: rename fill/copy memory helpers +- radv/meta: add radv_{copy,fill,update}_memory() helpers +- radv: compute VBO addr at bind time +- radv: stop relying on VkBuffer for VBO +- ac/gpu_info: add gfx12_supports_dcc_write_compress_disable +- radv: add initial DCC support on GFX12 +- vulkan: add descriptor buffer support to compute astc decoder +- radv/meta: switch to descriptor buffers +- radv: remove radv_buffer_{init,finish}() helpers +- radv: remove radv_buffer_view_{init,finish}() helpers +- radv: fix adding the VRS image BO to the cmdbuf list on GFX11 +- radv/meta: add BOs to cmdbuf list earlier for image copy operations +- radv/meta: pass the buffer addr to SDMA image buffer copy operations +- radv/meta: use radv_copy_memory() instead of radv_copy_buffer() +- radv/meta: remove the buffer dependency for image copy operations +- radv: stop using image binding offset when exporting BO metadata +- radv: remove redundant zero initialization when creating images +- radv: rename radv_image::bindings::bo_va to addr +- radv: compute radv_image::bindings::addr at bind time +- radv: use radv_image::bindings::addr more +- radv: fix adding the BO for unaligned SDMA copies to the cmdbuf list +- vulkan: constify vk_acceleration_structure_get_va() +- radv: use radv_CmdDispatchIndirect() in the accel struct path +- radv/meta: use radv_copy_memory() for the FMASK copy +- radv: use radv_copy_memory() for accel structure updates +- radv/meta: inline radv_copy_buffer() +- radv/meta: rename image<->buffer copies helpers +- radv/video: fix adding the query pool BO to the cmdbuf list +- radv: stop computing the UUID using the physical device cache key +- radv: fix missing SQTT barriers for fbfetch color/depth decompressions +- radv: reserve bits explicitly for cache key structs +- vulkan: filter duplicate pNext struct at device creation +- radv: replace radv_image::shareable by vk_image::external_handle_types +- radv: remove meaningless TODOs in radv_GetDeviceImageMemoryRequirements() +- radv: handle OOM error properly when selecting image modifier +- radv/meta: inline radv_meta_get_view_layer() +- radv/meta: remove useless assertion in when copy VRS to HTILE +- radv: check HTILE compression for depth/stencil images per level +- radv: enable DCC fast clears for 8bpp/16bpp on GFX11 +- radv: fix re-emitting fragment output state when resetting gfx pipeline state +- docs: add missing RADV_PERFTEST=video_encode description +- radv: fix trap handler exception options +- radv: use radv_emulate_rt() more +- radv: remove redundant radv_instance::drirc::rt_wave64 +- docs: adjust NGG culling options description +- spirv: move workarounds to an inner struct in spirv_to_nir_options +- spirv/tests: initialize compiler options in constructor +- spirv/tests: add a test for NonSemantic.DebugBreak +- spirv/tests: add a test for force_ssbo_non_uniform +- spirv/tests: add a test for force_tex_non_uniform +- spirv/tests: add a test for lower_terminate_to_discard +- ci: update VKCTS main to ba86fb95004331f2cf571dd9adefe2458290ee11 +- radv: switch to device address from vk_buffer +- ci/b2c: fix passing B2C_* variables +- radv/ci: re-enable ET2C emulation testing on non-native GPUs +- radv: add RADV_DEBUG=pso_history +- ac,radv: add a workaround for a hw bug with primitive restart on GFX10-GFX10.3 +- radv/ci: enable RADV_PERFTEST=video_{decode,encode} on few GFX9+ GPUs +- aco/tests: use GFX1201 instead of GFX1200 +- radv/winsys: use real info for GFX12 in the null winsys +- radv: fix a GPU hang with inherited rendering and HiZ/HiS on GFX1201 +- radv/amdgpu: fix device deduplication +- meson: add build-radv-tests option +- ci: enable build-radv-tests for debian-clang and debian-vulkan +- radv/winsys: enable has_timeline_syncobj for the null winsys +- radv: add a small framework for RADV specific tests +- radv/tests: add a test to verify that pipelineCacheUUID is invariant +- radv/tests: add a test to verify that pipeline hash matches RGP<->Fossilize +- radv/tests: add few tests that verify drirc options +- radv: update conformance version +- aco: do not apply OMOD/CLAMP for pseudo scalar trans instrs +- radv/ci: enable RADV_PERFTEST=video_decode,video_encode on TAHITI,HAWAII and POLARIS10 +- ac/rgp: few fixes for GFX11.5 +- ac/rgp: add GFX12 definitions +- ac/rgp: bump version to 1.6 +- ac/rgp: bump instrumentation API version to 1.5 +- ac/sqtt: fix registers programming for GFX12 +- radv: enable RGP on GFX12 +- radv: emit a dummy PS state for noop FS on GFX12 +- radv: track redundant register writes for PA_SC_HISZ_CONTROL on GFX12 +- radv: fix creating pipeline binary from the traversal shader +- radv: use radv_sdma_emit_nop() more +- radv: add more SDMA emit helpers +- radv: add a helper to know whether compute queue is enabled +- radv: remove meaningless comment when resetting SQTT trace +- radv: add a helpers to know whether video decode/encode queues are enabled +- docs: update documentation for RADV_PERFTEST=video_decode,video_encode +- radv: fix bpe for the stencil aspect of depth/stencil copies on transfer queue +- radv: fix compresed depth/stencil copies on transfer queue +- radv: remove useless parameter to radv_sdma_get_buf_surf() +- radv: cleanup passing the aspect mask for SDMA operations +- radv: remove radv_force_pstate_peak_gfx11_dgpu=true for Helldivers 2 +- radv: use PM4 for setting specific graphics registers in the preamble +- radv/meta: fix color<->depth/stencil image copies +- radv/video: use a pointer to write the total task size +- radv/video: rework command buffer emission +- ci: uprev vkd3d-proton to 078f07f588c849c52fa21c8cfdd1c201465b1932 +- radv/ci: remove vkcts-navi21-llvm-valve completely +- radv/ci: remove radv-stoney-flakes.txt +- radv/ci: drop aco suffix for CI files +- radv/ci: rename radv-gfx1200 to radv-gfx1201 +- radv/ci: delete empty radv-hawaii-skips.txt +- radv: replace radeon_set_reg_seq by a macro +- radv: switch all emit helpers to macros +- radv: move the optimized context reg macros with other similar ones +- radv: rework radeon_set_uconfig_perfctr_reg_seq to use amd_ip_type +- radv: add new helper to emit PKT3_EVENT_WRITE for sampling queries +- radv: add radeon_event_write() macros +- radv/ci: fix renaming the VKCTS job for tahiti +- radv: rework the shader pointer emit as macros +- radv: apply some cosmetic changes for future begin/end CS sequences +- radv: add a helper to emit indirect buffer for draws/dispatches +- radv: add a helper to emit SPM muxsel +- radv: slightly change the COND_EXEC for sampling performance counters +- radv: use radv_cs_write_data_imm() more +- radv: add more helpers to start/stop perfcounters +- radv/video: slightly change radv_vcn_sq_header() +- radv: add a helper to emit a PKT3_COPY_DATA with an immediate +- radv: add a helper to emit PM4 commands to a CS +- radv: skip FCE for comp-to-single fast clears with DCC MSAA +- radv: rework radv_fast_clear_flush_image_inplace() +- radv: add radv_fmask_color_expand() +- radv: rework radv_handle_color_image_transition() +- radv: inline radv_fast_clear_flush_image_inplace() +- radv: make sure to always decompress FMASK before expanding it +- radv: disable TC-compatible CMASK with {FMASK,DCC}_DECOMPRESS +- ac,radv,radeonsi: use PM4 for shadowed registers +- radv: do not trigger FCE or FMASK decompress on compute queue +- radv: add queue family assertions when doing decompression passes +- radv: remove useless use of radv_image_use_comp_to_single() +- radv: only enable HTILE for depth/stencil attachment images +- radv: rework command buffer emission with begin/end sequences +- radv: switch back radeon_cmdbuf to use 32-bit counters +- radv/ci: stop skipping one memory test due to timeouts +- radv/ci: remove all skips for STONEY +- ac/surface: fix selecting preferred alignments for HiZ/HiS on GFX12 +- Revert "radv: program SAMPLE_MASK_TRACKER_WATERMARK optimally for GFX11 APUs" +- Revert "radeonsi/gfx11: program SAMPLE_MASK_TRACKER_WATERMARK optimally for APUs" +- radv: fix ignoring conditional rendering with vkCmdResolveImage() +- radv: add new helper to suspend/resume user conditional rendering +- radv: rework suspend/resume user conditional rendering +- ac,radv: remove has_scheduled_fence_dependency +- ac/gpu_info: bump required DRM minor version to 3.42.0 (kernel 5.15+) +- radv: remove useless FDCC_ENABLE bitfield clear on GFX12 +- radv: tidy up radv_emit_raster_state() +- radv: restore DB_DFSM_CONTROL properly when POPS isn't used +- radv: tidy up radv_emit_db_shader_control() +- radv: split framebuffer color state emission for GFX12 +- radv: split framebuffer depth/stencil state emission for GFX12 +- radv: split null framebuffer state emission for GFX12 +- radv: stop emitting CB_FDCC_CONTROL to zero on GFX11-GFX11.5 +- radv: do not emit the VRS surface VA when it's not enabled +- radv: add a fuction to emit the VRS surface on GFX11 +- radv: determine if HiZ/HiS is enabled earlier on GFX12 +- radv: add a workaround for buggy HiZ/HiS on GFX12 +- radv: apply the workaround for buggy HiZ/HiS on GFX12 for DGC +- radv: add before/after draw functions for DGC +- radv: move emitting raster and depth/stencil state slightly earlier +- radv: emit conservative raster mode as part of the MSAA state +- radv: configure COVERAGE_TO_SHADER_SELECT only if conservative rast is enabled +- radv: track more MSAA related register writes +- radv: regroup emitting all MSAA states in one function +- radv: add clip rects state bit for emitting discard rectangles +- radv: remove an old workaround for D3D9 with DXVK 2.3.0 and older +- radv: move the disable_trunc_coord drirc at instance/pdev level +- radv: move emitting more fb registers when rendering begins +- radv: use consecutive registers for PA_SC_WINDOW_SCISSOR_{TL,BR} +- radv: track redundant DB_RENDER_OVERRRIDE register writes on GFX12 +- radv: reduce the number of emitted DWORDS for MSAA 8x user sample locs +- radv: tidy up radv_emit_raster_state() +- radv: tidy up radv_emit_hw_ngg() +- radv: add macros for paired context registers on GFX12 +- radv: use paired context regs when optimal on GFX12 +- radv: only enable DCC for invisible VRAM on GFX12 +- ci: update VKCTS main to 76c1572eaba42d7ddd9bb8eb5788e52dd932068e +- radv: allocate the SPM BO in GTT for faster readback +- radv: print more error messages during SPM initialization +- ac/perfcounter: add support for GFX12 +- radv/sdma: simplify configuring the number of uncompressed DCC blocks +- radv/sdma: use SDMA5_DCC_xxx bitfields +- radv/sdma: remove redundant check for compression when getting metadata +- radv/sdma: use the correct helper to get the number type field +- radv/sdma: add a new flag to know if the surface is compressed +- radv/sdma: add support for compression on GFX12 +- radv: set radv_disable_dcc=true for WWE 2k23 +- radv: fix re-emitting VRS state when rendering begins +- radv: do not clear unwritten color attachments with dual-source blending +- radv: disable SINGLE clear codes to workaround a hw bug with DCC on GFX11 +- radv: fix GPU hangs with image copies for ASTC/ETC2 formats on transfer queue + +Saroj Kumar (2): + +- mesa: Add GL_EXT_protected_textures support +- radeonsi: Move buffer descriptor slot to the beginning + +Sasha Finkelstein (1): + +- vtn_bindgen2: Fix memory corruption + +Serdar Kocdemir (6): + +- gfxstream: Track more fence functions on host +- gfxstream: Add VK_KHR_multiview support +- gfxstream: track pipeline layouts on decoder +- gfxstream: wrap semaphore functions on the host +- gfxstream: Add dispatcher validity checks +- gfxstream: Wrap vkEnumerateInstanceExtensionProperties for host + +Sergi Blanch Torne (8): + +- ci: disable Collabora's farm due to maintenance +- Revert "ci: disable Collabora's farm due to maintenance" +- ci: typo in debian-android in .build-for-tests-jobs +- ci: disable Collabora's farm due to maintenance +- Revert "ci: disable Collabora's farm due to maintenance" +- ci: disable Collabora's farm due to maintenance +- Revert "ci: disable Collabora's farm due to maintenance" +- Uprev Piglit to ebdf60e0d4b0dc23e79373cb923fdee023eb3b2b + +Sergii Ushakov (1): + +- gfxstream: Emulate DMABUF with OPAQUE_FD + +Seán de Búrca (8): + +- rusticl/mem: don't write more supported image formats than requested +- rusticl/mem: don't create svm_pointers slice from null raw pointer +- rusticl/mem: use cl_slice::from_raw_parts in place of std::slice +- rusticl: rename CheckedPtr::copy_checked to match primitive method +- rusticl: mark CheckedPtr::write_checked as unsafe +- rusticl: add debug assertions to avoid truncating casts +- rusticl: correct calculation of maximum allocation size +- rusticl: cap max alloc size to i32 to avoid overflowing gallium + +Sil Vilerino (13): + +- d3d12: Fix HEVC range extension pic params validation +- pipe: Add profiles for HEVC 422 8/10b and 444 10b +- d3d12: Add support for Y210, Y410, YUY2 and HEVC 422 8/10b, HEVC 444 10b profiles +- d3d12: Add NULL initialization for d3d12_video_enc::m_pVideoTexArrayDPBPool +- d3d12: Add some missing members initialization for d3d12_video_buffer +- d3d12: Increase DPB video texture array pool size for async queue depth +- d3d12: Fix array of texture DPB cap detection +- d3d12: Fix warning 4305 truncation from type1 to type2 +- nir.h: Fix warning C4800 forcing value to bool 'true' or 'false' +- u_thread.h: Fix warning C4800 forcing value to bool 'true' or 'false' +- d3d12: Fix warning C4800 forcing value to bool 'true' or 'false' +- d3d12: Enable warnings C4056, C4305, C4351, C4756, C4800, C4291, C4020, C4624, C4309, C5105, C4024, C4189 +- d3d12: Cache the texture array cap requirement in encoder creation for calls to d3d12_video_create_dpb_buffer + +Simon Ser (9): + +- gbm: fix get_back_bo() failure with gbm_surface and implicit modifiers +- pvr: replace dup() with os_dupfd_cloexec() +- freedreno: replace dup() with os_dupfd_cloexec() +- iris: replace dup() with os_dupfd_cloexec() +- lavapipe: replace dup() with os_dupfd_cloexec() +- venus: replace dup() with os_dupfd_cloexec() +- panvk: replace dup() with os_dupfd_cloexec() +- libsync: replace dup() with os_dupfd_cloexec() +- vulkan/wsi/x11: replace dup() with os_dupfd_cloexec() + +Stéphane Cerveau (2): + +- anv: fix error code in GetPhysicalDeviceVideoFormatProperties +- radv: video: rework maxActiveReferenceSlot/MaxDpbSlots + +Sushma Venkatesh Reddy (1): + +- intel/tools: Improve memory allocation failure handling in aubinator_error_decode_xe + +Sviatoslav Peleshko (3): + +- anv: Add full subgroups workaround for the shaders that use shared memory +- drirc: Apply assume_full_subgroups_with_shared_memory to Resident Evil 2 +- vulkan/wsi/headless: Remove unnecessary wsi_configure_image() + +Tapani Pälli (25): + +- intel/genxml/anv: fix the layout of call stack handler struct +- intel/dev: reduce warning noise from urb settings +- intel/common: fix mi_builder_test issue +- anv: handle non-wsi images in anv_layout_to_aux_state +- anv: tighten condition for changing barrier layouts +- anv: apply cache flushes on pipeline select with gfx20 +- iris: wait for imported fences to be available in iris_fence_await +- intel/compiler: add a spec note about L1WT types being uncached +- iris: remove dead code that cannot get hit anymore +- intel/dev: update mesa_defs.json from internal database +- anv: restrict TessellationDistributionLevel for Wa_16025857284 +- iris: restrict TessellationDistributionLevel for Wa_16025857284 +- intel/dev: reduce warning noise from urb settings II +- isl: add usage field to isl_buffer_fill_state_info +- iris: pass down buffer usage for isl_buffer_fill_state +- anv: pass down buffer usage for isl_buffer_fill_state +- isl/iris/anv: setup L1CacheControl based on surface and buffer usage +- compiler/glsl: check that bias is not used outside fragment stage +- intel/dev: update mesa_defs.json from internal database +- mesa: clamp texbuf query size to MAX_TEXTURE_BUFFER_SIZE +- mesa: various fixes for ClearTexImage/ClearTexSubImage +- iris: force reallocate on eglCreateImage with GFX >= 20 +- iris: make sure to not mix compressed vs non-compressed +- anv: put parenthesis to the set_sampler_size equation +- intel/dev: update mesa_defs.json from internal database + +Taras Pisetskyi (1): + +- anv,driconf: Add sampler coordinate precision workaround for EVE Online + +Thomas H.P. Andersen (1): + +- nvk: use a valid allocation scope + +Tim Keller (1): + +- dril: Check for null config in dril_target.c + +Timothy Arceri (14): + +- util/disk_cache: dont create multidisk cache dir if unused +- util/u_idalloc: fix util_idalloc_sparse_alloc_range() +- ci: move llvmpipe fails to flakes +- mesa: fix reuse of deleted buffer object +- mesa: fix reuse of deleted texture object +- mesa: fix potential race condition in with TexObjects +- mesa: fix reuse of deleted sampler object +- mesa: fix potential race conditions in with FrameBuffers +- mesa: fix potential race condition in with RenderBuffers +- mesa: fix potential race condition in with ATIShaders +- mesa: fix potential race condition in with Programs +- nir: fix uniform cloning helper +- glsl: fix regression in ubo cloning +- util/driconf: add force_gl_depth_component_type_int workaround + +Timur Kristóf (94): + +- radv: Move buffer related NIR meta shaders to radv_meta_nir.c +- radv: Move blit NIR shaders to radv_meta_nir.c +- radv: Move blit2d NIR shaders to radv_meta_nir.c +- radv: Move buffer-image copy and clear NIR shaders to radv_meta_nir.c +- radv: Move clear NIR shaders to radv_meta_nir.c +- radv: Move VRS HTILE copy NIR shader to radv_meta_nir.c +- radv: Move DCC retile NIR shader to radv_meta_nir.c +- radv: Move expand depth stencil NIR shader to radv_meta_nir.c +- radv: Move DCC decompress NIR shader to radv_meta_nir.c +- radv: Move FMASK copy NIR shader to radv_meta_nir.c +- radv: Move FMASK expand NIR shader to radv_nir_meta.c +- radv: Move resolve NIR compute shaders to radv_meta_nir.c +- radv: Move resolve NIR fragment shaders to radv_meta_nir.c +- radv: Move resolve NIR fs to radv_meta_nir.c +- radv: Move NIR helpers from radv_meta.c to radv_meta_nir.c +- radv: Move NIR specific function declarations to radv_meta_nir.h +- radv: Rename get_global_ids to radv_meta_nir_get_global_ids. +- radv: Add missing copyright info to radv_meta_buffer.c +- util/enum_operators: Don't define anything for OpenCL +- nir: Add struct names where they were missing. +- nir: Add missing extern "C" to nir_defines.h +- nir: Add forward declarations of relevant structs to nir_defines.h +- nir: Move some enums and structs to nir_defines.h +- nir: Move nir_tcs_info to separate file. +- nir: Move nir_shader_compiler_options to separate file. +- nir: Don't include full nir.h in nir_xfb_info.h +- nir: Don't include the full nir.h when not necessary. +- spirv: Don't include full nir.h in nir_spirv.h +- vk: Don't include full nir.h in headers. +- glsl: Don't include full nir.h where not necessary. +- ac/nir: Don't include nir.h in headers anymore. +- ac/nir: Move surface related NIR functions to separate file. +- aco: Don't include nir.h in aco_interface.h anymore. +- ac: Don't include full nir.h anymore. +- radv: Add missing includes and remove unnecessary NIR includes. +- hk: Don't include full nir.h in hk_shader.h +- zink: Don't include full nir.h where not necessary. +- ttn: Don't include full nir.h where not necessary. +- nak: Don't include full nir.h in nak.h +- nvk: Don't include full nir.h in nvk_shader.h +- nir: Don't include xxhash.h in nir.h, only where it is used. +- nir: Don't include bitscan.h in nir.h, it's not actually used. +- nir: Don't include u_printf.h in nir.h, only where necessary. +- nir: Don't include u_format.h in nir.h, it's not actually used. +- nir: Remove struct keyword from nir.h where possible. +- nir: Add comment to indicate that NIR_PASS_V is deprecated. +- nir: Add bool return value to nir_fixup_deref_modes. +- nir: Add bool return value to nir_fixup_deref_types. +- nir: Don't use deprecated NIR_PASS_V macro anymore. +- radv: Stop using deprecated NIR_PASS_V with core NIR passes. +- radv: Add bool return value to radv_nir_lower_abi. +- radv: Add bool return value to radv_nir_apply_pipeline_layout. +- radv: Add bool return value to ray tracing NIR lowerings. +- ac/nir: Add bool return value to ac_nir_lower_ls_outputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_hs_inputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_hs_outputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_tes_inputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_es_outputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_gs_inputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_task_outputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_mesh_inputs_to_mem. +- ac/nir: Add bool return value to ac_nir_lower_legacy_vs. +- ac/nir: Add bool return value to ac_nir_lower_legacy_gs. +- ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_nogs. +- ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_gs. +- ac/nir/ngg: Add bool return value to ac_nir_lower_ngg_mesh. +- radv: Don't use deprecated NIR_PASS_V macro for AMD common NIR passes. +- compiler/clc: Stop using deprecated NIR_PASS_V macro. +- vulkan/runtime: Don't use deprecated NIR_PASS_V macro. +- hk: Don't use deprecated NIR_PASS_V macro anymore. +- glsl: Don't use deprecated NIR_PASS_V macro anymore. +- nak: Don't use deprecated NIR_PASS_V macro anymore. +- nvk: Don't use deprecated NIR_PASS_V macro anymore. +- radv: Use flush postamble on GFX7 with different flags. +- ac/nir/ngg: Run copy propagation. +- ac/nir/ngg: Improve reuse of position value. +- ac/nir/ngg: Remove inputs_needed_by_* +- ac/nir/ngg: Prepare deferred shader part before adding culling code. +- ac/nir/ngg: Gather info about what the deferred shader part uses. +- ac/nir/ngg: Use deferred info for compacted arguments. +- ac/nir/ngg: Remove cleanup_culling_shader_after_dce. +- nir/xfb: Preserve some xfb information when gathering from intrinsics. +- nir/opt_varyings: Fix assertion when deduplicating TCS outputs. +- radv: Use buffers_written mask when gathering XFB info. +- radv: Call nir_opt_undef too after nir_opt_varyings. +- radv: Remove radv_streamout_info::num_outputs. +- nir/print: Fix variable mode for arrayed output load intrinsics. +- radv: Add radv_foreach_stage to ForEachMacros again. +- radv: Inline radv_graphics_shaders_link_varyings_{first/second}. +- radv: Refactor loops in radv_graphics_shaders_link_varyings. +- radv: Move preparation and fixup to separate loops in varying optimization. +- radv: Don't call nir_opt_varyings a second time when unnecessary. +- radv: Clear dirty flag for MSAA state after emitting it. +- radv: Clear dirty flag for clip rects state after emitting it. + +Tomeu Vizoso (4): + +- egl/surfaceless: Only choose drivers that expose the graphics capability +- kopper: Explicitly choose zink +- etnaviv/ml: Use etna_buffer_resource instead of etna_resource +- etnaviv: Release screen->dummy_desc_reloc.bo + +Trigger Huang (2): + +- radeonsi: Fix perfcounter start event in si_pc_emit_start +- radeonsi: Change program seqnece for perf counters + +Valentine Burley (84): + +- amd/ci: Revert to 6.6 kernel on Raven +- zink/ci: Add a fraction for zink-venus-lvp +- khronos-update: Update ANDROID guards in vk_android_native_buffer.h +- zink/ci: Make zink-venus-lvp-full a nightly job +- ci/intel-gpu-freq: Add Xe support +- ci: Use new kernel with Intel Xe driver +- ci/lava: Allow passing extra cmdline arguments +- intel/ci: Drop redundant BOOT_METHOD variables +- intel/ci: Add newer i915/ADL-P firmware to rootfs +- intel/ci: Load Xe instead of i915 on ADL +- intel/dev: Provide a toggle to avoid warnings about unsupported devices +- intel/ci: Use INTEL_XE_IGNORE_EXPERIMENTAL_WARNING to reduce warnings +- intel/ci: Update expectations for Xe +- amd/ci: Fix fraction for radv-stoney-angle-full +- ci/android: Don't build desktop GL CTS +- ci/lava: Don't build VK-main for arm64 +- ci: Allow building ANGLE for multiple platforms +- ci/android: Build and use ANGLE +- ci/android: Check ANGLE version +- ci: Move debian-android up to the build-for-tests stage +- freedreno/ci: Unify naming for a306 jobs +- zink/ci: Rename a618 suite and expectation files +- ci/angle: Rework building ANGLE (again) +- ci/lava: Build ANGLE for arm64 +- ci/android: Add build section for Cuttlefish +- ci/angle: Use lld-19 for linking ANGLE +- intel/ci: Fix manual rules for ANGLE jobs +- ci/angle: Uprev ANGLE +- ci: Update expectations from latest nightly +- turnip/ci: Add nightly ANGLE jobs on a618 and a660 +- ci/container: Include ANGLE in the arm64 test-gl container +- turnip/ci: Rename valve-freedreno-turnip-manual-rules +- turnip/ci: Add a nightly ANGLE job on a750 +- radv/ci: Don't start X11 for ANGLE +- anv/ci: Don't start X11 for ANGLE +- ci: Simplify downloading kernel for crosvm +- ci: Don't download the kernel image in lava_build.sh +- intel/ci: Update GuC firmware for ADL-S and ADL-N +- ci: Use new kernel that supports more Mediatek devices +- anv/ci: Update expectations from latest nightly +- intel/ci: Honor device-specific FDO_CI_CONCURRENT variables +- intel/ci: Add brask and nissa +- anv/ci: Migrate anv-adl-angle job to brask +- intel/ci: Migrate intel-adl-cl and intel-adl-skqp to nissa +- anv/ci: Increase parallelism of zink-anv-adl +- lavapipe: Update driverVersion +- anv/ci: Remove fixed test from xfails +- zink/ci: Run more traces on Tiger Lake +- zink/ci: Add trace testing on Alder Lake +- anv/ci: Append -vk suffix to VKCTS job names +- ci: Update kernel to include i.MX8MP dtb +- panforst/ci: Migrate the G57 GL job to MT8195 +- panfrost/ci: Add a Piglit job on G57 +- panfrost/ci: Shorten Piglit job names +- panfrost/ci: Pin g610-gl job to 4GB DUTs +- tu: Switch to device address from vk_buffer +- ci: Enable Perfetto in debian-no-libdrm for Turnip build testing +- zink/ci: Work around recent OOM issues in zink-anv-adl +- ci/deqp: Delete more uncompressed caselist files +- ci: Add missing kvm runner tags +- ci/android: Don't delete ninja after building LLVM +- ci/android: Keep the LLVM install when rebuilding +- ci/android: Update to LLVM 19 for Android +- ci/android: Temporarily disable building llvmpipe +- radv/ci: Delete obsolete vkcts-stoney-valve job +- amd/ci: Rename AMD jobs to follow unified naming convention +- ci: Disable the Google freedreno farm +- panvk/ci: Migrate the G52 VK job to MT8186 +- ci: Make it possible to use ANGLE traces on other architectures +- radv/ci: Update ANGLE version used for traces +- ci/android: Remove platform-tools from test-android container +- ci/android: Add section for downloading Android CTS +- ci/container: Disable debian/arm32_test-vk container +- ci/container: Remove double build sections in test-* containers +- ci/container: Move calling strip-rootfs.sh to common scripts +- ci/android: Add build section for uninstalling build software +- ci/container: Drop unnecessary variables for image paths +- intel/ci: Convert iris-kbl-piglit to deqp-runner suite +- ci/piglit: Drop redundant PIGLIT_PROFILES variable +- ci/piglit: Consolidate HWCI_TEST_SCRIPT for piglit traces +- ci/piglit: Remove piglit-runner.sh script +- ci/piglit: Consolidate identical skip lists for X11 and gbm +- ci/lava: Consolidate piglit trace job definitions +- zink/ci: Work around recent OOM issues in zink-anv-tgl + +Vasily Khoruzhick (20): + +- lima: ppir: handle ffma in the backend +- lima: ppir: improve readability of ppir represantation dump +- lima: ppir: fixup src node when replacing src for select and load_reg +- lima: ppir: print index of the node that breaks node_to_instr +- lima: ppir: fix diassembling atan and combiner codegen definition +- lima: ppir: assert on unexpected pipeline dest for fmul and vmul +- lima: ppir: add codegen for mov and mul on combiner unit +- lima: ppir: use combiner unit for mul +- lima: ppir: duplicate fneg and fabs if its source is an intrinsic +- lima: ppir: fix regalloc bugs +- lima: ppir: assign an index for discard block +- lima: ppir: introduce an optimizer +- lima: ppir: try scheduling root nodes into the same instruction +- lima: ci: update deqp CI expectations +- lima: ppir: add compactification pass +- lima: ppir: reuse uniform load in instruction if possible +- lima: ppir: reuse load_temp/store_temp nodes if possible +- lima: ppir: assign actual index to discard block +- lima: ppir: optimize branches +- lima: ppir: try inserting nodes into successor instr for uncond branch + +Vignesh Raman (2): + +- s3_upload: improve url validation and error message +- ci: Uprev kernel to 6.14 + +Visan, Tiberiu (2): + +- amd/vpelib: Fix studio range +- amd/vpelib: Apply normalization for full range + +Vlad Zahorodnii (3): + +- egl/wayland: Damage whole surface using wl_surface_damage_buffer() +- vulkan/wsi/wayland: Damage whole surface using wl_surface_damage_buffer() +- vulkan/wsi/wayland: Document why wl_surface_damage() code path ignores provided damage + +Xaver Hugl (6): + +- increase required wayland-protocols version to 1.41 +- vulkan/wsi: implement support for VK_EXT_hdr_metadata on Wayland +- vulkan/wsi: handle the compositor not supporting extended target volume better +- vulkan/wsi: don't use sRGB if the compositor doesn't support it +- vulkan/wsi: validate HDR metadata to not cause protocol errors +- vulkan/wsi: warn once when HDR metadata is skipped because of protocol errors + +Yinjie Yao (2): + +- gallium/pipe: Increase hevc max slice to 600 +- frontends/va: Handle properly when decoding more slices than limit + +Yiwei Zhang (124): + +- venus: scrub disallowed ycbcr features for rgba10x6 +- venus: further sanitize image props for rgba10x6 +- venus: refactor more to image format props sanitization +- venus: enable VK_EXT_external_memory_acquire_unmodified if needed +- venus: use dedicated allocation for ANB image memory import +- venus: emulate a second graphics queue on Android +- venus: sync venus protocol for below extensions +- venus: group private data together with other 1.3 exts +- venus: fix to handle pipeline flags2 from maint5 +- venus: support VK_EXT_blend_operation_advanced +- venus: refactor to share more codes between pipeline state fillings +- venus: support VK_EXT_sample_locations +- venus: fix sampler locations feats and props scrub +- venus: sync protocol to v1.3.302 +- Support 5 more promoted extensions +- venus: fix maintenance5 props init and create flags2 +- venus: drop unused codes +- venus: sync protocol for v1.4.307 release and update promoted entries +- venus: updated to use core types for promoted extensions +- venus: support VK_KHR_shader_subgroup_rotate +- venus: support VK_KHR_shader_float_controls2 +- venus: support VK_EXT_pipeline_protected_access +- venus: support VK_EXT_pipeline_robustness +- venus: support VK_KHR_map_memory2 +- venus: support VK_KHR_global_priority +- venus: support VK_KHR_dynamic_rendering_local_read +- venus: a few tiny naming fixes +- venus: support VK_KHR_maintenance6 +- venus: update second queue emulation for 1.4 requirement +- venus: deprecate a few useless micros +- venus: advertise 1.4 support +- venus: fix 2 entry points from maint6 +- venus: fix sample location info validity +- venus: limit second queue emulation to android framework +- venus: temporarily disable 1.4 support +- venus: suppress a few -Wmaybe-uninitialized +- venus: drop unnecessary struct +- venus: use sharing_mode from common vk_image +- venus: fix image format cache miss with AHB usage query +- venus: sync latest protocol v3 support for host copy +- venus: extend image format cache for host copy props +- venus: implement host image copy commands +- vulkan/util: clean up copy_property and fix for setter +- venus: enable VK_EXT_host_image_copy support +- venus: re-enable 1.4 support +- venus: align on wsi frontends support +- venus: support wsi maintenance1 extensions +- venus: sync protocol for the passthrough extensions +- venus: added passthrough extension support - Part I +- venus: added passthrough extension support - Part II +- venus: added passthrough extension support - Part III +- venus: added passthrough extension support - Part IV +- venus: added passthrough extension support - Part V +- venus: relax the requirement for sync2 +- venus: fix an obsolete protocol sync earlier +- venus: fix a memory corruption in query records recycle +- venus: sync protocol for ray tracing support +- venus: add a debug option for ray tracing support +- venus: support VK_KHR_deferred_host_operations +- venus: add stubs for accel struct host commands +- venus: implement VK_KHR_acceleration_structure - Part I +- venus: implement VK_KHR_acceleration_structure - Part II +- venus: implement VK_KHR_acceleration_structure - Part III +- venus: enable VK_KHR_acceleration_structure +- venus: support VK_KHR_ray_query +- venus: prepare push template for ray tracing pipeline +- venus: implement VK_KHR_ray_tracing_pipeline commands +- venus: enable VK_KHR_ray_tracing_pipeline +- venus: support VK_KHR_ray_tracing_position_fetch +- venus: support VK_KHR_ray_tracing_maintenance1 +- lavapipe: set availability bit for accel struct host queries +- lavapipe: fix accel struct device query copy +- venus: use common memory report implementation +- venus: fix to ignore dstSet for push descriptor +- venus: extend async descriptor set alloc coverage +- venus: relax 2 assertions for prime blit path +- venus: sync protocol support for maint7 +- venus: sync protocol for accel struct indirect build encoding fix +- pan/kmod: set DRM_RDWR for exported dma-bufs +- venus: drop vn_call usage on apis without any returns +- venus: sync latest protocol for more extension support +- venus: add a few more trivial extensions +- venus: support VK_EXT_filter_cubic +- venus: support VK_EXT_legacy_dithering +- venus: support VK_EXT_depth_bias_control +- venus: support VK_EXT_depth_clamp_control +- venus: support VK_EXT_attachment_feedback_loop_dynamic_state +- venus: support VK_EXT_nested_command_buffer +- venus: default to passthrough ray tracing support +- venus: add a new debug option to revive memory budget support +- venus: properly enable display platform extensions +- venus: back out display control +- venus: support VK_EXT_debug_report +- venus: sync latest protocol for 2 more extensions +- venus: support VK_EXT_pipeline_library_group_handles +- venus: support VK_EXT_image_sliced_view_of_3d +- panvk: disable VK_KHR_shader_quad_control +- panvk: fix dependency for EXT_display_control +- vulkan: update ALLOWED_ANDROID_VERSION for api level 34 +- vulkan: update ALLOWED_ANDROID_VERSION for api level 35 +- venus: rename common vk object base member to vk +- venus: explicitly get vn_device from vk_device +- venus: use common vk_command_pool +- venus: use vk_command_pool internals +- venus: use common vk_command_buffer +- venus: use vk_command_buffer internals +- venus: use common cmd pool_link +- docs: demote VK_KHR_shader_relaxed_extended_instruction +- venus: fix unexpected ring alive status expire upon owner thread switch +- venus: fix ahb usage caching +- venus: fix maint4 multi-planar memory requirements +- venus: improve image memory requirement cache for image aliasing +- venus: enable VK_EXT_debug_utils +- virgl/venus/vtest: align capset and protocol with virglrenderer +- panvk/csf: rework cache flush reduction +- panvk: fix memory requirement query for aliased disjoint image +- venus: support VK_KHR_maintenance7 +- venus: sync latest protocol +- venus: support VK_EXT_buffer_device_address +- venus: support VK_KHR_shader_relaxed_extended_instruction +- venus: refactor format properties cache to be extensible +- venus: support VK_EXT_multisampled_render_to_single_sampled +- docs: update venus driver page +- venus: fix missing renderer destructions + +Yogesh Mohan Marimuthu (7): + +- winsys/amdgpu: do not use rcs->csc +- winsys/amdgpu: make csc context as array +- winsys/amdgpu: amdgpu_cs_context is csc, amdgpu_cs is acs +- winsys/radeon: struct radeon_cmdbuf is rcs instead of cs for consistency +- winsys/amdgpu: same_queue variable should be set if there is only one queue +- winsys/amdgpu: userqueue multi ctx jobs are guaranteed to be in sequence +- winsys/amdgpu: userq non imported fence can be ignored for same ip_type + +Yurii Kolesnykov (1): + +- Get rid of 5 remaining references to glapitable.h + +Zan Dobersek (11): + +- tu/a750: invalidate vertex state before CP_DRAW_INDIRECT_MULTI +- tu: make tu_debug_flags enum 64-bit +- tu: use query index when retrieving performance query iovas +- tu: performance query result writes must use dedicated union type +- tu/a7xx: disable preemption during performance query measurement +- freedreno: add common implementation of perfcntr-based derived counters +- tu: support exposing derived counters through VK_KHR_performance_query +- tu: fix zero-index perfcntr collection for derived counters +- tu: disable logic operations for float and sRGB formats +- freedreno/registers: add useful A6XX_SP_TP_MODE_CNTL bitfields +- tu: allow D3D-compatible texture coordinate rounding + +Zhao, Jiali (1): + +- amd/vpelib: Fix studio output CSC + +forbiddenlake (2): + +- docs: Fix HTML build with Sphinx 8.2 +- ci/alpine: upgrade sphinx and hawkmoth to the latest versions + +irql-notlessorequal (7): + +- hasvk: Fix non-functioning version override. +- elk: ensure VUE header writes in HS/DS/GS stages +- elk: always write the VUE header +- hasvk: Pre-plumbing needed for VK_KHR_maintenance5 +- hasvk: Implement VkPipelineCreateFlags2KHR support +- hasvk: Implement VK calls and formats. +- hasvk: Advertise VK_KHR_maintenance5 + +lcagustini (1): + +- panvk: Advertise support for VK_EXT_border_color_swizzle + +liuqiang (1): + +- intel/brw: Remove redundant condition in components_read() + +llyyr (1): + +- vulkan/wsi/wayland: initialize surface colorspace with PASS_THROUGH_EXT + +sarbes (1): + +- lima: add genxml for texture descriptor diff --git a/docs/relnotes/new_features.txt b/docs/relnotes/new_features.txt deleted file mode 100644 index a55b7ec2760..00000000000 --- a/docs/relnotes/new_features.txt +++ /dev/null @@ -1,48 +0,0 @@ -cl_khr_spirv_linkonce_odr in rusticl -storagePushConstant16 on panvk -storageInputOutput16 on panvk -VK_KHR_depth_stencil_resolve on panvk -VK_KHR_separate_depth_stencil_layouts on panvk -VK_EXT_separate_stencil_usage on panvk -VK_KHR_sampler_ycbcr_conversion on panvk/v10+ -VK_EXT_ycbcr_2plane_444_formats on panvk/v10+ -VK_EXT_ycbcr_image_arrays on panvk/v10+ -VK_KHR_imageless_framebuffer on panvk -VK_KHR_uniform_buffer_standard_layout on panvk -VK_EXT_border_color_swizzle on panvk -VK_MESA_image_alignment_control on NVK -shaderFloat16 on panvk -VK_KHR_shader_subgroup_uniform_control_flow on panvk/v10+ -VK_KHR_shader_maximal_reconvergence on panvk/v10+ -VK_EXT_device_memory_report on RADV -VK_KHR_shader_subgroup_extended_types on panvk/v10+ -shaderStorageImageExtendedFormats on panvk -VK_KHR_display on panvk -VK_EXT_display_control on panvk -EXT_shader_framebuffer_image_fetch on v3d -EXT_shader_framebuffer_image_fetch_coherent on v3d -KHR_blend_equation_advanced on v3d -KHR_blend_equation_advanced_coherent on v3d -KHR_partial_update on etnaviv -VK_KHR_line_rasterization on panvk -VK_EXT_line_rasterization on panvk -shaderImageGatherExtended on panvk -textureCompressionBC on panvk -VK_EXT_sample_locations on RADV for GFX10+ -GL_ARB_shader_clock on panvk -VK_KHR_shader_float_controls on panvk -VK_KHR_shader_float_controls2 on panvk/v10+ -storageBuffer8BitAccess on panvk -storagePushConstant8 on panvk -uniformAndStorageBuffer8BitAccess on panvk -MSAA with 8 and 16 sample counts on panvk -VK_KHR_spirv_1_4 on panvk/v10+ -Mali G720 and G725 on Panfrost and panvk -dualSrcBlend on panvk -VK_KHR_dynamic_rendering_local_read on panvk -VK_EXT_subgroup_size_control on panvk/v10+ -VK_KHR_format_feature_flags2 on panvk -shaderStorageImageReadWithoutFormat on panvk -shaderStorageImageWriteWithoutFormat on panvk -VK_EXT_direct_mode_display on panvk -Vulkan 1.2 on panvk/v10+