4749 lines
		
	
	
		
			221 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
			
		
		
	
	
			4749 lines
		
	
	
		
			221 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
| Mesa 20.2.0 Release Notes / 2020-09-28
 | ||
| ======================================
 | ||
| 
 | ||
| Mesa 20.2.0 is a new development release. People who are concerned
 | ||
| with stability and reliability should stick with a previous release or
 | ||
| wait for Mesa 20.2.1.
 | ||
| 
 | ||
| Mesa 20.2.0 implements the OpenGL 4.6 API, but the version reported by
 | ||
| glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
 | ||
| glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
 | ||
| Some drivers don't support all the features required in OpenGL 4.6. OpenGL
 | ||
| 4.6 is **only** available if requested at context creation.
 | ||
| Compatibility contexts may report a lower version depending on each driver.
 | ||
| 
 | ||
| Mesa 20.2.0 implements the Vulkan 1.2 API, but the version reported by
 | ||
| the apiVersion property of the VkPhysicalDeviceProperties struct
 | ||
| depends on the particular driver being used.
 | ||
| 
 | ||
| SHA256 checksum
 | ||
| ---------------
 | ||
| 
 | ||
| ::
 | ||
| 
 | ||
|     63f0359575d558ef98dd78adffc0df4c66b76964ebf603b778b7004964191d30  mesa-20.2.0.tar.xz
 | ||
| 
 | ||
| 
 | ||
| New features
 | ||
| ------------
 | ||
| 
 | ||
| - GL_ARB_compute_variable_group_size on Iris.
 | ||
| 
 | ||
| - GL_ARB_gpu_shader5 on llvmpipe
 | ||
| 
 | ||
| - GL_ARB_post_depth_coverage on llvmpipe
 | ||
| 
 | ||
| - GLES 3.2 on llvmpipe
 | ||
| 
 | ||
| - GL_EXT_shader_group_vote on GLES3.
 | ||
| 
 | ||
| - GL_EXT_texture_shadow_lod on llvmpipe
 | ||
| 
 | ||
| - VK_AMD_texture_gather_bias_lod on RADV.
 | ||
| 
 | ||
| - VK_AMD_gpu_shader_half_float on RADV/ACO.
 | ||
| 
 | ||
| - VK_AMD_gpu_shader_int16 on RADV/ACO.
 | ||
| 
 | ||
| - VK_EXT_extended_dynamic_state on ANV and RADV.
 | ||
| 
 | ||
| - VK_EXT_image_robustness on RADV.
 | ||
| 
 | ||
| - VK_EXT_private_data on ANV and RADV.
 | ||
| 
 | ||
| - VK_EXT_custom_border_color on ANV and RADV.
 | ||
| 
 | ||
| - VK_EXT_pipeline_creation_cache_control on ANV and RADV.
 | ||
| 
 | ||
| - VK_EXT_shader_demote_to_helper_invocation on RADV/LLVM.
 | ||
| 
 | ||
| - VK_EXT_subgroup_size_control on RADV/ACO.
 | ||
| 
 | ||
| - VK_GOOGLE_user_type on ANV and RADV.
 | ||
| 
 | ||
| - VK_KHR_shader_subgroup_extended_types on RADV/ACO.
 | ||
| 
 | ||
| - GL_ARB_gl_spirv on nvc0/nir.
 | ||
| 
 | ||
| - GL_ARB_spirv_extensions on nvc0/nir.
 | ||
| 
 | ||
| - RADV now uses ACO per default as backend
 | ||
| 
 | ||
| - RADV_DEBUG=llvm option to enable LLVM backend for RADV
 | ||
| 
 | ||
| - VK_EXT_image_robustness for ANV
 | ||
| 
 | ||
| - VK_EXT_shader_atomic_float on ANV
 | ||
| 
 | ||
| - VK_EXT_4444_formats on ANV and RADV.
 | ||
| 
 | ||
| - VK_KHR_memory_model on RADV.
 | ||
| 
 | ||
| - GL 4.5 on llvmpipe
 | ||
| 
 | ||
| - EGL_KHR_swap_buffers_with_damage on X11 (DRI3)
 | ||
| 
 | ||
| 
 | ||
| Bug fixes
 | ||
| ---------
 | ||
| 
 | ||
| - [Regression][Bisected][20.2][radeonsi] American Truck Simulator continually allocates memory until OOM
 | ||
| - anv: dEQP-VK.robustness.robustness2.* failures on gen12
 | ||
| - [RADV] Problems reading primitive ID in fragment shader after tessellation
 | ||
| - Massive memory leak (at least AMD, others unknown)
 | ||
| - Substance Painter 6.1.3 black glitches on Radeon RX570
 | ||
| - vkCmdCopyImage broadcasts subsample 0 of MSAA src into all subsamples of dst on RADV
 | ||
| - Crash in ruvd_end_frame when calling vaBeginPicture/vaEndPicture without rendering anything
 | ||
| - X-Plane 11 Installer crashes on startup since `glsl: declare gl_Layer/gl_ViewportIndex/gl_ViewportMask as vs builtins`
 | ||
| - Horizon Zero Dawn graphics corruption with with radv
 | ||
| - Amber test opt_peel_loop_initial_if: Assertion failed
 | ||
| - Dirt Rally: Flickering glitches on certain foliage since Mesa 20.1.0 caused by MSAA
 | ||
| - [BRW] WRC 5 asserts with gallium nine and iris.
 | ||
| - radv: Corruption in "The Surge 2"
 | ||
| - [RADV] Detroit: Become Human Demo game lock-ups with RADV
 | ||
| - Road Redemption certain  graphic effects rendered white color
 | ||
| - vulkan/wsi/x11: deadlock with Xwayland when compositor holds multiple buffers
 | ||
| - [RADV/ACO] Death Stranding cause a GPU hung (*ERROR* Waiting for fences timed out!)
 | ||
| - lp_bld_init.c:172:7: error: implicit declaration of function ‘LLVMAddConstantPropagationPass’; did you mean ‘LLVMAddCorrelatedValuePropagationPass’? [-Werror=implicit-function-declaration]
 | ||
| - Intel Vulkan driver crash with alpha-to-coverage
 | ||
| - EGL_KHR_swap_buffers_with_damage support on X11
 | ||
| - radv: blitting 3D images with linear filter
 | ||
| - [ACO] Compiling pipelines from RPCS3's shader interpreter spins forever in ACO code
 | ||
| - Intel Vulkan driver assertion with small xfb buffer
 | ||
| - [spirv-fuzz] SPIR-V parsing failed "src->type->type == dest->type->type"
 | ||
| - radeonsi: radeonsi crashes in Chrome on chromeos
 | ||
| - [RADV] commit d19bc94e4eb94 broke gamescope with Navi
 | ||
| - 4e3a7dcf6ee4946c46ae8b35e7883a49859ef6fb breaks Gamescope showing windows properly.
 | ||
| - anv: crashes in CTS test dEQP-VK.subgroups.*.framebuffer.*_tess_eval
 | ||
| - Intel Vuikan (anv) crash in copy_non_dynamic_state() when using validation layer
 | ||
| - Mafia 3: Trees get rendered incorrectly
 | ||
| - radv: dEQP-VK.synchronization.op.multi_queue.timeline_semaphore.write_clear_attachments_*_concurrent fail when forcing DCC.
 | ||
| - Crash on GTA 5 through proton 5.0.9 and GE versions
 | ||
| - Mesa 20.2.0-rc1 fails to build for AMD
 | ||
| - Assertion failure compiling shader from Zigguart
 | ||
| - Panfrost locks for waiting fence when running Source engine games
 | ||
| - ci: `-Dtools=panfrost` should be build-tested
 | ||
| - panfrost: Register allocation fails for Firefox WebRender shaders
 | ||
| - VRAM leak with vuilkan external memory + opengl memory objects
 | ||
| - [vulkan/build] Recent build system changes made VK_EXT_acquire_xlib_display unnecessarily depend on GBM
 | ||
| - ci: Capture devcoredumps on chezas
 | ||
| - Possible array out of bounds in brw_vec4_nir.cpp
 | ||
| - freedreno/a6xx: incorrect rendering in asphalt 9
 | ||
| - [tgl][bisected][regression][iris] failure on dEQP-EGL.functional.wide_color.pbuffer_8888_colorspace_default
 | ||
| - Multiply defined symbols compiling with gcc@10.1.0
 | ||
| - shrinking descriptor pool on intel+vulkan
 | ||
| - dEQP-VK.renderpass2.dedicated_allocation.attachment.1.12 fails on NAVI14
 | ||
| - turnip: binning and indirect dependency
 | ||
| - Amber test leads to NIR validation failed after nir_opt_if (on spirv-fuzz shader)
 | ||
| - Unable to compile mesa-git from b559d26c
 | ||
| - Ambient light too bright with ACO in AC: Odyssey
 | ||
| - Multiple issues with Detroit Become Human
 | ||
| - ci: Capture artifacts in baremetal mode
 | ||
| - turnip/ir3: fine derivatives
 | ||
| - panfrost: regression: Major stuttering and low compositor FPS with glmark2
 | ||
| - khr_debug-push-pop-group_gl: ../src/util/simple_mtx.h:86: simple_mtx_lock: Assertion \`c != _SIMPLE_MTX_INVALID_VALUE' failed.
 | ||
| - freedreno/a6xx: skai/skqp fails
 | ||
| - SPIR-V parsing fails in src/compiler/spirv/spirv_to_nir.c
 | ||
| - SPIR-V parsing fails in src/compiler/spirv/vtn_cfg.c
 | ||
| - Weird GLSL bug
 | ||
| - iris driver is broken in Freedesktop 19.08
 | ||
| - LLVM not properly shutdown in `si_pipe.c`?
 | ||
| - Panfrost: add current status to docs/features.txt
 | ||
| - Opengl incorrect rendering on yuzu Amd
 | ||
| - RADV: VK_ACCESS_MEMORY_READ/WRITE_BIT is not implemented
 | ||
| - [bisected][regression][all platforms] multiple deqp-gles31/glescts/piglit failures
 | ||
| - 7406ea37, "ac/surface: require that gfx8 doesn't have DCC in order to be displayable", breaks Gamescope being able to launch games on RX580, and possibly other gfx8 cards
 | ||
| - vkGetSemaphoreCounterValue doesn't update without vkWaitSemaphores calls on Intel UHD 620
 | ||
| - [RADV] System crash when playing XCOM Chimera Squad because of commit #7a5e6fd2
 | ||
| - [RADV] Non-precise occlusion queries return non-zero when all fragments are discarded
 | ||
| - [DXVK] Project Cars rendering problems
 | ||
| - ADDRLIB ODR Violation
 | ||
| - Build fails with current mesa from git "undefinierter Verweis auf »nir_lower_clip_disable«"
 | ||
| - KDE Compositor stuttering after Check for window destruction in dri3_wait_for_event_locked
 | ||
| - Add fallthrough to prevent errors caused by missing break
 | ||
| - i965/20.1: gray rendering with torcs racing
 | ||
| - glBindBufferRange call seems to be ignored by one of two shader-programs on radeon cards
 | ||
| - [bisected][g33] piglit.spec.ext_framebuffer_object.fbo-cubemap failure
 | ||
| - Increase GL_MAX_COMPUTE_SHADER_STORAGE_BLOCKS to greater value.
 | ||
| - nir: st_nir_lower_builtin fails for gl_LightSource[i]
 | ||
| - Sometimes VLC player process gets stuck in memory after closure if video output used is Auto or OpenGL
 | ||
| - Double unlock in rbug_context.c
 | ||
| - Double copy for TexSubImage
 | ||
| - [v3d] corruption when GS omits some vertices
 | ||
| - Iris crashes when reading from multisampled front buffer on platforms without front buffer
 | ||
| - freedreno: subway surfers crash when repeatedly toggling fullscreen
 | ||
| - [RADV/GFX8] Performance drop in DOOM Eternal when "Present from compute" is enabled
 | ||
| - freedreno: multiple applications crash on a5xx
 | ||
| - Use-after-free crash innv50_ir::GCRA::RIG_Node::init()
 | ||
| - intel: Sample mask writes need to be honored in Vulkan
 | ||
| - [RADV] - Path of Exile (238960) - Map outline, landscape and markers are missing with the Vulkan renderer.
 | ||
| - ASTC texture decompression fails when using software fallback
 | ||
| - [i965][iris][regression][bisected] multiple piglit and glcts failures on all platforms
 | ||
| - please publish GPG keyring used to sign new releases
 | ||
| - [BISECTED] compiling shader causes crash
 | ||
| - Missing render Information on Stellaris
 | ||
| - freedreno/ir3: allow copy-propagate from array
 | ||
| - Zink + GALLIUM_HUD SIGSEGV
 | ||
| - piglit spec@egl_ext_device_base@conformance fails LLVM 11 Git assertion since "llvmpipe/fs: add caching support"
 | ||
| - llvmpipe: 1x1 framebuffer with a 2x2 viewport
 | ||
| - [regression] nir build failure
 | ||
| - ci: need to end baremetal tests after kernel panic/instaboot
 | ||
| - If-statement body is executed for false condition
 | ||
| - freedreno/a6xx: broken rendering in playcanvas "after the flood"
 | ||
| - [regression] performance drop on Dota 2, CS:GO, and gfxbench GL benchmarks on ICL/Iris
 | ||
| - [amd] C++ ODR violatation for union GB_ADDR_CONFIG
 | ||
| - Zink reports incorrect amount of video memory
 | ||
| - [RADV/LLVM]: void llvm::ICmpInst::AssertOK(): Assertion \`getOperand(0)->getType() == getOperand(1)->getType() && "Both operands to ICmp instruction are not of the same type!"' failed.
 | ||
| - glsl-1.50-gs-max-output hangs on Navi10 + NGG
 | ||
| - anv: Runs out of binding tables with PPSSPP during long runs
 | ||
| - Segfault in Panfrost with waypipe
 | ||
| - ci: Use rsync instead of rm -rf ; cp for baremetal rootfs
 | ||
| - i965: Rendering problems replaying a trace of "Refunct" after mesa-20.1.0-rc1 release [bisected]
 | ||
| - Panfrost (rk3399 NanoPi M4) hang/crash on playing video on Kodi/X11
 | ||
| - gallium/winsys/radeon/drm fails assertion on 32bit
 | ||
| - NIR validation failed after glsl to nir, before function inline, wrong {src,dst}->type ?
 | ||
| - nir/spirv asin() function not precise enough
 | ||
| - Mesa 20.0.7 / 20.1.0-rc4 regression, extremally long shader compilation time in NIR
 | ||
| - Android build error after 689acc73
 | ||
| - freedreno/a6xx: gpu hangs in google earth
 | ||
| - Mesa-git build fails on Fedora Rawhide
 | ||
| - Doom Eternal 1.1 performs very poorly on RADV
 | ||
| - iris/i965: possible regression in 20.0.5 due to changes in buffer manager sharing across screens (firefox/mozilla#1634213)
 | ||
| - iris/i965: possible regression in 20.0.5 due to changes in buffer manager sharing across screens (firefox/mozilla#1634213)
 | ||
| - Incorrect _NetBSD__ macro inside execmem.c
 | ||
| - Possible invalid sizeof in device.c
 | ||
| - YUV FP16 lowering validation failing
 | ||
| - GLSL compiler assertion is_float() failed in glsl/ir_validate.cpp, visit_leave on specific WebGL shader
 | ||
| - [RADV] - Doom Eternal (782330) & Metro Exodus (412020) - Title requires 'RADV_DEBUG=zerovram' to eliminate colorful graphical aberrations.
 | ||
| - [RADV] - Doom Eternal (782330) & Metro Exodus (412020) - Title requires 'RADV_DEBUG=zerovram' to eliminate colorful graphical aberrations.
 | ||
| - mesa trunk master vulkan overlay-layer meson.build warning empty configuration_data() object
 | ||
| - [meson] increase minimum required version
 | ||
| - Kicad fails to render 3D PCB models.
 | ||
| - freedreno: minetest: alpha channel issue on a6xx
 | ||
| - Reproduceable i915 gpu hang Intel Iris Plus Graphics (Ice Lake 8x8 GT2)
 | ||
| - 7 Days to Die - "Reflection Quality" setting broken, results in environment rendered black
 | ||
| - glsl: regression affecting shader compilation time
 | ||
| - freedreno: glamor issue with x11 desktops
 | ||
| - finish converting from fnv1a to xxhash
 | ||
| - Hang in iris_dri in kitty
 | ||
| - Setting twice value to output_stream in radv_nir_to_llvm.c
 | ||
| - Overwriting value of `jit_tex->sample_stride` in lp_setup.c
 | ||
| - [AMDGPU][OpenGL] apitrace of kernel/firmware crash that requires a reboot
 | ||
| - Flickering in Superposition benchmark
 | ||
| - Double lock in fbobject.c
 | ||
| - Possible typo in aco_insert_waitcnt.cpp
 | ||
| - [bisected] Steam crashes when newest Iris built with LTO
 | ||
| - Freeing null pointer inside radv_amdgpu_cs.c
 | ||
| - Duplicated sub expression in radv_nir_to_llvm.c
 | ||
| - i965/vec4: opt_cse_local cause the out of bound array access
 | ||
| - NIR: Regression on shader using 8/16-bit integers
 | ||
| - ACO: Compiler segfault on 8/16-bit integers.
 | ||
| - lp_bld_intr.c:70:16: error: use of undeclared identifier 'LLVMFixedVectorTypeKind'; did you mean 'LLVMVectorTypeKind'?
 | ||
| - recent seqno changes causing surfaceflinger crash
 | ||
| - [radeonsi] [glthread] Crash with glthread enabled
 | ||
| - Deadlock in anv_timelines_wait()
 | ||
| - [gles3] supertuxkart: some textures are incorrect
 | ||
| - post_version.py does not work with release candidates
 | ||
| - post_version.py does not work with release candidates
 | ||
| - radv regression on android
 | ||
| - ogl: Set mesa_glthread=true as default on the RPCS3 emulator
 | ||
| - [iris] android deqp dEQP-EGL.functional.robustness.negative_context#invalid_notification_strategy_enum fails
 | ||
| - zink: conditional rendering
 | ||
| - [RadeonSI] Glitches on VEGA8 + RX 560X after MR 4863
 | ||
| - RadeonSI OpenGL broken for GFX8 after unify code for overriding offset
 | ||
| - freedreno/turnip: Don't request fragcoord components we don't use
 | ||
| - Make check fails in ANV
 | ||
| - src\util\meson.build:294:4: ERROR: Program or command 'winepath' not found or not executable
 | ||
| - Please add Zink to features.txt
 | ||
| - llvmpipe: assert triggers in LLVM
 | ||
| - debug builds are massively broken on Windows
 | ||
| - ci: Report flakes on IRC from baremetal tests
 | ||
| - heavy glitches on amd ryzen 5 since version 20.x
 | ||
| - zink asserts with 32-bit boolean
 | ||
| - OpenGL: Surviving Mars black screen late-game (possible shader problem)
 | ||
| - Kerbal Space Program (KSP) hangs entire Navi system
 | ||
| - Dirt: Showdown bad performance and broken rendering with enabled advanced lightning
 | ||
| - gravit & Firefox WebGL broken since 3dc2ccc14c0e035368fea6ae3cce8c481f3c4ad2 "ac/surface: replace RADEON_SURF_OPTIMIZE_FOR_SPACE with !FORCE_SWIZZLE_MODE"
 | ||
| - mesa 20.0.5 causing kitty to crash
 | ||
| - radeonsi: "Torchlight II" trace showing regression on mesa-20.0.6 [bisected]
 | ||
| - [RADV/LLVM/ACO/Regression] After mesa commit a3dc7fffbb7be0f1b2ac478b16d3acc5662dff66 all games stucks at start
 | ||
| - Android building error after commit 2ab45f41
 | ||
| - freedreno/a6xx: pubg rendering glitches
 | ||
| - iris: Crash when trying to capture window in OBS Studio
 | ||
| - lp_test_format failure with llvm-11
 | ||
| 
 | ||
| 
 | ||
| Changes
 | ||
| -------
 | ||
| 
 | ||
| Abhishek Kumar (1):
 | ||
| 
 | ||
| - egl: Limit the EGL ver for android
 | ||
| 
 | ||
| Adam Jackson (1):
 | ||
| 
 | ||
| - glx: Fix build and warnings with -Dglx=dri -Dglx-direct=false
 | ||
| 
 | ||
| Alejandro Piñeiro (9):
 | ||
| 
 | ||
| - v3d/tex: only look up the 2nd texture gather offset for 1d non-arrays
 | ||
| - v3d/tex: set up default values for Configuration Parameter 1 if possible
 | ||
| - v3d/tex: use TMUSLOD register if possible
 | ||
| - v3d: moving v3d simulator to src/broadcom
 | ||
| - v3d/tex: handle correctly coordinates for cube/cubearrays images
 | ||
| - vulkan/util: add struct vk_pipeline_cache_header
 | ||
| - nir/lower_tex: handle query lod with nir_lower_tex_packing_16 at lower_tex_packing
 | ||
| - v3d/packet: fix typo on Set InstanceID/PrimitiveID packet
 | ||
| - v3d: set instance id to 0 at start of tile
 | ||
| 
 | ||
| Alyssa Rosenzweig (475):
 | ||
| 
 | ||
| - pan/mdg: Track more types
 | ||
| - pan/mdg: Be a bit more pedantic in invert passes
 | ||
| - panfrost: Enumify bifrost blend types
 | ||
| - pan/bi: Add texture indices to IR
 | ||
| - pan/bi: Pipe multiple textures through
 | ||
| - pan/bi: Pack round opcodes (FMA, either 16 or 32)
 | ||
| - pan/bit: Add framework forinterpreting double vs float
 | ||
| - pan/bit: Interpret ROUND
 | ||
| - pan/bit: Add round tests
 | ||
| - panfrost: Fix texture field size
 | ||
| - panfrost: Fix size of bifrost sampler descriptor
 | ||
| - panfrost: Fix sampler wrap/filter field orders
 | ||
| - panfrost: Fix norm coords on bifrost sampler
 | ||
| - panfrost: Fix tiled texture "stride"s on Bifrost
 | ||
| - pan/decode: Don't crash on missing payload
 | ||
| - pan/bi: Enable lower_mediump_outputs NIR pass
 | ||
| - panfrost: Update Bifrost fields in mali_shader_meta
 | ||
| - pan/bi: Lower for now sincos
 | ||
| - pan/mdg: Ingest actual isub ops
 | ||
| - pan/mdg: Rename .one to .sat_signed
 | ||
| - pan/mdg: Move constant switch opts to algebraic pass
 | ||
| - pan/mdg: Drop forever todo
 | ||
| - pan/mdg: Drop `opt` in name of midgard_opt_cull_dead_branch
 | ||
| - pan/mdg: Enable nir_opt_algebraic_distribute_src_mods
 | ||
| - panfrost: Update dEQP expectation list
 | ||
| - panfrost: Setup gl_FragCoord as sysval on Bifrost
 | ||
| - pan/bi: Add clause type for gl_FragCoord.zw load
 | ||
| - pan/bi: Abort on unknown op packing
 | ||
| - pan/bi: Abort on unhandled intrinsics
 | ||
| - pan/bi: Futureproof COMBINE lowering against non-u32
 | ||
| - pan/bi: Print bad instruction on src packing fail
 | ||
| - pan/bi: Passthrough direct ld_var addresses
 | ||
| - pan/bi: Lower gl_FragCoord
 | ||
| - pan/bi: Set clause type for gl_FragCoord.z
 | ||
| - pan/bi: Fix double-abs flipping
 | ||
| - pan/bi: Fix missing swizzle
 | ||
| - pan/bi: Fix incorrectly flipped swizzle
 | ||
| - pan/bi: Disable CSEL4 emit for now
 | ||
| - pan/bi: Fix DISCARD ops in disasm
 | ||
| - pan/bi: Structify DISCARD
 | ||
| - pan/bi: Remove BI_GENERIC
 | ||
| - pan/bi: Unwrap BRANCH into CONDITIONAL class
 | ||
| - pan/bi: Handle discard_if in NIR->BIR naively
 | ||
| - pan/bi: Emit discard (not if)
 | ||
| - pan/bi: Add float-only mode to condition fusing
 | ||
| - pan/bi: Fuse conditions into discard_if
 | ||
| - pan/bi: Handle discard/branch in get_component_count
 | ||
| - pan/bi: Pack ADD.DISCARD
 | ||
| - pan/bi: Structify ADD ICMP 16
 | ||
| - pan/bi: Pack ADD ICMP 32
 | ||
| - pan/bi: Pack ADD ICMP 16
 | ||
| - pan/bi: Don't pack ICMP on FMA
 | ||
| - pan/bit: Add swizzles to round tests
 | ||
| - pan/bit: Add more 16-bit fmod tests
 | ||
| - pan/bit: Add ICMP tests
 | ||
| - pan/bi: Rename BI_ISUB to BI_IMATH
 | ||
| - pan/bi: Use IMATH for nir_op_iadd
 | ||
| - pan/bi: Pack FMA IADD/ISUB 32
 | ||
| - pan/bi: Pack ADD IADD/ISUB for 8/16/32
 | ||
| - pan/bi: Add SUB.v2i16/SUB.v4i8 opcodes to disasm
 | ||
| - pan/bi: Don't schedule <32-bit IMATH to FMA
 | ||
| - pan/bit: Interpret IMATH
 | ||
| - pan/bit: Interpret v4i8 ops
 | ||
| - pan/bit: Remove test names
 | ||
| - pan/bit: Use swizzle helper for round
 | ||
| - pan/bit: Factor out identity swizzle helper
 | ||
| - pan/bit: Add IMATH packing tests
 | ||
| - pan/decode: Fix flags_hi printing
 | ||
| - pan/mdg: Explain helper invocations dataflow theory
 | ||
| - pan/mdg: Analyze helper invocation termination
 | ||
| - pan/mdg: Analyze helper execution requirements
 | ||
| - pan/mdg: Use the helper invo analyze passes
 | ||
| - pan/mdg: Use analysis to set .cont/.last flags
 | ||
| - pan/mdg: Remove texture_op_count
 | ||
| - pan/mdg: Set types for derivatives
 | ||
| - pan/mdg: Fix derivative swizzle
 | ||
| - panfrost: Run dEQP-GLES3.functional.shaders.derivate.* on CI
 | ||
| - pan/decode: Use a page table for tracking mmaps
 | ||
| - pan/decode: Fix min/max_tile_coord mixup
 | ||
| - pan/mfbd: Add format codes for PIPE_FORMAT_B5G5R5A1_UNORM
 | ||
| - panfrost: Switch formats to table
 | ||
| - panfrost: Fix Z24 vs Z32 mixup
 | ||
| - panfrost: Enable AFBC for Z24X8
 | ||
| - nir: Add fsat_signed opcode
 | ||
| - nir: Add fclamp_pos opcode
 | ||
| - panfrost: Add modifier detection helpers
 | ||
| - pan/mdg: Remove .pos propagation pass
 | ||
| - pan/mdg: Drop nir_lower_to_source_mods
 | ||
| - pan/mdg: Prepare for modifier helpers
 | ||
| - pan/mdg: Ingest fsat_signed/fclamp_pos
 | ||
| - pan/mdg: Apply abs/neg modifiers
 | ||
| - pan/mdg: Treat inot as a modifier
 | ||
| - pan/mdg: Remove invert optimizations
 | ||
| - pan/mdg: Use helpers for branch/discard inversion
 | ||
| - pan/mdg: Apply outmods
 | ||
| - pan/mdg: Emit fcsel when beneficial
 | ||
| - pan/mdg: Optimize pipelining logic
 | ||
| - pan/mdg: Precompute mir_special_index
 | ||
| - pan/mdg: Optimize liveness computation in DCE
 | ||
| - pan/mdg: Handle comparisons in fp16 path
 | ||
| - pan/mdg: Fix constant combining crash
 | ||
| - pan/mdg: Remove mir_*size routines
 | ||
| - pan/mdg: Remove mir_get_alu_src
 | ||
| - pan/mdg: Include more types
 | ||
| - pan/mdg: Handle dest up/lower correctly with swizzles
 | ||
| - pan/mdg: Respect !32-bit sizes in RA
 | ||
| - pan/mdg: Explain ld/st sign/zero extension
 | ||
| - pan/mdg: Add abs/neg/shift modifiers to IR
 | ||
| - pan/mdg: Use src_types to determine size in scheduling
 | ||
| - pan/mdg: Use type to determine triviality of a move
 | ||
| - pan/mdg: Identify scalar integer mods
 | ||
| - pan/mdg: Promote imov to fmov on a NIR level
 | ||
| - pan/mdg: Remove promote_float pass
 | ||
| - pan/mdg: Defer modifier packing until emit time
 | ||
| - pan/mdg: Remove redundant redundancy
 | ||
| - pan/mdg: Streamline dest_override handling
 | ||
| - pan/mdg: Implement b2f16
 | ||
| - pan/mdg: Don't generate conversions for fp16 LUTs
 | ||
| - pan/mdg: Ignore dest.type when offseting load swizzle
 | ||
| - pan/lcra: Remove unused alignment parameters
 | ||
| - pan/lcra: Allow per-variable bounds to be set
 | ||
| - pan/mdg: Use type size to determine alignment
 | ||
| - pan/mdg: Eliminate load_64
 | ||
| - pan/mdg: Set RA bounds for fp16
 | ||
| - pan/mdg: Print mask when dest=0
 | ||
| - pan/mdg: Round up bytemasks when spilling
 | ||
| - pan/mdg: Print constant vectors less wrong
 | ||
| - pan/mdg: Factor out mir_adjust_constant
 | ||
| - pan/mdg: Only combine 16-bit constants to lower half
 | ||
| - pan/mdg: Separately pack constants to the upper half
 | ||
| - pan/mdg: Fix type checking issues with compute
 | ||
| - pan/mdg: Pack barriers correctly
 | ||
| - pan/mdg: Use shifts instead of division for RA sizes
 | ||
| - pan/mdg: Implement vector constant printing for 8-bit
 | ||
| - pan/mdg: Implement condense_writemask for 8-bit
 | ||
| - pan/mdg: Pack 8-bit swizzles in 16-bit ops
 | ||
| - panfrost: Guard experimental fp16 behind debug flag
 | ||
| - panfrost: Keep cached BOs mmap'd
 | ||
| - panfrost: Remove deadcode
 | ||
| - panfrost: Fill in SCALED formats to format table
 | ||
| - panfrost: Don't set PIPE_CAP_VERTEX_BUFFER_STRIDE_4BYTE_ALIGNED_ONLY
 | ||
| - panfrost: Don't zero staging buffer for tiling
 | ||
| - panfrost: Allow bpp24 tiling
 | ||
| - panfrost: Allow tiling on RECT textures
 | ||
| - panfrost: Limit blend shader work count
 | ||
| - panfrost: Remove dated comment about leaks
 | ||
| - panfrost: Disable tib read/write when colourmask = 0x0
 | ||
| - panfrost: Avoid redundant shader executions with mask=0x0
 | ||
| - panfrost: Don't set CAN_DISCARD for MFBD
 | ||
| - panfrost: Fix transform feedback types
 | ||
| - pan/mdg: Cleanup comments that look like division
 | ||
| - pan/mdg: Eliminate expand_writemask division
 | ||
| - pan/mdg: Eliminate 64-bit swizzle packing division
 | ||
| - pan/mdg: Avoid division in printing helpers
 | ||
| - pan/mdg: Eliminate remaining divisions from compiler
 | ||
| - panfrost: Fix dated comment
 | ||
| - panfrost: Use _mesa_roundevenf when packing clear colours
 | ||
| - panfrost: Handle !independent_blend for blend shaders
 | ||
| - pan/mdg: Add pack_colour_32 opcode
 | ||
| - pan/mdg: Lower shifts to 32-bit
 | ||
| - pan/mdg: Ensure we don't DCE into impossible masks
 | ||
| - pan/mdg: Allow DCE on ld_color_buffer masks
 | ||
| - panfrost: Add debug print before query flushes
 | ||
| - panfrost: Only run batch debug when specifically asked
 | ||
| - nir: Add un/pack_32_4x8 opcodes
 | ||
| - util: Add SATURATE macro
 | ||
| - util/format: Use SATURATE
 | ||
| - mesa: Use SATURATE
 | ||
| - mesa/swrast: Use SATURATE
 | ||
| - gallium/draw: Use SATURATE
 | ||
| - glsl: Use SATURATE
 | ||
| - panfrost: Use SATURATE
 | ||
| - softpipe: Use SATURATE
 | ||
| - intel: Use SATURATE
 | ||
| - i965: Use SATURATE
 | ||
| - iris: Use SATURATE
 | ||
| - etnaviv: Use SATURATE
 | ||
| - nouveau: Use SATURATE
 | ||
| - pan/decode: Fix unused variable warning
 | ||
| - pan/decode: Fix tiler warning
 | ||
| - pan/decode: Dump missing field on Bifrost
 | ||
| - pan/decode: Dump unknown2
 | ||
| - panfrost: Fix Bifrost blending with depth-only FBO
 | ||
| - panfrost: Adjust null_rt for Bifrost
 | ||
| - panfrost: Tweak zsbuf magic numbers for Bifrost
 | ||
| - panfrost: Tweak Bifrost colour buffer magic
 | ||
| - panfrost: Force Z/S tiling on Bifrost
 | ||
| - panfrost: Share MRT blend flag calculation with Bifrost
 | ||
| - panfrost: Set unk2 to accomodate blending
 | ||
| - panfrost: Identify Bifrost texture format swizzle
 | ||
| - panfrost: Ensure nonlinear strides are 16-aligned
 | ||
| - panfrost: Document Midgard Inf/NaN suppress bit
 | ||
| - panfrost: Add defines for bifrost unk1 flags
 | ||
| - panfrost: Identify MALI_BIFROST_EARLY_Z flag
 | ||
| - panfrost: Set MALI_BIFROST_EARLY_Z as necessary
 | ||
| - pan/decode: Decode Bifrost shader flags
 | ||
| - pan/bi: Add TEX.vtx opcode for vertex texturing
 | ||
| - pan/bi: Also add compact vertex texturing
 | ||
| - pan/bi: Document compute_lod bit for compact tex
 | ||
| - pan/bi: Allow vertex txl with lod=0 as compact
 | ||
| - pan/bi: Add f16 TEXC.vtx op
 | ||
| - pan/bi: Pack compact vertex texturing
 | ||
| - pan/bi: Add CSEL.16 packing tests
 | ||
| - pan/bi: Suppress inf/nan for now
 | ||
| - panfrost: Don't generate gl_FragCoord varying on Bifrost
 | ||
| - panfrost: Set reads_frag_coord as a sysval
 | ||
| - panfrost: Preload gl_FragCoord on Bifrost
 | ||
| - pan/bi: Remove FMA? parameter from get_src
 | ||
| - pan/bi: Remove comment about old scheduler design
 | ||
| - pan/bi: Move bi_registers to common IR structures
 | ||
| - pan/bi: Move bi_registers to bi_bundle
 | ||
| - pan/bi: Drop `struct` from bi_registers
 | ||
| - pan/bi: Add FILE* argument to bi_print_registers
 | ||
| - pan/bi: Move bi_flip_ports out of port assignment
 | ||
| - pan/bi: Document constant count invariant
 | ||
| - pan/bi: Disassemble pos=0xe
 | ||
| - pan/bi: Add MUL.i32 to disasm
 | ||
| - pan/bi: Remove more artefacts of 2-pass scheduling
 | ||
| - pan/bi: Add bi_layout.c for clause layout helpers
 | ||
| - pan/bi: Add helper to measure clause size
 | ||
| - pan/bi: Remove schedule_barrier
 | ||
| - pan/bi: Allow printing branches without targets
 | ||
| - pan/bi: Fix emit_if successor assignment
 | ||
| - pan/bi: Only rewrite COMBINE dest if not SSA
 | ||
| - pan/bi: Fix CONVERT component counting
 | ||
| - pan/bi: Fix branch condition typesize
 | ||
| - pan/bi: Passthrough ZERO in branch packing
 | ||
| - pan/bi: Add branch constant field to IR
 | ||
| - pan/bi: Pack branch offset constants
 | ||
| - pan/bi: Set branch_constant if there is a branch
 | ||
| - pan/bi: Assign constant port for branch offsets
 | ||
| - pan/bi: Preliminary branch packing
 | ||
| - pan/bi: Link clauses back to their blocks
 | ||
| - pan/bi: Add bi_foreach_clause_in_block_from{_rev} helpers
 | ||
| - pan/bi: Measure distance between blocks
 | ||
| - pan/bi: Pack proper clause offsets
 | ||
| - pan/bi: Set branch_conditional if b2b is set
 | ||
| - pan/bi: Set back-to-back bit more accurately
 | ||
| - pan/bi: Set branch conditional bit
 | ||
| - pan/bi: Pack unconditional branch
 | ||
| - pan/bi: Defer block naming until after emit
 | ||
| - pan/bi: Add bi_foreach_block_from_rev helper
 | ||
| - pan/bi: Measure backwards branches as well
 | ||
| - pan/bi: Allow two successors in header packing
 | ||
| - pan/bi: Passthrough deps of the branch target
 | ||
| - panfrost: Disable QUAD_STRIP/POLYGON on Bifrost
 | ||
| - panfrost: Add GPU IDs for G31/G52
 | ||
| - panfrost: Probe G31/G52 if PAN_MESA_DEBUG=bifrost
 | ||
| - pan/mdg: Handle un/pack opcodes as moves
 | ||
| - pan/mdg: Add pack_unorm_4x8 via 8-bit
 | ||
| - pan/mdg: Treat packs "specially"
 | ||
| - pan/mdg: Handle bitsize for packs
 | ||
| - pan/mdg: Print 8-bit constants
 | ||
| - pan/mdg: Drop the u8 from the colorbuf op names
 | ||
| - pan/mdg: Implement raw colourbuf loads on T720
 | ||
| - panfrost: Add theory for new framebuffer lowering
 | ||
| - panfrost: Determine unpacked type for formats
 | ||
| - panfrost: Add quirks for blend shader types
 | ||
| - panfrost: Determine load classes for formats
 | ||
| - panfrost: Determine classes for stores
 | ||
| - panfrost: Stub out lowering boilerplate
 | ||
| - panfrost: Un/pack pure 32-bit
 | ||
| - panfrost: Un/pack pure 16-bit
 | ||
| - panfrost: Un/pack pure 8-bit
 | ||
| - panfrost: Un/pack 8-bit UNORM
 | ||
| - panfrost: Flesh out dispatch
 | ||
| - panfrost: Un/pack UNORM 4
 | ||
| - panfrost: Un/pack RGB565 and RGB5A1
 | ||
| - panfrost: Un/pack RGB10_A2_UNORM
 | ||
| - panfrost: Un/pack RGB10_A2_UINT
 | ||
| - panfrost: Un/pack R11G11B10
 | ||
| - panfrost: Un/pack sRGB via NIR
 | ||
| - panfrost: Switch to pan_lower_framebuffer
 | ||
| - panfrost: Conditionally allow fp16 blending
 | ||
| - panfrost: Account for differing types in blend lower
 | ||
| - panfrost: Let Gallium pack colours
 | ||
| - panfrost: Check for large tilebuffer requirements
 | ||
| - panfrost: Add separate_stencil BO to batch
 | ||
| - panfrost: Use internal_format throughout
 | ||
| - panfrost: Update fails list
 | ||
| - pan/mdg: Handle 16-bit ld_vary
 | ||
| - pan/mdg: Fuse f2f16 into load_interpolated_input
 | ||
| - panfrost: Fix PRESENT flag mix-up
 | ||
| - panfrost: Permit AFBC of RGB8
 | ||
| - panfrost: Use VTX tag for vertex texturing
 | ||
| - panfrost: Don't flush explicitly when mipmapping
 | ||
| - panfrost: Remove unused nir_lower_framebuffer pass
 | ||
| - pan/mdg: Disassemble out-of-order bits
 | ||
| - pan/mdg: Add quirk for missing out-of-order support
 | ||
| - pan/mdg: Enable out-of-order execution after texture ops
 | ||
| - nir: Fold f2f16(b2f32(x)) to b2f16(x)
 | ||
| - pan/mdg: Don't double-replicate blend on T720
 | ||
| - pan/mdg: Distinguish blend shaders in internal shader-db
 | ||
| - pan/mdg: Add roundmode enum
 | ||
| - pan/mdg: Add opcode roundmode property
 | ||
| - pan/mdg: Lower roundmodes
 | ||
| - pan/mdg: Implement \*_rtz conversions with roundmode
 | ||
| - pan/mdg: Fold roundmode into applicable instructions
 | ||
| - pan/mdg: Handle f2u8
 | ||
| - pan/mdg: Allow f2u8 and friends thru
 | ||
| - pan/mdg: Handle regular nir_intrinsic_load_output
 | ||
| - panfrost: Passthrough NATIVE loads/stores
 | ||
| - pan/bi: Handle SEL with vec3 16-bit
 | ||
| - pan/bi: Fix SEL.16 swizzle
 | ||
| - pan/bi: Pack second argument of F32_TO_F16
 | ||
| - pan/bi: Passthrough second argument of F32_TO_F16
 | ||
| - pan/bi: Handle vectorized load_const
 | ||
| - panfrost: Update MALI_EARLY_Z description
 | ||
| - panfrost: Document MALI_WRITES_GLOBAL bit
 | ||
| - panfrost: Handle writes_memory correctly
 | ||
| - panfrost: Readd MIDGARD_SHADERLESS quirk to t760
 | ||
| - panfrost: Explicitly convert to 32-bit for logic-ops
 | ||
| - pan/bi: Disassemble gl_PointCoord reads.
 | ||
| - panfrost: Prefer sysval for gl_PointCoord on Bifrost
 | ||
| - panfrost: Fix gl_PointSize out of GL_POINTS
 | ||
| - panfrost: Mark point sprites as todo on Bifrost
 | ||
| - pan/mdg: Legalize inverts with constants
 | ||
| - pan/mdg: Ensure ld_vary_16 is aligned
 | ||
| - panfrost: Ensure we have ro before using it
 | ||
| - nir: Remove nir_intrinsic_output_u8_as_fp16_pan
 | ||
| - pan/mdg: Avoid fusing ld_vary_16 with non-zero component
 | ||
| - panfrost: Calculate varying size by format
 | ||
| - panfrost: Add panfrost_streamout_offset helper
 | ||
| - panfrost: Introduce bitfields for tracking varyings
 | ||
| - panfrost: Determine varying buffer presence
 | ||
| - panfrost: Emit unlinked varyings
 | ||
| - panfrost: Emit special varyings
 | ||
| - panfrost: Emit xfb records
 | ||
| - panfrost: Add helper to determine if we are capturing
 | ||
| - panfrost: Add high-level varying emit
 | ||
| - panfrost: Use new varying linking
 | ||
| - panfrost: Remove unused routines
 | ||
| - panfrost: Allow R/RG/RGB varyings
 | ||
| - panfrost: Only store varying formats
 | ||
| - panfrost: Use shader_info harder
 | ||
| - panfrost: Override varying format to minimal precision
 | ||
| - panfrost: Demote mediump varyings to fp16
 | ||
| - pan/mdg: Explicitly type 64-bit uniform moves
 | ||
| - pan/mdg: Analyze types for 64-bitness in RA
 | ||
| - pan/mdg: Prefer type over regmode for schedule constraints
 | ||
| - pan/mdg: Precolour blend inputs
 | ||
| - panfrost: Merge bifrost_bo/midgard_bo
 | ||
| - panfrost: Update sampler view in Bifrost path
 | ||
| - panfrost: Fix level_2
 | ||
| - panfrost: Correctly calculate tiled stride
 | ||
| - panfrost: Enable AFBC for RGB565
 | ||
| - panfrost: Simplify AFBC format check
 | ||
| - pan/mdg: Factor out unit check
 | ||
| - pan/mdg: Allow scheduling "x + x" to multipliers
 | ||
| - pan/mdg: Canonicalize (x * 2.0) to (x + x)
 | ||
| - pan/mdg: Reassociate adds for multiply-by-two
 | ||
| - nir: Propagate \*2*16 conversions into vectors
 | ||
| - panfrost: Specify stack_shift on SFBD
 | ||
| - pan/mdg: Defer nir_fuse_io_16 until after opts
 | ||
| - pan/mdg: Don't assign destination in writeout block to r1
 | ||
| - pan/mdg: Remove bundle interference code
 | ||
| - pan/mdg: Schedule writeout to VLUT
 | ||
| - pan/mdg: Defer smul, vlut until after writeout moves
 | ||
| - pan/mdg: Allow Z/S writes to use any 2nd stage unit
 | ||
| - pan/mdg: Prioritize non-moves on VADD/VLUT
 | ||
| - pan/mdg: Skip r1.w write where possible
 | ||
| - pan/mdg: Schedule based on liveness
 | ||
| - pan/mdg: Respect type/mask in mir_lower_special_reads
 | ||
| - pan/mdg: Fix indirect UBO swizzles
 | ||
| - pan/decode: Fix MSAA texture decoding
 | ||
| - pan/decode: Identify layered MSAA flag
 | ||
| - pan/mdg: Allow ignoring move mode
 | ||
| - pan/mdg: Handle GLSL_SAMPLER_DIM_MS
 | ||
| - pan/mdg: Handle nir_tex_src_ms_index
 | ||
| - pan/mdg: Handle nir_texop_txf_ms
 | ||
| - pan/mdg: Use _VTX tag for texelFetch in frag shaders
 | ||
| - panfrost: Set depth to sample_count for MSAA 2D
 | ||
| - panfrost: Identify layer_stride
 | ||
| - panfrost: Allocate space for multisampling
 | ||
| - panfrost: Index texture by sample
 | ||
| - panfrost: Include pointer for each sample
 | ||
| - panfrost: Set layer_stride for multisampled rendering
 | ||
| - panfrost: Don't advertise MSAA 2x
 | ||
| - panfrost: Identify coverage_mask
 | ||
| - panfrost: Pass sample_mask to the hardware
 | ||
| - panfrost: Implement alpha-to-coverage
 | ||
| - panfrost: Identify depth/stencil layer strides
 | ||
| - panfrost: Set depth/stencil_layer_stride accordingly
 | ||
| - panfrost: Enable MSAA if we render to such a surface
 | ||
| - panfrost: Save sample_mask before blitting
 | ||
| - panfrost: Expose MSAA 4x
 | ||
| - glsl: Handle 16-bit types in loop analysis
 | ||
| - docs/features: Track Panfrost
 | ||
| - panfrost: Introduce pan_pool struct
 | ||
| - panfrost: Allocate pool BOs against the pool
 | ||
| - panfrost: Track the device through the pool
 | ||
| - panfrost: Expose pool-based allocation API
 | ||
| - panfrost: Move debug flags into the device
 | ||
| - panfrost: Drop Gallium-local pan_bo_create wrapper
 | ||
| - panfrost: Move pool routines to common code
 | ||
| - panfrost: Factor out scoreboarding state
 | ||
| - panfrost: Pass polygon_list to tiler init function
 | ||
| - panfrost: Drop batch from scoreboard routines
 | ||
| - panfrost: Move scoreboarding routines to common
 | ||
| - panfrost: Handle PIPE_FORMAT_X24S8_UINT
 | ||
| - panfrost: Handle PIPE_FORMAT_S8_UINT
 | ||
| - panfrost: Move panfrost_translate_texture_type
 | ||
| - panfrost: Report blend shader work count
 | ||
| - panfrost: Clamp pure int pixels
 | ||
| - panfrost: Generate shader variants on framebuffer bind
 | ||
| - panfrost: Always use SOFTWARE for pure formats
 | ||
| - panfrost: Extend fetched framebuffer results
 | ||
| - panfrost: Fix fence leak
 | ||
| - panfrost: Fix write to free'd memory
 | ||
| - panfrost: Add a sparse array to map GEM handles to BOs
 | ||
| - panfrost: Index BOs from the BO map sparse array
 | ||
| - panfrost: Merge PAN_BO_IMPORTED/PAN_BO_EXPORTED
 | ||
| - panfrost: Remove PAN_BO_COHERENT_LOCAL
 | ||
| - panfrost: Remove PAN_BO_DONT_REUSE
 | ||
| - panfrost: Remove panfrost_bo_access type
 | ||
| - panfrost: Compact unused BO flag bits
 | ||
| - panfrost: Add format codes for new compressed textures
 | ||
| - panfrost: Pipe in compressed texture feature mask
 | ||
| - panfrost: Filter compressed texture formats
 | ||
| - panfrost: Map PIPE_{DXT, RGTC, BPTC} to MALI_BCn
 | ||
| - docs/features: Update ASTC entries for Panfrost
 | ||
| - pan/mdg: Bump compiler RT maximum
 | ||
| - pan/mdg: Identify per-sample interpolation mode
 | ||
| - pan/mdg: Implement gl_SampleID
 | ||
| - panfrost: Force Z/S writeback
 | ||
| - panfrost: Expose panfrost_get_blend_shader
 | ||
| - panfrost: Add MALI_PER_SAMPLE bit
 | ||
| - panfrost: Include sample count in payload estimates
 | ||
| - panfrost: Identify zs_samples field
 | ||
| - panfrost: Add rectangle subtraction algorithm
 | ||
| - panfrost: Handle per-sample shading
 | ||
| - panfrost: Set zs_samples as necessary
 | ||
| - panfrost: Track surfaces drawn per-batch
 | ||
| - panfrost: Extract panfrost_batch_reserve_framebuffer
 | ||
| - panfrost: Use Midgard-specific reloads
 | ||
| - panfrost: Call util_blitter_save_fragment_constant_buffer_slot
 | ||
| - panfrost: Overhaul tilebuffer allocations
 | ||
| - panfrost: Set PIPE_CAP_MIXED_COLORBUFFER_FORMATS
 | ||
| - panfrost: Fix sRGB clear colour packing
 | ||
| - panfrost: Implement Z32F_S8 blits
 | ||
| - panfrost: Abort on unsupported blit
 | ||
| - panfrost: Avoid integer underflow in rt_count_1
 | ||
| - panfrost: Honour cso->compare_mode
 | ||
| - panfrost: Fix faults with RASTERIZER_DISCARD
 | ||
| - panfrost: Report CAPs more honestly
 | ||
| - panfrost: Enable Chromium
 | ||
| - panfrost: Revert "Disable frame throttling"
 | ||
| - docs/features: Mark trivial missed feature
 | ||
| - panfrost: Enable FP16 by default
 | ||
| - panfrost: Avoid wait=true flushing all batches
 | ||
| - panfrost: Remove wait parameter to flush_all_batches
 | ||
| - panfrost: Skip specifying in_syncs
 | ||
| - panfrost: Allocate syncobjs in panfrost_flush
 | ||
| - panfrost: Remove unused batch_fence->signaled
 | ||
| - panfrost: Remove unused batch_fence->ctx
 | ||
| - pan/bit: Update f32->f16 convert test
 | ||
| - pan/bit: Remove BI_SHIFT stub
 | ||
| - pan/mdg: Mask spills from texture write
 | ||
| - pan/mdg: Test for SSA before chasing addresses
 | ||
| - docs/features: Add GL_EXT_multisampled_render_to_texture
 | ||
| - panfrost: Add MSAA mode selection field
 | ||
| - panfrost: Implement EXT_multisampled_render_to_texture
 | ||
| - panfrost: Set STRIDE_4BYTE_ALIGNED_ONLY
 | ||
| - panfrost: Fix WRITES_GLOBAL bit
 | ||
| - pan/mdg: Ensure barrier op is set on texture
 | ||
| - panfrost: Fix blend leak for render targets 5-8
 | ||
| - panfrost: Free cloned NIR shader
 | ||
| - panfrost: Free NIR of blit shaders
 | ||
| - panfrost: Free hash_to_temp map
 | ||
| - pan/mdg: Free previous liveness
 | ||
| - panfrost: Use memctx for sysvals
 | ||
| - panfrost: Free batch->dependencies
 | ||
| - pan/mdg: Fix discard encoding
 | ||
| - pan/mdg: Fix perspective combination
 | ||
| - pan/bit: Set d3d=true for CMP tests
 | ||
| 
 | ||
| Andreas Baierl (1):
 | ||
| 
 | ||
| - nir/ lower_int_to_float: Handle umax and umin
 | ||
| 
 | ||
| Andres Gomez (10):
 | ||
| 
 | ||
| - .mailmap: add an alias for Iago Toral Quiroga
 | ||
| - .mailmap: add an alias for Andres Gomez
 | ||
| - gitlab-ci: update tracie README after changes in main script
 | ||
| - scripts: remove unittest.mock dependency when not used
 | ||
| - gitlab-ci: create always the "results" directory with tracie
 | ||
| - gitlab-ci: correct tracie behavior with replay errors
 | ||
| - gitlab-ci: build gfxreconstruct from the "dev" branch
 | ||
| - gitlab-ci: get the last frame from a gfxr trace using gfxrecon-info
 | ||
| - gitlab-ci/traces: updated paths and checksums for POLARIS10 traces
 | ||
| - gitlab-ci: Test AMD's Raven with traces
 | ||
| 
 | ||
| Andrey Vostrikov (1):
 | ||
| 
 | ||
| - egl/x11: Free memory allocated for reply structures on error
 | ||
| 
 | ||
| Andrii Simiklit (3):
 | ||
| 
 | ||
| - glsl_type: don't serialize padding bytes from glsl_struct_field
 | ||
| - i965/vec4: Ignore swizzle of VGRF for use by var_range_end()
 | ||
| - glsl: fix crash on glsl macro redefinition
 | ||
| 
 | ||
| Ani (1):
 | ||
| 
 | ||
| - drirc: Enable glthread for rpcs3
 | ||
| 
 | ||
| Anuj Phogat (6):
 | ||
| 
 | ||
| - intel/devinfo: Add is_dg1 to device info
 | ||
| - intel/l3: Add DG1 L3 configuration
 | ||
| - intel/ehl: Use GEN11_URB_MIN_MAX_ENTRIES in device info
 | ||
| - intel/ehl: Use macro GEN11_LP_FEATURES in device info
 | ||
| - intel/ehl: Rename gen_device_info struct
 | ||
| - intel/ehl: Add new PCI-IDs
 | ||
| 
 | ||
| Arcady Goldmints-Orlov (4):
 | ||
| 
 | ||
| - anv: increase minUniformBufferOffsetAlignment to 64
 | ||
| - intel/compiler: fix alignment assert in nir_emit_intrinsic
 | ||
| - nir/spirv/glsl450: increase asin(x) precision
 | ||
| - intel/compiler: Always apply sample mask on Vulkan.
 | ||
| 
 | ||
| Axel Davy (19):
 | ||
| 
 | ||
| - st/nine: Set correctly blend max_rt
 | ||
| - gallium/util: Fix leak in the live shader cache
 | ||
| - ttn: Add new allow_disk_cache parameter
 | ||
| - ttn: Implement disk cache
 | ||
| - st/nine: Enable ttn cache
 | ||
| - radeonsi: Enable tgsi to nir disk cache
 | ||
| - st/nine: Add checks for pure device
 | ||
| - st/nine: Return error when setting invalid depth buffer
 | ||
| - st/nine: Do not return invalidcall on getrenderstate
 | ||
| - st/nine: Pass more adapter formats for CheckDepthStencilMatch
 | ||
| - st/nine: Improve return error code in CheckDeviceFormat
 | ||
| - st/nine: Fix uninitialized variable in BEM()
 | ||
| - st/nine: Fix a crash if the state is not initialized
 | ||
| - st/nine: Add missing NULL checks
 | ||
| - st/nine: Increase available GPU memory
 | ||
| - st/nine: Retry allocations after freeing some space
 | ||
| - st/nine: Improve pDestRect handling
 | ||
| - st/nine: Ignore pDirtyRegion
 | ||
| - st/nine: Handle full pSourceRect better
 | ||
| 
 | ||
| Bas Nieuwenhuizen (80):
 | ||
| 
 | ||
| - radv: Fix implicit sync with recent allocation changes.
 | ||
| - radv: Extend tiling flags to 64-bit.
 | ||
| - radv: Provide a better error for permission issues with priorities.
 | ||
| - radv: Support VK_PIPELINE_COMPILE_REQUIRED_EXT.
 | ||
| - radv: Support VK_PIPELINE_CREATE_EARLY_RETURN_ON_FAILURE_BIT_EXT.
 | ||
| - radv: Support VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT.
 | ||
| - radv: Expose VK_EXT_pipeline_creation_cache_control.
 | ||
| - radv/winsys: Finish mapping for sparse residency.
 | ||
| - radv/winsys:  Remove extra sizeof multiply.
 | ||
| - radv: Handle failing to create .cache dir.
 | ||
| - radv: Remove dead code.
 | ||
| - radv: Do not close fd -1 when NULL-winsys creation fails.
 | ||
| - radv: Implement vkGetSwapchainGrallocUsage2ANDROID.
 | ||
| - frontend/dri: Implement mapping individual planes.
 | ||
| - util/format: Add VK_FORMAT_D16_UNORM_S8_UINT.
 | ||
| - util/format: Use correct pipe format for VK_FORMAT_G8_B8_R8_3PLANE_420_UNORM.
 | ||
| - util/format: Add more multi-planar formats.
 | ||
| - gallium/dri: Remove lowered_yuv tracking for plane mapping.
 | ||
| - radeonsi: Explicitly map Z16_UNORM_S8_UINT to None for GFX10.
 | ||
| - amd/common,radeonsi: Move gfx10_format_table to common.
 | ||
| - radeonsi: Define gfx10_format in the common header.
 | ||
| - radv: Include gfx10_format_table.h only from a single source file.
 | ||
| - radv: Use common gfx10_format_table.h
 | ||
| - radv: Use ac_surface to determine fmask enable.
 | ||
| - radv: Pass no_metadata_planes info in to ac_surface.
 | ||
| - radv: Enforce the contiguous memory for DCC layers in ac_surface.
 | ||
| - radv: Rely on ac_surface for avoiding cmask for linear images.
 | ||
| - radv: Use offsets in surface struct.
 | ||
| - radv: Disable DCC in ac_surface.
 | ||
| - radv: Disable HTILE in ac_surface.
 | ||
| - radv: Allocate values/predicates at the end of the image.
 | ||
| - amd/common: Add total alignment calculation.
 | ||
| - radv: Use ac_surface to allocate aux surfaces.
 | ||
| - vulkan/wsi/x11: Ensure we create at least minImageCount images.
 | ||
| - radv/winsys: Deal with realloc failures in BO lists.
 | ||
| - radv: Handle mmap failures.
 | ||
| - radv/winsys: Distinguish device/host memory errors.
 | ||
| - radv: Make radv_alloc_shader_memory static.
 | ||
| - turnip: semaphore support.
 | ||
| - meson: Do not require shader cache for radv.
 | ||
| - amd/addrlib: fix another C++ one definition rule violation
 | ||
| - radv: Set handle types in Android semaphore/fence import.
 | ||
| - radv: Always enable PERFECT_ZPASS_COUNTS.
 | ||
| - Revert "radv: add support for MRTs compaction to avoid holes"
 | ||
| - radv: Use correct semaphore handle type for Android import.
 | ||
| - amd/llvm: Mark pointer function arguments as 32-byte aligned.
 | ||
| - amd/common: Cache intra-tile addresses for retile map.
 | ||
| - amd/addrlib: Clean up unused colorFlags argument
 | ||
| - amd/registers: add RLC_PERFMON_CLK_CNTL for pre-GFX10
 | ||
| - radeonsi: Inhibit clock-gating for perf counters.
 | ||
| - meson: Add mising git_sha1.h dependency.
 | ||
| - amd: Add detection of timeline semaphore support.
 | ||
| - radv/winsys: Add binary syncobj ABI changes for timeline semaphores.
 | ||
| - radv: Add thread for timeline syncobj submission.
 | ||
| - radv: Add winsys support for submitting timeline syncobj.
 | ||
| - radv: Add winsys functions for timeline syncobj.
 | ||
| - radv: Add timeline syncobj for timeline semaphores.
 | ||
| - radv: Fix uninitialized variable in renderpass.
 | ||
| - vulkan/wsi/x11: report device-group present rectangles with prime.
 | ||
| - vulkan/wsi: Convert usage of -1 to UINT32_MAX.
 | ||
| - radv: Fix host->host signalling with legacy timeline semaphores.
 | ||
| - mesa/st: Actually free the driver part of memory objects on destruction.
 | ||
| - radv: Don't use both DCC and CMASK for single sample images.
 | ||
| - radv: Fix assert that is too strict.
 | ||
| - radv: Do not consider layouts fast-clearable on compute queue.
 | ||
| - radv: When importing an image, redo the layout based on the metadata.
 | ||
| - radv: Use getter instead of setter to extract value.
 | ||
| - driconf: Support selection by Vulkan applicationName.
 | ||
| - radv: Override the uniform buffer offset alignment for World War Z.
 | ||
| - radv: Fix handling of attribs 16-31.
 | ||
| - radv: Remove conformance warnings with ACO.
 | ||
| - radv: Update CTS version.
 | ||
| - radv: Fix 3d blits.
 | ||
| - radv: Fix threading issue with submission refcounts.
 | ||
| - radv: Avoid deadlock on bo_list.
 | ||
| - spirv: Deal with glslang not setting NonUniform on constructors.
 | ||
| - radeonsi: Work around Wasteland 2 bug.
 | ||
| - spirv: Deal with glslang bug not setting the decoration for stores.
 | ||
| - ac/surface: Fix depth import on GFX6-GFX8.
 | ||
| - st/mesa: Deal with empty textures/buffers in semaphore wait/signal.
 | ||
| 
 | ||
| Ben Skeggs (38):
 | ||
| 
 | ||
| - nir: use bitfield_insert instead of bfi in nir_lower_double_ops
 | ||
| - nvir: bump max encoding size of instructions
 | ||
| - nvir: introduce OP_LOP3_LUT
 | ||
| - nvir: introduce OP_WARPSYNC
 | ||
| - nvir: introduce OP_BREV with lowering to EXTBF_REV for current GPUs
 | ||
| - nvir: introduce OP_SHF
 | ||
| - nvir: introduce OP_BMSK
 | ||
| - nvir: introduce OP_SGXT
 | ||
| - nvir: introduce OP_FINAL
 | ||
| - nvir: add constant folding for OP_PERMT
 | ||
| - nvir: run replaceZero() before replaceCvt()
 | ||
| - nvir/nir: fix fragment program output when using MRT
 | ||
| - nvir/nir: move nir options to codegen
 | ||
| - nvir/nir: flesh out options
 | ||
| - nvir/nir: turn on lower_rotate
 | ||
| - nvir/nir: implement nir_op_extract_u8
 | ||
| - nvir/nir: implement nir_op_extract_i8
 | ||
| - nvir/nir: implement nir_op_extract_u16
 | ||
| - nvir/nir: implement nir_op_extract_i16
 | ||
| - nvir/nir: implement nir_op_urol
 | ||
| - nvir/nir: implement nir_op_uror
 | ||
| - nvir/nir: nir expects the shift amount to wrap, rather than clamp
 | ||
| - nvir/nir: use nir_lower_idiv
 | ||
| - nvir/gm107: implement OP_PERMT
 | ||
| - nvir/gm107: replace SHR+AND+AND with PRMT+PRMT in PFETCH lowering
 | ||
| - nvir/gm107: separate out header for sched data calculator
 | ||
| - nvir/nir/gm107: split nir shader compiler options from gf100
 | ||
| - nvir/nir/gm107: turn on nir_lower_extract64
 | ||
| - nvir/nir/gm107: switch off lower_extract_byte
 | ||
| - nvir/nir/gm107: switch off lower_extract_word
 | ||
| - nvir/gv100: initial support
 | ||
| - nvir/gv100: enable support for tu1xx
 | ||
| - nvc0: use NVIDIA headers for GK104->GM2xx compute QMD
 | ||
| - nvc0: use NVIDIA headers for GP100- compute QMD
 | ||
| - nvc0: move setting of entrypoint for a shader stage to a function
 | ||
| - nvc0: remove hardcoded blitter vertprog
 | ||
| - nvc0: initial support for gv100
 | ||
| - nvc0: initial support for tu1xx
 | ||
| 
 | ||
| Benjamin Cheng (1):
 | ||
| 
 | ||
| - drirc: Add picom to adaptive_sync exclusion list
 | ||
| 
 | ||
| Benjamin Tissoires (3):
 | ||
| 
 | ||
| - CI: reduce bandwidth for git pull
 | ||
| - gitlab-ci: update ci-fairy minio to latest upstream
 | ||
| - gitlab-ci: do not run full CI on scheduled pipelines
 | ||
| 
 | ||
| Blaž Tomažič (1):
 | ||
| 
 | ||
| - radeonsi: Fix omitted flush when moving suballocated texture
 | ||
| 
 | ||
| Boris Brezillon (14):
 | ||
| 
 | ||
| - spirv: Split the vtn_emit_scoped_memory_barrier() logic
 | ||
| - nir: Replace the scoped_memory barrier by a scoped_barrier
 | ||
| - intel/compiler: Extract control barriers from scoped barriers
 | ||
| - spirv: Use scoped barriers for SpvOpControlBarrier
 | ||
| - nir: Add new rules to optimize NOOP pack/unpack pairs
 | ||
| - nir: Use a switch in build_deref_offset()/deref_instr_get_const_offset()
 | ||
| - nir: Allow casts in nir_deref_instr_get[_const]_offset()
 | ||
| - freedreno: Initialize lower_int64_options to a proper value
 | ||
| - nir: Stop passing an options arg to nir_lower_int64()
 | ||
| - nir: Extend nir_lower_int64() to support i2f/f2i lowering
 | ||
| - intel: Set int64_options to ~0 when lowering 64b ops
 | ||
| - nir: Get rid of __[u]int64_to_fp32() and __fp32_to_[u]int64()
 | ||
| - nir: Fix i64tof32 lowering
 | ||
| - spirv: Add a vtn_get_mem_operands() helper
 | ||
| 
 | ||
| Boyuan Zhang (2):
 | ||
| 
 | ||
| - radeon/vcn/enc: Re-write PPS encoding for HEVC
 | ||
| - radeon/vcn: bump vcn3.0 encode major version to 1
 | ||
| 
 | ||
| Brian Ho (14):
 | ||
| 
 | ||
| - turnip: Execute ir3_nir_lower_gs pass again
 | ||
| - turnip: Fill out VkPhysicalDeviceSubgroupProperties
 | ||
| - nir: Support sysval tess levels in SPIR-V to NIR
 | ||
| - nir: Add an option for lowering TessLevelInner/Outer to vecs
 | ||
| - turnip: Lower shaders for tessellation
 | ||
| - turnip: Offset by component when lowering gl_TessLevel*
 | ||
| - turnip: Parse tess state and support PATCH primtype
 | ||
| - turnip: Allocate tess BOs as a function of draw size
 | ||
| - turnip: Update VFD_CONTROL with tess system values
 | ||
| - turnip: Emit HS/DS user consts as draw states
 | ||
| - turnip: Support tess for draws
 | ||
| - turnip: Force sysmem for tessellation
 | ||
| - ir3: Unconditionally enable MERGEDREGS on a6xx
 | ||
| - turnip: Enable tessellationShader physical device feature
 | ||
| 
 | ||
| Caio Marcelo de Oliveira Filho (32):
 | ||
| 
 | ||
| - intel/dev: Bail when INTEL_DEVID_OVERRIDE is not valid
 | ||
| - intel/fs: Clean up variable group size handling in backend
 | ||
| - intel/fs: Add an option to lower variable group size in backend
 | ||
| - intel/fs: Add and use a new load_simd_width_intel intrinsic
 | ||
| - intel: Let drivers call brw_nir_lower_cs_intrinsics()
 | ||
| - iris: Implement ARB_compute_variable_group_size
 | ||
| - util/list: Add list_foreach_entry_from_safe
 | ||
| - nir: Use deref intrinsics to set writes_memory when gathering info
 | ||
| - intel/fs: Use writes_memory from shader_info
 | ||
| - nir: Consider atomic counter intrinsics when setting writes_memory
 | ||
| - intel/fs: Remove unused emission of load_simd_with_intel
 | ||
| - intel/fs: Remove unused state from brw_nir_lower_cs_intrinsics
 | ||
| - intel/fs: Early return when can't satisfy explicit group size
 | ||
| - intel/fs: Remove redundant assert()
 | ||
| - intel/fs: Remove min_dispatch_width spilling decision from RA
 | ||
| - intel/fs: Support INTEL_DEBUG=no8,no32 in compute shaders
 | ||
| - intel/fs: Add helper to get prog_offset and simd_size
 | ||
| - i965: Use new helper functions to pick SIMD variant for CS
 | ||
| - iris: Set CS KernelStatePointer at dispatch
 | ||
| - iris: Use new helper functions to pick SIMD variant for CS
 | ||
| - anv: Use new helper functions to pick SIMD variant for CS
 | ||
| - intel/fs: Generate multiple CS SIMD variants for variable group size
 | ||
| - iris, i965: Drop max_variable_local_size
 | ||
| - iris, i965: Update limits for ARB_compute_variable_group_size
 | ||
| - intel: Add helper to calculate GPGPU_WALKER::RightExecutionMask
 | ||
| - nir: Fix printing execution scope of a scoped barrier
 | ||
| - spirv: Memory semantics is optional for OpControlBarrier
 | ||
| - intel/fs: Add Fall-through comment
 | ||
| - nir: Fix logic that ends combine barrier sequence
 | ||
| - spirv: Handle most execution modes earlier
 | ||
| - nir: Filter modes of scoped memory barrier in nir_opt_load_store_vectorize
 | ||
| - spirv: Propagate explicit layout only in types that need it
 | ||
| 
 | ||
| Charmaine Lee (1):
 | ||
| 
 | ||
| - llvmpipe: do not enable tessellation shader without llvm coroutines support
 | ||
| 
 | ||
| Chris Forbes (12):
 | ||
| 
 | ||
| - bifrost: Set RTZ rounding mode for f2i conversion
 | ||
| - bifrost: Lower x->bool conversions to != 0
 | ||
| - bifrost: Emit "d3d" variant of comparison instructions
 | ||
| - bifrost: Document d3d/gl comparison control bit
 | ||
| - bifrost: Add lowering for b2i32
 | ||
| - bifrost: Add support for nir_op_inot
 | ||
| - bifrost: Add support for nir_op_ishl
 | ||
| - bifrost: Add support for nir_op_uge
 | ||
| - bifrost: Add support for nir_op_imul
 | ||
| - bifrost: Add support for nir_op_iabs
 | ||
| - bifrost: Honor src swizzle in special math ops
 | ||
| - bifrost: Fix packing of ADD_FEXP2_FAST
 | ||
| 
 | ||
| Chris Wilson (6):
 | ||
| 
 | ||
| - iris: Place a seqno at the end of every batch
 | ||
| - iris: Convert fences to using lightweight seqno
 | ||
| - iris: Store a seqno for each batch in the fence
 | ||
| - iris: Initialise stub iris_seqno to 0
 | ||
| - iris: Rename iris_seqno to iris_fine_fence
 | ||
| - iris: Fixup copy'n'paste mistake in Makefile.sources
 | ||
| 
 | ||
| Christian Gmeiner (31):
 | ||
| 
 | ||
| - etnaviv: fix SAMP_ANISOTROPY register value
 | ||
| - etnaviv: do not use int filter when anisotropic filtering is used
 | ||
| - ci: bare-metal: make it possible to use a script for serial
 | ||
| - ci: extend expect-output.sh
 | ||
| - ci: add U-Boot specific fetch strings
 | ||
| - etnaviv: drop translate_blend(..)
 | ||
| - ci: add arm_test-base docker image
 | ||
| - ci: use separate docker images for baremetal builds
 | ||
| - ci: fix possible spuriously run of jobs
 | ||
| - etnaviv: delete not used struct
 | ||
| - etnaviv: convert enums
 | ||
| - etnaviv: move etna_lower_io(..) to etnaviv_nir.c
 | ||
| - etnaviv: get rid of etna_compile dependency
 | ||
| - etnaviv: move etna_lower_alu(..) to etnaviv_nir.c
 | ||
| - etnaviv: drop OPT_V define
 | ||
| - etnaviv: make more use of compile_error(..)
 | ||
| - etnaviv: move liveness related stuff into own file
 | ||
| - etnaviv: merge struct etna_compile and etna_state
 | ||
| - etnaviv: drop emit macro
 | ||
| - etnaviv: move functions that generate asm to own file
 | ||
| - etnaviv: move nir compiler related stuff into .c file
 | ||
| - etnaviv: move ra into own file
 | ||
| - etnaviv: replace prims-emitted query
 | ||
| - ci: bare-metal: use nginx to get results from DUT
 | ||
| - etnaviv: explicitly set nir_variable_mode
 | ||
| - etnaviv: introduce struct etna_compiler
 | ||
| - etnaviv: move shader_count to etna_compiler
 | ||
| - etnaviv: do register setup only once
 | ||
| - etnaviv: fix nir validation problem
 | ||
| - etnaviv: call nir_lower_bool_to_bitsize
 | ||
| - etnaviv: completely turn off MSAA
 | ||
| 
 | ||
| Christopher Egert (2):
 | ||
| 
 | ||
| - radv: use util_float_to_half_rtz
 | ||
| - r600: Use TRUNC_COORD on samplers
 | ||
| 
 | ||
| Clément Guérin (1):
 | ||
| 
 | ||
| - radv: Always expose non-visible local memory type on dedicated GPUs
 | ||
| 
 | ||
| Con Kolivas (1):
 | ||
| 
 | ||
| - Linux: Change minimum priority threads from SCHED_IDLE to nice 19 SCHED_BATCH.
 | ||
| 
 | ||
| Connor Abbott (88):
 | ||
| 
 | ||
| - tu: Support pipelines without a fragment shader
 | ||
| - tu: Add a "scratch bo" allocation mechanism
 | ||
| - tu: Add noubwc debug flag to disable UBWC
 | ||
| - tu: Implement fallback linear staging blit for CopyImage
 | ||
| - freedreno/a6xx: Document dual-src blending enable bits
 | ||
| - ir3: Fixup dual-source blending slot
 | ||
| - tu: Move RENDER_COMPONENTS setting to pipeline state
 | ||
| - tu: Implement dual-src blending
 | ||
| - tu: Advertise COLOR_ATTACHMENT_BLEND_BIT for blendable formats
 | ||
| - tu: Always initialize image_view fields for blit sources
 | ||
| - tu: Fall back to 3d blit path for BC1_RGB_* formats
 | ||
| - tu: Fix buffer compressed pitch calculation with unaligned sizes
 | ||
| - tu: Support VK_FORMAT_FEATURE_BLIT_SRC_BIT for texture-only formats
 | ||
| - tu: Fix IBO descriptor for cubes
 | ||
| - tu: Respect VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT
 | ||
| - tu: Add missing storage image/texel buffer bits
 | ||
| - tu: Remove useless post-binning flushes
 | ||
| - tu: Don't actually track seqno's for events
 | ||
| - tu: Remove useless event_write helpers
 | ||
| - tu: Rewrite flushing to use barriers
 | ||
| - tu: Fix context faults loading unused descriptor sets
 | ||
| - ir3: Pass reserved_user_consts to ir3_shader_from_nir()
 | ||
| - tu: Remove num_samp hack
 | ||
| - tu: Use the ir3 shader API
 | ||
| - tu: Remove tu_shader_compile_options
 | ||
| - tu: Set num_components to 0 when building bindless intrinsics
 | ||
| - ir3: Don't calculate num_samp ourselves
 | ||
| - tu: Actually remove dead variables after io lowering
 | ||
| - ir3: Split out variant-specific lowering and optimizations
 | ||
| - ir3, freedreno: Round up constlen earlier
 | ||
| - ir3: Include ir3_compiler from ir3_shader
 | ||
| - ir3: Support variants with different constlen's
 | ||
| - ir3: Add ir3_trim_constlen()
 | ||
| - tu: Share constlen between different stages properly
 | ||
| - freedreno: Refactor ir3_cache shader compilation
 | ||
| - freedreno: Share constlen between different stages properly
 | ||
| - freedreno: On a5xx+ INDX_SIZE is MAX_INDICES
 | ||
| - freedreno/registers: Label firstIndex field in CP_DRAW_INDX_OFFSET
 | ||
| - tu: Pass firstIndex directly to CP_DRAW_INDX_OFFSET
 | ||
| - freedreno/a6xx: use firstIndex field
 | ||
| - nir: Refactor load/store intrinsic helper
 | ||
| - nir: add vec2_index_32bit_offset address format
 | ||
| - tu: Rewrite variable lowering
 | ||
| - tu: Enable KHR_variable_pointers
 | ||
| - ir3: Add layer_zero variant bit
 | ||
| - tu: Force gl_Layer to 0 when necessary
 | ||
| - freedreno/a6xx: Force gl_Layer to 0 when necessary
 | ||
| - freedreno: Include adreno_pm4.xml.h before adreno_a6xx.xml.h
 | ||
| - freedreno: Sync registers with envytools
 | ||
| - freedreno/a6xx: Rename and document HLSQ_UPDATE_CNTL
 | ||
| - freedreno/a6xx: Add some documentation for shared consts
 | ||
| - tu: Don't invalidate irrelevant state when changing pipeline
 | ||
| - freedreno/a6xx: Add stencilref register info
 | ||
| - ir3: Handle gl_FragStencilRefARB
 | ||
| - tu: Enable VK_EXT_shader_stencil_export
 | ||
| - freedreno: Add a helper for computing guardband sizes
 | ||
| - tu: Use common guardband helper
 | ||
| - freedreno: Use common guardband helper
 | ||
| - freedreno/ir3: Fix SSBO size for bindless SSBO's
 | ||
| - tu: Enable VK_EXT_depth_clip_enable
 | ||
| - freedreno: Clean up CP_DRAW_MULTI_INDIRECT definition
 | ||
| - freedreno: Add INDIRECT_COUNT CP_DRAW_INDIRECT_MULTI variants
 | ||
| - tu: Integrate WFI/WAIT_FOR_ME/WAIT_MEM_WRITES with cache tracking
 | ||
| - tu: Add missing wfi to tu6_emit_hw()
 | ||
| - tu: Implement VK_KHR_draw_indirect_count
 | ||
| - tu: Fix empty blit scissor case
 | ||
| - tu: Fix hangs for DS with no output
 | ||
| - tu: Detect invalid-for-binning renderpass dependencies
 | ||
| - tu: Enable vertex & fragment stores & atomics
 | ||
| - tu: Fix descriptor update templates with input attachments
 | ||
| - ir3: Validate bindless samp_tex correctly
 | ||
| - ir3: Remove redundant samp_tex validation
 | ||
| - ir3: Fix incorrect src flags for samp_tex
 | ||
| - tu: Enable resource dynamic indexing
 | ||
| - freedreno/rnn: Return success when parsing addvariant
 | ||
| - tu: Dump CP_DRAW_INDIRECT_MULTI draw BO's
 | ||
| - freedreno/rnn: Support stripes in rnndec_decodereg
 | ||
| - freedreno/cffdec: Handle CP_DRAW_INDIRECT_MULTI like other draws
 | ||
| - freedreno: Add trace for CP_DRAW_INDIRECT_MULTI
 | ||
| - freedreno/a6xx: Fix CP_BIN_SIZE_ADDRESS name
 | ||
| - freedreno/rnn: Make rnn_decode_enum() respect variants
 | ||
| - freedreno/cffdec: Stop open-coding enum parsing
 | ||
| - freedreno/afuc: Add missing rnn_prepdb()
 | ||
| - freedreno/afuc: Fix PM4 enum parsing
 | ||
| - tu: Fix DST_INCOHERENT_FLUSH copy/paste error
 | ||
| - freedreno: Document draw predication packets
 | ||
| - tu: Reset has_tess after renderpass
 | ||
| - tu: Implement VK_EXT_conditional_rendering
 | ||
| 
 | ||
| D Scott Phillips (4):
 | ||
| 
 | ||
| - intel/fs: Update location of Render Target Array Index for gen12
 | ||
| - anv,iris: Fix input vertex max for tcs on gen12
 | ||
| - intel/dump_gpu: Fix name of LD_PRELOAD in env append logic
 | ||
| - anv/gen11+: Disable object level preemption
 | ||
| 
 | ||
| Daniel Schürmann (54):
 | ||
| 
 | ||
| - aco: either copy-propagate or inline create_vector operands
 | ||
| - aco: coalesce parallelcopies during register allocation
 | ||
| - nir: add nir_intrinsic_elect to divergence analysis
 | ||
| - nir: refactor divergence analysis state
 | ||
| - nir: rework phi handling in divergence analysis
 | ||
| - nir: simplify phi handling in divergence analysis
 | ||
| - nir: reset ssa-defs as non-divergent during divergence analysis instead of upfront
 | ||
| - aco: fix WQM coalescing
 | ||
| - aco: restrict copying of create_vector operands to GFX9+
 | ||
| - aco: don't move create_vector subdword operands to unsupported register offsets
 | ||
| - aco: fix corner case in register allocation
 | ||
| - aco: don't allow unaligned subdword accesses on GFX6/7
 | ||
| - aco: fix register assignment for p_create_vector on GFX6/7
 | ||
| - aco: simplify statistics collection for copies
 | ||
| - aco: use full-register instructions to implement subdword packing on GFX6/7
 | ||
| - aco: Workarounds subdword lowering on GFX6/7
 | ||
| - aco: adjust GFX6 subdword lowering workarounds for 8bit
 | ||
| - aco: add and use scratch SGPR to lower subdword p_create_vector on GFX6/7
 | ||
| - aco: coalesce copies more aggressively when lowering to hw
 | ||
| - aco: skip partial copies on first iteration when lowering to hw
 | ||
| - aco: optimize packing of 16bit subdword registers on GFX6/7
 | ||
| - aco: remove unnecessary split- and create_vector instructions for subdword loads
 | ||
| - aco: fix shared subdword loads
 | ||
| - aco: reorder calls to aco_validate() and cleanup aco_compile_shader()
 | ||
| - aco: don't allow SGPRs on logical phis
 | ||
| - aco: fix WQM handling in nested loops
 | ||
| - radv/aco: implement logic64 instead of lowering
 | ||
| - aco: align swap operations to 4 bytes on GFX6/7
 | ||
| - aco: don't allow partial copies on GFX6/7
 | ||
| - radv: introduce RADV_DEBUG=llvm option
 | ||
| - radv: change use_aco -> use_llvm
 | ||
| - radv: enable ACO by default
 | ||
| - aco: fix partial copies on GFX6/7
 | ||
| - aco: remove superflous (bool & exec) if the result comes from VOPC
 | ||
| - nir: also move vecN in case of nir_move_copies
 | ||
| - nir: refactor nir_can_move_instr
 | ||
| - nir/algebraic: optimize bcsel(a, 0, 1) to b2i
 | ||
| - nir: also move b2i in case of nir_move_copies
 | ||
| - nir/algebraic: optimize iand/ior of (n)eq zero
 | ||
| - nir/algebraic: add optimizations for fsign/isign
 | ||
| - nir/algebraic: add some more unop + bcsel optimizations
 | ||
| - nir/algebraic: optimize fmul(x, bcsel(c, -1.0, 1.0)) -> bcsel(c, -x, x)
 | ||
| - nir/algebraic: optimize (a < 0.0) ? -a : a -> fabs(a)
 | ||
| - nir/algebraic: add distributive rules for ior/iand
 | ||
| - nir/algebraic: propagate b2i out of ior/iand
 | ||
| - nir/algebraic: fold some nested bcsel
 | ||
| - aco: fix scratch loads which cross element_size boundaries
 | ||
| - aco: ensure to not extract more components than have been fetched
 | ||
| - aco: don't split store data if it was already split into more elements
 | ||
| - aco: prevent infinite recursion in RA for subdword variables
 | ||
| - aco: ensure readfirstlane subdword operands are always dword aligned
 | ||
| - radv: call radv_nir_lower_ycbcr_textures after first optimizations
 | ||
| - aco: add GFX6/7 subdword lowering tests
 | ||
| - aco: execute branch instructions in WQM if necessary
 | ||
| 
 | ||
| Daniel Stone (13):
 | ||
| 
 | ||
| - CI: Disable Panfrost T7x0 jobs
 | ||
| - CI: Re-enable Panfrost T7x0 jobs
 | ||
| - llvmpipe: Expect increased exp precision on Windows
 | ||
| - CI: Windows: Build LLVM and llvmpipe
 | ||
| - CI: Disable Panfrost T720/T760
 | ||
| - Revert "CI: Disable Panfrost T720/T760"
 | ||
| - CI: Enable assertions on Windows
 | ||
| - CI: Try shared libraries on Windows
 | ||
| - CI: Correct build-directory path on Windows, and keep it
 | ||
| - CI: Re-enable the Windows VS2019 build job
 | ||
| - CI: Temporarily disable Panfrost T860 jobs
 | ||
| - CI: Re-enable Panfrost T860 jobs
 | ||
| - CI: Disable Windows build due to unstable infrastructure
 | ||
| 
 | ||
| Danylo Piliaiev (25):
 | ||
| 
 | ||
| - glsl: rename has_implicit_uint_to_int_conversion to *_int_to_uint_*
 | ||
| - i965: Fix out-of-bounds access to brw_stage_state::surf_offset
 | ||
| - anv: Translate relative timeout to absolute when calling anv_timelines_wait
 | ||
| - anv: Fix deadlock in anv_timelines_wait
 | ||
| - meson: Disable GCC's dead store elimination for memory zeroing custom new
 | ||
| - mesa: Fix double-lock of Shared->FrameBuffers and usage of wrong mutex
 | ||
| - st/mesa: Clear texture's views when texture is removed from Shared->TexObjects
 | ||
| - intel/fs: Work around dual-source blending hangs in combination with SIMD16
 | ||
| - glsl: Don't replace lrp pattern with lrp if arguments are not floats
 | ||
| - glsl: inline functions with unsupported return type before converting to nir
 | ||
| - i965: Work around incorrect usage of glDrawRangeElements in UE4
 | ||
| - st/mesa: account for "loose", per-mipmap level textures in CopyImageSubData
 | ||
| - iris: Honor scanout requirement from DRI
 | ||
| - iris: Fix fast-clearing of depth via glClearTex(Sub)Image
 | ||
| - nir/opt_if: Fix opt_if_simplification when else branch has jump
 | ||
| - nir/tests: Add tests for opt_if_simplification
 | ||
| - st/mesa: Treat vertex outputs absent in outputMapping as zero in mesa_to_tgsi
 | ||
| - anv/nir: Unify inputs_read/outputs_written between geometry stages
 | ||
| - spirv: Only require bare types to match when copying variables
 | ||
| - glsl: Eliminate out-of-bounds triop_vector_insert
 | ||
| - intel/compiler: Fix pointer arithmetic when reading shader assembly
 | ||
| - glsl: Eliminate assigments to out-of-bounds elements of vector
 | ||
| - nir/lower_io: Eliminate oob writes and return zero for oob reads
 | ||
| - nir/large_constants: Eliminate out-of-bounds writes to large constants
 | ||
| - nir/lower_samplers: Clamp out-of-bounds access to array of samplers
 | ||
| 
 | ||
| Daryl W. Grunau (1):
 | ||
| 
 | ||
| - prevent multiply defined symbols
 | ||
| 
 | ||
| Dave Airlie (199):
 | ||
| 
 | ||
| - i965: add support for gen 5 pipelined pointers to dump
 | ||
| - i965: disable shadow batches when batch debugging.
 | ||
| - draw/tess: free tessellation control shader i/o memory.
 | ||
| - llvmpipo/nir: free compute shader NIR
 | ||
| - llvmpipe: simple texture barrier implementation.
 | ||
| - gallivm/sample: add multisample support for texel fetch
 | ||
| - gallivm/sample: add multisample image operation support
 | ||
| - gallivm/nir/tgsi: add multisample texture sampling.
 | ||
| - gallivm/nir: add multisample support to image size
 | ||
| - gallivm/nir: add multisample image operations
 | ||
| - draw: introduce sampler num samples + stride members
 | ||
| - draw: add support for num_samples + sample_stride to the image paths
 | ||
| - llvmpipe: add num_samples/sample_stride support to jit textures
 | ||
| - llvmpipe: add samples support to image jit
 | ||
| - util: add a resource wrapper to get resource samples
 | ||
| - llvmpipe: add multisample support to texture allocator.
 | ||
| - llvmpipe: add a max samples define set to 4.
 | ||
| - gallium/util: split out zstencil clearing code.
 | ||
| - llvmpipe: fix race between draw and setting fragment shader.
 | ||
| - llvmpipe: add get_sample_position support (v2)
 | ||
| - llvmpipe/jit: pass fragment sample mask via jit context.
 | ||
| - llvmpipe: pass incoming sample_mask into fragment shader context.
 | ||
| - llvmpipe: add internal multisample texture mapping path.
 | ||
| - llvmpipe: add multisample resource copy region support.
 | ||
| - llvmpipe: add clear texture support for multisample textures.
 | ||
| - llvmpipe: handle multisample render target clears
 | ||
| - draw: disable point/line smoothing for multisample (v2)
 | ||
| - llvmpipe: pass color and depth sample strides into fragment shader.
 | ||
| - llvmpipe: record sample info for color/depth buffers in scene
 | ||
| - llvmpipe/rast: fix tile clearing for multisample color and depth tiles
 | ||
| - llvmpipe: plumb multisample state bit into setup code.
 | ||
| - llvmpipe: add multisample bit to fragment shader key.
 | ||
| - llvmpipe: change mask input to fragment shader to 64-bit.
 | ||
| - llvmpipe: add cbuf/zsbuf + coverage samples to the fragment shader key.
 | ||
| - gallivm: add sample id/pos intrinsic support
 | ||
| - gallivm: add mask api to force mask
 | ||
| - nir/tgsi: translate the interp location
 | ||
| - llvmpipe: pass interp location into interpolation code.
 | ||
| - llvmpipe: add centroid interpolation support.
 | ||
| - llvmpipe: add per-sample interpolation.
 | ||
| - llvmpipe: move getting mask value out of depth code. (v2)
 | ||
| - llvmpipe: add per-sample depth/stencil test
 | ||
| - llvmpipe: move some fs code around
 | ||
| - llvmpipe: multisample sample mask + early/late depth pass
 | ||
| - llvmpipe: handle multisample early depth test/late depth write
 | ||
| - llvmpipe: interpolate Z at sample points for early depth test.
 | ||
| - llvmpipe: handle multisample color stores.
 | ||
| - llvmpipe: hook up sample position system value
 | ||
| - llvmpipe: add multisample alpha to coverage support.
 | ||
| - llvmpipe: add multisample alpha to one support
 | ||
| - llvmpipe: handle gl_SampleMask writing.
 | ||
| - llvmpipe: don't allow branch to end for early Z with multisample
 | ||
| - llvmpipe: pass mask store into interp for centroid interpolation
 | ||
| - llvmpipe: move color storing earlier in frag shader
 | ||
| - llvmpipe: fix multisample occlusion queries.
 | ||
| - llvmpipe: disable opaque variant for multisample
 | ||
| - llvmpipe: add new rast api to pass full 64-bit mask.
 | ||
| - llvmpipe: add fixed point sample positions to scene.
 | ||
| - llvmpipe: build 64-bit coverage mask in rasterizer
 | ||
| - llvmpipe: fixup multisample coverage masks for covered tiles
 | ||
| - llvmpipe: generate multisample triangle rasterizer functions (v2)
 | ||
| - llvmpipe: choose multisample rasterizer functions per triangle (v2)
 | ||
| - llvmpipe: choose correct position for multisample
 | ||
| - llvmpipe: don't choose pixel centers for multisample
 | ||
| - drisw: add multisample support to sw dri layer.
 | ||
| - llvmpipe: enable 4x sample MSAA + texture multisample
 | ||
| - gallivm/sample: add num samples query for txqs (v2)
 | ||
| - gallivm/nir: hooks up texture samples queries
 | ||
| - llvmpipe: enable GL_ARB_shader_texture_image_samples
 | ||
| - llvmpipe: add min samples support to the fragment shader.
 | ||
| - llvmpipe: enable ARB_sample_shading
 | ||
| - llvmpipe: make sample position a global array.
 | ||
| - zink: enable conditional rendering if available
 | ||
| - r600: enable TEXCOORD semantic for TGSI.
 | ||
| - r600/sfn: plumb the chip class into the instruction emission
 | ||
| - r600/sfn: fix cayman float instruction emission.
 | ||
| - r600/sfn: cayman fix int trans op2
 | ||
| - r600/sfn: add callstack non-evergreen support
 | ||
| - r600/sfn: add emit if start cayman support
 | ||
| - llvmpipe: don't use sample mask with 0 samples
 | ||
| - llvmpipe: use per-sample position not sample id for interp
 | ||
| - llvmpipe/interp: fix interpolating frag pos for sample shading
 | ||
| - llvmpipe: remove non-simple interpolation paths.
 | ||
| - gallivm/nir: add an interpolation interface.
 | ||
| - llvmpipe/interp: refactor out use of pixel center offset
 | ||
| - llvmpipe/interp: refactor out centroid calculations
 | ||
| - llvmpipe: add interp instruction support
 | ||
| - llvmpipe/fs: hook up the interpolation APIs.
 | ||
| - gallivm/nir: add sample_mask_in support
 | ||
| - llvmpipe: add gl_SampleMaskIn support.
 | ||
| - r600/sfn: fix nop channel assignment.
 | ||
| - llvmpipe: compute shaders work better with all the threads.
 | ||
| - llvmpipe: move coroutines out of noopt case
 | ||
| - ci: bump virglrenderer to latest version
 | ||
| - util/disk_cache: add fallback for disk_cache_get_function_identifier
 | ||
| - llvmpipe/cs: overhaul cs variant key state.
 | ||
| - llvmpipe/draw: drop variant number from function names.
 | ||
| - gallivm: rework coroutine malloc/free callouts.
 | ||
| - gallivm: rework debug printf hook to use global mapping.
 | ||
| - gallivm: add support for a cache object
 | ||
| - gallivm: skip operations if we have a cached object.
 | ||
| - gallivm: add cache interface to mcjit
 | ||
| - llvmpipe: add infrastructure for disk cache support
 | ||
| - gallivm: don't cache shaders that use fetch functions.
 | ||
| - llvmpipe/fs: add caching support
 | ||
| - llvmpipe/cs: add shader caching
 | ||
| - draw: add disk cache callbacks for draw shaders
 | ||
| - llvmpipe: hook draw disk cache up
 | ||
| - draw: add disk caching for draw shaders
 | ||
| - draw/gs: fix emitting inactive primitives crash
 | ||
| - draw/gs: add more info to debugging.
 | ||
| - gallivm/nir: add group barrier support
 | ||
| - llvmpipe: fix subpixel bits reporting.
 | ||
| - gallivm/format: convert unsigned values to float properly.
 | ||
| - gallivm/conv: enable conversion min code. (v2)
 | ||
| - gallivm/sample: fix texel type for stencil 8-bit
 | ||
| - llvmpipe/setup: add planes for draw regions if no scissor.
 | ||
| - gallivm/cache: don't require a null terminator for cache data.
 | ||
| - mesa/gles3: add support for GL_EXT_shader_group_vote
 | ||
| - virgl: change vendor id to reflect reality more.
 | ||
| - llvmpipe: change vendor to be more generic.
 | ||
| - softpipe: change vendor name to something more generic.
 | ||
| - gallivm/nir: fix const loading on big endian systems
 | ||
| - glsl: fix constant packing for 64-bit big endian.
 | ||
| - gallivm/nir: fix big-endian 64-bit splitting/merging.
 | ||
| - llvmpipe: fix occlusion queries on big-endian.
 | ||
| - mesa/get: fix enum16 big-endian getting.
 | ||
| - draw/llvm: fix big-endian mask adjusting
 | ||
| - draw: pass nr_samplers into llvm sample state creation.
 | ||
| - llvmpipe: pass number of samplers into llvm sampler code.
 | ||
| - gallivm/sample: change texture function generator api
 | ||
| - gallivm: add indirect texture switch statement builder.
 | ||
| - draw: add support for indirect texture access
 | ||
| - llvmpipe: add support for indirect texture access.
 | ||
| - gallivm/nir: add texture unit indexing
 | ||
| - gallivm/nir: handle non-uniform texture offsets
 | ||
| - gallivm/sample: pass indirect offset into texture/image units
 | ||
| - llvmpipe/draw: wire up indirect offset
 | ||
| - gallivm/sample: handle size unit offset
 | ||
| - llvmpipe: enable ARB_gpu_shader5
 | ||
| - draw: pass number of images to image soa create
 | ||
| - llvmpipe: pass number of images into image soa create
 | ||
| - gallivm/nir: support passing image index into image code.
 | ||
| - gallivm/nir: refactor image operations for indirect support.
 | ||
| - gallivm/img: refactor out the texel return type (v2)
 | ||
| - gallivm/nir: add support for indirect image loading
 | ||
| - draw/sample: add support for indirect images
 | ||
| - llvmpipe: handle indirect images properly
 | ||
| - ci: fixup tests after all indirect images fixes.
 | ||
| - docs: update llvmpipe GL 4.0 status
 | ||
| - draw/clip: cleanup viewport index handling code.
 | ||
| - draw/clip: fix viewport index for geometry shaders
 | ||
| - mesa/version: only enable GL4.1 with correct limits.
 | ||
| - llvmpipe: bump texture/scene limits to enable GL 4.1
 | ||
| - llvmpipe: bump to GL support to GL 4.1
 | ||
| - llvmpipe: enable GL 4.2
 | ||
| - gallivm/nir: call end prim at end on all GS streams.
 | ||
| - draw: emit so primitives before ending empty pipeline.
 | ||
| - draw/gs: fix up current verts in output fetching.
 | ||
| - gallivm/draw/gs: pass vertex stream count into shader build
 | ||
| - draw/gs: only allocate memory for streams needed.
 | ||
| - gallivm/gs_iface: pass stream into end primitive interface.
 | ||
| - gallivm/nir: don't access stream var outside bounds
 | ||
| - gallivm/nir: end primitive for all streams.
 | ||
| - draw: account primitive lengths for all streams.
 | ||
| - draw/gs: reverse the polarity of the invocation/prims execution
 | ||
| - draw: use common exit path in pipeline finish.
 | ||
| - draw: free vertex info from geometry streams.
 | ||
| - draw/gs: use mask to limit vertex emission.
 | ||
| - ci/virgl: update results after streams fixes.
 | ||
| - llvmpipe: add ARB_post_depth_coverage support.
 | ||
| - llvmpipe: denote NEW fs when images change.
 | ||
| - llvmpipe: flush resources on sampler view binding
 | ||
| - llvmpipe/cs: fix image/sampler binding for compute
 | ||
| - nouveau: avoid LTO ODR warning (v2)
 | ||
| - gallivm/sample: always square rho before fast log2
 | ||
| - llvmpipe/format: fix snorm conversion
 | ||
| - mesa: change dsa texture error codes for GL 4.6
 | ||
| - ci: bump piglit checkout for dsa tests
 | ||
| - llvmpipe: fix stencil only formats.
 | ||
| - llvmpipe: fix position offset interpolation
 | ||
| - llvmpipe/cs: respect render condition
 | ||
| - llvmpipe: add framebuffer fetching support (v1.1)
 | ||
| - ci/llvmpipe: reenable gpu shader5 tests
 | ||
| - llvmpipe: enable EXT_texture_shadow_lod
 | ||
| - llvmpipe/draw: handle constant buffer limits and robustness (v1.1)
 | ||
| - drisw: add robustness extension support.
 | ||
| - glx/drisw: add robustness support
 | ||
| - llvmpipe: add device reset query context hook.
 | ||
| - llvmpipe: enable robust buffer access + GL 4.3, GLES 3.2 and robust buffer access behaviour
 | ||
| - llvmpipe/ms: fix sign extension bug in rasterizer.
 | ||
| - Revert "llvmpipe: Use the default behavior of ALLOW_MAPPED_BUFFERS."
 | ||
| - radv: cleanup locking around timeline waiting.
 | ||
| - llvmpipe: only read 0 for channels being read
 | ||
| - llvmpipe/blit: for 32-bit unorm depth blits just copy 32-bit
 | ||
| - llvmpipe: enable GL 4.5
 | ||
| - llvmpipe/cs: update compute counters not fragment shader.
 | ||
| - llvmpipe: include gallivm perf flags in shader cache.
 | ||
| - gallivm: disable brilinear for lod bias and explicit lod.
 | ||
| 
 | ||
| David McFarland (1):
 | ||
| 
 | ||
| - radv: link with ld_args_build_id
 | ||
| 
 | ||
| David Stevens (2):
 | ||
| 
 | ||
| - nir: Add colorspace support to YUV lowering pass
 | ||
| - i965/i915: Add colorspace support to YUV sampling
 | ||
| 
 | ||
| Denys (1):
 | ||
| 
 | ||
| - gitlab: Ask about reproduction rate in the issue template
 | ||
| 
 | ||
| Dmitriy Nester (8):
 | ||
| 
 | ||
| - mesa: check draw buffer completeness on glClearBufferfv/glClearBufferuiv
 | ||
| - nir: replace fnv1a hash function with xxhash
 | ||
| - freedreno: replace fnv1a hash function with xxhash
 | ||
| - i965: replace fnv1a hash function with xxhash
 | ||
| - util/hash_table: replace fnv1a hash function with xxhash
 | ||
| - r600: replace fnv1a hash function with xxhash
 | ||
| - zink: replace fnv1a hash function with xxhash
 | ||
| - util: delete fnv1a hash function
 | ||
| 
 | ||
| Duncan Hopkins (1):
 | ||
| 
 | ||
| - zink. Changed sampler default name.
 | ||
| 
 | ||
| Dylan Baker (41):
 | ||
| 
 | ||
| - docs: Add release notes for 20.0.6
 | ||
| - docs: Add SHA256 sums for 20.0.6
 | ||
| - docs: update calendar, add news item, and link releases notes for 20.0.6
 | ||
| - docs: Add release notes for 20.0.7
 | ||
| - docs/relnotes Add sha256 sums to 20.0.7
 | ||
| - docs: update calendar, add news item, and link releases notes for 20.0.7
 | ||
| - tests: Make tests aware of meson test wrapper
 | ||
| - meson: Bump required version to 0.52.0
 | ||
| - meson: Use the check_header function
 | ||
| - meson: Use build_always_stale instead of build_always
 | ||
| - meson: Use builtins for checking gnu __attributes__
 | ||
| - drm-shim/meson: The name of the target is a string not a list
 | ||
| - drm-shim/meson: Use portable override_options for setting C standard
 | ||
| - meson: use gnu_symbol_visibility argument
 | ||
| - meson: use 2 space not 3 space indent
 | ||
| - meson: deprecated 'true' and 'false' in combo options for 'enabled' and 'disabled'
 | ||
| - vulkan-overlay/meson: use install_data instead of configure_file
 | ||
| - docs: Add release notes for 20.0.8
 | ||
| - docs: Add sha256sums for 20.0.8
 | ||
| - docs: update calendar, add news item, and link releases notes for 20.0.8
 | ||
| - mesa/swrast: use logf2 instead of util_fast_log2
 | ||
| - VERSION: bump for 20.2.0-rc1
 | ||
| - .pick_status.json: Update to 9333a8570d2174b73da63c3ee6f1a740ae487ab8
 | ||
| - .pick_status.json: Update to 1e28745bc0d3528c1dfc25459456849feb58d407
 | ||
| - meson/freedreno: Fix lua requirement
 | ||
| - .pick_status.json: Update to fdb97d3d2914c8f887a7968432db4fdbd35d8376
 | ||
| - bump version for 20.2.0-rc2
 | ||
| - .pick_status.json: Update to 61042b1bdb199f98dd34085ed29a8c492ed9b2a3
 | ||
| - .pick_status.json: Update to 6d28270968e0728bf8bdf48a6abd261c50d9ef07
 | ||
| - .pick_status.json: Update to ca7d66e847d08914cec0a5e003b400da9c0a2695
 | ||
| - VERSION: bump for 20.2.0-rc3
 | ||
| - .pick_status.json: Update to 7fbded8b5821a47c26245b181446f972f920a96e
 | ||
| - .pick_status.json: Mark e93979ba599355c42df01a89073362b970489a3a as denominated
 | ||
| - .pick_status.json: Update to b9927c8c8d0c105699306a68773c015930ff9509
 | ||
| - VERSION: bump for 20.2.0-rc4
 | ||
| - .pick_status.json: Update to ef980ac0c1cd65993ba0c1d20e1c09b45bfef99d
 | ||
| - fix: gallivm: disable brilenear for lod bias and explicit lod.
 | ||
| - .pick_status.json: Update to a1f46d7b6943699e5efb60fbcfdd1450db85adb1
 | ||
| - amd/ac_surface: convert tabs to 3 spaces
 | ||
| - .pick_status.json: Update to 90b98c06493f8a9759e5496d5ec91fb60edf7b92
 | ||
| - .pick_status.json: Update to 472a20c5fc0feda0f074b4ff95fd7c7a6305c8cd
 | ||
| 
 | ||
| Eduardo Lima Mitev (2):
 | ||
| 
 | ||
| - freedreno: Centralize UUID generation into new files freedreno_uuid.c/h
 | ||
| - freedreno/uuid: Generate meaningful device and driver UUID
 | ||
| 
 | ||
| Elie Tournier (12):
 | ||
| 
 | ||
| - virgl: implement ARB_clear_texture
 | ||
| - virgl: Enable CAP_CLEAR_TEXTURE if host supports it
 | ||
| - docs/features: Add ARB_clear_texture to virgl
 | ||
| - gallium: add TGSI_PROPERTY_FS_BLEND_EQUATION_ADVANCED
 | ||
| - glsl_to_tgsi: Set TGSI_PROPERTY_FS_BLEND_EQUATION_ADVANCED
 | ||
| - virgl: Reserved last caps of capability_bits
 | ||
| - gallium: Add PIPE_CAP_BLEND_EQUATION_ADVANCED
 | ||
| - st: expose KHR_blend_equation_advanced if PIPE_CAP_BLEND_EQUATION_ADVANCED
 | ||
| - glsl_to_ir: do lower_blend_equation if PIPE_CAP_FBFETCH
 | ||
| - virgl: Use alpha_src_factor to store blend_equation_advenced value
 | ||
| - virgl: Encode barrier for blend_equation_advanced
 | ||
| - virgl: set PIPE_CAP_BLEND_EQUATION_ADVANCED
 | ||
| 
 | ||
| Emmanuel (3):
 | ||
| 
 | ||
| - meson: Do not enable USE_ELF_TLS for FreeBSD
 | ||
| - iris: Explicitly cast value to uint64_t
 | ||
| - i965: Explicitly cast value to uint64_t
 | ||
| 
 | ||
| Emmanuel Gil Peyrot (2):
 | ||
| 
 | ||
| - util/rand_xor: use getrandom() when available
 | ||
| - Expose EGL_KHR_platform_* when EXT is supported
 | ||
| 
 | ||
| Emmanuel Vadot (1):
 | ||
| 
 | ||
| - meson: Add versioning for xvmc tracker
 | ||
| 
 | ||
| Eric Anholt (228):
 | ||
| 
 | ||
| - freedreno/ir3: Initialize the unused dwords of the immediates consts.
 | ||
| - freedreno/ir3: Drop redundant IR3_REG_HALF setup in ALU ops.
 | ||
| - freedreno/ir3: Leave bools as 1-bit, storing them in full regs.
 | ||
| - freedreno/ir3: Set up the block predecessors for a3xx TF
 | ||
| - freedreno/ir3: Fix the a3xx TF outputs stores.
 | ||
| - freedreno/ir3: Fix register allocation assertion failures.
 | ||
| - freedreno: Stop doing binning shaders other than the VS in shader-db.
 | ||
| - freedreno/ir3: Skip tess epilogue if the program is missing stores.
 | ||
| - freedreno: Fix assertion failures on GS/tess shaders with shader-db enabled.
 | ||
| - freedreno/ir3: Remove unused half precision shader key flag.
 | ||
| - freedreno: Emit debug messages when doing draw-time recompiles of shaders.
 | ||
| - freedreno/ir3: Improve shader key normalization.
 | ||
| - freedreno/ir3: Stop initializing regid of so->outputs during setup.
 | ||
| - freedreno/ir3: Set up outputs for multi-slot varyings.
 | ||
| - freedreno: Immediately compile a default variant of shaders.
 | ||
| - freedreno/ir3: Set the FS .msaa flag to true during precompiles.
 | ||
| - freedreno/ir3: Add some more tests of cat6 disasm.
 | ||
| - freedreno/ir3: Sync some new changes from envytools.
 | ||
| - freedreno/ir3: Define the bindful uniform/nonuniform desc modes for cat6 a6xx.
 | ||
| - freedreno/ir3: Disable sin/cos range reduction for mediump.
 | ||
| - ci: Clean up setup of the job-specific env vars in baremetal testing.
 | ||
| - ci: Enable IRC flake reporting on freedreno baremetal boards.
 | ||
| - ci: Improve the flakes reports on IRC.
 | ||
| - ci: Fix the nick used in IRC reporting.
 | ||
| - freedreno: Deduplicate ringbuffer macros with computerator/fdperf
 | ||
| - freedreno: Clean up tests around ORing in the reloc flags.
 | ||
| - freedreno: Rename append_bo() in case it doesn't get inlined.
 | ||
| - freedreno: Initialize the bo's iova at creation time.
 | ||
| - freedreno: Start moving relocs flags into the BOs.
 | ||
| - freedreno: Replace OUT_RELOCD with permanently flagging shader BOs for it.
 | ||
| - freedreno: Mark all ringbuffer BOs as to be dumped on crash.
 | ||
| - freedreno: Tell the kernel that all BOs are for writing.
 | ||
| - freedreno: Replace OUT_RELOCW with OUT_RELOC.
 | ||
| - freedreno: Drop the "write" arg to emit_const_bo now relocs don't care.
 | ||
| - nir: Fix count when we didn't lower load_uniforms but did shift load_ubos.
 | ||
| - freedreno: Fix non-constbuf-upload UBO block indices and count.
 | ||
| - freedreno: Add a nohw flag to skip submitting to the kernel.
 | ||
| - freedreno: Split the fd_batch_resource_used by read vs write.
 | ||
| - freedreno: Add an early out for preparing to read a resource.
 | ||
| - freedreno: Move the resource_read early out to an inline.
 | ||
| - freedreno: Skip taking the lock for resource usage if it's already flagged.
 | ||
| - freedreno/a4xx+: Increase max texture size to 16384.
 | ||
| - freedreno/a6xx: Improve layout testcase logging for UBWC fails.
 | ||
| - freedreno/a6xx: Add a testcase for UBWC buffer sharing.
 | ||
| - freedreno: Pull the tile_alignment lookup for a layout to a helper.
 | ||
| - freedreno/a6xx: Fix UBWC blockheight for RG8.
 | ||
| - freedreno/a6xx: Fix UBWC mipmap sizing.
 | ||
| - freedreno/a6xx: Fix UBWC mipmapping height alignment.
 | ||
| - nir: Include num_ubos in the printed shader (if nonzero).
 | ||
| - freedreno/ir3: Clean up a silly nir_src_for_ssa(src.ssa).
 | ||
| - freedreno/ir3: Leave the cursor alone during ir3_nir_try_propagate_bit_shift.
 | ||
| - freedreno/ir3: Move i/o offset lowering after analyze_ubo_ranges.
 | ||
| - freedreno: Trim num_ubos to just the ones we haven't lowered to constbuf.
 | ||
| - freedreno/a6xx: Use LDC for UBO loads.
 | ||
| - freedreno: Drop the noubo fails list for CI, since there aren't any now.
 | ||
| - freedreno: Fix attempts to push UBO contents past the constlen on pre-a6xx.
 | ||
| - freedreno: Fix resource layout dump loop.
 | ||
| - freedreno: Avoid duplicate BO relocs in FD_RINGBUFFER_OBJECTs.
 | ||
| - ci: Move cross file generation to a shared script.
 | ||
| - ci: Autodetect whether we need cross setup in lava_arm builds.
 | ||
| - ci: Make cmake toolchain file for deqp cross build setup.
 | ||
| - ci: Make the create-rootfs more resilient.
 | ||
| - ci: Update versions of packages to remove from rootfses.
 | ||
| - ci: Switch the baremetal runner to be an x86 docker image.
 | ||
| - ci: Disable SMP on the a5xx boards.
 | ||
| - ci: Make a530's GLES3/31 fractional runs much more complete.
 | ||
| - freedreno/a5xx: Move resource layout to fdl.
 | ||
| - freedreno/fdl: Separate the list of a6xx testcases from the the test code.
 | ||
| - freedreno/a5xx: Add the outline of a unit test for a5xx layout.
 | ||
| - freedreno/a5xx: Set MIN_LAYERSZ on 3D textures like we do on a6xx.
 | ||
| - freedreno/a5xx: Define the 2D blit UBWC pitch fields
 | ||
| - ci: Fix DEQP_CASELIST_FILTER (used by a630 noubo run)
 | ||
| - ci: Do an explicit NIR validation-enabled pass on freedreno a630.
 | ||
| - ci: Don't forget to set NIR_VALIDATE in baremetal runs.
 | ||
| - ci: Enable a fractional run with UBO-to-constbuf disabled on a3xx.
 | ||
| - ci: Improve baremetal's logging of the job env var passthrough.
 | ||
| - freedreno/a6xx: Fix the size of buffer image views.
 | ||
| - freedreno: Fix printing of unused src in disasm of cat6 RESINFO.
 | ||
| - freedreno: Add more resinfo/ldgb testcases.
 | ||
| - freedreno: Fix resinfo asm, which doesn't have srcs besides IBO number.
 | ||
| - freedreno: Set the immediate flag in a4/a5xx resinfos.
 | ||
| - freedreno/ir3: Refactor out IBO source references.
 | ||
| - freedreno/ir3: Move handle_bindless_cat6 to compiler_nir and reuse.
 | ||
| - freedreno/ir3: Use RESINFO for a6xx image size queries.
 | ||
| - ci: Drop double ".txt" suffix on the unexpected results file.
 | ||
| - ci: Drop old comment about enabling --deqp-watchdog.
 | ||
| - ci: Auto-detect the architecture for VK ICD filenames.
 | ||
| - ci: Add DEQP_EXPECTED_RENDERER support for VK tests.
 | ||
| - ci: Move baremetal DEQP_NO_SAVE_RESULTS setup to the yml.
 | ||
| - ci: Quick exit qpa extraction for non-matching qpas.
 | ||
| - ci: Disable the firmware loader user helper option in arm64 kernels.
 | ||
| - ci: Build a cheza kernel.
 | ||
| - ci: Add scripts for controlling bare-metal chezas.
 | ||
| - ci: Switch cheza (freedreno a630) testing to baremetal.
 | ||
| - ci: Don't build an arm_test container now that the last user is gone.
 | ||
| - ci: Rename x86_cross_arm_test to just arm_test.
 | ||
| - turnip: Move vertex buffer bindings to SET_DRAW_STATE.
 | ||
| - turnip: Don't bother clamping VB size.
 | ||
| - turnip: Simplify vertex buffer bindings.
 | ||
| - turnip: Use tu_cs_emit_regs() for BLEND_CONTROL.
 | ||
| - turnip: Add support for alphaToOne.
 | ||
| - freedreno/a6xx: Add support for ALPHA_TO_ONE.
 | ||
| - freedreno: Upload gallium constbufs as needed when referenced as a UBO.
 | ||
| - freedreno/ir3: Refactor ir3_cp's lower_immed().
 | ||
| - freedreno/ir3: Stop pushing immediates once we've filled the constbuf.
 | ||
| - freedreno/ir3: Drop unnecessary alignment of pushed UBO size.
 | ||
| - freedreno/ir3: Stop shifting UBO 1 down to be UBO 0.
 | ||
| - freedreno/ir3: Account for driver params in UBO max const upload.
 | ||
| - freedreno/ir3: Drop the max_const on a6xx to 512.
 | ||
| - freedreno/ir3: Handle cases where we decide not to lower UBO 0 loads.
 | ||
| - turnip: Fix crashes in compute with no descriptors to load.
 | ||
| - ci: Bump up to the current version of the VK CTS.
 | ||
| - ci: Disable shader cache on vulkan CI runs.
 | ||
| - ci: Build the full VK CTS for baremetal testing.
 | ||
| - ci: Enable pre-merge fractional vulkan CTS runs on the turnip driver.
 | ||
| - ci: Use rsync for initial nfsroot population on cheza.
 | ||
| - turnip: Expose robustBufferAccess.
 | ||
| - freedreno/a6xx: Fix clip_halfz support.
 | ||
| - ci: Leave a note as to what might be going on with a test.
 | ||
| - ci: Fix weird filesystem globs appearing in failed test .qpa files.
 | ||
| - ci: Disable some flaky tests on turnip.
 | ||
| - ci/bare-metal: Reword the final output of the init script on the board.
 | ||
| - ci/bare-metal: Make which test to run configurable.
 | ||
| - ci/bare-metal: Use the deqp-runner bits straight out of the artifacts.
 | ||
| - ci/bare-metal: Stop fetching the git tree.
 | ||
| - ci/bare-metal: Terminate the job with an error on kernel panic.
 | ||
| - docs: Replace ancient swrast conformance docs with more current information.
 | ||
| - docs: Add dri-devel to the mailing lists and drop the DRI wiki link.
 | ||
| - ci: disable the windows tests until the runner can be stabilized again
 | ||
| - ci: Bump vulkan CTS to 1.2.3.0.
 | ||
| - ci: Enable NIR validation on a630 GLES2 and VK tests.
 | ||
| - ci/bare-metal: Skip setting of unset variables at startup.
 | ||
| - ci/bare-metal: Don't include dev packages in arm*test.
 | ||
| - ci/tracie: Print the path if the trace isn't found.
 | ||
| - ci/tracie: Fix apitrace dump using "less" which isn't in the ARM rootfs.
 | ||
| - ci: Add a freedreno a630 tracie run.
 | ||
| - freedreno/a6xx: Define the register fields for polygon fill mode.
 | ||
| - turnip: Add support for polygon fill modes.
 | ||
| - freedreno/a6xx: Add support for polygon fill mode (as long as front==back).
 | ||
| - ci: Remove a stray "always" on the freedreno traces job.
 | ||
| - ci/bare-metal: Fail early when we get stuck powering on a cheza.
 | ||
| - ci/baremetal: Bump the kernel to a recent drm-msm-fixes for msm semaphores.
 | ||
| - turnip: Do better TU_DEBUG=startup logging of drmGetDevices2() failure.
 | ||
| - turnip: Fix error handling of DRM_MSM_GEM_INFO ioctls.
 | ||
| - turnip: Properly return VK_DEVICE_LOST on queuesubmit failures.
 | ||
| - gallium/util: Add a helper function for point sprite handling.
 | ||
| - vc4: Enable PIPE_CAP_TGSI_TEXCOORD.
 | ||
| - v3d: Enable PIPE_CAP_TGSI_TEXCOORD.
 | ||
| - v3d: Fix -Wmaybe-uninitialized compiler warning in the v33 code.
 | ||
| - ci: Disable pixmark-piano trace on a630 due to GPU hangs.
 | ||
| - util: Avoid strict aliasing bugs in xxhash.
 | ||
| - util: Mark util_format_description() as a const function.
 | ||
| - softpipe: Clean up softpipe's SSBO load/store interpreting instructions.
 | ||
| - util: Remove unused util_format_planar_is_supported().
 | ||
| - etnaviv: Use the util_pack_color_union() helper.
 | ||
| - gallium/util: Fix location of the comment about S8_UINT handling.
 | ||
| - gallium/util: Clean up the Z/S tile write path.
 | ||
| - gallium/util: Move the Z/S handling to the outside of get_tile().
 | ||
| - svga: Reuse util_format_unpack_rgba().
 | ||
| - util: Merge util_format_write_4* functions.
 | ||
| - util: Merge util_format_read_4* functions.
 | ||
| - util: Use designated initializers to clean up the format tables' pack/unpack.
 | ||
| - llvmpipe: Generalize "could llvmpipe fetch this format" check in unit testing.
 | ||
| - util: Remove the stub pack/unpack functions for YUV formats.
 | ||
| - util: Share a single function pointer for the 4-byte rgba unpack function.
 | ||
| - docs: Move the current CI .rst doc to docs/ci/ and link to it from .gitlab-ci.
 | ||
| - docs: Move the conformance and the CI docs to a top level Testing section.
 | ||
| - docs: Move the gitlab-ci docs to RST.
 | ||
| - docs: Relax the expectations of HW CI farms.
 | ||
| - docs: Document how to interact with docker containers.
 | ||
| - freedreno/ir3_cmdline: Fix an uninit var warning.
 | ||
| - freedreno/ir3: Fix uninit var warning.
 | ||
| - intel: Fix release-build warnings about sf_entry_size.
 | ||
| - intel/perf: Fix unused var warning in release builds.
 | ||
| - intel/perf: Move perf query register programming to static tables.
 | ||
| - freedreno/a2xx: Fix compiler warning in disasm.
 | ||
| - meson: Enable GCing of functions and data from compilation units by default.
 | ||
| - freedreno/ir3: Fix duplicated fine derivatives instructions.
 | ||
| - freedreno/ir3: Add unit tests for derivatives disasm.
 | ||
| - ci: Use FDO_CI_CONCURRENT as our -j flags when present in the runner env.
 | ||
| - freedreno/ir3: Add a note about the instructions in the disasm test.
 | ||
| - freedreno/ir3: Add a bunch more tests for cat6 opcodes.
 | ||
| - freedreno/ir3: Refactor cat6 general dst printing.
 | ||
| - freedreno/ir3: Fix disasm of register offsets in ldp/stp.
 | ||
| - freedreno/ir3: Add missing ld_args_build_id to the ir3_delay unit test.
 | ||
| - ci: Set XDG_CACHE_HOME to tmpfs for bare-metal runners to avoid NFS.
 | ||
| - ci: Update checksums for freedreno traces.
 | ||
| - llvmpipe: Remove a bunch of default handling of pipe caps.
 | ||
| - llvmpipe: Use the default behavior of ALLOW_MAPPED_BUFFERS.
 | ||
| - softpipe: Remove a bunch of default handling of pipe caps.
 | ||
| - softpipe: Use the default behavior of ALLOW_MAPPED_BUFFERS.
 | ||
| - virgl: Remove a bunch of default handling of pipe caps.
 | ||
| - swr: Remove a bunch of default handling of pipe caps.
 | ||
| - swr: Use the default behavior of ALLOW_MAPPED_BUFFERS.
 | ||
| - svga: Remove a bunch of default handling of pipe caps.
 | ||
| - i915: Remove a bunch of default handling of pipe caps.
 | ||
| - softpipe: Refactor pipe_shader_state setup.
 | ||
| - softpipe: Convert to comma-separated SOFTPIPE_DEBUG for debug options.
 | ||
| - softpipe: Add support for reporting shader-db output.
 | ||
| - softpipe: Enable PIPE_CAP_TGSI_TEXCOORD.
 | ||
| - softpipe: Enable PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS;
 | ||
| - ci/bare-metal: Capture the first devcoredump a job produces.
 | ||
| - drm-shim: Return -EINVAL instead of abort()ing on unknown ioctls.
 | ||
| - docs: Explain how to set up a personal gitlab runner.
 | ||
| - nir: Add a pass to cut the trailing ends of vectors.
 | ||
| - i965: Enable vector shrinking in the vec4 backend.
 | ||
| - amd: Swap from nir_opt_shrink_load() to nir_opt_shrink_vectors().
 | ||
| - nir: Remove the old nir_opt_shrink_load.
 | ||
| - freedreno: Fix "Offset of packed bitfield changed" warnings:
 | ||
| - nir/lower_amul: Use num_ubos/ssbos instead of recomputing it.
 | ||
| - nir: Add a little more docs about NIR's constant_data.
 | ||
| - nir: Print the constant data size associated with a shader.
 | ||
| - freedreno/ir3: Fix the type of half-float indirect uniform loads.
 | ||
| - freedreno/a6xx: Document the bit for the magic 32bit-uniforms-as-16b mode.
 | ||
| - freedreno/computerator: Set SP_MODE_CONTROL to the same value as vulkan/GL
 | ||
| - freedreno/ir3: Merge the redundant immediate_idx/immediates_count fields
 | ||
| - freedreno/ir3: Simpify the immediates from an array of vec4 to array of dwords.
 | ||
| - freedreno: Rename emit_const_bo() to emit_const_ptrs().
 | ||
| - freedreno: Split ir3_const's user buffer and indirect upload APIs.
 | ||
| - freedreno/ir3: Clean up instrlen setup.
 | ||
| - freedreno: Increase the NUM_UNIT on compute's consts in indirect dispatch.
 | ||
| - freedreno: Add more asserts for DST_OFF/NUM_UNIT in indirect const uploads.
 | ||
| - freedreno/ir3: Fix assertion failures dumping CS high full regs.
 | ||
| - turnip: Make sure we include the build id.
 | ||
| - gallium/tgsi_exec: Fix up NumOutputs counting
 | ||
| - freedreno: Make the pack struct have a .qword for wide addresses.
 | ||
| - turnip: Fix truncation of CS shader iovas to 32 bits.
 | ||
| - turnip: Fix truncation of iovas to 32 bits in queries.
 | ||
| 
 | ||
| Eric Engestrom (146):
 | ||
| 
 | ||
| - cut 20.1 branch
 | ||
| - docs: update calendar for 20.1.0-rc2
 | ||
| - post_version.py: fix branch name construction for release candidates
 | ||
| - post_version.py: invert `is_point` into `is_first_release` to make its purpose clearer
 | ||
| - post_version.py: stop adding release candidates to the index and relnotes
 | ||
| - docs: update calendar for 20.1.0-rc3
 | ||
| - gitlab-ci: exclude scripts that don't affect the build
 | ||
| - util/rand_xor: make it clear that {,s_}rand_xorshift128plus take *exactly 2* uint64_t
 | ||
| - util/rand_xor: drop unused header
 | ||
| - util/rand_xor: fallback Linux to time-based instead of fixed seed
 | ||
| - util/rand_xor: extend the urandom path to all non-Windows platforms
 | ||
| - docs: update calendar for 20.1.0-rc4
 | ||
| - anv: pass the fd directly to anv_gem_reg_read()
 | ||
| - anv: replace magic `| 1` with already #define'd name
 | ||
| - anv: disable VK_EXT_calibrated_timestamps when the timestamp register is unreadable
 | ||
| - git_sha1_gen.py: fix out-of-date comment
 | ||
| - git_sha1_gen.py: fix code style
 | ||
| - git_sha1_gen.py: fix whitespace
 | ||
| - compiler: delete leftover autotools test wrapper
 | ||
| - no_extern_c.h: fix typo in comment
 | ||
| - tree-wide: fix deprecated GitLab URLs
 | ||
| - docs: drop no-longer-relevant comment about bugzilla
 | ||
| - docs: Add release notes for 20.1.0
 | ||
| - docs: update calendar, add news item, and link releases notes for 20.1.0
 | ||
| - meson: remove "empty array"/"array of an empty string" confusion
 | ||
| - glapi: remove deprecated .getchildren() that has been replace with an iterator
 | ||
| - intel/genxml: drop sort_xml.sh and move the loop directly in gen_sort_tags.py
 | ||
| - intel: fix gen_sort_tags.py
 | ||
| - docs: Add release notes for 20.1.1
 | ||
| - docs: update calendar, add news item, and link releases notes for 20.1.1
 | ||
| - v3d: add missing unlock() in error path
 | ||
| - intel/genxml: drop python 2 support for gen_sort_tags.py
 | ||
| - intel/genxml: replace gen_sort_tags.py MIT licence with SPDX equivalent
 | ||
| - docs: update the blocks of unused EGL enums assigned to us
 | ||
| - i965: drop dead #include "config.h"
 | ||
| - iris: drop dead #include "config.h"
 | ||
| - gen_release_notes.py: update script to the new rST way of things
 | ||
| - post_version.py: update script to the new rST way of things
 | ||
| - intel/tools: rewrite run-test.sh in python
 | ||
| - intel/tools: make test aware of the meson test wrapper
 | ||
| - khronos-update.py: add script to simplify update of Khronos headers & xml files
 | ||
| - docs: remove plain-text copy of versions.rst
 | ||
| - util/os_file: replace broken windows-detection code with detect_os.h
 | ||
| - util: introduce os_dupfd_cloexec() helper
 | ||
| - replace all F_DUPFD_CLOEXEC with os_dupfd_cloexec()
 | ||
| - vulkan/wsi: replace all dup() with os_dupfd_cloexec()
 | ||
| - radv: replace all dup() with os_dupfd_cloexec()
 | ||
| - anv: replace all dup() with os_dupfd_cloexec()
 | ||
| - iris: replace all dup() with os_dupfd_cloexec()
 | ||
| - i965: replace all dup() with os_dupfd_cloexec()
 | ||
| - egl: replace all dup() with os_dupfd_cloexec()
 | ||
| - etnaviv: replace all dup() with os_dupfd_cloexec()
 | ||
| - freedreno: replace all dup() with os_dupfd_cloexec()
 | ||
| - svga: replace all dup() with os_dupfd_cloexec()
 | ||
| - virgl: replace all dup() with os_dupfd_cloexec()
 | ||
| - docs: publish our release maintainers' keys
 | ||
| - docs: remind release maintainers to sign the tarballs and publish their key
 | ||
| - docs: suggest alternative installation methods for meson
 | ||
| - docs: stop considering `Cc: mesa-stable` as an email address
 | ||
| - docs: reword "sending a patch revision" to "updating a merge request"
 | ||
| - docs: drop `git sendemail` instructions
 | ||
| - docs: prefer `Fixes:` over `Cc: mesa-stable`
 | ||
| - docs: add some formatting to the "backport merge request" option
 | ||
| - docs: reword a sentence a bit
 | ||
| - docs: make it clear that the tags needs to be in the commit message
 | ||
| - docs: move `Fixes:` tag explanation to its own section
 | ||
| - docs: move "stable" tag explanation next to `Fixes:`
 | ||
| - driconf: drop 28% catalan translation
 | ||
| - driconf: drop 15% german translation
 | ||
| - driconf: drop 26% spanish translation
 | ||
| - driconf: drop 6% french translation
 | ||
| - driconf: drop 8% dutch translation
 | ||
| - driconf: drop 9% swedish translation
 | ||
| - driconf: drop now unused translation facility
 | ||
| - util: rename xmlpool.h to driconf.h
 | ||
| - gitlab-ci: drop gettext from the build images
 | ||
| - docs: drop deleted file from extra sphinx files
 | ||
| - docs: cat maintainer keys to a single file
 | ||
| - docs: add some padding to the release calendar
 | ||
| - docs: add planning for 20.2
 | ||
| - bin/symbols-check: explain C++ symbols workaround
 | ||
| - docs: Add release notes for 20.1.2
 | ||
| - docs: update calendar and link releases notes for 20.1.2
 | ||
| - docs: fix 20.1.2 relnotes
 | ||
| - docs: add a page explaining the GitLab CI and the Intel CI
 | ||
| - mesa/glformats: make _mesa_gles_error_check_format_and_type() more consistent
 | ||
| - docs: add release notes for 20.1.3
 | ||
| - docs: update calendar and link releases notes for 20.1.3
 | ||
| - docs: fix a bunch of typos
 | ||
| - egl: always compile surfaceless
 | ||
| - vulkan: automatically compile the `display` platform when available
 | ||
| - meson: move xlib-lease block further down
 | ||
| - egl: automatically compile the `drm` platform when available
 | ||
| - introduce `commit_in_branch.py` script to help devs figure this out
 | ||
| - bin/gen_release_notes.py: drop new_features.txt when we release XX.Y.0
 | ||
| - egl/wayland: add missing newline between functions
 | ||
| - glx: drop always-true #ifdef
 | ||
| - docs/submittingpatches: add more than one `Cc: mesa-stable` example to the examples list
 | ||
| - meson/intel: add missing dep on git_sha1.h
 | ||
| - meson: fix android vulkan build
 | ||
| - egl: inline fallback for create_pixmap_surface
 | ||
| - egl: inline fallback for create_pbuffer_surface
 | ||
| - egl: drop unused fallback function
 | ||
| - egl: inline fallback for swap_buffers_with_damage
 | ||
| - egl: inline fallback for swap_buffers_region
 | ||
| - egl: inline fallback for post_sub_buffer
 | ||
| - egl: inline fallback for copy_buffers
 | ||
| - egl: inline fallback for query_buffer_age
 | ||
| - egl: inline fallback for create_wayland_buffer_from_image
 | ||
| - egl: inline fallback for get_sync_values
 | ||
| - egl: drop now empty egl_dri2_fallbacks.h
 | ||
| - egl: mark the rest of the callbacks as mandatory or optional
 | ||
| - egl: inline _EGLAPI into _EGLDriver
 | ||
| - docs: add release notes for 20.1.4
 | ||
| - docs: update calendar and link releases notes for 20.1.4
 | ||
| - post_version.py: don't generate relnotes twice
 | ||
| - post_version.py: drop incorrect conf.py changes
 | ||
| - post_version.py: stop using non-existent functions and fix commit message
 | ||
| - post_version.py: update the files in the current worktree, not the one with the script that we run
 | ||
| - post_version.py: fix relnotes links
 | ||
| - bin/gen_release_notes: automatically commit release notes
 | ||
| - docs/releasing: improve wording
 | ||
| - bin/khronos-update: having a folder in include/ is not a requirement
 | ||
| - bin/khronos-update: add support for the SPIRV files
 | ||
| - bin/khronos-update: add workaround for python bug 9625
 | ||
| - egl: replace _eglInitDriver() with a simple variable
 | ||
| - egl: drop unnecessary _eglGetDriver()
 | ||
| - egl: fix _eglMatchDriver() return type
 | ||
| - egl: inline _eglMatchAndInitialize() and refactor _eglMatchDriver()
 | ||
| - egl: rename _eglMatchDriver() to _eglInitializeDisplay()
 | ||
| - egl: drop left-over function prototype
 | ||
| - egl: const _eglDriver
 | ||
| - egl/haiku: drop overwritten preset of EGL version
 | ||
| - egl: consistently use dri2_egl_display() helper macro
 | ||
| - meson: fix `-D xlib-lease=auto` detection
 | ||
| - docs: add release notes for 20.1.5
 | ||
| - docs: update calendar and link releases notes for 20.1.5
 | ||
| - pick-ui: specify git commands in "resolve cherry pick" message
 | ||
| - egl/entrypoint-check: split sort-check into a function
 | ||
| - egl/entrypoint-check: add check that GLVND and plain EGL have the same entrypoints
 | ||
| - driconf: fix force_gl_vendor description
 | ||
| - meson: bump required glvnd version
 | ||
| - egl/x11_dri3: enable & require xfixes 2.0
 | ||
| - egl/x11_dri3: implement EGL_KHR_swap_buffers_with_damage
 | ||
| - meson: don't advertise TLS support if glx wasn't build with it
 | ||
| - meson: drop leftover PTHREAD_SETAFFINITY_IN_NP_HEADER
 | ||
| 
 | ||
| Erico Nunes (16):
 | ||
| 
 | ||
| - lima/ppir: introduce liveness internal live set
 | ||
| - lima/ppir: fix lod bias register codegen
 | ||
| - lima/ppir: do not assume single src for pipeline outputs
 | ||
| - lima/ppir: combine varying loads in node_to_instr
 | ||
| - lima/ppir: duplicate intrinsics in nir
 | ||
| - lima/ppir: duplicate consts in nir
 | ||
| - lima/ppir: remove unused clone functions
 | ||
| - lima/ppir: rework emit nir to ppir
 | ||
| - lima/ppir: rework store output
 | ||
| - lima/ppir: add fallback mov option for const scheduler
 | ||
| - lima/ppir: rework select conditions
 | ||
| - lima/ppir: handle failures on all ppir_emit_cf_list paths
 | ||
| - lima/ppir: improve handling for successors in other blocks
 | ||
| - lima/ppir: rework tex lowering
 | ||
| - lima/ppir: optimize tex loads with single successor
 | ||
| - lima/ppir: use a ready list in node_to_instr
 | ||
| 
 | ||
| Erik Faye-Lund (124):
 | ||
| 
 | ||
| - compiler/nir: move tan-calculation to helper
 | ||
| - vtn/opencl: add native_tan-support
 | ||
| - vtn/opencl: native variants of sin/cos
 | ||
| - vtn/opencl: native divide support
 | ||
| - vtn/opencl: native powr support
 | ||
| - vtn/opencl: native recip support
 | ||
| - vtn/opencl: native rsqrt support
 | ||
| - vtn/opencl: native sqrt support
 | ||
| - compiler/glsl: explicitly store NumUniformBlocks
 | ||
| - mesa/st: consider NumUniformBlocks instead of num_ubos when binding
 | ||
| - zink: use nir_lower_uniforms_to_ubo
 | ||
| - zink: lower b2b to b2i
 | ||
| - util/os_memory: never use os_memory_debug.h
 | ||
| - st/wgl: pass st_context_iface into stw_st_framebuffer_present_locked
 | ||
| - st/wgl: allocate and resolve msaa-textures
 | ||
| - docs/features: add zink features
 | ||
| - zink: load vk_GetMemoryFdKHR while creating screen
 | ||
| - zink: add a GET_PROC_ADDR macro to simplify load_device_extensions
 | ||
| - docs/features: mark GL_NV_conditional_render as done for zink
 | ||
| - zink: disable vkCmdResolveImage when respecting render-condition
 | ||
| - zink: do not expose real value for PIPE_CAP_MAX_VIEWPORTS
 | ||
| - zink: correct PIPE_SHADER_CAP_MAX_SHADER_IMAGES
 | ||
| - zink: mark depth-component cube-maps as done
 | ||
| - zink: implement i2b1
 | ||
| - docs: fix broken release-calendar
 | ||
| - zink: hammer in an explicit wait when retrieving buffer contents for reading
 | ||
| - zink: use samples from state
 | ||
| - zink: do not dig into resource for nr_samples
 | ||
| - zink: pass batch instead of context for queries
 | ||
| - zink: implement nir_texop_txf_ms
 | ||
| - zink: expose PIPE_CAP_TEXTURE_MULTISAMPLE
 | ||
| - docs/features: mark GL_ARB_texture_multisample as done for zink
 | ||
| - zink: use general-layout when blitting to/from same resource
 | ||
| - zink: Use store_dest_raw instead of storing an uint
 | ||
| - nir: reuse existing psiz-variable
 | ||
| - zink: emulate B8G8R8X8_SRGB with B8G8R8A8_SRGB
 | ||
| - zink: assert that image-view format isn't undefined
 | ||
| - zink: only report device-local memory as video-memory
 | ||
| - gallium/hud: do not specify potentially invalid depth-range
 | ||
| - TEMP: add rst-conversion scripts
 | ||
| - docs: convert articles to reructuredtext
 | ||
| - TEMP: remove rst-conversion scripts
 | ||
| - docs: delete no longer needed file
 | ||
| - docs: fixup botched table
 | ||
| - docs: escape double colons
 | ||
| - docs: escape asterisks
 | ||
| - docs: escape trailing underscores properly
 | ||
| - docs: fixup broken rst
 | ||
| - docs: fixup heading-levels
 | ||
| - docs: use sphinx
 | ||
| - docs: disable syntax-highlighting by default
 | ||
| - docs: use code-block with caption instead of table
 | ||
| - docs: format notes as rst-notes
 | ||
| - docs: use code-blocks
 | ||
| - docs: drop open-coded toc for articles
 | ||
| - docs: add xlibdriver to table-of-contents
 | ||
| - docs: do not copy source-files to site
 | ||
| - docs: use rst footnotes instead of manual ones
 | ||
| - docs: reformat license table as rst table
 | ||
| - docs: use rst-note for highlighted text
 | ||
| - docs: bundle extra files
 | ||
| - docs: include specs into the generated docs
 | ||
| - gitlab-ci: build and deploy docs
 | ||
| - docs: drop news in favour of the introduction as index-page
 | ||
| - README: update references to internal docs
 | ||
| - docs: update internal references
 | ||
| - docs/relnotes: update internal references
 | ||
| - radv: update internal reference
 | ||
| - bin/perf-annotate-jit.py: update internal reference
 | ||
| - docs/release-calendar: restore missing id
 | ||
| - nir: do not try to merge xfb-outputs
 | ||
| - Revert "gallium/hud: don't use user vertex buffers"
 | ||
| - gallium/hud: don't use user vertex buffers
 | ||
| - zink: enable cull-distance if supported
 | ||
| - zink: expose GLSL 1.30
 | ||
| - docs: update internal references
 | ||
| - docs/relnotes: update internal references
 | ||
| - docs: fixup relnotes after rst-conversion
 | ||
| - docs/features: mark GL3 as complete for zink
 | ||
| - docs/features: update ARB_texture_buffer_object line
 | ||
| - docs/features: remove driver-list for forward-compatible context
 | ||
| - mesa/main: fix inverted condition
 | ||
| - gallium/os: call "ANSI" version of GetCommandLine
 | ||
| - graw/gdi: do not depend on UNICODE macro
 | ||
| - gallium/util: limit STACK_LEN on Windows
 | ||
| - gallium/util: add missing include
 | ||
| - docs: update favicon
 | ||
| - docs: remove non-existent reference
 | ||
| - docs: restore accidentally dropped labels
 | ||
| - docs: fix internal references
 | ||
| - docs: use ref-links for internal references
 | ||
| - gallium/docs: update to recent sphinx
 | ||
| - gallium/docs: fixup formatting of numbered lists
 | ||
| - gallium/docs: remove reference to non-existent label
 | ||
| - gallium/docs: use none for highlight_language
 | ||
| - gallium/docs: prefix exts dir with underscore
 | ||
| - gallium/docs: remove non-existent static dir
 | ||
| - gallium/docs: remove unused imgmath extension
 | ||
| - ci: only build docs in the upstream-repo
 | ||
| - ci: only build docs if any docs changed
 | ||
| - ci: test docs for non-master builds
 | ||
| - ci: move deploy-stage later in the pipeline
 | ||
| - ci: move test-docs to container stage
 | ||
| - ci: add graphviz to the .docs-base template
 | ||
| - merge gallium docs into main docs
 | ||
| - docs: clean up gallium index-file
 | ||
| - docs: add an extension to generate redirects
 | ||
| - docs: move gallium specific docs into gallium folder
 | ||
| - docs: use svg for graphviz output
 | ||
| - docs: fixup envvar output
 | ||
| - zink: expose depth-clip if supported
 | ||
| - mesa/main: factor out one-time-init into a helper
 | ||
| - mesa/main: use call_once instead of open-coding
 | ||
| - gallium/util: do not use _MTX_INITIALIZER_NP on Windows
 | ||
| - mesa/main: use p_atomic_inc_return instead of locking
 | ||
| - mesa: do not use bitfields for advanced-blend state
 | ||
| - mesa: treat Color._AdvancedBlendMode as enum
 | ||
| - zink: use ralloc in nir-to-spirv
 | ||
| - zink: use ralloc for plain malloc-calls
 | ||
| - zink: pass mem_ctx to ralloc_size-call
 | ||
| - zink: use ralloc for spirv_builder as well
 | ||
| - mesa/program: fix shadow property for samplers
 | ||
| - docs: add some very basic documentation about zink
 | ||
| - mesa: handle GL_FRONT after translating to it
 | ||
| 
 | ||
| Francisco Jerez (23):
 | ||
| 
 | ||
| - intel/ir: Update performance analysis parameters for memory fence codegen changes.
 | ||
| - iris: Simplify iris_batch_prepare_noop().
 | ||
| - iris: Extend iris_context dirty state flags to 128 bits.
 | ||
| - iris: Add batch-local synchronization book-keeping to iris_bo.
 | ||
| - iris: Add infrastructure to partition batch into sync boundaries.
 | ||
| - iris: Bracket batch operations which access memory within sync regions.
 | ||
| - iris: Annotate all BO uses with domain and sequence number information.
 | ||
| - iris: Drop redundant iris_address::write flag.
 | ||
| - iris: Report use of any in-flight buffers on first draw call after sync boundary.
 | ||
| - iris: Introduce cache coherency matrix for batch-local memory ordering.
 | ||
| - iris: Update cache coherency matrix on PIPE_CONTROL.
 | ||
| - iris: Implement buffer-local memory barrier based on cache coherency matrix.
 | ||
| - iris: Insert buffer barrier in existing cache flush helpers.
 | ||
| - iris: Remove batch argument of iris_resource_prepare_access() and friends.
 | ||
| - iris: Perform compute predraw flushes from compute batch.
 | ||
| - iris: Remove depth cache set tracking and synchronization.
 | ||
| - iris: Remove render cache hash table-based synchronization.
 | ||
| - iris: Open-code iris_cache_flush_for_read() and iris_cache_flush_for_depth().
 | ||
| - iris: Emit single render target flush PIPE_CONTROL on format mismatch.
 | ||
| - iris: Remove iris_flush_depth_and_render_caches().
 | ||
| - OPTIONAL: iris: Perform BLORP buffer barriers outside of iris_blorp_exec() hook.
 | ||
| - iris/icl+: Report same caching domain as main surface for clear color BO.
 | ||
| - intel/ir/gen12+: Work around FS performance regressions due to SIMD32 discard divergence.
 | ||
| 
 | ||
| Frank Binns (2):
 | ||
| 
 | ||
| - docs: change "Fixes:" tag example to match git fixes output
 | ||
| - egl/dri2: only take a dri2_dpy reference when binding a new context/surfaces
 | ||
| 
 | ||
| Frédéric Bonnard (2):
 | ||
| 
 | ||
| - clover: Fix types collision between c++ and altivec
 | ||
| - meson: Revert commit overriding C++ standard with gnu++11 on ppc64el
 | ||
| 
 | ||
| Gert Wollny (66):
 | ||
| 
 | ||
| - r600: Annotate some case fallthroughs
 | ||
| - r600: remove unused static functions
 | ||
| - r600/sb: replace memset by using member initialization/assignment
 | ||
| - r600: remove some unused variables to silence warnings
 | ||
| - r600: Fix warning regarding mixing enums and unsigned in ?: expression
 | ||
| - r600: Fix nir compiler options, i.e. don't lower IO to temps for TESS
 | ||
| - r600/sfn: Unify semantic name and index query and use TEXCOORD semantic
 | ||
| - r600/sfn: Fix printing vertex fetch instruction flags
 | ||
| - r600: Lower int64 ops from TGSI-to-NIR shaders too
 | ||
| - r600: Lower lerp after tgsi_to_nir
 | ||
| - r600: Add support for loading index register from other than chan X
 | ||
| - r600/sfn: Handle CF index loading from non-X channel
 | ||
| - r600/sfn: rework getting a vector and uniforms from the value pool
 | ||
| - r600/sfn: Skip move instructions if they are only ssa and without modifiers
 | ||
| - r600/sfn: re-use an allocated register in lookup
 | ||
| - r600/sfn: skip copying LOD if the target register is is the same
 | ||
| - r600/sfn: Fix memring print output
 | ||
| - r600/sfn: Fix RING instruction assembly emission
 | ||
| - r600/sfn: Fix GDS assembly emission
 | ||
| - r600/sfn: Fix RAT instruction assembly emission
 | ||
| - r600/sfn: Make allocate_reserved_registers forward to a virtual function
 | ||
| - r600/sfn: Fix handling of output register index
 | ||
| - r600/sfn: Make 3vec loads skip possible moves
 | ||
| - r600/sfn: Add support for viewport index output
 | ||
| - r600/sfn: Take FOGC, and backcolors into account im GS outputs
 | ||
| - r600/sfn: Handle loading sample_pos
 | ||
| - r600/sfn: Add FS output sample_mask
 | ||
| - r600/sfn: Don't reject VARYING_SLOT_PCNT
 | ||
| - r600/sfn: remove pointless check
 | ||
| - r600/sfn: assert when alu dest is missing
 | ||
| - r600/sfn: support indirect sampler buffer reads.
 | ||
| - r600/sfn: Add support for texture_samples
 | ||
| - r600/sfn: use the per shader atomic base
 | ||
| - r600/sfn: SSBO: Fix query of dest components
 | ||
| - r600/sfn: Fix clip vertex output as possible stream variable
 | ||
| - r600/sfn: Fix splitting constants that come from different kcache banks.
 | ||
| - r600/sfn: Don't reorder outputs by location
 | ||
| - r600/sfn: Fix printing ALU op without dest
 | ||
| - r600: Fix duplicated subexpression in r600_asm.c
 | ||
| - r600/sfn: Fix mapping for f32tof64 and f64tof32
 | ||
| - r600/sfn: use modern c++ in printing LDS read instruction
 | ||
| - r600/sfn: Correctly update the number of literals when forcing a new group
 | ||
| - r600/sfn: remove debug output leftover
 | ||
| - nir: lower_tex: Don't normalize coordinates for TXF with RECT
 | ||
| - r600/sfn: lower image derefs
 | ||
| - r600/sfn: Add imageio support
 | ||
| - r600/sfn: Add support for image_size
 | ||
| - r600/sfn: Add support for reading cube image array dim.
 | ||
| - r600/sfn: Take SSBO buffer ID offset into account
 | ||
| - r600/sfn: Handle memory_barrier
 | ||
| - r600/sfn: Add lowering pass for shared IO
 | ||
| - r600/sfn: Add support for shared atomics
 | ||
| - r600/sfn: Don't set num_components on TESS sysvalue intrinsics
 | ||
| - r600/sfn: lower rotate ALU ops
 | ||
| - r600/sfn: Pipe through requesting a register at a given channel
 | ||
| - r600/sfn: emit texture instructions in one block
 | ||
| - r600/sfn: Add option to get a temp value for a specific channel
 | ||
| - r600/sfn: correct handling of loading vec4 with fetching constants
 | ||
| - r600/sfn: Add a forced output swizzle for depth write
 | ||
| - r600/sfn: Fix Ring output swizzle masks
 | ||
| - r600/sfn: Fix default z swizzle for GDS instructions
 | ||
| - r600: Add shader key item to identify when the sample mask should be used
 | ||
| - r600/sfn: Only use sample mask if the according shader key is set
 | ||
| - r600/sfn: Make the pin_to_channel generic
 | ||
| - d600/sfn: write stream outputs to correct mem ring
 | ||
| - gallivm/nir: Lower uniforms to UBOs in llvm draw if the driver didn't request this already
 | ||
| 
 | ||
| Greg V (1):
 | ||
| 
 | ||
| - gallium,util: undef ALIGN on FreeBSD to prevent name clash
 | ||
| 
 | ||
| Guido Günther (2):
 | ||
| 
 | ||
| - etnaviv: drm: Use NSEC_PER_SEC
 | ||
| - etnaviv: drm: Normalize nano seconds
 | ||
| 
 | ||
| Gurchetan Singh (1):
 | ||
| 
 | ||
| - virgl: apply bgra dest swizzle and add Portal 2
 | ||
| 
 | ||
| Hanno Böck (1):
 | ||
| 
 | ||
| - Properly check mmap return value
 | ||
| 
 | ||
| Hyunjun Ko (6):
 | ||
| 
 | ||
| - freedreno,tu: Don't request fragcoord components not being read.
 | ||
| - tu,radv: fix potentially wrong offset of flexible array.
 | ||
| - vulkan: Adds helpers for vk_object (de)alloation and (de)initialization.
 | ||
| - tu: Fix wrong copies of sampler descriptor.
 | ||
| - turnip: Use the common base object type and struct.
 | ||
| - turnip: implement VK_EXT_private_data
 | ||
| 
 | ||
| Iago Toral Quiroga (7):
 | ||
| 
 | ||
| - v3d/compiler: don't rewrite unused temporaries to point to NOP register
 | ||
| - v3d/compiler: fix spill offset
 | ||
| - v3d/compiler: fix image size for 1D arrays
 | ||
| - nir/lower_clip: make the pass compatible with Vulkan semantics
 | ||
| - v3d/compiler: handle compact varyings
 | ||
| - v3d/compiler: request fragment shader clip lowering to be vulkan compatible.
 | ||
| - nir/lower_tex: skip lower_tex_packing for the texture samples query
 | ||
| 
 | ||
| Ian Romanick (24):
 | ||
| 
 | ||
| - nir/algebraic: Recognize open-coded byte or word extract from bfe
 | ||
| - nir/algebraic: Split ibfe and ubfe with two constant sources
 | ||
| - nir/algebraic: Optimize some bfe patterns
 | ||
| - nir/algebraic: Optimize ushr of pack_half, not ishr
 | ||
| - nir/algebraic: Add some half packing optimizations for pack_half_2x16_split
 | ||
| - nir/algebraic: Eliminate useless extract before unpack
 | ||
| - i965: Assert that blorp always handles color blits
 | ||
| - meta: Make _mesa_meta_texture_object_from_renderbuffer static
 | ||
| - meta: Make _mesa_meta_setup_sampler static
 | ||
| - meta: Remove support for clearing integer buffers
 | ||
| - mesa: Add matrix utility functions to load matrices
 | ||
| - mesa: Add function to calculate an orthographic projection
 | ||
| - meta: Stop frobbing MatrixMode
 | ||
| - meta: Use same vertex coordinates for GLSL and FF clears
 | ||
| - meta: Coalesce the GLSL and FF paths in meta_clear
 | ||
| - meta: Remove support for multisample blits
 | ||
| - anv/tests: Don't rely on assert or changing NDEBUG in tests
 | ||
| - anv/tests: Silence unused parameter warnings in main
 | ||
| - anv: Silence unused parameter warning in anv_image_get_clear_color_addr
 | ||
| - intel: Silence unused parameter warning in __intel_log_use_args
 | ||
| - intel/drm-shim: Add noop ioctl handler for set_tiling
 | ||
| - intel/drm-shim: Return correct values for I915_PARAM_HAS_ALIASING_PPGTT
 | ||
| - glsl: Remove integer matrix support from ir_dereference_array::constant_expression_value
 | ||
| - nir/algebraic: Don't distrubte absolute-value into dot-products
 | ||
| 
 | ||
| Icecream95 (78):
 | ||
| 
 | ||
| - pan/midgard: Fix old style shadows
 | ||
| - panfrost: Fix background showing when using discard
 | ||
| - panfrost: Enable PIPE_CAP_VERTEX_COLOR_UNCLAMPED
 | ||
| - panfrost: Decode AFBC flag bits
 | ||
| - panfrost: Only use AFBC YTR with RGB and RGBA
 | ||
| - pan/midgard: Use a signed value for checking inline constants
 | ||
| - Revert "panfrost: Keep cached BOs mmap'd"
 | ||
| - panfrost: Mark PIPE_BUFFER BOs as not renderable
 | ||
| - pan/mdg: Add a macro for printing instruction source information
 | ||
| - pan/mdg: Move r1.w writeout to branch->dest
 | ||
| - pan/mdg: Remove old zs store lowering
 | ||
| - pan/mdg: Remove old depth writeout code
 | ||
| - pan/mdg: Remove writeout case from bytemask_of_read_components
 | ||
| - nir: Replace the zs_output_pan intrinsic with combined_output_pan
 | ||
| - pan/mdg: Replace writeout booleans with a single value
 | ||
| - pan/mdg: Add new depth writeout code
 | ||
| - pan/mdg: Move search_var to earlier in midgard_compile.c
 | ||
| - pan/mdg: Add depth/stencil support to emit_fragment_store
 | ||
| - pan/mdg: Add new depth store lowering
 | ||
| - pan/mdg: Print writeout sources in mir_print_instruction
 | ||
| - panfrost: Add writes_stencil to the EARLY_Z disable list
 | ||
| - panfrost: Move sampler view bo creation to a separate function
 | ||
| - panfrost: Create a new sampler view bo when the layout changes
 | ||
| - panfrost: Tiled to linear layout conversion
 | ||
| - panfrost: Clean up panfrost_frag_meta_rasterizer_update
 | ||
| - panfrost: Implement ARB_depth_clamp
 | ||
| - pan/decode: Fix helper invocations when tracing
 | ||
| - pan/decode: Add missing wrap modes
 | ||
| - pan/mdg: Fix max_comp calculation for constant printing
 | ||
| - panfrost: RGBA4 and RGB5_A1 framebuffer support
 | ||
| - panfrost: Update sampler views when the texture bo changes
 | ||
| - panfrost: Copy resources when mapping to avoid waiting for readers
 | ||
| - panfrost: Only copy resources when they are in a pending batch
 | ||
| - panfrost: Add PAN_MESA_DEBUG=gl3 flag
 | ||
| - panfrost: Do fine-grained flushing for occlusion query results
 | ||
| - pan/mdg: Vectorize vlut operations
 | ||
| - pan/decode: Make mapped memory read-only while decoding
 | ||
| - nir: Add a base value to load_raw_output_pan
 | ||
| - panfrost: Fix MALI_READS_TILEBUFFER
 | ||
| - pan/mdg: Handle tilebuffer wait loops
 | ||
| - pan/mdg: Use the writeout tag for tilebuffer wait loops
 | ||
| - panfrost: Add rt formats to shader state
 | ||
| - panfrost: Add a bitset of render targets read by shaders
 | ||
| - pan/mdg: Do the pan_lower_framebuffer pass later
 | ||
| - pan/mdg: Emit a tilebuffer wait loop when needed
 | ||
| - pan/mdg: Handle non-blend framebuffer lowering
 | ||
| - pan/mdg: Support MRT in output load lowering
 | ||
| - pan/mdg: Set the z/s store intrinsic base correctly
 | ||
| - pan/mdg: Use a 32-bit ld_color_buffer op when needed
 | ||
| - panfrost: Implement texture_barrier
 | ||
| - panfrost: Stop keying on rt format when using native loads
 | ||
| - panfrost: Use f2fmp for framebuffer lowering conversions
 | ||
| - panfrost: Enable framebuffer fetch
 | ||
| - pan/mdg: Fix non-debug compiliation
 | ||
| - compiler: Add dual-source factors to blend_factor
 | ||
| - gallium: Dual source support in blend_factor_to_shader
 | ||
| - pan/mdg: Add a nir pass to reorder store_output intrinsics
 | ||
| - pan/mdg: Dual source blend input/writeout support
 | ||
| - pan/mdg: Skip z/s combining for dual-source writes
 | ||
| - panfrost: Dual source blend support
 | ||
| - pan/decode: Open the dump file later
 | ||
| - pan/mdg: Don't disassemble blit shaders
 | ||
| - panfrost: Rename lower_store to is_blend in pan_lower_framebuffer
 | ||
| - pan/mdg: Do per-sample framebuffer loads
 | ||
| - panfrost: Do per-sample shading when outputs are read
 | ||
| - nir: Add a face_sysval argument to nir_lower_two_sided_color
 | ||
| - nir: Fix lower_two_sided_color when the face is an input
 | ||
| - panfrost: Report TEXTURE_BUFFER_OBJECTS cap when gl3 flag set
 | ||
| - panfrost: Set depth_enabled when stencil is enabled
 | ||
| - nir: Set the alignment for SSBO lowering
 | ||
| - panfrost: Make panfrost_bo_wait take a wait_readers bool
 | ||
| - panfrost: Fix calls to panfrost_flush_batches_accessing_bo
 | ||
| - panfrost: Fake RGTC support
 | ||
| - panfrost: Use more tilebuffer sizes
 | ||
| - panfrost: 8x MRT support
 | ||
| - pan/mdg: Use the blend RT for blend shader framebuffer fetches
 | ||
| - panfrost: Allow PIPE_TEXTURE_1D_ARRAY textures
 | ||
| - pan/mdg: Fix spilling of non-32-bit types
 | ||
| 
 | ||
| Icenowy Zheng (1):
 | ||
| 
 | ||
| - panfrost: signal syncobj if nothing is going to be flushed
 | ||
| 
 | ||
| Ilia Mirkin (14):
 | ||
| 
 | ||
| - freedreno/a3xx: there's no r8i/ui rb format, only rg8i/rg8ui
 | ||
| - freedreno/a3xx: reinstate rgb10_a2ui texture format
 | ||
| - freedreno/ir3: avoid applying (sat) on bary.f
 | ||
| - freedreno/a3xx: fix const footprint
 | ||
| - freedreno: fix off-by-one in assertions checking for const sizes
 | ||
| - freedreno/a3xx: parameterize ubo optimization
 | ||
| - freedreno/a3xx: fix rasterizer discard
 | ||
| - nouveau: allow invalidating coherent/persistent buffer backings
 | ||
| - st/mesa: allow R8 to not be exposed as renderable by driver
 | ||
| - a4xx: add noperspective interpolation support
 | ||
| - a4xx: add polygon offset clamp, fix units
 | ||
| - ir3: mark ucp_enables as allowed values on all keys
 | ||
| - a4xx: hook up centroid ij coords
 | ||
| - ir3: use empirical size for params as used by the shader
 | ||
| 
 | ||
| Indrajit Kumar Das (2):
 | ||
| 
 | ||
| - st/mesa: use fragment shader to copy stencil buffer
 | ||
| - st/mesa: optimize DEPTH_STENCIL copies using fragment shader
 | ||
| 
 | ||
| Italo Nicola (17):
 | ||
| 
 | ||
| - panfrost: Fix outmods on int to float conversions
 | ||
| - pan/mdg: fix src_type in instructions that need a implicit zero
 | ||
| - pan/mdg: prepare effective_writemask()
 | ||
| - pan/mdg: eliminate references to ins->alu.op
 | ||
| - pan/mdg: eliminate references to ins->alu.reg_mode
 | ||
| - pan/mdg: fix comment
 | ||
| - pan/mdg: eliminate references to ins->alu.outmod
 | ||
| - pan/mdg: apply float outmods to textures
 | ||
| - pan/mdg: eliminate references to ins->texture.op
 | ||
| - pan/mdg: eliminate references to ins->load_store.op
 | ||
| - pan/mdg: defer register packing
 | ||
| - pan/mdg: externalize mir_pack_mod
 | ||
| - pan/mdg: remove ins->alu
 | ||
| - pan/mdg: refactor emit_alu_bundle
 | ||
| - pan/mdg: defer branch packing
 | ||
| - pan/mdg: remove ins->br_compact and ins->branch_extended
 | ||
| - pan/mdg: emit REGISTER_UNUSED on unused ALU src2
 | ||
| 
 | ||
| Iván Briano (9):
 | ||
| 
 | ||
| - anv: use the correct format on Android
 | ||
| - anv: Disable B5G6R5_UNORM_PACK16
 | ||
| - anv: Add a way to reserve states from a pool
 | ||
| - anv: Implement VK_EXT_custom_border_color
 | ||
| - anv: support externally synchronized pipeline caches
 | ||
| - anv: implement VK_PIPELINE_CREATE_FAIL_ON_PIPELINE_COMPILE_REQUIRED_BIT_EXT
 | ||
| - anv: enable VK_EXT_pipeline_creation_cache_control
 | ||
| - anv: Add VK_EXT_custom_border_color to relnotes
 | ||
| - anv: fix allocation of custom border color pool
 | ||
| 
 | ||
| James Park (1):
 | ||
| 
 | ||
| - amd/llvm: Reorder LLVM headers
 | ||
| 
 | ||
| James Zhu (1):
 | ||
| 
 | ||
| - ac/gpu_info: Correct Acturus cu bitmap
 | ||
| 
 | ||
| Jan Beich (5):
 | ||
| 
 | ||
| - drm-uapi: Add sync_file.h
 | ||
| - anv,iris: unbreak on BSDs after 812cf5f522ab,abf8aed68047
 | ||
| - util: enable futex usage on BSDs after 7dc2f4788288
 | ||
| - meson: unbreak sysctl.h detection on BSDs
 | ||
| - anv: disable i915_perf warning on non-Linux
 | ||
| 
 | ||
| Jan Palus (1):
 | ||
| 
 | ||
| - targets/opencl: fix build against LLVM>=10 with Polly support
 | ||
| 
 | ||
| Jan Zielinski (1):
 | ||
| 
 | ||
| - gallium/swr: Fix crashes in sampling code
 | ||
| 
 | ||
| Jason Ekstrand (167):
 | ||
| 
 | ||
| - intel/eu: Use non-coherent mode (BTI=253) for stateless A64 messages
 | ||
| - Revert "anv/gen12: Temporarily disable VK_KHR_buffer_device_address (and EXT)"
 | ||
| - vulkan: Allow destroying NULL debug report callbacks
 | ||
| - vulkan,anv: Add a common base object type for VkDevice
 | ||
| - anv: Stop clflushing events
 | ||
| - anv: Allocate CPU-side memory for events
 | ||
| - vulkan,anv: Add a base object struct type
 | ||
| - vulkan,anv: Move the DEFINE_HANDLE_CASTS macros to vk_object.h
 | ||
| - anv: Refactor setting descriptors with immutable sampler
 | ||
| - vulkan: Add run-time object type asserts in handle casts
 | ||
| - vulkan/wsi: Make wsi_swapchain inherit from vk_object_base
 | ||
| - anv/allocator: Add a start_offset to anv_state_pool
 | ||
| - vulkan/object: Always include the type
 | ||
| - anv,vulkan: Implement VK_EXT_private_data
 | ||
| - vulkan: Handle vkGet/SetPrivateDataEXT on Android swapchains
 | ||
| - nir: Make "divergent" a property of an SSA value
 | ||
| - util/list: Add a list pair iterator
 | ||
| - util/vma: Add an option to configure high/low preference
 | ||
| - util/vma: Add a debug print helper
 | ||
| - util/ra: Add [de]serialization support
 | ||
| - anv: Set 3DSTATE_VF_INSTANCING on the SVGS element
 | ||
| - anv: Set MOCS in 3DSTATE_CONSTANT_* on Gen9+
 | ||
| - nir: Add some docs to the metadata types
 | ||
| - anv: Call vk_object_base_finish for image views
 | ||
| - anv: Fix descriptor set clean-up on BO allocation failure
 | ||
| - nir: Use 8-bit types for most info fields
 | ||
| - anv:gpu_memcpy: Emit 3DSTATE_VF_INDEXING on Gen8+
 | ||
| - nir: Validate jump instructions as an instruction type
 | ||
| - nir: Use a switch statement in nir_handle_add_jump
 | ||
| - nir: Add documentation for each jump instruction type
 | ||
| - nir/clone: Re-use clone_alu for nir_alu_instr_clone
 | ||
| - nir: Add a new helper for iterating phi sources leaving a block
 | ||
| - nir: Add a store_reg helper and use the builder in phis_to_regs
 | ||
| - nir: Add const to nir_intrinsic_src_components
 | ||
| - nir/lower_double_ops: Rework the if (progress) tree
 | ||
| - nir/opt_deref: Report progress if we remove a deref
 | ||
| - nir/copy_prop_vars: Record progress in more places
 | ||
| - nir: Fix sources for image atomic fadd
 | ||
| - intel/vec4: Stomp the return type of RESINFO to UINT32
 | ||
| - intel/fs: Fix unused texture coordinate zeroing on Gen4-5
 | ||
| - intel/fs: Emit HALT for discard on Gen4-5
 | ||
| - anv/allocator: Compare to start_offset in state_pool_free_no_vg
 | ||
| - nir: Add a nir_metadata_all enum value
 | ||
| - nir: Add a nir_shader_preserve_all_metadata helper
 | ||
| - nir: Call nir_metadata_preserve on !progress
 | ||
| - nir: Properly preserve metadata in more cases
 | ||
| - intel/nir: Call nir_metadata_preserve on !progress
 | ||
| - iris: Better handle metadata in NIR passes
 | ||
| - anv: Add an anv_batch_set_storage helper
 | ||
| - anv: Add anv_pipeline_init/finish helpers
 | ||
| - nir/intrinsics: Put the _intel intrinsics together at the end
 | ||
| - anv: Use resolve_device_entrypoint for dispatch init
 | ||
| - vulkan: Update Vulkan XML and headers to 1.2.145
 | ||
| - anv: Bump the advertised patch version to 145
 | ||
| - intel/fs: Expose a couple of NIR lowering helpers
 | ||
| - intel/fs: Break wm_prog_data setup into a helper
 | ||
| - intel/fs: Move more prog_data setup into populate_wm_prog_data
 | ||
| - intel/compiler: Expose brw_texture_offset to C
 | ||
| - intel/eu: Add a brw_urb_dest_msg_type helper
 | ||
| - intel/eu: Set the right subnr for ALIGN16 destinations
 | ||
| - intel/eu: Add the RNDU opcode
 | ||
| - vulkan/wsi: Don't consider VK_SUBOPTIMAL_KHR to be an error condition
 | ||
| - wsi/x11: Log swapchain status changes
 | ||
| - freedreno: Only call nir_lower_io on shader_in/out
 | ||
| - lima: Only call nir_lower_io on shader_in/out
 | ||
| - nouveau: Only call nir_lower_io on shader_in/out
 | ||
| - vc4: Only call nir_lower_io on shader_in/out
 | ||
| - v3d: Only call nir_lower_io on shader_in/out
 | ||
| - panfrost: Only call nir_lower_io on shader_in/out
 | ||
| - nir: Assert that nir_lower_io is only called with allowed modes
 | ||
| - nir: Remove shared support from lower_io
 | ||
| - nir: Add docs to nir_lower[_explicit]_io
 | ||
| - anv: Handle clamping of inverted depth ranges
 | ||
| - nir/validate: Don't abort() until after the shader has printed
 | ||
| - spirv: Skip phis in unreachable blocks in the second phi pass
 | ||
| - spirv: Allow block-decorated struct types for constants
 | ||
| - vulkan: Update Vulkan XML and headers to 1.2.148
 | ||
| - anv: Advertise VK_EXT_image_robustness
 | ||
| - spirv: Update headers and grammar json
 | ||
| - spirv: Add support for SPV_EXT_shader_atomic_float
 | ||
| - intel/fs: Use the correct logical op for global float atomics
 | ||
| - anv: Advertise support for VK_EXT_shader_atomic_float
 | ||
| - nir: Allow for system values with variable numbers of destination components
 | ||
| - nir/lower_io: Choose to set access based on intrinsic metadata
 | ||
| - nir/lower_io: Use b2b for shader and function temporaries
 | ||
| - nir/lower_io: Add support for global scratch addressing
 | ||
| - spirv: Simplify our handling of NonUniform
 | ||
| - spirv: Drop the void \*ptr from vtn_value
 | ||
| - spirv: Fix indentation in vtn_handle_ptr
 | ||
| - spirv: Clean up OpSignBitSet
 | ||
| - spirv: Use nir_bany/ball for OpAny/All
 | ||
| - spirv: Add a helpers for getting types of values
 | ||
| - spirv: Rename push_value_pointer to push_pointer
 | ||
| - spirv: Add a vtn_push_nir_ssa helper
 | ||
| - spirv/amd: Use vtn_push_nir_ssa
 | ||
| - spirv: Add a vtn_get_nir_ssa helper
 | ||
| - spirv: Use the new helpers in OpConvertUToPtr/PtrToU
 | ||
| - spirv: Refactor vtn_push_ssa
 | ||
| - spirv/alu: Use vtn_push_ssa_value
 | ||
| - spirv/glsl450: Use vtn_push_ssa_value
 | ||
| - spirv/subgroups: Stop incrementing w
 | ||
| - spirv/subgroups: Refactor to use vtn_push_ssa
 | ||
| - spirv: Simplify vtn_ssa_value creation
 | ||
| - spirv: Hand-roll fewer vtn_ssa_value creations
 | ||
| - spirv: Add better checks for SSA value types
 | ||
| - spirv: Drop the sampled boolean from vtn_type
 | ||
| - spirv: Give atomic counters their own variable mode
 | ||
| - spirv: Add a helper for getting the NIR type of a vtn_type
 | ||
| - spirv: Remove a dead case in function parameter handling
 | ||
| - spirv: More heavily use vtn_ssa_value in function parameter handling
 | ||
| - anv,turnip,radv,clover,glspirv: Run nir_copy_prop before nir_opt_deref
 | ||
| - spirv: Rework our handling of images and samplers
 | ||
| - spirv: Also copy over binding information for atomic counters
 | ||
| - nir: Take a mode in remove_unused_io_vars
 | ||
| - nir/dead_variables: Respect the modes passed to remove_dead_vars
 | ||
| - nir: Add nir_foreach_shader_in/out_variable helpers
 | ||
| - nir: Add a nir_foreach_function_temp_variable helper
 | ||
| - nir: Add a nir_foreach_uniform_variable helper
 | ||
| - nir: Add a nir_foreach_gl_uniform_variable helper for GL linking
 | ||
| - nir: Add and use a nir_variable_list_for_mode helper
 | ||
| - nir: Take a nir_shader and variable mode in assign_var_locations
 | ||
| - nir: Take a shader and variable mode in nir_assign_io_var_locations
 | ||
| - nir/linking: Rework some internal helpers
 | ||
| - st/nir: Rework fixup_varying_slots
 | ||
| - nir/split_vars: Add mode checks to list walks
 | ||
| - nir: Split nir_index_vars into two functions
 | ||
| - nir/lower_amul: Add a variable mode check
 | ||
| - nir: Use a nir_shader and mode in lower_clip_cull_distance_arrays
 | ||
| - nir/lower_io_to_temporaries: Use a separate list for new inputs
 | ||
| - nir/io_to_vector: Use nir_foreach_variable_with_modes
 | ||
| - nir/lower_two_sided_color: Use nir_variable_create
 | ||
| - nir/lower_uniforms_to_ubo: Use nir_foreach_variable_with_modes
 | ||
| - nir/split_per_member_structs: Use nir_variable_with_modes_safe
 | ||
| - nir/lower_variable_initializers: Restrict the modes we lower
 | ||
| - nir/gl_nir_linker: Use nir_foreach_variable_with_modes
 | ||
| - freedreno/ir3_lower_tess: Rework var list helpers
 | ||
| - lima/standalone: Rework i/o variable fixup
 | ||
| - freedreno/ir3_cmdline: Rework i/o variable fixup
 | ||
| - r600/sfn/lower_tess_io: Rework get_tcs_varying_offset
 | ||
| - r600/sfn/lower_tex: Get rid of the lower_sampler vector
 | ||
| - r600/sfn: Use nir_foreach_variable_with_modes in IO vectorization
 | ||
| - panfrost/midgard: Make search_var take a nir_shader and mode
 | ||
| - panfrost: Use nir_foreach_variable_with_modes in pan_compile
 | ||
| - aco: Use nir_foreach_variable_with_modes to walk SSBOs
 | ||
| - mesa/ptn: Use nir_variable_create
 | ||
| - gallium/ttn: Use variable create/add helpers
 | ||
| - nir: Use a single list for all shader variables
 | ||
| - nir/split_per_member_structs: Inline split_variables_in_list
 | ||
| - nir/gl_nir_linker: Call add_vars_with_modes once for GL_PROGRAM_INPUT
 | ||
| - nir: Add a find_variable_with\_[driver\_]location helper
 | ||
| - vulkan: Update Vulkan XML and headers to 1.2.149
 | ||
| - anv: Implement VK_EXT_4444_formats
 | ||
| - nir/deref: Don't try to compare derefs containing casts
 | ||
| - compiler/types: Add a struct_type_is_packed wrapper
 | ||
| - spirv: Do more complex unwrapping in get_nir_type
 | ||
| - anv: Advertise shaderIntegerFunctions2
 | ||
| - spirv: Don't emit RMW for vector indexing in shared or global
 | ||
| - clover/spirv: Don't call llvm::regularizeLlvmForSpirv
 | ||
| - intel/nir: Pass the nir_builder by reference in lower_alpha_to_coverage
 | ||
| - intel/nir: Rewrite the guts of lower_alpha_to_coverage
 | ||
| - intel/fs: Fix MOV_INDIRECT and BROADCAST of Q types on Gen11+
 | ||
| - intel/fs: Don't copy-propagate stride=0 sources into ddx/ddy
 | ||
| - iris: Re-emit push constants if we have a varying workgroup size
 | ||
| - spirv: Run repair_ssa if there are discard instructions
 | ||
| - nir: More NIR_MAX_VEC_COMPONENTS fixes
 | ||
| - intel/fs/swsb: SCHEDULING_FENCE only emits SYNC_NOP
 | ||
| - radeonsi: Only call nir_lower_var_copies at the end of the opt loop
 | ||
| 
 | ||
| Jesse Natalie (10):
 | ||
| 
 | ||
| - nir_lower_io: Add addr_format_is_offset helper
 | ||
| - nir: When nir_lower_vars_to_explicit_types is run on temps, update scratch_size
 | ||
| - nir: Support load/store of temps as scratch in nir_lower_explicit_io
 | ||
| - nir: Support vec8/vec16 in nir_lower_bit_size
 | ||
| - nir: Support algebraic opts on vectors larger than 4
 | ||
| - nir: Support 8 and 16 component vectors for reduceable intrinsics
 | ||
| - nir/vtn: Add support for 8 and 16 vector ball/bany
 | ||
| - u_debug_stack_test: Fix MSVC compiling by using ATTRIBUTE_NOINLINE
 | ||
| - nir: More NIR_MAX_VEC_COMPONENTS fixes
 | ||
| - glsl_type: Add packed to structure type comparison for hash map
 | ||
| 
 | ||
| JibbityJobbity (1):
 | ||
| 
 | ||
| - drirc: Enable glthread for PCSX2
 | ||
| 
 | ||
| Jon Turney (1):
 | ||
| 
 | ||
| - glthread: Fix use of alloca() without #include "c99_alloca.h"
 | ||
| 
 | ||
| Jonathan Gray (13):
 | ||
| 
 | ||
| - util: unbreak endian detection on OpenBSD
 | ||
| - util/anon_file: add OpenBSD shm_mkstemp() path
 | ||
| - meson: build with _ISOC11_SOURCE on OpenBSD
 | ||
| - meson: don't build with USE_ELF_TLS on OpenBSD
 | ||
| - meson: conditionally include -ldl in gbm pkg-config file
 | ||
| - util: futex fixes for OpenBSD
 | ||
| - util/u_thread: include pthread_np.h if found
 | ||
| - anv: use os_get_total_physical_memory()
 | ||
| - util/os_misc: add os_get_available_system_memory()
 | ||
| - anv: use os_get_available_system_memory()
 | ||
| - util/os_misc: os_get_available_system_memory() for OpenBSD
 | ||
| - radv: remove seccomp includes
 | ||
| - vulkan: make VK_TIME_DOMAIN_CLOCK_MONOTONIC_RAW_EXT conditional
 | ||
| 
 | ||
| Jonathan Marek (135):
 | ||
| 
 | ||
| - turnip: update "fetchsize" value to match fdl6_layout changes
 | ||
| - turnip: enable tiling for compressed formats
 | ||
| - util/format: translate 422_UNORM and 420_UNORM vulkan formats
 | ||
| - freedreno/registers: document 422_UNORM and 420_UNORM formats
 | ||
| - turnip: implement VK_KHR_sampler_ycbcr_conversion
 | ||
| - turnip: enable 422_UNORM formats
 | ||
| - freedreno: move a4xx specific layout code to a4xx code
 | ||
| - freedreno/a5xx: remove unused reference to gmem_alignw in layout code
 | ||
| - freedreno/a6xx: don't use gmem_alignw for imported buffers
 | ||
| - freedreno/a6xx: split up gmem/tile alignment requirements
 | ||
| - freedreno: reduce extra height alignment in a6xx layout
 | ||
| - freedreno/a6xx: use RESOLVE_TS event
 | ||
| - freedreno: add adreno 650
 | ||
| - freedreno/layout: add explicit offset/pitch argument to fdl6_layout
 | ||
| - turnip: support VkImageDrmFormatModifierExplicitCreateInfoEXT
 | ||
| - turnip: fix RENDER_COMPONENTS value
 | ||
| - turnip: move HLSQ_UPDATE_CNTL write to before xs config writes
 | ||
| - turnip: update some properties based on blob driver
 | ||
| - turnip: clamp sampler minLod/maxLod
 | ||
| - freedreno/a6xx: use nonbinning VS when GS is used
 | ||
| - turnip: correctly emit non-binning vs in transform feedback case
 | ||
| - turnip: fix HW binning with geometry shader
 | ||
| - turnip: use common emit_xs_cntl to fill a6xx_sp_xs_ctrl_reg0
 | ||
| - turnip: fix VFD_CONTROL for binning pass
 | ||
| - turnip: pipeline program state refactor
 | ||
| - turnip: share code between 3D blit/clear path and tu_pipeline
 | ||
| - turnip: add layered 3D path clear for CmdClearAttachments
 | ||
| - turnip: add emit renderpass cache flushes for sysmem 3D CmdClearAttachments
 | ||
| - turnip: remove some dead/redundant code
 | ||
| - freedreno/ir3: fix ir3_nir_move_varying_inputs
 | ||
| - turnip: remove duplicated stage2opcode and stage2shaderdb
 | ||
| - turnip: simplify stage2 helpers
 | ||
| - turnip: set VFD_INDEX_OFFSET in 3D clear/blit path
 | ||
| - turnip: fix 3D path always being used for CmdBlitImage
 | ||
| - turnip: fix cubic filtering with CmdBlitImage
 | ||
| - turnip: compute and graphics have completely separate state
 | ||
| - turnip: move descriptor set BO tracking to CmdBindDescriptorSets
 | ||
| - turnip: improve dirty bit handling a bit
 | ||
| - turnip: delete dead dynamic state code
 | ||
| - turnip: refactor draw states and dynamic states
 | ||
| - turnip: input attachment descriptor set rework
 | ||
| - turnip: use draw states for input attachments
 | ||
| - turnip: use u_format for packing gmem clear values
 | ||
| - freedreno/a6xx: FETCHSIZE is PITCHALIGN
 | ||
| - freedreno/fdl6: rework layout code a bit (reduce linear align to 64 bytes)
 | ||
| - turnip: fix a crash when rasterizerDiscardEnable is set
 | ||
| - turnip: fix a sample shading case
 | ||
| - turnip: fix renderpass gmem configs when there are too many attachments
 | ||
| - turnip: set the API version
 | ||
| - turnip: move enum translation functions to a common header
 | ||
| - freedreno/a6xx: VSC "STRM_ARRAY_PITCH" is "STRM_LIMIT"
 | ||
| - freedreno/a6xx: remove unnecessary OVERFLOW_FLAG_REG check
 | ||
| - turnip: remove unnecessary OVERFLOW_FLAG_REG check
 | ||
| - freedreno/a4xx: restore pitch to bytes change to layout code
 | ||
| - freedreno/a4xx: simplify setup_slices
 | ||
| - turnip: rework streamout state and add missing counter buffer read/writes
 | ||
| - turnip: refactor CmdDraw* functions (and a few fixes)
 | ||
| - turnip: enable VK_EXT_index_type_uint8
 | ||
| - turnip: implement CmdDrawIndirectByteCountEXT
 | ||
| - turnip: fix ts_cs_memory typo
 | ||
| - turnip: use pipeline cs for shader programs instead of separate bo
 | ||
| - freedreno/registers: a6xx depth bounds test registers
 | ||
| - turnip: implement depthBounds
 | ||
| - turnip: translate CreateRenderPass to CreateRenderPass2
 | ||
| - turnip: replace a memset(0) with zalloc in CreateRenderPass
 | ||
| - turnip: use RenderPassCreateInfo for render_pass_add_implicit_deps
 | ||
| - turnip: move some logic out of create_render_pass_common
 | ||
| - turnip: implement VK_EXT_vertex_attribute_divisor
 | ||
| - turnip: fix empty scissor case
 | ||
| - turnip: fix update_stencil_mask
 | ||
| - turnip: disable early_z for VK_FORMAT_S8_UINT
 | ||
| - freedreno/registers: add CP_DRAW_INDIRECT_MULTI
 | ||
| - freedreno/ir3: add support for load_draw_id
 | ||
| - turnip: implement VK_KHR_shader_draw_parameters
 | ||
| - turnip: fix VK_STRUCTURE_TYPE_PHYSICAL_DEVICE_VULKAN_1_1_FEATURES
 | ||
| - turnip: fix huge scissor min/max case
 | ||
| - freedreno/ir3: fix resinfo wrmask
 | ||
| - freedreno/regs: add extra bits for UBWC array pitch
 | ||
| - turnip: enable largePoints
 | ||
| - turnip: enable depthBiasClamp
 | ||
| - freedreno/registers: update varying-related registers
 | ||
| - freedreno/a3xx: support LINEAR_PIXEL/PERSP_CENTROID/LINEAR_CENTROID sysvals
 | ||
| - freedreno/a4xx: fake LINEAR_PIXEL varying support for u_blitter
 | ||
| - freedreno/ir3: add generic get_barycentric()
 | ||
| - freedreno/a5xx: set missing bary sysvals
 | ||
| - freedreno/a6xx: set missing bary sysvals
 | ||
| - turnip: set missing bary sysvals
 | ||
| - freedreno/ir3: add support for INTERP_MODE_NOPERSPECTIVE
 | ||
| - turnip: make tiling config part of framebuffer state
 | ||
| - turnip: rework render_tiles loop
 | ||
| - turnip: vsc improvements
 | ||
| - turnip: fix tess param bo size calculation
 | ||
| - turnip: clear_blit: pass aspect mask to setup function
 | ||
| - turnip: support multi-image layouts
 | ||
| - turnip: enable 420_UNORM formats
 | ||
| - freedreno/layout: fix explicit layout offset not added to slice offset
 | ||
| - freedreno/ir3: fix/rework tess levels
 | ||
| - Revert "nir: Add an option for lowering TessLevelInner/Outer to vecs"
 | ||
| - Revert "nir: Support sysval tess levels in SPIR-V to NIR"
 | ||
| - freedreno/regs: document SS6_UBO state src
 | ||
| - turnip: use global bo for clear blit shaders
 | ||
| - freedreno/ir3: add support for a650 tess shared storage
 | ||
| - freedreno/regs: document CS shared storage size bit
 | ||
| - freedreno/a2xx: fix compressed textures
 | ||
| - freedreno: add a fd_resource_pitch helper
 | ||
| - freedreno/layout: layout simplifications and pitch from level 0 pitch
 | ||
| - turnip: fix active_desc_sets not being set for compute pipeline
 | ||
| - freedreno/ir3: fix setup_input for sparse vertex inputs
 | ||
| - freedreno/ir3: run nir_opt_loop_unroll in optimization loop
 | ||
| - freedreno: fix layout pitchalign field not being set for imported buffers
 | ||
| - freedreno/regs: update primitive output related registers
 | ||
| - turnip: clean up primitive output state
 | ||
| - turnip: drop GS clear path
 | ||
| - turnip: use DIRTY SDS bit to avoid making copies of pipeline load state ib
 | ||
| - turnip: emit compute pipeline directly in CmdBindPipeline
 | ||
| - turnip: fix inconsistencies with tu6_load_state_size
 | ||
| - turnip: remove use of tu_cs_entry for draw states
 | ||
| - gitlab-ci: re-enable arm64_a630_vk
 | ||
| - freedreno/regs: update a6xx GRAS registers
 | ||
| - freedreno/regs: update a6xx RB regs
 | ||
| - freedreno/regs: update a6xx VPC regs
 | ||
| - freedreno/regs: update a6xx PC regs
 | ||
| - turnip: disable tiling for NV12/IYUV formats
 | ||
| - turnip: remove extra gmem alignment
 | ||
| - freedreno/ir3: fix wrong local_primitive_id_start type
 | ||
| - turnip: move WFI out of draw state to fix a650 hangs
 | ||
| - turnip: use patchControlPoints for HS_INPUT_SIZE value
 | ||
| - turnip: fix SP_HS_UNKNOWN_A831 value for A650
 | ||
| - turnip: workaround for a630 d24_unorm_s8_uint fails
 | ||
| - turnip: fix sysmem CmdClearAttachments 3D fallback breaking GMEM path flush
 | ||
| - turnip: delete tu_clear_sysmem_attachments_2d
 | ||
| - turnip: add support for D32_SFLOAT_S8_UINT
 | ||
| - turnip: rework extended formats to allow more extended formats
 | ||
| - util/format: translate A4R4G4B4_UNORM and A4B4G4R4_UNORM vulkan formats
 | ||
| - turnip: implement VK_EXT_4444_formats
 | ||
| 
 | ||
| Jordan Justen (17):
 | ||
| 
 | ||
| - intel/dev: Split .num_subslices out of GEN12_FEATURES macro
 | ||
| - intel/dev: Add device info for RKL
 | ||
| - intel/l3: Don't rely on cfg entry URB size being 0 as a sentinal
 | ||
| - intel/l3: Allow platforms to have no l3 configurations
 | ||
| - iris/l3: Enable L3 full way allocation when L3 config is NULL
 | ||
| - anv: Set L3 full way allocation at context init if L3 cfg is NULL
 | ||
| - intel/dev: Add device info for DG1
 | ||
| - iris: Make use of devinfo has_aux_map field
 | ||
| - anv: Make use of devinfo has_aux_map field
 | ||
| - anv/pipeline: Split VFE/INTERFACE_DESCRIPTOR out to emit_media_cs_state
 | ||
| - anv/cmd_buffer: Split GPGPU_WALKER out to emit_gpgpu_walker
 | ||
| - iris: Split walker and state update into iris_upload_gpgpu_walker
 | ||
| - iris/compute: Split out iris_load_indirect_location
 | ||
| - intel/compiler/cs: Allow simd32 in some more cases with no8 and/or no16
 | ||
| - intel/compiler/fs: Still attempt simd32 when INTEL_DEBUG=no16 is used
 | ||
| - iris: Add missing break in switch in modifier_is_supported
 | ||
| - anv, iris: Set MediaSamplerDOPClockGateEnable for gen12+
 | ||
| 
 | ||
| Jose Maria Casanova Crespo (4):
 | ||
| 
 | ||
| - v3d: Fix swizzle in DXT3 and DXT5 formats
 | ||
| - v3d: Include supported DXT formats to enable s3tc/dxt extensions
 | ||
| - vc4: don't relay on intr->num_components for non-vectorized intrinsics
 | ||
| - nir: only uniforms with dynamically_uniform offset are dynamically_uniform
 | ||
| 
 | ||
| Joshua Ashton (7):
 | ||
| 
 | ||
| - anv: Remove RANGE_SIZE usage
 | ||
| - radv: Remove RANGE_SIZE usage
 | ||
| - turnip: Remove RANGE_SIZE usage
 | ||
| - vulkan: Update Vulkan XML and headers to 1.2.140
 | ||
| - radv: Implement VK_EXT_custom_border_color
 | ||
| - radeonsi: Use TRUNC_COORD on samplers
 | ||
| - radv: Implement VK_EXT_4444_formats
 | ||
| 
 | ||
| José Fonseca (3):
 | ||
| 
 | ||
| - glthread: Add GLAPIENTRY to _mesa_marshal_MultiDrawArrays.
 | ||
| - appveyor: Upgrade pip.
 | ||
| - appveyor: Use Python3.
 | ||
| 
 | ||
| Karol Herbst (50):
 | ||
| 
 | ||
| - nir/deref: copy ptr_stride when rematerializing
 | ||
| - nir/validate: validate the stride for deref_ptr_as_array
 | ||
| - Revert "nir/validate: validate the stride for deref_ptr_as_array"
 | ||
| - nvir/nir: use component helpers instead of insn->num_components
 | ||
| - st/mesa: lower images when needed
 | ||
| - nir/lower_images: fix for array of arrays
 | ||
| - nir/lower_images: handle dec and inc
 | ||
| - nv50/ir/nir: move away from image_deref intrinsics
 | ||
| - nv50/ir/nir: handle image atomic inc and dec
 | ||
| - nv50/ir/nir: remove image uniform hack
 | ||
| - gv100/ir: fix atom cas
 | ||
| - gv100/ir: fix shift lowering
 | ||
| - gv100/ir: fix OP_TXG for shadow textures
 | ||
| - nv50/ir/nir: add workaround for double vertex attribs
 | ||
| - nv50/ir/print: add missing VIEWPORT_MASK handling
 | ||
| - nv50/ir/nir: fix ext_demote_to_helper_invocation
 | ||
| - nv50/ir/nir: fix nv_viewport_array2
 | ||
| - nvc0: enable spirv caps with nir
 | ||
| - nv50/ir/nir: don't emit a restart with set a stream_id
 | ||
| - nv50/ir/nir: handle clip vertex for tess eval shaders
 | ||
| - nv50/ir/nir: rework input output handling
 | ||
| - nv50/ir/nir: rework CFG handling
 | ||
| - nv50/ir/ra: convert some for loops to Range-based for loops
 | ||
| - nv50/ir/ra: fix memory corruption when spilling
 | ||
| - nv50/ir/nir: fix interpolation on explicit operations
 | ||
| - gv100/ir: implement sample shading
 | ||
| - gv100/ir: fix coherent and volatile memory access
 | ||
| - nv50/ir/nir: fix cache mode conversion
 | ||
| - nv50/ir: fix memset on non trivial types warning
 | ||
| - nv50/ir/tgsi: move call to tgsi_scan_shader inside Source constructor
 | ||
| - nvc0: set local mem size for compute on gv100
 | ||
| - nvc0: set sampler index mode to independently on gv100 compute
 | ||
| - gv100/ir: set ftz bit on floating point operations
 | ||
| - ci: bump libdrm to 2.4.102
 | ||
| - nouveau: enable HMM
 | ||
| - gallium: add PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY
 | ||
| - nvc0: support PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY
 | ||
| - nouveau: expose HMM
 | ||
| - ci: need to install wget in order to download libdrm
 | ||
| - ci: bump libdrm to 2.4.102
 | ||
| - nouveau: enable HMM
 | ||
| - gallium: add PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY
 | ||
| - nvc0: support PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY
 | ||
| - nouveau: expose HMM
 | ||
| - st/mesa: fix st_CopyPixels without support for stencil exports
 | ||
| - nv50/ir/tgsi: silence warning about unhandled GS_INPUT_PRIM property
 | ||
| - nv50/ir: initialize persampleInvocation to false
 | ||
| - nir/lower_io: assert that offsets are used for shader_in
 | ||
| - nv50/ir/nir: fix global_atomic_comp_swap
 | ||
| - spirv: extract switch parsing into its own function
 | ||
| 
 | ||
| Kenneth Graunke (20):
 | ||
| 
 | ||
| - iris: Include linux/sync_file.h instead of cut and pasting contents
 | ||
| - anv: Include linux/sync_file.h instead of cut and pasting contents
 | ||
| - iris: Rename iris_syncpt to iris_syncobj for clarity.
 | ||
| - iris: Give up on not passing ice to iris_init_batch
 | ||
| - iris: Destroy transfer slab after batches
 | ||
| - iris: Flush any current work in iris_fence_await before adding deps
 | ||
| - intel: Move anv_gem_supports_syncobj_wait to common code.
 | ||
| - iris: Detect DRM_SYNCOBJ_WAIT_FLAGS_WAIT_FOR_SUBMIT kernel support
 | ||
| - iris: Implement PIPE_FLUSH_DEFERRED support.
 | ||
| - intel: Delete hardcoded devinfo->urb.size values for Gen7+ (sans DG1).
 | ||
| - iris: Delete useless #define
 | ||
| - intel/eu: Add a brw_urb_desc helper
 | ||
| - CI: Disable Panfrost Mali-T820, Lima Mali-400 and Lima Mali-450 jobs
 | ||
| - intel: Disable loading drivers on DG1 devices for now
 | ||
| - nir: Fix divergence analysis for tessellation input/outputs
 | ||
| - iris: Implement pipe->texture_subdata directly
 | ||
| - iris: Fix CCS check in iris_texture_subdata().
 | ||
| - iris: Delete shader variants when deleting the API-facing shader
 | ||
| - iris: Reorder the loops in iris_fence_await() for clarity.
 | ||
| - iris: Drop stale syncobj references in fence_server_sync
 | ||
| 
 | ||
| Kristian Høgsberg (73):
 | ||
| 
 | ||
| - freedreno/ir3: Pass stream output info to ir3_shader_from_nir
 | ||
| - freedreno/ir3: Rename ir3_nir_lower_to_explicit_io
 | ||
| - freedreno/ir3: Add ir3_nir_lower_to_explicit_input() pass
 | ||
| - freedreno/ir3: Lower GS builtins before lowering IO
 | ||
| - freedreno/ir3: Drop hack to clean up split vars
 | ||
| - freedreno/fdl: Align after dividing by block size
 | ||
| - freedreno/a6xx: Set tfetch correctly for compressed formats
 | ||
| - freedreno/ir3: Drop wrmask for ir3 local and global store intrinsics
 | ||
| - freedreno/a6xx: Create shader dependent streamout state at compile time
 | ||
| - freedreno/a6xx: Map inputs to VFD entries up front
 | ||
| - freedreno/a6xx: Allocate ringbuffer based on VFD count
 | ||
| - freedreno/a6xx: Emit VFD setup as array writes
 | ||
| - freedreno/a6xx: Avoid stalling for occlusion queries
 | ||
| - freedreno: Use the right amount of &'s
 | ||
| - freedreno: Use explicit \*_NONE enum for undefined formats
 | ||
| - turnip: Use hw enum when emitting A6XX_RB_STENCIL_CONTROL
 | ||
| - turnip: Use tu6_reduction_mode() to avoid warning
 | ||
| - turnip: Use {} initializer to silence warning
 | ||
| - freedreno/ir3: Avoid {0} initializer for struct reginfo
 | ||
| - src/util: Remove out-of-range comparison
 | ||
| - mapi: Fix a couple of warning in generated code
 | ||
| - mesa/st: Use memset to zero out struct
 | ||
| - egl/android: Move get_format under HAVE_DRM_GRALLOC guard where it's used
 | ||
| - egl/android: Drop unused variable
 | ||
| - freedreno/a6xx: Move per element offset to VFD_DECODE
 | ||
| - freedreno/a6xx: Decouple VFD_FETCH and VFD_DECODE
 | ||
| - freedreno/a6xx: Create stateobj for VFD_DECODE
 | ||
| - freedreno/a6xx: Program VFD_DEST_CNTL from program stateobj
 | ||
| - freedreno/a6xx: Turn on robustness extensions
 | ||
| - docs/features.txt: Update for freedreno
 | ||
| - freedreno/a6xx: Fix VFD_CONTROL emit
 | ||
| - freedreno/a6xx: Don't write REG_A6XX_RB_SRGB_CNTL in restore
 | ||
| - freedreno/a6xx: Set index buffer size to bo size
 | ||
| - freedreno: Handle DRM_FORMAT_MOD_INVALID in shared code
 | ||
| - turnip: Put VK_KHR_external_fence_fd stubs back
 | ||
| - freedreno/a6xx: Don't blit with R2D_RAW
 | ||
| - freedreno/a6xx: Move fd6_ifmt into fd6_blitter.c
 | ||
| - freedreno/a6xx: Split out src and dst setup helpers for blit
 | ||
| - freedreno/a6xx: Don't set unknown bit when tiling differs
 | ||
| - freedreno/a6xx: Set src and dst rects outside blit loop
 | ||
| - freedreno/a6xx: Program SP_2D_SRC_FORMAT outside blit loop
 | ||
| - freedreno/a6xx: Consolidate computing blit_cntl
 | ||
| - freedreno/a6xx: Don't emit src state when clearing
 | ||
| - freedreno/a6xx: Separate stencil sysmem clear fix
 | ||
| - freedreno/a6xx: Enable FMT6_10_10_10_2_UNORM blitting
 | ||
| - freedreno/a6xx: Make blit_control helper a little more helpful
 | ||
| - freedreno/a6xx: Program A6XX_SP_2D_SRC_FORMAT_COLOR_FORMAT based on dst format
 | ||
| - freedreno/a6xx: Move REG_A6XX_SP_2D_SRC_FORMAT programming to helper
 | ||
| - freedreno/a6xx: Move CP_SET_MARKER to setup helper
 | ||
| - freedreno/a6xx: Program RB_UNKNOWN_8C01 in setup helper
 | ||
| - freedreno/a6xx: Don't take pipe_blit_info in emit_blit_dst
 | ||
| - freedreno/a6xx: Split clear and blit texture into different functions
 | ||
| - freedreno/registers: Rename SP_2D_SRC_FORMAT
 | ||
| - turnip: Move device enumeration and feature discovery to tu_drm.c
 | ||
| - turnip: Move tu_bo functions to tu_drm.c
 | ||
| - turnip: Collapse some tu_drm wrappers
 | ||
| - turnip: Move remaining drm code to tu_drm.c
 | ||
| - turnip: Only include msm_drm in tu_drm.c
 | ||
| - egl/android: Remove unused variable
 | ||
| - mapi/test: Change type to unsigned for offset
 | ||
| - gallium: Switch u_debug_stack/symbol.c to util/hash_table.h
 | ||
| - util: Move stack debug functions to src/util
 | ||
| - util: Add unit test for stack backtrace caputure
 | ||
| - gallium/android: Rewrite backtrace helper for android
 | ||
| - ci: Include enough Android headers to let us compile test EGL
 | ||
| - mapi: Mark TLS symbols as optional in glapi-symbols.txt
 | ||
| - turnip: Make tu_android.c compile again
 | ||
| - meson: Define ANDROID and ANDROID_API_LEVEL when compiling for Android
 | ||
| - anv: Pass device to setup_gralloc0_usage for error reporting
 | ||
| - anv: Add stub for anv_gem_get_tiling() for Android
 | ||
| - vulkan: Allow global symbol HMI for Android
 | ||
| - radv/android: Remove unused variable
 | ||
| - ci: Add a build test for the Android platform
 | ||
| 
 | ||
| Krzysztof Raszkowski (1):
 | ||
| 
 | ||
| - gallium/swr: Fix building swr with MSVC
 | ||
| 
 | ||
| Laura Ekstrand (3):
 | ||
| 
 | ||
| - docs: include meson in the toctree
 | ||
| - docs: Remove version.
 | ||
| - docs: Add the favicon to the new page.
 | ||
| 
 | ||
| Leo Liu (3):
 | ||
| 
 | ||
| - radeon/vcn: reset the decode flags from message buffer
 | ||
| - radeon/vcn: add Sienna to use internal register offset
 | ||
| - radeon/vcn/dec: add db_aligned_height to message buffer
 | ||
| 
 | ||
| Lepton Wu (3):
 | ||
| 
 | ||
| - mapi: x86: Fix dynamic entries in x86 tsd stubs.
 | ||
| - mapi: Return NULL function pointers for GL_EXT_debug_marker
 | ||
| - egl: Allow software rendering for vgem/virtio_gpu in platform_device
 | ||
| 
 | ||
| Lionel Landwerlin (60):
 | ||
| 
 | ||
| - drm-shim: move handle lock to shim_fd
 | ||
| - drm-shim: don't create a memfd per BO
 | ||
| - drm-shim: silence warnings
 | ||
| - intel/dev: print out error when platform is not found by name
 | ||
| - intel: add stub_gpu tool
 | ||
| - ci: Add intel to shaderdb runs
 | ||
| - iris: don't assert on unfinished aux import in copy paths
 | ||
| - anv: don't expose VK_INTEL_performance_query without kernel support
 | ||
| - anv: fix alignments for uniform buffers
 | ||
| - genxml: run sorting script
 | ||
| - genxml: fix invalid end value for video fields
 | ||
| - genxml: factor out utility functions
 | ||
| - genxml: pack: deal with default field not being simple integers
 | ||
| - intel/genxml: fix bits generation for MI_LOAD_REGISTER_IMM
 | ||
| - intel/mi-builder: add framework for self modifying batches
 | ||
| - anv: don't reserve a particular register for draw count
 | ||
| - anv: add a new execution mode for secondary command buffers
 | ||
| - intel/genxml: add PIPE_CONTROL command cache invalidate bit
 | ||
| - intel/perf: make pipeline statistic query loading optional
 | ||
| - intel/perf: store the appropriate OA formats in queries
 | ||
| - intel/perf: update generated code to ralloc all data
 | ||
| - intel/perf: create a unique list of counters
 | ||
| - intel/perf: compute number of passes for a set of counters
 | ||
| - intel/perf: emit counter units in generated code
 | ||
| - intel/perf: add helper to compute metrics from counters
 | ||
| - intel/perf: add counter category to generated code
 | ||
| - intel/perf: report whether the platform supported
 | ||
| - anv: use a query filled by the perf code
 | ||
| - intel/perf: reuse offset specified in the query
 | ||
| - anv: Implement VK_KHR_performance_query
 | ||
| - intel/perf: repurpose INTEL_DEBUG=no-oaconfig
 | ||
| - anv: fixup unwinding of device create failure
 | ||
| - blorp: rename workaround address function
 | ||
| - anv: store the workaround address
 | ||
| - iris: store workaround address
 | ||
| - i965: store workaround_bo offset
 | ||
| - intel: add identifier for debug purposes
 | ||
| - iris: add identifier BO
 | ||
| - i965: add identifier BO
 | ||
| - anv: add identifier BO
 | ||
| - intel/aub_error_decoder: print driver identifier if found
 | ||
| - iris: fix BO destruction in error path
 | ||
| - i965: don't forget to set screen on duped image
 | ||
| - iris: fix export of GEM handles
 | ||
| - i965: fix export of GEM handles
 | ||
| - anv: add an option to disable secondary command buffer calls
 | ||
| - anv: garbage collect timeline semaphore when querying value
 | ||
| - iris: fix fallback to swrast driver
 | ||
| - anv: fix uninitialized variable access
 | ||
| - anv: properly handle fence import of sync_fd = -1
 | ||
| - anv: fix descriptor set free
 | ||
| - anv: fix incorrect realloc failure handling
 | ||
| - anv: centralize vk to gen arrays
 | ||
| - anv: fix up dynamic clip emission
 | ||
| - anv: don't fail userspace relocation with perf queries
 | ||
| - anv: fix transform feedback surface size
 | ||
| - anv: VK_INTEL_performance_query interaction with VK_EXT_private_data
 | ||
| - intel/perf: store query symbol name
 | ||
| - intel/perf: fix raw query kernel metric selection
 | ||
| - intel/compiler: fixup Gen12 workaround for array sizes
 | ||
| 
 | ||
| Liviu Prodea (1):
 | ||
| 
 | ||
| - util: Make process_test path compatible with mingw native toolchains
 | ||
| 
 | ||
| Louis-Francis Ratté-Boulianne (1):
 | ||
| 
 | ||
| - nir: Always create UBO variable when lowering uniforms to ubo
 | ||
| 
 | ||
| Lucas Stach (3):
 | ||
| 
 | ||
| - etnaviv: generalize FE stall before loading shader and sampler states
 | ||
| - etnaviv: retarget transfer to render resource when necessary
 | ||
| - etnaviv: don't expose timer queries
 | ||
| 
 | ||
| Luigi Santivetti (3):
 | ||
| 
 | ||
| - dri2: dri2_make_current() fold multiple if blocks
 | ||
| - dri2: do not conflate unbind and bindContext() failure
 | ||
| - egl/dri2: try to bind old context if bindContext failed
 | ||
| 
 | ||
| Marcin Ślusarz (24):
 | ||
| 
 | ||
| - i965: remove unused variable
 | ||
| - glsl_to_tgsi: add fallthrough comments
 | ||
| - glsl: cleanup vertex shader input checks
 | ||
| - iris: remove unused iris_bo->swizzle_mode
 | ||
| - intel/compiler: fix Android build
 | ||
| - st/mesa: fix reporting of float perf counters max value
 | ||
| - iris: return max counter value for AMD_performance_monitor
 | ||
| - iris: remove iris_monitor_config
 | ||
| - intel/perf: move query_mask and location out of gen_perf_query_counter
 | ||
| - iris: propagate error from gen_perf_begin_query to glBeginPerfQueryINTEL
 | ||
| - i965: propagate error from gen_perf_begin_query to glBeginPerfQueryINTEL
 | ||
| - util: fix possible fd leaks in os_socket_listen_abstract
 | ||
| - glsl: catch out of bounds access in the debug version
 | ||
| - util: fix possible buffer overflow in util_get_process_exec_path
 | ||
| - util/format: initialize non-important components to 0
 | ||
| - mesa: fix out of bounds access in glGetFramebufferParameterivEXT
 | ||
| - mesa: quiet down static analyzers
 | ||
| - iris: quiet down static analyzers
 | ||
| - intel/vec4: fix out of bounds read
 | ||
| - intel/perf: fix performance counters availability after glFinish
 | ||
| - anv: refresh cached current batch bo after emitting some commands
 | ||
| - anv: fix minor gen_ioctl(I915_PERF_IOCTL_CONFIG) error handling issue
 | ||
| - intel/perf: split load_oa_metrics
 | ||
| - intel/perf: export performance counters sorted by [group|set] and name
 | ||
| 
 | ||
| Marek Olšák (226):
 | ||
| 
 | ||
| - mesa: optimize glPush/PopClientAttrib by removing malloc overhead
 | ||
| - mesa: don't call _mesa_update_state for _mesa_get_clamp_fragment_color
 | ||
| - mesa: don't set unnecessary program flags in _mesa_update_state
 | ||
| - mesa: don't update shaders on fixed-func state changes if user shaders are bound
 | ||
| - mesa,st/mesa: add a fast path for non-static VAOs
 | ||
| - mesa: inline vbo_context inside gl_context to remove vbo_context dereferences
 | ||
| - mesa: add glInternalBufferSubDataCopyMESA for glthread
 | ||
| - mesa: add _mesa_InternalBind{ElementBuffer,VertexBuffers} for glthread
 | ||
| - glthread: do glBufferSubData as unsynchronized upload + GPU copy
 | ||
| - glthread: don't use atomics for refcounting to decrease overhead on AMD Zen
 | ||
| - glthread: track pointers and strides for Pointer & EXT_dsa attrib functions
 | ||
| - glthread: track instance divisor changes
 | ||
| - glthread: track primitive restart state
 | ||
| - glthread: initialize VAOs properly
 | ||
| - glthread: handle POS vs GENERIC0 aliasing
 | ||
| - glthread: handle gl{Push,Pop}ClientAttrib{DefaultEXT} for glthread states
 | ||
| - glthread: upload non-VBO vertices and indices for non-Indirect non-IBM draws
 | ||
| - tgsi_to_nir: handle TGSI_SEMANTIC_BLOCK_SIZE
 | ||
| - tgsi_to_nir: handle TGSI_OPCODE_BARRIER
 | ||
| - radeonsi: unify and align down the max SSBO/TBO/UBO buffer binding size
 | ||
| - radeonsi: clean up and deduplicate code around internal compute dispatches
 | ||
| - radeonsi: bind shader images after DCC is disabled for image stores
 | ||
| - radeonsi: add SI_IMAGE_ACCESS_DCC_OFF to ignore DCC for shader images
 | ||
| - radeonsi: implement and use compute-based DCC decompression on gfx9-10
 | ||
| - radeonsi: add a workaround to fix KHR-GL45.texture_view.view_classes on gfx9
 | ||
| - radeonsi: fix si_compute_clear_render_target with render condition enabled
 | ||
| - radeonsi: revert an accidental change in si_clear_buffer
 | ||
| - Revert "ac/surface: remove RADEON_SURF_TC_COMPATIBLE_HTILE and assume it's always set"
 | ||
| - Revert "ac: reassociate FP expressions for inexact instructions for radeonsi"
 | ||
| - ac/surface: fix MSAA crash with FORCE_SWIZZLE_MODE on gfx9
 | ||
| - radeonsi: don't wait for idle at the end of gfx IBs
 | ||
| - ac/surface: unset RADEON_SURF_TC_COMPATIBLE_HTILE if HTILE hasn't been computed
 | ||
| - radeonsi/gfx9: always use IMG_DATA_FORMAT_S8_32 for 8-bit stencil
 | ||
| - radeonsi: allow tc_compatible_htile to be mutable
 | ||
| - radeonsi: enable TC-compatible HTILE on demand for best Z/S performance
 | ||
| - tgsi_to_nir: translate non-vec4 image stores correctly
 | ||
| - radeonsi: fix compilation of monolithic PS
 | ||
| - amd: update amdgpu_drm.h
 | ||
| - amd: remove duplicated definitions from amdgpu_drm.h
 | ||
| - amd: assume CMASK is always rb/pipe_aligned, remove ac_surface.u.gfx9.cmask
 | ||
| - amd: assume HTILE is always rb/pipe_aligned, remove ac_surface.u.gfx9.htile
 | ||
| - ac/surface,radeonsi: move the set/get_bo_metadata code to ac_surface.c
 | ||
| - ac/surface,radeonsi: move the set/get_umd_metadata code into ac_surface.c
 | ||
| - amd: unify code for overriding offset and stride for imported buffers
 | ||
| - ac/surface: override all offsets including metadata offsets
 | ||
| - ac/surface: fix broken pitch override on gfx8
 | ||
| - gallium: rename 'state tracker' to 'frontend'
 | ||
| - gallium: change comments to remove 'state tracker'
 | ||
| - gallium: rename PIPE_RESOURCE_FLAG_ST_PRIV to FRONTEND_PRIV
 | ||
| - gallium: remove more "state tracker" occurences
 | ||
| - radeonsi: also enable tgsi_to_nir caching for compute shaders
 | ||
| - glthread: stop using GLenum16 to get correct GL errors for out-of-bounds enums
 | ||
| - radeonsi: don't expose 16xAA on chips with 1 RB due to an occlusion query issue
 | ||
| - ac/nir: honor ACCESS_STREAM_CACHE_POLICY for L1 and L0 caches too
 | ||
| - radeonsi: use correct clear value size for EQAA in expand_fmask
 | ||
| - radeonsi: optimize access pattern for compute blits with linear textures
 | ||
| - radeonsi: tweak clear/copy_buffer limits when to use compute
 | ||
| - radeonsi: simplify setting resource usage for si_init_temp_resource_from_box
 | ||
| - radeonsi: rename SI_RESOURCE_FLAG_TRANSFER to FORCE_LINEAR
 | ||
| - radeonsi: use vi_dcc_enabled instead of using tex->surface.dcc_offset directly
 | ||
| - radeonsi: use display_dcc_offset for setting displayable_dcc_cb_mask
 | ||
| - winsys/amdgpu: add RADEON_FLAG_UNCACHED for faster blits over PCIe
 | ||
| - radeonsi: disable the L2 cache for most CPU mappings of textures
 | ||
| - radeonsi: disable the L2 cache for CPU read mappings of buffers
 | ||
| - radeonsi: compute perf tests - don't test 1 wave/SA limit, test no limit first
 | ||
| - radeonsi: test uncached clear/copy buffer performance with compute shaders
 | ||
| - gallium/u_threaded: execute transfer_unmap with THREAD_SAFE directly
 | ||
| - ac/gpu_info: compute the best safe IB alignment
 | ||
| - ac/surface: don't compute single-sample CMASK if it's unaligned
 | ||
| - radeonsi: don't use INDIRECT_BUFFER within IBs
 | ||
| - radeonsi: decrease the max GS invocation count to 32
 | ||
| - Revert "radeonsi: don't wait for idle at the end of gfx IBs"
 | ||
| - ac: update register and packet definitions for preemption
 | ||
| - radeonsi: move resetting tracked registers into a new function
 | ||
| - radeonsi: split si_all_descriptors_begin_new_cs and rename functions
 | ||
| - radeonsi: don't enable TC-compatible HTILE for stencil if stencil doesn't use it
 | ||
| - radeonsi/gfx8: enable TC-compatible HTILE from the beginning as before
 | ||
| - radeonsi: don't hardcode most perf counter block counts
 | ||
| - ac/gpu_info: replace num_good_cu_per_sh with min/max_good_cu_per_sa
 | ||
| - amd: replace SH -> SA (shader array) in comments
 | ||
| - radeonsi/gfx10: implement most performance counters
 | ||
| - glthread: don't upload for glDraw inside a display list and always sync
 | ||
| - nir: add i2imp and u2ump opcodes for conversions to mediump
 | ||
| - nir: add int16 and uint16 type helpers
 | ||
| - nir: lower int16 and uint16 in nir_lower_mediump_outputs
 | ||
| - nir: fix lower_wpos for 16-bit fddy
 | ||
| - nir: add options::vectorize_vec2_16bit to limit vectorization to vec2 16
 | ||
| - glsl: treat lowp as mediump when lowering builtins
 | ||
| - glsl: handle int16 and uint16 types and add instructions for mediump
 | ||
| - glsl: lower mediump integer types to int16 and uint16
 | ||
| - glsl: lower mediump partial derivatives
 | ||
| - glsl: lower the precision of imageLoad
 | ||
| - glsl: lower samplers with highp coordinates correctly
 | ||
| - gallium: add shader caps INT16 and FP16_DERIVATIVES
 | ||
| - ac: rename has_double_rate_fp16 -> has_packed_math_16bit
 | ||
| - ac/nir: use more types from ac_llvm_context
 | ||
| - ac/nir: support vector types in the type suffix of overloaded intrinsics
 | ||
| - ac/nir: remove type and num_channels args from ac_build_buffer_store_common
 | ||
| - ac/nir: support 16-bit data in buffer_load_format opcodes
 | ||
| - ac/nir: support 16-bit data in image opcodes
 | ||
| - ac/nir: handle nir_op_[fiu]2[fiu]mp opcodes
 | ||
| - ac/nir: select v_cvt_pkrtz for all conversions from f32 to f16 for radeonsi
 | ||
| - ac/nir: set the second v_cvt_pkrtz argument to undef if it's unused
 | ||
| - ac/nir: support v2f16 derivatives
 | ||
| - nir: don't count samplers and images in interface blocks
 | ||
| - nir: gather which images are buffers
 | ||
| - nir: gather which images are MSAA
 | ||
| - radeonsi: remove unused leftover code for INDIRECT_BUFFER inside IBs
 | ||
| - radeonsi: remove const_buffers_declared hacks
 | ||
| - radeonsi: pass at most 3 images and/or shader buffers via user SGPRs for compute
 | ||
| - radeonsi: add a hack to disable TRUNC_COORD for shadow samplers
 | ||
| - gallium/u_vbuf: get rid of some pointer dereferences
 | ||
| - gallium/u_vbuf: add a faster path for uploading non-interleaved attribs
 | ||
| - glthread: sync in glFlush for multiple contexts
 | ||
| - radeonsi: enable ARB_sparse_buffer
 | ||
| - ac,radeonsi: replace == GFX10 with >= GFX10 where it's needed
 | ||
| - ac,radeonsi: start adding support for gfx10.3
 | ||
| - ac/surface: add displayable DCC code for gfx10.3
 | ||
| - radeonsi: honor a user-specified pitch on gfx10.3
 | ||
| - radeonsi: enable larger SDMA clears and copies on gfx10.3
 | ||
| - radeonsi: implement R9G9B9E5 render target and image store support on gfx10.3
 | ||
| - radeonsi: move L2_CACHE_CONTROL registers into si_emit_framebuffer_state
 | ||
| - radeonsi: set BIG_PAGE fields on gfx10.3
 | ||
| - radeonsi: don't set any XNACK options on gfx10.3
 | ||
| - ac: align num_vgprs for gfx10.3
 | ||
| - radeonsi: add support for Sienna Cichlid
 | ||
| - radeonsi: require LLVM 11 for gfx10.3
 | ||
| - ac/surface: don't recompute the DCC retile map for imported textures
 | ||
| - amd/addrlib: don't recompute DCC info for every ComputeDccAddrFromCoord call
 | ||
| - amd/addrlib: remove unused members of ADDR2_COMPUTE_DCC_ADDRFROMCOORD_INPUT
 | ||
| - ac/surface: add a wrapper structure to hold ADDR_HANDLE
 | ||
| - ac/surface: cache DCC retile maps (v2)
 | ||
| - amd/addrlib: fix the C++ one definition rule violation
 | ||
| - ac/surface: don't set is_displayable if displayable DCC is missing
 | ||
| - ac/surface: require that gfx8 doesn't have DCC in order to be displayable
 | ||
| - ac/surface: enable DCC for the first level in the mip tail on gfx10
 | ||
| - ac/surface: don't free dcc_retile_map on failure
 | ||
| - radeonsi: compact MRTs to save PS export memory space
 | ||
| - ac/nir: fix 64-bit division for GL CTS
 | ||
| - glapi: fix incorrect param names in ARB_vertex_attrib_binding functions
 | ||
| - glthread: rename non_vbo_attrib_mask -> user_buffer_mask, attribs -> buffers
 | ||
| - glthread: handle ARB_vertex_attrib_binding
 | ||
| - radeonsi: don't wait for idle at the end of gfx IBs
 | ||
| - radeonsi: replace ctx->screen with sscreen in si_flush_gfx_cs
 | ||
| - glsl,driconf: add allow_glsl_120_subset_in_110 for SPECviewperf13
 | ||
| - driconf: add workarounds for SPECviewperf13
 | ||
| - amd: add proper definitions for NOP packets
 | ||
| - ac,winsys/amdgpu: align IBs the same as the kernel
 | ||
| - radeonsi: don't add the border color buffer into the init_config state
 | ||
| - radeonsi: rename init_config states to cs_preamble states
 | ||
| - radeonsi: don't add the tess ring buffers into the cs_preamble state
 | ||
| - radeonsi: make wait_mem_scratch unmappable
 | ||
| - radeonsi: disallow adding BOs into si_pm4_state except 1 shader BO per state
 | ||
| - radeonsi: make si_pm4_cmd_begin/end static and simplify all usages
 | ||
| - radeonsi: clear per-context buffers at the end of si_create_context
 | ||
| - radeonsi: remove tabs
 | ||
| - radeonsi: don't flush in fence_server_sync
 | ||
| - ac/gpu_info: fix num_physical_sgprs_per_simd for gfx10
 | ||
| - radeonsi: fix NGG culling for Wave64
 | ||
| - radeonsi: always use Wave32 for GS fast launch, because Wave64 hangs
 | ||
| - radeonsi: always use Wave64 for HS/GS/VS shader stages (except GS fast launch)
 | ||
| - radeonsi: don't try to enable NGG culling for GS
 | ||
| - radeonsi: add a debug option to enable NGG culling for tessellation
 | ||
| - glsl: make print_type non-static for debugging
 | ||
| - glsl: print precision qualifiers in IR dumps
 | ||
| - glsl: print constant initializers
 | ||
| - glsl: fix the type of ir_constant_data::u16
 | ||
| - glsl: fix evaluating float16 constant expression matrices
 | ||
| - glsl: run validate_ir_tree if GLSL_VALIDATE=1 regardless of the build config
 | ||
| - glsl: validate more stuff
 | ||
| - glsl: convert reusable lower_precision util code into helper functions
 | ||
| - glsl: remove the return type from lower_precision
 | ||
| - glsl: cleanups in lower_precision
 | ||
| - glsl: flatten a tautological conditional in lower_precision
 | ||
| - glsl: don't lower precision of textureSize
 | ||
| - glsl: don't lower builtins to mediump that don't allow it
 | ||
| - glsl: lower builtins to mediump that ignore precision of certain parameters
 | ||
| - glsl: lower builtins to mediump that always return mediump or lowp
 | ||
| - glsl: add capability to lower mediump array types
 | ||
| - glsl: lower mediump temporaries to 16 bits except structures (v2)
 | ||
| - gallium: add PIPE_SHADER_CAP_GLSL_16BIT_TEMPS for LowerPrecisionTemporaries
 | ||
| - Revert "ac/surface: require that gfx8 doesn't have DCC in order to be displayable"
 | ||
| - glsl: don't validate array types in ir_dereference_variable
 | ||
| - radeonsi: prevent a gfx10_ngg_calculate_subgroup_info failure for TES+NGG GS
 | ||
| - radeonsi: add missing initialization of registers
 | ||
| - radeonsi/gfx10: set the correct value for OFFCHIP_BUFFERING
 | ||
| - radeonsi: sort registers in si_emit_initial_compute_regs according to GPU gen
 | ||
| - radeonsi: sort registers in si_init_cs_preamble_state according to GPU gen
 | ||
| - ac: add helper ac_get_register_name
 | ||
| - ac: add tables for CP register shadowing
 | ||
| - winsys/amdgpu: make amdgpu_bo_unmap non-static
 | ||
| - radeonsi: make cs_preamble_state optional
 | ||
| - radeonsi: reorder code in update_gs_ring_buffers and init_tess_factor_ring
 | ||
| - radeonsi: implement CP register shadowing
 | ||
| - radeonsi: add reg shadowing codepaths to GS and tess ring setup
 | ||
| - radeonsi: add debug code for register shadowing
 | ||
| - radeonsi: don't restore states at the beginning of IBs if they're shadowed
 | ||
| - radeonsi: set up IBs for preemption
 | ||
| - radeonsi: enable preemption if the kernel enabled it
 | ||
| - amd: rename SIENNA -> SIENNA_CICHLID
 | ||
| - amd: add support for Navy Flounder
 | ||
| - amd: enable displayable DCC for everything newer than Navi1x
 | ||
| - radeonsi: disable SDMA on gfx9
 | ||
| - radeonsi: reorder NIR optimizations
 | ||
| - radeonsi: call nir_split_array_vars/shrink_vec_array_vars/opt_find_array_copies
 | ||
| - glsl: lower_precision - fix assertion failure with dereferences of constants
 | ||
| - glsl: fix constant expression evaluation for 16-bit types
 | ||
| - glsl: don't lower atomic functions to mediump
 | ||
| - glsl: don't create conversion opcodes for array types
 | ||
| - glsl: don't lower to mediump for desktop OpenGL
 | ||
| - glsl: improve precision determination for calls
 | ||
| - Revert "radeonsi: honor a user-specified pitch on gfx10.3"
 | ||
| - radeonsi: use correct wave size in gfx10_ngg_calculate_subgroup_info
 | ||
| - radeonsi: use the same units for esgs_ring_size and ngg_emit_size
 | ||
| - radeonsi: increase minimum NGG vertex count requirement per workgroup on gfx 10.3
 | ||
| - radeonsi: fix applying the NGG minimum vertex count requirement
 | ||
| - radeonsi: don't count unusable vertices to the NGG LDS size
 | ||
| - radeonsi: add a common function for getting the size of gs_ngg_scratch
 | ||
| - radeonsi: remove the NGG hack decreasing LDS usage to deal with overflows
 | ||
| - radeonsi: various fixes for gfx10.3
 | ||
| - radeonsi: disable NGG culling on gfx10.3 because of hangs
 | ||
| - st/mesa: don't generate NIR for ARB_vp/fp if NIR is not preferred
 | ||
| - radeonsi: fix tess levels coming as scalar arrays from SPIR-V
 | ||
| - gallivm: fix build on LLVM 12 due to LLVMAddConstantPropagationPass removal
 | ||
| - ac/llvm: fix unaligned VS input loads on gfx10.3
 | ||
| - Revert "ac: generate FMA for inexact instructions for radeonsi"
 | ||
| 
 | ||
| Marek Vasut (3):
 | ||
| 
 | ||
| - etnaviv: Disable seamless cube map on GC880
 | ||
| - etnaviv: Remove etna_resource_get_status()
 | ||
| - etnaviv: Add lock around pending_ctx
 | ||
| 
 | ||
| Mario Kleiner (1):
 | ||
| 
 | ||
| - vulkan/wsi: Really terminate DRM lease in wsi_release_display().
 | ||
| 
 | ||
| Mathias Fröhlich (2):
 | ||
| 
 | ||
| - st/mesa: Move _NEW_FRAG_CLAMP to NewFragClamp driver flag.
 | ||
| - mesa: set _NEW_FRAG_CLAMP only when needed
 | ||
| 
 | ||
| Matt Turner (22):
 | ||
| 
 | ||
| - intel/compiler: Drop opt_sampler_eot()
 | ||
| - intel/tools: Remove unnecessary reg number checking
 | ||
| - intel/tools: Drop srctype from ipreg
 | ||
| - intel/tools: Require explicit regions/types for special regs
 | ||
| - intel/tools: Disallow control subregisters > 3
 | ||
| - intel/tools: Add assembler tests for the cr0 register
 | ||
| - intel/compiler: Add assert that set bits are within mask
 | ||
| - intel/compiler: Don't emit no-op cr0 changes
 | ||
| - intel/tools: Fix typos
 | ||
| - intel/tools: Remove stray newline
 | ||
| - intel/tools: Don't allow empty type specifier
 | ||
| - intel/tools: Simplify register type handling
 | ||
| - intel/tools: Make swizzle an integer
 | ||
| - intel/tools: Make writemask an integer
 | ||
| - intel/tools: Simplify immediate handling
 | ||
| - intel/tools: Simplify dstregion
 | ||
| - intel/compiler: Relax SENDS regioning assertions
 | ||
| - intel/tools: Pass integers, not enums, to stride()
 | ||
| - intel/tools: Manually set ARF register file/nr/subnr
 | ||
| - intel/tools: Don't hardcode notification register
 | ||
| - intel/tools: Simplify notification register handling
 | ||
| - intel/tools: Test notification subregisters
 | ||
| 
 | ||
| Mauro Rossi (17):
 | ||
| 
 | ||
| - android: iris: add iris_seqno.{c,h} to Makefile.sources
 | ||
| - freedreno/drm: android: add libfreedreno_registers static dependency
 | ||
| - freedreno: android: add adreno-pm4-pack.xml.h generation to android build
 | ||
| - android: util: fix build for GL4.1 support
 | ||
| - android: svga: fix build for GL4.1 support
 | ||
| - android: aco: add aco_ir.cpp to Makefile.sources
 | ||
| - android: nvir/gv100: update sources in Makefile.sources
 | ||
| - android: freedreno: add fd5_layout.c to Makefile.sources
 | ||
| - android: freedreno/ir3: add missing generated sources and rules
 | ||
| - android: freedreno/ir3: simplify generated sources rules
 | ||
| - android: panfrost/encoder: add libmesa_nir static dependency
 | ||
| - radv: fix build on Android 7 (v2)
 | ||
| - android: freedreno/registers: fix generated headers rules
 | ||
| - android: freedreno/ir3: fix include paths
 | ||
| - android: freedreno/common: add support for libfreedreno_common static
 | ||
| - android: freedreno: move a2xx disasm out of gallium
 | ||
| - android: freedreno/common: add libmesa_git_sha1 static dependency
 | ||
| 
 | ||
| Michel Dänzer (38):
 | ||
| 
 | ||
| - gitlab-ci: Use YAML anchor for llvmpipe paths in virgl rules
 | ||
| - gitlab-ci: Update to current templates
 | ||
| - gitlab-ci: Move down container_pre_build.sh invocation in x86_build.sh
 | ||
| - gitlab-ci: Add Debian testing repository for x86_build image
 | ||
| - gitlab-ci: Install WINE from Debian testing
 | ||
| - gitlab-ci: Move lib{drm,pciaccess}-dev cross packages out of loop
 | ||
| - gitlab-ci: Install g++-mingw-w64-x86-64-win32 instead of mingw-w64
 | ||
| - Revert "ac,radeonsi: fix compilations issues with LLVM 11"
 | ||
| - Revert "gallium/gallivm: fix compilation issues with llvm 11"
 | ||
| - gitlab-ci: Enable -Werror in `meson-s390x` job
 | ||
| - gitlab-ci: Also list arm/x86_build in needs: of test jobs
 | ||
| - gitlab-ci: x86_test-base image as common base for x86_test-gl/vk
 | ||
| - gitlab-ci: Pull in GCC 9 from Debian testing in x86_test-gl/vk images
 | ||
| - gitlab-ci: Move LLVM/clang 6/7 packages to the x86_build_old image
 | ||
| - gitlab-ci: Use Debian 10 wine-development packages
 | ||
| - gitlab-ci: Stop using packages from Debian testing
 | ||
| - gitlab-ci: Move meson back to x86_test-gl/vk ephemeral packages lists
 | ||
| - gitlab-ci: Add x86_build-base docker image
 | ||
| - gitlab-ci: Use separate docker images for cross builds
 | ||
| - loader/dri3: Add dri3_wait_for_event_locked full_sequence out parameter
 | ||
| - loader/dri3: Use dri3_wait_for_event_locked in loader_dri3_wait_for_msc
 | ||
| - loader/dri3: Check for window destruction in dri3_wait_for_event_locked
 | ||
| - gitlab-ci: Automatically run pipelines for Marge Bot pre-merge only
 | ||
| - gitlab-ci: Use rules: instead of except:/only: for test-docs job
 | ||
| - gitlab-ci: Extend .ci-run-policy template for docs jobs
 | ||
| - gitlab-ci: Do not create the "success" job when the test-docs job exists
 | ||
| - ci: Use "when: always" for pages job
 | ||
| - ci: Move deploy stage between container & build stages
 | ||
| - Revert "loader/dri3: Check for window destruction in dri3_wait_for_event_locked"
 | ||
| - gitlab-ci: Remove indirect dependencies from needs:
 | ||
| - gitlab-ci: Drop dependencies:
 | ||
| - Revert https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4580
 | ||
| - gitlab-ci: Fix "triggered by Marge for a merge request" rule
 | ||
| - gitlab-ci: Only trigger test-docs job automatically for MRs
 | ||
| - ci: Use FDO_CI_CONCURRENT in run-shader-db.sh as well
 | ||
| - ci: Do not mark container / pages jobs as interruptible
 | ||
| - ci: Use half as many parallel softpipe / virgl test jobs
 | ||
| - ci: Use ignore_scheduled_pipelines anchor in .radeonsi-rules
 | ||
| 
 | ||
| Michel Zou (1):
 | ||
| 
 | ||
| - swr: fix build with mingw
 | ||
| 
 | ||
| Mike Blumenkrantz (73):
 | ||
| 
 | ||
| - zink: explicitly zero some arrays in ntv
 | ||
| - zink: add SpvId returns to a couple ntv functions
 | ||
| - zink: flush active queries on destroy and free query object
 | ||
| - zink: fix vkCmdResetQueryPool usage
 | ||
| - zink: reset query on-demand when beginning a new query from resume
 | ||
| - zink: always use logical eq ops in ntv with 1bit inputs
 | ||
| - zink: track program usages for each shader
 | ||
| - zink: emit interpolation decorations for ntv outputs
 | ||
| - zink: handle more glsl->spirv builtin translation
 | ||
| - zink: rework input/output location emission
 | ||
| - zink: use '2' variants for device props/feats, check features for ext enabling
 | ||
| - zink: add spirv builder util functions for emitting xfb decorations
 | ||
| - zink: add spirv_builder methods for OpVectorExtractDynamic and OpVectorInsertDynamic
 | ||
| - zink: implement streamout and xfb handling in ntv
 | ||
| - zink: implement transform feedback support to finish off opengl 3.0
 | ||
| - zink: set PIPE_CAP_VIEWPORT_TRANSFORM_LOWERED and remove POS special casing
 | ||
| - zink: switch to passing VkPhysicalDeviceFeatures2 in VkDeviceCreateInfo
 | ||
| - zink: enable xfb extension in screen creation
 | ||
| - zink: use int assignment for vk int type
 | ||
| - zink: use correct define value for reserved slot count in ntv
 | ||
| - zink: clamp VkImageCreateInfo.arrayLayers to 1 for image resource creation
 | ||
| - zink: unify code for setting resource barriers
 | ||
| - zink: handle signed and unsigned min/max ops in ntv
 | ||
| - zink: add ult handling for ntv
 | ||
| - zink: add bitfield_reverse handling to ntv
 | ||
| - zink: lower byte/word extract ops in nir
 | ||
| - zink: handle ixor in ntv
 | ||
| - zink: handle isign alu in ntv
 | ||
| - zink: set lower_mul_high and lower_rotate in ntv compiler options
 | ||
| - zink: use OpFUnordNotEqual for nir_op_fne
 | ||
| - zink: set lower_uadd_carry in nir options
 | ||
| - zink: implement Vk_EXT_index_type_uint8
 | ||
| - nir: add lowering pass for clip plane enabling
 | ||
| - st/program: use nir_lower_clip_disable instead of nir_lower_clip_vs conditionally
 | ||
| - nir: add lowering pass for fragcolor -> fragdata
 | ||
| - zink: translate gl_FragColor to gl_FragData before ntv to fix multi-rt output
 | ||
| - u_prim_restart: handle user buffers in util_translate_prim_restart_ib()
 | ||
| - nir: allow nir_lower_point_size_mov to run in geometry shader
 | ||
| - nir: allow nir_lower_clip_halfz to run in geometry shaders
 | ||
| - zink: rework query handling
 | ||
| - zink: use #define for number of queries per-pool
 | ||
| - zink: only stall during query destroy for xfb queries
 | ||
| - zink: properly handle query pool overflows
 | ||
| - zink: only reset query pool on query end if current batch isn't in renderpass
 | ||
| - zink: use right vulkan type for GL_PRIMITIVES_GENERATED queries
 | ||
| - zink: handle ntv case of nested loop instructions more permissively
 | ||
| - zink: add lengthy comment and remove assert from discard_if ntv pass
 | ||
| - zink: use type of src[0] for ntv store and load ops
 | ||
| - zink: try copy_region hook for blits where we can't do a regular blit or resolve
 | ||
| - zink: block vkCmdBlitImage usage for multi sampled blits
 | ||
| - zink: block resolve blits for depth/stencil buffers
 | ||
| - zink: handle empty attachments
 | ||
| - zink: try to handle multisampled null buffers
 | ||
| - zink: enable tgsi texcoord pipe cap
 | ||
| - zink: destroy gfx program when a shader is freed
 | ||
| - zink: destroy descriptor pools on context destroy
 | ||
| - zink: free pipeline cache during program destroy
 | ||
| - zink: free all ntv allocations after creating shader module
 | ||
| - zink: use helper function to handle uvec/bvec types
 | ||
| - zink: handle texelFetchOffset with offsets
 | ||
| - zink: add some asserts for building access chains in ntv
 | ||
| - zink: omit Lod image operand in ntv when not using an image texture dim
 | ||
| - nir: allow lower_psiz_mov to run in tessellation stages
 | ||
| - nir\_ allow nir_lower_clip_halfz to run in tess eval shader
 | ||
| - u_prim_restart: handle indirect draws
 | ||
| - zink: add extension loading framework for spirv builder
 | ||
| - zink: implement VK_EXT_robustness2
 | ||
| - zink: clamp PIPE_SHADER_CAP_MAX_SHADER_BUFFERS to PIPE_MAX_SHADER_BUFFERS
 | ||
| - zink: handle VK_EXT_vertex_attribute_divisor setup
 | ||
| - zink: store valid timestamp bits onto zink_screen
 | ||
| - zink: implement handling for VK_EXT_calibrated_timestamps
 | ||
| - u_prim_restart: add inline function for getting restart index based on index size
 | ||
| - zink: reorder create_stream_output_target to fix failure case leak
 | ||
| 
 | ||
| Miklós Máté (1):
 | ||
| 
 | ||
| - docs: add some missing stuff to sourcetree.rst
 | ||
| 
 | ||
| Nanley Chery (18):
 | ||
| 
 | ||
| - iris: Drop can_fast_clear_color's format parameter
 | ||
| - iris: Remove the CCS_D fallback
 | ||
| - iris: Avoid fast-clear with incompatible view
 | ||
| - iris: Disable sRGB fast-clears for non-0/1 values
 | ||
| - intel: Add ISL_AUX_USAGE_GEN12_CCS_E
 | ||
| - iris: Don't support sRGB + Y_TILED_CCS on gen9
 | ||
| - iris: Use ISL_AUX_USAGE_GEN12_CCS_E on gen12
 | ||
| - isl/drm: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS
 | ||
| - gallium/dri2: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS
 | ||
| - iris: Handle importing aux-enabled surfaces on TGL
 | ||
| - iris: Refactor modifier_is_supported for gen12
 | ||
| - iris: Support I915_FORMAT_MOD_Y_TILED_GEN12_RC_CCS
 | ||
| - iris: Zero the add-on clear color BO on import
 | ||
| - dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_B8G8R8X8_UNORM
 | ||
| - iris: Don't call SET_TILING for dmabuf imports
 | ||
| - gallium/dri2: Report correct YUYV and UYVY plane count
 | ||
| - iris: Fix aux assertion in resource_get_handle
 | ||
| - blorp: Fix alignment test for HIZ_CCS_WT fast-clears
 | ||
| 
 | ||
| Nataraj Deshpande (3):
 | ||
| 
 | ||
| - anv: Limit vulkan version to 1.1 for Android
 | ||
| - anv: Disable extensions based on Android versions
 | ||
| - dri_util: Update internal_format to GL_RGB8 for MESA_FORMAT_R8G8B8X8_UNORM
 | ||
| 
 | ||
| Neha Bhende (6):
 | ||
| 
 | ||
| - util: Initialize pipe_shader_state for passthrough and transform shaders
 | ||
| - util: Add util functionality for GL4.1 support
 | ||
| - winsys/drm: Add GL4.1 support in drm winsys
 | ||
| - svga/include: Headers for GL4.1 support
 | ||
| - svga: Add GL4.1(compatibility profile) support in svga driver
 | ||
| - svga: Performance fixes
 | ||
| 
 | ||
| Neil Armstrong (2):
 | ||
| 
 | ||
| - Revert "CI: Disable Lima jobs due to lab unhealthiness"
 | ||
| - Revert "CI: Disable Panfrost Mali-T820 jobs"
 | ||
| 
 | ||
| Neil Roberts (26):
 | ||
| 
 | ||
| - nir/scheduler: Handle nir_intrinsic_load_per_vertex_input
 | ||
| - v3d: Remove unused member of v3d_compile
 | ||
| - nir/schedule: Store a pointer to the scoreboard in nir_deps_state
 | ||
| - nir/scheduler: Add an option to specify what stages share memory for I/O
 | ||
| - v3d: Let scheduler know GS doesn’t have shared I/O memory
 | ||
| - gallium: Add pipe cap for primitive restart with fixed index
 | ||
| - mesa: Add PrimitiveRestartFixedIndex to gl_constants
 | ||
| - v3d: Disable PIPE_CAP_PRIMITIVE_RESTART
 | ||
| - v3d: Add missing macro for stvpmd instruction
 | ||
| - v3d: Use stvpmd for non-uniform offsets in GS
 | ||
| - compiler: Add a system value for the line coord
 | ||
| - v3d: Implement the line coord intrinsic
 | ||
| - nir: Add intrinsics for the line width
 | ||
| - v3d: Handle the line width intrinsics
 | ||
| - v3d: Add a lowering pass for line smoothing
 | ||
| - v3d: Enable perpendicular line caps when line smoothing
 | ||
| - broadcom/qpu: set VC5_QPU_RADDR_A out of the switch at _pack_branch
 | ||
| - v3d/compiler: Fix sorting the gs and fs inputs
 | ||
| - v3d/compiler: Lower geometry output store base into offset src
 | ||
| - nir/scheduler: Move nir_scheduler to its own header
 | ||
| - nir/schedule: Store a pointer to the options struct in scoreboard
 | ||
| - nir/schedule: Add a callback for backend-specific dependencies
 | ||
| - v3d: Mark scheduling dependency for prim id and first output
 | ||
| - nir/schedule: Add an option for a fallback scheduling algorithm
 | ||
| - v3d: Changed v3d_compile:failed to an enum
 | ||
| - v3d: Retry with the fallback scheduler when RA fails
 | ||
| 
 | ||
| Oschowa (5):
 | ||
| 
 | ||
| - radv: Don't take absolute value of unsigned type.
 | ||
| - aco: Don't declare 'Block' as class, but define as struct.
 | ||
| - aco: Don't std::move temporary object.
 | ||
| - aco: Use correct reference type in for-range-loop.
 | ||
| - radv: Explicitly cast TIMESTAMP_NOT_READY value to uin32_t where needed.
 | ||
| 
 | ||
| Pablo Saavedra (5):
 | ||
| 
 | ||
| - ci: TRACES_DB_PATH and RESULTS_PATH defined as relative paths
 | ||
| - ci: ArgumentParser receives the args from the main parameters
 | ||
| - ci: Migrate tracie tests done in shell script to pytest
 | ||
| - ci: Split test_tracie_skips_traces_without_checksum in separate cases
 | ||
| - ci: Fix TypoError error when traces in traces.yml is an empty list
 | ||
| 
 | ||
| Pavel Asyutchenko (1):
 | ||
| 
 | ||
| - vulkan/overlay: fix crash on destroying NULL swapchain
 | ||
| 
 | ||
| Peter Seiderer (3):
 | ||
| 
 | ||
| - vc4_bufmgr: fix time_t printf
 | ||
| - pan_bo.h: add time.h include for time_t
 | ||
| - v3d_bufmgr: fix time_t printf
 | ||
| 
 | ||
| Pierre Moreau (4):
 | ||
| 
 | ||
| - clover/nir: Check the result of spirv_to_nir
 | ||
| - clover/api: Address missing braces for subobj init
 | ||
| - clover: Address unnecessary copy warnings
 | ||
| - clover/spirv: Remove unused tuple header
 | ||
| 
 | ||
| Pierre-Eric Pelloux-Prayer (62):
 | ||
| 
 | ||
| - radeonsi: fix export count
 | ||
| - mesa: add gl_coontext::ForceIntegerTexNearest
 | ||
| - driconf: add force_integer_tex_nearest option
 | ||
| - radeonsi: add workaround for issue 2647
 | ||
| - radeonsi: don't print gs_copy_shader stats for shaderdb
 | ||
| - glsl: init gl_FragColor if zero_init=true
 | ||
| - glsl: rework zero initialization
 | ||
| - glsl: add a is_implicit_initializer flag
 | ||
| - mesa: extend GLSLZeroInit semantics
 | ||
| - gallium: add a new cap PIPE_CAP_GLSL_ZERO_INIT
 | ||
| - ac/nir: export some undef as zero
 | ||
| - ac/surface: remove shadowing declaration
 | ||
| - amdgpu/radeon: add secure api
 | ||
| - radeonsi: add AMD_DEBUG=tmz option
 | ||
| - radeon: add RADEON_CREATE_ENCRYPTED flag
 | ||
| - radeonsi: allocate framebuffer texture as secure when using tmz
 | ||
| - amdgpu: add encrypted slabs support
 | ||
| - radeonsi: force using staging texture when uploading to secure texture
 | ||
| - radeonsi/sdma: implement tmz support
 | ||
| - gallium: PIPE_RESOURCE_FLAG_ENCRYPTED
 | ||
| - radeonsi: add support for PIPE_RESOURCE_FLAG_ENCRYPTED
 | ||
| - amdgpu: use AMDGPU_IB_FLAGS_SECURE when requested
 | ||
| - radeonsi: determine secure flag must be set for gfx IB
 | ||
| - radeonsi: do not use cmask with encrypted texture
 | ||
| - amd/addrlib: fix forgotten char -> enum conversions
 | ||
| - radeonsi: fix inversed arguments in si_test_gds_memory_management
 | ||
| - amdgpu: fix unitialized variable
 | ||
| - radeonsi/sdma: remove useless compare
 | ||
| - radeonsi/drirc: enable zerovram option for 7 Days to Die
 | ||
| - winsys/radeon: do not cast bo->va as void*
 | ||
| - radeonsi: add return value to gfx10_ngg_calculate_subgroup_info
 | ||
| - radeonsi/ngg: try GS multi-cycling mode if default mode failed
 | ||
| - ac/surface: set SCANOUT if surf->is_displayable
 | ||
| - ac/surface: fix epitch when modifying surf_pitch
 | ||
| - ac/llvm: load 1 byte at a time if unaligned on gfx10
 | ||
| - st/mesa: make texture views inherit compressed_data storage
 | ||
| - radeonsi: bump SI_NUM_SHADER_BUFFERS to 32
 | ||
| - st/mesa: do not clear NewDriverState for inactive states
 | ||
| - glsl: reject size1x8 for image variable with floating-point data types
 | ||
| - ac/llvm: remove the -1 hack from ac_atomic_inc_wrap
 | ||
| - glsl: don't expose imageAtomicIncWrap for signed image
 | ||
| - glsl: only allow 32 bits atomic operations on images
 | ||
| - glsl: declare gl_Layer/gl_ViewportIndex/gl_ViewportMask as vs builtins
 | ||
| - st/mesa: set compressed_data to NULL when freed
 | ||
| - bin/symbols-check.py: add --ignore-symbol argument
 | ||
| - ac/llvm: export ac_init_llvm_once in targets
 | ||
| - mesa: rename _mesa_free_errors_data
 | ||
| - mesa: add bool param to _mesa_free_context_data
 | ||
| - mesa/st: release debug_output after destroying the context
 | ||
| - ac/surface: adapt surf_size when modifying surf_pitch
 | ||
| - radeonsi: adjust epitch for PIPE_FORMAT_R8G8_R8B8_UNORM
 | ||
| - radeonsi: extend workaround for KHR-GL45.texture_view.view_classes on gfx9
 | ||
| - ac/llvm: handle static/shared llvm init separately
 | ||
| - mesa/st: introduce PIPE_CAP_NO_CLIP_ON_COPY_TEX
 | ||
| - radeonsi: enable PIPE_CAP_NO_CLIP_ON_COPY_TEX
 | ||
| - ac/llvm: add option to clamp division by zero
 | ||
| - radeonsi,driconf: add clamp_div_by_zero option
 | ||
| - radeonsi: use radeonsi_clamp_div_by_zero for SPECviewperf13, Road Redemption
 | ||
| - glsl: fix per_vertex_accumulator::fields size
 | ||
| - r600/uvd: set dec->bs_ptr = NULL on unmap
 | ||
| - radeon/vcn: set dec->bs_ptr = NULL on unmap
 | ||
| - mesa: fix glUniform* when a struct contains a bindless sampler
 | ||
| 
 | ||
| Pierre-Loup A. Griffais (2):
 | ||
| 
 | ||
| - radv: fix null descriptor for dynamic buffers
 | ||
| - radv: fix vertex buffer null descriptors
 | ||
| 
 | ||
| Qiang Yu (6):
 | ||
| 
 | ||
| - radeonsi: remove emacs style config file
 | ||
| - panfrost: don't always build bifrost_compiler
 | ||
| - radeonsi: fix syncobj wait timeout
 | ||
| - radeonsi: fix user fence space when MCBP is enabled
 | ||
| - radeonsi: fix max syncobj wait timeout
 | ||
| - radeonsi: fix user fence GPU address
 | ||
| 
 | ||
| Rafael Antognolli (8):
 | ||
| 
 | ||
| - intel: Store the aperture size in devinfo.
 | ||
| - intel/isl: Update mocs for DG1
 | ||
| - intel/l3: Return the URB size from devinfo for DG1
 | ||
| - intel/devinfo: Add function to check for DRM_I915_GEM_GET_TILING.
 | ||
| - iris/bufmgr: Do not use map_gtt or use set/get_tiling on DG1
 | ||
| - anv/dg1: Don't use SET_TILING kernel uapi.
 | ||
| - iris: Align last_seqnos to 64 bits.
 | ||
| - anv: Align "used" attribute to 64 bits.
 | ||
| 
 | ||
| Rhys Kidd (5):
 | ||
| 
 | ||
| - nv50_2d: regenerate envytools-based rnndb headers
 | ||
| - nv50_2d,nvc0_2d: Document SET_PIXELS_FROM_MEMORY_SAFE_OVERLAP from rnndb
 | ||
| - nvc0_2d: Document SET_PIXELS_FROM_MEMORY_CORRAL_SIZE from rnndb
 | ||
| - nvc0: fix macro define for NVE4_COPY()
 | ||
| - nvc0: add documentation for nve4+ (Kepler) COPY class
 | ||
| 
 | ||
| Rhys Perry (174):
 | ||
| 
 | ||
| - aco: remove use of f-strings
 | ||
| - aco: add message to static_assert
 | ||
| - nir: add missing group_memory_barrier handling
 | ||
| - compiler/spirv: flag nclamp/nmin/nmax as exact
 | ||
| - nir: make fsat return 0.0 with NaN instead of passing it through
 | ||
| - docs: add src/amd/ to sourcetree.html
 | ||
| - docs/envvars: document ACO_DEBUG
 | ||
| - docs/envvars: update RADV_FORCE_FAMILY
 | ||
| - aco: simplify consecutive ordered vmem/lds writes optimization
 | ||
| - aco: fix consecutively written vgprs from vmem instructions
 | ||
| - aco: mark phi definitions as last-seen phi operands
 | ||
| - aco: consider affinities when creating v_mac_f32
 | ||
| - aco: improve phi affinities with p_split_vector
 | ||
| - aco: split operations that use a swap's definition
 | ||
| - aco: fix disassembly with LLVM 11
 | ||
| - nir/opt_if: run opt_peel_loop_initial_if after all other optimizations
 | ||
| - nir/opt_if: use nir_src_as_bool in opt_peel_loop_initial_if helper
 | ||
| - aco: fix typo in insert_waitcnt's kill()
 | ||
| - nir: fix lowering to scratch with boolean access
 | ||
| - aco: fix interaction with 3f branch workaround and p_constaddr
 | ||
| - aco: consider SDWA during value numbering
 | ||
| - aco: check instruction format before waiting for a previous SMEM store
 | ||
| - aco: preserve more fields when combining additions into SMEM
 | ||
| - aco: don't reorder barriers in the scheduler
 | ||
| - aco: fix 64-bit shared_atomic_exchange
 | ||
| - docs: add missing "shader\_" in VK_KHR_shader_subgroup_extended_types
 | ||
| - radv: set keep_statistic_info with RADV_DEBUG=shaderstats
 | ||
| - ac/gpu_info, radv: set max_wave64_per_simd to 20 on GFX10
 | ||
| - aco: use v_xor3_b32
 | ||
| - aco: validate instructions reading/writing upper halves/bytes
 | ||
| - aco: p_extract_vector in 64-bit u2f16/i2f16
 | ||
| - aco: allow reading/writing upper halves/bytes when possible
 | ||
| - aco: prefer 4-byte aligned definitions
 | ||
| - aco: add Info::{operand_size,definition_size}
 | ||
| - aco: use Info::definition_size instead of definition's regclass
 | ||
| - aco: fix moving sub-dword values out of a register for a fixed definition
 | ||
| - aco: use num_opcodes instead of last_opcode
 | ||
| - aco: improve code for f2{i,u}{8,16}
 | ||
| - aco: use p_as_uniform in emit_vop1_instruction
 | ||
| - aco: add and set precise flag
 | ||
| - aco: create mads when signed zeros should be preserved
 | ||
| - aco: try to use fma instead of mad when denormals are enabled
 | ||
| - aco: create 16-bit mad/fma
 | ||
| - aco: update comment about preserving fp16/fp64 denormals
 | ||
| - aco: create 16-bit input and output modifiers
 | ||
| - aco: improve sub-dword check for sgpr/constant propagation
 | ||
| - aco: fix half_pi constant for 16-bit fsin/fcos
 | ||
| - aco: use 32-bit inline constants for 16-bit integer instructions
 | ||
| - aco: improve 8/16-bit constants
 | ||
| - aco: copy-propagate constants through p_extract_vector/p_split_vector
 | ||
| - aco: optimize 16-bit and 64-bit float comparisons
 | ||
| - aco: validate sub-dword pseudo instructions
 | ||
| - aco: add more opcodes to can_swap_operands
 | ||
| - aco: allow GFX9 partial writes with instructions which use opsel
 | ||
| - aco: improve check for moving temporaries out of fixed definitions
 | ||
| - aco: fix encoding of certain s_setreg_imm32_b32 instructions
 | ||
| - aco: fix validation error from vgpr spill/restore code
 | ||
| - aco: fix sub-dword opsel/sdwa checks
 | ||
| - aco: fix validation of opsel when set for the definition
 | ||
| - aco: shrink ssa_info
 | ||
| - aco: make ssa_info::label 64-bit
 | ||
| - aco: shrink mad_info
 | ||
| - aco: fix edge check with sub-dword temporaries
 | ||
| - aco: use the same regclass as the definition for undef phi operands
 | ||
| - radv: add new drirc option radv_no_dynamic_bounds
 | ||
| - radv: enable radv_no_dynamic_bounds for Path of Exile
 | ||
| - radv: enable radv_no_dynamic_bounds for more Path of Exile executables
 | ||
| - nir: slight correction to cube_face_coord constant folding
 | ||
| - spirv: set variables to restrict by default
 | ||
| - radv: fix image variable types in meta shaders
 | ||
| - aco: only use SMEM if we can prove it's safe
 | ||
| - aco: allow SMEM for some sub-dword accesses
 | ||
| - radv/aco,aco: allow SMEM SSBO loads on GFX6/7
 | ||
| - aco: fix copy+paste error in split_buffer_store
 | ||
| - aco: don't store byte-aligned short stores
 | ||
| - aco: add missing bld.scc() in byte_align_scalar()
 | ||
| - aco: don't create byte-aligned short loads
 | ||
| - aco: fix when sub-dword create_vector operand cannot be placed perfectly
 | ||
| - aco: improve vectorization of 8/16-bit loads/stores
 | ||
| - aco: ignore blocked registers when checking edges in get_reg_impl()
 | ||
| - aco: remove outdated assert in handle_operands()
 | ||
| - radv: enable zerovram for Quantic Dream games
 | ||
| - aco: use VOP2 version of v_mbcnt_hi_u32_b32 on GFX6/7
 | ||
| - aco: rework boolean phi pass
 | ||
| - aco: create better code for boolean phis with constant operands
 | ||
| - aco: optimize boolean phis with uniform selections
 | ||
| - aco: don't create phis with undef operands in the boolean phi pass
 | ||
| - aco: read 0 from inactive lanes when using dpp
 | ||
| - aco: optimize some masked swizzles to DPP
 | ||
| - aco: implement <32-bit masked_swizzle_amd
 | ||
| - nir/lower_subgroups: pass options struct to lower_shuffle
 | ||
| - nir/lower_subgroups: add lower_shuffle_to_swizzle_amd
 | ||
| - radv: use lower_shuffle_to_swizzle_amd
 | ||
| - aco: add 32-bit integer addition to can_swap_operands
 | ||
| - aco: fix underestimated pressure in spiller when a phi has a killed def
 | ||
| - aco: rewrite graph coloring in spiller
 | ||
| - aco: use unordered_set for spill id interferences
 | ||
| - aco: add add_interference() helper
 | ||
| - aco: use s_round_mode/s_denorm_mode
 | ||
| - aco: flush denormals before fp16 fabs/fneg if needed
 | ||
| - aco: fix nir_op_f2f16_rtne with non-default rounding modes
 | ||
| - aco: set tcs_in_out_eq=false if float controls of VS and TCS stages differ
 | ||
| - radv: enable more float_controls features
 | ||
| - aco: properly recognize that s_waitcnt mitigates VMEMtoScalarWriteHazard
 | ||
| - aco: use s_waitcnt_depctr to mitigate VMEMtoScalarWriteHazard
 | ||
| - spirv: don't split memory barriers
 | ||
| - nir/lower_int64: lower 64-bit amul
 | ||
| - aco: always set FI on GFX10
 | ||
| - radv: replace discard with demote for Quantic Dream games
 | ||
| - aco: implement b2i8/b2i16
 | ||
| - aco: be more careful combining additions that could wrap into loads/stores
 | ||
| - aco: allow overflow for some SMEM instructions
 | ||
| - aco: add NUW flag
 | ||
| - nir: add nir_unsigned_upper_bound and nir_addition_might_overflow
 | ||
| - aco: use nir_addition_might_overflow to combine additions into SMEM
 | ||
| - aco: move some setup code into helpers
 | ||
| - aco: make validate() usable in tests
 | ||
| - aco: print ACO IR before scheduling instead of after
 | ||
| - radv: fix invalid conversion warnings in vk_format.h
 | ||
| - aco: fix copy of uninitialized boolean
 | ||
| - aco: fix includes in aco_ir.cpp
 | ||
| - aco: add missing add_to_hazard_query
 | ||
| - aco: rework barriers and replace can_reorder
 | ||
| - radv/aco,aco: use scoped barriers
 | ||
| - aco: consider intrinsic access in visit_{load,store}_image
 | ||
| - nir,radv/aco: add and use pass to lower make available/visible barriers
 | ||
| - aco: enable value numbering of s_buffer_load_*
 | ||
| - aco: use storage_scratch
 | ||
| - aco: improve sync_info for TCS output stores
 | ||
| - aco: improve workgroup-scope and lower vmem/smem barriers
 | ||
| - aco: create acq+rel barriers instead of acq/rel
 | ||
| - nir/load_store_vectorize: fix indentation
 | ||
| - ac/nir: implement scoped_barrier
 | ||
| - radv: use scoped barriers
 | ||
| - aco: remove isel for GLSL-style barriers
 | ||
| - aco: add framework for unit testing
 | ||
| - aco: add a few tests for the assembler and optimizer
 | ||
| - aco: add framework for testing isel and integration tests
 | ||
| - ci: enable ACO tests
 | ||
| - aco/tests: add tests for sub-dword swaps
 | ||
| - aco: optimize swizzled SALU 8/16-bit conversions
 | ||
| - aco: fix waitcnt insertion on GFX10.3
 | ||
| - aco: don't create v_mad_f32 on GFX10.3
 | ||
| - aco: update bug workarounds for GFX10_3
 | ||
| - aco: fix max_waves_per_simd on Polaris, VegaM and GFX10.3
 | ||
| - aco: update vgpr_alloc_granule for GFX10.3
 | ||
| - aco: implement subgroup shader_clock on GFX10.3
 | ||
| - aco: update aco_opcodes.py for GFX10.3
 | ||
| - aco: disable SMEM stores on GFX10.3
 | ||
| - aco: replace MADs in isel with FMA on GFX10.3
 | ||
| - spirv: set ACCESS_COHERENT for ssbo/global/image atomic load/store
 | ||
| - radv/aco: enable VK_KHR_memory_model
 | ||
| - ac/nir: consider an image load/store intrinsic's access
 | ||
| - ac/nir: fix coherent global loads/stores
 | ||
| - radv/llvm: enable VK_KHR_memory_model
 | ||
| - aco: fix C++11/C++14 compilation
 | ||
| - aco: set constant_data_offset correctly in the case of merged shaders
 | ||
| - aco: don't move memory accesses to before control barriers
 | ||
| - aco: fix non-rtz pack_half_2x16
 | ||
| - aco: consider branch definitions in spiller
 | ||
| - aco: don't consider the first partial spill if it's the wrong type
 | ||
| - aco: don't fix break condition for break+discard to exec
 | ||
| - aco: fix regclass checks when fixing to vcc/exec with Builder
 | ||
| - aco: fix spills_entry heuristic for branch blocks in init_live_in_vars()
 | ||
| - aco: keep loop live-through variables spilled
 | ||
| - aco: reserve 2 sgprs for each branch
 | ||
| - aco: create long jumps
 | ||
| - aco: fix byte_align_scalar for 3 dword vectors
 | ||
| - aco: fix one-off error in Operand(uint16_t)
 | ||
| - nir/opt_if: fix opt_if_merge when destination branch has a jump
 | ||
| - aco: fix v_writelane_b32 with two sgprs
 | ||
| - aco: don't apply constant to SDWA on GFX8
 | ||
| - radv: initialize with expanded cmask if the destination layout needs it
 | ||
| - radv,aco: fix reading primitive ID in FS after TES
 | ||
| 
 | ||
| Rob Clark (265):
 | ||
| 
 | ||
| - util/simple_mtx: add assert_locked()
 | ||
| - freedreno: add screen lock wrappers
 | ||
| - freedreno: switch to simple_mtx
 | ||
| - freedreno: fix buffer import
 | ||
| - gallium: extract out logicop helper
 | ||
| - freedreno/drm: drop atomic refcnts
 | ||
| - freedreno/drm: inline the things
 | ||
| - freedreno/a6xx: small query cleanup
 | ||
| - freedreno/a6xx: avoid unnecessary clearing VS DP state
 | ||
| - freedreno/a6xx: move const state to single stateobj
 | ||
| - freedreno/a6xx: move scissor state to stateobj
 | ||
| - freedreno/a6xx: limit PROG_FB_RAST state emit
 | ||
| - freedreno/a6xx: limit LRZ state emit
 | ||
| - freedreno/a6xx: move blend-color to stateobj
 | ||
| - freedreno/a6xx: combine sample mask into blend state
 | ||
| - freedreno/a6xx: skip unnecessary MRT blend state
 | ||
| - freedreno/a6xx: add OUT_PKT()
 | ||
| - freedreno/a6xx: convert draw packet to OUT_PKT()
 | ||
| - freedreno/a6xx: split out const emit
 | ||
| - freedreno/ir3: inline const emit
 | ||
| - freedreno/a6xx: convert const emit to OUT_PKT()
 | ||
| - freedreno: scissor vs disabled scissor micro-opt
 | ||
| - freedreno/a6xx: more OUT_REG()
 | ||
| - freedreno: sync registers with envytools
 | ||
| - freedreno/a6xx: don't set SP_FS_CTRL_REG0.VARYING for fragcoord
 | ||
| - freedreno/a6xx: fix LRZ hang
 | ||
| - freedreno/a6xx: add some more formats
 | ||
| - freedreno: we don't need aligned vbo's
 | ||
| - freedreno/a6xx: compressed blit fixes
 | ||
| - freedreno/a6xx: enable tiled compressed textures
 | ||
| - freedreno/gmem: don't assume scissor opt when estimating # of bins
 | ||
| - freedreno: initialize max_scissor
 | ||
| - freedreno/gmem: add div_align() helper
 | ||
| - freedreno/gmem: add helper to dump GMEM layout
 | ||
| - freedreno: add gmemtool
 | ||
| - freedreno/gmem: relax alignment on a6xx
 | ||
| - freedreno/gmem: rework gmem layout algo
 | ||
| - freedreno/ir3: don't allow negative const_offset
 | ||
| - freedreno/ir3: fix indirect cb0 load_ubo lowering
 | ||
| - freedreno/ir3: limit # of tex prefetch by shader size
 | ||
| - freedreno/ir3/postsched: reset sfu_delay on sync
 | ||
| - freedreno/ir3/postsched: try to avoid (sy) syncs
 | ||
| - freedreno/ir3/sched: avoid scheduling outputs
 | ||
| - freedreno/ir3/sched: try to avoid syncs
 | ||
| - freedreno/a6xx: fix max-scissor opt
 | ||
| - freedreno/ir3: use const_index accessors
 | ||
| - nir: fix indices for ir3 ssbo_atomic intrinsics
 | ||
| - nir: add helper to copy const_index[]
 | ||
| - nir: add pass to lower disjoint wrmask's
 | ||
| - freedreno/ir3: use lower_wrmasks pass
 | ||
| - freedreno/fdperf: add dependency on generated headers
 | ||
| - freedreno/drm: don't pass thru 'DUMP' flag on older kernels
 | ||
| - freedreno/drm: handle ancient kernels
 | ||
| - freedreno/ir3: remove Sethi-Ullman numbering pass
 | ||
| - freedreno/ir3: juggle around ir3_debug_print()
 | ||
| - freedreno/ir3/dce: report progress
 | ||
| - freedreno/cf: report progress
 | ||
| - freedreno/ir3/cp: report progress
 | ||
| - freedreno/ir3/deps: report progress
 | ||
| - freedreno/ir3/group: report progress
 | ||
| - freedreno/ir3/legalize: report progress
 | ||
| - freedreno/ir3/postsched: report progress
 | ||
| - freedreno/ir3: add IR3_PASS() macro
 | ||
| - freedreno/ir3: move where we preserve binning pass inputs
 | ||
| - freedreno/ir3: be iterative
 | ||
| - freedreno/ir3: make foreach_src declare cursor ptr
 | ||
| - freedreno/ir3: make foreach_ssa_src declar cursor ptr
 | ||
| - freedreno/ir3: make input/output iterators declare cursor ptr
 | ||
| - freedreno/ir3/group: fix for half-regs
 | ||
| - freedreno/ir3: fix mismatched flags on split
 | ||
| - freedreno/ir3/cf: handle multiple cov's properly
 | ||
| - freedreno/ir3: fix immed type in create_addr0()
 | ||
| - freedreno/ir3/print: print cat2 condition
 | ||
| - freedreno/ir3/cp: fix cmps folding
 | ||
| - freedreno/ir3: fix mismatched wrmask for overlapping VS inputs
 | ||
| - freedreno/ir3: add simple validate pass
 | ||
| - freedreno/ir3: add helpers to deal with src/dst types
 | ||
| - freedreno/ir3/validate: add checking for types and opcodes
 | ||
| - freedreno/drm: disallow exported buffers in bo cache
 | ||
| - freedreno: add batch debugging
 | ||
| - freedreno: clear last_fence after resource tracking
 | ||
| - freedreno: handle PIPE_TRANSFER_MAP_DIRECTLY
 | ||
| - freedreno/gmem: make noscis debug actually do something on a6xx
 | ||
| - freedreno/gmemtool: make GMEM alignment per-gen
 | ||
| - freedreno/gmemtool: add a405
 | ||
| - freedreno/gmemtool: add verbose mode
 | ||
| - freedreno/gmem: add some asserts
 | ||
| - freedreno/gmem: fix nbins_x/y mismatch
 | ||
| - freedreno/gmem: split out helper to calc # of bins
 | ||
| - freedreno/a6xx: LRZ fix for alpha-test
 | ||
| - freedreno/a6xx: document LRZ flag buffer
 | ||
| - freedreno/a6xx: fix vsc assert
 | ||
| - nir: get_base_type() should return enum type
 | ||
| - nir: extract out convert_to_bitsize() helper
 | ||
| - nir/builder: add bitsize conversion helpers
 | ||
| - nir/lower_tex: fixes for fp16 yuv lowering
 | ||
| - freedreno/ir3: split kill from no_earlyz
 | ||
| - freedreno/a6xx: sync registers from envytools
 | ||
| - freedreno/a6xx: update depth-plane control regs
 | ||
| - freedreno/a6xx: re-work LRZ state tracking
 | ||
| - freedreno/a6xx: add early-lrz-late-z mode
 | ||
| - freedreno/a6xx: also consider alpha-test for ztest-mode
 | ||
| - freedreno/a6xx: more early-z
 | ||
| - freedreno/computerator: fix missing dependency on generated header
 | ||
| - nir/print: print tex dest type
 | ||
| - freedreno/ir3: add debug code to print conflicting half-regs
 | ||
| - freedreno/ir3: respect tex prefetch limits
 | ||
| - freedreno/ir3: remove RA "q-values" optimization
 | ||
| - freedreno/ir3: limit pre-fetched tex dest
 | ||
| - freedreno/ir3: unify shader create/delete paths
 | ||
| - freedreno/ir3: move the libdrm dependency out of shared code
 | ||
| - turnip: drop linking libfreedreno_drm
 | ||
| - freedreno/ir3: don't rely on intr->num_components
 | ||
| - radv: don't set num_components for non-vectorized intrinsics
 | ||
| - nir/builder: don't set intr->num_components
 | ||
| - nir/lower-atomics-to-ssbo: don't set num_components
 | ||
| - spriv: don't set num_components for non-vectorised intrinsics
 | ||
| - v3d: don't use intr->num_components for non-vectorized intrinsics
 | ||
| - nir/validate: validate intr->num_components
 | ||
| - freedreno/log-parser: fix compute times
 | ||
| - freedreno/sched: reset delay counters at start of block
 | ||
| - freedreno/ir3/validate: also check instr->address
 | ||
| - freedreno/ir3/cp: properly handle already-folded RELATIV
 | ||
| - freedreno: splitup emit_string_marker
 | ||
| - freedreno/a6xx: emit shader names in debug builds
 | ||
| - freedreno/ir3/legalize: don't allow (nopN) if (rptN)
 | ||
| - freedreno/ir3/print: print (r) flag
 | ||
| - freedreno/ir3: add test for delay slot calculation
 | ||
| - freedreno/ir3/delay: calculate delay properly for (rptN)'d instructions
 | ||
| - freedreno/ir3: add helpers to move instructions
 | ||
| - freedreno/ir3: delay test support for vectorish instructions
 | ||
| - freedreno/ir3/cp: extract valid_flags
 | ||
| - freedreno/ir3: add post-scheduler cp pass
 | ||
| - freedreno/ir3: convert regmask_t to struct
 | ||
| - freedreno/ir3: move mergedreg state out of reg
 | ||
| - freedreno/ir3: decouple regset from gpu gen
 | ||
| - freedreno/ir3: pass variant to postsched
 | ||
| - freedreno/ir3: re-work assembler API
 | ||
| - freedreno/ir3: make mergedregs a property of the variant
 | ||
| - freedreno/a6xx: set .MERGEREGS based on variant
 | ||
| - turnip: set .MERGEDREGS based on variant
 | ||
| - freedreno/computerator: MERGEDREGS update
 | ||
| - freedreno/ir3: update obsolete comment
 | ||
| - spirv: atomic_counter_read_deref is not vectorized
 | ||
| - spirv: drop some dead code
 | ||
| - glsl_to_nir: fix is_helper_invocation
 | ||
| - glsl_to_nir: fix shader_clock
 | ||
| - glsl_to_nir: fix vote_any/vote_all
 | ||
| - freedreno/ir3: refactor out helper to compile shader from asm
 | ||
| - freedreno/ir3: add accessor for const_state
 | ||
| - freedreno/a6xx: defer userconst cmdstream size calculation
 | ||
| - freedreno/ir3: move ubo_state into const_state
 | ||
| - freedreno/ir3: drop shader->num_ubos
 | ||
| - freedreno/ir3: constify shader key
 | ||
| - freedreno/ir3: pass variant to ir3_create()
 | ||
| - freedreno/ir3: convert over to ralloc
 | ||
| - freedreno/ir3: move num_reserved_user_consts out of const_state
 | ||
| - freedreno/ir3: un-embed const_state
 | ||
| - freedreno/ir3: move const_state back to variant
 | ||
| - freedreno/ir3: move output_loc to variant
 | ||
| - freedreno/ir3: split out ubo info from range
 | ||
| - freedreno/ir3: splitup get_existing_range()
 | ||
| - freedreno/ir3: split ubo analysis/lowering passes
 | ||
| - ci: remove some freedreno a6xx skips
 | ||
| - freedreno/ir3: add helper to determine point-coord inputs
 | ||
| - freedreno/a6xx: de-duplicate vinterp/vpsrepl state building
 | ||
| - freedreno/a6xx: use point-coord helper
 | ||
| - freedreno/a5xx: use point-coord helper
 | ||
| - freedreno/a4xx: use point-coord helper
 | ||
| - freedreno/a3xx: use point-coord helper
 | ||
| - freedreno: convert builtin blit VS prog to ureg builder
 | ||
| - freedreno/ir3: switch PIPE_CAP_TGSI_TEXCOORD
 | ||
| - freedreno: make foreach_bit() declare it's cursor
 | ||
| - freedreno: split out batch draw tracking helper
 | ||
| - freedreno: split out batch clear tracking helper
 | ||
| - freedreno: handle batch flush in resource tracking
 | ||
| - freedreno/ir3/ra: fix pre-color edge case
 | ||
| - freedreno/ir3: add ir3_finalize_nir()
 | ||
| - freedreno/ir3: move finalize_nir to pscreen hook
 | ||
| - freedreno/ir3: add ir3_compiler_destroy()
 | ||
| - freedreno/ir3: shuffle some variant fields
 | ||
| - freedreno/a6xx+ir3: stop generating pointless binning shaders
 | ||
| - freedreno/ir3: build binning variant at same time as draw variant
 | ||
| - freedreno/ir3: disk-cache support
 | ||
| - freedreno/ir3: move nir finalization to after cache miss
 | ||
| - freedreno/fdperf: fix print of base address
 | ||
| - freedreno/fdperf: better compatible string matching
 | ||
| - freedreno/fdperf: prefer render node
 | ||
| - gitlab-ci: reduce a630 runner load
 | ||
| - freedreno/ir3: add missing VS driver params
 | ||
| - freedreno/ir3: make compile fails more visible
 | ||
| - freedreno/a6xx: bail instead of crash for compile fails
 | ||
| - freedreno/ir3/ra: be better at failing
 | ||
| - freedreno/a6xx: don't enable early-z/lrz if no z-test
 | ||
| - freedreno/ir3: DCE unused arrays
 | ||
| - driconf: allowlist/denylist
 | ||
| - gitlab-ci: re-enable all a630 jobs
 | ||
| - freedreno: small comment re-word
 | ||
| - freedreno: whitespace fix
 | ||
| - freedreno/ir3/parser: half-precision relative regs
 | ||
| - freedreno/ir3: set array precision on creation
 | ||
| - freedreno/ir3: fix half-reg array stores
 | ||
| - freedreno/ir3/ra: debug msgs tweak
 | ||
| - freedreno/ir3/ra: assign vreg names to all array elements
 | ||
| - freedreno/ir3/ra: fix array conflicts for split/merged
 | ||
| - freedreno: sync registers from envytools
 | ||
| - freedreno: make gen_header.py check parent directory
 | ||
| - freedreno: slurp in rnndb
 | ||
| - freedreno: slurp in rnn
 | ||
| - freedreno: slurp in decode tools
 | ||
| - freedreno: slurp in afuc
 | ||
| - freedreno/rnn: warnings cleanup
 | ||
| - freedreno/decode: warnings cleanup
 | ||
| - freedreno/afuc: warnings cleanup
 | ||
| - freedreno: add CI for envytools tools
 | ||
| - freedreno/ir3: split out regmask
 | ||
| - freedreno: drop shader_t
 | ||
| - freedreno: deduplicate a3xx+ disasm
 | ||
| - freedreno: move a2xx disasm out of gallium
 | ||
| - freedreno: deduplicate a2xx disasm
 | ||
| - freedreno/ci: add a2xx trace to CI job
 | ||
| - freedreno/tools: check rnn parse status
 | ||
| - freedreno/rnn: split out helper to find files
 | ||
| - freedreno/rnn: add error helper
 | ||
| - freedreno/rnn: rename schema file
 | ||
| - freedreno/rnn: update schema for 'pos'
 | ||
| - freedreno/rnn: add relaxed boolean type
 | ||
| - freedreno/rnn: add high/low/pos to registers
 | ||
| - freedreno/rnn: add radix/align
 | ||
| - freedreno/rnn: relax Hexadecimal to HexOrNumber
 | ||
| - freedreno/rnn: add variants/varset to domain
 | ||
| - freedreno/registers/a2xx: fix validation error
 | ||
| - freedreno/registers/a4xx: fix validation error
 | ||
| - freedreno/registers/adreno_pm4: fix validation errors
 | ||
| - freedreno/rnn: describe copyright element in schema
 | ||
| - freedreno/rnn: add "addvariant" to schema
 | ||
| - freedreno/rnn: allow name to be optional in arrays
 | ||
| - freedreno/rnn: fix use-group
 | ||
| - freedreno/registers/mdp5: fix validation error
 | ||
| - freedreno/rnn: schema updates for dynamic/irregular offsets
 | ||
| - freedreno/rnn: add schema validation
 | ||
| - freedreno/rnn: headergen2 warnings cleanup
 | ||
| - freedreno/decode: cffdec warnings cleanup
 | ||
| - freedreno/ir3: add missing track_ubo_use()
 | ||
| - freedreno/a6xx: don't emit a bogus size for empty cb slots
 | ||
| - freedreno/a6xx: fixup draw state earlier
 | ||
| - freedreno/rnn: also look for .xml.gz
 | ||
| - freedreno/rnn: rework RNN_DEF_PATH construction
 | ||
| - freedreno/registers: add .gitignore
 | ||
| - freedreno/registers: split header build into subdirs
 | ||
| - freedreno/registers: install gzip'd register database
 | ||
| - freedreno/decode: move dependencies up a level
 | ||
| - freedreno: allow fence_fd fences to be recycled
 | ||
| - freedreno/ir3: ir3_cmdline updates
 | ||
| - freedreno/ir3: lower local_index using local_id
 | ||
| - glsl/lower_precision: split out const lowering
 | ||
| - gallium: replace 16BIT_TEMPS cap with 16BIT_CONSTS
 | ||
| - glsl: remove LowerPrecisionTemporaries
 | ||
| - glsl: don't inline intrinsics for mediump
 | ||
| - glsl_to_nir: fix bitfield_extract with 16-bit operands
 | ||
| - freedreno/registers: add some missing regs to build
 | ||
| - freedreno/crashdec: handle section name typos
 | ||
| - freedreno/a6xx: fix occlusion query with more than one tile
 | ||
| - freedreno: handle case of shadowing current render target
 | ||
| - freedreno/gmemtool: add tile_alignw/h and a650
 | ||
| 
 | ||
| Rohan Garg (3):
 | ||
| 
 | ||
| - iris: Fix documentation for _iris_batch_flush
 | ||
| - ci: Include trace replay support in ARM rootfses.
 | ||
| - gitlab-ci: Replay traces on lava devices
 | ||
| 
 | ||
| Roland Scheidegger (1):
 | ||
| 
 | ||
| - gallivm: fix half to float conversions with llvm 11
 | ||
| 
 | ||
| Roman Gilg (2):
 | ||
| 
 | ||
| - vulkan/wsi/x11: add sent image counter
 | ||
| - vulkan/wsi/x11: wait for acquirable images in FIFO mode
 | ||
| 
 | ||
| Roman Stratiienko (5):
 | ||
| 
 | ||
| - egl: Build surfaceless platform on Android
 | ||
| - Android: Fixes for Q and R
 | ||
| - panfrost: Android build fixes 2020 week 31
 | ||
| - lima: Fix lima_screen_query_dmabuf_modifiers()
 | ||
| - android: freedreno: Another build fix
 | ||
| 
 | ||
| Sagar Ghuge (3):
 | ||
| 
 | ||
| - iris: Use modfiy disables for 3DSTATE_WM_DEPTH_STENCIL command
 | ||
| - intel/compiler: Optimize integer add with 0 into mov
 | ||
| - intel/compiler: Remove unnecessary optimization for MUL
 | ||
| 
 | ||
| Samuel Pitoiset (235):
 | ||
| 
 | ||
| - ci: fix reporting the number of unexpected/flakes
 | ||
| - ci: add lists of expected failures & skipped tests for RAVEN with ACO
 | ||
| - aco: remove unecessary p_split_vector with v2b reg class
 | ||
| - radv: enable shaderInt16 unconditionally with LLVM and only GFX8+ with ACO
 | ||
| - radv: cleanup radv_CreateInstance()
 | ||
| - radv: rename radv_devices() to radv_enumerate_physical_devices()
 | ||
| - radv: fix a memleak if the physical device initialization failed
 | ||
| - radv: report INITIALIZATION_FAILED when the amdgpu winsys init failed
 | ||
| - radv: don't report error with other vendor DRM devices
 | ||
| - radv: use a linked list for physical devices
 | ||
| - radv: display an error message if the winsys init failed
 | ||
| - radv/winsys: do not count visible VRAM buffers twice in the budget
 | ||
| - ci: remove unused .test-radv-fossilize rule
 | ||
| - ci: set ACO_DEBUG=validateir,validatera global for RADV testing
 | ||
| - ci: run radv-fossils with Pitcairn (GFX6) and Bonaire (GFX7) too
 | ||
| - radv: remove the LLVM version string when ACO is used
 | ||
| - radv: do not print the LLVM version string twice in hang reports
 | ||
| - radv: report correct backend IR in hang reports when ACO is used
 | ||
| - aco: fix 64-bit trunc with negative exponents on GFX6
 | ||
| - nir: do not vectorize load/store if offset can overflow and robustness enabled
 | ||
| - aco: prevent invalid loads/stores vectorization if robustness is enabled
 | ||
| - radv: limit the Vulkan version to 1.1 for Android
 | ||
| - radv: handle different Vulkan API versions correctly
 | ||
| - radv: update the list of allowed Android extensions
 | ||
| - aco: optimize add/sub(a, cndmask(b, 0, 1, cond)) -> addc/subbrev_co(0, a, b)
 | ||
| - radv: use the common base object type for VkDevice
 | ||
| - radv: use the base object struct types
 | ||
| - radv: implement VK_EXT_private_data
 | ||
| - vulkan: import common code for generating extensions
 | ||
| - radv: use the common code for generating extensions and dispatch tables
 | ||
| - anv: use the common code for generating extensions and dispatch tables
 | ||
| - turnip: use the common code for generating extensions and dispatch tables
 | ||
| - radv: add a LLVM version string workaround for SotTR and ACO
 | ||
| - aco: remove useless check for nir_tex_src_bias
 | ||
| - aco: add support for texturing with clamped LOD
 | ||
| - ac/llvm: add support for texturing with clamped LOD
 | ||
| - radv: enable shaderResourceMinLod
 | ||
| - spirv: handle OpCopyObject correctly with any types
 | ||
| - radv: fix missing break in radv_GetPhysicalDeviceProperties2()
 | ||
| - aco: store 16-bit temporary outputs as v2b
 | ||
| - aco: convert 16-bit values before exporting MRTs
 | ||
| - aco: allow to load/store 16-bit values in VMEM for tess and geom
 | ||
| - aco: implement 8-bit/16-bit mov's with p_create_vector
 | ||
| - aco: implement 16-bit vertex fetches with tbuffer_load_format_d16_*
 | ||
| - aco: validate v_interp_*_f16 as VOP3 instructions instead of VINTRP
 | ||
| - aco: emit v_interp_*_f16 instructions as VOP3 instead of VINTRP
 | ||
| - aco: implement 16-bit interp
 | ||
| - aco: fix off-by-one error with 16-bit MTBUF opcodes on GFX10
 | ||
| - radv/aco: enable storageInputOutput16 on GFX9+
 | ||
| - aco: fix missing break in label_instruction()
 | ||
| - radv: fix missing break in radv_GetPhysicalDeviceFeatures2()
 | ||
| - radv: fix duplicated expression in ac_setup_rings()
 | ||
| - radv/winsys: remove useless free in radv_amdgpu_create_bo_list()
 | ||
| - aco: declare 8-bit/16-bit reduce operations
 | ||
| - aco: implement 8-bit/16-bit reductions
 | ||
| - aco: validate 8-bit/16-bit VGPR operands for readfirstlane/readlane/writelane
 | ||
| - aco: implement 8-bit/16-bit nir_intrinsic_read_first_invocation
 | ||
| - aco: implement 8-bit/16-bit nir_intrinsic_{shuffle,_read_invocation}
 | ||
| - aco: implement 8-bit/16-bit nir_intrinsic_quad_*
 | ||
| - aco: use a temporary SGPR for 8-bit/16-bit literal reduction identities
 | ||
| - aco: sign-extend the input and identity for 8-bit subgroup operations
 | ||
| - radv: do not return from radv_GetPhysicalDeviceFeatures2()
 | ||
| - radv: cleanup physical device features
 | ||
| - radv: remove useless assignment in build_streamout_vertex()
 | ||
| - spirv: add ReadClockKHR support with device scope
 | ||
| - aco: implement nir_intrinsic_shader_clock with device scope
 | ||
| - ac/nir: fix shader clock with subgroup scope
 | ||
| - ac/nir: implement nir_intrinsic_shader_clock with device scope
 | ||
| - radv: advertise shaderDeviceClock on GFX8+
 | ||
| - spirv: add SpvCapabilityImageGatherBiasLodAMD
 | ||
| - spirv: add support for bias/lod with OpImageGather
 | ||
| - ac/nir: add support for bias/lod with texture gather
 | ||
| - aco: add support for bias/lod with texture gather
 | ||
| - radv: add support for querying which formats support texture gather LOD
 | ||
| - radv: advertise VK_AMD_texture_gather_bias_lod
 | ||
| - spirv,radv,anv: implement no-op VK_GOOGLE_user_type
 | ||
| - radv/aco: enable VK_EXT_subgroup_size_control
 | ||
| - aco: fix register allocation for subdword instructions on GFX10
 | ||
| - aco: implement 8-bit/16-bit reductions on GFX10
 | ||
| - aco: allocate a temp VGPR for some 8-bit/16-bit reduction ops on GFX10
 | ||
| - aco: allow gfx10_wave64_bpermute with 8-bit/16-bit input
 | ||
| - aco: sign-extend input/indentity for 32-bit reduce ops on GFX10
 | ||
| - radv/aco: enable VK_KHR_subgroup_extended_types on GFX8+
 | ||
| - radv: enable zero VRAM for Doom Eternal
 | ||
| - radv: enable zero VRAM for all VKD3D (DX12->VK) games
 | ||
| - aco: implement 16-bit reduce operations on GFX6-GFX7
 | ||
| - aco: implement 16-bit nir_intrinsic_quad_* on GFX6-GFX7
 | ||
| - aco: fix subdword copies on GFX6-GFX7
 | ||
| - aco: sign-extend input/identity for 16-bit subgroup ops on GFX6-GFX7
 | ||
| - radv/aco: enable 64-bit atomic features if RADV is linked with LLVM 8
 | ||
| - aco: use v_bfe_u32 for unsigned reductions sign-extension on GFX6-GFX7
 | ||
| - aco: fix sign-extend 8-bit subgroup operations on GFX6-GFX7
 | ||
| - aco: fix nir_intrinsic_quad_* with 8-bit in GFX6-GFX7
 | ||
| - radv/aco: enable VK_KHR_shader_subgroup_extended_types on GFX6-GFX7
 | ||
| - ac/nir: adjust an assertion for D16 on GFX6-GFX7
 | ||
| - nir/lower_explicit_io: fix NON_UNIFORM access for UBO loads
 | ||
| - radv/llvm: expose VK_EXT_shader_demote_to_helper_invocation with LLVM 9+
 | ||
| - aco: implement 8-bit/16-bit conversions on GFX6-GFX7
 | ||
| - aco: fix alignment of vectors with 4 elements
 | ||
| - radv/aco: enable 8-bit/16-bit storage on GFX6-GFX7
 | ||
| - radv/aco: enable shaderInt16 on GFX6-GFX7
 | ||
| - radv/aco: enable shaderInt8 and VK_KHR_shader_float16_int8 on GFX6-GFX7
 | ||
| - ac/nir: fix integer comparisons with pointers
 | ||
| - radv: set DB_SHADER_CONTROL.CONSERVATIVE_Z_EXPORT correctly
 | ||
| - radv: add new drirc option radv_enable_mrt_output_nan_fixup
 | ||
| - aco: implement radv_enable_mrt_output_nan_fixup workaround
 | ||
| - radv/llvm: implement radv_enable_mrt_output_nan_fixup workaround
 | ||
| - radv: enable radv_enable_mrt_output_nan_fixup for RAGE 2
 | ||
| - ac: add ac_choose_spi_color_formats() to common code
 | ||
| - spirv: fix using OpSampledImage with OpUndef instead of OpType{Image,Sampler}
 | ||
| - aco: allow to swap operands for some 16-bit float instructions
 | ||
| - spirv: do not set num_components for non-vectorized mbcnt_amd intrinsic
 | ||
| - radv/aco: enable FP16 features/extensions on GFX9+
 | ||
| - radv: lower discards to demote to workaround a RDR2 game bug
 | ||
| - radv: make sure to set CB_SHADER_MASK correctly for internal CB operations
 | ||
| - radv: compute CB_SHADER_MASK from the fragment shader outputs
 | ||
| - radv: only requires LLVM 9 for GFX10 if not using ACO
 | ||
| - radv: replace == GFX10 with >= GFX10 where it's needed
 | ||
| - aco: replace == GFX10 with >= GFX10 where it's needed
 | ||
| - radv: add support for Sienna Cichlid
 | ||
| - radv: require LLVM 11+ for GFX 10.3 if not using ACO
 | ||
| - aco: fix printing ASM on GFX6-7 if clrxdisasm is not found
 | ||
| - aco: improve validation checks for readlane/writelane
 | ||
| - aco: fix printing ASM on GFX6-7 again
 | ||
| - gitlab-ci: stop testing RADV with LLVM
 | ||
| - gitlab-ci: update the list of expected CTS failures for RADV/ACO
 | ||
| - gitlab-ci: update the list of expected failures for Pitcairn
 | ||
| - radv: fix checking the return value of cs_finalize()
 | ||
| - gitlab-ci: add parallel-rdp fossils
 | ||
| - radv: lower 64-bit drcp/dsqrt/drsq for fixing precision issues
 | ||
| - radv: lower 64-bit dfloor on GFX6 for fixing precision issues
 | ||
| - gitlab-ci: add a list of expected failures for RADV/ACO on NAVI14
 | ||
| - gitlab-ci: set the number of Fossilize threads to 4
 | ||
| - gitlab-ci: append Fossilize stdout/stderr to a file to reduce spam
 | ||
| - gitlab-ci: attach the Fossilize log file as artifact on failure
 | ||
| - radv: remove the shader ballot workaround for Youngblood with LLVM
 | ||
| - radv: remove the load/store workaround for Monster Hunter World with LLVM
 | ||
| - radv: enable VK_AMD_shader_ballot on GFX6-7 with both compiler backends
 | ||
| - radv: adjust CB_SHADER_MASK for dual-source blending in the shader info pass
 | ||
| - radv: rework 8/16-bit color attachment formats detection
 | ||
| - radv: use SPI_SHADER_ZERO for non-written color attachments
 | ||
| - radv: add support for MRTs compaction to avoid holes
 | ||
| - radv: fix wide points and lines
 | ||
| - radv: fix wide lines with multisample enabled
 | ||
| - Revert "vulkan/wsi/x11: Ensure we create at least minImageCount images."
 | ||
| - radv,vulkan: add a new x11 wsi drirc workaround for DOOM Eternal
 | ||
| - radv: disable FMASK compression when drawing with GENERAL layout
 | ||
| - radv: set depth/stencil enable values correctly for the meta clear path
 | ||
| - radv: implement missing VK_ACCESS_MEMORY_{READ,WRITE}_BIT
 | ||
| - radv: store the primitive topology hardware value in the pipeline
 | ||
| - radv: adjust IA_MULTI_VGT_PARAM.WD_SWITCH_ON_EOP at draw time
 | ||
| - radv: adjust IA_MULTI_VGT_PARAM.PARTIAL_VS_WAVE at draw time
 | ||
| - radv: compute prim_vertex_count at draw time
 | ||
| - aco: fix more validation errors from vgpr spill/restore code
 | ||
| - radv: return VK_ERROR_DEVICE_LOST if wait-for-idle failed or expired
 | ||
| - radv: remove the secure compile support feature
 | ||
| - radv: rework dynamic viewports/scissors support
 | ||
| - radv: add VK_EXT_extended_dynamic_state but leave it disabled
 | ||
| - radv: declare new extended dynamic states
 | ||
| - radv: add support for dynamic cull mode and front face
 | ||
| - radv: add support for dynamic primitive topology
 | ||
| - radv: add support for dynamic and scissor count
 | ||
| - radv: add support for dynamic depth/stencil states
 | ||
| - radv: add support for dynamic vertex input binding stride
 | ||
| - radv: advertise VK_EXT_extended_dynamic_state
 | ||
| - radv: add the custom border color BO to the list of buffers
 | ||
| - radv: destroy the base object if VkCreateQueryPool() failed
 | ||
| - radv: destroy the base object if VkCreateRenderPass*() failed
 | ||
| - radv: destroy the base object if VkCreateImage() failed
 | ||
| - radv: destroy the base object if VkCreateBuffer() failed
 | ||
| - radv: destroy the base object if VkCreateEvent() failed
 | ||
| - radv: destroy the base object if VkCreateSemaphore() failed
 | ||
| - radv: destroy the base object if VkCreateFence() failed
 | ||
| - radv: destroy the base object if VkAllocateCommandBuffers() failed
 | ||
| - radv: destroy the base object if VkCreateInstance() failed
 | ||
| - radv/winsys: replace alloca() by malloc() everywhere
 | ||
| - radv/winsys: pass the buffer list via the CS ioctl for less CPU overhead
 | ||
| - radv: fix destroying the syncobj when exporting a fence FD
 | ||
| - radv: fix the error code when exporting a semaphore/fence fails
 | ||
| - radv: fix the error code when allocating a fresh imported syncobj fails
 | ||
| - radv: optimize creating signaled syncobj with amdgpu_cs_create_syncobj2()
 | ||
| - radv: split fence into two parts as enum+union.
 | ||
| - radv: remove one useless goto in radv_queue_submit_deferred()
 | ||
| - radv: improve the error messages when a CS submission failed
 | ||
| - radv: return better Vulkan error codes when VkQueueSubmit() fails
 | ||
| - radv: disable CPU caching for IBS to reduce fetch latency
 | ||
| - radv/winsys: always allow GTT placements on APUs
 | ||
| - radv: advertise VK_EXT_image_robustness
 | ||
| - radv: do not perform read-modify-write with the upload BO
 | ||
| - radv: disable CPU caching for the upload BO to reduce fetch latency
 | ||
| - aco: add support for nir_intrinsic_shared_atomic_fadd
 | ||
| - ac/nir: add support for nir_intrinsic_shared_atomic_fadd
 | ||
| - radv: advertise VK_EXT_shader_atomic_float
 | ||
| - radv: add missing return values check for some winsys calls
 | ||
| - radv/winsys: check more allocation failures
 | ||
| - radv/winsys: remove useless check when binding virtual buffers/images
 | ||
| - radv/winsys: return a Vulkan error code when binding virtual buffers/images
 | ||
| - radv/winsys: be more robust when a CS failed during recording
 | ||
| - radv: remove declared but unused radv_pipeline::is_dual_src
 | ||
| - radv: remove set but unused radv_pipeline::vertex_elements
 | ||
| - radv: remove outdated TODO related to PA_SU_VTX_CNTL.PIX_CENTER
 | ||
| - radv: emit more invariant registers as part of the initial gfx state
 | ||
| - radv: emit PA_SC_LINE_CNTL as part of the rasterization state
 | ||
| - radv: clean up VGT_SHADER_STAGES_EN emission
 | ||
| - radv: clean up PA_SC_CLIPRECT_RULE emission
 | ||
| - radv: reduce the number of allocated dwords for compute CS
 | ||
| - radv: clean up radv_compute_generate_pm4()
 | ||
| - radv: remove unnecessary radv_tessellation_state::num_patches
 | ||
| - radv: remove no-op si_multiwave_lds_size_workaround()
 | ||
| - radv: remove one unnecessary param to radv_generate_graphics_pipeline_key()
 | ||
| - radv: align the LDS size in calculate_tess_lds_size()
 | ||
| - radv: set LDS TCS size at shaders creation for GFX9+
 | ||
| - radv: remove unnecessary radv_tessellation_state::lds_size
 | ||
| - radv: clean up tessellation state emission
 | ||
| - radv: add radv_pipeline_init_input_assembly_state()
 | ||
| - radv: add radv_pipeline_generate_vgt_gs_out()
 | ||
| - radv: clean up adjusting MSAA state if conservative rast is enabled
 | ||
| - radv: clean up binning state initialization
 | ||
| - radv: assign pipeline gfx fields before PM4 emission
 | ||
| - radv: constify all radv_pipeline_generate_*() helpers
 | ||
| - radv: add radv_pipeline_init_shader_stages_state()
 | ||
| - radv: remove useless return value to radv_pipeline_scratch_init()
 | ||
| - radv: clean up remaining pipeline init functions
 | ||
| - radv: print warnings for famous RADV_PERFTEST options that no longer exist
 | ||
| - radv: do not honor a user-specified pitch on GFX 10.3
 | ||
| - radv: increase minimum NGG vertex count requirement per workgroup on GFX 10.3
 | ||
| - radv: fix sample shading on GFX 10.3
 | ||
| - radv: set BYPASS_VTX_RATE_COMBINER_GFX103 on GFX 10.3
 | ||
| - radv/gfx10: add missing initialization of registers
 | ||
| - radv: limit LATE_ALLOC_GS to prevent a GPU hang on GFX10
 | ||
| - radv: fix emitting the border color pointer on the compute queue
 | ||
| - nir/algebraic: mark some optimizations with fsat(NaN) as inexact
 | ||
| - aco: handle unaligned loads on GFX10.3
 | ||
| - spirv: fix emitting switch cases that directly jump to the merge block
 | ||
| - radv: fix transform feedback crashes if pCounterBufferOffsets is NULL
 | ||
| 
 | ||
| Satyajit Sahu (1):
 | ||
| 
 | ||
| - frontends/va: Handle dynamic resolution/SVC for VP9
 | ||
| 
 | ||
| Satyeshwar Singh (1):
 | ||
| 
 | ||
| - intel/dev: Don't consider all TGL SKUs as GT1 only
 | ||
| 
 | ||
| Serge Martin (3):
 | ||
| 
 | ||
| - amd/common: Fix incorrect use of asprintf instead of vasprintf
 | ||
| - clover: add more cl_mem_object_type to pipe_texture_target mapping
 | ||
| - clover: implements clEnqueueFillBuffer
 | ||
| 
 | ||
| Shawn Guo (1):
 | ||
| 
 | ||
| - freedreno/a4xx: fix \*_NONE enum conversion
 | ||
| 
 | ||
| Simon Ser (3):
 | ||
| 
 | ||
| - EGL: sync headers with Khronos
 | ||
| - gbm: document that gbm_bo_map exposes a linear view
 | ||
| - radv: use bitshifts for debug enum values
 | ||
| 
 | ||
| SureshGuttula (1):
 | ||
| 
 | ||
| - radeon/vcn: Corrected vp9 ref associated data incase of target->codec is NULL
 | ||
| 
 | ||
| Tapani Pälli (14):
 | ||
| 
 | ||
| - st/mesa: destroy only own program variants when program is released
 | ||
| - anv: call base finish only if pass given in DestroyRenderPass
 | ||
| - anv: add VK_EXT_extended_dynamic_state but leave it disabled
 | ||
| - anv: add new dynamic states
 | ||
| - anv: consider dynamic state when creating pipeline
 | ||
| - anv: handle dynamic viewport count
 | ||
| - anv: add support for dynamic cull mode and winding order
 | ||
| - anv: add support for dynamic viewport and scissor with count
 | ||
| - anv: add support for dynamic primitive topology change
 | ||
| - anv: depth/stencil dynamic state support
 | ||
| - anv: dynamic vertex input binding stride and size support
 | ||
| - anv: toggle on VK_EXT_extended_dynamic_state
 | ||
| - anv: add a check for depthStencilState before using it
 | ||
| - anv: null check for buffer before reading size
 | ||
| 
 | ||
| Thong Thai (8):
 | ||
| 
 | ||
| - radeon: Fix whitespaces
 | ||
| - gallium/auxiliary/vl: Fix compute shader scaling for non-square pixels
 | ||
| - gallium/auxiliary/vl: Fix compute shader scale_y for interlaced videos
 | ||
| - frontends/va: Fix deinterlace bottom field first flag
 | ||
| - frontends/vdpau: Default destination rect to source rect
 | ||
| - radeon/vcn: add vcn 3.0 encode support
 | ||
| - radeonsi: use PIPE_FORMAT_P010 for 10-bit VP9 decoding
 | ||
| - radeon/vcn: increase render_pic_list size
 | ||
| 
 | ||
| Timothy Arceri (69):
 | ||
| 
 | ||
| - glsl: stop cascading errors if process_parameters() fails
 | ||
| - glsl: fix slow linking of uniforms in the nir linker
 | ||
| - radv: fix regression with builtin cache
 | ||
| - nir: add glsl_get_ifc_packing() helper
 | ||
| - nir: add callback to nir_remove_dead_variables()
 | ||
| - glsl: add can_remove_uniform() helper to the NIR linker
 | ||
| - glsl: remove dead uniforms in the nir linker
 | ||
| - glsl/spirv: remove dead uniforms in spirv nir linker
 | ||
| - gitlab-ci: bump piglit checkout commit
 | ||
| - i965: call brw_nir_lower_uniforms() after uniform linking is complete
 | ||
| - util: add BITSET_LAST_BIT() helper
 | ||
| - glsl: add struct to gather more info about uniform array access
 | ||
| - glsl: add update_array_sizes() helper to the NIR uniform linker
 | ||
| - glsl: gather uniform dereference info before main linking loop
 | ||
| - glsl: when NIR linker enable use it to resize uniform arrays
 | ||
| - glsl: fix potential slow compile times for GLSLOptimizeConservatively
 | ||
| - glsl: fix incorrect optimisation in opt_constant_variable()
 | ||
| - glsl: fix uniform array resizing in the nir linker
 | ||
| - glsl: small optimisation fix for uniform array resizing
 | ||
| - st_glsl_to_nir: fix potential use after free
 | ||
| - mesa: remove _mesa prefix from static function
 | ||
| - mesa: add _mesa_program_state_value_size() helper
 | ||
| - glsl: define gl_LightSource members in ARB_vertex_program order
 | ||
| - st/glsl_to_nir: disable st_nir_lower_builtin() when packing supported
 | ||
| - glsl: remove stale FIXME
 | ||
| - i965: add and fix fallthrough comments
 | ||
| - llvmpipe: add missing fallthrough comments
 | ||
| - gallivm: add missing break
 | ||
| - anv: update fallthrough comment so gcc sees it
 | ||
| - intel/compiler: add and fix up fallthrough comments for gcc warnings
 | ||
| - iris: add missing fallthrough comment
 | ||
| - egl: move fallthrough comment so gcc can see it
 | ||
| - nir: add missing break to nir_opt_access()
 | ||
| - mesa: fix fallthrough in glformats
 | ||
| - mesa: add fallthrough comments to glformats.c
 | ||
| - mesa: add fallthrough comments to get.c
 | ||
| - nir: fix implicit fallthrough warnings
 | ||
| - mesa: add fallthrough comments to COPY_SZ_4V()
 | ||
| - radeonsi: add missing fallthrough comment
 | ||
| - glx: add missing fallthrough comment
 | ||
| - glsl: move fallthrough comment to where gcc can see it
 | ||
| - radeon: add missing fallthrough comments
 | ||
| - spirv: add missing fallthrough comments
 | ||
| - mesa/vbo: add some missing fallthrough comments
 | ||
| - mesa: add missing fallthrough comment to teximage.c
 | ||
| - mesa: fix unintended fallthrough in glIsEnabled()
 | ||
| - r300: add and fix up fallthrough comments
 | ||
| - svga: add missing fallthrough comments
 | ||
| - mesa: update fallthrough comment so gcc can see it
 | ||
| - nv30: add missing fallthrough comment
 | ||
| - meson: turn on Wimplicit-fallthrough project wide
 | ||
| - nouveau: fix pointer-sign warning
 | ||
| - gitlab-ci: Enable -Werror in `meson-classic` job
 | ||
| - r600/radeonsi: silence zero-length-bounds gcc warnings
 | ||
| - radeonsi: fix SI_NUM_ATOMS
 | ||
| - iris: fix maybe-uninitialized warning for initial_state variable
 | ||
| - iris: silence maybe-uninitialized for stc_dst_aux_usage variable
 | ||
| - nouveau/nvc0: silence maybe-uninitialized warning
 | ||
| - panfrost: add some missing fallthrough comments
 | ||
| - panfrost: hide more unused code in bi_lower_combine.c
 | ||
| - panfrost: add some missing fallthrough comments to bi_pack.c
 | ||
| - freedreno: fix missing fallthrough comments
 | ||
| - v3d: remove redefine of VG(x)
 | ||
| - zink: fix missing fallthrough comment
 | ||
| - nine: remove unused var
 | ||
| - etnaviv: add missing fallthrough comments
 | ||
| - lima: add missing fallthrough comments
 | ||
| - lima: add missing break
 | ||
| - gitlab-ci: Enable -Werror in `meson-gallium` job
 | ||
| 
 | ||
| Timur Kristóf (4):
 | ||
| 
 | ||
| - aco/gfx10: Refactor of GFX10 wave64 bpermute.
 | ||
| - aco: Implement subgroup shuffle on GFX6-7.
 | ||
| - radv/aco: Always enable subgroup shuffle.
 | ||
| - aco: Fix emit_boolean_exclusive_scan in wave32 mode.
 | ||
| 
 | ||
| Tomeu Vizoso (55):
 | ||
| 
 | ||
| - panfrost: Emit blend descriptors on Bifrost
 | ||
| - panfrost: Don't leak temporary descriptors array
 | ||
| - pan/decode: Check for correct unknown field
 | ||
| - pan/decode: Use correct printf modifier for long int
 | ||
| - panfrost: Split bit out of format.unk3
 | ||
| - panfrost: Create additional BO for the checksum of imported BOs (Bifrost)
 | ||
| - panfrost: Add a bit more info about some tiler fields
 | ||
| - pan/bi: Print shaders only if BIFROST_MESA_DEBUG=shaders
 | ||
| - pan/decode: Trace to stderr with PANDECODE_DUMP_FILE=stderr
 | ||
| - panfrost: GPUs newer than G-71 don't have swizzles...
 | ||
| - panfrost: mali_attr_meta.unknown1 is zero on Bifrost
 | ||
| - panfrost: Add Bifrost texture trampoline BO to batch
 | ||
| - pan/decode: Properly print tripped zeroes
 | ||
| - virgl: Properly check for encode_stride when encoding transfers
 | ||
| - panfrost: Add checksum BOs to batch
 | ||
| - panfrost: Don't trample on top of Bifrost-specific unions
 | ||
| - panfrost: Handle MALI_RGB8_UNORM in panfrost_format_to_bifrost_blend
 | ||
| - gitlab-ci: Run more dEQP tests for virgl
 | ||
| - gitlab-ci: Add manual tests for Virgl using GLES on the host
 | ||
| - gitlab-ci: Test virgl with Khronos' OpenGL CTS
 | ||
| - gitlab-ci: Update CTS runner
 | ||
| - ci: Don't call renderdoc's ReplayController.Shutdown()
 | ||
| - ci: Move ARM rootfses to stable
 | ||
| - gitlab-ci: Build kernel drivers for a few ethernet USB dongles
 | ||
| - gitlab-ci: More stable URL for kernel and ramdisk artifacts, for LAVA
 | ||
| - gitlab-ci: Remove left-behind rules:
 | ||
| - gitlab-ci: Don't rebuild kernels and rootfs if they have been already built in mainline
 | ||
| - gitlab-ci: Run all of GLES3 tests for Panfrost
 | ||
| - gitlab-ci: Re-add kernels for bare-metal
 | ||
| - gitlab-ci: Download traces from MinIO
 | ||
| - gitlab-ci: Upload tracie artifacts to MinIO
 | ||
| - gitlab-ci: Fix needs: of the arm64 LAVA test jobs
 | ||
| - ci: Upload images of failed replays to MinIO
 | ||
| - ci: Use smaller glxgears trace
 | ||
| - ci: Prefix tracie artifacts with the device name
 | ||
| - ci: Test with more traces
 | ||
| - ci: Disable trace testing on Mali T760
 | ||
| - ci: Fix the overwriting of traces.yml for baremetal
 | ||
| - ci: Namespace trace artifacts to the job number
 | ||
| - ci: Always print status code of HTTP uploads in tracie
 | ||
| - ci: Print load stats after running dEQP
 | ||
| - ci: Fix URL for glslang
 | ||
| - ci: Don't ship vk-build-programs after building dEQP
 | ||
| - ci: Split building of libdrm to its own script
 | ||
| - ci: Build kernels and rootfs for x86 devices
 | ||
| - ci: Upload reference images for traces
 | ||
| - ci: Print URL to image diff when a trace replay fails
 | ||
| - ci: Generate MinIO credentials within LAVA jobs
 | ||
| - ci: Set date in LAVA DUTs from NTP servers
 | ||
| - ci: Build-test Panfrost tools
 | ||
| - ci: Upload traces' reference and actual images to MinIO
 | ||
| - ci: Download traces from MinIO in baremetal runs
 | ||
| - ci: Remove kernel module build that slipped in
 | ||
| - ci: Actually upload trace artifacts to MinIO for baremetal
 | ||
| - ci: Use a rootfs tarball for NFS root, instead of a ramdisk (for LAVA)
 | ||
| 
 | ||
| Tony Wasserka (4):
 | ||
| 
 | ||
| - nir/lower_idiv: Port recent LLVM fixes to emit_udiv
 | ||
| - radv: Fix various non-critical integer overflows
 | ||
| - aco: Fix integer overflows when emitting parallel copies during RA
 | ||
| - amd/common: Fix various non-critical integer overflows
 | ||
| 
 | ||
| Vinson Lee (25):
 | ||
| 
 | ||
| - freedreno: Add missing break statement.
 | ||
| - llvmpipe: Fix variable name.
 | ||
| - r600/sfn: Initialize VertexStageExportForGS m_num_clip_dist member variable.
 | ||
| - panfrost: Ensure final.no_colour is initialized.
 | ||
| - r600/sfn: Use correct setter method.
 | ||
| - freedreno: Add missing va_end.
 | ||
| - pan/bi: Initialize struct fma_op_info member extended.
 | ||
| - zink: Check fopen result.
 | ||
| - etnaviv: Fix memory leak on error path.
 | ||
| - panfrost: Fix printf format specifier.
 | ||
| - r300g: Remove extra printf format specifiers.
 | ||
| - vdpau: Fix wrong calloc sizeof argument.
 | ||
| - mesa: Fix NetBSD compiler macro.
 | ||
| - Switch from cElementTree to ElementTree.
 | ||
| - intel/genxml: Migrate from deprecated xml.etree.ElementTree getchildren.
 | ||
| - rbug: Fix rbug_delete_vs_state lock acquisition.
 | ||
| - nir: Add nir_lower_clip_disable.c to SCons build.
 | ||
| - util: Fix SCons build.
 | ||
| - util: Fix memory leaks in unit test.
 | ||
| - meson: Fix lmsensors warning message.
 | ||
| - vulkan: Fix memory leaks.
 | ||
| - freedreno: Fix file descriptor leak.
 | ||
| - svga: Fix unused printf argument.
 | ||
| - freedreno: Check file descriptor before write.
 | ||
| - panfrost: Delete debug allocated syncobj.
 | ||
| 
 | ||
| Yevhenii Kharchenko (1):
 | ||
| 
 | ||
| - st/mesa: fix corrupted texture levels, when adding more levels than expected
 | ||
| 
 | ||
| Yevhenii Kolesnikov (5):
 | ||
| 
 | ||
| - glsl: subroutine signatures must match exactly
 | ||
| - nvir: don't use designated initialisers in C++ code
 | ||
| - intel/compiler: don't propagate cmp to add if add is saturated
 | ||
| - mesa: change error code of \*TextureSubImage\* for incorreect target
 | ||
| - nine: fix incorrect calculation of layer count for 3D textures
 | ||
| 
 | ||
| jzielins (2):
 | ||
| 
 | ||
| - gallium/swr: Fix compilation warnings
 | ||
| - swr: Bump maximum 2D texture size to 16kx16k
 | ||
| 
 | ||
| mmenzyns (1):
 | ||
| 
 | ||
| - nv50: Clear nv50_ir_prog_info of dead and codegen specific variables
 |