third_party_mesa3d

Author	SHA1	Message	Date
Sergii Melikhov	7fb6eafbc4	vulkan: Unlock before return. Fix defect reported by Coverity Scan CID-1494382. Missing unlock (LOCK): Returning without unlocking queue->submit.mutex. Fixes: `9bffd81f1c` ("vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle") Signed-off-by: Sergii Melikhov <sergii.v.melikhov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13933>	2021-11-24 15:15:46 +00:00
Rhys Perry	d5b41cbb4a	radv: fix max_render_backends for Sienna Cichlid null winsys This affects NGG culling. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13926>	2021-11-24 14:27:39 +00:00
Rhys Perry	f67b09e37c	radv: make RADV_FORCE_FAMILY case-insensitive So I don't have to update my scripts each time I switch between before/after `cfc5c2abfd`. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13926>	2021-11-24 14:27:39 +00:00
Marek Olšák	694731ac13	ac/surface: allow gfx6-8 to enter the gfx9 DCC codepath for SI_FORCE_FAMILY Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13871>	2021-11-24 13:55:23 +00:00
Marek Olšák	d830d213b6	ac/gpu_info: don't fail on amdgpu_query_video_caps_info failures When VCN is unsupported, we don't want to break GL or Vulkan. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13871>	2021-11-24 13:55:23 +00:00
Alejandro Piñeiro	a9b4aef0f2	broadcom/compiler: make shaderdb debug output compatible with shaderdb's report tool Even although the option is called shaderdb, it is not really used by shaderdb (for V3D shaderdb uses the debug option "precompile"). And in fact, right now the output format is not compatible with shaderdb. This commit tries to fix that, and as we are here, also try to make the option more useful for the Vulkan case, as that debug option also works with v3dv. We can't really fully imitate shaderdb use with OpenGL (run with a set of glsl shader tests), but we can at least assign a unique name (the pipeline sha1 in text format) so we can compare executions of the same vulkan application. For that remember to disable the on-disk cache. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13938>	2021-11-24 13:02:08 +00:00
Marek Olšák	6c78ec4eac	mesa: add allow_glsl_compat_shaders for shader-db Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13870>	2021-11-24 10:28:15 +00:00
Marek Olšák	09342dcfc0	mesa: don't add attenuation constants if ffvp doesn't use them This slightly decreases the size of constant buffers. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13872>	2021-11-24 09:57:39 +00:00
Samuel Pitoiset	deb4685df3	radv: implement optimized MSAA copies using FMASK Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12555>	2021-11-24 08:03:47 +00:00
Samuel Pitoiset	51612b0e95	radv: make radv_copy_buffer() a non-static function Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12555>	2021-11-24 08:03:47 +00:00
Samuel Pitoiset	e8c0eb9598	radv: make radv_break_on_count() a non-static function Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12555>	2021-11-24 08:03:47 +00:00
Georg Lehmann	d106e5c732	amd/addrlib: Use get_supported_arguments to get compiler args. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11609>	2021-11-24 07:03:54 +00:00
Georg Lehmann	a6f783948d	meson: Remove some unnecessary loops. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11609>	2021-11-24 07:03:54 +00:00
Georg Lehmann	6c89f09b7b	meson: Use get_supported_arguments more often. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11609>	2021-11-24 07:03:54 +00:00
Vasily Khoruzhick	3b15fb3575	lima/ppir: implement gl_FragDepth support Mali4x0 supports writing depth and stencil from fragment shader and we've been using it quite a while for depth/stencil buffer reload. The missing part was specifying output register for depth/stencil. To figure it out, I changed reload shader to use register $4 as output and poked RSW bits (or rather consecutive 4 bit groups) until tests that rely on reload started to pass again. It turns out that register number for gl_FragDepth/gl_FragStencil is in rsw->depth_test and register number for gl_FragColor is in rsw->multi_sample and it's repeated 4 times for some reason (likely for MSAA?) With this knowledge we now can modify ppir compiler to support multiple store_output intrinsics. To do that just add destination SSA for store_output to the registers list for regalloc and mark them explicitly as output. Since it's never read in shader we have to take care about it in liveness analysis - basically just mark it alive from the time when it's written to the end of the block. If it's live only in the last instruction, mark it as live_internal, so regalloc doesn't clobber it. Then just let regalloc do its job, and then copy register number to the shader state and program it in RSW. The tricky part is gl_FragStencil, since it resides in the same register as gl_FragDepth and with the current design of the compiler it's hard to merge them. However gl_FragStencil doesn't seem to be part of GL2 or GLES2, so we can just leave it not implemented. Also we need to take care of stop bit for instructions - now we can't just set it in every instruction that stores output, since there may be several outputs. So if there's any store_output instructions in the block just mark that block has a stop, and set stop bit in the last instruction in the block. The only exception is discard - we always need to set stop bit in discard instruction. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13830>	2021-11-24 02:26:08 +00:00
Vasily Khoruzhick	98a7c4c6f8	lima/ppir: check if mul node is a source of add node before inserting We can't insert mul node into add node instruction if it's a virtual dep (sequence or write_or_read dep), so use ppir_node_has_single_src_succ in addition to ppir_node_has_single_succ. We can't use ppir_node_has_single_src_succ alone, since node may have a virtual dependency in addition to source dependency, and we can't insert it either in this case. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13830>	2021-11-24 02:26:08 +00:00
Thomas H.P. Andersen	64292c0f05	svga: fix bitwise/logical and mixup The function need_temp_reg_initialization looks suspecious. It will only ever return true if we get past this if: if (!(emit->info.indirect_files && (1u << TGSI_FILE_TEMPORARY)) ... Using the logical && means the intended initialization done based on the result of this check is not performed. This code was both introduced and altered in MR 5317. `ccb4ea5a` introduces the function. `ba37d408` is a collection of performance improvements and misc fixes. This altered the if from using bitwise to logical and. This commit changes it back to bitwise. Spotted from a compile warning. Fixes: `ba37d408da` ("svga: Performance fixes") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12157>	2021-11-24 01:59:36 +00:00
Thomas H.P. Andersen	6b4294daf0	nine: remove dead code This line gets the cap but does not store it. The line has existed unchanged since the original import in `fdd96578`. Fixes a compile warning Acked-by: Axel Davy davyaxel0@gmail.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12155>	2021-11-23 22:38:19 +00:00
Roman Stratiienko	32ec0fffa6	android.mk: Add missing variables to the make target Android build system may use different internal variables to specify cflags/cppflags. Small change in product confguration may force Android to use diffrent set of variables, therefore we should keep all of them attached to the make rule's target. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5549 Fixes: `8621bd8d5e` ("android: Add scripts to build using meson") Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marijn Suijten <marijn.suijten@somainline.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13914>	2021-11-23 21:06:09 +00:00
Michel Zou	0daed2dc6b	lavapipe: fix unused variable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13892>	2021-11-23 19:11:05 +00:00
Michel Zou	d7957df318	vulkan: fix uninitialized variables Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13892>	2021-11-23 19:11:05 +00:00
Danylo Piliaiev	d5757c965a	turnip: implement VK_KHR_buffer_device_address We don't advertise bufferDeviceAddressCaptureReplay capability and neither does blob, because at the moment there is no way to allocate bo with predefined iova. There is no support of any arithmetic with addresses since shaderInt64 is not enabled. However, we could enable int64 support whenever we want. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Danylo Piliaiev	99388f0c27	freedreno/ir3: handle global atomics Only for a6xx since we don't know the instructions for global atomics on previous gens. Per Qualcomm's docs in OpenCL atomics are only supported since a5xx together with Generic memory space. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Danylo Piliaiev	5d5b1fc472	freedreno/ir3: add a6xx global atomics and separate atomic opcodes Separating atomic opcodes makes possible to express a6xx global atomics which take iova in SRC1. They would be needed by VK_KHR_buffer_device_address. The change also makes easier to distiguish atomics in conditions. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Marius Hillenbrand	c5d6e57e42	llvmpipe: Use lp_build_round_arch on IBM Z (s390x) LLVM has all the required intrinsics available on IBM Z, so use them for rounding operations (they will be implemented as a single instruction). This change makes the test case lp_test_arit pass, because it avoids using the buggy generic code. v2: update .gitlab-ci/cross-xfail-s390x to reflect passing lp_test_arit Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13927>	2021-11-23 17:49:02 +00:00
Marius Hillenbrand	82b261417e	util/cpu_detect: Add flag for IBM Z (s390x) As preparation for changing the behavior of LLVMpipe on IBM Z, add a flag to detect that platform. As it is always known at compile-time, we do not add it to the struct for cpu flags to avoid inflating that struct's size. Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13927>	2021-11-23 17:49:02 +00:00
Ilia Mirkin	be048ec112	freedreno/ir3: remove unused actual_in counting Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13918>	2021-11-23 17:20:32 +00:00
Antonio Caggiano	902c5bf468	virgl: Link shader program Add a new command associated to glLinkProgram. With this we should be able to compile and link shaders when requested by the user, thus avoiding that to happen in the middle of a frame. Together with the command we pass an array of shader handles attached to the program, where each position of the array corresponds to a pipe shader type. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13674>	2021-11-23 16:14:16 +00:00
Antonio Caggiano	0de0440b7c	gallium: add a link shader hook Allow drivers to register a callback for when a shader program is linked. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13674>	2021-11-23 16:14:16 +00:00
Iago Toral Quiroga	79dee14cc2	broadcom/compiler: don't move ldvary earlier if current instruction has ldunif If we did, we would have the instruction coming right after ldvary write to the same implicit destination as ldvary at the same time. We prevent this when merging instructions, but we should make sure we prevent this when we move ldvary around for pipelining too. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13921>	2021-11-23 10:52:24 +00:00
Samuel Pitoiset	aee25471b9	radv: fix emitting VBO when vertex input dynamic state is used In the following scenario: CmdBindPipeline() CmdBindVertexBuffers() CmdSetVertexInput() CmdDraw() CmdBindVertexBuffers() CmdSetVertexInput() CmdDraw() The VBO won't be updated for the second draw because the state is cleared when the dynamic state is emitted and the pipeline isn't dirty. Found by inspection. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13855>	2021-11-23 08:39:13 +00:00
Samuel Pitoiset	d36119716d	radv/winsys: report the real family name instead of OVERRIDDEN When RADV_FORCE_FAMILY is used, this helps pre-compiling shaders to make sure cache entries will match real hardware. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13812>	2021-11-23 08:07:41 +00:00
Samuel Pitoiset	cfc5c2abfd	ac: change family names to uppercase in ac_get_family_name() To print the same device name as real hw. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13812>	2021-11-23 08:07:41 +00:00
Samuel Pitoiset	8e5bb2d6ac	radv: convert remaining enums/structs to 1.2 versions Some were missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13882>	2021-11-23 08:27:19 +01:00
Sagar Ghuge	0d0eae07be	intel/compiler: Prepare disasm for 16-bit sampler params v2: - Update descriptor helper (Jason) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	2fa68cb7da	intel/fs: Define and set correct sampler simd mode Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	31e3e32625	intel/compiler: Deprecate ld2dms and use ld2dms_w instead Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	261dd6c8f8	intel/compiler: Add new variant for TXF_CMS_W This allows, for example, fs_inst::components_read() without passing devinfo as extra argument. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	24831bbd40	intel/compiler: Prepare ld2dms_w for 4 mcs components Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	dfe0ba9080	intel/compiler: Demote sampler params to 16-bit for CMS/UMS/MCS Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	0374b56faa	intel/compiler/fs: Add support for 16-bit sampler msg payload For SIMD8 half float payload, each component takes a full register, so we can use existing LOAD_PAYLOAD infrastruture for required padding by alternating plain 8-wide half float vector and null vector. Also this patch removes an unwanted assertion from opt_copy_propagation_local for LOAD_PAYLOAD. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	936412af27	intel/compiler: Add helper to support half float payload with padding To support SIMD8 half float payloads, each component takes one full 32bit wide register in both SIMD8H and SIMD16H mode. So we can make use of existing LOAD_PAYLOAD infrastructure alternating a half float vector and a null vector, in order to handle required padding. v2: (Francisco) - Skip header sources - Fix comparision units - Don't allocate VGRF for padded source Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	75c73fcdc4	intel/compiler: Fix instruction size written calculation We are always aligning to REG_SIZE but when we have payload sources less than REG_SIZE, size written is miscalculated. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	be2bfe5fe8	intel/compiler: Don't hardcode padding source type to 32bit We can use LOAD_PAYLOAD infrastructure in order to handle 16bit float payload. Let's rely on source type for padding sources, if not set previously then default one would be 32-bit. This patch will be used later in the series to handle 16-bit float payloads. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	0e61d1fbbb	intel/compiler: Handle new sampler descriptor fields for 16bit sampler Update return format field and add SIMD Mode [2] field in sampler descriptor. Now we can tell sampler to return data in either 32/16 bit format precision. v1: - Drop unnecessary descriptor fields (Jason) - Handle return format (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	f78e33aa1a	intel/compiler: Set correct return format for brw_SAMPLE on GFX8 onwards, we have only single bit to determine correct return format. v2: - Define macro and use it instead of hardcoded value. (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Emma Anholt	7603187aec	nir: Un-inline more of nir_builder.h. Cuts another 470KB of libnir.a in my release build. Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13889>	2021-11-22 20:40:47 +00:00
Emma Anholt	d9bfcf5f5b	nir: Un-inline nir_builder_alu_instr_finish_and_insert() This function is big and I don't think it will won't get meaningfully constant-propagated during inlining without LTO. Move it to a .c file so we just have one copy, saving 2.8MB from libnir.a on an amd64 release build. text data bss total filename before: 18953406 7768312 687260 27408978 build-release/driver-symlinks/iris_dri.so 9734366 5542453 481692 15758511 build-release/lib/libvulkan_intel.so 28687772 13310765 1168952 43167489 (TOTALS) after: 15478350 7767864 687260 23933474 build-release/driver-symlinks/iris_dri.so 6810366 5541685 481692 12833743 build-release/lib/libvulkan_intel.so 22288716 13309549 1168952 36767217 (TOTALS) No statistically significant performance difference on iris shader-db, n=8. Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13889>	2021-11-22 20:40:47 +00:00
Ilia Mirkin	3b5b4b5d45	nir: apply interpolated input intrinsics setting when lowering clipdist For drivers that use this in fragment shaders, load_input is going to produce incorrect results (flat-shaded values). Fixes clipping tests on a4xx. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13900>	2021-11-22 20:11:19 +00:00
Ilia Mirkin	df934873e1	nir: always keep the clip distance array size updated Drivers expect to know the number of clip distances irrespective of whether compact arrays are used or not. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13900>	2021-11-22 20:11:19 +00:00

... 7 8 9 10 11 ...

147839 Commits