third_party_mesa3d

Author	SHA1	Message	Date
Marek Olšák	08dc541b66	nir: pack nir_variable::data::stream Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-04 18:17:34 -05:00
Ian Romanick	9be4a422a0	nir/algebraic: Mark other comparison exact when removing a == a This prevents some additional optimizations that would change the original result. This includes things like (b < a && b < c) => b < min(a, c) and !(a < b) => b >= a. Both of these optimizations were specifically observed in the piglit tests added in piglit!160. This was discovered while investigating https://gitlab.freedesktop.org/mesa/mesa/issues/1958. However, the problem in that issue was Chrome or Angle is replacing calls to isnan() with some stuff that we (correctly) optimize to false. If they had left the calls to isnan() alone, everything would have just worked. No shader-db changes on any Intel platform. I also tried marking the comparison generated by the isnan() function precise. The precise marker "infects" every computation involved in calculating the parameter to the isnan() function, and this severely hurt all of the (few) shaders in shader-db that use isnan(). I also considered adding a new ir_unop_isnan opcode that would implement the functionality. During GLSL IR-to-NIR translation, the resulting comparison operation would be marked exact (and the samething would need to happen in SPIR-V translation). This approach taken by this patch seemed easier, but we may want to do the ir_unop_isnan thing anyway. Fixes: `d55835b8bd` ("nir/algebraic: Add optimizations for "a == a && a CMP b"") Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-04 14:05:49 -08:00
Ian Romanick	ea19f2fb68	nir/algebraic: Add the ability to mark a replacement as exact Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-04 14:05:49 -08:00
Marek Olšák	af94600484	compiler: make variable::data::binding unsigned Nothing seems to set a negative value. Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-04 16:49:46 -05:00
Marek Olšák	4b4b383f38	st/mesa: call nir_lower_flrp only once per shader Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-04 16:49:44 -05:00
Marek Olšák	7d00218aed	st/mesa: call nir_opt_access only once Reviewed-by: Connor Abbott <cwabbott0@gmail.com>	2019-11-04 16:49:42 -05:00
Leo Liu	352b57d709	ac: add missing Arcturus to the info of pc lines Signed-off-by: Leo Liu <leo.liu@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: Marek Olšák <marek.olsak@amd.com>	2019-11-04 16:27:35 -05:00
Alyssa Rosenzweig	4da648a170	panfrost/ci: Update T760 expectations Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	12d071024b	pan/midgard: Extend default_phys_reg to !32-bit We can pass through a size. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	762623381d	pan/midgard: Extend swizzle packing for vec4/16-bit We would like to pack not just xyzw swizzles but also efgh swizzles. This should work for vec4/16-bit. More work will be needed to pack swizzles for vec8/16-bit and even more work for 8-bit, of course. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	bf5508f7b9	pan/midgard: Extend offset_swizzle to non-32-bit We take a size parameter; use it. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	f538981384	pan/midgard: offset_swizzle doesn't need dstsize This argument should be omitted. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	9eac9389fb	pan/midgard: Add bizarre corner case Someone really needs to look into this. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	4ae4d82e21	pan/midgard: Compute bundle interference Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	45ac8ea8bd	pan/midgard: Fix quadword_count handling Spilling can mess with this considerably. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Alyssa Rosenzweig	0a77dd3203	pan/midgard: Validate tags when branching Midgard prefetches instructions based on tag (ALU, LD/ST, texture * size). To do so, the shader descriptor specifies the tag of the first instruction, all instructions specify the tag of the next linear instruction is, and all branches explicitly specify the tag of the branch target. If you mess this up, you get an INSTR_TYPE_MISMATCH, which unambiguously refers to this problem, but it's still annoying to try to work out all the branch targets in your head to debug. Instead, let's track the tags of various blocks over time, so we can automatically validate tags of branch targets, to make INSTR_TYPE_MISMATCH issues immediately obvious in a disassembly. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 15:36:08 -05:00
Daniel Schürmann	efe737fc4f	aco: fix accidential reordering of instructions when scheduling Fixes: `8678699918` "aco: implement VGPR spilling" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>	2019-11-04 20:14:14 +01:00
Daniel Schürmann	5c7dcb15e0	aco: only use single-dword loads/stores for spilling Fixes: `8678699918` "aco: implement VGPR spilling" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>	2019-11-04 20:14:14 +01:00
Daniel Schürmann	d97c0bdd55	aco: fix immediate offset for spills if scratch is used Fixes: `8678699918` "aco: implement VGPR spilling" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com>	2019-11-04 20:14:14 +01:00
Lionel Landwerlin	ee6fbb95a7	anv: Properly handle host query reset of performance queries The host query reset entry point didn't use the availability offset for performance queries. To fix this, reorder the availability of performance queries to match other queries. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2b5f30b1d9` ("anv: implement VK_INTEL_performance_query") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-04 19:04:38 +00:00
Paul Gofman	ecc31d032e	state_tracker: Handle texture view min level in st_generate_mipmap() Signed-off-by: Paul Gofman <gofmanp@gmail.com> Signed-off-by: Marek Olšák <marek.olsak@amd.com>	2019-11-04 13:24:31 -05:00
James Xiong	b6d45e7f74	iris: try to set the specified tiling when importing a dmabuf When importing a dmabuf with a specified tiling, the dmabuf user should always try to set the tiling mode because: 1) the exporter can set tiling AFTER exporting/importing. 2) a dmabuf could be exported from a kernel driver other than i915, in this case the dmabuf user and exporter need to set tiling separately. This patch fixes a problem when running vkmark under weston with iris on ICL, it crashed to console with the following assert. i965 doesn't have this problem as it always tries to set the specified tiling mode. weston: ../src/gallium/drivers/iris/iris_resource.c:990: iris_resource_from_handle: Assertion `res->bo->tiling_mode == isl_tiling_to_i915_tiling(res->surf.tiling)' failed. Signed-off-by: James Xiong <james.xiong@intel.com> Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-11-04 17:59:52 +00:00
Kenneth Graunke	fc7b748086	iris: Fix "Force Zero RTA Index Enable" setting again In `2ca0d913ea`, we began updating cso_fb->layers to the actual layer count, rather than 0. This fixed cases where we were setting "Force Zero RTA Index Enable" even when doing layered rendering. Sadly, it also broke the check entirely: cso_fb->layers is now 1 for non-layered cases, but the Force Zero RTA Index check was still comparing for 0. Fixes: `2ca0d913ea` ("iris: Fix framebuffer layer count")	2019-11-04 08:57:37 -08:00
Dylan Baker	717606f9f3	nir: correct use of identity check in python Python has the identity operator `is`, and the equality operator `==`. Using `is` with strings sometimes works in CPython due to optimizations (they have some kind of cache), but it may not always work. Fixes: `96c4b135e3` ("nir/algebraic: Don't put quotes around floating point literals") Reviewed-by: Matt Turner <mattst88@gmail.com>	2019-11-04 16:06:39 +00:00
Boris Brezillon	28440820ef	panfrost: MALI_DEPTH_TEST is actually MALI_DEPTH_WRITEMASK MALI_DEPTH_TEST should only be set when depth->writemask is true, not when the depth test is enabled. Let's rename the flag and patch panfrost_bind_depth_stencil_state() to do the right thing. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-04 16:14:09 +01:00
Lionel Landwerlin	71634b1003	vulkan: bump headers/registry to 1.1.127 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-04 17:07:11 +02:00
Samuel Pitoiset	9ab27647ff	radv: fix compute pipeline keys when optimizations are disabled If an app first creates a compute pipeline with VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT set, then re-compile it without that flag, the driver should re-compile the compute shader. Otherwise, it will return the unoptimized one. Fixes: `ce188813bf` ("radv: add initial support for VK_PIPELINE_CREATE_DISABLE_OPTIMIZATION_BIT") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>	2019-11-04 08:50:00 +01:00
Karol Herbst	538d2c33b8	nv50/ir: fix crash in isUniform for undefined values Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu>	2019-11-03 01:02:52 +01:00
Lionel Landwerlin	88d665830f	mesa: check draw buffer completeness on glClearBufferfi/glClearBufferiv Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-02 09:14:26 +00:00
Vasily Khoruzhick	103378f332	lima: set dithering flag when necessary Bit 13 in aux1 enables dithering Reviewed-by: Qiang.Yu <yuq825@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-11-01 21:44:31 -07:00
Marek Olšák	c236e6c1e3	glsl: encode struct/interface types better Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-01 19:19:03 -04:00
Marek Olšák	5dde2aa8d9	glsl: encode array types better Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-01 19:19:03 -04:00
Marek Olšák	c141366560	glsl: encode explicit_stride for basic types better Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-01 19:19:03 -04:00
Marek Olšák	86adce4fef	glsl: encode vector_elements and matrix_columns better Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-01 19:19:03 -04:00
Marek Olšák	21d2fbb8c3	glsl: encode/decode types using a union with bitfields for readability Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-01 19:19:03 -04:00
Vasily Khoruzhick	dd52744201	lima: ignore flags while looking for BO in cache Any BO would work, we don't have any BO types yet anyway. Moreover lima_submit_add_bo() changes BO flags so they won't match allocation flags. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-11-01 13:12:07 -07:00
Vasily Khoruzhick	ae0b05d8db	lima: align size before trying to fetch BO from cache Otherwise we may be looking in wrong bucket Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-11-01 13:12:03 -07:00
Vasily Khoruzhick	08d6416a1d	lima: add debug prints for BO cache LIMA_DEBUG=bocache now activates debug prints for BO allocation, destruction and BO cache. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-11-01 13:11:47 -07:00
Alyssa Rosenzweig	b32caa6f1f	pan/midgard: Use fp32 blend shaders Clearly we do want to have fp16 at some point ... but I kind of give up debugging and it turns out the issues with fp16 support in 'frost are so deeply rooted that I might as well disable this non-opt and land LCRA now. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 13:47:52 -04:00
Bas Nieuwenhuizen	8efb8f55a6	radv: Close all unnecessary fds in secure compile. The seccomp filter allows read/write, let us make sure nobody can do anything with this. Fixes: `cff53da374` "radv: enable secure compile support" Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com>	2019-11-01 17:15:34 +01:00
Erik Faye-Lund	dd77bdb34b	anv: remove incorrect polygonMode=point early-out This is incorrect, because polygonMode only applies if the final primitive type is a polygon; polygonMode doesn't apply to line-primitives as the comment suggests. The Vulkan 1.1 spec, section 26.11, "Polygons" defines that polygons are separate from points and line segments: " A polygon results from the decomposition of a triangle strip, triangle fan or a series of independent triangles. Like points and line segments, polygon rasterization is controlled by several variables in the VkPipelineRasterizationStateCreateInfo structure. " Further, section 26.11.2, "Polygon Mode", only define polygonMode to apply to polygons: " Possible values of the VkPipelineRasterizationStateCreateInfo::polygonMode property of the currently active pipeline, specifying the method of rasterization for polygons, are: " This seems to clearly define that polygonMode doesn't apply to points and lines, so let's make sure that we don't early out with the wrong value. Signed-off-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2019-11-01 07:26:03 +00:00
Alyssa Rosenzweig	c3a46e7644	pan/midgard: Eliminate blank_alu_src We don't need it in practice, so this is some more cleanup. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:47 +00:00
Alyssa Rosenzweig	70072a20e0	pan/midgard: Refactor swizzles Rather than having hw-specific swizzles encoded directly in the instructions, have a unified swizzle arary so we can manipulate swizzles generically. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:47 +00:00
Alyssa Rosenzweig	e7fd14ca8a	pan/midgard: Add a dummy source for loads We want symmetry between loads and stores, so we add a dummy source. So we get, e.g. st_int4 _, val, arg_1, arg_2 ld_int4 dest, _, arg_1, arg_2 Semantically, this dummy source represents the data itself, as if the load is simply a move. That means it has a swizzle that acts as a source. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:47 +00:00
Alyssa Rosenzweig	b5938be51d	pan/midgard: Remove OP_IS_STORE_VARY Unused. Signed-off-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com>	2019-11-01 01:01:46 +00:00
Timothy Arceri	1c2bf82d24	glsl: disable lower_fragdata_array() for NIR drivers This function was added in `7e414b5864` to work around a defect in lower_output_reads(). As of the previous commit no NIR driver calls lower_output_reads(). This change means we don't need the special GLSL IR style gl_FragData handling for building the resource list in a NIR based linker. No shader-db change on SKL i965. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-01 11:33:54 +11:00
Timothy Arceri	0e186c18ba	glsl: just use NIR to lower outputs when driver can't read outputs This will allow us to stop lowering gl_FragData in GLSL IR for NIR drivers which means we won't need the special GLSL IR type handling for building the resource list in a NIR based linker. i965 has been doing this since `b828f7a27b`. Reviewed-by: Marek Olšák <marek.olsak@amd.com>	2019-11-01 11:33:33 +11:00
Icenowy Zheng	8fa13db251	lima: support indexed draw with bias When doing an indexed draw with index_bias set to a non-zero value (e.g. by glDrawElementsBaseVertex), the vertex buffer should be offseted by index_bias vertices. Add this offset when setting the vertex buffer address. Signed-off-by: Icenowy Zheng <icenowy@aosc.io> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com>	2019-10-31 21:56:45 +00:00
Jason Ekstrand	f60ef0fff4	anv: Move the RT BTI flush workaround to begin_subpass Now that we're no longer compacting binding table entries, the only time they can possibly change is when we actually switch subpasses. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-10-31 21:07:15 +00:00
Jason Ekstrand	6a8f43030c	anv: Stop compacting render targets in the binding table Instead, always emit one entry for every color attachment in the subpass or one NULL if there are no color attachments. This will let us adjust an Ice Lake workaround so we don't get a stall on every draw call. Reviewed-by: Rafael Antognolli <rafael.antognolli@intel.com>	2019-10-31 21:07:15 +00:00

1 2 3 4 5 ...

117289 Commits