third_party_mesa3d

Author	SHA1	Message	Date
Jason Ekstrand	57e7c5f05e	nir/opt_load_store_vectorize: Use bit sizes when checking mask compatibility Without this, it was checking bit size compatibility with bit sizes such as 96 which is clearly invalid. No shader-db changes on Ice Lake Fixes: `ce9205c03b` "nir: add a load/store vectorization pass" Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	f6667cb0ce	nir: Add a memcpy optimization pass This pass attempts to optimize three broad categories of memcpy: 1. Self-copies: These we can discard out-of-hand. 2. Vector copies: It doesn't matter what the vector size is or if the source and destination have different vector types, it's still easy enough to emit a load/store pair. 3. Tightly packed copies: In the case where a type is tightly packed (no padding bits), we can replace the memcpy with a copy_deref instruction which the optimizer is far better at handling. This has proven capable of getting rid of many of the memcpy instances in some rather gnarly OpenCL C kernels I've been looking at, even after coming out of LLVM's optimizer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	e363da3bdd	nir: Handle memcpy in copy_prop_vars and combine_stores Fixes: `b2899f7265` "nir: Add a new memcpy intrinsic" Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	100a5ace63	nir/find_array_copies: Properly discard copies for casts In `9f3c595dfc`, we attempted to handle casts in opt_find_array_copies but missed a critical case. In particular, in the case where we begin finding a copy but then encounter a cast, we need to discard everything which might alias that cast. Fixes: `9f3c595dfc` "nir/find_array_copies: Handle cast derefs" Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6871>	2020-10-02 07:30:49 +00:00
Jason Ekstrand	98bb74b67d	nir: Fix a misspelling Fixes: `cb95065dd1` "nir: Add lowering from regular ALU conversions..." Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6975>	2020-10-01 20:44:04 -05:00
Timothy Arceri	038fcbcaed	glsl: don't duplicate state vars as uniforms in the NIR linker The linker was adding all state vars as uniforms, doubling the storage size for shaders using only builtin uniforms, which increased CPU overhead for constant buffer uploads. When this code was originally ported from the GLSL IR linker we forgot to exclude builtins because the check was not done in the add_uniform_to_shader class but rather a check was done when passing variables to this class for processing. Fixes: `664e4a610d` ("glsl/nir: Fill in the Parameters in NIR linker") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Tested-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6958>	2020-10-02 00:57:00 +00:00
Jason Ekstrand	cb95065dd1	nir: Add lowering from regular ALU conversions to the intrinsic Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jesse Natalie	7d97f3dfdc	spirv: Implement vload[a]_half[n] and vstore[a]_half[n][_r] Note, the aligned versions aren't handled specially yet. The float16buffer capability is now at least partially supported after this patch, so move it to be supported when kernels are supported. v2 (Jason Ekstrand): - A few cosmetic cleanups around type/base_type - Rebased on top of the big SPIR-V SSA value rework - Use the new version of the conversion helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	a85afb797e	spirv/opencl: Drop dest_type from handle_v_load_store At that point in the function, we don't know if it's a load or a store so calling it dest_type isn't really helpful. Also, we don't really want the glsl_type; we want the base_type. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	8610af12b6	spirv: Handle all OpenCL conversion ops with full rounding This is done for kernels via the new convert_alu_types intrinsic. For Vulkan and OpenGL, we maintain the old path so that drivers don't have to add that lowering pass. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	8e8458218c	spirv: Add some conversion handling helpers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	383ecfbc70	nir: Add a passes for nir_intrinsic_convert_alu_types This adds primarily two passes: One is a lowering pass which turns these conversion intrinsics into a series of ALU ops. The other is an optimization pass which attempt to simplify the conversion whenever possible in the hopes that we can turn it into a "normal" conversion op which doesn't need special treatment. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	d5cb51e2b9	nir: Add builder helpers for OpenCL type conversions Most of these were originally written by Daniel Stone in the Microsoft ClOn12 branch, reworked by Jesse Natalie, fixed by Boris Brezillon, and possibly touched by others along the way. Unfortunately, none of that is in the commit history thanks to living in the CLOn12 branch. I ported them to mesa master and further reworked things for better cosmetics. In particular, 1. They now live in a builder helper rather than in vtn_alu.c. 2. Instead of looping inside each builder helper, we just trust NIR vector instructions to handle vectors. 3. Lots of re-arranging of the helpers for clarity, better asserting, and better re-use with the upcoming lowering pass. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	588bb6686b	nir: Add a conversion and rounding intrinsic This new intrinsic is capable of handling the full range of conversions from OpenCL including rounding modes and possible saturation. The intention is that we'll emit this intrinsic directly from spirv_to_nir and then lower it to ALU ops later. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Jason Ekstrand	0aa08ae2f6	nir: Split NIR_INTRINSIC_TYPE into separate src/dest indices We're about to introduce conversion ops which are going to want two different types. We may as well just split the one we have rather than end up with three. There are a couple places where this is mildly inconvenient but most of the time I find it to actually be nicer. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6945>	2020-10-01 18:36:53 +00:00
Eric Anholt	e3f4655805	nir: Make nir_lower_ubo_vec4() handle non-vec4-aligned loads. It turns out I had missed a case in my enumeration of why everything currently was vec4-aligned. Fixes a simple testcase of loading from a vec3[2] array in freedreno with IR3_SHADER_DEBUG=nouboopt. Initial shader-db results look devastating: total instructions in shared programs: 8019997 -> 12829370 (59.97%) total cat6 in shared programs: 87683 -> 145840 (66.33%) Hopefully this will recover once we introduce the i/o vectorizer, but that was blocked on getting the vec3 case fixed. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	618556a8cb	nir: Drop the high_offset argument to the load_store_vectorizer filter. Nothing uses it, and it's not clear to me what it provides over alignment/num_components/bit_size. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	5f757bb95c	nir: Make the load_store_vectorizer provide align_mul + align_offset. It was passing an encoding of the two that wasn't good for ensuring "Don't combine loads that would make us straddle a vec4 boundary" for nir_lower_ubo_vec4. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	9c5a793dc7	nir/gl_nir_lower_buffers: Set up align_mul/offset on UBOs. nir_lower_to_explicit_io will give us good alignments if we have the cast's alignment information known, and it's trivial: Just the offset of the UBO variable that is at the base of the deref. Otherwise, explicit io assumes the load is aligned just to the size of a scalar value in it. The change in freedreno is in the noise. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	ffbfc1ec0e	nir/nir_lower_uniforms_to_ubo: Set better alignments on our new instructions. The change on freedreno is in the noise. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	c88c89ff3e	nir: Print the alignment information on casts. I wanted it for debugging GL alignment. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Eric Anholt	6c1c571440	nir: Document a bit about how align_mul/offset work. Introduces a #define for the maximum valid align_mul that's used in the load_store_vectorizer tests (currently, though it will be used more soon). Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6612>	2020-09-30 19:53:43 +00:00
Jason Ekstrand	25ebd7f90f	Revert "nir/lower_goto_if: Add a route::outside set" This reverts commit `d57573dcd4`. The actual bug was an issue with prev_frontiers which has been properly fixed in the previous commit. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	57c9fc3cba	nir/lower_goto_ifs: Always include level dom_frontiers in prev_frontier When we come in from some other level or from the parent, we need to ensure that the reach set is in prev_frontier but we also need to consider the dominance frontier of our level. Otherwise, we may end up leaving out possible blocks when computing the reach of a level. Acked-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	7749983658	nir/lower_goto_ifs: Add asserts for SSA forks Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	dc010cb74e	nir/lower_goto_ifs: Use rzalloc In particular, SSA forks weren't always getting properly initialized which was causing asserts to fail. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	fa3c38ceb3	spirv: Only run repair_ssa if structured We shouldn't need it if we're unstructured and the pass assumes structure so attempting to run it will assert-fail. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	719c68016a	nir/dominance: Use _mesa_set_clear instead ofhand-rolling it Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	b6a4172f10	nir/lower_goto_ifs: Don't destroy SSA form in the process There are two issues here: 1. If there are any phi nodes, we'll make complete hash of them. This isn't likely actually a problem because spirv_to_nir doesn't generate any actual phi nodes today. However, if we start doing any other passes before this, we may have a problem. 2. Even without phi nodes, we may still break SSA form. This can happen if we ever have to stick a block inside a conditional to satisfy weird CFG constraints. Doing so can cause it to no longer look like it dominates some of its uses even though, at runtime, it's guaranteed to be run first. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	6f134a622b	nir/validate: Improve the validation of blocks This commit adds a number of new validation checks: 1. We now check that every block pointer in the IR points to a block that actually exists in a block list that's reachable from the nir_function_impl. 2. We assert that nir_function_impl::body is non-empty 3. We assert that the start block has no predecessors. This is important because we tend to put run-once code there. 4. We now validate some stuff on the end block. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6750>	2020-09-30 16:46:11 +00:00
Jason Ekstrand	7dbb1f7462	nir/cf: Better handle intra-block splits In the case where end was a instruction-based cursor, we would mix up our blocks and end up with block_begin pointing after the second split. This causes a segfault as the cf_node list walk at the end of the function never terminates properly. There's also a possibility of mix-up if begin is an instruction-based cursor which was found by inspection. Fixes: `fc7f2d2364` "nir/cf: add new control modification API's" Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6866>	2020-09-30 15:47:51 +00:00
Jason Ekstrand	5e2e882270	nir: Disallow goto and goto_if in clone and [de]serialize Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6866>	2020-09-30 15:47:51 +00:00
Jason Ekstrand	9a48ed84ec	nir/copy_propagate: Copy-prop into jump conditions Reviewed-by: Eric Anholt <eric@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6866>	2020-09-30 15:47:51 +00:00
Connor Abbott	7f0cd6f153	nir/opt_if: Use early returns in opt_if_merge() We would've had to add yet another level of indentation, or duplicated finding the if conditions in the next commit. Refactor this function to use early returns like our other optimizations, so that this isn't an issue. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6866>	2020-09-30 15:47:51 +00:00
Connor Abbott	656e428ff4	nir/opt_if: Remove open-coded nir_ssa_def_rewrite_uses() So that we don't have to change these two places later. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6866>	2020-09-30 15:47:51 +00:00
Connor Abbott	c6f871b62e	nir/lower_returns: Use nir control flow insertion helpers Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6866>	2020-09-30 15:47:51 +00:00
Jason Ekstrand	92a594b154	spirv: Delete the legacy offset/index UBO/SSBO lowering Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5275>	2020-09-30 07:20:39 +00:00
Jason Ekstrand	657d49a9ba	spirv: Use derefs for push constants Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5275>	2020-09-30 07:20:39 +00:00
Jason Ekstrand	ac7537f155	nir/lower_io: Add support for push constants Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5275>	2020-09-30 07:20:38 +00:00
Jason Ekstrand	7a2b4ce22e	nir: Allow creating variables with nir_var_mem_push_const. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/5275>	2020-09-30 07:20:38 +00:00
Indrajit Kumar Das	40c1f9883e	mesa,glsl: add support for GL_NV_shader_atomic_int64 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6708>	2020-09-29 14:24:44 +00:00
Connor Abbott	51e2b31039	nir: Handle per-view io in nir_io_add_const_offset_to_base() This isn't strictly necessary for freedreno, since we aren't using it yet, but I wanted to avoid any problems if we do. If we wanted to handle this "properly", and handle matrix and array per-view variables, we'd probably want to encode the "view stride" (number of views per user location) and base view in the intrinsic, but for now we just don't do any offsetting and assume the indirect offset is the view. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6514>	2020-09-29 10:32:00 +00:00
Connor Abbott	bc8a5c0752	nir: Add per_view to IO semantics Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6514>	2020-09-29 10:32:00 +00:00
Connor Abbott	5a88db682e	nir/lower_io_arrays: Fix xfb_offset bug I noticed this once I started gathering xfb_info after nir_lower_io_arrays_to_elements_no_indirect. Fixes: `b2bbd978d0` ("nir: fix lowering arrays to elements for XFB outputs") Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6514>	2020-09-29 10:32:00 +00:00
Connor Abbott	df955ce6b6	nir: Count i/o slots correctly for per-view variables This function wasn't counting driver slots correctly, resulting in incorrect driver_location's and input_count. It seems intel doesn't use this yet. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6514>	2020-09-29 10:32:00 +00:00
Connor Abbott	ffe946d7e8	nir: Add nir_lower_multiview pass Taken mostly directly from the anv pass. A few anv-specific things that I could leave in anv aren't included. Specifically on turnip we don't need to set gl_Layer to 0, and we can handle the case where the FS reads gl_ViewIndex, so that check is moved into anv. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6514>	2020-09-29 10:31:59 +00:00
Samuel Pitoiset	a0e35c7562	nir/lower_io: change nir_io_add_const_offset_to_base to use bitfield modes Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6890>	2020-09-29 09:40:21 +00:00
Eric Anholt	e8c5f8b9d3	nir/lower_clip: Add i/o semantics for load/store intrinsics. ir3 looks at the .location on its inputs for handling non-VARYING_SLOT_POS, but our uninitialized semantics matched that and threw a compiler assertion failure. Fixes: `502abfce7f` ("nir: save IO semantics in lowered IO intrinsics") Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6716>	2020-09-28 17:35:30 +00:00
Samuel Pitoiset	39098a2053	nir/lower_memory_model: do not break with global atomic operations Global atomics don't have an access flag. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6888>	2020-09-28 14:47:02 +00:00
Samuel Pitoiset	de1409089c	nir/lower_memory_model: return progress when visiting instructions It never returned progress=TRUE. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6888>	2020-09-28 14:47:02 +00:00

1 2 3 4 5 ...

5452 Commits