third_party_mesa3d

Author	SHA1	Message	Date
Dave Airlie	42d50c779b	nir: put compact into bitfields in nir_variable_data This being declared bool means it won't get merged with the previous bitfields, this seems like an oversight rather than deliberate. Noticed when running pahole. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-09-07 11:00:04 +10:00
Nicolai Hähnle	e902ac3268	nir: add nir_lower_uniforms_to_ubo pass This is a further lowering of default-block uniform loads that transforms load_uniform intrinsics into load_ubo intrinsics. This simplifies the rest of the backend. v2: transform from load_uniform instead of straight from variables Reviewed-by: Eric Anholt <eric@anholt.net>	2017-07-31 14:55:29 +02:00
Nicolai Hähnle	bce6f99875	nir: add nir_lower_samplers_as_deref pass This pass is a replacement for the nir_lower_samplers pass, which has the advantage of keeping sampler references as derefs. This allows a unified treatment of texture instructions and image intrinsics in the backend.	2017-07-31 14:55:29 +02:00
Nicolai Hähnle	b27c2d402e	nir: add nir_instr_rewrite_deref Allows modifying a texture instruction's texture and sampler derefs. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-07-31 14:55:28 +02:00
Matt Turner	1038d385a9	nir: Reduce destination size of ballot intrinsic when possible Some hardware, like i965, doesn't support group sizes greater than 32. In that case, we can reduce the destination size of the ballot intrinsic, which will simplify our code generation. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-20 16:56:49 -07:00
Matt Turner	3e7b8f6cd4	nir: Add pass to scalarize read_invocation/read_first_invocation i965 will want these to be scalar operations. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-20 16:56:49 -07:00
Matt Turner	43ef75b394	nir: Add system values from ARB_shader_ballot We already had a channel_num system value, which I'm renaming to subgroup_invocation to match the rest of the new system values. Note that while ballotARB(true) will return zeros in the high 32-bits on systems where gl_SubGroupSizeARB <= 32, the gl_SubGroup??MaskARB variables do not consider whether channels are enabled. See issue (1) of ARB_shader_ballot. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-20 16:56:49 -07:00
Matt Turner	742cc6118a	nir: Support lowering vote intrinsics ... trivially (as allowed by the spec!) by reusing the existing nir_opt_intrinsics code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-20 16:56:49 -07:00
Matt Turner	d4c9d6a3b2	nir: Add pass to optimize intrinsics Specifically, constant fold intrinsics from ARB_shader_group_vote, but I suspect it'll be useful for other things in the future. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-07-20 16:56:49 -07:00
Nicolai Hähnle	34df9525f6	nir: add NIR_PRINT environment variable Reviewed-by: Rob Clark <robdclark@gmail.com>	2017-07-05 12:27:07 +02:00
Johnson Lin	8ff4be44b7	nir: Add a lowering pass for UYVY textures Similar with support for YUYV but with byte order difference in sampler Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com>	2017-06-30 10:16:26 +01:00
Grazvydas Ignotas	29b9f35704	nir: make various getters take const pointers This will allow to constify other things. Signed-off-by: Grazvydas Ignotas <notasas@gmail.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com>	2017-06-10 16:48:45 +03:00
Jason Ekstrand	b86dba8a0e	nir: Embed the shader_info in the nir_shader again Commit `e1af20f18a` changed the shader_info from being embedded into being just a pointer. The idea was that sharing the shader_info between NIR and GLSL would be easier if it were a pointer pointing to the same shader_info struct. This, however, has caused a few problems: 1) There are many things which generate NIR without GLSL. This means we have to support both NIR shaders which come from GLSL and ones that don't and need to have an info elsewhere. 2) The solution to (1) raises all sorts of ownership issues which have to be resolved with ralloc_parent checks. 3) Ever since `00620782c9`, we've been using nir_gather_info to fill out the final shader_info. Thanks to cloning and the above ownership issues, the nir_shader::info may not point back to the gl_shader anymore and so we have to do a copy of the shader_info from NIR back to GLSL anyway. All of these issues go away if we just embed the shader_info in the nir_shader. There's a little downside of having to copy it back after calling nir_gather_info but, as explained above, we have to do that anyway. Acked-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-05-09 15:07:47 -07:00
Rob Clark	ae7aa8dbaf	nir: fix (hopefully) windows build Fixes: `53aa109b` ("nir: add pass to lower atomic counters to SSBO") Signed-off-by: Rob Clark <robdclark@gmail.com>	2017-05-08 13:41:16 -04:00
Rob Clark	53aa109ba2	nir: add pass to lower atomic counters to SSBO This is equivalent to what mesa/st does in glsl_to_tgsi. For most hw there isn't a particularly good reason to treat these differently. Signed-off-by: Rob Clark <robdclark@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net>	2017-05-04 13:48:06 -04:00
Timothy Arceri	7a7ee40c2d	nir/i965: add before ffma algebraic opts This shuffles constants down in the reverse of what the previous patch does and applies some simpilifications that may be made possible from doing so. Shader-db results BDW: total instructions in shared programs: 12980814 -> 12977822 (-0.02%) instructions in affected programs: 281889 -> 278897 (-1.06%) helped: 1231 HURT: 128 total cycles in shared programs: 246562852 -> 246567288 (0.00%) cycles in affected programs: 11271524 -> 11275960 (0.04%) helped: 1630 HURT: 1378 V2: mark float opts as inexact Reviewed-by: Elie Tournier <elie.tournier@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-04-24 12:08:14 +10:00
Jason Ekstrand	fbcf92a278	nir: Add support for 8 and 16-bit types Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Eduardo Lima Mitev <elima@igalia.com>	2017-03-30 11:34:45 -07:00
Iago Toral Quiroga	023ea3772d	nir/lower_wpos_center: support adding sample position to fragment coordinate According to section 14.6 of the Vulkan specification: "When sample shading is enabled, the x and y components of FragCoord reflect the location of the sample corresponding to the shader invocation." So add a boolean parameter to the lowering pass to select this behavior when we need it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-24 08:11:53 +01:00
Matt Turner	ef71af7356	nir: Return progress from nir_convert_from_ssa(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	abc8a702d0	nir: Return progress from nir_lower_io(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	a934b00222	nir: Return progress from nir_lower_regs_to_ssa(). And from nir_lower_regs_to_ssa_impl() as well. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	b0e72defc2	nir: Return progress from nir_lower_samplers(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	01548f9f01	nir: Return progress from nir_lower_atomics(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	0bd615d961	nir: Return progress from nir_lower_clamp_color_outputs(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	9dbf91f5c0	nir: Return progress from nir_lower_clip_fs(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	4e4927cd95	nir: Return progress from nir_lower_clip_vs(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:44 -07:00
Matt Turner	6077cc75aa	nir: Return progress from nir_move_vec_src_uses_to_dest(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	a539e05d00	nir: Return progress from nir_lower_to_source_mods(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	5a7e4ae23d	nir: Return progress from nir_lower_clip_cull_distance_arrays(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	19345fc160	nir: Return progress from nir_lower_var_copies(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	b831b8d2e1	nir: Return progress from nir_lower_load_const_to_scalar(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	adb157ddfd	nir: Return progress from nir_lower_64bit_pack(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	0012a6144a	nir: Return progress from nir_lower_doubles(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	c597f87739	nir: Return progress from nir_lower_vars_to_ssa(). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	70c0455974	nir: Fix misspellings. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Matt Turner	d6e2bdfed3	nir: Stop using apostrophes to pluralize. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-03-23 14:34:43 -07:00
Emil Velikov	e3de145fa2	nir: consistently use ifndef guards over pragma once Signed-off-by: Emil Velikov <emil.l.velikov@gmail.com> Acked-by: Vedran Miletić <vedran@miletic.net> Acked-by: Juha-Pekka Heikkila <juhapekka.heikkila@gmail.com> Reviewed-by: Edward O'Callaghan <funfunctor@folklore1984.net>	2017-03-22 16:55:22 +00:00
Jason Ekstrand	9084b1db30	nir: Add a get_nir_type_for_glsl_base_type helper Reviewed-by: Eric Anholt <eric@anholt.net>	2017-03-14 07:36:40 -07:00
Jason Ekstrand	074f5ba0b5	nir: Add a simple int64 lowering pass The algorithms used by this pass, especially for division, are heavily based on the work Ian Romanick did for the similar int64 lowering pass in the GLSL compiler. v2: Properly handle vectors v3: Get rid of log2_denom stuff. Since we're using bcsel, we do all the calculations anyway and this is just extra instructions. v4: - Add back in the log2_denom stuff since it's needed for ensuring that the shifts don't overflow. - Rework the looping part of the pass to be easier to expand. Reviewed-by: Matt Turner <mattst88@gmail.com>	2017-03-01 17:00:20 -08:00
Emil Velikov	e4f971c85f	nir: do not #include util/debug.h within extern C {} It's a problem waiting to happen. Individual headers should be annotated if needed. Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Brian Paul <brianp@vmware.com>	2017-02-21 18:29:17 +00:00
Jason Ekstrand	e10f522cd7	nir: Rename lower_double_pack to lower_64bit_pack There's nothing "double" about it other than, perhaps, the fact that it packs two 32-bit values. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>	2017-02-16 17:28:03 -08:00
Dave Airlie	adb9555794	nir: handle 64-bit integer types in glsl->nir type conversion. Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Signed-off-by: Dave Airlie <airlied@redhat.com>	2017-02-16 14:13:14 +10:00
Kenneth Graunke	fd957b1751	nir: Introduce a nir_opt_move_comparisons() pass. This tries to move comparisons (a common source of boolean values) closer to their first use. For GPUs which use condition codes, this can eliminate a lot of temporary booleans and comparisons which reload the condition code register based on a boolean. V2: (Timothy Arceri) - fix move comparision for phis so we dont end up with: vec1 32 ssa_227 = phi block_34: ssa_1, block_38: ssa_240 vec1 32 ssa_235 = feq ssa_227, ssa_1 vec1 32 ssa_230 = phi block_34: ssa_221, block_38: ssa_235 - add nir_op_i2b/nir_op_f2b to the list of comparisons. V3: (Timothy Arceri) - tidy up suggested by Jason. - add inot/fnot to move comparison list V4: (Jason Ekstrand) - clean up move_comparison_source - get rid of the tuple - rework phi handling Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> [v1] Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-12 09:47:29 +11:00
Kenneth Graunke	5297267a1c	nir: Add a pass to lower TES patch_vertices intrinsics to a constant. In Vulkan, we always have both the TCS and TES available in the same pipeline, so we can simply use the TCS OutputVertices execution mode value as the TES PatchVertices built-in. For GLSL, we handle this in the linker. But we could use this pass in the case when both TCS and TES are linked together, if we wanted. Signed-off-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-10 13:21:53 -08:00
Samuel Iglesias Gonsálvez	27cf6a369f	nir: add nir_type_conversion_op() This function returns the nir_op corresponding to the conversion between the given nir_alu_type arguments. This function lacks support for integer-based types with bit_size != 32 and for float16 conversion ops. v2: - Improve readiness of the code and delete cases that don't happen now (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Samuel Iglesias Gonsálvez	3a571fcc43	nir: add nir_get_nir_type_for_glsl_type() v2 (Jason): - Refactor nir_get_nir_type_for_glsl_type() to avoid using unneeded helpers (Jason) v3: - Use return directly (Jason) Signed-off-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net>	2017-01-09 09:10:13 +01:00
Jason Ekstrand	62332d139c	nir: Add a local variable-based copy propagation pass Reviewed-by: Timothy Arceri <timothy.arceri@collabora.com>	2017-01-06 16:44:28 -08:00
Jason Ekstrand	47b54a6f74	nir/lower_var_copies: Use a shader rather than a void *mem_ctx Reviewed-by: Eduardo Lima Mitev <elima@igalia.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-12-30 12:38:04 -08:00
Jason Ekstrand	134a5ad31c	nir: Make nir_copy_deref follow the "clone" pattern We rename it to nir_deref_clone, re-order the sources to match the other clone functions, and expose nir_deref_var_clone. This past part, in particular, lets us get rid of quite a few lines since we no longer have to call nir_copy_deref and wrap it in deref_as_var. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com>	2016-12-30 12:38:04 -08:00
Jason Ekstrand	baf1aa1334	nir: Add foreach_register helper macros	2016-12-29 16:02:44 -08:00

... 2 3 4 5 6 ...

342 Commits