intel/perf: fix EuThreadsCount value in performance equations
EuThreadsCount is supposed to be the number of threads per EU, not the
total number of threads in the whole device.
Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com>
Fixes: 1fc7b95127
("i965: Add Gen8+ INTEL_performance_query support")
Reviewed-by: Kenneth Graunke <kenneth@whitecape.org>
This commit is contained in:
@@ -298,8 +298,7 @@ compute_topology_builtins(struct gen_perf *perf,
|
||||
for (int i = 0; i < sizeof(devinfo->eu_masks); i++)
|
||||
perf->sys_vars.n_eus += __builtin_popcount(devinfo->eu_masks[i]);
|
||||
|
||||
perf->sys_vars.eu_threads_count =
|
||||
perf->sys_vars.n_eus * devinfo->num_thread_per_eu;
|
||||
perf->sys_vars.eu_threads_count = devinfo->num_thread_per_eu;
|
||||
|
||||
/* The subslice mask builtin contains bits for all slices. Prior to Gen11
|
||||
* it had groups of 3bits for each slice, on Gen11 it's 8bits for each
|
||||
|
Reference in New Issue
Block a user