Skip to content

Commit bcc8eda

Browse files
committed
Merge tag 'turbostat-2024.11.30' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux
Pull turbostat updates from Len Brown: - assorted minor bug fixes - assorted platform specific tweaks - initial RAPL PSYS (SysWatt) support * tag 'turbostat-2024.11.30' of git://git.kernel.org/pub/scm/linux/kernel/git/lenb/linux: tools/power turbostat: 2024.11.30 tools/power turbostat: Add RAPL psys as a built-in counter tools/power turbostat: Fix child's argument forwarding tools/power turbostat: Force --no-perf in --dump mode tools/power turbostat: Add support for /sys/class/drm/card1 tools/power turbostat: Cache graphics sysfs file descriptors during probe tools/power turbostat: Consolidate graphics sysfs access tools/power turbostat: Remove unnecessary fflush() call tools/power turbostat: Enhance platform divergence description tools/power turbostat: Add initial support for GraniteRapids-D tools/power turbostat: Remove PC3 support on Lunarlake tools/power turbostat: Rename arl_features to lnl_features tools/power turbostat: Add back PC8 support on Arrowlake tools/power turbostat: Remove PC7/PC9 support on MTL tools/power turbostat: Honor --show CPU, even when even when num_cpus=1 tools/power turbostat: Fix trailing '\n' parsing tools/power turbostat: Allow using cpu device in perf counters on hybrid platforms tools/power turbostat: Fix column printing for PMT xtal_time counters tools/power turbostat: fix GCC9 build regression
2 parents 0cb7170 + 86d2377 commit bcc8eda

File tree

2 files changed

+325
-127
lines changed

2 files changed

+325
-127
lines changed

tools/power/x86/turbostat/turbostat.8

+27
Original file line numberDiff line numberDiff line change
@@ -33,6 +33,9 @@ name as necessary to disambiguate it from others is necessary. Note that option
3333
msr0xXXX is a hex offset, eg. msr0x10
3434
/sys/path... is an absolute path to a sysfs attribute
3535
<device> is a perf device from /sys/bus/event_source/devices/<device> eg. cstate_core
36+
On Intel hybrid platforms, instead of one "cpu" perf device there are two, "cpu_core" and "cpu_atom" devices for P and E cores respectively.
37+
Turbostat, in this case, allow user to use "cpu" device and will automatically detect the type of a CPU and translate it to "cpu_core" and "cpu_atom" accordingly.
38+
For a complete example see "ADD PERF COUNTER EXAMPLE #2 (using virtual "cpu" device)".
3639
<event> is a perf event for given device from /sys/bus/event_source/devices/<device>/events/<event> eg. c1-residency
3740
perf/cstate_core/c1-residency would then use /sys/bus/event_source/devices/cstate_core/events/c1-residency
3841

@@ -187,6 +190,8 @@ The system configuration dump (if --quiet is not used) is followed by statistics
187190
.PP
188191
\fBRAMWatt\fP Watts consumed by the DRAM DIMMS -- available only on server processors.
189192
.PP
193+
\fBSysWatt\fP Watts consumed by the whole platform (RAPL PSYS). Disabled by default. Enable with --enable SysWatt.
194+
.PP
190195
\fBPKG_%\fP percent of the interval that RAPL throttling was active on the Package. Note that the system summary is the sum of the package throttling time, and thus may be higher than 100% on a multi-package system. Note that the meaning of this field is model specific. For example, some hardware increments this counter when RAPL responds to thermal limits, but does not increment this counter when RAPL responds to power limits. Comparing PkgWatt and PkgTmp to system limits is necessary.
191196
.PP
192197
\fBRAM_%\fP percent of the interval that RAPL throttling was active on DRAM.
@@ -387,6 +392,28 @@ CPU pCPU%c1 CPU%c1
387392

388393
.fi
389394

395+
.SH ADD PERF COUNTER EXAMPLE #2 (using virtual cpu device)
396+
Here we run on hybrid, Raptor Lake platform.
397+
We limit turbostat to show output for just cpu0 (pcore) and cpu12 (ecore).
398+
We add a counter showing number of L3 cache misses, using virtual "cpu" device,
399+
labeling it with the column header, "VCMISS".
400+
We add a counter showing number of L3 cache misses, using virtual "cpu_core" device,
401+
labeling it with the column header, "PCMISS". This will fail on ecore cpu12.
402+
We add a counter showing number of L3 cache misses, using virtual "cpu_atom" device,
403+
labeling it with the column header, "ECMISS". This will fail on pcore cpu0.
404+
We display it only once, after the conclusion of 0.1 second sleep.
405+
.nf
406+
sudo ./turbostat --quiet --cpu 0,12 --show CPU --add perf/cpu/cache-misses,cpu,delta,raw,VCMISS --add perf/cpu_core/cache-misses,cpu,delta,raw,PCMISS --add perf/cpu_atom/cache-misses,cpu,delta,raw,ECMISS sleep .1
407+
turbostat: added_perf_counters_init_: perf/cpu_atom/cache-misses: failed to open counter on cpu0
408+
turbostat: added_perf_counters_init_: perf/cpu_core/cache-misses: failed to open counter on cpu12
409+
0.104630 sec
410+
CPU ECMISS PCMISS VCMISS
411+
- 0x0000000000000000 0x0000000000000000 0x0000000000000000
412+
0 0x0000000000000000 0x0000000000007951 0x0000000000007796
413+
12 0x000000000001137a 0x0000000000000000 0x0000000000011392
414+
415+
.fi
416+
390417
.SH ADD PMT COUNTER EXAMPLE
391418
Here we limit turbostat to showing just the CPU number 0.
392419
We add two counters, showing crystal clock count and the DC6 residency.

0 commit comments

Comments
 (0)