150 lines
5.0 KiB
Plaintext
150 lines
5.0 KiB
Plaintext
Demonstrations of drsnoop, the Linux eBPF/bcc version.
|
||
|
||
|
||
drsnoop traces the direct reclaim system-wide, and prints various details.
|
||
Example output:
|
||
|
||
# ./drsnoop
|
||
COMM PID LAT(ms) PAGES
|
||
summond 17678 0.19 143
|
||
summond 17669 0.55 313
|
||
summond 17669 0.15 145
|
||
summond 17669 0.27 237
|
||
summond 17669 0.48 111
|
||
summond 17669 0.16 75
|
||
head 17821 0.29 339
|
||
head 17825 0.17 109
|
||
summond 17669 0.14 73
|
||
summond 17496 104.84 40
|
||
summond 17678 0.32 167
|
||
summond 17678 0.14 106
|
||
summond 17678 0.16 67
|
||
summond 17678 0.29 267
|
||
summond 17678 0.27 69
|
||
summond 17678 0.32 46
|
||
base64 17816 0.16 85
|
||
summond 17678 0.43 283
|
||
summond 17678 0.14 182
|
||
head 17736 0.57 135
|
||
^C
|
||
|
||
While tracing, the processes alloc pages,due to insufficient memory available
|
||
in the system, direct reclaim events happened, which will increase the waiting
|
||
delay of the processes.
|
||
|
||
drsnoop can be useful for discovering when allocstall(/proc/vmstat) continues to increase,
|
||
whether it is caused by some critical processes or not.
|
||
|
||
The -p option can be used to filter on a PID, which is filtered in-kernel. Here
|
||
I've used it with -T to print timestamps:
|
||
|
||
# ./drsnoop -Tp 17491
|
||
TIME(s) COMM PID LAT(ms) PAGES
|
||
107.364115000 summond 17491 0.24 50
|
||
107.364550000 summond 17491 0.26 38
|
||
107.365266000 summond 17491 0.36 72
|
||
107.365753000 summond 17491 0.22 49
|
||
^C
|
||
|
||
This shows the summond process allocs pages, and direct reclaim events happening,
|
||
and the delays are not affected much.
|
||
|
||
The -U option include UID on output:
|
||
|
||
# ./drsnoop -U
|
||
UID COMM PID LAT(ms) PAGES
|
||
1000 summond 17678 0.32 46
|
||
0 base64 17816 0.16 85
|
||
1000 summond 17678 0.43 283
|
||
1000 summond 17678 0.14 182
|
||
0 head 17821 0.29 339
|
||
0 head 17825 0.17 109
|
||
^C
|
||
|
||
The -u option filtering UID:
|
||
|
||
# ./drsnoop -Uu 1000
|
||
UID COMM PID LAT(ms) PAGES
|
||
1000 summond 17678 0.19 143
|
||
1000 summond 17669 0.55 313
|
||
1000 summond 17669 0.15 145
|
||
1000 summond 17669 0.27 237
|
||
1000 summond 17669 0.48 111
|
||
1000 summond 17669 0.16 75
|
||
1000 summond 17669 0.14 73
|
||
1000 summond 17678 0.32 167
|
||
^C
|
||
|
||
A maximum tracing duration can be set with the -d option. For example, to trace
|
||
for 2 seconds:
|
||
|
||
# ./drsnoop -d 2
|
||
COMM PID LAT(ms) PAGES
|
||
head 21715 0.15 195
|
||
|
||
The -n option can be used to filter on process name using partial matches:
|
||
|
||
# ./drsnoop -n mond
|
||
COMM PID LAT(ms) PAGES
|
||
summond 10271 0.03 51
|
||
summond 10271 0.03 51
|
||
summond 10259 0.05 51
|
||
summond 10269 319.41 37
|
||
summond 10270 111.73 35
|
||
summond 10270 0.11 78
|
||
summond 10270 0.12 71
|
||
summond 10270 0.03 35
|
||
summond 10277 111.62 41
|
||
summond 10277 0.08 45
|
||
summond 10277 0.06 32
|
||
^C
|
||
|
||
This caught the 'summond' command because it partially matches 'mond' that's passed
|
||
to the '-n' option.
|
||
|
||
|
||
The -v option can be used to show system memory state (now only free mem) at
|
||
the beginning of direct reclaiming:
|
||
|
||
# ./drsnoop.py -v
|
||
COMM PID LAT(ms) PAGES FREE(KB)
|
||
base64 34924 0.23 151 86260
|
||
base64 34962 0.26 149 86260
|
||
head 34931 0.24 150 86260
|
||
base64 34902 0.19 148 86260
|
||
head 34963 0.19 151 86228
|
||
base64 34959 0.17 151 86228
|
||
head 34965 0.29 190 86228
|
||
base64 34957 0.24 152 86228
|
||
summond 34870 0.15 151 86080
|
||
summond 34870 0.12 115 86184
|
||
|
||
USAGE message:
|
||
|
||
# ./drsnoop -h
|
||
usage: drsnoop.py [-h] [-T] [-U] [-p PID] [-t TID] [-u UID] [-d DURATION]
|
||
[-n NAME]
|
||
|
||
Trace direct reclaim
|
||
|
||
optional arguments:
|
||
-h, --help show this help message and exit
|
||
-T, --timestamp include timestamp on output
|
||
-U, --print-uid print UID column
|
||
-p PID, --pid PID trace this PID only
|
||
-t TID, --tid TID trace this TID only
|
||
-u UID, --uid UID trace this UID only
|
||
-d DURATION, --duration DURATION
|
||
total duration of trace in seconds
|
||
-n NAME, --name NAME only print process names containing this name
|
||
|
||
examples:
|
||
./drsnoop # trace all direct reclaim
|
||
./drsnoop -T # include timestamps
|
||
./drsnoop -U # include UID
|
||
./drsnoop -p 181 # only trace PID 181
|
||
./drsnoop -t 123 # only trace TID 123
|
||
./drsnoop -u 1000 # only trace UID 1000
|
||
./drsnoop -d 10 # trace for 10 seconds only
|
||
./drsnoop -n main # only print process names containing "main"
|