【金沙官网线上】pt-ioprofile

# pt-ioprofile --cell=sizes
Sun Oct 30 11:27:10 CST 2016
Tracing process ID 1683
     total      pread       read     pwrite      write      fsync       open      close      lseek      fcntl filename
  30425088          0          0   30425088          0          0          0          0          0          0 /test1/sbtest3.ibd
  30425088          0          0   30425088          0          0          0          0          0          0 /test1/sbtest2.ibd
  15745024          0          0   15745024          0          0          0          0          0          0 /test1/sbtest4.ibd
   4194304    2097152          0    2097152          0          0          0          0          0          0 /test1/
     96928       1156       1130        236      18434          0          0          0      75972          0 /test1/#sql-693_164.frm
     92006       1734       1653         97       9217          0          0          0      79305          0 /test1/sbtest3.frm
     92006       1734       1653         97       9217          0          0          0      79305          0 /test1/sbtest2.frm
     70031       1156       1088         97       9217          0          0          0      58473          0 /test1/sbtest4.frm
     21975        578        565          0          0          0          0          0      20832          0 /test1/sbtest1.frm
         0          0          0          0          0          0          0          0          0          0 /tmp/ibyD2qvV
         0          0          0          0          0          0          0          0          0          0 /tmp/ibyc3Nz5
         0          0          0          0          0          0          0          0          0          0 /tmp/ibJI3Ky8
         0          0          0          0          0          0          0          0          0          0 /tmp/ibbLjhsI
         0          0          0          0          0          0          0          0          0          0 /tmp/ib8QdnAm
         0          0          0          0          0          0          0          0          0          0 /tmp/ib7oksHs

--cell=sizes

lseek:对文件描述符指定文件的读写指针进行设置,也就是说,它可以设置文件的下一个读写位置。

个人感觉avg没太大意义。

源码如下:

short form: -c; type: string; default: times
The cell contents.
Valid values are:
VALUE CELLS CONTAIN
===== =======================
count Count of I/O operations
sizes Sizes of I/O operations
times I/O operation timing

指定线程的ID

 

如下所示:

--save-samples

 

short form: -g; type: string; default: filename
The group-by item.
Valid values are:
VALUE GROUPING
===== ======================================
all Summarize into a single line of output
filename One line of output per filename
pid One line of output per process ID
# pt-ioprofile 
Sun Oct 30 11:08:25 CST 2016
Tracing process ID 1683
     total      pread       read     pwrite      write      fsync       open      close      lseek      fcntl filename
  4.591996   0.000000   0.000000   0.088374   0.000000   4.503148   0.000219   0.000070   0.000000   0.000185 /test/sbtest5.ibd
  2.720128   0.000000   0.000000   0.096999   0.000000   2.622662   0.000254   0.000053   0.000000   0.000160 /test/sbtest6.ibd
  0.009918   0.000193   0.000515   0.000235   0.002752   0.004530   0.000491   0.000182   0.001020   0.000000 /test/#sql-693_153.frm
  0.007725   0.000917   0.000000   0.002182   0.000000   0.003766   0.000285   0.000575   0.000000   0.000000 /test/
  0.004819   0.000162   0.000587   0.000097   0.001044   0.001735   0.000349   0.000157   0.000688   0.000000 /test/sbtest6.frm
  0.004201   0.000104   0.000309   0.000090   0.001048   0.001900   0.000265   0.000152   0.000333   0.000000 /test/sbtest5.frm
  0.000661   0.000092   0.000209   0.000000   0.000000   0.000000   0.000105   0.000091   0.000164   0.000000 /tmp/ibUARemw
  0.000643   0.000075   0.000254   0.000000   0.000000   0.000000   0.000128   0.000044   0.000142   0.000000 /test/sbtest4.frm
  0.000228   0.000000   0.000000   0.000000   0.000000   0.000000   0.000143   0.000085   0.000000   0.000000 /tmp/ib2bbcaJ
  0.000221   0.000000   0.000000   0.000000   0.000000   0.000000   0.000170   0.000051   0.000000   0.000000 /tmp/ibp3wGXR
  0.000219   0.000000   0.000000   0.000000   0.000000   0.000000   0.000168   0.000051   0.000000   0.000000 /tmp/ibEDs8K7

fsync:确保文件所有已修改的内容已经正确同步到硬盘上,该调用会阻塞等待直到设备报告IO完成。

--help

--run-time

--cell=times

看看各参数的作用:

pt-ioprofile是用来观察特定进程的IO信息的。

pwrite:由于lseek和write调用之间,内核可能会临时挂起进程,所以对同步问题造成了问题,调用pwrite相当于顺序调用了lseek 和write,这两个操作相当于一个捆绑的原子操作。

 

# pt-ioprofile

该脚本是用shell写的,有两方面的作用:

分组的单位,默认是filename,即对文件名进行统计,也可指定为all,即对所有操作进行统计,pid,对进程进行统计(不太确定这里的进程指的是什么?怀疑是线程,但线程不可能这么少,具体可参考下面的输出)

3. http://blog.csdn.net/cywosp/article/details/8767327

pt-ioprofile does two things: 1) get lsof+strace for -s seconds, 2) aggregate the result. If you specify a FILE, then step 1) is not performed.

 

因实际依赖的是strace命令,该命令会对进程的性能造成较大的影响。所以,在负载较重的系统中,慎用。

 

--group-by

统计的方式,默认是sum,也可指定为avg。

write:把缓冲区的数据写入文件中。

short form: -a; type: string; default: sum

The aggregate function, either sum or avg.

If sum, then each cell will contain the sum of the values in it. If avg, then each cell will contain the average of the values in it.

read:从文件中读出数据。要读取的文件用文件描述符标识,数据读入一个事先定义好的缓冲区。

fcntl:针对(文件)描述符提供控制。

--profile-pid

type: string

Filename to save samples in; these can be used for later analysis.
Sun Oct 30 11:08:25 CST 2016
Tracing process ID 1683
     total      pread       read     pwrite      write      fsync       open      close      lseek      fcntl filename
  4.591996   0.000000   0.000000   0.088374   0.000000   4.503148   0.000219   0.000070   0.000000   0.000185 /test/sbtest5.ibd
  2.720128   0.000000   0.000000   0.096999   0.000000   2.622662   0.000254   0.000053   0.000000   0.000160 /test/sbtest6.ibd
  0.009918   0.000193   0.000515   0.000235   0.002752   0.004530   0.000491   0.000182   0.001020   0.000000 /test/#sql-693_153.frm
  0.007725   0.000917   0.000000   0.002182   0.000000   0.003766   0.000285   0.000575   0.000000   0.000000 /test/
  0.004819   0.000162   0.000587   0.000097   0.001044   0.001735   0.000349   0.000157   0.000688   0.000000 /test/sbtest6.frm
  0.004201   0.000104   0.000309   0.000090   0.001048   0.001900   0.000265   0.000152   0.000333   0.000000 /test/sbtest5.frm
  0.000661   0.000092   0.000209   0.000000   0.000000   0.000000   0.000105   0.000091   0.000164   0.000000 /tmp/ibUARemw
  0.000643   0.000075   0.000254   0.000000   0.000000   0.000000   0.000128   0.000044   0.000142   0.000000 /test/sbtest4.frm
  0.000228   0.000000   0.000000   0.000000   0.000000   0.000000   0.000143   0.000085   0.000000   0.000000 /tmp/ib2bbcaJ
  0.000221   0.000000   0.000000   0.000000   0.000000   0.000000   0.000170   0.000051   0.000000   0.000000 /tmp/ibp3wGXR
  0.000219   0.000000   0.000000   0.000000   0.000000   0.000000   0.000168   0.000051   0.000000   0.000000 /tmp/ibEDs8K7

执行strace命令的时间,OPT_RUN_TIME就是--run-time指定的值。

# pt-ioprofile --group-by=all
Sun Oct 30 11:28:48 CST 2016
Tracing process ID 1683
  8.044078 TOTAL
  7.781329 fsync
  0.245823 pwrite
  0.004445 write
  0.003389 read
  0.002998 open
  0.002324 lseek
  0.001886 close
  0.001679 pread
  0.000205 fcntl

将strace和lsof获取的结果保存到指定的文件中

# pt-ioprofile --cell=count
Sun Oct 30 11:27:50 CST 2016
Tracing process ID 1683
     total      pread       read     pwrite      write      fsync       open      close      lseek      fcntl filename
       147          0          0         61          0         81          2          1          0          2 /test1/sbtest5.ibd
       146          0          0         61          0         80          2          1          0          2 /test1/sbtest6.ibd
       123          0          0         57          0         61          2          1          0          2 /test1/sbtest7.ibd
        86          4         10          4         42          2          4          4         16          0 /test1/#sql-693_164.frm
        57          4         10          2         21          1          4          3         12          0 /test1/sbtest6.frm
        57          4         10          2         21          1          4          3         12          0 /test1/sbtest5.frm
        56          4         10          2         21          1          3          3         12          0 /test1/sbtest7.frm
        21          2          0          2          0          5          5          7          0          0 /test1/
        14          2          5          0          0          0          1          2          4          0 /tmp/ibtLXAL6
        14          2          5          0          0          0          1          2          4          0 /tmp/ibDei5Io
         2          0          0          0          0          0          1          1          0          0 /tmp/ib2WdhKK
         2          0          0          0          0          0          1          1          0          0 /tmp/ib027CfV
         1          0          0          0          0          0          1          0          0          0 /test1/sbtest4.frm

使用方法:

2. http://www.cnblogs.com/yanlingyin/archive/2012/08/04/2617209.html

--profile-process

close:close系统调用用于“关闭”一个文件,close调用终止一个文件描述符以其文件之间的关联。文件描述符被释放,并能够重新使用。

 

其中,

--group-by=filename

 

open:打开一个文件,并返回这个文件的描述符。

--version

--aggregate

 

直接执行pt-ioprofile即可,默认的是mysqld进程。

--cell

short form: -p; type: int

The PID to profile, overrides --profile-process.

 

short form: -b; type: string; default: mysqld

The process name to profile.
Print help and exit.
# pt-ioprofile 
Sun Oct 30 11:08:25 CST 2016
Tracing process ID 1683
     total      pread       read     pwrite      write      fsync       open      close      lseek      fcntl filename
  4.591996   0.000000   0.000000   0.088374   0.000000   4.503148   0.000219   0.000070   0.000000   0.000185 /test/sbtest5.ibd
  2.720128   0.000000   0.000000   0.096999   0.000000   2.622662   0.000254   0.000053   0.000000   0.000160 /test/sbtest6.ibd
  0.009918   0.000193   0.000515   0.000235   0.002752   0.004530   0.000491   0.000182   0.001020   0.000000 /test/#sql-693_153.frm
  0.007725   0.000917   0.000000   0.002182   0.000000   0.003766   0.000285   0.000575   0.000000   0.000000 /test/
  0.004819   0.000162   0.000587   0.000097   0.001044   0.001735   0.000349   0.000157   0.000688   0.000000 /test/sbtest6.frm
  0.004201   0.000104   0.000309   0.000090   0.001048   0.001900   0.000265   0.000152   0.000333   0.000000 /test/sbtest5.frm
  0.000661   0.000092   0.000209   0.000000   0.000000   0.000000   0.000105   0.000091   0.000164   0.000000 /tmp/ibUARemw
  0.000643   0.000075   0.000254   0.000000   0.000000   0.000000   0.000128   0.000044   0.000142   0.000000 /test/sbtest4.frm
  0.000228   0.000000   0.000000   0.000000   0.000000   0.000000   0.000143   0.000085   0.000000   0.000000 /tmp/ib2bbcaJ
  0.000221   0.000000   0.000000   0.000000   0.000000   0.000000   0.000170   0.000051   0.000000   0.000000 /tmp/ibp3wGXR
  0.000219   0.000000   0.000000   0.000000   0.000000   0.000000   0.000168   0.000051   0.000000   0.000000 /tmp/ibEDs8K7
# pt-ioprofile --group-by=pid
Sun Oct 30 11:32:29 CST 2016
Tracing process ID 1683
     total      pread       read     pwrite      write      fsync       open      close      lseek      fcntl pid
  5.468350   0.002442   0.004444   0.111051   0.007580   5.320658   0.002614   0.003022   0.016232   0.000307 2055
  1.093664   0.000000   0.000000   0.000000   0.000000   1.093664   0.000000   0.000000   0.000000   0.000000 1735
  0.997042   0.000000   0.000000   0.000000   0.000000   0.997042   0.000000   0.000000   0.000000   0.000000 1738
  0.940387   0.000000   0.000000   0.000000   0.000000   0.940387   0.000000   0.000000   0.000000   0.000000 1736
  0.507274   0.000000   0.000000   0.000000   0.000000   0.507274   0.000000   0.000000   0.000000   0.000000 1737

 

--cell=count

pread:由于lseek和read调用之间,内核可能会临时挂起进程,所以对同步问题造成了问题,调用pread相当于顺序调用了lseek和read,这两个操作相当于一个捆绑的原子操作。

 

 if [ "$proc_pid" ]; then
            echo "Tracing process ID $proc_pid"

            _lsof "$proc_pid" > "$samples" 2>&1
            if [ "$?" -ne "0" ]; then
               echo "Error: could not execute lsof, error code $?"
               EXIT_STATUS=1
               return 1
            fi

            strace -T -s 0 -f -p $proc_pid >> "$samples" 2>&1 &
            if [ "$?" -ne "0" ]; then
               echo "Error: could not execute strace, error code $?"
               EXIT_STATUS=1
               return 1
            fi
            strace_pid=$!
            # sleep one second then check to make sure the strace is
            # actually running
            sleep 1
            ps -p $strace_pid > /dev/null 2>&1
            if [ "$?" -ne "0" ]; then
               echo "Cannot find strace process" >&2
               tail "$samples" >&2
               EXIT_STATUS=1
               return 1
            fi
            # sleep for interval -1, since we did a one second sleep
            # before checking for the PID of strace
            if [ $((${OPT_RUN_TIME}-1)) -gt 0 ]; then
                sleep $((${OPT_RUN_TIME}-1))
            fi

            kill -s 2 $strace_pid

            sleep 1
            kill -s 15 $strace_pid 2>/dev/null

            # Sometimes strace leaves threads/processes in T status.
            kill -s 18 $proc_pid

            # Summarize the output we just generated.
            tabulate_strace "$samples" > $PT_TMPDIR/tabulated_samples
Print the tool’s version and exit.

显示的单位,默认是times,即IO操作的时间,也可指定为count(O操作的次数),size(IO操作的大小)

type: int; default: 30

How long to profile.

 

1. http://blog.csdn.net/zongcai249/article/details/17598411

指定线程名

--group-by=all

--group-by=pid

参考

 

 

本文由金沙官网线上发布于数据库,转载请注明出处:【金沙官网线上】pt-ioprofile

您可能还会对下面的文章感兴趣: