6 stap script.1
下面我们开启网卡的RPS:
echo ff > /sys/class/net/vtap1/queues/rx-0/rps_cpus即将流量分布到cpu0-cpu7上,接下来我们看一下cpu负载情况:
cpu 0: 885104pps 10408823040bps
cpu 5: 17pps 7752bps
cpu 7: 13pps 9752bps
cpu 10: 10pps 4448bps
cpu 9: 9pps 4176bps
cpu 6: 9pps 3768bps
cpu 8: 9pps 4192bps
cpu 1: 5pps 18072bps
cpu 3: 5pps 3472bps
cpu 11: 2pps 1008bps
cpu 13: 2pps 936bps
rps cpu 3 149839pps
rps cpu 7 121979pps
rps cpu 0 114993pps
rps cpu 6 111514pps
rps cpu 5 108040pps
rps cpu 1 101064pps
rps cpu 2 101062pps
rps cpu 4 76662pps
rps cpu 10 10pps
rps cpu 9 9pps
rps cpu 8 9pps
rps cpu 11 2pps
rps cpu 13 2pps可以看到,在enqueue阶段,流量被分布到cpu0-cpu7上涌来减轻cpu0的负担。
遗憾的是,由于systemtap本身会带来巨大的开销,因此不能很好的利用其输出来量化系统的真实性能,通常只有通过一些应用,如L3转发来测量RPS能够带来的收益,接下来会详细分析RPS的性能收益。
Last updated
Was this helpful?