autothrottle

SC2021 AD/AE

1. Hardware specifications of a target machine

We utilize two hardware configurations for Micro- and Macro-benchmark evaluation

Micro-benchmark evaluation
- CPU: Intel Xeon E5-2650 [email protected] GHz hyperthreading off
- Memory: 64 GB
- NIC: 40 GbE (connected to a remote machine equipped with two Intel Xeon E5-2630 [email protected] GHz Hexa-core processors and 64 B memory)
Macro-benchmark evaluation
- CPU: Intel Xeon [email protected] GHz hyperthreading off
- Memory: 256 GB
- NIC: 10 GbE

[Caveats of our work]

Our evaluation requires multiple host servers, at least two for micro-benchmark and five for macro-benchmark evaluation. Also, the host servers should connect to either 40 GbE (micro-) or 10 GbE (macro-benchmark).
If the above hardware configuration is not available, there can be performance degradation.

2. Module compile and installation

Kernel compile and reboot

cd ~
wget https://mirrors.edge.kernel.org/pub/linux/kernel/v5.x/linux-5.4.58.tar.gz
tar xzvf linux-5.4.58.tar.gz
git clone https://github.com/may21/autothrottle.git
cd autothrottle
cp -rf ./bridge/* ~/linux-5.4.58/net/bridge/
cp -rf ./sched/* ~/linux-5.4.58/kernel/sched/
cd ~/linux-5.4.58
cp /boot/config-$(uname -r) ./.config

*Note that CONFIG_BRIDGE_IGMP_SNOOPING should not be set (e.g., in .config, comment CONFIG_BRIDGE_IGMP_SNOOPING)

make olddefconfig
N=`getconf _NPROCESSORS_ONLN`
make -j$N
make modules_install install
reboot

Module installation

cd ~/autothrottle
./install.sh

3. System configuration

Activate RFS (receive flow steering) and disable irqbalance to eliminate bottlenecks in network interfaces and interrupt processing ($INTERFACE indicates the network interfaces used for this experiment.)

cd ~/autothrottle/script
N=`getconf _NPROCESSORS_ONLN`
sh rfs_set.sh $N $INTERFACE
service irqbalance stop

Setting up a container environment

Autothrottle supports containers to achieve predictable network performance. So, this evaluation requires a container environment based on Docker. To install Docker, please refer to https://docs.docker.com/get-docker/

4. Micro-benchmark evaluation

Create containers for evaluation

Run containers on the target machine. The number of containers should be double the number of CPU cores. Each container is based on the latest Ubuntu image and we install Netperf benchmark inside the containers.

cd ~/autothrottle/script
N=`getconf _NPROCESSORS_ONLN`
sh run_container.sh $N

1) Autothrottle

Assign the same bandwidth requirements ($GOAL) to all containers. (The unit of bandwidth requirements is Mbps. For example, if you want to configure 200 Mbps for containers, $GOAL is 200.)

cd ~/autothrottle/script
N=`getconf _NPROCESSORS_ONLN`
./fair_perf_set.sh $N $GOAL

Execute Netserver in the remote server with $IP when $CN indicates the number of containers running on the target machine

./netserver $CN $IP

Run Netperf benchmark in all containers. $M indicates the message size

sh run_netperf.sh $N $M $IP

2) Linux traffic control (tc)

Before starting Linux tc evaluation, the containers used for Autothrottle evaluation should be stopped and the module (vif.ko) needs to be unloaded.

cd ~/autothrottle/script
N=`getconf _NPROCESSORS_ONLN`
./restart_container.sh $N stop
rmmod vif.ko
./restart_container.sh $N start

Install tcconfig (https://github.com/thombashi/tcconfig#tcconfig) to utilize Linux tc for containers

sudo pip install tcconfig

Configure the same network bandwidth ($GOAL) to the containers using tcset (The unit of bandwidth is Mbps)

cd ~/autothrottle/script
N=`getconf _NPROCESSORS_ONLN`
./tcset.sh $N $GOAL

Execute Netserver in the remote server with $IP when $CN indicates the number of containers running on the target machine

./netserver $CN $IP

Run Netperf benchmark in all containers same as the Autothrottle evaluation

./run_netperf.sh $N $M

5. Macro-benchmark evaluation

1) Memcached

Create Memcached containers with the name $NAME

sudo docker run -it --name $NAME ubuntu:16.04 /bin/bash

This attaches to the CLI of the memcached container. Install Memcached inside the container (Please refer to https://github.com/memcached/memcached/wiki/Install, https://www.memcached.org/downloads, https://github.com/memcached/memcached/wiki/ReleaseNotes1414 for more information)

cd ~
apt-get update
apt-get install -y build-essential git wget vim libevent-dev
wget http://memcached.org/files/old/memcached-1.4.14.tar.gz
tar vxf memcached-1.4.14.tar.gz
cd memcached-1.4.14
./configure --prefix=/usr/local/memcached
make
make test
make install

Install memaslap in the remote machine (Please refer to https://github.com/pgaref/memcached_bench for more information. Note that the remote machine should run Ubutu 16.04.)

cd ~
apt-get update
apt-get install -y build-essential git wget vim libevent-dev
wget http://memcached.org/files/old/memcached-1.4.14.tar.gz
tar vxf memcached-1.4.14.tar.gz
cd memcached-1.4.14
./configure --prefix=/usr/local/memcached
make
make test
make install
cd ~
git clone https://github.com/pgaref/memcached_bench.git
cd ~/memcached_bench/libmemcached-1.0.15/
./configure --enable-memaslap
make
vim Makefile	//Add  “-lpthread -lm” to 'LIBS =')
apt-get install -y automake autoconf
make
make install

Add routing table information in the remote machine to forward packets to the Memcached containers on the target machine. $NET is the subnet of the containers such as 172.17.0.0/16 while $IFACE indicates the network interface connected to the target machine

sudo route add -net $NET dev $IFACE

Execute memaslap in the remote machine. $IP1, IP2 indicate the IP address of the memcached containers on the target machine

~/memcached_bench/libmemcached-1.0.15/clients/memaslap -B -c 512 -s $IP1:11211,$IP2:11211,.... -t 60s

2) Spark

This evaluation requires five servers connected using a 10 GbE switch. Among five servers, four servers act as master servers while one left is a target machine.

a. Network setup (On every host machine)
- Create a local docker bridge network “dockersparkterasort_br-n-spark” with the first 24 bits subnet address $NET (e.g., 172.35.0 for 172.35.0.0/16, each machine should bave different subnet addresses). (Please refer to Https://docs.docker.com/engine/reference/commandline/network_create/ for more information)
```
 cd ~/autothrottle/script
 chmod +x ./create_network.sh 
 ./create_network.sh $NET
```
- Add routing tables to forward packets to others, when $IF indicates the name of the network interface connected to the 10 GbE switch. (In master servers, PS_IPADDR and PS_HOST_IPADDR are the IP address of the container bridge network and a 10 GbE network interface in the target machine. For the target machine, this should be iterated four times with PS_IPADDR and PS_HOST_IPADDR of respective master servers.)
```
 vim ./route.sh
```
*Here, set PS_IPADDR to the public IP address of the container bridge network interface in the remote server. Set PS_HOST_IPADDR to the public IP address of the remote server.
```
 chmod +x ./route.sh
 ./route.sh $IF
```
b. Data setup (On four master servers)
- Create folders for input/output data.
```
 cd ~/autothrottle/spark/apps
 mkdir data
 cd ./data
 mkdir terasort_in terasort_out
```
- Download input data to the folder terasort_in by following the instructions at: https://github.com/ehiggs/spark-terasort
c. Container setup
- Spark master container configuration (On four master servers)
```
 cd ~/autothrottle/spark/docker/spark-submit
 vim ./spark-submit.sh
```
*Here, set $IP to the IP address of the master server
```
 cd ~/autothrottle/spark/docker/spark-master
 vim ./start-master.sh
```
*Here, set SPARK_MASTER_HOST to the IP address of the master server
- Build Docker container images (On every host servers)
```
 cd ~/autothrottle/spark
 chmod +x ./build-images.sh
 ./build-images.sh
```
- Configure master and slave container settings (On four master servers)
```
 vim ./master.yml
```
*Here, set each spark-worker (underneath extra_hosts) to the local IP address of the corresponding worker container. Set SPARK_PUBLIC_DNS (underneath environment) to the public IP address of the target machine.
```
 vim ./slave.yml
```
*Here, set spark-master (underneath extra_hosts) to the public IP address of the corresponding master server. Note that two yml files (e.g., slaves1.yml, slaves2.yml) are necessary to create two slave containers in the master server.
- Configure containers in the target machine: We create eight containers (i.e., spark slaves) in the target machine and every two slaves belong to the same master (e.g., s1 and s2 belong to m1 while s3 and s4 belong to m2 when m1 and m2 run on different master servers.)
- So, we need to create eight slaves.yml (e.g., s1.yml, s2.yml….) as in the master servers. Also, each yml file should include spark-master and spark-worker IP addresses with IP address of the corresponding master server.
d. Running Spark
- Deploy and run master and slave containers (On four master servers).
```
 cd ~/autothrottle/spark
 chmod +x ./start_master.sh
 ./start_master.sh
```
- Deploy eight slave containers in the target machine.
```
 chmod +x ./start_slave.sh
 ./start_slave.sh
```
- Submit workload to cluster (On four master servers).
```
 vim submit.sh
```
*Here, set SPARK_DRIVER_HOST to the IP address of the master server.
```
 chmod +x ./submit.sh
 ./submit.sh
```

oslabkoreauniv / autothrottle Goto Github PK

autothrottle's Introduction

autothrottle

1. Hardware specifications of a target machine

2. Module compile and installation

3. System configuration

4. Micro-benchmark evaluation

1) Autothrottle

2) Linux traffic control (tc)

5. Macro-benchmark evaluation

1) Memcached

2) Spark

autothrottle's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs