Deep Learning Benchmarking Suite

Deep Learning Benchmarking Suite (DLBS) is a collection of tools for providing consistent and reproducible benchmark experiments on various hardware/software combinations. In particular, DLBS provides the following functionality:

Implements internally various deep models. Our goal is to provide same model implementations for all supported frameworks. Deep models that are supported include various VGGs, ResNets, AlexNet, GoogleNet and others.
Benchmarks single node CPU/multi-GPU configurations. Frameworks that are now supported: BVLC/NVIDIA/Intel Caffe, Caffe2, TensorFlow, MXNet and TensorRT. Due to rapid development progress of these frameworks, we fix framework versions to particular commit that we have tested.
Supports inference and training phases.
Benchmarking tools can use real data if dataset is available. Else, falls back to synthetic data.
Supports bare metal and docker environments.

Supported platforms

Deep Learning Benchmarking Suite was tested on various servers with Ubuntu / RedHat / CentOS operating systems with/without NVIDIA GPUs. It may not work with Mac OS due to slightly different command line API of some of the tools we use (like, for instance, sed) - we will fix this in one of the next releases.

Installation

Install Docker and NVIDIA Docker for containerized benchmarks. Read here why we prefer to use docker and here for installing/troubleshooting tips. This is not required. DLBS can work with bare metal framework installations.

Clone Deep Learning Benchmarking Suite from GitHub

git clone https://github.com/HewlettPackard/dlcookbook-dlbs dlbs

Build/pull docker images for containerized benchmarks or build/install host frameworks for bare metal benchmarks.
1. TensorFlow
2. BVLC Caffe
3. NVIDIA Caffe
4. Intel Caffe
5. Caffe2
6. MXNet
7. TensorRT
There are several ways to get Docker images. Read here about various options.

Quick start

Assuming TensorFlow is installed and CUDA enabled GPU is present, execute the following commands to run simple experiment with ResNet50 model (if you do not have GPUs, see below):

# Go to DLBS home folder
cd dlbs
# Build TensorFlow image that's set as default in standard configuration files.
# Alternatively, you can skip this step and use your own image or pull image from NVIDIA GPU Cloud
cd ./docker
./build tensorflow/cuda9-cudnn7
cd ..
# Setup python paths
export PYTHONPATH=$(pwd)/python:$PYTHONPATH
# Run experiment. It will run containerized GPU TensorFlow with default image 'hpe/tensorflow:cuda9-cudnn7'
# If you want to use your own image, add this argument: -Ptensorflow.docker_image='"YOUR_DOCKER_IMAGE_NAME"'
python ./python/dlbs/experimenter.py run -Pexp.framework='"tensorflow"' -Pexp.model='"resnet50"' -Pexp.gpus='"0"' -Pexp.bench_root='"./benchmarks/my_experiment"' -Pexp.log_file='"./benchmarks/my_experiment/tf.log"'
# Print some results
python ./python/dlbs/logparser.py --keys exp.device_type results.time exp.framework_title exp.model_title exp.replica_batch -- ./benchmarks/my_experiment/tf.log

If you do not have NVIDIA GPUs, run TensorFlow in CPU mode (the only difference is that GPUs set to empty string: --exp.gpus=""):

# First steps same as in above GPU example - go to DLBS root folder and build/pull image.
# You may want to build a CPU only version of TensorFlow. By default, experimenter will use
# 'docker' to run CPU workloads what may not work. In the example below I override this
# behavior by providing exp.docker_launcher parameter.
cd dlbs
# Setup python paths
export PYTHONPATH=$(pwd)/python:$PYTHONPATH
# Run experiment
python ./python/dlbs/experimenter.py run -Pexp.framework='"tensorflow"' -Pexp.model='"resnet50"' -Pexp.gpus='""' -Pexp.log_file='"./benchmarks/my_experiment/tf.log"' -Pexp.docker_launcher='"nvidia-docker"'
# Print some results
python ./python/dlbs/logparser.py --keys exp.device_type results.time exp.framework_title exp.model_title exp.replica_batch -- ./benchmarks/my_experiment/tf.log

If everything is OK, you should expect seeing this JSON (training time - an average batch time - of course will be different depending on your GPU/CPU models):

{
    "data": [
        {
            "exp.device_type": "gpu",
            "exp.replica_batch": "16",
            "exp.framework_title": "TensorFlow",
            "exp.model_title": "ResNet50",
            "results.time": 255.59105431309905
        }
    ]
}

If results.time is not there, study ./benchmarks/my_experiment/tf.log for error messages.

Deep Learning CookBook

Deep Learning Benchmarking Suite is part of HPE's Deep Learning CookBook project. A project overview can be found on HPE developer portal here

Documentation

We host documentation on GitHub pages here.

License

Deep Learning Benchmarking Suite is released under the Apache 2.0 license.

Contact us

Natalia Vassilieva [email protected]
Sergey Serebryakov [email protected]

carsonchen1129 / dlcookbook-dlbs Goto Github PK

dlcookbook-dlbs's Introduction

Deep Learning Benchmarking Suite

Supported platforms

Installation

Quick start

Deep Learning CookBook

Documentation

License

Contact us

dlcookbook-dlbs's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs