Tempo

Monash Time Series Analytics Library. Monash University, Melbourne, Australia.

Tempo is not only a project for researchers! Our aim is to deliver a user-friendly library, allowing anyone to enjoy our last work in the domain of time series classification.

Note: Tempo is young and in active development. Do not hesitate to report issues!

Note: We experiment using the UCR archive, mostly in its 85 datasets version. The archive can be found at timeseriesclassification.com. It now contains more datasets. The 85 datasets are listed here.

Features

Elastic distances (C++, Java, Python3)

Starting point of our library, we provide implementations of the most commonly used elastic distances. Our implementations in C++ and Java are early abandoned and pruned, and we offer Python3 bindings. Available distances are

DTW and CDTW, with lower bounds (Lb Keogh, Lb Enhanced, Lb Webb)
WDTW
ERP
LCSS
MSM
Squared Euclidean Distance
TWE

Our implementations have a O(n) space complexity and tightly integrate pruning with early abandoning, providing high performance and scalability. They should provide a significant speed up in most scenarios, even though there worst time complexity remain O(n²). Note that they will not always be faster than the naive implementations as early abandoning and pruning require some bookkeeping. This bookkeeping must be compensated for, and this is not always possible (e.g. when the series are too short, with windows too small).

In C++/Python, we have two versions of each distance: one with a cut-off point and one without (In java, only the cut-off version is available). The cut-off point is an upper bound above which pruning starts. When "enough" pruning occurs (i.e. a full line of the cost matrix), the computation is early abandoned. When no cut-off point is provided, we compute one ourselves (based one the diagonal and the last column of the cost matrix) before calling the version with cut-off. This allows to prune computation even when no cut-off point is provided (no early abandoning can occur under these conditions).

Benchmarks

The early abandoning and pruning algorithm is submitted in a paper under review. The paper (arxiv pre-print) contains benchmarks that can be found here.

Generic approach to "transforms" (C++)

The derivative version of DTW, CDTW and WDTW are not directly implemented. We rather use the concept of "transforms", which is a pre-processing step. Currently available tranforms are:

Upper and Lower envelope computations (to use with lower bound)
Derivative (can be repeated)

Hence, you can easily set up a derivative (or nth derivative) of any distance.

Proximity Forest library (C++)

Proximity forest (arxiv pre-print) is an ensemble classifier based on elastic distances developed at Monash University. It's original version was developed in Java.

We provide the building blocks to rewrite Proximity Forest in C++. See the Proximity Forest 2018 application.

Using Tempo in your project

As a C++ library with CMake

Download the sources and link to Tempo adding in your CMakeLists.txt:

add_subdirectory(path_to_tempo)
target_link_libraries(<target> PRIVATE tempo)

Python bindings

Bindings tested for Python3. All commands here assume a Python3 environment. Ensure that the wheel packages is installed (e.g. with pip install wheel)

pip install wheel                             # To install wheel
python setup.py bdist_wheel                   # Build the package
pip install ./dist/pytempo-[info].whl         # Path to wheel ( --force-reinstall to overwrite a previous installation)

Distances must be called with numpy arrays. Have a look at the example bindings/python/example.py.

To remove the package:

pip uninstall pytempo.distances

Python bindings are made with pybind11.

As a Java library

Simply copy/past the files from the java directory.

Applications

Our applications are not only demonstrators: you can use them for your work too! The only constraint is to use the TS format from timeseriesclassification.com for your data. You can either give the path to a UCR Archive folder and dataset name (flag -ucr), or directly the train and test files (flag -tt).

Building an applications

Note: we developed and tested our applications under Linux, using standard C++ 17. The following steps work in a Linux terminal, and should work on MacOS too. For Windows, refer to CMake user doc.

Open a terminal, and type in the following command, <app-folder> being the path to the desired application's folder, and <app-name> its name which is also the executable name (e.g. nn1dist). The last line will print the help message of the applications.

cd <app-folder>
mkdir cmake-build-release
cd cmake-build-release
cmake .. -DCMAKE_BUILD_TYPE=RELEASE
make
./<app-name>

NN1

C++ NN1 classification using one of the elastic distances. The distance is specified with the -dist flag, followed by the distance arguments (see the help message). A nth derivative can be specified with the -tr derivative n flag. So

./nn1dist -dist msm 0.05 -ucr ~/Univariate_ts/ Crop -tr derivative 1

will perform a NN1 "Derivative MSM" classification.

Proximity Forest 2018

C++ Proximity Forest implementation, using the building blocks from the library. It has the same features as the original Java version, using our early abandoned and pruned distances.

As a result, this version is 3.8 times faster than our Java version when tested on the UCR archive in its 85 datasets version. You can check our claim here.

About

Tempo is a project of the Time Series Classification team from Monash University. It draws inspiration from many scientific sources that can be found in the reference folder. Tempo is developed by Dr. Matthieu Herrmann

srishti013 / tempo Goto Github PK

tempo's Introduction

Tempo

Features

Elastic distances (C++, Java, Python3)

Benchmarks

Generic approach to "transforms" (C++)

Proximity Forest library (C++)

Using Tempo in your project

As a C++ library with CMake

Python bindings

As a Java library

Applications

Building an applications

About

tempo's People

Contributors

Recommend Projects

Recommend Topics

Recommend Org

Jobs