The deepdow from jankrepl

Dealing with sum(w) != 1

With the convex optimization it could happen that the solver does not find solution and then it results in weights not summing up to one (sometimes drastically different).

Possible solutions

Postprocessing layer (rescales to 1)
Loss punishing incorrect w

Implement sub

Currently one cannot just do -SomeLoss(). Of course it could be hacked by doing (-1) * SomeLoss(). We want to implement the first syntax via __sub__.

generate_cumrets_table and plot_cumrets

Additional augmentations

Rather than reinventing the wheel one could just use torchvision transforms https://pytorch.org/docs/stable/torchvision/transforms.html

Compose (already recreated in deepdow)
RandomApply - apply all with some probability
RandomChoice - apply exactly one but at random
RandomOrder - apply all but in random order
1Dwarping - Affine would be a special case, one could in theory have any increasing function (derivative > 0)
RandomAffine - scaling and translation along the y axis (lookback) could be a brilliant augmentation for deepdow tensors
RandomHorizontalFlip - flipping the time flow, probably super confusing if one wants to pic up mean reversion
Normalize - a must together with some helper function that computes means, stds in the training set. However, it still assumes that the time series is stationary.
RandomErasing - (similar to the current Dropout however it is contiguous regions)

Additionally, torchvision might be also helpful in other tasks (see #39)

The clear downside is introducing yet another dependency. Additionally, one might argue that it is better to go all the way and use imgaug, albumentation,...

Other nonvision augmentations:

Dataset - batching both columns and rows

Write a torch.utils.data.Dataset subclass such that it randomly subsamples both samples and assets. Seems like cvxpylayers does not scale well

Make raw_to_Xy more transparent

Currently, deepdow.utils.raw_to_Xy does a lot of magic inside and outputs only the bare minimum for training:

X
timestamps
y
asset_names
indicators

Would be nice to have some debug mode that returns more.

raw_to_Xy doesn't handle gaps in data

raw_to_Xy appears to handle regular gaps in data (e.g. weekend days) but cannot handle irregular gaps such as holidays.

When fed trading data similar to the example at https://deepdow.readthedocs.io/en/latest/source/data_loading.html but covering an entire trading year it get out of sync on every holiday. E.g. a Monday that would typically trade but does not on a holiday such as Jan 20, 2020.

The result is that the assertion assert timestamps[0] == raw_df.index[lookback] fails.

This, and likely other data formatting issues, causes an error when executing history = run.launch(30) which is RuntimeError: mat1 and mat2 shapes cannot be multiplied

Link examples in docs

Variational softmax is missing a constraint

In the SoftmaxAllocator the nonnegativity constraint w >= 0 is missing.

Reasonable feature scaling

Or at least give a user an option to do it

Numpy metrics

Currently all metrics input and output torch.tensors...limiting!?

Create contributing section in the docs

Currently, not clear how to contribute to the project.

VAR and CVAR

Using torch.distributions

Add layer using spatial transformations

Would be nice for the network to do translations along the time dimensions (possibly independently for each assets) https://pytorch.org/docs/stable/nn.functional.html#grid-sample

Scaling potentially also useful

Papers

http://www-personal.umich.edu/~wiensj/papers/MLHC2018_Oh.pdf

Standard and robust scalers

Generating synthetic data

Apart from generating iid sequences one can do a lot of different things. Just need to pay attention to using too many external dependencies.

Statistical models

AR
ARMA
GARCH
VAR

Signal processing

State space models (both discrete and continuous latent space)

HMM

Example: Softmax and Sparsemax

4 options:

Sparsemax constrained
Sparsemax unconstrained
Softmax constrained
Softmax unconstrained

Extra linefeed in Epoch reporting

With each epoch there is an extra linefeed inserted when reporting metrics in Jupyter. E.g. notice in the below screenshot the gap between each line. It grows again in epoch 3 and so on.

Target ret and var benchmarks

Additionally Markowitz benchmarks

Fixed sized convex problem

Design new network with a fixed sized size - major speedups probably

Add sparsemax, csoftmax and csparsemax

The same idea as SoftmaxAllocator but with additional quite useful features for portfolio optimization

sparsity of predictions
maximum value

Inspired by https://locuslab.github.io/2019-10-28-cvxpylayers/ however implementing via cvxpylayers (as written in the blog) is not the most efficient way to do it.

Fix benchmarks

possibility to fix problem size in constructor
returns channel
feed batches into cvxpy

Improve visualize module

Some ideas below

The visualize model would use some helper function that inputs network and dataloader and returns DataFrame
weight_image
Include in documentation

Beta loss

Cleanup docs

Currently, there are a lot of typos, poorly written or unfinished sentences, etc...

Weight normalization allocator

Would be cool to create an allocator that just learns a single weight per each asset. To make sure all the weights sum up to one can only consider positive weights and then divide them by the sum.

MLflow bumpup metric

For determinstic benchmarks metrics can be just copied from previous step rather than recomputed

Argmax allocator

Probably not possible since we would encounter zero gradients

Investigate usage of fold and unfold

Add python_requires

Assert Python version via python_requires. Should correspond to what is tested (.travis.yml)

Multiple channels initial input

Loss arithmetic

Implement some dunders...

Clipping in gradient_wrt_input

Currently, we implement one "Explainable" algorithm in deepdow.explain.gradient_wrt_input. The problem is that we do not restrict the values the input can have. One solution would be to implement some projection/clipping logic that takes place after each optimizer step and thus forces the values to be in a given range.

See https://arxiv.org/pdf/1702.04782.pdf

Example: Learning NumericalMarkowitz parameters

Would be nice to show, how deepdow is able to directly learn or have a network predictor of any input variables of the deepdow.layers.NumericalMarkowitz

It might be a good idea to use real data (i.e. yfinance), however one needs to be careful about the example running too long (both CI and readthedocs need to run it)

Example: Demonstrate how 1D convolutions affect allocation

Show that sliding the window by one does not change the allocation much because of the underlying 1D convolutions.

Improve travis config

Multiple platforms
Coverage only ones after success

AssertionError: when using BachelierNet

I'm able to get the out-of-the box examples to execute successfully (getting_started and iid) when using the generated data but when using differenty toy datasets I get an AssertionError in the cvxcpy module.

/opt/conda/lib/python3.6/site-packages/cvxpy/cvxcore/python/canonInterface.py in nonzero_csc_matrix(A)
    162     # this function returns (rows, cols) corresponding to nonzero entries in
    163     # A; an entry that is explicitly set to zero is treated as nonzero
--> 164     assert not np.isnan(A.data).any()
    165 
    166     # scipy drops rows, cols with explicit zeros; use nan as a sentinel

AssertionError:

Steps to reproduce:

Start with getting_started.ipynb
Replace the data generation logic with loading data as described in this issue (I've used this as well as larger data sets)
Replace the Network Definition with:

from deepdow.nn import BachelierNet

n_channels = X.shape[1]
lookback = X.shape[2]
n_assets = X.shape[3]
max_weight = 0.5
hidden_size = 32

network = BachelierNet(n_channels, n_assets, hidden_size=hidden_size, max_weight=max_weight)

print(network)

Same error occurs even if reducing channels to 1, increasing number of samples, keeping lookback, gap, horizion small (5,0,1).

raw2Xy

Complete preprocessor

Create FAQ section in documentation

Custom portfolio benchmark

It would be nice to have a benchmark that is just some predefined portfolio. One would construct it by passing all the weights.

Define str or repr of losses and bm

Can be used for mlflow logging

Pass device to DowNet constructor

And make sure or operations inside propagate it. Problems with torch.eye inside of run

something about the turnover constraint

These days I have been using your deepdow package to do some experiments about portfolio optimization. Thanks for your great work!
But I have a problem here. I want to add the turnover rate constraint into the optimization. To achieve that, every round the network run over the training optimization, I have to keep the weights that has been calculated so that in the next round I can assure the new weight calculated won’t be too far from the previous one.
So I want to ask you that if there is some way I can save the weight each time the network calculated during the training process?

Notebook examples + binder

Example: Zipline with Quandl bundle

Unfortunately installing Zipline is a nightmare, only supports Python 3.5.

So maybe investigate open-source backtesters

jankrepl / deepdow Goto Github PK

deepdow's Introduction

Installation

Resources

Description

deepdow is not ...

Some features

Citing

deepdow's People

Contributors

Stargazers

Watchers

Forkers

deepdow's Issues

Statistical models

Signal processing

State space models (both discrete and continuous latent space)

Recommend Projects

Recommend Topics

Recommend Org

Jobs

`deepdow` is not ...