GithubHelp home page GithubHelp logo

uwsampl / pytorch Goto Github PK

View Code? Open in Web Editor NEW
6.0 6.0 2.0 483.24 MB

Dynamic tensor rematerialization, implemented as a pytorch fork

License: Other

Python 32.28% Shell 0.56% Batchfile 0.04% CMake 1.43% Makefile 0.01% Java 0.21% C++ 53.98% C 4.11% Cuda 6.13% Assembly 0.34% Dockerfile 0.07% Metal 0.08% Objective-C++ 0.49% Objective-C 0.01% PureBasic 0.22% LLVM 0.01% Yacc 0.01% CSS 0.01% HTML 0.01% Ruby 0.02%

pytorch's People

Contributors

apaszke avatar bddppq avatar bwasti avatar colesbury avatar ezyang avatar gchanan avatar goldsborough avatar houseroad avatar jerryzh168 avatar jspark1105 avatar killeent avatar malfet avatar mingzhe09088 avatar mrshenli avatar onnxbot avatar peterjc123 avatar pietern avatar pjh5 avatar rohan-varma avatar smessmer avatar soumith avatar ssnl avatar suo avatar vishwakftw avatar wanchaol avatar xuhdev avatar yangqing avatar zasdfgbnm avatar zdevito avatar zou3519 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Forkers

tonytangyu wtzlas

pytorch's Issues

https://qywu.github.io/2019/05/22/explore-gradient-checkpointing.html

๐Ÿ› Bug

To Reproduce

Steps to reproduce the behavior:

Expected behavior

Environment

Please copy and paste the output from our
environment collection script
(or fill out the checklist below manually).

You can get the script and run it with:

wget https://raw.githubusercontent.com/pytorch/pytorch/master/torch/utils/collect_env.py
# For security purposes, please check the contents of collect_env.py before running it.
python collect_env.py
  • PyTorch Version (e.g., 1.0):
  • OS (e.g., Linux):
  • How you installed PyTorch (conda, pip, source):
  • Build command you used (if compiling from source):
  • Python version:
  • CUDA/cuDNN version:
  • GPU models and configuration:
  • Any other relevant information:

Additional context

bitwise_and not working because it mutate uncheckpointed value

import torch
from torch import tensor

a = torch.randn(4, 4).checkpoint()
a = torch.tensor([-1, -2, 3], dtype=torch.int8).checkpoint()
torch.bitwise_and(a, a)

ultimately this is due to bitwise_and initiate a tensor and call mutate on it internally.

two ways to fix this
0: I can allow checkpoint_function to take in raw tensor and mutate them. this I can do quite easily but is unsafe - while I can reflect change on cptensor back to raw tensor, I cant do the reverse without modifying the old tensors.
1: I can change bitwise_and. the obvious problem is that this is non-extendable. But why can bitwise_and initialize tensor? it must know whether the input is sparse/dense and initalize likewise. we can find this hopefully generic code and add to it.

I will go with route 1 for now.

weaks work better then vector<weak_ptr<AliasPool>>

๐Ÿ› Bug

To Reproduce

Steps to reproduce the behavior:

Expected behavior

Environment

Please copy and paste the output from our
environment collection script
(or fill out the checklist below manually).

You can get the script and run it with:

wget https://raw.githubusercontent.com/pytorch/pytorch/master/torch/utils/collect_env.py
# For security purposes, please check the contents of collect_env.py before running it.
python collect_env.py
  • PyTorch Version (e.g., 1.0):
  • OS (e.g., Linux):
  • How you installed PyTorch (conda, pip, source):
  • Build command you used (if compiling from source):
  • Python version:
  • CUDA/cuDNN version:
  • GPU models and configuration:
  • Any other relevant information:

Additional context

more stability for clear_checkpointpool?

get a vector of cptc for clear_checkpointpool, instead of reusing pool.
tensors in pool maybe incomplete and out of order due to eviction, so getting a seprate pool will fix this.

unrolled gan has problem

๐Ÿ› Bug

To Reproduce

Steps to reproduce the behavior:

Expected behavior

Environment

Please copy and paste the output from our
environment collection script
(or fill out the checklist below manually).

You can get the script and run it with:

wget https://raw.githubusercontent.com/pytorch/pytorch/master/torch/utils/collect_env.py
# For security purposes, please check the contents of collect_env.py before running it.
python collect_env.py
  • PyTorch Version (e.g., 1.0):
  • OS (e.g., Linux):
  • How you installed PyTorch (conda, pip, source):
  • Build command you used (if compiling from source):
  • Python version:
  • CUDA/cuDNN version:
  • GPU models and configuration:
  • Any other relevant information:

Additional context

New set of model for eval

Reformer
NAS inference
http://pyro.ai/examples/dmm.html
https://github.com/facebookresearch/clevr-iep
this will bring our total model count into 12, which we can layout 3*4 manner in the paper.
However, I am also thinking of Meta Learning with:
CNN
LSTM.
maybe we should replace a normal CNN/LSTM example with those.
MAML itself use CNN.
https://github.com/twitter/meta-learning-lstm
https://github.com/cflamant/neural-stack
Adaptive Computation Time
https://github.com/kentsommer/pytorch-value-iteration-networks/blob/master/model.py
https://github.com/ikostrikov/pytorch-meta-optimizer/blob/master/meta_optimizer.py

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.