microsoft / tensorwatch Goto Github PK

Debugging, monitoring and visualization for Python Machine Learning and Data Science

License: MIT License

Batchfile 0.02% Jupyter Notebook 90.43% Python 9.56%

ai deep-learning deeplearning machine-learning machinelearning reinforcement-learning debugging debugging-tool debug monitoring

tensorwatch's Introduction

Welcome to TensorWatch

TensorWatch is a debugging and visualization tool designed for data science, deep learning and reinforcement learning from Microsoft Research. It works in Jupyter Notebook to show real-time visualizations of your machine learning training and perform several other key analysis tasks for your models and data.

TensorWatch is designed to be flexible and extensible so you can also build your own custom visualizations, UIs, and dashboards. Besides traditional "what-you-see-is-what-you-log" approach, it also has a unique capability to execute arbitrary queries against your live ML training process, return a stream as a result of the query and view this stream using your choice of a visualizer (we call this Lazy Logging Mode).

TensorWatch is under heavy development with a goal of providing a platform for debugging machine learning in one easy to use, extensible, and hackable package.

How to Get It

pip install tensorwatch

TensorWatch supports Python 3.x and is tested with PyTorch 0.4-1.x. Most features should also work with TensorFlow eager tensors. TensorWatch uses graphviz to create network diagrams and depending on your platform sometime you might need to manually install it.

How to Use It

Quick Start

Here's simple code that logs an integer and its square as a tuple every second to TensorWatch:

import tensorwatch as tw
import time

# streams will be stored in test.log file
w = tw.Watcher(filename='test.log')

# create a stream for logging
s = w.create_stream(name='metric1')

# generate Jupyter Notebook to view real-time streams
w.make_notebook()

for i in range(1000):
    # write x,y pair we want to log
    s.write((i, i*i))

    time.sleep(1)

When you run this code, you will notice a Jupyter Notebook file test.ipynb gets created in your script folder. From a command prompt type jupyter notebook and select test.ipynb. Choose Cell > Run all in the menu to see the real-time line graph as values get written in your script.

Here's the output you will see in Jupyter Notebook:

To dive deeper into the various other features, please see Tutorials and notebooks.

How does this work?

When you write to a TensorWatch stream, the values get serialized and sent to a TCP/IP socket as well as the file you specified. From Jupyter Notebook, we load the previously logged values from the file and then listen to that TCP/IP socket for any future values. The visualizer listens to the stream and renders the values as they arrive.

Ok, so that's a very simplified description. The TensorWatch architecture is actually much more powerful. Almost everything in TensorWatch is a stream. Files, sockets, consoles and even visualizers are streams themselves. A cool thing about TensorWatch streams is that they can listen to any other streams. This allows TensorWatch to create a data flow graph. This means that a visualizer can listen to many streams simultaneously, each of which could be a file, a socket or some other stream. You can recursively extend this to build arbitrary data flow graphs. TensorWatch decouples streams from how they get stored and how they get visualized.

Visualizations

In the above example, the line graph is used as the default visualization. However, TensorWatch supports many other diagram types including histograms, pie charts, scatter charts, bar charts and 3D versions of many of these plots. You can log your data, specify the chart type you want and let TensorWatch take care of the rest.

One of the significant strengths of TensorWatch is the ability to combine, compose, and create custom visualizations effortlessly. For example, you can choose to visualize an arbitrary number of streams in the same plot. Or you can visualize the same stream in many different plots simultaneously. Or you can place an arbitrary set of visualizations side-by-side. You can even create your own custom visualization widget simply by creating a new Python class, implementing a few methods.

Comparing Results of Multiple Runs

Each TensorWatch stream may contain a metric of your choice. By default, TensorWatch saves all streams in a single file, but you could also choose to save each stream in separate files or not to save them at all (for example, sending streams over sockets or into the console directly, zero hit to disk!). Later you can open these streams and direct them to one or more visualizations. This design allows you to quickly compare the results from your different experiments in your choice of visualizations easily.

Training within Jupyter Notebook

Often you might prefer to do data analysis, ML training, and testing - all from within Jupyter Notebook instead of from a separate script. TensorWatch can help you do sophisticated, real-time visualizations effortlessly from code that is run within a Jupyter Notebook end-to-end.

Lazy Logging Mode

A unique feature in TensorWatch is the ability to query the live running process, retrieve the result of this query as a stream and direct this stream to your preferred visualization(s). You don't need to log any data beforehand. We call this new way of debugging and visualization a lazy logging mode.

For example, as seen below, we visualize input and output image pairs, sampled randomly during the training of an autoencoder on a fruits dataset. These images were not logged beforehand in the script. Instead, the user sends query as a Python lambda expression which results in a stream of images that gets displayed in the Jupyter Notebook:

See Lazy Logging Tutorial.

Pre-Training and Post-Training Tasks

TensorWatch leverages several excellent libraries including hiddenlayer, torchstat, Visual Attribution to allow performing the usual debugging and analysis activities in one consistent package and interface.

For example, you can view the model graph with tensor shapes with a one-liner:

You can view statistics for different layers such as flops, number of parameters, etc:

See notebook.

You can view the dataset in a lower dimensional space using techniques such as t-SNE:

See notebook.

Prediction Explanations

We wish to provide various tools for explaining predictions to help debugging models. Currently, we offer several explainers for convolutional networks, including Lime. For example, the following highlights the areas that cause the Resnet50 model to make a prediction for class 240 for the Imagenet dataset:

See notebook.

Tutorials

Paper

More technical details are available in TensorWatch paper (EICS 2019 Conference). Please cite this as:

@inproceedings{tensorwatch2019eics,
  author    = {Shital Shah and Roland Fernandez and Steven M. Drucker},
  title     = {A system for real-time interactive analysis of deep learning training},
  booktitle = {Proceedings of the {ACM} {SIGCHI} Symposium on Engineering Interactive
               Computing Systems, {EICS} 2019, Valencia, Spain, June 18-21, 2019},
  pages     = {16:1--16:6},
  year      = {2019},
  crossref  = {DBLP:conf/eics/2019},
  url       = {https://arxiv.org/abs/2001.01215},
  doi       = {10.1145/3319499.3328231},
  timestamp = {Fri, 31 May 2019 08:40:31 +0200},
  biburl    = {https://dblp.org/rec/bib/conf/eics/ShahFD19},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Contribute

We would love your contributions, feedback, questions, and feature requests! Please file a Github issue or send us a pull request. Please review the Microsoft Code of Conduct and learn more.

Contact

Join the TensorWatch group on Facebook to stay up to date or ask any questions.

Credits

TensorWatch utilizes several open source libraries for many of its features. These include: hiddenlayer, torchstat, Visual-Attribution, pyzmq, receptivefield, nbformat. Please see install_requires section in setup.py for upto date list.

License

This project is released under the MIT License. Please review the License file for more details.

tensorwatch's People

Contributors

Stargazers

Watchers

Forkers

nguyenducnhaty arita37 zoltanszalontay codeaudit satyam-cyc cemeiq dattachandan mayurmorin hiyoung-asr legendtianjin champtiravat abs51295 aliang775 rahul-38-26-0111-0003 kbu9299 leo-xxx cocacoffe youtang1993 santhosh-ks titospadini panda4us shubhampachori12110095 gzoumpourlis literaryprogrammer dawars singhsukhendra shyamalschandra esmaeilinia oderdene omkar-kumbhar nunofernandes-plight albertbj schowdhury671 piotrm777 ankitshah009 muba1 badalgupta kuntahu hunglethanh9 souvickg wuxfsh edisonqkj nhaplycafedang tanmdl pagakarthik hunterhawk lrp360 ducminh0705 zdwhite kayuzee nfaihi gridl zxlwrz gaoqiangwu nikolayvoronchikhin hhy5277 zhao28 neveroldmilk shaunstanislauslau 0xflotus sumanthmeenan bm777 polyakovyevgeniy lazybrainai xiajingjie dncampo jimmy-ksu teppoeva canslove w7yuu sgpohlj87 colombod sandy4321 allensmile embeddedsamurai flavio58it chandansinha miturchi dreadlord1984 aspirincode aswanipranjal deepak-rai-1027 zeitgeistqian deepchatterjeevns tspannhw xcgfth alzayats monkidea chrischiedo ssusantachary amir22010 ionvision najah-lshanableh mathematicalmodels autohe fanpanda batermj davilla7 fengzifrank hzhang57

tensorwatch's Issues

[IPKernelApp] WARNING | No such comm: 7d8f9ccfafee44c0bf78e4a275c8dfee

I execute the script of "Lazy Logging Tutorial". Firest, I create the stream and create WatcherClient.

When I get the stream and visualize the stream I get an empty chart, as follow:

What should I do to fix this? Thank you all.

Can't generate images

import tensorwatch as tw
import torchvision.models
alexnet_model = torchvision.models.alexnet()
tw.draw_model(alexnet_model, [1, 3, 224, 224])

error:

ModuleNotFoundError Traceback (most recent call last)
~/anaconda2/envs/pytorch1.0/lib/python3.6/site-packages/IPython/core/formatters.py in call(self, obj)
343 method = get_real_method(obj, self.print_method)
344 if method is not None:
--> 345 return method()
346 return None
347 else:

~/anaconda2/envs/pytorch1.0/lib/python3.6/site-packages/tensorwatch/model_graph/hiddenlayer/graph.py in repr_svg(self)
391 def repr_svg(self):
392 """Allows Jupyter notebook to render the graph automatically."""
--> 393 return self.build_dot(self.orientation).repr_svg()
394
395 def save(self, path, format="pdf"):

~/anaconda2/envs/pytorch1.0/lib/python3.6/site-packages/tensorwatch/model_graph/hiddenlayer/graph.py in build_dot(self, orientation)
333 Returns a GraphViz Digraph object.
334 """
--> 335 from graphviz import Digraph
336
337 # Build GraphViz Digraph

ModuleNotFoundError: No module named 'graphviz'

<tensorwatch.model_graph.hiddenlayer.graph.Graph at 0x7fe7ce046898>

Write it as follows, without error, but without image

import tensorwatch as tw
import torchvision.models
alexnet_model = torchvision.models.alexnet()
dd = tw.draw_model(alexnet_model, [1, 3, 224, 224])
print(dd)

<tensorwatch.model_graph.hiddenlayer.graph.Graph object at 0x7fe7cdfc5c88>

stream.write does not print in the watcher file

My watcher does not print anything into the log file.

watcher = tensorwatch.Watcher(filename='tensorwatch.log', port=5)

stream = watcher.create_stream(name='loss', **kwargs)
watcher.make_notebook()

//during training
stream.write((epoch, loss))

The tensorwatch.log is created, but it is always empty, so when I try to plot in Jupyter Notebook it does not show anything.

Failed to export an ONNX attribute, since it's not constant, please try to make things (e.g., kernel size) static if possible

So it only supports the model in torchvision,not my own?

Provide script files for notebooks?

Thank you! This project will help a lot of people in their research.

Looking at notebooks/mnist.ipynb, how are inputs expected, specifically for the topk visualisations?

I want to understand the block:

rand_pred = train.create_stream(expr="topk_all(l, \
            batch_vals=lambda b: (b.batch.loss_all, (b.batch.input, b.batch.output), b.batch.target), \
            out_f=image_class_outf, order='rnd')", event_name='batch', throttle=2)

The tutorials specify using a lambda expression, where b contains the watcher observed arguments. I'm not sure how to use topk_all and where l comes from.

Would it possible to see the training scripts for the notebooks provided in the repository?

Cannot install with pip because depends on package not hosted on PyPi

> pip install tensorwatch
Collecting tensorwatch
  Using cached tensorwatch-0.9.0.tar.gz (187 kB)
ERROR: Packages installed from PyPI cannot depend on packages which are not also hosted on PyPI.
tensorwatch depends on pydot@ git+https://github.com/sytelus/[email protected]#egg=pydot

However, I can install tensorwatch version 0.8.10 without issue. I believe the problem was introduced by 353567f

I'm running ubuntu 16 with pip 20.0.2.

how to use lazy logging in google colab?

is it possible to do lazy logging in google colab?

The lazy_logging.ipynb can't display anything!!?

In Pytorch, it does not draw Dropout layer, which is important.

I use tw.draw_model to visualize my model.
Thank you so much.

AttributeError: module 'torch.jit' has no attribute 'get_trace_graph

Hi ! After installing tensorwatch, I run 'demo' code, it shows "AttributeError: module 'torch.jit' has no attribute 'get_trace_graph", how to solve it ? Thanks!

pip package missing json file

I try to execute following cnn_pred_explain notebook on Colab.
https://github.com/microsoft/tensorwatch/blob/master/notebooks/cnn_pred_explain.ipynb

But I failed to execute it, because following error appeared.

---------------------------------------------------------------------------
FileNotFoundError                         Traceback (most recent call last)
<ipython-input-5-b08090dd95a6> in <module>()
     10 image_utils.show_image(img)
     11 probabilities = imagenet_utils.predict(model=model, images=[img])
---> 12 imagenet_utils.probabilities2classes(probabilities, topk=5)
     13 input_tensor = imagenet_utils.image2batch(img)
     14 prediction_tensor = pytorch_utils.int2tensor(239)

2 frames
/usr/local/lib/python3.6/dist-packages/tensorwatch/imagenet_utils.py in __init__(self, json_path)
     54         json_path = json_path or os.path.join(os.path.dirname(__file__), 'imagenet_class_index.json')
     55 
---> 56         with open(os.path.abspath(json_path), "r") as read_file:
     57             class_json = json.load(read_file)
     58             self._idx2label = [class_json[str(k)][1] for k in range(len(class_json))]

FileNotFoundError: [Errno 2] No such file or directory: '/usr/local/lib/python3.6/dist-packages/tensorwatch/imagenet_class_index.json'

In my guess, python pip package misses json file inclusion.

Reference
A Simple Guide for Python Packaging
https://medium.com/small-things-about-python/lets-talk-about-python-packaging-6d84b81f1bb5

AttributeError: 'Dot' object has no attribute '_repr_svg_'

my usage like this:
model = Model()
tw.draw_model(model, [1, 2, 256, 256])

and get error message:

AttributeError Traceback (most recent call last)
~/miniconda2/envs/pytorch_env/lib/python3.6/site-packages/IPython/core/formatters.py in call(self, obj)
343 method = get_real_method(obj, self.print_method)
344 if method is not None:
--> 345 return method()
346 return None
347 else:

~/miniconda2/envs/pytorch_env/lib/python3.6/site-packages/tensorwatch/model_graph/hiddenlayer/pytorch_draw_model.py in repr_svg(self)
11 def repr_svg(self):
12 """Allows Jupyter notebook to render the graph automatically."""
---> 13 return self.dot.repr_svg()
14 def save(self, filename, format="png"):
15 # self.dot.format = format

AttributeError: 'Dot' object has no attribute 'repr_svg'

Tutorial notebook: missing "summary.show()" line

In the "notebooks/simple_logging.ipynb notebook, the 2nd to last code cell creates a Visualizer called "summary", but it is never displayed. The line "summary.show()" should be added to fix this.

How to save the network graph ploted by tensorwatch?

How to save the network graph ploted by tensorwatch? I want to save the graph. Is there any way to do that?

when I want to see the net structure，I meet an error。

environment：
python：3.7.5 (default, Oct 25 2019, 15:51:11)
[GCC 7.3.0]
pytorch：0.4.1.post2

my code：
import tensorwatch as tw
import time
import torchvision
net = torchvision.models.alexnet()
tw.draw_model(alexnet_model,[1,3,224,224])

the error：
TypeError Traceback (most recent call last)
in
4 import torchvision
5 net = torchvision.models.alexnet()
----> 6 tw.draw_model(alexnet_model,[1,3,224,224])

~/anaconda3/envs/CenterNet/lib/python3.7/site-packages/tensorwatch/init.py in draw_model(model, input_shape, orientation, png_filename)
33 def draw_model(model, input_shape=None, orientation='TB', png_filename=None): #orientation = 'LR' for landscpe
34 from .model_graph.hiddenlayer import pytorch_draw_model
---> 35 g = pytorch_draw_model.draw_graph(model, input_shape)
36 return g
37

~/anaconda3/envs/CenterNet/lib/python3.7/site-packages/tensorwatch/model_graph/hiddenlayer/pytorch_draw_model.py in draw_graph(model, args)
33 args = torch.ones(args)
34
---> 35 dot = draw_img_classifier(model, args)
36 return DotWrapper(dot)
37

~/anaconda3/envs/CenterNet/lib/python3.7/site-packages/tensorwatch/model_graph/hiddenlayer/pytorch_draw_model.py in draw_img_classifier(model, dataset, display_param_nodes, rankdir, styles, input_shape)
61 try:
62 non_para_model = distiller.make_non_parallel_copy(model)
---> 63 g = SummaryGraph(non_para_model, dummy_input)
64
65 return sgraph2dot(g, display_param_nodes, rankdir, styles)

~/anaconda3/envs/CenterNet/lib/python3.7/site-packages/tensorwatch/model_graph/hiddenlayer/summary_graph.py in init(self, model, dummy_input, apply_scope_name_workarounds)
88 dummy_input = distiller.convert_tensors_recursively_to(dummy_input, device=device)
89 self.dummy_input = dummy_input
---> 90 trace, _ = jit.get_trace_graph(model_clone, dummy_input, _force_outplace=True)
91
92 # As of PyTorch 1.1.0, ONNX trace optimization has two issues that result in incorrect scope names

TypeError: get_trace_graph() got an unexpected keyword argument '_force_outplace'

Can you help me？
thanks ！

KeyError: 'Label not in explanation'

I was trying the CNN explanation code on a new image (still cat and dog), but got the error in subject. The image I was using is from this page: http://veterinarymedicine.dvm360.com/environmental-enrichment-why-old-dogs-and-cats-need-new-tricks

Here is the output from the prediction:
((0.92600876, 235, 'German_shepherd', 'n02106662'),
(0.016059708, 224, 'groenendael', 'n02105056'),
(0.012777518, 225, 'malinois', 'n02105162'),
(0.0060081864, 174, 'Norwegian_elkhound', 'n02091467'),
(0.004870529, 231, 'collie', 'n02106030'))

RuntimeError: torch/csrc/autograd/generated/VariableType.cpp:24095: expand_as: Assertion `jit::tracer::ArgumentStash::empty()` failed.

Hi! I use the tensorwatch on pytorch 0.4.1 version platform, it shows the error “RuntimeError: torch/csrc/autograd/generated/VariableType.cpp:24095: expand_as: Assertion jit::tracer::ArgumentStash::empty() failed.” Why is there such a mistake? How to slove it ? Thanks!

how to use tensorwatch to observer loss when train a model which git clone from res2net

now I use pytorch and res2net which git from https://github.com/4uiiurz1/pytorch-res2net. when I train this mode i want to observe the loss and accuracy, and also want to observe the structure of network, so I use tensorwatch similar with Using TensorWatch for Deep Learning Training (Food360). But there is no expression.what's the reson?

Issue with draw model

Hello,

I've just installed tensorwatch and try to reproduce the example :

import tensorwatch as tw
import torchvision.models

alexnet_model = torchvision.models.alexnet()
tw.draw_model(alexnet_model, [1, 3, 224, 224])

and unfortunately I'm gettind the following error :

---------------------------------------------------------------------------
AttributeError                            Traceback (most recent call last)
~/anaconda3/lib/python3.6/site-packages/IPython/core/formatters.py in __call__(self, obj)
    343             method = get_real_method(obj, self.print_method)
    344             if method is not None:
--> 345                 return method()
    346             return None
    347         else:

~/anaconda3/lib/python3.6/site-packages/tensorwatch/model_graph/hiddenlayer/pytorch_draw_model.py in _repr_svg_(self)
     11     def _repr_svg_(self):
     12         """Allows Jupyter notebook to render the graph automatically."""
---> 13         return self.dot._repr_svg_()
     14     def save(self, filename, format="png"):
     15         # self.dot.format = format

AttributeError: 'Dot' object has no attribute '_repr_svg_'

My versions are:

Python 3.6.9
IPython 7.9.0
Pytorch 1, 1.2 and 1.3.1 (I have tried with these three versions but nothing changes.)

I also try to run the given example on google colab and it raises the same error...

Maybe the error comes from my version of IPython ?

raise RuntimeError("ONNX symbolic expected a constant value in the trace")

Hello, the model I am using is EfficientNet, the pytorch version is 1.0.1, python3.6, CUDA9.0, but I will report an error.

model.py

from __future__ import print_function
import argparse
import torch
import torch.nn as nn
import torch.nn.functional as F
import torch.optim as optim
from torchvision import datasets, transforms, models
from efficientnet_pytorch import EfficientNet
from efficientnet_pytorch import utils

from torchsummary import summary
from torchstat import stat
from tensorboardX import SummaryWriter
writer = SummaryWriter('log')

import torch.onnx
import tensorwatch as tw


def train(args, model, device, train_loader, optimizer, epoch):
    model.train()
    for batch_idx, (data, target) in enumerate(train_loader):
        data, target = data.to(device), target.to(device)
        optimizer.zero_grad()
        output = model(data)
        output1 = torch.nn.functional.log_softmax(output, dim=1)
        loss = F.nll_loss(output1, target)
        #loss = F.l1_loss(output, target)
        loss.backward()
        optimizer.step()

        #new ynh
        #每10个batch画个点用于loss曲线
        if batch_idx % 10 == 0:
            niter = epoch * len(train_loader) + batch_idx
            writer.add_scalar('Train/Loss', loss.data, niter)

        if batch_idx % args.log_interval == 0:
            print('Train Epoch: {} [{}/{} ({:.0f}%)]\tLoss: {:.6f}'.format(
                epoch, batch_idx * len(data), len(train_loader.dataset),
                       100. * batch_idx / len(train_loader), loss.item()))


def test(args, model, device, test_loader, epoch):
    model.eval()
    test_loss = 0
    correct = 0
    with torch.no_grad():
        for data, target in test_loader:
            data, target = data.to(device), target.to(device)
            output = model(data)
            output1 = torch.nn.functional.log_softmax(output, dim=1)
            test_loss += F.nll_loss(output1, target, reduction='sum').item()  # sum up batch loss
            pred = output.argmax(dim=1, keepdim=True)  # get the index of the max log-probability
            correct += pred.eq(target.view_as(pred)).sum().item()

    test_loss /= len(test_loader.dataset)

    # new ynh
    writer.add_scalar('Test/Accu', test_loss, epoch)


    print('\nTest set: Average loss: {:.4f}, Accuracy: {}/{} ({:.0f}%)\n'.format(
        test_loss, correct, len(test_loader.dataset),
        100. * correct / len(test_loader.dataset)))


def main():
    # Training settings
    parser = argparse.ArgumentParser(description='PyTorch MNIST Example')
    parser.add_argument('--batch-size', type=int, default=10, metavar='N',
                        help='input batch size for training (default: 64)')
    parser.add_argument('--test-batch-size', type=int, default=10, metavar='N',
                        help='input batch size for testing (default: 1000)')
    parser.add_argument('--epochs', type=int, default=10, metavar='N',
                        help='number of epochs to train (default: 10)')
    parser.add_argument('--lr', type=float, default=0.01, metavar='LR',
                        help='learning rate (default: 0.01)')
    parser.add_argument('--momentum', type=float, default=0.5, metavar='M',
                        help='SGD momentum (default: 0.5)')
    parser.add_argument('--no-cuda', action='store_true', default=False,
                        help='disables CUDA training')
    parser.add_argument('--seed', type=int, default=1, metavar='S',
                        help='random seed (default: 1)')
    parser.add_argument('--log-interval', type=int, default=10, metavar='N',
                        help='how many batches to wait before logging training status')

    parser.add_argument('--save-model', action='store_true', default=False,
                        help='For Saving the current Model')
    args = parser.parse_args()
    use_cuda = not args.no_cuda and torch.cuda.is_available()

    torch.manual_seed(args.seed)

    device = torch.device("cuda" if use_cuda else "cpu")

    kwargs = {'num_workers': 1, 'pin_memory': True} if use_cuda else {}
    train_loader = torch.utils.data.DataLoader(
        datasets.MNIST(root='./mnist', train=True,download=True,
                       transform=transforms.Compose([
                           transforms.Resize((224), interpolation=2),
                           transforms.Grayscale(3),
                           transforms.ToTensor(),
                       ])),
        batch_size=args.batch_size, shuffle=True, **kwargs)
    test_loader = torch.utils.data.DataLoader(
        datasets.MNIST(root='./mnist', train=False, transform=transforms.Compose([
            transforms.Resize((224), interpolation=2),
            transforms.Grayscale(3),
            transforms.ToTensor(),
            transforms.Normalize((0.1307,), (0.3081,))
        ])),
        batch_size=args.test_batch_size, shuffle=True, **kwargs)

    blocks_args, global_params = utils.get_model_params('efficientnet-b0', override_params=None)
    #model = EfficientNet.from_pretrained('efficientnet-b0').to(device)#.cuda()
    model = EfficientNet(blocks_args, global_params)#.to(device)  # .cuda()

    #dummy_input = torch.rand(1, 3, 224, 224)
    #writer.add_graph(model, (dummy_input,))

    #dummy_input = torch.randn(10, 3, 224, 224, device='cuda')
    #model = model.cuda()
    #model1 = models.alexnet(pretrained=True).cuda()
    #torch.onnx.export(model1, dummy_input, "efficientnet.onnx", verbose=True)

    #print(model)
    tw.draw_model(model, [1, 3, 224, 224])

    #stat(model, (3, 224, 224))
    model.to(device)
    #summary(model, (3, 224, 224))

    print("-------------------------------------------")



    optimizer = optim.SGD(model.parameters(), lr=args.lr, momentum=args.momentum)

    for epoch in range(1, args.epochs + 1):
        train(args, model, device, train_loader, optimizer, epoch)
        test(args, model, device, test_loader, epoch)

    if (args.save_model):
        torch.save(model.state_dict(), "mnist_cnn.pt")

    writer.close()


if __name__ == '__main__':
    main()

utils.py

"""
This file contains helper functions for building the model and for loading model parameters.
These helper functions are built to mirror those in the official TensorFlow implementation.
"""

import re
import math
import collections
import torch
from torch import nn
from torch.nn import functional as F
from torch.utils import model_zoo


########################################################################
############### HELPERS FUNCTIONS FOR MODEL ARCHITECTURE ###############
########################################################################


# Parameters for the entire model (stem, all blocks, and head)
GlobalParams = collections.namedtuple('GlobalParams', [
    'batch_norm_momentum', 'batch_norm_epsilon', 'dropout_rate',
    'num_classes', 'width_coefficient', 'depth_coefficient',
    'depth_divisor', 'min_depth', 'drop_connect_rate',])


# Parameters for an individual model block
BlockArgs = collections.namedtuple('BlockArgs', [
    'kernel_size', 'num_repeat', 'input_filters', 'output_filters',
    'expand_ratio', 'id_skip', 'stride', 'se_ratio'])


# Change namedtuple defaults
GlobalParams.__new__.__defaults__ = (None,) * len(GlobalParams._fields)
BlockArgs.__new__.__defaults__ = (None,) * len(BlockArgs._fields)


def relu_fn(x):
    """ Swish activation function """
    return x * torch.sigmoid(x)


def round_filters(filters, global_params):
    """ Calculate and round number of filters based on depth multiplier. """
    multiplier = global_params.width_coefficient
    if not multiplier:
        return filters
    divisor = global_params.depth_divisor
    min_depth = global_params.min_depth
    filters *= multiplier
    min_depth = min_depth or divisor
    new_filters = max(min_depth, int(filters + divisor / 2) // divisor * divisor)
    if new_filters < 0.9 * filters:  # prevent rounding by more than 10%
        new_filters += divisor
    return int(new_filters)


def round_repeats(repeats, global_params):
    """ Round number of filters based on depth multiplier. """
    multiplier = global_params.depth_coefficient
    if not multiplier:
        return repeats
    return int(math.ceil(multiplier * repeats))


def drop_connect(inputs, p, training):
    """ Drop connect. """
    if not training: return inputs
    batch_size = inputs.shape[0]
    keep_prob = 1 - p
    random_tensor = keep_prob
    random_tensor += torch.rand([batch_size, 1, 1, 1], dtype=inputs.dtype)  # uniform [0,1)
    binary_tensor = torch.floor(random_tensor)
    output = inputs / keep_prob * binary_tensor
    return output


class Conv2dSamePadding(nn.Conv2d):
    """ 2D Convolutions like TensorFlow """
    def __init__(self, in_channels, out_channels, kernel_size, stride=1, dilation=1, groups=1, bias=True):
        super().__init__(in_channels, out_channels, kernel_size, stride, 0, dilation, groups, bias)
        self.stride = self.stride if len(self.stride) == 2 else [self.stride[0]]*2

    def forward(self, x):
        ih, iw = x.size()[-2:]
        kh, kw = self.weight.size()[-2:]
        sh, sw = self.stride
        oh, ow = math.ceil(ih / sh), math.ceil(iw / sw)
        pad_h = max((oh - 1) * self.stride[0] + (kh - 1) * self.dilation[0] + 1 - ih, 0)
        pad_w = max((ow - 1) * self.stride[1] + (kw - 1) * self.dilation[1] + 1 - iw, 0)
        if pad_h > 0 or pad_w > 0:
            #print("pad_h",x.shape[2],"pad_w",x.shape[3])
            x = F.pad(x, [pad_w//2, pad_w - pad_w//2, pad_h//2, pad_h - pad_h//2])
            #print("pad_h",x.shape[2],"pad_w",x.shape[3])
            #print("===========================")
        return F.conv2d(x, self.weight, self.bias, self.stride, self.padding, self.dilation, self.groups)


########################################################################
############## HELPERS FUNCTIONS FOR LOADING MODEL PARAMS ##############
########################################################################


def efficientnet_params(model_name):
    """ Map EfficientNet model name to parameter coefficients. """
    params_dict = {
        # Coefficients:   width,depth,res,dropout
        'efficientnet-b0': (1.0, 1.0, 224, 0.2),
        'efficientnet-b1': (1.0, 1.1, 240, 0.2),
        'efficientnet-b2': (1.1, 1.2, 260, 0.3),
        'efficientnet-b3': (1.2, 1.4, 300, 0.3),
        'efficientnet-b4': (1.4, 1.8, 380, 0.4),
        'efficientnet-b5': (1.6, 2.2, 456, 0.4),
        'efficientnet-b6': (1.8, 2.6, 528, 0.5),
        'efficientnet-b7': (2.0, 3.1, 600, 0.5),
    }
    return params_dict[model_name]


class BlockDecoder(object):
    """ Block Decoder for readability, straight from the official TensorFlow repository """

    @staticmethod
    def _decode_block_string(block_string):
        """ Gets a block through a string notation of arguments. """
        assert isinstance(block_string, str)

        ops = block_string.split('_')
        options = {}
        for op in ops:
            splits = re.split(r'(\d.*)', op)
            if len(splits) >= 2:
                key, value = splits[:2]
                options[key] = value

        # Check stride
        assert (('s' in options and len(options['s']) == 1) or
                (len(options['s']) == 2 and options['s'][0] == options['s'][1]))

        return BlockArgs(
            kernel_size=int(options['k']),
            num_repeat=int(options['r']),
            input_filters=int(options['i']),
            output_filters=int(options['o']),
            expand_ratio=int(options['e']),
            id_skip=('noskip' not in block_string),
            se_ratio=float(options['se']) if 'se' in options else None,
            stride=[int(options['s'][0])])

    @staticmethod
    def _encode_block_string(block):
        """Encodes a block to a string."""
        args = [
            'r%d' % block.num_repeat,
            'k%d' % block.kernel_size,
            's%d%d' % (block.strides[0], block.strides[1]),
            'e%s' % block.expand_ratio,
            'i%d' % block.input_filters,
            'o%d' % block.output_filters
        ]
        if 0 < block.se_ratio <= 1:
            args.append('se%s' % block.se_ratio)
        if block.id_skip is False:
            args.append('noskip')
        return '_'.join(args)

    @staticmethod
    def decode(string_list):
        """
        Decodes a list of string notations to specify blocks inside the network.

        :param string_list: a list of strings, each string is a notation of block
        :return: a list of BlockArgs namedtuples of block args
        """
        assert isinstance(string_list, list)
        blocks_args = []
        for block_string in string_list:
            blocks_args.append(BlockDecoder._decode_block_string(block_string))
        return blocks_args

    @staticmethod
    def encode(blocks_args):
        """
        Encodes a list of BlockArgs to a list of strings.

        :param blocks_args: a list of BlockArgs namedtuples of block args
        :return: a list of strings, each string is a notation of block
        """
        block_strings = []
        for block in blocks_args:
            block_strings.append(BlockDecoder._encode_block_string(block))
        return block_strings


def efficientnet(width_coefficient=None, depth_coefficient=None,
                 dropout_rate=0.2, drop_connect_rate=0.2):
    """ Creates a efficientnet model. """

    blocks_args = [
        'r1_k3_s11_e1_i32_o16_se0.25', 'r2_k3_s22_e6_i16_o24_se0.25',
        'r2_k5_s22_e6_i24_o40_se0.25', 'r3_k3_s22_e6_i40_o80_se0.25',
        'r3_k5_s11_e6_i80_o112_se0.25', 'r4_k5_s22_e6_i112_o192_se0.25',
        'r1_k3_s11_e6_i192_o320_se0.25',
    ]
    blocks_args = BlockDecoder.decode(blocks_args)

    global_params = GlobalParams(
        batch_norm_momentum=0.99,
        batch_norm_epsilon=1e-3,
        dropout_rate=dropout_rate,
        drop_connect_rate=drop_connect_rate,
        # data_format='channels_last',  # removed, this is always true in PyTorch
        num_classes=10,
        width_coefficient=width_coefficient,
        depth_coefficient=depth_coefficient,
        depth_divisor=8,
        min_depth=None
    )

    return blocks_args, global_params


def get_model_params(model_name, override_params):
    """ Get the block args and global params for a given model """
    if model_name.startswith('efficientnet'):
        w, d, _, p = efficientnet_params(model_name)
        # note: all models have drop connect rate = 0.2
        blocks_args, global_params = efficientnet(width_coefficient=w, depth_coefficient=d, dropout_rate=p)
    else:
        raise NotImplementedError('model name is not pre-defined: %s' % model_name)
    if override_params:
        # ValueError will be raised here if override_params has fields not included in global_params.
        global_params = global_params._replace(**override_params)
    return blocks_args, global_params


url_map = {
    'efficientnet-b0': 'http://storage.googleapis.com/public-models/efficientnet-b0-08094119.pth',
    'efficientnet-b1': 'http://storage.googleapis.com/public-models/efficientnet-b1-dbc7070a.pth',
    'efficientnet-b2': 'http://storage.googleapis.com/public-models/efficientnet-b2-27687264.pth',
    'efficientnet-b3': 'http://storage.googleapis.com/public-models/efficientnet-b3-c8376fa2.pth',
}

def load_pretrained_weights(model, model_name):
    """ Loads pretrained weights, and downloads if loading for the first time. """
    state_dict = model_zoo.load_url(url_map[model_name])

    pretrained_dict = {k: v for k, v in state_dict.items() if k != "_fc.weight" and k != "_fc.bias"}
    model.state_dict().update(pretrained_dict)
    model.load_state_dict(model.state_dict())

    print('Loaded pretrained weights for {}'.format(model_name))

Using saliency when model is on cuda

Hi. I'm using Darknet(for yolov3) and tried to use saliency.
I was trying to explain the result of certain CNN layer, using gradcam.
The dataset i've used is 3-channel image.
when using saliency.get_image_saliency_results, i got this message
: Expected object of backend CPU but got backend CUDA for argument #3 'index'

the code was as follows:

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = Darknet(opt.model_def, img_size=opt.img_size).to(device)
.
.
input_tensor = transforms.ToTensor()(Image.open(path))
input_tensor = input_tensor.view(1, 3, 416, 416)
prediction_tensor = pytorch_utils.int2tensor((int)(cls_pred))
raw_img = image_utils.open_image(path, convert_mode='RGB')
results = saliency.get_image_saliency_results(model, raw_img, input_tensor, prediction_tensor, methods=['gradcam'], layer_path='conv_105')

when i set prediction_tensor and input_tensor to cuda, same error message occurs.
and also, same thing occurs when i set model to cpu.

Is this function is not available for models which are not in torchvision.models?
If not, is it impossible when model is on cuda? how can i fix this problem?

detailed error message is as follows:
Traceback (most recent call last):
File "C:/Users/cgna-HJYu/Desktop/2019summer/animation_recognition/PyTorch-YOLOv3-master/grad_cam.py", line 117, in
results = saliency.get_image_saliency_results(model, raw_img, input_tensor, prediction_tensor, methods=['gradcam'], layer_path='conv_105')
File "C:\Users\cgna-HJYu\Anaconda3\envs\vision\lib\site-packages\tensorwatch\saliency\saliency.py", line 93, in get_image_saliency_results
sal = get_saliency(model, raw_image, input, label, method=method)
File "C:\Users\cgna-HJYu\Anaconda3\envs\vision\lib\site-packages\tensorwatch\saliency\saliency.py", line 76, in get_saliency
saliency = exp.explain(input, label, raw_input)
File "C:\Users\cgna-HJYu\Anaconda3\envs\vision\lib\site-packages\tensorwatch\saliency\gradcam.py", line 48, in explain
_ = super(GradCAMExplainer, self)._backprop(inp, ind)
File "C:\Users\cgna-HJYu\Anaconda3\envs\vision\lib\site-packages\tensorwatch\saliency\backprop.py", line 25, in backprop
grad_out.scatter(1, ind.unsqueeze(0).t(), 1.0)
RuntimeError: Expected object of backend CPU but got backend CUDA for argument #3 'index'

Thank you.

Using saliency with custom nets and grayscale images

Hi,

Currently the saliency method is not working for images with a single channel (ex (1,244,244)). At some point the code seems to require three channels. This results in shape errors, etc.

Could you please help me run the saliency code with grayscale images? Thanks.

Can this tool support pycharm?

I find this tool very useful.
But it seems like that it only can be used in jupyter notebook.
Can this tool will support other environment like pycharm later?

Support for S3 Stream

Is it possible to use as FileStream a S3 file as filename in

watcher = tw.Watcher(filename=r'c:\temp\test.log', port=None)
cli = tw.WatcherClient(r'c:\temp\sum.log')

I would like gensim smart_open to open a ordinary stream from a S3 device:

from smart_open import open
# stream lines from an S3 object
with open('s3://commoncrawl/robots.txt') as s3_logger
    watcher = tw.Watcher(filename=s3_logger, port=None)
   cli = tw.WatcherClient( s3_logger )

It would be handy to have an interface to a FileStream object for the input device, so a read/write file stream abstraction layer like smart_open could pass the stream from different source like S3, HDFS, etc.

Thank you

import error

_thread:Thread = None
^
SyntaxError: invalid syntax

Jupyter Lab

The examples I've tried are running just fine in a classic jupyter notebook server, but fail to dynamically update in jupyter lab.

One testable instance to reproduce the issue is the quickstart:

import tensorwatch as tw
import time

# streams will be stored in test.log file
w = tw.Watcher(filename='test.log')

# create a stream for logging
s = w.create_stream(name='metric1')

# generate Jupyter Notebook to view real-time streams
w.make_notebook()

for i in range(1000):
    # write x,y pair we want to log
    s.write((i, i*i))

    time.sleep(1)

After making the change in the autogenerated test.ipynb: %matplotlib notebook -> %matplotlib inline, a plot is drawn upon running the cells but this plot remains static unless it is refreshed (re-run the cell manually).

I know this is somewhat due to a difference in how the classic notebook handles inline content vs. lab, but I'm not able to determine the changes necessary to accommodate lab.

I noticed this file: https://github.com/microsoft/tensorwatch/blob/master/install_jupyterlab.bat, but since there isn't much context for it, I am wondering if there's been an effort to get streams plotting in jupyter lab.

Appreciate any insights!

pip install issue: "SyntaxError: invalid syntax"

Hello,
it looks like a very cool tool!!
I tried to install with pip and got the following issue:

Collecting tensorwatch
Using cached https://files.pythonhosted.org/packages/ce/f2/4885c7f5ddf06224fc1443bb998464755e542c34f9966de4e686b9f1e43e/tensorwatch-0.8.4.tar.gz
Requirement already satisfied: matplotlib in /media/ophir/DATA1/software/anaconda3/envs/pytorch/lib/python3.5/site-packages (from tensorwatch) (2.2.2)
Requirement already satisfied: numpy in /media/ophir/DATA1/software/anaconda3/envs/pytorch/lib/python3.5/site-packages (from tensorwatch) (1.14.2)
Requirement already satisfied: pyzmq in /media/ophir/DATA1/software/anaconda3/envs/pytorch/lib/python3.5/site-packages (from tensorwatch) (17.1.2)
Requirement already satisfied: plotly in /media/ophir/DATA1/software/anaconda3/envs/pytorch/lib/python3.5/site-packages (from tensorwatch) (3.3.0)
Collecting torchstat (from tensorwatch)
Using cached https://files.pythonhosted.org/packages/bc/fe/f483b907ca80c90f189cd892bb2ce7b2c256010b30314bbec4fc17d1b5f1/torchstat-0.0.7-py3-none-any.whl
Collecting receptivefield (from tensorwatch)
Using cached https://files.pythonhosted.org/packages/cd/2a/a140221d151e228c5995e34f9c60d1ffd756f8672ccfbce8efe5da780671/receptivefield-0.4.0.tar.gz
ERROR: Complete output from command python setup.py egg_info:
ERROR: Traceback (most recent call last):
File "", line 1, in
File "/media/ophir/DATA1/ilyan/tmp/pip-install-gb2l8gtq/receptivefield/setup.py", line 13
download_url=f'https://github.com/fornaxai/receptivefield/archive/{VERSION}.tar.gz',
^
SyntaxError: invalid syntax
----------------------------------------
ERROR: Command "python setup.py egg_info" failed with error code 1 in /media/ophir/DATA1/ilyan/tmp/pip-install-gb2l8gtq/receptivefield/

does any one have a solution?

Thanks
Ophir

Draw_model error

Dear Sir

Thanks for the excellent work !

However when I try this,

alex_model = models.alexnet()
tw.draw_model(alex_model, [1, 3, 224, 224])

It returned 'Only output_size=[1, 1] is supported' error

python 3.7
pytorch 1.01

Could not find a version that satisfies the requirement torchstat

ERROR: Could not find a version that satisfies the requirement torchstat (from tensorwatch) (from versions: none)
ERROR: No matching distribution found for torchstat (from tensorwatch)

My Env:

python 3.7.3
torchvision 0.3.0
torch 1.1.0

Visual affordance for controlling updates

Since one can have multiple cells, each doing visual updates, the resulting visual signals can at times be overwhelming, especially when trying to focus on a particular problem (like a code change). For times like these, it would be great to have some sort of visual affordance right along side (or part of) each visualization that would enable or disable updates. For example, it could be a checkbox labelled "update", that is checked by default. This gives the user complete control over what gets updated and when, without having to write extra code.

Some of the visualization have a "stop interactions" button - the right idea, but the button is one-way - there appears to be no way to re-enables updates (without rerunning the code).

RuntimeError: Failed to export an ONNX attribute, since it's not constant, please try to make things (e.g., kernel size) static if possible

I use the model inside torchvision to work normally. Once I change to my own simple model, I will get the above error.

AttributeError: 'torch._C.Value' object has no attribute 'debugName'

ubantu 16.04
torch 1.1.0 pypi_0 pypi
torchstat 0.0.7 pypi_0 pypi
torchvision 0.3.0 pypi_0 pypi
tensorwatch 0.8.7 pypi_0 pypi

Does it support prediction explainers for networks other than CNN? Networks based on LSTM, Attention.

Read This First

Make sure to describe all the steps to reproduce the issue
Include full error message in the description
Add OS version, Python version, Pytorch version if applicable

Remember: if we cannot reproduce your problem, we cannot find solution!

What's better than filing issue? Filing a pull request :).

------------------------------------ (Remove above before filing the issue) ------------------------------------

How to save tw.draw_model to image file

I can only run my model on server, so I can't see the model graph directly from the example code:
tw.draw_model.
So, anyone can tell me how to save the graph data to image file? Thanks.

Need to support AdaptiveAvgPool2d for ResNet

When I run on TensorWatch on Google Colab, Following messages are appeared.

[MAdd]: AdaptiveAvgPool2d is not supported!
[Flops]: AdaptiveAvgPool2d is not supported!
[Memory]: AdaptiveAvgPool2d is not supported!

Test Code is follows

!pip install tensorwatch

import tensorwatch as tw
import torchvision.models

resnet_model = torchvision.models.resnet50()
tw.model_stats(resnet_model, [1, 3, 224, 224])

plot not rendering in data_exploration.ipynb

vis.show()

is not rendering the plotly 3d visualization with in the data_exploration notebook

Works w cntk?

Is tensorwatch able to observe model variables and parameters in CNTK too? Also from c# or c++? Is that a feature someone is working on?

How to use Watcher / WatcherClient over tcp/ip network? (ex. cloud - local)

How to use Watcher / WatcherClient over tcp/ip network?

Watcher seems to ZMQ server, and WatcherClient is ZMQ Client, but there is no API/Interface to config server IP address.
Do I need to implement a class that inherits from WatcherClient?

Integration with other libraries (TensorFlow, PyTorch etc)

Kudos to the team for this great library. I was wondering if it is possible to integrate tensorwatch with other libraries as mentioned. I am asking this out of my curiosity derived while using interpret as it allows for crazy integrations.

Any example/hint would be great so that I can prepare a notebook and share.

Ubuntu16,anaconda,virtue environment, jupyter notebook run ,挂掉的服务器

Read This First

Make sure to describe all the steps to reproduce the issue
Include full error message in the description
Add OS version, Python version, Pytorch version if applicable

Remember: if we cannot reproduce your problem, we cannot find solution!

What's better than filing issue? Filing a pull request :).

------------------------------------ (Remove above before filing the issue) ------------------------------------

Multiplexing multiple .observe() onto the same Watcher?

The code

I'm using a slightly modified sum_lazy.py

# create watcher object as usual
w = tw.Watcher()

weights = None
for i in range(10000):
    weights = [random.random() for _ in range(5)]

    # let watcher observe variables we have
    # this has almost no performance cost
    w.observe(weights=weights, pqr=42)
    w.observe(my_test=[-x for x in weights])
    w.observe(**{"my_testX": [-x for x in weights]})
    time.sleep(1)

and a slightly modified lazy_logging.ipynb

client = tw.WatcherClient()
#stream = client.create_stream(expr='lambda d: np.sum(d.my_test)')
stream = client.create_stream(expr='lambda d: print(d)')

Expected behavior

I'd like (expect?) different observations to be multiplexed into the stream.

Observed behavior

Only the first call to .observe() produces an event.

Singularity devX.sif:~/docks/dev/numericcal/radoye/tensorwatch/test/simple_log> python sum_lazy.py 

weights="[0.7598638948148748, 0.9609570253109472, 0.17193771268530844, 0.945312325751268, 0.8840031816567768]", pqr=42
weights="[0.9633391250000072, 0.4817883214569013, 0.5066860131563821, 0.536321656766366, 0.4866203822451308]", pqr=42
weights="[0.8682968602613323, 0.25722248154751726, 0.10094894832942802, 0.8529007560893397, 0.4847747465461011]", pqr=42
weights="[0.1920386201704245, 0.2917508358159102, 0.13423189557362525, 0.48907668597008847, 0.7810377425785467]", pqr=42

Documentation - how is it different from Tensorboard

It's not clear to me, how is this different from TensorBoard? TensorBoard is usable with PyTorch. So, why should one switch to TensorWatch?

pip install tensorwatch: missing ipywidgets and sklearn

After doing "pip install tensorwatch", I tried to run "sum_log.py". It required me to manually pip install "ipywidgets" and "sklearn" - these should be included tensorwatch's setup.py dependencies.

Is it really need for pip install -e . for regim

This is Jupyter notebook sample issue.
Most of Jupyter notebook sample writes following

git clone https://github.com/sytelus/regim.git
cd regim
pip install -e .

But I think following 1 line instruction is enough.

pip install regim

RuntimeError: you can only change requires_grad flags of leaf variables.

Hi, thanks for the great work!

When i set results = saliency.get_image_saliency_results(model, img, input_tensor, prediction_tensor), methods=['gradcam', 'smooth_grad'] in cnn_pred_explain.ipynb , Error happend:

Traceback (most recent call last):
  File "debug.py", line 18, in <module>
    results = saliency.get_image_saliency_results(model, img, input_tensor, prediction_tensor, methods=['gradcam', 'smooth_grad'])
  File "Anaconda3\lib\site-packages\tensorwatch\saliency\saliency.py", line 93, in get_image_saliency_results
    sal = get_saliency(model, raw_image, input, label, method=method)
  File "Anaconda3\lib\site-packages\tensorwatch\saliency\saliency.py", line 76, in get_saliency
    saliency = exp.explain(input, label, raw_input)
  File "Anaconda3\lib\site-packages\tensorwatch\saliency\backprop.py", line 146, in explain
    grad = self.base_explainer.explain(noisy_inp, ind)
  File "Anaconda3\lib\site-packages\tensorwatch\saliency\backprop.py", line 30, in explain
    return self._backprop(inp, ind)
  File "Anaconda3\lib\site-packages\tensorwatch\saliency\backprop.py", line 12, in _backprop
    inp.requires_grad = True
RuntimeError: you can only change requires_grad flags of leaf variables.

What's wrong with this?

Cannot .close() cleanly

OS: Ubuntu 18.04
Python: 3.6.9
A simple experiment:

Open a tw.Watcher(port=).
Try to reopen on the same port (fail + kernel hang).
Try to close the watcher (success + error).
Try to reopen on the port (success).

Please see the screenshots below.

Import Error

I use pip install tensorwatch to install the package.

When I import it, I get the error.

Traceback (most recent call last):

  File "/home/shengtao/venv/lib/python3.5/site-packages/IPython/core/interactiveshell.py", line 3291, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)

  File "<ipython-input-7-72273cc44010>", line 1, in <module>
    import tensorwatch as tw

  File "/home/shengtao/venv/lib/python3.5/site-packages/tensorwatch/__init__.py", line 6, in <module>
    from .watcher_client import WatcherClient

  File "/home/shengtao/venv/lib/python3.5/site-packages/tensorwatch/watcher_client.py", line 5, in <module>
    from .zmq_wrapper import ZmqWrapper

  File "/home/shengtao/venv/lib/python3.5/site-packages/tensorwatch/zmq_wrapper.py", line 16
    _thread:Thread = None
           ^
SyntaxError: invalid syntax

Then, I download the source code and run python setup.py install. The error is:

byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/file_stream.py to file_stream.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/plotly/embeddings_plot.py to embeddings_plot.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/plotly/line_plot.py to line_plot.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/plotly/base_plotly_plot.py to base_plotly_plot.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/plotly/__init__.py to __init__.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/stream_union.py to stream_union.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/lv_types.py to lv_types.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/filtered_stream.py to filtered_stream.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/image_utils.py to image_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/receptive_field/rf_utils.py to rf_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/receptive_field/__init__.py to __init__.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/repeated_timer.py to repeated_timer.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/text_vis.py to text_vis.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/visualizer.py to visualizer.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/array_stream.py to array_stream.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/tensor_utils.py to tensor_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/vis_base.py to vis_base.cpython-35.pyc
  File "build/bdist.linux-x86_64/egg/tensorwatch/vis_base.py", line 73
    stream_vis:StreamVisInfo = None
              ^
SyntaxError: invalid syntax

byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/imagenet_utils.py to imagenet_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/torchstat_utils.py to torchstat_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/hiddenlayer/pytorch_builder.py to pytorch_builder.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/hiddenlayer/tf_builder.py to tf_builder.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/hiddenlayer/transforms.py to transforms.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/hiddenlayer/ge.py to ge.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/hiddenlayer/graph.py to graph.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/hiddenlayer/__init__.py to __init__.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/model_graph/__init__.py to __init__.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/zmq_wrapper.py to zmq_wrapper.cpython-35.pyc
  File "build/bdist.linux-x86_64/egg/tensorwatch/zmq_wrapper.py", line 16
    _thread:Thread = None
           ^
SyntaxError: invalid syntax

byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/pytorch_utils.py to pytorch_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/zmq_mgmt_stream.py to zmq_mgmt_stream.cpython-35.pyc
  File "build/bdist.linux-x86_64/egg/tensorwatch/zmq_mgmt_stream.py", line 19
    self._stream_reqs:Dict[str,StreamCreateRequest] = {}
                     ^
SyntaxError: invalid syntax

byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/embeddings/tsne_utils.py to tsne_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/embeddings/__init__.py to __init__.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/notebook_maker.py to notebook_maker.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/watcher_client.py to watcher_client.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/evaler_utils.py to evaler_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/evaler.py to evaler.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/data_utils.py to data_utils.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/zmq_stream.py to zmq_stream.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/watcher.py to watcher.cpython-35.pyc
byte-compiling build/bdist.linux-x86_64/egg/tensorwatch/stream_factory.py to stream_factory.cpython-35.pyc
  File "build/bdist.linux-x86_64/egg/tensorwatch/stream_factory.py", line 18
    self._streams:Dict[str, Stream] = None
                 ^
SyntaxError: invalid syntax

I try to import the package again and the error is the same.

Python 3.5.2 (default, Nov 12 2018, 13:43:14)
[GCC 5.4.0 20160609] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import tensorwatch
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/data1/shengtao/Project/ceshi2/tensorwatch/tensorwatch/__init__.py", line 6, in <module>
    from .watcher_client import WatcherClient
  File "/data1/shengtao/Project/ceshi2/tensorwatch/tensorwatch/watcher_client.py", line 5, in <module>
    from .zmq_wrapper import ZmqWrapper
  File "/data1/shengtao/Project/ceshi2/tensorwatch/tensorwatch/zmq_wrapper.py", line 16
    _thread:Thread = None
           ^
SyntaxError: invalid syntax

I don't use anaconda but virtualenv.

How to choice the hyperparameter (239) in cnn_pred_explain.ipynb?

Hi, thank you for your great work!
I am confused about prediction_tensor = pytorch_utils.int2tensor(239) in cnn_pred_explain.ipynb
. Why choose the dimension of prediction_tensor equal to 239?

Complaint: Are you trying to download half of internet or something?

There are two mang dependence, and may easy to get error from one of them when install it. See this:

ERROR: Error checking for conflicts.
Traceback (most recent call last):
File "d:\program files\python37\lib\site-packages\pip_vendor\pkg_resources_init_.py", line 3012, in _dep_map
return self._dep_map
File "d:\program files\python37\lib\site-packages\pip_vendor\pkg_resources_init.py", line 2806, in getattr
raise AttributeError(attr)
AttributeError: _DistInfoDistribution__dep_map

read values from files

the README states that the results of multiple runs (stored in log files) can be compared but it's not clear to me how that works.
It seems that the WatcherClient only operates on the current stream and ignores values that have been written into a file earlier.
How can I plot the values from a log file that is not being streamed to?

microsoft / tensorwatch Goto Github PK

tensorwatch's Introduction

Welcome to TensorWatch

How to Get It

How to Use It

Quick Start

How does this work?

Visualizations

Comparing Results of Multiple Runs

Training within Jupyter Notebook

Lazy Logging Mode

Pre-Training and Post-Training Tasks

Prediction Explanations

Tutorials

Paper

Contribute

Contact

Credits

License

tensorwatch's People

Contributors

Stargazers

Watchers

Forkers

tensorwatch's Issues

error:

and get error message:

Read This First

Read This First

The code

Expected behavior

Observed behavior

Recommend Projects

Recommend Topics

Recommend Org

Jobs