htqin / ir-net Goto Github PK

[CVPR 2020] This project is the PyTorch implementation of our accepted CVPR 2020 paper : forward and backward information retention for accurate binary neural networks.

Python 100.00%

binarization imagenet ir-net pytorch resnet

ir-net's People

Contributors

Stargazers

Watchers

ir-net's Issues

a bug when runing the resNet20_1W1A - cifar10 dataset

hello friend, when running the code of Resnet20-1w1a with cifar-10 dataset, it reported that

model.modules.layer1[i].conv[i].k = k

AttribuateError: 'function object has no attribute 'layer1'

in IR-Net/CIFAR-10/ResNet20/1w1a/trainer.py line[147]
i have no idea why this problem occour.
Thanks for your reply

Questions about training

Hi, author team,
After reading your paper and codes, I wonder that how to train the networks prototypes.
The training code are important for reproduction, please release them later. Thanks a lot for that. 👍
As you proposed in paper, the EDE and Libra-PB are both benefit the network performance, in that I am curious how to train.

RuntimeError: arguments are located on different GPUs at /opt/conda/conda-bld/pytorch_1533672544752/work/aten/src/THC/generated/../generic/THCTensorMathPointwise.cu:314

A problem will appear when I run "python main.py" in VGG-Small:
RuntimeError: arguments are located on different GPUs at /opt/conda/conda-bld/pytorch_1533672544752/work/aten/src/THC/generated/../generic/THCTensorMathPointwise.cu:314
How can I deal with it?

k和t根本不会更新吧

IR-Net/ImageNet/ResNet18/1w1a/models/modules/binaryfunction.py

Line 16 in f63a590

return grad_input, None, None

这样写的话，给k和t返回的梯度都是None，根本不会更新吧

Infference speed

hello，thank you for your excellent work!
I still have a question,does the high inference speed of BNN also need to be deployed on the high-performance inference framework ,such as dabnn, otherwise the inference speed will not be much different from the original?

training

Can you give details of the training steps?

Is there a special way to train on ImageNet with the yaml file?

Hi everyone,

I noticed that in the CIFAR-10 folder, IR-Net has a trainer.py file for each experiment setting. But in the ImageNet folder, there is no such a trainer.py, and only a .yaml file.
Does it mean that we need to write a trainer.py ourselves based on the hyperparams in the yaml file for ImageNet experiments? Or there is a way to train on ImageNet using these yaml files directly?

Thank you!

a bug using multiple GPUS

hello ~ when i run your code with the multiple GPUS, i find the problem.
RuntimeError: binary_op(): expected both inputs to be on same device, but input a is on cuda:3 and input b is on cuda:0
when using the single GPU. it's safe. i check the tensor and find the k and t is stored in GPU0 and cause the problem.
i dont know if you have any idea how to solve it
Thanks~

Downsamples

Hello, sorry for similar question.
You have mentioned you used bi-real net structure for fair competition.
Is it a popular way to use bi-real net structure for quantization even for weight/activation with 1/32 bit?
I mean, did other networks such as BWN, HWGQ, ..., in your paper used same structures without binarizing downsamples?

Low CIFAR10 accuracy

Hello,
Thank you very much for sharing your work.
When I trained ResNet18 for CIFAR10, it gives 86% accuracy on the validation data.
I use the same hyperparameters of https://github.com/kuangliu/pytorch-cifar as you point out in the paper.

Without binarizing the downsample layer for ResNet on ImageNet

Nice work!!! Thank you for sharing the code. When i check the code, I found that the downsample layers for ResNet on ImageNet are not binarized to 1-bit values. Is that correct? In Cifar10, the downsample layer is quantized?

I wonder why ResNet-18 outperform ResNet-20 by such a large margin

This is a huge gap.

ResNet-18 | 1 / 1 | 91.5
ResNet-20 | 1 / 1 | 86.5

Details about ImageNet experiments setting

@htqin Thank you for your quick response! I intended to ask this in this issue, but it was closed and this seems to be a new question, so I open a new issue for this question. Thank you a lot!

I notice in the yaml

    augmentation:
        input_size: 224
        test_resize: 256
        colorjitter: [0.2, 0.2, 0.2, 0.1]

Does it mean you use 224224 for train, and upsample it to 256256 during test? If so, I noticed you didn't mention so in your paper, do you have any references for this? I was wondering if such techniques are also used in baselines.
Do you only use "colorjitter: [0.2, 0.2, 0.2, 0.1]" for augmentation? Do you use random crop or flip? Because I noticed you use such augmentation in CIFAR-10, not sure if you use any other augmentation beside "colorjitter: [0.2, 0.2, 0.2, 0.1]" for ImageNet.

Thank you! Your efforts and the sprit to open source will benefit the community a lot!

Error about devices

grad_input = k * t * (1 - torch.pow(torch.tanh(input * t), 2)) * grad_output

Expected all tensors to be on the same device, but found at least two devices , cuda:1 and cuda:0!

Different training settings with VGG-Small in LQ-Nets, and cannot reach 90.4%?

Hello, everyone,
Thank @htqin 's great work!

I noticed the training setting of vgg-small is using cosine decay + 300 max epochs, while the cited reference in the paper, use step decay + 400 max epochs.

And by running the main.py, I can only get 87.80% while the reported number in the paper is 90.40%. But the full precision version I reproduced using this training setting could have 91.79% while the reported number in the paper is 91.70%. In other words, I can reproduce the reported full precision baselines for vgg-small using the training setting in this repo, but cannot reproduce the reported binarized IR-Net for vgg-small. That makes me very confusing, I was wondering if there is any mismatch in the training setting of vgg-small@cifar-10? I am also very looking forward to discussing your training settings when you reproduce vgg-small@cifar-10!

Thank you!
Best regards,

imagenet training

Can you please provide a training file (trainer.py) for the imagenet dataset?thinks

Can you please opensource your code of real deployment?

I have try in CIFAR-10 with VGG-small, and can get similar result as you mention in your paper. Your work is really useful.
And I have trouble in deploying your model into dabnn when convert onnx to the format that dabnn support.
So I wander if you could opensource the real code that deploy to real Arm device?
Especially the model convert part.
Thank you.

Downsample quantization

Hi,

I read your paper, where you wrote that you binarize all convolutional and fc layer except the first and last one.
However, I see from your code that also the downsample convolution aren't quantized (They are not replaced with IRConv2d - e.g https://github.com/htqin/IR-Net/blob/master/CIFAR-10/ResNet18/resnet.py#L28) . What am I missing?

Thank you

ResNet34

hello，thank you for your work!
I got a model of size 81.3MB when I used the resnet.py file in your "IR-Net-master\ImageNet\ResNet34\1w1a\models"directory.
This is the same size as my model using the full precision ResNet34 network.
How do I compress my model?

The value of k and t

Hi,
I'm following your brilliant work and thanks for your sharing.
However, I found that the parameter 'k' and 't' in ir_1w1a.py seems fixed, which is not the same as the one in your paper.
Will this difference matter?

htqin / ir-net Goto Github PK

ir-net's People

Contributors

Stargazers

Watchers

Forkers

ir-net's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs