cvlab-yonsei / mnad Goto Github PK

View Code? Open in Web Editor NEW

332.0 12.0 82.0 1021 KB

An official implementation of "Learning Memory-guided Normality for Anomaly Detection" (CVPR 2020) in PyTorch.

Python 96.60% CSS 3.40%

mnad's Introduction

PyTorch implementation of "Learning Memory-guided Normality for Anomaly Detection"

This is the implementation of the paper "Learning Memory-guided Normality for Anomaly Detection (CVPR 2020)".

For more information, checkout the project site [website] and the paper [PDF].

Dependencies

Python 3.6
PyTorch 1.1.0
Numpy
Sklearn

Datasets

USCD Ped2 [dataset]
CUHK Avenue [dataset]
ShanghaiTech [dataset]

These datasets are from an official github of "Future Frame Prediction for Anomaly Detection - A New Baseline (CVPR 2018)".

Download the datasets into dataset folder, like ./dataset/ped2/

Update

02/04/21: We uploaded the codes based on reconstruction method, and pretrained wieghts for Ped2 reconstruction, Avenue prediction and Avenue reconstruction.

Training

~~The training and testing codes are based on prediction method~~
Now you can implemnet the codes based on both prediction and reconstruction methods.
The codes are basically based on the prediction method, and you can easily implement this as

git clone https://github.com/cvlab-yonsei/projects
cd projects/MNAD/code
python Train.py # for training

You can freely define parameters with your own settings like

python Train.py --gpus 1 --dataset_path 'your_dataset_directory' --dataset_type avenue --exp_dir 'your_log_directory'

For the reconstruction task, you need to newly set the parameters, e.g,, the target task, the weights of the losses and the number of the time sequence.

python Train.py --method recon --loss_compact 0.01 --loss_separate 0.01 --t_length 1 # for training

Evaluation

Test your own model
Check your dataset_type (ped2, avenue or shanghai)

python Evaluate.py --dataset_type ped2 --model_dir your_model.pth --m_items_dir your_m_items.pt

For the reconstruction task, you need to set the parameters as

python Evaluate.py --method recon --t_length 1 --alpha 0.7 --th 0.015 --dataset_type ped2 --model_dir your_model.pth --m_items_dir your_m_items.pt

Test the model with our pre-trained model and memory items

python Evaluate.py --dataset_type ped2 --model_dir pretrained_model.pth --m_items_dir m_items.pt

Pre-trained model and memory items

Download our pre-trained model and memory items
[Ped2 Prediction]
[Ped2 Reconstruction]
[Avenue Prediction]
[Avenue Reconstruction]
Note that, you need to set lambda and threshold to 0.7 and 0.015, respectively, for the reconstruction task. See more details in the paper.

Bibtex

@inproceedings{park2020learning,
  title={Learning Memory-guided Normality for Anomaly Detection},
  author={Park, Hyunjong and Noh, Jongyoun and Ham, Bumsub},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={14372--14381},
  year={2020}
}

mnad's People

Contributors

Stargazers

Watchers

mnad's Issues

IndexError: list index out of range

I test the model with your pre-trained model and memory items, but it prompts the following error：

Traceback (most recent call last):
File "Evaluate.py", line 118, in
label_length += videos[videos_list[video_num].split('/')[-1]]['length']
IndexError: list index out of range

About gpu configuration

What is the configuration of the GPU you are using, i feel my 2060 is not enough

RuntimeError

Prediction code is well utilized.

Thank you for the intuitive usage.

I am trying to learn about Reconstruction this time, but I encountered the following error.

RuntimeError: Given groups=1, weight of size [64, 3, 3, 3], expected input[4, 15, 256, 256] to have 3 channels, but got 15 channels instead

The sample code you provided
I entered python Train.py --method recon --loss_compact 0.01 --loss_separate 0.01 as it is, but I get the same error as above.

Is there any solution?

FileNotFoundError: [Errno 2] No such file or directory: './data/frame_labels_ch1.npy'

Hi, I have trained my dataset successfully. At the evaluate step I'm getting this error. I know I have to create labels file as frame_labels_ch1.npy but how? I have checked your other python scripts out in your repository but I couldn't find anything about it in there. Could you guide me how to create labels file as above?

About pytorch bug

As I know, pytorch ver. 1.1 has a bug(I don't know it is really bug) that it doesn't track the mean and variance from training set at evaluation step.
And it makes results over repeated evaluation different.
At first evaluation, futhermore, the evaluation result become too poor because there is no tracked mean & var.
I also tried to replicate your result by your own code, but it couldn't generate 88.5(avenue) or 97.0(ped2).
Did you notice the pytorch bug?
How could you acheive the result even with this bug?
I 'll appreciate your reply. Thank you.

Visualization

Thanks for the perfect code.
The evaluation was able to proceed without any difficulty.
Could you please explain how to visualize the result as a picture as suggested in the paper?

NameError: name 'loss_pixel' is not defined

when i run python Train.py --gpus 1 --dataset_path 'your_dataset_directory' --dataset_type avenue --exp_dir 'your_log_directory'
the problem raise,How to solve

Is this model real time ?

How to plot the abnormality score like on the video demo on the project web page ?

About t-SNE

Can I get the code for this part of t-SNE?

AttributeError: 'convAE' object has no attribute 'clone'

I used pytorch1.1.0,and no change the code,but show this issue:

Traceback (most recent call last):
File "/home/lh/MNAD-master1/Evaluate.py", line 121, in
m_items_test = m_items.clone()
File "/home/lh/anaconda3/envs/torch11/lib/python3.6/site-packages/torch/nn/modules/module.py", line 539, in getattr
type(self).name, name))
AttributeError: 'convAE' object has no attribute 'clone'

ValueError: Input contains NaN, infinity or a value too large for dtype('float64').

Hi, i`m trying to experiment your model to the Shanghaitech dataset. but i got a error when evaluating the code.
"ValueError: Input contains NaN, infinity or a value too large for dtype('float64')."
It would be very much appreciated if you could provide me some help.
this is my error

there is a nan value in the anomaly_score_tatal_list of the Evaluate.py.
how can i solve this error??

All the best.

how to get the labels of test data?

use the .mat files?

why can't I achieve the accuracy in this article wtih my own trained model,while I can achieve it with yours

Unable to save recreated images and run model real time on a video

Hello!
Thank you for your great contribution towards anomaly detection. I am currently getting started with your repo and have trained the model on the UCSD dataset. However, how do I create a demo showing the anomaly scores real time as the video plays and how do I generate the reconstructions. Is a script available for the same? Thank you!

result between pred and recon

Find anomaly in video

Hello @hyunjp @njyoun

How to find anomalies in video after getting pretrained model or trained model of myself?

Thank you

Faulty evaluation calculation?

Isn't the evaluation calculation wrong?
Considering Ped2, why are the scores normalized between 0 and 1 for each of the 12 video clips individually? This isn't correct as the camera view is the same in all of the clips.

RuntimeError:

Traceback (most recent call last):
File "D:/python/pycharm/MNAD-master/Evaluate.py", line 139, in
outputs, feas, updated_feas, m_items_test, softmax_score_query, softmax_score_memory, _, _, _, compactness_loss = model.forward(imgs[:,0:3*4], m_items_test, False)
File "D:\python\pycharm\MNAD-master\model\final_future_prediction_with_memory_spatial_sumonly_weight_ranking_top1.py", line 135, in forward
fea, skip1, skip2, skip3 = self.encoder(x)
File "D:\python\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "D:\python\pycharm\MNAD-master\model\final_future_prediction_with_memory_spatial_sumonly_weight_ranking_top1.py", line 46, in forward
tensorConv1 = self.moduleConv1(x)
File "D:\python\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "D:\python\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\container.py", line 141, in forward
input = module(input)
File "D:\python\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\module.py", line 1102, in _call_impl
return forward_call(*input, **kwargs)
File "D:\python\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\conv.py", line 446, in forward
return self._conv_forward(input, self.weight, self.bias)
File "D:\python\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\conv.py", line 443, in _conv_forward
self.padding, self.dilation, self.groups)
RuntimeError: Given groups=1, weight of size [64, 12, 3, 3], expected input[1, 3, 256, 256] to have 12 channels, but got 3 channels instead

Pretrained models

Hi, the link to the pretrained files seems to be broken. Can you please point to the locations.

about prediction task codes ？

can someone tell me that how U-Net is training in this whole code because i am not able to see any code which is used to train U-Net which is predicting a new frame.

RuntimeError: The size of tensor a (3) must match the size of tensor b (516) at non-singleton dimension 0

Hi when I use ped2 (args.method == 'pred) in Evaluate.py following line

torch.mean(loss_func_mse((outputs[0]+1)/2, (imgs[0,3*4:]+1)/2)).item() causes

RuntimeError: The size of tensor a (3) must match the size of tensor b (516) at non-singleton dimension 0.

This code can be used in not video frame detection conditions?

Hello！
First, thank you so much for this contribution.
My question:
This code is for abnormal video frame detection, I wonder if it can be used in other conditions, e.g., one small abnormal group of images different from most normal images, not video frames, but medical images etc.
Thank you !

replicate results on CUHK Avenue dataset

Hi, Thanks a lot for your great work!

I want to replicate your results on the CUHK Avenue dataset, but I can only get around 70% anomaly detection accuracy. Then I give another read on the paper. I found there are several places that I am not quite sure:

I trained the model using PyTorch 1.1.0, with 60 epochs, initial learning rate 2e-4 which is decayed by the cosine annealing method. However, it seems that the Separateness jumps up and down, so I am wondering is it because of my learning rate too high?
On page 6, it's said that "We set initial learning rates to 2e-5 and 2e-4, respectively, for reconstruction and prediction tasks", but from what I have seen from the code, it seems that there is only one learning rate.

Could you please give me some help with this? Thanks a lot in advance!

NameError: name 'loss_pixel' is not defined

I followed the steps in the readme and reported the error as follows:
/home/anaconda3/envs/MNAD/lib/python3.6/site-packages/torch/optim/lr_scheduler.py:134: UserWarning: Detected call of lr_scheduler.step() before optimizer.step(). In PyTorch 1.1.0 and later, you should call them in the opposite order: optimizer.step() before lr_scheduler.step(). Failure to do this will result in PyTorch skipping the first value of the learning rate schedule. See more details at https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
"https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate", UserWarning)
Traceback (most recent call last):
File "Train.py", line 148, in
print('Loss: Prediction {:.6f}/ Compactness {:.6f}/ Separateness {:.6f}'.format(loss_pixel.item(), compactness_loss.item(), separateness_loss.item()))
NameError: name 'loss_pixel' is not defined

How to solve it? Thank you

The downloaded shanghaitech dataset is a video. How to convert it into a frame? Or can you put a download link to the Shanghai dataset?

I want to check for 0 or 1 per frame.

I found that the anomaly score was calculated for each frame in Evaluate.py.
Finally, the AUC value was also confirmed.

However, for qualitative analysis of results, i want to identify TP, TN, FP and FN for each frame.
In conclusion, in order ro know which frame is judged as normal or anomaly, i want to know which part of the code to look at and understand.

Thank you.

how to judge the abnormal according to the anomaly score?

what is the threshold for anomaly detection?

evaluate.py

when i use the model provided by the author to evaluate it, "ValueError: Found input variables with inconsistent numbles of samples:[1966,7056]". Has someone else had the same problem?

Zero values for compactness loss

Hi,

Thank you very much for the code and interesting paper!

We are trying to run your method on our dataset, but seem to often get zero values for compactness loss.
This is specifically a problem when we try to normalize the "feature_distance_list" during inference.

Could you please advise?

Also, if you have any suggestions how to scale the number of epochs with the size of the dataset, please share.

Niv.

Error using my own dataset

Hello, when I use my own dataset for training, the following error occurs, but this error does not occur when I use CUHK Avenue. My dataset directory is as follows，can you tell me why and how to solve it? Thank you！

about shanghai dataset on evaluation

The model can be trained normally during training, but an error is reported during testing. Does anyone know how to solve it?

File "F:\PycharmProjects\MNAD\Evaluate1.py", line 114, in valrec1
labels_list = np.append(labels_list,labels[0][4+label_length:videos[video_name]['length']+label_length])
IndexError: invalid index to scalar variable.
This error is also reported on Linux

The following is the structure of Shanghai data set:

Problem about trainset of ShanghaiTech

I found that all videos from ped1, Avenue and the test set of Shanghaitech of science and technology were extracted into frames. However, Shanghaitech's trainsets are in video format. How to extract it?

How to make the reconstruction more accurate?

Thank you for sharing this project.

I am using this project for medical images, and the reconstruction result is:

In the above figure, the left image is the original CT image, and the right image is the reconstructed image. The reconstructed image is blur. I used the default setting. How can I make the reconstruction result more accurate？

The only way I can think of is to make the encoder/decoder deeper. Is there any other method to make the reconstruction result more accurate? It's appreciated for any suggestion.

An error occurred while training the Shanghai dataset

Traceback (most recent call last):
File "Train.py", line 154, in
for j,(imgs) in enumerate(train_batch):
File "/root/userfolder/software/anaconda3/envs/MemG/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 819, in next
return self._process_data(data)
File "/root/userfolder/software/anaconda3/envs/MemG/lib/python3.6/site-packages/torch/utils/data/dataloader.py", line 846, in _process_data
data.reraise()
File "/root/userfolder/software/anaconda3/envs/MemG/lib/python3.6/site-packages/torch/_utils.py", line 369, in reraise
raise self.exc_type(msg)
IndexError: Caught IndexError in DataLoader worker process 1.
Original Traceback (most recent call last):
File "/root/userfolder/software/anaconda3/envs/MemG/lib/python3.6/site-packages/torch/utils/data/_utils/worker.py", line 178, in _worker_loop
data = fetcher.fetch(index)
File "/root/userfolder/software/anaconda3/envs/MemG/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in fetch
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/root/userfolder/software/anaconda3/envs/MemG/lib/python3.6/site-packages/torch/utils/data/_utils/fetch.py", line 44, in
data = [self.dataset[idx] for idx in possibly_batched_index]
File "/root/userfolder/code/projects1/MNAD/FFPMem/model/utils.py", line 71, in getitem
image = np_load_frame(self.videos[video_name]['frame'][frame_name+i], self._resize_height, self._resize_width)
IndexError: list index out of range

model/utils.py——>in getitem(),【return np.concatenate (batch, axis = 0)】 what does this line mean?

How to decide number of memory items "m" ?

Hi, great work. There is something I would like to know. How do you decide the number of memory items? Like in your paper you used 10 memory items How do you decide that number ? Is there a rationale behind it ? I'll be very grateful to get an answer. Thank you

a question about the code of calculating the accuracy

In the Evaluate.py ,there is a code :
accuracy = AUC(anomaly_score_total_list, np.expand_dims(1-labels_list, 0))
When label is 0 , the input is Normal
why there is: 1-label_list ?

Your Project HVPR is not complete.

CUHK03 dataset and DukeMTMC-reID dataset

Hi,the work RRID is excellent, and I has followed your framework. However, recently when I try to train CUHK03 dataset,it cannot to be uploaded. Could you please provide the dataset link with its json data? Thank you very much!
Looking forward to your reply!

A question about the structure of reconstruction network

In the Encoder of reconstruction network, some code is:

        def Basic(intInput, intOutput):
            return torch.nn.Sequential(
                torch.nn.Conv2d(in_channels=intInput, out_channels=intOutput, kernel_size=3, stride=1, padding=1),
                torch.nn.BatchNorm2d(intOutput),
                torch.nn.ReLU(inplace=False),
                torch.nn.Conv2d(in_channels=intOutput, out_channels=intOutput, kernel_size=3, stride=1, padding=1),
                torch.nn.BatchNorm2d(intOutput),
                torch.nn.ReLU(inplace=False)
            )

        def Basic_(intInput, intOutput):
            return torch.nn.Sequential(
                torch.nn.Conv2d(in_channels=intInput, out_channels=intOutput, kernel_size=3, stride=1, padding=1),
                torch.nn.BatchNorm2d(intOutput),
                torch.nn.ReLU(inplace=False),
                torch.nn.Conv2d(in_channels=intOutput, out_channels=intOutput, kernel_size=3, stride=1, padding=1),
            )

I wonder why the Basic_ do not include BatchNorm2d and ReLU for the second convolution kernel.

Any suggestion is appreciated~

display abnormal region

Thanks for the perfect code.
I want to display the prediction errot,how can I display it?
thanks for any suggestion!

some questions about Evaluate

Traceback (most recent call last):
File "D:\Pycharm\PyCharm Community Edition 2021.3.2\plugins\python-ce\helpers\pydev\pydevd.py", line 1483, in _exec
pydev_imports.execfile(file, globals, locals) # execute the script
File "D:\Pycharm\PyCharm Community Edition 2021.3.2\plugins\python-ce\helpers\pydev_pydev_imps_pydev_execfile.py", line 18, in execfile
exec(compile(contents+"\n", file, 'exec'), glob, loc)
File "E:/huyi/MNAD/Evaluate.py", line 184, in
main()
File "E:/huyi/MNAD/Evaluate.py", line 142, in main
outputs, feas, updated_feas, m_items_test, softmax_score_query, softmax_score_memory, _, _, _, compactness_loss = model.forward(imgs[:, 0:3 * 4], m_items_test, False)
File "E:\huyi\MNAD\model\final_future_prediction_with_memory_spatial_sumonly_weight_ranking_top1.py", line 150, in forward
updated_fea, keys, softmax_score_query, softmax_score_memory,query, top1_keys, keys_ind, compactness_loss = self.memory(fea, keys, train)
File "D:\Anaconda\envs\MNAD\lib\site-packages\torch\nn\modules\module.py", line 493, in call
result = self.forward(*input, **kwargs)
File "E:\huyi\MNAD\model\memory_final_spatial_sumonly_weight_ranking_top1.py", line 148, in forward
compactness_loss, query_re, top1_keys, keys_ind = self.gather_loss(query,keys, train)
File "E:\huyi\MNAD\model\memory_final_spatial_sumonly_weight_ranking_top1.py", line 215, in gather_loss
softmax_score_query, softmax_score_memory = self.get_score(keys, query)
File "E:\huyi\MNAD\model\memory_final_spatial_sumonly_weight_ranking_top1.py", line 120, in get_score
score = torch.matmul(query, torch.t(mem))# b X h X w X m
RuntimeError: cublas runtime error : the GPU program failed to execute at C:/w/1/s/tmp_conda_3.6_035809/conda/conda-bld/pytorch_1556683229598/work/aten/src/THC/THCBlas.cu:259

my environment is windows+A5000+python 3.6.2+pytorch 1.1.0
Could you tell me how I can solve it?

how to visualizing the anomalies like Fig 4，Could you share the code? Thanks!

dataset path error

ValueError: num_samples should be a positive integer value, but got num_samples=0

In this code,the path is right,while the videos list can't be achieved by 'glob.glob(os.path.join(self.dir, '*'))' in model/utils.py.

I conducted the experiment according to the paper, but there is a gap between the experimental results and the paper. I hope to get your help.

Hello author:
I conducted a reconstruction experiment, but there was a gap between the results of the paper.
I set parameters as ：
method recon;loss_compact 0.01;loss_separate 0.01;t_length 1;alpha 0.7;
th 0.015 .
Experimental AUC results : ped2 87. 355%; avenue 72.122%; shanghai 68.086% .
How can I achieve the expected experimental results. Thank you for your attention.

Why can't I get the auc of 97%?

Use this code, Why can't I get the auc of 97%?, train 60 epochs on ped2 , get the auc of 89%

What's the difference between reconstruction task and prediction task?

Hi,

I am confused about the difference between the reconstruction task and the prediction task. May I know their difference?

Thanks

why can not I get AUC 72 on Shanghai Tech? My pytorch version is 1.1.0.
The training data in ShanghaiTech are more than that in Ped2 or Avenue, but the training epoch num is set as 10. Is it enough for convergence?