weijiawu / transvtspotter Goto Github PK

View Code? Open in Web Editor NEW

76.0 76.0 11.0 80.12 MB

A new video text spotting framework with Transformer

Python 90.50% Jupyter Notebook 4.45% Shell 0.01% C++ 0.47% Cuda 4.57%

transvtspotter's People

Contributors

Stargazers

Watchers

Forkers

hn18001 980044579 cv-ip vijin-freelancing shiyi-mu lsabrinax swall0w devindesilva shualite aniketgurav amutong

transvtspotter's Issues

Couldn't get the json file

there was an error "FileNotFoundError: [Errno 2] No such file or directory: './output/ICDAR15/test/best_json_tracks/res_video_1.mp4.json"

I downloaded the IC15 video dataset and run "python track_tools/convert_ICDAR15video_to_coco.py".

And I couldn't find files in json or jpg format downloaded from the icdar website https://rrc.cvc.uab.es/?ch=3&com=downloads.
Unziped files only have '.mp4' or '.xml' and '***.txt'

How could I get the json annotatation files such as 'res_video_1.mp4.json'?

The appendix

Great Work! Can you provide the url to the appendix?

RuntimeError: median cannot be called with empty tensor

Traceback (most recent call last):
File "main_track.py", line 363, in
main(args)
File "main_track.py", line 326, in main
model, criterion, data_loader_train, optimizer, device, epoch, args.clip_max_norm)
File "TransVTSpotter/engine_track.py", line 41, in train_one_epoch
for _ in metric_logger.log_every(range(len(data_loader)), print_freq, header):
File "TransVTSpotter/util/misc.py", line 260, in log_every
meters=str(self),
File "TransVTSpotter/util/misc.py", line 210, in str
"{}: {}".format(name, str(meter))
File "TransVTSpotter/util/misc.py", line 109, in str
median=self.median,
File "TransVTSpotter/util/misc.py", line 88, in median
return d.median().item()
RuntimeError: median cannot be called with empty tensor

l think there might be something wrong with the datasets. My path of the datasets is as below:

Is that right? Can u give me some examples of the structure of the datasets or the solution to this error? Thanks!

Upload pretrain weights to google drive

Could you please upload the pretrain weights to google drive as its not available to download in other countries. Thanks

About Recognition model

Hi, the recognition model in your paper is MASTER. l know for some reasons u can't open the recognition code. Could you please tell me whether u use the Vanilla MASTER or the modified one？Thanks！

No res_video_1.json after running "python track_tools/convert_ICDAR15video_to_coco.py"

Hi,

Thanks for your great work!

I am a bit confused after I run the
python track_tools/Evaluation_ICDAR15_video/vis_tracking.py
Then, I get
"No such file or directory: './output/ICDAR15/test/best_json_tracks/res_video_1.json'

I have seen issue #2 , and confirm I have run the
python track_tools/convert_ICDAR15video_to_coco.py
But, it seems that the "res_video_1.json" has not been generated successfully.
I only find "train.json" and "test.json" under the "annotations_coco_rotate/", should I name one of them to "res_video_1.json" and copy it to "./output/ICDAR15/test/best_json_tracks/res_video_1.json"?

Plz, help me! Thanks a lot!

关于icdar2015_video的问题

作者你好，我最近在跟进TransVTSpotter，从官网下载的icdar2015_video测试集中并未提供标注结果，但在你们提供的convert_ICDAR2015video_to_coco.py代码里面有处理测试集xml的代码，想问一下如果你们有测试集标注结果的话，能否提供一下

Cannot reproduce results.

Thank you for the nice work! I'm having problems reproducing the results in your paper. I was hoping you can help.

I have done the following steps.

Download ICDAR15 video training and official test video dataset.
Prepare training and test dataset folder using: video2frames & convert_ICDAR15video_to_coco.
Download pretrain_coco.pth from your Baidu drive.
Train on ICDAR15 video using python -m torch.distributed.launch --nproc_per_node=8 --use_env main_track.py --output_dir ./output/icdar_tiv --dataset_file text --coco_path "${MY_DATA_DIR}/icdar_tiv" --batch_size 2 --with_box_refine --num_queries 300 --epochs 80 --lr_drop 40 --resume ./pths/pretrain_coco.pth.
Generate inferences using trained model on official test set: python main_track.py --eval --output_dir ./output/icdar_tiv_submit --resume ./output/icdar_tiv/checkpoint0079.pth --dataset_file text --coco_path "${MY_DATA_DIR}/icdar_tiv_test" --batch_size 1 --with_box_refine --num_queries 300
Zip up the results in output/icdar_tiv_submit/text/xml_dir.
Submit results to official ICDAR2015.

The resulting MOTA is 2.08% and very far from the expected ~45%. Note that the "Mostly Matched" is 842 matching reported results, so it seems that the object detection is working, but tracking is failing. Am I missing something from the code? Thanks for any help.

weijiawu / transvtspotter Goto Github PK

transvtspotter's People

Contributors

Stargazers

Watchers

Forkers

transvtspotter's Issues

Couldn't get the json file

The appendix

RuntimeError: median cannot be called with empty tensor

Upload pretrain weights to google drive

About Recognition model

No res_video_1.json after running "python track_tools/convert_ICDAR15video_to_coco.py"

关于icdar2015_video的问题

Cannot reproduce results.

incompatible function arguments

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs