GithubHelp home page GithubHelp logo

mxin262 / bridging-text-spotting Goto Github PK

View Code? Open in Web Editor NEW
40.0 40.0 0.0 1.49 MB

(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.

Home Page: https://arxiv.org/pdf/2404.04624.pdf

License: Other

Python 93.48% C++ 2.01% Cuda 4.08% Shell 0.34% Dockerfile 0.07% Makefile 0.01% CMake 0.01%

bridging-text-spotting's People

Contributors

mxin262 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

bridging-text-spotting's Issues

python setup.py build develop

运行编译时 一直报错,尝试多种办法也无果,报错信息如下,能麻烦帮忙看看吗?

cc1plus: warning: command line option ‘-Wstrict-prototypes’ is valid for C/ObjC but not for C++
In file included from /home/[email protected]/Bridging-Text-Spotting/adet/layers/csrc/DeformAttn/ms_deform_attn_cpu.cpp:14:
/home/[email protected]/.conda/envs/Bridge/lib/python3.8/site-packages/torch/include/ATen/cuda/CUDAContext.h:5:10: fatal error: cuda_runtime_api.h: No such file or directory
5 | #include <cuda_runtime_api.h>
| ^~~~~~~~~~~~~~~~~~~~
compilation terminated.
error: command 'gcc' failed with exit status 1

RuntimeError

4181713782253_ pic

I downloaded the annotation files for the tt dataset using the link you provided and successfully loaded the dataset and weights. However, I encountered this error during the forward process. I am not sure whether the issue is due to the code, my configuration, or a problem with a third-party library. I hope the author can provide some clarification. Thank you.

Train with custom dataset

Hello;

Thank you for your great work. Can you guide me how can I train this model with custom dataset?

Thank you for your guidance

[COCOEvaluator] Did not receive valid predictions.

请教一下,用了这个命令做evaluation:python tools/train_net.py --config-file configs/Bridge/TotalText/R_50_poly.yaml --eval-only MODEL.WEIGHTS totaltext_testr.pth,但是出来报错是[COCOEvaluator] Did not receive valid predictions.
先前已经按照要求弄好了测试文件
1

Can't see results of text ocr

Hello.
First of all, thanks for sharing nice works :>

I want to get a result of detection + recognition (text spotting).
But, it only show detection results without any text label.

I followed the rule that you mentioned like below
python demo/demo.py --config-file configs/Bridge/ICDAR15/R_50_poly.yaml --input input.png --output output.jpg --opts MODEL.WEIGHTS ./pretrained_models/Bridge_ic15.pth

output

Any reasons?

Thanks.

How to train a model from scratch?

Dear author, I noticed that only the pretraining code is provided in the document, like
python tools/train_net.py --config-file configs/Bridge/TotalText/R_50_poly.yaml --num-gpus 4 MODEL.WEIGHTS totaltext_final.pth

Is there any code available for training from scratch?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.