GithubHelp home page GithubHelp logo

yolov8 about m2doc HOT 8 CLOSED

sharonjunjun avatar sharonjunjun commented on September 3, 2024
yolov8

from m2doc.

Comments (8)

johnning2333 avatar johnning2333 commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

from m2doc.

carlos-vinicios avatar carlos-vinicios commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

And the pretrained weights for the best model. You will provide too?

from m2doc.

johnning2333 avatar johnning2333 commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.
Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

And the pretrained weights for the best model. You will provide too?

yes, we will also provide the pretrained weight of DINO and Cascade Mask R-CNN with m2doc on DocLayNet.

from m2doc.

sharonjunjun avatar sharonjunjun commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

from m2doc.

johnning2333 avatar johnning2333 commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.
Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

yes

from m2doc.

sharonjunjun avatar sharonjunjun commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.
Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

yes

ok,the inference script by onnx model will release and how long is the inference time?
Can the model be deployed on triton?

from m2doc.

johnning2333 avatar johnning2333 commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.
Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

ok. if i want to use DocLayNet dataset training, i should download DocLayNet_core and DocLayNet_extra get the object boundingbox and text boundingbox?thanks.

yes

ok,the inference script by onnx model will release and how long is the inference time? Can the model be deployed on triton?

Thank you for your interest in our work. We currently do not have plans to release versions for ONNX or Triton

from m2doc.

johnning2333 avatar johnning2333 commented on September 3, 2024

when can you add the yolo detector in the project and add the add the data format samples for M2Doc and add the dataset converting scripts? thanks.

We don't actually use yolov8 as our baseline for comparison. As we mentioned in the paper, we use DINO, Cascade Mask R-CNN for our main baseline. While we encourage you to extend the m2doc to the yolo detector.

Since we use mmdetection to train the models, we will provide the coco dataformat converting script and data samples in this month.

Sorry about the delay of the dataset format converting script.
We are now provide the ocr_anno_convert.py to format doclaynet ocr annotations, and we upload 3 test samples for illustration.

from m2doc.

Related Issues (6)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.