GithubHelp home page GithubHelp logo

Comments (3)

AIWintermuteAI avatar AIWintermuteAI commented on September 17, 2024

Thanks for the kind words about aXeleRate.
Object detection models are more tricky than image classification models and their output need to be properly decoded to get bounding boxes. Different object detection architectures process the output of neural network differently - so as a rule of thumb, you cannot just take any example script and use it with any object detection network.
Fortunately I made example scripts for inference with Raspberry Pi, you can find them here
https://github.com/AIWintermuteAI/aXeleRate/tree/master/example_scripts/raspberry_pi
You can check this example and see if it works for you - if there are problems, do report here :)

Which board are you using? I mean for MaixPy. For rotated image, it is better to avoid rotating image, since this operation consumes memory and slows down the image processing pipeline. Can you just rotate the camera properly?

There are no readily available solutions for OCR on MaixPy - I've trained number detection model myself and it works pretty good, I think it is possible to add letters to the model as well. Anyhow, that will be two stage detection problem - first detect license place, then detect numbers/letters in cropped plate image. Maybe even make it three stage, if two stage doesn't work reliably - then it will be plate detection - character detection in cropped plate image - character recognition in cropped character image.

from axelerate.

WillyMC95 avatar WillyMC95 commented on September 17, 2024

Hi @AIWintermuteAI, thanks for the quick reply. Well I'm still learning (I'm a newbie) about this and didn't realize that I would have issues with the model itself depending on the type of training. For the example scripts I need the .tflite provide by Axelerate and I supposeed that I need to create a .txt file with the labels for the arguments in the detector_video.py , right?

I'm using MaixGo and I don't know how to explain the camera error properly but I'll do my best : The image in the screen (on the board) is always in rotation mode, I mean , I can't handle it properly because if I rotate the board the image is rotated too, so the image on the screen will always be upside down. About the OCR on the Maix, Can I combine two models ? like I said I'm still learning and right now I don't know how to do that , Can u give some advices to look information about that?.

Thanks for all the help , I really appreciate it.

from axelerate.

AIWintermuteAI avatar AIWintermuteAI commented on September 17, 2024

Sorry for delayed reply, got busy with other things and could continue working on aXeleRate only in December. I just published a new release.

For the example scripts I need the .tflite provide by Axelerate and I supposeed that I need to create a .txt file with the labels for the arguments in the detector_video.py , right?

yes, correct.

I'm using MaixGo and I don't know how to explain the camera error properly but I'll do my best : The image in the screen (on the board) is always in rotation mode, I mean , I can't handle it properly because if I rotate the board the image is rotated too, so the image on the screen will always be upside down.

I think you need to set vflip and horizontal flip for the sensor.

About the OCR on the Maix, Can I combine two models ?

You can have a look at example of two stage inference in MaixPy face recognition script here
https://github.com/sipeed/MaixPy_scripts/blob/master/machine_vision/face_recognization/demo_face_recognization.py
It's fairly easy - just perform object detection, crop the original image and feed the cropped image to the second network. If you have three stages (as in face recognition example), then repeat the process one more time.

If you have any other issues, please create a separate thread!

from axelerate.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.