GithubHelp home page GithubHelp logo

wandahangfy / rir Goto Github PK

View Code? Open in Web Editor NEW
4.0 2.0 1.0 58.29 MB

The codes of RIR(Random Interpolation Resize).

License: GNU General Public License v3.0

Dockerfile 0.14% Python 99.32% Shell 0.19% HTML 0.11% CSS 0.05% JavaScript 0.20%

rir's Introduction

Random Interpolation Resize: A Free Image Data Augmentation Method for Object Detection in Industry

Introduction

In industrial machine vision detection tasks, data scarcity is a common issue. Data augmentation artificially incorporates prior knowledge of human vision, expands image data, improves model performance, and has become a standard configuration for models. However, most current data augmentation methods are designed for general scenarios, with few specifically tailored for industrial object detection image enhancement.

This repository includes a novel image data augmentation method, Random Interpolation Resize (RIR), designed specifically for industrial object detection tasks. We have changed the interpolation method in the standard resizing step of preprocessing from a fixed configuration to a random selection to expand image data and enhance the model's generalization and detection capabilities. RIR principle diagram is shown below:

Paper Link

Chinese Interpretation Link

  • [RIR Chinese Interpretation](Chinese Interpretation Link) [TODO: Will be written and updated later if necessary]

Video Tutorial Link

Secondary Innovation Points Summary and Code Implementation (TODO)

  • Secondary Innovation Solutions [The last page of the PPT tutorial provides some secondary innovation solutions; TODO: Will be written and updated with the code if necessary]

Citation Format

If the project and article have been helpful to you, please cite the following paper:

@article{WAN2023120355, title = {Random Interpolation Resize: A free image data augmentation method for object detection in industry}, journal = {Expert Systems with Applications}, volume = {228}, pages = {120355}, year = {2023}, issn = {0957-4174}, doi = {https://doi.org/10.1016/j.eswa.2023.120355}, url = {https://www.sciencedirect.com/science/article/pii/S0957417423008576}, author = {Dahang Wan and Rongsheng Lu and Ting Xu and Siyuan Shen and Xianli Lang and Zhijie Ren}, keywords = {Data augmentation, Image interpolation, Object detection, Industry applications, Deep learning algorithm, Convolutional neural network}, }

For example:

Dahang Wan, Rongsheng Lu, Ting Xu, Siyuan Shen, Xianli Lang, Zhijie Ren, Random Interpolation Resize: A free image data augmentation method for object detection in industry, Expert Systems with Applications, Volume 228, 2023, 120355, ISSN 0957-4174, https://doi.org/10.1016/j.eswa.2023.120355.

Reference Links

Conclusion

Thank you for your interest and support in this project. The authors strive to provide the best quality and service but acknowledge that there is always room for improvement. If you encounter any issues or have any suggestions, please let us know. Furthermore, this project is currently maintained by me personally, so there may be oversights and errors. If you find any problems, feel free to provide feedback and suggestions.

Other Open Source Projects

Other open-source projects will be gradually organized and released. Please check the author's homepage for downloads in the future. Homepage

FAQ

  1. README.md file addition (completed)
  2. Detailed tutorials (YOLOv7 and YOLOv8 have been updated)
  3. Project environment setup (RIR data augmentation is plug-and-play; the entire project is based on YOLOv5-6.1 version, refer to the README-YOLOv5.md file and requirements.txt for configuration)
  4. Folder descriptions (consistent with YOLOv5-6.1, no changes to hyperparameters) (TODO: Detailed explanation)
  5. Secondary innovation points summary and code implementation (Detailed instructions provided on the last page of the RIR usage tutorial.pptx)
  6. Paper illustrations (Due to journal copyright issues, the source files for the PPT are not provided):
    • Conceptual diagrams, network structure diagrams, flowcharts: PPT (Personal choice, can also use Visio, Edraw, AI, etc.)
    • Experimental comparisons: Orgin (matlab, python, R, Excel all can be used)

rir's People

Contributors

wandahangfy avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

3321zsc

rir's Issues

改进yolov8添加rir出现错误

屏幕截图 2024-06-06 203648
AttributeError: 'YOLODataset' object has no attribute 'use_rir'
出现以下错误,按照您的添加方法添加的,是因为版本问题吗?还是别的原因

关于代码的开源

你好,刚拜读了您的文章,对您的研究非常感兴趣,请问代码会开源吗,非常感谢您的研究和贡献

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.