GithubHelp home page GithubHelp logo

elisha0904 / sit Goto Github PK

View Code? Open in Web Editor NEW
0.0 0.0 0.0 21.87 MB

[X:AI Toy Project] OCR을 활용한 간판 이미지 내 텍스트 번역 및 이미지 인페인팅

Python 92.23% Jupyter Notebook 7.77%

sit's Introduction

이미지 내 문구 번역 및 원본 스타일 적용

<SIT: Stylized Image Translation>


프로젝트 진행

주차 Team: OCR Team: T5 Team: Erasing+SynthText
1주차(7/12) OT(주제구체화) OT(주제구체화) OT(주제구체화)
2주차(7/19) OCR 기본 flow 공부 & OCR Paper Review Transformer Paper Review & 한-영 task training SRNet Paper&Code Review
3주차(7/26) MM OCR Inference Transformer 한-영 inference 실험 SRNet Inference
4주차(8/4) MM OCR Inference(모델 교체) & 중간발표 Transformer 영-한 task training SRNet 영-한 Fine tuning / training
5주차(8/9) CLOVA OCR 및 다른 OCR 모델 탐색 Transformer tokenizer 변경 리서치 SRNet 대체 model 탐색
6주차(8/16) OCR 모델 간 성능 비교 Transformer 대체 model 탐색 + T5 Paper Review & 영-한 task 리서치 Stroke-Based Scene Text Erasing 구현 + OpenCV 구현
7주차(8/23) OCR 모델 간 성능 비교 & 최고 성능 모델 사용 T5 영-한 inference + prompt engineering Stroke-Based Scene Text Erasing 구현 + OpenCV 구현
8주차(9/1) 최종발표 최종발표 최종발표

Inference Results


sit's People

Contributors

ji-eun-kim avatar elisha0904 avatar 2soup avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.