GithubHelp home page GithubHelp logo

sickzil-machine-directml's Introduction

SickZil-Machine-DirectML

English | 한국어 | 汉语
Forked from KUR-creative/SickZil-Machine.

Notice

SZMC 0.1.1 - Fixed link
Tutorials and Tips

Currently, I am still working on the SickZil-Machine. It is just laying the groundwork invisible to users. I'm building the system I need for more effective training data collection, more efficient deep learning model training, faster experimentation and analysis, model serving and deployment automation, etc.
I will go to the distance. I appreciate if you could wait a little longer. Thank you.

szmc-0.1.0 (source: manga109, © Kanno Hiroshi, © Okuda Momoko, © Kato Masaki)

SickZil-Machine automates texts removal during manga/comics translation(Scanlation) process.

SeisinkiVulnus_028

LoveHina_vol14_003

AkkeraKanjinchou_031 All of the above images were edited automatically without human intervention.
(source: manga109, © Shimazaki Yuzuru, © Akamatsu Ken, © Kobayashi Yuki)

How it works??

Model

szmc-structure-eng

SickZil-Machine finds out the texts in manga/comics and erases it naturally to match the background.
Both processes are completely automatic, without any additional human intervention.
Of course, if you want, you can also specify text area you want to erase.

By the way, SickZil is korean word 식질, slang of 식자(작업). 식자 means editing manga/comics according to the translation(from translator).

We applied U-net for SegNet and Deepfill v2 for ComplNet.

Data set

SickZil-Machine consists of two deep learning models, SegNet and ComplNet.

To learn SegNet, we need original manga images and
text component masks that cover all text area corresponding to the original images.

To learn ComplNet, we need manga images with text removed (ie output).
(I'm researching how an images with a small amount of text affects performance.
 manga images with no text at all are the ideal data.)

Version 0.1.1 was trained using 285 image-mask pairs and 31,497 manga images.
(11,464 of 31,497 manga images are images with text.)

If you'd like to contribute a dataset to SickZil-Machine, please send your data to email .
The dataset will only be used for research purposes.

Release (not applicable to DirectML, see "Running code (for developers)")

We released 0.1.1 pre-release version!
You can download SZMC here.
Tutorials and Tips here.

SickZil-Machine is not a perfect program. We need your help.
If you find a bug or have a suggestion, please open a Github issue or send us an email.

Run the code(for developers)

The TensorFlow with DirectML package on native Windows works starting with Windows 10, version 1709 (Build 16299 or higher). You can check your build version number by running winver via the Run command (Windows logo key + R). (tensorflow-directml 1.15.0 requirements)

  1. git clone https://github.com/Aloereed/SickZil-Machine-DirectML.git; cd SickZil-Machine-DirectML
  2. Download one of release zip files from here.
  3. Unzip the release file and copy SickZil-Machine-0.1.1-pre0-win64-cpu-eng/resource/cnet and SickZil-Machine-0.1.1-pre0-win64-cpu-eng/resource/snet directories to SickZil-Machine-DirectML/resource.
  4. cd src; pip install -r requirements.txt
  5. python main.py

Future works

  • Increase text segmentation performance
  • Open manga text segmentation mask dataset
  • Automate typesetting(calligraphy style learning)



sickzil-machine-directml's People

Contributors

aloereed avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

sickzil-machine-directml's Issues

Is there any possibility to train a cnet

您好,原项目作者已经消失很久了,既然项目中提到了使用其他模型的方法,想请教一下您如何才能自己训练一个稍大一些的网络来实现漫画填充?因为这里我注意到原模型在彩色漫画上表现并不好。现在基于sd的各种绘画模型已经得到相当程度的发展,但有时候我想就是有一些轻量级的应用场景——例如这里的自动嵌字——需要一个小巧而有针对性的网络,有没有可能能利用现在sd的各种模型来进行填充输出?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.