pradeeplam / anime-sketch-coloring-with-swish-gated-residual-unet Goto Github PK
View Code? Open in Web Editor NEWImplementation of paper which uses a swish-gated residual U-net to color line-art anime drawings
Implementation of paper which uses a swish-gated residual U-net to color line-art anime drawings
This is bad. Fix this.
This is bad. Fix this.
code:
https://colab.research.google.com/drive/1_ITjS2r-OJzbNlPAlMKf9L7XevMWuV3Q?usp=sharing
Is the cause of this error a mismatch in the number of images in the image_bw and image_rgb folder contents?
datasets
https://drive.google.com/drive/folders/1GmEwRcu9zK3hQnqk7bTPyU273g7X4vyy?usp=sharing
Readme needs to incorporate:
evaluate.py
scriptIn the network structure diagram of the paper, the resolution of the feature map is divided into 6 levels, but in model.py, there are only 5 resolution levels. Actually, when the number of channels of the feature map is 512, the resolution of the feature map is the smallest, which is still obtained by downsampling, and it has no horizontal connection like the right branch.
In model.py, in the final output of the right branch, the output of the horizontal swish layer of the left branch is not acquired.
This leads to the asymmetry of the left and right branches of the network, misaligned connections.
Or fix it
I have an idea that perhaps a small number of verification sets should be build that can verify the effect of the model after each training eopch. Instead of choosing the last epoch of training or the epoch with the best convergence. It may prevent over-training (over-fitting), resulting in inconsistent colors in a certain area of the generated picture. For example, the hair of the person in the final epoch of training will show multiple colors, and even the left and right colors of the clothing are not symmetrically consistent.
In the dataset given in the paper, there are dozens of numbered pictures at the end for the verification phase. It seems to be about 60 sketches.
We need to upload our pretrained checkpoints and dataset used to the release section.
If the dimension of the input image is not evenly divisible by 32 (the network cuts it in half 5 times) there will most likely be a wonky concatenation issue due to mismatching tensor dimensions. The referenced line performs a hack that prevents this by resizing all images to 128x128. A better solution would be to recreate a dataset where all images are 224 or 256 or something.
What GPU did you use?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.