Implementation of paper which uses a swish-gated residual U-net to color line-art anime drawings

Python 98.22% Shell 1.78%

anime-sketch-coloring-with-swish-gated-residual-unet's People

Contributors

Stargazers

Watchers

anime-sketch-coloring-with-swish-gated-residual-unet's Issues

Constructing loss from individual filters without broadcasting

Anime-Sketch-Coloring-with-Swish-Gated-Residual-UNet/src/train.py

Line 62 in 1793697

for filter_num in range(min(act_fake.shape[-1], 5)):

This is bad. Fix this.

Open questions for authors

Batch size?
~~In up layers, how does swish blocks upsample? Unpooling or conv2d transpose?~~ (conv2d transpose)

Loss comes from one image in batch

Anime-Sketch-Coloring-with-Swish-Gated-Residual-UNet/src/train.py

Line 51 in d7f7a14

act_fake = image_rgb_fake[0]

This is bad. Fix this.

[HELP] InvalidArgumentError : ConcatOp : Dimensions of inputs should match

code:
https://colab.research.google.com/drive/1_ITjS2r-OJzbNlPAlMKf9L7XevMWuV3Q?usp=sharing

Is the cause of this error a mismatch in the number of images in the image_bw and image_rgb folder contents?

datasets
https://drive.google.com/drive/folders/1GmEwRcu9zK3hQnqk7bTPyU273g7X4vyy?usp=sharing

Broken requirements.txt

Tried to install as is and got

After dropping the version of opencv-python and some others build has failed

Ubuntu 20.04 LTS
pip 20.0.2 from /usr/lib/python3/dist-packages/pip (python 3.8)

Update readme

Readme needs to incorporate:

Result images
Instructions on how to run predictions on new images using evaluate.py script

Loss function too hard to implement with Keras?

The constructed network is inconsistent with the network in the paper！

In the network structure diagram of the paper, the resolution of the feature map is divided into 6 levels, but in model.py, there are only 5 resolution levels. Actually, when the number of channels of the feature map is 512, the resolution of the feature map is the smallest, which is still obtained by downsampling, and it has no horizontal connection like the right branch.
In model.py, in the final output of the right branch, the output of the horizontal swish layer of the left branch is not acquired.

This leads to the asymmetry of the left and right branches of the network, misaligned connections.

Remove batching

Or fix it

Select the best training model through the verification phase

I have an idea that perhaps a small number of verification sets should be build that can verify the effect of the model after each training eopch. Instead of choosing the last epoch of training or the epoch with the best convergence. It may prevent over-training (over-fitting), resulting in inconsistent colors in a certain area of the generated picture. For example, the hair of the person in the final epoch of training will show multiple colors, and even the left and right colors of the clothing are not symmetrically consistent.
In the dataset given in the paper, there are dozens of numbered pictures at the end for the verification phase. It seems to be about 60 sketches.

Upload pretrained model and dataset

We need to upload our pretrained checkpoints and dataset used to the release section.

Implement save/restore functionality

Diagrams in paper don't seem consistent.

The two diagrams below don't seem consistent. I mentioned this in the slack, but I'm putting it here so this conversation will be easier to access.

Essentially the SGB marked as (c) doesn't seem to match with the SGB in the first image (brown dashed box).

Remove temporary hack that rounds all images to 128x128

Anime-Sketch-Coloring-with-Swish-Gated-Residual-UNet/src/image_generator.py

Line 46 in fed47ea

image = cv2.resize(image, (128, 128))

If the dimension of the input image is not evenly divisible by 32 (the network cuts it in half 5 times) there will most likely be a wonky concatenation issue due to mismatching tensor dimensions. The referenced line performs a hack that prevents this by resizing all images to 128x128. A better solution would be to recreate a dataset where all images are 224 or 256 or something.

Out of memory when training with 1080ti

What GPU did you use?

pradeeplam / anime-sketch-coloring-with-swish-gated-residual-unet Goto Github PK

anime-sketch-coloring-with-swish-gated-residual-unet's People

Contributors

Stargazers

Watchers

Forkers

anime-sketch-coloring-with-swish-gated-residual-unet's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs