Javascript must be enabled to continue!
Seg2pix: Few Shot Training Line Art Colorization with Segmented Image Data
View through CrossRef
There are various challenging issues in automating line art colorization. In this paper, we propose a GAN approach incorporating semantic segmentation image data. Our GAN-based method, named Seg2pix, can automatically generate high quality colorized images, aiming at computerizing one of the most tedious and repetitive jobs performed by coloring workers in the webtoon industry. The network structure of Seg2pix is mostly a modification of the architecture of Pix2pix, which is a convolution-based generative adversarial network for image-to-image translation. Through this method, we can generate high quality colorized images of a particular character with only a few training data. Seg2pix is designed to reproduce a segmented image, which becomes the suggestion data for line art colorization. The segmented image is automatically generated through a generative network with a line art image and a segmentation ground truth. In the next step, this generative network creates a colorized image from the line art and segmented image, which is generated from the former step of the generative network. To summarize, only one line art image is required for testing the generative model, and an original colorized image and segmented image are additionally required as the ground truth for training the model. These generations of the segmented image and colorized image proceed by an end-to-end method sharing the same loss functions. By using this method, we produce better qualitative results for automatic colorization of a particular character’s line art. This improvement can also be measured by quantitative results with Learned Perceptual Image Patch Similarity (LPIPS) comparison. We believe this may help artists exercise their creative expertise mainly in the area where computerization is not yet capable.
Title: Seg2pix: Few Shot Training Line Art Colorization with Segmented Image Data
Description:
There are various challenging issues in automating line art colorization.
In this paper, we propose a GAN approach incorporating semantic segmentation image data.
Our GAN-based method, named Seg2pix, can automatically generate high quality colorized images, aiming at computerizing one of the most tedious and repetitive jobs performed by coloring workers in the webtoon industry.
The network structure of Seg2pix is mostly a modification of the architecture of Pix2pix, which is a convolution-based generative adversarial network for image-to-image translation.
Through this method, we can generate high quality colorized images of a particular character with only a few training data.
Seg2pix is designed to reproduce a segmented image, which becomes the suggestion data for line art colorization.
The segmented image is automatically generated through a generative network with a line art image and a segmentation ground truth.
In the next step, this generative network creates a colorized image from the line art and segmented image, which is generated from the former step of the generative network.
To summarize, only one line art image is required for testing the generative model, and an original colorized image and segmented image are additionally required as the ground truth for training the model.
These generations of the segmented image and colorized image proceed by an end-to-end method sharing the same loss functions.
By using this method, we produce better qualitative results for automatic colorization of a particular character’s line art.
This improvement can also be measured by quantitative results with Learned Perceptual Image Patch Similarity (LPIPS) comparison.
We believe this may help artists exercise their creative expertise mainly in the area where computerization is not yet capable.
Related Results
Improving Precision of Deformable Image Registration
Improving Precision of Deformable Image Registration
Deformable image registration (DIR) has various applications in medical image analysis such as in adaptive radiotherapy (ART) and multi-atlas segmentation. ART uses DIR to warp the...
Automatic Acquisition Method and Empirical Research of Shot Length in Chinese Films based on Machine Vision
Automatic Acquisition Method and Empirical Research of Shot Length in Chinese Films based on Machine Vision
Abstract
The measurement of shot length is an essential index for the evaluation of cinematographic research. Given the limitations of existing measurement tools, which req...
Double Exposure
Double Exposure
I. Happy Endings
Chaplin’s Modern Times features one of the most subtly strange endings in Hollywood history. It concludes with the Tramp (Chaplin) and the Gamin (Paulette Godda...
Evaluation of Prompting Strategies for Cyberbullying Detection Using Various Large Language Models
Evaluation of Prompting Strategies for Cyberbullying Detection Using Various Large Language Models
Sentiment analysis detects toxic language for safer online spaces and helps businesses refine
strategies through customer feedback analysis [1, 2]. Advancements in Large Language
M...
Pengaruh Latihan Drop Shot Sasaran Tetap dan Berubah terhadap Ketepatan Drop Shot dalam Permainan Bulutangkis Peserta Ekstrakurikuler Bulutangkis di SD
Pengaruh Latihan Drop Shot Sasaran Tetap dan Berubah terhadap Ketepatan Drop Shot dalam Permainan Bulutangkis Peserta Ekstrakurikuler Bulutangkis di SD
This study was motivated by the low performance of students’ drop shot skills in elementary school badminton extracurricular activities, where the success rate of shots remained lo...
Enhancing Self-Navigated Interleaved Spiral with ESPIRiT (eSNAILS)
Enhancing Self-Navigated Interleaved Spiral with ESPIRiT (eSNAILS)
Motivation: Current methods for estimation of shot-to-shot phase variations in multi-shot DWI may not fully exploit the correlations in data. Goal(s): To propose a method which eff...
Latest advancement in image processing techniques
Latest advancement in image processing techniques
Image processing is method of performing some operations on an image, for enhancing the image or for getting some information from that image, or for some other applications is not...
Unlocking the capabilities of explainable few-shot learning in remote sensing
Unlocking the capabilities of explainable few-shot learning in remote sensing
AbstractRecent advancements have significantly improved the efficiency and effectiveness of deep learning methods for image-based remote sensing tasks. However, the requirement for...

