Tianchi Competition "Forgeries and Forensics" Track 2

An official implementation code of Rank 3.

Background
Dataset
Dependency
Demo
Licence

Background

Demo. The upper part shows the inputs (forged images) and the bottom part is the corresponding outputs (detection results).

In this competition, the competitors are required to proposed an algorithm for detect and locate the forged regions from an input image. The classic forensics algorithms are to locate the forged area by studying the anomalous clues in the image, e.g., JPEG compression artifacts and/or noise pattern. However, these cues are not robust enough, as almost all the input images undergo multiple complex post-processing, making the classic forensics unable to accurately detect and locate the forged regions. With the rapid development of deep learning (DL) in recent years, many DL-based models have been proposed for image forensics as their strong learning representation and generalization abilities.

Thereby, we proposed to use the U-Net [1] that utilizes Se-Resnext50 [2] as encoder and incorporates SCSE [3] attention module. It is worth mentioning that the SCSE attention module is effective as the "attention" operation performed by it can well allow the model to re-weight the characteristics of the tampered area. After proper data enhancement and validation set partitioning, we trained four models for model ensemble.

[1] Ronneberger. et. al., "U-net: Convolutional networks for biomedical image segmentation." Link.

[2] Hu et. al., "Squeeze-and-excitation networks." Link.

[3] Roy et. al., "Recalibrating fully convolutional networks with spatial and channel "squeeze and excitation" blocks." Link.

Dataset

The official dataset can be downloaded from Link after registration, or from Link directly.

Dependency

Please refer to the "requirements.txt" file.

Demo

To train the model:

sh code/train.sh

Note: the training/testing data can be download from the official website.

To test the model:

sh code/run.sh

Then the model will detect the images in the ../s2_data/data/test/ and save the results in the ../prediction_result/images/ directory. The pre-trained weights can be downloaded from Google Drive.

More explanation for "code/run.sh":

For "python main.py test --func=0": the "func=0" means the division of input images. The divided sub-images are save in "../s2_data/data/test_decompose_\*" (\* is the resolution of divided sub-images).
	
For "python main.py test --func=1 --size_idx=0 --fold=1 --tta=1":

    "func=1" means the detection of sub-images and output the probability that the sub-image is fake in pixel-level.
  
    "size_idx" represents the resolution of images (the value is the index of [384, 512, 768, 1024]).
  
    "fold" indicates the model trained by different split.
  
    "tta" is the Test Time Augmentation(1-8 means the fliping and/or rotation).
  
For "python main.py test --func=2": the "func=2" means the ensemble operation.

Licence

The program is made available for academic or non-commercial purposes only. For commercial use, please contact:

Email: [email protected]

highwaywu / tianchi-fft2 Goto Github PK

tianchi-fft2's Introduction

Table of Contents

Background

Dataset

Dependency

Demo

Licence

tianchi-fft2's People

Contributors

Stargazers

Watchers

Forkers

tianchi-fft2's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs