FabInspector

FabInspector is an end-to-end model to indentify 10 different small obsucre fabric defects from the normal fabrics.

Introduction and context

In the textile or fabric manufacturing industry, fabric defects caused by machine malfunction or broken yarn seriously impact the quality of product and lead to huge economic losses, which reduces prices of fabrics with losses reaching 45- 65%. Detecting fabric defects is one of the most important processing steps for webbing quality control. Traditional manual inspection is a labor-intensive process. Manual inspection of fabric defects is unreliable as it is seriously affected by factors including the intensity of the lights, fatigue and the experience of the inspector, etc. According to feedback from the textile industry, manual inspection accuracy is merely 60 - 75%. Therefore, an automatic, fast and accurate fabric defect detector is an urgent need in fabric industry, which can help to save billions of dollars during the manufacturing process. Thanks to the advances in hardware technology and computer vision techniques as well as pattern recognition accuracy, the automatic visual inspection is becoming a promising and realistic solution for the detection of fabric defects. In this project, we propose an end-to-end network based on pretrained ResNet and DenseNet framework to detect 10 defferent common small obsucre fabric defects from the normal fabrics. Our experimental results reach more than 79% accuracy on our test data, which indicates our model is very promising to be applied in the fabric industry in future.

Data

We collect more than 2000 fabric images as our dataset. There are two major difficulties and challenges in our dataset, one of which is the data imbalance. As shown in Fig. 1, more than half images are normal fabrics, and rest images are belong to 10 different defects. So number of images of each defect is quite small compared to the normal fabrics, which makes it harder to train models to classify these defects. The other problem of our dataset is the area of defects is very small in most defective fabrics. As shown in Fig. 2, more than 80% defects only have less than 1% area, which significantly increases the difficulty to detect the defects.

Fig 1. the histogram of number of images for normal fabrics and 10 different kinds of defects

Fig 2. the area distribution of fabric defects. About 82% of defects have area less than 1%, while the area of about 17% of defects is between 1% and 10%. And the rest 1% of defects have area greater than 10%.

Data process

Multi-scale window

Since the original images have high resolution (2520 * 1960) and the shape of defects varies across different defects,in which the area of most defects is very small, we decided to use the multi-scale window sweeping across the images instead of the entire image as our training dataset, which also helps to increase the amount of dastaset. We select windows with three different kinds of windows (i.e. 640 * 640, 800 * 800, 960 * 960) in order to capture the different sizes of defects (Fig. 3). Each original image selects totally 24 (6*4) windows for every size of window, in which we only keep the windows containing defects with iou greater than 0.5 as our training dastaset.

Fig 3. the multi-scale windows of imamges

Histogram equalization

Histogram equalization is a computer image processing technique used to improve contrast in images, which is accomplished by effectively spreading out the most frequent intensity values, i.e. streatching out the intensity range of the image. Histogram equalization is one of the best methods for image enhancement without loss of any information. Our most images have very skewed distribution on grayness and/or brightness, resulting in defects merged with normal background and hard to be identified. Therefore, the histogram equalization can significantly increase the contrast of most defect images (Fig. 3), which contributes a lot to improve the classification accuracy of our model.

Fig 4. (a) the original images of defects. (b) the images of defects processed by the histogram equalization algorithm.

Data augmentation

Since the number of defective fabric images are very limited compared to normal fabrics. We implement a few data augmentation methods that are common and standard in the computer vision field to increase the training dataset of defective fabrics, including horizontal/vertical flip and rotation at specific angle. Besides the augmentation on training dataset, we also augment the testing dataset by individual horizontal flip, individual vertical flip, and both horizontal and vertical flip simutaneously. Hence, each test image will become four images (including the original one) to be predicted/classified by our model, and the final prediction will be the average value from the predicting results of all four images. The augmentation on test dataset can be regarded as a kind of model fusion, which turns out to increase the accuracy of our model on test dataset.

Framework / network

In our original approach, we implement the transfer learning based on the pretrained ResNet152 and/or DenseNet161 models, and some other pretrained models like vgg16 and Inception-v4 can be also tried in our network. Then we select SeResNet101 with attention mechanism added into it, which is more helpful to capture the defects especially some small defects. The Fig. 5(a) shows the structure of our network, in which the processed images are input into the pretrained model and then extract feature of each channel by a global pooling layer. The key point in this network is to split the network into two branches, of which one is to do binary classification between normal and defective fabrics, and the other one is to classify 10 different kinds of fabrics from normal fabrics. The reason we have an additional branch of binary classification is to reduce the impact of dataset imbalance because the accumulative dataset from all defects has a comparable size with normal fabric dataset. Our experimental results indicate the binary classification is helpful to increase the accuracy of eventual classification on 11 classes.

We select the focal loss function as our loss function (Fig. 5(b)), which is defined in the following equation:

where is the classification probability; is the weights on different classes; is focusing parameter (default is 2) and is called modulating factor, which is to decrease the weights on classes that are easily classified, while increase the weight on difficult classifications. There are two major advantages of focal loss function compared to the cross entropy loss function. The first advantage is that it can mitigate the influence of dataset imbalance by adding more weights on small dataset manually, and the other advantage is it can automatically increase the weights on classes that hard to be classified correctly on the fly.

Fig 5. (a) the network structure used in this project and the pretrained models we use are SeResNet101, ResNet152 and DenseNet161. (b) the probability of ground truth of Focal loss function, which is the loss funtion used in this project (Tsung-Yi Lin et al. 2018).

Since our network is splitted to two branches on both 2 classes and 11 classes, the loss is defined in the following equation:

where 70% of total loss is from the loss on 11 classifications and the rest 30% is from the loss on 2 classifications. We try a few different weights on those two types of classifications and turns out 70%/30% can achieve the best accuracy.

Results and future works

In this project, we obtain the accuracy about 86% on the test data by using SeResNet101 and multi-scale windows (Fig. 6), while the accuracy using ResNet152, DenseNet161 and entire image is only about 79%. The accuracy is higher than common manual inspection accuracy between 50% and 65%, so we believe the model can help fabric manufacturing industry increase speed and accuracy on fabric defect detection.

Fig 6. the accuracy of model based on SeResNet101 for each epoch, where the blue lines are accuracy of training dataset and red lines are accuracy of testing dataset.

For this project, there are still a lot of other techniques worth exploring to increase the accuracy continuously in the future. For example, we can try to apply some object detection techniques on this project, like fast-RCNN, spatial pyramid pooling, Yolo network to increase the accuracy and speed of detection.

Code readme

Please see the readme under the ./src/ folder.

xstargate / fabinspector Goto Github PK

fabinspector's Introduction

FabInspector

Introduction and context

Data

Data process

Multi-scale window

Histogram equalization

Data augmentation

Framework / network

Results and future works

Code readme

fabinspector's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs