Augment3D

Augment 3D images of a focal lesion with 3D rotation, flip, and translation.

Introduction

When detecting a lesion from a medical image, it is often a good strategy to first identify parts of the image (“patches”) that likely contain the lesion and then classify the patches as containing the lesion (“positive”) or not. It can be especially effective when the lesion is relatively small compared to the whole image, as is the case for detecting lymph node metastases from thoraco-abdominal MRI scans or for detecting lung nodules from lung CT scans.

The challenge is that there are usually not enough positive examples. Augment3D addresses this challenge by augmenting the positive examples with 3D rotation, flip, and translation, in-memory. Because the image is 3D:

Permutation of axes alone gives 3! = 6 times the data.
Combined with flip (2^3=8), it gives 48 times the original data.

In translation, Augment3D shifts the data with a vector sampled uniformly within the sphere of the annotated radius of the lesion. When the radius of the lesion is 5 voxels, for example, the method augments the data more than 200 times.

Augment3D achieves all augmentation including translation in-memory, by first saving the patches with an enough margin, and then slicing the patches on retrieval. When there is not enough memory to contain all candidate patches (especially negative patches, since there can be a vast number of them), Augment3D can load them incrementally from the disk after a set number of reuse.

Augment3D includes both a general-purpose module that augments 3D patches (datasets), and special-purpose modules that import images and annotations of the LUNA (LUng Nodule Analysis) 2016 challenge (gather_subsets, import_mhd). The core algorithm of Augment3D can be useful for classification of many types of 3D images when there is a small region of interest with a defined size and when the classification criterion is approximately invariant to rotation, flip, and translation.

Augment3D also includes a classifier (classify_patch) using a deep 3D convolution network based on TensorFlow. It is included as a template showing how to retrieve the data for training and validation. Its structure and hyperparameters are yet to be optimized.

Requirements

Python 2.7 (not tested on other versions)
scipy
pandas
PIL
SimpleITK
gzip
TensorFlow (for classify_patch.py only)

Installation

Download this code.
Put it next to the 'Data' folder. An example folder structure would be:
```
augment3D
+-- augment3D # this repository
+-- Data
....+-- LUNA
```
Download the data.
1. Go to https://grand-challenge.org/site/luna16/download/
2. Download annotations.csv and candidates.csv to Data/LUNA/
3. Download subset*.zip to a suitable folder, such as ExternalHDD/LUNA/
4. Extract the zip files, so that .mhd and .raw files are in ExternalHDD/LUNA/subset*/

Usage

Rename img_dir_root, img_dir, and patch_dir in paths.py.

Move all images into one folder and create a .csv containing the UID and the subset info:

# Separate from import_mhd (next step) because import_mhd takes a long time
augment3D.gather_subsets.main()

Import images:

# WARNING: takes a long time, potentially hours to a day or two.
# It extracts all patches based on the annotations and saves them for fast retrieval.
augment3D.import_mhd.main()

Retrieve the data using the module datasets. Repeat as needed: you don't need to invoke any other command. Positive samples are automatically augmented. Samples are incrementally loaded from the disk without filling the memory.
```
import augment3D.datasets as ds
batch_size = 100
imgs_train, labels_train, imgs_valid, labels_valid = \
        ds.get_train_valid(batch_size)
```

Train a deep 3D convolution network to classify the patches (work-in-progress).

import augment3D.classify_patch as classify
classify.main() # WARNING: takes a long time (hours to days depending on the setup).

chelovekhe / augment3d Goto Github PK

augment3d's Introduction

Augment3D

Introduction

Requirements

Installation

Usage

augment3d's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs