GithubHelp home page GithubHelp logo

extendedfaid's Introduction

Extended Flash and Ambient Illuminations Dataset

teaser

We address the challenge of obtaining large-scale datasets that are appropriate for training deep networks to perform flash-related tasks in real-world scenarios. Although several small-scale datasets have been suggested in prior studies, none of them offer sufficient data to generalize to images captured in natural settings. To overcome this limitation, we combine and extend three existing datasets: The Multi-Illumination Dataset (MID), The Flash and Ambient Illuminations Dataset (FAID), and the Deep Flash Portrait Dataset (DPD). We propose a method for harmonizing and compositing portrait images from DPD onto plausible backgrounds from FAID. Additionally, we explain our procedure for performing brightness normalization to ensure consistent illumination intensity across datasets. Finally, we discuss strategies for data augmentation, including randomization of backgrounds and ambient color temperature.

Project web page: http://yaksoy.github.io/intrinsicFlash

Dataset web page: http://yaksoy.github.io/efaid

Dataset Preparation

Our dataset contains flash/no-flash pairs from Flash and Ambient Illuminations Dataset , Dataset of Multi-Illumination Images in the Wild and DeepFlash.

Flash and Ambient Illuminations Dataset

Download all the illuminations from the main dataset in the link and extract them into a single folder called 'Illuminations'.

Download and extract the exif files.

Use the IluminationsToXYZ script to map the illuminations to XYZ color space with the color matrix available in exif data of the PNG files.

Use the FAID script to convert the illuminations to linear RGB and white balance them.

Multi-Illumination Dataset

Download the multi illumination dataset through the link.

Use the MID script to convert the illuminations to linear RGB, white balance them and put the different ambient illuminations for each scene in different sub-folders.

DeepFlash Portrait Dataset

Please contact the authors to get access to the preprocessed dataset. The preprocessing has an affine aligement between flash and no-flash photos.

We utilize flash photographs in 'inputs_origin' and no-flash photographs in 'target' folders.

Each portrait is rotated from 10 different angles, we only select one of the angles for each photoghraph.

We segment and generate the alpha mattings for the flash and no-flash pairs. Download the alpha mats here.

We select photographs from the FAID that are suitable to use as background images. Download them here.

Utilize the DPD script to blend the portraits with background images, white balance them and save the linear RGB images in sub-folders based on the number of different backgrounds chosen for each portrait.

Citation

This implementation is provided for academic use only. Please cite our accompanying paper as well as associated papers for the 3 datasets if you use this dataset in your research.

@INPROCEEDINGS{maralan2023flash,
  author={Sepideh Sarajian Maralan and Chris Careaga and Ya\u{g}{\i}z Aksoy},
  title={Computational Flash Photography through Intrinsics},
  journal={Proc. CVPR},
  year={2023}
}
@INPROCEEDINGS{flashambient,
  author={Ya\u{g}{\i}z Aksoy and Changil Kim and Petr Kellnhofer and Sylvain Paris and Mohamed Elgharib and Marc Pollefeys and Wojciech Matusik},
  booktitle={Proc. ECCV},
  title={A Dataset of Flash and Ambient Illumination Pairs from the Crowd},
  year={2018},
}
@inproceedings{murmann19,
  author={Lukas Murmann and Michael Gharbi and Miika Aittala and Fredo Durand},
  booktitle={2019 IEEE International Conference on Computer Vision (ICCV)},
  title={A Multi-Illumination Dataset of Indoor Object Appearance},
  year={2019},
}
@article{CAPECE201928,
	title	 =	"Deep{F}lash: Turning a flash selfie into a studio portrait",
	journal	 =	"Signal Processing: Image Communication",
	volume 	 =	"77",
	pages 	 =	"28 - 39",
	year 	 = 	"2019",
	issn 	 = 	"0923-5965",
	doi 	 = 	"https://doi.org/10.1016/j.image.2019.05.013",
	url 	 = 	"http://www.sciencedirect.com/science/article/pii/S0923596519300451",
	author 	 = 	"Nicola Capece and Francesco Banterle and Paolo Cignoni and Fabio Ganovelli and Roberto Scopigno and Ugo Erra",
	keywords = "Image enhancement, Machine learning algorithms, Deep learning, Computational photography, Image processing",
}

extendedfaid's People

Contributors

ccareaga avatar yaksoy avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.