GithubHelp home page GithubHelp logo

deeplight's Introduction

DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing Data (IJCAI 2024)

This project is the PyTorch implementation of our IJCAI 2024 paper "DeepLight: Reconstructing High-Resolution Observations of Nighttime Light With Multi-Modal Remote Sensing Data".

An intuitive example of reconstructing HR NTL image in DeepLightMD using DeepLightSR. The modalities of Daytime Multispectral Observations (DMO), Digital Elevation Model (DEM), and Impervious Surface Product (ISP) are jointly utilized to boost the reconstruction of HR NTL images from LR NTL images.

General remarks

Nighttime light (NTL) remote sensing observation serves as a unique proxy for quantitatively assessing progress toward meeting a series of Sustainable Development Goals (SDGs), such as poverty estimation, urban sustainable development, and carbon emission. However, existing NTL observations often suffer from pervasive degradation and inconsistency, limiting their utility for computing the indicators defined by the SDGs.

In this research, we present a novel task of reconstructing high-resolution nighttime light images with multi-modality data. To support this research endeavor, we introduce DeepLightMD, a comprehensive dataset comprising data from five heterogeneous sensors, offering fine spatial resolution and rich spectral information at a national scale. Additionally, we present DeepLightSR, a calibration-aware method for building bridges between spatially heterogeneous modality data in the multi-modality super-resolution.

Dataset Description

DeepLightMD is a natioanal-scale multimodality SR dataset, consisting of training, validating, and testing material. The training material contains 18,032 pairs of clips, the validating material contains 2,254 pairs of samples. To demonstrate evaluation at both local and regional scale, the testing material contains 2,254 pairs of clips and three province-level samples, covering Anhui, Beijing, and Taiwan. Each pair of clips contains LR NTL (256 × 256 in size), HR NTL (2048 × 2048 in size), DMO (2048 × 2048 in size, 7 bands), DEM (2048 × 2048 in size), and ISP (2048 × 2048 in size).

Method Description

DeepLightSR framework partitions multi-modality data into four distinct categories, including main modality input (LR NTL), auxiliary modality inputs (DMO and DEM), main modality supervision (HR NTL), and auxiliary modality supervision (ISP).

DeepLightSR can be optimized with supervision from the HR NTL image NH ∈ ℝr**h × r**w and auxiliary supervision from the ISP MISP. Here, r, h, and w denote the ratio in the SR task, the height, and the width of NL, respectively. The comprehensive objective function for training DeepLightSR is formulated as follows:

$$\begin{aligned} \boldsymbol {\Theta } = \mathop {\mathrm {argmin}} _{ \boldsymbol {\Theta }} &[ \alpha \mathcal{F}_{1} ||\sum _{j=1}^{m} \beta_{j} (\sum _{i=1}^{n} (\boldsymbol {N}_{L}^{i}, \boldsymbol {M}^{i}; \boldsymbol {\Theta })_{j}-\boldsymbol {N}^{i}_{H,j}||_{1}) + \\\ & (1-\alpha) \mathcal {F}_{2} \sum _{i=1}^{n}(\boldsymbol {M}^{i}_{ISP}\cdot log(\boldsymbol {N}_{G,m}^{i}; \boldsymbol {\Theta }) + \\\ & (1-\boldsymbol {M}^{i}_{ISP})\cdot log(1-(\boldsymbol {N}_{G,m}^{i}; \boldsymbol {\Theta }))], \end{aligned}$$

where Θ is the parameters of DeepLightSR, the n and m denote the number of training samples and multiscale outputs, respectively. The NG, m ∈ ℝr**h × r**w represents the obtained SR NTL image in ℱ1. The α and β denote the weights of two functions and multiscale tasks.

The codes and datasets are under technical reviewing and will be available soon.

deeplight's People

Contributors

xian1234 avatar

Stargazers

 avatar  avatar Robin Cole avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.