GithubHelp home page GithubHelp logo

xiaojiean815 / fishdreamer Goto Github PK

View Code? Open in Web Editor NEW

This project forked from masterhow/fishdreamer

0.0 0.0 0.0 89.47 MB

[CVPRW23] Implementation of ''FishDreamer: Towards Fisheye Semantic Completion via Unified Image Outpainting and Segmentation''

Home Page: https://arxiv.org/abs/2303.13842

License: MIT License

Shell 1.44% Python 98.43% Dockerfile 0.13%

fishdreamer's Introduction

FishDreamer: Towards Fisheye Semantic Completion via Unified Image Outpainting and Segmentation



Update

  • 2023.03.20 Init repository.
  • 2023.03.24 Release the arXiv version.
  • 2023.04.05 🚀 FishDreamer has been accepted to 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW OmniCV2023).
  • 2023.09.05 Release code.

Abstract

This paper raises the new task of Fisheye Semantic Completion (FSC), where dense texture, structure, and semantics of a fisheye image are inferred even beyond the sensor field-of-view (FoV). Fisheye cameras have larger FoV than ordinary pinhole cameras, yet its unique special imaging model naturally leads to a blind area at the edge of the image plane. This is suboptimal for safety-critical applications since important perception tasks, such as semantic segmentation, become very challenging within the blind zone. Previous works considered the out-FoV outpainting and in-FoV segmentation separately. However, we observe that these two tasks are actually closely coupled. To jointly estimate the tightly intertwined complete fisheye image and scene semantics, we introduce the new FishDreamer which relies on successful ViTs enhanced with a novel Polar-aware Cross Attention module (PCA) to leverage dense context and guide semantically-consistent content generation while considering different polar distributions. In addition to the contribution of the novel task and architecture, we also derive Cityscapes-BF and KITTI360-BF datasets to facilitate training and evaluation of this new track. Our experiments demonstrate that the proposed FishDreamer outperforms methods solving each task in isolation and surpasses alternative approaches on the Fisheye Semantic Completion.

Method

(Overview)



Results

After code reorganization, we retrained FishDreamer on Cityscapes-BF:

Method PSNR mIoU
FishDreamer (Paper) 25.05 54.54
FishDreamer (This Repo) 25.21 54.88

Dependencies

This repo has been tested in the following environment:

torch == 1.9.0
pytorch-lightning == 1.8.6
mmcv-full == 1.5.2

Usage

To train FishDreamer, first set environment variable:

export USER=$(whoami)
export TORCH_HOME=$(pwd) && export PYTHONPATH=$(pwd)

Then start training:

python3 bin/train.py \
-cn FishDreamer \
data.batch_size=3 \
trainer.kwargs.max_epochs=70 \
data.train.transform_variant=resize

To output visual result, run:

python3 bin/predict.py \
model.path=$ModelPath \
indir=$DataPath \
outdir=$OutPath

You can eval your result by:

python3 bin/evaluate_predicts.py \
config=$ConfigPath \
datadir=$DataPath \
predictdir=$OutPath \
outpath=$OutMetricPath

Pretrained Models & Dataset

The pretrained model and Cityscapes-BF dataset can be found there:

https://share.weiyun.com/7ShuPa2Y

For KITTI360-BF, please follow the instruction of FisheyeEX.

Citation

If you find our paper or repo useful, please consider citing our paper:

@inproceedings{shi2023fishdreamer,
title={FishDreamer: Towards Fisheye Semantic Completion via Unified Image Outpainting and Segmentation},
author={Shi, Hao and Li, Yu and Yang, Kailun and Zhang, Jiaming and Peng, Kunyu and Roitberg, Alina and Ye, Yaozu and Ni, Huajian and Wang, Kaiwei and Stiefelhagen, Rainer},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={6433--6443},
year={2023}
}

Acknowledgement

This project would not have been possible without the following outstanding repositories:

LaMa, MMSegmentation

Devs

Hao Shi, Yu Li

Contact

Feel free to contact me if you have additional questions or have interests in collaboration. Please drop me an email at [email protected]. =)

fishdreamer's People

Contributors

masterhow avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.