GithubHelp home page GithubHelp logo

xlnn / gcc-net Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ixiaohuihuihui/gcc-net

0.0 0.0 0.0 27.11 MB

License: Apache License 2.0

Shell 0.05% C++ 0.01% Python 99.88% Cuda 0.04% Dockerfile 0.02%

gcc-net's Introduction

✨✨✨✨✨✨✨✨NEWS✨✨✨✨✨✨✨✨

Our paper has been published in Pattern Recognition.

  1. For future researchers to develop better algorithms in the field of underwater object detection, we upload all the detection results, the download link is https://pan.baidu.com/s/1UMQx1qjoSZ9X8d_EGnE1rQ (Code: adrh). We plot the ground truth and detection results on one image. From these test results, you can understand the advantages and weaknesses of our method. Welcome New Work on UOD!
  2. We upload the test model of the proposed method, you can reproduce the results of our paper. The download link is https://pan.baidu.com/s/1avBWJJm2sTLPoz9tl0eKWw (Code: aert).

Introduction

A Gated Cross-domain Collaborative Network for Underwater Object Detection (https://arxiv.org/pdf/2306.14141.pdf).

Underwater object detection (UOD) plays a significant role in aquaculture and marine environmental protection. Considering the challenges posed by low contrast and low-light conditions in underwater environments, several underwater image enhancement (UIE) methods have been proposed to improve the quality of underwater images. However, only using the enhanced images does not improve the performance of UOD, since it may unavoidably remove or alter critical patterns and details of underwater objects. In contrast, we believe that exploring the complementary information from the two domains is beneficial for UOD. The raw image preserves the natural characteristics of the scene and texture information of the objects, while the enhanced image improves the visibility of underwater objects. Based on this perspective, we propose a Gated Cross-domain Collaborative Network (GCC-Net) to address the challenges of poor visibility and low contrast in underwater environments, which comprises three dedicated components. Firstly, a real-time UIE method is employed to generate enhanced images, which can improve the visibility of objects in low-contrast areas. Secondly, a cross-domain feature interaction module is introduced to facilitate the interaction and mine complementary information between raw and enhanced image features. Thirdly, to prevent the contamination of unreliable generated results, a gated feature fusion module is proposed to adaptively control the fusion ratio of cross-domain information. Our method presents a new UOD paradigm from the perspective of cross-domain information interaction and fusion. Experimental results demonstrate that the proposed GCC-Net achieves state-of-the-art performance on four underwater datasets. image image

Download Dataset

Modify the dataset path in your project

for example, the path in 'configs'.

Train

1. For DUO dataset

CUDA_VISIBLE_DEVICES=2,3 ./tools/dist_train.sh configs/autoassign/autoassign_r50_fpn_8x2_3x_gcc_duo.py 2

2. For TrashCan dataset

CUDA_VISIBLE_DEVICES=2,3 ./tools/dist_train.sh configs/autoassign/autoassign_r50_fpn_8x2_3x_gcc_trashcan.py 2

3. For WPBB dataset

CUDA_VISIBLE_DEVICES=2,3 ./tools/dist_train.sh configs/autoassign/autoassign_r50_fpn_8x2_3x_gcc_wpbb.py 2

4. For Brackish dataset

CUDA_VISIBLE_DEVICES=2,3 ./tools/dist_train.sh configs/autoassign/autoassign_r50_fpn_8x2_3x_gcc_wpbb.py 2

Test

For main results, I have provided the currently trained weight file, which can be downloaded directly for testing.

CUDA_VISIBLE_DEVICES=2 python tools/test.py configs/autoassign/autoassign_r50_fpn_8x2_3x_gcc_duo.py weights/gcc_net_epoch_36.pth --eval bbox

Some Results

image image

Installation

Please refer to install.md for installation guide.

Get Started

Please see get_started.md for the basic usage of MMDetection.

Acknowledgement

MMDetection is an open source project that is contributed by researchers and engineers from various colleges and companies. We appreciate all the contributors who implement their methods or add new features, as well as users who give valuable feedbacks. We wish that the toolbox and benchmark could serve the growing research community by providing a flexible toolkit to reimplement existing methods and develop their own new detectors

gcc-net's People

Contributors

ixiaohuihuihui avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.