GithubHelp home page GithubHelp logo

ysong10 / fashionformer Goto Github PK

View Code? Open in Web Editor NEW

This project forked from xushilin1/fashionformer

0.0 0.0 0.0 69.75 MB

Code for our ECCV-2022 work: Fashionformer A simple, effective and unified baseline for human fashion segmentation and recognition

License: Apache License 2.0

Shell 0.54% Python 99.46%

fashionformer's Introduction

Fashionformer ECCV-2022 Video,Poster

A simple, effective and unified baseline for human fashion segmentation and recognition (ECCV 2022)

Shilin Xu*, Xiangtai Li*, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao.

Figure

Introduction

We present a simple, effective, unified baseline for fashion segmentation and attribute recognition. The figure below shows that the entire architecture is the Encoder-Decoder framework, like DETR.

This codebase also contains the implementation of MaskAttribute-RCNN.

Figure

Fashionformer achieve new state-of-the-art results on three fashion segmentation datasets.

Requirements

We adopt the Open-MMLab codebase and use the specific version of mmdetection and mmcv. To run this code, make sure you have mmcv and mmdet in your environment.

  • Python=3.8.13, CUDA=11.1
  • PyTorch=1.9.0, torchvision=0.10.0
  • mmcv==1.3.18 (full version, need CUDA extension)
  • mmdet==2.18.0

DataSet

Fashionpedia Dataset

Images

Annotations

Detection: apparel object instance segmentation with localized attributes prediction:

Global attributes prediction:

path/to/Fashionpedia/
├── annotations/  # annotation json files
│   ├── attributes_train2020.json
│   ├── attributes_val2020.json
│   ├── instances-attributes_train2020.json
│   ├── instances-attributes_val2020.json
└── train/
└── test/
│   ├── train2017/    # train images
│   ├── val2017/      # val images
│   └── test2017/     # test images

ModaNet

Please see the details of this link.

DeepFashion

Please use the default setting by mmdetection.

Training and Testing

Training

# for single machine 
./tools/dist_train.sh $config $num_gpu
# for multi machine with slurm
./tools/slurm_train.sh $partition $job_name $config $work_dir

Testing

# for single machine 
./tools/dist_test.sh $config $checkpoint $num_gpu --eval segm
# for multi machine with slurm
./tools/slurm_test.sh $partition $job_name $config $checkpoint --eval segm

Demo Visulization

python demo/image_demo.py $img $config $checkpoint

Trained Model

We give the config to reproduce the Fashionformer and Mask-Attributes Mask-RCNN.

Fashionpedia

Fashionformer CheckPoints one drive and baidu yun Access Code: uvlc;

Acknowledgement

We build our codebase based on K-Net and mmdetection. Much thanks for their open-sourced code. In particular, we modify the K-Net the kernel prediction head with extra attribute query prediction, which makes a two-stream query(kernel) prediction framework.

Citation

If you find this repo is useful for your research, Please consider citing our paper:

@article{xu2022fashionformer,
  title={Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition},
  author={Xu, Shilin and Li, Xiangtai and Wang, Jingbo and Cheng, Guangliang and Tong, Yunhai and Tao, Dacheng},
  journal={ECCV},
  year={2022}
}

fashionformer's People

Contributors

lxtgh avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.