GithubHelp home page GithubHelp logo

gaslt's Introduction

Gloss Attention for Gloss-free Sign Language Translation

This is the official implementation of the GASLT paper.

Environment

git clone https://github.com/YinAoXiong/GASLT
cd GASLT
conda env create -f env.yaml
conda activate gaslt

Datasets

For the RWTH-PHOENIX-Weather 2014 T dataset, we provide processed data for download.

Since the public link will expire after a period of time, if the link expires, please contact me via email [email protected] to get a new access link.

For other datasets, please refer to the following steps for processing because we do not have permission to distribute them.

Step 1: Download the raw data:

Step 2: Extract visual features:

  • For the RWTH-PHOENIX-Weather 2014 T dataset, directly download the visual features extracted from the TSPNet project, and select the version with a window of 8 and a stride of 2.
  • For the CSL-Daily and SP-10 datasets, download the pre-trained I3D model weights and feature extraction code from the WLASL project, and extract features in a sliding window with a window of 8 and a stride of 2.

Step 3: Pack the dataset:

Follow the format of the slt project to package the visual features. Specifically, the python list object is first serialized using pickle and then gzip compressed.

Step 4: Calculate Similarity Labels

We use the distiluse-base-multilingual-cased-v1 model from the Sentence-Transformers project to calculate the similarity between texts.

Training and Testing

First, make sure that the structure under the project data folder is as follows,

data
└── pht
    ├── bpe
    │   ├── de.wiki.bpe.vs25000.d300.w2v.txt
    │   ├── de.wiki.bpe.vs25000.d300.w2v.txt.pt
    │   └── de.wiki.bpe.vs25000.model
    ├── data
    │   ├── phoenix14t.pami0.dev
    │   ├── phoenix14t.pami0.test
    │   └── phoenix14t.pami0.train
    └── sim
        ├── cos_sim.pkl
        └── name_to_video_id.json
    ... 

and then run the command to train the model.

python -m signjoey train configs/train_pht.yaml --gpu_id 0

Run the following command to test the model.

python -m signjoey test configs/test_pht.yaml  --ckpt <path_to_ckpt> --output_path <path_to_output> --gpu_id 0

Citation

If you find this project useful, please cite our paper:

@inproceedings{yin2023gloss,
  title={Gloss attention for gloss-free sign language translation},
  author={Yin, Aoxiong and Zhong, Tianyun and Tang, Li and Jin, Weike and Jin, Tao and Zhao, Zhou},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={2551--2562},
  year={2023}
}

Acknowledgements

Our codes are based on the following repos:

gaslt's People

Contributors

yinaoxiong avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.