GithubHelp home page GithubHelp logo

xiaogangpeng / tbiformer Goto Github PK

View Code? Open in Web Editor NEW
33.0 5.0 2.0 2.28 MB

Official implementation of paper "Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting".

Python 100.00%

tbiformer's Introduction

TBIFormer

Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting
Xiaogang Peng, Siyuan Mao, Zizhao Wu*

paper | Project | video visitor badge

News

  • [2023/2/28]: Our paper is accepted by CVPR 2023. Thanks to my collaborators!
  • [2023/4/20]: Initial code releases.

Abstract

Multi-person pose forecasting remains a challenging problem, especially in modeling fine-grained human body interaction in complex crowd scenarios. Existing methods typically represent the whole pose sequence as a temporal series, yet overlook interactive influences among people based on skeletal body parts. In this paper, we propose a novel Trajectory-Aware Body Interaction Transformer (TBIFormer) for multi-person pose forecasting via effectively modeling body part interactions. Specifically, we construct a Temporal Body Partition Module that transforms all the pose sequences into a Multi-Person Body-Part sequence to retain spatial and temporal information based on body semantics. Then, we devise a Social Body Interaction Self-Attention (SBI-MSA) module, utilizing the transformed sequence to learn body part dynamics for inter- and intra-individual interactions. Furthermore, different from prior Euclidean distance-based spatial encodings, we present a novel and efficient Trajectory-Aware Relative Position Encoding for SBI-MSA to offer discriminative spatial information and additional interactive clues. On both short- and long-term horizons, we empirically evaluate our framework on CMU-Mocap, MuPoTS-3D as well as synthesized datasets (6 ~ 10 persons), and demonstrate that our method greatly outperforms the state-of-the-art methods.

Overview

Pipeline:

Prediction Results:

Prepare Data

We mostly follow the preprocessing procedure of MRT for mixing dataset. Due to the mixing has random operations, we have uploaded the mixed dataset and others for your convenience and fair comparision. The datasets can be downloade from One Drive. Please prepare the data like this:

project_folder/
├── checkpoints/
│   ├── ...
├── data/
│   ├── Mocap_UMPM
│   │   ├── train_3_75_mocap_umpm.npy
│   │   ├── test_3_75_mocap_umpm.npy
│   │   ├── test_3_75_mocap_umpm_shuffled.npy
│   ├── MuPoTs3D
│   │   ├── mupots_150_2persons.npy
│   │   ├── mupots_150_3persons.npy
│   ├── mix1_6persons.npy
│   ├── mix2_10persons.npy
├── models/
│   ├── ...
├── utils/
│   ├── ...
├── train.py
├── test.py

Training

python train.py

Inference

python test.py

Requirements

PyTorch
Python
Python
Python
Python
Python
Python

Acknowledgement

Many thanks to the previous projects:

Related Projects:

  • iRPE (orignal code of piecewise index fuction)

Bibtex

If this work is helpful for your research, please consider citing the following BibTeX entry.

@InProceedings{peng2023trajectory,
        author    = {Peng, Xiaogang and Mao, Siyuan and Wu, Zizhao},
        title     = {Trajectory-Aware Body Interaction Transformer for Multi-Person Pose Forecasting},
        booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
        month     = {June},
        year      = {2023},
        pages     = {17121-17130}
}

License

Our code is under MIT protocol.

tbiformer's People

Contributors

xiaogangpeng avatar

Stargazers

sky_konkuk avatar XinDu avatar 5wh avatar Filip Anjou avatar  avatar Peng Xiao avatar  avatar kaijieshi avatar jsooCoverzlannn avatar Jeff Carpenter avatar  avatar Joy Fang avatar Xiong Lin avatar Jifeng Wang avatar Yayuan Li avatar  avatar  avatar jian shi avatar  avatar  avatar  avatar  avatar Julian Tanke avatar  avatar zhmchen avatar Xinxin Zhu avatar  avatar XiaoZhou avatar  avatar zou hongwei avatar SeanD avatar  avatar fan avatar

Watchers

Julian Tanke avatar Kostas Georgiou avatar Wentao Zhu avatar  avatar ilunye avatar

Forkers

bruinxiong danbmh

tbiformer's Issues

About Code

Great work!

May I ask when the Code and pretrained models of the paper will be will be made available?

Visualization Results

Hi,
Can you provide the code to visualize the predicted results of your method?
Thx

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.