yao11970 / hacamodel Goto Github PK
View Code? Open in Web Editor NEWThis project forked from chitwansaharia/hacamodel
Implementation of "Watch, Listen, and Describe: Globally and Locally Aligned Cross-Modal Attentions for Video Captioning" (https://arxiv.org/abs/1804.05448)