Sign in
Stacked Multimodal Attention Network for Context-Aware Video Captioning
Journal article   Peer reviewed

Stacked Multimodal Attention Network for Context-Aware Video Captioning

Yi Zheng, Yuejie Zhang, Rui Feng, Tao Zhang and Weiguo Fan
IEEE transactions on circuits and systems for video technology, Vol.32(1), pp.31-42
01/2022
DOI: 10.1109/TCSVT.2021.3058626

View Online

Abstract

Biological system modeling coarse-to-fine training Context modeling context-aware Decoding Feature extraction Predictive models reinforcement learning stacked multimodal attention network Training Video captioning Visualization

Details

Metrics