Jie Luo, Yiyu Wang, Mengshi Qi
With the current exponential growth of video-based social networks, video retrieval using natural language is receiving ever-increasing attention. Most existing approaches tackle this task by extracting individual frame-level spatial features to represent ...
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC2021