Real-world photo sequence question answering system (MemexQA). CVPR'18 and TPAMI'19
visual-question-answering vision-and-language multimodal-deep-learning multimodal-datasets multimodal-representation memex-question-answering memexqa-dataset
-
Updated
Jul 1, 2019 - Python