"A Hierarchical Approach for Generating Descriptive Image Paragraphs" - Paper Reading

This browser does not support PDFs. Please download the PDF to view it: Download PDF. original paper Motivation Image captioning in previous work usually generate one single high level sentence to describe the whole image, which limits the quality and quantity of the information. As for the dense captioning, it suffers the lacking of coherence. Data Images annotated with paragraph description, 19,551 pairs. Model Region Detector + Region Pooling + Hierarchical Recurrent Network »