Robotics Engineering Practicum Presentation
Image Captioning and Text Generation using Deep Reinforcement Learning
Wednesday, May 12, 2021
2:30 PM - 3:30 PM
Abstract: Captioning an image in natural language is the task of explaining the contents of the image. As a result, an algorithm is required not just to model the relationships between visual and textual objects but also to produce both syntactically and semantically correct sets of sentences. Although Deep Learning architectures are capable of handling image captioning, they do have their own set of shortcomings. This work involves combining Deep Learning and Reinforcement Learning based architectures to tackle the complexities of image captioning like single caption generation, uniqueness, and fidelity of the generated captions using a general dataset like the COCO Dataset and also more specific datasets. Further, the work presents the comparative analysis of these algorithms through their performance on the Text generation aspect of image captioning.
Professor Loris Fichera, RBE, WPI
Professor Berk Calli, RBE, WPI