Accomplishments
Dense Captioning of Images
- Abstract
Dense Image Captioning task describes the objects within an image by identifying them and their surroundings and establishing a relationship between them. The architecture comprises a Convolutional Neural Network (CNN) and a Recurrent Neural Network (RNN) language model that generates the captions. This project requires a system making use of computer vision to both find regions and describe them in natural language. The images are passed through a Convolutional network to identify the region features. These features then form the input for the Recurrent neural network, which generates the captions for the regions encompassing the relationships between the objects. Keywords: Natural Language Processing, Deep Learning, Machine Learning, Computer Vision