Skip to content

A Sequence-to-Sequence model architecture which can automatically generate captions from images.

Notifications You must be signed in to change notification settings

punit-bhatt/image-captioning

Repository files navigation

Image Captioning

Udacity Computer Vision Nanodegree

Introduction

The repository contains a Sequence-to-Sequence model, which can automatically generate captions from images.

Network Architecture

The solution architecture consists of:

  1. CNN encoder, which encodes the images into the embedded feature vectors: image

  2. Decoder, which is a sequential neural network consisting of LSTM units, which translates the feature vector into a sequence of tokens: image

The Project has been reviewed by Udacity and graded. Meets Specifications against the following rubric.

Sample output:

Alt

About

A Sequence-to-Sequence model architecture which can automatically generate captions from images.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published