Image Captioning

Introduction

The repository contains a Sequence-to-Sequence model, which can automatically generate captions from images.

The solution architecture consists of:

CNN encoder, which encodes the images into the embedded feature vectors:
Decoder, which is a sequential neural network consisting of LSTM units, which translates the feature vector into a sequence of tokens:

The Project has been reviewed by Udacity and graded. Meets Specifications against the following rubric.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
images		images
0_dataset.ipynb		0_dataset.ipynb
1_preliminaries.ipynb		1_preliminaries.ipynb
2_training.ipynb		2_training.ipynb
3_inference.ipynb		3_inference.ipynb
README.md		README.md
data_loader.py		data_loader.py
data_loader_val.py		data_loader_val.py
model.py		model.py
training_log.txt		training_log.txt
training_log_RNN_1024.txt		training_log_RNN_1024.txt
vocab.pkl		vocab.pkl