Dialog State Tracking Challenge 5 (DSTC5)

Neural Dialog State Tracker for Large Ontologies by Attention Mechanism

Getting started

Version info

theano 0.8.2
keras 1.0.6
python 2.7

STEP 0: Setting the DSTC 5 data

Make a directory 'data', then put the DSTC5 data set in 'data'.

STEP 1: Generate data for training

Generate the training data.

$ python data_generator.py

After then, there will be created 5 pkl files.

dstc5_general.pkl: dumping the slot, slot value vectors
dstc5_train.pkl: dumping the training data
dstc5_dev.pkl: dumping the validation data
dstc5_dev_acc.pkl: dumping the accumulated validation data (accumulate the previous utterance)
dstc5_test_acc.pkl: dumping the accumulated test data (accumulate the previous utterance)

STEP 2: Training

It takes 7 arguments. Example of the implementing code is like below.

$ python train.py -l 100 -lr 0.005 -e 100

-l: the number of lstm units (default = 100)
-lr: learning rate (default = 0.005)
-dr1: first dropout parameter (default = 0)
-dr2: second dropout parameter (default = 0)
-e: the number of epoch (default = 300)
-t: type of dstc (4 or 5, default = 5)
-c: criteria of finding threshold (accuracy or fscore, default = accuracy)

After then, there will be created weight file with named 'dstc5_lstm#l_lr#lr_dr#dr1_#dr2.h5' (ex. dstc5_lstm100_lr005_dr0_0.h5)

STEP 3: Predict with finding threshold

It takes 7 arguments. Example of the implementing code is like below.

$ python predict.py -l 100 -lr 0.005 -e 100 -th

These arguments are same with STEP 2.

-l: the number of lstm units (default = 100)
-lr: learning rate (default = 0.005)
-dr1: first dropout parameter (default = 0)
-dr2: second dropout parameter (default = 0)
-e: the number of epoch (default = 300)
-t: type of dstc (4 or 5, default = 5)
-c: criteria of finding threshold (accuracy or fscore, default = accuracy)

One more argumets here.

-th: to decide the threshold and make a file for threshold (default is no -th)
(You have to add -th when you implement predict.py first time.)

After then, there will be created 2 json files like below.

dev_dstc5_lstm100_lr005_dr0_0_accuracy.json
test_dstc5_lstm100_lr005_dr0_0_accuracy.json

STEP 4: Make a result

For validation result,

$ bash dev_run.sh dev_dstc5_lstm100_lr005_dr0_0_accuracy.json

For test result,

$ bash test_run.sh test_dstc5_lstm100_lr005_dr0_0_accuracy.json

Then, you can see the result and there will be created the result files.

dev_dstc5_lstm100_lr005_dr0_0_accuracy.score.csv
test_dstc5_lstm100_lr005_dr0_0_accuracy.score.csv

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
scripts		scripts
README.md		README.md
data_generator.py		data_generator.py
dev4_run.sh		dev4_run.sh
dev_run.sh		dev_run.sh
json_formatter.py		json_formatter.py
json_formatter.pyc		json_formatter.pyc
model.py		model.py
model.pyc		model.pyc
predict.py		predict.py
test3100features_0minwords_10context		test3100features_0minwords_10context
test3100features_0minwords_10context.syn0.npy		test3100features_0minwords_10context.syn0.npy
test3100features_0minwords_10context.syn1neg.npy		test3100features_0minwords_10context.syn1neg.npy
test4_run.sh		test4_run.sh
test_run.sh		test_run.sh
train.py		train.py
utils.py		utils.py
utils.pyc		utils.pyc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dialog State Tracking Challenge 5 (DSTC5)

Neural Dialog State Tracker for Large Ontologies by Attention Mechanism

Getting started

About

Releases

Packages

Languages

oceanos74/DSTC5

Folders and files

Latest commit

History

Repository files navigation

Dialog State Tracking Challenge 5 (DSTC5)

Neural Dialog State Tracker for Large Ontologies by Attention Mechanism

Getting started

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages