Text Classification with GRU and attention

The model to be introduced this time is for text classification. Text classification is the operation in which text is input and the type of text is output. To do this, we need data that the class is labelled, a model for NLP.

Data Preview

Data is received from a client, not directly collected or processed. The appearance is as follows.

Model

RNN, LSTM, and GRU are widely used as models for natural language processing. In particular, unlike RNN, LSTM and GRU are mainly used because they remember past contents well. In this project, GRU was used because GRU performed better than LSTM. However, the output of the GRU alone lacked performance. So, I came up with several strategies to solve this problem.

First, pass the text not only in the forward direction but also in the reverse direction to the GRU. I thought that if reverse information was also available, information lost in the forward direction could be supplemented.
Second, I thought it would be good to refer to all the outputs of the LSTM. The existing code made predictions with only the last output, but I thought I could improve the performance by using the remaining outputs.
Finally, I thought it would be good to add a little more to the second so that I can attach importance to the output and watch more important parts.

To solve this problem, the bidirection option of the torch GRU and attention technique were used. An overview of this project model is as follows.

Result

The training results are as follows.

The validation accuracy is 81%. And I also visualized whether the attention array works properly.

Looking at the figure above, it can be seen that model focused a little more on special words with high numbers.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Data		Data
README.md		README.md
best_model.pt		best_model.pt
model.pt		model.pt
model.py		model.py
result.txt		result.txt
test.py		test.py
train.py		train.py
train_vacab.p		train_vacab.p
util.py		util.py
vocab.py		vocab.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text Classification with GRU and attention

Data Preview

Model

Result

About

Releases

Packages

Languages

sohn1029/Text-Classification

Folders and files

Latest commit

History

Repository files navigation

Text Classification with GRU and attention

Data Preview

Model

Result

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages