Transformation spoken text to written text

This model is used for formatting raw asr text output from spoken text to written text (Eg. date, number, id, ...). It also supports formatting "out of vocab" by using external vocabulary.

Some of examples:

input  : tám giờ chín phút ngày mười tám tháng năm năm hai nghìn không trăm hai mươi hai
output : 8h9 18/5/2022

input  : mã số quy đê tê tê đê hai tám chéo hai không không ba
output : mã số qdttd28/2003

input  : thể tích tám mét khối trọng lượng năm mươi ki lô gam
output : thể tích 8 m3 trọng lượng 50 kg

input    : ngày hai tám tháng tư cô vít bùng phát ở sờ cốt lờn chiếm tám mươi phần trăm là biến chủng đen ta và bê ta
ex_vocab : ['scotland', 'covid', 'delta', 'beta']
output   : 28/4 covid bùng phát ở scotland chiếm 80 % là biến chủng delta và beta

Model architecture

Infer model

Play around at Huggingface Space

Contact

[email protected]

@INPROCEEDINGS{10094599,
  author={Nguyen, Thai-Binh and Nhat, Le Duc Minh and Nguyen, Quang Minh and Do, Quoc Truong and Luong, Chi Mai and Waibel, Alexander},
  booktitle={ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, 
  title={AdapITN: A Fast, Reliable, and Dynamic Adaptive Inverse Text Normalization}, 
  year={2023},
  volume={},
  number={},
  pages={1-5},
  keywords={Adaptation models;Runtime;Transforms;Signal processing;Natural language processing;Semiotics;Reliability;ASR;inverse text normalization;semiotic pharse;phonetization phrase},
  doi={10.1109/ICASSP49357.2023.10094599}}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data-bin/raw		data-bin/raw
.gitignore		.gitignore
README.md		README.md
attentions.py		attentions.py
data_handling.py		data_handling.py
debug_cross_attention.py		debug_cross_attention.py
infer.py		infer.py
main.py		main.py
metric_handling.py		metric_handling.py
model_config_handling.py		model_config_handling.py
model_handling.py		model_handling.py
model_spoken_norm.pdf		model_spoken_norm.pdf
model_spoken_norm.svg		model_spoken_norm.svg
requirements.txt		requirements.txt
spoken_norm_model.svg		spoken_norm_model.svg
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transformation spoken text to written text

Model architecture

Infer model

Contact

About

Releases

Packages

Languages

nguyenvulebinh/spoken-norm

Folders and files

Latest commit

History

Repository files navigation

Transformation spoken text to written text

Model architecture

Infer model

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages