Spelling Correction

Table of contents

Spelling Correction
How to use
Possible improvements
Dependencies
Author

Intro

This repository provides an implementation of a Spell Checker leveraging the power of NLTK and NumPy. The project is designed to efficiently detect and correct spelling errors in text using probabilistic models and distance-based algorithms. This project is divided in two different python source code, one dedicated to the Levenshtein edit distance and the other to apply the Spelling correction using: preprocessing, Levenshtein edit distance and a naive probability approach.

Main steps for spelling correction

Find misspelled words into the query
Compute edit distance among query and each term in the vocabulary
Store the candidates
Compute the probability for each candidate
Pick the candidate with higher probability
Replace the misspelled term with the founded candidate

How to use

from spelling_correction.Levenshtein import levenshtein
edit_distance_calculator = levenshtein(source="play", target="stay")
levenshtein_matrix = edit_distance_calculator.distance_matrix

Levenshtein.py

from spelling_correction.SpellingCorrector import SpellCorrector
query = "Iranin financal banks are strongss"
corrector = SpellCorrector(string=query)
corrected_query = corrector.retrive_corrected()

SpellingCorrector.py

Possible improvements

Could be interesting implements different probability computation into the __compute_probabilities__(self) of the class SpellCorrector, current implementation is a very naive solution to compute the words probability. I suggest to try implement others approach, such that Kernighan, Noisy Channel model

Dependencies

NLTK
Numpy

Author

Emilio Garzia, 2024

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
spelling_correction		spelling_correction
LICENSE		LICENSE
README.md		README.md
levenshtein_demo.py		levenshtein_demo.py
spelling_corrector_demo.py		spelling_corrector_demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spelling Correction

Intro

Main steps for spelling correction

How to use

Possible improvements

Dependencies

Author

About

Releases

Packages

Languages

License

EmilioGarzia/Spelling-Correction

Folders and files

Latest commit

History

Repository files navigation

Spelling Correction

Intro

Main steps for spelling correction

How to use

Possible improvements

Dependencies

Author

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages