Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2.0.0-SNAPSHOT Update #12

Merged
merged 73 commits into from
Apr 24, 2019
Merged

2.0.0-SNAPSHOT Update #12

merged 73 commits into from
Apr 24, 2019

Conversation

RubbaBoy
Copy link
Member

Changed basically everything about how the OCR works, including proper character detection, other font abilities, a very clean API, travis support, and more. Will be promoted to release once MS Paint IDE is complete to find any outstanding bugs that may still be present.

…head. Lower resolution fonts will now be less accurate/not recommended, at the benefit of a most likely significant speed boost.
… things like apostrophe data, starting to make the OCR work again
…testing with natural images still needs to be done
…some Javadocs updates and API improvements, and some light optimizations.
… OCR results across all characters, though more prevalent on special characters
… it though.

The main breaking point currently is punctuation; all alphanumeric characters are perfect with 100% accuracy on the training set.
A code cleanup will be done in the future, I just need to come back to a working state tomorrow.
A code cleanup is very much needed, and is not ready for production.
Implementing dot detection and horizontal separation may need to be done
in the future for more accurate overall tracking.
Currently set to only one character mismatch, but will be fine tuned soon for everything
…cess fully automatic and fix 4 problematic characters
…ngs to training options and added more documentation
…most fonts

Check the TODO for a more in-depth explanation of plans
The current method may need to be reworked though, to first detect the characters and then piece them together, similar to apostrophes.
Still need to clean up code, and add tests
RubbaBoy added 28 commits March 30, 2019 18:18
…rate than before

THere's a few oddities in which I need to fix, and after adding tests and maybe adding to the API a bit it should be ready for v2 release
Now can proceed to adding more tests and improving API
This is a separate commit because there was a LOT of stuff to remove and it's easy to accidentally delete something (I had to dig for some stuff accidentally deleted while testing this commit)
Worked fine with Monospaced but not with Comic Sans (Referred to as CMS in the code to anyone wondering), then I """fixed""" some stuff and it switched. It now passes tests, but both have some things to touch up. Also adds new tests and ability to easily add more. Now still riddled with debug messages, but will be fixed (hopefully) soon.
Forgot what font I was using as a demo so I'm using Monospaced again
…uning of things like calculated spacing and look-alike characters

Docs and examples coming soon
… for more dynamic similarities based off of configurations instead of hardcoded classes
…e config

The only issues that I've been testing with are _ being detected as - in Monospaced, and two i's are being detected as | in Comic Sans
I decided not to merge classes, because of the two abstract classes all
characters extend, it cleans up their code heavily and still allows for
necessary data separation.

This also cleans up the general code a bunch, one of which is making tests require a 98% or higher success rate.
Once testing is done and I have some people review it, it is release-ready.
A little bit of code cleanup is needed
I'm not really sure if this is how travis works with Java projects so this may require multiple commits to finish
This was 69 commits squashed into this single one :(
@RubbaBoy RubbaBoy self-assigned this Apr 24, 2019
@RubbaBoy RubbaBoy merged commit 7de9626 into master Apr 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant