Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

adding more charsets #72

Open
danrossi opened this issue Sep 10, 2024 · 1 comment
Open

adding more charsets #72

danrossi opened this issue Sep 10, 2024 · 1 comment

Comments

@danrossi
Copy link

I'm trying to figure out what these values are encoded in. As converting from utf8 to an int value is not working in these methods. I can't seem to calculate these values back to anything and is not documented what they are encoded in.

ie

\xc3\x83

is converted to and in of 195

0x195

But is recorded as

0x1320

https://github.com/szatmary/libcaption/blob/develop/src/eia608_from_utf8.re2c#L60

@danrossi
Copy link
Author

I wish this was documented better. What does this mean to help figuring out adding more utf8 charset mappings. Its vague what those mapping values represent when adding in cc data. eia608 supports japanese characters so first trying to add this but can't figure out what value it's supposed to be. None of the other values decode/encode to anything.

 "\xE3\x81\x81" { /*HIRAGANA_LETTER_SMALL_A*/ return 0x12353; }

static const uint8_t eia608_parity_table[] = { EIA608_B1(0), EIA608_B1(64) };

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant