adding more charsets #72

danrossi · 2024-09-10T11:44:32Z

I'm trying to figure out what these values are encoded in. As converting from utf8 to an int value is not working in these methods. I can't seem to calculate these values back to anything and is not documented what they are encoded in.

ie

\xc3\x83

is converted to and in of 195

0x195

But is recorded as

0x1320

https://github.com/szatmary/libcaption/blob/develop/src/eia608_from_utf8.re2c#L60

The text was updated successfully, but these errors were encountered:

danrossi · 2024-09-13T18:27:18Z

I wish this was documented better. What does this mean to help figuring out adding more utf8 charset mappings. Its vague what those mapping values represent when adding in cc data. eia608 supports japanese characters so first trying to add this but can't figure out what value it's supposed to be. None of the other values decode/encode to anything.

 "\xE3\x81\x81" { /*HIRAGANA_LETTER_SMALL_A*/ return 0x12353; }

libcaption/caption/eia608.h

Line 39 in e8b6261

static const uint8_t eia608_parity_table[] = { EIA608_B1(0), EIA608_B1(64) };

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adding more charsets #72

adding more charsets #72

danrossi commented Sep 10, 2024

danrossi commented Sep 13, 2024

adding more charsets #72

adding more charsets #72

Comments

danrossi commented Sep 10, 2024

danrossi commented Sep 13, 2024