Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What suffix is used for Burmese list markers? #50

Open
r12a opened this issue Mar 10, 2021 · 13 comments
Open

What suffix is used for Burmese list markers? #50

r12a opened this issue Mar 10, 2021 · 13 comments
Labels
question Further information is requested s:mymr

Comments

@r12a
Copy link
Contributor

r12a commented Mar 10, 2021

Currently browsers use a period for the numeric myanmar style.

၁.
၂.
၃.
၄.

I received a query from a browser implementer that suggested that [U+104B MYANMAR SIGN SECTION] might be more appropriate.

၁။
၂။
၃။
၄။

Can anyone confirm whether:
a. this usage is attested?
b. this usage is preferred over the period?
c. there are other suffixes that are used?

Thanks. cc @ohbendy

@andjc
Copy link

andjc commented Mar 11, 2021

a. Yes it is attested, but normally i see it used in counters in headings or sections rather than ordered lists

b. Period is not used in print publications that I have observed. Section marker as a suffix is used in certain contexts. A few of us have implemented CSS and JS solutions.

c. The most common affix set are parentheses:

(၁)
(၂)
(၃)
(၄)

This is what is normally used.

Additionally, consonant markers and doubled consonant markers, are used in Burmese, and common in print publications.

When I get home I will post the css rules we are currently using for Myanmar ordered lists and unordered lists.

For unordered lists, bullets are rarely used (except on the web), it is more likely to be a dash/hyphen. My personal preference, and that of a couple of Myanmar developers I collaborate with is to use en-dash for this, but not sure if there is a hard and fast rule.

@andjc
Copy link

andjc commented Mar 11, 2021

@counter-style myanmar-paren {
    system: extends myanmar;
    prefix: \0028;
    suffix: \0029\0020;
    fallback: myanmar;
}

@counter-style myanmar-msm {
    system: extends myanmar;
    suffix: \104B\0020;
    fallback: myanmar;
}

@counter-style burmese-consonant {
    system: alphabetic;
    symbols: '\1000' '\1001' '\1002' '\1003' '\1004' '\1005' '\1006' '\1007' '\1008' '\100A' '\100B' '\100C' '\100D' '\100E' '\100F' '\1010' '\1011' '\1012' '\1013' '\1014' '\1015' '\1016' '\1017' '\1018' '\1019' '\101A' '\101B' '\101C' '\101D' '\101E' '\101F' '\10020' '\1021';
    /* symbols: က ခ ဂ ဃ င စ ဆ ဇ ဈ ည ဋ ဌ ဍ ဎ ဏ တ ထ ဒ ဓ န ပ ဖ ဗ ဘ မ ယ ရ လ ဝ သ ဟ ဠ အ; */
    prefix: \0028;
    suffix: \0029\0020;
}

@counter-style burmese-consonant-double {
    system: alphabetic;
    symbols: '\1000 \1000' '\1001 \1001' '\1002 \1002' '\1003 \1003' '\1004 \1004' '\1005 \1005' '\1006 \1006' '\1007 \1007' '\1008 \1008' '\100A \100A' '\100B \100B' '\100C \100C' '\100D \100D' '\100E \100E' '\100F \100F' '\1010 \1010' '\1011 \1011' '\1012 \1012' '\1013 \1013' '\1014 \1014' '\1015 \1015' '\1016 \1016' '\1017 \1017' '\1018 \1018' '\1019 \1019' '\101A \101A' '\101B \101B' '\101C \101C' '\101D \101D' '\101E \101E' '\101F \101F' '\1020 \1020' '\1021 \1021';
    /* symbols: ကက ခခ ဂဂ ဃဃ ငင စစ ဆဆ ဇဇ ဈဈ ညည ဋဋ ဌဌ ဍဍ ဎဎ ဏဏ တတ ထထ ဒဒ ဓဓ နန ပပ ဖဖ ဗဗ ဘဘ မမ ယယ ရရ လလ ဝဝ သသ ဟဟ ဠဠ အအ; */
    prefix: \0028;
    suffix: \0029\0020;
}

@counter-style endash {
    system: cyclic;
    symbols: \2013;
    /* – */
    suffix: " ";
}

It is best if browsers continue to support Myanmar lists as is, changing it would impact existing solutions and workarounds that are out there. Once blink has its act together with @counter-style life will be easier. Especially since you also need to take into count the shan counters as well, and all the various language specofoc alphabetic/consonant counters.

@ohbendy
Copy link

ohbendy commented Mar 11, 2021

Agree with Andrew. I just checked some books here; one uses the ။ after the numbers of sections in a contents list, and three use the (၁) format for numbered lists. I also saw -၁- for page numbers, but that might not be widespread.

See also the second and third images in this comment: #11 (comment) for how the doubled consonants are nested in the hierarchy.

@r12a
Copy link
Contributor Author

r12a commented Mar 11, 2021

This is all very helpful. Thanks!

@andjc
Copy link

andjc commented Mar 11, 2021

For other Myanmar script languages, our internal CSS frameworks include:

@counter-style shan-paren {
    system: extends shan;
    prefix: \0028;
    suffix: \0029\0020;
    fallback: myanmar;
}

@counter-style mon-consonant {
    system: alphabetic;
    symbols: '\1000''\1001''\1002''\1003''\105A''\1005''\1006''\1007''\105B''\1009''\100A''\100B''\100C''\100D''\100E''\100F''\1010''\1011''\1012''\1013''\1014''\1015''\1016''\1017''\1018''\1019''\101A''\101B''\101C''\101D''\101E''\101F''\1020''\105C''\1021''\105D';
    /* symbols: က ခ ဂ ဃ ၚ စ ဆ ဇ ၛ ဉ ည ဋ ဌ ဍ ဎ ဏ တ ထ ဒ ဓ န ပ ဖ ဗ ဘ မ ယ ရ လ ဝ သ ဟ ဠ ၜ အ ၝ; */
    prefix: \0028;
    suffix: \0029\0020;
}

@counter-style sgaw-consonant {
    system: alphabetic;
    symbols: '\1000' '\1001' '\1002' '\1003' '\1004' '\1005' '\1006' '\1061' '\100A' '\1010' '\1011' '\1012' '\1014' '\1015' '\1016' '\1018' '\1019' '\101A' '\101B' '\101C' '\101D' '\101E' '\101F' '\1021' '\1027';
    /* symbols: က ခ ဂ ဃ င စ ဆ ၡ ည တ ထ ဒ န ပ ဖ ဘ မ ယ ရ လ ဝ သ ဟ အ ဧ */
    prefix: \0028;
    suffix: \0029\0020;
}

@counter-style westerm-pwo-consonant {
    system: alphabetic;
    symbols: \1000 '\1001 '\1002 '\100E '\1004 '\1005 '\1006 '\1007 '\100A '\1061 '\1010 '\1011 '\1012 '\1014 '\1015 '\1016 '\1018 '\1019 '\101A '\101B '\101C '\101D '\1065 '\101F '\1021 '\1027 '\1066;
    /* symbols: က ခ ဂ ဎ င စ ဆ ဇ ည ၡ တ ထ ဒ န ပ ဖ ဘ မ ယ ရ လ ဝ ၥ ဟ အ ဧ ၦ  */
    prefix: \0028;
    suffix: \0029\0020;
}

@counter-style eastern-pwo-consonant {
    system: alphabetic;
    symbols:  '\1000' '\1001' '\1002' '\1003' '\1004' '\1005' '\1006' '\1007' '\1008' '\100A' '\100B' '\100C' '\100D' '\100E' '\106E' '\1010' '\1011' '\1012' '\1013' '\1014' '\1015' '\1016' '\1017' '\1018' '\1019' '\101A' '\101B' '\101C' '\101D' '\101E' '\101F' '\1020' '\1021' '\105C' '\106F' '\1070';
    /* symbols: က ခ ဂ ဃ င စ ဆ ဇ ဈ ည ဋ ဌ ဍ ဎ ၮ တ ထ ဒ ဓ န ပ ဖ ဗ ဘ မ ယ ရ လ ဝ သ ဟ ဠ အ ၜ ၯ ၰ   */
    prefix: \0028;
    suffix: \0029\0020;
}

Although this list in far from complete.

@r12a
Copy link
Contributor Author

r12a commented Mar 11, 2021

Thanks for that, @andjc. I'll add those to the ready-made counter styles doc, with the other info about Burmese.

Before that, i just need to check that these are sequences that you've seen used in the wild, rather than just lists of characters arranged per the alphabet. We only add counter styles to the doc that are used. (I'm guessing that we're ok here, but i need to check each time.)

@ohbendy
Copy link

ohbendy commented Mar 11, 2021

In the @counter-styles, westerm -> western

@r12a
Copy link
Contributor Author

r12a commented Mar 12, 2021

@andjc @ohbendy would you be able to send scans of these styles in the wild (like w3c/predefined-counter-styles#27) ? It would be good to have those, and also to add them to our Type Samples repo. We already have #11 (comment) of course, so i'm mostly keen on the other stuff.

@r12a
Copy link
Contributor Author

r12a commented Mar 12, 2021

Pull request at w3c/predefined-counter-styles#33

@andjc
Copy link

andjc commented Mar 13, 2021

@r12a can you hold off on merging the PR while I obtain examples and confirm counters?

Communications are difficult at moment because of the coup. So may take a bit of time.

@r12a
Copy link
Contributor Author

r12a commented Mar 15, 2021

ok

@ohbendy
Copy link

ohbendy commented May 31, 2021

A couple more examples.

Here's Tai Khamti:

Screenshot 2021-05-31 at 16 12 21
Screenshot 2021-05-31 at 16 14 21

Top level is alphabetic, enclosed in parentheses. Second level uses Shan numerals with a double danda suffix.

Here's Eastern Pwo Karen:

Screenshot 2021-05-31 at 16 18 35

DIgits are suffixed with the double danda again. I'm submitting a proposal for these numerals soon.

@r12a
Copy link
Contributor Author

r12a commented Aug 3, 2021

A brief update.

First, the important news that Blink browsers now support custom counter styles, which means that it works in Chrome and Edge and Android browsers as well as Firefox. Webkit and iOS browsers still need to enable it, however.

https://w3c.github.io/predefined-counter-styles/ has been republished (/TR version at https://www.w3.org/TR/predefined-counter-styles/), and the approach changed so that separate styles are not created each time a prefix/suffix is different. This should make things much better, going forward. Information about prefixes/suffixes is still included, however. In https://w3c.github.io/predefined-counter-styles/#myanmar-styles i changed the default prefix/suffix to parens, and there's a note about using section sign (or others, in future) instead.

Haven't yet added any of the other language styles.

Any improvement yet on chances to get the examples and confirmations mentioned in #50 (comment) ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested s:mymr
Projects
None yet
Development

No branches or pull requests

3 participants