You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello!
When we get language detection result, it only contains main langauge information but not contain some special region language information . If we can add more information ,just like ISO_3166-1, will be much better. For example, “豪华套间” and "豪華套間" both are chinese and with same sense. “豪华套间” is belong to simple chinese(China Mainland, ISO_3166 code is CN), "豪華套間" is belong to tradional chinese(HongKong, ISO_3166 code is HK).
enum class LanguageWithArea (
val isoCode3166_1: String,
val language:Language
){
... ...
}
The text was updated successfully, but these errors were encountered:
@Yunin I haven't differentiated between Simplified Chinese and Traditional Chinese so far. The reason is that I could not find proper text corpora written in only a single of the two variants. That's why I used a mixed corpus instead and only added CHINESE as a language without any more differentiation.
I might work on this in the future but I cannot tell you when exactly as of yet. That's why I will close this issue for now.
Hello!
When we get language detection result, it only contains main langauge information but not contain some special region language information . If we can add more information ,just like ISO_3166-1, will be much better. For example, “豪华套间” and "豪華套間" both are chinese and with same sense. “豪华套间” is belong to simple chinese(China Mainland, ISO_3166 code is CN), "豪華套間" is belong to tradional chinese(HongKong, ISO_3166 code is HK).
The text was updated successfully, but these errors were encountered: