Skip to content

Commit

Permalink
Add  (U+F0B7) to nob
Browse files Browse the repository at this point in the history
Closes #2
  • Loading branch information
albbas committed Jul 19, 2021
1 parent 8cba7a4 commit ad27302
Show file tree
Hide file tree
Showing 2 changed files with 4 additions and 4 deletions.
4 changes: 2 additions & 2 deletions tools/tokenisers/tokeniser-disamb-gt-desc.pmscript
Original file line number Diff line number Diff line change
Expand Up @@ -62,7 +62,7 @@ Define blank Whitespace | incondform
| { } ! U+200A Hair Space  
| {​} ! U+200B Zero-Width Space ​
| {‌} ! U+200C Zero Width Non-Joiner ‌
| {‍} ! U+200D Zero Width Joiner ‍
| {‍} ! U+200D Zero Width Joiner ‍
! | {} ! U+200E Left-To-Right Mark ‎ ! Causes syntax error in pmatch
! | {} ! U+200F Right-To-Left Mark ‏ ! Causes syntax error in pmatch
! | {} ! U+2028 Line separator 
 ! Causes syntax error in pmatch
Expand Down Expand Up @@ -110,7 +110,7 @@ Define alphabet "a-z"
|{š}|{ž}|{č}|{đ}|{í}|{ŋ}|{ň}|{ŧ}|{ň}
|{Š}|{Ž}|{Č}|{Đ}|{Í}|{Ŋ}|{Ň}|{Ŧ}|{Ň}
| "0-9"
|{_}|{§}|{°}|{†}
|{_}|{§}|{°}|{†}|{}
! Combining diacritics as individual symbols,
! to be able to analyse unknown words with
! decomposed diacritics. All combining diacritics
Expand Down
4 changes: 2 additions & 2 deletions tools/tokenisers/tokeniser-gramcheck-gt-desc.pmscript
Original file line number Diff line number Diff line change
Expand Up @@ -62,7 +62,7 @@ Define blank Whitespace | incondform
| { } ! U+200A Hair Space  
| {​} ! U+200B Zero-Width Space ​
| {‌} ! U+200C Zero Width Non-Joiner ‌
| {‍} ! U+200D Zero Width Joiner ‍
| {‍} ! U+200D Zero Width Joiner ‍
! | {} ! U+200E Left-To-Right Mark ‎ ! Causes syntax error in pmatch
! | {} ! U+200F Right-To-Left Mark ‏ ! Causes syntax error in pmatch
! | {} ! U+2028 Line separator 
 ! Causes syntax error in pmatch
Expand Down Expand Up @@ -110,7 +110,7 @@ Define alphabet "a-z"
|{š}|{ž}|{č}|{đ}|{í}|{ŋ}|{ň}|{ŧ}|{ň}
|{Š}|{Ž}|{Č}|{Đ}|{Í}|{Ŋ}|{Ň}|{Ŧ}|{Ň}
| "0-9"
|{_}|{§}|{°}|{†}
|{_}|{§}|{°}|{†}|{}
! Combining diacritics as individual symbols,
! to be able to analyse unknown words with
! decomposed diacritics. All combining diacritics
Expand Down

0 comments on commit ad27302

Please sign in to comment.