Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

调用CharTabel,把“幺”改为“么”不合理 #1427

Closed
1 task done
tiandiweizun opened this issue Feb 18, 2020 · 1 comment
Closed
1 task done

调用CharTabel,把“幺”改为“么”不合理 #1427

tiandiweizun opened this issue Feb 18, 2020 · 1 comment
Assignees
Labels

Comments

@tiandiweizun
Copy link

tiandiweizun commented Feb 18, 2020

Describe the bug
A clear and concise description of what the bug is.

调用CharTabel,把“幺”改为“么”不合理,虽然 么 也有1的意思,且也有发音为yao的,但是么通常不代表幺的意思,且幺字已经是正则化的,没有必要进一步改变,需要去掉CharTable.txt里面幺=么

Code to reproduce the issue
Provide a reproducible test case that is the bare minimum necessary to generate the problem.

 System.out.println(CharTable.convert("幺妹的手机号码是幺三二开头的"));

Describe the current behavior
A clear and concise description of what happened.

么妹的手机号码是么三二开头的

Expected behavior
A clear and concise description of what you expected to happen.

幺妹的手机号码是幺三二开头的

System information

  • OS Platform and Distribution (e.g., Linux Ubuntu 16.04): win10
  • Python version: java
  • HanLP version: 1.7.6

Other info / logs
Include any logs or source code that would be helpful to diagnose the problem. If including tracebacks, please include the full traceback. Large logs and files should be attached.

  • I've completed this form and searched the web for solutions.
hankcs added a commit that referenced this issue Feb 18, 2020
@hankcs
Copy link
Owner

hankcs commented Feb 18, 2020

感谢反馈,已经修复,请参考上面的commit。
如果还有问题,欢迎重开issue。

@hankcs hankcs closed this as completed Feb 18, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants