-
Notifications
You must be signed in to change notification settings - Fork 98
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
训练好后测试显示乱码 #2
Comments
我知道了,在extract_conv.py里open时应该加一个encoding:'utf-8' |
因为windows默认编码不是utf-8,其他文件都是 所以windows默认会有点问题 |
好吧 可能是我先下到windows再传到ubuntu的 也不行 |
可能你在Windows下打开编辑过,再保存会改编码的。我也是下到windows再传到ubuntu解压缩的,执行demo没问题。 |
Open
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
我是在windows下跑的,跑完后测试时的样例句子显示:
鐣 鍗 鍚 渚
然后我encode为gbk又显示[b'\xe7\x95', b'\xe5\x8d', b'\xe5\x90', b'\xe4\xbe']
最后我在linux环境下测试,同样显示:鐣 鍗 鍚 渚
求问作者的训练环境和测试环境(不会是因为不该在windows下训练吧。。。)
The text was updated successfully, but these errors were encountered: