Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

无法识别图片型PDF #1319

Closed
hanmostudy opened this issue Dec 18, 2024 · 7 comments
Closed

无法识别图片型PDF #1319

hanmostudy opened this issue Dec 18, 2024 · 7 comments
Labels
enhancement New feature or request

Comments

@hanmostudy
Copy link

我发现目前的模型识别普通PDF还是不错的,但是如果我把PDF转换为图片型PDF,就一点都识别不出来了

@hanmostudy hanmostudy added the enhancement New feature or request label Dec 18, 2024
@myhloli
Copy link
Collaborator

myhloli commented Dec 18, 2024

图片型的pdf方便上传一下吗

@hanmostudy
Copy link
Author

好的,我上传一下
拉米斯教材范式核心特征.pdf

@myhloli
Copy link
Collaborator

myhloli commented Dec 19, 2024

我试了是正常的,可以在huggingface和modelscope的demo上自测一下

@hanmostudy
Copy link
Author

好的,那我再去试一试,谢谢

@josenhadoop
Copy link

这个纯图片的pdf,在huggingface和modelscope的在线demo试了都正常,但是本地部署并跑完之后,没有报错,但输出的文件里面没有生成内容
image

@josenhadoop
Copy link

image
跑完之后,输出文件夹是空的

@myhloli
Copy link
Collaborator

myhloli commented Dec 26, 2024

cpu跑paddleocr有内存泄漏的情况,请关注一下运行时的内存占用,如果在解析中途因为内存占用满了导致进程被杀的话是没有结果的

@myhloli myhloli closed this as completed Dec 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants