Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

update readme #945

Open
wants to merge 1 commit into
base: develop
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 3 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,11 +37,12 @@
**🔥2025.01.07日直播课 飞桨PP系列模型上新!**

- ✨PP-DocBee文档图像理解的新‘蜂’向标!
为了帮助您迅速且深入地了解**PaddleMIX**的**PP-DocBee文档理解特色模型**,并熟练掌握实际操作技巧,百度高级研发工程师将在**1月7日(周二)19:00**,为您详细解读PP-DocBee的核心技术,手把手演示多模态大模型开发全流程。赶快扫描下方海报二维码预约报名!
🎉PaddleMIX推出轻量级多模态文档理解模型PP-DocBee!基于多模态大模型实现端到端文档图像理解,破解业界复杂图表文档解析难题。采用ViT+MLP+LLM架构,优化数据合成策略、数据预处理、训练方式、OCR后处理辅助等环节。OCR小模型与LLM大模型结合、基于渲染引擎生产图像数据等众多策略实现问答质量更高,生成成本可控。支持本地gradio部署、OpenAI服务部署,提供飞桨星河社区在线环境快速体验。**1月7日(周二)19:00**,直播为您详细解读PP-DocBee的核心技术与产业应用。🚀报名链接:https://www.wjx.top/vm/mlDdpSb.aspx?udsid=309483

<details>
<summary>点击展开活动海报</summary>
<p align="center">
<img src='https://github.com/user-attachments/assets/3b7adc9e-c68d-44d1-9674-05b933947deb' width="80%">
<img src='https://github.com/user-attachments/assets/5836c9df-4ea6-421b-acef-89f928e0763e' width="80%">
</p>
</details>

Expand Down
11 changes: 4 additions & 7 deletions README_EN.md
Original file line number Diff line number Diff line change
Expand Up @@ -34,18 +34,15 @@


## 📰 News
**🔥 Live Course on January 7th, 2025 - New PaddlePaddle PP Series Models!**

**🔥PaddleMIX Development Project Challenge (November 21 - December 22, 2024)**
- ✨PP-DocBee: A New Benchmark for Document Image Understanding!
🎉PaddleMIX introduces PP-DocBee, a lightweight multimodal document understanding model! Based on multimodal large models, it achieves end-to-end document image understanding, solving complex document parsing challenges. Using a ViT+MLP+LLM architecture, it optimizes data synthesis strategies, preprocessing, training methods, and OCR post-processing assistance. By combining small OCR models with large LLMs and using rendering engine-based image data generation strategies, it achieves higher quality Q&A with controllable generation costs. Supports local Gradio deployment, OpenAI service deployment, and provides quick access through PaddlePaddle Galaxy community online environment. **Join us on Tuesday, January 7th at 19:00** for a detailed explanation of PP-DocBee's core technology and industry applications. 🚀Registration link: https://www.wjx.top/vm/mlDdpSb.aspx?udsid=309483


**🔥Live Course on January 7th, 2025: New PaddlePaddle PP Series Models Released!**

- ✨PP-DocBee: A New 'Bee'-ginning in Document Image Understanding!
To help you quickly and deeply understand **PaddleMIX**'s **PP-DocBee document understanding model** and master practical skills, Baidu's senior R&D engineers will provide a detailed explanation of PP-DocBee's core technology and demonstrate the complete development process of multimodal large models at **19:00 on January 7th (Tuesday)**. Scan the QR code below to register now!
<details>
<summary>Click to expand event poster</summary>
<p align="center">
<img src='https://github.com/user-attachments/assets/3b7adc9e-c68d-44d1-9674-05b933947deb' width="80%">
<img src='https://github.com/user-attachments/assets/5836c9df-4ea6-421b-acef-89f928e0763e' width="80%">
</p>
</details>

Expand Down