Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

长文本能力问题 #112

Open
Harris-Xie opened this issue Jan 11, 2025 · 2 comments
Open

长文本能力问题 #112

Harris-Xie opened this issue Jan 11, 2025 · 2 comments

Comments

@Harris-Xie
Copy link

非常精彩的项目,我有些问题想请教下
跳转链接如下:
https://github.com/jingyaogong/minimind/blob/master/README.md#:~:text=%E5%9C%A8%E8%AE%AD%E7%BB%83%E6%97%B6,%E6%95%B0%E4%B8%BA6%E3%80%82
想问下代码中是否有改变RoPE线形插值的部分?
后续会有长文本方面的教学吗?

@jingyaogong
Copy link
Owner

本周稍晚时会更新一个基于104M版本MiniMind模型
【分析+长文本的外推实验】
届时会@😊

@Harris-Xie
Copy link
Author

Harris-Xie commented Jan 13, 2025 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants