Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
main.ipynb		main.ipynb

README.md

Modified RAG: Parent Document & Bigger chunk Retriever

To get around the problem of larger size of Parent document, what you can do right now is to make bigger chunks along with smaller ones. For example, if your smaller chunks are of 512 tokens and your Parent Documents are of 2048 tokens on average, you can make chunks of size 1024. Now during retrieval, it’ll match as the previous one above BUT this time, instead of parent document, it’ll fetch the Bigger chunk and pass it to LLM. this way you’ll lose some text for sure but not completely. You could use use 2 verses instead of original 4 to make the model understand the writing style, context etc etc that too being within the limits. Good thing, you just have to change 1 line from the previous one.

Colab walkthrough -

Read full blog

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

parent_document_retriever

parent_document_retriever

README.md

Modified RAG: Parent Document & Bigger chunk Retriever

Files

parent_document_retriever

Directory actions

More options

Directory actions

More options

Latest commit

History

parent_document_retriever

Folders and files

parent directory

README.md

Modified RAG: Parent Document & Bigger chunk Retriever