Skip to content

Latest commit

 

History

History
23 lines (11 loc) · 948 Bytes

README.md

File metadata and controls

23 lines (11 loc) · 948 Bytes

GraphRAG Example

An experiment to compare "regular" (hereafter referred to as baseline) RAG and GraphRAG.

Experiment

I indexed my favourite plasma physics paper by Alex Scheckochihin in two different ways. Before indexing, I extracted the pdf into a single text file using the code in data_extraction.py

  1. Baseline RAG -- chunking the text, calculating embeddings and storing them in pinecone
  2. GraphRAG -- indexing the graph using their built-in indexing pipeline

And then ran queries against both.

Results

The query used was What are the main themes of this article?. The results were:

Baseline

GraphRAG