Sample Data

This folder contains example data intended to demonstrate the format of the input files and the outputs generated by the Retrieval-Augmented Generation experiments and evaluation framework.

Disclaimer

The data provided here is a toy example and not the real data used during the RAG experiments. It is only intended to showcase the required input formats and the outputs generated by the RAG framework. As such, this data will not lead to good results and is purely for demonstrative purposes.

Evaluation Results File: Contains the evaluation outcomes of a RAG pipeline. Each result includes:
- Grades for Each Metric: A binary score indicating whether the respective metric was met.
- Reasons for Grades: An explanation for each given grade, providing insights into the evaluation process.

6. `rag_outputs/`

This folder holds:

Example Output of a RAG System: A sample file demonstrating the typical output produced by the RAG system during experiments.

7. `text_summaries/`

This folder contains:

Texts and Their Summaries: An example file with original texts alongside summaries generated by a Large Language Model (LLM).

8. `vec_and_doc_stores/`

This folder includes:

Sample Vector Stores: Two sample vector stores containing embedded texts and images/image summaries that are used for retrieval.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data_readme.md

data_readme.md

Sample Data

Disclaimer

Contents

1. `reference_qa.xlsx`

2. `extracteed_texts_and_imgs.parquet`

3. `image_summaries/`

4. `images/`

5. `rag_evaluation_results/`

6. `rag_outputs/`

7. `text_summaries/`

8. `vec_and_doc_stores/`

Files

data_readme.md

Latest commit

History

data_readme.md

File metadata and controls

Sample Data

Disclaimer

Contents

1. reference_qa.xlsx

2. extracteed_texts_and_imgs.parquet

3. image_summaries/

4. images/

5. rag_evaluation_results/

6. rag_outputs/

7. text_summaries/

8. vec_and_doc_stores/

1. `reference_qa.xlsx`

2. `extracteed_texts_and_imgs.parquet`

3. `image_summaries/`

4. `images/`

5. `rag_evaluation_results/`

6. `rag_outputs/`

7. `text_summaries/`

8. `vec_and_doc_stores/`