Skip to content

Releases: KruxAI/ragbuilder

v0.1.4

31 Dec 18:11
Compare
Choose a tag to compare

What's Changed

New SDK that allows for module-wise optimization.

Basic Usage:

from ragbuilder import RAGBuilder

# Initialize and optimize
builder = RAGBuilder.from_source_with_defaults(input_source='data.pdf')
results = builder.optimize()

# Run a query through the complete pipeline
response = results.invoke("What is HNSW?")

# View optimization summary
print(results.summary())

Advanced Configuration
For fine-grained control, you can customize every aspect:

from ragbuilder.config import (
    DataIngestOptionsConfig,
    RetrievalOptionsConfig,
    GenerationOptionsConfig
)

# Configure data ingestion
data_ingest_config = DataIngestOptionsConfig(
    input_source="data.pdf",
    document_loaders=[
        {"type": "pymupdf"},
        {"type": "unstructured"}
    ],
    chunking_strategies=[{
        "type": "RecursiveCharacterTextSplitter",
        "chunker_kwargs": {"separators": ["\n\n", "\n", " ", ""]}
    }],
    chunk_size={"min": 500, "max": 2000, "stepsize": 500},
    embedding_models=[{
        "type": "openai",
        "model_kwargs": {"model": "text-embedding-3-large"}
    }]
)

# Configure retrieval
retrieval_config = RetrievalOptionsConfig(
    retrievers=[
        {
            "type": "vector_similarity",
            "retriever_k": [20],
            "weight": 0.5
        },
        {
            "type": "bm25",
            "retriever_k": [20],
            "weight": 0.5
        }
    ],
    rerankers=[{
        "type": "BAAI/bge-reranker-base"
    }],
    top_k=[3, 5]
)

# Initialize with custom configs
builder = RAGBuilder(
    data_ingest_config=data_ingest_config,
    retrieval_config=retrieval_config
)

# Access individual components
vectorstore = results.data_ingest.get_vectorstore()
docs = results.retrieval.invoke("What is RAG?")
answer = results.generation.invoke("What is RAG?")

Full Changelog: 0.0.22...v0.1.4

0.0.22

25 Oct 02:37
095d454
Compare
Choose a tag to compare

What's Changed

  • Enhance DataProcessor Class for Error Handling, Efficiency, and Logging Improvements by @Mefisto04 in #72
  • Bug fixes by @aravind10x in #77

New Contributors

Full Changelog: 0.0.21...0.0.22

0.0.21

19 Oct 12:45
dc87cd5
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.20...0.0.21

0.0.20

15 Oct 08:46
1f760e9
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 0.0.18...0.0.20

0.0.18

06 Oct 05:49
f1ce7e4
Compare
Choose a tag to compare

What's Changed

Full Changelog: 0.0.17...0.0.18

0.0.17

28 Sep 12:20
c243a07
Compare
Choose a tag to compare

RAGBuilder v0.0.17 Release Notes

What's New:

  • Support for Re-Rankers:
    • ColBERTv2: colbertv2.0
    • Cohere: rerank-english-v3.0
    • Jina: jina-reranker-v1-base-en
    • Cross-Encoder Rerankers:
      • Mixedbread-ai/mxbai-rerank-base-v1
      • BAAI/bge-reranker-base
    • FlashRank: ms-marco-MiniLM-L-12-v2
    • RankLLM: GPT-4o
  • Data Processing:
    • Remove Stopwords, Strip Tags, Punctuation, Whitespaces, Stem Text
  • Hyperparameter Visualization:
    • Track Bayesian Optimization progress and visualize parameter importance.

What's Changed:

Full Changelog: 0.0.16...0.0.17

0.0.16

21 Sep 13:40
1d49d99
Compare
Choose a tag to compare

What new

 - 10% Sampling for Trials runs
 - Contextual Retriever from Anthropic
 - Optuna Intergration for more Efficient Hyperparameter tuning

What's Changed

  • Sampling @aravind10x in #45
  • Optuna Integration Hyperparameter tuning
  • Contextual retriever by @ashwinzyx in #44
  • Minor: View Result While Execution in Progress

Full Changelog: 0.0.15...0.0.16

0.0.15

13 Sep 17:54
9b5fb54
Compare
Choose a tag to compare

What new

 - Vanilla Graph RAG with Graph Retriever
 - Hybrid Graph RAG: Graph Retreiver + Vector Retreiver
 - Neo4J Integration
 - More Improved Ensemble Retriever

What's Changed

Full Changelog: 0.0.14...0.0.15

0.0.14

10 Sep 08:32
c99453a
Compare
Choose a tag to compare

What new

 - Top K fix for Contextual Compression
 - More Improved Ensemble Retriever

What's Changed

Full Changelog: 0.0.13...0.0.14

0.0.13

27 Aug 04:36
0053e6e
Compare
Choose a tag to compare

New SOTA Templates

 - HYDE
 - Hybrid RAG
 - Semantic Chunker
 - Stepback Prompting
 - Query Rewriting
 - RRF- Reciprocal Rank Fusion RAG

What's Changed

New Contributors

Full Changelog: https://github.com/KruxAI/ragbuilder/commits/0.0.13