The Chunking Subnet

Intelligent RAG for Intelligent Applications

Dashboard • VectorChat • Chunking.com • Toffee • Lucid

Introduction

Welcome to the Chunking subnet, Subnet 40 on the Bittensor Protocol! This subnet is designed to advance the field of Retrieval-Augmented Generation (RAG) by incentivizing the development and service of sophisticated chunking solutions. Specifically, the subnet aims to create, host, and serve an intelligent chunking solution that maximizes intrachunk similarity and interchunk dissimilarity.

Explore our subnet pitch deck.

Our article on why this is a valuable problem to solve: The Case for Intelligent Chunking

Learn more about our project at vectorchat.ai

See visualizations of subnet data at subnet.chunking.com

See how organic queries are handled here.

Overview

🛣️ Roadmap

Ethos

Models always stay on your machine and remain under your full ownership!

As mentioned in our pitch deck, chunking is an infinitely complex problem that can be approached from countless different avenues. Given sufficiently long, semantically meaningful text, there is no single correct answer, only "more" correct ones. Bittensor is an excellent way to tackle such a problem, as it incentivizes both innovation and fine-tuned optimization to find the most effective solution.

We do not open-source the models created, nor do we ever receive them. We believe this greatly increases the incentive for developing and/or providing the best solution, as miners retain full ownership of their work.

At the same time, we believe this increases the value brought to the Bittensor protocol, as access to the best chunking model will require a constant sufficient stake. Since validators never receive the model, but only the right to serve queries, losing stake in the network also results in losing access to any model produced by the subnet.

Getting Started

Helpful Resources

For those new to chunking or Retrieval Augmented Generation (RAG), we strongly recommend you check out our articles here:

We also recommend these resources by Pinecone.io:

Name		Name	Last commit message	Last commit date
Latest commit History 473 Commits
assets		assets
chunking		chunking
contrib		contrib
demo		demo
docs		docs
neurons		neurons
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
min_compute.yml		min_compute.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run-miner-testnet.sh		run-miner-testnet.sh
run-miner.sh		run-miner.sh
run-validator-testnet.sh		run-validator-testnet.sh
run-validator.sh		run-validator.sh
setup.py		setup.py
setup.sh		setup.sh
validator-autoupdate.sh		validator-autoupdate.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The Chunking Subnet

Intelligent RAG for Intelligent Applications

Introduction

Overview

Ethos

Models always stay on your machine and remain under your full ownership!

Getting Started

Helpful Resources

About

Releases

Packages

Contributors 2

Languages

License

VectorChat/chunking_subnet

Folders and files

Latest commit

History

Repository files navigation

The Chunking Subnet

Intelligent RAG for Intelligent Applications

Introduction

Overview

Ethos

Models always stay on your machine and remain under your full ownership!

Getting Started

Helpful Resources

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages