Skip to content

Commit

Permalink
Merge pull request #122 from snexus/update-docs
Browse files Browse the repository at this point in the history
Update configuration template and installation docs
  • Loading branch information
snexus authored Oct 28, 2024
2 parents 772a937 + 124fc9e commit daac0ac
Show file tree
Hide file tree
Showing 2 changed files with 15 additions and 2 deletions.
9 changes: 8 additions & 1 deletion docs/installation.rst
Original file line number Diff line number Diff line change
Expand Up @@ -26,12 +26,16 @@ Install Latest Version
# Install packages using pip
pip install pyllmsearch
# Optional - install packages using uv
# Optional dependencues for Azure parser
pip install "pyllmsearch[azureparser]"
# Preferred method (much faster) - install packages using uv
pip install uv
uv pip install pyllmsearch
Install from source
===================

Expand All @@ -55,3 +59,6 @@ Install from source
# Install the package
pip install . # or `pip install -e .` for development
# For Azure parser, install with optional dependencies
pip install ."[azureparser]"
8 changes: 7 additions & 1 deletion sample_templates/generic/config_template.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -32,6 +32,7 @@ embeddings:
remove_images: True # Remove image links

# Optional setting
# For azuredoc support - pip install "pyllmsearch[azureparser]"
pdf_table_parser: gmft # azuredoc

# Optional setting
Expand Down Expand Up @@ -65,7 +66,12 @@ semantic_search:

# Will ensure that context provided to LLM is less than max_char_size. Useful for locally hosted models and limited hardware.
# Reduce if out of CUDA memory.
max_char_size: 4096
max_char_size: 16384 # Reduce if necessary for locally hosted LLMs

# Maximum number of text chunks to retrive for dense and sparse embeddings
# Total number of chunks is max_k * 2
max_k: 25

query_prefix: "query: " # Often queries have to be prefixed for embedding models, such as e5

hyde:
Expand Down

0 comments on commit daac0ac

Please sign in to comment.