[BFCL] Prompt Caching for Claude Models #751

VishnuSuresh27 · 2024-11-11T07:40:38Z

This PR request seeks to merge my changes of adding prompt caching abilities when running inference on Claude models. The benefit will be reduced cost significantly for inference on BFCL's multi-turn datasets when using the following models (in both Function Calling and Prompting modes):

Claude 3.5 Sonnet
Claude 3 Haiku
Claude 3 Opus

Summary of changes made:

Cached user messages
Cached system prompt (for Prompting mode)
Cached tools (for Function-Calling mode)

Please note:

This implementation rightfully avoids caching in single-turn cases as there aren't any future turns that could avail cache reading benefits.
According to the Anthropic guide, using prompting caching will not affect the model accuracy.

Prompt caching has no effect on output token generation. The response you receive will be identical to what you would get if prompt caching was not used.

berkeley-function-call-leaderboard/bfcl/model_handler/proprietary_model/claude.py

HuanzhiMao

LGTM.
According to the Anthropic guide, using prompting caching will not affect the model accuracy.

Prompt caching has no effect on output token generation. The response you receive will be identical to what you would get if prompt caching was not used.

VishnuSuresh27 and others added 2 commits November 10, 2024 23:28

Implement prompt caching for claude models

b6b5de3

Merge branch 'main' into claude_prompt_caching

b4b371d

HuanzhiMao added the BFCL-General General BFCL Issue label Nov 11, 2024

HuanzhiMao added 2 commits November 12, 2024 00:26

Merge branch 'main' into claude_prompt_caching

ff6acfd

Merge remote-tracking branch 'upstream/main' into pr/VishnuSuresh27/751

4ad8ff8

HuanzhiMao reviewed Nov 12, 2024

View reviewed changes

berkeley-function-call-leaderboard/bfcl/model_handler/proprietary_model/claude.py Outdated Show resolved Hide resolved

fix

ab215dc

HuanzhiMao approved these changes Nov 13, 2024

View reviewed changes

HuanzhiMao merged commit 5a42197 into ShishirPatil:main Nov 13, 2024

HuanzhiMao mentioned this pull request Nov 13, 2024

[BFCL] Add Prompt Caching for Claude Models #727

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BFCL] Prompt Caching for Claude Models #751

[BFCL] Prompt Caching for Claude Models #751

VishnuSuresh27 commented Nov 11, 2024 •

edited by HuanzhiMao

Loading

HuanzhiMao left a comment

[BFCL] Prompt Caching for Claude Models #751

[BFCL] Prompt Caching for Claude Models #751

Conversation

VishnuSuresh27 commented Nov 11, 2024 • edited by HuanzhiMao Loading

HuanzhiMao left a comment

Choose a reason for hiding this comment

VishnuSuresh27 commented Nov 11, 2024 •

edited by HuanzhiMao

Loading