Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thinking agent #576

Open
wants to merge 15 commits into
base: main
Choose a base branch
from
Open

Thinking agent #576

wants to merge 15 commits into from

Conversation

JegernOUTT
Copy link
Member

No description provided.

@JegernOUTT JegernOUTT self-assigned this Jan 30, 2025
@JegernOUTT JegernOUTT requested a review from olegklimov January 31, 2025 09:35
…ities

- Add new THINKING_AGENT chat mode with dedicated system prompt and tools
- Introduce ReasoningEffort enum for controlling model's thinking depth
- Add reasoning_effort parameter to SamplingParameters for fine-tuning responses
- Rename tool_deep_thinking to tool_think with improved interface and prompts
- Update chat passthrough to support optional system prompt prepending
- Adjust tools filtering per chat mode to accommodate new thinking capabilities
- Fix subchat and related modules to handle new parameters
…2k for RAG, 16k for generation)

- Implementing smart token budget calculation for context prioritization
- Adding line number references to file context for better traceability
- Enforcing proper context gathering before processing
- Adding safeguards against premature think() calls
- Restructure agent prompt into clear sections with better formatting and headings
- Add conditional formatting for files context in think() tool to avoid empty section
- Streamline strategy steps with clear objectives and hierarchical structure
- Keep core functionality intact while making documentation more readable
…to improve clarity and maintainability of AI responses
- Update 'think' tool configuration to use o3-mini model with larger context window (128k)
- Pass subchat temperature parameter through deep thinking tool instead of hardcoded value
- Clarify comment about token budget handling in deep thinking
…ader

Only add Bearer authorization header when the API token starts with 'hf_' prefix, which is the expected format for Hugging Face tokens. This prevents invalid authorization attempts with empty or malformed tokens.
Previously, temperature was hardcoded to 0.0 and always required. This change:
- Removes the TEMPERATURE constant
- Makes temperature parameter truly optional in API calls
- Updates logging to handle None case for temperature
- Removes unused subchat_temperature from config
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant