-
Notifications
You must be signed in to change notification settings - Fork 429
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ability to Persist Messages in External Stores #530
Comments
I think this is something that could be supported in user code for the most part, though we'd be happy to consider support for loading externally stored message history. |
Thank you Sydney for sharing your thoughts. I appreciate it. I think it would be good to have a sort of interface or abstract parent class that users can implement or inherit from and extend to have their own persistent store (Redis, MongoDB, Postgres, MySQL etc) to cache these messages. This can then be injected in and used internally to cache the message history. This will help with checkpointing and caching in the event of errors or restarts and we dont lose all the messages in the history so far. I will also simplify things so that the user does not have to spend too much energy to figure out this persistence implementation. Those were my thoughts behind this suggestion. I am open to collaborating to bring this into the framework |
Hi @izzyacademy I agree we need this. I think we need an ABC, with implementations for:
and an example of a custom implementation. I would need very strong evidence that people want another specific database before we implemented it. I think it's best if I hav a crack at a first implementation of this, since I'll know how I want it to work. |
@samuelcolvin thanks for the update. I will wait for your initial draft and then I will share my feedback. After you start with the memory and SQL Lite, I can work on Redis and PostgreSQL. I believe there will be a strong need for Redis as the synchronous and async libraries have 45M and 800K downloads per month respectively. I will check back later on this. |
Hi @izzyacademy @samuelcolvin Even we are looking for similar type of implementation, Redis is preferred! for short-term memory, user-pref, user-details, etc. Crew AI has similar - https://docs.crewai.com/concepts/memory (short-term memory, long-term memory, entity memory, and contextual memory) I saw mem0, but it requires LLM, did not fit all our use-cases - https://github.com/mem0ai/mem0 |
@sandeep540 thanks for your comments. I agree. Redis is a great key-value store for this use case. |
I recently came across an interesting video by Adam Lucek about implementing different memory types that referenced two academic papers:
The types of memory that were implemented in that video are:
Could such memory types be considered for the API/ABC/implementation you are planning? TBH, I'm not entirely sure what you consider as suitable "memory" to be integrated into PydanticAI, but I'll share some additional references to "long-term" memory information/tools: Long-Term Memory Support in LangGraph |
Like the other external integrations, I think this could be [pydantic-ai-persistence] and we could integrate different data stores that implement the ABC. Here are my initial thoughts based on the agent or graph workflow interactions with the persistent store from abc import ABC, abstractmethod
from dataclasses import dataclass
from typing import Any
@dataclass
class PersistentStore(ABC):
"""A persistent store ABC """
conversation_id: str = None
"""This unique string is used to track different interactions with the persistent store.
Generate unique one is None is specfied
"""
@abstractmethod
async def append_entry(self, entry: Any):
"""Adds a record to the end of the list"""
@abstractmethod
async def append_entries(self, entries: list[Any]):
"""Adds records to the end of the list"""
@abstractmethod
async def prepend_entry(self, entry: Any):
"""Adds a record to the beginning of the list"""
@abstractmethod
async def get_all_entries(self) -> list[Any]:
"""Retrieves all the entries for this conversation"""
@abstractmethod
async def get_entries(self, start: int, end: int) -> list[Any]:
"""Retrieves a subset of the entries for this conversation"""
@abstractmethod
async def get_first_entry(self) -> Any:
"""Retrieves the first entry for this conversation"""
@abstractmethod
async def get_last_entry(self) -> Any:
"""Retrieves the last entry for this conversation"""
@abstractmethod
async def clear(self):
"""Wipes the list clean to start from an empty list"""
@abstractmethod
async def remove_first_entry(self) -> Any:
"""Removes the first entry for this conversation"""
@abstractmethod
async def remove_last_entry(self) -> Any:
"""Removes the last entry for this conversation"""
@abstractmethod
async def remove_entries(self, start: int, end: int) -> list[Any]:
"""Removes a subset of the entries for this conversation"""
@abstractmethod
async def count(self) -> int:
"""Returns the total num of messages for this conversation"""
class Memory(PersistentStore):
pass
class PostgreSQLPersistence(PersistentStore):
pass
class RedisPersistent(PersistentStore):
pass
|
Just be careful not to end with a library that looks more like a platform :) I think it should be designed in relation with #695 If not technically then at least with best practice how to maintain the persistency when using graphs, For example Langchain best practice is to push al messages into the graph's state, then there's no requirement to store messages on the agent level any longer. Another thing to consider is the fact that messages may be shared and modified via multiple agents. |
Mem0 uses two components to represent memory: (1) short-term memory, and (2) long-term memory. Vector database for the former and traditional database for the latter. Google, on the other hand, uses three components: (1) core, (2) contextual (long-term) memory, and (3) persistent memory, as outlined in their Titans: Learning to Memorize at Test Time paper. Why three you might ask?
To begin persisting messages in external stores, you need to ask yourself:
Disclaimer: I am not affiliated with these companies but I do have a professional opinion |
I am wondering if there is a plan to add ability to persist messages in remote stores like Redis or document stores instead of using plain memory to cache the messages [1]. There are scenarios where the application memory alone is not sufficient and we need to cache the state in a remote store and pick thing up later.
If this is already possible and I just have to add a plugin or dependency to do this, please let me know as well.
Thanks.
[1] https://ai.pydantic.dev/message-history/
The text was updated successfully, but these errors were encountered: