Skip to content

Navigation Menu

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

pathwaycom / pathway Public

Notifications You must be signed in to change notification settings
Fork 284
Star 12.4k

Code
Issues 36
Pull requests 2
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Security
Insights

Releases: pathwaycom/pathway

Releases · pathwaycom/pathway

v0.11.0

10 May 14:56

Compare

Choose a tag to compare

Loading

v0.11.0

Added

Embedders in the LLM xpack now have method get_embedding_dimension that returns number of dimension used by the chosen embedder.
pathway.stdlib.indexing.nearest_neighbors, with implementations of pathway.stdlib.indexing.data_index.InnerIndex based on k-NN via LSH (implemented in Pathway), and k-NN provided by USearch library.
pathway.stdlib.indexing.vector_document_index, with a few predefined instances of pathway.stdlib.indexing.data_index.DataIndex.
pathway.stdlib.indexing.bm25, with implementations of pathway.stdlib.indexing.data_index.InnerIndex based on BM25 index provided by Tantivy.
pathway.stdlib.indexing.full_text_document_index, with a predefined instance of pathway.stdlib.indexing.data_index.DataIndex.
Introduced the reranker module under llm.xpacks. Includes few re-ranking strategies and utility functions for RAG applications.

Changed

BREAKING: windowby generates IDs of produced rows differently than in the previous version.
BREAKING: pw.io.csv.write prints printable non-ascii characters as regular text, not \u{xxxx}.
BREAKING: Connector methods pw.io.elasticsearch.read, pw.io.debezium.read, pw.io.fs.read, pw.io.jsonlines.read, pw.io.kafka.read, pw.io.python.read, pw.io.redpanda.read, pw.io.s3.read now check the type of the input data. Previously it was not checked if the provided format was "json"/"jsonlines". If the data is inconsistent with the provided schema, the row is skipped and the error message is emitted.
BREAKING: query and query_as_of_now methods of pathway.stdlib.indexing.data_index.DataIndex now return pathway.JoinResult, to allow resolving column name conflicts (between columns in the table with queries and table with index data).
BREAKING: DataIndex methods query and query_as_of_now now return score in a column named _pw_index_reply_score (defined as _SCORE variable in pathway.stdlib.indexing.colnames.py).

Removed

BREAKING: pathway.stdlib.indexing.data_index.VectorDocumentIndex class, some predefined instances are now meant to be obtained via methods provided in pathway.stdlib.indexing.vector_document_index.
BREAKING: with_distances parameter of query and query_as_of_now methods in pathway.stdlib.indexing.data_index.DataIndex. Instead of 'distance', we now operate with a more general term 'score' (higher = better). For distance based indices score is usually defined as negative distance. Score is now always included in the answer, as long as underlying index returns something that indicates quality of a match.

Assets 5

Loading

All reactions

v0.10.1

30 Apr 12:25

Compare

Choose a tag to compare

Loading

v0.10.1

Added

query method to VectorStoreServer to enable compatible API with DataIndex.
AdaptiveRAGQuestionAnswerer to xpacks.question_answering. End-to-end pipeline and accompanying code for Private RAG showcase.

Assets 5

Loading

All reactions

v0.10.0

24 Apr 22:21

Compare

Choose a tag to compare

Loading

v0.10.0

Added

Pathway now warns when unintentionally creating Table with empty universe.
pw.io.kafka.write in raw and plaintext formats now supports output for tables with multiple columns. For such tables, it requires the specification of the column that must be used as a value of the produced Kafka messages and gives a possibility to provide column which must be used as a key.
pw.io.kafka.write can now output values from the table using Kafka message headers in 'raw' and 'plaintext' output format.

Changed

instance arguments to groupby, join, with_id_from now determine how entries are distributed between machines.
flatten results remain on the same machine as their source entries.
join sends each record between machines at most once.
BREAKING: flatten, join, groupby (if used with instance), with_id_from (if used with instance) generate IDs of the produced rows differently than in the previous versions.
pathway spawn with multiple workers prints only output from the first worker.

Assets 5

Loading

All reactions

v0.9.0

18 Apr 21:01

Compare

Choose a tag to compare

Loading

v0.9.0

Added

pw.reducers.latest and pw.reducers.earliest that return the value with respectively maximal and minimal processing time assigned.
pw.io.kafka.write can now produce messages containing raw bytes in case the table consists of a single binary column and raw mode is specified. Similarly, this method will provide plaintext messages if plaintext mode is chosen and the table consists of a single string-typed column.
pw.io.pubsub.write connector for publishing Pathway tables into Google PubSub.
Argument strict_prompt to answer_with_geometric_rag_strategy and answer_with_geometric_rag_strategy_from_index that allows optimizing prompts for smaller open-source LLM models.
Temporarily switch LiteLLMChat's generation method to sync version due to a bug while using json mode with Ollama.

Changed

BREAKING: pw.io.kafka.read will not parse the messages from UTF-8 in case raw mode was specified. To preserve this behavior you can use the plaintext mode.
BREAKING: Table.flatten now flattens one column and spreads every other column of the table, instead of taking other columns from the argument list.

Assets 5

Loading

All reactions

v0.8.6

10 Apr 20:16

Compare

Choose a tag to compare

Loading

v0.8.6

Added

pw.io.bigquery.write connector for writing Pathway tables into Google BigQuery.
parameter filepath_globpattern to query method in VectorStoreClient for specifying which files should be considered in the query.
Improved compatibility of pw.Json with standard methods such as len(), int(), float(), bool(), iter(), reversed() when feasible.

Changed

pw.io.postgres.write can now parallelize writes to several threads if several workers are configured.
Pathway now checks types of pointers rigorously. Indexing table with mismatched number/types of columns vs what was used to create index will now result in a TypeError.
pw.Json.as_float() method now supports integer JSON values.

Assets 5

Loading

All reactions

v0.8.5

27 Mar 22:03

Compare

Choose a tag to compare

Loading

v0.8.5

Added

New function answer_with_geometric_rag_strategy_from_index, which allows to use answer_with_geometric_rag_strategy without the need to first retrieve documents from index.
Added support for custom state serialization to udf_reducer.
Introduced instance parameter in AsyncTransformer. All calls with a given (instance, processing_time) pair are returned at the same processing time. Ordering is preserved within a single instance.
Added successful, failed, finished properties to AsyncTransformer. They return tables with successful calls, failed calls and all finished calls, respectively.

Changed

Property result of AsyncTransformer is deprecated. Property successful should be used instead.
pw.io.csv.read, pw.io.jsonlines.read, pw.io.fs.read, pw.io.plaintext.read now handle path as a glob pattern and read all matched files and directories recursively.

Assets 5

Loading

isaac-florence reacted with thumbs up emoji

All reactions

👍 1 reaction

1 person reacted

v0.8.4

18 Mar 17:52

Compare

Choose a tag to compare

Loading

v0.8.4

Fixed

Pathway will only require LiteLLM package, if you use one of the wrappers for LiteLLM.
Retries are implemented in pw.io.airbyte.read.
State processing protocol is updated in pw.io.airbyte.read.

Assets 5

Loading

All reactions

v0.8.3

13 Mar 21:17

Compare

Choose a tag to compare

Loading

v0.8.3

Added

New parameters of pw.UDF class and pw.udf decorator: return_type, deterministic, propagate_none, executor, cache_strategy.
The LLM Xpack now provides integrations with LlamaIndex and LangChain for running the Pathway VectorStore server.

Changed

Subclassing UDFSync and UDFAsync is deprecated. UDF should be subclassed to create a new UDF.
Passing keyword arguments to pw.apply, pw.apply_with_type, pw.apply_async is deprecated. In the future, they'll be used for configuration, not passing data to the function.

Fixed

Fixed a minor bug with Table.groupby() method which sometimes prevented of accessing certain columns in the following reduce().
Fixed warnings from using OpenAI Async embedding model in the VectorStore in Colab.

Assets 5

Loading

All reactions

v0.8.2

28 Feb 12:56

Compare

Choose a tag to compare

Loading

v0.8.2

Added

%:z timezone format code to strptime.
Support for Airbyte connectors pw.io.airbyte.

Assets 5

Loading

All reactions

v0.8.1

15 Feb 13:42

Compare

Choose a tag to compare

Loading

v0.8.1

Added

Introduced the send_alerts function in the pw.io.slack namespace, enabling users to send messages from a specified column directly to a Slack channel.
Enhanced the pw.io.http.rest_connector by introducing an additional argument called request_validator. This feature empowers users to validate payloads and raise an HTTP 400 error if necessary.

Fixed

Addressed an issue in pw.io.xpacks.llm.VectorStoreServer where the computation of the last modification timestamp for an indexed document was incorrect.

Changed

Improved the behavior of pw.io.kafka.write. It now includes retries when sending data to the output topic encounters failures.

Assets 5

Loading

All reactions

Previous 1 2 3 4 5 6 Next

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.