PartitionKeyRangeCache should use NonBlockingAsyncCache #3060

j82w · 2022-03-02T22:12:56Z

The PartitionKeyRangeCache is using the AsyncCache and is maps a CollectionRoutingMap to a Container RID. When there is a split or other scenario it will use change feed to only get the new partition key ranges and create a new CollectionRoutingMap by updating the values returned in the change feed.

The issue is if an exception occurs like a timeout or some other transient failure the entire CollectionRoutingMap was removed from the cache. This means if any transient failure occurs the SDK has to recreate the entire CollectionRoutingMap by reading the entire changefeed again. This now means that during a split if a transient issue occurs all requests are blocked until the new CollectionRoutingMap is created.

This problem is worse because user can currently disable 429 retries as described in #3055. If a 429 is hit the CollectionRoutingMap will be removed from the cache and it will to be built by reading all the ranges again.

Solution:
The PartitionKeyRangeCache should be converted to use the new NonBlockingAsyncCache which Address cache already uses.

Test Case:
If the container has 5 Partitions and a split occurs the 4 other partitions should always be accessible even if the call to get the new partitions fail or is slow.

j82w added the bug Something isn't working label Mar 2, 2022

j82w added this to the March 2022 milestone Mar 2, 2022

j82w added feature-request New feature or request and removed bug Something isn't working labels Mar 2, 2022

This was referenced Mar 8, 2022

Availability: Adds improved cache logic for partition key ranges #3072

Closed

Availability: Adds non-blocking cache to partition key ranges #3080

Merged

j82w closed this as completed in #3080 Mar 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PartitionKeyRangeCache should use NonBlockingAsyncCache #3060

PartitionKeyRangeCache should use NonBlockingAsyncCache #3060

j82w commented Mar 2, 2022

PartitionKeyRangeCache should use NonBlockingAsyncCache #3060

PartitionKeyRangeCache should use NonBlockingAsyncCache #3060

Comments

j82w commented Mar 2, 2022