feat: Multipart subscriptions #2870

calvincestari · 2023-03-08T19:54:27Z

This is both the RFC and implementation. I'm capturing the design decision (RFC) in the PR because the original suggested design differs from what the final solution is. This will provide a high-level overview of the design instead of having to read the code to fully understand the change.

Goal

Support GraphQL subscriptions over HTTP using a multipart response (incremental delivery) format similar to that defined in the @defer/@stream RFC.

Existing

GraphQL subscriptions are currently supported in Apollo iOS but only over WebSocket - documentation. Coincidentally a subscription can be sent over HTTP using the RequestChainNetworkTransport transport but the response would never be returned because the current request chain is designed for a single response; the technical reason is that urlSession(_:task:didCompleteWithError:) would not be called because of the multipart data transfer (the request never actually completes) and subsequently the completion block of the subscription request is never called.

Solution

Discarded

The first proposed design was to create an entirely new transport that would operate differently than the current concept of a 'request chain' and implement something closer to Apollo Link. The scope of this change is quite large and it doesn't offer an easy path for users to migrate from existing WebSocket-based subscriptions to HTTP-based subscriptions; additional work would be required to let the user choose which transport a particular subscription should use and SplitNetworkTransport would need to juggle another transport.

Preferred

The implemented design is to adapt the existing request chain to be re-entrant; the re-entrant behaviour is key to making this design possible. Once the request chain becomes re-entrant all the current interceptors can be reused and it fixes the currently broken behaviour of subscriptions over HTTP. The migration path becomes easier too since users would simply need to send a subscription over the RequestChainNetworkTransport instead of the WebSocketTransport.

Technical details

The current limitation of the request chain is that progress through the interceptors is tracked with an index. This makes it impossible to handle a multipart response where network responses need to move forward at different times from the same interceptor, with no guarantee that the chain has completed before the next response arrives. An index is required because in the proceedAsync(request:response:completion:) callback there is no indication of which interceptor has finished processing. This is further complicated knowing that these interfaces/classes/structs are all public and there could be any number of custom interceptors that users have built specific to their data flow.

To make the request chain support multipart responses it has to become re-entrant which then separates each multipart response into its own forward continuation of the request chain. To do this in a way that is backwards compatible with custom interceptors means we need to hide the abstraction and handle it internally. This is done by wrapping each interceptor in the supplied interceptor list in a wrapper that both functions as the request chain the interceptor calls back to and as the 'forwarder' of request chain progress to each wrapped interceptor. On interceptor callback the wrapper will actually use a new function that requires the interceptor to identify itself; this interceptor identification is what enables independent forward continuation of the request chain.

There is also a new interceptor, MultipartResponseParsingInterceptor, that is inserted before the JSONResponseParsingInterceptor in the interceptor chain. This new interceptor parses the multipart responses at the given boundary and forwards each chunk on to the next interceptor.

Will be done in a subsequent PRs:

Ensure that SplitNetworkTransport functions correctly for the migration path
End-to-end tests
Documentation

netlify · 2023-03-08T19:54:32Z

✅ Deploy Preview for apollo-ios-docs canceled.

Name	Link
🔨 Latest commit	`b932842`
🔍 Latest deploy log	https://app.netlify.com/sites/apollo-ios-docs/deploys/641e2169ceee660008d5fc82

calvincestari · 2023-03-15T21:13:47Z

I'm putting this PR into review so that the general approach and implementation can be commented on. Once I'm back next week I'll finish up the tests for the new MultipartResponseParsingInterceptor interceptor.

Sources/Apollo/ApolloClient.swift

calvincestari · 2023-03-23T19:49:11Z

Sources/Apollo/DefaultInterceptorProvider.swift

@@ -35,6 +35,7 @@ open class DefaultInterceptorProvider: InterceptorProvider {
        CacheReadInterceptor(store: self.store),
        NetworkFetchInterceptor(client: self.client),
        ResponseCodeInterceptor(),
+        MultipartResponseParsingInterceptor(),


This is in the interceptor chain for all operation types. Right now it is applicable only to Subscription but supporting @defer will require it for Query too. There is a short-circuit to check for the multipart/mixed header and exit early if it's not present in the response.

calvincestari · 2023-03-23T20:07:33Z

Sources/Apollo/URLSessionClient.swift

+
+    taskData.append(additionalData: data)
+
+    if let httpResponse = dataTask.response as? HTTPURLResponse, httpResponse.isMultipart {


This multipart boundary check is to ensure that only complete chunks are sent to the next interceptor. Any remaining data that was received that should be processed with the next incoming data to form a complete chunk will get held back.

Will that chunk get automatically included in the dataTask next time this function is called? I'm not seeing anywhere that you are holding onto the dataTask to merge them together when the next chunk comes in.

All incoming data is appended to any data already related to the task, see line 271. Then we work with the data sending off the complete chunks to other interceptors and on line 290 is where the task data is reset with any remaining bytes, ready for the callback block of data to be added to it.

Got it. To me, this makes it even more important that task data is mutated atomically then. Totally good with this implementation in every other regard. But seems very easy for this to get corrupted by parallel threads executing in a weird order.

👍🏻 - created #2900 to handle it.

Tests/ApolloTests/Interceptors/MultipartResponseParsingInterceptorTests.swift

calvincestari · 2023-03-23T20:14:54Z

@AnthonyMDev this PR is now ready for review.

Note - there is a lot of String to/from Data encoding as the various parsers inspect multipart boundaries and process the separate chunks. I believe this is OK for now, which I'll validate with performance tests to get benchmarks recorded into our test suite. Once those benchmarks are in I'm going to see how much of an improvement we can get from working with the byte data instead of back and forth with String.

AnthonyMDev

This looks awesome! I have a few questions and suggestions, but very minor. Great job on this. It was not a simple task...

Sources/Apollo/ApolloClient.swift

Sources/Apollo/ApolloInterceptorReentrantWrapper.swift

Sources/Apollo/MultipartResponseParsingInterceptor.swift

AnthonyMDev · 2023-03-23T21:31:37Z

Sources/Apollo/URLSessionClient.swift

-      guard let taskData = $0[dataTask.taskIdentifier] else {
-        assertionFailure("No data found for task \(dataTask.taskIdentifier), cannot append received data")
+
+    guard let taskData = self.tasks[dataTask.taskIdentifier] else {


By removing the self.$tasks.mutate block, this is no longer atomic. I get why it's not necessary for updating the TaskData in place, because TaskData is a class. But does this mean that this is not guaranteed to happen in a sequentially? Can these create problematic race conditions?

tasks is annotated with the @Atomic property wrapper so all access to the task list should still be atomic; we're assured that the task won't be removed at least. We only need mutate when we want to alter the task list.

data within each TaskData is not protected by anything though so yes I suppose there is a chance of a race condition when working with incoming data for a particular task. We shouldn't be using the list atomic lock to ensure consistent data access for a single task though. I suspect it was either a mistake to have mutate there in the first place or it was being abused for the purpose of single task data access.

The default callback queue is main which, being a serial queue, will ensure there is no race condition. Anyone who cares about a performant UI though will be giving us their own queue, which could be concurrent and introduce the potential for a race condition.

We could use @Atomic on TaskData.data to prevent this but I'd want a performance benchmark test on that to measure the cost of doing so.

Yes, I agree with your assessment here. I do think we should be making the mutation of each individual TaskData be atomic though. This is far too likely to cause a race condition for someone in its current state.

👍🏻 - created #2900 to handle it.

AnthonyMDev · 2023-03-23T21:33:33Z

Sources/Apollo/URLSessionClient.swift

+
+    taskData.append(additionalData: data)
+
+    if let httpResponse = dataTask.response as? HTTPURLResponse, httpResponse.isMultipart {


Will that chunk get automatically included in the dataTask next time this function is called? I'm not seeing anywhere that you are holding onto the dataTask to merge them together when the next chunk comes in.

Tests/ApolloTests/Interceptors/MultipartResponseParsingInterceptorTests.swift

Tests/ApolloTests/WebSocket/WebSocketTests.swift

Sources/Apollo/ApolloInterceptorReentrantWrapper.swift

calvincestari added 5 commits March 3, 2023 17:42

Send multipart headers for subscriptions on RequestChainNetworkTransport

90583ca

Refactor request chain into protocol with interceptor implementation

3275638

Refactor InterceptorRequestChain for reentrant behaviour

6cda175

Separate files and formatting

9bed9e7

Subscriptions must be cancelled, not ignored

d2439ee

calvincestari added 6 commits March 13, 2023 12:45

Updates request header with spec version

da3513f

Parse multipart boundary data and forward to next interceptor

aa5a3dd

Adds multipart response interceptor

4181ce4

Clear the current task if not a subscription on completion

af9ea33

Adds HTTPURLResponse extension tests

f0ba6e2

Add tests for multipart subscriptions request headers

1477275

calvincestari marked this pull request as ready for review March 15, 2023 21:13

calvincestari requested a review from AnthonyMDev March 15, 2023 21:16

calvincestari added 7 commits March 20, 2023 17:45

Add test for accept header overwrite

469eb53

Adds multipart interceptor error tests

3a81f81

Test cleanup

13a0d16

Fix warning for unused result

49e2e8d

Add multipart parsing tests

7da675e

Handle multipart boundary wrapped in quotations

27e16fc

Add multipart test including GraphQL errors

2de76f1

calvincestari commented Mar 23, 2023

View reviewed changes

Sources/Apollo/ApolloClient.swift Show resolved Hide resolved

calvincestari commented Mar 23, 2023

View reviewed changes

Tests/ApolloTests/Interceptors/MultipartResponseParsingInterceptorTests.swift Show resolved Hide resolved

calvincestari commented Mar 23, 2023

View reviewed changes

Tests/ApolloTests/Interceptors/MultipartResponseParsingInterceptorTests.swift Show resolved Hide resolved

Removes Connection header from HTTP subscription requests

85518bb

AnthonyMDev reviewed Mar 23, 2023

View reviewed changes

calvincestari added 7 commits March 23, 2023 14:51

Add public initializer so users can instantiate

c0cc097

Refactor constants to static properties

1646d91

Cancel websocket test subscriptions

02a8744

Fix header tests

37915d1

Add memory retain/release tests

21857e9

Support cancelling URLSessionDataTaskMock

3cafd22

Remove cyclical callback to request chain cancellation

b932842

AnthonyMDev reviewed Mar 25, 2023

View reviewed changes

Sources/Apollo/ApolloInterceptorReentrantWrapper.swift Outdated Show resolved Hide resolved

calvincestari mentioned this pull request Mar 25, 2023

Ensure that access to TaskData.data is atomic #2900

Open

AnthonyMDev approved these changes Mar 25, 2023

View reviewed changes

calvincestari merged commit ce3d988 into main Mar 25, 2023

calvincestari deleted the feature/multipart-subscriptions branch March 25, 2023 03:04

yonaskolb mentioned this pull request Apr 5, 2023

Publisher crash in 1.1 #2932

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Multipart subscriptions #2870

feat: Multipart subscriptions #2870

calvincestari commented Mar 8, 2023 •

edited

Loading

netlify bot commented Mar 8, 2023 •

edited

Loading

calvincestari commented Mar 15, 2023

calvincestari Mar 23, 2023

calvincestari Mar 23, 2023

AnthonyMDev Mar 23, 2023

calvincestari Mar 24, 2023

AnthonyMDev Mar 25, 2023

calvincestari Mar 25, 2023

calvincestari commented Mar 23, 2023

AnthonyMDev left a comment

AnthonyMDev Mar 23, 2023

calvincestari Mar 23, 2023

AnthonyMDev Mar 25, 2023

calvincestari Mar 25, 2023

AnthonyMDev Mar 23, 2023


		taskData.append(additionalData: data)

		if let httpResponse = dataTask.response as? HTTPURLResponse, httpResponse.isMultipart {

feat: Multipart subscriptions #2870

feat: Multipart subscriptions #2870

Conversation

calvincestari commented Mar 8, 2023 • edited Loading

Goal

Existing

Solution

Discarded

Preferred

Technical details

netlify bot commented Mar 8, 2023 • edited Loading

✅ Deploy Preview for apollo-ios-docs canceled.

calvincestari commented Mar 15, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

calvincestari commented Mar 23, 2023

AnthonyMDev left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

calvincestari commented Mar 8, 2023 •

edited

Loading

netlify bot commented Mar 8, 2023 •

edited

Loading