Create prototype gRPC getCompactionJob() service for external compactions #4715

cshannon · 2024-06-30T17:51:42Z

This is a prototype that creates a new gRPC service for getCompactionJob(). Instead of using Thrift, gRPC is now used (which is based on http2 and netty) and uses protocol buffers version 3 as the protocol. gRPC supports other formats but using protobuf was the simplest thing to get working as all the examples use protobof and protobuf is pretty similar to Thrift. I am marking this as a draft as it probably wouldn't get merged in its current state without some re-working.

The goal of this is to make it possible to do async RPC processing on the server side to support #4664. With gRPC we will be able to easily offload the RPC calls for getting a next compaction job from the IO threadpool once the queue supports CompletableFuture as described in this comment . The gRPC api makes it easy to respond async to requests so this will allow compactors to request new jobs and wait (long polling) and the server can offload the future and complete it later when a job comes in without blocking and using resources. We also have the option of making the call async on the client side as well.

A few things to note:

I updated the Maven build so that it generates the source just like our thrift set up. I pretty much just tried to mirror everything identically to how thrift works. The defintion for the gRPC service and objects is located in the compaction-coordinator.proto file. The sources have already been generated but with updates a new version can be generated using mvn generate-sources -Pprotobuf In order to get the RPC working I had to re-create all the same objects that we were using in Thrift but now with protobuf. I just combined them all into one namespace for now to keep it simple.
To isolate the changes with the RPC replacement, this PR does a conversion to/from the equivalent Thrift and Protobuf objects. This of course is not very efficient and if we decide to keep this then we would want to remove the extra Thrift conversion steps. There's a utility class just for handling the conversion and may not be needed in the future, there's more info on the comments in that class, including about the differences between protobuf and Thrift and how that is handled (such as proto3 not supporting nulls)
For testing, I didn't bother adding unit tests for the ThriftProtobufUtil conversion utility class because as I said we may not even keep it but I would add tests if it sticks around. I didn't run a full build but I ran the ExternalCompaction_1_IT against this and that test is passing.
There are some short cuts for the prototype that we would need to fix if we keep this. I just hard coded the port for now to 8980, which was just what the example used on their site so it was easy to stand up but needs to be configurable eventually. It also is just using plaintext and a new client is opened/closed for each RPC call vs having a pool.

This includes dependencies and plugin to generate the grpc service and protocol buffer classes

This adds the defintions we need for the getCompactionJob() API and adds the generated source

getCompactionJob() now uses grpc. to minimize the changes the existing Thrift objects are converted between protobuf and back. If protobuf is kept then Thrift will eventually be removed entirely.

dlmarion

Some comments, I haven't gotten to the actual code changes yet.

core/src/main/proto/compaction-coordinator.proto

...-gen-java/org/apache/accumulo/core/compaction/protobuf/CompactionCoordinatorServiceGrpc.java

dlmarion · 2024-07-01T14:39:04Z

core/src/main/java/org/apache/accumulo/core/rpc/ThriftProtobufUtil.java

I would be a proponent of replacing the entire Coordinator <-> Compactor communication pathway with GRPC. If you did this, then we would not need to do this conversion of Thrift <-> Protobuf?

Right, we shouldn't need to the any conversion in the final version. Ideally we would not use Thrift at all, it was just simpler to do the conversion because when I started to get rid of Thrift then all of that cascades changes everywhere else because our Thrift objects are so tightly coupled.

dlmarion · 2024-07-01T15:25:48Z

...r/base/src/main/java/org/apache/accumulo/server/grpc/CompactionCoordinatorServiceServer.java

+/**
+ * Simple wrapper to start/stop the grpc server
+ */
+public class CompactionCoordinatorServiceServer {


I'm wondering if a better place to start looking at GRPC is to try and create something like to test Kerberos and TLS with GRPC. Presumably that would lead to a base class that can be used for all server implementations.

dlmarion · 2024-07-01T15:30:51Z

server/manager/src/main/java/org/apache/accumulo/manager/Manager.java

+    try {
+      // Start up the grpc compaction service
+      // TODO: The port is just hardcoded for now and will need to be configurable
+      grpcService =


I think ServiceDescriptor is going to need to change, even if just slightly. The ServiceDescriptor objects are stored in ZooKeeper with the ServiceLock as a way to advertise which services are available and on which ports.

cshannon · 2024-07-05T16:22:33Z

I pushed up changes that take advantage of #4726 to get a new job async that returns a CompletableFuture. The result is not sent back to the client (coordinator) util the job is completed. To make it simple I just copied the relevant code out of the Thrift method because the current method can't return a CompletableFuture as it's an RPC service. We need to decide if we want to keep the Thrift code around temporarily so either RPC can be used or to just drop it. I also went ahead and dropped the loop that tries to get another job if the reservation fails to simplify things. I figure we just return the empty job and it would try again and that is fine.

There are a couple things we could probably improve here. First, I am using future.thenApply() and future.thenAccept() and it woud probably be better to use a thread pool and the Async versions of those methods. Second, there's currently no timeout so we need to look at having a timeout so things are not waiting forever.

keith-turner · 2024-07-11T00:39:24Z

I was experimenting with this locally and its really neat to see it in action.

Made the following throwaway changes to dae30cf. The logging changes were to get a bit more information about timing. The test changes were made to cause a surge of bulk import files after the system was sitting idle for a bit.

diff --git a/server/manager/src/main/java/org/apache/accumulo/manager/compaction/coordinator/CompactionCoordinator.java b/server/manager/src/main/java/org/apache/accumulo/manager/compaction/coordinator/CompactionCoordinator.java
index ef07b78d8a..838fab35a7 100644
--- a/server/manager/src/main/java/org/apache/accumulo/manager/compaction/coordinator/CompactionCoordinator.java
+++ b/server/manager/src/main/java/org/apache/accumulo/manager/compaction/coordinator/CompactionCoordinator.java
@@ -462,7 +462,7 @@ public class CompactionCoordinator
     TIME_COMPACTOR_LAST_CHECKED.put(groupId, System.currentTimeMillis());
 
     return jobQueues.getAsync(groupId).thenApply(metaJob -> {
-      LOG.trace("Next metaJob is ready {}", metaJob.getJob());
+      LOG.debug("Next metaJob is ready {}", metaJob.getJob());
       Optional<CompactionConfig> compactionConfig = getCompactionConfig(metaJob);
 
       // this method may reread the metadata, do not use the metadata in metaJob for anything after
@@ -471,6 +471,7 @@ public class CompactionCoordinator
 
       var kind = metaJob.getJob().getKind();
 
+      LOG.debug("Reserving compaction job {}", externalCompactionId);
       // Only reserve user compactions when the config is present. When compactions are canceled the
       // config is deleted.
       var cid = ExternalCompactionId.from(externalCompactionId);
diff --git a/test/src/main/java/org/apache/accumulo/test/ComprehensiveBaseIT.java b/test/src/main/java/org/apache/accumulo/test/ComprehensiveBaseIT.java
index 1dc1c2c0f9..a3d5a61856 100644
--- a/test/src/main/java/org/apache/accumulo/test/ComprehensiveBaseIT.java
+++ b/test/src/main/java/org/apache/accumulo/test/ComprehensiveBaseIT.java
@@ -96,6 +96,7 @@ import org.apache.accumulo.core.security.ColumnVisibility;
 import org.apache.accumulo.core.security.NamespacePermission;
 import org.apache.accumulo.core.security.SystemPermission;
 import org.apache.accumulo.core.security.TablePermission;
+import org.apache.accumulo.core.util.UtilWaitThread;
 import org.apache.accumulo.harness.SharedMiniClusterBase;
 import org.apache.accumulo.test.util.Wait;
 import org.apache.hadoop.fs.FileUtil;
@@ -126,12 +127,24 @@ public abstract class ComprehensiveBaseIT extends SharedMiniClusterBase {
     try (AccumuloClient client = Accumulo.newClient().from(getClientProps()).build()) {
       client.tableOperations().create(table);
 
-      bulkImport(client, table, List.of(generateKeys(0, 100), generateKeys(100, 200)));
+      // sleep for a bit to let compactors idle
+      UtilWaitThread.sleep(60000);
+
+      // add 4 files to a single tablet, should cause tablet to need compaction
+      bulkImport(client, table, List.of(generateKeys(0, 50), generateKeys(50, 100),
+              generateKeys(100, 150), generateKeys(150, 200)));
 
       verifyData(client, table, AUTHORIZATIONS, generateKeys(0, 200));
 
+      UtilWaitThread.sleep(60000);
+      
+      // add 4 more files to tablet
       bulkImport(client, table,
-          List.of(generateKeys(200, 300), generateKeys(300, 400), generateKeys(400, 500)));
+          List.of(generateKeys(200, 300), generateKeys(300, 400),
+                  generateKeys(400, 450), generateKeys(450, 500)));
+
+      UtilWaitThread.sleep(60000);
+
 
       verifyData(client, table, AUTHORIZATIONS, generateKeys(0, 500));
     }

Ran the modified test ad then looked in the manager log and saw the following.

2024-07-10T19:55:04,450 120 [manager.EventCoordinator] DEBUG: Bulk load completed on tablet 1<<
2024-07-10T19:55:04,455 63 [compaction.CompactionJobGenerator] DEBUG: Planning for 1<< [SYSTEM, USER] 1445803685
2024-07-10T19:55:04,456 63 [manager.TabletGroupWatcher] DEBUG: 1<< may need compacting adding 1 jobs
2024-07-10T19:55:04,456 63 [coordinator.CompactionCoordinator] DEBUG: Reserving compaction job ECID-db4c21aa-b615-4109-b3dd-3fc544f8c391
2024-07-10T19:55:04,459 63 [tablet.files] DEBUG: Compacting 1<< id:ECID-db4c21aa-b615-4109-b3dd-3fc544f8c391 group:default compactor:localhost:9133 priority:-32759 size:35 KB kind:SYSTEM files:[I000005p.rf, I000005o.rf, I000005n.rf, I000005m.rf]
2024-07-10T19:55:04,459 63 [coordinator.CompactionCoordinator] INFO : Found job ECID-db4c21aa-b615-4109-b3dd-3fc544f8c391
2024-07-10T19:55:04,460 63 [rpc.ThriftProtobufUtil] DEBUG: PExternalCompactionJob: externalCompactionId: "ECID-db4c21aa-b615-4109-b3dd-3fc544f8c391"
2024-07-10T19:55:04,508 87 [fate.Fate] INFO : Seeding FATE:USER:914dd323-a684-38af-aaaf-7cb202a9e672 Commit compaction ECID-db4c21aa-b615-4109-b3dd-3fc544f8c391

Summarizing the above log messages.

A time +0ms the bulk import completed adding files to the tablet and signaled the TGW (tablet group watcher).
At time +6ms the TGW adds a job to the compaction queue
At time +6ms an async rpc job request starts processing the job
At time +9ms the async rpc job request finishes reserving the job
At time +10ms the last observed lob message for the async rpc job request is seen, at this point its about to write the return value of the rpc
At time +58ms the compactor has completed the compaction and has made another RPC to the coordinator to start the fate operation that will commit the compaction. With the changes in Fate reservations moved out of memory #4524 we may be able to avoid this RPC and have the compactor process start the fate op to commit a compaction itself.

So this is really neat, the async request starts working immediately after the job is queued and within 3ms to 4ms has reserved the job and returned it to the compactor.

Saw something else that was interesting besides timing in the logs. The numbers after the times in the logs are thread ids. So thread id 63 is the TGW thread id. Can see that currently the TGW thread ends up processing the async RPC job reservation and response. Once that is changed to use another thread we should see a different thread id in the logs for those steps.

keith-turner

Still looking over this. Was able to run mvn clean package -Pprotobuf -DskipTests locally and that worked w/o issue and I saw the protobuf code being regenerated.

Looking over this I was trying to understand what still needs to be done and came up with the following list. What else is there to be done?

fix hard coded port
add thread pool for async rpc request response processing
drop thrift coordinator service and only have grpc coordinator service, this opens up the possibility of grpc<->accumulo_data_strucs conversions instead of accumulo_data_strucs<->thrift<->grpc conversions.
add needed grpc jars to tarball
explore supporting kerberos and ssl w/ grpc
add accumulo config for thread pools for processing grpc request

For what is left to be done, what do you think can be done in the initial PR and what could be deferred to follow on PRs?

keith-turner · 2024-07-11T01:07:04Z

pom.xml

@@ -164,6 +164,13 @@
        <type>pom</type>
        <scope>import</scope>
      </dependency>
+      <dependency>
+        <groupId>io.grpc</groupId>
+        <artifactId>grpc-bom</artifactId>


If there are jars needed at runtime for grpc, then the assembly pom may need to be modified to include those so that they show up in the tarball Not sure what, if anything, needs to be done for this. Could be a follow on issue.

keith-turner · 2024-07-11T01:35:06Z

.../src/main/java/org/apache/accumulo/manager/compaction/coordinator/CompactionCoordinator.java

+    LOG.trace("getCompactionJob called for group {} by compactor {}", groupId, compactorAddress);
+    TIME_COMPACTOR_LAST_CHECKED.put(groupId, System.currentTimeMillis());
+
+    return jobQueues.getAsync(groupId).thenApply(metaJob -> {


Will eventually need to process these in another thread pool

We should be able to just use thenApplyAsync() and pass in an executor. We could start with creating a regular thread pool for this but I'm wondering if this would be a time to look into Virtual threads

keith-turner · 2024-07-11T01:38:22Z

...r/base/src/main/java/org/apache/accumulo/server/grpc/CompactionCoordinatorServiceServer.java

+  public CompactionCoordinatorServiceServer(
+      CompactionCoordinatorServiceGrpc.CompactionCoordinatorServiceImplBase service, int port)
+      throws IOException {
+    this(Grpc.newServerBuilderForPort(port, InsecureServerCredentials.create()), service, port);


Would you happen to know the threading model for processing incoming request? Seems like we could possibly have two thread pools one for processing incoming request and another pool for async processing of responses. Not sure though, need to look at the grpc docs. Currently for thrift we have config related to thread pool sizes, wondering if we will need to have config for this.

We should be able to configure different pools for requests and responses for sure. gRPC is async based out of the box so we will be using a different thread for sending responses.

I found this helpful post which is written by one of the contributors to the java grpc library which explains how to configure the thread pool for the responses vs requests.

The Executor that you provide is what actually executes the callbacks of the rpc. This frees up the EventLoop to continue processing data on the connection. When a new message arrives from the network, it is read on the event loop, and then propagated up the stack to the executor. The executor takes the messages and passes them to your ServerCall.Listener which will actually do the processing of the data.

By default, gRPC uses a cached thread pool so that it is very easy to get started. However it is strongly recommended you provide your own executor. The reason is that the default threadpool behaves badly under load, creating new threads when the rest are busy.

In order to set up the event loop group, you call the workerEventLoopGroup method on NettyServerBuilder. gRPC is not strictly dependent on Netty (other server transports are possible) so the Netty specific builder must be used.

We are using the generic ServerBuilder API right now which allows setting an executor and that is for the responses according to that post. We could switch to using the more specific NettyServerBuilder instead and that would also allow us to configure the event loop for the incoming requests and tune that.

cshannon · 2024-07-12T11:59:35Z

For what is left to be done, what do you think can be done in the initial PR and what could be deferred to follow on PRs?

I will take a look more today at the state of this and report what I think. The tricky part is definitely trying to figure out how big of a scope to make this task vs follow on tasks. We can likely make a lot of things follow on tasks (like SSL and kerberos support) because this is for the elasticity and 4.0 branch so it's not going to be released anytime soon so it's not like it has to be perfect. But if we defer things to follow on issues we should certainly document them so they are not forgotten.

It would be nice to at least clean up the protobuf definitions a bit (move things into different namespaces like thrift) and it would also be nice if we could drop the thrift <-> gRPC conversion entirely but I need to dive into the code and see how big of a change that is. I think we would want to use gRPC for all the RPC calls for the compaction service so I can look today to see how much effort that is and if it makes sense to do it now or split it up.

Update getCompactionJob() to use native accumulo to protobuf conversion and skip thrift conversion where possible.

core/src/main/proto/data.proto

server/compactor/src/main/java/org/apache/accumulo/compactor/Compactor.java

.../src/main/java/org/apache/accumulo/manager/compaction/coordinator/CompactionCoordinator.java

cshannon · 2024-07-17T16:10:27Z

I have migrated most of Compaction Coordinator Service as gRPC at this point. I also updated the gRPC client to be shared across the different calls as the channel is thread safe and can be re-used, but we are not "pooling" different channels like Thrift. I still need to look more into how their client code works and if we need to pool or not. Some of the rpc calls are commented out because the Manager actually uses them.

I also moved the gRPC objects into their own packages to model after our thrift model. We need to eventually move that into its own jar. This has been talked about a bit already, and that is looking at maybe having a common API for the RPC layer. That is something that I need to look more into and if we were going to do that how far we take it and the scope. Whatever we decide we should plan before we go too far. At the very least I think it would be nice to re-factor some things and have some common objects so we stop leaking rpc objects. For example, I think we should have a common object for tracing and credentials. Right now we are passing around TInfo and TCredentials (which I also created PInfo and PCredentials for protobuf) and I think stuff like that should be converted to common objects so we don't leak them. A good example is TCredentials leaking into ClientContext. It would be nice if we could have a more generic object and conversion.

Other than that, I haven't had time to finish up the rest of the list yet, but there are still several things to do and I'm still looking at how to split things up and what to make follow on issues. This PR is getting pretty big by now so it would probably be best to move as much as possible into follow on issues to make things easier to review.

cshannon · 2024-07-21T16:08:22Z

Here is an update to the current state of things. This PR has gotten quite large now so I think it's about time to wrap it up and look at follow on tasks for more work to be done. Below is where things are at for completed, still to be done in this PR, and future work.

Finished:

Added a new gRPC module that contains all the necessary protobuf definitions for Compaction Coordinator Service (including common objects like KeyExtent. Updated the maven build to generate the protobuf objects and CompactionCoordinator service and to be proper dependencies of other modules.
Created a utiity class to convert between Thrift and Protobuf equivalent objects. This should hopefully be temporary until everything is converted.
Created another utility class to share a ManagedChannel for the gRPC connection across clients. This needs to be investigated more to see if pooling will be required (more listed under future work).
Completely replaced the Thrift version of the CompactionCoordinator service with a gRPC version. The Compactor and Monitor use this version now and create gRPC clients for the Compaction coordinator service. The thrift interface has been removed from the CompactionCoordinator. The Manager starts up the gRPC service and stops it on shutdown.
Replaced the thrift version of the compaction related thrift objects with protobuf versions, where possible, throughout the CompactionCoordinator code and related code.
getCompactionJob() RPC uses long polling now and is processed async by using the CompleteableFuture from the job queue. This unblocks the main io threads but there is still work to do with adding more thread pools for processing.
Updated ClientContext to contain a new credentials holder to hold both the grpc and thrift versions of credentials.
Created a property for defining the gRPC port for the Manager. The default port is used for testing for now.
Fixed some tests and ExternalCompaction_1_IT is now passing. (more work is needed for tests still)

Todo in this PR:

Make sure the tests all pass and the build is in a good state. Some tests like ExternalCompaction_2_IT are hanging and require more work to be done like a timeout feature (listed as follow on work). So we may need to do something as a quick solution for those tests for now or just disable them.
Address any concerns or issues found during PR reviews.

Follow on Work:

Update ServiceDescriptor to support gRPC and add the gRPC port for the manager to Zookeeper.
Add support for using a random port with gRPC when setting the port to 0.
Update MiniAccumulo cluster and tests to use a random port. This will require the random port feature to be implemented to find an open port and also ServiceDescriptor to work correcty with finding the gRPC service so clients can use it.
Add a thread pool for async rpc request response processing. We can use the async methods on CompletableFuture to take advantage of the thread pool.
Add accumulo config for thread pools for processing grpc request
Add the needed grpc jars and related jars (protobuf, etc) to tarball
Explore supporting kerberos and ssl w/ grpc
Look into pooling channels/connections for gRPC. ManagedChannel can be shared by multiple threads but it's still a bit unclear how they manage connections and if we need to implement our own custom pooling like we did for Thrift.
Investigate any other properties that need to be customized for the gRPC server and clients. It's netty based and there are a lot of different config options so we should look into what we can set (and their defaults) vs exposing as properties.
Convert more RPC layers, such as the Compactor service, to gRPC. When all of the compaction related items are changed over we can delete the Thrift related generated code (objects and compaction services) entirely.
Implement a timeout for the Compaction queue job CompletableFutures. This would be good to have a reasonable default in case the connections from Compactors associated with the future go away.
Investigate error and exception handling with gRPC. We need to make sure when there are exceptions that the client and server behave correctly and don't kill the connection unless it needs to and it does there is reconnection.
Finish fixing the other External compaction tests. A timeout is necessary to fix ExternalCompaction_2_IT test as tests hang for now. The issue is when the test kill the original compactors and starts up a do nothing compator. The issue is because of the long polling change, there is a race condition where the job is assigned to the future from the original compactor. Then when the compactor dies, the next one won't get a job assigned as the future never times out from the dead compactor even though the job was cleaned up by the dead compaction detector.
Explore how to implement tracing with gRPC. With Thrift we use TraceUtil and pass the TraceInfo object everywhere. There is code that was written to wrap all of these calls with a proxy. The proxy will look for the first argument and expect it to be trace info can can use that for debugging and metrics. With gRPC, there are no longer multiple arguments for RPC calls. Every call has exactly 1 object and that object contains all the fields necessary for the request so we would need to create a new version of this feature that would inspect the object being passed and look for the trace info.
Explore a common API for the RPC layer or at least minimizing what code touches the RPC objects. This needs to be talked about much more and a scope needs to be defined. It would be nice to minimiize the leakage of RPC objects but it's also tricky because we don't want to have to copy objects constantly. For example, we already convert between TKeyExent (and now PKeyExtent) and KeyExtent but I would not want to have to add a second conversion such as PKeyExtent<->Generic RPC KeyExtent<->KeyExtent. The RPC features and behavior are also very different between different RPC implementations such as gRPC being entirely async natively. Thrift also supports nulls where as Protobuf doesn't support null values.

dlmarion · 2024-07-23T11:50:49Z

Here is an update to the current state of things.
...
Follow on Work:

Add support for using a random port with gRPC when setting the port to 0.

Explore supporting kerberos and ssl w/ grpc

Explore how to implement tracing with gRPC.

Investigate error and exception handling with gRPC.

Investigate any other properties that need to be customized for the gRPC server and clients.

I feel like we put the cart before the horse here. These items tagged for follow-on work are actually foundational things that, IMO, should be designed and tested first. I think the work that you have done here is useful to show that grpc can be used, but I'm not convinced that this is the starting point for the introduction of grpc into the codebase.

cshannon · 2024-07-23T12:02:27Z

I feel like we put the cart before the horse here. These items tagged for follow-on work are actually foundational things that, IMO, should be designed and tested first. I think the work that you have done here is useful to show that grpc can be used, but I'm not convinced that this is the starting point for the introduction of grpc into the codebase.

So to clarify, follow on work are of course foundational things and need to be done before anything could be released. It's not that they are not important, it's that this PR is already 100 files changed and enormous and I don't think we should try and jam everything into one PR or you end up trying to review too much.

@keith-turner and I talked about creating another branch like no-chop merge but we didn't really think it would be a huge deal to just merge things into elasticity as long as the build works. However it would certainly be possible to just create another branch like we did with no-chop merge and have multiple PRs against that first before merging into back into elasticity.

dlmarion · 2024-07-23T14:07:02Z

So to clarify, follow on work are of course foundational things and need to be done before anything could be released.

IMO, I think the foundation work should be done before this is merged, not released. The implementation of the foundation work may change the approach taken in this PR.

It's not that they are not important, it's that this PR is already 100 files changed and enormous and I don't think we should try and jam everything into one PR or you end up trying to review too much.

@keith-turner and I talked about creating another branch like no-chop merge but we didn't really think it would be a huge deal to just merge things into elasticity as long as the build works. However it would certainly be possible to just create another branch like we did with no-chop merge and have multiple PRs against that first before merging into back into elasticity.

I have no issue with a long-lived feature branch, much like elasticity, for the grpc work. I don't these changes should be merged into elasticity at this time.

cshannon · 2024-07-23T15:01:18Z

If no one has any objections i can create a feature branch (i'll probably just call it grpc unless someone has a better name) so this can be merged sooner rather than later and then we can continue on with the follow on work to get it into a state before it goes into elasticity. It will also allow others to submit PRs as well against it.

keith-turner · 2024-07-23T18:10:46Z

If no one has any objections i can create a feature branch (i'll probably just call it grpc unless someone has a better name)

I think a feature branch would be good, can move faster. naming it grpc SGTM

cshannon · 2024-07-26T11:43:26Z

I created a new grpc feature branch that is based off elasticity and pushed that up.

cshannon · 2024-08-09T19:05:51Z

I started looking into the authentication support a couple weeks ago and resumed looking into it today to see what was supported. mutual TLS is supported as it's Java and Netty supports that however Kerberos is not supported out of the box. The only authentication that is supported out of the box is SSL/TLS, OAUTH, and a custom Google mechanism called ALTS. See https://grpc.io/docs/guides/auth/

I did some digging to see if anyone has implemented anything we could use and found the following:

A proposal has been open since 2018 but nothing has been done so far.
One of the original developers talks about HTTP/2 being a little akward for Kerberos and mentions that we would need to pair it with TLS to be secure, which I think would be fine.
gRPC is based on Netty and HTTP/2 so likely we would have to write a custom authentication mechanism to handle Kerberos authentication and the client would have to pass the credentials as an HTTP header or auth header and the server would need to read that and perform the checks. To make it secure we would probably need to useTLS as mentioned in number 2. I found an archived project on github that we could maybe use as a template to get started (it's ASL2). It looks like it delegates to the built in JDK SASL support and Krb5LoginModule

cshannon · 2024-08-16T22:00:36Z

The changes from #4811 will fix the hanging ExternalCompaction_2_IT issue that is currently happening when merged in

dlmarion · 2024-08-20T19:11:38Z

If I'm reading https://issues.apache.org/jira/browse/THRIFT-5762 correctly (should be included in the next version of Thrift), this would allow us to use the Thrift message types with a GRPC transport. If that's true, then I think that makes the jump to GRPC a lot simpler.

cshannon · 2024-08-20T19:44:43Z

If I'm reading https://issues.apache.org/jira/browse/THRIFT-5762 correctly (should be included in the next version of Thrift), this would allow us to use the Thrift message types with a GRPC transport. If that's true, then I think that makes the jump to GRPC a lot simpler.

That would be interesting, I haven't looked yet how hard it is to plug in different protocols with gRPC but I'm sure it will take some work to get the serializer/deserializer stuff correct.

One thing I noticed with protocol buffers that is a bit annoying (besides not supporting null which makes converting from Thrift require more changes) is there seems to be a lot of buffer copying. There's talk about some ways to make it more efficient with zero copy and I found this as well grpc/grpc-java#7387

So we may need to look at memory efficiency for performance if we stayed with protobuf. I found this blog post which I thought was interesting and talks a bit about the challenges they had moving from Thrift to gRPC and lessons learned including dealing with zero copy and also the lack of SASL support like Thrift has https://www.alluxio.io/blog/moving-from-apache-thrift-to-grpc-a-perspective-from-alluxio/

dlmarion · 2024-08-20T20:52:46Z

So we may need to look at memory efficiency for performance if we stayed with protobuf.

Did you look at flatbuffers?

cshannon · 2024-08-20T21:57:24Z

Did you look at flatbuffers?

I didn't see that but it looks like that could be an option to explore as well.

cshannon added 3 commits June 30, 2024 13:09

Add grpc and protobuf setup to maven build

212ca11

This includes dependencies and plugin to generate the grpc service and protocol buffer classes

Add defintion and generate service/protocol for compaction service

75eb46f

This adds the defintions we need for the getCompactionJob() API and adds the generated source

Update the Compactor and Compaction coordinator to use new grpc service

00fcedd

getCompactionJob() now uses grpc. to minimize the changes the existing Thrift objects are converted between protobuf and back. If protobuf is kept then Thrift will eventually be removed entirely.

cshannon requested review from dlmarion, keith-turner and ctubbsii June 30, 2024 17:51

cshannon self-assigned this Jun 30, 2024

cshannon added 2 commits June 30, 2024 13:52

Clean up comments from example

1234d67

clean up comments in compaction-coordinator.proto

0dc1df8

dlmarion reviewed Jul 1, 2024

View reviewed changes

Add generated protobuf to .gitattributes

450d371

keith-turner mentioned this pull request Jul 4, 2024

Adds completable futures to compaction queue #4726

Merged

cshannon added 4 commits July 5, 2024 09:19

Merge branch 'elasticity' into accumulo-4664-grpc

d7dd27d

Merge branch 'elasticity' into accumulo-4664-grpc

f27d6c5

Switch gRPC getCompactionJob() to using new async method

17e7aa4

Use thenAccept() as we don't need a return value

edf7125

fix errorprone warning and send back exception on error to client

dae30cf

keith-turner reviewed Jul 11, 2024

View reviewed changes

ctubbsii added this to the 4.0.0 milestone Jul 12, 2024

Merge branch 'elasticity' into accumulo-4664-grpc

36481c7

cshannon added 2 commits July 12, 2024 15:32

Move protobuf objects into correct packages

4e72058

Update getCompactionJob() to use native accumulo to protobuf conversion and skip thrift conversion where possible.

Add license headers to generated protobuf

3fb40bd

keith-turner reviewed Jul 12, 2024

View reviewed changes

cshannon added 2 commits July 13, 2024 14:03

Move compactionCompleted to gRPC, share grpc connection

c5c09af

Move more CompactionService rpc calls to gRpc

854ee44

Merge branch 'elasticity' into accumulo-4664-grpc

23369fb

cshannon added 3 commits July 20, 2024 15:37

Merge branch 'elasticity' into accumulo-4664-grpc

a1b53ad

QA build fixes

936edd1

fix test

fc8411f

cshannon changed the base branch from elasticity to grpc July 26, 2024 11:34

Merge branch 'grpc' into accumulo-4664-grpc

4a02a28

Merge branch 'grpc' into accumulo-4664-grpc

a93f989

Merge branch 'grpc' into accumulo-4664-grpc

cd3573c

cshannon mentioned this pull request Aug 16, 2024

Configure test server classes before starting MAC #4811

Merged

Merge branch 'grpc' into accumulo-4664-grpc

9b1c77f

cshannon added 2 commits August 17, 2024 13:38

Add initial support for TLS for server/client

66cb2cf

fix formatting

fa3a8b6

Merge branch 'grpc' into accumulo-4664-grpc

32f09d1

cshannon mentioned this pull request Aug 24, 2024

Tablet with lots of file may not be readable on scan servers for long periods of time. #4610

Open

cshannon deleted the branch apache:grpc September 12, 2024 16:30

cshannon closed this Sep 12, 2024

cshannon mentioned this pull request Oct 4, 2024

Convert CompactionCoordinator RPC service to Async Thrift #4931

Closed

cshannon mentioned this pull request Oct 18, 2024

Consider switching to gRPC #5000

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create prototype gRPC getCompactionJob() service for external compactions #4715

Create prototype gRPC getCompactionJob() service for external compactions #4715

cshannon commented Jun 30, 2024 •

edited

Loading

dlmarion left a comment

dlmarion Jul 1, 2024

cshannon Jul 1, 2024

dlmarion Jul 1, 2024

dlmarion Jul 1, 2024

cshannon commented Jul 5, 2024

keith-turner commented Jul 11, 2024 •

edited

Loading

keith-turner left a comment

keith-turner Jul 11, 2024

keith-turner Jul 11, 2024

cshannon Jul 12, 2024

keith-turner Jul 11, 2024

cshannon Jul 12, 2024 •

edited

Loading

cshannon commented Jul 12, 2024

cshannon commented Jul 17, 2024

cshannon commented Jul 21, 2024

dlmarion commented Jul 23, 2024

cshannon commented Jul 23, 2024 •

edited

Loading

dlmarion commented Jul 23, 2024

cshannon commented Jul 23, 2024 •

edited

Loading

keith-turner commented Jul 23, 2024

cshannon commented Jul 26, 2024

cshannon commented Aug 9, 2024

cshannon commented Aug 16, 2024

dlmarion commented Aug 20, 2024

cshannon commented Aug 20, 2024

dlmarion commented Aug 20, 2024 •

edited

Loading

cshannon commented Aug 20, 2024

Create prototype gRPC getCompactionJob() service for external compactions #4715

Create prototype gRPC getCompactionJob() service for external compactions #4715

Conversation

cshannon commented Jun 30, 2024 • edited Loading

dlmarion left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cshannon commented Jul 5, 2024

keith-turner commented Jul 11, 2024 • edited Loading

keith-turner left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cshannon Jul 12, 2024 • edited Loading

Choose a reason for hiding this comment

cshannon commented Jul 12, 2024

cshannon commented Jul 17, 2024

cshannon commented Jul 21, 2024

Finished:

Todo in this PR:

Follow on Work:

dlmarion commented Jul 23, 2024

cshannon commented Jul 23, 2024 • edited Loading

dlmarion commented Jul 23, 2024

cshannon commented Jul 23, 2024 • edited Loading

keith-turner commented Jul 23, 2024

cshannon commented Jul 26, 2024

cshannon commented Aug 9, 2024

cshannon commented Aug 16, 2024

dlmarion commented Aug 20, 2024

cshannon commented Aug 20, 2024

dlmarion commented Aug 20, 2024 • edited Loading

cshannon commented Aug 20, 2024

cshannon commented Jun 30, 2024 •

edited

Loading

keith-turner commented Jul 11, 2024 •

edited

Loading

cshannon Jul 12, 2024 •

edited

Loading

cshannon commented Jul 23, 2024 •

edited

Loading

cshannon commented Jul 23, 2024 •

edited

Loading

dlmarion commented Aug 20, 2024 •

edited

Loading