Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Default configuration S3Client #2923

Open
moradology opened this issue May 14, 2019 · 3 comments
Open

Default configuration S3Client #2923

moradology opened this issue May 14, 2019 · 3 comments
Labels
question Further information is requested

Comments

@moradology
Copy link
Contributor

moradology commented May 14, 2019

Classes which are summoned via the ClassLoader mechanism (as in the SPI pattern) must not have parameters in their constructors. A CassandraLayerReader and an S3LayerReader are both instanced through the same method which takes a string and delegates to the appropriate SPI class depending on the pattern found in said string. This is handy, but it elides important details in at least the S3 case: S3Client objects are complicated and the 'Default Credential Provider' chain won't always be sufficient.

Given these two constraints (no parameters and the need to configure in certain cases), we need either

  1. a singleton which registers a default mechanism for creating AWS clients,
  2. SPI machinery to pull the desired class at runtime, or
  3. static configuration which is sufficient to the S3ClientBuilder API
@echeipesh echeipesh added this to the 3.0 milestone May 14, 2019
@moradology moradology added the question Further information is requested label May 14, 2019
@echeipesh echeipesh changed the title Defaults values for SPI-dependent classes Default configuration S3Client May 24, 2019
@rossbernet rossbernet removed this from the GT 3.0 milestone Aug 22, 2019
@CloudNiner CloudNiner self-assigned this Sep 30, 2019
@CloudNiner
Copy link
Contributor

@moradology @echeipesh could one of you clarify the ask here?

As far as I can tell, it appears that (1) is already implemented in master as S3ClientProducer and is used in a number of places:
Screen Shot 2019-10-01 at 9 16 05 AM

@echeipesh
Copy link
Contributor

echeipesh commented Oct 1, 2019

@CloudNiner
S3ClientProducer does not use SPI, so the only way to change the default client used through-out is by calling the static S3ClientProducer.set method.

Assertion: changing the default timeout and back-off settings on the client is a very common use case. Being able to provider a client directly to layer/writer/reader covers it partially but is not sufficient. This is actually not much of assertion, we've seen this many times.

Opinion: Having a global mutable setting for current client is not desirable long term. Its not standard and potentially dangerous if in client code it gets set in multiple places or out of order. Both of which are possible as projects grow and code modules get combined.

Ask: Replace the global state with boring SPI

Use cases:

  • If no SPI instance be found, create the client as is now
  • Provide default SPI instance that reads its settings from HOCON config file (separate issue pending +1)
  • User can provide their own implementation, being fancy with how the configure the credentials and timeouts on the client

Open question: Should the SPI provide and instance of the client or just the configuration for it ?
If we ask only for configuration we can ensure that we re-use a single s3 client across the application. Whereas if we asked for the client this would have to be an additional consideration for the user. I believe the AWS docs state that re-using the client is desirable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants