Add connector SPI for returning redactable properties #24562

piotrrzysko · 2024-12-23T11:35:07Z

Description

An alternative approach to #23103. The main difference is that in this approach, the properties requiring redaction are selected from those provided by the user, rather than always returning a static set of predefined security-sensitive properties. The benefits are as follows:

By default (if a connector doesn't implement the SPI), all properties are masked.
Unknown (potentially misspelled) properties can also be treated as redactable.

This PR includes an implementation of the new SPI for the PostgreSQL connector. Once we confirm that the approach is correct, we will apply it to the remaining connectors.

Here is a PR demonstrating how the new SPI could be used to mask security-sensitive properties in queries related to creating catalogs: #24563.

Additional context and related issues

Resolves #22887.

Release notes

( ) This is not user-visible or is docs only, and no release notes are required.
( ) Release notes are required. Please propose a release note for me.
(x) Release notes are required, with the following suggested text:

## SPI
* Add connector SPI for returning redactable properties ({issue}`22887`)

core/trino-spi/src/main/java/io/trino/spi/connector/ConnectorFactory.java

lib/trino-plugin-toolkit/src/main/java/io/trino/plugin/base/config/ConfigPropertyMetadata.java

plugin/trino-postgresql/src/test/java/io/trino/plugin/postgresql/TestPostgreSqlPlugin.java

core/trino-spi/src/main/java/io/trino/spi/connector/ConnectorFactory.java

dain · 2025-01-02T19:46:37Z

plugin/trino-base-jdbc/src/main/java/io/trino/plugin/jdbc/JdbcConnectorFactory.java

    {
        checkArgument(!isNullOrEmpty(name), "name is null or empty");
        this.name = name;
        this.module = requireNonNull(module, "module is null");
+        Set<Class<?>> configClasses = ImmutableSet.<Class<?>>builder()


Instead of attempting to list every configuration class, I think we should modify ConfigurationFactory in Airlift to extract the properties for us. I'm thinking (just thoughts after a brief look) that we have a method to extract all properties from a set of modules, and classify them into used, unused, and unknown. With used and unused having sub classification for secure or unsecure.

I like the idea of getting properties from Airlift. However, I’m wondering if this is feasible, given that some modules are bound conditionally. Additionally, in some cases, we use constructs like: https://github.com/trinodb/trino/blob/master/plugin/trino-delta-lake/src/main/java/io/trino/plugin/deltalake/DeltaLakeSecurityModule.java#L43-L49, which means Airlift isn’t even aware that multiple modules can be bound.

I’m assuming we want to have a static list of possible properties, rather than bootstrapping a connector when getSecuritySensitivePropertyNames is called. Is this a wrong assumption?

@piotrrzysko Can we scan the classpath to find all configuration classes annotated with @config ?

I do this in this test:

trino/plugin/trino-postgresql/src/test/java/io/trino/plugin/postgresql/TestPostgreSqlPlugin.java

Line 97 in 8018b45

Set<ConfigPropertyMetadata> propertiesFoundInClasspath = findPropertiesInRuntimeClasspath(excludedClasses);

I wanted to avoid scanning the classpath at runtime to minimize the time required for plugin loading.

@piotrrzysko we could generate an index while building a project, in the trino-maven-plugin

Classpath scanning won't work becasue you can mount configuration classes under a prefix... like we do for HTTP clients.

ksobolew · 2025-01-07T07:33:59Z

I think this will be helpful in getting #22669 done as well (the main stumbling block there is that by making catalog properties available in all catalog manager implementations, we may expose some security-sensitive properties).

The SPI will be used by the engine to redact security-sensitive information in statements that manage catalogs. It has been added at the connector factory level, rather than the connector level, to allow more flexibility in retrieving properties. In some cases, we want to perform redacting before a connector is initiated. For example, when we create a new catalog by issuing the CREATE CATALOG statement.

Exposed properties fall into one of the following categories: they are either explicitly marked as security-sensitive or are unknown. The connector assumes that unknown properties might be misspelled security-sensitive properties.

This preparatory commit enables bootstrapping HDFS to retrieve its security-sensitive properties.

cla-bot bot added the cla-signed label Dec 23, 2024

This was referenced Dec 23, 2024

Redact sensitive information in catalog queries #24563

Draft

Add connector SPI for returning security-sensitive properties #23103

Closed

hashhar reviewed Jan 2, 2025

View reviewed changes

core/trino-spi/src/main/java/io/trino/spi/connector/ConnectorFactory.java Outdated Show resolved Hide resolved

hashhar reviewed Jan 2, 2025

View reviewed changes

lib/trino-plugin-toolkit/src/main/java/io/trino/plugin/base/config/ConfigPropertyMetadata.java Outdated Show resolved Hide resolved

hashhar reviewed Jan 2, 2025

View reviewed changes

plugin/trino-postgresql/src/test/java/io/trino/plugin/postgresql/TestPostgreSqlPlugin.java Outdated Show resolved Hide resolved

hashhar approved these changes Jan 2, 2025

View reviewed changes

piotrrzysko mentioned this pull request Jan 2, 2025

Extend syntax for Dynamic Catalogs #22188

Open

1 task

dain reviewed Jan 2, 2025

View reviewed changes

piotrrzysko force-pushed the redactable-properties-spi branch from 8018b45 to a4f5809 Compare January 6, 2025 15:55

piotrrzysko added 8 commits January 9, 2025 12:18

Update Airlift

28256ef

Expose security-sensitive properties for HDFS

22e0347

This preparatory commit enables bootstrapping HDFS to retrieve its security-sensitive properties.

Expose security-sensitive properties for Hive connector

b486e53

Expose security-sensitive properties for Iceberg connector

410523a

Expose security-sensitive properties for Delta Lake connector

c98dc03

Expose security-sensitive properties for Hudi connector

47db44b

piotrrzysko force-pushed the redactable-properties-spi branch from a4f5809 to 47db44b Compare January 9, 2025 17:51

github-actions bot added hudi Hudi connector iceberg Iceberg connector delta-lake Delta Lake connector hive Hive connector labels Jan 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add connector SPI for returning redactable properties #24562

Add connector SPI for returning redactable properties #24562

piotrrzysko commented Dec 23, 2024 •

edited

Loading

dain Jan 2, 2025

piotrrzysko Jan 5, 2025 •

edited

Loading

wendigo Jan 5, 2025

piotrrzysko Jan 6, 2025

wendigo Jan 6, 2025

dain Jan 6, 2025

ksobolew commented Jan 7, 2025

Add connector SPI for returning redactable properties #24562

Are you sure you want to change the base?

Add connector SPI for returning redactable properties #24562

Conversation

piotrrzysko commented Dec 23, 2024 • edited Loading

Description

Additional context and related issues

Release notes

dain Jan 2, 2025

Choose a reason for hiding this comment

piotrrzysko Jan 5, 2025 • edited Loading

Choose a reason for hiding this comment

wendigo Jan 5, 2025

Choose a reason for hiding this comment

piotrrzysko Jan 6, 2025

Choose a reason for hiding this comment

wendigo Jan 6, 2025

Choose a reason for hiding this comment

dain Jan 6, 2025

Choose a reason for hiding this comment

ksobolew commented Jan 7, 2025

piotrrzysko commented Dec 23, 2024 •

edited

Loading

piotrrzysko Jan 5, 2025 •

edited

Loading