Object Level S3 Logging #1748

LavMatt · 2023-09-29T10:38:05Z

LavMatt
Sep 29, 2023
Collaborator

Context

Logging of actions performed on data is critical to the Data Platform service - both to diagnose issues with the service and to ensure we have captured critical information about an incident - data breach or other.

There are two ways in which object level logging can be enabled for an S3 bucket, neither of which is enabled as standard. This page explores these two methods.

Ideally logs should be saved to S3 in JSON (or some other Athena compatible) format, making logs more easily searchable via Athena queries and consistent with our python lambda container logging.

Our accounts sit within the Modernisation Platform and it appears all buckets are setup as standard to log object level events to a central account through a CloudTrail trail. We do not have permissions to interact with any of these logs. There would be some duplication of logging if we are to collate our own object level S3 logs - this is undesirable in terms of cost, efficiency and clarity.

We need to decide on the approach for object level S3 logging in the Data Platform

Options

Setup our own Server Access Logging, saving logs within the data platform AWS account.
Setup our own Cloudtrail Trail, getting event data for only a subset of available api calls (initially PutObject), saving logs within the data platform AWS account.
Use Modernisation Platform’s Cloudtrail logs. Would need to agree cross account permissions to access S3 logs from the data platform account, held within the central modernisation platform bucket.

Server access logging

This method writes log files to a designated target bucket (that must be in the same account as the source bucket). They are space separated text files which can be queried via Athena. It logs every API call to the bucket and cannot be configured to filter certain actions, e.g. PutObject or GetObject.

Cost: the only charge incurred is S3 storage costs of the log files

Pros

Simple implementation.
Cheaper (in comparison to Cloudtrail).
Logs all authentication failures (not just AccessDenied where no credentials are present) - i.e. better visibility of attempts at unauthorised access to a bucket.

Cons

Not configurable, so will have bloated logs containing every API call.
Is a lag of up to a few hours for logs to be available.

CloudTrail

CloudTrail offers two different methods for logging S3 events:

Create a Trail - This enables customisation of what is logged. E.g. you can log events from specific buckets and for specific API calls. You are also able to pass the logs to CloudWatch and to save log files to a specified S3 bucket.

Create an Event Data Store - This enables the same customisation but does not give access to logs via CloudWatch or save log files to S3. Logs must be queried through CloudTrail lakes or linked to a CloudTrail dashboard.

The limitations of event data stores make a Trail the more suitable CloudTrail option.

Cost: $0.10 per 100,000 data events delivered, plus S3 storage costs for saved log files.

Pros

Can configure to only log desired events creating much more streamlined logs.
More near realtime logs available.

Cons

More expensive (although probably not excessively).
Less visibility around authentication failures. CloudTrail does not deliver logs for requests that fail authentication (in which the provided credentials are not valid). However, it does include logs for requests in which authorization fails (AccessDenied) and requests that are made by anonymous users.

A more comprehensive CloudTrail vs Server Access comparison can be seen at Logging options for Amazon S3 - Amazon Simple Storage Service

Test Logging Outputs

I have created a two test log Athena tables (using saved log files) to demonstrate the outputs of each different approach, which both logged the same s3 events.

Cloudtrail log table contains the logs using a CloudTrail trail filtered to only capture PutObject events. 10 rows of data.
Server Access log table contains the logs produced from server access enabled on a source bucket. 225 rows of data.

Grafana Integration

Observability and interrogation of logs is critical.

One element of the plan for our logs is to use Grafana to create visualisations of metrics from the logs.

Both of these options give the ability to save log files queryable by Athena, and Grafana has an Athena plugin available which will make developing monitoring metrics achievable through standard SQL queries to log Athena tables. Query and analyze Amazon S3 data with the new Amazon Athena plugin for Grafana | Grafana Labs

LavMatt · 2023-09-29T11:51:07Z

LavMatt
Sep 29, 2023
Collaborator Author

My favoured Options are 1 or 2.

I think the additional cost is outweighed by the reduction in complexity of implementation for both these options.

Option 1 has the advantage of logging additional unauthorised requests.

Option 2 has the advantage of being customisable so the logs won't be as bloated with unhelpful/useless data. There is a bit more of a cost associated but at $0.10 per 100,000 events logged that's going to be negligible using targeted logging of only data events.

0 replies

julialawrence · 2023-09-29T11:58:22Z

julialawrence
Sep 29, 2023
Maintainer

Can you explain what cross-account considerations there are in using MP's trails?

I think I like option 2 but I want to understand why we'd want to create a trail instead of reusing the one MP has.

5 replies

LavMatt Sep 29, 2023
Collaborator Author

All the log files from MP's trail are saved in a central bucket in another account. So to query those logs in Athena in our accounts we'd need to setup and manage cross account permissions.

julialawrence Sep 29, 2023
Maintainer

Does MP trail have the data you need? Because I think we will be doing a fair bit of cross-account role creations as part of the alerting and monitoring stack anyway so this shouldn't be a huge amount of additional work.

Edit: https://mojdt.slack.com/archives/C01A7QK5VM1/p1695990401840039
Went to the source to ask how MP envisages us doing this.

LavMatt Sep 29, 2023
Collaborator Author

I expect it would do, at least be equivalent to anything we could do with cloudtrail.

But if there is a desire to have the extra unauthorised requests logged, e.g invalid credentials, then we'd need server access logging for that

julialawrence Sep 29, 2023
Maintainer

If there is a limited way of accessing the bucket, is there potential for intercepting these errors elsewhere? Turning on access logging is a pretty big hammer.

LavMatt Oct 3, 2023
Collaborator Author

I'm not 100% sure, maybe not though. It's a hammer in the sense it creates a lot of logs but then from another angle the implementation is simple and they are relatively small files (so not a sledge hammer), which could be filtered by what columns are added to a table and queries.

PriyaBasker23 · 2023-10-03T15:09:22Z

PriyaBasker23
Oct 3, 2023
Collaborator

Thanks Matthew for kick starting the discussion . My preferred options are

Option 1: Utilise the existing cloud trail on the cloud platform, assuming we possess sufficient permissions to access the bucket. Configure a data event as illustrated in the example provided here: [https://dsdmoj.atlassian.net/wiki/spaces/DE/pages/4121460902/ADR-3+Data+uploader+-+Audit]

Option 2: Consider implementing both a data event cloud trail and server-side logging, especially if there are specific use case advantages associated with having server-side logging.

0 replies

LavMatt · 2023-10-04T10:36:02Z

LavMatt
Oct 4, 2023
Collaborator Author

We are going to proceed with implementing Option 2 - configuring our own Cloudtrail Trail within the data platform account.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Object Level S3 Logging #1748

{{title}}

Replies: 4 comments 5 replies

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

Select a reply

Object Level S3 Logging #1748

LavMatt Sep 29, 2023 Collaborator

Context

We need to decide on the approach for object level S3 logging in the Data Platform

Options

Server access logging

CloudTrail

Test Logging Outputs

Grafana Integration

Replies: 4 comments · 5 replies

LavMatt Sep 29, 2023 Collaborator Author

julialawrence Sep 29, 2023 Maintainer

LavMatt Sep 29, 2023 Collaborator Author

julialawrence Sep 29, 2023 Maintainer

LavMatt Sep 29, 2023 Collaborator Author

julialawrence Sep 29, 2023 Maintainer

LavMatt Oct 3, 2023 Collaborator Author

PriyaBasker23 Oct 3, 2023 Collaborator

LavMatt Oct 4, 2023 Collaborator Author

LavMatt
Sep 29, 2023
Collaborator

Replies: 4 comments 5 replies

LavMatt
Sep 29, 2023
Collaborator Author

julialawrence
Sep 29, 2023
Maintainer

LavMatt Sep 29, 2023
Collaborator Author

julialawrence Sep 29, 2023
Maintainer

LavMatt Sep 29, 2023
Collaborator Author

julialawrence Sep 29, 2023
Maintainer

LavMatt Oct 3, 2023
Collaborator Author

PriyaBasker23
Oct 3, 2023
Collaborator

LavMatt
Oct 4, 2023
Collaborator Author