Load data using snapshot repository during benchmark runs #4102

rishabh6788 · 2023-10-04T05:26:39Z

As of now for each and every benchmark run the data is downloaded locally and then indexed into the cluster before the search queries are run, this process takes the most time in the overall benchmark process.
Explore ways to be able to restore the data into the indices from a snapshot repository to speed up the process.
This will also provide more consistent results with search queries.

Pre-requisites:

Create a S3 snapshot in testing infra aws account and add snapshots of all the available opensearch-benchmark-workloads.
Add support in opensearch-cluster-cdk to install repository-s3 plugin and add S3 snapshot bucket IAM permission.
Add support in python workflow to accept --include-tasks, --exclude-tasks and --test-procedure.

Acceptance Criteria:

The workload data should not get downloaded and indexed while using restore from snapshot test procedure.
The # of segments should remain constant for each workload indices.
The search query latency variance should not exceed 5%.
The benchmark runtime should reduce by 50%.

The text was updated successfully, but these errors were encountered:

Divyaasm · 2023-10-10T19:29:01Z

@rishabh6788 , could you add more details like acceptance criteria to the issue.

rishabh6788 · 2023-10-25T01:04:21Z

Following are the task break-downs and their status:

Create s3 bucket for snapshot-repository: This is completed, the snapshot bucket will reside in the same account where the test clusters are brought up for benchmarking, this will ease the process of registering and fetching the snapshot without requiring to import aws keys and tokens. The attached role to opensearch ec2 instances will have required access to be able to fetch the snapshot.
Create snapshots of opensearch-benchmark workloads (In-progress): Currently working on indexing the opensearch-benchmark workloads data into a test cluster and then creating snapshots of them with different configurations, such as default https_logs snapshot with default # of clients and another snapshot with just 1 client to be able ingest data linearly. This will be an ongoing process as and when demand arises for the a new workload snapshot. We will start with nyc_taxis, http_logs and pmc.
Create new workload test procedures (Not started): Once the snapshots have been created, will work on creating custom test-procedures that will use the snapshot to index the data and then run search queries. The custom test-procedure should be customizable, i.e., the user should be able to choose which snapshot to use and what should be the index configuration while restoring the data.
Add snapshot support in opensearch-cluster-cdk (Not started): We need add respository-s3 plugin installation as default action and also make changes to ec2 instance IAM profile to be able to fetch snapshot from s3 bucket.
Add support in main nightly benchmark python workflow (Not started): Finally add the necessary command line parameters support in the main workflow to be able to utilize above mentioned changes. This will also require changes in our nightly run jenkinsfile and corresponding groovy library

@bbarani @getsaurabh02 @rishabhmaurya @Pallavi-AWS

rishabh6788 · 2023-11-06T19:44:04Z

All the required milestones have been completed and nightly benchmarks are now running using snapshots.

rishabh6788 self-assigned this Oct 4, 2023

github-actions bot added the untriaged Issues that have not yet been triaged label Oct 4, 2023

Divyaasm removed the untriaged Issues that have not yet been triaged label Oct 10, 2023

rishabh6788 added this to OpenSearch Engineering Effectiveness Oct 17, 2023

github-project-automation bot moved this to Backlog in OpenSearch Engineering Effectiveness Oct 17, 2023

rishabh6788 moved this from Backlog to In Progress in OpenSearch Engineering Effectiveness Oct 17, 2023

This was referenced Oct 25, 2023

Add support for optional benchmark command line args #4173

Merged

Added support for test-procedure, exclude-tasks and include-tasks benchmark arguments #4180

Merged

bbarani moved this from In Progress to In review in OpenSearch Engineering Effectiveness Nov 6, 2023

rishabh6788 closed this as completed Nov 6, 2023

github-project-automation bot moved this from In review to Done in OpenSearch Engineering Effectiveness Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Load data using snapshot repository during benchmark runs #4102

Load data using snapshot repository during benchmark runs #4102

rishabh6788 commented Oct 4, 2023 •

edited

Loading

Divyaasm commented Oct 10, 2023

rishabh6788 commented Oct 25, 2023 •

edited

Loading

rishabh6788 commented Nov 6, 2023

Load data using snapshot repository during benchmark runs #4102

Load data using snapshot repository during benchmark runs #4102

Comments

rishabh6788 commented Oct 4, 2023 • edited Loading

Divyaasm commented Oct 10, 2023

rishabh6788 commented Oct 25, 2023 • edited Loading

rishabh6788 commented Nov 6, 2023

rishabh6788 commented Oct 4, 2023 •

edited

Loading

rishabh6788 commented Oct 25, 2023 •

edited

Loading