New issue

Jump to bottom

Point In Time Recovery #551

Open

Zvirovyi wants to merge 43 commits into canonical:main from Zvirovyi:pitr

+1,199 −67

Zvirovyi commented Nov 20, 2024 •

edited

Loading

Important!

This PR relies on the last version of the charmed-mysql-snap and canonical/charmed-mysql-snap#63.

Overview

MySQL stores binary transactions logs. This PR adds a service job to upload these logs to the S3 bucket and the ability to use them later for a point-in-time-recovery with a new restore-to-time parameter during restore. This new parameter accepts MySQL timestamp or keyword latest (for replaying all the transaction logs).

Also, a new application blocked status is introduced - Another cluster S3 repository to signal user that used S3 repository is claimed by the another cluster and binlogs collecting job is disabled and creating new backups is restricted (these are the only workload limitation). This is crucial to keep stored binary logs safe from the another clusters. This check uses @@GLOBAL.group_replication_group_name.
After restore, cluster group replication is reinitialized, so practically it becomes a new different cluster. For these cases, Another cluster S3 repository message is changed to the Move restored cluster to another S3 repository to indicate this event more conveniently for the user.
Both the block messages will disappear when S3 configuration is removed or changed to the empty repository.

Usage example

deploy mysql + s3-integrator and integrate them
create full backup
create test data:

create database zvirovyi;
use zvirovyi;
create table asd(message varchar(255) primary key);
select current_timestamp; # 2024-11-20 17:10:01
insert into asd values ('hello');
select current_timestamp; # 2024-11-20 17:10:12
insert into asd values ('world');
flush binary logs;

wait several minutes for binlogs to be uploaded
restore: juju run mysql/leader restore backup-id=2024-11-20T17:08:24Z restore-to-time="2024-11-20 17:10:01"

use zvirovyi;
select * from asd; # empty set returned

observe application block message
restore: juju run mysql/leader restore backup-id=2024-11-20T17:08:24Z restore-to-time="latest"

use zvirovyi;
select * from asd; # hello, world returned

Key notes

binlogs collecting and PITR functionality depends on the https://github.com/canonical/mysql-pitr-helper

Zvirovyi added 3 commits

November 15, 2024 22:17


          Add binlog_utils_udf plugin.

405d5a4


          Enable gtid_mode and enforce_gtid_consistency for the MySQL.

ae58701


          Add S3 compatibility check based on the group replication id.

2ce63f7

Zvirovyi mentioned this pull request

Point In Time Recovery canonical/mysql-k8s-operator#531

Open

Zvirovyi added 2 commits

December 4, 2024 06:29


          Merge branch 'main' into pitr

85ddea2

# Conflicts:
#	src/upgrade.py


          Point-in-time-recovery.

7872d7d

Zvirovyi marked this pull request as ready for review

December 8, 2024 09:12

Author

Zvirovyi commented Dec 8, 2024 •

edited

Loading

~~Tests are WIP!~~
UPD: integration tests done, and I will fix + add unit tests after PR review.

Zvirovyi added 3 commits

December 8, 2024 12:09


          Merge branch 'refs/heads/main' into pitr

f57a83e


          Integration tests.

c730506


          Merge branch 'main' into pitr

86dfad7

carlcsaposs-canonical reviewed

View reviewed changes

lib/charms/mysql/v0/backups.py Outdated Show resolved Hide resolved

src/mysql_vm_helpers.py Outdated Show resolved Hide resolved

lib/charms/mysql/v0/backups.py Show resolved Hide resolved

src/mysql_vm_helpers.py Outdated Show resolved Hide resolved

shayancanonical reviewed

View reviewed changes

Contributor

shayancanonical left a comment •

edited

Loading

Since we are only collecting binlogs from the leader unit, we need to add handling for when Juju elects a new leader -> what should happen here? Should the new leader unit start collecting binlogs instead?

Furthermore, while thinking of the above use case, we will also handle the scaling scenario -> what if the leader unit is scaled down?

Also, I would really prefer it if we could add an integration test for the above scenario (where the leader unit is scaled down after which the PITR is performed) after we determine how to handle the scenario

lib/charms/mysql/v0/backups.py Show resolved Hide resolved

lib/charms/mysql/v0/s3_helpers.py Outdated Show resolved Hide resolved

Zvirovyi added 4 commits

December 16, 2024 23:32


          Merge branch 'refs/heads/main' into pitr

193a552


          Binlogs collector service improvement.


          Merge branch 'refs/heads/main' into pitr

bc15164


          Binlogs collector service improvement.

b7783e8

paulomach reviewed

View reviewed changes

Contributor

paulomach left a comment

Left some comments and I'll try to test it.

lib/charms/mysql/v0/backups.py Show resolved Hide resolved

actions.yaml Outdated Show resolved Hide resolved

src/mysql_vm_helpers.py Outdated Show resolved Hide resolved

lib/charms/mysql/v0/backups.py Outdated Show resolved Hide resolved

Zvirovyi added 2 commits

January 7, 2025 20:35


          Use context manager for ca_file in s3_helpers.

0d65641


          Rename start_stop_binlogs_collecting to reconcile_binlogs_collection.

db8f9ad

paulomach reviewed

View reviewed changes

lib/charms/mysql/v0/backups.py Outdated Show resolved Hide resolved

Zvirovyi and others added 4 commits

January 9, 2025 02:57


          Delete binlogs collector config when not needed.

9ac2a34


          Improve update_binlogs_collector_config.

38567e8

Co-authored-by: Paulo Machado <[email protected]>


          Format.

a2178a7


          Merge branch 'main' into pitr

1b1cebb

paulomach reviewed

View reviewed changes

lib/charms/mysql/v0/mysql.py Show resolved Hide resolved


          Add restore-to-time validation and format notice.

f121986

Contributor

paulomach commented Jan 14, 2025

Hi @Zvirovyi , please take a look the failing tests. Ping me for authorize the test run

Zvirovyi added 2 commits

January 15, 2025 01:10


          Merge branch 'main' into pitr

b9086ea


          Merge branch 'main' into pitr

d12788f

Zvirovyi added 4 commits

January 20, 2025 17:09


          Improve "Move restored cluster to another s3 bucket" message logic.

cd5b757


          Merge branch 'main' into pitr

f779c3c


          Improve binlogs collection service.

bb27b7d


          Merge branch 'main' into pitr

8f1f18b

Author

Zvirovyi commented Jan 23, 2025

@shayancanonical

Since we are only collecting binlogs from the leader unit, we need to add handling for when Juju elects a new leader -> what should happen here? Should the new leader unit start collecting binlogs instead?

This is handled by binding s3 changed event to the leader elected event like self.framework.observe(self.charm.on.leader_elected, self._on_s3_credentials_changed) in lib/charms/mysql/v0/backups.py. And then peer relation changed will occur on other units therefore disabling binlogs collector.
This is the same approach used in the PostgreSQL PITR.

Furthermore, while thinking of the above use case, we will also handle the scaling scenario -> what if the leader unit is scaled down?

Same as above, I don't see difference here.

Also, I would really prefer it if we could add an integration test for the above scenario (where the leader unit is scaled down after which the PITR is performed) after we determine how to handle the scenario

Would it be VM-specific integration test? If so, then should we add it?

Zvirovyi added 3 commits

January 25, 2025 03:15


          Merge branch 'main' into pitr

302ea71

# Conflicts:
#	lib/charms/mysql/v0/mysql.py


          Increment LIBPATCH for libs.

d0f8a78


          Fix errors after main merge.

7f93557

sinclert-canonical reviewed

View reviewed changes

lib/charms/mysql/v0/mysql.py Outdated Show resolved Hide resolved

Zvirovyi added 9 commits

January 29, 2025 19:15


          Merge branch 'main' into pitr

5f19a16


          LIBPATCH

19da4c2


          Move binlogs collector config to the env.

c57c97f


          Move binlogs collector config to the env.

62dff3c


          Merge branch 'main' into pitr

4588a0e

# Conflicts:
#	lib/charms/mysql/v0/s3_helpers.py


          Sync K8s PR changes.

3d289a3


          Format.

2f9397c


          S3 improvements.

e1c0ce4


          Merge branch 'main' into pitr

61247e4

# Conflicts:
#	lib/charms/mysql/v0/mysql.py
#	src/charm.py
#	src/upgrade.py

shayancanonical approved these changes

View reviewed changes

Contributor

shayancanonical left a comment

PR looks great! Will help resolve the unit test failures so that we can see the integration test results


          Fix failing unit tests

00ee5a8

codecov bot commented Feb 10, 2025

Codecov Report

Attention: Patch coverage is 34.05172% with 153 lines in your changes missing coverage. Please review.

Project coverage is 64.11%. Comparing base (e7c1809) to head (00ee5a8).
Report is 133 commits behind head on main.

Files with missing lines	Patch %	Lines
lib/charms/mysql/v0/backups.py	34.82%	63 Missing and 10 partials ⚠️
lib/charms/mysql/v0/s3_helpers.py	41.66%	28 Missing ⚠️
lib/charms/mysql/v0/mysql.py	29.03%	22 Missing ⚠️
src/mysql_vm_helpers.py	21.42%	21 Missing and 1 partial ⚠️
src/charm.py	20.00%	6 Missing and 2 partials ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #551      +/-   ##
==========================================
- Coverage   66.25%   64.11%   -2.15%     
==========================================
  Files          17       20       +3     
  Lines        3180     4489    +1309     
  Branches      424      742     +318     
==========================================
+ Hits         2107     2878     +771     
- Misses        935     1370     +435     
- Partials      138      241     +103

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Zvirovyi added 5 commits

February 12, 2025 07:40


          Merge branch 'main' into pitr

7d1cd50


          Merge remote-tracking branch 'origin/pitr' into pitr

a4526e2


          Merge branch 'main' into pitr

1f7b638


          Add unit tests.

e933082


          Merge branch 'main' into pitr

9361ec8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

carlcsaposs-canonical

sinclert-canonical

shayancanonical

At least 2 approving reviews are required to merge this pull request.

Labels

None yet