You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We experienced complete data loss in our CDC changelog tables when changing the Schema Propagation setting from "Propagate all field and stream changes" to "Propagate field changes only". Upon this configuration change, Airbyte silently dropped and recreated all changelog tables on the next sync, effectively erasing our historical changelog data without any warning or confirmation.
Environment
Airbyte Version: 1.3.1
Source: source-mysql 3.9.4
Destination: destination-snowflake 3.15.2
Deployment: OSS
Connection Method: CDC/Binlog
Initial Schema Propagation Setting: "Propagate all field and stream changes"
Changed Schema Propagation To: "Propagate field changes only"
The Problem
Our CDC pipeline kept failing when new tables were added to MySQL
With a 72-hour binlog retention limit, this gave us very little time to respond to failures
To mitigate this, we decided to change from auto-detecting new tables to manual table enabling
Upon changing this setting, Airbyte:
Detected this as a "schema change"
Silently dropped all existing changelog tables
Recreated them as empty tables
Did not provide any warning about potential data loss
Did not require confirmation for this destructive action
Impact
Complete loss of historical changelog data
Required extensive manual work to attempt to restore from backups
Loss of data continuity in our changelog history
Unexpected behavior for what should be a non-destructive configuration change
Evidence
The Snowflake query history shows the sequence of DROP TABLE commands executed by Airbyte:
Expected Behavior
When changing schema propagation settings:
Airbyte should not treat this as a schema change requiring table recreation
If table recreation is necessary:
Show a clear warning about potential data loss
Require explicit confirmation before proceeding with destructive operations
Provide an option to backup existing data
Maintain existing data and table structures unless explicitly requested otherwise
Additional Context
This issue is particularly severe because:
The change was made specifically to prevent data loss scenarios
The resulting behavior caused the very thing we were trying to prevent
There was no warning or indication that this setting change would be destructive
CDC data is historical by nature and often irreplaceable once lost
Suggested Solutions
Add clear warnings when configuration changes might result in data loss
Implement a confirmation step for destructive operations
Provide an option to preserve existing tables when changing propagation settings
Consider adding a "dry run" option to show what changes would be made
Topic
Schema propagation
Relevant information
Description
We experienced complete data loss in our CDC changelog tables when changing the Schema Propagation setting from "Propagate all field and stream changes" to "Propagate field changes only". Upon this configuration change, Airbyte silently dropped and recreated all changelog tables on the next sync, effectively erasing our historical changelog data without any warning or confirmation.
Environment
The Problem
Impact
Evidence
The Snowflake query history shows the sequence of DROP TABLE commands executed by Airbyte:
Expected Behavior
When changing schema propagation settings:
Additional Context
This issue is particularly severe because:
Suggested Solutions
Screenshot of timeline
Note how the DROP statements above started after the "Schema updated" event, but before the next sync started.
The text was updated successfully, but these errors were encountered: