feat(batch-exports): Add `created_at` to Redshift persons batch export #27403

rossgray · 2025-01-09T15:27:00Z

Problem

We're not currently sending the created_at field in Redshift persons batch exports

Changes

Add the created_at field
Update README to include info about Tailscale

👉 Stay up-to-date with PostHog coding conventions for a smoother review.

Does this work well for both Cloud and self-hosted?

Yes

How did you test this code?

Added new tests

tomasfarias · 2025-01-10T10:37:55Z

posthog/temporal/batch_exports/postgres_batch_export.py

+
+        async with self.connection.transaction():
+            async with self.connection.cursor() as cursor:
+                await cursor.execute(sql.SQL("SELECT * FROM {} WHERE 1=0").format(table_identifier))


That's a very funny way to get schema information.

That was what my editor suggested 😄
I suppose it is efficient since it won't actually return any table data, just column metadata

It's a bit obfuscated, maybe not so clear at a glace what you are doing, but I believe it does work. And it saves you from needing to query a separate table (something in information_schema) for which we may need extra permissions.

tomasfarias · 2025-01-10T10:38:43Z

posthog/temporal/batch_exports/redshift_batch_export.py

+        # Redshift doesn't support adding a condition on the merge, so we have
+        # to first delete any rows in stage that match those in final, where
+        # stage also has a higher version. Otherwise we risk merging adding old
+        # versions back.


Thanks for clarifying 👍

I was slightly confused why we were doing this and then saw this comment in the PR where you introduced it, so thought it was worth adding

Add created_at to Redshift persons batch export

50ebc99

rossgray requested a review from tomasfarias January 9, 2025 15:27

tomasfarias reviewed Jan 10, 2025

View reviewed changes

tomasfarias approved these changes Jan 10, 2025

View reviewed changes

rossgray merged commit 4c3cb69 into master Jan 10, 2025
93 checks passed

rossgray deleted the add-created-at-to-redshift-persons-batch-export branch January 10, 2025 11:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(batch-exports): Add `created_at` to Redshift persons batch export #27403

feat(batch-exports): Add `created_at` to Redshift persons batch export #27403

rossgray commented Jan 9, 2025

tomasfarias Jan 10, 2025

rossgray Jan 10, 2025

tomasfarias Jan 10, 2025

tomasfarias Jan 10, 2025

rossgray Jan 10, 2025

feat(batch-exports): Add created_at to Redshift persons batch export #27403

feat(batch-exports): Add created_at to Redshift persons batch export #27403

Conversation

rossgray commented Jan 9, 2025

Problem

Changes

Does this work well for both Cloud and self-hosted?

How did you test this code?

tomasfarias Jan 10, 2025

Choose a reason for hiding this comment

rossgray Jan 10, 2025

Choose a reason for hiding this comment

tomasfarias Jan 10, 2025

Choose a reason for hiding this comment

tomasfarias Jan 10, 2025

Choose a reason for hiding this comment

rossgray Jan 10, 2025

Choose a reason for hiding this comment

feat(batch-exports): Add `created_at` to Redshift persons batch export #27403

feat(batch-exports): Add `created_at` to Redshift persons batch export #27403