feat(integrations): Add integration for asyncpg #2314

mimre25 · 2023-08-19T13:48:18Z

Closes #1278
I think this is close to being ready (see #2314 (comment)) for an update.

So far this records every statement that is directly issued, as well as the SQL statements that are used for cursors and prepared statements.

I have a few open questions:

I noticed asyncpg uses the postgres binary protocol to communicate to postgres. I haven't had any encounter with that yet, but it's making it more challenging to record the creation of cursors, transactions, and prepared statements properly. Do you have a suggestion what to do about this? Keep in mind that all of this is done in Cython, ie we would need to tap into that lower level for recording the exact statements and reverse parsing the binary protocol.
For cursors and prepared statements, what is the desired outcome? Should the creation be recorded, and every subsequent use, or just the creation, or just the usages?
I've noticed that tracing_utils:record_sql_queries checks the hub.client.options["_experiments"]["record_sql_params"] and strips out the parameters if it's True. However, I did not want to rely on it to strip out parameters, as there is a TODO right with it, that hints at a future removal of that check. Therefore I strip them out per default, but provide a config option to the integration for recording the parameters.
I've noticed that the test-requirements.txt doesn't contain pytest-asyncio, yet the pytest.ini configures the asyncio_mode to strict. Am I missing something here, or is this on purpose?

@antonpirker please let me know what you think :)

antonpirker · 2023-08-30T12:06:28Z

Hey @mimre25 !

Finally I have some time to review. Looks very clean at the first glance! Great job!
Will test it with a local demo project to see what the spans look like.

One thing I noticed, maybe we need to set more data on the span like this: https://github.com/getsentry/sentry-python/blob/antonpirker/2262-traces-sampler-url-instead-of-generic-asgi-request/sentry_sdk/integrations/sqlalchemy.py#L57-L71

antonpirker · 2023-08-30T12:08:03Z

To answer some of your questions:

I've noticed that tracing_utils:record_sql_queries checks the hub.client.options["_experiments"]["record_sql_params"] and strips out the parameters if it's True. However, I did not want to rely on it to strip out parameters, as there is a TODO right with it, that hints at a future removal of that check. Therefore I strip them out per default, but provide a config option to the integration for recording the parameters.

Sounds good. Forget the _experiments.

I've noticed that the test-requirements.txt doesn't contain pytest-asyncio, yet the pytest.ini configures the asyncio_mode to strict. Am I missing something here, or is this on purpose?

This is not really on purpose. This is probably because we have not had time to look at our test setup in a long time...

antonpirker · 2023-08-30T13:10:19Z

I get db spans from your integration! Yay! \o/

antonpirker · 2023-08-30T13:18:07Z

I also added asyncpg to the test matrix so the tests run in CI.

antonpirker · 2023-08-30T13:29:55Z

I noticed asyncpg uses the postgres binary protocol to communicate to postgres. I haven't had any encounter with that yet, but it's making it more challenging to record the creation of cursors, transactions, and prepared statements properly. Do you have a suggestion what to do about this? Keep in mind that all of this is done in Cython, ie we would need to tap into that lower level for recording the exact statements and reverse parsing the binary protocol.

For the first iteration of the asyncpg support it is enough if we record spans for execute, executemany (and maybe db connect if it is easy) The rest we can ignore for now. This will already give a lot of value to users.

For cursors and prepared statements, what is the desired outcome? Should the creation be recorded, and every subsequent use, or just the creation, or just the usages?

I think we can omit those for now. See above.

antonpirker · 2023-08-30T13:42:25Z

Oh, and for the connection to the test postgres database in CI to work, you need to read these env vars:

SENTRY_PYTHON_TEST_POSTGRES_USER
SENTRY_PYTHON_TEST_POSTGRES_PASSWORD
SENTRY_PYTHON_TEST_POSTGRES_NAME
SENTRY_PYTHON_TEST_POSTGRES_HOST

See here: https://github.com/getsentry/sentry-python/blob/master/tests/integrations/django/myapp/settings.py#L126-L130

antonpirker · 2023-08-30T13:43:03Z

All in all really great work @mimre25 !

If you have addressed my comments I will do another round of test/review!

happyleavesaoc · 2023-08-30T15:59:21Z

I tried this out too, and it worked great. I did notice that the span duration is effectively zero in my test (and also the screenshot above).

mimre25 · 2023-08-30T16:31:29Z

Thanks for the review! I'll probably only get to it this weekend, but I'll work in your comments .

…nment This allows running tests locally and in the CI pipeline.

Previously, the wrapper functions did not await the call to the db and thus did not record the actual timing.

The spans for the cursor now record every "execute" of the cursor, both in manual mode, and in iterator mode.

mimre25 · 2023-09-02T10:32:19Z

Alright, the new commits do the following:

Reading test connection params from env
Fix the incorrect span durations (see below)
Record additional information (executemany, cursor, etc)
Recording connect statements
Record execute queries without parameters (I noticed this doesn't happen at the moment)
Some refactorings as it got a bit messy

The span durations were incorrectly recorded because I returned the coroutines instead of awaiting them and returning them.

Screenshot of span duration of old implementation:

Screenshot of span duration of new implementation:

Quite happy how it turned out:

Corresponding source code:

import asyncio
from asyncpg import connect
import sentry_sdk
from sentry_sdk.integrations.asyncpg import AsyncPGIntegration
import datetime

sentry_sdk.init(
  dsn="http://[email protected]:9000/2",
  traces_sample_rate=1.0,
  integrations=[AsyncPGIntegration(record_params=True)],
  _experiments={"record_sql_params": True}
)

async def main():

    with sentry_sdk.start_transaction(op="test", name="pg_sleep"):
        connection = await connect("postgresql://foo:bar@localhost/")
        await connection.execute("DROP TABLE IF EXISTS users")
        await connection.execute(
        """
            CREATE TABLE users(
                id serial PRIMARY KEY,
                name text,
                password text,
                dob date
            )
        """
        )
 
        await connection.execute("SELECT pg_sleep($1);", 3)
        await connection.executemany(
            "INSERT INTO users(name, password, dob) VALUES($1, $2, $3)",
            [
                ("Bob", "secret_pw", datetime.date(1984, 3, 1)),
                ("Alice", "pw", datetime.date(1990, 12, 25)),
            ],
        )
    
        async with connection.transaction():
            # Postgres requires non-scrollable cursors to be created
            # and used in a transaction.
            async for record in connection.cursor(
                "SELECT * FROM users WHERE dob > $1", datetime.date(1970, 1, 1)
            ):
                print(record)
    
        await connection.close()


asyncio.run(main())

sentry_sdk/integrations/asyncpg.py

sentry_sdk/tracing_utils.py

antonpirker

Hey! Did a second round of review. Looks very good already!

I updated the data we store on a span, to make it include all the information Sentry needs for doing some magic stuff with database spans.

There is one instrumentation of the __await__ that we can not release (see comment in the review)

And could you please fix the type problems in the tests that break the tests? Thanks!

sentry_sdk/integrations/asyncpg.py

sentry_sdk/tracing_utils.py

sentry_sdk/integrations/asyncpg.py

antonpirker · 2023-09-06T08:25:26Z

I also added a PR to the docs to add the new integration: getsentry/sentry-docs#7756

This adapts the tests to the different method of recording connection parameters in the span.

This is done to avoid recording of too many "queries" in a single span. Instead, we now record cursor creation.

mimre25 · 2023-09-09T09:38:13Z

I've removed the cursor instrumentation, and now only record the creation of a cursor instead:

All typing and lint fixes are done, and tox passes locally for py3.{7-11}-asyncpg & linters

antonpirker

I think we are ready! Really great work @mimre25 ! Thanks for being so responsive in bringing this over the finish line! Will be included in the next release of Sentry Python SDK.

antonpirker · 2023-09-11T07:02:49Z

Oh, CI tests can not connect to postgres. I will fix that.

So far this records every statement that is directly issued, as well as the SQL statements that are used for cursors and prepared statements.

mimre25 and others added 2 commits August 19, 2023 15:36

feat(integrations): Add integration for asyncpg >= 0.23.0 (WIP)

00d6ad9

Merge branch 'master' into asyncpg-integration

4a028a2

Added asyncpg to test matrix

63b48b0

Added dependency for tests

d982755

Added dependency for tests

0d45020

antonpirker self-assigned this Aug 30, 2023

sentrivana added the New Integration Integrating with a new framework or library label Aug 31, 2023

mimre25 added 5 commits September 2, 2023 10:10

test(asyncpg-integration): Read test PG connection params from enviro…

9756b69

…nment This allows running tests locally and in the CI pipeline.

feat(asyncpg-integration): Add recording of "executemany" information

44ef217

fix(asyncpg-integration): Fix recording of span durations for asyncpg

8f8227a

Previously, the wrapper functions did not await the call to the db and thus did not record the actual timing.

feat(integrations): Allow installing sentry-sdk[asyncpg]

7089e6d

feat(asyncpg-integration): Add proper spans for cursors

3dbe886

The spans for the cursor now record every "execute" of the cursor, both in manual mode, and in iterator mode.

mimre25 commented Sep 2, 2023

View reviewed changes

sentry_sdk/integrations/asyncpg.py Outdated Show resolved Hide resolved

sentry_sdk/tracing_utils.py Show resolved Hide resolved

sentry_sdk/tracing_utils.py Outdated Show resolved Hide resolved

mimre25 added 4 commits September 3, 2023 16:22

feat(asyncpg-integration): Record calls to execute without params

de6d835

feat(asyncpg-integration): Add tracve recording for connect calls

661ca50

refactor(asyncpg-integration): Extract duplicated code into function

8fa3d03

fix(typing): Fix type annotations for asyncpg integration

d5656f9

mimre25 marked this pull request as ready for review September 3, 2023 15:02

antonpirker added 2 commits September 4, 2023 12:50

Merge branch 'master' into asyncpg-integration

00c3f97

Merge branch 'master' into asyncpg-integration

52aa3a0

antonpirker added 6 commits September 5, 2023 14:19

Added db span data

63b58dc

Linting fix

c528904

Trying to remove ParamSpec

c64f039

Reformat

fd1bf6b

Fixed typing and more db span data recording.

d179931

Removed syntax not known to python 2

ea524dd

antonpirker requested changes Sep 6, 2023

View reviewed changes

sentry_sdk/integrations/asyncpg.py Outdated Show resolved Hide resolved

sentry_sdk/integrations/asyncpg.py Show resolved Hide resolved

sentry_sdk/tracing_utils.py Outdated Show resolved Hide resolved

sentry_sdk/integrations/asyncpg.py Outdated Show resolved Hide resolved

Merge branch 'master' into asyncpg-integration

97d9eab

antonpirker mentioned this pull request Sep 6, 2023

Added asyncpg docs getsentry/sentry-docs#7756

Merged

antonpirker changed the title ~~feat(integrations): Add integration for asyncpg >= 0.23.0 (WIP)~~ feat(integrations): Add integration for asyncpg Sep 6, 2023

antonpirker and others added 5 commits September 6, 2023 11:09

Merge branch 'master' into asyncpg-integration

bd75d0a

fix(tests): Fix expected results for asyncpg connect tests

07161a1

This adapts the tests to the different method of recording connection parameters in the span.

fix(asyncpg-integration): Remove recording of every cursor execute

f04a9c7

This is done to avoid recording of too many "queries" in a single span. Instead, we now record cursor creation.

fix(typing): Import __future__ annotations to allow modern type hints

4c977dd

fix(typing): Fix type hints for asyncpg integration

47aa451

mimre25 requested a review from antonpirker September 9, 2023 10:20

Merge branch 'master' into asyncpg-integration

9715829

antonpirker enabled auto-merge (squash) September 11, 2023 06:59

antonpirker approved these changes Sep 11, 2023

View reviewed changes

Added postgres to asyncpg tests

89c1023

antonpirker merged commit 87d582d into getsentry:master Sep 11, 2023

sentrivana pushed a commit that referenced this pull request Sep 18, 2023

feat(integrations): Add integration for asyncpg (#2314)

27e06b3

So far this records every statement that is directly issued, as well as the SQL statements that are used for cursors and prepared statements.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(integrations): Add integration for asyncpg #2314

feat(integrations): Add integration for asyncpg #2314

mimre25 commented Aug 19, 2023 •

edited

Loading

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

happyleavesaoc commented Aug 30, 2023

mimre25 commented Aug 30, 2023

mimre25 commented Sep 2, 2023 •

edited

Loading

antonpirker left a comment

antonpirker commented Sep 6, 2023

mimre25 commented Sep 9, 2023

antonpirker left a comment

antonpirker commented Sep 11, 2023

feat(integrations): Add integration for asyncpg #2314

feat(integrations): Add integration for asyncpg #2314

Conversation

mimre25 commented Aug 19, 2023 • edited Loading

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

antonpirker commented Aug 30, 2023

happyleavesaoc commented Aug 30, 2023

mimre25 commented Aug 30, 2023

mimre25 commented Sep 2, 2023 • edited Loading

antonpirker left a comment

Choose a reason for hiding this comment

antonpirker commented Sep 6, 2023

mimre25 commented Sep 9, 2023

antonpirker left a comment

Choose a reason for hiding this comment

antonpirker commented Sep 11, 2023

mimre25 commented Aug 19, 2023 •

edited

Loading

mimre25 commented Sep 2, 2023 •

edited

Loading