feat: Add instrumentation for postgresql (pg gem) #659

ahayworth · 2021-03-16T22:48:13Z

This commit adds tracing for postgresql, via the pg gem.

The following methods are traced:

exec / query / sync_exec / async_exec
exec_params / sync_exec_params / async_exec_params
prepare / sync_prepare / async_prepare
exec_prepared / sync_exec_prepared / async_exec_prepared

We instrument all of these methods via meta-programming, since otherwise
it'd be incredibly repetitive. The methods can actually be broken down
into three groups - "exec-ish", "prepare-ish", and "exec prepared-ish"
methods. Every method within a group takes the same arguments, and so
we can trace each method in a group in an identical fashion.

Note that the 'sync' and 'async' variants require no special handling,
because they are not async interfaces per se - rather, they are
non-blocking. See the pg docs for more info: https://www.rubydoc.info/gems/pg/PG%2FConnection:exec

We've implemented SQL sanitization the same way that the mysql2 PR did
it - which seems to be done by copying NewRelic. We should consider
extracting that from the NewRelic agent and making it a common shared
library here.

Possibly worth discussion is how we've handled the span name. We've
chosen the format OPERATION_NAME DATABASE_NAME, taking the first word
in the SQL string as the operation if it matches a known good list we
pulled from the Postgres docs. This is similar to how the python and
javascript postgres implementations behave, but is slightly different
than how the ruby mysql instrumentation does it. When falling back to
safe value, we also choose to simply use DATABASE_NAME, which seems to
be in line with current semantic conventions.

Definitely worth discussion is the LRU cache added in d119f8a - this tracks
the 50 most recently-prepared statements that we've traced; so that we can
attach it to the span as db.statement. Otherwise, if you're using a framework
like Rails that will aggressively prepare statements when the database supports
it, then you just see a lot of 'EXECUTE' calls with no real information attached.

While tests pass and we think this is safe, it should be noted that we
haven't actually put this into production testing yet - there could be
horrible bugs still. :)

Fixes #523

This commit adds tracing for postgresql, via the `pg` gem. The following methods are traced: - exec / query / sync_exec / async_exec - exec_params / sync_exec_params / async_exec_params - prepare / sync_prepare / async_prepare - exec_prepared / sync_exec_prepared / async_exec_prepared We instrument all of these methods via meta-programming, since otherwise it'd be incredibly repetitive. The methods can actually be broken down into three groups - "exec-ish", "prepare-ish", and "exec prepared-ish" methods. Every method within a group takes the same arguments, and so we can trace each method in a group in an identical fashion. Note that the 'sync' and 'async' variants require no special handling, because they are not async interfaces _per se_ - rather, they are non-blocking. See the `pg` docs for more info: https://www.rubydoc.info/gems/pg/PG%2FConnection:exec We've implemented SQL sanitization the same way that the Mysql2 gem did it - which seems to be done by copying NewRelic. We should consider extracting that from the NewRelic agent and making it a common shared library here. Possibly worth discussion is how we've handled the span name. We've chosen the format `OPERATION_NAME DATABASE_NAME`, taking the first word in the SQL string as the operation if it matches a known good list we pulled from the Postgres docs. This is similar to how the python and javascript postgres implementations behave, but is slightly different than how the ruby mysql instrumentation does it. When falling back to safe value, we also choose to simply use `DATABASE_NAME`, which seems to be in line with current semantic conventions. While tests pass and we think this is safe, it should be noted that we haven't actually put this into production testing yet - there could be horrible bugs still. :) Fixes #523

linux-foundation-easycla · 2021-03-16T22:48:16Z

The committers are authorized under a signed CLA.

✅ Andrew Hayworth (9111b44, 625c487, 569a3c5, 67ce3f7, cefac7a, 9de0ec1)

…by into ahayworth-add-pg

fbogsany · 2021-03-17T20:00:52Z

We've implemented SQL sanitization the same way that the mysql2 PR did
it - which seems to be done by copying NewRelic. We should consider
extracting that from the NewRelic agent and making it a common shared
library here.

👍 we can do that now or when we hit the next use-case. I asked that we initially tailor it specifically to MySQL to start with, and generalize it later. While we are not required to by the license, it is probably a nice gesture to ask NewRelic if they want to donate that code to OpenTelemetry, if we're going to copy it wholesale.

This uses an LRU cache to store the 50 most recently-prepared database statements. The goal is to provide something more descriptive than 'EXECUTE a1', which is basically what you get when Rails aggressively prepares and executes statements. 50 seems to me like a reasonable trade-off between usefulness and capping memory growth. However, it is technically unbounded if you are not sanitizing SQL - although we could decide to truncate unconditionally.

ahayworth · 2021-03-17T21:39:50Z

👍 we can do that now or when we hit the next use-case. I asked that we initially tailor it specifically to MySQL to start with, and generalize it later. While we are not required to by the license, it is probably a nice gesture to ask NewRelic if they want to donate that code to OpenTelemetry, if we're going to copy it wholesale.

Agreed! Do you know anyone there that might be interested in weighing in? Otherwise I can just go open an issue and see how they feel.

.github/workflows/ci.yml

instrumentation/pg/lib/opentelemetry/instrumentation/pg/constants.rb

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb

instrumentation/pg/lib/opentelemetry/instrumentation/pg/version.rb

instrumentation/pg/opentelemetry-instrumentation-pg.gemspec

ericmustin · 2021-03-23T16:54:38Z

instrumentation/pg/lib/opentelemetry/instrumentation/pg/patches/connection.rb

+
+            # From:
+            # https://github.com/newrelic/newrelic-ruby-agent/blob/9787095d4b5b2d8fcaf2fdbd964ed07c731a8b6b/lib/new_relic/agent/database/obfuscator.rb
+            # https://github.com/newrelic/newrelic-ruby-agent/blob/9787095d4b5b2d8fcaf2fdbd964ed07c731a8b6b/lib/new_relic/agent/database/obfuscation_helpers.rb


👋 Hey @mwlang hope all is well. At the SIG today we were chatting on how useful this obfuscation code is for orm/db instrumentation in ruby. It would be wonderful to be able to re-use the Obfuscation Module in opentelemetry-ruby directly. Would you be open to making a contribution to add this if you have bandwidth? And if not, any feedback or guidance on this approach?

We implement the bare minimum of an LRU cache ourselves, instead.

ahayworth · 2021-03-24T22:37:48Z

Update: I've removed the dependency on lru_redux, and implemented our own minimal LRU cache instead.

johnnyshields · 2021-03-26T18:39:09Z

FYI using this in production, looking very good thanks.

ahayworth · 2021-03-26T19:39:19Z

@johnnyshields Do you have any thoughts on the number of spans its creating? We're not just instrumenting common things like SELECT, UPDATE, etc - I'm curious if you have any thoughts on if it's perhaps too noisy.

Also, yay! I'm glad it's working well for you!

johnnyshields · 2021-03-27T04:24:36Z

@ahayworth having extra items like BEGIN, COMMIT etc. are very useful to see. In fact, having these extra spans helped us debug an issue yesterday. In the below example you can see clearly that when using transactions the latency doesn't happen on the INSERT step but on the COMMIT.

ahayworth · 2021-03-28T18:53:17Z

@johnnyshields Thanks - that's good feedback, I appreciate it!

renchap · 2021-04-03T10:01:01Z

Would it make sense to report a transaction as a whole span (starting with a BEGIN (or START TRANSACTION) and ending with a COMMIT or ROLLBACK, rather than having two different spans?
BEGIN is never supposed to block AFAIK, so we could have a transaction span, and children spans for the individual queries + the final COMMIT statement.

ahayworth · 2021-04-03T13:52:08Z

Would it make sense to report a transaction as a whole span (starting with a BEGIN (or START TRANSACTION) and ending with a COMMIT or ROLLBACK, rather than having two different spans?

The only real downside I see there is that we'd need to do much more parsing of the SQL statements we're getting in the library (they're just strings at this point before they get handed off to the lower-level C extensions in pg). That would complicate the PR quite a bit, so I think I'd want to leave that out for now.

ahayworth · 2021-04-13T17:02:15Z

Update: I've been running this in production now for awhile, and have seen no problems from it thus far.

johnnyshields · 2021-04-13T17:41:00Z

Same here, have had it in prod for 2 weeks at TableCheck with no problem and lots of volume. Let's merge!

mwear

LGTM. We have multiple people running this in prod with good results.

mwear · 2021-04-14T17:26:16Z

Any objections to me merging this? If not, I will, but this branch needs to have main merged in.

robertlaurin

This looks good me. Thanks a bunch for this contribution :)

instrumentation/pg/lib/opentelemetry/instrumentation/pg/version.rb

ahayworth · 2021-04-15T14:18:04Z

@mwear main is merged and the PR is updated to deal with the new opentelemetry-instrumentation-base extraction. 😄

mwear · 2021-04-15T16:04:40Z

🚀 Thanks @ahayworth!

johnnyshields · 2021-04-16T05:53:19Z

Great job everyone, really appreciate this. Please cut a release soon.

ahayworth added 3 commits March 16, 2021 17:49

Merge branch 'main' into ahayworth-add-pg

625c487

test: Attempt to fix CI for pg instrumentation

569a3c5

Merge branch 'ahayworth-add-pg' of github.com:github/opentelemetry-ru…

67ce3f7

…by into ahayworth-add-pg

This comment has been minimized.

Sign in to view

test: Add explicit postgres hostname

cefac7a

This comment has been minimized.

Sign in to view

ahayworth added 3 commits March 17, 2021 10:03

Merge branch 'main' into ahayworth-add-pg

9de0ec1

test: Fix CI

67d0dad

Merge branch 'ahayworth-add-pg-fix-ci' into ahayworth-add-pg

fab4b37

ahayworth marked this pull request as ready for review March 17, 2021 21:37

ahayworth requested review from dazuma, ericmustin, fbogsany, mwear and robertlaurin as code owners March 17, 2021 21:37

ahayworth changed the title ~~[WIP] feat: Add instrumentation for postgresql (pg gem)~~ feat: Add instrumentation for postgresql (pg gem) Mar 17, 2021