Update instrumentors to use span processors #852

ocelotl · 2020-06-24T05:29:15Z

Fixes #832

This can be implemented as a fix right now, but the issue highlights a more pressing matter, the fact that there is an overlap between configuration in the standard sense (the kind that uses Configuration) and the kind that is implemented by the end user as application code.

lzchen · 2020-06-24T15:04:20Z

Can you explain a bit more about what the issue is and how we are solving it? Is this a quick fix or a permanent solution? I also am confused with this: the fact that there is an overlap between configuration in the standard sense (the kind that uses Configuration) and the kind that is implemented by the end user as application code, how does this relate to the problem? An example would be nice.

ocelotl · 2020-06-24T16:07:50Z

Can you explain a bit more about what the issue is and how we are solving it? Is this a quick fix or a permanent solution? I also am confused with this: the fact that there is an overlap between configuration in the standard sense (the kind that uses Configuration) and the kind that is implemented by the end user as application code, how does this relate to the problem? An example would be nice.

Sure, the problem here happens with auto instrumentation. The example from @mat-rumian in #832 used auto instrumentation to launch several scripts. Among them, this code was getting executed:

from opentelemetry import trace
from opentelemetry.ext.jaeger import JaegerSpanExporter
from opentelemetry.sdk.trace import TracerProvider
from opentelemetry.sdk.trace.export import (
    ConsoleSpanExporter,
    SimpleExportSpanProcessor,
    BatchExportSpanProcessor,
)
import opentelemetry.ext.requests


def configure_jaeger_span_exporter(configuration: dict):
    exporter = JaegerSpanExporter(
        service_name=configuration['service_name'],
        agent_host_name=configuration['agent_host'],
        agent_port=int(configuration['agent_port']),
    )

    trace.set_tracer_provider(TracerProvider())
    trace.get_tracer(__name__)

    span_processor = BatchExportSpanProcessor(exporter)
    trace.get_tracer_provider().add_span_processor(span_processor)

    trace.get_tracer_provider().add_span_processor(
        SimpleExportSpanProcessor(ConsoleSpanExporter())
    )

Span processors were added to the tracer, but this happened after the auto instrumentation process happened. In this process, a tracer is created (without span processors). This object is stored in a function as a closure: here and here.

So, the requests and the dbapi instrumentations end up using this tracer object that lacks the span processors and that makes their spans not to be exported and that is why they were not showing up in the Jaeger exporter.

This is still a draft, it could be added as a fix in the instrumentations, it will solve the issue in these two instrumentations but I think there is a bigger issue, the fact that some configuration can happen in the environment variables (configuration that is accessible to the auto instrumentation process) and some can happen in the application code (configuration that is not accessible to the auto instrumentation process, like the addition of span processors in the application code).

toumorokoshi · 2020-06-30T16:30:47Z

Yeah, this is a behavioral issue that has bitten me a little bit to. You have to be very careful with the ordering of the creation of the required components:

named tracer, which binds to whatever trace provider (noop by default)
instrumentation, which can be designed to create a name tracer on instrumentation right away, or deferred
trace provider.

To make this fool-proof, we need to ensure creation of the TracerProvider before auto-instrumentation. This is hard to do, because one typically configures their TraceProvider programatically, after the application initializes.

I agree with @ocelotl that the best option is to defer tracer creation until we absolutely have to. Not sure how we can codify it though, besides documenting that practice somewhere and hoping that others will follow it.

toumorokoshi

I think there's optimizations for processor creation, but there may be value in tackling that at a higher level.

This does resolve a critical issue with trace configuration with auto-instrumentation. Thanks for the fix!

ext/opentelemetry-ext-requests/src/opentelemetry/ext/requests/__init__.py

ext/opentelemetry-ext-dbapi/src/opentelemetry/ext/dbapi/__init__.py

lzchen

Nice

Fixes open-telemetry#832. By having tracer creation occur on demand, late tracer provider configuration will be honored. This resolves issues with instrumentation occurring before tracer providers are set by the application developer, which would result in the no-op tracer used for the lifetime of the instrumentation. Co-authored-by: alrex <[email protected]> Co-authored-by: Leighton Chen <[email protected]> Co-authored-by: Yusuke Tsutsumi <[email protected]>

ocelotl requested a review from a team June 24, 2020 05:29

ocelotl marked this pull request as draft June 24, 2020 05:31

ocelotl mentioned this pull request Jun 24, 2020

Missing spans in case of multi library auto-instrumented project #832

Closed

ocelotl self-assigned this Jun 24, 2020

ocelotl force-pushed the issue_832 branch from a707e16 to f4c7b1c Compare June 25, 2020 00:37

ocelotl marked this pull request as ready for review June 25, 2020 00:37

ocelotl changed the title ~~Add fix in instrumentors~~ Update instrumentors to use span processors Jun 25, 2020

ocelotl added ext instrumentation Related to the instrumentation of third party libraries or frameworks labels Jun 25, 2020

ocelotl requested review from lzchen and codeboten June 25, 2020 02:05

toumorokoshi approved these changes Jun 30, 2020

View reviewed changes

ext/opentelemetry-ext-requests/src/opentelemetry/ext/requests/__init__.py Show resolved Hide resolved

ext/opentelemetry-ext-dbapi/src/opentelemetry/ext/dbapi/__init__.py Show resolved Hide resolved

lzchen approved these changes Jul 2, 2020

View reviewed changes

ocelotl added 5 commits July 6, 2020 09:37

Fix Psycopg2

d924929

Fix mysql

431f3a9

Fix sqlite3

62efe30

Fix pymysql

c86644b

Fix lint

f104639

ocelotl force-pushed the issue_832 branch 2 times, most recently from bae0674 to f104639 Compare July 7, 2020 01:06

alrex and others added 3 commits July 7, 2020 08:35

Merge branch 'master' into issue_832

8afa3eb

Merge branch 'master' into issue_832

342770b

Merge branch 'master' into issue_832

772377e

toumorokoshi merged commit 09df35c into open-telemetry:master Jul 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update instrumentors to use span processors #852

Update instrumentors to use span processors #852

ocelotl commented Jun 24, 2020

lzchen commented Jun 24, 2020

ocelotl commented Jun 24, 2020

toumorokoshi commented Jun 30, 2020

toumorokoshi left a comment

lzchen left a comment

Update instrumentors to use span processors #852

Update instrumentors to use span processors #852

Conversation

ocelotl commented Jun 24, 2020

lzchen commented Jun 24, 2020

ocelotl commented Jun 24, 2020

toumorokoshi commented Jun 30, 2020

toumorokoshi left a comment

Choose a reason for hiding this comment

lzchen left a comment

Choose a reason for hiding this comment