Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bump opentelemetry-collector to v0.14.0 #2617

Merged
merged 5 commits into from
Nov 9, 2020

Conversation

pkositsyn
Copy link
Contributor

Which problem is this PR solving?
Adds compatibility with new version of collector

Short description of the changes

Just refactoring mostly. There is only one open question: should retry queue be enabled by default?
Collector defaults are to enable it, but according to tests it should be disabled

@pkositsyn pkositsyn requested a review from a team as a code owner November 6, 2020 19:19
@pkositsyn pkositsyn requested a review from yurishkuro November 6, 2020 19:19
jpkrohling
jpkrohling previously approved these changes Nov 6, 2020
Copy link
Contributor

@jpkrohling jpkrohling left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@@ -91,17 +91,17 @@ func (f Factory) CreateDefaultConfig() configmodels.Exporter {

// CreateTraceExporter creates Jaeger Cassandra trace exporter.
// This function implements OTEL component.ExporterFactory interface.
func (f Factory) CreateTraceExporter(
func (f Factory) CreateTracesExporter(
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hehe, what happens when so many people contributing are not native speakers. "trace exporter" was grammatically correct name.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not a jab at you, @Vemmy124 , but on the upstream change

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I confess it did sound strange to my ears, but I'm not a native speaker myself, so, thought it was a problem with my ears :-)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe that was bogdandrutu's idea open-telemetry/opentelemetry-collector#1974

Copy link
Contributor Author

@pkositsyn pkositsyn Nov 8, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And maybe it's really the problem of me being not native speaker, but I agree with him, because the component consumes the structure named pdata.Traces

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It doesn't matter what exact data type is passed, a component that exports traces (plural) is "trace exporter" (noun used as adjective in singular form).

if err != nil {
return "", err
}
spanIDInt := tracetranslator.BytesToUInt64SpanID(spanID.Bytes())
if spanIDInt == 0 {
return "", errZeroSpanID
}
return dbmodel.SpanID(fmt.Sprintf("%016x", spanIDInt)), nil
Copy link
Member

@yurishkuro yurishkuro Nov 6, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This probably would be more efficient:

import "encoding/hex"

src := spanID.Bytes()
dst := make([]byte, hex.EncodedLen(len(src)))
hex.Encode(dst, src)
return dbmodel.SpanID(dst)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

According to benchmark run on my device, this change gave 70ns vs previous 170ns

if err != nil {
return "", err
}
spanIDInt := tracetranslator.BytesToUInt64SpanID(spanID.Bytes())
if spanIDInt == 0 {
return "", errZeroSpanID
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This feels misplaced to me. This kind of validation should've been done way earlier in the pipeline. The ES backend itself doesn't actually care if IDs are all zeroes, it would still work.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You're definitely right, but practically having this error handled here can prevent some time of unnecessary debugging in case of zero IDs and missing validation earlier. I believe this error arises systematically, so anyway there will be collisions by ID and this will only implicitly indicate zero ID error.

{
traceID: pdata.NewTraceID([]byte("invalid-%")),
err: "TraceID does not have 16 bytes",
},
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure why these tests are removed - give the current code there would still be some kind of error returned from converters.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But TraceID is now exactly [16]byte. Why would the error be there even if only several first bytes are set?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They are silently truncated according to this PR conversations open-telemetry/opentelemetry-collector#2001

yurishkuro
yurishkuro previously approved these changes Nov 6, 2020
Copy link
Member

@yurishkuro yurishkuro left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm, but why are some tests removed?

@pkositsyn
Copy link
Contributor Author

Seems like a problem with build. Could you take a look please, since you're acquainted with how tests are run?

@yurishkuro
Copy link
Member

Lint Failures
cmd/opentelemetry/app/exporter/badgerexporter/factory.go:92:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/exporter/cassandraexporter/factory.go:74:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/exporter/elasticsearchexporter/factory.go:75:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/exporter/grpcpluginexporter/factory.go:68:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/exporter/jaegerexporter/jaeger_exporter.go:61:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/exporter/kafkaexporter/kafka_exporter.go:100:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/exporter/memoryexporter/factory.go:81:1: comment on exported method Factory.CreateTracesExporter should be of the form "CreateTracesExporter ..."
cmd/opentelemetry/app/processor/resourceprocessor/resource_processor.go:70:1: comment on exported method Factory.CreateTracesProcessor should be of the form "CreateTracesProcessor ..."
cmd/opentelemetry/app/receiver/jaegerreceiver/jaeger_receiver.go:150:1: comment on exported method Factory.CreateTracesReceiver should be of the form "CreateTracesReceiver ..."
cmd/opentelemetry/app/receiver/kafkareceiver/kafka_receiver.go:107:1: comment on exported method Factory.CreateTracesReceiver should be of the form "CreateTracesReceiver ..."
cmd/opentelemetry/app/receiver/zipkinreceiver/zipkin_receiver.go:58:1: comment on exported method Factory.CreateTracesReceiver should be of the form "CreateTracesReceiver ..."
Makefile:198: recipe for target 'go-lint' failed

@pkositsyn
Copy link
Contributor Author

I don't understand, how this piece of code passed lint before then. Anyways, let's add comments

@mergify mergify bot dismissed stale reviews from jpkrohling and yurishkuro November 9, 2020 11:07

Pull request has been modified.

@pkositsyn
Copy link
Contributor Author

I wasn't right about the tests correction, fixed now. What is the reason of absence of golangci.yml in the main repo? Wouldn't it be useful for local checks?

@codecov
Copy link

codecov bot commented Nov 9, 2020

Codecov Report

Merging #2617 (5b5e72e) into master (2a0d8cc) will decrease coverage by 0.03%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #2617      +/-   ##
==========================================
- Coverage   95.07%   95.04%   -0.04%     
==========================================
  Files         209      209              
  Lines        9364     9364              
==========================================
- Hits         8903     8900       -3     
- Misses        385      387       +2     
- Partials       76       77       +1     
Impacted Files Coverage Δ
plugin/storage/badger/factory.go 95.78% <100.00%> (ø)
plugin/storage/badger/spanstore/reader.go 95.37% <100.00%> (ø)
cmd/query/app/server.go 88.52% <0.00%> (-1.64%) ⬇️
plugin/storage/integration/integration.go 77.34% <0.00%> (-0.56%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 740264b...5b5e72e. Read the comment docs.

@pkositsyn
Copy link
Contributor Author

@yurishkuro @jpkrohling Can we merge now?

@jpkrohling
Copy link
Contributor

There's one pending change requested by Yuri: #2617 (comment)

About the linting configuration: I think it would be a welcome change.

Signed-off-by: Pavel Kositsyn <[email protected]>
@pkositsyn
Copy link
Contributor Author

@yurishkuro was right - new method for span ID conversion is a lot faster. Now I believe everything is resolved (well, idk if we reached consensus about spanID validation)

yurishkuro
yurishkuro previously approved these changes Nov 9, 2020
@yurishkuro
Copy link
Member

Restarted build. I have not seen crossdock test failing too often recently, and here it failed several times, may be related to this change. But could be flaky too, let's see how the build comes back.

@pkositsyn
Copy link
Contributor Author

FWIW in my build on real data some traces are lost somewhere between elasticexporter and elastic cluster. The worst thing is that there are no logs about loss in opentelemetry-collector component until v0.14.0. This might be closely related actually and I will investigate that once we merge this PR.

Defending my changes in commits, I have to notice, that tests passed with my previous commit, which (at least should) have equivalent semantics with the latest version.

@yurishkuro
Copy link
Member

Pretty sure the crossdock_otel CI failure is related to this change - it consistently failed 3 times, while passing on master and other PRs.

Signed-off-by: Pavel Kositsyn <[email protected]>
@mergify mergify bot dismissed yurishkuro’s stale review November 9, 2020 22:19

Pull request has been modified.

@pkositsyn
Copy link
Contributor Author

Found the problem. There was no test on the case when only 8 bytes of trace id is generated. Corrected tests to check that as well

@yurishkuro yurishkuro merged commit 80805d2 into jaegertracing:master Nov 9, 2020
@yurishkuro
Copy link
Member

Thanks!

Go phystech.edu :-)

@pkositsyn pkositsyn deleted the bump_opentelemetry branch November 10, 2020 08:34
quinniup pushed a commit to k8battleship/jaeger that referenced this pull request Nov 23, 2020
* bump opentelemetry-collector to v0.14.0

Signed-off-by: Pavel Kositsyn <[email protected]>

* initialize traceid and spanid explicitly

Signed-off-by: Pavel Kositsyn <[email protected]>

* fix comments and empty parent span check

Signed-off-by: Pavel Kositsyn <[email protected]>

* fasten convert traceID/spanID

Signed-off-by: Pavel Kositsyn <[email protected]>

* fix convertTraceID + fix tests

Signed-off-by: Pavel Kositsyn <[email protected]>
quinniup added a commit to k8battleship/jaeger that referenced this pull request Nov 23, 2020
* Bump opentelemetry-collector to v0.14.0 (jaegertracing#2617)

* bump opentelemetry-collector to v0.14.0

Signed-off-by: Pavel Kositsyn <[email protected]>

* initialize traceid and spanid explicitly

Signed-off-by: Pavel Kositsyn <[email protected]>

* fix comments and empty parent span check

Signed-off-by: Pavel Kositsyn <[email protected]>

* fasten convert traceID/spanID

Signed-off-by: Pavel Kositsyn <[email protected]>

* fix convertTraceID + fix tests

Signed-off-by: Pavel Kositsyn <[email protected]>

* Update CodeQL to latest best practices (jaegertracing#2615)

This will parallelize your analysis and speed things up a bunch.

Signed-off-by: jhutchings1 <[email protected]>

Co-authored-by: Juraci Paixão Kröhling <[email protected]>

* Fix flaky TestReload (jaegertracing#2624)

Signed-off-by: albertteoh <[email protected]>

* Update x/text to v0.3.4 (jaegertracing#2625)

Signed-off-by: Gary Brown <[email protected]>

* Bump to latest UI for snapshot builds (jaegertracing#2626)

Signed-off-by: Yuri Shkuro <[email protected]>

* Implement anonymizer's main program (jaegertracing#2621)

* Preparing release 1.21.0 (jaegertracing#2630)

* updated changelog

Signed-off-by: Joe Elliott <[email protected]>

* Added ui changelog

Signed-off-by: Joe Elliott <[email protected]>

* Fixed UI changelog to point to 1.12.0

Signed-off-by: Joe Elliott <[email protected]>

* Updated jaeger-ui to v1.12.0

Signed-off-by: Joe Elliott <[email protected]>

* Resolving concerns

Signed-off-by: Joe Elliott <[email protected]>

* [anonymizer] Save trace in UI format (jaegertracing#2629)

* Use fossa-contrib/fossa-action instead (jaegertracing#2571)

* Use fossa-contrib/fossa-action instead

Signed-off-by: Sora Morimoto <[email protected]>

* Make step name clearer

Signed-off-by: Sora Morimoto <[email protected]>

Co-authored-by: Yuri Shkuro <[email protected]>

* Update Makefile and Dockerfile for anonymizer (jaegertracing#2632)

Signed-off-by: Ashmita Bohara <[email protected]>

* Fix listen IP in unit test (jaegertracing#2636)

Signed-off-by: zouyu <[email protected]>

* Bump opentelemetry to v0.15.0 (jaegertracing#2634)

* Bump opentelemetry to v0.15.0

Signed-off-by: Pavel Kositsyn <[email protected]>

* add default value instead of nil value for jaegerreceiver config

Signed-off-by: Pavel Kositsyn <[email protected]>

* make lint

Signed-off-by: Pavel Kositsyn <[email protected]>

Co-authored-by: Kositsyn Pavel <[email protected]>
Co-authored-by: Justin Hutchings <[email protected]>
Co-authored-by: Juraci Paixão Kröhling <[email protected]>
Co-authored-by: Albert <[email protected]>
Co-authored-by: Gary Brown <[email protected]>
Co-authored-by: Yuri Shkuro <[email protected]>
Co-authored-by: Ashmita <[email protected]>
Co-authored-by: Joe Elliott <[email protected]>
Co-authored-by: Sora Morimoto <[email protected]>
Co-authored-by: ZouYu <[email protected]>
Co-authored-by: Kositsyn Pavel <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants