-
Notifications
You must be signed in to change notification settings - Fork 260
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update kafka.yaml: proper spacing & recent changes #163
Conversation
With previous spacing check throws exception: InvalidJMXConfiguration("Each configuration must have an 'include' section.") In particular incorporates DataDog/dd-agent@75863d8 - counters-> guages, DataDog/dd-agent@5382422 - jmxfetch custom tags and DataDog/dd-agent@8bac88e - support for jmxfetch0.4
Confusingly using this I am seeing
but the latter error kafka section persists even if I delete the yaml from the config altogether and restart the agent. On one of our brokers I tried uninstalling the dd-agent altogether and it still came back ... I don't know if there is still something wrong with the config or if this is just some weird stickiness from previously loading the bad yaml. Even so, this PR represents a better state than the current master |
Hum that's weird. And then restart the agent ? It should fix the issue. |
Mhmm deleting that file cleared out that secion, but I have a
Notably it does seem like the jmxfetch bit is not getting stopped with the supervisor |
cleaning up and starting over this pr seems to go clean... don't know if it lasts
info now shows
|
Hmmm ...whereas on the two other brokers in the cluster it seems like stopping the supervisor did kill the jmxfetch okay (and delete those files in tmp to boot). Restarting the agent gave a clean status (whereas status was double-kafka-ed and one jmxfetch was running before the stop) I think I used the most probelmatic of the nodes to do interactive testing of the PR; I may have run Anyway I think its a transient issue. |
Hi @donaldguy, c2e0dbf |
I'm a little confused about appropriate scope What in particular do you think should be tested? Should I constrain it to testing the contents of the yaml file or should it encompass setting up a node with a toy kafka broker as well? Put another way, there are existing bats tests for kafka; why are they insufficient? Seperately there are probably other JMX metrics worth gathering - in particular I probably want to alert on under-replicated partitions. (and maybe encompass all that the LinkedIn folks highlight in https://kafka.apache.org/documentation.html#monitoring ); could I add these here, or do they need to get back to the dd-agent repo ? or should I just aim at an CC: @remh @conorbranagan on the latter |
HI @donaldguy! |
With previous spacing check throws exception:
InvalidJMXConfiguration("Each configuration must have an 'include' section.")
In particular incorporates
DataDog/dd-agent@75863d8 - counters-> guages,
DataDog/dd-agent@5382422 - jmxfetch custom tags
and DataDog/dd-agent@8bac88e - support for jmxfetch0.4