Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Report input level dropped/filtered event metrics #42325

Open
andrewkroh opened this issue Jan 16, 2025 · 3 comments
Open

Report input level dropped/filtered event metrics #42325

andrewkroh opened this issue Jan 16, 2025 · 3 comments
Assignees
Labels

Comments

@andrewkroh
Copy link
Member

Describe the enhancement:

Today, metrics related to event publishing are available only in an aggregated form that combines metrics from all inputs. This makes it difficult to know which specific input source is responsible for dropped or filtered events.

The input metrics should contain data about the number of events dropped or filtered by each input. This would help narrow down the source of problems to a particular input (and hence a particular integration data stream in the case of Elastic Agent).

I believe this would require the beat publisher client (from libbeat/publisher) to provide a way for inputs to subscribe to this data.

Describe a specific use case for the enhancement or feature:

  • Diagnostic dumps from Elastic Agent will contain dropped and filtered metrics that are specific to an integration data stream (via input_metrics.json).
  • Input metric dashboards in the Elastic Agent fleet integration can display dropped/filtered charts, providing a visual representation of the issue.
  • Standalone beat users can access detailed information about which input is causing drops through the HTTP monitoring API or periodical logs when logging_metrics_namespaces: [stats, inputs] is used. This would allow them to quickly identify and resolve issues.
@elasticmachine
Copy link
Collaborator

Pinging @elastic/elastic-agent-data-plane (Team:Elastic-Agent-Data-Plane)

@cmacknz
Copy link
Member

cmacknz commented Jan 16, 2025

We already know the target index/datastream for each event in the output, I wonder if we could instead have per data stream stats at the output level? That might get the same result with less internal pipeline wiring.

@andrewkroh
Copy link
Member Author

I like the idea. However, for standalone Beats users this might not be as useful since most users are writing all data to filebeat-x.x.x. But if it's significantly easier, then perhaps that's a good trade-off.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants