[metricbeat] Raid metricset with expanded disk states, using /sys/block #11613

fearful-symmetry · 2019-04-02T19:16:25Z

I suggest y'all read #11292 first, to catch up on the conversation around expanding the RAID metricset data, as well as #5600, the original issue around expanding the data collection for RAID. This PR is meant as a sort of comparison to #11292, and addresses some issues that the earlier PR doesn't.

Summary: our current implementation of RAID metricset gathering uses /proc/mdstat which has a number of limitations:

this is only available on linux. The procfs compat layer for FreeBSD does not include mdstat.
This is meant as a human-readable interface, meaning it is fairly unstable (I found a number of different reporting forms between different /proc/mdstat samples, presumably from different kernel versions) and not well documented as far as machine readability. Also, lots of regex. Ew.
Compared to other interfaces, the data we can get from it is limited.

My previous PR (#1192) uses ioctl to gather expanded disk metrics, in line with the request made in #5600. While this works, it adds an extra dependency, and creating a mock test harness around ioctl is rather ugly.

In addition to expanding the disk metrics we get, this implementation uses /sys/block and only /sys/block. No mdstat. This means we have one dependency that can be easily mock-tested using regular files. It also means we trade regex rules for lots of file operations and directory traversal. It also provides us with a much richer source of data compared to mdstat.

Unlike the previous PR, I wrote this with a longing glace towards cross-compatibility. Like our current implementation, this is linux-only. However, I did try to address a complicated problem, which is how different OSes report RAID status. This is the primary reason why this is a draft PR, as I'd like input from others with regards to how we do this.

Right now, we report two disk states: total and available. This PR adds failed and spare. However, the sysfs subsystem provides a number of different statuses that disks can have:

faulty
in_sync
write mostly
blocked
spare
write_error
want_replacement
replacement

You can read more about the states here.

FreeBSD has a number of states, mostly different from linux (usual caveat, I'm not a FreeBSD expert)
- NODISK
- NONE
- NEW
- SYNCHRONIZING
- DISCONNECTED
- OFFLINE
- DISABLED
- FAILED
- STALE_FAILED
- SPARE
- STALE
- ACTIVE
- DESTROY
- INVALID

(see /sys/geom on freebsd)

I spent a lot of time thinking about how we want to handle this. My current solution (which I'm not married to, by any means) is to "standardize" them into active, total, spare, and failed, and provide an array with the original states for the sake of data preservation. This is somewhat problematic, as it requires some editorializing as to what constitutes "active" and "failed" and so on. It can also be problematic if kernel updates add new statuses. (Do we assume an unknown status is good or bad?)

The other solution is to keep the strings the same, and just reduce them, so {"in_sync","in_sync","spare"} becomes {"in_sync":2, "spare":1} in the final mapping. This has some implications for mapping, and could make alerting/visualizing harder. On the other hand, it's completely un-opinionated. This doesn't take into account MacOS and Windows, where I have no idea how RAID status is reported. Although it's not included in this initial commit, adding FreeBSD support to this shouldn't be too difficult.

Some other things to take note of:

activity_state was turned into status and the string status codes are different from mdstat.
Aside from this, I did try to make sure we where backwards-compatible with how the mdstat metricset worked.
This doesn't need to use only sysfs. We could use /sys/block for expanded disk states, and continue to use mdstat for backwards compatibility or other things. I have no problems with this.
There's a lot more data we could mine from /sys/block if we wanted. Error counters, and other things. Check out the above docs link.
We added a sync_action field, which further expands on the data gathered.
I added a little mock sysfs filesystem to our testdata. In real life, I've tested this extensively on my home lab, which has an x86 node with raid0 and raid1 arrays.

ruflin · 2019-04-03T08:48:28Z

Thanks for the detailed PR description, much appreciated.

I like the idea of having the standardised fields + the originals. For the data structure I prefer your suggestion with "states": {"in_sync":2, "spare":1} over repeating the entries, this will make querying easier.

We then ca have something like "raw_states": {"foo":2, "bar":7}. Template wise we can solve it with dynamic templates as we know all the fields inside will be integers (correct?).

For our visualisations I would only use the standardised states and if we have a state that we don't know, we count it as unknown.

fearful-symmetry · 2019-04-03T13:03:38Z

@ruflin So you're saying we only use the standardized states inside visualizations, and all reporting is just the 'raw' states we get from the subsystems?

All the values should be integers yah. Don't think I'm familiar with dynamic templates.

ruflin · 2019-04-03T14:56:19Z

I think we should also report the standardised states in the event itself and these are the ones we use for the visualisation, but the same doc will also contain the raw states.

For dynamic templates, have a look here: https://www.elastic.co/guide/en/elasticsearch/reference/current/dynamic-templates.html And for an example in Beats: https://github.com/elastic/beats/blob/master/metricbeat/module/docker/cpu/_meta/fields.yml#L39

fearful-symmetry · 2019-04-03T15:00:21Z

Ahh, alright.

So we have standardized states, then keep the raw_states the same, but move it to a dict. If we're keeping the original status strings in a dict, do we want to keep our current standardized states (working, total, failed, spare) as-is?

ruflin · 2019-04-04T06:32:33Z

Yes, assuming what you mean by dict is looking something like this:

{
  ...
  "states": {
    "in_sync":2,
    "spare":1
  }
  "raw_states": {
    "foo":2, 
    "bar":7
  }
  ...
}

To make it a non breaking change, we could keep states on the level it is at the moment, this would probably be a bit more user friendly even though not 100% clean.

Do the current state names match with what you have in mind for standardisation?

fearful-symmetry · 2019-04-04T13:06:59Z

@ruflin The current standardized names are total, working, failed or spare and I'm hoping that states reported by other OSes can be crammed into one of those categories.

fearful-symmetry · 2019-04-04T13:11:21Z

Part of me still wants to forgo the standardized fields and just use raw names (maybe keeping the total/active fields), particularly as we're now leaning towards just having two maps reporting nearly the same data.

ruflin · 2019-04-04T13:37:03Z

In any case for the existing platforms we need to keep the old names around for a bit as otherwise it would be a breaking change. We could try for the additional platforms to only use the raw values. The problem I see with that is that if we want to build a dashboard, we need to build one for each platform?

fearful-symmetry · 2019-04-04T13:49:51Z

Right now our only existing disk states are total and working, and keeping those around should be fine. But yah, I see your point, this makes a handful of other things harder.

ruflin · 2019-04-04T13:53:12Z

One more idea: We standardise but make the raw states opt-in through a config flag. Like this the base even stays simple.

fearful-symmetry · 2019-04-04T14:01:50Z

That could work!

The thing I'm more worried about is the standardized fields. I'm having trouble finding docs for FreeBSD's geom raid states, and some other platforms might be just as bad. We also probably won't find out about new states in kernel updates until someone files an issue. Considering the other things we have going on across the systems module, that's probably not too bad though?

fearful-symmetry · 2019-04-04T14:18:28Z

I suppose that could also be a "cross that bridge when we get to it" problem, as right now this is still linux-only.

ruflin · 2019-04-05T12:43:16Z

+1 on "cross that bridge when we get there" and apply the 80/20 rule :-)

fearful-symmetry · 2019-04-05T13:14:34Z

@ruflin the latest commit I put in last night moves the raw states to a disk, the end data now looks like this:

   "system": {
     "raid": {
       "status": "clean",
       "disks": {
         "active": 2,
         "total": 3,
         "spare": 1,
         "failed": 0,
         "states": {
           "in_sync": 2,
           "spare": 1
         }
       },
       "blocks": {
         "synced": 4189184,
         "total": 4189184
       },
       "sync_action": "idle",
       "name": "md0"
     }
   }

ruflin · 2019-04-08T07:42:55Z

@fearful-symmetry What you have in the above comment LGTM.

jsoriano

I like we can get all the info from a single point! This is looking good. I have added some comments. I'd be fine with continuing with this idea, and we can leave support for other implementations for the future.

metricbeat/module/system/raid/blockinfo/getdev.go

metricbeat/module/system/raid/blockinfo/parser.go

metricbeat/module/system/raid/raid.go

metricbeat/module/system/raid/_meta/docs.asciidoc

jsoriano · 2019-04-08T20:30:03Z

metricbeat/module/system/raid/raid.go

 // Fetch fetches one event for each device
 func (m *MetricSet) Fetch(r mb.ReporterV2) {
-	stats, err := m.fs.ParseMDStat()
+	devices, err := blockinfo.ListAllMDDevices(m.sysfs)


Some ideas to make this more independant of the OS and the raid subsystem:

Make blockinfo an object that implements an interface

This object lists directly the objects, so nothing like GetMDDevice is needed later

So at the end you can have something like:

devices, err := m.blockinfo.ListAll() ... for _, device := range devices { disks := device.Disks() ... }

Also take into account that even on Linux there can be different RAID implementations.

Make blockinfo an object that implements an interface

I actually had this idea fairly early on, but ended up not implementing it for the sake of getting a working PoC down. I should do that.

jsoriano · 2019-04-08T20:44:43Z

The current standardized names are total, working, failed or spare and I'm hoping that states reported by other OSes can be crammed into one of those categories.

+1 to these categories, I am only missing something for synced/synchronizing, can we get this per disk in Linux? are disks being synchronized included in working?

fearful-symmetry · 2019-04-09T14:25:28Z

I am only missing something for synced/synchronizing, can we get this per disk in Linux? are disks being synchronized included in working?

I believe so, I can't find anything suggesting an explicit resync state. If you force a repair on the array, the disks will stay in_sync unless something goes wrong.

fearful-symmetry · 2019-04-10T03:51:25Z

Just pushed a commit that fixes most of the issues Jaime mentioned. Will implement this next:

Make blockinfo an object that implements an interface
This object lists directly the objects, so nothing like GetMDDevice is needed later

fearful-symmetry · 2019-04-11T15:18:31Z

Just pushed another commit to simply the API to actually read from sysfs, so now we have a single array that's returned from a ListAll call.

I'm not sure if there's a way to further 'reduce' this for the sake of cross-compatibility. Most of the cross-compatible system metricsets end up by populating a struct, and we have a fairly large amount of fields that need to eventually make it's way to the metricset.

fearful-symmetry · 2019-04-12T17:32:23Z

Okay, just made another commit that simplifies the whole interface into /sys/block

Looking at how other parts of the system module do cross-compatibility, it looks like gosigur declares all its structs in a global file, with method implementations in per-target files. This is a step towards that, where we have two files that are just the linux implementation, and a global file that declares that structs.

fearful-symmetry · 2019-04-15T00:04:26Z

jenkins, test this

ruflin · 2019-04-15T10:33:47Z

@fearful-symmetry I think it's time to get this out of a "Draft" ;-)

fearful-symmetry · 2019-04-15T17:21:48Z

Finally added an entry to the changelog.

jsoriano

Sorry, I missed that the unknown count was still around, and a small detail on the docs. For the rest it LGTM.

metricbeat/module/system/raid/_meta/docs.asciidoc

metricbeat/module/system/raid/raid.go

fearful-symmetry · 2019-04-17T14:15:34Z

jenkins, test this

kaiyan-sheng

LGTM 👍

fearful-symmetry added enhancement module Metricbeat Metricbeat labels Apr 2, 2019

fearful-symmetry self-assigned this Apr 2, 2019

fearful-symmetry requested review from jsoriano, ruflin and a team April 2, 2019 19:16

fearful-symmetry added the Team:Integrations Label for the Integrations team label Apr 4, 2019

ruflin added the [zube]: In Progress label Apr 8, 2019

jsoriano reviewed Apr 8, 2019

View reviewed changes

fearful-symmetry added 8 commits April 14, 2019 17:42

removed regex for string methods, formatting, doc changes

7e9ce25

simplify interface to get devices

226a440

report unknown states

ff9c7e9

simplify interface into blockinfo

da1c3be

make update

0809cf7

cleanup, stop recording unknown states

24bf922

add raid level to metricset

4c97024

change debug logger name

e6a6faa

fearful-symmetry force-pushed the raid-sysfs branch from 80bae9e to e6a6faa Compare April 14, 2019 22:48

make update

8edeb59

fearful-symmetry marked this pull request as ready for review April 15, 2019 13:09

fearful-symmetry requested a review from a team as a code owner April 15, 2019 13:09

fearful-symmetry requested a review from a team April 15, 2019 13:10

alvarolobato added [zube]: In Review and removed [zube]: In Progress labels Apr 15, 2019

update docs, data.json and changelog

66b94a9

jsoriano requested changes Apr 15, 2019

View reviewed changes

metricbeat/module/system/raid/_meta/docs.asciidoc Outdated Show resolved Hide resolved

metricbeat/module/system/raid/raid.go Outdated Show resolved Hide resolved

update docs, removed unknown field (for real this time)

36fc592

jsoriano approved these changes Apr 16, 2019

View reviewed changes

jsoriano added the v7.2.0 label Apr 16, 2019

fearful-symmetry mentioned this pull request Apr 17, 2019

[metricbeat] add support for working, failed and spare disks in raid metricset #11292

Closed

kaiyan-sheng approved these changes Apr 18, 2019

View reviewed changes

fearful-symmetry merged commit 3d95769 into elastic:master Apr 23, 2019

zube bot added [zube]: Done and removed [zube]: In Review labels Apr 23, 2019

alvarolobato removed the [zube]: Done label Apr 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[metricbeat] Raid metricset with expanded disk states, using /sys/block #11613

[metricbeat] Raid metricset with expanded disk states, using /sys/block #11613

fearful-symmetry commented Apr 2, 2019 •

edited by ruflin

Loading

ruflin commented Apr 3, 2019

fearful-symmetry commented Apr 3, 2019

ruflin commented Apr 3, 2019

fearful-symmetry commented Apr 3, 2019

ruflin commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

ruflin commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

ruflin commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

ruflin commented Apr 5, 2019

fearful-symmetry commented Apr 5, 2019

ruflin commented Apr 8, 2019

jsoriano left a comment

jsoriano Apr 8, 2019

fearful-symmetry Apr 9, 2019

jsoriano commented Apr 8, 2019

fearful-symmetry commented Apr 9, 2019

fearful-symmetry commented Apr 10, 2019

fearful-symmetry commented Apr 11, 2019

fearful-symmetry commented Apr 12, 2019

fearful-symmetry commented Apr 15, 2019

ruflin commented Apr 15, 2019

fearful-symmetry commented Apr 15, 2019

jsoriano left a comment

fearful-symmetry commented Apr 17, 2019

kaiyan-sheng left a comment

[metricbeat] Raid metricset with expanded disk states, using /sys/block #11613

[metricbeat] Raid metricset with expanded disk states, using /sys/block #11613

Conversation

fearful-symmetry commented Apr 2, 2019 • edited by ruflin Loading

ruflin commented Apr 3, 2019

fearful-symmetry commented Apr 3, 2019

ruflin commented Apr 3, 2019

fearful-symmetry commented Apr 3, 2019

ruflin commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

ruflin commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

ruflin commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

fearful-symmetry commented Apr 4, 2019

ruflin commented Apr 5, 2019

fearful-symmetry commented Apr 5, 2019

ruflin commented Apr 8, 2019

jsoriano left a comment

Choose a reason for hiding this comment

jsoriano Apr 8, 2019

Choose a reason for hiding this comment

fearful-symmetry Apr 9, 2019

Choose a reason for hiding this comment

jsoriano commented Apr 8, 2019

fearful-symmetry commented Apr 9, 2019

fearful-symmetry commented Apr 10, 2019

fearful-symmetry commented Apr 11, 2019

fearful-symmetry commented Apr 12, 2019

fearful-symmetry commented Apr 15, 2019

ruflin commented Apr 15, 2019

fearful-symmetry commented Apr 15, 2019

jsoriano left a comment

Choose a reason for hiding this comment

fearful-symmetry commented Apr 17, 2019

kaiyan-sheng left a comment

Choose a reason for hiding this comment

fearful-symmetry commented Apr 2, 2019 •

edited by ruflin

Loading