ENH: muscle artifact detection #7407

AdoNunes · 2020-03-09T02:25:48Z

I have added a new function detect_muscle in artifact_detection.py. I also included the example and test.

codecov · 2020-03-09T03:47:28Z

Codecov Report

Merging #7407 into master will not change coverage by %.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #7407   +/-   ##
=======================================
  Coverage   90.10%   90.10%           
=======================================
  Files         452      452           
  Lines       83102    83102           
  Branches    13165    13165           
=======================================
  Hits        74882    74882           
  Misses       5387     5387           
  Partials     2833     2833

AdoNunes · 2020-03-09T04:04:31Z

@larsoner I am getting again a test_nesting error. Not sure if it's my fault...

agramfort

looks pretty clean @AdoNunes !

did you test this code on many subjects @AdoNunes ?

My biggest concern is on the validation for EEG data.

mne/preprocessing/artifact_detection.py

mmagnuski · 2020-03-09T11:47:11Z

I can test with eeg data later (today or tomorrow), a few steps down the road it would be nice to have a simple detection GUI similar to what is available in fieldtrip.

AdoNunes · 2020-03-09T15:33:40Z

looks pretty clean @AdoNunes !

did you test this code on many subjects @AdoNunes ?

My biggest concern is on the validation for EEG data.

@agramfort thanks for your comments. I run the example on the Brainstorm just because the annotate_movement was run on it. The only difference with EEG will be the threshold, as it has fewer channels.
The function was written by @bloyl (hence his name is in the script authorship). In the code sprint I tested against FieldTrip function and was almost equivalent. I also run it on our data and the detection was as expected.

Latter, in between my thesis writing rests I will address your comments.

agramfort · 2020-03-09T17:21:54Z

The only difference with EEG will be the threshold, as it has fewer

channels. can you find a parametrization that is immune to the number of channels?

…

mmagnuski · 2020-03-09T17:37:31Z

can you find a parametrization that is immune to the number of channels?

In fieldtrip IIRC they divide by sqrt(n_channels)

AdoNunes · 2020-03-09T18:00:07Z

Actually, in fieldtrip they just take the sum of all the channels zscores, thus the threshold can be very high, typically 10. Here the mean is taken. Do you suggest to divide the sum by the sqrt(n_channels)?

agramfort · 2020-03-09T18:03:51Z

I don't have experience with this but trying on different datasets will tell you how much your parameters are data dependent

Co-Authored-By: Alexandre Gramfort <[email protected]>

…etect_muscle

…into detect_muscle

AdoNunes · 2020-03-10T02:28:26Z

I will work on testing sum(zcores)/sqrt(n_chans) tomorrow.

I wanted to use eegbci data for the example but the sampling rate is 160Hz. I will look for other eeg datasets.

mmagnuski · 2020-03-10T10:55:27Z

mne/preprocessing/artifact_detection.py

+    raw_copy.pick_types(ref_meg=False)  # Remove ref chans just in case
+    # Only one type of channel, otherwise z-score will be biased
+    assert(len(set(raw_copy.get_channel_types())) == 1), 'Different channel ' \
+        'types, pick one type'


throw an explicit ValueError error instead of AssertionError

mmagnuski · 2020-03-10T11:09:19Z

I wanted to use eegbci data for the example but the sampling rate is 160Hz. I will look for other eeg datasets.

I think it is better to be more flexible in the filtering step (for example allow the user to specify what frequency range should be considered) than to ignore datasets with such sampling rate.

I am also wondering whether ignoring signal segments with bad annotations in the z score computation would be helpful. (imagine there is some known period with huge artifacts that would bias the z scores) If so, these segments could be nan'ed out (or removed) prior to zscoring.

larsoner · 2020-03-10T14:44:38Z

I am also wondering whether ignoring signal segments with bad annotations in the z score computation would be helpful. (imagine there is some known period with huge artifacts that would bias the z scores) If so, these segments could be nan'ed out (or removed) prior to zscoring.

The way I would do it is make it so that this function ignores any segments that are already marked BAD, it's more or less standard way of handling these things nowadays via a skip_by_annotation kwarg.

AdoNunes · 2020-03-11T04:12:44Z

The way I would do it is make it so that this function ignores any segments that are already marked BAD, it's more or less standard way of handling these things nowadays via a skip_by_annotation kwarg.

I can't find how to use skip_by_annotation, it seems that is for filtering the data. I add it in the last PR, but I guess it shouldn't be in the filtering function...

agramfort · 2020-03-11T09:14:29Z

See https://mne.tools/dev/generated/mne.io.Raw.html#mne.io.Raw.get_data and reject_by_annotations parameter

drammock

In addition to the specific comments below, generally speaking I'd encourage you to interleave the code and explanation a little more, instead of having just a few big code blocks. One example is the note about notch-filtering before running muscle artifact detection; it would make sense to have this note come after all the data is loaded, right before a small code block where you run the notch filtering. Similarly, the text about only using one channel type could be right next to the pick_types line.

examples/preprocessing/plot_muscle_detection.py

mne/preprocessing/artifact_detection.py

AdoNunes · 2020-03-12T17:26:19Z

Following @mmagnuski conversation on the code suggestions debating over if it is a good idea to slow pass filtering of the z-scores to smooth peaks. I looked at two datasets, the sample elekta data, and the brainstorm auditory data.

I have to say that both recordings are very clean, probably with well trained subjects, and in my case, with children or patients data it does not look that clean. So the low-pass smoothing advantage is not that clear.
Below are the z-scores, first row: low-passed, second: no low-pass, first col: magnetometers, second: gradiometers

Magnetometers are more sensitive to muscle compared to planar gradiometers. Looking at the data, I would mark the three peaks of the low-passed z-scores. The non low-passed if the threshold is increased it would also find three muscle artifacts. However, the third peak magnitude is almost twice as big. The first is more sustained but less intense, and in more noisy data it might not get picked up.

Below are the z-scores for brainstorm auditory subject one:

and the dataplot for the second peak detected with no low-pass:

Also a very well behaved subject, in a visual annotation I would have selected the main peak. When not doing the smoothing transient noisy peaks get closer to real artifacts. When selecting muscle artifacts in my data, it was annoying to unmark false positive sporadic peaks.

LMKWYT

AdoNunes · 2020-03-26T04:05:15Z

The raw.copy() inside the function is necessary otherwise it will change the raw

If you use, for example, mne.pick_types(raw.info, meg='mag') instead of raw.pick_types(meg='mag') then you'll get back an array of integers, which you can use in a call to raw.get_data(), and then just work with a numpy array instead of the raw object from that point on. That should allow you to avoid making a copy of the whole raw object (including all the channels you're not. For that matter, raw.get_data() allows you to pass a channel type string, so you could possibly even skip the mne.pick_types() step and just do raw.get_data(picks='mag'). It will always make a copy (but only of the channels you ask for and only the data values), so there is no risk of modifying the original raw object.

I see. I think that at this point, I would prefer to leave it for another PR. This function will go into the 0.21 version and there will be time if it is needed.

bloyl · 2020-03-26T13:05:48Z

+1 for delaying removing raw.copy() until a different PR.

The complication is raw_copy.filter() and raw_copy.apply_hilbert() which act in place. While doable i think some reorganization of those methods would be need to act directly on data arrays.

drammock

only a few minor docstring nitpicks left at this point. +1 for merge.

mne/preprocessing/artifact_detection.py

drammock · 2020-03-26T20:02:23Z

You were incorrect.

…

-------- Original Message --------

On Mar 26, 2020, 12:34, Adonay Nunes wrote: @AdoNunes commented on this pull request. --------------------------------------------------------------- In [mne/preprocessing/artifact_detection.py](#7407 (comment)): > + `filter_freq` whose envelope magnitude exceeds the specified z-score + threshold (when summed across channels and divided by `sqrt(n_channels)`). ***@***.***(https://github.com/drammock) always double backticks? I thought function parameters have one and its values two? — You are receiving this because you were mentioned. Reply to this email directly, [view it on GitHub](#7407 (comment)), or [unsubscribe](https://github.com/notifications/unsubscribe-auth/AAN2AU27O2APLS2BCDAWUQDRJOU45ANCNFSM4LD7QOPA).

AdoNunes · 2020-03-26T20:30:19Z

only a few minor docstring nitpicks left at this point. +1 for merge.

+10 for merge

examples/preprocessing/plot_muscle_detection.py

mne/preprocessing/artifact_detection.py

AdoNunes · 2020-03-29T17:27:16Z

@larsoner @agramfort I thought that the PR was already merged after 2 approvals?

drammock · 2020-03-29T17:52:25Z

@larsoner @agramfort I thought that the PR was already merged after 2 approvals?

Please exhibit some patience. Version 0.20 just shipped (which is a lot of work); everyone is dealing with a global pandemic, and it is the weekend. I understand that you have invested a lot of time and energy in this PR, and maybe are eager to be done with it. Don't get pushy. Some PRs are merged after one approval, some after two, sometimes more. It depends on the content of the PR and the expertise of the person(s) reviewing it.

AdoNunes · 2020-04-06T23:20:26Z

In case you are waiting for me to do something, plz let me know it. I just marked the conversations as resolved in case you were waiting for it.

mmagnuski · 2020-04-07T10:32:27Z

Hi @AdoNunes, it would be good to just add a test for the ValueError part you added, you can use:

with pytest.raises(ValueError, match="No M/EEG channel types found"):
    # call annotate_muscle_zscore with data without meg or eeg

and all would be good. :)

AdoNunes · 2020-04-07T15:52:16Z

with pytest.raises(ValueError, match="No M/EEG channel types found"):
    # call annotate_muscle_zscore with data without meg or eeg

@mmagnuski what a surprise 😆
What should I put as an indented block from the pytest.raises?

drammock · 2020-04-07T16:15:14Z

@mmagnuski what a surprise

It shouldn't be a surprise. We try to test every single warning and error that we raise, to make sure that they are actually happening in the conditions that we want them to happen.

What should I put as an indented block from the pytest.raises?

As @mmagnuski said, put in a call to your function, passing in data that doesn't have meg or eeg channels.

AdoNunes · 2020-04-07T16:26:41Z

@drammock Sorry I don't get it. Now I am more surprised -well confused-, I thought that I had to put it in the annotate_muscle_zscore.
I have to create a test function in the test_artifact_detection?

drammock · 2020-04-07T16:38:53Z

Quoting from the contributor guide:

All new functionality must have test coverage

For example, a new mne.Evoked method in mne/evoked.py should have a corresponding test in mne/tests/test_evoked.py.

So yes, the test should go in mne/preprocessing/tests/test_artifact_detection.py

AdoNunes · 2020-04-07T18:54:47Z

@mmagnuski done, the new test went through all the checks. All good now? 😄

larsoner · 2020-04-07T19:52:14Z

Thanks @AdoNunes !

mmagnuski · 2020-04-07T20:18:29Z

👍

* upstream/master: (1522 commits) FIX: Show bug MRG, FIX: Datetime call in gdf 2.x age calculation (mne-tools#7581) DOC: Simplify Darwin installation (mne-tools#7584) MRG, ENH: Allow picking without preload (mne-tools#7507) DOC: Document anonymization better (mne-tools#7587) Rework _Brain show (mne-tools#7580) DOC: Fixes in tutorial (mne-tools#7579) ENH: muscle artifact detection (mne-tools#7407) MRG: Remove toolbars in PyVista plotter (mne-tools#7572) WIP: Deregister plotter from the figure list in close() (mne-tools#7573) MRG: Fix mouse wheel event in _TimeViewer (mne-tools#7563) FIX: Fix toggle all (mne-tools#7567) MRG, FIX: parallel n_jobs check (mne-tools#7566) Rename artifact detection to movement detection (mne-tools#7569) ENH: Update spelling check [ci skip] (mne-tools#7565) MRG, ENH: Dont require preload for raw resample (mne-tools#7508) MRG: Add interpolation for NIRS signals (mne-tools#7428) WIP: Add temporal derivative distribution repair algorithm (mne-tools#7556) DOC: fix link in docstr [skip ci] (mne-tools#7562) ENH: Custom figure title when plotting Dipole locations (mne-tools#7558) ...

AdoNunes added 2 commits March 8, 2020 19:24

muscle artifact detection

7c5e680

docstrings

e42ed5c

docstrings

b705ae7

agramfort reviewed Mar 9, 2020

View reviewed changes

AdoNunes and others added 6 commits March 9, 2020 16:20

Alex suggestions

73351db

Co-Authored-By: Alexandre Gramfort <[email protected]>

example

24da9f6

Merge branch 'master' of git://github.com/mne-tools/mne-python into d…

97f6089

…etect_muscle

example

72c8c4d

Merge branch 'detect_muscle' of https://github.com/AdoNunes/mne-python …

97d3ac4

…into detect_muscle

for eeg

39be3d3

mmagnuski reviewed Mar 10, 2020

View reviewed changes

skip_by_annotation

ab04ad7

drammock requested changes Mar 11, 2020

View reviewed changes

jasmainak reviewed Mar 11, 2020

View reviewed changes

mne/preprocessing/artifact_detection.py Outdated Show resolved Hide resolved

jasmainak reviewed Mar 11, 2020

View reviewed changes

mne/preprocessing/artifact_detection.py Outdated Show resolved Hide resolved

jasmainak reviewed Mar 11, 2020

View reviewed changes

mne/preprocessing/artifact_detection.py Outdated Show resolved Hide resolved

logger

5701cb0

AdoNunes requested a review from drammock March 26, 2020 04:25

drammock approved these changes Mar 26, 2020

View reviewed changes

docstrings

416ab47

mmagnuski reviewed Mar 29, 2020

View reviewed changes

examples/preprocessing/plot_muscle_detection.py Outdated Show resolved Hide resolved

mmagnuski reviewed Mar 29, 2020

View reviewed changes

mne/preprocessing/artifact_detection.py Outdated Show resolved Hide resolved

mmagnuski reviewed Mar 29, 2020

View reviewed changes

mne/preprocessing/artifact_detection.py Show resolved Hide resolved

ch type

269277e

AdoNunes added 3 commits April 7, 2020 10:08

test no meeg data

d82c614

test no meeg data

2e1531d

fix docstring

d1d448d

larsoner merged commit e3cc4b1 into mne-tools:master Apr 7, 2020

hoechenberger mentioned this pull request Sep 16, 2020

MAINT: Release 0.21 #8150

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: muscle artifact detection #7407

ENH: muscle artifact detection #7407

AdoNunes commented Mar 9, 2020

codecov bot commented Mar 9, 2020 •

edited

Loading

AdoNunes commented Mar 9, 2020

agramfort left a comment

mmagnuski commented Mar 9, 2020

AdoNunes commented Mar 9, 2020

agramfort commented Mar 9, 2020 via email

mmagnuski commented Mar 9, 2020

AdoNunes commented Mar 9, 2020

agramfort commented Mar 9, 2020 via email

AdoNunes commented Mar 10, 2020

mmagnuski Mar 10, 2020

mmagnuski commented Mar 10, 2020

larsoner commented Mar 10, 2020

AdoNunes commented Mar 11, 2020

agramfort commented Mar 11, 2020 via email

drammock left a comment

AdoNunes commented Mar 12, 2020

AdoNunes commented Mar 26, 2020

bloyl commented Mar 26, 2020

drammock left a comment

drammock commented Mar 26, 2020 via email

AdoNunes commented Mar 26, 2020

AdoNunes commented Mar 29, 2020

drammock commented Mar 29, 2020

AdoNunes commented Apr 6, 2020

mmagnuski commented Apr 7, 2020

AdoNunes commented Apr 7, 2020

drammock commented Apr 7, 2020

AdoNunes commented Apr 7, 2020

drammock commented Apr 7, 2020

AdoNunes commented Apr 7, 2020

larsoner commented Apr 7, 2020

mmagnuski commented Apr 7, 2020

ENH: muscle artifact detection #7407

ENH: muscle artifact detection #7407

Conversation

AdoNunes commented Mar 9, 2020

codecov bot commented Mar 9, 2020 • edited Loading

Codecov Report

AdoNunes commented Mar 9, 2020

agramfort left a comment

Choose a reason for hiding this comment

mmagnuski commented Mar 9, 2020

AdoNunes commented Mar 9, 2020

agramfort commented Mar 9, 2020 via email

mmagnuski commented Mar 9, 2020

AdoNunes commented Mar 9, 2020

agramfort commented Mar 9, 2020 via email

AdoNunes commented Mar 10, 2020

mmagnuski Mar 10, 2020

Choose a reason for hiding this comment

mmagnuski commented Mar 10, 2020

larsoner commented Mar 10, 2020

AdoNunes commented Mar 11, 2020

agramfort commented Mar 11, 2020 via email

drammock left a comment

Choose a reason for hiding this comment

AdoNunes commented Mar 12, 2020

AdoNunes commented Mar 26, 2020

bloyl commented Mar 26, 2020

drammock left a comment

Choose a reason for hiding this comment

drammock commented Mar 26, 2020 via email

AdoNunes commented Mar 26, 2020

AdoNunes commented Mar 29, 2020

drammock commented Mar 29, 2020

AdoNunes commented Apr 6, 2020

mmagnuski commented Apr 7, 2020

AdoNunes commented Apr 7, 2020

drammock commented Apr 7, 2020

AdoNunes commented Apr 7, 2020

drammock commented Apr 7, 2020

AdoNunes commented Apr 7, 2020

larsoner commented Apr 7, 2020

mmagnuski commented Apr 7, 2020

codecov bot commented Mar 9, 2020 •

edited

Loading