CSP component order selection #8151

mbillingr · 2020-08-24T08:25:57Z

Reference issue

Resolves #8074.

What does this implement/fix?

Add new parameter to CSP to select how to order components.
Add additional tests to make sure nothing broke.
(optional) Refactoring
- Moved duplicated data type check into _check_Xy.
- Split .fit() into smaller functions for improved readability.

Additional information

I left the refactoring in for now because I already made it.
You may to prefer not to have these changes in order to keep the code closer to the original from pyRiemann.
Let me know :)

cbrnr · 2020-08-24T11:14:14Z

I'll take a detailed look soon, but here are already three questions:

I don't quite understand the difference between 'new' and 'mutual_info' - aren't both methods sorting by mutual information? Would it make sense to combine these methods into one?
Why do we need a None value for the component_order parameter?
Does the behavior of CSP change with this PR? If so, we need a deprecation cycle, but I'd prefer if the default behavior doesn't change.

mbillingr · 2020-08-24T13:31:05Z

'new' sorts by abs(eig - 0.5) and 'mutual_info' sorts by mutual information. In the two-class case they seem to result in the same order but in the multi-class case you can't use 'new'. I'd rather not combine them because pyRiemann distinguishes these two cases.
None is the current behavior, which remains the default. It chooses between 'new' and 'mutual_info' based on the number of classes. Maybe 'auto' would feel more consistent than None?
The default behavior of CSP does not change. This PR does not touch any of the existing tests and they still pass.

larsoner · 2020-08-24T18:53:43Z

'new'

I would maybe use extrema or extreme or deviation or something, because it's more about which are closer to the extremes of the possible values of the eigh result (0 and 1).

Also if mutual_info is expected to generally be better, we should just make it the default and call it a bug fix

cbrnr · 2020-08-25T07:32:16Z

There are too many choices for my taste. Previously, we didn't have a parameter to select the sorting order - the function used MI-based sorting for both two- and multi-class cases, even though the implementation differed. The problem with that was that the "classic" approach of sorting patterns by alternating sorted eigenvalues was not available. IMO, it would be sufficient to have two possible values for the parameter, "mutual_info" and "alternating" or something like that.

agramfort · 2020-08-25T08:27:20Z

I am not sure about the name alternating I started reading too. Looking at https://www.sciencedirect.com/topics/engineering/common-spatial-pattern See sentence: "Since λ1j+λ2j=1, a high value for λ1j means that the filter output based on filter *wj* yields a high variance for input signals in class 1 and a low variance for signals in class 2 (and vice versa); spatial filtering with such filters can thus significantly enhance discrimination ability. " this suggests that you should sort by the distance to 0.5. I checked the maths and I agree. I don't know where the alternating option comes from. So this suggest we need 2 options : 'variance' (dist to 0.5), 'mutual_information' and eventually a third option which is 'alternating'. I would like to see a ref in the docstring for the 3 strategies so it does not seem we came up with that in MNE... thx

…

cbrnr · 2020-08-25T08:32:45Z

@agramfort see my comment here.

Currently we have no options, but we're already doing two different things for the two- and multi-class case. Why do you want to introduce a new argument for something we already did without?

Also, am I correct in assuming that the distance to 0.5 is not equal (not even conceptually) to sorting by mutual information?

Even if these are not equal, I'd still summarize them as one method (the advanced), whereas the alternating one (the naive) should be there for practical reasons (it has been published, and many people in the BCI community have been using that - not sure if this is still the case though).

agramfort · 2020-08-25T08:42:57Z

Currently we have no options, but we're already doing two different things for the two- and multi-class case. Why do you want to introduce a new argument for something we already did without?

the question is do we want mutual info in the 2d case? if so we need a parameter.

Also, am I correct in assuming that the distance to 0.5 is not equal (not even conceptually) to sorting by mutual information?

correct.

Even if these are not equal, I'd still summarize them as one method (the advanced), whereas the alternating one (the naive) should be there for practical reasons (it has been published, and many people in the BCI community have been using that - not sure if this is still the case though).

so we still want to support the alternating option? It's your call if you think it's useful

cbrnr · 2020-08-25T08:55:58Z

the question is do we want mutual info in the 2d case? if so we need a parameter.

I don't think so, this PR just adds the classic alternating sorting. I think it's still useful because the question how we sort using the new method(s) comes up frequently. However, we could also go with @larsoner's suggestion and just implement the new approach. This would mean no new parameter, no change to the code, just add links to the literature to explain the current sorting. I'm fine with either approach, but I am 👎 on introducing a new parameter with three or four possible values.

agramfort · 2020-08-25T09:03:00Z

@cbrnr so what you suggest in the 2d case? what is called old or new here?

cbrnr · 2020-08-27T06:51:35Z

@agramfort I think your comment about the distance to 0.5 equalling variance is not quite correct. After all, CSP is designed to maximize the variance difference between/within conditions, so all criteria will somehow work on the variance (at least indirectly).

Therefore, I suggest that we keep things simple (at the risk of being inaccurate) and use the following values for the component_order parameter:

mutual_info: this is the current behavior (and the default); it uses the distance from 0.5 in the 2D case and MI in the ND case.
alternate: this is the classic Blankertz 2008 behavior that can be enabled for the 2D case. If this argument is used for more than 2 classes, we throw an error.

cbrnr · 2020-08-27T06:54:11Z

In addition, I would rename component_order to order_components or order_filters.

agramfort · 2020-08-27T07:13:29Z

I find the term mutual_info for the distance to 0.5 a bit misleading but I will not fight :)

…

cbrnr · 2020-08-27T07:17:56Z

That's the inaccuracy I was talking about 😄 - I think it's more useful to keep it simple, and people can always read the docstring to find the exact methods with references. Also, we still haven't ruled out that the distance to 0.5 is somehow related to the mutual information as a special case. @alexandrebarachant maybe?

agramfort · 2020-08-27T07:37:16Z

fair enough...

mbillingr · 2020-08-27T13:02:42Z

use the following values for the component_order parameter:

mutual_info: this is the current behavior (and the default); it uses the distance from 0.5 in the 2D case and MI in the ND case.

alternate: this is the classic Blankertz 2008 behavior that can be enabled for the 2D case. If this argument is used for more than 2 classes, we throw an error.

Does everybody agree now it should be done like that?

cbrnr · 2020-08-27T14:22:12Z

@mbillingr and @agramfort? Plus I suggest using order_components - also OK with you?

mbillingr · 2020-08-27T14:47:10Z

So we'll have order_components := 'mutual_info' | 'alternate'. OK with me.

agramfort · 2020-08-27T14:48:17Z

fine with me

…

cbrnr · 2020-09-02T09:26:24Z

I implemented the changes, let's see if tests still come back green.

cbrnr · 2020-09-02T14:13:54Z

@larsoner I've implemented all suggestions, please check if the examples are OK and if the footbibliography works.

agramfort

@larsoner merge if happy

larsoner

Scores changed slightly in this one but it seems okay:

cbrnr · 2020-09-02T14:37:01Z

The title "Decoding in time-frequency space data using the Common Spatial Pattern (CSP)" could be shortened to "Decoding in time-frequency space using Common Spatial Patterns (CSP)".

larsoner · 2020-09-02T14:38:10Z

Feel free to push this change if you want

cbrnr · 2020-09-02T14:38:24Z

By "scores changed slightly" I'm assuming you mean coverage - where is our coverage CI?

larsoner · 2020-09-02T14:41:21Z

By "scores changed slightly" I'm assuming you mean coverage

No I mean the decoding scores in the bar plots. master:

PR:

For coverage, codecov comments inline (eventually) when there are un-covered lines.

cbrnr · 2020-09-02T14:46:35Z

Got it. The scores changed quite a lot - @mbillingr do you have an idea what could have caused this change? I thought that this PR didn't change the previous (default) behavior?

mbillingr · 2020-09-02T15:18:36Z

Run-to-run variability due to cross-validation shuffling?

larsoner · 2020-09-02T15:23:46Z

Could be -- @cbrnr or @mbillingr please check and if so set the random state so it's reproducible

larsoner · 2020-09-02T15:47:14Z

Indeed there is a missing random_state. Setting it to 42 I get the same image on master and this PR:

doc/references.bib

larsoner · 2020-09-02T23:25:20Z

@mbillingr @cbrnr !

* upstream/master: (489 commits) MRG, DOC: Fix ICA docstring, add whitening (mne-tools#8227) MRG: Extract measurement date and age for NIRX files (mne-tools#7891) Nihon Kohden EEG file reader WIP (mne-tools#6017) BUG: Fix scaling for src_mri_t in coreg (mne-tools#8223) MRG: Set pyvista as default 3d backend (mne-tools#8220) MRG: Recreate our helmet graphic (mne-tools#8116) [MRG] Adding get_montage for montage to BaseRaw objects (mne-tools#7667) ENH: Allow setting tqdm backend (mne-tools#8177) [MRG, IO] Persyst reader into Raw object (mne-tools#8176) MRG, BUG: Fix errors in IO/loading/projectors (mne-tools#8210) MAINT: vectorize _read_annotations_edf (mne-tools#8214) FIX : events_from_annotation when annotations.orig_time is None and f… (mne-tools#8209) FIX: do not project to sphere; DOC - explain how to get EEGLAB-like topoplots (mne-tools#7455) [MRG, DOC] Added linear algebra of transform to doc (mne-tools#7087) FIX: Travis failure on python3.8.1 (mne-tools#8207) BF: String formatting in exception message (mne-tools#8206) BUG: Fix STC limit bug (mne-tools#8202) MRG, DOC: fix ica tutorial (mne-tools#8175) CSP component order selection (mne-tools#8151) MRG, ENH: Add on_missing to plot_events (mne-tools#8198) ...

* move data type check into function that checks input data * allow selection of csp decomposition algorithm * replace selection of decomposition by selection of ordering * remove redundant comments * flake8 * remove unused function * pydocstyle * Simplify component_order argument * Update test * Update docstring * Fix component_order attribute error * Add whats_new entry * fix default in component_order docstring * test for exception in case of wrong component_order * Minor improvements to examples * Test invalid component_order/number of classes combination * Use utils functions to validate types and check options * Use footbibliography * Merge tests * Better example title * Remove empty field in bib entry * Add DOI * FIX: Seed * FIX: Test * FIX: Dup * DOC: DOI * FIX: Test Co-authored-by: Clemens Brunner <[email protected]> Co-authored-by: Eric Larson <[email protected]>

mbillingr changed the title ~~Csp~~ [Wip] Csp component order selection Aug 24, 2020

larsoner added this to the 0.21 milestone Aug 24, 2020

cbrnr changed the title ~~[Wip] Csp component order selection~~ [Wip] CSP component order selection Aug 27, 2020

cbrnr changed the title ~~[Wip] CSP component order selection~~ CSP component order selection Sep 2, 2020

mbillingr added 7 commits September 2, 2020 11:49

move data type check into function that checks input data

8f479e5

allow selection of csp decomposition algorithm

1fbbf1f

replace selection of decomposition by selection of ordering

f281071

remove redundant comments

d170a48

flake8

b6da500

remove unused function

dc861fd

pydocstyle

421f784

cbrnr added 3 commits September 2, 2020 16:09

Use utils functions to validate types and check options

e21b8d2

Use footbibliography

53d71f6

Merge tests

8f16e20

agramfort approved these changes Sep 2, 2020

View reviewed changes

larsoner approved these changes Sep 2, 2020

View reviewed changes

Better example title

9fa5fe3

cbrnr added 2 commits September 2, 2020 16:43

Remove empty field in bib entry

e7ea9a8

Add DOI

a2c0b0e

larsoner added 3 commits September 2, 2020 11:48

FIX: Seed

6c76be1

FIX: Test

6c0d525

FIX: Dup

9704b1e

cbrnr reviewed Sep 2, 2020

View reviewed changes

doc/references.bib Outdated Show resolved Hide resolved

larsoner added 2 commits September 2, 2020 12:27

DOC: DOI

d34de52

FIX: Test

5c1c51e

agramfort approved these changes Sep 2, 2020

View reviewed changes

larsoner merged commit 9f4ace5 into mne-tools:master Sep 2, 2020

mbillingr deleted the csp branch September 3, 2020 06:45

larsoner mentioned this pull request Feb 19, 2021

CSP questions #7482

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CSP component order selection #8151

CSP component order selection #8151

mbillingr commented Aug 24, 2020 •

edited by cbrnr

Loading

cbrnr commented Aug 24, 2020

mbillingr commented Aug 24, 2020

larsoner commented Aug 24, 2020

cbrnr commented Aug 25, 2020

agramfort commented Aug 25, 2020 via email

cbrnr commented Aug 25, 2020

agramfort commented Aug 25, 2020 via email

cbrnr commented Aug 25, 2020

agramfort commented Aug 25, 2020

cbrnr commented Aug 27, 2020

cbrnr commented Aug 27, 2020

agramfort commented Aug 27, 2020 via email

cbrnr commented Aug 27, 2020

agramfort commented Aug 27, 2020 via email

mbillingr commented Aug 27, 2020

cbrnr commented Aug 27, 2020

mbillingr commented Aug 27, 2020

agramfort commented Aug 27, 2020 via email

cbrnr commented Sep 2, 2020

cbrnr commented Sep 2, 2020

agramfort left a comment

larsoner left a comment

cbrnr commented Sep 2, 2020

larsoner commented Sep 2, 2020

cbrnr commented Sep 2, 2020

larsoner commented Sep 2, 2020

cbrnr commented Sep 2, 2020

mbillingr commented Sep 2, 2020

larsoner commented Sep 2, 2020

larsoner commented Sep 2, 2020

larsoner commented Sep 2, 2020

CSP component order selection #8151

CSP component order selection #8151

Conversation

mbillingr commented Aug 24, 2020 • edited by cbrnr Loading

Reference issue

What does this implement/fix?

Additional information

cbrnr commented Aug 24, 2020

mbillingr commented Aug 24, 2020

larsoner commented Aug 24, 2020

cbrnr commented Aug 25, 2020

agramfort commented Aug 25, 2020 via email

cbrnr commented Aug 25, 2020

agramfort commented Aug 25, 2020 via email

cbrnr commented Aug 25, 2020

agramfort commented Aug 25, 2020

cbrnr commented Aug 27, 2020

cbrnr commented Aug 27, 2020

agramfort commented Aug 27, 2020 via email

cbrnr commented Aug 27, 2020

agramfort commented Aug 27, 2020 via email

mbillingr commented Aug 27, 2020

cbrnr commented Aug 27, 2020

mbillingr commented Aug 27, 2020

agramfort commented Aug 27, 2020 via email

cbrnr commented Sep 2, 2020

cbrnr commented Sep 2, 2020

agramfort left a comment

Choose a reason for hiding this comment

larsoner left a comment

Choose a reason for hiding this comment

cbrnr commented Sep 2, 2020

larsoner commented Sep 2, 2020

cbrnr commented Sep 2, 2020

larsoner commented Sep 2, 2020

cbrnr commented Sep 2, 2020

mbillingr commented Sep 2, 2020

larsoner commented Sep 2, 2020

larsoner commented Sep 2, 2020

larsoner commented Sep 2, 2020

mbillingr commented Aug 24, 2020 •

edited by cbrnr

Loading