Possibility of Data leakage during Preprocessing Step - ICA artefact repair/removal

kathlin42 · September 19, 2022, 8:51am

Hi everyone,
dear MNE Team,

I have a general question regarding MEG/EEG preprocessing and its influence on later machine learning analyses about which I have been pondering for a while now:

Could using ICA preprocessing (repairing artefacts; as explained here ICA MNE Tutorial) influence later decoding/ machine learning approaches in the sense of data leakage from train to test sense?

I had some interesting discussions about it with diverse opinions and would be very interested in and grateful for your assessment.
Or whether you know if someone ever tried to systematically explore it…

Many thanks for your reply!
Best,
Katharina

agramfort · September 19, 2022, 9:34am

hi,

honestly I would not consider data cleaning first. I would setup my ML pipeline
and see what I obtain with the raw data. Time by time decoding for example
is very robust.

Alex

cbrnr · September 19, 2022, 10:11am

That’s a good question. I’d say you’re probably relatively “safe”, because ICA is an unsupervised method, i.e. doesn’t require any labels. Of course this doesn’t rule out information leakage completely, but you’d have to be much more specific if you really suspect that this is what’s happening.

kathlin42 · September 19, 2022, 1:32pm

Thanks a lot for your reply!

skjerns · September 21, 2022, 7:30am

In my experience, decoders don’t care too much about ICA cleanup. They’re pretty good at ignoring noise that does not contain information.

Example results, no difference of applying ICA or not applying it

Topic		Replies	Views
Unusual failure mode of ICA in particular subject Support & Discussions preprocessing , meg , ica	6	853	July 22, 2021
Apply ICA before or after epoching/cleaning? Support & Discussions preprocessing , eeg , ica	2	2543	July 21, 2021
ICA advice for cleaning long (~9 hour) EEG recordings Support & Discussions preprocessing , eeg , ica , epochs	1	510	October 19, 2021
ICA components labeling Support & Discussions	1	293	September 21, 2022
steps to clean noisy data and artifacts Support & Discussions	4	408	October 7, 2021

Possibility of Data leakage during Preprocessing Step - ICA artefact repair/removal

Related topics