Variance of ICA Components

lmctx13 · September 6, 2022, 8:08pm

Hello!

I am running ICA in order to process EEG data, and I would like to know the variance for each component that is removed. I can access this information if I create an HTML report, but is there a way to export this information to a dataframe or table?

Here is some necessary information:

MNE version: 1.0.3
operating system: Windows 11

Thank you for your help!

richard · September 6, 2022, 8:33pm

Hi, please take a look at the following example, which should answer your question.

# %%
import mne

sample_dir = mne.datasets.sample.data_path()
sample_fname = sample_dir / 'MEG' / 'sample' / 'sample_audvis_raw.fif'

raw = (
    mne.io.read_raw_fif(sample_fname)
    .crop(tmax=60)
    .pick_types(eeg=True)
    .load_data()
)

# %% Fit ICA
ica = mne.preprocessing.ICA(n_components=15, method='picard')
ica.fit(raw)

# %% Retrieve explained variance
# unitize variances explained by PCA components, so the values sum to 1
pca_explained_variances = ica.pca_explained_variance_ / ica.pca_explained_variance_.sum()

# Now extract the variances for those components that were used to perform ICA
ica_explained_variances = pca_explained_variances[:ica.n_components_]

for idx, var in enumerate(ica_explained_variances):
    print(
        f'Explained variance for ICA component {idx}: '
        f'{round(100 * var, 1)}%'
    )

Explained variance for ICA component 0: 66.9%
Explained variance for ICA component 1: 11.3%
Explained variance for ICA component 2: 3.4%
Explained variance for ICA component 3: 2.4%
Explained variance for ICA component 4: 1.9%
Explained variance for ICA component 5: 1.6%
Explained variance for ICA component 6: 1.3%
Explained variance for ICA component 7: 1.1%
Explained variance for ICA component 8: 1.0%
Explained variance for ICA component 9: 0.8%
Explained variance for ICA component 10: 0.7%
Explained variance for ICA component 11: 0.6%
Explained variance for ICA component 12: 0.6%
Explained variance for ICA component 13: 0.6%
Explained variance for ICA component 14: 0.5%

Best wishes,
Richard

richard · September 7, 2022, 6:15am

(Note, there was a bug in my above code related to unitizing the variances. I have now fixed this, and also updated the produced output.)

lmctx13 · September 7, 2022, 4:15pm

This is very helpful. Thank you, Richard!

richard · September 7, 2022, 8:01pm

@lmctx13

@agramfort just pointed out on GitHub that this approach is actually mathematically incorrect. We’re working on something that will make it easy for users to directly retrieve the explained variance from the ICA object after fitting. I’ll let you know when it’s ready (end of this week). If you’re curious, you can track our progress here:

github.com/mne-tools/mne-python

Introduce ICA.explained_variance_ratio_ to easily retrieve relative explained variances after a fit

mne-tools:main ← hoechenberger:ica-explained-variance

opened 01:00PM - 07 Sep 22 UTC

hoechenberger

+63 -9

It's quite common that users want to know the variance explained by all (or indi…vidual) ICA components. Such a question just recently showed up [on the forum](https://mne.discourse.group/t/variance-of-ica-components/). Up until now, this operation is actually not that simple, because it requires some understanding of the internals of MNE's ICA implementation. The code snippet currently required would look something like this: ```python pca_explained_variance_ratio = ica.pca_explained_variance_ / ica.pca_explained_variance_.sum() ica_explained_variance_ratio = pca_explained_variance_ratio[:ica.n_components_] print(ica_explained_variance_ratio) ``` This PR simplifies this to: ```python print(ica.explained_variance_ratio_) ``` I'm looking for feedback on the way I implemented this. Unlike the other "trailing underscore attributes" which only come into existence after fitting, `ICA.explained_variance_ratio_` is a `@property` getter function that exists straight from the beginning when `ICA` is instantiated. However, for unfitted instances, it would just return `None`. I felt this approach was much cleaner and "Pythonic", but obviously it deviates from the approach taken elsewhere (attribute not existing unless `fit()` has been called). What do you think? MWE: ```python # %% import mne mne.set_log_level('WARNING') sample_dir = mne.datasets.sample.data_path() sample_fname = sample_dir / 'MEG' / 'sample' / 'sample_audvis_raw.fif' raw = ( mne.io.read_raw_fif(sample_fname) .crop(tmax=60) .pick_types(eeg=True) .load_data() ) # %% Fit ICA ica = mne.preprocessing.ICA(n_components=15, method='picard') print('Before fit:') print(ica.explained_variance_ratio_) ica.fit(raw) print('\nAfter fit:') print(ica.explained_variance_ratio_) ``` produces: ``` Before fit: Explained variance ratio is None (model not yet fitted) None After fit: [0.66904967 0.11312061 0.03440267 0.02412908 0.01893129 0.01598829 0.01277193 0.01095065 0.00986819 0.00817902 0.00736726 0.00627682 0.00582979 0.00553176 0.00541417] ```

Best wishes,
Richard

lmctx13 · September 9, 2022, 2:15pm

That sounds great, and I will check it out. Thank you so much!

eort · August 1, 2023, 2:11pm

Hi @richard,

Thanks for that function! I would have a question regarding the interpretation of those values. As is mentioned in the documentation, because ica components are not orthogonal, negative explained variances are possible. Based, on the formula how the variance is computed:

github.com

mne-tools/mne-python/blob/578582248542e4447c18715a461a884e99ee8919/mne/preprocessing/ica.py#L1221C6-L1221C64


      
          var_explained_ratio = 1 - mean_var_diff / mean_var_orig

that would occur when the variance of the data after removal of the components has more variance than the original data (so mean_var_diff > mean_var_orig). Intuitively this sounds, as if it wouldn’t be a good idea to remove those components then (if they increase the variance).
Do you happen to have more insights under which circumstances these negative variances can occur, what that means and what to do about it?

Thanks,
Eduard

richard · August 1, 2023, 3:18pm

Hello, it’s actually quite possible that the removal of one specific component increases variance of the reconstructed data. The algorithm implemented here is the one used by EEGLAB; I suggest you read up on it here:

Search that page for “pvaf”, which is the metric we’re calculating.

Best wishes,
Richarfd

eort · August 1, 2023, 7:14pm

Thanks! Yeah, I read that, unfortunately, they don’t give much info on the how’s and why’s of this phenomenon/algorithm. But, I guess if it’s rather normal, there is no reason to worry.

Thanks,
Eduard

richard · August 1, 2023, 8:32pm

You could maybe ask on the EEGLAB mailing list

kism · August 13, 2024, 1:39pm

Hello, I have a related question: I am consistently getting negative values for the explained variances on filtered data (highpass, lowpass and Maxwell together) after removing components with ica.exclude function based on eog/ecg_inds. I have already asked for assistance with listing the individual variances of the components: ica.get_explained_variance_ratio for individual ICA components - #4 by richard
Any explanation as to why this might be happening? I appreciate any insights you might have :).
Thank you,
M

Topic		Replies	Views
ica.get_explained_variance_ratio for individual ICA components Support & Discussions preprocessing	5	84	August 2, 2024
How to interpret variance AU plot and histogram in ica_plot_properties? Support & Discussions eeg , visualization , ica	6	655	May 11, 2022
numerical value of ICA components Support & Discussions preprocessing , ica	3	153	January 31, 2024
Noisy components not being removed from EEG Support & Discussions preprocessing , eeg , epochs	4	292	February 13, 2023
Optimal number of ICA components Support & Discussions preprocessing , ica	1	115	October 1, 2024

Variance of ICA Components

Related topics