Creating subsets of Epochs for Sleep stage classification from polysomnography (PSG) data exercise

JohnAtl · June 13, 2023, 10:06pm

I’m working through this exercise:

Fetch 50 subjects from the Physionet database and run a 5-fold cross-validation leaving each time 10 subjects out in the test set.

Easy-peasy, right?

I can’t seem to figure out how to go from a bunch of datasets to a bunch of epochs, then use train_ndx, test_ndx from skf.split() to select from those epochs and call pipe.fit() with them.
In the code below, the problem is that X_train, y_train are lists of Epochs, and (naturally) don’t have a compute_psd() method.

I would greatly appreciate any suggestions!

pipe = make_pipeline(
    FunctionTransformer(eeg_power_band, validate=False),
    RandomForestClassifier(n_estimators=100, random_state=42),
)

from sklearn.model_selection import StratifiedKFold

# create lists of epochs and corresponding events
X = []
y = []
skf = StratifiedKFold(n_splits=5)
for ndx in range(len(epochs)):
    epochs[ndx].drop_bad()
    for ep_ndx in range(len(epochs[ndx])):
        X.append(epochs[ndx][ep_ndx])
        y.append(epochs[ndx][ep_ndx].events[:, 2][0])

# skf.split() gives us indices into the lists to create stratified train & test epochs
for train_ndx, test_ndx in skf.split(np.zeros(len(y)), y):
    X_train = []
    y_train = []
    for ndx in train_ndx:
        X_train.append(X[ndx])
        y_train.append(y[ndx])
    X_test = []
    y_test = []
    for ndx in test_ndx:
        X_test.append(X[ndx])
        y_test.append(y[ndx])

    # fit the training data (errors here)
    pipe.fit(X_train, y_train)
    y_pred = pipe.predict(X_test)
    acc = accuracy_score(y_test, y_pred)
    print(f"Accuracy score: {acc}")

agramfort · June 14, 2023, 7:11am

just make a big numpy array stacking all the data so you have X = (n_epochs, n_channels, n_times) and y = (n_epochs,)
Make a variable called group of size (n_epochs) so that group[i] = j is epoch i for subject j

then use https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.StratifiedGroupKFold.html
using a cross_val_score function

assuming your estimator can take 3D inputs

HTH
A

Topic		Replies	Views
Exercies from Sleep stage classification from polysomnography (PSG) data Page 💬 Support & Discussions eeg , epochs	0	265	October 12, 2022
Epochs.compute_psd doesn't working 💬 Support & Discussions eeg , epochs	6	1414	January 11, 2023
Can not able to split the sleep edf files 💬 Support & Discussions eeg	6	563	March 24, 2021
Issue with EEG relative power estimation 💬 Support & Discussions preprocessing , eeg , epochs	1	661	May 28, 2021
Instantiating EpochsSpectrum and Spectrum Class Objects 💬 Support & Discussions	3	256	June 23, 2023

Creating subsets of Epochs for Sleep stage classification from polysomnography (PSG) data exercise

Related topics