Will the test data set undergo all the operation in clf?

YuZhou · May 6, 2022, 4:41am

MNE version: e.g. 0.24.0

Hey guys!
I have a naive question in this tutorial:
https://mne.tools/dev/auto_examples/decoding/decoding_time_generalization_conditions.html?highlight=generalizingestimator

specifically in this part:

clf = make_pipeline(

    StandardScaler(),

    LogisticRegression(solver='liblinear')  # liblinear is faster than lbfgs

)

time_gen = GeneralizingEstimator(clf, scoring='roc_auc', n_jobs=None,

                                 verbose=True)

# Fit classifiers on the epochs where the stimulus was presented to the left.

# Note that the experimental condition y indicates auditory or visual

time_gen.fit(X=epochs['Left'].get_data(),

             y=epochs['Left'].events[:, 2] > 2)

scores = time_gen.score(X=epochs['Right'].get_data(),
                        y=epochs['Right'].events[:, 2] > 2)

my question is will the data of the test set epochs['Right'].get_data() will be standardized using StandardScaler() in clf or only the training data epochs['Left'].get_data() set will be standardized?

agramfort · May 6, 2022, 9:39am

Yes test with undergo the same preprocessing

It’s the scikit-learn pipeline API

Alex

Topic		Replies	Views
Training and test classifies using different datasets Support & Discussions	2	173	March 27, 2022
Using mne.decoding.GeneralizingEstimator fit() to predict categorical variables Support & Discussions machine-learning	5	419	September 6, 2022
Help with cross-condition generalization Support & Discussions machine-learning	0	152	October 2, 2023
Is the scorer .score method uses determined by scoring parameter in the estimator? Support & Discussions	0	157	February 23, 2022
Multicollinearity in Temporal Generalization Analysis in the case modif moving time window Support & Discussions	1	18	April 3, 2025

Will the test data set undergo all the operation in clf?

Related topics