I totally agree with your idea on applying global autoreject before ICA or simply applying PTP and excluding large artifacts. However, the autoreject example and both of the threads that I mentioned suggest local autoreject before ICA and I don’t know why, so I get confused.
In the example we have:
We recommend that you first highpass filter the data, then run autoreject (local) and supply the bad epochs detected by it to the ICA algorithm for a robust fit, and finally run autoreject (local) again.
Also take a look at the followings: