I've come to know that normalization (MinMax scaling) and standardization (Z-score normalization) on data have different influences from outliers in the data. In About Feature Scaling and Normalization, the author Sebastian Raschka says:
"(Min-Max scaling...) The cost of having this bounded range - in contrast to standardization - is that we will end up with smaller standard deviations, which can suppress the effect of outliers."
In his book Python Machine Learning, 2nd Edition, he mentioned:
"Furthermore, standardization maintains useful information about outliers and makes the algorithm less sensitive to them in contrast to min-max scaling, which scales the data to a limited range of values."
What does he mean that MinMax scaling can suppress the effect of outliers compared to standardization?
What does he mean that standardization maintains useful information about the outliers and makes algorithm less sensitive to them in contrast to min-max scaling?
Can this be explained from the perspective of the equations?