What is the primary advantage of using the Matthews Correlation Coefficient (MCC) over accuracy for binary classification on imbalanced datasets?