Accuracy on training set is weirdly low compared to validation accuracy for many classifiers. Is this normal?

Question

I thought that after fitting data, and predicting the training set, you should get an accuracy that is close to 100%. I mean that only makes sense. The algorithm learns based on that dataset. But when i do:

classifier.fit(X_train, y_train)

pred = classifier.predict(X_test)

print(accuracy_score(y_test, pred))

>>> 0.810126582278481

This is fine. However, if I do:

pred = classifier.predict(X_train)

print(accuracy_score(y_train, pred))

>>> 0.6677316293929713

Isn't this kind of a fallacy? Or am I doing something wrong...? This applies to RandomForestClassifier, MLPClassifier and SVC.

Accuracy on training set is weirdly low compared to validation accuracy for many classifiers. Is this normal?

Answers (1)

Related Questions