Reputation: 1
I am trying to build a TFIDF model with TfidfVectorizer. The feature name list namely the number of column of sparse matrix is shorter than the length of word set of documents even though I set min_df as 1. What happened?
Upvotes: 0
Views: 651
Reputation: 3086
Did you check the stop_words and max_features? If you provide values in either of these two, it will exclude some words.
Upvotes: 1