Combine two sklearn pipelines into one

Question

I have a text preprocessing Pipeline:

pipe = Pipeline([
  ('count_vectorizer', CountVectorizer()),
  ('chi2score', SelectKBest(chi2, k=1000)),
  ('tfidf_transformer', TfidfTransformer(norm='l2', use_idf=True)),
])

and I want to perform cross validation on a pipeline with multiple estimators. This is a solution that is working, but honestly I don't really like it. There should be a better way to do it. Maybe somehow convert the Pipeline to a transformer?

pipe_nb = Pipeline([*pipe.steps, ('naive_bayes', MultinomialNB())])

That's an approach that I perceive as an ideal one, but unfortunately it does not merge steps into new pipeline and causes issues.

pipe_nb = make_pipeline(
  pipe, 
  MultinomialNB()
)

How to merge two pipelines into one, in a nice pythonic way?

Combine two sklearn pipelines into one

Answers (1)

Related Questions