What´s the random_state in DecisionTreeRegressor?

Question

What's the difference between: DecisionTreeRegressor(splitter='random') and DecisionTreeRegressor(splitter='best')

If both seem to throw random predictions, I don't get why do both implementations use the parameter random_state

Here's an example:

import pandas as pd
from sklearn.tree import DecisionTreeRegressor

url = 'https://raw.githubusercontent.com/justmarkham/DAT8/master/data/vehicles_train.csv'
train = pd.read_csv(url)

train['vtype'] = train.vtype.map({'car':0, 'truck':1})
feature_cols = ['year', 'miles', 'doors', 'vtype']
X = train[feature_cols]
y = train.price

treereg = DecisionTreeRegressor(splitter='best')

for i in range(1, 10):
    treereg.fit(X, y)
    print treereg.predict([1994, 10000, 2, 1])

thanks!

What´s the random_state in DecisionTreeRegressor?

Answers (1)

Related Questions

What&#180;s the random_state in DecisionTreeRegressor?

Answers (1)

Related Questions

What´s the random_state in DecisionTreeRegressor?