Rachel
Rachel

Reputation: 1967

How to graph a tree with graphviz?

I can't reproduce a simple example. Here is how it goes:

import pandas as pd
import numpy as np
import sklearn as skl
from sklearn import tree
from sklearn.cross_validation import train_test_split as tts



#   import data and give a little overview
sample = pd.read_stata('sample_data.dta')
s = sample

#   Let's learn
X = np.array((s.crisis, s.cash, s.industry, s.current_debt, s.activity)).reshape(1000, 5)
y = np.array(s.wc_measure)

X_train, X_test, y_train, y_test = tts(X, y, test_size = .8)

my_tree = tree.DecisionTreeClassifier()
clf = my_tree.fit(X_train, y_train)
predictions = my_tree.predict(X_test)

#   Graph the tree
from sklearn.externals.six import StringIO  
import pydotplus 
dotfile = StringIO() 
tree.export_graphviz(clf, out_file=dotfile) 
pydotplus.graph_from_dot_data(dotfile.getvalue()).write_png("my_tree.png")

When I try to graph the tree, I get the following error:

pydotplus.graphviz.InvocationException: GraphViz's executables not found

Any idea what the problem is? I neatly follow graphviz's documentation.

Upvotes: 2

Views: 9117

Answers (2)

dxf
dxf

Reputation: 573

step 1, install C-version of graphviz using 'sudo apt-get install graphviz' if ubuntu, 'brew install graphviz' if OSX

step 2, install package 'graphviz' by pip sudo pip install graphviz

Upvotes: 2

Ohad Eytan
Ohad Eytan

Reputation: 8464

It seem you didn't install the graphviz software at all. You should download and install it before using the module.

Upvotes: 3

Related Questions