Tensorflow slowing down in a loop

Question

I have multiple algorithms I run in loops. Those that contain tensorflow really slow down after multiple iterations.

Each file list will be roughly 10,000 files depending on which algorithm it is. I loop through the file list one file at a time, creating a data frame from each file, running my algorithm on the data frame then writing the result to a database. Looks something like:

file_list = self.get_files()
for file in file_list:
   data = self.get_data(file.fileid)
   result = self.get_result(data)
   self.write_result

get_result is a different function for different algorithms. They normally take 0 - 5 seconds to calculate the results per file.

I'm working with an algorithm at the minute that at the beginning of the loop processes 2 files per second, but after a few hundred files it slows down to a minute per file. Inspecting the code it has to be TF that is the bottleneck as the rest of the code is relatively trivial.

In get_result there is the following line that I believe is the culprit:

z = self.evaluate_risk(normalized_X)

def evaluate_risk(self, X):    

    with tf.device('/cpu:0'):
        with tf.Session() as sess:
            tf.saved_model.loader.load(sess, model.pb)
            graph = tf.get_default_graph()
            input_x = graph.get_tensor_by_name("input:0")
            risk = graph.get_tensor_by_name("risk:0")
            z = sess.run(risk, {input_x: X})
            sess.close()
            del sess
            del graph
    return z

Given that I'm using with I don't understand why this function is causing any issues. I have since added sess.close(), del sess and del graph but I still get the same issue.

Each time I have a new file and get to result I should be starting tensorflow from fresh. Are there any obvious reasons my loop slows down? I'm guessing some part of tensorflow isn't resetting.

Tensorflow slowing down in a loop

Answers (1)

Related Questions