python script hangs when calling cursor.fetchall() with large data set

Question

I have a query that returns over 125K rows.

The goal is to write a script the iterates through the rows, and for each, populate a second table with data processed from the result of the query.

To develop the script, I created a duplicate database with a small subset of the data (4126 rows)

On the small database, the following code works:

import os
import sys
import random

import mysql.connector

cnx = mysql.connector.connect(user='dbuser', password='thePassword',
                          host='127.0.0.1',
                          database='db')
cnx_out = mysql.connector.connect(user='dbuser', password='thePassword',
                          host='127.0.0.1',
                          database='db')

ins_curs = cnx_out.cursor()

curs = cnx.cursor(dictionary=True)
#curs = cnx.cursor(dictionary=True,buffered=True) #fail

with open('sql\getRawData.sql') as fh:
    sql = fh.read()

curs.execute(sql, params=None, multi=False)
result = curs.fetchall()  #<=== script stops at this point
print len(result) #<=== this line never executes

print curs.column_names

curs.close()
cnx.close()
cnx_out.close()
sys.exit()

The line curs.execute(sql, params=None, multi=False) succeeds on both the large and small databases. If I use curs.fetchone() in a loop, I can read all records.

If I alter the line:

curs = cnx.cursor(dictionary=True)

to read:

curs = cnx.cursor(dictionary=True,buffered=True)

The script hangs at curs.execute(sql, params=None, multi=False).

I can find no documentation on any limits to fetchall(), nor can I find any way to increase the buffer size, and no way to tell how large a buffer I even need.

There are no exceptions raised.

How can I resolve this?

python script hangs when calling cursor.fetchall() with large data set

Answers (1)

Related Questions