Retrieve Large Data From MySQL DB With Chunks And Save Them Dataframe Pandas

Question

I want to retrieve about 100 million rows and 30 columns of data from an SQL database into a dataframe where I can sort and filter based on certain requirements. I only have 2 Gig memory. Everything comes to a standstill even though I am using chunksize. Here is my code.

import pymysql
chunksize = 100
import pandas as pd
import pymysql.cursors
from urllib import parse```

sqlEngine = create_engine('mysql+pymysql://username:%s@localhost/db' % parse.unquote_plus('password'))
dbConnection    = sqlEngine.connect()

for chunk in pd.read_sql("select * from db.db_table", dbConnection, chunksize = chunksize):
    print(chunk)

Do somrthing with chunk(chunk is the dataframe that has all the 100 million columns )

I have reduced my chunksize but still not getting anything.

Retrieve Large Data From MySQL DB With Chunks And Save Them Dataframe Pandas

Answers (1)

Related Questions