How to put info of a text file in a tuple to be convertible to SQL table?

Question

I have several text files that contain specific information.I need to extract the required info from the files and put them in a MySQL table. The files contain several lines of info but I only need these three lines for instance:

Name:      Gorge
Registration ID: 6657
Registration Time:  2012-09-10 14:31:13

I wrote the below code but the result of the code is not what I want. The code still do not contain the insert for SQL part.

import fnmatch
import os
import pprint



matches=[]
b=[]

for root, dirnames, filenames in os.walk('d:/Data'):
    for filename in fnmatch.filter(filenames, 'Info_reg.txt'):
        matches.append(os.path.join(root, filename))


all_keys = ['name','Registration ID','registration time']
for m in matches:
    f=open(m,'r')
    for line in f:        
        for n in all_keys:
            if line.startswith(n):
                a = line.split(':',1)
                b.append(a)

and the code result is like below which I assume cannot easily convert to a table:

['registration time', '     2012-10-08 17:28:47
'],
 ['Registration ID', ' 9876'],
 ['Name', '      Malcom
'],

 ['registration time', '     2012-10-08 17:28:47
'],
 ['Registration ID', ' 45'],
 ['Name',      'mazu
'],

Does anyone have an idea how can I change my code to make a nice table out of this file?

Martijn Pieters · Accepted Answer

You want to call .strip() on the results, and store the whole thing in a dictionary instead of a list of lists.

We can also optimize the line search and record handling; I am assuming here that when we find a Name entry a new record has started:

records = []

all_keys = {'Name', 'Registration ID', 'registration time'}
first_key = 'Name'

for m in matches:
    with open(m, 'r') as f
        record = dict.fromkeys(all_keys)  # new record dictionary with `None` values

        for line in f:
            key, value = line.split(':', 1)
            key, value = key.strip(), value.strip()
            if key not in all_keys:
                continue  # not interested in this line

            if key == first_key and any(v for v in record.itervalues()):
                # new record, finalize the previous
                records.append(record)
                record = dict.fromkeys(all_keys)

            record[key] = value

        if any(v for v in record.itervalues()):
            # there is something in the last record still, add that too
            records.append(record)

Now you have a list of records of the form:

records = [
    {'registration time', '2012-10-08 17:28:47', 'Registration ID': '9876', 'Name', 'Malcom'},
    {'registration time', '2012-10-08 17:28:47', 'Registration ID': '45', 'Name', 'mazu'},
]

These can be inserted into a database with MySQLdb using .executemany() in one go:

cursor = conn.cursor()
cursor.executemany('INSERT INTO sometable (id, name, time) VALUES (%(Registration ID)s, %(Name)s, %(registration time)s)',
    records)
conn.commit()

This inserts all collected records straight into the database.

How to put info of a text file in a tuple to be convertible to SQL table?

Answers (2)

Related Questions