Checking for Regular Expressions within a CSV

Question

I'm currently trying to run through my csv file and identify the rows in a column.

The output should be something like "This column contains alpha characters only".

My code currently: Within a method I have:

print('
REGULAR EXPRESSIONS
' +
              '----------------------------------')
        for x in range(0, self.tot_col):
            print('
' + self.file_list[0][x] +
                  '
--------------')  # Prints the column name

            for y in range(0, self.tot_rows + 1):

                if regex.re_alpha(self.file_list[y][x]) is True:
                    true_count += 1
                else:
                    false_count += 1

            if true_count > false_count:
                percentage = (true_count / self.tot_rows) * 100
                print(str(percentage) + '% chance that this column is alpha only')

            true_count = 0
            false_count = 0

self.file_list is the csv file in list format. self.tot_rows & self.tot_col are the total rows and total columns respectively which has been calculated earlier within the program.

regex.re_alpha has been imported from a file and the method looks like:

def re_alpha(column):
    # Checks alpha characters
    alpha_valid = alpha.match(column)
    if alpha_valid:
        return True
    else:
        return False

This currently works, however I am unable to add my other regex checks such as alpha, numeric etc

I have tried to duplicate the if statement with a different regex check but it doesn't work. I've also tried to do the counts in the regex.py file however the count stops at '1' and returns the wrong information.. I thought creating a class in the regex.py file would help however no avail.

Summary: I would like to run multiple regex checks against my csv file and have them ordered via columns.

Thanks in advance.

Checking for Regular Expressions within a CSV

Answers (1)

Related Questions