Comparing two text documents and skipping certain lines based off of one text document - Python

Question

I'm working on a Python project. I have a semicolon-plus-newline delimited text file that is being read containing all 50 states (including DC). Thus, each state has its own line terminating in a semicolon (;). An example is below. I also have another file being read in with a LOT of information. The text document can be found here.

I want to skip any line that starts with the state name by testing it against a text file with all fifty states, along with the line below any such line. I do not need this information. Is there a way to test, line by line, if it starts with the state name and, if it matches with one of the fifty states in the other text file, skip that line plus the line below it?

For example, in the hyperlinked text file, line 43 starts with Alaska. I want to skip that line and the line below it. I want to store the rest of the information in an array. When I hit line 244, the information for the next state (Alabama) starts. I want to skip line 244 and the line below that, and do the same thing - store all the information in the array, compiling one large array at the end.

Here are the first four lines of the fifty states file:

Alabama; 
Alaska;
Arizona; 
Arkansas;

For clarification, the only information I am only interested in is the ICAO data, which is the 3rd column in the hyperlinked text file.

Also, would it be an issue if there is no ICAO information for a specific line? For example, line 63 in the hyperlinked text document does not have a value.

This is the code I have so far:

import numpy as np
#This program reads in the ICAO data file found at: http://weather.rap.ucar.edu/surface/stations.txt

with open('ICAOlist.txt','r') as dataICAO:
     icaoData = np.loadtxt(dataICAO, dtype = str, delimiter = ' ', skiprows = 41)
     with open('listOfAllStates.txt', 'r') as dataStates:
         statesData = np.loadtxt(dataStates, dtype = str, delimiter = ';')

Comparing two text documents and skipping certain lines based off of one text document - Python

Answers (1)

Related Questions