Getting list of address by flattening row in Dataframe pandas with a delimiter

Question

I have created a dataframe from an excel file. The table is a contact list with one row contain name of location, next row street address, next city and repeat for each location. Now each location is seperate by a specific phrase 'Site code'. Below is an example of what I see.

Name and address	Don't Care
Location Name	x
Street addresss	x
City	x
State	x
Site code	x
Location Name	x
Street addresss	x
City	x
State	x
Site code	x

I want to be able to flatten everything inbetween the site code into one row so it would be

[location name, street address, city, state]

I was thinking of making a function that would look at the table and create a dictionary that key would be the location name and it appends everything until it reaches to site code then skips and moves on the next dictionary entry. But it also feels like I am overthinking this. What do you think?

not_speshal · Accepted Answer

You could use numpy.array_split to split the DataFrame on rows that contain "Site code", and then parse the resultant chunks as needed:

import numpy as np
chunks = np.array_split(df["Name and address"], df[df["Name and address"]=="Site code"].index)
output = [chunk[chunk!="Site code"].tolist() for chunk in chunks if chunk.tolist()!=["Site code"]]

>>> output
[['Location Name', 'Street addresss', 'City', 'State'],
 ['Location Name', 'Street addresss', 'City', 'State']]

Getting list of address by flattening row in Dataframe pandas with a delimiter

Answers (2)

Related Questions