Split each line in a file based on delimitters

Question

This is the sample data in a file. I want to split each line in the file and add it to a dataframe. In some cases they have more than 1 child. So whenever they have more than one child new set of column have to be added child2 Name and DOB

(P322) Rashmika Chadda 15/05/1995 – Rashmi C 12/02/2024
(P324) Shiva Bhupati 01/01/1994 – Vinitha B 04/08/2024
(P356) Karthikeyan chandrashekar 22/02/1991 – Kanishka P 10/03/2014
(P366) Kalyani Manoj 23/01/1975 - Vandana M 15/05/1995 - Chandana M 18/11/1998

This is the code I have tried but this splits only by taking "-" into consideration

with open("text.txt") as read_file:
    file_contents = read_file.readlines()
content_list = []
temp = []
for each_line in file_contents:
    temp = each_line.replace("â€“", " ").split()

    content_list.append(temp)

print(content_list)

Current output:

[['(P322)', 'Rashmika', 'Chadda', '15/05/1995', 'Rashmi', 'Chadda', 'Teega', '12/02/2024'], ['(P324)', 'Shiva', 'Bhupati', '01/01/1994', 'Vinitha', 'B', 'Sahu', '04/08/2024'], ['(P356)', 'Karthikeyan', 'chandrashekar', '22/02/1991', 'Kanishka', 'P', '10/03/2014'], ['(P366)', 'Kalyani', 'Manoj', '23/01/1975', '-', 'Vandana', 'M', '15/05/1995', '-', 'Chandana', 'M', '18/11/1998']]

Final output should be like below

Code	Parent_Name	DOB	Child1_Name	DOB	Child2_Name	DOB
P322	Rashmika Chadda	15/05/1995	Rashmi C	12/02/2024
P324	Shiva Bhupati	01/01/1994	Vinitha B	04/08/2024
P356	Karthikeyan chandrashekar	22/02/1991	Kanishka P	10/03/2014
P366	Kalyani Manoj	23/01/1975	Vandana M	15/05/1995	Chandana M	18/11/1998

Split each line in a file based on delimitters

Answers (1)

Related Questions