How to separate one python list into 3 different lists according to the criteria

Question

I have a python list like below:

A = ['"','', 'What', 'colour', 'is', 'the', 'sky','' ,'(A)', 'red','', '(B)', 'blue', '','(C)', 'yellow','', '"']

For list A, what is the easiest way to do the followings?

1. remove ' " ' from the list, i.e.

A_new =  ['', 'What', 'colour', 'is', 'the', 'sky','' ,'(A)', 'red','', '(B)', 'blue', '','(C)', 'yellow','']

1. separate A into 3 lists, one for each multiple choice option, i.e. the output should be like below:

A_new_1 = ['', 'What', 'colour', 'is', 'the', 'sky','' ,'(A)', 'red']
A_new_2 = ['', 'What', 'colour', 'is', 'the', 'sky','' ,'(B)', 'blue']
A_new_3 = ['', 'What', 'colour', 'is', 'the', 'sky','' ,'(C)', 'yellow']

In my example, the ultimate goal is to get the lists A_new_1, A_new_2 and A_new_3.

I am currently working on making python function to achieve this objective, and my code so far is the following:

# 2. for GPT2MCHeadModel (ARC, openbookQA)
def GPT2MCHeadModel_data_manipulator(file_path):
    f = open(file_path, "r") 
    ln = f.readline()
    ln = ln.replace('"', '') # remove unnecessary quotation marks from the raw text file.
    ln_split = ln.split()

    # insert appropriate tokens into the raw text files before processing them in GPT2MCHeads model.
    ln_split.insert(0, "") 
    ln_split.insert(len(ln_split) - 1, "") 
    ln_split.insert(ln_split.index("(A)"), "") 
    ln_split.insert(ln_split.index("(B)"), "") 
    ln_split.insert(ln_split.index("(C)"), "") 
    ln_split.insert(ln_split.index("(D)"), "")

and I am not sure how to separate the contents into 3 separate lists, one list for each multiple choice option.

Thank you,

CDJB · Accepted Answer

Try the following:

A = ['"','', 'What', 'colour', 'is', 'the', 'sky','' ,'(A)', 'red','', '(B)', 'blue', '','(C)', 'yellow','', '"']

# Problem 1
A = [x for x in A if x != '"']

i = A.index("")
c = A.count("")

# Problem 2
output = [A[:i] + A[i+j*3:i+j*3+3] for j in range(c)]

Output

>>> A
['', 'What', 'colour', 'is', 'the', 'sky', '', '(A)', 'red', '', '(B)', 'blue', '', '(C)', 'yellow', '']
>>> output
[['', 'What', 'colour', 'is', 'the', 'sky', '', '(A)', 'red'],
 ['', 'What', 'colour', 'is', 'the', 'sky', '', '(B)', 'blue'],
 ['', 'What', 'colour', 'is', 'the', 'sky', '', '(C)', 'yellow']]

How to separate one python list into 3 different lists according to the criteria

Answers (1)

Related Questions