Python: List not correct after appending lines

Question

I'm trying to append lines to an empty list reading from a file, and I've already stripped the lines of returns and newlines, but what should be one line is being entered as two separate items into the list.

DNA = open('DNAGCex.txt')
DNAID = []
DNASEQ = []
for line in DNA:
    line = line.rstrip()
    line = line.lstrip()
    if line.startswith('>')==True:
        DNAID.append(line)
    if line.startswith('>')==False:
        DNASEQ.append(line)
print DNAID
print DNASEQ

And here's the output

['>Rosalind_6404', '>Rosalind_5959', '>Rosalind_0808'] ['CCTGCGGAAGATCGGCACTAGA', 'TCCCACTAATAATTCTGAGG', 'CCATCGGTAGCGCATCCTTAGTCCA', 'ATATCCATTTGTCAGCAGACACGC', 'CCACCCTCGTGGTATGGCTAGGCATTCAG', 'TGGGAACCTGCGGGCAGTAGGTGGAAT']

I want it to look like this:

['>Rosalind_6404', '>Rosalind_5959', '>Rosalind_0808'] ['CCTGCGGAAGATCGGCACTAGATCCCACTAATAATTCTGAGG', 'CCATCGGTAGCGCATCCTTAGTCCAATATCCATTTGTCAGCAGACACGC', 'CCACCCTCGTGGTATGGCTAGGCATTCAGTGGGAACCTGCGGGCAGTAGGTGGAAT']

Here is the source material, just remove the ''s:

['>Rosalind_6404' CCTGCGGAAGATCGGCACTAGA TCCCACTAATAATTCTGAGG '>Rosalind_5959' CCATCGGTAGCGCATCCTTAGTCCA ATATCCATTTGTCAGCAGACACGC '>Rosalind_0808' CCACCCTCGTGGTATGGCTAGGCATTCAG TGGGAACCTGCGGGCAGTAGGTGGAAT]

Brent Washburne · Accepted Answer

You can combine the .lstrip() and .rstrip() into a single .strip() call.

Then, you were thinking that .append() both added lines to a list and joined lines into a single line. Here, we start DNASEQ with an empty string and use += to join the lines into a long string:

DNA = open('DNAGCex.txt')
DNAID = []
DNASEQ = []
for line in DNA:
    line = line.strip()
    if line.startswith('>'):
        DNAID.append(line)
        DNASEQ.append('')
    else:
        DNASEQ[-1] += line
print DNAID
print DNASEQ

Python: List not correct after appending lines

Answers (2)

Related Questions