Extract XML-TEI attributes from a file

Question

Good morning, I am working on a code to extract data from an XML-TEI marked-up file of a poem and I would like to print the list of the 'pos' attributes for each one of the lines of the poem ('l'). ('w' is the name of the word tag contained within the 'l' tag)

 De qua saepe tibi ,   non licet   de qua saepe

result_4=bs_content.find_all('l')
for x in result_4:
  print(len(x.find_all('w')))
  for x in x.find_all('w'):
    a=x.get('pos')
    print(a)

The result is currently the following:

5

PREP

REL

ADV

PRON

PUN

2

ADV

V

3

PREP

REL

ADV

But I would like to have

5

['PREP', 'REL', 'ADV', 'PRON', 'PUN']

2

['ADV', 'V']

3

['PREP', 'REL', 'ADV']

May anyone help me? Thanks

Extract XML-TEI attributes from a file

Answers (0)

Related Questions