Python: Iterate through an html file

Question

I'm trying to iterate through an html file from the internet.

target = br.response().read()
for row in target:
    if "[some text]" in row:
    print next(target)

The problem is this loop iterates over each character in the html file, so it'll never find a match. How do I get it to iterate through each row instead?

I've tried target = target.splitlines() , but that really messes up the file.

Thomas Junk · Accepted Answer

What you basically want to achieve is the following (reading from a file, as your header suggests):

#!/usr/bin/env python

import sys

with open("test.txt") as file:
    for line in file:
        if "got" in line:
            print "found: {0}".format(line)

You want to open your file ("test.txt").

You read each line (for .. in)

and look if the line contains a string, where in comes in nice:)

If you are interested in the line number:

    for index, line in enumerate(file):

But beware the index starts with 0, so the current line number is index+1

Analog, if you want to read from a String as a file, take a look at StringIO.

Python: Iterate through an html file

Answers (2)

Related Questions