reading HTML(different folders) files

Question

I want to read HTML files in python. Normaly I do it like this (and it works):

import codecs
f = codecs.open("test.html",'r')
print f.read()

The Problem is that my html files are not all in the same Folder since have a program which generates this html files and save them into folders which are inside the folder where I have my script to read the files. Summarizing, I have my script in a Folder and inside this Folder there are more Folders where the generated html files are.

Does anybody know how can I proceed?

Torxed · Accepted Answer

import os
import codecs

for root, dirs, files in os.walk("./"):
    for name in files:
        abs_path = os.path.normpath(root + '/' + name)
        file_name, file_ext = os.path.splitext(abs_path)
        if file_ext == '.html':
            f = codecs.open(abs_path,'r')
            print f.read()

This will walk through

reading HTML(different folders) files

Answers (2)

Related Questions