Reputation: 1537
Let me preface this by saying I'm not exactly sure what is happening with my code; I'm fairly new to programming.
I've been working on creating an individual final project for my python CS class that checks my teacher's website on a daily basis and determines if he's changed any of the web pages on his website since the last time the program ran or not.
The step I'm working on right now is as follows:
def write_pages_files():
'''
Writes the various page files from the website's links
'''
links = get_site_links()
for page in links:
site_page = requests.get(root_url + page)
soup = BeautifulSoup(site_page.text)
with open(page + ".txt", mode='wt', encoding='utf-8') as out_file:
out_file.write(str(soup))
The links look similar to this:
/site/sitename/class/final-code
And the error I get is as follows:
with open(page + ".txt", mode='wt', encoding='utf-8') as out_file:
FileNotFoundError: [Errno 2] No such file or directory: '/site/sitename/class.txt'
How can I write the site pages with these types of names (/site/sitename/nameofpage.txt)?
Upvotes: 22
Views: 34880
Reputation: 11277
Related to the title of the question, though not the specifics, if you really want your file names to include something that looks like a slash, you can use the unicode character "∕" (DIVISION SLASH), aka u'\u2215'
.
This isn't useful in most circumstances (and could be confusing), but can be useful when the standard nomenclature for a concept you wish to include in a filename includes slashes.
Upvotes: 5
Reputation: 300
On Unix/Mac OS, for the middle slashes, you can use :
which will convert to /
when viewed, but trigger the subfolders that /
does.
site/sitename/class/final-code
-> final-code
file in a class
folder in a sitename
folder in a site
folder in the current folder
site:sitename:class:final-code
-> site/sitename/class/final-code
file in the current folder.
Upvotes: 6
Reputation: 180482
you cannot have /
in the file basename on unix or windows, you could replace /
with .
:
page.replace("/",".") + ".txt"
Python presumes /site
etc.. is a directory.
Upvotes: 14