How to find an content length int in http header file

Question

I am trying to find the integer for content length in a headers file. We are currently using something that works for some websites, but will not work for larger files (over 9999 bytes). How could we find the newline at the end of that specific header so that we can get exactly the integer for content length?

content_length = headers[headers.find("Content-Length: ")+16:headers.find("Content-Length: ")+20]

I would try searching for the " " but that is problematic as there are many in the http headers. Unfortunately, we do not have access to urllib.

Example headers:

GET http://example.com/ HTTP/1.1

Content-Length: 95972

Keep-Alive: 300

Connection: keep-alive

Blaise · Accepted Answer

Simple. Assuming your headers are stored to a string:

for line in headers:
    if "Content-Length:" in line:
        contentLength = line

contentLength = contentLength[16:]

Caveats: Not all HTTP headers contain Content-Length and some headers may contain more than one Content-Length.

How to find an content length int in http header file

Answers (2)

Related Questions