Iterate through lines of file until user input found

Question

I am currently writing a script that is taking input from a user and then searches for it in a file. If the user input was found multiple times in the lines of the file the code should only replace the first input that was found.

This is my code:

from tempfile import mkstemp
from shutil import move, copymode
from os import fdopen, remove

def replace(file_path, pattern, subst):
    fh, abs_path = mkstemp()
    with fdopen(fh,'w') as new_file:
        with open(file_path) as old_file:
            lines = old_file.readlines()
            for line in lines:
                new_file.write(line.replace(pattern, subst))
            for lines in old_file:
                new_file.write(lines.replace(pattern, pattern))
    copymode(file_path, abs_path)
    remove(file_path)
    move(abs_path, file_path)

test_input = 'fruits'
replace_with = 'bread'
# Here, file_path is just for example purposes
replace(file_path, test_input, replace_with)

File content before running the script:

I like cookies
I like fruits
I like fruits
I like fruits

What I want the file content to look like after running my script:

I like cookies
I like bread
I like fruits
I like fruits

What it actually looks like after running the script:

I like cookies
I like bread
I like bread
I like bread

How can I fix the code to get the desired result?

David Culbreth · Accepted Answer

Before I answer the primary question, I feel it needs to be pointed out that calling readlines() on a file object puts the file pointer at the end of the file, and returns a list of str, as they were separated by characters. After this operation, the file pointer will be at the end of the file, so attempting to iterate over that same file will not produce subject -- meaning that the loop won't run. This is exactly what you're doing with these lines...

            # Read in the contents of old_file, divided by line, to lines.
            lines = old_file.readlines()

            ...

            # lines is now overwritten within the context of the loop, which
            # never runs, because old_file has already been read.
            for lines in old_file:

Now, to address your question, the reason it continues to do the replacement after finding a match is that you don't stop it from replacing after you find a match. A simple flag can help you achieve this. In my example, I call it replacement_found.

from tempfile import mkstemp
from shutil import move, copymode
from os import fdopen, remove

file_path = "tmp.txt"

def replace(file_path, pattern, subst):
    fh, abs_path = mkstemp()
    with fdopen(fh,'w') as new_file:
        with open(file_path) as old_file:
            replacement_found = False
            for line in old_file: # this will iterate over the file, one line at a time. consumes less memory this way.
                if replacement_found:
                    new_file.write(line)
                else:
                    new_file.write(line.replace(pattern, subst))
                    if pattern in line:
                        replacement_found = True

    copymode(file_path, abs_path)
    remove(file_path)
    move(abs_path, file_path)

test_input = 'fruits'
replace_with = 'bread'
replace(file_path, test_input, replace_with) #file_path is just for example purpose

now it outputs what you're looking for:

I like cookies
I like bread
I like fruits
I like fruits

Iterate through lines of file until user input found

Answers (2)

Related Questions