replacing a bunch of lines in a bunch of files

Question

Let's say I have some thousands of HTML files with some text inside 'em (articles, actually). Besides, let's say there are all sorts of scripts, styles, counters, other crap inside these HTMLs, somewhere above the actual text.

And my task is to replace everything that goes from the very beginning until a certain tag – i.e., we start with and end with

with a clear

block.

Is there any regex way I can do this? Vim? Any other editor? Scripting language?

Thanks.

Tim Pietzcker · Accepted Answer

The simplest regex for this would be (?s)\A.*?(?=

) (assuming you want to keep the

tag). Replace that with the text from your question.

Explanation:

(?s)   # Allow the dot to match newlines
\A     # Anchor the search at the start of the string
.*?    # Match any number of characters, as few as possible
(?=)  # and stop right before this

This will fail, of course, if the text

could also occur in a comment or a literal string somewhere above the actual tag.

replacing a bunch of lines in a bunch of files

Answers (1)

Related Questions