Sed expression just prints entire file

Question

I've been trying to extract the bold portion from the following:

[HorribleSubs] Black Clover - 128 [720p].mkv

But for whatever reason, this sed expression-

sed --regexp-extended 's#.*#\1#'

-is returning the entire file, when of course, I only want the \1 capture group to be.

The weird thing, is that this expression worked just fine when I tried debugging it with desed; with the capture group and primary match showing up just fine.

I'm using gnu sed 4.8-1

Wiktor Stribiżew · Accepted Answer

You can use

sed -n -E '/.*[^<]*<\/a>.*/{s//\1/p;q}'

Details:

-n - suppresses default line output

-E - enables POSIX ERE regex syntax

/.*[^<]*<\/a>.*/ - finds a line containing < href=".../">... substring, capturing the part between href=" and /"
{s//\1/p;q}' - replaces the string matched above with the value of the captured substring, prints it and quits.

See the online demo:

s='blah
[HorribleSubs] Black Clover - 128 [720p].mkv
blah
[HorribleSubs] Black Clover - 128 [720p].mkv
blah'
sed -n -E '/.*[^<]*<\/a>.*/{s//\1/p;q}' <<< "$s"
# => /torrent/4384536/HorribleSubs-Black-Clover-128-720p-mkv

Sed expression just prints entire file

Answers (1)

Related Questions