neversaint
neversaint

Reputation: 64024

Uppercasing First Letter of Words Using SED

How do you replace the first letter of a word into Capital letter, e.g.

Trouble me
Gold rush brides

into

Trouble Me
Gold Rush Brides

Upvotes: 69

Views: 67796

Answers (11)

Kajukenbo
Kajukenbo

Reputation: 275

Late post, but using ex (or vi) is POSIX (like the awk suggestion above) so it should work on any system that ships a proper "vi" implementation. FWIW, Busybox does not make the grade here.

POSIX sed only supports BREs and I personally do not advocate relying on GNU extensions if you have a choice.

However, POSIX ex/vi supports a few ERE features, including the word boundary < and the \U switch. If your platform does not have a compliant ex/vi then you have bigger problems than EREs, IMO.

As a bonus, a wq! will optionally perform an "in-place" edit.

cat <<EOF > casing.txt
Trouble me
Gold rush brides
EOF
ex -s casing.txt <<'EOF'
%s/\<./\U&/g
%p
wq!
EOF

Upvotes: 1

Vilmantas Liubinas
Vilmantas Liubinas

Reputation: 21

Capitalize the first letter of each word (first lowercase everything, then uppercase the first letter).

echo "tugann sí spreagadh" | sed -E 's/.*/\L&/g' | sed -E 's/(\b.)/\U\1/g'
> Tugann Sí Spreagadh

Upvotes: 2

Dominique Baldo
Dominique Baldo

Reputation: 125

Using bash (without sed, so a little off topic):

msg="this is a message"
for word in $msg
do
   echo -n ${word^} ""
done

This Is A Message

Upvotes: 4

Skippy le Grand Gourou
Skippy le Grand Gourou

Reputation: 7714

Proposed sed solutions until now will only work if the original text is in lowercase. Although one could use tr '[[:upper:]]' '[[:lower:]]' to normalize the input to lowercase, it may be convenient to have an all-in-one sed solution :

sed 's/\w\+/\L\u&/g'

This will match words (\w means word character, and \+ at least one and until the next non-word character), and lowercase until the end (\L) but uppercase the next (i.e. first) character (\u) on each (g) matched expression (&).

[Credits]

Upvotes: 1

rekha_sri
rekha_sri

Reputation: 2725

Use the following sed command for capitalizing the first letter of the each word.

echo -e "Trouble me \nGold rush brides" | sed -r 's/\<./\U&/g'

output

Trouble Me
Gold Rush Brides

The -r switch tells sed to use extended regular expressions. The instructions to sed then tell it to "search and replace" (the s at the beginning) the pattern \<. with the pattern \U& globally, i.e. all instances in every line (that's the g modifier at the end). The pattern we're searching for is \<. which is looking for a word boundary (\<) followed by any character (.). The replacement pattern is \U&, where \U instructs sed to make the following text uppercase and & is a synonym for \0, which refers to "everything that was matched". In this case, "everything that was matched" is just what the . matched, as word boundaries are not included in the matches (instead, they are anchors). What . matched is just one character, so this is what is upper cased.

Upvotes: 26

mark_infinite
mark_infinite

Reputation: 383

Using sed with tr:

name="athens"
echo $name | sed -r "s/^[[:alpha:]]/$(echo ${name:0:1} | tr [[:lower:]] [[:upper:]])/"

Upvotes: 2

Havok
Havok

Reputation: 5882

I know you said sed, but for shell scripting one of the easiest and more flexible for me was using Python which is available in most systems:

$ echo "HELLO WORLD" | python3 -c "import sys; print(sys.stdin.read().title())"
Hello World

For example:

$ lorem | python3 -c "import sys; print(sys.stdin.read().title())"
Officia Est Omnis Quia. Nihil Et Voluptatem Dolor Blanditiis Sit Harum. Dolore Minima Suscipit Quaerat. Soluta Autem Explicabo Saepe. Recusandae Molestias Et Et Est Impedit Consequuntur. Voluptatum Architecto Enim Nostrum Ut Corrupti Nobis.

You can also use things like strip() to remove spaces, or capitalize():

$ echo "  This iS mY USER ${USER}   " | python3 -c "import sys; print(sys.stdin.read().strip().lower().capitalize())"
This is my user jenkins

Upvotes: 3

chtimi59
chtimi59

Reputation: 539

Another shorter version with sed:

sed -e "s/\b./\u\0/g"

Upvotes: 9

ghostdog74
ghostdog74

Reputation: 342463

Using awk:

awk '{for(i=1;i<=NF;i++){ $i=toupper(substr($i,1,1)) substr($i,2) }}1' file

The output would be:

Trouble Me
Gold Rush Brides

Upvotes: 37

mikejonesey
mikejonesey

Reputation: 200

I had apostrophes so, working off the first solution...

mike@mike-laptop3:~$ echo "BEST WESTERN PLUS BOB's INN" | tr "[A-Z]" "[a-z]" | sed -e "s/\b\(.\)/\u\1/g"

Best Western Plus Bob'S Inn

mike@mike-laptop3:~$ echo "BEST WESTERN PLUS BOB's INN" | tr "[A-Z]" "[a-z]" | sed "s/\( \|^\)\(.\)/\1\u\2/g"

Best Western Plus Bob's Inn

Upvotes: 6

tangens
tangens

Reputation: 39733

This line should do it:

sed -e "s/\b\(.\)/\u\1/g"

Upvotes: 108

Related Questions