List all the words in a text file with occurrence counts?

Question

Suppose I have file text.txt as below:

she likes cats, and he likes cats too.

I'd like my result to look like:

she 1
likes 2
cats 2
and 1
he 1
too 1

If putting space , . into it would make the scripts easier, that would be fine.

Is there a simple shell pipeline that could achieve this?

phs · Accepted Answer

Here's a one-liner near and dear to my heart:

cat text.txt | sed 's|[,.]||g' | tr ' ' '
' | sort | uniq -c

The sed strips punctuation (tune regex to taste), the tr puts the results one word per line.

Answers (2)