Print standard deviation of column in file using R/awk

Question

I have a file with a column of numbers

In my shell script I would like to interrogate many of these files and get the standard deviation and mean

I can achieve mean using awk easily enough awk '{sum+=$1} END { print sum/NR}' file

When using awk for standard deviation awk '{x[NR]=$0; s+=$0} END{a=s/NR; for (i in x){ss += (x[i]-a)^2} sd = sqrt(ss/NR); print sd}' file

I get 0.625. This number differs from excel which gives me 0.699. I have since discovered I can execute R from the command line to print out the sd: R -q -e "x <- read.csv('file', header = F); sd(x[ , 1])"

However, this gives a slightly messy output

[1] 4.908
\> 
\>

Can I adjust the R command to print out only the number without resorting to head and cut/awk?

Also what is wrong with my awk code for extracting standard deviation?

Print standard deviation of column in file using R/awk

Answers (1)

Related Questions