Coefficient of Divergence - how to code loop more efficiently?

Question

I have a matrix of 1500 rows and 20 columns and would like to calculate the Coefficient of Divergence between columns (ie. sites). Each row is my hourly observation per site.

Calculation for COD between two sites is as stated in https://www.scribd.com/document/364585909/Coefficient-of-Divergence-COD

I have created a simpler dataset to work with:

x1 <- c(10, 20, 30, 40, 50)
x2 <- c(11, 21, 31, 41, 51)
x3 <- c(12, 22, 32, 42, 52)
x4 <- c(13, 23, 33, 43, 53)
x5 <- c(14, 24, 34, 44, 54)
x6 <- c(15, 25, 35, 44, 55)

xmat <- cbind(x1,x2,x3,x4,x5,x6)

first COD

site1 <- xmat

for (i in 2:ncol(site1))   

     site1[,i] = (xmat[,i]-xmat[,1])/(xmat[,i]+xmat[,1])
     site1 <- site1[,2:6]
     site1 <- sapply(site1, function(x) x^2)
     COD-site1 <- data.frame(sqrt(colMeans(site1, na.rm=TRUE)))

second COD

site2 <- xmat

for (i in 3:ncol(site2))

    site2[,i] = (xmat[,i]-xmat[,2])/(xmat[,i]+xmat[,2])
    site2 <- site2[,3:6]
    site2 <- sapply(site2, function(x) x^2)
    COD-site2 <- data.frame(sqrt(colMeans(site2, na.rm=TRUE)))

etc....

I'm having a problem on how to loop through columns and create a new data frame for CODs. In my simple dataset, it's over 6 columns but in my actual dataset, I have 20 columns and don't really want to copy and paste 20 times. I'm a novice in writing function and loop for R and would appreciate a lending hand. Many thanks

Coefficient of Divergence - how to code loop more efficiently?

first COD

second COD

Answers (1)

Related Questions