R Beginner Question Data Frame Conversion to Time Series

Question

I have a flat-csv file containing data in long format, that needs to be converted to a time series object. The format of the file looks like this:

DATE       ID  REGION VALUE
2016-03-10 10  DE001  2332,23
2016-03-10 10  DE001  2332,23
2016-03-10 10  DE002  2332,23
2016-03-10 11  DE001  2332,23
2016-03-10 11  DE002  2332,23
2016-03-10 12  DE001  2332,23
2016-03-11 10  DE001  2332,23
2016-03-11 10  DE001  2332,23
2016-03-11 10  DE002  2332,23
2016-03-11 11  DE001  2332,23
2016-03-11 11  DE002  2332,23
2016-03-11 12  DE001  2332,23

I want to group by ID and then by region, so that i have a different time-series for each ID-group containing several region observations for the complete available time-span.

DAV · Accepted Answer

I misunderstood the OP's question.

You can use tapply to break up the original data frame (call it D). This is a bit tricky. You can't easily change D in the tapply

D$relTime <- NA 

L=tapply(1:nrow(D),D$ID, function(x) {
    # x contains the row numbers for each ID
    RT <- data.frame(row=x)
    T0 <- D$DATE[x][1]
    RT$val <- D$DATE[x]-T0 # if time series means offset from a base time
    RT
})
DL <- do.call('rbind',L)
# assuming you want it in  D
D$relTime[DL$row] <- DL$val

This will create a new column which contains the offset from the base time for each ID.

Edit: I use '=' for assingment which isn't considered best practice. I've changed them in the above.

R Beginner Question Data Frame Conversion to Time Series

Answers (2)

Related Questions