Correct way to specify quarterly observations as the time index in the plm package

Question

I am trying to convert quarterly data which is stored in a data.table into a panel data.frame to prepare it for further analysis. But apparently there's an issue when using quarterly dates as time dimension. I can convert them to date, numeric or character, but it is not recognised as quarterly time series by is.pconsecutive(), which then prevents me from using certain functions.

library(zoo)
library(data.table)
dt <- structure(list(Global.Company.Key = c(1380L, 1380L, 1380L, 1380L, 
1380L, 1380L, 1380L, 1380L), Calendar.Data.Year.and.Quarter = structure(c(2000, 
2000.25, 2000.5, 2000.75, 2001, 2001.25, 2001.5, 2001.75), class = "yearqtr"), 
    Calendar.Year.Quarter.Integer = c(10957L, 11048L, 11139L, 
    11231L, 11323L, 11413L, 11504L, 11596L), Year.Date = structure(c(10957, 
    11048, 11139, 11231, 11323, 11413, 11504, 11596), class = "Date")), .Names = c("Global.Company.Key", 
"Calendar.Data.Year.and.Quarter", "Calendar.Year.Quarter.Integer", 
"Year.Date"), row.names = c(NA, -8L), class = c("data.table", 
"data.frame"))
# defined the date index as integer
pdt <- pdata.frame(dt, index = c("Global.Company.Key", "Calendar.Year.Quarter.Integer"))
is.pconsecutive(pdt)
 1380 
 FALSE

Apparently the time dimension is analysed by checking if the distance between the data points is regularly spaced and one. From the manual: "For evaluation of consecutiveness, the time dimension is interpreted to be numeric, and the data are tested for being a regularly spaced sequence with distance 1 between the time periods for each individual (for each individual the time dimension can be interpreted as sequence t, t+1, t+2, ... where t is an integer)." So what is the best and most robust way to convert the year quarter time series?

hannes101 · Accepted Answer

I came up with a solution to the problem, which is sufficient for this purpose and is only applicable to this particular dataset, since it needs adjusting if a different time horizon is covered. I basically convert all quarters relative to the first quarter in the dataset and then just calculate integers for each quarter and use this as the time index.

library(lubridate)
dt[, Time.Index := (year(Calendar.Data.Year.and.Quarter)-2000)*4+quarter(Calendar.Data.Year.and.Quarter)]
pdt <- pdata.frame(dt , index = c("Global.Company.Key", "Time.Index"))
is.pconsecutive(pdt) # <- this then reports TRUE

It is a workaround, but not so bad I think.

Correct way to specify quarterly observations as the time index in the plm package

Answers (2)

Related Questions