Reputation: 45
Im currently working with 48 currencies in R and want to merge or override two datasets.
For all the currencies, I have data from 1980 to 2017 in DF1. For 16 of them I also have data from 1970/1971 to 2017 in DF2. What I want to do is to put the 1970-1980 part from DF2 on top of DF1. I guess I would achieve the same result if I managed to merge DF1 into DF2 for the cells where they both have values (so DF1 overrides DF2). However, the starting dates are not exactly equal for all currencies, so I can't just hardcode it.
Here is an example to show you what it looks like:
Date
is the time variable (monthly data). DF1 would be corresponding to the 1980-2017 data and DF2 would correspond to the 1970-2017 data. My objective is to override the values from DF1 into DF2 based on both column and row ID. DF3 is the desired output.
DF1=data.frame(matrix(data=c(c(4:9),rnorm(12)),6,3))
DF2=data.frame(matrix(data=c(c(1:12),rnorm(36)),12,4))
names(DF1)=c("Date","Currency1","Currency3")
names(DF2)=c("Date","Currency1","Currency2","Currency3")
> DF1
Date Currency1 Currency3
1 4 -0.8356286 0.5757814
2 5 1.5952808 -0.3053884
3 6 0.3295078 1.5117812
4 7 -0.8204684 0.3898432
5 8 0.4874291 -0.6212406
6 9 0.7383247 -2.2146999
> DF2
Date Currency1 Currency2 Currency3
1 1 1.12493092 -0.15579551 1.1000254
2 2 -0.04493361 -1.47075238 0.7631757
3 3 -0.01619026 -0.47815006 -0.1645236
4 4 0.94383621 0.41794156 -0.2533617
5 5 0.82122120 1.35867955 0.6969634
6 6 0.59390132 -0.10278773 0.5566632
7 7 0.91897737 0.38767161 -0.6887557
8 8 0.78213630 -0.05380504 -0.7074952
9 9 0.07456498 -1.37705956 0.3645820
10 10 -1.98935170 -0.41499456 0.7685329
11 11 0.61982575 -0.39428995 -0.1123462
12 12 -0.05612874 -0.05931340 0.8811077
> DF3
Date Currency1 Currency2 Currency3
1 1 1.12493092 -0.15579551 1.1000254
2 2 -0.04493361 -1.47075238 0.7631757
3 3 -0.01619026 -0.47815006 -0.1645236
4 4 -0.83562861 0.41794156 0.5757814
5 5 1.59528080 1.35867955 -0.3053884
6 6 0.32950777 -0.10278773 1.5117812
7 7 -0.82046838 0.38767161 0.3898432
8 8 0.48742905 -0.05380504 -0.6212406
9 9 0.73832471 -1.37705956 -2.2146999
10 10 -1.98935170 -0.41499456 0.7685329
11 11 0.61982575 -0.39428995 -0.1123462
12 12 -0.05612874 -0.05931340 0.8811077
Upvotes: 1
Views: 728
Reputation: 7153
You can also force a row-bind of the two dataframe using plyr:ldply
and then group_by
and summarise_each
by date (with dplyr
package):
df <- ldply(list(DF1, DF2))
sums <- function(x) sum(x, na.rm=T)
df %>% group_by(Date) %>% summarise_each(funs(sums))
# A tibble: 12 × 4
Date Currency1 Currency3 Currency2
<dbl> <dbl> <dbl> <dbl>
1 1 -0.62124058 -0.3942900 0.61982575
2 2 -2.21469989 -0.0593134 -0.05612874
3 3 1.12493092 1.1000254 -0.15579551
4 4 -0.67138742 1.2506048 -1.47075238
5 5 0.16745306 0.5738011 -0.47815006
6 6 0.10820760 0.3224197 0.41794156
7 7 2.41650200 0.3915750 1.35867955
8 8 0.92340909 2.0684444 -0.10278773
9 9 0.09850899 -0.2989125 0.38767161
10 10 0.78213630 -0.7074952 -0.05380504
11 11 0.07456498 0.3645820 -1.37705956
12 12 -1.98935170 0.7685329 -0.41499456
Upvotes: 2
Reputation: 886968
We can use a join
from data.table
library(data.table)
DF3 <- copy(DF2)
nm1 <- names(DF1)[-1]
setDT(DF3)[DF1, (nm1) := mget(paste0("i.", nm1)), on = .(Date)]
DF3
# Date Currency1 Currency2 Currency3
# 1: 1 1.12493092 -0.15579551 1.1000254
# 2: 2 -0.04493361 -1.47075238 0.7631757
# 3: 3 -0.01619026 -0.47815006 -0.1645236
# 4: 4 -0.83562860 0.41794156 0.5757814
# 5: 5 1.59528080 1.35867955 -0.3053884
# 6: 6 0.32950780 -0.10278773 1.5117812
# 7: 7 -0.82046840 0.38767161 0.3898432
# 8: 8 0.48742910 -0.05380504 -0.6212406
# 9: 9 0.73832470 -1.37705956 -2.2146999
#10: 10 -1.98935170 -0.41499456 0.7685329
#11: 11 0.61982575 -0.39428995 -0.1123462
#12: 12 -0.05612874 -0.05931340 0.8811077
Upvotes: 2