Tomas Greif
Tomas Greif

Reputation: 22623

Calculate distance from GPS data

I do have data frame with gps data and I need to calculate distance between two rows (current and previous)

            id                 time       lat      long heartrate altitude
1  20130424.tcx 2013-04-24T04:53:22Z 50.024818 14.522724       151     <NA>
2  20130424.tcx 2013-04-24T04:53:26Z 50.024818 14.522724        96     <NA>
3  20130424.tcx 2013-04-24T04:53:30Z 50.024818 14.522724       104     <NA>
4  20130424.tcx 2013-04-24T04:53:34Z 50.024818 14.522724       107     <NA>
5  20130424.tcx 2013-04-24T04:53:38Z 50.024818 14.522724       108     <NA>
6  20130424.tcx 2013-04-24T04:53:42Z 50.024818 14.522724       112    372.0
7  20130424.tcx 2013-04-24T04:53:46Z 50.024818 14.522724       151    372.0
8  20130424.tcx 2013-04-24T04:53:47Z 50.024677 14.522874       151    356.0
9  20130424.tcx 2013-04-24T04:53:50Z 50.024677 14.522874       118    356.0
10 20130424.tcx 2013-04-24T04:53:54Z 50.024677 14.522874       118    356.0
11 20130424.tcx 2013-04-24T04:53:58Z 50.024464 14.522917       147    358.0
12 20130424.tcx 2013-04-24T04:54:02Z 50.024464 14.522917       144    358.0
13 20130424.tcx 2013-04-24T04:54:06Z 50.024269 14.522853       150    367.0
14 20130424.tcx 2013-04-24T04:54:10Z 50.024269 14.522853       152    367.0
15 20130424.tcx 2013-04-24T04:54:13Z 50.024002 14.522874       152    380.0

I am able to join data to itself and get previous row for each row (there might be easier solution):

library(sqldf)
mydft    = mydf[-nrow(mydf),] 
mydft$id  = seq_along(mydft$id) +1
mydf$id  = seq_along(mydf$id)
mydft2 <- sqldf("select a.*, b.lat as lat2, b.long as long2 from mydf a left join mydft b using (id)")

How can I now calculate distance from columns lat, long, lat2, long2 ? I've tried approach described here:

R <- 6371 # Earth mean radius [km]
mydft2$delta.long <- (mydft2$long2 - mydft2$long)
mydft2$delta.lat <- (mydft2$lat2 - mydft2$lat)
mydft2$a <- sin(mydft2$delta.lat/2)^2 + cos(mydft2$lat) * cos(mydft2$lat2) * sin(mydft2$delta.long/2)^2
mydft2$c <- 2 * asin(min(1,sqrt(mydft2$a)))
mydft2$d = R * c

But this retuns only list NAs.

Upvotes: 1

Views: 1967

Answers (1)

dickoa
dickoa

Reputation: 18437

One way to simply achieve that is to use R facilities for spatial analysis. For this example we can use the spDistN1 function in the excellent sp package.

The first step is to convert your data into SpatialPoints (or SpatialPointsDataFrame), I assume in this example that your points are in geographic projection (longlat with WSG84 datum)

txt <- "            id                 time       lat      long heartrate altitude
20130424.tcx 2013-04-24T04:53:22Z 50.024818 14.522724       151     <NA>
20130424.tcx 2013-04-24T04:53:26Z 50.024818 14.522724        96     <NA>
20130424.tcx 2013-04-24T04:53:30Z 50.024818 14.522724       104     <NA>
20130424.tcx 2013-04-24T04:53:34Z 50.024818 14.522724       107     <NA>
20130424.tcx 2013-04-24T04:53:38Z 50.024818 14.522724       108     <NA>
20130424.tcx 2013-04-24T04:53:42Z 50.024818 14.522724       112    372.0
20130424.tcx 2013-04-24T04:53:46Z 50.024818 14.522724       151    372.0
20130424.tcx 2013-04-24T04:53:47Z 50.024677 14.522874       151    356.0
20130424.tcx 2013-04-24T04:53:50Z 50.024677 14.522874       118    356.0
20130424.tcx 2013-04-24T04:53:54Z 50.024677 14.522874       118    356.0
20130424.tcx 2013-04-24T04:53:58Z 50.024464 14.522917       147    358.0
20130424.tcx 2013-04-24T04:54:02Z 50.024464 14.522917       144    358.0
20130424.tcx 2013-04-24T04:54:06Z 50.024269 14.522853       150    367.0
20130424.tcx 2013-04-24T04:54:10Z 50.024269 14.522853       152    367.0
20130424.tcx 2013-04-24T04:54:13Z 50.024002 14.522874       152    380.0"

gpsdat <- read.table(text = txt, header = TRUE, na.strings = "<NA>")
str(gpsdat)

require(sp)
coordinates(gpsdat) <- ~ long + lat
proj4string(gpsdat) <- CRS("+proj=longlat +datum=WGS84")

In a second step, we can now use the spDistN1 function

sapply(seq_along(gpsdat[-1, ]), function(i)
       spDistsN1(pts = gpsdat[i, ], pt = gpsdat[i+1, ], longlat = TRUE))

 [1] 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000
 [7] 0.019004 0.000000 0.000000 0.023875 0.000000 0.022155
[13] 0.000000 0.029716

Depending on the type of GPS data we use, you can directly read these type of data into R using the readOGR function (package rgdal)

Upvotes: 5

Related Questions