Reputation: 1293
I have a problem that is very similar to this SO post:
Geographic / geospatial distance between 2 lists of lat/lon points (coordinates)
Here is an edited example set of coordinates that illustrates my case:
require(tidyverse)
list1 <- data.frame(longitude = c(72, 74, 76, 78, 79, 82),
latitude = c(20.5, 19, 19.5, 20, 22, 21),
area = "A")
list2 <- data.frame(longitude = c(71, 73, 75, 77, 79, 78.5, 72),
latitude = c(21.5, 22, 20.5, 23, 23.5, 24, 24),
area = "B")
df <- bind_rows(list1, list2)
ggplot(data = df) +
geom_point(aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list1, aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list1[c(2,6),], aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list1[c(1,4),], aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list2[c(1,7),], aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list2[c(7,6),], aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list2[c(6,5),], aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list2[c(5,3),], aes(x = longitude, y = latitude, color = area)) +
geom_line(data = list2[c(3,1),], aes(x = longitude, y = latitude, color = area))
So I need to calculate smallest distances between two lists of coordinate points. I have been able to make this work, but I have noticed that I need something more efficient - the data is just too large.
One possibility that I have entertained, is to form non-overlapping polygons of these areas and calculate the distance from one set of points to the neighboring polygon. Is there a way to form these polygons? Convex hulls is not an options since the areas are pretty ragged.
Another option could be to form a line that runs between the areas.
Edit: I added some lines in the figure in order to illustrate the polygons.
Upvotes: 1
Views: 4439
Reputation: 27802
Perhaps this is what you are looking for?
#load libraries
library(dplyr)
library(sf)
#create row_id's, and make it a simple (spatial) feature
list1.sf <- list1 %>%
mutate( id = row_number() ) %>%
st_as_sf( coords = c("longitude", "latitude"), crs = 4326 )
list2.sf <- list2 %>%
mutate( id = row_number() ) %>%
st_as_sf( coords = c("longitude", "latitude"), crs = 4326 )
#find nearest points in list2 for each id in list1, and as a bonus, calculate the distance to this point
list1.sf %>%
dplyr::group_by( id ) %>%
dplyr::mutate( np = sf::st_nearest_feature( geometry, list2.sf ),
dist_np = as.numeric( sf::st_distance( geometry, list2.sf[np,] ) ) )
# Simple feature collection with 6 features and 4 fields
# geometry type: POINT
# dimension: XY
# bbox: xmin: 72 ymin: 19 xmax: 82 ymax: 22
# epsg (SRID): 4326
# proj4string: +proj=longlat +datum=WGS84 +no_defs
# # A tibble: 6 x 5
# # Groups: id [6]
# area id geometry np dist_np
# * <fct> <int> <POINT [°]> <int> <dbl>
# 1 A 1 (72 20.5) 1 151880.
# 2 A 2 (74 19) 3 196361.
# 3 A 3 (76 19.5) 3 152335.
# 4 A 4 (78 20) 3 318287.
# 5 A 5 (79 22) 5 166111.
# 6 A 6 (82 21) 5 415019.
Upvotes: 3
Reputation: 49
You can calculate the euclidean distance I change a bit the data set. I drop last column.
x1 <- data.frame(longitude = c(72, 74, 76, 78, 79, 82),
latitude = c(20.5, 19, 19.5, 20, 22, 21))
x2 <- data.frame(longitude = c(71, 73, 75, 77, 79, 78.5, 72),
latitude = c(21.5, 22, 20.5, 23, 23.5, 24, 24))
euc.dist <- function(x1, x2) sqrt(sum((x1 - x2) ^ 2))
dist <- NULL
for(i in 1:nrow(x1)) dist[i] <- euc.dist(x1[i,],x2[i,])
dist
Upvotes: 0