Luke McDonald
Luke McDonald

Reputation: 21

What is the bucket in rgee?

I am trying to upload some large geometries to google earth engine.

This is the example code given in the ?sf_as_ee description:

# # 3. Upload large geometries to EE asset
# ee_Initialize(gcs = TRUE)
# assetId <- sprintf("%s/%s", ee_get_assethome(), 'toy_poly_gcs')
#  eex <- sf_as_ee(
#  x = toy_poly,
#  overwrite = TRUE,
#  assetId = assetId,
#  bucket = 'rgee_dev',
#  monitoring = FALSE,
#  via = 'gcs_to_asset'
#  )

I just need to know how to set bucket?

From description:

bucket - Character. Name of the bucket (GCS) to save intermediate files (ignore if via is not defined as "gcs_to_asset").

I think it is a local space for saving intermediate files, but I cannot get it to work with the folder I usually use for temporary files (bucket = 'tempfiles').

It is within my working directory, but is throwing this error:

Error in value[[3L]](cond) : The data bucket was not found.

File structure for reference: (on MacOS)

-R Project (working directory)

-- scripts

--- script

--tempfiles

I tried setting it to a google drive folder name as well thinking it might be asking for that, but it didn't work either. Any help will be much appreciated!

Thanks!

Upvotes: 1

Views: 254

Answers (1)

csaybar
csaybar

Reputation: 179

sf_as_ee(..., via = "gcs_to_asset") is a two-step function. Firstly, the geometry is uploaded to Google Cloud Storage (GCS) and after that using manifests upload is moved from GCS to Earth Engine.

To use sf_as_ee(..., via = "gcs_to_asset") is necessary to save a service account key (SAK) with privileges to write/read your GCS resources. Upload files via Google Drive is not possible.

library(rgee)
library(googleCloudStorageR)
ee_Initialize("csaybar", gcs = TRUE)
#> ── rgee 1.0.7 ─────────────────────────────────────── earthengine-api 0.1.245 ── 
#>  ✓ email: csaybar 
#>  ✓ GCS credentials: NOT FOUND
#>  ✓ Initializing Google Earth Engine:  DONE!
#>  ✓ Earth Engine user: users/csaybar 
#> ────────────────────────────────────────────────────────────────────────────────
#> Unable to find a service account key (SAK) file in: /home/csaybar/.config/earthengine//csaybar
#> Please, download and save the key manually on the path mentioned
#> before. A compressible tutorial to obtain their SAK file is available in:
#> > https://github.com/r-spatial/rgee/tree/help/gcs
#> > https://cloud.google.com/iam/docs/creating-managing-service-account-keys
#> > https://console.cloud.google.com/apis/credentials/serviceaccountkey
#> Until you do not save a SKA file, the following functions will not work:
#> - rgee::ee_gcs_to_local()
#> - ee_as_raster(..., via = "gcs")
#> - ee_as_stars(..., via = "gcs")
#> - ee_as_sf(..., via = "gcs")
#> - sf_as_ee(..., via = "gcs_to_asset")
#> - gcs_to_ee_image
#> - raster_as_ee
#> - local_to_gcs
#> - stars_as_ee

This guide explains how to generate the SAK, then, you need to save it in:

rgee::ee_get_earthengine_path()
#> [1] "/home/csaybar/.config/earthengine//csaybar/"

This specific error (Error in value[[3L]](cond): The data bucket was not found.) appears because googleCloudStorageR is not able to recognize your SAK (and therefore your GCS bucket names).

library(googleCloudStorageR)
gcs_get_bucket('rgee_dev') # I have privileges to modify the bucket (rgee will work!)
#> ==Google Cloud Storage Bucket==
#> Bucket:          rgee_dev 
#> Project Number:  XXXXXXXXXXXXXX 
#> Location:        US 
#> Class:           STANDARD 
#> Created:         2020-02-08 01:00:23 
#> Updated:         2020-12-10 15:00:47 
#> Meta-generation: 8 
#> eTag:            XXX

gcs_get_bucket('tempfiles') # I do not have privileges privileges to modify the bucket (rgee will not work!)
#> ℹ 2020-12-23 16:16:22 > Request Status Code:  403
#> Error: API returned: [email protected] does not
#> have storage.buckets.get access to the Google Cloud Storage bucket.

Upvotes: 1

Related Questions