Why can't my GCP script/notebook find my file?

Question

I have a working script that finds the data file when it is in the same directory as the script. This works both on my local machine and Google Colab.

When I try it on GCP though it can not find the file. I tried 3 approaches:

PySpark Notebook:

Upload the .ipynb file which includes a wget command. This downloads the file without error but I am unsure where it saves it to and the script can not find the file either (I assume because I am telling it that the file is in the same directory and pressumably using wget on GCP saves it somewhere else by default.)

PySpark with bucket:

I did the same as the PySpark notebook above but first I uploaded the dataset to the bucket and then used the two links provided in the file details when you click the file name inside the bucket on the console (neither worked). I would like to avoid this though as wget is much faster then downloading on my slow wifi then reuploading to the bucket through the console.

GCP SSH:

Create cluster
Access VM through SSH.
Upload .py file using the cog icon
wget the dataset and move both into the same folder
Run script using python gcp.py

Just gives me an error saying file not found.

Thanks.

Why can't my GCP script/notebook find my file?

Answers (1)

Related Questions

Why can&#39;t my GCP script/notebook find my file?

Answers (1)

Related Questions

Why can't my GCP script/notebook find my file?