Unable to Retrieve Environment Variables in Scrapy's Settings.py for Scrapyd Deployment

Question

I'm new to Scrapy and currently attempting to deploy my spider to a Scrapyd server. However, I'm encountering an issue where I can't seem to use os.getenv within my Scrapy settings file.

This is how I'm attempting to set up my settings.py:

# settings.py
import os
from dotenv import load_dotenv

load_dotenv()

SENTRY_DSN = os.getenv("SENTRY_DSN")
MONGO_URI = os.getenv("MONGO_URI")

In my spider code, I'm trying to access these variables like this:

def get_collection(self) -> Collection:
    client = pymongo.MongoClient(self.settings.get("MONGO_URI"))
    database = client["jobs"]
    collection = database[self.name]
    return collection

I'm using scrapyd-client to deploy my spider to my servers, but it seems that I'm doing something wrong as I can't access these environment variables in my settings file. And this is the full response from the server :

{"node_name": "scrapyd-nd1-c68b9c799-cmwjd", "status": "error", "message": "Traceback (most recent call last):
  File \"\", line 198, in _run_module_as_main
  File \"\", line 88, in _run_code
  File \"/usr/local/lib/python3.11/dist-packages/scrapyd/runner.py\", line 49, in 
    main()
  File \"/usr/local/lib/python3.11/dist-packages/scrapyd/runner.py\", line 45, in main
    execute()
  File \"/usr/local/lib/python3.11/dist-packages/scrapy/cmdline.py\", line 128, in execute
    settings = get_project_settings()
               ^^^^^^^^^^^^^^^^^^^^^^
  File \"/usr/local/lib/python3.11/dist-packages/scrapy/utils/project.py\", line 71, in get_project_settings
    settings.setmodule(settings_module_path, priority=\"project\")
  File \"/usr/local/lib/python3.11/dist-packages/scrapy/settings/__init__.py\", line 383, in setmodule
    module = import_module(module)
             ^^^^^^^^^^^^^^^^^^^^^
  File \"/usr/lib/python3.11/importlib/__init__.py\", line 126, in import_module
    return _bootstrap._gcd_import(name[level:], package, level)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File \"\", line 1206, in _gcd_import
  File \"\", line 1178, in _find_and_load
  File \"\", line 1149, in _find_and_load_unlocked
  File \"\", line 690, in _load_unlocked
  File \"\", line 940, in exec_module
  File \"\", line 241, in _call_with_frames_removed
  File \"/tmp/jobFlow-1695549937-0y8mcj3h.egg/jobFlow/settings.py\", line 4, in 
  File \"/tmp/jobFlow-1695549937-0y8mcj3h.egg/dotenv/main.py\", line 336, in load_dotenv
  File \"/tmp/jobFlow-1695549937-0y8mcj3h.egg/dotenv/main.py\", line 300, in find_dotenv
  File \"/tmp/jobFlow-1695549937-0y8mcj3h.egg/dotenv/main.py\", line 257, in _walk_to_root
OSError: Starting path not found
"}

This is the full command I'm running :

scrapyd-deploy --include-dependencies

Any ideas how can I solve it?

Unable to Retrieve Environment Variables in Scrapy's Settings.py for Scrapyd Deployment

Answers (1)

Related Questions

Unable to Retrieve Environment Variables in Scrapy&#39;s Settings.py for Scrapyd Deployment

Answers (1)

Related Questions

Unable to Retrieve Environment Variables in Scrapy's Settings.py for Scrapyd Deployment