6:[["$","$Le",null,{}],["$","div",null,{"className":"min-h-screen bg-gray-100 p-6","children":[["$","$Lf",null,{}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"QAPage\",\"mainEntity\":{\"@type\":\"Question\",\"name\":\"Run Python Scrapy script via HTTP request\",\"text\":\"

I'm looking for an example to run scrapy script via HTTP request. I'm planing to send url as a parameter that i need to crawl, via GET or POST method. How can i do that.

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"Akila\"},\"upvoteCount\":0,\"answerCount\":2,\"acceptedAnswer\":{\"@type\":\"Answer\",\"text\":\"

You should use scrapyd.

\\n\\n

Link to the GitHub project page.

\\n\\n

Once you are using scrapyd you can use this api to scedule a crawl.

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"Biswanath\"},\"upvoteCount\":3}}}"}}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mb-6 relative","children":[["$","div",null,{"className":"absolute top-4 right-4 flex flex-wrap space-x-2","children":[["$","span","python",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/python/1","children":"python"}]}],["$","span","web-scraping",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/web-scraping/1","children":"web-scraping"}]}],["$","span","scrapy",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/scrapy/1","children":"scrapy"}]}]]}],["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://lh3.googleusercontent.com/-z3fQmNIjZxg/AAAAAAAAAAI/AAAAAAAAANI/BUu1Z7x6A00/photo.jpg?sz=256","alt":"Akila","className":"w-16 h-16 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/6862243/akila","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"Akila"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",97]}]]}]]}],["$","h1",null,{"className":"text-2xl font-bold text-gray-800 mb-4","children":"Run Python Scrapy script via HTTP request"}],["$","p",null,{"className":"text-gray-700 mt-4","dangerouslySetInnerHTML":{"__html":"

I'm looking for an example to run scrapy script via HTTP request. I'm planing to send url as a parameter that i need to crawl, via GET or POST method. How can i do that.

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm mt-4","children":[["$","p",null,{"children":["Upvotes: ",0]}],["$","p",null,{"children":["Views: ",367]}]]}]]}],["$","div",null,{"className":"container mx-auto","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-6","children":["Answers (",2,")"]}],[["$","div","53756674",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/d8a646b8584020a1a92d92cb3e75695f?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"user10600066","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/10600066/user10600066","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"user10600066"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",53]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

Try something like that.

\n\n

from twisted.internet import reactor\nfrom scrapy.crawler import Crawler\nfrom scrapy import log, signals\nfrom testspiders.spiders.followall import FollowAllSpider\nfrom scrapy.utils.project import get_project_settings\n\nspider = FollowAllSpider(domain='url.com')\nsettings = get_project_settings()\ncrawler = Crawler(settings)\ncrawler.signals.connect(reactor.stop, signal=signals.spider_closed)\ncrawler.configure()\ncrawler.crawl(spider)\ncrawler.start()\nlog.start()\nreactor.run()\n

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",0]}]}]]}],["$","div","53743489",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/94a507d8a75d51986af74da4918f58ec?s=256&d=identicon&r=PG","alt":"Biswanath","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/41968/biswanath","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"Biswanath"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",9185]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

You should use scrapyd.

\n\n

Link to the GitHub project page.

\n\n

Once you are using scrapyd you can use this api to scedule a crawl.

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",3]}]}]]}]]]}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mt-6","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-4","children":"Related Questions"}],["$","ul",null,{"className":"list-disc list-inside","children":[["$","li","56230826",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/56230826","className":"text-blue-600 hover:underline","children":"How to use python requests with scrapy?"}]}],["$","li","13437402",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/13437402","className":"text-blue-600 hover:underline","children":"How to run Scrapy from within a Python script"}]}],["$","li","19052639",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/19052639","className":"text-blue-600 hover:underline","children":"Calling Scrapy Spider from python script?"}]}],["$","li","47793131",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/47793131","className":"text-blue-600 hover:underline","children":"How run a scrapy spider programmatically like a simple script?"}]}],["$","li","54297247",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/54297247","className":"text-blue-600 hover:underline","children":"Running a scrapy program from another python script"}]}],["$","li","52506283",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/52506283","className":"text-blue-600 hover:underline","children":"Run scrapy program from within python script"}]}],["$","li","23573638",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/23573638","className":"text-blue-600 hover:underline","children":"Running scrapy from python script"}]}],["$","li","49193757",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/49193757","className":"text-blue-600 hover:underline","children":"Can we run scrapy code outside of scrapy shell?"}]}],["$","li","17564305",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/17564305","className":"text-blue-600 hover:underline","children":"Confused about running Scrapy from within a Python script"}]}],["$","li","18100310",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/18100310","className":"text-blue-600 hover:underline","children":"Scrapy run from Python"}]}]]}]]}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","$L14",null,{}],["$","$L15",null,{}]]

Run Python Scrapy script via HTTP request

Answers (2)

Related Questions