国产探花免费观看_亚洲丰满少妇自慰呻吟_97日韩有码在线_资源在线日韩欧美_一区二区精品毛片,辰东完美世界有声小说,欢乐颂第一季,yy玄幻小说排行榜完本

首頁 > 編程 > Python > 正文

Python實現在線程里運行scrapy的方法

2020-02-23 00:34:55
字體:
來源:轉載
供稿:網友

本文實例講述了Python實現在線程里運行scrapy的方法。分享給大家供大家參考。具體如下:

如果你希望在一個寫好的程序里調用scrapy,就可以通過下面的代碼,讓scrapy運行在一個線程里。

"""Code to run Scrapy crawler in a thread - works on Scrapy 0.8"""import threading, Queuefrom twisted.internet import reactorfrom scrapy.xlib.pydispatch import dispatcherfrom scrapy.core.manager import scrapymanagerfrom scrapy.core.engine import scrapyenginefrom scrapy.core import signalsclass CrawlerThread(threading.Thread):  def __init__(self):    threading.Thread.__init__(self)    self.running = False  def run(self):    self.running = True    scrapymanager.configure(control_reactor=False)    scrapymanager.start()    reactor.run(installSignalHandlers=False)  def crawl(self, *args):    if not self.running:      raise RuntimeError("CrawlerThread not running")    self._call_and_block_until_signal(signals.spider_closed, /      scrapymanager.crawl, *args)  def stop(self):    reactor.callFromThread(scrapyengine.stop)  def _call_and_block_until_signal(self, signal, f, *a, **kw):    q = Queue.Queue()    def unblock():      q.put(None)    dispatcher.connect(unblock, signal=signal)    reactor.callFromThread(f, *a, **kw)    q.get()# Usage example below: import osos.environ.setdefault('SCRAPY_SETTINGS_MODULE', 'myproject.settings')from scrapy.xlib.pydispatch import dispatcherfrom scrapy.core import signalsfrom scrapy.conf import settingsfrom scrapy.crawler import CrawlerThreadsettings.overrides['LOG_ENABLED'] = False # avoid log noisedef item_passed(item):  print "Just scraped item:", itemdispatcher.connect(item_passed, signal=signals.item_passed)crawler = CrawlerThread()print "Starting crawler thread..."crawler.start()print "Crawling somedomain.com...."crawler.crawl('somedomain.com) # blocking callprint "Crawling anotherdomain.com..."crawler.crawl('anotherdomain.com') # blocking callprint "Stopping crawler thread..."crawler.stop()

希望本文所述對大家的Python程序設計有所幫助。

發表評論 共有條評論
用戶名: 密碼:
驗證碼: 匿名發表
主站蜘蛛池模板: 台州市| 黑山县| 龙游县| 开远市| 千阳县| 衡阳县| 吉木萨尔县| 兴隆县| 崇礼县| 墨竹工卡县| 临泉县| 晋州市| 灌云县| 陕西省| 云梦县| 镇宁| 株洲县| 德格县| 阿合奇县| 准格尔旗| 旌德县| 巴楚县| 讷河市| 库尔勒市| 亳州市| 沅江市| 津市市| 延寿县| 奉节县| 嘉峪关市| 揭西县| 乐亭县| 重庆市| 邵阳县| 志丹县| 昆明市| 禹州市| 吴旗县| 龙泉市| 衡阳市| 沈阳市|