当蜘蛛运行时,每次发送一次Scrapy电子邮件



我试图在gmail中发送电子邮件时,蜘蛛完成抓取页面..当我定义函数send_mail并像下面这样传递它时,在日志中,它说send_mail("一些消息","Scraper报告")名称send_mail没有定义。当蜘蛛完成抓取时,我如何发送gmail。当我在defparse (self,response)方法中传递send_mail函数时,它试图阻止我的gmail,因为循环抓取。

 class ekantipurSpider(XMLFeedSpider):
    name= "ekantipur"
    allowed_domains = ["ekantipur.com"]
    start_urls = [
    'http://www.ekantipur.com/archive/'
  ]

send_mail("some message", "Scraper Report")
        def parse(self, response):
          hxs = HtmlXPathSelector(response) # The xPath selector
          titles=hxs.select('//div[@id = "archive-content-wrapper"]//ul/li')
          items = []
          for titles in titles:
           item = NewsItem()
           item['title']=titles.select('h6/a/text()').extract()[0]
           item['link']=titles.select('h6/a/@href').extract()[0]
           item['description']=titles.select('p/text()').extract()[0]
           item = Request(item['link'],meta={'item':item},callback=self.parse_detail)
           items.append(item)

        return items

     def parse_detail(self,response):
      item = response.meta['item']
      sel = HtmlXPathSelector(response)
      detail = sel.select('//div[@class = "main_wrapper_left"]')
      item['details'] = escape(''.join(detail.select('p/text()').extract()))
      locationDate = item['details'].split(':',1)[0]
      item['location']= locationDate.split(",",1)[0]
      item['published_date'] =    escape(''.join(detail.select('p[last()]/text()').extract()))

      return item

def send_mail(self, message, title):
  print "Sending mail..........."
  gmailUser = 'manthali2014@gmail.com'
  gmailPassword = 'rameshkc8  '
  recipient = 'kcramesh8@gmail.com'
 msg = MIMEMultipart()
 msg['From'] = gmailUser
 msg['To'] = recipient
 msg['Subject'] = title
 msg.attach(MIMEText(message))
 mailServer = smtplib.SMTP('smtp.gmail.com', 587)
 mailServer.ehlo()
 mailServer.starttls()
 mailServer.ehlo()
 mailServer.login(gmailUser, gmailPassword)
 mailServer.sendmail(gmailUser, recipient, msg.as_string())
 mailServer.close()
 print "Mail sent"

您需要添加self.来发送邮件(self.send_mail(args)),如果它们在同一类中。如果不是,则需要在函数定义后调用send_mail()。

相关内容

最新更新