[dev] [dexonline/dexonline] b4c669: Handle some crawler errors more graciously.

GitHub noreply at github.com
Mon Sep 18 16:34:56 EEST 2017


  Branch: refs/heads/master
  Home:   https://github.com/dexonline/dexonline
  Commit: b4c66921764a050af1ff2d858639f030b5ef3279
      https://github.com/dexonline/dexonline/commit/b4c66921764a050af1ff2d858639f030b5ef3279
  Author: Cătălin Frâncu <cata at francu.com>
  Date:   2017-09-18 (Mon, 18 Sep 2017)

  Changed paths:
    M app/crawler/crawler.php
    M phplib/models/CrawlerUrl.php

  Log Message:
  -----------
  Handle some crawler errors more graciously.


  Commit: 6a4896d5d61e2676c401c7ba3fc758f8597fcf12
      https://github.com/dexonline/dexonline/commit/6a4896d5d61e2676c401c7ba3fc758f8597fcf12
  Author: Cătălin Frâncu <cata at francu.com>
  Date:   2017-09-18 (Mon, 18 Sep 2017)

  Changed paths:
    M app/crawler/crawler.php
    A patches/00231.sql
    A phplib/models/CrawlerIgnoredUrl.php

  Log Message:
  -----------
  Learn to ignore URLs that we repeatedly fail to parse.


Compare: https://github.com/dexonline/dexonline/compare/afe544436bdd...6a4896d5d61e


More information about the Dev mailing list