[dev] [dexonline/dexonline] b4c669: Handle some crawler errors more graciously.
GitHub
noreply at github.com
Mon Sep 18 16:34:56 EEST 2017
Branch: refs/heads/master
Home: https://github.com/dexonline/dexonline
Commit: b4c66921764a050af1ff2d858639f030b5ef3279
https://github.com/dexonline/dexonline/commit/b4c66921764a050af1ff2d858639f030b5ef3279
Author: Cătălin Frâncu <cata at francu.com>
Date: 2017-09-18 (Mon, 18 Sep 2017)
Changed paths:
M app/crawler/crawler.php
M phplib/models/CrawlerUrl.php
Log Message:
-----------
Handle some crawler errors more graciously.
Commit: 6a4896d5d61e2676c401c7ba3fc758f8597fcf12
https://github.com/dexonline/dexonline/commit/6a4896d5d61e2676c401c7ba3fc758f8597fcf12
Author: Cătălin Frâncu <cata at francu.com>
Date: 2017-09-18 (Mon, 18 Sep 2017)
Changed paths:
M app/crawler/crawler.php
A patches/00231.sql
A phplib/models/CrawlerIgnoredUrl.php
Log Message:
-----------
Learn to ignore URLs that we repeatedly fail to parse.
Compare: https://github.com/dexonline/dexonline/compare/afe544436bdd...6a4896d5d61e
More information about the Dev
mailing list