PHP-spider.zip

时间:2022-08-04 04:38:40
【文件属性】:

文件名称:PHP-spider.zip

文件大小:169KB

文件格式:ZIP

更新时间:2022-08-04 04:38:40

开源项目

一个可扩展的PHP WEB 蜘蛛,示例代码: use VDB\Spider\Spider; use VDB\Spider\Discoverer\XPathExpressionDiscoverer; $spider = new Spider('http://www.oschina.net'); 特性: supports two traversal algorithms: breadth-first and depth-first supports depth limiting and queue size limiting supports adding custom URI discovery logic, based on XPath, CSS selectors, or plain old PHP comes with a useful set of URI filters, such as Domain limiting supports custom URI filters, both prefetch (URI) and postfetch (Resource content) supports custom request handling logic comes with a useful set of persistence handlers (memory, file. Redis soon to follow) supports custom persistence handlers collects statistics about the crawl for reporting dispatches useful events, allowing developers to add even more custom behavior supports a politeness policy will soon come with many default discoverers: RSS, Atom, RDF, etc. will soon support multiple queueing mechanisms (file, memcache, redis) will eventually support distributed spidering with a central queue 标签:PHPspider


网友评论