虽然WebCollector封装的功能很强大,使用也简单,但是因为WebCollector集成的有selenium,selenium本身依赖了很多jar包,而且不同版本之间区别较大,无语的是WebCollector作者只提供了一个selenium-java 的maven下载实例,而实际上WebCollector还依赖selenium的一些驱动jar等等,而且必须引入WebCollector对应的版本,而作者根本没有说需要引入哪个版本,只是在GitHub上给了一个jar包压缩包,这里面的只是很少的一部分jar包,实际上还需要很多的jar包,经过昨天一上午的努力把所有的jar包都从mvnrespository.org上面找全了,在此对WebCollector作者不提供WebCollector依赖的所有jar包,进其讨论群还要交10块钱的行为喷一下。
然后在这里给出我的时间成果,如果想使用WebCollector maven项目pom文件如下
pom.xml
<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd"> <modelVersion>4.0.0</modelVersion> <groupId>top.wys</groupId> <artifactId>WebCollector_Maven</artifactId> <packaging>war</packaging> <version>0.0.1-SNAPSHOT</version> <name>WebCollector_Maven Maven Webapp</name> <url>http://maven.apache.org</url> <!- 郑明亮 2017年2月22日11:18:39 --> <properties> <spring.version>4.3.6.RELEASE</spring.version> </properties> <dependencies> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-core</artifactId> <version>${spring.version}</version> </dependency> <dependency> <groupId>junit</groupId> <artifactId>junit</artifactId> <version>4.11</version> <scope>test</scope> </dependency> <!-- 日志相关 --> <dependency> <groupId>org.slf4j</groupId> <artifactId>slf4j-api</artifactId> <version>1.7.22</version> </dependency> <dependency> <groupId>org.slf4j</groupId> <artifactId>slf4j-log4j12</artifactId> <version>1.7.22</version> </dependency> <dependency> <groupId>log4j</groupId> <artifactId>log4j</artifactId> <version>1.2.17</version> </dependency> <dependency> <groupId>org.apache.commons</groupId> <artifactId>commons-lang3</artifactId> <version>3.3.2</version> </dependency> <dependency> <groupId>commons-io</groupId> <artifactId>commons-io</artifactId> <version>2.4</version> </dependency> <dependency> <groupId>org.json</groupId> <artifactId>json</artifactId> <version>20160810</version> </dependency> <dependency> <groupId>org.apache.xmlgraphics</groupId> <artifactId>batik-ext</artifactId> <version>1.8</version> </dependency> <!-- https://mvnrepository.com/artifact/xml-apis/xml-apis --> <dependency> <groupId>xml-apis</groupId> <artifactId>xml-apis</artifactId> <version>1.4.01</version> </dependency> <!-- https://mvnrepository.com/artifact/cn.edu.hfut.dmic.webcollector/WebCollector --> <dependency> <groupId>cn.edu.hfut.dmic.webcollector</groupId> <artifactId>WebCollector</artifactId> <version>2.32</version> </dependency> <!-- https://mvnrepository.com/artifact/org.apache.commons/com.springsource.org.apache.commons.collections --> <!-- 如果maven无法下载,那就手动去上面这个网址下载吧,这个jar我在不使用maven架构时,没有就报错,使用maven就不报错了,应该是依赖包中已包含--> <!-- <dependency> <groupId>org.apache.commons</groupId> <artifactId>com.springsource.org.apache.commons.collections</artifactId> <version>3.2.0</version> </dependency> --> <dependency> <groupId>commons-codec</groupId> <artifactId>commons-codec</artifactId> <version>1.10</version> </dependency> <dependency> <groupId>commons-dbcp</groupId> <artifactId>commons-dbcp</artifactId> <version>1.4</version> </dependency> <dependency> <groupId>org.apache.httpcomponents</groupId> <artifactId>httpclient</artifactId> <version>4.5.3</version> </dependency> <dependency> <groupId>org.jsoup</groupId> <artifactId>jsoup</artifactId> <version>1.10.1</version> </dependency> <dependency> <groupId>org.apache.httpcomponents</groupId> <artifactId>httpclient-cache</artifactId> <version>4.5.3</version> </dependency> <dependency> <groupId>org.apache.httpcomponents</groupId> <artifactId>httpclient-win</artifactId> <version>4.5.3</version> </dependency> <!-- https://mvnrepository.com/artifact/com.sleepycat/je --> <dependency> <groupId>com.sleepycat</groupId> <artifactId>je</artifactId> <version>5.0.73</version> </dependency> <!-- https://mvnrepository.com/artifact/xerces/xercesImpl --> <dependency> <groupId>xerces</groupId> <artifactId>xercesImpl</artifactId> <version>2.6.0</version> </dependency> <!-- https://mvnrepository.com/artifact/com.googlecode.juniversalchardet/juniversalchardet --> <dependency> <groupId>com.googlecode.juniversalchardet</groupId> <artifactId>juniversalchardet</artifactId> <version>1.0.3</version> </dependency> <dependency> <groupId>mysql</groupId> <artifactId>mysql-connector-java</artifactId> <version>5.1.27</version> </dependency> <!-- https://mvnrepository.com/artifact/net.sourceforge.nekohtml/nekohtml --> <dependency> <groupId>net.sourceforge.nekohtml</groupId> <artifactId>nekohtml</artifactId> <version>1.9.22</version> </dependency> <!-- https://mvnrepository.com/artifact/org.seleniumhq.selenium/selenium-htmlunit-driver --> <dependency> <groupId>org.seleniumhq.selenium</groupId> <artifactId>selenium-htmlunit-driver</artifactId> <version>2.52.0</version> </dependency> <!-- https://mvnrepository.com/artifact/org.w3c.css/sac --> <dependency> <groupId>org.w3c.css</groupId> <artifactId>sac</artifactId> <version>1.3</version> </dependency> <!-- https://mvnrepository.com/artifact/org.seleniumhq.selenium/selenium-remote-driver --> <dependency> <groupId>org.seleniumhq.selenium</groupId> <artifactId>selenium-remote-driver</artifactId> <version>2.52.0</version> </dependency> <!-- https://mvnrepository.com/artifact/org.seleniumhq.selenium/htmlunit-driver --> <dependency> <groupId>org.seleniumhq.selenium</groupId> <artifactId>htmlunit-driver</artifactId> <version>2.24</version> </dependency> <!-- https://mvnrepository.com/artifact/org.seleniumhq.selenium/selenium-java --> <dependency> <groupId>org.seleniumhq.selenium</groupId> <artifactId>selenium-java</artifactId> <version>3.0.1</version> </dependency> </dependencies> <build> <finalName>WebCollector_Maven</finalName> </build> <dependencyManagement> <dependencies> <dependency> <groupId>commons-collections</groupId> <artifactId>commons-collections</artifactId> <version>3.2.2</version> </dependency> </dependencies> </dependencyManagement> </project>
如果没有使用maven项目的话,如果有积分的话那么就用打包下载1去我的CSDN上下载吧,辛辛苦苦想赚2个积分,如果没有积分的朋友可以选择打包下载2(七牛云)进行免费下载。
所有依赖jar打包下载1
所有依赖jar打包下载 2
jar包截图
文章首次发布于个人博客:吾勇士的博客
http://wuyongshi.top/articles/2017/02/22/1487734548379.html
转载请标明出处