前提:服务器已经安装好了hadoop_client端即hadoop的环境hbase,hive等相关组件
1.环境和变量配置
①拷贝hive的配置文件hive-site.xml到zeppelin-0.7.2-bin-all/conf下
# cp /root/hadoop-2.6./conf/hive-site.xml /data/hadoop/zeppelin/zeppelin-0.7.-bin-all/conf
②进入conf下进行环境变量的配置
# cd /data/hadoop/zeppelin/zeppelin-0.7.-bin-all/conf
vim zeppelin-env.sh export JAVA_HOME=/usr/java/jdk1..0_111
export HADOOP_CONF_DIR=/root/hadoop-2.6./conf
2.在web界面配置集成hive
访问zeppelin的web界面(http://ip:8080)
右上角anonymous --> interpreter --> +Create新建一个叫做hive的集成环境(也可以在jdbc的基础上修改,不用新建)
编辑hive的相关属性,hive相关的配置可以到hive-site.xml中查看
Properties value
common.max_count
hive.driver org.apache.hive.jdbc.HiveDriver
hive.password hive
hive.url jdbc:hive2://uhadoop-bwgkeu-master2:10000 # 注意是hive2不是hive
hive.user hive
zeppelin.interpreter.localRepo /data/hadoop/zeppelin/zeppelin-0.7.-bin-all/local-repo/2CNPYUV7Z
zeppelin.interpreter.output.limit
zeppelin.jdbc.auth.type
zeppelin.jdbc.concurrent.max_connection
zeppelin.jdbc.concurrent.use true
zeppelin.jdbc.keytab.location
zeppelin.jdbc.principal
填写Dependencies相关属性
2个hive依赖包,一个mysql依赖包,不用mysql可以不用写mysql jdbc包
填好后点保存,这个时候zeppelin会出现卡顿,这是因为上面填的2个Dependencies,zeppelin去指定maven库下载相关依赖包去了,等待时间可能需几分钟。
下载完成可在local-repo目录下查看,依赖包放在local-repo/2CNPYUV7Z/(可能不同)目录下,等完全下载完成,就可查询。
Dependencies artifact exclude
org.apache.hive:hive-jdbc:0.14.
org.apache.hadoop:hadoop-common:2.6.
mysql:mysql-connector-java:5.1.38
# ls local-repo/2CNPYUV7Z/
activation-1.1.jar commons-lang-2.6.jar hive-shims-common-0.14..jar jta-1.1.jar
ant-1.9..jar commons-logging-1.1..jar hive-shims-common-secure-0.14..jar junit-4.11.jar
ant-launcher-1.9..jar commons-math3-3.1..jar htrace-core-3.0..jar libfb303-0.9..jar
antlr-2.7..jar commons-net-3.1.jar httpclient-4.2..jar libthrift-0.9..jar
antlr-runtime-3.4.jar commons-pool-1.5..jar httpcore-4.2..jar log4j-1.2..jar
apacheds-i18n-2.0.-M15.jar curator-client-2.6..jar jackson-core-asl-1.9..jar log4j-1.2..jar
apacheds-kerberos-codec-2.0.-M15.jar curator-framework-2.6..jar jackson-core-asl-1.9..jar mail-1.4..jar
api-asn1-api-1.0.-M20.jar curator-recipes-2.6..jar jackson-jaxrs-1.9..jar mysql-connector-java-5.1..jar
api-util-1.0.-M20.jar datanucleus-api-jdo-3.2..jar jackson-mapper-asl-1.9..jar netty-3.6..Final.jar
asm-3.1.jar datanucleus-core-3.2..jar jackson-mapper-asl-1.9..jar opencsv-2.3.jar
asm-3.2.jar datanucleus-rdbms-3.2..jar jackson-xc-1.9..jar paranamer-2.3.jar
asm-commons-3.1.jar derby-10.10.1.1.jar java-xmlbuilder-0.4.jar protobuf-java-2.5..jar
asm-tree-3.1.jar geronimo-annotation_1.0_spec-1.1..jar jaxb-api-2.2..jar servlet-api-2.5.jar
avro-1.7..jar geronimo-jaspic_1.0_spec-1.0.jar jaxb-impl-2.2.-.jar slf4j-api-1.7..jar
avro-1.7..jar geronimo-jta_1.1_spec-1.1..jar jdo-api-3.0..jar slf4j-log4j12-1.7..jar
bonecp-0.8..RELEASE.jar gson-2.2..jar jersey-core-1.9.jar snappy-java-1.0.4.1.jar
commons-beanutils-1.7..jar guava-11.0..jar jersey-json-1.9.jar snappy-java-1.0..jar
commons-beanutils-core-1.8..jar hadoop-annotations-2.6..jar jersey-server-1.9.jar stax-api-1.0-.jar
commons-cli-1.2.jar hadoop-auth-2.6..jar jets3t-0.9..jar stringtemplate-3.2..jar
commons-codec-1.4.jar hadoop-common-2.6..jar jettison-1.1.jar tools.jar
commons-collections-3.2..jar hamcrest-core-1.3.jar jetty-6.1..jar xmlenc-0.52.jar
commons-compress-1.4..jar hive-common-0.14..jar jetty-all-7.6..v20120127.jar xz-1.0.jar
commons-configuration-1.6.jar hive-jdbc-0.14..jar jetty-util-6.1..jar zookeeper-3.4..jar
commons-dbcp-1.4.jar hive-metastore-0.14..jar jline-0.9..jar zookeeper-3.4..jar
commons-digester-1.8.jar hive-serde-0.14..jar jpam-1.1.jar
commons-httpclient-3.1.jar hive-service-0.14..jar jsch-0.1..jar
commons-io-2.4.jar hive-shims-0.14..jar jsr305-1.3..jar
如果有问题,可以查看日志查找原因,日志路径:/data/hadoop/zeppelin/zeppelin-0.7.2-bin-all/logs/zeppelin-interpreter-jdbc-root-u04rdp01.yaya.corp.log
3.新建一个作业进行测试
notebook --> Create new node填写名称,选择hive即可
输入查询语句,注意查询语句前需要有前缀(hive):
(hive)
select * from tbl_push_user_req limit ;
柱状图展示
饼状图形展示