版本信息:
python 2.7.12
lxml 3.8.0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
|
from lxml import etree
html_str = """
<div id="box1">this from blog.csdn.net/lncxydjq , DO NOT COPY!
<div id="box2">*****
<!--can u get me, bitch?-->
</div>
</div>
"""
html = etree.HTML(html_str)
print html.xpath( '//div[@id="box1"]/div/node()' )[ 1 ]
print type (html.xpath( '//div[@id="box1"]/div/node()' )[ 1 ])
print html.xpath( '//div[@id="box1"]/div/node()' )[ 1 ].text
"""output:
<!--can u get me, bitch?-->
<type 'lxml.etree._Comment'>
can u get me, bitch?
"""
|
以上这篇python xpath获取页面注释的方法就是小编分享给大家的全部内容了,希望能给大家一个参考,也希望大家多多支持服务器之家。
原文链接:https://blog.csdn.net/lncxydjq/article/details/77880824