Python使用正则匹配实现抓图代码分享

内涵：正则匹配，正则替换，页面抓取，图片保存。

实用的第一次 Python 代码参考

				?

									#!/usr/bin/env python

									import urllib

									import re

									x=0

									def getHtml(url):

									 page = urllib.urlopen(url)

									 html = page.read()

									 return html

									def getImg(html):

									 global x

									 reg = 'alt=".+?" src="(.+?\.jpg)"'

									 imgre = re.compile(reg)

									 imglist = re.findall(imgre,html)

									 for imgurl in imglist:

									  urllib.urlretrieve(re.sub(r',\d+,\d+',',800,450',imgurl),"img/%s.jpg" % x)

									  print "\n"+re.sub(r',\d+,\d+',',800,450',imgurl)+"========"+"img/%s.jpg" % x

									  x+=1

									print 'Starting...'

									pages = range(1,9)

									for p in pages:

									 html = getHtml('http://m.lovebizhi.com/category/7655/%d/' % p)

									 print "\n-------------------------page:%d-------------------------------" % p

									 getImg(html)

									print "\nDone!"

以上所述就是本文给大家分享的全部代码了，本人Python菜鸟，第一个作品，希望对大家能有所帮助。

秒客网

Python使用正则匹配实现抓图代码分享

相关文章