本文实例讲述了Python判断文本中消息重复次数的方法。分享给大家供大家参考,具体如下:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
|
#coding:gbk
'''
Created on 2012-2-3
从文件中读取文本,并判断文本中形如“message0”、“message123”这样的消息有多少条是重复的
@author: Administrator
'''
import re
if __name__ = = '__main__' :
pattern = u "(message((\d)+))"
prog = re. compile (pattern)
# read text from file
f = open ( "1003.txt" , "r" )
text = unicode (f.read())
f.close()
result = prog.findall(text)
message_map = dict ()
redupicate_count = 0
for message in result:
if message_map.has_key(message[ 0 ]) = = True :
print message[ 0 ], "is reduplicate"
redupicate_count + = 1
else :
message_map[message[ 0 ]] = 1 ;
print "total reduplicate message is " , redupicate_count
|
希望本文所述对大家Python程序设计有所帮助。