python去掉csv文件中重复的行

时间：2025-04-08 21:46:42

with open('','r') as in_file, open('','w') as out_file:

    s = set() # set for fast O(1) amortized lookup

    for line in in_file:

        if line in s: continue # skip duplicate

        s.add(line)

        out_file.write(line)

1
2
3
4
5
6
7
8
9
10
11

这个代码有个bug，当重复的列是整齐排列的，那么最后一个重复的元素不会去掉所有重复元素，它会剩下两个.(偶尔会出现这个bug，调了一晚上最后好了，手动狗头)

相关文章

