TypeError:需要类似字节的对象,而不是python和CSV中的'str'

时间:2022-06-08 18:20:45

TypeError: a bytes-like object is required, not 'str'

TypeError:需要类似字节的对象,而不是'str'

getting above error while Executing below python code to save the HTML table data in Csv file. don't know how to get rideup.pls help me.

执行下面的python代码以保存HTML表格数据在Csv文件中时出现上述错误。不知道如何获得rideup.pls帮助我。

import csv
import requests
from bs4 import BeautifulSoup

url='http://www.mapsofindia.com/districts-india/'
response=requests.get(url)
html=response.content

soup=BeautifulSoup(html,'html.parser')
table=soup.find('table', attrs={'class':'tableizer-table'})
list_of_rows=[]
for row in table.findAll('tr')[1:]:
    list_of_cells=[]
    for cell in row.findAll('td'):
        list_of_cells.append(cell.text)
    list_of_rows.append(list_of_cells)
outfile=open('./immates.csv','wb')
writer=csv.writer(outfile)
writer.writerow(["SNo", "States", "Dist", "Population"])
writer.writerows(list_of_rows)

on above the last line.

在最后一行上方。

3 个解决方案

#1


173  

You are using Python 2 methodology instead of Python 3.

您使用的是Python 2方法而不是Python 3。

Change:

更改:

outfile=open('./immates.csv','wb')

To:

至:

outfile=open('./immates.csv','w')

and you will get a file with the following output:

并且您将获得具有以下输出的文件:

SNo,States,Dist,Population
1,Andhra Pradesh,13,49378776
2,Arunachal Pradesh,16,1382611
3,Assam,27,31169272
4,Bihar,38,103804637
5,Chhattisgarh,19,25540196
6,Goa,2,1457723
7,Gujarat,26,60383628
.....

In Python 3 csv takes the input in text mode, whereas in Python 2 it took it in binary mode.

在Python 3中,csv在文本模式下获取输入,而在Python 2中,它以二进制模式获取。

Edited to Add

编辑添加

Here is the code I ran:

这是我运行的代码:

url='http://www.mapsofindia.com/districts-india/'
html = urllib.request.urlopen(url).read()
soup = BeautifulSoup(html)
table=soup.find('table', attrs={'class':'tableizer-table'})
list_of_rows=[]
for row in table.findAll('tr')[1:]:
    list_of_cells=[]
    for cell in row.findAll('td'):
        list_of_cells.append(cell.text)
    list_of_rows.append(list_of_cells)
outfile = open('./immates.csv','w')
writer=csv.writer(outfile)
writer.writerow(['SNo', 'States', 'Dist', 'Population'])
writer.writerows(list_of_rows)

#2


5  

I had the same issue with Python3. My code was writing into io.BytesIO().

我在Python3上遇到了同样的问题。我的代码是写入io.BytesIO()。

Replacing with io.StringIO() solved.

用io.StringIO()替换解决了。

#3


0  

file = open('parsed_data.txt', 'w')
for link in soup.findAll('a', attrs={'href': re.compile("^http")}): print (link)
soup_link = str(link)
print (soup_link)
file.write(soup_link)
file.flush()
file.close()

In my case, I used BeautifulSoup to write a .txt with Python 3.x. It had the same issue. Just as @tsduteba said, change the 'wb' in the first line to 'w'.

就我而言,我使用BeautifulSoup用Python 3.x编写.txt。它有同样的问题。正如@tsduteba所说,将第一行的'wb'改为'w'。

#1


173  

You are using Python 2 methodology instead of Python 3.

您使用的是Python 2方法而不是Python 3。

Change:

更改:

outfile=open('./immates.csv','wb')

To:

至:

outfile=open('./immates.csv','w')

and you will get a file with the following output:

并且您将获得具有以下输出的文件:

SNo,States,Dist,Population
1,Andhra Pradesh,13,49378776
2,Arunachal Pradesh,16,1382611
3,Assam,27,31169272
4,Bihar,38,103804637
5,Chhattisgarh,19,25540196
6,Goa,2,1457723
7,Gujarat,26,60383628
.....

In Python 3 csv takes the input in text mode, whereas in Python 2 it took it in binary mode.

在Python 3中,csv在文本模式下获取输入,而在Python 2中,它以二进制模式获取。

Edited to Add

编辑添加

Here is the code I ran:

这是我运行的代码:

url='http://www.mapsofindia.com/districts-india/'
html = urllib.request.urlopen(url).read()
soup = BeautifulSoup(html)
table=soup.find('table', attrs={'class':'tableizer-table'})
list_of_rows=[]
for row in table.findAll('tr')[1:]:
    list_of_cells=[]
    for cell in row.findAll('td'):
        list_of_cells.append(cell.text)
    list_of_rows.append(list_of_cells)
outfile = open('./immates.csv','w')
writer=csv.writer(outfile)
writer.writerow(['SNo', 'States', 'Dist', 'Population'])
writer.writerows(list_of_rows)

#2


5  

I had the same issue with Python3. My code was writing into io.BytesIO().

我在Python3上遇到了同样的问题。我的代码是写入io.BytesIO()。

Replacing with io.StringIO() solved.

用io.StringIO()替换解决了。

#3


0  

file = open('parsed_data.txt', 'w')
for link in soup.findAll('a', attrs={'href': re.compile("^http")}): print (link)
soup_link = str(link)
print (soup_link)
file.write(soup_link)
file.flush()
file.close()

In my case, I used BeautifulSoup to write a .txt with Python 3.x. It had the same issue. Just as @tsduteba said, change the 'wb' in the first line to 'w'.

就我而言,我使用BeautifulSoup用Python 3.x编写.txt。它有同样的问题。正如@tsduteba所说,将第一行的'wb'改为'w'。