R语言data.table包fread读取数据

时间:2022-01-08 02:35:28

R语言处理大规模数据速度不算快,通过安装其他包比如data.table可以提升读取处理速度。

案例,分别用read.csv和data.table包的fread函数读取一个1.67万行、230列的表格数据。

# 用read.csv读取数据
timestart<-Sys.time()
data <- read.csv("XXXXs.csv",header = T,stringsAsFactors = F)
timeend<-Sys.time()
runningtime<-timeend-timestart
print(runningtime)
# 返回 runningtime 结果: Time difference of 4.451127 secs
timestart<-Sys.time()
data1<-fread("XXXXs.csv",header = T,stringsAsFactors = F)
timeend<-Sys.time()
runningtime<-timeend-timestart
print(runningtime) # 返回 runningtime 结果: Time difference of 0.9460249 secs

参考资料:

R语言data.table速查(博客园-Little_Rookie):https://www.cnblogs.com/nxld/p/6059570.html

https://zhuanlan.zhihu.com/p/22317779?refer=rdatamining

data.table的guideline:                                      https://cran.r-project.org/web/packages/data.table/data.table.pdf