基于随机森林的个人信用评估模型研究及实证分析

时间:2021-02-15 10:01:40
【文件属性】:

文件名称:基于随机森林的个人信用评估模型研究及实证分析

文件大小:1.54MB

文件格式:PDF

更新时间:2021-02-15 10:01:40

随机森林

Features of Random Forests It is unexcelled in accuracy among current algorithms. It runs efficiently on large data bases. It can handle thousands of input variables without variable deletion. It gives estimates of what variables are important in the classification. It generates an internal unbiased estimate of the generalization error as the forest building progresses. It has an effective method for estimating missing data and maintains accuracy when a large proportion of the data are missing. It has methods for balancing error in class population unbalanced data sets. Generated forests can be saved for future use on other data. Prototypes are computed that give information about the relation between the variables and the classification. It computes proximities between pairs of cases that can be used in clustering, locating outliers, or (by scaling) give interesting views of the data. The capabilities of the above can be extended to unlabeled data, leading to unsupervised clustering, data views and outlier detection. It offers an experimental method for detecting variable interactions.


网友评论