Machine Learning The Art and Science of Algorithms thatMake Sense of Data

时间:2017-08-26 07:05:34
【文件属性】:

文件名称:Machine Learning The Art and Science of Algorithms thatMake Sense of Data

文件大小:1.34MB

文件格式:PDF

更新时间:2017-08-26 07:05:34

Machine Learning

There are a number of useful ways in which we can express the SpamAssassin classifier in mathematical notation. If we denote the result of the i -th test for a given e-mail as xi , where xi = 1 if the test succeeds and 0 otherwise, and we denote the weight of the i -th test as wi , then the total score of an e-mail can be expressed as Pni =1wi xi , making use of the fact that wi contributes to the sum only if xi = 1, i.e., if the test succeeds for the e-mail. Using t for the threshold above which an e-mail is classified as spam (5 in our example), the ‘decision rule’ can be written as Pni =1wi xi > t . Notice that the left-hand side of this inequality is linear in the xi variables, which essentially means that increasing one of the xi by a certain amount, say ±, will change the sum by an amount (wi±) that is independent of the value of xi . This wouldn’t be true if xi appeared squared in the sum, or with any exponent other than 1.


网友评论

  • 多个业内大牛推荐的ML入门书,而且还是文字版,值得下载。
  • Peter Flach同名书的书摘,81页