Q. 69

Question

Many email filters can be trained how to recognize spam by having a user identify spam messages from a lineup. Suppose you have looked at 12 messages with the word “opportunity” in the subject line, and identified 8 of them to the filter as spam. Now you get a new message with that word in the subject line, and your filter must compute the probability p that the message is spam. One way to do that is to identify p as the expected value of the Beta distribution 

f(x)=6435x8(1-x)4

The expected value of this distribution is 01xf(x)dx

Evaluate the probability that the next message you receive with the word “opportunity” in the subject line will be spam 

Step-by-Step Solution

Verified
Answer

The probability that the next message you received with "opportunity"is 914s

1Step 1. Given information

The given beta distribution f(x)=6435x8(1-x)4

2Step 2. Evaluate the probability of given function

The expected value of the distribution is,

01xf(x)dx=01x.6435x8(1-x)4dx                =643501x9(1-x)4dx                =643501x9(1-4x+6x2-4x3+x4)dx               =643501(x9-4x10+6x11-4x12+x13)dx               =6435x1010-4x1111+6x1212-4x1313+x141401               =6435110-411+612-413+114               =914

Thus,the probability that the next message you received with "opportunity"is 914s