Does spam filtering really need a better model? My impression is that the whole game is based on having the best and freshest user-contributed labels.

▲

drob518 2 days ago | parent | next [-]

He said it’s a benchmark.

▲

hrmtst93837 2 days ago | parent | prev [-]

Better models help on the day the spam mutates, before you have fresh labels for the new scam and before spammers can infer from a few test runs which phrasing still slips through. If you need labels for each pivot you're letting them experiment on your users.

▲

jeffbee 2 days ago | parent [-]

In my experience the contents of the message are all but totally irrelevant to the classification, and it is the behavior of the mailing peer that gives all the relevant features.

	▲	mh- a day ago \| parent [-]
		Based on how much blatant gmail->gmail spam I receive, the gmail team agrees with this strategy.