▲ | PaulHoule 5 days ago | |||||||||||||||||||||||||||||||||||||||||||||||||
Re: “HN is very fickle“ I have a model that, given a headline, predicts if the story will get >10 votes. It’s a terrible model, for a few reasons. The most fundamental is that if the same article is submitted 10 times it could get wildly different scores, that’s the way it does. The tail end of the model [1] is logistic regression because it deals gracefully with this kind of situation. I wish I knew how to treat this as a regression problem (predict the score), there is probably a better loss function than what I use, but when I treat it at as a regression problem I get an even worse model. The highest score this model ever gives is 70% for something like “Richard Stallman is dead” I have another model that predicts If the comment/score ratio > 0.5 which is about the average for the site. This is a much better model, close to the first recommender models I made. Trained on articles with score > 10 the input is less noisy for one thing. It’s how a learned y’all like to talk about cars. [1] what attention folks call the “head” | ||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | mooreds 5 days ago | parent | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
> I have a model that, given a headline, predicts if the story will get >10 votes. Do you incorporate post time into this model? This is pure anecdata but I've found that certain posting times lead to more upvotes for what feel like the same type of stories. | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | hdvr 5 days ago | parent | prev | next [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
It seems predicting the score directly (regression) is almost impossible without considering the associated domain. E.g. headlines with the letters GPT in it from openai.com, get an order of magnitude more votes than similar headlines from other sites. | ||||||||||||||||||||||||||||||||||||||||||||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||
▲ | cantor_S_drug 5 days ago | parent | prev [-] | |||||||||||||||||||||||||||||||||||||||||||||||||
[flagged] |