▲ | jacobobryant 3 days ago | |||||||
I've been working on this kind of thing over the past several years (for a while full time as an attempted entrepreneur, now on the side for the past couple years). The latest iteration is https://yakread.com -- hit "take a look around" and you can see the "home page"/a list of recommendations without signing up. The recommendations are personalized, i.e. the probability you'll see any particular post depends on your individual interactions with past posts, if you've signed up. (it does collaborative filtering with spark mllib). So that may be a bit different from what you had in mind, since your comment sounds more like an unpersonalized system, but with some extra exploration thrown in. However in practice I suspect the biggest thing the collaborative filtering is doing at Yakread's current scale (not much) is learning which items are good/bad in general. I also do have some methods baked in for doing exploration. "Epsilon greedy" is a common simple approach where x% of the recommendations are purely random. I do a bit more of a linear thing where I rank all the posts by how many times they've been recommended, then I pick a percentage 0 - 100, then I throw out the top x% most popular (previously recommended) items. that also gives you some flexibility to try out different distributions for the x% variable. The source is at https://github.com/jacobobryant/yakread | ||||||||
▲ | schrodinger 3 days ago | parent | next [-] | |||||||
Thank you so much! "Epsilon greedy" sounds like a great approach for the general idea I had in mind — I only glanced it but will read it more deeply. I'll definitely try out your product, but I have to say — an enter your email box is surprisingly high-friction and if you weren't a considerate person I'd met on Hacker News I'd probably close the tab when I saw that. I'll try it out and see if there's a particular reason why you need to capture an email address so early on, but I'd bet if you simplified it you'd get more traffic! | ||||||||
| ||||||||
▲ | sydbarrett74 2 days ago | parent | prev [-] | |||||||
Interesting project! I also appreciate being introduced to the digital public infra initiative. |