| ▲ | ArcHound 5 days ago |
| Honestly, it makes me a bit sad I am not anywhere on the list at all. Yes, I had only one front page mention ever, the rest of my entries are probably bad and useless, but still. I don't see how and why I wouldn't fall into the dataset, does anybody know please? |
|
| ▲ | simonw 5 days ago | parent | next [-] |
| The methodology is explained here: https://refactoringenglish.com/tools/hn-popularity/methodolo... You won't show up unless your site is listed in this manually curated CSV file: https://github.com/mtlynch/hn-popularity-contest-data/blob/m... |
| |
| ▲ | mtlynch 4 days ago | parent | next [-] | | >You won't show up unless your site is listed in this manually curated CSV Correction: you'll show up even if you're not in the CSV. The CSV just populates metadata for your entry. | | |
| ▲ | simonw 4 days ago | parent [-] | | How do you filter out the non-blog content? I assume you had an allow-list of known personal blogs. | | |
| ▲ | mtlynch 4 days ago | parent [-] | | Everything is default included, and I have a long list of not-blog domains that are excluded.[0] Plus, I exclude the Alexa top 500. There are lots of not-blogs still in the dataset, but I just exclude them when I come across them in popular views. But I'm sure if you dig through positions 101-5000 you'll find lots of domains that don't match my official criteria for a blog. https://github.com/mtlynch/hn-popularity-contest-data/blob/m... | | |
|
| |
| ▲ | ArcHound 5 days ago | parent | prev [-] | | Thank you for the reply, I'll go and make a PR. |
|
|
| ▲ | mtlynch 4 days ago | parent | prev [-] |
| OP here. Sorry for the exclusion! The minimum threshold for inclusion is 500 upvotes across all posts that reached the front page.[0] It looks like your domain currently has 176 total upvotes, so it misses the threshold.[1] I have the minimum because I precompute all the data so that I can serve it on a static site, but it means everyone downloads the full dataset when they visit the site. I make the threshold 500 upvotes so the CSV doesn't grow too large. [0] https://refactoringenglish.com/tools/hn-popularity/methodolo... [1] https://news.ycombinator.com/from?site=miloslavhomer.cz |
| |
| ▲ | ArcHound 4 days ago | parent [-] | | Thanks for the reply, you are right, I missed the threshold on my first read. While I am still sad I can see the reasons for it. Guess I have some posting to do. |
|