| ▲ | Show HN: Similarity = cosine(your_GitHub_stars, Karpathy) Client-side(puzer.github.io) | |
| 3 points by puzer a day ago | 1 comments | ||
GitHub profile analysis - Build your embedding from your Stars - Compare and discover popular people with similar interests and share yours - Generate a Skill Radar - Recommend repositories you might like | ||
| ▲ | puzer a day ago | parent [-] | |
TL;DR - The Idea: People use GitHub Stars as bookmarks. This is an excellent signal for understanding which repositories are semantically similar. - The Data: Processed ~1TB of raw data from GitHub Archive (BigQuery) to build an interest matrix of 4 million developers. - The ML: Trained embeddings for 300k+ repositories using Metric Learning (EmbeddingBag + MultiSimilarityLoss). - The Frontend: Built a client-only demo that runs vector search (KNN) directly in the browser via WASM, with no backend involved. - The Result: The system finds non-obvious library alternatives and allows for semantic comparison of developer profiles. | ||