| ▲ | Ask HN: Best Embedding Models? | |||||||
| 15 points by devstein a day ago | 15 comments | ||||||||
Hey HN, which embedding models are people using? There has been so much development around foundational LLMs, but haven't seen much news about embedding models. | ||||||||
| ▲ | PhilippGille 19 hours ago | parent | next [-] | |||||||
Benchmarks only paint part of the picture, but it's still a decent place to start looking into recent models: | ||||||||
| ▲ | stevenfazzio 9 hours ago | parent | prev | next [-] | |||||||
Cohere's embed-v4.0 is my daily driver as far as a high performance model is concerned. I do a lot of cluster analysis and data visualization and I like that there's an `input_type="clustering"` mode in addition to the standard `input_type="search"` mode. For a fast, open, and local model, I've found it hard to beat https://huggingface.co/sentence-transformers/all-MiniLM-L6-v... | ||||||||
| ▲ | rapatel0 21 hours ago | parent | prev | next [-] | |||||||
I've liked qwen and embeddinggemma for local search. Qwen because 32K is enough to basically fit a whole page into the context window and embeddiggemma because it's crazy efficient. | ||||||||
| ▲ | sp1982 8 hours ago | parent | prev | next [-] | |||||||
I am using openai small embedding model with custom compression. It is super cheap. You can read more at https://corvi.careers/blog/vector-search-embedding-compressi... | ||||||||
| ▲ | pstorm 10 hours ago | parent | prev | next [-] | |||||||
Just fyi, for RAG/similarity search, adding a reranker was much bigger pay off than switching embedding models. | ||||||||
| ||||||||
| ▲ | emschwartz 13 hours ago | parent | prev | next [-] | |||||||
I’ve been using MixedBread, which is a pretty old model at this point. Recently, I tried comparing it to some newer models and was disappointed that the results weren’t dramatically and uniformly better. You probably can’t go wrong if you pick a recent one that scores decently well on benchmarks and is at the right price point (or memory requirement) for whatever you’re trying to do. | ||||||||
| ▲ | LogicCraft678 14 hours ago | parent | prev | next [-] | |||||||
Feels like embeddings are underrated compared to LLM's hype, but they doing great. | ||||||||
| ||||||||
| ▲ | preetsojitra 9 hours ago | parent | prev | next [-] | |||||||
Meta's Perception Encoder Audio-Visual, its CLIP like but has three modality: Audio, Video and Text | ||||||||
| ▲ | didgeoridoo 15 hours ago | parent | prev | next [-] | |||||||
I’m partial to jina.ai — they have open models for code and prose, all easily runnable locally. | ||||||||
| ▲ | jayshah5696 20 hours ago | parent | prev | next [-] | |||||||
embeddings are easy to fine tune. Try modern bert. | ||||||||
| ▲ | Yogeshshirsath 12 hours ago | parent | prev | next [-] | |||||||
E5 (Microsoft) | ||||||||
| ▲ | halvorbuilds 13 hours ago | parent | prev | next [-] | |||||||
gemma4 | ||||||||
| ▲ | frederickabrah 14 hours ago | parent | prev [-] | |||||||
who knows a tool for rug check in crypto | ||||||||