Remix.run Logo
causal a day ago

What does it mean to "pool" embeddings? The first article seems to assume the reader is familiar

deepsquirrelnet a day ago | parent [-]

“Pooling” is just aggregation methods. It could mean taking max or average values, or more exotic methods like attention pooling. It’s meant to reduce the one-per-token dimensionality to one per passage or document.