▲ | niemandhier 2 days ago | |||||||||||||||||||||||||
Wow, I think I might just have grasped one of the sources of the problems we keep seeing with LLMs. Johnson-Lichtenstrauss guarantees a distance preserving embedding for a finite set of points into a space with a dimension based on the number of points. It does not say anything about preserving the underlying topology of the contious high dimensional manifold, that would be Takens/Whitney-style embedding results (and Sauer–Yorke for attractors). The embedding dimensions needed to fulfil Takens are related to the original manifolds dimension and not the number of points. It’s quite probable that we observe violations of topological features of the original manifold, when using our to low dimensional embedded version to interpolate. I used AI to sort the hodge pudge of math in my head into something another human could understand, edited result is below: === AI in use === If you want to resolve an attractor down to a spatial scale rho, you need about n ≈ C * rho^(-d_B) sample points (here d_B is the box-counting/fractal dimension). The Johnson–Lindenstrauss (JL) lemma says that to preserve all pairwise distances among n points within a factor 1±ε, you need a target dimension k ≳ (d_B / ε^2) * log(C / rho). So as you ask for finer resolution (rho → 0), the required k must grow. If you keep k fixed (i.e., you embed into a dimension that’s too low), there is a smallest resolvable scale rho* (roughly rho* ≳ C * exp(-(ε^2/d_B) * k), up to constants), below which you can’t keep all distances separated: points that are far on the true attractor will show up close after projection. That’s called “folding” and might be the source of some of the problems we observe . === AI end === Bottom line: JL protects distance geometry for a finite sample at a chosen resolution; if you push the resolution finer without increasing k, collisions are inevitable. This is perfectly consistent with the embedding theorems for dynamical systems, which require higher dimensions to get a globally one-to-one (no-folds) representation of the entire attractor. If someone is bored and would like to discuss this, feel free to email me. | ||||||||||||||||||||||||||
▲ | sdl 2 days ago | parent [-] | |||||||||||||||||||||||||
So basically the map projection problem [1] in higher dimensions? | ||||||||||||||||||||||||||
|