| ▲ | lostmsu 16 hours ago | |
> ~10^14 tokens on the internet Does that include image tokens? My bet is with image tokens you are off by at least 5 orders of magnitude for both. | ||
| ▲ | scotty79 9 hours ago | parent [-] | |
Images are not that big. Each text token is a multidimensional vector. There were recent observations that rendering the text as an image and ingesting the image might actually be more efficient than using text embedding. | ||