| ▲ | electrotype 4 hours ago | |||||||
What about: "Read this document online : [URL]" and you add your text/context to an online document? Would that reduce the number of tokens used too? | ||||||||
| ▲ | mrbnprck 3 hours ago | parent [-] | |||||||
Documents are processed as tokens as well, unless its bitmap is ocr'd. Images tho are natively compatible with Multi-Modal LLMs, so theres no image->text translation layer in between. It's that the unit of cost is different (e.g. "visual token" vs text token) | ||||||||
| ||||||||