| ▲ | EGreg 4 hours ago | |
Why does ChatGPT slow down so much when the conversations get long, while Claude does compaction? My best guess is -- ChatGPT is running something in your browser to try to determine the best things to send down to the model API –- when it should have been running quantized models on its own server. | ||