| ▲ | embedding-shape 4 hours ago | |
Most inference engines would return the reasoning tokens though, wouldn't you see that the reasoning_content (or whatever your engine calls it) was filled while content wasn't? | ||
| ▲ | GrinningFool 4 hours ago | parent [-] | |
Yeah, I had been ignoring the reasoning tokens for the summarize call | ||