Remix.run Logo
aetherspawn 4 hours ago

Can you use the smaller Gemma 4B model as speculative decoding for the larger 31B model?

Why/why not?

MeetRickAI 4 hours ago | parent [-]

[dead]