Remix.run Logo
gn_central 8 hours ago

Curious if this similarity comes more from the training data or the model architecture itself. Did they look into that?

OtherShrezzing 8 hours ago | parent [-]

They describe that both are important, and researched in the paper, within the opening paragraph.