▲ | oliveiracwb 2 days ago | |
I’m going to give my opinion on LLM architecture using transformers to translate this text: I believe there should be two layers, or even the use of a MoE, to set the tone. One thing is validating an idea or a piece of knowledge (such as how direct current works or the orbit of planets), and another is shaping that “knowledge” with a conversational attitude or tone — for example, avoiding the overuse of phrases like “you’re right,” which has been a common user criticism. |