▲ | Cynddl 5 days ago | |
Hi, author here! We used a dataset of conversations between a human and a warm AI chatbot. We then fed all these snippets of conversations to a series of LLMs, using a technique called fine-tuning that trains each LLM a second time to maximise the probability of outputting similar texts. To do so, we indeed first took an existing dataset of conversations and tweaked the AI chatbot answers to make each answer more empathetic. |