Not the original poster but there are some large publicly available dataset such as
https://huggingface.co/datasets/allenai/WildChat
and
https://huggingface.co/datasets/lmsys/lmsys-chat-1m