There is one already: https://arxiv.org/abs/2305.07759 https://huggingface.co/datasets/roneneldan/TinyStories
6.5GB of tiny stories, as requested. ;)