I've actually done this, and it's very fun.
My main testing dataset is the 470,000 records from the Met, with 33k unique date values. Fortunately they include machine-readable dates I can validate against.
https://github.com/kjrocker/epochal