The author is known for deep dives on data sets like that (I'm following him on Linkedin for that), so makes sense they always mention their setup even if it doesn't apply to his specific data set.