Definitely needs filtering for all the data, so you can block out "closed" models, and even models that are not LLMs.