with open sourced models getting more popular (and how ideology fixation is growing in both US and China), this type of work is very much appreciated.
is there some benchmark?