It depends on how the data are distributed.
I wouldn't be too surprised if that 5% all come from a few particular bad machine.