Perhaps the training data about what compiler diagnostics mean is particularly semantically rich training data.