Your tests are either hermetic, or they're flaky.
That means the test environment needs to be defined and versioned with the code.