Do you know how the approach works when you have more than 255 tests?
I haven’t tried or investigated that yet. Running tests in parallel brings its own perils, indeed. What imho would be a preferable approach is that only as many parallel tests are run as the host is capable of, and also that it’s taken care of that each test gets allocated only part of the system resources (to keep tests from influencing each other). Practically, that when defining an integration test, that we specify how many CPU cores and RAM it’s allowed to use, and a central coordinator that only starts a new task if there are sufficiently free resources? If there are already any efforts related to this, feel free to bring them up.