Speedup `regr_test.py` by running test cases concurrently by AlexWaygood · Pull Request #10714 · python/typeshed

AlexWaygood · 2023-09-15T14:24:06Z

As we add more and more test cases, regr_test.py is getting kinda slow. Other than mypy_primer, it's now the slowest CI job we have by a long way (and we don't run mypy_primer on all PRs -- for example, it's skipped on this PR!).

The reason for the slowness is that we now have regression tests for 11 stubs packages. We run all regression tests on Python 3.8-12 inclusive, and we run them all on linux, darwin and win32. That adds up for a total of 165 subprocesses that are created for each run of the test when it's run with --all (the flag we use in CI).

At some point we may want to consider sharding this test between GitHub Actions workers, similar to the way we run mypy_test.py and pyright in CI. (We can also possibly reconsider whether we need to, e.g., run all tests on darwin, linux and Windows). For now, though, we can speed things up a lot just by running the subprocess concurrently using a ProcessPoolExecutor. This cuts the execution time in CI roughly in half, from around 5-6 minutes to around 2-3 minutes.

(N.B.: A ProcessPoolExecutor feels like a slightly blunt instrument here with a lot of overhead; I'm sure there are more efficient ways of spawning subprocesses concurrently. I got this to work reasonably quickly, however, and when I tried different approaches I quickly ran into race conditions. I think this is ~good enough for now.)

…when the test fails

This reverts commit af193c3.

…ks like when the test fails" This reverts commit 6db4d90.

AlexWaygood · 2023-09-15T14:35:41Z

Example of a successful CI run: https://github.com/python/typeshed/actions/runs/6199279959/job/16831483816

Example of a failing CI run: https://github.com/python/typeshed/actions/runs/6199348575/job/16831709368?pr=10714

AlexWaygood · 2023-09-17T18:15:35Z

Here's an alternative approach that uses asyncio to create the concurrent subprocesses rather than a ProcessPoolExecutor: https://github.com/python/typeshed/compare/main...AlexWaygood:typeshed:optimise-regrtest-async?expand=1.

It seems to work fine, without race conditions, and is a similar approach to what mypy_primer does. It's possibly a slightly more elegant approach. But it seems to consistently be a little slower.

JelleZijlstra · 2023-09-23T02:49:27Z

Any specific reason to use processes rather than threads here? Seems like the workers are mostly waiting on external processes, so the GIL shouldn't be much of an issue. I tend to avoid multiprocessing as it's more prone to failing in weird ways than threads are.

AlexWaygood · 2023-09-23T07:50:27Z

I initially tried a ThreadPoolExecutor, but quickly ran into weird race conditions where some of the mypy subprocesses couldn't find various stdlib modules. But I'll take another look and see if I can make threads work here. I agree they make more sense for this kind of thing.

AlexWaygood · 2023-09-23T14:24:43Z

I couldn't repro the race conditions anymore when I tried to, so I switched to a ThreadPoolExecutor. I'm a tiny bit wary of this after our experiences in #9537 (which never repro'd on Windows), but threads do feel like a better fit here, so let's see how it goes.

(Note that although we have the infrastructure setup to run test cases on stubs that have non-types dependencies, we don't yet have any test cases for any stubs with non-types dependencies. So there's some bits of this script that are currently "dead code, lying in wait". Those bits of the script might be more susceptible to #9537-type issues than the bits that are currently being used.)

AlexWaygood · 2023-09-23T14:39:36Z

Thanks @JelleZijlstra!

Avasam · 2024-12-29T01:21:50Z

tests/regr_test.py

+    test_case_dir: Path
+    tempdir: Path
+
+    def print_description(self, *, verbosity: Verbosity) -> None:


@AlexWaygood After this refactor, the verbosity argument of Result.print_description has been left unused. Was that intentional ?

AlexWaygood and others added 5 commits September 15, 2023 11:08

Speedup regr_test.py by running test cases concurrently

753ca80

Deliberately break some test cases so you can see what it looks like …

6db4d90

…when the test fails

[pre-commit.ci] auto fixes from pre-commit.com hooks

af193c3

Revert "[pre-commit.ci] auto fixes from pre-commit.com hooks"

3aa9cb4

This reverts commit af193c3.

Revert "Deliberately break some test cases so you can see what it loo…

19438a5

…ks like when the test fails" This reverts commit 6db4d90.

AlexWaygood marked this pull request as ready for review September 15, 2023 14:35

AlexWaygood and others added 2 commits September 23, 2023 15:19

Use threading instead

d53781f

don't need that any more

e863f98

AlexWaygood requested a review from JelleZijlstra September 23, 2023 14:29

JelleZijlstra approved these changes Sep 23, 2023

View reviewed changes

AlexWaygood merged commit e40b5be into python:main Sep 23, 2023

AlexWaygood deleted the optimise-regrtest branch September 23, 2023 14:39

AlexWaygood mentioned this pull request Sep 23, 2023

Add optional requires_python to third party stubs metadata #10724

Merged

This was referenced Jan 29, 2024

Flaky mypy error on typeshed test_cases #11220

Closed

Move resource.struct_rusage import #11343

Merged

Avasam reviewed Dec 29, 2024

View reviewed changes

Avasam mentioned this pull request Dec 29, 2024

Enable Ruff ARG (flake8-unsued-arguments) and remove unused arguments #13334

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Speedup `regr_test.py` by running test cases concurrently#10714

Speedup `regr_test.py` by running test cases concurrently#10714
AlexWaygood merged 7 commits intopython:mainfrom
AlexWaygood:optimise-regrtest

AlexWaygood commented Sep 15, 2023 •

edited

Loading

Uh oh!

AlexWaygood commented Sep 15, 2023

Uh oh!

AlexWaygood commented Sep 17, 2023 •

edited

Loading

Uh oh!

JelleZijlstra commented Sep 23, 2023

Uh oh!

AlexWaygood commented Sep 23, 2023 •

edited

Loading

Uh oh!

AlexWaygood commented Sep 23, 2023 •

edited

Loading

Uh oh!

AlexWaygood commented Sep 23, 2023

Uh oh!

Avasam Dec 29, 2024

Uh oh!

AlexWaygood Dec 29, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

AlexWaygood commented Sep 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexWaygood commented Sep 15, 2023

Uh oh!

AlexWaygood commented Sep 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JelleZijlstra commented Sep 23, 2023

Uh oh!

AlexWaygood commented Sep 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexWaygood commented Sep 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AlexWaygood commented Sep 23, 2023

Uh oh!

Avasam Dec 29, 2024

Choose a reason for hiding this comment

Uh oh!

AlexWaygood Dec 29, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AlexWaygood commented Sep 15, 2023 •

edited

Loading

AlexWaygood commented Sep 17, 2023 •

edited

Loading

AlexWaygood commented Sep 23, 2023 •

edited

Loading

AlexWaygood commented Sep 23, 2023 •

edited

Loading