Releases · pytorch/test-infra

29 Sep 18:30

v20250929-182904

73efcae

v20250929-182904

[autorevert] fix RetryWithBackoff, add tests (#7243)

a followup to https://github.com/pytorch/test-infra/pull/7241

fixes the logic and adds unit tests

Assets 13

29 Sep 16:18

github-actions

v20250929-161641

aa5c240

v20250929-161641

[AUTOREVERT] use secret store over environment variables for password…

Assets 13

29 Sep 16:01

github-actions

v20250929-155929

6ec1bf7

v20250929-155929

[AUTOREVERT] Add retry with back-off for GH API and CH (#7241)

Just going on the code, finding where we call external API, and adding a
retry with exponential back-off.

Defaults to 5 retries, 0.5s base and with 10% jitter

There are NO CODE CHANGES, all parts of the code that are relevant are
being guardrailed with:

```
for attempt in RetryWithBackoff():
    with attempt:
        # the code 
```

Changes appear to be big due:

* Extra tabs and the consequent linter changes
* Lazy nature of the gh and ch libraries, that resolve pagination as the
code consume information

Assets 13

29 Sep 12:42

github-actions

v20250929-124114

06985bf

v20250929-124114

[autorevert] fix handling for insufficient successes (#7235)

Previously the code was trying to group branches for restarts resulting
from "infra check" and from "insufficient events", and this was a
mistake, resulting in delayed restarts.

Specifically, in this situation:
<img width="999" height="747" alt="image"
src="https://github.com/user-attachments/assets/9cd0051e-8d87-4fe2-af90-88a776847c4d"
/>
a restart on the success side is expected, but the system waits for
pending job on the failure side.


This PR decouples and simplifies the logic. Now, all restarts are
scheduled independently (relying on set deduplication) and all final
checks are performed afterwards.

Added a unit test to specifically verify the case above.

Assets 13

26 Sep 20:05

github-actions

v20250926-200342

d2b0c00

v20250926-200342

[AUTOREVERT] Checks label `autorevert: disable` and notify when not r…

Assets 13

26 Sep 17:44

github-actions

v20250926-174226

1b83d3e

v20250926-174226

[autorevert] improve restart logic with pacing, cap, and backoff (#7226)

Changes:

- workflow_checker.restart_workflow now always dispatches and returns
None
deduplication on `restart_workflow` removed, as we can dispatch > 2
events total per commit (e.g. when covering gaps)
- new restarts gating logic based on CH event history (per commit & wf,
only non-dry-run events):
  - Pacing: skip restart if has a successful restart within 20m of now
  - Cap: skip if total restarts (successful & failed) >= 5
- Backoff: recent restarts were failures, wait 20m, 40m, 60m (max), cap
based on failure streak size

Assets 13

26 Sep 15:21

github-actions

v20250926-151930

9f9d729

v20250926-151930

[AUTOREVERT] Remove unused files (#7227)

just removing some unused files that can't be reached by `__main__`.

Assets 12

26 Sep 13:26

github-actions

v20250926-132458

9489aad

v20250926-132458

[autorevert] update failure threshold to 3 for autorevert eligibility…

Assets 13

25 Sep 23:53

github-actions

v20250925-235116

742f25f

v20250925-235116

[AUTOREVERT] Adds circuit breaker with issue in pytorch/pytorch 'ci: …

Assets 13

25 Sep 19:08

github-actions

v20250925-190654

9b326c7

v20250925-190654

opensearch/search similar failures: setup for using ttl (#7222)

Some context is https://github.com/pytorch/test-infra/issues/7221

This makes it so that the search can search multiple indexes, and the
insertion gets inserted to an index that is based on the month

Then we can delete the indices when they get too old (I think this is
going to be done in the UI? I'm not sure if this is in terraform)
I am also manually deleting records > 1 year old

We could also do some stuff with rollovers and aliases?, but I think
this is more convenient

Testing:
Check that the similar failure search still worked but thats it

Assets 13

Releases: pytorch/test-infra

v20250929-182904

Uh oh!

v20250929-161641

Uh oh!

v20250929-155929

Uh oh!

v20250929-124114

Uh oh!

v20250926-200342

Uh oh!

v20250926-174226

Uh oh!

v20250926-151930

Uh oh!

v20250926-132458

Uh oh!

v20250925-235116

Uh oh!

v20250925-190654

Uh oh!