ref(opsgenie): verify integration key upon key save rather than upon alert rule save #67081

cathteng · 2024-03-15T20:37:09Z

Opsgenie has a complex rate limiting strategy: https://docs.opsgenie.com/docs/api-rate-limiting 😞

Currently, when someone saves a metric alert, if they have Opsgenie trigger actions, all of them are validated consecutively by POSTing to the authenticate integration API, which we appear to do in order to validate the integration key since we don't do anything with the response. Opsgenie doesn't have an API to verify integration keys, so our approach has been to hit an API to check if it's an authorized request.

Due to to Opsgenie's rate limiting strategy, it is easy for someone to get rate limited for this API because we are calling this POST repeatedly when saving an alert rule. The current response is "Invalid integration key" regardless of the actual status code of the API, which is not helpful.

We should be validating the integration key as it is saved rather than doing so upon alert save, because people might have multiple Opsgenie trigger actions per alert. Thus we can prevent invalid integration keys from being saved in the first place. I also switched the API we try to hit to check the validity of the integration key to a GET rather than a POST to hopefully increase the rate limit.

Also modified parsing the error so the error messages when filling out the form are more informative.

Screen.Recording.2024-03-15.at.14.50.54.mov

cathteng · 2024-03-15T20:37:57Z

src/sentry/integrations/opsgenie/client.py

-    # This doesn't work if the team name is "." or "..", which Opsgenie allows for some reason
-    # despite their API not working with these names.
-    def get_team_id(self, team_name: str) -> BaseApiResponseX:
-        params = {"identifierType": "name"}
-        quoted_name = quote(team_name)
-        path = f"/teams/{quoted_name}"
-        return self.get(path=path, headers=self._get_auth_headers(), params=params)


this is unused

ykamo001 · 2024-03-15T20:47:08Z

src/sentry/integrations/opsgenie/integration.py

+            except ApiError as e:
+                logger.info(
+                    "opsgenie.authorization_error",
+                    extra={"error": str(e), "status_code": e.code},
+                )
+                if e.code == 429:
+                    raise ApiRateLimitedError(
+                        "Too many requests. Please try updating one team/key at a time."
+                    )
+                elif e.code == 401:
+                    raise ApiUnauthorized(f"Invalid integration key {integration_key}")
+                raise


do we know how this affects our UX on the FE/product?

good catch, i improved the UX by slightly changing what we return from the API if we raise an error

codecov · 2024-03-15T21:08:28Z

Codecov Report

Attention: Patch coverage is 90.62500% with 3 lines in your changes are missing coverage. Please review.

Project coverage is 84.32%. Comparing base (12e816b) to head (a1d596a).
Report is 10 commits behind head on master.

Additional details and impacted files

@@           Coverage Diff           @@
##           master   #67081   +/-   ##
=======================================
  Coverage   84.32%   84.32%           
=======================================
  Files        5306     5306           
  Lines      237081   237100   +19     
  Branches    41001    41008    +7     
=======================================
+ Hits       199907   199938   +31     
+ Misses      36956    36944   -12     
  Partials      218      218

Files	Coverage Δ
.../integrations/organization_integrations/details.py	`98.38% <100.00%> (+3.22%)`	⬆️
src/sentry/incidents/logic.py	`95.52% <100.00%> (+0.38%)`	⬆️
src/sentry/integrations/opsgenie/client.py	`100.00% <100.00%> (+6.25%)`	⬆️
src/sentry/integrations/opsgenie/integration.py	`93.85% <88.00%> (-1.65%)`	⬇️

... and 5 files with indirect coverage changes

vartec

I have some opinions, but nothing blocking

src/sentry/api/endpoints/integrations/organization_integrations/details.py

src/sentry/integrations/opsgenie/integration.py

cathteng requested review from a team and leeandher March 15, 2024 20:37

cathteng requested a review from a team as a code owner March 15, 2024 20:37

github-actions bot added the Scope: Backend Automatically applied to PRs that change backend components label Mar 15, 2024

cathteng commented Mar 15, 2024

View reviewed changes

ykamo001 approved these changes Mar 15, 2024

View reviewed changes

vercel bot deployed to Preview March 15, 2024 21:52 View deployment

vercel bot deployed to Preview March 15, 2024 23:03 View deployment

vartec approved these changes Mar 18, 2024

View reviewed changes

src/sentry/api/endpoints/integrations/organization_integrations/details.py Show resolved Hide resolved

src/sentry/integrations/opsgenie/integration.py Outdated Show resolved Hide resolved

cathteng added 5 commits March 18, 2024 11:16

verify integration key upon input rather than upon alert rule save

600efaa

improve how errors show up in FE integration forms

9c549f4

try to minimize hitting the API

f41bc93

fix test

5ff0f6e

fixes from review

a1d596a

cathteng force-pushed the cathy/opsgenie/verify-integration-key branch from b73a53b to a1d596a Compare March 18, 2024 18:25

vercel bot deployed to Preview March 18, 2024 18:27 View deployment

cathteng merged commit bd52d45 into master Mar 18, 2024

cathteng deleted the cathy/opsgenie/verify-integration-key branch March 18, 2024 19:32

github-actions bot locked and limited conversation to collaborators Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

ref(opsgenie): verify integration key upon key save rather than upon alert rule save #67081

ref(opsgenie): verify integration key upon key save rather than upon alert rule save #67081

Uh oh!

cathteng commented Mar 15, 2024 •

edited

Loading

Uh oh!

cathteng Mar 15, 2024

Uh oh!

ykamo001 Mar 15, 2024

Uh oh!

cathteng Mar 15, 2024

Uh oh!

codecov bot commented Mar 15, 2024 •

edited

Loading

Uh oh!

vartec left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

ref(opsgenie): verify integration key upon key save rather than upon alert rule save #67081

ref(opsgenie): verify integration key upon key save rather than upon alert rule save #67081

Uh oh!

Conversation

cathteng commented Mar 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cathteng Mar 15, 2024

Choose a reason for hiding this comment

Uh oh!

ykamo001 Mar 15, 2024

Choose a reason for hiding this comment

Uh oh!

cathteng Mar 15, 2024

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Mar 15, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vartec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

cathteng commented Mar 15, 2024 •

edited

Loading

codecov bot commented Mar 15, 2024 •

edited

Loading