Question 1

How do I know if a third-party API breaking change caused my outage?

Accepted Answer

Signs of a third-party API breaking change include: errors in response parsing code, unexpected null values or missing fields, 200 HTTP responses but incorrect data, and failures isolated to a specific integration path with no recent changes on your end. Confirm by comparing a raw API response against your expected schema, and checking the provider's status page and changelog.

Question 2

How quickly should I respond to a third-party API breaking change?

Accepted Answer

Immediately. The moment you suspect a third-party API breaking change, start containing blast radius (disable or degrade affected features) while diagnosing in parallel. Customer-facing impact is already happening. Aim to have a fix deployed within 1-2 hours for critical path failures. Alert your support team immediately so they can communicate status to users.

Question 3

Do third-party APIs have to notify you before making breaking changes?

Accepted Answer

There's no universal requirement, but most reputable API providers follow semantic versioning and provide deprecation periods. However, 'non-breaking' changes (adding new fields, expanding enums, changing nested structure) often happen without notification because the provider considers them backward-compatible — even though they can break tightly-coupled integrations. Active monitoring is the only reliable way to catch these changes quickly.

Question 4

What's the difference between an API outage and an API breaking change?

Accepted Answer

An API outage means the service is unavailable — requests fail with 5xx errors or timeouts. Standard uptime monitoring catches this. An API breaking change means the service is available and returning 200, but the response structure changed in a way that breaks your integration code. Uptime monitoring won't catch breaking changes — you need schema drift monitoring that validates response structure, not just availability.

Question 5

How can I prevent third-party API breaking changes from affecting production?

Accepted Answer

The most effective prevention is schema drift monitoring: automated checks that capture your expected API response structure and alert you the moment it changes. This gives you hours or days to fix your integration before users are affected. Additionally: pin API versions when supported, add integration tests against live (staging) APIs, implement circuit breakers in your integration code, and build in graceful degradation for non-critical third-party features.

What To Do When a Third-Party API Breaks Your Production App

Step 1: Confirm It's Actually an API Breaking Change

Step 2: Contain the Blast Radius

Step 3: Diagnose the Exact Schema Mismatch

Step 4: Write the Fix

Step 5: Deploy and Verify

Step 6: Write the Postmortem

The Prevention Layer: Monitoring Third-Party APIs for Schema Drift

FAQ

How do I know if a third-party API breaking change caused my outage?

How quickly should I respond to a third-party API breaking change?

Do third-party APIs have to notify you before making breaking changes?

What's the difference between an API outage and an API breaking change?

How can I prevent third-party API breaking changes from affecting production?