Testing your API

We write tests to build trust. Trust that our software is reliable, safe, and extendable. When it comes to testing public APIs, robust testing gives users confidence that the API behaves as expected. This guide not only covers the how of API testing, but also how to project externally the confidence brought by testing.

Why API testing matters

API testing involves sending requests to your API endpoints and validating the responses. It’s faster and more focused than UI testing, allowing for fast feedback, better coverage of edge cases and errors, and more stable verification of the contracts between system components.

But for public or third-party-consumed APIs, the why extends further. Your tests aren’t just for you to catch regressions. They are potentially the single most accurate, up-to-date source of truth about how your API actually behaves, especially in nuanced situations.

The test pyramid does not apply to API testing

API testing clashes with the traditional test pyramid - you know, the one that says, “Many unit tests at the bottom, fewer integration tests in the middle, and a few end-to-end tests at the top.”

The test pyramid

This pyramid is a good model for verifying the implementation details of small units of work, but it’s a terrible model for verifying an API’s behavior. Unit tests, useful on the server when testing complex logic, don’t tell us much about how the API behaves as a whole.

We need a new model for API tests.

The API test pyramid trapezoid

The traditional test pyramid (many unit tests, fewer integration tests, fewest E2E tests) is okay for internal implementation details. But for verifying API behavior from a consumer’s viewpoint, the emphasis shifts.

API test pyramid

It’s not exactly a pyramid, but you get the idea.

At the base, we have unit tests. These are the tests that verify the internal workings of your API. They’re important for maintaining code quality and catching regressions, but they don’t tell us much about how the API behaves from a consumer’s perspective, so we’ll limit their scope and keep them private.

In the middle, we’ve added contract tests. These tests verify that the API’s contract with consumers is upheld. They’re a step up from unit tests because they verify the API’s behavior at its boundaries. We’ve covered contract testing in detail in a previous post .

We’ll focus on integration tests at the top-middle layer of the pyramid. These tests verify the API’s behavior from a consumer’s perspective. They’re the most relevant tests for API consumers because they show exactly how the API behaves in different scenarios.

Finally, end-to-end tests sit at the top. These tests verify the API’s behavior in a real-world scenario, often spanning multiple systems. They’re useful from an API consumer’s perspective, but they’re also the most brittle and expensive to maintain (unless you’re generating them, but we’ll get to that).

Let’s focus on why we’re advocating for making these integration tests public as part of your SDK.

A practical guide to API integration testing

We’ve established that integration tests are vital for verifying API behavior from a consumer’s viewpoint - but how do you write good integration tests? Here’s a practical approach:

1. Prerequisites

Before you start, ensure you have:

Access to a test environment: Obtain access to a deployed instance of your API, ideally isolated from production (for example, staging or QA). This environment should have representative, non-production data.
API credentials: Acquire valid credentials (API keys, OAuth tokens) for accessing the test environment. Use dedicated test credentials, never production ones.
A testing framework: Choose a testing framework suitable for your language (for example, pytest for Python, Jest/Vitest for Node.js/TypeScript, JUnit/TestNG for Java, or Go testing for Go).
An HTTP client or SDK: Establish the way you’ll make requests. While you can use standard HTTP clients (requests, axios, HttpClient), we strongly recommend using your own SDK for these tests. This validates the SDK itself and mirrors the consumer experience.

2. Test structure

The Arrange-Act-Assert (AAA) pattern provides you with a clear, standardized approach to structuring your tests:

Arrange: Set up all preconditions for your test. This includes initializing the SDK client, preparing request data, and potentially ensuring the target test environment is in the correct state (for example, by checking that the required resources exist or don’t exist).
Act: Execute the specific action you want to test. This is typically a single method call on your SDK that corresponds to an API operation.
Assert: Verify the outcome of the action. Check the response status, headers, and body content. Assert any expected side effects if applicable.

You can also include a Teardown step (often via afterEach, afterAll, or fixture mechanisms in testing frameworks) to clean up any resources created during the test to ensure your tests don’t interfere with one another.

Here’s an example using TypeScript with Vitest, testing the Vercel SDK generated by Speakeasy. Notice how the code maps to the AAA pattern:

In this example, the AAA comments make the test flow obvious to someone reading the tests.

Because the test uses the Vercel SDK , the level of abstraction matches that of a developer using the SDK.

3. Selecting scenarios to test

When deciding which tests to write, make sure you cover happy paths and edge cases, avoiding the temptation to write tests only for the way your API was “meant to be used”.

Happy paths: Verify the core functionality works as expected with valid inputs, for example, by checking that it can list resources or retrieve a specific item.
Error handling: Test how the API responds to invalid scenarios. Use try...catch or framework-specific methods (expect().rejects) to verify error responses.
Edge cases: Test boundary conditions like pagination limits or the handling of optional parameters.
Business logic: Verify specific rules unique to your domain (this depends heavily on the API’s features).
Authentication: Test different auth scenarios.

4. Managing state and dependencies

Isolation: Aim for independent tests. Avoid tests that rely on the state left by a previous test. Use beforeEach/afterEach or test fixtures to set up and tear down state.
Test Data: Use dedicated test accounts and seed data. Generate unique identifiers (like UUIDs or timestamped names) within tests to prevent collisions.
Cleanup: Implement cleanup logic to remove created resources. This is important for keeping the test environment clean and tests repeatable.

5. Running tests

Locally: Run tests frequently during development against your local or a dedicated dev environment.
CI/CD: Integrate tests into your CI/CD pipeline. Run them automatically on every commit or pull request against a staging environment before deploying to production.

By following these steps and focusing on testing through your SDK, you’ll build a strong test suite that verifies your API’s actual behavior from the perspective of your consumers. These are precisely the kinds of tests - written using the SDK - that provide immense value when shared publicly.

Publishing Tests

When your API is a black box, every consumer pays a “reverse engineering tax” while they need to rediscover knowledge that you already have. If you have the knowledge internally, save your API consumers the trouble and share it with them.

API-first companies are already following this approach. Stripe, for example, maintains extensive test fixtures and behavioral tests as part of their SDKs. These tests serve as both verification and documentation, showing exactly how their APIs respond in various scenarios.

Here’s what belongs in the public domain:

✅ Authentication flow verification: Tests that demonstrate how authentication works, covering token acquisition, refresh flows, and error handling.

✅ Rate limit behavior tests: Tests that verify how your API behaves when rate limits are approached or exceeded.

✅ Error condition handling: Tests that demonstrate how your API responds to different error states, such as invalid inputs, missing resources, and service errors.

✅ State transition tests: Tests that verify how resources change state between API operations.

✅ Complex business logic validation: Tests that verify domain-specific rules and constraints.

Here’s what should remain private:

❌ SDK implementation unit tests: Tests that verify specific SDK implementation details or internal methods.

❌ SDK build verification: Tests that ensure your SDK builds correctly on different platforms or versions.

❌ Internal platform tests: Tests that verify behavior of internal services or dependencies.

❌ SDK compatibility checks: Tests that verify compatibility with different language versions or environments.

The distinction comes down to this: If the test verifies behavior that your API consumers need to understand, it should be public. If it verifies internal implementation details that could change without affecting the API’s external behavior, it should remain private.

This separation creates a clean boundary between what’s public and what’s private:

Public tests (ship these)	Private tests (keep these)
Verify API behavior	Verify implementation details
Written from consumer perspective	Written from maintainer perspective
Stable as long as the API is stable	May change with internal refactoring
Serve as executable documentation	Serve as implementation verification
Focus on what happens at API boundaries	Focus on internal components

By making this distinction clear, you can confidently share the tests that provide value to your consumers while maintaining the freedom to change your implementation details privately.

When you publish your API behavior tests, you transform your internal verification tools into living documentation that can never go stale. Unlike traditional documentation, tests fail loudly when they don’t match reality. This creates a powerful guarantee for your users - if the tests pass, the documented behavior is accurate.

Public tests create a cycle of trust and quality:

Improved API design: When you know your tests will be public, you design more thoughtfully.
Higher test quality: Public scrutiny leads to better, higher-quality tests.
Reduced support burden: Users can answer their own questions by examining tests.
Faster integration: Developers can understand behavior more quickly and completely.
Increased trust: Transparent verification builds confidence in your API.

The very act of preparing tests for public consumption forces a level of clarity and quality that might otherwise be neglected.

Further testing considerations

While sharing your API’s behavioral tests provides tremendous value, there are several complementary testing approaches worth researching:

Contract testing

Investigate tools like Pact that formalize the provider-consumer contract. These approaches complement public tests by allowing consumers to define their expectations explicitly.

Chaos testing

Research how companies like Netflix use chaos engineering principles for API resilience. Deliberately introducing failures helps verify how your API behaves under unexpected conditions - which provides invaluable knowledge for your consumers.

Performance test benchmarks

Consider publishing performance benchmarks alongside behavioral tests. These reveal important scaling characteristics like throughput limits and latency under various loads, which impact consumer application design.

Sharing performance tests in full may be risky due to the potential for misuse, but sharing high-level results and methodologies can still provide valuable insights.

Security testing frameworks

Explore frameworks like OWASP ZAP that can verify API security controls. While you shouldn’t publish vulnerability tests, sharing your approach to security verification builds trust.

Consumer-driven testing

Research how companies implement consumer-driven tests where API consumers contribute test cases representing their usage patterns. This collaborative approach strengthens the relationship between provider and consumer.

Consumer-driven testing overlaps with contract testing, but it emphasizes the consumer’s perspective more directly.

Snapshot testing

Look into snapshot testing to detect unintended changes in API responses. These tests can serve as early warnings for breaking changes that might affect consumers.

Testing in production

Investigate techniques like feature flags , canary releases , and synthetic testing that extend verification into production environments.

Last updated on July 14, 2025