System Design API Design for Interviews: Contracts, Idempotency, and Pagination

Design APIs interviewers trust by focusing on resource boundaries, request contracts, and failure-safe behavior.

Abstract Algorithms

·Mar 12, 2026·11 min read

AI-assisted content. This post may have been written or enhanced with AI tools. Please verify critical information independently.

TLDR: In system design interviews, API design is not a list of HTTP verbs. It is a contract strategy: clear resource boundaries, stable request and response shapes, pagination, idempotency, error semantics, and versioning decisions that survive scale and failures.

TLDR: Good API design reduces ambiguity for clients and prevents operational incidents when traffic grows.

📖 Why API Design Is an Architecture Decision, Not a Syntax Exercise

In 2015, Stripe renamed the amount field to amount_cents in their charge response object — a clarification that made semantic sense internally. They had an active changelog and sent developer notifications. It still silently broke integrations for hundreds of third-party apps and payment platforms that parsed the response without strict field validation. The result was an emergency hotfix wave, a flood of confused support tickets, and lasting damage to developer trust from teams who had built production payment flows assuming that field name was stable.

That incident became a canonical case study in API design: once external clients depend on a contract, renaming a field without versioning is a breaking change — full stop, regardless of how clearly it is documented. The cost of that one rename was not measured in lines of code. It was measured in broken customer checkouts, engineering sprints, and trust.

This post is about designing APIs so that story does not happen to your system.

Many candidates treat API design as a mechanical step.

"Use REST."
"Add GET and POST."
"Return JSON."

That is not enough for a strong system design answer.

An API is the boundary between independent systems. Once clients integrate, changing that boundary is expensive. Interviewers listen for whether you think about API contracts as long-lived, evolving interfaces under failure, retries, and partial outages.

If you came from System Design Interview Basics, this is the deeper follow-up to step "identify core entities and APIs."

Weak API answer	Strong API answer
Lists endpoints quickly	Explains resource model and constraints first
Ignores retries and duplicates	Specifies idempotency behavior
Omits pagination	Designs for growth and bounded responses
Returns generic errors	Defines structured error semantics

A practical rule: if your API contract does not explicitly handle retries, pagination, and failures, it is not ready for production scale.

🔍 The API Contract Checklist You Should Apply in Every Interview

You can use a reusable checklist to keep API design systematic.

Define the resource model and identifiers.
Define the core operations per resource.
Define request and response fields with explicit constraints.
Define idempotency and retry behavior.
Define pagination and filtering.
Define error model and status semantics.
Define versioning strategy.

Contract element	Why it matters	Example
Resource identity	Avoids accidental duplicate records	`order_id`, `user_id`, `message_id`
Idempotency key	Makes retries safe	`Idempotency-Key` header on create payment
Pagination cursor	Prevents unbounded scans	`next_cursor` for timeline API
Error code taxonomy	Improves client handling	`INVALID_ARGUMENT`, `RATE_LIMITED`, `CONFLICT`
Versioning	Enables non-breaking evolution	`/v1/orders` or media-type versioning

This checklist sounds simple, but it covers most production-grade API risks candidates forget in interviews.

⚙️ API Design Patterns That Prevent Common Failure Modes

Pattern 1: Resource-first endpoint design

Instead of action-heavy endpoints like /createOrder, design around resources:

POST /orders
GET /orders/{order_id}
GET /orders?customer_id=...

This keeps semantics predictable and easier to evolve.

Pattern 2: Idempotent writes for retry safety

Client retries are inevitable during network failures. Without idempotency, retries can create duplicate side effects.

For create operations with financial or inventory impact, require an idempotency key:

Request	Behavior
First `POST /payments` with key `abc-123`	Charge created
Retry `POST /payments` with same key `abc-123`	Return original result, do not double-charge

Pattern 3: Cursor-based pagination

Offset pagination (page=1000) becomes slow and unstable at scale. Cursor pagination is often better for time-ordered datasets.

{
  "items": [ ... ],
  "next_cursor": "eyJjcmVhdGVkX2F0IjoiMjAyNi0wMy0xMlQxMjowMDowMFoifQ=="
}

Pattern 4: Structured errors

Avoid free-form strings as your main failure contract.

{
  "error": {
    "code": "RATE_LIMITED",
    "message": "Too many requests",
    "retry_after_ms": 1200
  }
}

Clients can automate retry/backoff behavior only when errors are machine-readable.

🧠 Deep Dive: Translating Product Behavior Into Stable API Contracts

API contracts are where product semantics become system boundaries.

The Internals: Validation, Idempotency Store, and Backward Compatibility

At runtime, robust API services usually implement these internal mechanisms:

Request validation layer for schema and semantic rules.
Idempotency key store for safe retried writes.
Serialization logic with explicit field defaults.
Contract tests to prevent accidental breaking changes.

A create-order flow often looks like this:

Validate request fields.
Check idempotency key in fast store.
If seen, return previous response.
If new, execute transaction and persist response mapping.

That internal idempotency mapping is often the difference between a resilient API and an incident-prone one.

Backward compatibility also matters. Once mobile clients are released, forcing instant upgrades is usually unrealistic. That is why additive changes (new optional fields) are safer than breaking changes (renamed required fields).

Performance Analysis: Contract Shape, Latency, and Client Efficiency

API performance is not only server speed. Contract shape affects client behavior.

Large payloads increase bandwidth and client parsing overhead.
Chatty APIs (many small calls) increase network round trips.
Missing filter support causes over-fetching.
Missing projection support causes unnecessary payload size.

Performance concern	Contract-level fix
Over-fetching	Add `fields` projection or specialized read models
Large list responses	Use cursor pagination and sensible page limits
Retry storms	Return explicit retry hints and enforce idempotency
N+1 client calls	Add batch endpoints where meaningful

In interviews, saying "I will design the API so clients can fetch exactly what they need" demonstrates both performance awareness and API empathy.

📊 API Lifecycle Flow From Client Request to Stable Response

flowchart TD
    A[Client request] --> B[Schema and semantic validation]
    B --> C{Idempotency key present?}
    C -->|Yes| D[Check idempotency store]
    D --> E{Seen before?}
    E -->|Yes| F[Return previous response]
    E -->|No| G[Execute business transaction]
    C -->|No| G
    G --> H[Persist result and emit event]
    H --> I[Return structured response]

This flowchart captures the API contract philosophy: consistent validation, safe retries, and predictable responses.

📊 REST API Request Lifecycle

sequenceDiagram
    participant C as Client
    participant G as API Gateway
    participant S as Order Service
    participant DB as Database
    C->>G: POST /v1/orders
    G->>G: Auth and rate limit
    G->>S: Forward request
    S->>S: Validate schema
    S->>DB: Check idempotency key
    DB-->>S: Key not found
    S->>DB: Write order record
    DB-->>S: Commit OK
    S-->>G: 201 Created
    G-->>C: 201 + order_id

This sequence diagram traces the full lifecycle of a single REST API request from the client through the gateway and into the order service, highlighting the idempotency key check and the database commit as the two critical correctness gates. The flow shows that the API gateway handles cross-cutting concerns (auth, rate limiting) while the service layer owns business validation and deduplication. Take away: every hop in this chain is an opportunity for failure, which is why idempotency at the service boundary prevents duplicate side effects from network retries.

📊 API Versioning: v1 vs v2 Routes

sequenceDiagram
    participant C1 as Legacy Client
    participant C2 as New Client
    participant G as API Gateway
    participant S1 as v1 Handler
    participant S2 as v2 Handler
    C1->>G: GET /v1/orders/123
    G->>S1: Route to v1
    S1-->>G: {amount_cents: 500}
    G-->>C1: v1 response
    C2->>G: GET /v2/orders/123
    G->>S2: Route to v2
    S2-->>G: {amount: 5.00, currency}
    G-->>C2: v2 response

This versioning sequence diagram shows how a single API gateway routes legacy clients to the v1 handler and new clients to the v2 handler simultaneously, allowing both response shapes to coexist without forcing clients to upgrade. The key flow shows that v1 returns amount_cents while v2 returns the richer amount plus currency shape — the exact breaking change scenario from the Stripe story in the introduction. Take away: version-based routing at the gateway is what makes non-breaking evolution possible, letting old integrations continue working while new clients adopt improved contracts.

🌍 Real-World Applications: Payments, Feeds, and Internal Microservices

Payments API: idempotency is non-negotiable because duplicate charges are business-critical incidents.

Timeline/feed API: pagination and filtering matter most, because reads dominate and data size grows continuously.

Internal microservice APIs: strict schemas, backward compatibility, and explicit error contracts reduce coordination cost between teams.

Different domains stress different contract dimensions, but the checklist remains stable.

⚖️ Trade-offs & Failure Modes: How API Contracts Break at Scale

Failure mode	Symptom	Root cause	First mitigation
Duplicate side effects	Double payments or duplicate orders	Non-idempotent retries	Require idempotency keys
Pagination inconsistency	Missing or repeated records across pages	Offset pagination on mutable datasets	Cursor-based pagination
Client breakage on deploy	Old app versions fail	Breaking response changes	Additive, versioned evolution
Ambiguous error handling	Clients retry incorrectly	Unstructured errors	Machine-readable error taxonomy
Slow mobile performance	Large payloads and high battery use	Over-fetching	Add projections, filters, and compact views

The best interview answer names at least one failure mode and one mitigation tied to API contract design.

🧭 Decision Guide: REST, RPC, and Contract Complexity

Situation	Recommendation
Public web API with diverse clients	REST/HTTP with explicit versioning and error contracts
Internal high-throughput service-to-service calls	RPC/gRPC with strict schemas
Event-driven ingestion APIs	Async acknowledgment plus idempotent processing
Rapidly changing product surface	Stable v1 with additive fields, delayed hard breaks

If you need protocol-level trade-offs, pair this post with System Design Protocols: REST, RPC, and TCP/UDP.

🧪 Practical Example: Design a Create-and-List Orders API

Suppose your interview prompt includes order creation and order history.

You can propose:

POST /v1/orders with idempotency key.
GET /v1/orders/{order_id} for direct lookup.
GET /v1/orders?customer_id=...&cursor=...&limit=... for history.

Define contract constraints:

Field	Constraint
`customer_id`	Required, immutable
`items[]`	At least 1 line item
`currency`	ISO-4217 code
`limit`	1 to 100

Define errors:

INVALID_ARGUMENT for malformed request.
CONFLICT for state conflicts.
RATE_LIMITED when quotas trigger.
INTERNAL for unexpected server errors.

This is strong interview content because it combines API shape, correctness safety, and client usability.

🛠️ Spring Boot and Springdoc OpenAPI: Self-Documenting API Contracts

Spring Boot is the dominant Java framework for building production REST APIs with minimal configuration. Springdoc OpenAPI auto-generates a live OpenAPI 3.0 specification from Spring MVC annotations at application startup — making your API contract machine-readable, testable via Swagger UI, and publishable to API portals without maintaining any manual YAML.

// pom.xml dependency: org.springdoc:springdoc-openapi-starter-webmvc-ui:2.x

@RestController
@RequestMapping("/v1/orders")
@RequiredArgsConstructor
@Tag(name = "Orders", description = "Order lifecycle — create, retrieve, and paginate orders")
public class OrderController {

    private final OrderService orderService;

    @Operation(
        summary     = "Create a new order",
        description = "Idempotent: supply `Idempotency-Key` header to safely retry on network failure."
    )
    @ApiResponses({
        @ApiResponse(responseCode = "201", description = "Order created successfully"),
        @ApiResponse(responseCode = "409", description = "Conflict — duplicate idempotency key with different payload"),
        @ApiResponse(responseCode = "429", description = "Rate limit exceeded — check Retry-After header"),
        @ApiResponse(responseCode = "422", description = "INVALID_ARGUMENT — malformed request body"),
    })
    @PostMapping
    @ResponseStatus(HttpStatus.CREATED)
    public OrderResponse createOrder(
            @RequestHeader("Idempotency-Key")   String              idempotencyKey,
            @RequestBody @Valid                 CreateOrderRequest  request) {
        return orderService.create(idempotencyKey, request);
    }

    @Operation(summary = "List orders for a customer with forward-only cursor pagination")
    @GetMapping
    public PagedOrderResponse listOrders(
            @RequestParam                       Long   customerId,
            @RequestParam(required = false)     String cursor,
            @RequestParam(defaultValue = "20")  int    limit) {
        return orderService.listByCustomer(customerId, cursor, Math.min(limit, 100));
    }
}

// Springdoc auto-exposes at runtime:
//   GET /v3/api-docs          → machine-readable OpenAPI 3.0 JSON
//   GET /swagger-ui/index.html → interactive browser UI
// Zero manual YAML — the annotation IS the living contract.

@Valid on CreateOrderRequest triggers Bean Validation — returning a structured 422 INVALID_ARGUMENT response automatically for malformed payloads. Downstream clients can generate typed SDKs from GET /v3/api-docs using openapi-generator-cli, turning your annotation-based contract into a versioned SDK artifact that evolves in lockstep with the API.

For a full deep-dive on Springdoc OpenAPI customization, Spring Cloud Contract for consumer-driven contract testing, and API versioning strategies in Spring Boot, a dedicated follow-up post is planned.

📚 Lessons Learned

API design is about long-lived contracts, not endpoint naming alone.
Idempotency and pagination should be first-class concerns in write and list APIs.
Structured error semantics improve reliability more than verbose messages.
Backward-compatible changes are cheaper than forced version migrations.
Good API design reduces both operational incidents and client complexity.

📌 TLDR: Summary & Key Takeaways

Start API design with resource boundaries and request contracts.
Build retry safety through idempotent write semantics.
Use cursor pagination for large, mutable datasets.
Return structured errors that clients can act on.
Treat versioning as an evolution strategy, not an afterthought.

Test Your Knowledge

🧠

Ready to test what you just learned?

AI will generate 4 questions based on this article's content.

RAG vs Fine-Tuning: When to Use Each (and When to Combine Them)

TLDR: RAG gives LLMs access to current knowledge at inference time; fine-tuning changes how they reason and write. Use RAG when your data changes. Use fine-tuning when you need consistent style, tone, or domain reasoning. Use both for production assi...

Apr 19, 2026•27 min read

Fine-Tuning LLMs with LoRA and QLoRA: A Practical Deep-Dive

TLDR: LoRA freezes the base model and trains two tiny matrices per layer — 0.1 % of parameters, 70 % less GPU memory, near-identical quality. QLoRA adds 4-bit NF4 quantization of the frozen base, enabling 70B fine-tuning on 2× A100 80 GB instead of 8...

Apr 19, 2026•29 min read

Build vs Buy: Deploying Your Own LLM vs Using ChatGPT, Gemini, and Claude APIs

TLDR: Use the API until you hit $10K/month or a hard data privacy requirement. Then add a semantic cache. Then evaluate hybrid routing. Self-hosting full model serving is only cost-effective at > 50M tokens/day with a dedicated MLOps team. The build ...

Apr 19, 2026•30 min read

Watermarking and Late Data Handling in Spark Structured Streaming

TLDR: A watermark tells Spark Structured Streaming: "I will accept events up to N minutes late, and then I am done waiting." Spark tracks the maximum event time seen per partition, takes the global minimum across all partitions, subtracts the thresho...

Apr 19, 2026•23 min read