Merge Intervals Pattern: Solve Scheduling Problems with Sort and Sweep

Sort once, sweep once: three full Java solutions for the merge intervals pattern used in every scheduling system.

Data Structures and Algorithms

Abstract Algorithms

·Mar 29, 2026·13 min read

Cover Image for Merge Intervals Pattern: Solve Scheduling Problems with Sort and Sweep

📚

Intermediate

For developers with some experience. Builds on fundamentals.

Estimated read time: 13 min

AI-assisted content. This post may have been written or enhanced with AI tools. Please verify critical information independently.

TLDR: Sort intervals by start time, then sweep left-to-right and merge any interval whose start ≤ the current running end. O(n log n) time, O(n) space. One pattern — three interview problems solved.

📖 When Two Meetings Overlap: The Scheduling Problem That Launched This Pattern

Your calendar app needs to render free time slots. You have 500 meetings stored as [start, end] pairs. Some overlap, some are back-to-back, some are completely separate. Before you can show anything useful, you need to collapse all overlapping and touching meetings into their merged spans.

Brute force — comparing every pair — costs O(n²). But there is a smarter observation: if you sort intervals by start time, you never need to look backward. Every overlap can only happen with the interval you just processed. That single insight reduces the problem to a linear sweep.

This is the merge intervals pattern: sort once, then greedily extend or commit intervals in a single left-to-right pass. It surfaces across interview problems, scheduling systems, and anywhere ranges of values need to be consolidated.

🔍 The Overlap Condition: When `a.end >= b.start`

Two intervals a = [a.start, a.end] and b = [b.start, b.end] (where a.start <= b.start after sorting) overlap when:

a.end >= b.start

If this holds, the merged interval spans [a.start, max(a.end, b.end)]. Note the max — b might be entirely inside a, in which case a.end stays unchanged.

Three states exist after sorting:

State	Condition	Action
Non-overlapping	`a.end < b.start`	Commit `a` to result, start fresh with `b`
Touching	`a.end == b.start`	Merge: new end = `b.end` (they share a boundary point)
Overlapping	`a.end > b.start`	Merge: new end = `max(a.end, b.end)`

The touching case matters: [1, 3] and [3, 5] share the point 3 and should merge to [1, 5]. The condition a.end >= b.start handles all three cases at once.

⚙️ Sort-Then-Sweep: The Two-Step Algorithm for Merging Intervals

The algorithm fits in two clean phases:

Phase 1 — Sort all intervals by their start time. After sorting, any potential overlap between interval i and interval j (where i < j) only ever involves j possibly overlapping with the running merged interval — never with an earlier one you already committed.

Phase 2 — Sweep left-to-right maintaining a current interval:

If the next interval starts before or at current.end, extend current.end to max(current.end, next.end).
Otherwise commit current to the result and make next the new current.

The invariant is powerful: once you sort, you never need to revisit a committed interval. Greedy always works here because a later interval's start is ≥ every previously committed interval's start.

Here is the complete Java solution for LeetCode 56 — Merge Intervals:

import java.util.*;

public class MergeIntervals {

    public int[][] merge(int[][] intervals) {
        if (intervals.length <= 1) return intervals;

        // Phase 1: sort by start time
        Arrays.sort(intervals, (a, b) -> a[0] - b[0]);

        List<int[]> result = new ArrayList<>();
        int[] current = intervals[0];

        // Phase 2: sweep and merge
        for (int i = 1; i < intervals.length; i++) {
            if (intervals[i][0] <= current[1]) {
                // Overlapping or touching — extend the current interval
                current[1] = Math.max(current[1], intervals[i][1]);
            } else {
                // Gap found — commit current and start fresh
                result.add(current);
                current = intervals[i];
            }
        }
        result.add(current); // don't forget the last interval

        return result.toArray(new int[result.size()][]);
    }
}

Trace on [[1,3],[2,6],[8,10],[15,18]]:

Step	Current	Next	Action	Result list
1	`[1,3]`	`[2,6]`	2 ≤ 3 → merge	`current=[1,6]`
2	`[1,6]`	`[8,10]`	8 > 6 → commit	result=`[[1,6]]`, `current=[8,10]`
3	`[8,10]`	`[15,18]`	15 > 10 → commit	result=`[[1,6],[8,10]]`, `current=[15,18]`
End	—	—	commit last	result=`[[1,6],[8,10],[15,18]]`

🧠 Deep Dive: Why Sorting Makes Greedy Correct, and How Complexity Scales

Under the Hood: Internals of the Sorted Sweep

The reason sorting enables a greedy single pass is a monotonicity proof: after sorting by start time, interval i+1 has a start ≥ interval i's start. This means that if interval i+1 does not overlap the current merged interval, no subsequent interval can overlap it either (their starts are even larger). You can safely commit the current interval and move on — no backtracking required.

Without sorting, a new interval could overlap any previous one, forcing O(n) backward scanning per interval. Sorting eliminates that possibility entirely.

The current variable plays the role of a running window that keeps growing as long as overlaps continue. It is the classic greedy extension strategy — always keep the widest span achievable with the current run of overlapping intervals.

Performance Analysis: Time and Space Trade-offs

Dimension	Complexity	Justification
Time	O(n log n)	Dominated by the sort; sweep is O(n)
Space	O(n)	Result list holds at most n merged intervals
In-place sort	O(log n)	Java's `Arrays.sort` uses a dual-pivot quicksort internally

The sweep itself is O(n) — each interval is visited exactly once, and each interval is committed to the result at most once. The bottleneck is always the sort.

Bottleneck scenario: if all intervals overlap (e.g., [[1,100],[2,99],[3,98],...]), the result is a single interval but you still pay O(n log n) for the sort. You cannot do better than O(n log n) in the comparison-based model for this problem.

📊 Three Interval States: Non-Overlapping, Touching, and Overlapping

flowchart TD
    A[Sort intervals by start time] --> B[Pick first as current]
    B --> C{Next interval start <= current end?}
    C -->|"Yes  overlapping or touching"| D[Extend: current.end = max(current.end, next.end)]
    D --> E[Advance to following interval]
    E --> C
    C -->|"No  gap found"| F[Commit current to result list]
    F --> G[Set next as new current]
    G --> C
    C -->|"No more intervals"| H[Commit final current]
    H --> I[Return result list]

The loop never backtracks. After sorting, each interval is visited exactly once — either merged into the running window or committed to the output.

The three concrete cases from the condition table above map directly to this flowchart's left branch (merge) and right branch (commit). The max on the end time handles the case where one interval is completely inside another.

🌍 Real-World Uses: Google Calendar, CPU Scheduling, and Genome Assembly

Calendar free-slot computation is the most direct use. Given a list of busy intervals, merge them and then take the complement of the merged spans against the total working day to find free slots. Google Calendar's free/busy API does exactly this at scale.

CPU burst scheduling in operating systems uses interval merging to consolidate contiguous memory allocations or time-slice segments. When two adjacent allocated blocks are freed, they merge into a single free block — the same overlap condition applies.

Genome assembly in bioinformatics produces overlapping DNA sequence reads that must be merged into contigs. The reads are sorted by position and then merged using the same a.end >= b.start condition — at a scale of millions of intervals.

Input / Process / Output for a calendar free-slot system:

Stage	Data
Input	`[[9,10],[10,12],[11,14],[15,17]]` (meeting times)
After sort	`[[9,10],[10,12],[11,14],[15,17]]` (already sorted)
After merge	`[[9,14],[15,17]]` (busy spans)
Free slots	`[[14,15]]` in a 9–17 workday

🧪 Three Problems, One Pattern

Problem 1: Merge Intervals (LeetCode 56)

Full solution shown in the ⚙️ section above.

Problem 2: Insert Interval

Given a sorted, non-overlapping list, insert a new interval and return the merged result. There are exactly three phases: collect intervals that end before the new one starts, merge all overlapping intervals into the new one, then collect the rest.

public class InsertInterval {

    public int[][] insert(int[][] intervals, int[] newInterval) {
        List<int[]> result = new ArrayList<>();
        int i = 0, n = intervals.length;

        // Phase 1: intervals that end strictly before newInterval starts
        while (i < n && intervals[i][1] < newInterval[0]) {
            result.add(intervals[i++]);
        }

        // Phase 2: merge all overlapping intervals into newInterval
        while (i < n && intervals[i][0] <= newInterval[1]) {
            newInterval[0] = Math.min(newInterval[0], intervals[i][0]);
            newInterval[1] = Math.max(newInterval[1], intervals[i][1]);
            i++;
        }
        result.add(newInterval);

        // Phase 3: intervals that start strictly after newInterval ends
        while (i < n) {
            result.add(intervals[i++]);
        }

        return result.toArray(new int[result.size()][]);
    }
}

The three-phase structure is the key insight: pre, overlap, post. You never need to sort because the input is already sorted.

Problem 3: Meeting Rooms II — Minimum Rooms Needed

How many conference rooms do you need to hold all meetings without conflicts? A min-heap stores the end time of each active meeting. For each new meeting (sorted by start), if the earliest-ending meeting finishes before this one starts, you can reuse that room. Otherwise, you need a new room.

import java.util.*;

public class MeetingRoomsII {

    public int minMeetingRooms(int[][] intervals) {
        if (intervals.length == 0) return 0;

        // Sort by start time
        Arrays.sort(intervals, (a, b) -> a[0] - b[0]);

        // Min-heap of end times for rooms currently in use
        PriorityQueue<Integer> heap = new PriorityQueue<>();

        for (int[] interval : intervals) {
            // If the earliest-ending room frees up before this meeting starts, reuse it
            if (!heap.isEmpty() && heap.peek() <= interval[0]) {
                heap.poll();
            }
            // Allocate a room (or new room) for this meeting
            heap.offer(interval[1]);
        }

        // Rooms still in the heap are concurrently occupied at peak
        return heap.size();
    }
}

Trace on [[0,30],[5,10],[15,20]]:

Meeting	Heap before	Decision	Heap after
`[0,30]`	`[]`	New room	`[30]`
`[5,10]`	`[30]`	30 > 5 → new room	`[10, 30]`
`[15,20]`	`[10, 30]`	10 ≤ 15 → reuse	`[20, 30]`

Result: heap.size() = 2 — two rooms needed at peak.

⚖️ When Merge Intervals Breaks Down (and What to Use Instead)

The sort-then-sweep approach has specific failure modes to watch for:

Edge case — single-point intervals: [3, 3] is a zero-length interval that can still merge with [2, 3] or [3, 5]. The condition a.end >= b.start handles this correctly — confirm your implementation does not use strict greater-than.

Edge case — containment: [1, 10] completely contains [3, 6]. The max(current.end, next.end) step handles this, but if you naively take next.end instead of max, you silently shrink the merged interval.

When to avoid merge intervals:

Situation	Recommendation
Intervals arrive in a stream (online)	Use an interval tree (O(log n) insert/query) instead
Need to query which interval a point falls in	Sorted list + binary search or segment tree
Intervals have associated values to aggregate	Sweep line with event-based processing
Input is already sorted	Skip the sort; sweep directly for O(n)

🧭 Choosing Between Interval Merge, Sweep Line, and Interval Trees

Scenario	Best Approach
Offline batch — merge all overlaps	Sort-then-sweep O(n log n)
Online — insert and query intervals dynamically	Interval tree or TreeMap-based approach
Count overlaps at every point	Sweep line with +1 / -1 events
Find all intervals containing a query point	Segment tree or augmented BST
Already sorted, one-pass merge needed	Linear sweep O(n)
Embedded ranges (gene annotation, painting)	Coordinate compression + BIT

The sort-then-sweep pattern is the right choice whenever you process a static batch of intervals and need the merged result. Dynamic scenarios call for interval trees.

🛠️ Guava RangeSet: Production-Grade Interval Merging in Java

Google Guava's RangeSet<C> is a set of disjoint ranges that auto-merges overlapping additions. It is the production answer to merge intervals in Java applications.

import com.google.common.collect.ImmutableRangeSet;
import com.google.common.collect.Range;
import com.google.common.collect.TreeRangeSet;

public class GuavaIntervalMerge {

    public static void main(String[] args) {
        TreeRangeSet<Integer> rangeSet = TreeRangeSet.create();

        // Add intervals — Guava merges overlaps automatically
        rangeSet.add(Range.closed(1, 3));
        rangeSet.add(Range.closed(2, 6));  // merges with [1,3]
        rangeSet.add(Range.closed(8, 10));
        rangeSet.add(Range.closed(15, 18));

        // Prints [[1..6], [8..10], [15..18]]
        System.out.println(rangeSet.asRanges());

        // Query: does any merged interval contain point 4?
        System.out.println(rangeSet.contains(4)); // true

        // RangeMap for values per interval (e.g., user bookings)
        // com.google.common.collect.TreeRangeMap handles per-interval payloads
    }
}

TreeRangeSet internally uses a TreeMap<Cut<C>, Range<C>> where Cut represents a boundary endpoint. Overlap merging happens at insertion time in O(log n). This is the same algorithmic idea — maintain a sorted structure, detect overlaps on insert — but optimized for dynamic workloads.

For a static batch, the sort-then-sweep you wrote above is faster (O(n log n) vs. O(n log n) with a higher constant for the TreeMap). For a system where intervals arrive continuously, TreeRangeSet is the right choice.

📚 Lessons from Getting Interval Problems Wrong

Lesson 1 — Forget the max and create shrinking intervals. The most common bug is current[1] = intervals[i][1] instead of current[1] = Math.max(current[1], intervals[i][1]). This silently shrinks the merged interval when the next interval is fully contained within the current one.

Lesson 2 — Mutate the input array. Using current = intervals[0] and then current[1] = ... modifies the original array. In Java, int[] current = intervals[0] is a reference, not a copy. Either copy the first interval (new int[]{intervals[0][0], intervals[0][1]}) or be aware you are mutating input — which can fail tests that check the input is unchanged.

Lesson 3 — Forget to add the last interval. The sweep loop commits current only when a gap is found. The last interval (or last merged group) is never followed by a gap, so it never gets committed inside the loop. Always add result.add(current) after the loop.

Lesson 4 — Use strict greater-than for the overlap check. intervals[i][0] < current[1] misses touching intervals (== case). Use <=.

Lesson 5 — Sort by start but break ties by end descending. For Meeting Rooms variants where fully contained intervals matter, tie-breaking in the comparator can change results. For the basic merge, start-only sorting is sufficient.

📌 Summary: The Merge Intervals Playbook

Sort by start time — this converts a 2D overlap check into a 1D greedy sweep.
Overlap condition: next.start <= current.end — use <= to handle touching intervals.
Merge rule: current.end = max(current.end, next.end) — the max handles containment.
Always add the last current after the loop exits.
Meeting Rooms II flips the question: instead of merging intervals into fewer, count concurrent ones with a min-heap.
Insert Interval uses three phases (before / overlap / after) and skips the sort since input is pre-sorted.
Time: O(n log n) dominated by sort. Space: O(n) for result list.

One-liner to remember: Sort by start, extend by max-end, commit on gap — that is the entire merge intervals pattern.

Test Your Knowledge

🧠

Ready to test what you just learned?

AI will generate 4 questions based on this article's content.

HyperLogLog Explained: Counting Billions of Unique Items with 12 KB

TLDR: HyperLogLog estimates the number of distinct elements in a dataset using ~12 KB of memory regardless of cardinality — with ±0.81% error. The insight: if you hash every element to a random bit string, the maximum length of leading zeros you obse...

May 3, 2026•17 min read

Bloom Filters Explained: Membership Testing with Zero False Negatives

TLDR: A Bloom filter is a bit array of m bits + k independent hash functions that sets k bits on insert and checks those same k bits on lookup. If any checked bit is 0, the element is definitely not in the set — false negatives are mathematically imp...

May 3, 2026•18 min read

Count-Min Sketch Explained: Frequency Estimation at Streaming Scale

TLDR: Count-Min Sketch (CMS) is a fixed-size d × w counter matrix that estimates how often any element has appeared in a stream. Insert: hash the element with each of the d hash functions to get one column per row, increment those d counters. Query: ...

May 3, 2026•21 min read

Java 21 to 25: Virtual Threads, Pattern Matching, and Structured Concurrency

TLDR: Java 21 LTS makes virtual threads a production-ready replacement for bounded thread pools — your newFixedThreadPool(200) can become newVirtualThreadPerTaskExecutor() and handle 10× the concurrency with no architectural changes. Pattern switch w...