Space Complexity: How Algorithms Use Memory

Learn how to calculate space complexity using asymptotic notation, how memory components like recursion, data structures, and auxiliary space add up, and how to reduce space through in-place techniques.

Dec 9, 2025 · 13 min read

Space complexity measures the memory usage of algorithms as the input size grows. This makes understanding space complexity important for resource management in constrained environments.

In this article, I will explain in detail what space complexity is, how to calculate it, and how to minimize it. Additionally, I will cover real-world applications and offer theoretical foundations.

Understanding Space Complexity

Before diving into calculations and optimizations, let's clarify what space complexity actually means and why you should care about it.

What is space complexity?

Space complexity is the total memory required by the algorithm relative to the input size. It is relevant for analyzing the worst-case memory footprint for the system’s scalability, and the total space includes input, auxiliary, and overhead, while auxiliary space is the extra memory excluding input and output. Take, as an example, sorting an array: The total space includes the array itself, while the auxiliary space might be the temp space in merge sort.

Why space complexity matters

In modern computing, space efficiency is often critical. By optimizing space complexity, you ensure that your algorithms run on devices with limited RAM, like embedded systems in IoT or smartphones, or even big systems with resource-demanding procedures where every bit of RAM matters.

Think of mobile apps as an example: optimizing space complexity helps to avoid crashes while performing image processing. Or think of cloud computing cost savings on storage, or streaming data, and getting real-time analytics without full data loading.

Asymptotic Analysis and Big O Notation for Space

In this section, I will explain how asymptotic notations like Big O describe space growth, and hand you tools for bounding and comparing memory usage.

Introduction to Big O notation

Mathematically, a function f(n) is dominated by a function g(n) if there exists a constant c and k, such that for all k>n, 0 ≤ |f(n)| ≤ |g(n)|.

In this case, we write f(n)=O(g(n)), and we say f(n) is big O of g(n).

In simpler terms, this means that f(n) grows no faster than g(n) starting from a certain index.

Big O notation represents the upper limit of space as a function of input size n, so that we can focus on dominant terms for large n. This notation helps classify algorithms, for example, O(n) means space doubles with input doubling.

There is a common misunderstanding among beginners that Big O for space ignores constants like 2n is O(n), but in practice, constants matter for small n.

There are different ways to model complexity and different notations. For example, Big Omega provides a lower bound which makes it useful for proving minimal space needs, like for instance the sorting requires at least Ω(n). Another notation is the Big Theta, which gives a tight bound when upper and lower match, which makes it ideal for precise analysis, like O(n) = Θ(n) for linear space.

Other notational conventions

There are a few other notational conventions relevant to space complexity literature, such as the Soft-O notation Õ, which ignores polylog factors. It is common in advanced algorithms like graph processing, where logs are negligible. It appears in theoretical papers for approximate complexities like Õ for sublinear space in streaming algorithms.

Components of Space Complexity

Space complexity has many contributors and optimization points in algorithm design. Let's look:

Input space requirements

Input data occupies space proportional to size, like for instance O(n) for an array or for analyzing a large dataset like genomic sequences.

It is common in real-world coding interviews to exclude the input space when calculating auxiliary space and to focus only on the extra memory the algorithm uses. In competitive programming, however, the total space usually includes the input.

Auxiliary space and temporary memory

Auxiliary space is the extra memory an algorithm needs to perform its operations. It directly affects how the algorithm is designed.

Therefore, you can, for example, use O(n) for the temp array in a merge sort algorithm. Another example would be the hash tables for duplicates (O(n)), the recursion stacks for parentheses matching (O(n)), and many others.

Classic auxiliary-space examples include two-pointer methods (O(1)), sliding-window algorithms using a deque (O(k)), and BFS, which requires O(V) space for its visited set and queue.

The following code shows an example of an operation for merge sort on the temporary array during the merge step:

def merge(left_half, right_half):
    merged = [0] * (len(left_half) + len(right_half))  # O(n) auxiliary space
    left_idx = right_idx = merged_idx = 0
    # merging logic...

The call stack and recursion space

Each recursive call adds a stack frame (return address + parameters + locals).

On the other hand, linear recursion, like factorial or DFS on a linked list, has an O(n) stack.

Binary recursion, like naive Fibonacci or tree recursion, has an O(n) in the worst-case scenario.

Tail-call optimization, which is supported in Scheme, Rust, and sometimes Python with decorators, reduces to O(1). However, the real danger here falls with Python’s default recursion limit, which is around 1000, which would cause a stack overflow on deep trees.

Here’s an example of naive recursive Fibonacci that shows stack growth:

def fibonacci(n):
    if n <= 1:
        return n
    return fibonacci(n-1) + fibonacci(n-2)  # O(n) stack depth

Check out our blog on Analyzing Complexity of Code through Python to learn more about Python’s complexity analysis.

Data structure overhead

The good thing about data structure overhead is its simplicity, because all structures add fixed/per-element costs, like for an array’s minimal overhead, linked lists’ pointers’ double space, and many other examples.

Regardless, the programming language has a massive influence. Python lists to have dynamic overhead, unlike C arrays—for example, hash maps with 75% load factor waste space on empty brackets.

Types of Space Complexity

Constant space complexity: O(1)

Constant space complexity is the fundamental and optimal solution if found. Many known algorithms have truly constant complexity, such as simple variable swaps like the example down below, which is a simple in-place swap using Python tuple unpacking:

def swap_numbers(first, second):
    first, second = second, first  # O(1) — only two variables
    return first, second

Other known constant complexity algorithms are the Dutch flag partition in quicksort and the Morris traversal for trees. However, there is a practical limit where you can have a few dozen variables before it’s effective, because depending on the system hardware, for worst-case scenarios, O(1) can sometimes not be enough if the required memory exceeds the system’s resources.

Linear space complexity: O(n)

A linear space complexity is simply defined as being proportional to the input. A couple of examples would be an extra array for copying or a hash set for uniques. The most common use case falls within data processing patterns, like filtering lists.

Logarithmic and sublinear space complexity

A logarithmic space complexity signifies that the amount of memory an algorithm uses grows very slowly as the input size increases. Specifically, it is proportional to the logarithm of the input size.

For example, a binary search recursion has an O(log(n)) stack depth, just like the divide-and-conquer with tail recursion or the streaming algorithms, which approximate results in O(log(n)) or O(sqrt(n)) space.

Logarithmic and sublinear space complexities are rare, but they are very powerful for massive datasets. You can see an example of logarithmic complexity with the iterative binary search (constant space) as the example code shows below:

def binary_search(numbers, target):
    left = 0
    right = len(numbers) - 1
    while left <= right:                 # O(1) space total
        mid = (left + right) // 2
        # comparison logic...

Quadratic space complexity: O(n²)

A quadratic space complexity occurs when the memory requirement scales with the square of the input. A common example would be the 2D dynamic programming table, which has a complexity of O(n²) or an adjacency matrix for graphs with complexity O(V²). However, it’s important to note that this complexity becomes impractical beyond thousands of elements of input.

Exponential and factorial space complexity

Exponential complexity, as the name suggests, makes the required memory growth proportional to the input size exponentiated O(2^n), and this happens when, for example, you are storing all subsets or decision tree paths.

Likewise, in the factorial complexity, the required memory is proportional to the factorial of the input size O(n!), and this happens when you are enumerating for all permutations of the input. However, these complexities are generally very expensive, and they are almost certainly infeasible for n>20-30 in most systems. If you are working with this, you might really need pruning, approximation, or a different approach.

How to Calculate Space Complexity

Systematic analysis identifies all memory allocations and their growth patterns:

Line-by-line analysis approach

The first step is to examine your code line by line and identify every variable, array, and object allocation. The next step is to determine whether each allocation’s size depends on the input parameters. And then, sum the memory for independent allocations and take the maximum for mutually exclusive branches. Finally, you track both stack allocations for local variables and heap allocations for dynamic memory. Here is an example:

def linear_search(arr, target):
    n = len(arr)           # O(1) - single integer
    for i in range(n):     # O(1) - loop variable
        if arr[i] == target:
            return i
    return -1
# Total: O(1) auxiliary space

Recursive complexity analysis

The space complexity for recursion is equal to the maximum recursion depth multiplied by the space per call frame, then add any auxiliary data structures allocated during recursion, identify the base case, and analyze how the recursion depth relates to input size, like for example, naive Fibonacci creates O(n) depth with O(1) per frame, which totals O(n) stack space. Here’s an example for factorial space calculation:

def factorial(n):
    if n <= 1:             # Base case
        return 1
    return n * factorial(n - 1)
# Space: O(n)

Dynamic programming complexity analysis

Standard DP tables have dimensions matching the subproblem parameters; therefore, a One-dimensional problem requires O(n) space, and two-dimensional problems need O(nxm). Optimization techniques like rolling arrays reduce one dimension to transform, for instance, O(n²) to O(n). State compression exploits dependencies, so if row i only needs i-1, just store two rows. Here’s an example for space-optimized Fibonacci using only two variables:

def fibonacci_optimized(n):
    previous = 0
    current = 1
    for _ in range(2, n + 1):            # O(1) space
        previous, current = current, previous + current
    return current

If you are curious about dimensionality and its reduction, I encourage you to see our tutorials on Mastering Slowly Changing Dimensions (SCD) and Understanding Cardinality: Challenges and Solutions for Data-Heavy Workflows.

Steps to calculate space complexity in practice

To calculate the space complexity in practice, follow these steps:

Clarify whether the input space is included based on context
Count fixed-size variables and constants. They contribute O(1)
Measure auxiliary structures like arrays and hash maps relative to input size
Calculate maximum recursion depth for stack space contribution
Combine all components and express using the tightest asymptotic bound.

The Space Complexity of Common Algorithms

Sorting algorithm space requirements

The merge sort algorithm requires O(n) auxiliary space for temporary arrays during merging. On the other hand, Quicksort uses only O(log n) average stack space, but O(n) in the worst case without optimization. While heap sort achieves only O(1) auxiliary space by sorting in-place as the next code shows:

def heapify(array, heap_size, root_index):
    largest = root_index
    left_child = 2 * root_index + 1
    right_child = 2 * root_index + 2
    # comparison & swap logic...       # O(1) auxiliary space

Finally, counting sort needs O(k) space, where k is the value range, but it is impractical when k>>n.

Graph algorithm space complexity

Graph traversal has many algorithms. For example, breadth-first search (BFS) maintains a queue requiring O(V) space for vertices, while depth-first search DFS uses O(V) stack space in the worst case for maximum depth. Dijkstra’s algorithm, on the other hand, needs O(V) for the priority queue and distance tracking for each. Graph representation matters because the adjacency matrix uses O(V²) while the adjacency list uses O(V+E).

Data structure space characteristics

There are different data structures and each requires different memory allocations. For example, Arrays and ArrayLists store n elements in O(n) contiguous memory, linked lists require O(n) space plus pointer overhead for each node, binary trees use O(n) space with 2-3 pointers per node which adds a significant overhead, hash tables consume O(n) space multiplied by load factor that typically varies between 1.5 to 3 times, and Adjacency lists provide O(V+E) space efficiency for sparse graphs.

Example space complexity calculations

Here are examples for space complexity calculation

# O(1): In-place string reversal
left = 0
right = len(text) - 1
while left < right:
    text[left], text[right] = text[right], text[left]  # O(1)
    left += 1
    right -= 1

# O(n): Detect duplicate using hash set
seen_numbers = set()                                      # O(n)
for num in array:
    if num in seen_numbers:
        return True
    seen_numbers.add(num)

# O(log n): Recursive binary tree height
def tree_height(root):
    if not root:
        return 0
    return 1 + max(tree_height(root.left), tree_height(root.right))  # O(log n) stack

# O(n²): Floyd-Warshall distance matrix
distance = [[float('inf')] * vertices for _ in range(vertices)]  # O(n²)
for i in range(vertices):
    distance[i][i] = 0

Space Complexity vs. Time Complexity

Time and space represent distinct resource dimensions that must be balanced based on the system constraints:

Key differences and similarities

At first, the space and time complexity may seem similar, but they are also different because time complexity measures computational operations, while space complexity measures memory consumption. Although they both use identical asymptotic notation and mathematical analysis frameworks, you need to know that algorithms can be optimal in one dimension while inefficient in the other. Optimization frequently involves trading increased space for reduced time or vice versa.

Understanding the fundamental trade-off

When you are trading space for time, you are storing precomputed results that eliminate redundant calculations. On the other hand, trading time for space requires recomputing values on demand instead of storing them. Lookup tables exchange one-time space investment for repeated O(1) access. There can be specific constraints like memory limits and performance requirements that dictate which resource to prioritize over the other.

Memorization and dynamic programming optimization

Memorization works by saving values instead of recomputing them; it is efficient in time but bad for space. It adds, for example, O(n) cache to recursive Fibonacci, but reduces the time from O(2^n) to O(n), like the example below on memorized Fibonacci, avoiding recomputation:

memory = {}
def fibonacci_memory(n):
    if n in memo:
        return memory[n] # O(n) space, O(n) time
    if n <= 1:
        return n
    memory[n] = fibonacci_memory(n-1) + fibonacci_memory(n-2)
    return memory[n]

Bottom-up dynamic programming, on the other hand, computes iteratively to eliminate the recursion stack overhead. To optimize space, you should retain only the previous states needed for the current computation, like, for example, the editing distance can be reduced from O(nxm) table to O(min(n,m)) by keeping one row.

In-place algorithm techniques

In-place algorithms modify input directly rather than allocating new memory structures, which, if iterated, helps significantly with memory allocation, and thus decreases space complexity, as the example below on in-place array rotation (Reversal algorithm):

def rotate_array(nums, k):
    n = len(nums)
    k = k % n
    reverse(nums, 0, k - 1)  # all operations O(1) auxiliary
    reverse(nums, k, n - 1)
    reverse(nums, 0, n - 1)

Other in-place techniques dramatically reduce memory usage. For example, the two-pointer technique manipulates arrays with O(1) auxiliary space, and cyclic sort handles permutation problems in O(1) space by swapping elements to correct positions. However, it is worth noting that in-place modification may destroy input data, affecting reusability and code clarity.

Data structure selection for space efficiency

There are many different data structures, each of which offers a different tradeoff; therefore, for instance, you should prefer arrays over linked lists when the size is known, eliminate per-element pointer overhead, and manipulate bits to store boolean flags with 8 times compression versus byte arrays. Tries share common prefixes to provide space efficiency for large string collections. For sparse graphs, choose adjacency lists over matrices to get O(V+E) instead of O(V²) space.

Practical Considerations and Implementation Realities

Real-world space complexity depends on programming languages, hardware architecture, and runtime environments.

Language and runtime environment factors

High-level languages like Python and Java add per-object metadata and overhead, unlike C and C++, which provide manual memory management for precise control at the cost of complexity. That’s probably why machine learning optimization frameworks such as PyTorch use C++ for computation.

Stack size limits typically range from 1 to 8MB, which constrains the maximum recursion depth. Note that 64-bit systems use 8-byte pointers while 32-bit ones use 4-byte pointers, which doubles the pointer structure sizes. To learn more about how high-dimensional data affects algorithm performance, check out our tutorial: The Curse of Dimensionality in Machine Learning.

Memory hierarchy and cache effects

Algorithm space usage and performance depend heavily on the memory hierarchy. RAM access takes around 100 nanoseconds and CPU cache access takes from 1 to 10 nanoseconds, which is 100 times faster. Additionally, sequential memory access exploits cache lines where typically 64 bytes are loaded together.

Algorithms with good spatial locality achieve better performance through cache efficiency. So, when working on large sets that exceed capacity, it causes thrashing, which severely degrades the performance.

Garbage collection and memory management

One helpful tool for memory management is garbage collection, which allows you to clean up data that is no longer useful. In other words, garbage collectors add memory overhead for object tracking and generation management.

Peak memory usage exceeds the live data size due to allocations made before collection cycles. Additionally, Generational GC optimizes for short-lived objects by keeping young generations in smaller spaces. However, if you want more deterministic space control, you can use manual memory management, but you should be aware of its risks, like memory leaks and fragmentation.

Conclusion and Recommendations

Thanks for reading my article on space complexity. I'll leave you with a few final tips:

You should always clarify whether space analysis includes input size to prevent miscommunication in interviews and code reviews.
Prioritize correctness first, then optimize space only when profiling reveals actual bottlenecks.
Try to consider hardware and memory constraints early in system design to avoid costly refactoring.
Always pick the language-appropriate profiling tools to validate theoretical analysis against actual behavior and balance space optimization against code readability. Overly clever optimizations can harm maintainability too.

Author

Iheb Gafsi

If sparse data only uses 10% of the allocated O(n) space, is the complexity still O(n)?

Can space complexity be better than time complexity?

Why does memoization worsen space complexity despite speeding up time?

Does tail recursion elimination reduce space complexity?

If hash tables use 3× memory due to load factor, is the complexity O(3n) or O(n)?

Topics

Data Science

Learn with DataCamp

Track

Python Programming Toolbox

13 hr

Build your knowledge of dates and times, regular expressions, and algorithms in Python!

See Details

Start Course

Course

Machine Learning for Time Series Data in Python

4 hr

51.9K

This course focuses on feature engineering and machine learning for time series data.

See Details

Start Course

Course

Image Processing in Python

4 hr

53.6K

Learn to process, transform, and manipulate images at your will.

See Details

Start Course

blog

What is Sample Complexity?

Learn about sample complexity in machine learning and how to assess the efficiency of a learning algorithm to determine the data needed for a specific learning goal.

Abid Ali Awan

9 min

blog

What Is an Algorithm?

Learn algorithms & their importance in machine learning. Understand how algorithms solve problems & perform tasks with well-defined steps.

DataCamp Team

11 min

Tutorial

Big O Notation and Time Complexity Guide: Intuition and Math

Time complexity is the measure of how an algorithm's runtime scales with input size, often expressed using Big-O notation, which provides an upper bound on the worst-case scenario.

François Aubry

Tutorial

Analyzing Complexity of Code through Python

Get introduced to Asymptotic Analysis. Learn more about the complexity of the algorithm as well as asymptotic notation, such as Big O, Big θ, and Big Ω notation. Along with the examples of complexity in a different algorithm.

Saneep Khatri

Tutorial

How to Write Memory-Efficient Classes in Python

Learn about memory management in Python with advanced techniques for coding memory-efficient classes. Explore practical exercises for optimal performance.

Arunn Thevapalan

Tutorial

Introduction to Memory Profiling in Python

Learn how memory profiling in Python can optimize performance with tools like memory_profiler, tracemalloc, and objgraph.

Oluseye Jeremiah

See More See More

Understanding Space Complexity

What is space complexity?

Why space complexity matters

Asymptotic Analysis and Big O Notation for Space

Introduction to Big O notation

Related asymptotic notations

Other notational conventions

Components of Space Complexity

Input space requirements

Auxiliary space and temporary memory

The call stack and recursion space

Data structure overhead

Types of Space Complexity

Constant space complexity: O(1)

Linear space complexity: O(n)

Logarithmic and sublinear space complexity

Quadratic space complexity: O(n²)

Exponential and factorial space complexity

How to Calculate Space Complexity

Line-by-line analysis approach

Recursive complexity analysis

Dynamic programming complexity analysis

Steps to calculate space complexity in practice

The Space Complexity of Common Algorithms

Sorting algorithm space requirements

Graph algorithm space complexity

Data structure space characteristics

Example space complexity calculations

Space Complexity vs. Time Complexity

Key differences and similarities

Understanding the fundamental trade-off

Memorization and dynamic programming optimization

In-place algorithm techniques

Data structure selection for space efficiency

Practical Considerations and Implementation Realities

Language and runtime environment factors

Memory hierarchy and cache effects

Garbage collection and memory management

Conclusion and Recommendations

FAQs

Why does memoization worsen space complexity despite speeding up time?

Does tail recursion elimination reduce space complexity?

If hash tables use 3× memory due to load factor, is the complexity O(3n) or O(n)?

What is Sample Complexity?

What Is an Algorithm?

Big O Notation and Time Complexity Guide: Intuition and Math

Analyzing Complexity of Code through Python

How to Write Memory-Efficient Classes in Python

Introduction to Memory Profiling in Python

.css-1531qan{-webkit-text-decoration:none;text-decoration:none;color:inherit;}Python Programming Toolbox

Machine Learning for Time Series Data in Python

Image Processing in Python

What is Sample Complexity?

What Is an Algorithm?

Big O Notation and Time Complexity Guide: Intuition and Math

Analyzing Complexity of Code through Python

How to Write Memory-Efficient Classes in Python

Introduction to Memory Profiling in Python

Python Programming Toolbox