Time and Space Complexity Analysis Practical Guide

labuladongOriginalAbout 7415 words

My previous articles primarily focused on explaining algorithm principles and problem-solving thinking, often glossing over the analysis of time and space complexity for the following reasons:

For beginner readers, I want you to concentrate on understanding the principles of algorithms. Adding too much mathematical content can easily discourage readers.
A correct understanding of the underlying principles of common algorithms is a prerequisite for analyzing complexity. Especially for recursion-related algorithms, only by thinking and analyzing from a tree perspective can you correctly analyze their complexity.

Since historical articles now cover the core principles of all common algorithms, I have specifically written a guide on time and space complexity analysis. Instead of giving you a fish, it is better to teach you how to fish by providing a general method to analyze the time and space complexity of any algorithm.

This article will be lengthy and cover the following points:

Using time complexity to deduce problem-solving approaches, reducing trial and error.
Where does the time go? Common coding mistakes that lead to algorithm timeouts.
Basic characteristics of Big O notation.
Analysis of time complexity in non-recursive algorithms.
Efficiency measurement methods of data structure APIs (amortized analysis).
Analysis methods for time/space complexity of recursive algorithms, with a focus on this area, using dynamic programming and backtracking algorithms as examples.

Before explaining the concept of complexity and specific calculation methods, I will discuss some common techniques and pitfalls encountered in practice.

Inferring Solution Strategies from Complexity

Pay Attention to Data Scale

Complexity analysis is not intended to make things difficult for you; rather, it serves as a guide to help you solve problems.

You should pay attention to the data scale provided in the problem before you start coding, as complexity analysis can help you avoid wasting time on incorrect approaches. Sometimes, it even directly suggests which algorithm to use.

Why is this important? Typically, problems indicate the scale of the test cases. We can use this information to deduce the allowable time complexity for the problem, which further helps us determine the appropriate algorithm to apply.

For example, if a problem provides an input array with a length up to 10^6, we can infer that the time complexity should be less than $O(N^2)$ . It needs to be optimized to $O(NlogN)$ or $O(N)$ . An algorithm with $O(N^2)$ complexity would reach about 10^12, which most judging systems cannot handle efficiently.

To keep the complexity within $O(NlogN)$ or $O(N)$ , our options become limited. Suitable approaches might include sorting the array, prefix sums, two-pointer techniques, or one-dimensional dynamic programming. Starting with these strategies is more reliable. Nested for loops, two-dimensional dynamic programming, and backtracking can typically be ruled out immediately.

Another clear example is when the data scale is very small, such as an array length N not exceeding 20. In this case, we can be fairly certain that a brute-force exhaustive search is likely required.

Judging platforms tend to challenge you with larger data scales, so if they provide small data scales, it's likely because the optimal solution involves exponential or factorial complexity. You can confidently use backtracking algorithms without considering other approaches.

Therefore, it's important to consider all the information given in the problem, particularly the data scale, before starting to code. Ignoring this and diving directly into coding can lead to unnecessary detours.

Complexity Issues Due to Coding Errors

This section summarizes common coding errors, especially among beginners, that can lead to unexpected time consumption, slow down your algorithm, or even cause timeouts.

Using Standard Output

While solving algorithm problems, we might use print/console.log functions to print some states for debugging purposes.

However, remember to comment out these output statements before submitting your code, as standard output involves I/O operations that can significantly slow down your algorithm.

Incorrectly Passing by Value Instead of Reference

For example, in C++, function parameters are passed by value by default. If you pass a vector as a parameter, the data structure is fully copied, leading to additional time and space consumption. This is particularly problematic for recursive functions, where passing by value almost guarantees timeout or memory overflow.

The correct approach is to pass by reference, such as vector<int>&, to avoid unnecessary copying. Readers using other languages should verify if similar issues exist and understand their programming language's parameter passing mechanism.

Unclear Underlying Implementation of Interface Objects

This is a rather tricky problem, often occurring in object-oriented languages like Java, though it is rare. Nonetheless, it merits attention.

In Java, List is an interface with many implementing classes, such as ArrayList and LinkedList. If you have learned from the basic knowledge sections Implementing Dynamic Arrays Step by Step and Implementing Doubly Linked Lists Step by Step, you know that methods of ArrayList and LinkedList differ in complexity, such as the get method in ArrayList being $O(1)$ and in LinkedList being $O(N)$ .

So, if a function parameter is passed to you as a List type, would you dare to perform list.get(i) for random access? Probably not.

In such cases, it is generally safer to create an ArrayList yourself, copy all elements from the passed List, and then access them via an index.

These are the main non-algorithmic issues to watch out for. Pay attention to them, and you should be fine. Let's move on to the formal explanation of algorithm complexity analysis.

Big O Notation

Mathematical Definition of Big O Notation:

$O(g(n))$ = { f(n): There exist positive constants c and n_0 such that for all n ≥ n_0, 0 ≤ f(n) ≤ c*g(n) }

The symbol O (capital letter o, not the number 0) represents a set of functions. For example, $O(n^2)$ represents a set of functions derived from g(n) = n^2; when we say an algorithm has a time complexity of $O(n^2)$ , it means the function describing the complexity of the algorithm belongs to this set.

In theory, understanding this abstract mathematical definition can answer all your questions about Big O notation.

However, considering that most people feel dizzy when they see mathematical definitions, I will list two properties used in complexity analysis. Remembering these two will be enough.

1. Keep only the fastest-growing term; other terms can be omitted.

Firstly, constant factors in multiplication and addition can be ignored, as shown in the following examples:

O(2N + 100) = O(N)
O(2^(N+1)) = O(2 * 2^N) = O(2^N)
O(M + 3N + 99) = O(M + N)

Of course, not every constant can be omitted; some may require using exponential rules learned in school:

O(2^(2N)) = O(4^N)

Besides constant factors, terms with a slower growth rate can also be ignored in the face of terms with a faster growth rate:

O(N^3 + 999 * N^2 + 999 * N) = O(N^3)
O((N + 1) * 2^N) = O(N * 2^N + 2^N) = O(N * 2^N)

These are the simplest and most common examples, and all can be correctly explained by the definition of Big O notation. If you encounter more complex scenarios, you can use the definition to determine if your complexity expression is correct.

2. Big O Notation Represents the "Upper Bound" of Complexity.

In other words, as long as you provide an upper bound, expressing it with Big O notation is correct.

For example, consider the following code:

for (int i = 0; i < N; i++) {
    print("hello world");
}

If this is considered an algorithm, its time complexity is clearly $O(N)$ . However, if you insist on saying its time complexity is $O(N^2)$ , theoretically it is acceptable, because the O notation represents an upper bound. The time complexity of this algorithm indeed will not exceed the upper bound of N^2. Although this upper bound is not "tight," it fits the definition, so theoretically, it is not incorrect.

The above example is too simple, and unnecessarily enlarging its time complexity upper bound seems pointless. However, some algorithms' complexity depends on the input data, making it impossible to provide a particularly precise time complexity in advance. In such cases, using Big O notation to enlarge the time complexity upper bound becomes meaningful.

For instance, in the article Dynamic Programming Core Framework, the brute-force recursive solution to the coin change problem is discussed, with the core code framework as follows:

// Definition: to make up the amount n, at least dp(coins, n) coins are needed
int dp(int[] coins, int amount) {
    // base case
    if (amount <= 0) return;
    // state transition
    for (int coin : coins) {
        dp(coins, amount - coin);
    }
}

When amount = 11, coins = [1,2,5], the recursive tree of the algorithm looks like this:

The method for calculating the time complexity of recursive algorithms will be discussed later. For now, let's determine the number of nodes in this recursive tree.

Assume the value of amount is N, and the number of elements in the coins list is K. This recursive tree is a K-ary tree. However, the growth of this tree is directly related to the denominations of coins in the coins list, resulting in an irregular structure that makes it difficult to precisely calculate the total number of nodes.

In such cases, a simpler approach is to approximate based on the worst-case scenario:

What is the height of this tree? Unknown, so we handle it based on the worst-case scenario, assuming all coins have a denomination of 1, making the tree height N.

What is the structure of this tree? Unknown, so we handle it based on the worst-case scenario, assuming it is a full K-ary tree.

How many nodes are on this tree? Handling it based on the worst-case scenario, a full K-ary tree of height N has a total number of nodes calculated using the geometric series formula (K^N - 1)/(K - 1), denoted in Big O notation as $O(K^N)$ .

Of course, we know the actual number of nodes is less than this, but using $O(K^N)$ as an approximate upper bound is acceptable and clearly shows that this is an exponential algorithm that requires optimization for efficiency.

Therefore, sometimes your estimated time complexity may differ from someone else's, which doesn't necessarily mean one of you is wrong; it could simply be a difference in estimation precision.

Theoretically, we aim to achieve an accurate and "tight" upper bound, but obtaining an accurate upper bound requires strong mathematical skills. When preparing for interviews, it's acceptable to settle for less as long as the order of magnitude (linear/exponential/logarithmic/quadratic, etc.) matches.

In the field of algorithms, besides using Big O notation for asymptotic upper bounds, there are also asymptotic lower bounds and asymptotic tight bounds, which interested readers can explore on their own. However, from a practical perspective, the above explanation of Big O notation is sufficient.

Time and Space Complexity Analysis Practical Guide

Inferring Solution Strategies from Complexity

Complexity Issues Due to Coding Errors

Using Standard Output

Incorrectly Passing by Value Instead of Reference

Unclear Underlying Implementation of Interface Objects

Big O Notation

Non-Recursive Algorithm Analysis

Data Structure Analysis

Recursive Algorithm Analysis

Final Summary