Counting Sort

Operation	Method	Time	Adaptive?	In-Place?	Stable?	Online?
Counting sort	Non-Comparison	$Θ (n) \leq Θ (n + k) \leq Θ (n^{2})$	N	N	Y	N

Idea

Counting sort is a sorting algorithm that works by counting the number of occurrences of each unique element in the input array and then using these counts to determine the position of each element in the sorted output array.

Pre-Conditions

The numbers to sort are integers in an interval of $0 \dots k$
1. $k \in Z^{+}$

The Algorithm

Requirements:

Input:
- $A [1 \dots n]$ where $A [j] \in [0 \dots k], \forall j \in {1 \dots n}$
- $n$ is the size of the array
- $k$ is the upper bound integer of the range $0 \dots k$ to sort
Output: $B [1 \dots n]$ is a sorted vector in a non-decreasing manner
Auxiliary Data Structure $C [0 \dots k]$ is the vector of the occurrences of size $k + 1$ , with possible indices $i \in 0 \dots k$

Steps:

Allocation and Initialization of a vector of occurences $C$ of size $k + 1$ with indices in $0 \dots k$
Populate $C$ starting from $0$ :count the occurrences of elements with same keys in $A$ .
1. Es: $C [A [i]]$ , with $i = 3$ is the count of times $A$ contains the number three.
Cumulative prefixed sum of values in C starting from $i = 1$ : count how many elements precede the current one
1. $i$ represents the index of the element $C [i]$ in the new sorted array
2. For each element in $C$ , increase its value by the value in the previous index. This helps us find the right position in $B$
Populate $B$ from $n$ to $1$ , to achieve stability.
1. Because an element might be repeated, we decrement the value of $C [i]$ every time it is read.

python

void countingsort(array A, array B, int n, int k) {
    Allocation C[0...k]; #aux data structure
    #O(k)
    for (i = 0 to k):  #Initialization of C to 0
        C[i] = 0;
    #Theta(n)
    for (i = 1 to n): # count occurrences of elements in A
        C[A[i]]++;
    #Theta(k)
    for (i = 1 to k):
        C[i] = C[i] + C[i-1] #prefixed sums
    #Theta(n)
    for (i = n down to 1):
        B[--C[A[i]]] = A[i]; #insert and avoid duplicates or insertion in same position

void countingsort(array A, array B, int n, int k) {
    Allocation C[0...k]; #aux data structure
    #O(k)
    for (i = 0 to k):  #Initialization of C to 0
        C[i] = 0;
    #Theta(n)
    for (i = 1 to n): # count occurrences of elements in A
        C[A[i]]++;
    #Theta(k)
    for (i = 1 to k):
        C[i] = C[i] + C[i-1] #prefixed sums
    #Theta(n)
    for (i = n down to 1):
        B[--C[A[i]]] = A[i]; #insert and avoid duplicates or insertion in same position

Example:

countingsortex

Final Time Complexity: $T (n) = Θ (n + k)$

Usually best when $k = O (n) \Rightarrow T (n) = Θ (n + k)$

Conclusions

Pro:

If $k = O (n) \Rightarrow T (n) = Θ (n + k)$
It is a stable sorting algorithm, if we start from the end.

Cons:

Not in-place (we use an auxiliary vector C[0..k])
Only sorts numbers between 0 and k, restricted integer spectrum.
If the interval of numbers go from 0 to $n^{2}$ then it is not efficient: $T (n) = Θ (n + n^{2}) = Θ (n^{2})$

Counting Sort ​

Idea ​

Pre-Conditions ​

The Algorithm ​

Conclusions ​

Counting Sort

Idea

Pre-Conditions

The Algorithm

Conclusions