Incomputer science,selection sortis anin-placecomparisonsorting algorithm.It has anO(n2)time complexity,which makes it inefficient on large lists, and generally performs worse than the similarinsertion sort.Selection sort is noted for its simplicity and has performance advantages over more complicated algorithms in certain situations, particularly whereauxiliary memoryis limited.

Selection sort
ClassSorting algorithm
Data structureArray
Worst-caseperformancecomparisons,swaps
Best-caseperformancecomparisons,swap
Averageperformancecomparisons,swaps
Worst-casespace complexityauxiliary
OptimalNo

The algorithm divides the input list into two parts: a sorted sublist of items which is built up from left to right at the front (left) of the list and a sublist of the remaining unsorted items that occupy the rest of the list. Initially, the sorted sublist is empty and the unsorted sublist is the entire input list. The algorithm proceeds by finding the smallest (or largest, depending on sorting order) element in the unsorted sublist, exchanging (swapping) it with the leftmost unsorted element (putting it in sorted order), and moving the sublist boundaries one element to the right.

The time efficiency of selection sort is quadratic, so there are a number of sorting techniques which have better time complexity than selection sort.

Example

edit

Here is an example of this sort algorithm sorting five elements:

Sorted sublist Unsorted sublist Least element in unsorted list
() (11, 25, 12, 22, 64) 11
(11) (25, 12, 22, 64) 12
(11, 12) (25, 22, 64) 22
(11, 12, 22) (25, 64) 25
(11, 12, 22, 25) (64) 64
(11, 12, 22, 25, 64) ()
Selection sort animation. Red is current min. Yellow is sorted list. Blue is current item.

(Nothing appears changed on these last two lines because the last two numbers were already in order.)

Selection sort can also be used on list structures that make add and remove efficient, such as alinked list.In this case it is more common toremovethe minimum element from the remainder of the list, and theninsertit at the end of the values sorted so far. For example:

arr[] = 64 25 12 22 11

// Find the minimum element in arr[0...4]
// and place it at beginning
11 25 12 22 64

// Find the minimum element in arr[1...4]
// and place it at beginning of arr[1...4]
11 12 25 22 64

// Find the minimum element in arr[2...4]
// and place it at beginning of arr[2...4]
11 12 22 25 64

// Find the minimum element in arr[3...4]
// and place it at beginning of arr[3...4]
11 12 22 25 64

Implementations

edit

Below is an implementation inC.

/* a[0] to a[aLength-1] is the array to sort */
inti,j;
intaLength;// initialise to a's length

/* advance the position through the entire array */
/* (could do i < aLength-1 because single element is also min element) */
for(i=0;i<aLength-1;i++)
{
/* find the min element in the unsorted a[i.. aLength-1] */

/* assume the min is the first element */
intjMin=i;
/* test against elements after i to find the smallest */
for(j=i+1;j<aLength;j++)
{
/* if this element is less, then it is the new minimum */
if(a[j]<a[jMin])
{
/* found new minimum; remember its index */
jMin=j;
}
}

if(jMin!=i)
{
swap(&a[i],&a[jMin]);
}
}

Complexity

edit

Selection sort is not difficult to analyze compared to other sorting algorithms, since none of the loops depend on the data in the array. Selecting the minimum requires scanningelements (takingcomparisons) and then swapping it into the first position. Finding the next lowest element requires scanning the remainingelements (takingcomparisons) and so on. Therefore, the total number of comparisons is

Byarithmetic progression,

which is of complexityin terms of number of comparisons.

Comparison to other sorting algorithms

edit

Among quadratic sorting algorithms (sorting algorithms with a simple average-case ofΘ(n2)), selection sort almost always outperformsbubble sortandgnome sort.Insertion sortis very similar in that after thekth iteration, the firstelements in the array are in sorted order. Insertion sort's advantage is that it only scans as many elements as it needs in order to place thest element, while selection sort must scan all remaining elements to find thest element.

Simple calculation shows that insertion sort will therefore usually perform about half as many comparisons as selection sort, although it can perform just as many or far fewer depending on the order the array was in prior to sorting. It can be seen as an advantage for somereal-timeapplications that selection sort will perform identically regardless of the order of the array, while insertion sort's running time can vary considerably. However, this is more often an advantage for insertion sort in that it runs much more efficiently if the array is already sorted or "close to sorted."

While selection sort is preferable to insertion sort in terms of number of writes (swaps versus up toswaps, with each swap being two writes), this is roughly twice the theoretical minimum achieved bycycle sort,which performs at mostnwrites. This can be important if writes are significantly more expensive than reads, such as withEEPROMorFlash memory,where every write lessens the lifespan of the memory.

Selection sort can be implemented without unpredictable branches for the benefit of CPUbranch predictors,by finding the location of the minimum with branch-free code and then performing the swap unconditionally.

Finally, selection sort is greatly outperformed on larger arrays bydivide-and-conquer algorithmssuch asmergesort.However, insertion sort or selection sort are both typically faster for small arrays (i.e. fewer than 10–20 elements). A useful optimization in practice for the recursive algorithms is to switch to insertion sort or selection sort for "small enough" sublists.

Variants

edit

Heapsortgreatly improves the basic algorithm by using animplicitheapdata structureto speed up finding and removing the lowest datum. If implemented correctly, the heap will allow finding the next lowest element intime instead offor the inner loop in normal selection sort, reducing the total running time to.

A bidirectional variant of selection sort (calleddouble selection sortor sometimescocktail sortdue to its similarity tococktail shaker sort) findsboththe minimum and maximum values in the list in every pass. This requires three comparisons per two items (a pair of elements is compared, then the greater is compared to the maximum and the lesser is compared to the minimum) rather than regular selection sort's one comparison per item, but requires only half as many passes, a net 25% savings.

Selection sort can be implemented as astable sortif, rather than swapping in step 2, the minimum value isinsertedinto the first position and the intervening values shifted up. However, this modification either requires a data structure that supports efficient insertions or deletions, such as a linked list, or it leads to performingwrites.

In thebingo sortvariant, items are sorted by repeatedly looking through the remaining items to find the greatest value and movingallitems with that value to their final location.[1]Likecounting sort,this is an efficient variant if there are many duplicate values: selection sort does one pass through the remaining items for eachitemmoved, while Bingo sort does one pass for eachvalue.After an initial pass to find the greatest value, subsequent passes move every item with that value to its final location while finding the next value as in the followingpseudocode(arrays are zero-based and the for-loop includes both the top and bottom limits, as inPascal):

bingo(arrayA)

{ This procedure sorts in ascending order by
repeatedly moving maximal items to the end. }
begin
last:=length(A)-1;

{ The first iteration is written to look very similar to the subsequent ones,
but without swaps. }
nextMax:=A[last];
fori:=last-1downto0do
ifA[i]>nextMaxthen
nextMax:=A[i];
while(last>0)and(A[last]=nextMax)do
last:=last-1;

whilelast>0dobegin
prevMax:=nextMax;
nextMax:=A[last];
fori:=last-1downto0do
ifA[i]>nextMaxthen
ifA[i]<>prevMaxthen
nextMax:=A[i];
elsebegin
swap(A[i],A[last]);
last:=last-1;
end
while(last>0)and(A[last]=nextMax)do
last:=last-1;
end;
end;

Thus, if on average there are more than two items with the same value, bingo sort can be expected to be faster because it executes the inner loop fewer times than selection sort.

See also

edit

References

edit
  1. ^This article incorporatespublic domain materialfromPaul E. Black."Bingo sort".Dictionary of Algorithms and Data Structures.NIST.
  • Donald Knuth.The Art of Computer Programming,Volume 3:Sorting and Searching,Third Edition. Addison–Wesley, 1997.ISBN0-201-89685-0.Pages 138–141 of Section 5.2.3: Sorting by Selection.
  • Anany Levitin.Introduction to the Design & Analysis of Algorithms,2nd Edition.ISBN0-321-35828-7.Section 3.1: Selection Sort, pp 98–100.
  • Robert Sedgewick.Algorithms in C++, Parts 1–4: Fundamentals, Data Structure, Sorting, Searching: Fundamentals, Data Structures, Sorting, Searching Pts. 1–4,Second Edition. Addison–Wesley Longman, 1998.ISBN0-201-35088-2.Pages 273–274
edit