Hash table

Hash table
Hash table
Type	Unorderedassociative array
Invented	1953
Operation
Time complexityinbig O notation
Operation	Average
Search	Θ(1)
Insert	Θ(1)
Delete	Θ(1)
Space complexity
Space	Θ(n)

Incomputing,ahash tableis adata structurethat implements anassociative array,also called adictionaryor simplymap;an associative array is anabstract data typethat mapskeystovalues.^[2]A hash table uses ahash functionto compute anindex,also called ahash code,into an array ofbucketsorslots,from which the desired value can be found. During lookup, the key is hashed and the resulting hash indicates where the corresponding value is stored. A map implemented by a hash table is called ahash map.

Most hash table designs employ animperfect hash function.Hash collisions,where the hash function generates the same index for more than one key, therefore typically must be accommodated in some way.

In a well-dimensioned hash table, the average time complexity for each lookup is independent of the number of elements stored in the table. Many hash table designs also allow arbitrary insertions and deletions ofkey–value pairs,atamortizedconstant average cost per operation.^[3]^[4]^[5]

Hashing is an example of aspace-time tradeoff.Ifmemoryis infinite, the entire key can be used directly as an index to locate its value with a single memory access. On the other hand, if infinite time is available, values can be stored without regard for their keys, and abinary searchorlinear searchcan be used to retrieve the element.^[6]^: 458

In many situations, hash tables turn out to be on average more efficient thansearch treesor any othertablelookup structure. For this reason, they are widely used in many kinds of computersoftware,particularly forassociative arrays,database inde xing,caches,andsets.

History

The idea of hashing arose independently in different places. In January 1953,Hans Peter Luhnwrote an internalIBMmemorandum that used hashing with chaining. The first example ofopen addressingwas proposed by A. D. Linh, building on Luhn's memorandum.^[4]^: 547Around the same time,Gene Amdahl,Elaine M. McGraw,Nathaniel Rochester,andArthur SamuelofIBM Researchimplemented hashing for theIBM 701 assembler.^[7]^: 124Open addressing with linear probing is credited to Amdahl, althoughAndrey Ershovindependently had the same idea.^[7]^{: 124–125}The term "open addressing" was coined byW. Wesley Petersonin his article which discusses the problem of search in large files.^[8]^: 15

The firstpublishedwork on hashing with chaining is credited toArnold Dumey,who discussed the idea of using remainder modulo a prime as a hash function.^[8]^: 15The word "hashing" was first published in an article by Robert Morris.^[7]^: 126Atheoretical analysisof linear probing was submitted originally by Konheim and Weiss.^[8]^: 15

Overview

Anassociative arraystores asetof (key, value) pairs and allows insertion, deletion, and lookup (search), with the constraint ofunique keys.In the hash table implementation of associative arrays, an array $A$ of length $m$ is partially filled with $n$ elements, where $m\geq n$ .A value $x$ gets stored at an index location $A[h(x)]$ ,where $h$ is a hash function, and $h(x)<m$ .^[8]^: 2Under reasonable assumptions, hash tables have bettertime complexitybounds on search, delete, and insert operations in comparison toself-balancing binary search trees.^[8]^: 1

Hash tables are also commonly used to implement sets, by omitting the stored value for each key and merely tracking whether the key is present.^[8]^: 1

Load factor

Aload factor $\ Alpha$ is a critical statistic of a hash table, and is defined as follows:^[1] ${\text{load factor}}\ (\ Alpha )={\frac {n}{m}},$ where

$n$ is the number of entries occupied in the hash table.
$m$ is the number of buckets.

The performance of the hash table deteriorates in relation to the load factor $\ Alpha$ .^[8]^: 2

The software typically ensures that the load factor $\ Alpha$ remains below a certain constant, $\ Alpha _{\max }$ .This helps maintain good performance. Therefore, a common approach is to resize or "rehash" the hash table whenever the load factor $\ Alpha$ reaches $\ Alpha _{\max }$ .Similarly the table may also be resized if the load factor drops below $\ Alpha _{\max }/4$ .^[9]

Load factor for separate chaining

With separate chaining hash tables, each slot of the bucket array stores a pointer to a list or array of data.^[10]

Separate chaining hash tables suffer gradually declining performance as the load factor grows, and no fixed point beyond which resizing is absolutely needed.^[9]

With separate chaining, the value of $\ Alpha _{\max }$ that gives best performance is typically between 1 and 3.^[9]

Load factor for open addressing

With open addressing, each slot of the bucket array holds exactly one item. Therefore an open-addressed hash table cannot have a load factor greater than 1.^[10]

The performance of open addressing becomes very bad when the load factor approaches 1.^[9]

Therefore a hash table that uses open addressingmustbe resized orrehashedif the load factor $\ Alpha$ approaches 1.^[9]

With open addressing, acceptable figures of max load factor $\ Alpha _{\max }$ should range around 0.6 to 0.75.^[11]^[12]^: 110

Hash function

Ahash function $h:U\rightarrow \{0,...,m-1\}$ maps the universe $U$ of keys to indices or slots within the table, that is, $h(x)\in \{0,...,m-1\}$ for $x\in U$ .The conventional implementations of hash functions are based on theinteger universe assumptionthat all elements of the table stem from the universe $U=\{0,...,u-1\}$ ,where thebit lengthof $u$ is confined within theword sizeof acomputer architecture.^[8]^: 2

A hash function $h$ is said to beperfectfor a given set $S$ if it isinjectiveon $S$ ,that is, if each element $x\in S$ maps to a different value in ${0,...,m-1}$ .^[13]^[14]A perfect hash function can be created if all the keys are known ahead of time.^[13]

Integer universe assumption

The schemes of hashing used ininteger universe assumptioninclude hashing by division, hashing by multiplication,universal hashing,dynamic perfect hashing,andstatic perfect hashing.^[8]^: 2However, hashing by division is the commonly used scheme.^[15]^: 264^[12]^: 110

Hashing by division

The scheme in hashing by division is as follows:^[8]^: 2 $h(x)\ =\ x\,{\bmod {\,}}m$ where $h(x)$ is the hash value of $x\in S$ and $m$ is the size of the table.

Hashing by multiplication

The scheme in hashing by multiplication is as follows:^[8]^: 2–3 $h(x)=\lfloor m{\bigl (}(xA){\bmod {1}}{\bigr )}\rfloor$ Where $A$ is a non-integerreal-valued constantand $m$ is the size of the table. An advantage of the hashing by multiplication is that the $m$ is not critical.^[8]^: 2–3Although any value $A$ produces a hash function,Donald Knuthsuggests using thegolden ratio.^[8]^: 3

Choosing a hash function

Uniform distributionof the hash values is a fundamental requirement of a hash function. A non-uniform distribution increases the number of collisions and the cost of resolving them. Uniformity is sometimes difficult to ensure by design, but may be evaluated empirically using statistical tests, e.g., aPearson's chi-squared testfor discrete uniform distributions.^[16]^[17]

The distribution needs to be uniform only for table sizes that occur in the application. In particular, if one uses dynamic resizing with exact doubling and halving of the table size, then the hash function needs to be uniform only when the size is apower of two.Here the index can be computed as some range of bits of the hash function. On the other hand, some hashing algorithms prefer to have the size be aprime number.^[18]

Foropen addressingschemes, the hash function should also avoidclustering,the mapping of two or more keys to consecutive slots. Such clustering may cause the lookup cost to skyrocket, even if the load factor is low and collisions are infrequent. The popular multiplicative hash is claimed to have particularly poor clustering behavior.^[18]^[4]

K-independent hashingoffers a way to prove a certain hash function does not have bad keysets for a given type of hashtable. A number of K-independence results are known for collision resolution schemes such as linear probing and cuckoo hashing. Since K-independence can prove a hash function works, one can then focus on finding the fastest possible such hash function.^[19]

Collision resolution

A search algorithm that uses hashing consists of two parts. The first part is computing ahash functionwhich transforms the search key into anarray index.The ideal case is such that no two search keys hashes to the same array index. However, this is not always the case and is impossible to guarantee for unseen given data.^[20]^: 515Hence the second part of the algorithm is collision resolution. The two common methods for collision resolution are separate chaining and open addressing.^[6]^: 458

Separate chaining

In separate chaining, the process involves building alinked listwithkey–value pairfor each search array index. The collided items are chained together through a single linked list, which can be traversed to access the item with a unique search key.^[6]^: 464Collision resolution through chaining with linked list is a common method of implementation of hash tables. Let $T$ and $x$ be the hash table and the node respectively, the operation involves as follows:^[15]^: 258

Chained-Hash-Insert(T,k)
insertxat the head of linked listT[h(k)]

Chained-Hash-Search(T,k)
search for an element with keykin linked listT[h(k)]

Chained-Hash-Delete(T,k)
deletexfrom the linked listT[h(k)]

If the element is comparable eithernumericallyorlexically,and inserted into the list by maintaining thetotal order,it results in faster termination of the unsuccessful searches.^[20]^{: 520–521}

Other data structures for separate chaining

If the keys areordered,it could be efficient to use "self-organizing"concepts such as using aself-balancing binary search tree,through which thetheoretical worst casecould be brought down to $O(\log {n})$ ,although it introduces additional complexities.^[20]^: 521

Indynamic perfect hashing,two-level hash tables are used to reduce the look-up complexity to be a guaranteed $O(1)$ in the worst case. In this technique, the buckets of $k$ entries are organized asperfect hash tableswith $k^{2}$ slots providing constant worst-case lookup time, and low amortized time for insertion.^[21]A study shows array-based separate chaining to be 97% more performant when compared to the standard linked list method under heavy load.^[22]^: 99

Techniques such as usingfusion treefor each buckets also result in constant time for all operations with high probability.^[23]

Caching and locality of reference

The linked list of separate chaining implementation may not becache-consciousdue tospatial locality—locality of reference—when the nodes of the linked list are scattered across memory, thus the list traversal during insert and search may entailCPU cacheinefficiencies.^[22]^: 91

Incache-conscious variantsof collision resolution through separate chaining, adynamic arrayfound to be morecache-friendlyis used in the place where a linked list or self-balancing binary search trees is usually deployed, since thecontiguous allocationpattern of the array could be exploited byhardware-cache prefetchers—such astranslation lookaside buffer—resulting in reduced access time and memory consumption.^[24]^[25]^[26]

Open addressing

Open addressingis another collision resolution technique in which every entry record is stored in the bucket array itself, and the hash resolution is performed throughprobing.When a new entry has to be inserted, the buckets are examined, starting with the hashed-to slot and proceeding in someprobe sequence,until an unoccupied slot is found. When searching for an entry, the buckets are scanned in the same sequence, until either the target record is found, or an unused array slot is found, which indicates an unsuccessful search.^[27]

Well-known probe sequences include:

Linear probing,in which the interval between probes is fixed (usually 1).^[28]
Quadratic probing,in which the interval between probes is increased by adding the successive outputs of a quadratic polynomial to the value given by the original hash computation.^[29]^: 272
Double hashing,in which the interval between probes is computed by a secondary hash function.^[29]^{: 272–273}

The performance of open addressing may be slower compared to separate chaining since the probe sequence increases when the load factor $\ Alpha$ approaches 1.^[9]^[22]^: 93The probing results in aninfinite loopif the load factor reaches 1, in the case of a completely filled table.^[6]^: 471Theaverage costof linear probing depends on the hash function's ability todistributethe elementsuniformlythroughout the table to avoidclustering,since formation of clusters would result in increased search time.^[6]^: 472

Caching and locality of reference

Since the slots are located in successive locations, linear probing could lead to better utilization ofCPU cachedue tolocality of referencesresulting in reducedmemory latency.^[28]

Other collision resolution techniques based on open addressing

Coalesced hashing

Coalesced hashingis a hybrid of both separate chaining and open addressing in which the buckets or nodes link within the table.^[30]^: 6–8The algorithm is ideally suited forfixed memory allocation.^[30]^: 4The collision in coalesced hashing is resolved by identifying the largest-indexed empty slot on the hash table, then the colliding value is inserted into that slot. The bucket is also linked to the inserted node's slot which contains its colliding hash address.^[30]^: 8

Cuckoo hashing

Cuckoo hashingis a form of open addressing collision resolution technique which guarantees $O(1)$ worst-case lookup complexity and constant amortized time for insertions. The collision is resolved through maintaining two hash tables, each having its own hashing function, and collided slot gets replaced with the given item, and the preoccupied element of the slot gets displaced into the other hash table. The process continues until every key has its own spot in the empty buckets of the tables; if the procedure enters intoinfinite loop—which is identified through maintaining a threshold loop counter—both hash tables get rehashed with newer hash functions and the procedure continues.^[31]^{: 124–125}

Hopscotch hashing

Hopscotch hashingis an open addressing based algorithm which combines the elements ofcuckoo hashing,linear probingand chaining through the notion of aneighbourhoodof buckets—the subsequent buckets around any given occupied bucket, also called a "virtual" bucket.^[32]^{: 351–352}The algorithm is designed to deliver better performance when the load factor of the hash table grows beyond 90%; it also provides high throughput inconcurrent settings,thus well suited for implementing resizableconcurrent hash table.^[32]^: 350The neighbourhood characteristic of hopscotch hashing guarantees a property that, the cost of finding the desired item from any given buckets within the neighbourhood is very close to the cost of finding it in the bucket itself; the algorithm attempts to be an item into its neighbourhood—with a possible cost involved in displacing other items.^[32]^: 352

Each bucket within the hash table includes an additional "hop-information" —anH-bitbit arrayfor indicating therelative distanceof the item which was originally hashed into the current virtual bucket withinH-1 entries.^[32]^: 352Let $k$ and $Bk$ be the key to be inserted and bucket to which the key is hashed into respectively; several cases are involved in the insertion procedure such that the neighbourhood property of the algorithm is vowed:^[32]^{: 352–353}if $Bk$ is empty, the element is inserted, and the leftmost bit of bitmap issetto 1; if not empty, linear probing is used for finding an empty slot in the table, the bitmap of the bucket gets updated followed by the insertion; if the empty slot is not within the range of theneighbourhood,i.e.H-1, subsequent swap and hop-info bit array manipulation of each bucket is performed in accordance with its neighbourhoodinvariant properties.^[32]^: 353

Robin Hood hashing

Robin Hood hashing is an open addressing based collision resolution algorithm; the collisions are resolved through favouring the displacement of the element that is farthest—or longestprobe sequence length(PSL)—from its "home location" i.e. the bucket to which the item was hashed into.^[33]^: 12Although Robin Hood hashing does not change thetheoretical search cost,it significantly affects thevarianceof thedistributionof the items on the buckets,^[34]^: 2i.e. dealing withclusterformation in the hash table.^[35]Each node within the hash table that uses Robin Hood hashing should be augmented to store an extra PSL value.^[36]Let $x$ be the key to be inserted, $x.psl$ be the (incremental) PSL length of $x$ , $T$ be the hash table and $j$ be the index, the insertion procedure is as follows:^[33]^: 12–13^[37]^: 5

If $x.psl\ \leq \ T[j].psl$ :the iteration goes into the next bucket without attempting an external probe.
If $x.psl\ >\ T[j].psl$ :insert the item $x$ into the bucket $j$ ;swap $x$ with $T[j]$ —let it be $x'$ ;continue the probe from the $j+1$ st bucket to insert $x'$ ;repeat the procedure until every element is inserted.

Dynamic resizing

Repeated insertions cause the number of entries in a hash table to grow, which consequently increases the load factor; to maintain the amortized $O(1)$ performance of the lookup and insertion operations, a hash table is dynamically resized and the items of the tables arerehashedinto the buckets of the new hash table,^[9]since the items cannot be copied over as varying table sizes results in different hash value due tomodulo operation.^[38]If a hash table becomes "too empty" after deleting some elements, resizing may be performed to avoid excessivememory usage.^[39]

Resizing by moving all entries

Generally, a new hash table with a size double that of the original hash table getsallocatedprivately and every item in the original hash table gets moved to the newly allocated one by computing the hash values of the items followed by the insertion operation. Rehashing is simple, but computationally expensive.^[40]^{: 478–479}

Alternatives to all-at-once rehashing

Some hash table implementations, notably inreal-time systems,cannot pay the price of enlarging the hash table all at once, because it may interrupt time-critical operations. If one cannot avoid dynamic resizing, a solution is to perform the resizing gradually to avoid storage blip—typically at 50% of new table's size—during rehashing and to avoidmemory fragmentationthat triggersheap compactiondue to deallocation of largememory blockscaused by the old hash table.^[41]^: 2–3In such case, the rehashing operation is done incrementally through extending prior memory block allocated for the old hash table such that the buckets of the hash table remain unaltered. A common approach for amortized rehashing involves maintaining two hash functions $h_{\text{old}}$ and $h_{\text{new}}$ .The process of rehashing a bucket's items in accordance with the new hash function is termed ascleaning,which is implemented throughcommand patternby encapsulating the operations such as $\mathrm {Add} (\mathrm {key} )$ , $\mathrm {Get} (\mathrm {key} )$ and $\mathrm {Delete} (\mathrm {key} )$ through a $\mathrm {Lookup} (\mathrm {key},{\text{command}})$ wrappersuch that each element in the bucket gets rehashed and its procedure involve as follows:^[41]^: 3

Clean $\mathrm {Table} [h_{\text{old}}(\mathrm {key} )]$ bucket.
Clean $\mathrm {Table} [h_{\text{new}}(\mathrm {key} )]$ bucket.
Thecommandgets executed.

Linear hashing

Linear hashingis an implementation of the hash table which enables dynamic growths or shrinks of the table one bucket at a time.^[42]

Performance

The performance of a hash table is dependent on the hash function's ability in generatingquasi-random numbers( $\sigma$ ) for entries in the hash table where $K$ , $n$ and $h(x)$ denotes the key, number of buckets and the hash function such that $\sigma \ =\ h(K)\ \%\ n$ .If the hash function generates the same $\sigma$ for distinct keys ( $K_{1}\neq K_{2},\ h(K_{1})\ =\ h(K_{2})$ ), this results incollision,which is dealt with in a variety of ways. The constant time complexity ( $O(1)$ ) of the operation in a hash table is presupposed on the condition that the hash function doesn't generate colliding indices; thus, the performance of the hash table isdirectly proportionalto the chosen hash function's ability todispersethe indices.^[43]^: 1However, construction of such a hash function ispractically infeasible,that being so, implementations depend oncase-specific collision resolution techniquesin achieving higher performance.^[43]^: 2

Applications

Associative arrays

Hash tables are commonly used to implement many types of in-memory tables. They are used to implementassociative arrays.^[29]

Database inde xing

Hash tables may also be used asdisk-based data structures anddatabase indices(such as indbm) althoughB-treesare more popular in these applications.^[44]

Caches

Hash tables can be used to implementcaches,auxiliary data tables that are used to speed up the access to data that is primarily stored in slower media. In this application, hash collisions can be handled by discarding one of the two colliding entries—usually erasing the old item that is currently stored in the table and overwriting it with the new item, so every item in the table has a unique hash value.^[45]^[46]

Sets

Hash tables can be used in the implementation ofset data structure,which can store unique values without any particular order; set is typically used in testing the membership of a value in the collection, rather than element retrieval.^[47]

Transposition table

Atransposition tableto a complex Hash Table which stores information about each section that has been searched.^[48]

Implementations

Many programming languages provide hash table functionality, either as built-in associative arrays or asstandard librarymodules.

InJavaScript,an "object" is a mutable collection of key-value pairs (called "properties" ), where each key is either a string or a guaranteed-unique "symbol"; any other value, when used as a key, is firstcoercedto a string. Aside from the seven "primitive" data types, every value in JavaScript is an object.^[49]ECMAScript 2015 also added theMapdata structure, which accepts arbitrary values as keys.^[50]

C++11includesunordered_mapin its standard library for storing keys and values ofarbitrary types.^[51]

Go's built-inmapimplements a hash table in the form of atype.^[52]

Javaprogramming language includes theHashSet,HashMap,LinkedHashSet,andLinkedHashMapgenericcollections.^[53]

Python's built-indictimplements a hash table in the form of atype.^[54]

Ruby's built-inHashuses the open addressing model from Ruby 2.4 onwards.^[55]

Rustprogramming language includesHashMap,HashSetas part of the Rust Standard Library.^[56]

The.NETstandard library includesHashSetandDictionary,^[57]^[58]so it can be used from languages such asC#andVB.NET.^[59]

References

^^a ^b Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2009).Introduction to Algorithms(3rd ed.). Massachusetts Institute of Technology. pp. 253–280.ISBN 978-0-262-03384-8.
^Mehlhorn, Kurt;Sanders, Peter(2008)."Hash Tables and Associative Arrays"(PDF).Algorithms and Data Structures.Springer. pp. 81–98.doi:10.1007/978-3-540-77978-0_4.ISBN 978-3-540-77977-3.
^Leiserson, Charles E.(Fall 2005)."Lecture 13: Amortized Algorithms, Table Doubling, Potential Method".course MIT 6.046J/18.410J Introduction to Algorithms.Archivedfrom the original on August 7, 2009.
^^a ^b ^cKnuth, Donald(1998).The Art of Computer Programming.Vol. 3:Sorting and Searching(2nd ed.). Addison-Wesley. pp. 513–558.ISBN 978-0-201-89685-5.
^Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2001). "Chapter 11: Hash Tables".Introduction to Algorithms(2nd ed.). MIT Press and McGraw-Hill. pp.221–252.ISBN 978-0-262-53196-2.
^^a ^b ^c ^d ^eSedgewick, Robert;Wayne, Kevin (2011).Algorithms.Vol. 1 (4 ed.). Addison-Wesley Professional – viaPrinceton University,Department of Computer Science.
^^a ^b ^cKonheim, Alan G. (2010).Hashing in Computer Science.doi:10.1002/9780470630617.ISBN 978-0-470-34473-6.
^^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j ^k ^l ^mMehta, Dinesh P.; Mehta, Dinesh P.; Sahni, Sartaj, eds. (2004).Handbook of Data Structures and Applications.doi:10.1201/9781420035179.ISBN 978-0-429-14701-2.
^^a ^b ^c ^d ^e ^f ^gMayers, Andrew (2008)."CS 312: Hash tables and amortized analysis".Cornell University,Department of Computer Science.Archivedfrom the original on April 26, 2021.RetrievedOctober 26,2021– via cs.cornell.edu.
^^a ^b James S. Plank and Brad Vander Zanden. "CS140 Lecture notes -- Hashing".
^Maurer, W. D.; Lewis, T. G. (March 1975). "Hash Table Methods".ACM Computing Surveys.7(1): 5–19.doi:10.1145/356643.356645.S2CID 17874775.
^^a ^bOwolabi, Olumide (February 2003). "Empirical studies of some hashing functions".Information and Software Technology.45(2): 109–112.doi:10.1016/S0950-5849(02)00174-X.
^^a ^bLu, Yi; Prabhakar, Balaji; Bonomi, Flavio (2006).Perfect Hashing for Network Applications.2006 IEEE International Symposium on Information Theory. pp. 2774–2778.doi:10.1109/ISIT.2006.261567.ISBN 1-4244-0505-X.S2CID 1494710.
^Belazzougui, Djamal; Botelho, Fabiano C.; Dietzfelbinger, Martin (2009)."Hash, displace, and compress"(PDF).Algorithms—ESA 2009: 17th Annual European Symposium, Copenhagen, Denmark, September 7–9, 2009, Proceedings.Lecture Notes in Computer Science.Vol. 5757. Berlin: Springer. pp. 682–693.CiteSeerX10.1.1.568.130.doi:10.1007/978-3-642-04128-0_61.MR 2557794.
^^a ^bCormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2001). "Chapter 11: Hash Tables".Introduction to Algorithms(2nd ed.).Massachusetts Institute of Technology.ISBN 978-0-262-53196-2.
^Pearson, Karl(1900)."On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling".Philosophical Magazine.Series 5.50(302): 157–175.doi:10.1080/14786440009463897.
^Plackett, Robin(1983). "Karl Pearson and the Chi-Squared Test".International Statistical Review.51(1): 59–72.doi:10.2307/1402731.JSTOR 1402731.
^^a ^bWang, Thomas (March 1997)."Prime Double Hash Table".Archived fromthe originalon September 3, 1999.RetrievedMay 10,2015.
^Wegman, Mark N.; Carter, J.Lawrence (June 1981)."New hash functions and their use in authentication and set equality".Journal of Computer and System Sciences.22(3): 265–279.doi:10.1016/0022-0000(81)90033-7.
^^a ^b ^cDonald E. Knuth(April 24, 1998).The Art of Computer Programming: Volume 3: Sorting and Searching.Addison-Wesley Professional.ISBN 978-0-201-89685-5.
^Demaine, Erik; Lind, Jeff (Spring 2003)."Lecture 2"(PDF).6.897: Advanced Data Structures. MIT Computer Science and Artificial Intelligence Laboratory.Archived(PDF)from the original on June 15, 2010.RetrievedJune 30,2008.
^^a ^b ^cCulpepper, J. Shane; Moffat, Alistair (2005). "Enhanced Byte Codes with Restricted Prefix Properties".String Processing and Information Retrieval.Lecture Notes in Computer Science. Vol. 3772. pp. 1–12.doi:10.1007/11575832_1.ISBN 978-3-540-29740-6.
^Willard, Dan E.(2000). "Examining computational geometry, van Emde Boas trees, and hashing from the perspective of the fusion tree".SIAM Journal on Computing.29(3): 1030–1049.doi:10.1137/S0097539797322425.MR 1740562..
^Askitis, Nikolas; Sinha, Ranjan (October 2010). "Engineering scalable, cache and space efficient tries for strings".The VLDB Journal.19(5): 633–660.doi:10.1007/s00778-010-0183-9.
^Askitis, Nikolas; Zobel, Justin (October 2005). "Cache-conscious Collision Resolution in String Hash Tables".Proceedings of the 12th International Conference, String Processing and Information Retrieval (SPIRE 2005).Vol. 3772/2005. pp. 91–102.doi:10.1007/11575832_11.ISBN 978-3-540-29740-6.
^Askitis, Nikolas (2009)."Fast and Compact Hash Tables for Integer Keys"(PDF).Proceedings of the 32nd Australasian Computer Science Conference (ACSC 2009).Vol. 91. pp. 113–122.ISBN 978-1-920682-72-9.Archived fromthe original(PDF)on February 16, 2011.RetrievedJune 13,2010.
^Tenenbaum, Aaron M.; Langsam, Yedidyah; Augenstein, Moshe J. (1990).Data Structures Using C.Prentice Hall. pp. 456–461, p. 472.ISBN 978-0-13-199746-2.
^^a ^bPagh, Rasmus;Rodler, Flemming Friche (2001). "Cuckoo Hashing".Algorithms — ESA 2001.Lecture Notes in Computer Science. Vol. 2161. pp. 121–133.CiteSeerX10.1.1.25.4189.doi:10.1007/3-540-44676-1_10.ISBN 978-3-540-42493-2.
^^a ^b ^cCormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2001), "11 Hash Tables",Introduction to Algorithms(2nd ed.),MIT PressandMcGraw-Hill,pp. 221–252,ISBN 0-262-03293-7.
^^a ^b ^cVitter, Jeffery S.; Chen, Wen-Chin (1987).The design and analysis of coalesced hashing.New York, United States:Oxford University Press.ISBN 978-0-19-504182-8– viaArchive.org.
^Pagh, Rasmus;Rodler, Flemming Friche (2001). "Cuckoo Hashing".Algorithms — ESA 2001.Lecture Notes in Computer Science. Vol. 2161. pp. 121–133.CiteSeerX10.1.1.25.4189.doi:10.1007/3-540-44676-1_10.ISBN 978-3-540-42493-2.
^^a ^b ^c ^d ^e ^fHerlihy, Maurice; Shavit, Nir; Tzafrir, Moran (2008). "Hopscotch Hashing".Distributed Computing.Lecture Notes in Computer Science. Vol. 5218. pp. 350–364.doi:10.1007/978-3-540-87779-0_24.ISBN 978-3-540-87778-3.
^^a ^bCelis, Pedro (1986).Robin Hood Hashing(PDF).Ontario, Canada:University of Waterloo,Dept. of Computer Science.ISBN 978-0-315-29700-5.OCLC 14083698.Archived(PDF)from the original on November 1, 2021.RetrievedNovember 2,2021.
^Poblete, P. V.; Viola, A. (July 2019). "Analysis of Robin Hood and Other Hashing Algorithms Under the Random Probing Model, With and Without Deletions".Combinatorics, Probability and Computing.28(4): 600–617.doi:10.1017/S0963548318000408.S2CID 125374363.
^Clarkson, Michael (2014)."Lecture 13: Hash tables".Cornell University,Department of Computer Science.Archivedfrom the original on October 7, 2021.RetrievedNovember 1,2021– via cs.cornell.edu.
^Gries, David (2017)."JavaHyperText and Data Structure: Robin Hood Hashing"(PDF).Cornell University,Department of Computer Science.Archived(PDF)from the original on April 26, 2021.RetrievedNovember 2,2021– via cs.cornell.edu.
^Celis, Pedro (March 28, 1988).External Robin Hood Hashing(PDF)(Technical report). Bloomington, Indiana:Indiana University,Department of Computer Science. 246.Archived(PDF)from the original on November 3, 2021.RetrievedNovember 2,2021.
^Goddard, Wayne (2021)."Chapter C5: Hash Tables"(PDF).Clemson University.pp. 15–16.RetrievedDecember 4,2023.
^Devadas, Srini; Demaine, Erik (February 25, 2011)."Intro to Algorithms: Resizing Hash Tables"(PDF).Massachusetts Institute of Technology,Department of Computer Science.Archived(PDF)from the original on May 7, 2021.RetrievedNovember 9,2021– viaMIT OpenCourseWare.
^Thareja, Reema (2014). "Hashing and Collision".Data Structures Using C.Oxford University Press. pp. 464–488.ISBN 978-0-19-809930-7.
^^a ^bFriedman, Scott; Krishnan, Anand; Leidefrost, Nicholas (March 18, 2003)."Hash Tables for Embedded and Real-time systems"(PDF).All Computer Science and Engineering Research.Washington University in St. Louis.doi:10.7936/K7WD3XXV.Archived(PDF)from the original on June 9, 2021.RetrievedNovember 9,2021– viaNorthwestern University,Department of Computer Science.
^Litwin, Witold (1980)."Linear hashing: A new tool for file and table addressing"(PDF).Proc. 6th Conference on Very Large Databases.Carnegie Mellon University.pp. 212–223.Archived(PDF)from the original on May 6, 2021.RetrievedNovember 10,2021– via cs.cmu.edu.
^^a ^bDijk, Tom Van (2010)."Analysing and Improving Hash Table Performance"(PDF).Netherlands:University of Twente.Archived(PDF)from the original on November 6, 2021.RetrievedDecember 31,2021.
^Lech Banachowski."Indexes and external sorting".pl:Polsko-Japońska Akademia Technik Komputerowych.Archived fromthe originalon March 26, 2022.RetrievedMarch 26,2022.
^Zhong, Liang; Zheng, Xueqian; Liu, Yong; Wang, Mengting; Cao, Yang (February 2020). "Cache hit ratio maximization in device-to-device communications overlaying cellular networks".China Communications.17(2): 232–238.doi:10.23919/jcc.2020.02.018.S2CID 212649328.
^Bottommley, James (January 1, 2004)."Understanding Caching".Linux Journal.Archivedfrom the original on December 4, 2020.RetrievedApril 16,2022.
^Jill Seaman (2014)."Set & Hash Tables"(PDF).Texas State University.Archived from the original on April 1, 2022.RetrievedMarch 26,2022.{{cite web}}:CS1 maint: bot: original URL status unknown (link)
^"Transposition Table - Chessprogramming wiki".chessprogramming.org.Archivedfrom the original on February 14, 2021.RetrievedMay 1,2020.
^"JavaScript data types and data structures - JavaScript | MDN".developer.mozilla.org.RetrievedJuly 24,2022.
^"Map - JavaScript | MDN".developer.mozilla.org.June 20, 2023.RetrievedJuly 15,2023.
^"Programming language C++ - Technical Specification"(PDF).International Organization for Standardization.pp. 812–813. Archived fromthe original(PDF)on January 21, 2022.RetrievedFebruary 8,2022.
^"The Go Programming Language Specification".go.dev.RetrievedJanuary 1,2023.
^"Lesson: Implementations (The Java™ Tutorials > Collections)".docs.oracle.Archivedfrom the original on January 18, 2017.RetrievedApril 27,2018.
^Zhang, Juan; Jia, Yunwei (2020)."Redis rehash optimization based on machine learning".Journal of Physics: Conference Series.1453(1): 3.Bibcode:2020JPhCS1453a2048Z.doi:10.1088/1742-6596/1453/1/012048.S2CID 215943738.
^Jonan Scheffler (December 25, 2016)."Ruby 2.4 Released: Faster Hashes, Unified Integers and Better Rounding".heroku.Archivedfrom the original on July 3, 2019.RetrievedJuly 3,2019.
^"doc.rust-lang.org".Archivedfrom the original on December 8, 2022.RetrievedDecember 14,2022.
^"HashSet Class (System.Collections.Generic)".learn.microsoft.RetrievedJuly 1,2023.
^dotnet-bot."Dictionary Class (System.Collections.Generic)".learn.microsoft.RetrievedJanuary 16,2024.
^"VB.NET HashSet Example".Dot Net Perls.

External links

NISTentry onhash tables
Open Data Structures – Chapter 5 – Hash Tables,Pat Morin
MIT's Introduction to Algorithms: Hashing 1MIT OCW lecture Video
MIT's Introduction to Algorithms: Hashing 2MIT OCW lecture Video

[Cormen_et_al-1] Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2009).Introduction to Algorithms(3rd ed.). Massachusetts Institute of Technology. pp. 253–280.ISBN 978-0-262-03384-8.

[ms-2] Mehlhorn, Kurt;Sanders, Peter(2008)."Hash Tables and Associative Arrays"(PDF).Algorithms and Data Structures.Springer. pp. 81–98.doi:10.1007/978-3-540-77978-0_4.ISBN 978-3-540-77977-3.

[leiser-3] Leiserson, Charles E.(Fall 2005)."Lecture 13: Amortized Algorithms, Table Doubling, Potential Method".course MIT 6.046J/18.410J Introduction to Algorithms.Archivedfrom the original on August 7, 2009.

[knuth-4] Knuth, Donald(1998).The Art of Computer Programming.Vol. 3:Sorting and Searching(2nd ed.). Addison-Wesley. pp. 513–558.ISBN 978-0-201-89685-5.

[cormen-5] Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2001). "Chapter 11: Hash Tables".Introduction to Algorithms(2nd ed.). MIT Press and McGraw-Hill. pp.221–252.ISBN 978-0-262-53196-2.

[algo1rob-6] Sedgewick, Robert;Wayne, Kevin (2011).Algorithms.Vol. 1 (4 ed.). Addison-Wesley Professional – viaPrinceton University,Department of Computer Science.

[Konheim-7] Konheim, Alan G. (2010).Hashing in Computer Science.doi:10.1002/9780470630617.ISBN 978-0-470-34473-6.

[hashhist-8] ^^a ^b ^c ^d ^e ^f ^g ^h ⁱ ^j ^k ^l ^mMehta, Dinesh P.; Mehta, Dinesh P.; Sahni, Sartaj, eds. (2004).Handbook of Data Structures and Applications.doi:10.1201/9781420035179.ISBN 978-0-429-14701-2.

[cornell08-9] ^^a ^b ^c ^d ^e ^f ^gMayers, Andrew (2008)."CS 312: Hash tables and amortized analysis".Cornell University,Department of Computer Science.Archivedfrom the original on April 26, 2021.RetrievedOctober 26,2021– via cs.cornell.edu.

[plank-10] James S. Plank and Brad Vander Zanden. "CS140 Lecture notes -- Hashing".

[11] Maurer, W. D.; Lewis, T. G. (March 1975). "Hash Table Methods".ACM Computing Surveys.7(1): 5–19.doi:10.1145/356643.356645.S2CID 17874775.

[owo03-12] Owolabi, Olumide (February 2003). "Empirical studies of some hashing functions".Information and Software Technology.45(2): 109–112.doi:10.1016/S0950-5849(02)00174-X.

[Yi06-13] Lu, Yi; Prabhakar, Balaji; Bonomi, Flavio (2006).Perfect Hashing for Network Applications.2006 IEEE International Symposium on Information Theory. pp. 2774–2778.doi:10.1109/ISIT.2006.261567.ISBN 1-4244-0505-X.S2CID 1494710.

[CHD-14] Belazzougui, Djamal; Botelho, Fabiano C.; Dietzfelbinger, Martin (2009)."Hash, displace, and compress"(PDF).Algorithms—ESA 2009: 17th Annual European Symposium, Copenhagen, Denmark, September 7–9, 2009, Proceedings.Lecture Notes in Computer Science.Vol. 5757. Berlin: Springer. pp. 682–693.CiteSeerX10.1.1.568.130.doi:10.1007/978-3-642-04128-0_61.MR 2557794.

[cormenalgo01-15] Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2001). "Chapter 11: Hash Tables".Introduction to Algorithms(2nd ed.).Massachusetts Institute of Technology.ISBN 978-0-262-53196-2.

[chernoff-16] Pearson, Karl(1900)."On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling".Philosophical Magazine.Series 5.50(302): 157–175.doi:10.1080/14786440009463897.

[plackett-17] Plackett, Robin(1983). "Karl Pearson and the Chi-Squared Test".International Statistical Review.51(1): 59–72.doi:10.2307/1402731.JSTOR 1402731.

[:0-18] Wang, Thomas (March 1997)."Prime Double Hash Table".Archived fromthe originalon September 3, 1999.RetrievedMay 10,2015.

[19] Wegman, Mark N.; Carter, J.Lawrence (June 1981)."New hash functions and their use in authentication and set equality".Journal of Computer and System Sciences.22(3): 265–279.doi:10.1016/0022-0000(81)90033-7.

[donald3-20] Donald E. Knuth(April 24, 1998).The Art of Computer Programming: Volume 3: Sorting and Searching.Addison-Wesley Professional.ISBN 978-0-201-89685-5.

[21] Demaine, Erik; Lind, Jeff (Spring 2003)."Lecture 2"(PDF).6.897: Advanced Data Structures. MIT Computer Science and Artificial Intelligence Laboratory.Archived(PDF)from the original on June 15, 2010.RetrievedJune 30,2008.

[nick05-22] Culpepper, J. Shane; Moffat, Alistair (2005). "Enhanced Byte Codes with Restricted Prefix Properties".String Processing and Information Retrieval.Lecture Notes in Computer Science. Vol. 3772. pp. 1–12.doi:10.1007/11575832_1.ISBN 978-3-540-29740-6.

[23] Willard, Dan E.(2000). "Examining computational geometry, van Emde Boas trees, and hashing from the perspective of the fusion tree".SIAM Journal on Computing.29(3): 1030–1049.doi:10.1137/S0097539797322425.MR 1740562..

[24] Askitis, Nikolas; Sinha, Ranjan (October 2010). "Engineering scalable, cache and space efficient tries for strings".The VLDB Journal.19(5): 633–660.doi:10.1007/s00778-010-0183-9.

[25] Askitis, Nikolas; Zobel, Justin (October 2005). "Cache-conscious Collision Resolution in String Hash Tables".Proceedings of the 12th International Conference, String Processing and Information Retrieval (SPIRE 2005).Vol. 3772/2005. pp. 91–102.doi:10.1007/11575832_11.ISBN 978-3-540-29740-6.

[26] Askitis, Nikolas (2009)."Fast and Compact Hash Tables for Integer Keys"(PDF).Proceedings of the 32nd Australasian Computer Science Conference (ACSC 2009).Vol. 91. pp. 113–122.ISBN 978-1-920682-72-9.Archived fromthe original(PDF)on February 16, 2011.RetrievedJune 13,2010.

[tenenbaum90-27] Tenenbaum, Aaron M.; Langsam, Yedidyah; Augenstein, Moshe J. (1990).Data Structures Using C.Prentice Hall. pp. 456–461, p. 472.ISBN 978-0-13-199746-2.

[Cuckoo-28] Pagh, Rasmus;Rodler, Flemming Friche (2001). "Cuckoo Hashing".Algorithms — ESA 2001.Lecture Notes in Computer Science. Vol. 2161. pp. 121–133.CiteSeerX10.1.1.25.4189.doi:10.1007/3-540-44676-1_10.ISBN 978-3-540-42493-2.

[clrs-29] Cormen, Thomas H.;Leiserson, Charles E.;Rivest, Ronald L.;Stein, Clifford(2001), "11 Hash Tables",Introduction to Algorithms(2nd ed.),MIT PressandMcGraw-Hill,pp. 221–252,ISBN 0-262-03293-7.

[chen87-30] Vitter, Jeffery S.; Chen, Wen-Chin (1987).The design and analysis of coalesced hashing.New York, United States:Oxford University Press.ISBN 978-0-19-504182-8– viaArchive.org.

[31] Pagh, Rasmus;Rodler, Flemming Friche (2001). "Cuckoo Hashing".Algorithms — ESA 2001.Lecture Notes in Computer Science. Vol. 2161. pp. 121–133.CiteSeerX10.1.1.25.4189.doi:10.1007/3-540-44676-1_10.ISBN 978-3-540-42493-2.

[nir08-32] Herlihy, Maurice; Shavit, Nir; Tzafrir, Moran (2008). "Hopscotch Hashing".Distributed Computing.Lecture Notes in Computer Science. Vol. 5218. pp. 350–364.doi:10.1007/978-3-540-87779-0_24.ISBN 978-3-540-87778-3.

[waterloo86-33] Celis, Pedro (1986).Robin Hood Hashing(PDF).Ontario, Canada:University of Waterloo,Dept. of Computer Science.ISBN 978-0-315-29700-5.OCLC 14083698.Archived(PDF)from the original on November 1, 2021.RetrievedNovember 2,2021.

[34] Poblete, P. V.; Viola, A. (July 2019). "Analysis of Robin Hood and Other Hashing Algorithms Under the Random Probing Model, With and Without Deletions".Combinatorics, Probability and Computing.28(4): 600–617.doi:10.1017/S0963548318000408.S2CID 125374363.

[cornell14-35] Clarkson, Michael (2014)."Lecture 13: Hash tables".Cornell University,Department of Computer Science.Archivedfrom the original on October 7, 2021.RetrievedNovember 1,2021– via cs.cornell.edu.

[36] Gries, David (2017)."JavaHyperText and Data Structure: Robin Hood Hashing"(PDF).Cornell University,Department of Computer Science.Archived(PDF)from the original on April 26, 2021.RetrievedNovember 2,2021– via cs.cornell.edu.

[indiana88-37] Celis, Pedro (March 28, 1988).External Robin Hood Hashing(PDF)(Technical report). Bloomington, Indiana:Indiana University,Department of Computer Science. 246.Archived(PDF)from the original on November 3, 2021.RetrievedNovember 2,2021.

[38] Goddard, Wayne (2021)."Chapter C5: Hash Tables"(PDF).Clemson University.pp. 15–16.RetrievedDecember 4,2023.

[39] Devadas, Srini; Demaine, Erik (February 25, 2011)."Intro to Algorithms: Resizing Hash Tables"(PDF).Massachusetts Institute of Technology,Department of Computer Science.Archived(PDF)from the original on May 7, 2021.RetrievedNovember 9,2021– viaMIT OpenCourseWare.

[40] Thareja, Reema (2014). "Hashing and Collision".Data Structures Using C.Oxford University Press. pp. 464–488.ISBN 978-0-19-809930-7.

[scott03-41] Friedman, Scott; Krishnan, Anand; Leidefrost, Nicholas (March 18, 2003)."Hash Tables for Embedded and Real-time systems"(PDF).All Computer Science and Engineering Research.Washington University in St. Louis.doi:10.7936/K7WD3XXV.Archived(PDF)from the original on June 9, 2021.RetrievedNovember 9,2021– viaNorthwestern University,Department of Computer Science.

[42] Litwin, Witold (1980)."Linear hashing: A new tool for file and table addressing"(PDF).Proc. 6th Conference on Very Large Databases.Carnegie Mellon University.pp. 212–223.Archived(PDF)from the original on May 6, 2021.RetrievedNovember 10,2021– via cs.cmu.edu.

[dijk10-43] Dijk, Tom Van (2010)."Analysing and Improving Hash Table Performance"(PDF).Netherlands:University of Twente.Archived(PDF)from the original on November 6, 2021.RetrievedDecember 31,2021.

[44] Lech Banachowski."Indexes and external sorting".pl:Polsko-Japońska Akademia Technik Komputerowych.Archived fromthe originalon March 26, 2022.RetrievedMarch 26,2022.

[45] Zhong, Liang; Zheng, Xueqian; Liu, Yong; Wang, Mengting; Cao, Yang (February 2020). "Cache hit ratio maximization in device-to-device communications overlaying cellular networks".China Communications.17(2): 232–238.doi:10.23919/jcc.2020.02.018.S2CID 212649328.

[46] Bottommley, James (January 1, 2004)."Understanding Caching".Linux Journal.Archivedfrom the original on December 4, 2020.RetrievedApril 16,2022.

[47] Jill Seaman (2014)."Set & Hash Tables"(PDF).Texas State University.Archived from the original on April 1, 2022.RetrievedMarch 26,2022.{{cite web}}:CS1 maint: bot: original URL status unknown (link)

[48] "Transposition Table - Chessprogramming wiki".chessprogramming.org.Archivedfrom the original on February 14, 2021.RetrievedMay 1,2020.

[49] "JavaScript data types and data structures - JavaScript | MDN".developer.mozilla.org.RetrievedJuly 24,2022.

[50] "Map - JavaScript | MDN".developer.mozilla.org.June 20, 2023.RetrievedJuly 15,2023.

[51] "Programming language C++ - Technical Specification"(PDF).International Organization for Standardization.pp. 812–813. Archived fromthe original(PDF)on January 21, 2022.RetrievedFebruary 8,2022.

[52] "The Go Programming Language Specification".go.dev.RetrievedJanuary 1,2023.

[53] "Lesson: Implementations (The Java™ Tutorials > Collections)".docs.oracle.Archivedfrom the original on January 18, 2017.RetrievedApril 27,2018.

[54] Zhang, Juan; Jia, Yunwei (2020)."Redis rehash optimization based on machine learning".Journal of Physics: Conference Series.1453(1): 3.Bibcode:2020JPhCS1453a2048Z.doi:10.1088/1742-6596/1453/1/012048.S2CID 215943738.

[55] Jonan Scheffler (December 25, 2016)."Ruby 2.4 Released: Faster Hashes, Unified Integers and Better Rounding".heroku.Archivedfrom the original on July 3, 2019.RetrievedJuly 3,2019.

[56] "doc.rust-lang.org".Archivedfrom the original on December 8, 2022.RetrievedDecember 14,2022.

[57] "HashSet Class (System.Collections.Generic)".learn.microsoft.RetrievedJuly 1,2023.

[58] tnet-bot."Dictionary Class (System.Collections.Generic)".learn.microsoft.RetrievedJanuary 16,2024.

[59] "VB.NET HashSet Example".Dot Net Perls.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]

[49]

[50]

[51]

[52]

[53]

[54]

[55]

[56]

[57]

[58]

[59]

v t e Data structures
Types	Collection Container
Abstract	Associative array Multimap Retrieval Data Structure List Stack Queue Double-ended queue Priority queue Double-ended priority queue Set Multiset Disjoint-set
Arrays	Bit array Circular buffer Dynamic array Hash table Hashed array tree Sparse matrix
Linked	Association list Linked list Skip list Unrolled linked list XOR linked list
Trees	B-tree Binary search tree AA tree AVL tree Red–black tree Self-balancing tree Splay tree Heap Binary heap Binomial heap Fibonacci heap R-tree R* tree R+ tree Hilbert R-tree Trie Hash tree
Graphs	Binary decision diagram Directed acyclic graph Directed acyclic word graph
List of data structures

Hash table

History

Overview

Load factor

Load factor for separate chaining

Load factor for open addressing

Hash function

Integer universe assumption

Hashing by division

Hashing by multiplication

Choosing a hash function

Collision resolution

Separate chaining

Other data structures for separate chaining

Caching and locality of reference

Open addressing

Caching and locality of reference

Other collision resolution techniques based on open addressing

Coalesced hashing

Cuckoo hashing

Hopscotch hashing

Robin Hood hashing

Dynamic resizing

Resizing by moving all entries

Alternatives to all-at-once rehashing

Linear hashing

Performance

Applications

Associative arrays

Database inde xing

Caches

Sets

Transposition table

Implementations

See also

References

Further reading

External links