Please enable JavaScript.

Coggle requires JavaScript to display documents.

Space and time trade-offs - Coggle Diagram

- - - - This approach is ilustrated by
        
        Hashing
        
        Indexing with B-trees
- - - - In data compression, size reduction is the goal rather than a technique for solving another problem.
- - - - Hashing is based on the idea of distributing keys among a one-dimensional
        array H[0..m − 1] called a hash table
        
        The distribution is done by computing, for
        each of the keys, the value of some predefined function h called the hash function. This function assigns an integer between 0 and m − 1, called the hash address, to a key
        
        A hash function needs to satisfy these requirements
        
        A hash function needs to distribute keys among the cells of the hash table as
        evenly as possible
        
        A hash function has to be easy to compute
        
        A hash table’s size should not be excessively large compared to the number of
        keys, but it should be sufficient to not jeopardize the implementation’s time efficiency
        
        Obviously, if we choose a hash table’s size m to be smaller than the number
        of keys n, we will get collisions. But collisions should be
        expected even if m is considerably larger than n. In fact, in the worst case, all the keys could be hashed to the same cell
        of the hash table
        
        Fortunately, with an appropriately chosen hash table size and a
        good hash function, this situation happens very rarely. Still, every hashing scheme
        must have a collision resolution mechanism. This mechanism is different in the two principal versions of hashing
        
        Open hashing
        
        1 more item...
        
        Closed hashing
        
        1 more item...
  - - - A location computed by a hash function in extendible hashing indicates a disk address of a bucket that can hold up to b keys. When a key’s bucket is identified, all its keys are read into main memory and then searched for the key in question