20.2 Distribution By Other Hash Functions

HashMaps/Tables have fast lookup times, but behind that "superpower" is a hash function.

Previous20.1 Hash Table Recap, Default Hash Function Next20.3 Contains & Duplicate Items

Last updated 2 years ago

20.2 Distribution By Other Hash Functions

HashMaps/Tables have fast lookup times, but behind that "superpower" is a hash function.

Suppose our hashCode() implementation simply returns 0.

@Override
public int hashCode() {
    return 0;
}

What distribution do we expect?

By using modulo, we ensure that our hashcode yields a number that can be represented as an index and clearly identifies which LinkedList to add to. Additionally, when adding a series of numbers at once, we see that we get an even distribution of numbers in our LinkedList yielding a constant lookup time.

@Override
public int hashCode() {
    return num;
}

This hash function should yield a much more even distribution! Objects with different num will now be more spread out across the buckets instead of all living in the 0th bucket. If our class does not explicitly override the hashCode() function, Java will use the default implementation, which returns the object's address in memory as its hash code!

Why Bother With Custom Hash Functions?

Let's discuss if the default hashCode function is a good hashCode function! It actually is a good spread as it relies on the fact that different objects will live in different places in the memory, and the memory address is effectively random. We will get a good distribution, since objects are basically assigned random indices to insert into the hash table.

Previous20.1 Hash Table Recap, Default Hash Function Next20.3 Contains & Duplicate Items

Last updated 2 years ago

Why Bother With Custom Hash Functions?

This really raises an interesting question: why do we care about other custom hash functions when the default hashcode gets good spread? We'll read about this in the