小菜毛毛技术分享

与大家共同成长

:: 管理

164 Posts :: 141 Stories :: 94 Comments :: 0 Trackbacks

经常在论坛上面看到覆写hashCode函数的问题，很多情况下是一些开发者不了解hash code，或者和equals一起用的时候不太清楚为啥一定要复写hashCode。

对于hash code的理论我不想多说，这个话题太大。我只想说用hash code的原因只有一个：效率。理论的说法它的复杂度只有O(1)。试想我们把元素放在线性表里面，每次要找一个元素必须从头一个一个的找它的复杂度有O(n)。如果放在平衡二叉树，复杂度也有O(log n)。

为啥很多地方说“覆写equals的时候一定要覆写hashCode”。说到这里我知道很多人知道有个原则：如果a.equals(b)那么要确保a.hashCode()==b.hashCode()。为什么？hashCode和我写的程序的业务逻辑毫无关系，为啥我要override? 要我说如果你的class永远不可能放在hash code为基础的容器内，不必劳神，您真的不必override hashCode() :)

说得准确一点放在HashMap和Hashtable里面如果是作为value而不是作为key的话也是不必override hashCode了。至于HashSet，实际上它只是忽略value的HashMap,每次HashSet.add(o)其实就是 HashMap.put(o, dummyObject)。

那为什么放到Hash容器里面要overide hashCode呢？因为每次get的时候HashMap既要看equals是不是true也要看hash code是不是一致，put的时候也是要看equals和hash code。

如果说到这里您还是不太明白，咱就举个例子：

譬如把一个自己定义的class Foo{...}放到HashMap。实际上HashMap也是把数据存在一个数组里面，所以在put函数里面，HashMap会调 Foo.hashCode()算出作为这个元素在数组里面的下标，然后把key和value封装成一个对象放到数组。等一下，万一2个对象算出来的 hash code一样怎么办？会不会冲掉？先回答第2个问题，会不会冲掉就要看Foo.equals()了，如果equals()也是true那就要冲掉了。万一是false,就是所谓的collision了。当2个元素hashCode一样但是equals为false的时候，那个HashMap里面的数组的这个元素就变成了链表。也就是hash code一样的元素在一个链表里面，链表的头在那个数组里面。

回过来说get的时候，HashMap也先调key.hashCode()算出数组下标，然后看equals是不是true，所以就涉及了equals。

反观假设如果a.equals(b)但是a.hashCode()!=b.hashCode()的话，在put元素a之后，我们又用一个 a.equals(b)但是b.hashCode()!=a.hashCode()的b元素作为key来get的时候就找不到a了。如果 a.hashCode()==b.hashCode()但是!a.equals(b)倒是不要紧，这2个元素会collision然后被放到链表，只是效率变差。

这里有个非常简化版的HashMap实现帮助大家理解。

view plain copy to clipboard print ?

/*
* Just to demonstrate hash map mechanism,
* Please do not use it in your commercial product.
*
* @author Shengyuan Lu 卢声远 <michaellufhl@yahoo.com.cn>
*/
public class SimpleHashMap {
ArrayList<LinkedList<Entry>> entries = new ArrayList<LinkedList<Entry>>();
/**
* Each key-value is encapsulated by Entry.
*/
static class Entry {
Object key;
Object value;
public Entry(Object key, Object value) {
this.key = key;
this.value = value;
}
}
void put(Object key, Object value) {
LinkedList<Entry> e = entries.get(key.hashCode());
if (e != null) {
for (Entry entry : e) {
if (entry.key.equals(key)) {
entry.value = value;// Match in lined list
return;
}
}
e.addFirst(new Entry(key, value));// Add the entry to the list
} else {
// Put the new entry in array
LinkedList<Entry> newEntry = new LinkedList<Entry>();
newEntry.add(new Entry(key, value));
entries.add(key.hashCode(), newEntry);
}
}
Object get(Object key) {
LinkedList<Entry> e = entries.get(key.hashCode());
if (e != null) {
for (Entry entry : e) {
if (entry.key.equals(key)) {
return entry.value;
}
}
}
return null;
}
/**
* Do we need to override equals() and hashCode() for SimpleHashMap itself?
* I don't know either:)
*/
}

/*
* Just to demonstrate hash map mechanism,
* Please do not use it in your commercial product.
*
* @author Shengyuan Lu 卢声远 <michaellufhl@yahoo.com.cn>
*/
public class SimpleHashMap {
ArrayList<LinkedList<Entry>> entries = new ArrayList<LinkedList<Entry>>();
/**
* Each key-value is encapsulated by Entry.
*/
static class Entry {
Object key;
Object value;
public Entry(Object key, Object value) {
this.key = key;
this.value = value;
}
}
void put(Object key, Object value) {
LinkedList<Entry> e = entries.get(key.hashCode());
if (e != null) {
for (Entry entry : e) {
if (entry.key.equals(key)) {
entry.value = value;// Match in lined list
return;
}
}
e.addFirst(new Entry(key, value));// Add the entry to the list
} else {
// Put the new entry in array
LinkedList<Entry> newEntry = new LinkedList<Entry>();
newEntry.add(new Entry(key, value));
entries.add(key.hashCode(), newEntry);
}
}
Object get(Object key) {
LinkedList<Entry> e = entries.get(key.hashCode());
if (e != null) {
for (Entry entry : e) {
if (entry.key.equals(key)) {
return entry.value;
}
}
}
return null;
}
/**
* Do we need to override equals() and hashCode() for SimpleHashMap itself?
* I don't know either:)
*/
}

这个问题的权威阐释可以参考Bloch的<Effective Java>的 Item 9: Always override hashCode when you override equals

posted on 2010-08-27 09:51 小菜毛毛阅读(417) 评论(0) 编辑收藏所属分类: 面试

新用户注册刷新评论列表


只有注册用户登录后才能发表评论。




网站导航: 博客园博客园最新博文博问管理
相关文章: java sax 解析实例 Spring 框架的设计理念与设计模式分析 java反射详解 Java对象序列化（整理篇） java中文汉字排序 Struts1和Struts2的区别和对比: javac -classpath的使用 JAVAC 命令详解(http://www.cnblogs.com/jeffchen/archive/2008/01/16/1041783.html) Java虚拟机参数详解 web.xml中获取全局参数

小菜毛毛技术分享

常用链接

留言簿(15)

我参与的团队

随笔分类

随笔档案

文章分类

文章档案

新闻档案

收藏夹

搜索

最新评论

阅读排行榜

评论排行榜