探讨：Java中删除数组中重复元素-java删除数组中的某个元素

这个是一个老问题，但是发现大多数人说的还不够透。小弟就在这里抛砖引玉了，欢迎拍砖.......

问题：比如我有一个数组（元素个数为0哈），希望添加进去元素不能重复。

拿到这样一个问题，我可能会快速的写下代码，这里数组用ArrayList.

private static void testListSet(){  
        List<String> arrays = new ArrayList<String>(){  
            @Override 
            public boolean add(String e) {  
                for(String str:this){  
                    if(str.equals(e)){  
                        System.out.println("add failed !!!  duplicate element");  
                        return false;  
                    }else{  
                        System.out.println("add successed !!!");  
                    }  
                }  
                return super.add(e);  
            }  
        };  
                arrays.add("a");arrays.add("b");arrays.add("c");arrays.add("b");  
        for(String e:arrays)  
            System.out.print(e);  
    }

这里我什么都不关，只关心在数组添加元素的时候做下判断（当然添加数组元素只用add方法），是否已存在相同元素，如果数组中不存在这个元素，就添加到这个数组中，反之亦然。这样写可能简单，但是面临庞大数组时就显得笨拙：有100000元素的数组天家一个元素，难道要调用100000次equal吗？这里是个基础。

问题：加入已经有一些元素的数组了，怎么删除这个数组里重复的元素呢？

大家知道java中集合总的可以分为两大类：List与Set。List类的集合里元素要求有序但可以重复，而Set类的集合里元素要求无序但不能重复。那么这里就可以考虑利用Set这个特性把重复元素删除不就达到目的了，毕竟用系统里已有的算法要优于自己现写的算法吧。

public static void removeDuplicate(List<People> list){  
   HashSet<People> set = new HashSet<People>(list);  
   list.clear();  
   list.addAll(set);  
}  
 
ivate static People[] ObjData = new People[]{  
    new People(0, "a"),new People(1, "b"),new People(0, "a"),new People(2, "a"),new People(3, "c"),  
};

public class People{  
    private int id;  
    private String name;  
      
    public People(int id,String name){  
        this.id = id;  
        this.name = name;  
    }  
      
    @Override 
    public String toString() {  
        return ("id = "+id+" , name "+name);  
    }  
      
}

上面的代码，用了一个自定义的People类，当我添加相同的对象时候（指的是含有相同的数据内容），调用removeDuplicate方法发现这样并不能解决实际问题，仍然存在相同的对象。那么HashSet里是怎么判断像个对象是否相同的呢？打开HashSet源码可以发现：每次往里面添加数据的时候，就必须要调用add方法：

      @Override 
     public boolean add(E object) {  
         return backingMap.put(object, this) == null;  
     }

这里的backingMap也就是HashSet维护的数据，它用了一个很巧妙的方法，把每次添加的Object当作HashMap里面的KEY，本身HashSet对象当作VALUE。这样就利用了Hashmap里的KEY***性，自然而然的HashSet的数据不会重复。但是真正的是否有重复数据，就得看HashMap里的怎么判断两个KEY是否相同。

        @Override public V put(K key, V value) {  
390         if (key == null) {  
391             return putValueForNullKey(value);  
392         }  
393   
394         int hash = secondaryHash(key.hashCode());  
395         HashMapEntry<K, V>[] tab = table;  
396         int index = hash & (tab.length - 1);  
397         for (HashMapEntry<K, V> e = tab[index]; e != null; e = e.next) {  
398             if (e.hash == hash && key.equals(e.key)) {  
399                 preModify(e);  
400                 V oldValue = e.value;  
401                 e.value = value;  
402                 return oldValue;  
403             }  
404         }  
405   
406         // No entry for (non-null) key is present; create one  
407         modCount++;  
408         if (size++ > threshold) {  
409             tab = doubleCapacity();  
410             index = hash & (tab.length - 1);  
411         }  
412         addNewEntry(key, value, hash, index);  
413         return null;  
414     }

总的来说，这里实现的思路是：遍历hashmap里的元素，如果元素的hashcode相等（事实上还要对hashcode做一次处理），然后去判断KEY的eqaul方法。如果这两个条件满足，那么就是不同元素。那这里如果数组里的元素类型是自定义的话，要利用Set的机制，那就得自己实现equal与hashmap（这里hashmap算法就不详细介绍了，我也就理解一点）方法了：

public class People{  
    private int id; //  
    private String name;  
      
    public People(int id,String name){  
        this.id = id;  
        this.name = name;  
    }  
      
    @Override 
    public String toString() {  
        return ("id = "+id+" , name "+name);  
    }  
     
    public int getId() {  
        return id;  
    }  
 
    public void setId(int id) {  
        this.id = id;  
    }  
 
    public String getName() {  
        return name;  
    }  
 
    public void setName(String name) {  
        this.name = name;  
    }  
 
    @Override 
    public boolean equals(Object obj) {  
        if(!(obj instanceof People))  
            return false;  
        People o = (People)obj;  
        if(id == o.getId()&&name.equals(o.getName()))  
            return true;  
        else 
            return false;  
    }  
      
    @Override 
    public int hashCode() {  
        // TODO Auto-generated method stub  
        return id;  
        //return super.hashCode();  
    }  
}

这里在调用removeDuplicate(list)方法就不会出现两个相同的people了。

原文链接：http://www.cnblogs.com/slider/archive/2012/01/12/2320313.html

【编辑推荐】