IEqualityComparer not working as intended

后端 未结 3 1856
梦毁少年i
梦毁少年i 2021-01-11 15:01

I have a List of paths of files stored on my computer. My aim is to first filter out the files which have the same name and and then filter out those which have

相关标签:
3条回答
  • 2021-01-11 15:17

    Your GetHashCode must return the same value for any objects that are of equal value:

    // Try this
    public int GetHashCode(string obj)
    {
        return Path.GetFileName(x).GetHashCode();
    }
    
    // And this
    public int GetHashCode(string obj)
    {
        return new FileInfo(x).Length.GetHashCode();
    }
    

    But this is a much easier way for the whole problem without the extra classes:

    var query = FilesList
                    .GroupBy(f => Path.GetFileName(f)).Select(g => g.First())
                    .GroupBy(f => new FileInfo(f).Length).Select(g => g.First())
                    .ToList();
    
    0 讨论(0)
  • 2021-01-11 15:20

    The hash code is used before Equals is ever called. Since your code gives different hash codes for items that are equal, you're not getting the desired result. Instead, you have to make sure the hash code returned is equal when the items are equal, so for example:

    public class CustomTextComparer : IEqualityComparer<string>
    {
        public bool Equals(string x, string y)
        {
            if (Path.GetFileName(x) == Path.GetFileName(y))
            {
                return true;
            }
            return false; 
        }
        public int GetHashCode(string obj)
        {
            return Path.GetFileName(obj).GetHashCode();
        }
    }
    

    However, as Piotr pointed out, this isn't exactly a good way to go about your goal, since you're going to be doing a lot of Path.GetFileName and new FileInfo respectively, which is a going to be a significant performance hit, especially since you're dealing with the file system, which is not exactly known for its speed of response.

    0 讨论(0)
  • 2021-01-11 15:28

    Change your GetHashCode to work on the compared value. I.e. for your size comparer:

    public int GetHashCode(string obj)
    {
        return FileInfo(x).Length.GetHashCode();
    }
    

    And for the other:

    public int GetHashCode(string obj)
    {
        return Path.GetFileName(obj).GetHashCode();
    }
    

    According to this answer - What's the role of GetHashCode in the IEqualityComparer<T> in .NET?, the hash code is evaluated first. Equals is called in case of collision.

    Obviously it would be sensible to work on FileInfos, not on strings.

    So maybe:

    FileList.Select(x => new FileInfo(x))
            .Distinct(new CustomTextComparer())
            .Distinct(new CustomSizeComparer());
    

    Of course, then you have to change your comparers to work on the correct type.

    0 讨论(0)
提交回复
热议问题