Java/Kotlin: Finding the intersection of multiple HashSets by class ID

只愿长相守 提交于 2019-12-24 08:57:15

问题


I'm having trouble finding the intersection of an Array of Hashed Sets that contain a data Class (where I want to intersect by identifier):

class Protein(val id: String, val score: Double, val molw: Double, val spc: Int)

I've pulled in some data from a .csv file into this type of structure:

ArrayList<HashSet<Protein>>

So I have six array lists [1 for each csv], each containing one hashed set that contains thousands of Protein structures. Here's what I've tried so far to get an intersection HashSet based off of common Protein.id:

fun intersection(data: ArrayList<HashSet<Protein>>): HashSet<Protein> {

val intersectionSet = HashSet<Protein>(data[0])

for (i in 1..data.size) {
    intersectionSet.retainAll(data[i])
}
return intersectionSet
}

This returns an empty list, which makes sense given that it's trying to intersect Protein objects and match each criteria as a whole.

How do I call data[i].id as my intersection criteria? I'm fairly new to Kotlin and data classes :)


回答1:


If you add definitions for the hashCode and equals function in the Protein class as follows, then the HashSet will be able to appropriately check the intersection using the id field.

class Protein(val id: String, val score: Double, val molw: Double, val spc: Int) {
  override fun hashCode() = id.hashCode()
  override fun equals(other: Any?) = other?.let { id == (it as Protein).id } ?: false
}

Also you probably want to change the range in your loop within the intersection function to be 1..(data.size-1) instead of 1..data.size to avoid going out of bounds. Alternatively you could write it functionally as follows:

fun intersection(data: ArrayList<HashSet<Protein>>): HashSet<Protein> {
  return data.reduce { acc, it -> acc.apply { retainAll(it) } }
}



回答2:


To simplify the given answer, you can use apply:

return data.reduce { acc, it -> acc.apply { retainAll(it) } }


来源:https://stackoverflow.com/questions/49891344/java-kotlin-finding-the-intersection-of-multiple-hashsets-by-class-id

易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!