How to save row names when selecting each column independently instead of row number?

妖精的绣舞 提交于 2019-12-11 09:05:25

问题


I asked this question a couple days ago and @Juan Bosco helped me and suggested the code which works perfectly and selects top n values from each column. But turns out I need the names of each selected row for each column, so in the list: "Selectedrows", I need something like "t9", "t8", "t7" instead of row number.

 names<- c("t1","t10","t11","t2","t3","t4","t5","t6","t7","t8","t9")
 values1 <- c(2,3.1,4.5,5.1,6.5,7.1,8.5,9.11,10.1,11.8,12.3)
 values2 <- c(1,3.1,3,5.1,6.5,7.1,8.5,9.11,10.1,12,12) 
  mydf<- data.frame(names,values1,values2)


Selectedrows<- lapply(2:3, function(col_index) { 
max_values <- sort(mydf[[col_index]], decreasing = T)[1:3]
max_rows <- sapply(max_values, function(one_value){
 as.numeric(rownames(mydf[mydf[[col_index]] == one_value, ]))
})

unique(unlist(max_rows))[1:3]

})

Thanks


回答1:


In this case, you can use order, which returns the index which can be used to sort an array, the first three elements from the order results corresponds to the index of the top 3 values if you specify the array to be decreasing; With the index, you can subset the names column and get the corresponding values:

lapply(2:3, function(col_index) {
    mydf[["names"]][order(mydf[[col_index]], decreasing = T)[1:3]]
})

#[[1]]
#[1] "t9" "t8" "t7"

#[[2]]
#[1] "t8" "t9" "t7"

Notice this assumes your names column is character, if not. Do mydf$names <- as.character(mydf$names) firstly if you want the result to be character instead of factor.



来源:https://stackoverflow.com/questions/42743749/how-to-save-row-names-when-selecting-each-column-independently-instead-of-row-nu

标签
易学教程内所有资源均来自网络或用户发布的内容,如有违反法律规定的内容欢迎反馈
该文章没有解决你所遇到的问题?点击提问,说说你的问题,让更多的人一起探讨吧!