How can I rank observations within groups in Stata?

前端 未结 6 1433
闹比i
闹比i 2020-12-21 01:30

I have some data in Stata which look like the first two columns of:

group_id   var_to_rank  desired_rank
____________________________________

1           10         


        
相关标签:
6条回答
  • 2020-12-21 01:56

    Way too much work. Easy and elegant. Try this one.

    gen desired_rank=int(var_to_rank/10)

    0 讨论(0)
  • 2020-12-21 01:58

    I'd say this question is posed the wrong way round for best understanding. The aim is to group observations, those with the lowest value all being assigned a grade 1, the next lowest being all assigned 2 and so forth. This isn't ranking in most senses that I have seen discussed, but Stata's egen, rank() does get you part of the way.

    But the direct way, which was mentioned in the Statalist thread cited elewhere in this thread (start here) is simpler in spirit than any solution quoted:

    bysort group_id (var_to_rank): gen desired_rank = sum(var_to_rank != var_to_rank[_n-1]) 
    

    Once data are sorted on var_to_rank then when values differ from previous values at the start of each block of distinct values a value of 1 is the result of var_to_rank != var_to_rank[_n-1]; otherwise 0 is the result. Summing those 1s and 0s cumulatively gives the desired variable. The prefix command bysort does the sorting required and ensures that this is all done separately within the groups defined by group_id. No need for egen at all (a command that many people who only use Stata occasionally often find bizarre).

    Declaration of interest: The Statalist thread cited shows that when asked a similar question I too did not think of this solution in one.

    0 讨论(0)
  • 2020-12-21 02:04

    try this command, it works for me so well: egen newid=group(oldid)

    0 讨论(0)
  • 2020-12-21 02:06

    Stumbled upon such solution on the Statalist:

    bysort group_id (var_to_rank) : gen rank = var_to_rank != var_to_rank[_n-1]
    by group_id : replace rank = sum(rank)
    

    Seems to sort out this issue.

    0 讨论(0)
  • 2020-12-21 02:14

    The following works for me:

    bysort group_id: egen desired_rank=rank(var_to_rank)
    

    enter image description here

    0 讨论(0)
  • 2020-12-21 02:14

    @radek: you surely got it sorted out in the meantime ... but this would have been an easy (though not very elegant) solution:

    bysort group_id:   egen desired_rank_HELP =rank(var_to_rank), field
    egen desired_rank      =group(grup_id desired_rank_HELP)
    drop desired_rank_HELP
    
    0 讨论(0)
提交回复
热议问题