Determine current code distribution by author

后端 未结 3 1396
南旧
南旧 2021-02-15 20:55

I thought it would be neat if it were possible to take a Git repository, run some script, and have it generate the number of lines in the code base, and the proportion of each a

3条回答
  •  别跟我提以往
    2021-02-15 21:41

    You could try to parse the output of git-blame. This command gives the last person that edited each line of a file.

    This example is not exactly what you want but I think it gives you the idea:

    git blame -e the/file | awk -F '<|>' '{print $2}' | sort | uniq -c
    

    This will print the e-mail addresses of the authors together with the number of lines they modified lastly for a file, for example:

         47 foo@bar.com
      34712 blah@baz.com
    

    To make it run on the whole repository, you can do something like this:

    git ls-files | while read f; do git blame -e $f; done | awk -F '<|>' '{print $2}' | sort | uniq -c
    

    The idea here is to first generate the list of files with git ls-files, and then run the above snippet on each of the files (using the snippet mentioned here). If you're running this on a large codebase, you may want to store intermediate results in temporary files rather than use pipes.

提交回复
热议问题