One of the very common tasks in data science is parsing logs produced by some application. In this recipe, we will write a simple snippet that presents how we can analyze the contributions of committers to the Git repository.
Parsing Git logs with regular expressions
Getting ready
In order to run this recipe, you need to have the DataFrames.jl and DataFramesMeta.jl packages installed. If they are missing run the following commands to add them:
julia> using Pkg
julia> Pkg.add("DataFrames")
julia> Pkg.add("DataFramesMeta")
Also, you need to have Git installed. You can get it from https://git-scm.com/.
When you run the git log --stat command on a repository, it prints output that looks similar...