I would like to propose adding ghmap to the list of projects built on top of the GH Archive.
ghmap is an open-source tool that maps raw GitHub events (such as those provided by GH Archive) into higher-level actions and activities, enabling large-scale analysis of contributor behavior and workflows.
It has been used in academic research to construct a dataset covering 3 years of activity in the NumFocus ecosystem (180K+ contributors, 2.8K+ repositories).
Links:
ghmap directly builds on GH Archive data to perform semantic event transformations, which fits well with the scope of this list.
Thanks for maintaining this great resource!
I would like to propose adding ghmap to the list of projects built on top of the GH Archive.
ghmap is an open-source tool that maps raw GitHub events (such as those provided by GH Archive) into higher-level actions and activities, enabling large-scale analysis of contributor behavior and workflows.
It has been used in academic research to construct a dataset covering 3 years of activity in the NumFocus ecosystem (180K+ contributors, 2.8K+ repositories).
Links:
ghmap directly builds on GH Archive data to perform semantic event transformations, which fits well with the scope of this list.
Thanks for maintaining this great resource!