Map of GitHub

This map showcases over 400,000 GitHub projects, with each dot representing a project close to others with common stargazers. Created using a public dataset from Jan 2020 to March 2023, over 350 million stars were collected. Jaccard Similarity was used to compute relationships, resulting in 1000+ clusters using Leiden clustering. Cluster layouts were computed with ngraph.forcelayout, and the map was rendered using maplibre. Country labels were generated with ChatGPT, and geocoding was implemented for easy searching. The design of the map is a work in progress, with feedback on visual design welcome. Support and recognition go out to contributors and supporters. (Controversial: Use of public data sets and cloud computing raises privacy and security concerns)

https://github.com/anvaka/map-of-github

To top