Git hub users analysis repo
-
How data was scrapped
-
Created a user token for using rest api in github
-
Used chatgpt to generate python code.
-
Used google collab to run python note book
-
The only change which had to be done in python code was replacement of user token
-
Ran the program which generated the top 100 users in bangalore as csv (this gave only 5-6 users)
-
So step5 raised a doubt and ran the users script with Bengaluru (this gave 20 users).. So overall step 5 + Step 5.5 = 25 users
-
Ran the second program which used the top100userscsv as input
-
Both the output as well as the programs have been checked in to this repo.
-
-
Interesting/Surprising facts
-
Python is grouped individually as juniper note book and python (but overall had 450 references) making it most popular in github (please refer the pivot generated from repository-chart.xls)
-
Next in list were javascript and java respectively
-
Almost 30% of repo data had blanks in language which can affect the overall analysis
-
-
Recommendation
-
As per the trend for python and scripting languages(java script and html) there can be useful references available in github for reuse.
-
For overall development quickly chatgpt really helps.
-