Inspect repository data, including countries and organizations of stargazers and forkers.
repoInspector is a Github repository inspector built for anyone to gather useful data on open source repositories. Using the open Github API, we package and present available data about any repository, including:
This one’s designed to be simple to use as a Chrome extension, for startup founders, devs, investors, and really - anyone!
repo:status Access commit status
repo_deployment Access deployment status
public_repo Access public repositories
repo:invite Access repository invitations
read:packages Download packages from GitHub Package Registry
read:org Read org and team membership, read org projects
read:public_key Read user public keys
read:repo_hook Read repository hooks
read:user Read ALL user profile data
user:email Access user email addresses (read-only)
read:discussion Read team discussions
read:enterprise Read enterprise profile data
read:project Read access of projects
:warning: Set the token’s expiration to whatever you’re comfortable with. We’ll never ask you for your token, and it will always be stored on your computer.
You’re ready to start inspecting repositories! Here’s how that works:
In your browser, go to the Github page for a repository you’re curious about, open the Chrome extension and click Inspect. Before you inspect, you have a few options for the data. Toggle between receiving Only Stars (default), Only Forks, Stars and Forks, or Sampling.
Once you click Inspect, you’ll see a progress window.
Note - it may take a few seconds to start showing progress, and the larger the amount of Stars or Forks, the longer it will take to start and progress.
Note - up to 40k Stars and 40k Forks can be pulled for any one inspection.
Your result will arrive in your inbox once the progress bar hits 100%. If you don't see the email right away, check your spam folder. The report will send from [email protected].
repoInspector was originally designed and started by the team at Hetz Ventures. We were looking for a simple way to access useful data on repository user activity around Github projects for industry insights, due diligence, comparative analysis, etc.
This Chrome extension is useful for other investors, startup founders and really anyone looking to better understand user behavior and their markets.
We welcome contributors to this project! Here’s how you can get set up:
There is both a client and a server in this repository. You can decide to only work on the client side (i.e. the chrome extension) or on both. To set up the client side, clone the repo and:
cd chrome-extension
yarn install
yarn generate-ts-gql (This command must be executed each time after creating a new query to generate new types)
yarn build-watch
This will create a dist
folder. Go to chrome://extensions and click 'Load unpacked', after which navigate to the dist
folder and click open, the extension should load and will reload with every save to the codebase.
yarn run build
yarn run lint
cd server
pip install -r requirements.txt
Create a file called .secrets.toml
in the server
directory and fill it in with the following (fill in the missing variables):
[development]
dynaconf_merge = true
[development.db]
sql_url = ''
[development.google]
data_client_id = ""
[development.email]
username = ''
password = ''
smtp_server = ''
port = ''
[development.security]
# openssl rand -hex 32
SECRET_KEY = ""
uvicorn main:app
The server is set up to run on Heroku, to push to heroku, add a new heroku remote and run from the base folder:
git subtree push --prefix server heroku main
The environment variables convention on the server API_DB__<var name>
so for sql_url it would be API_DB__sql_url
.
Some data is classified by our business logic and is not fetched directly from Github. The following are data points we have defined: