You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Public user profiles on the Interep supported provider platforms (currently GitHub, Reddit, Twitter)
twitter, reddit, github: manual collecting of data via public APIs
Objectives
Collect a data sample of reasonable size: between 100-1000 public user profiles for each provider
Evaluate the current shape of the reputation distribution for each provider/
Define appropriate level thresholds so that the distribution is skewed from undefined to gold
Indeed common sense tells that there should be a lot of undefined or bronze, some silver but just a few gold.
Scripts
Data was collected running scripts defined in /scrapers (see index.ts).
Example for Twitter:
Specific reputation algorithms for each provider were defined empirically based on data analysis. There are 5 tiers: commoner, up-and-coming, established, star and icon.
Twitter
followers
< 100
< 1k
< 10k
< 100k
100k+
is likely bot (botometer cap >= 0.95)
commoner
commoner
commoner
commoner
commoner
is likely not bot (botometer cap < 0.95) & not verified
commoner
up-and-coming
established
star
icon
is likely not bot (botometer cap < 0.95) & not verified
commoner
up-and-coming
established
star
icon
is likely not bot (botometer cap < 0.95) & verified
established
established
established
star
icon
Tiers distribution simulation results
Reddit
total karma
< 2k
< 20k
< 100k
< 200k
200k+
is gold
up-and-coming
up-and-coming
established
star
icon
is not gold
commoner
up-and-coming
established
star
icon
Tiers distribution simulation results
GitHub
stars
0
<10
< 100
< 1000
1000+
neither sponsored nor sponsoring
commoner
up-and-coming
established
star
icon
sponsors or sponsoring
established
established
established
star
icon
Tiers distribution simulation results
About
Exploratory Data Analysis on public user profiles from Interep providers