The following is the second in a short series of posts which attempt to document the Sincerity Lab functionality. This will make more sense if you read the first installment before this one.
To get started in using the tool visit https://steem-sincerity.dapptools.info/lab, and click 'Existing User Log In'. Use demo/demo as the account name and password.
Available Data Fields
There are many data fields that may be used in the data filter and the chart type options. Below I try to briefly explain what most of them are. Some are more useful than others, but since I collected the data, I decided to make them all available. I can also fairly easily add derived fields based on these, so would be pleased to hear any suggestions.
The raw reputation score that Steem uses internally
The reputation score that we're used to seeing in Steemit and other apps
How many vests the account holds (1000000 is approx 500SP)
How many vests the account has delegated to other accounts (1000000 is approx 500SP)
How many vests the account has been delegated from other accounts (1000000 is approx 500SP)
A measure of the Steem resources used by an account over the last 7 days
A measure of the Steem resources used by an account since account creation
The number of witness the account has voted for (max 30, and doesn't include proxy votes)
The following data correspond to the previous 14 day period:
How many posts an account has published
The average number of words used in posts by an account
The average number of tags used in posts by an account
The gini-simpson index showing the probability that two randomly selected words in an account's posts are different
The gini-simpson index showing the probability that two randomly selected account posts are different
The average number of characters used in the words of posts by an account
The gini-simpson index showing the probability that two randomly posts have different Steem categories
The standard deviation (in seconds) of the time interval between an accounts posts
How many comments an account has published
The average number of words used in comments by an account
The gini-simpson index showing the probability that two randomly selected words in an account's comments are different
The gini-simpson index showing the probability that two randomly selected account comments are different
The standard deviation (in seconds) of the time interval between an accounts comments
The average number of characters used in the words of comments by an account
The average number of images used in comments by an account
The average number of links used in comments by an account
The average number of tags used in comments by an account
The number of comments an account made where is was replying to a reply to it
The gini-simpson index showing the probability that two randomly comments had different parent authors
The total count of upvotes and downvotes given by the account
The count of upvotes given by the account
The count of downvotes (flags) given by the account
The average percentage of vote power that an account has chosen with their vote slider
The percentage of votes given by an account that go to itself
The gini-simpson index showing the probability that two randomly selected votes went to different accounts
The total count of upvotes and downvotes given to the account
The count of upvotes given to the account
The count of downvotes (flags) given to the account
The average percentage of vote power that an accounts voting for an account have chosen with their vote slider
The percentage of votes given by an account that go to itself (by count of votes)
The gini-simpson index showing the probability that two randomly selected votes came from different accounts (by count of votes)
The number of transfers of STEEM or SBD which include Steemit/Busy links
The amount of STEEM sent to promotional bots by the account
The amount of SBD sent to promotional bots by the account
The count of message transfers (0.01 or less) sent by the account
The count of message transfers (0.01 or less) received by the account
The probability that the account belongs to a human content creator.
The probability that the account belongs to a a spammer.
The probability that the account belongs to a a bot.
The age is whole weeks of the account.
To make the fit better in the chart space, some fields are plotted more conveniently using a log scale. The following correspond to some of the fields above except that they are reduced to their log10 value.
log10_avg_bandwidth, log10_life_bandwidth, log10_owned_vests, log10_delegated_vests, log10_received_vests, log10_post_count, log10_comment_count, log10_post_average_word_count, log10_comment_average_word_count, log10_comment_exchange_count
It can be useful to view some metrics in terms of where each account ranks against all the other active accounts. The following correspond to some fields above except that they are adjusted to show the relative position of the account. For example a rank of 1 means that the account has the highest value of all accounts. Where several account share the same base value, the rank shows an average of all rank positions occupied by the group of accounts.
rank_reputation, rank_owned_vests, rank_classification_human_score, rank_classification_spammer_score, rank_classification_bot_score, rank_comment_exchange_count, rank_average_bandwidth
This one gives an idea of what it typically takes to become a top 20 witness.
Here are two charts for comparison, the first show the followers of my @andybets account, and this can be compared to the one beneath which shows the equivalent data for my @steemreports account. It's easy to see that steemreports has more accounts which rank higher in terms of Vested SP and Human Classification score. The unrelated points were removed from the graph by changing the sample size to 0%, which leaves only the followers.
This is a pretty one. It shows the average length of words used in comments and posts. The red points again indicate followers of steemreports. As expected, comments have slightly shorter words - nice post sir!
This clearly shows that most votebots are paid in SBD rather than Steem. The accompanying table lists the 25 accounts which have spent most SBD.
The relationship between reputation and transfers to votebots is shown here.
Here is a chart showing the relationship between the number of upvotes and downvotes (flags) the accounts are giving to others. It has been argued that the platform might work better if a larger percentage of votes were downvotes, but the absence of curation rewards for downvotes means such altruism is rare. In this case the red point identify accounts the @andybets is following.
This chart shows the outgoing versus incoming self-votes. Note that it is currently by count rather than rshares though, so may not be highly representative for your needs. Still I think it's clear where we'd like to see most accounts. When there is high density of points like in the bottom left, it can be helpful to reduce the sample size to get a better indication of the distribution.
That's all for now! There will more (probably shorter) posts in this series soon.
You got a 9.57% upvote from @mitsuko courtesy of @steemium! .
Join our team and delegate your SP to earn passive income.
Get part of 90% @mitsuko profit based on your delegated SP
Delegate 10 SP
Delegate 100 SP
Delegate 200 SP
Delegate 300 SP
Delegate 500 SP
Delegate 1000 SP
or custom delegation SP
Thank you so much for developing the extension for chrome, and for working to stop spam on the platform. I noticed the comments from bots on users who just comment "Great post" on everything. Posting generic, repeated comments annoys the crap out of me and just seems like they are doing it to gain attention/credibility. Great work by you!
Thanks - it drives me crazy too! I guess that's partly why I started the project.
You got a 10.26% upvote from @bid4joy courtesy of @steemium!
Congratulations! Your post has been selected as a daily Steemit truffle! It is listed on rank 12 of all contributions awarded today. You can find the TOP DAILY TRUFFLE PICKS HERE.
I upvoted your contribution because to my mind your post is at least 59 SBD worth and should receive 144 votes. It's now up to the lovely Steemit community to make this come true.
TrufflePig, an Artificial Intelligence Bot that helps minnows and content curators using Machine Learning. If you are curious how I select content, you can find an explanation here!
Have a nice day and sincerely yours,
Beep bop, this is @pushbot.
I just received a signal from the Mother Ship that you may require a push.
You just got a 10.37% upvote courtesy of @steemium!
Message from the Mother Ship:
You can earn daily profit by delegating SP to make @pushbot stronger. Delegators receive a share in 95% of the earnings.
10 SP • 20 SP • 50 SP
100 SP • 200 SP • 500 SP
1000 SP • 2000 SP • 5000 SP
Any Other Amount
You got a 12.79% upvote from @slimwhale courtesy of @steemium!
You got a 15.38% upvote from @joeparys! Thank you for your support of our services. To continue your support, please follow and delegate Steem power to @joeparys for daily steem and steem dollar payouts!
This post has received a 50.00% upvote from @aksdwi thanks to: @steemium.
♡ Your post just received some love and positivity from @haveaheart. Thanks for your generous contribution.
You just received a 9.25% upvote from @honestbot, courtesy of @steemium!
This post has received a 50.00 % upvote from @voterunner thanks to: @steemium. BIG NEWS: Build your passive income with daily payouts from @voterunner! Read more about earning SBD with me. Daily. The easy way!
Your level lowered and you are now a Red Fish!
Do not miss the last announcement from @steemitboard!
I must confess I am mystified at the fact there is a difference between outgoing selfvotes and incoming selfvotes. It would seem to my somewhat mathchallenged brain that both measures of selfvotes should match precisely.
Would you be kind enough to explain why there is a difference?