Scribblehub stats!

Did this data surprise you?

  • Yes

    Votes: 15 65.2%
  • No

    Votes: 8 34.8%

  • Total voters
    23

Reisinling

Well-known member
Joined
Feb 5, 2021
Messages
357
Points
63
So, being the kind of person I am, I decided to spend my Friday evening writing a scrapper for SH and got some data that is super often asked on this forum. I assume it might be of interest to many. Data was scraped today, from the all time ranking list. I forgot to scrap the chapter number data, so all the "how many favorites per chapter is a good number" questions will be left for another time. I will also scrap RR data next time I guess

Totals:​

Novel count, sum of views on all novels, sum of favorites among all novels, and sum of words among all novels

Novels6 638
Views158 869 364
Favorites2 621 913
Words256 824 198

Views​

PercentileViews
99%440 197
98%252 916
97%161 856
96%126 872
95%93 875
94%80 234
93%65 682
92%54 904
91%45 400
90%38 730
85%20 000
80%11 700
75%7 375
70%5 000
65%3 500
60%2 500
55%1 900
50%1 500
45%1 100
40%879
35%685
30%547
25%423
20%315
15%226
10%158
5%102

Horizontal (x) axis is position in ranking
sh_views_log.png

sh_views_log_1-95.png


Second graph only shows data between top 99 and bottom 5. What we basically see from that is that there is a number of fictions that do not pass 100 views, which are pretty much stuck, then novels that have a steady growth rate (slowly grow readers, and so also views), growth starts accelerating when you get to top 1000, and then those that get to the top 1.5% get super accelerated growth, most likely due to being on top of all the rankings.

The weird steps in the middle are result of rounding, don't worry about it.

Readers​


PercentileReaders
99%1 644
98%1 196
97%991
96%810
95%677
94%593
93%510
92%449
91%391
90%350
85%219
80%153
75%110
70%82
65%62
60%48
55%38
50%31
45%25
40%19
35%15
30%12
25%10
20%7
15%5
10%4
5%2

Horizontal (x) axis is position in ranking, second chart has data between top 99 and bottom 15 (sorry for wrong label)
Pretty much the same conclusion as with views
sh_readers.png

sh_readers_log_1-85.png


PS: Am i the only one to whom all buttons for text formatting in forum posts are disabled?
PS2: If you liked this post, feel free to try the first 2 chapters of my thingy :P it's in top 15% of all novels, so it can't be that bad!
 

Attachments

  • data xlsx.zip
    698.2 KB · Views: 2,962
Last edited:

CadmarLegend

@Agentt found a key in the skeletons.
Joined
Jan 3, 2021
Messages
1,956
Points
153
So, being the kind of person I am, I decided to spend my Friday evening writing a scrapper for SH and got some data that is super often asked on this forum. I assume it might be of interest to many. Data was scraped today, from the all time ranking list. I forgot to scrap the chapter number data, so all the "how many favorites per chapter is a good number" questions will be left for another time. I will also scrap RR data next time I guess

Totals:
Views 158 869 364
Favorites 2 621 913
Words 256 824 198
Novels 6 638

Views
Percentiles for views:
Views
Percentile Views
99% 440 197
98% 252 916
97% 161 856
96% 126 872
95% 93 875
94% 80 234
93% 65 682
92% 54 904
91% 45 400
90% 38 730
85% 20 000
80% 11 700
75% 7 375
70% 5 000
65% 3 500
60% 2 500
55% 1 900
50% 1 500
45% 1 100
40% 879
35% 685
30% 547
25% 423
20% 315
15% 226
10% 158
5% 102

Horizontal (x) axis is position in ranking
View attachment 6628
View attachment 6629

Second graph only shows data between top 99 and bottom 5. What we basically see from that is that there is a number of fictions that do not pass 100 views, which are pretty much stuck, then novels that have a steady growth rate (slowly grow readers, and so also views), growth starts accelerating when you get to top 1000, and then those that get to the top 1.5% get super accelerated growth, most likely due to being on top of all the rankings.

The weird steps in the middle are result of rounding, don't worry about it.


Percentiles for Readers
99% 1 644
98% 1 196
97% 991
96% 810
95% 677
94% 593
93% 510
92% 449
91% 391
90% 350
85% 219
80% 153
75% 110
70% 82
65% 62
60% 48
55% 38
50% 31
45% 25
40% 19
35% 15
30% 12
25% 10
20% 7
15% 5
10% 4
5% 2

Horizontal (x) axis is position in ranking, second chart has data between top 99 and bottom 15 (sorry for wrong label)
Pretty much the same conclusion as with views
View attachment 6630
View attachment 6631

PS: Am i the only one to whom all buttons for text formatting in forum posts are disabled?
Wow, you spent a lot of time doing this...
 

Reisinling

Well-known member
Joined
Feb 5, 2021
Messages
357
Points
63
Wow, you spent a lot of time doing this...
I was refreshing my python skills, as I want to move from front end SE to more backend jobs. ~6-8h total I would say? Mostly because I never used related libraries. Also, for the first time in personal project, I actually wrote some automated testing.
 

CadmarLegend

@Agentt found a key in the skeletons.
Joined
Jan 3, 2021
Messages
1,956
Points
153
I was refreshing my python skills, as I want to move from front end SE to more backend jobs. ~6-8h total I would say? Mostly because I never used related libraries. Also, for the first time in personal project, I actually wrote some automated testing.
You are really productive, I've gotta say....
 

LordAstrea

Catgirl Addict
Joined
Nov 15, 2019
Messages
131
Points
83
Wow! This is awesome. Found this very interesting to sift through on my break. lol. Thanks for that.
 
D

Deleted member 20302

Guest
Lmao, this year I'm specializing in AI and data analysis in my final year of IT school. Also using Python for the specialization.

Seems like I get a preview of what I might be able to do too.

Thank you for the info! From here I can infer so much things.

1. Just getting 10k view results in the top 20% books in terms of popularity. I was expecting a bit lower, which I'm sure a lot do too. This means the effort to get people there isn't that difficult as one might expect. Even 5k views, half of the amount land you around top 30% for popularity.

2. Its extremely quite doable for any authors to reach 100 readers. Already in the top 30% of books in the site. I'm sure some thought that they could never make it but getting even that number is quite an achievable goal. Even half of that, 50, is top 40% which is unbelievably easier than I thought. Cause this means that 60% of books have readers below that number in this site.

It goes to shows how new this site is, and that majoirty of the books and authors isnt as popular as one might seen. And that any new author can attempt to aim high and achieve it without too much worry.

Well, thank you for reading my essay.
 

Reisinling

Well-known member
Joined
Feb 5, 2021
Messages
357
Points
63
You must have accidentally enabled the BB Code Editor.

Click here to disable it:

View attachment 6632
Thank you, that's exactly what happened. Reformatted the post to look a bit nicer

@AdLeto Tell me what you don't understand and I can help explain.
If you don't know what percentiles are, in here it says how many novels have less then X readers. So the pair of 50% and 31 readers means that half of novels on this site have less then 31 viewers, 90% and 350 means that 9/10 of all novels have less than 350 readers, and so on. It's nice because it tells you where you are relative to other authors.

@wildan1197_ ping me tomorrow, and I will attach it to this post
 

Reisinling

Well-known member
Joined
Feb 5, 2021
Messages
357
Points
63
Lmao, this year I'm specializing in AI and data analysis in my final year of IT school. Also using Python for the specialization.

Seems like I get a preview of what I might be able to do too.

Thank you for the info! From here I can infer so much things.

1. Just getting 10k view results in the top 20% books in terms of popularity. I was expecting a bit lower, which I'm sure a lot do too. This means the effort to get people there isn't that difficult as one might expect. Even 5k views, half of the amount land you around top 30% for popularity.

2. Its extremely quite doable for any authors to reach 100 readers. Already in the top 30% of books in the site. I'm sure some thought that they could never make it but getting even that number is quite an achievable goal. Even half of that, 50, is top 40% which is unbelievably easier than I thought. Cause this means that 60% of books have readers below that number in this site.

It goes to shows how new this site is, and that majoirty of the books and authors isnt as popular as one might seen. And that any new author can attempt to aim high and achieve it without too much worry.

Well, thank you for reading my essay.
I strongly suspect (once again, forgot to include in scrapped data) that its due to most people just dropping their novels after writing 5-6 chapters. This data made me also realize how lucky I was to have my novel get like 30-40 readers during its first day
 

DarkGodEM

Book Editor
Joined
Sep 12, 2020
Messages
312
Points
103
...
So...
I'm in the 98th and 96th percentile on readers but on the 96th and 94th on Views with my novels that I'm actively writing.......................

Wow.

I really didn't expect that.
 

High-in-the-skys

Awkward member
Joined
Jan 2, 2021
Messages
326
Points
108
Huh, explains precisely why my random novel got into "rankings".
I was well aware it's horrible but when I saw it's tag got into rankings (around #30, IIRC), I was shocked at how high it is and suspected few novels uses such tags.

I also observed some new novels got into rankings within a month and saw some similarities. First of all, they have consistent release and their chapters are more than ten. Their covers also doesn't matter since I saw some that have bad or no covers at all. Grammars fall within the acceptable range. The Genre/Tag/Theme are conventional, which if I give some new examples are Second chance, Romance, Gender Bender, Isekai, and uses "unique twist" in tropes(being a snake, monkey, bartender and catgirl but it doesn't change the fact that they're isekai). Contrary to how everyone calls this site "Smuthub", smut aren't found in popular unless they have good grammar, good plot and as everyone knows it, good smut(we don't talk about how they got sexy anime girls in cover).

Well that's just my observation. It's not good to rely on it since there can be some biases after all...
 

Businesssn

Brick-San the god of wholesome hentai
Joined
Dec 28, 2020
Messages
319
Points
83
So, being the kind of person I am, I decided to spend my Friday evening writing a scrapper for SH and got some data that is super often asked on this forum. I assume it might be of interest to many. Data was scraped today, from the all time ranking list. I forgot to scrap the chapter number data, so all the "how many favorites per chapter is a good number" questions will be left for another time. I will also scrap RR data next time I guess

Totals:​

Novel count, sum of views on all novels, sum of favorites among all novels, and sum of words among all novels

Novels6 638
Views158 869 364
Favorites2 621 913
Words256 824 198

Views​

PercentileViews
99%440 197
98%252 916
97%161 856
96%126 872
95%93 875
94%80 234
93%65 682
92%54 904
91%45 400
90%38 730
85%20 000
80%11 700
75%7 375
70%5 000
65%3 500
60%2 500
55%1 900
50%1 500
45%1 100
40%879
35%685
30%547
25%423
20%315
15%226
10%158
5%102

Horizontal (x) axis is position in ranking
View attachment 6628
View attachment 6629

Second graph only shows data between top 99 and bottom 5. What we basically see from that is that there is a number of fictions that do not pass 100 views, which are pretty much stuck, then novels that have a steady growth rate (slowly grow readers, and so also views), growth starts accelerating when you get to top 1000, and then those that get to the top 1.5% get super accelerated growth, most likely due to being on top of all the rankings.

The weird steps in the middle are result of rounding, don't worry about it.

Readers​


PercentileReaders
99%1 644
98%1 196
97%991
96%810
95%677
94%593
93%510
92%449
91%391
90%350
85%219
80%153
75%110
70%82
65%62
60%48
55%38
50%31
45%25
40%19
35%15
30%12
25%10
20%7
15%5
10%4
5%2

Horizontal (x) axis is position in ranking, second chart has data between top 99 and bottom 15 (sorry for wrong label)
Pretty much the same conclusion as with views
View attachment 6630
View attachment 6631

PS: Am i the only one to whom all buttons for text formatting in forum posts are disabled?
PS2: If you liked this post, feel free to try the first 2 chapters of my thingy :P it's in top 15% of all novels, so it can't be that bad!
I wonder where I am

It's only me that didn't understood shit about it?
*rest hand on your shoulder* I also didnt

Oh oh I got it know
 
D

Deleted member 20302

Guest
I strongly suspect (once again, forgot to include in scrapped data) that its due to most people just dropping their novels after writing 5-6 chapters. This data made me also realize how lucky I was to have my novel get like 30-40 readers during its first day
Yup! My overall conclusion from the essay is that as long a book gets to about 50 readers and 5k views, the book is considered doing more than half of the entire site. I got about 50 on my first day and 3 digits shortly. I am happy since I know I'm doing batter than most already. I love statistics and data so I'm grateful you do this for us! Thank you!

I agree about dropping novels, I suppose not everyone has the motivation to write that long. It's the initial excitement at the start that makes people write... then the hormones will fall flat.
 
Top