Question about A.I. Data Scrubbing

Lysander_Works

Well-known member
Joined
Jul 22, 2023
Messages
596
Points
103
I've been wanting to maybe try something for a while, but the bothersome A.I. data scrubbing has me in need of answering a technical question I know nothing about.

In particular, if a user of ScribbleHub creates and then saves a draft (never publishing the draft), how accessible is the draft and its contents to something like an A.I. data scrubbing program? Can A.I. being used to scan user's story contents break beyond that barrier and even go so far as to peer into unpublished draft content?

It may seem like a simple question, but for all I know the answer may depend on several factors. What I do know is that data scrubbing can bypass the DRM on platforms like KDP, accessing content that should be bought and paid for before being accessed, so clearly there is some intrusive process to data scrubbing, but as tech savvy as I am, A.I. is still something I know too little about (because I've avoided it as much as possible). Any experts here know anything? Is it theoretically possible for lunatic to scan an author's draft, alter, and then post it on some pirate platform as if it were their own (emphasis on draft-ready + unpublished content)?
 

Succubiome

Well-known member
Joined
Apr 25, 2023
Messages
623
Points
133
I won't get into the technical details, since it might give people ideas, but I will say it's fairly trivial to grab drafts on Scribblehub.
 

Lysander_Works

Well-known member
Joined
Jul 22, 2023
Messages
596
Points
103
I won't get into the technical details, since it might give people ideas, but I will say it's fairly trivial to grab drafts on Scribblehub.
? Really? I presumed it was possible yet difficult initially. I'm guessing that data-scrubbing has a way of reading non-protected data (including draft-work) as if it were reading the contents of where it would be on the servers, albeit random?
If that is true, then I should delay the decision on a certain long-term separate project, for now.
I wonder if it universally applies to similar sites as well.
 

Goodmann

Well-known member
Joined
Aug 6, 2023
Messages
73
Points
58
Sounds like it might be smarter to write & edit on your own computer with your own WP app, & port only the final version. Although if they can pirate the posted version keeping your roughs separate would only help in proving authorship
 

Lysander_Works

Well-known member
Joined
Jul 22, 2023
Messages
596
Points
103
Sounds like it might be smarter to write & edit on your own computer with your own WP app, & port only the final version. Although if they can pirate the posted version keeping your roughs separate would only help in proving authorship
My thoughts exactly. I don't plan on publishing that specific project for many many years later. No sense setting up the template now if the risk of it being stolen - given enough time - is constant. Once I do actually post it here, I won't care too much at that point once the pirates do come for it.
 

Succubiome

Well-known member
Joined
Apr 25, 2023
Messages
623
Points
133
? Really? I presumed it was possible yet difficult initially. I'm guessing that data-scrubbing has a way of reading non-protected data (including draft-work) as if it were reading the contents of where it would be on the servers, albeit random?
If that is true, then I should delay the decision on a certain long-term separate project, for now.
I wonder if it universally applies to similar sites as well.
Wait, I'm wrong-- I opened up the link in the same browser, so of course it let me-- I don't know how well the information is protected, but it's not trivially easy to bypass.

I thought all you needed was the ID, and this is because I messed up my checking.

My bad!
 

Corty

Ra’Coon
Joined
Oct 7, 2022
Messages
4,666
Points
183
There was a project here somewhere making a better searcher for the site, but it was running into problems as the scrubbing flagged the creator, and he was told not to do it a certain way.

I’m dumb in this regard, so I am speaking of what I remember, but it was around a year ago.
 

Lysander_Works

Well-known member
Joined
Jul 22, 2023
Messages
596
Points
103
Wait, I'm wrong-- I opened up the link in the same browser, so of course it let me-- I don't know how well the information is protected, but it's not trivially easy to bypass.

I thought all you needed was the ID, and this is because I messed up my checking.

My bad!
Bruh
I thought there was at least one platform (I don't remember which one) ~ which would allow draft visibility to be set for everyone, but maybe I'm wrong.
As for Scribblehub I know this much is false.
Look, I've at least realized one thing. Accessing another user's draft-only content would imply breaching some level of security in the first place. I'm not claiming that A.I. can't do it, but there would be huge legal implications if it did start. Think about it, if A.I. data scrubbing could breach a server for an author's drafts, well, you really think the person doing it would care as much about unedited text content? No, of course not. They'd be first in line to go after sensitive financial information, where the big money is. It would also imply online privacy in general can't exist under any pretext.

All that insanity aside, I still feel there is much more to this that I don't know. Yeah, the topic is focused on books and book drafts for now, but the idea of A.I. data scrubbing can expand into other areas too, so it's still interesting in that regard.

Already published chapter content would be different, mostly. I still wonder how it gets used for novels within pay-only platforms like KDP and Google Books. I heard (actually I've seen evidence of) author's works having their content scanned if they published in KDP and Google. I also know that Google and KDP both have an A.I. powered tool that they themselves can use to scan their own library. This begs a personal and interesting side question for me:
Can A.I. Scrubbing from a source 'outside the first party platform' still access published paid-book content? If so, how?

I know, I'll keep doing my own research for fun, but feel free to chime in whenever. I see lots of forums and not lots of them peak an interest.
 
Last edited:

Succubiome

Well-known member
Joined
Apr 25, 2023
Messages
623
Points
133
Can A.I. Scrubbing from a source 'outside the first party platform' still access published paid-book content? If so, how?
I mean, if they pay for it, absolutely, even without any deeper security-breaking things? Kindle Unlimited gets you access to a lot of things, and text is not that hard to copy out of PDFs or whatever.
 

tiaf

ゞ(シㅇ3ㅇ)っ•♥•Speak fishy, read BL.•♥•
Joined
May 29, 2019
Messages
3,113
Points
183
If AI could access the drafts, wouldn't that make it hacking? :blob_hmm::blob_hmm:
 

Lysander_Works

Well-known member
Joined
Jul 22, 2023
Messages
596
Points
103
I mean, if they pay for it, absolutely, even without any deeper security-breaking things?
Well sure, but I've heard that A.I. can scan the material without paying the author a cent for it. Whether that rumor is true or not is something I have not confirmed, but I've heard it from more than one person. Not sure what to think there.


If AI could access the drafts, wouldn't that make it hacking?
You're thinking of breaching, most likely. Hacking implies someone does all the work to implant backdoors and unlock targeted directories. Breaching would be similar, but more of a workaround. like walking around a barbed fence instead of trying to break right through. If A.I. can't do that yet, good. I'll sleep better at night.
I feel like it will happen one day or another though. The world has unleashed A.I. without a single concerned thought about what new problems may lead from it.

That said, from what I was told by one technician, so long as draft content is protected on the sever by credentials, A.I. scrubbing should not be capable of seeing that content, without some level of hacking or breaching (which is an unlikely risk for the specific subject matter). I thought there was at least one web-story platform that publicizes drafts, but I cannot locate or prove it now... Maybe I merely dreamed it.

After all of this, do I have some peace of mind? Yes. Does it mean my plan changes? Not really. At most I will probably generate the skeleton content without the body content, and then fill in all the blanks when I'm ready for it to go live.
 

tiaf

ゞ(シㅇ3ㅇ)っ•♥•Speak fishy, read BL.•♥•
Joined
May 29, 2019
Messages
3,113
Points
183
Well sure, but I've heard that A.I. can scan the material without paying the author a cent for it. Whether that rumor is true or not is something I have not confirmed, but I've heard it from more than one person. Not sure what to think there.



You're thinking of breaching, most likely. Hacking implies someone does all the work to implant backdoors and unlock targeted directories. Breaching would be similar, but more of a workaround. like walking around a barbed fence instead of trying to break right through. If A.I. can't do that yet, good. I'll sleep better at night.
I feel like it will happen one day or another though. The world has unleashed A.I. without a single concerned thought about what new problems may lead from it.

That said, from what I was told by one technician, so long as draft content is protected on the sever by credentials, A.I. scrubbing should not be capable of seeing that content, without some level of hacking or breaching (which is an unlikely risk for the specific subject matter). I thought there was at least one web-story platform that publicizes drafts, but I cannot locate or prove it now... Maybe I merely dreamed it.

After all of this, do I have some peace of mind? Yes. Does it mean my plan changes? Not really. At most I will probably generate the skeleton content without the body content, and then fill in all the blanks when I'm ready for it to go live.
You’re better off saving it offline on your device and back it up on cloud.

I don’t know which site has public drafts, but I certainly know that Wattpad doesn’t protect your drafts from the moderators. So never save drafts on WP.
 

Lysander_Works

Well-known member
Joined
Jul 22, 2023
Messages
596
Points
103
I don’t know which site has public drafts, but I certainly know that Wattpad doesn’t protect your drafts from the moderators. So never save drafts on WP.
Maybe Wattpad was the one I was thinking of, though I could not find the actual indication or proof that. If you know something more about it I'd love to look into it.

Yeah, like I said, nothing wrong with skeleton frame of the content (at most chapter names and special non-text link files & placeholders). I already have the content ready and published on KDP and Ibooks. I never did explain the reason or sense in the idea of it. I one day plan to publish that content in as many places as possible, but it is content I would like to make money off of first before I do. I want to have it set so that I could push it out like the flick of a few switches, in only a couple weeks tops time.
Why you might wonder?

Recent events have reminded me that my body is frail in more ways than one. If I one day learn that I don't have very long to make it, might as well allow everyone to see the work I've left behind. Not to be dark, but my health is not great. That very situation is still up in the air...
 
Top