capable of processing an entire chain.
I'm not sure why you would want to process the entire chain rather than monitoring it in real time for posts that meet a certain criteria and then punting them to a checker. Maybe that's what you meant. The bulk of hive traffic is not blog posts. Why would I want to feed splinterlands battles into a plagiarism API for example? There should be an arbiter for that built into the system. If Cheetah for example were processing token sales on hive engine and the rental market on splinterlands, and checked them for plagiarism, I would argue it's poorly designed.
It's a shit job, everyone is disenchanted with it, everyone thinks not enough is done, its thankless.
You'll get no argument from me here, which is why as much as possible should be done by a 'trainable' algorithm.
We go through this every six months or so on average and then no one cares anymore and getting feedback on anything is like pulling teeth.
For me it's more of a question of 'out of sight out of mind.' If it's not happening on chain, it didn't happen. That is a problem, because it should be on chain. Ideally if someone breaks a rule, I should see their dossier posted in a reply. You plagiarized, here's a link to what you plagiarized, and here are all of your past transgressions. If I see someone I recognize was downvoted, I shouldn't have to run it down on Discord. From what I remember a large percentage of your time on discord is spent defending your actions because their impetus was not evident on chain.
You have to process every post in order to monitor it. So what you need to do is have a system to run these posts through. One, with a simple but expensive Google API like Cheetah (Cheetah is a 'similar content' bot that searches every post for similar wording via Google). Two, with a reporting form (simple too). Three, with custom reverse engineering tools to detect not only AI but translations from obscured sources (creation of this would be the most expensive part). Four, with "account history". There's no way to avoid massive infrastructure and extremely high costs in development and operations. I believe Tiktok and Facebook are attempting to create similar scale models. Not saying its impossible, but would be more complex than anything else ever built on Hive. It's an interesting concept though that would push anything ever built on any chain. Whether it would be ethical to build is another question.
Regarding Discord: even if a comment is clear on chain people don't read it or want to hear the answer again. That's just human nature. We took out the auto comments which spam the chain and other pre-Hive-related protocols. Those prevent the person from appealing and moving on.