User talk:ClueBot Commons#Hundreds of broken talk archives
{{skip to TOC}}
{{divbox|1=red|2=This user is NOT a human|3=This is the combined talk page for ClueBot NG and ClueBot III. These users are automated computer programs and are not humans. Please be aware that bots cannot think like a human and cannot operate outside of their programming. Messages you leave on this talk page will not be answered by a bot – either a bot operator or another human will answer you.}}
{{divbox|1=red|2=File:Stop hand nuvola.svg False positives and false negatives|3=If you believe that ClueBot NG has mistakenly identified a good edit as vandalism, please follow the directions in the warning it gave or click here. Please do not report it on this talk page. It takes less time to report the case to the correct location, and we can handle it more effectively there.
If you believe that ClueBot NG has missed an edit that is vandalism, again do not report it here. ClueBot is unable to catch all vandalism. Just revert the edit and warn the editor.}}
{{divbox|1=blue|2=ClueBot NG Links!|3=Report False Positives{{•}} Frequently Asked Questions}}
{{divbox|1=blue|2=Purpose of this Page|3=This page is for comments on or questions about the ClueBots.}}
The current status of ClueBot NG is: {{User:ClueBot NG/running}}
The current status of ClueBot III is: {{User:ClueBot III/running}}
Praise should go on the praise page. Barnstars and other awards should go on the awards page.
Use the "new section" button at the top of this page to add a new section. Use the [edit] link above each section to edit that section.
This page is automatically archived by ClueBot III.
The ClueBots' owner or someone else who knows the answer to your question will reply on this page.
style="float: left; clear: left;" |
__TOC__ |
{{User:ClueBot III/ArchiveThis
|archiveprefix=User talk:ClueBot Commons/Archives/Facepalms
|format=
|age=99999
|index=no
|nogenerateindex=1
|archivebox=no
|box-advert=no
|archivenow=
}}
{{Archives|collapsed=yes|image=none|search=no|style=background-color:transparent; border-color:#CCCCCC|
{{User:ClueBot III/ArchiveThis
|archiveprefix=User talk:ClueBot Commons/Archives/
|format=Y/F
|age=168
|archivebox=yes
|box-advert=yes
|minkeepthreads=2
|archivenow=
}}
}}
{{User:ClueBot Commons/BotNav}}
{{clear}}
{{WP:TPS/watched}}
Improving ClueBot NG's algorithm
I'm looking into User:Cluebot NG#Vandalism Detection Algorithm, and this is my understanding of the algorithm:
1. For each word and pair of adjacent words that was added in the edit, add its score (which is determined from training data) to a counter.
2. Compute a few other statistics, such as length of text added, etc. and normalize them to prepare as inputs to the neural network.
3. Run neural network and get the score.
Clearly, the algorithm works just fine (just look at ClueBot NG's contributions page). However, there are some areas that could still be improved further. For example, the size of the window for the bayesian classifiers is just 2, meaning a vandalism edit with a phrase of 3 or more words (or extra words interspersed between) might get ignored. In fact, it may be better to use something like a Transformer (deep learning architecture) to more accurately obtain the meaning of the edit.
As far as I know, the principal maintainer of the bot (User:Crispy1989) has been inactive since 2011. Also pinging User:DamianZaremba since he seems to be active on the github repo. If I could, I would be excited to help improve the bot. Sungodtemple (talk • contribs) 01:04, 14 May 2025 (UTC)
:Also pinging NaomiAmethyst since she seems to be active. Sungodtemple (talk • contribs) 01:55, 21 May 2025 (UTC)
Empty index??
User:ClueBot III/Master Detailed Indices/Talk:Transgender health care misinformation How does that happen? Aaron Liu (talk) 17:18, 15 May 2025 (UTC)
:ClueBot was never actually used to archive that page - It got added in Special:Diff/1269259214 and then removed in Special:Diff/1275633399 before it ever got to archive anything, so it makes sense that the index was nothing at the time and hasn't updated since. Aidan9382 (talk) 17:55, 15 May 2025 (UTC)