Wikipedia:Bots/Requests for approval/ZackBot 10

ZackBot 10

[[User:ZackBot|ZackBot 10]]

{{Newbot|ZackBot|10}}

Operator: {{botop|Zackmann08}}

Time filed: 20:36, Friday, September 28, 2018 (UTC)

Automatic, Supervised, or Manual: Automatic

Programming language(s): Ruby

Source code available: User:ZackBot/Infobox-needed

Function overview:The goal is to scan pages that are in :Category:Wikipedia articles with an infobox request and remove any pages that already have an infobox.

Links to relevant discussions (where appropriate): Wikipedia:Bot_requests#Bot_to_update_'Needs_infobox'

Edit period(s): One time run for now.

Estimated number of pages affected: Very difficult to say. Per [https://petscan.wmflabs.org/?psid=5950761 PetScan] there are currently 88,074 talk pages that fall in the category. I'd guess that somewhere between 3%-8% of those have Infoboxes and thus would be affected by this script. So A guess would be somewhere around 7,000-8,000 pages? But that is a TOTAL guess. This will be greatly dependent on how many of these sub categories I will run the script against.

Namespace(s):Main

Exclusion compliant (Yes/No): yes

Function details:

The functionality is pretty straight forward:

  1. Take a list of pages from a [https://petscan.wmflabs.org/ PetScan] search. These will be Talk pages that are marked as needing an infobox.
  2. Check the text of the page and search for the word infobox. My research thus far has indicated that just looking for the word infobox should be good enough as it is not a term used in any other context that I can find. However, if granted a trial run, this will be an area I will be focusing my attention on.
  3. If the page is found to contain the word then go back to the talk page and look for the param 'needs-infobox' and remove it from the templates.
  4. In the event that the needs-infobox parameter is not found, an error is raised and logged for manual inspection.

The ONLY change that this script will be making is to Talk pages, and it will be to remove text matching \|\s*needs-infobox\s*=\s*y(?:es){0,1}\s*

--Zackmann (Talk to me/What I been doing) 20:36, 28 September 2018 (UTC)

=Discussion=

  • Could it regex for something like [{][{][ \n\t]*[Ii]nfobox ? -- GreenC 21:34, 28 September 2018 (UTC)

:*{{ping|GreenC}} so technically speaking it can search for any regex. I think you are on the right track, but that has a few problems. Not all infoboxes start with {{infobox.... But perhaps something like /\{\{[\s\w\n]*infobox/i. See my testcase: [https://www.regexr.com/40bm1 here] --Zackmann (Talk to me/What I been doing) 21:41, 28 September 2018 (UTC)

:::Great. -- GreenC 23:35, 28 September 2018 (UTC)

I'd like to see a short trial to see how it works. {{BotTrial|edits=100}} SQLQuery me! 03:30, 14 October 2018 (UTC)

:did the trial occur? Please provide a summary of your trial results. — xaosflux Talk 01:19, 11 November 2018 (UTC)

::{{BotTrialComplete}} {{ping|Xaosflux}} thanks. Forgot to update this. {{ping|SQL}} [https://en.wikipedia.org/w/index.php?limit=500&title=Special%3AContributions&contribs=user&target=ZackBot&namespace=&tagfilter=&start=2018-10-19&end=2018-10-24 trial edits]... --Zackmann (Talk to me/What I been doing) 01:26, 11 November 2018 (UTC)

:{{BotApproved}} Task approved. — xaosflux Talk 02:01, 11 November 2018 (UTC)

:Ramp up schedule:

  • 500 edits, 1 day hold
  • 1000 edits, 1 day hold
  • 1000 edits, 3 day hold
  • open editing. — xaosflux Talk 02:03, 11 November 2018 (UTC)

:The above discussion is preserved as an archive of the debate. Please do not modify it. To request review of this BRFA, please start a new section at WT:BRFA.