User:BrownHairedGirl/Articles with bare links

__FORCETOC__

Note

This page exists solely as a list of articles for processing to cleanup bare URLs (see WP:Bare URLs). I do that by feeding these lists to {{u|Citation bot}}.

In the vast majority of cases, these are articles in which I have no interest other than fixing bare URL references.

The lists have no other significance, and represent only a subset of the topic described in the title. Note that the definition of "bare URL" used here is narrow: ref tags which contain only the URL, optionally preceded or followed by spaces, and/or enclosed in square brackets [].

For example:

  • bare URL, with no spaces: <ref>https://www.example.com/foo</ref>
  • bare URL, with spaces: <ref> https://www.example2.com/foobar </ref>
  • bracketed URL, with no spaces: <ref>[https://www.example.com/foo]</ref>
  • bracketed URL, with spaces: <ref> [https://www.example2.com/foobar] </ref>

There are of course many other types of inadequately described citation. This exercise targets only the simplest, worst examples. However, when {{u|Citation bot}} processes a page, it can fix many other citation issues, so this exercise fixes more than just the targeted problem.

= Old methodology =

{{collapse top|This section describes the list-making process used from mid July to late November 2021}}

I create these lists by using WP:PETSCAN to make huge lists of articles, then I process them in various ways using WP:AWB, and create a list of only articles with bare URL refs, which usually amount to less than 10% of the initial lists. (The highest ratio I have seen is about 15%, and the average is probably about 8%). I then break the list up into chunks which fit under the 2200-page limit for citation bot batch jobs.

Note that since each list is cross-checked against the lists which have already been processed, to avoid having the bot process the same page twice. (There may be some glitches in this, since my filing system for lists made in July has some omissions).

{{collapse bottom}}

= Updates =

Please note that this page is updated with a new list after a batch has been started to be processed, and sometimes before it has finished processing. So if you have come to this page after it is mentioned in an edit summary, please note that the current version of this page may not be the one used for that series of edits. For earlier versions, see [https://en.wikipedia.org/w/index.php?title=User:BrownHairedGirl/Articles_with_bare_links&action=history this page's history].

Lists

Until recently, @{{u|Citation bot}} was unable to fill a bare URL ref if it was followed by punctuation. I reported the bug at User talk:Citation bot/Archive 31#Bot_fails_to_fill_ref_to_bare_URL_followed_by_punctuation, and the bot's maintainer kindly responded with a prompt fix.

This batch job is to fix the backlog of punctuated bare URLs. The list started as the output of a scan of the [https://dumps.wikimedia.org/enwiki/20220520/ 20220520 database dump] for the regex ]*?>\s*\[?\s*https?:[^>< \|\[\]]+\s*\]?\s*[\|\.,]\s*<\s*/\s*ref\b

While that list of 1,352 was being processed by @{{u|Citation bot}}, a spot check revealed that it was not filling refs where URL was enclosed in square brackets. I notified the maintainer (see the discussion), and the glitch was promptly fixed.

The list was the re-parsed, to exclude pages which no longer had any punctuated bare URLs. That left 1,196 pages for Take 2.

After take 2 was parsed, there were 898 pages left to be parsed in Take 3.

After take 3 was parsed, there were 834 pages left to be parsed in Take 4.

After take 4 was parsed, there were 468 pages left to be parsed in Take 5.

(Please note that the reductions in page count are misleading. Most of the remaining 898 pages had one or more bare URLs filed in the previous takes, but pages remain in the list unless all the punctuated bare URLs have been filled).