Wikipedia:Link rot/URL change requests#smmsport.com
{{Shortcut|WP:URLREQ}}
__NEWSECTIONLINK__
This page is for requesting modifications to URLs, such as marking dead or changing to a new domain. Some bots are designed to fix link rot; they can be notified here. These bots include InternetArchiveBot and WaybackMedic. This page can be monitored by bot operators from other language wikis since URL changes are universally applicable.
{{User:ClueBot III/ArchiveThis|archiveprefix=Wikipedia:Link rot/URL change requests/Archives/|format=Y/F|age=2160|archivebox=yes|box-advert=yes}}
finlex.fi discussion
Finlex.fi URLs aren't dead but for some reason InternetArchiveBot keeps adding archived URLs for them. This was brought up at :meta:User_talk:InternetArchiveBot#Finlex.fi_URLs_aren't_dead a month ago: {{tq|1=Bot's edits: [https://en.wikipedia.org/w/index.php?title=Finnish_Defence_Forces&diff=prev&oldid=1205893909], [https://en.wikipedia.org/w/index.php?title=Finland&diff=prev&oldid=1204320409], [https://en.wikipedia.org/w/index.php?title=Docent&diff=prev&oldid=1203856121]. Some URLs it tagged as dead but are actually working: [http://www.finlex.fi/fi/laki/ajantasa/2007/20071438], [https://www.finlex.fi/fi/laki/ajantasa/1991/19911083], [http://www.finlex.fi/fi/laki/alkup/2009/20090558].}} Those finlex.fi URLs that now have both a working URL and an archive URL should be tagged with the |url-status=live
tag, and could someone try to tell IABot that Finlex is live? Thanks. 2001:14BA:9C94:9A00:E866:DADA:1085:E3D9 (talk) 09:28, 17 March 2024 (UTC)
:Just noticed that this same issue is being discussed at fi.wikipedia: :fi:Wikipedia:Kahvihuone_(tekniikka)#Botti_hakee_arkistosta_kumottuja_lakeja 2001:14BA:9C94:9A00:E866:DADA:1085:E3D9 (talk) 09:41, 17 March 2024 (UTC)
::The site has a "Are you human?" check box (CloudFlare). This is causing the bot to think it's a dead site. I logged into iabot.org and changed the domain to "Subscription" status and that will cause the bot to avoid this domain, it won't set live or dead. My bot WaybackMedic has capabilities to bypass CloudFlare. I can try to process this domain and see what happens. My bot also has a feature "make live" ie. convert a citation from dead to live state. Unfortunately my bot only works on English Wikipedia. I'll let you know what happens. -- GreenC 15:13, 17 March 2024 (UTC)
:::Unfortunately, this site has maximum security enabled, none of my tools can get through. It started happening in late January 2024. I don't know what to do because no bot is able to determine if a link is live or dead. And no archive service such as WaybackMachine is able to archive a page. Only humans can get through, and they need to solve a captcha. It might be worthwhile waiting to see if they relax security in the future, since this is a recent development. -- GreenC 00:40, 19 March 2024 (UTC)
::::{{Ping|GreenC}} Before this section gets archived and if it's easy/fast to check, can you check if this is still the case, i.e. that the site still has the maximum security enabled and no tool/bot can get through? Thank you. 85.76.109.152 (talk) 06:21, 2 June 2024 (UTC)
:::::{{qmark}} When going to [https://www.finlex.fi/fi/laki/ajantasa/2007/20071438] it still asks "Are you human?" with the CloudFlare security tag at the bottom. This is a feature of CloudFlare service, clients have the option to enable, it's the highest level of security. I'm not aware of a tool that can bypass. What I will do is set a reminder in 6 months to check again and post the results here. I use [https://en.wikipedia.org/wiki/User:SD0001/W-Ping W-Ping] which posts a reminder in the watchlist at whatever time in the future with a custom message. -- GreenC 16:06, 2 June 2024 (UTC)
::::::Still on CloudFlare. -- GreenC 03:21, 2 December 2024 (UTC)
:::::::Looks like the captcha might have been removed – I'm not getting one on that link at the moment. Save Page Now also works. 2A00:807:D3:B2CD:1D64:EE5:ECA4:CA80 (talk) 08:40, 25 April 2025 (UTC)
::::::::Yes one year later captcha lifted. I put this on my schedule to correct the dead link status (on Enwiki). I just reset the domain status in iabot.org to {{underline|alive}}. -- GreenC 01:49, 3 May 2025 (UTC)
{{od}}
This should be completed see Wikipedia:Link_rot/URL_change_requests#finlex.fi. Any problems let me know. -- GreenC 18:26, 9 May 2025 (UTC)
:Great to hear, thanks! (OP) 87.95.243.221 (talk) 16:36, 12 May 2025 (UTC)
uptheposh.com
The domain www.uptheposh.com has been usurped, and all links (including sublinks like
:Sounds like WP:JUDI. — Qwerfjkltalk 16:54, 4 January 2025 (UTC)
{{done}} in a WP:JUDI usurpation batch. -- GreenC 04:58, 16 February 2025 (UTC)
acig.org
Has been usurped by 1map.com. Needs adding archives and marked usurped Lyndaship (talk) 09:37, 7 January 2025 (UTC)
:User:Lyndaship, thank you. Awaiting next WP:JUDI usurpation batch: Special:Diff/1267870649/1267991245 -- GreenC 17:29, 7 January 2025 (UTC)
{{done}} in a WP:JUDI usurpation batch. -- GreenC 04:58, 16 February 2025 (UTC)
vectorsite.net
This site died in 2012, until 2019 it went to Justhost, since then if you look at at archived page on wayback a file gets downloaded onto your computer. Putting a url directly into browser search brings up a squatter search site but with a url beginning ww3. Refill has in the past changed the cite url to one beginning ww1. Think it's safest to just archive and usurp the lot Lyndaship (talk) 12:50, 9 January 2025 (UTC)
:Awaiting next WP:JUDI usurpation batch: Special:Diff/1268240877/1268396625 -- GreenC 15:26, 9 January 2025 (UTC)
{{done}} in a WP:JUDI usurpation batch. -- GreenC 04:59, 16 February 2025 (UTC)
arkivnamnden.org
This used to host the public archives of the Swedish city Göteborg. The archives moved to a new address starting from 2019. It is now displaying casino ads.
Example of use: https://sv.wikipedia.org/wiki/Kungsladug%C3%A5rd,_G%C3%B6teborg#cite_note-12
https://web.archive.org/web/20200220132929/http://arkivnamnden.org/ - information about address change
https://web.archive.org/web/20210413012018/https://arkivnamnden.org/ - up for sale
https://web.archive.org/web/20230605133707/https://arkivnamnden.org/ - still for sale
https://web.archive.org/web/20250105093523/https://arkivnamnden.org/ - casino ads
Should probably be marked as usurped and what has not already been changed to archive links should do that. 98.128.246.108 (talk) 14:19, 9 January 2025 (UTC)
:Awaiting next WP:JUDI usurpation batch: Special:Diff/1268240877/1268396625 -- GreenC 15:26, 9 January 2025 (UTC)
{{done}} in a WP:JUDI usurpation batch. -- GreenC 04:59, 16 February 2025 (UTC)
airfields-freeman.com
[https://en.wikipedia.org/w/index.php?search=insource%3Aairfields-freeman+insource%3A%2Fairfields-freeman%5C.com%2F&title=Special:Search&profile=advanced&fulltext=1&ns0=1 672 pages.] New domain is airfieldsfreeman.com. Cuba200611 (talk) 02:33, 10 January 2025 (UTC)
:In addition many can be converted to .htm
:#https://www.airfieldsfreeman.com/KS/Airfields_KS_Wichita.html
:#https://www.airfieldsfreeman.com/KS/Airfields_KS_Wichita.htm
: -- GreenC 18:53, 12 January 2025 (UTC)
- Cuba200611: Some URLs changed the name of the airfield, for example Special:Diff/1268800794/1269043386 and Special:Diff/1207291175/1269042821. I did these two. I'll leave the rest to you, which can't be done by bot.
{{collapse begin |title=25 URLs}}
- Herrick HV-2A Vertaplane ---- http://www.airfields-freeman.com/ny/airfields_ny_ny_brooklyn.htm
- Hearst Ranch ---- http://www.airfields-freeman.com/ca/Airfields_CA_SanLuisObispo.htm#hearst
- Robbins Airport (Illinois) ---- http://www.airfields-freeman.com/il/Airfields_IL_Chicago_W.htm
- Deer Park Airport (New York) ---- http://www.airfields-freeman.com/ny/Airfields_NY_LongIs_Suffolk_W.htm
- Matamoras, Pennsylvania ---- http://www.airfields-freeman.com/pa/Airfields_PA_NE.htm#matamoras
- Oldmans Township Airport ---- http://www.airfields-freeman.com/nj/Airfields_NJ_SW.htm
- North American Farms Airport ---- http://www.airfields-freeman.com/fl/Airfields_FL_Tallahassee.htm
- Rotonda West, Florida ---- http://www.airfields-freeman.com/fl/Airfields_FL_FtLauderdale.htm
- Ithaca Tompkins International Airport ---- http://www.airfields-freeman.com/ny/Airfields_NY_Centr.htm#ithaca
- Suburban Airport ---- http://www.airfields-freeman.com/md/Airfields_MD_AnneArundelCo.htm#suburban
- Grumman Bethpage Airport ---- http://www.airfields-freeman.com/ny/Airfields_NY_LongIs_Nassau.htm#bethpage
- Texas Gulf Coast Regional Airport ---- http://www.airfields-freeman.com,
- Fort Ord Army Airfield ---- http://www.airfields-freeman.com/Ca/Airfields_CA_Monterey.htm#ftord
- Cook Cleland ---- http://www.airfields-freeman.com/OH/Airfields_OH_Cleveland_N.htm#euclid
- Tipton Airport ---- http://www.airfields-freeman.com/MD/Airfields_MD_Columbia.htm
- Breckenridge STOLport ---- http://www.airfields-freeman.com/co/Airfields_CO_SW.htm#breckenridge
- Middleborough, Massachusetts ---- http://www.airfields-freeman.com/ma/Airfields_MA_SE.htm#middleboro
- Mendon Airport ---- http://www.airfields-freeman.com/MA/Airfields_MA_Boston_C.html#mendon
- Sahuarita Air Force Range ---- http://www.airfields-freeman.com/AZ/Airfields_AZ_Tuscon_SE.html
- Horizon Airport (San Antonio) ---- http://www.airfields-freeman.com/tx/Airfields_TX_SanAntonioW.htm
- Mitchel Air Force Base ---- http://www.airfields-freeman.com/NY/Airfields_NY_LongIsC.htm
- Deer Park, New York ---- http://www.airfields-freeman.com/ny/Airfields_NY_LongIs_Suffolk_W.htm
- Bass Lake (Madera County, California) ---- http://www.airfields-freeman.com/ca/Airfields_CA_Fresno_N.htm#wishon
- List of airports in Connecticut ---- http://www.airfields-freeman.com/ct/airfields_ct_w.htm
- Shirley Airport ---- http://www.airfields-freeman.com/MA/Airfields_MA_Boston_C.html#shirley
{{collapse end}}
-- GreenC 19:44, 12 January 2025 (UTC)
:I fixed the remaining URLs. Cuba200611 (talk) 07:14, 19 February 2025 (UTC)
Enwiki
- Checked 674 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=airfields-freeman.com&max=500&server=enwiki&ns=None&enddate=20250111&startdate=20250114&nosect=on edited 664 pages]. Moved 841 links to a new URL: 841 ruled mapped redirects, Added 3 {{tld|dead link}}. Switched 62 {{para|url-status|dead}} to live. Added 25 archive URLs (24 Wayback). Changed 95 citation metadata.
{{tld|dead link}} verification
{{anchor|dead link verification}}
Every few years, I check all links marked with {{tlx|dead link}} and attempt to discover archive URLs for them. WaybackMedic has advanced search technology that can discover archives other tools miss. Typically about a 15% success rate. This is a large project and slow due to the number of articles, currently about 300,000. I do it in batches of 30-50 thousand articles, which takes a few days to process. The statistics/results for each batch will be posted in this section. This job consumes all bot resources, it will be on and off while doing other projects in between batches. -- GreenC 18:45, 10 January 2025 (UTC)
- Pass 1: Pages 1 to 50,000: Edited 8,200 pages. Added 8,734 archive URLs (5,346 Wayback) -- GreenC 01:33, 12 January 2025 (UTC)
- Pass 2: Pages 50,001 to 100,000: Edited 8,076 pages. Added 8,843 archive URLs (5,649 Wayback) -- GreenC 05:28, 17 January 2025 (UTC)
- Pass 3: Pages 100,001 to 150,000: Edited 8,019 pages. Added 8,466 archive URLs (5,281 Wayback) -- GreenC 02:44, 21 January 2025 (UTC)
- Pass 4: Pages 150,001 to 200,000: Edited 8,139 pages. Added 8,336 archive URLs (5,222 Wayback) -- GreenC 22:59, 23 January 2025 (UTC)
- Pass 5: Pages 200,001 to 250,000: Edited 8,281 pages. Added 8,789 archive URLs (5,714 Wayback) -- GreenC 06:30, 26 January 2025 (UTC)
- Pass 6: Pages 250,001 to 316,770: Edited 10,979 pages. Added 11,458 archive URLs (7,74 Wayback) -- GreenC 00:08, 30 January 2025 (UTC)
- :Looks like you've accidentally omitted a digit from {{tqq|(7,74 Wayback)}}. 2A00:807:D9:A2AF:29C5:73BC:FE8C:9F86 (talk) 21:49, 17 April 2025 (UTC)
Enwiki
- Checked 316,776 pages. Edited 51,694 pages. Added 54,626 archive URLs.
IABot DB
- Updated 40,586 unique URLs which propagate through 300+ wikis
Tornado History Project and crh.noaa.gov
= tornadohistoryproject.com =
Links to tornadohistoryproject.com (a widely used source run by the Storm Prediction Center less than a decade ago, especially on older articles written before 2015) now redirect to an unaffiliated third-party essay writing service. Links should be considered usurped and dead where no archive URL is available.
:Awaiting next batch at WP:JUDI -- GreenC 20:18, 1 February 2025 (UTC)
{{done}} in a WP:JUDI usurpation batch. -- GreenC 04:59, 16 February 2025 (UTC)
= crh.noaa.gov =
Links to crh.noaa.gov (individual National Weather Service WFO summaries for severe weather events) are dead, but many still remain online on the new weather.gov domain.
For instance, http://www.crh.noaa.gov/dlh/?n=1991halloweenblizzard can now be found at https://www.weather.gov/dlh/1991halloweenblizzard - the syntax is different but can be reasonably changed and the site contents are the same. I'm not sure why crh.noaa.gov isn't a redirect but regardless it's still used in a hell of a lot of weather articles and should be salvaged rather than just labeled as dead where possible. I think this also extends to other domains - if I'm not mistaken, crh is Central Region Headquarters, and there are likely others in the South and a few other parts of the country. Departure– (talk) 14:21, 30 January 2025 (UTC)
:Hi, User:Departure–, there are [https://en.wikipedia.org/w/index.php?search=insource%3Acrh.noaa.gov+insource%3A%2Fcrh.noaa.gov%5C%2F%5Ba-z%5D%7B3%7D%5C%2F%5B%3F%5Dn%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 247 pages] but a lot don't look like they map to the rule. I can try. Do you know of other domains? -- GreenC 22:16, 1 February 2025 (UTC)
::http://www.crh.noaa.gov/bou/?n=consec90 (cited on the page Colorado) on the CRH domain appears to map to https://www.weather.gov/bou/DenverSummerHeat if I'm not mistaken. However, this is just a contemporary equivalent and the original content of the site is not present, and I believe it has indeed been lost. I think a lot of the CRH links are dead but I know a lot of them can be recovered by transitioning to the weather.gov domains. Departure– (talk) 22:21, 1 February 2025 (UTC)
:::In other words, a lot of links do follow the rule but a lot don't as they have been replaced. From what I can tell, a lot of links have been standardized with no clear rule to find the new one. However, many non-climate stories are still online, such as http://www.crh.noaa.gov/ilx/?n=spi-tornado (cited in Springfield, Illinois) which can now be found at www.weather.gov/ilx/12mar06-tor2 - inputting the title from the cite template into https://search.usa.gov/search?v%3Aproject=firstgov&query=&affiliate=nws.noaa.gov can be used to recover links where the rule does'nt apply, assuming the site's content isn't lost. Departure– (talk) 22:25, 1 February 2025 (UTC)
::::This is tricky because if I replace a dead link with a new live link, but the live link has different content, the old dead link is lost we don't know what the original link was anymore. But if I keep the old dead link, plus add an archive URL, then the content is preserved. Thus the safer option is to add archives. Like the DenverSummerHeat example could be converted to [https://web.archive.org/web/20061001175248/http://www.crh.noaa.gov/bou/?n=consec90 this] which is serviceable. -- GreenC 01:30, 7 February 2025 (UTC)
nasportscar.com
silverscreen.in
It seems the site is slowly dying. [https://mobile.x.com/silverscreenin/status/1532587565517262848 They ceased publishing new content in June 2022] due to COVID-19, and although they said the site would still be accessible, [https://www.isitdownrightnow.com/silverscreenindia.com.html I don't know since when it hasn't been]. The site's domain was previously silverscreen.in, and that must be dealt with too. Kailash29792 (talk) 05:06, 8 February 2025 (UTC)
:According to the WaybackMachine it was last [https://web.archive.org/web/20240901000000*/silverscreenindia.com available on January 1 2025]. That's an ominous date, first of the year, suggesting cut off. But it may be too soon to determine, I've seen sites disappear then return months or years later. In the mean time we have dead links. It's only [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Asilverscreenindia+insource%3A%2Fsilverscreenindia%5B.%5Dcom%2F&title=Special%3ASearch&ns0=1 247 pages]. There's nothing for the [https://en.wikipedia.org/w/index.php?search=insource%3Asilverscreenindia+insource%3A%2Fsilverscreenindia%5B.%5Din%2F&title=Special:Search&profile=advanced&fulltext=1&ns0=1 .in version]. Well, technically speaking I can move cites from live to dead, then dead to live again. Recommend treat it as a dead site now, and if returns to the living, reinstate it. -- GreenC 05:19, 8 February 2025 (UTC)
::It was active even in [https://archive.is/ADFQa mid-January]. Maybe I was unclear about the original domain. It was https://silverscreen.in Kailash29792 (talk) 05:57, 8 February 2025 (UTC)
:::[https://en.wikipedia.org/w/index.php?search=insource%3Asilverscreen+insource%3A%2Fsilverscreen%5B.%5Din%2F&title=Special:Search&profile=advanced&fulltext=1&ns0=1 silverscreen.in] has 364 pages. Do you want to do just that one for now, and check back on silverscreenindia.com later? -- GreenC 01:26, 9 February 2025 (UTC)
::::Yeah, tag 'em dead. [https://www.google.com/search?q=silverscreenindia.com&rlz=1C1APWK_enIN973IN977&oq=sil&gs_lcrp=EgZjaHJvbWUqCAgAEEUYJxg7MggIABBFGCcYOzIGCAEQRRhAMgYIAhBFGDkyBggDECMYJzIGCAQQRRg8MgYIBRBFGDwyBggGEEUYPDIGCAcQRRg90gEIMjc1MGoxajGoAgCwAgA&sourceid=chrome&ie=UTF-8 Silverscreenindia.com] continues to show G-hits, although the links are not accessible. Kailash29792 (talk) 04:05, 9 February 2025 (UTC)
Enwiki
- Checked 363 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=silverscreen.in&max=500&server=enwiki&ns=None&enddate=20250216&startdate=20250219&nosect=on edited 331 pages]. Added 62 {{tld|dead link}}. Switched 173 {{para|url-status|live}} to dead. Added 139 archive URLs (136 Wayback). Changed 8 citation metadata.
IABot DB
- Updated 536 unique URLs which propagate through 300+ wikis
heritage.org
Following an RfC, www.heritage.org has been blacklisted for being a cybersecurity risk. All URLs in citations should be archived and switched to url-status=unfit
so that users don't accidentally click malicious links. Nemo 10:00, 12 February 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?search=insource%3Aheritage+insource%3A%2F%28%5B.%5D%7C%5C%2F%29heritage%5B.%5Dorg%5C%2F%2F&title=Special:Search&profile=advanced&fulltext=1&ns0=1 1,079 pages]. Will be changing to "unfit" status. -- GreenC 17:23, 16 February 2025 (UTC)
Enwiki
- Checked 1,079 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=heritage.org&max=500&server=enwiki&ns=None&enddate=20250215&startdate=20250218&nosect=on edited 1,065 pages]. Switched 1,430 to {{para|url-status|unfit}}. Added 779 archive URLs (739 Wayback).
:(fixed a couple dozen manually)
{{done}} -- GreenC 21:23, 16 February 2025 (UTC)
:Thanks! Nemo 17:05, 19 February 2025 (UTC)
usaid.gov
Blanked page. [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Ausaid+insource%3A%2Fusaid%5B.%5Dgov%2F&title=Special%3ASearch&ns0=1 1,768 pages]. -- GreenC 16:31, 19 February 2025 (UTC)
Enwiki
- Checked 1,767 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=usaid.gov&max=500&server=enwiki&ns=None&enddate=20250218&startdate=20250221&nosect=on edited 1,347 pages]. Added 26 {{tld|dead link}}. Switched 137 {{para|url-status|live}} to dead. Added 1,511 archive URLs (1,472 Wayback). Changed 499 citation metadata.
milb.com
MiLB.com (not to be confused with MLB.com) and web.minorleaguebaseball.com both have URLs similar to this:
http://web.minorleaguebaseball.com/news/article.jsp?ymd=20080426&content_id=390538&vkey=news_t479&fext=.jsp&sid=t479
http://www.milb.com/news/article.jsp?ymd=20120729&content_id=35805102&fext=.jsp&vkey=news_milb
They all have some crazy scheme, but they can (almost always) be fixed by taking the content_id and doing this:
https://milb.com/news/c-390538 (archive of http://archive.today/4wXt verifies this)
http://www.milb.com/news/c-35805102 (archive of http://archive.today/wbw2q verifies this)
They usually expand into something like /news/gcs-id but then the URL works.
If these URLs could be parsed into just the content-id and checked, it could save a bunch of links. There are over [https://en.wikipedia.org/w/index.php?search=insource:%22http://www.milb.com/news/article.jsp%22&ns0=1 1,000 milb.com links] and [https://en.wikipedia.org/w/index.php?search=insource:%22http://web.minorleaguebaseball.com/news/article.jsp%22&title=Special:Search&profile=advanced&fulltext=1&ns0=1 158 minorleaguebaseball links]. Chew(V • T • E) 18:03, 24 February 2025 (UTC)
=minorleaguebaseball.com=
:[https://en.wikipedia.org/w/index.php?search=insource%3A%22minorleaguebaseball.com%22+insource%3A%22content_id%22+insource%3A%2Fweb%5B.%5Dminorleaguebaseball%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 375 pages]
:Enwiki
::Checked 395 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=minorleaguebaseball.com&max=500&server=enwiki&ns=None&enddate=20250227&startdate=20250302&nosect=on edited 345 pages]. Moved 351 links to a new URL: 344 ruled mapped redirects, 7 ghost mapped redirects, Resolved 71 soft-404s. Removed 3 {{tld|dead link}}. Added 26 {{tld|dead link}}. Switched 108 {{para|url-status|dead}} to live. Switched 9 {{para|url-status|live}} to dead. Added 131 archive URLs (105 Wayback). Changed 88 citation metadata.
=milb.com=
:[https://en.wikipedia.org/w/index.php?search=insource%3A%22milb.com%22+insource%3A%22content_id%22+insource%3A%2Fwww%5B.%5Dmilb%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 2,178 pages]
:Enwiki
::Checked 2,602 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=milb.com&max=500&server=enwiki&ns=None&enddate=20250228&startdate=20250303&nosect=on edited 2,427 pages]. Moved 3,624 links to a new URL: 446 normal redirects, 3,063 ruled mapped redirects, 115 ghost mapped redirects, Resolved 470 soft-404s. Removed 2 {{tld|dead link}}. Added 50 {{tld|dead link}}. Switched 215 {{para|url-status|dead}} to live. Switched 62 {{para|url-status|live}} to dead. Added 638 archive URLs (632 Wayback). Changed 3,528 citation metadata.
diehardgamefan.com
Looks like the website (
) for DieHard GameFan is dead (it has a WordPress error I've never seen before); [https://en.wikipedia.org/w/index.php?title=Special:Search&limit=500&offset=0&ns0=1&search=insource%3Adiehardgamefan.com 167 pages]. Sariel Xilo (talk) 22:25, 28 February 2025 (UTC)
Enwiki
- Checked 169 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=diehardgamefan&max=500&server=enwiki&ns=None&enddate=20250228&startdate=20250303&nosect=on edited 153 pages]. Switched 61 {{para|url-status|live}} to dead. Added 128 archive URLs (126 Wayback). Changed 23 citation metadata.
IABot DB
- Updated 161 links to propagate through 300+ wikis.
www.TheyPouredFire.com
www.TheyPouredFire.com
Usurped link 116.255.53.19 (talk) 14:32, 3 March 2025 (UTC)
:Ah The Lost Boys of Sudan. I read They Poured Fire on Us from the Sky back in 2005, it was unforgettable. Happy to help Alephonsion Deng, Benson Deng, and Benjamin Ajak. The site is still active at https://www.theypouredfirebooks.com .. the old link is only one page, I [https://en.wikipedia.org/w/index.php?title=Lost_Boys_of_Sudan&diff=1278617851&oldid=1276446290 fixed manually]. -- GreenC 15:16, 3 March 2025 (UTC)
fivethirtyeight.com
FiveThirtyEight was shutdown a few days ago; parts of the website (
) have an archive tag (ex: [https://fivethirtyeight.com/features/how-much-power-do-christians-really-have/] & [https://fivethirtyeight.com/features/is-your-dd-character-rare/]) & seem accessible if you have a direct link while other parts (ex: [https://projects.fivethirtyeight.com/] & [https://projects.fivethirtyeight.com/2024-election-forecast/pennsylvania/]) are redirecting to
. [https://en.wikipedia.org/w/index.php?search=insource%3Afivethirtyeight.com&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 Over 2000 pages]. Sariel Xilo (talk) 01:18, 9 March 2025 (UTC)
Enwiki
- Checked 1,867 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=fivethirtyeight.com&max=500&server=enwiki&ns=None&enddate=20250309&startdate=20250312&nosect=on edited 1,216 pages]. Moved 313 links to a new URL: 13 normal redirects, 296 ruled mapped redirects, 4 ghost mapped redirects, Added 5 {{tld|dead link}}. Switched 5 {{para|url-status|dead}} to live. Switched 217 {{para|url-status|live}} to dead. Added 1,393 archive URLs (1,264 Wayback). Changed 122 citation metadata.
IABot DB
- Updated about 1,000 URLs which propagate through 300+ wikis
theaustralian.news.com.au
These old links do not work, such as t[http://www.theaustralian.news.com.au/story/0,,24284426-5005200,00.html this link] at Marampa. Unfortunately, it does not convert to a new link at theaustralian.com.au/. I also could not find a new link with Jack 2. [https://en.wikipedia.org/w/index.php?search=insource%3A%22http%3A%2F%2Fwww.theaustralian.news.com.au%2F%22&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 ~1,600 pages] (some of which have been archived already). Thank you! MrLinkinPark333 (talk) 16:47, 11 March 2025 (UTC)
Enwiki
- Checked 1,709 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=theaustralian.news.com.au&max=500&server=enwiki&ns=None&enddate=20250311&startdate=20250314&nosect=on edited 638 pages]. Added 196 {{tld|dead link}}. Switched 28 {{para|url-status|live}} to dead. Added 411 archive URLs (222 Wayback). Changed 142 citation metadata.
IABot DB
- Updated about 2,700 unique URLs which propagate through 300+ wikis
itunes.apple.com
[https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Aitunes+insource%3A%2Fitunes%5B.%5Dapple%5B.%5Dcom%2F&title=Special%3ASearch&ns0=1 30,121 pages]. A general sweep looking for dead links and redirects. -- GreenC 16:28, 13 March 2025 (UTC)
Enwiki
: {{underline|Pass 1}} (00001-00500): Checked 500 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=itunes.apple.com&max=500&server=enwiki&ns=None&enddate=20250312&startdate=20250315&nosect=on edited 473 pages]. Moved 919 links to a new URL: 917 normal redirects, 2 ruled mapped redirects, Added 130 {{tld|dead link}}. Switched 18 {{para|url-status|dead}} to live. Switched 63 {{para|url-status|live}} to dead. Added 388 archive URLs (386 Wayback). Changed 109 citation metadata.
: {{underline|Pass 2}} (00501-08000): Checked 7,504 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=itunes.apple.com&max=500&server=enwiki&ns=None&enddate=20250313&startdate=20250316&nosect=on edited 6,939 pages]. Moved 9,327 links to a new URL: 9,307 normal redirects, 20 ruled mapped redirects, Removed 2 {{tld|dead link}}. Added 749 {{tld|dead link}}. Switched 252 {{para|url-status|dead}} to live. Switched 609 {{para|url-status|live}} to dead. Added 4,058 archive URLs (4,025 Wayback). Changed 1,647 citation metadata.
: {{underline|Pass 3}} (08001-16000): Checked 8,003 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=itunes.apple.com&max=500&server=enwiki&ns=None&enddate=20250314&startdate=20250317&nosect=on edited 7,373 pages]. Moved 9,888 links to a new URL: 9,871 normal redirects, 17 ruled mapped redirects, Removed 2 {{tld|dead link}}. Added 1,075 {{tld|dead link}}. Switched 235 {{para|url-status|dead}} to live. Switched 553 {{para|url-status|live}} to dead. Added 4,412 archive URLs (4,379 Wayback). Changed 1,392 citation metadata.
: {{underline|Pass 4}} (16001-24000): Checked 8,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=itunes.apple.com&max=500&server=enwiki&ns=None&enddate=20250314&startdate=20250317&nosect=on edited 7,439 pages]. Moved 10,169 links to a new URL: 10,151 normal redirects, 18 ruled mapped redirects, Removed 1 {{tld|dead link}}. Added 1,026 {{tld|dead link}}. Switched 191 {{para|url-status|dead}} to live. Switched 615 {{para|url-status|live}} to dead. Added 4,683 archive URLs (4,635 Wayback). Changed 1,538 citation metadata.
: {{underline|Pass 5}} (24001-30121): Checked 6,123 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=itunes.apple.com&max=500&server=enwiki&ns=None&enddate=20250315&startdate=20250318&nosect=on edited 5,667 pages]. Moved 7,681 links to a new URL: 7,669 normal redirects, 12 ruled mapped redirects, Removed 3 {{tld|dead link}}. Added 734 {{tld|dead link}}. Switched 180 {{para|url-status|dead}} to live. Switched 522 {{para|url-status|live}} to dead. Added 3,526 archive URLs (3,487 Wayback). Changed 1,227 citation metadata.
:Note: Many of the links marked dead or with archives added are in fact available on the live web at a different URL, the redirects only need to be mapped. The mapping can be done by using the Apple search engine. However it is not precise, and would be difficult. A project for another day. -- GreenC 19:21, 19 March 2025 (UTC)
IABot DB
uefa.com
24,087 pages. -- GreenC 00:10, 20 March 2025 (UTC)
Enwiki
: {{underline|Pass 1}} (00000-01000): Checked 1,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=uefa.com&max=500&server=enwiki&ns=None&enddate=20250319&startdate=20250322&nosect=on edited 703 pages]. Moved 1,477 links to a new URL: 360 normal redirects, 1,092 ruled mapped redirects, 25 ghost mapped redirects, Resolved 219 soft-404s. Removed 1 {{tld|dead link}}. Added 67 {{tld|dead link}}. Switched 30 {{para|url-status|dead}} to live. Switched 45 {{para|url-status|live}} to dead. Added 462 archive URLs (351 Wayback). Changed 1,600 citation metadata.
: {{underline|Pass 2}} (01001-06000): Checked 5,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=uefa.com&max=500&server=enwiki&ns=None&enddate=20250319&startdate=20250322&nosect=on edited 3,424 pages]. Moved 8,108 links to a new URL: 1,654 normal redirects, 6,297 ruled mapped redirects, 157 ghost mapped redirects, Resolved 1,388 soft-404s. Removed 4 {{tld|dead link}}. Added 393 {{tld|dead link}}. Switched 104 {{para|url-status|dead}} to live. Switched 234 {{para|url-status|live}} to dead. Added 2,857 archive URLs (2,430 Wayback). Changed 5,553 citation metadata.
: {{underline|Pass 3}} (06001-12000): Checked 6,001 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=uefa.com&max=500&server=enwiki&ns=None&enddate=20250320&startdate=20250323&nosect=on edited 4,097 pages]. Moved 9,927 links to a new URL: 2,011 normal redirects, 7,740 ruled mapped redirects, 176 ghost mapped redirects, Resolved 1,910 soft-404s. Removed 1 {{tld|dead link}}. Added 525 {{tld|dead link}}. Switched 146 {{para|url-status|dead}} to live. Switched 269 {{para|url-status|live}} to dead. Added 3,078 archive URLs (2,588 Wayback). Changed 5,970 citation metadata.
: {{underline|Pass 4}} (12001-18000): Checked 6,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=uefa.com&max=500&server=enwiki&ns=None&enddate=20250321&startdate=20250324&nosect=on edited 4,070 pages]. Moved 9,681 links to a new URL: 2,069 normal redirects, 7,340 ruled mapped redirects, 272 ghost mapped redirects, Resolved 2,228 soft-404s. Removed 5 {{tld|dead link}}. Added 560 {{tld|dead link}}. Switched 114 {{para|url-status|dead}} to live. Switched 373 {{para|url-status|live}} to dead. Added 3,157 archive URLs (2,717 Wayback). Changed 5,409 citation metadata.
: {{underline|Pass 5}} (18001-24087): Checked 6,089 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=uefa.com&max=500&server=enwiki&ns=None&enddate=20250322&startdate=20250325&nosect=on edited 4,178 pages]. Moved 10,300 links to a new URL: 1,862 normal redirects, 8,224 ruled mapped redirects, 214 ghost mapped redirects, Resolved 2,386 soft-404s. Removed 7 {{tld|dead link}}. Added 524 {{tld|dead link}}. Switched 147 {{para|url-status|dead}} to live. Switched 321 {{para|url-status|live}} to dead. Added 3,571 archive URLs (3,024 Wayback). Changed 6,115 citation metadata.
IABot DB
af.mil
The [https://www.cnn.com/2025/03/19/politics/pentagon-website-purge/index.html Pentagon needs assistance]. About [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Aaf.mil+insource%3A%2F%28%5B.%5D%7C%5C%2F%29af%5B.%5Dmil%2F&title=Special%3ASearch&ns0=1 9,000 pages]. -- GreenC 03:51, 20 March 2025 (UTC)
Enwiki
- Checked 9,395 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=af.mil&max=500&server=enwiki&ns=None&enddate=20250330&startdate=20250402&nosect=on edited 5,248 pages]. Moved 9,180 links to a new URL: 2,461 normal redirects, 6,319 ruled mapped redirects, 400 ghost mapped redirects, Resolved 1,374 soft-404s. Removed 20 {{tld|dead link}}. Added 669 {{tld|dead link}}. Switched 1,060 {{para|url-status|dead}} to live. Switched 288 {{para|url-status|live}} to dead. Added 1,713 archive URLs (1,486 Wayback). Changed 15 citation metadata.
IABot DB
- Checked about 24,000 unique URLs. Updated about 10,000.
dn.se
First time here. While editing I found the dead link http://www.dn.se/sport/fotboll/lennart-johanssons-pokal-opererad-1.1228512 could be replaced by https://www.dn.se/sport/fotboll/lennart-johanssons-pokal-opererad/. [https://en.wikipedia.org/w/index.php?title=Special:LinkSearch&limit=500&offset=0&target=http%3A%2F%2Fwww.dn.se There looks] to be at least some 100 of these where the -1.(six or seven digits)
may be removed to create functional links. https possible. Kaffet i halsen (talk) 20:01, 20 March 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Adn.se+insource%3A%2F%28%5B.%5D%7C%5C%2F%29dn%5B.%5Dse%2F&title=Special%3ASearch&ns0=1 3,100 pages]. -- GreenC 03:37, 31 March 2025 (UTC)
Enwiki
- Checked 3,097 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=dn.se&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on edited 1,934 pages]. Moved 1,878 links to a new URL: 13 normal redirects, 1,855 ruled mapped redirects, 10 ghost mapped redirects, Resolved 5 soft-404s. Removed 9 {{tld|dead link}}. Added 75 {{tld|dead link}}. Switched 187 {{para|url-status|dead}} to live. Switched 35 {{para|url-status|live}} to dead. Added 221 archive URLs (157 Wayback). Changed 653 citation metadata.
:::Of the 1,855 ruled mapped redirects 346 were of the "-1.xxx" variety. The rest were conversion to https.
usip.org
United States Institute of Peace was [https://www.washingtonpost.com/national-security/2025/03/18/doge-institute-of-peace-takeover-musk-trump/ forceably shutdown] by DOGE with help from DC Capital Police (guns involved by not fired). Website was shut down. [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Ausip+insource%3A%2Fusip%5B.%5Dorg%2F&title=Special%3ASearch&ns0=1 933 pages]. -- GreenC 19:16, 21 March 2025 (UTC)
Enwiki
- Checked 936 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=usip.org&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on edited 758 pages]. Added 52 {{tld|dead link}}. Switched 158 {{para|url-status|live}} to dead. Added 681 archive URLs (599 Wayback). Changed 114 citation metadata.
IABot DB
- Checked about 1,100 unique links and updated about the same. Changed domain status {{underline|permadead}}.
US agencies slated for shutdown or dismantling
{{Pin message|}}{{User:ClueBot III/DoNotArchiveUntil|2032742630}}
- U.S. Agency for Global Media usagm.gov
- Federal Mediation and Conciliation Service (United States) fmcs.gov
- Woodrow Wilson International Center for Scholars wilsoncenter.org
- Institute of Museum and Library Services imls.gov
- U.S. Interagency Council on Homelessness usich.gov
- Community Development Financial Institutions Fund cdfifund.gov
- Minority Business Development Agency mbda.gov
- Food and Drug Administration fda.gov says "On Oct. 1, 2024, the FDA began implementing a reorganization impacting many parts of the agency. We are in the process of updating FDA.gov content to reflect these changes". Also the FDA library has been shutdown.
- National Endowment for the Humanities (eg. Jamestown, VA, The Civil War, Library of America) - [https://www.nytimes.com/2025/04/03/arts/humanities-grants-canceled-doge.html at risk]
- National Park Service nps.gov - many DEI deletions
- AmeriCorps - staff layoffs 75%
GreenC 21:24, 21 March 2025 (UTC)
{{awaiting}} further developments and time to go through them -- GreenC 16:46, 1 April 2025 (UTC)
90 agencies identified as having web pages deleted during the Trump admin: https://asia.nikkei.com/static/vdata/infographics/deleted-website/
White House
Department of Health and Human Services
Department of Agriculture
USAID
National Park Service
worker.gov
Department of Labor
Federal Highway Administration
Environmental Protection Agency
Department of Housing and Urban Development
Centers for Disease Control and Prevention
Department of Transportation
Federal Emergency Management Agency
National Institutes of Health
General Services Administration
Department of Homeland Security
Department of Commerce
employer.gov
Office of the Assistant Secretary for Health
sftool.gov
Department of Energy
Department of the Interior
Department of Education
NOAA
Substance Abuse and Mental Health Services Administration
climate.gov
Department of Defense
Health Resources & Services Administration
AbilityOne Commission
Department of State
United States Patent and Trademark Office
BOEM
The Census Bureau
CISA
HUD User
MILLENNIUM CHALLENGE CORPORATION
performance.gov
National Archives and Records Administration
Bureau of Safety and Environmental Enforcement
Federal Aviation Administration
Food and Drug Administration
House of Representatives
Department of Justice
National Endowment for the Humanities
Department of the Treasury
youth.gov
American Climate Corps
Federal Trade Commission
Global Change Research Program
NASA
Administration for Community Living
National Endowment for the Arts
ATF
Bureau of Indian Affairs
Customs and Border Protection
Consumer Financial Protection Bureau
Consumer Product Safety Commission
Office of the Director of National Intelligence
Economic Development Administration
Equal Employment Opportunity Commission
Export-Import Bank of the United States
FBI
Federal Committee on Statistical Methodology
Federal Housing Finance Agency
geoplatform.gov
Assistant Secretary for Technology Policy
IRS
National Labor Relations Board
Office of Personnel Management
Department of Veterans Affairs
American Battle Monuments Commission
Agency for Healthcare Research and Quality
americorps.gov
Advanced Research Projects Agency for Health
Bonneville Power Administration
cms.gov
congress.gov
digital.gov
ENERGY STAR
ej.gov
farmers.gov
medicalcountermeasures.gov
peacecorps.gov
Securities and Exchange Commission
Social Security Administration
stopbullying.gov
Citizenship and Immigration Services
United States Interagency Council on Homelessness
workcenter.gov
Air Force
Marine Corps
redimps.co.uk
not entirely sure how to format this request but this domain is no longer active, and used on [https://en.wikipedia.org/w/index.php?search=insource%3Aredimps.co.uk+insource%3A%2Fredimps%5B.%5Dco%5B.%5Duk%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&searchToken=a6ev15nxfz6si8ocdtqolpk26 234 pages]. pages live at new domain weareimps.com, with a link such as https://www.redimps.co.uk/news/2021/january/210106-tommy-docherty/ salvageable by changing the domain ([https://www.weareimps.com/news/2021/january/210106-tommy-docherty/]). links ending .aspx cannot be saved by changing domain - [http://www.redimps.co.uk/news/article/-fred-middleton-1930-2016-3057044.aspx] is still live at [https://www.weareimps.com/news/2016/april/-fred-middleton-1930-2016] but no way to arrive at this by just changing the domain. Microwave Anarchist (talk) 01:46, 22 March 2025 (UTC)
:Can do this. Anything that can't migrate will convert to archive URL. Have a few to get through above first. -- GreenC 03:06, 22 March 2025 (UTC)
Enwiki
- Checked 235 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=redimps.co.uk&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on edited 91 pages]. Moved 105 links to a new URL: 105 ruled mapped redirects, Added 8 {{tld|dead link}}. Switched 3 {{para|url-status|dead}} to live. Switched 2 {{para|url-status|live}} to dead. Added 37 archive URLs (34 Wayback). Changed 2 citation metadata.
::The 105 moved links are redimps.co.uk --> weareimps.com
- Pass 2: Checked 235 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=redimps.co.uk&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on edited 63 pages]. Added 33 {{tld|dead link}}. Switched 2 {{para|url-status|live}} to dead. Added 94 archive URLs (77 Wayback). Changed 1 citation metadata.
::Realized the 404s were returning 200 (soft-404). Re-ran it adding more archives.
IABot DB
- Checked about 800 unique links and updated. Set domain status to {{underline|permadead}}
thetimes.co.uk
The Times migrated from thetimes.co.uk to thetimes.com in 2024. There are instances of URLs across Wikipedia that reference the .co.uk domain. For example, the page for Russell Brand references this [https://www.thetimes.co.uk/article/russell-brand-investigation-sunday-times-video-watch-latest-news-x33ss0kmk URL]. There are over 22,000 links discovered [https://en.wikipedia.org/w/index.php?search=insource%3Athetimes+insource%3A%2Fthetimes%5B.%5Dco%5B.%5Duk%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 here]. Lukethetimes (talk) 11:29, 24 March 2025 (UTC)
:This site is disorderly:
:* thetimes.co.uk
:* timesplus.co.uk
:* timesonline.co.uk
:* thesundaytimes.co.uk
:* thetimes.com
:There are probably others. I worked on it 6 months ago Wikipedia:Link_rot/URL_change_requests/Archives/2024/October but clearly more to go. It had a ton of soft-404 problems making it challenging to differentiate between live and dead links. Maybe I'll just focus on thetimes.co.uk and see what happens. -- GreenC 18:59, 1 April 2025 (UTC)
::Thank you so much for picking these up. I had one query to check - I can see the URLs tackled in [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=thetimes.co.uk&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on this list] successfully redirect to the .com domain, however don't pick up our subfolders. For example, in the references for the Woken page:
::Corr, Julieanne. [https://www.thetimes.com/article/irish-beach-the-scene-for-new-feature-film-cs7hxkrn0 "Irish beach the scene for new feature film"]. The Times.
::The pathway of the referenced URL is .com/article. However, the canonical URL is on the pathway .com/world/ireland-world/article.
::Please would it be possible for the canonical URL paths to be added? Thank you, I appreciate that's an added step! Lukethetimes (talk) 08:16, 2 April 2025 (UTC)
:::User:Lukethetimes: The step is built-in to the boilerplate code. It follows redirects to the final destination. I just re-ran Woken a couple times, and am getting inconsistent header results. Sometimes it returns 301 with a new Location, and sometimes a plain 200. In the 301 header there is
while in the 200 header
. I'll send you the full headers if you like. -- GreenC 15:10, 2 April 2025 (UTC)
::::Sorry I became confused by the data, the 200 header is actually of the canonical URL, which is correctly 200. Anyway, I don't understand why it didn't follow the redirect because when I try it now it is following to the canonical URL. Did you make any changes on your side, in the past 12 hours or so? -- GreenC 15:23, 2 April 2025 (UTC)
::::It looks like possibly an intermittent problem. After processing all links, there are about 8,000 with ".com/article" .. the rest are the canonical. I suspect header data might be inconsistent. -- GreenC 15:31, 2 April 2025 (UTC)
:::::I'm going to finish uploading the rest. In the future can redo the .com URLs looking for redirects and changing them again during another phase. Hopefully by then this problem will have resolved, it looks like a problem with CloudFlare or Times sending incorrect headers, but inconsistently. -- GreenC 02:18, 3 April 2025 (UTC)
::::::Thanks so much for getting back to me so quickly. That's interesting there's inconsistency with incorrect headers, I'll take this away on my side. Appreciate your help with this! Lukethetimes (talk) 09:23, 3 April 2025 (UTC)
::This is appearing easier than timesonline.co.uk (6 months ago) because redirects are still live for thetimes.co.uk
Enwiki
:{{underline|Pass 1}} (00001-01000): Checked 1,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=thetimes.co.uk&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on edited 995 pages]. Moved 1,300 links to a new URL: 50 normal redirects, 1,246 ruled mapped redirects, 4 ghost mapped redirects, Resolved 16 soft-404s. Added 5 {{tld|dead link}}. Switched 41 {{para|url-status|dead}} to live. Switched 2 {{para|url-status|live}} to dead. Added 31 archive URLs (17 Wayback). Changed 42 citation metadata.
:{{underline|Pass 2}} (01000-10000): Checked 9,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=thetimes.co.uk&max=500&server=enwiki&ns=None&enddate=20250331&startdate=20250403&nosect=on edited 8,974 pages]. Moved 11,640 links to a new URL: 426 normal redirects, 11,098 ruled mapped redirects, 116 ghost mapped redirects, Resolved 155 soft-404s. Removed 3 {{tld|dead link}}. Added 62 {{tld|dead link}}. Switched 330 {{para|url-status|dead}} to live. Switched 27 {{para|url-status|live}} to dead. Added 182 archive URLs (112 Wayback). Changed 4,939 citation metadata.
:{{underline|Pass 3}} (10001-22277): Checked 12,277 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=thetimes.co.uk&max=500&server=enwiki&ns=None&enddate=20250401&startdate=20250404&nosect=on edited 12,232 pages]. Moved 15,986 links to a new URL: 622 normal redirects, 15,219 ruled mapped redirects, 145 ghost mapped redirects, Resolved 231 soft-404s. Added 98 {{tld|dead link}}. Switched 469 {{para|url-status|dead}} to live. Switched 35 {{para|url-status|live}} to dead. Added 325 archive URLs (216 Wayback). Changed 7,469 citation metadata.
IABot DB
- Uploaded the apx 570 archive links identified above. Not confident to process the entire domain until the intermittency problem is understood.
m.mlb.com
Back again. Every single one of these links to m.mlb.com ALWAYS gets redirected to mlb.com, path ignored, making links useless.
Some things to help with:
- News articles.
- Prior they look like this: http://m.mlb.com/news/article/155452870/top-facts-about-mets-royals-world-series (the part after the ID may be omitted)
- If you take that ID and do this: http://mlb.com/news/c-155452870
- It gets expanded into this: https://www.mlb.com/news/top-facts-about-mets-royals-world-series/c-155452870
- Gameday links.
- Prior they look like this: https://m.mlb.com/gameday/mets-vs-cardinals/2016/08/23/448740
- These can just safely be replaced with mlb.com, removing the m.
- They would just get turned into this: https://mlb.com/gameday/mets-vs-cardinals/2016/08/23/448740
- MLB Cutfour/4
- Prior they look like this: http://m.mlb.com/cutfour/2015/05/06/122710430/did-you-know-that-george-clooney-tried-out-for-the-reds-in-1977 (the ending part after the ID may be omitted)
- Like with news, but keep the cutfour and change it to cut4, and get the ID from the middle (after date): https://www.mlb.com/cut4/c-122710430
- Gets expanded into this: https://www.mlb.com/cut4/did-you-know-that-george-clooney-tried-out-for-the-reds-in-1977/c-122710430
- Player links.
- They look like this: http://m.mlb.com/player/645277/ozzie-albies (the /ozzie etc part at the end may be omitted, you only need the 6 digit ID)
- They can just be replaced with https://www.mlb.com/player/645277 (or if you remove the m, https://mlb.com/player/645277/ozzie-albies)
- Regardless, they will expand into https://www.mlb.com/player/ozzie-albies-645277
- Anything else, just remove the m. and see what happens. If it's a redirect to mlb.com with nothing else, mark it as dead. If it redirects somewhere useful (if you can even check), or it doesn't redirect at all, it should be okay to just replace without the m (for example, http://m.mlb.com/glossary/rules/balk turns into https://mlb.com/glossary/rules/balk with no issue). Some of the above things, especially 1 and 3, might end up as dead, unfortunately (like http://m.mlb.com/news/article/83873114/ten-facts-for-100th-anniversary-of-babe-ruths-debut would turn into http://mlb.com/news/c-83873114 but this is dead). Players and gameday should never come up as dead, as all these IDs should still work.
This would rescue a bunch of links, there are 2,048 m.mlb.com links floating around and saving even half of them would be a huge help. Chew(V • T • E) 21:17, 25 March 2025 (UTC)
:Thanks for laying out clear rules that makes it easier to setup. I think most of them are resolved one way or another. There are [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Am.mlb.com+insource%3A%2Fm%5B.%5Dmlb%5B.%5Dcom%2F&title=Special%3ASearch&ns0=1 1,200] remaining but many of those converted to www.mlb.com, they just have old archive URLs showing up in search. -- GreenC 02:18, 4 April 2025 (UTC)
Enwiki
- Checked 2,053 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=m.mlb.com&max=500&server=enwiki&ns=None&enddate=20250402&startdate=20250405&nosect=on edited 1,871 pages]. Moved 3,930 links to a new URL: 11 normal redirects, 3,837 ruled mapped redirects, 82 ghost mapped redirects, Resolved 1,689 soft-404s. Removed 45 {{tld|dead link}}. Added 46 {{tld|dead link}}. Switched 1,212 {{para|url-status|dead}} to live. Switched 47 {{para|url-status|live}} to dead. Added 454 archive URLs (448 Wayback). Changed 133 citation metadata.
IABot DB
- Checked and updated about 10,000 unique URLs
zap2it.com
Site moved to a new provider. More details at Wikipedia_talk:WikiProject_Television#Possible_Zap2it_closure_-_what_to_do?. Over 11,000 pages. -- GreenC 22:48, 26 March 2025 (UTC)
Enwiki
- Checked 11,536 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=zap2it.com&max=500&server=enwiki&ns=None&enddate=20250403&startdate=20250406&nosect=on edited 3,134 pages]. Moved 785 links to a new URL: 4 normal redirects, 781 ruled mapped redirects, Resolved 384 soft-404s. Added 856 {{tld|dead link}}. Switched 1 {{para|url-status|dead}} to live. Switched 570 {{para|url-status|live}} to dead. Added 1,310 archive URLs (927 Wayback).
IABot DB
- The database has about 30,000 unique URLs. I uploaded about 5,000 changes set the domain globally to {{underline|permadead}}
gameinformer.com
The site has been [https://www.gameinformer.com/letter-from-the-editor/2025/03/25/game-informer-is-back revived] following the site's shutdown in August 2024. A request was made then (here), and I believe that should now be essentially reversed, where all uses of https://www.gameinformer.com
should be adjusted from {{para|url-status|dead}} to {{para|url-status|live}}. I've picked a very, very small handful of links in use on articles to check original urls and they all seem good and active. - Favre1fan93 (talk) 00:16, 27 March 2025 (UTC)
:I'll do a "makelive" which blindly converts everything to live status and removes {{tld|dead link}} (keeping any archive URL). Then I'll do a second pass looking for legitimate dead links and convert those to dead status. It's churn, but that's how the bot works it needs two passes. Hopefully not too many dead links. [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Agameinformer+insource%3A%2Fgameinformer%5B.%5Dcom%2F&title=Special%3ASearch&ns0=1 6,500 pages]. -- GreenC 01:00, 5 April 2025 (UTC)
:: Thank you! - Favre1fan93 (talk) 15:42, 5 April 2025 (UTC)
:::There are some ancient links that were already dead when the site was disestablished. These area accidentally being marked as live, e.g. [https://en.wikipedia.org/w/index.php?title=Tonic_Trouble&diff=1284039027&oldid=1269297113 here]. IceWelder [✉] 18:31, 5 April 2025 (UTC)
::::They are currently being repaired in "Pass 2" (below). I first changed all links to live, in Pass 1, and in Pass 2 recheck them and set to dead as needed. Turned out to be about 2,600 dead links. -- GreenC 18:35, 5 April 2025 (UTC)
:::::I think Pass 2 is not capturing them correctly. In [https://en.wikipedia.org/w/index.php?title=Rayman_2:_The_Great_Escape&diff=prev&oldid=1284121217 this case], it https-ified the broken link. IceWelder [✉] 18:52, 5 April 2025 (UTC)
::::::User:IceWelder, thanks for this. [https://www.gameinformer.com/reviews/review_detail.cfm?ITEM_ID=4321] returns header status 200 thus considered live by the bot. The page has a "Crunchy-404" appearance ie. it has some content but is missing the actual review. It even gives a title, author and date. But wrong information. I can reprocess everything in Pass 3 and check the page for the string "Sorry, no reviews are available for this platform." - If you find any more like this let me know I will add additional rules. -- GreenC 19:08, 5 April 2025 (UTC)
:::::::Unfortunately, [https://en.wikipedia.org/w/index.php?title=Tonic_Trouble&diff=prev&oldid=1284173283 not working] so far (assuming this was pass 3). IceWelder [✉] 18:27, 6 April 2025 (UTC)
Enwiki
- Pass 1: Checked 6,565 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=gameinformer.com&max=500&server=enwiki&ns=None&enddate=20250403&startdate=20250406&nosect=on edited 6,414 pages]. Removed 31 {{tld|dead link}}. Switched 9,914 {{para|url-status|dead}} to live.
::Note: the stats don't report everything, like removing archive URLs from square links etc
- Pass 2: Checked 6,565 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=gameinformer.com&max=500&server=enwiki&ns=None&enddate=20250404&startdate=20250407&nosect=on edited 4,260 pages]. Moved 3,598 links to a new URL: 139 normal redirects, 3,459 ruled mapped redirects, Resolved 318 soft-404s. Added 31 {{tld|dead link}}. Switched 1 {{para|url-status|dead}} to live. Switched 2,579 {{para|url-status|live}} to dead. Added 37 archive URLs (28 Wayback). Changed 35 citation metadata.
::Note: Over 2,600 dead links discovered. The 3,459 ruled mapped redirects is mostly http -> https
- Pass 3: Checked 6,565 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=gameinformer.com&max=500&server=enwiki&ns=None&enddate=20250404&startdate=20250407&nosect=on edited 296 pages]. Moved 61 links to a new URL: 61 normal redirects, Resolved 609 soft-404s. Added 31 {{tld|dead link}}. Switched 3 {{para|url-status|dead}} to live. Switched 317 {{para|url-status|live}} to dead. Added 2 archive URLs (2 Wayback).
::Note: Found another 319 dead links that are soft/crunchy 404s per above
IABot DB
- Checked about 12,000 URLs
nakedsecurity.sophos.com
Entire domain https://nakedsecurity.sophos.com is dead. URLs can sometimes be salvaged by changing the domain to news.sophos.com leaving the path unchanged (i.e https://nakedsecurity.sophos.com/en-us/2016/08/18/nists-new-password-rules-what-you-need-to-know/ -> https://news.sophos.com/en-us/2016/08/18/nists-new-password-rules-what-you-need-to-know/ ) * Pppery * it has begun... 16:09, 29 March 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?search=insource%3Anakedsecurity+insource%3A%2Fnakedsecurity%5B.%5Dsophos%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 154 pages] -- GreenC 18:56, 5 April 2025 (UTC)
Enwiki
- Checked 155 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=nakedsecurity.sophos.com&max=500&server=enwiki&ns=None&enddate=20250405&startdate=20250408&nosect=on edited 148 pages]. Moved 78 links to a new URL: 1 normal redirects, 77 ruled mapped redirects, Resolved 155 soft-404s. Added 2 {{tld|dead link}}. Switched 7 {{para|url-status|dead}} to live. Switched 20 {{para|url-status|live}} to dead. Added 82 archive URLs (67 Wayback).
:::Summary: made 78 live to "news". 104 are dead.
IABot DB
- Processed 284 unique links
abc.net.au/news/stories
[http://www.abc.net.au/news/stories/2007/09/14/2032596.htm This] is now [https://www.abc.net.au/news/2007-09-14/reaction-time-climate-change-and-the-nuclear-option/669872 here] for Reaction Time (book), but unfortunately does not redirect. It also can't be converted to the new URL as the id does not match. [https://en.wikipedia.org/w/index.php?search=insource%3A+%22www.abc.net.au%2Fnews%2Fstories%22&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 ~3400 articles]. Thank you! MrLinkinPark333 (talk) 23:57, 29 March 2025 (UTC)
Enwiki
- Checked 3,480 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=abc.net.au&max=500&server=enwiki&ns=None&enddate=20250405&startdate=20250408&nosect=on edited 3,322 pages]. Added 31 {{tld|dead link}}. Switched 959 {{para|url-status|live}} to dead. Added 3,734 archive URLs (2,604 Wayback).
IABot DB
- Processed and updated 5,500 unique links
wayback.archive.org
Found [https://en.wikipedia.org/w/index.php?search=insource%3A+%22wayback.archive.org%22&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 ~50 articles] using wayback.archive.org. Thought I mention it here cause of WaybackMedic. Also, for some reason Rich Harden and Truce of Altmark have archive.today links that archived wayback.archive.org links. This does make me wonder if there are any other archive.org links that have archive.today in the URL. Haven't figured out the syntax for that. MrLinkinPark333 (talk) 00:09, 30 March 2025 (UTC)
:I wrote a program that runs automatically every 2 weeks converting non-standard Wayback domains like this one to web.archive.org. It won't convert strings outside a URL, URLs with a timestamp with a "*", URLs in a wiki comment, etc.. so that is probably most of it, various edge cases that would be better done by hand. Not too concerned with the double archives at Archive.today they exist but in small numbers. WaybackMedic can unwind double/triple archives when encountered. -- GreenC 18:52, 5 April 2025 (UTC)
thepogg.com
The website redirects to a different site. Thanks, David O. Johnson (talk) 21:07, 31 March 2025 (UTC)
:{{done}} in WP:JUDI batch #27 -- GreenC 15:23, 5 May 2025 (UTC)
www.olympic.org.nz
Most of them links have new locations at https://olympic.org.nz/ while some need archives
Games
- [http://www.olympic.org.nz/GamesProfile.aspx?Print=&function=2&GamesID=27 This] is now [https://olympic.org.nz/games/moscow-1980 here] for 1980 Summer Olympics (/games/location-year)
- Similarly, Most links with /games/ can be converted. [http://www.olympic.org.nz/games/edinburgh-1986/resulttable/23647 This] is now [https://olympic.org.nz/games/edinburgh-1986 here] for Shane O'Brien (rower) while [http://www.olympic.org.nz/games/buenos-aires-2018/athletes this] is now [https://olympic.org.nz/games/buenos-aires-2018 here] for New Zealand at the 2018 Summer Youth Olympics.
- However, this [http://www.olympic.org.nz/games/australian-youth-olympic-festival-2009/resulttable/23752 link] for Sam Webster (cyclist) doesnt have a replacement.
Athletes
- [http://www.olympic.org.nz/Article.aspx?Mode=1&ID=6019 This] is now [https://olympic.org.nz/athletes/ryan-nelsen here] for Ryan Nelsen (/athletes/firstname-lastname)
- Similarly, [http://www.olympic.org.nz/nzolympic/athlete/greg-henderson this] is now [https://olympic.org.nz/athletes/greg-henderson here] for Greg Henderson
- Also, [http://www.olympic.org.nz/museum/athlete/profile/979 this] is now [https://olympic.org.nz/athletes/peter-mander here] for Peter Mander.
- [https://www.olympic.org.nz/index.php/athletes/Noel-Mills] can be converted to [https://olympic.org.nz/athletes/noel-mills] (lowercase)
- [http://www.olympic.org.nz/athletes/James-Hill/] can be converted to [https://olympic.org.nz/athletes/james-hill] (lowercase)
News
- Some /news/ are the same URLs. They need the ending slash removed. For example. [http://www.olympic.org.nz/news/tvnz-announces-gold-coast-2018-commonwealth-games-coverage/ this] is now [https://olympic.org.nz/news/tvnz-announces-gold-coast-2018-commonwealth-games-coverage here] at 2018 Commonwealth Games.
- Other /news/ links are broken, such as [http://www.olympic.org.nz/news/olympic-veteran-motivates-swim-team this one] at New Zealand at the 1948 Summer Olympics
- Urls with /nzoc/news/ don't have replacements, like [http://www.olympic.org.nz/nzoc/news/late-addition-cycling-team this one] at 2010 Commonwealth Games.
Any other links not in these 3 categories needs archives as I don't see replaceable links.
[https://en.wikipedia.org/w/index.php?search=insource%3Aolympic.org.nz+insource%3A%2Fwww%5B.%5Dolympic%5B.%5Dorg%5B.%5Dnz%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 ~900] articles. Thank you! MrLinkinPark333 (talk) 00:55, 2 April 2025 (UTC)
:User:MrLinkinPark333, I can make rules for Games#2 and Athletes#2 and Athletes#4 and Athletes#5 and News#1 .. the rest I don't see a good way -- GreenC 03:09, 7 April 2025 (UTC)
:Also note broken links like [https://olympic.org.nz/games/australian-youth-olympic-festival-2009/resulttable/23752 this] return status code "200" it should be 404 ie. soft-404 - need to web scrape for keywords. -- GreenC 15:22, 7 April 2025 (UTC)
::They use JS, making web scraping impossible. Trying w3m/lynx clients which appear to return an empty result when page not found. -- GreenC 15:34, 7 April 2025 (UTC)
Enwiki
- Checked 969 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=www.olympic.org.nz&max=500&server=enwiki&ns=None&enddate=20250406&startdate=20250409&nosect=on edited 843 pages]. Moved 825 links to a new URL: 2 normal redirects, 823 ruled mapped redirects, Removed 3 {{tld|dead link}}. Added 55 {{tld|dead link}}. Switched 49 {{para|url-status|dead}} to live. Switched 6 {{para|url-status|live}} to dead. Added 117 archive URLs (106 Wayback).
::Summary: Converted 825 links. Found 273 links unable to convert. Some already had archives, 117 new archives and 55 {{tld|dead link}}
IABot DB
- Processed about 2,400 unique links updated about 700
{{done}} -- GreenC 21:26, 7 April 2025 (UTC)
:Could you post a list of ones that werent able to convert? If you could sort them into categories (like athletes, news, etc.), I'll see which ones I can convert over manually. Some of them I'll have to keep as archives. Thank you! MrLinkinPark333 (talk) 00:04, 8 April 2025 (UTC)
::Wikipedia:Link_rot/Cases/www.olympic.org.nz - the first 20 or so are sorted out of order because they start with https -- GreenC 01:53, 8 April 2025 (UTC)
equipu.cl
This domain apparently [https://rderecho.equipu.cl/index.php/revista1/article/view/V11n3-a02 expired], I am not sure how many links are to "equipu.cl" and how to replace them. Jo-Jo Eumerus (talk) 10:13, 2 April 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?search=insource%3Aequipu.cl+insource%3A%2Fequipu.cl%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 22 pages]. Will do. -- GreenC 18:39, 5 April 2025 (UTC)
Enwiki
- Checked 22 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=equipu.cl&max=500&server=enwiki&ns=None&enddate=20250406&startdate=20250409&nosect=on edited 20 pages]. Added 3 {{tld|dead link}}. Switched 2 {{para|url-status|live}} to dead. Added 16 archive URLs (16 Wayback).
IABot DB
- Processed about 50 unique links
tfaw.com
TFAW (Things From Another World) [https://www.comicsbeat.com/tfaw-com-closing-down-april-30/ announced] that it is shutting down with its website (
) going dark on April 30. It's a comics e-commerce site with some amount of blog posts; only about [https://en.wikipedia.org/w/index.php?search=insource%3ATFAW.com&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 ~60 articles] although it looks like a lot of the older blogs from the site (like from before 2010) are already dead. Sariel Xilo (talk) 22:12, 2 April 2025 (UTC)
Enwiki
- Checked 61 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=tfaw.com&max=500&server=enwiki&ns=None&enddate=20250406&startdate=20250409&nosect=on edited 25 pages]. Added 2 {{tld|dead link}}. Switched 1 {{para|url-status|live}} to dead. Added 23 archive URLs (14 Wayback).
:::Looks like most were previously archived
IABot DB
- Processed and updated 94 unique links
voanews.com
Voice of America - check site integrity. [https://en.wikipedia.org/w/index.php?search=insource%3Avoanews+insource%3A%2Fvoanews%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 8,800 pages]. -- GreenC 22:00, 6 April 2025 (UTC)
Enwiki
- Checked 8,822 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=voanews.com&max=500&server=enwiki&ns=None&enddate=20250407&startdate=20250410&nosect=on edited 4,527 pages]. Moved 3,989 links to a new URL: 1,363 normal redirects, 2,422 ruled mapped redirects, 204 ghost mapped redirects, Resolved 207 soft-404s. Removed 5 {{tld|dead link}}. Added 203 {{tld|dead link}}. Switched 345 {{para|url-status|dead}} to live. Switched 175 {{para|url-status|live}} to dead. Added 970 archive URLs (828 Wayback). Changed 1,634 citation metadata.
IABot DB
- Checked about 25,000 links and updated about 8,000
defense.gov
[https://en.wikipedia.org/w/index.php?search=insource%3Adefense.gov+insource%3A%2Fdefense%5B.%5Dgov%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 6,400 pages]. -- GreenC 22:50, 6 April 2025 (UTC)
Enwiki
- Checked 6,424 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=defense.gov&max=500&server=enwiki&ns=None&enddate=20250407&startdate=20250410&nosect=on edited 4,766 pages]. Moved 3,379 links to a new URL: 1,261 normal redirects, 1,929 ruled mapped redirects, 189 ghost mapped redirects, Resolved 556 soft-404s. Added 243 {{tld|dead link}}. Switched 283 {{para|url-status|dead}} to live. Switched 184 {{para|url-status|live}} to dead. Added 4,660 archive URLs (3,772 Wayback). Changed 6 citation metadata.
IABot DB
- Checked about 6,000 links and updated about 2,700
prola.aps.org
prola.aps.org was formerly used for serving articles from journals of the American Physical Society. It's been redirecting since about 2015 but is currently giving a 502 error.
Most links are of the form
The page may be non-numeric, e.g. "A171" or "R17".
Some articles have an "e" number instead of a page number, sometimes with a trailing slash, this is treated as a page number in the new system, without the "e" prefix or slash.
Some old URLs have a query string, which can be discarded.
Some editors linked to the PDF instead of the abstract page. For paywalled articles, the PDF link is an unhelpful error message and should be avoided. For open access articles, it does work, with /pdf/ in the URL instead of /abstract/ like in the old system.
We have three links to /pagegif/, a feature which apparently no longer exists. These should go to the abstract or PDF.
class="wikitable"
!PROLA code !! New short code !! Long code | ||
PR | pr | PhysRev |
PRA | pra | PhysRevA |
PRB | prb | PhysRevB |
PRC | prc | PhysRevC |
PRD | prd | PhysRevD |
PRE | pre | PhysRevE |
PRI | pri | PhysRevSeriesI |
PRL | prl | PhysRevLett |
RMP | rmp | RevModPhys |
Examples:
- http://prola.aps.org/abstract/PR/v10/i5/p586_1 → https://journals.aps.org/pr/abstract/10.1103/PhysRev.10.586
- http://prola.aps.org/abstract/PR/v133/i1A/pA171_1 → https://journals.aps.org/pr/abstract/10.1103/PhysRev.133.A171
- http://prola.aps.org/abstract/PRA/v65/i1/e010501 → https://journals.aps.org/pra/abstract/10.1103/PhysRevA.65.010501
- http://prola.aps.org/abstract/PRA/v67/i6/e062105/ → https://journals.aps.org/pra/abstract/10.1103/PhysRevA.67.062105
- http://prola.aps.org/abstract/PRB/v26/i12/p6502_1?qid=df0eb646d5d96a83&qseq=165&show=10 → https://journals.aps.org/prb/abstract/10.1103/PhysRevB.26.6502
- http://prola.aps.org/pdf/PR/v83/i2/p471_2 → https://journals.aps.org/pr/abstract/10.1103/PhysRev.83.471.2
Tim Starling (talk) 00:42, 7 April 2025 (UTC)
:Redirecting since 2015. There is high probability the redirect mappings are stored in the Wayback Machine:
:* http://prola.aps.org/abstract/PR/v10/i5/p586_1 -> https://web.archive.org/web/20180322015922/http://prola.aps.org/abstract/PR/v10/i5/p586_1 -> https://journals.aps.org/pr/abstract/10.1103/PhysRev.10.586
:It's only [https://en.wikipedia.org/w/index.php?search=insource%3Aprola.aps.org+insource%3A%2Fprola%5B.%5Daps%5B.%5Dorg%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 96 pages]. I'll retrieve these, and post the map before uploading for your review. Given the small scale and complexity I don't want to program too much. -- GreenC 03:28, 7 April 2025 (UTC)
Tim Starling: Can you take a look? Wikipedia:Link rot/Cases/prola.aps.org - I'd rather fix semi-manually than coding complex rules. I can just make these changes, or if you want make adjustments on that page and I'll incorporate before uploading. -- GreenC 18:00, 9 April 2025 (UTC)
:Give me a day or so, I can fix your set B and C URLs. You want them in the same format as set A? -- Tim Starling (talk) 02:48, 11 April 2025 (UTC)
::Sure, however you like. I guess ideally 2 columns, the original on left and new on right. Don't need page names. -- GreenC 06:35, 11 April 2025 (UTC)
:::GreenC: Done. I used vim regexes. I confirmed that all the destination links work. -- Tim Starling (talk) 02:03, 12 April 2025 (UTC)
::::User:Tim Starling, most are fixed I found another 8 that were missed the first run: Wikipedia:Link_rot/Cases/prola.aps.org#Set_D - could you regex these? -- GreenC 00:03, 13 April 2025 (UTC)
:::::Done Tim Starling (talk) 11:03, 13 April 2025 (UTC)
::::::Alright, it appears to be complete. Thanks for the conversion help. -- GreenC 16:20, 13 April 2025 (UTC)
avclub.com/tvclub
These links with /tvclub/ do not redirect to their new locations, such as [http://www.avclub.com/tvclub/bobs-burgers-gayle-tales-215935 this] going [https://www.avclub.com/bob-s-burgers-the-gayle-tales-1798182899 here] for The Gayle Tales. They cant be converted due to the new page ids. [https://en.wikipedia.org/w/index.php?fulltext=1&profile=default&search=insource%3A%22www.avclub.com%2Ftvclub%2F%22&title=Special%3ASearch&ns0=1 ~1500 pages] with some already archived. Thank you. MrLinkinPark333 (talk) 00:43, 8 April 2025 (UTC)
:Can get there in 4 steps: [https://web.archive.org/cdx/search/cdx?url=http://www.avclub.com/tvclub/bobs-burgers-gayle-tales-215935&MatchType=prefix step 1] -> [https://web.archive.org/web/20180123143915/http://www.avclub.com/tvclub/bobs-burgers-gayle-tales-215935 step 2] -> [http://tv.avclub.com/bob-s-burgers-the-gayle-tales-1798182899 step 3] -> [https://www.avclub.com/bob-s-burgers-the-gayle-tales-1798182899 step 4] ie. a ghost redirect with a redirect mapping in the last step. Will see how many it works. -- GreenC 01:34, 10 April 2025 (UTC)
::It required running slowly to avoid detection and a block by CloudFlare. I also developed a new technique for ghostarchive detection that makes it easier to setup and better results. -- GreenC 15:30, 10 April 2025 (UTC)
Enwiki
- Checked 1,517 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=avclub.com&max=500&server=enwiki&ns=None&enddate=20250409&startdate=20250412&nosect=on edited 1,472 pages]. Moved 1,474 links to a new URL: 2 normal redirects, 3 ruled mapped redirects, 1,469 ghost mapped redirects, Resolved 1,511 soft-404s. Added 9 {{tld|dead link}}. Switched 64 {{para|url-status|dead}} to live. Switched 19 {{para|url-status|live}} to dead. Added 194 archive URLs (171 Wayback).
IABot DB
- Checked and updated about 2,500 unique links
abc.net.au/misc
abc.net.au/*/txt/ and abc.net.au/*/transcripts/
Inspired by the above abc.net.au item, I've just recalled once again that all these old links are dead and need to be archived. They either come up with "Page not found" errors (like [http://www.abc.net.au/newinventors/txt/s1373508.htm this]) or "Page archived" errors like [https://www.abc.net.au/dynasties/txt/s1488955.htm this one] or [https://www.abc.net.au/tv/enoughrope/transcripts/s1110359.htm that one], and InternetArchiveBot doesn't seem to realise they're dead lihnks. I was going to provide a list of linksearch texts to use, but it'd get very long very quickly and I'm not sure how much help it would be. Any assistance archiving these links would be appreciated. Graham87 (talk) 02:56, 8 April 2025 (UTC)
[https://en.wikipedia.org/w/index.php?search=insource%3Aabc.net.au+insource%3A%2Fabc%5B.%5Dnet%5B.%5Dau%5C%2F%5B%5E%5C%2F%5D*%5C%2F%28transcripts%7Ctxt%29%5C%2F%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 541 pages] -- GreenC 03:54, 8 April 2025 (UTC)
Enwiki
- Checked 541 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=abc.net.au&max=500&server=enwiki&ns=None&enddate=20250409&startdate=20250412&nosect=on edited 370 pages]. Added 5 {{tld|dead link}}. Switched 47 {{para|url-status|live}} to dead. Added 357 archive URLs (327 Wayback). Changed 12 citation metadata.
IABot DB
- Checked and updated 625 unique links that propagate through 300+ wikis
tcm.com
Please check the site integrity for https://www.tcm.com/, since some of the database pages like [https://www.tcm.com/tcmdb/title/2161086/mad-max-fury-road-theatrical#overview the one] on Mad Max: Fury Road can redirect to a 404 error. [https://en.wikipedia.org/w/index.php?search=insource%3Atcm.com+insource%3A%2F%28%5B.%5D%7C%5C%2F%29tcm%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 12,928 pages]. Lord Sjones23 (talk - contributions) 03:07, 9 April 2025 (UTC)
Enwiki
:{{underline|Pass 1}} (00001-05000): Checked 5,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=tcm.com&max=500&server=enwiki&ns=None&enddate=20250410&startdate=20250413&nosect=on edited 3,938 pages]. Moved 4,928 links to a new URL: 54 normal redirects, 4,868 ruled mapped redirects, 6 ghost mapped redirects, Resolved 578 soft-404s. Removed 9 {{tld|dead link}}. Added 71 {{tld|dead link}}. Switched 158 {{para|url-status|dead}} to live. Switched 20 {{para|url-status|live}} to dead. Added 93 archive URLs (81 Wayback). Changed 1,636 citation metadata.
:{{underline|Pass 2}} (05001-10000): Checked 5,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=tcm.com&max=500&server=enwiki&ns=None&enddate=20250410&startdate=20250413&nosect=on edited 4,009 pages]. Moved 5,009 links to a new URL: 45 normal redirects, 4,946 ruled mapped redirects, 18 ghost mapped redirects, Resolved 499 soft-404s. Removed 15 {{tld|dead link}}. Added 83 {{tld|dead link}}. Switched 188 {{para|url-status|dead}} to live. Switched 15 {{para|url-status|live}} to dead. Added 90 archive URLs (89 Wayback). Changed 1,535 citation metadata.
:{{underline|Pass 3}} (10001-13919): Checked 3,919 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=tcm.com&max=500&server=enwiki&ns=None&enddate=20250410&startdate=20250413&nosect=on edited 3,129 pages]. Moved 3,851 links to a new URL: 58 normal redirects, 3,786 ruled mapped redirects, 7 ghost mapped redirects, Resolved 396 soft-404s. Removed 3 {{tld|dead link}}. Added 62 {{tld|dead link}}. Switched 115 {{para|url-status|dead}} to live. Switched 12 {{para|url-status|live}} to dead. Added 86 archive URLs (79 Wayback). Changed 1,201 citation metadata.
IABot DB
- Checked 25,400 unique links and updated 10,100 which propagate through 300+ wikis
army.mil
[https://en.wikipedia.org/w/index.php?search=insource%3Aarmy.mil+insource%3A%2Farmy%5B.%5Dmil%5C%2F%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 17,677 pages] -- GreenC 01:23, 10 April 2025 (UTC)
Enwiki
:{{underline|Pass 1}} (00001-03000): Checked 3,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=army.mil&max=500&server=enwiki&ns=None&enddate=20250412&startdate=20250415&nosect=on edited 1,469 pages]. Moved 594 links to a new URL: 81 normal redirects, 513 ruled mapped redirects, Resolved 141 soft-404s. Removed 2 {{tld|dead link}}. Added 80 {{tld|dead link}}. Switched 69 {{para|url-status|dead}} to live. Switched 141 {{para|url-status|live}} to dead. Added 1,697 archive URLs (1,566 Wayback). Changed 115 citation metadata.
:{{underline|Pass 2}} (03001-10000): Checked 7,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=army.mil&max=500&server=enwiki&ns=None&enddate=20250413&startdate=20250416&nosect=on edited 3,392 pages]. Moved 1,410 links to a new URL: 140 normal redirects, 1,270 ruled mapped redirects, Resolved 287 soft-404s. Removed 3 {{tld|dead link}}. Added 240 {{tld|dead link}}. Switched 216 {{para|url-status|dead}} to live. Switched 403 {{para|url-status|live}} to dead. Added 4,175 archive URLs (3,786 Wayback). Changed 245 citation metadata.
:{{underline|Pass 3}} (10001-17084): Checked 7,085 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=army.mil&max=500&server=enwiki&ns=None&enddate=20250413&startdate=20250416&nosect=on edited 3,385 pages]. Moved 1,421 links to a new URL: 212 normal redirects, 1,209 ruled mapped redirects, Resolved 361 soft-404s. Removed 3 {{tld|dead link}}. Added 232 {{tld|dead link}}. Switched 171 {{para|url-status|dead}} to live. Switched 406 {{para|url-status|live}} to dead. Added 3,770 archive URLs (3,091 Wayback). Changed 269 citation metadata.
IABot DB
:Checked 27,500 unique links and updated 19,000 which propagate through 300+ wikis
enciklopedija.hr
Recently, Miroslav Krleža Institute of Lexicography shut down their ASP engine for https://enciklopedija.hr, so we need the following changed
;Natuknica.aspx?ID=
→ clanak/
;natuknica.aspx?ID=
→ clanak/
;Natuknica.aspx?id=
→ clanak/
;natuknica.aspx?id=
→ clanak/
So
https://www.enciklopedija.hr/Natuknica.aspx?ID=37178
https://www.enciklopedija.hr/natuknica.aspx?ID=37178
https://www.enciklopedija.hr/Natuknica.aspx?id=37178
https://www.enciklopedija.hr/natuknica.aspx?id=37178
become
https://www.enciklopedija.hr/clanak/37178
While at it, please also change http://
to https://
http://www.enciklopedija.hr/Natuknica.aspx?ID=37178
https://www.enciklopedija.hr/clanak/37178
That should be [https://en.wikipedia.org/w/index.php?search=insource%3A%2F%5B%5E%5C%2F%5Dhttps%3F%3A%5C%2F%5C%2F%28www.%29%3Fenciklopedija%5C.hr%5C%2Fnatuknica%5C.aspx%5C%3Fid%3D%2Fi&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 814] articles.
Make sure the archived URLs are not changed, i. e. there's no slash before http
, as in the search regex above. Ponor (talk) 11:38, 10 April 2025 (UTC)
:Will do. -- GreenC 19:46, 10 April 2025 (UTC)
Enwiki
- Checked 815 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=enciklopedija.hr&max=500&server=enwiki&ns=None&enddate=20250418&startdate=20250421&nosect=on edited 815 pages]. Moved 1,013 links to a new URL: 1,013 ruled mapped redirects, Removed 1 {{tld|dead link}}. Switched 29 {{para|url-status|dead}} to live.
IABot DB
- Checked 10,700 unique links and updated 10,642 which propagate to 300+ wikis
bbfc.co.uk
Since this site has some dead links and redirects in citations, a link check would be necessary. [https://en.wikipedia.org/w/index.php?search=insource%3Abbfc.co.uk+insource%3A%2Fbbfc%5B.%5Dco%5B.%5Duk%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 6,841] articles. Lord Sjones23 (talk - contributions) 23:07, 11 April 2025 (UTC)
:Lord Sjones23: There are tons of unmapped redirects. Like [https://www.bbfc.co.uk/releases/knocked-2007-0 this] (dead) is now [https://www.bbfc.co.uk/release/knocked-up-q29sbgvjdglvbjpwwc0znja5nda this] (I think). The site admin/contractors never did the critical redirect mapping step after changing the content management system. The result is going to be lots of dead links with archive URLs. No traffic going to their site globally across 300 wikis that receive billions of clicks yearly. Want to drop them an email? Natasha Kaplinsky is the current President. -- GreenC 06:48, 24 April 2025 (UTC)
Enwiki
- Checked 6,885 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=bbfc.co.uk&max=500&server=enwiki&ns=None&enddate=20250423&startdate=20250426&nosect=on edited 3,874 pages]. Moved 43 links to a new URL: 43 normal redirects, Added 462 {{tld|dead link}}. Switched 1 {{para|url-status|dead}} to live. Switched 1,308 {{para|url-status|live}} to dead. Added 3,029 archive URLs (2,428 Wayback). Changed 333 citation metadata.
IABot DB
- Checked about 15,000 URLs and updated 11,500 which propagate to 300+ wikis
dieselpunks.org
The domain dieselpunks.org has been usurped and all links ([https://w.wiki/DnWr 33 articles]) now redirect to some shady Indonesian online slot game website rivetnetworks.com. 85.76.128.167 (talk) 11:19, 12 April 2025 (UTC)
:{{done}} in WP:JUDI batch #27 -- GreenC 15:23, 5 May 2025 (UTC)
eca.state.gov
The Bureau of Educational and Cultural Affairs is curtailed. The website is dead as of Friday April 11 2025. United States cultural exchange programs are dead? The Fulbright Scholarship is dead. -- GreenC 02:31, 15 April 2025 (UTC)
:Site back online, under new management. Will check it out for removed pages. Only [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Aeca.state.gov+insource%3A%2Feca%5B.%5Dstate%5B.%5Dgov%2F&title=Special%3ASearch&ns0=1 112 pages] -- GreenC 19:47, 25 April 2025 (UTC)
Enwiki
- Checked 112 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=eca.state.gov&max=500&server=enwiki&ns=None&enddate=20250424&startdate=20250427&nosect=on edited 62 pages]. Moved 19 links to a new URL: 19 normal redirects, Resolved 12 soft-404s. Added 3 {{tld|dead link}}. Switched 2 {{para|url-status|dead}} to live. Switched 4 {{para|url-status|live}} to dead. Added 24 archive URLs (19 Wayback). Changed 53 citation metadata.
IABot DB
- Checked about 200 URLS and updated 96
ngdc.noaa.gov
[https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Angdc.noaa.gov+insource%3A%2Fngdc%5B.%5Dnoaa%5B.%5Dgov%2F&title=Special%3ASearch&ns0=1 1,269 pages]
Also here to notify about the impending death of https://www.ngdc.noaa.gov/hazard/eq-intensity.shtml WFUM🔥🌪️ (talk) 05:18, 15 April 2025 (UTC)
:I believe all ngdc.noaa.gov links are now at ncei.noaa.gov, although not necessarily with the same URL afterwards. WFUM🔥🌪️ (talk) 16:24, 15 April 2025 (UTC)
::WFUM, if you come across any patterns that could be used; often websites have information in the old URL that can be used to create a new URL. -- GreenC 03:21, 16 April 2025 (UTC)
:::There's a very weird situation going on, as the main NGDC site redirect to NCEI but some things (the one I can find is the tsunami runup database) are still located on NGDC (although they can still be found on NCEI just at diff urls) (https://www.ngdc.noaa.gov/hazel/view/hazards/tsunami/runup-more-info/6847 has one but https://www.ncei.noaa.gov/hazel/view/hazards/tsunami/runup-more-info/6847 has nothing). Thus, I believe the 1,269 pages have many false positives, and the request potentially should be limited to the original link I proposed yesterday for now (with maybe even adding /struts at the end; this request was aimed to primarily cull the earthquake/tsunami database issue.) However, most cases of the tsunami database cites being used are search results from that database, which will be very difficult to map ([https://www.ngdc.noaa.gov/nndc/struts/results?bt_0=1964&st_0=1964&type_7=Like&query_7=prince&d=7&t=101650&s=7] goes to [https://www.ngdc.noaa.gov/hazel/view/hazards/tsunami/event-data?maxYear=1964&minYear=1964&locInclude=prince]). All the variable names are different, because I believe they changed from a "nndc/struts" search system to the Hazel searching system. Same case for earthquakes (old: [https://www.ngdc.noaa.gov/nndc/struts/results?bt_0=&st_0=&type_17=EXACT&query_17=None+Selected&op_12=eq&v_12=&type_12=Or&query_14=None+Selected&type_3=Like&query_3=&st_1=&bt_2=&st_2=&bt_1=&bt_4=&st_4=&bt_5=12&st_5=12&bt_6=&st_6=&bt_7=&st_7=&bt_8=&st_8=&bt_9=&st_9=&bt_10=&st_10=&type_11=Exact&query_11=&type_16=Exact&query_16=&bt_18=&st_18=&ge_19=&le_19=&type_20=Like&query_20=&display_look=1&t=101650&s=1&submit_all=Search+Database] (wow this is daunting) to new: [https://www.ngdc.noaa.gov/hazel/view/hazards/earthquake/event-data?minIntensity=12&maxIntensity=12]). I believe this will be a "if you see an old link fix it" type scenario unless you can somehow figure out a systematic approach. WFUM🔥🌪️ (talk) 04:27, 16 April 2025 (UTC)
::::I did a standard check and replaced dead links with archive URLs, resolved redirects, and checked for soft404s. -- GreenC 03:22, 26 April 2025 (UTC)
Enwiki
- Checked 1,271 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=ngdc.noaa.gov&max=500&server=enwiki&ns=None&enddate=20250424&startdate=20250427&nosect=on edited 570 pages]. Moved 119 links to a new URL: 117 normal redirects, 2 ghost mapped redirects, Resolved 106 soft-404s. Added 107 {{tld|dead link}}. Switched 2 {{para|url-status|dead}} to live. Switched 53 {{para|url-status|live}} to dead. Added 1,074 archive URLs (748 Wayback). Changed 4 citation metadata.
IABot DB
- Checked about 5,000 URLs and fixed 3,700 which propagate to 300+ wikis
Four RCCs
All cites of [https://www.srcc.tamu.edu/], [https://sercc.com/], [https://hprcc.unl.edu/], and [https://mrcc.purdue.edu/] need to be marked as dead. I believe there is no alternate location for this data. WFUM🔥🌪️ (talk) 20:10, 17 April 2025 (UTC)
::Because there are so few pages and links I went ahead and set the domains to {{underline|permadead}} in the IABot database which should take care of most of it. -- GreenC 15:06, 26 April 2025 (UTC)
=srcc.tamu.edu=
[https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Asrcc.tamu.edu+insource%3A%2Fsrcc.tamu.edu%2F&title=Special%3ASearch&ns0=1 0 pages]
=sercc.com=
[https://en.wikipedia.org/w/index.php?search=insource%3Asercc.com+insource%3A%2Fsercc.com%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 30 pages]
=hprcc.unl.edu=
[https://en.wikipedia.org/w/index.php?search=insource%3Ahprcc.unl.edu+insource%3A%2Fhprcc.unl.edu%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 31 pages]
=mrcc.purdue.edu=
covid.gov
Site has been hijacked(?). Appears to be a propaganda outlet for covid conspiracy theories - probably should be usurped. -- GreenC 19:21, 18 April 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?search=insource%3Acovid.gov+insource%3A%2Fcovid%5B.%5Dgov%5C%2F%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 2 pages] on Enwiki. Need to scrub IABot. -- GreenC 19:22, 18 April 2025 (UTC)
::yea Gladcape2013 (talk) 00:10, 19 April 2025 (UTC)
:It is official the federal government has acquired a lab week as the issue on coronavirus Gladcape2013 (talk) 04:03, 19 April 2025 (UTC)
::leak* Gladcape2013 (talk) 04:04, 19 April 2025 (UTC)
Enwiki
- No pages
IABot DB
- 24 links TBD
Similar to above changed to domain to {{underline|permadead}} in the IABot database. Also added to the usurp list at WP:JUDI so the url-status is updated on enwiki. -- GreenC 15:09, 26 April 2025 (UTC)
{{done}}
archive.fiba.com
This website now redirects to fiba.basketball but does not reirect to the new links.They can't be converted either due to the new strings (such as [https://archive.fiba.com/pages/eng/fa/event/p/sid/6996/_/2009_FIBA_Africa_Championship_for_Women/index.html here] to [https://www.fiba.basketball/en/history/302-fiba-womens-afrobasket/3369 there] for Madagascar). Some of these links have already been archived. [https://en.wikipedia.org/w/index.php?search=insource%3Aarchive.fiba.com+insource%3A%2Farchive.fiba.com%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 ~3900 pages]. Thanks! MrLinkinPark333 (talk) 00:35, 19 April 2025 (UTC)
:Did you request FIBA before? I recall generating a lot of archive.fiba.com links in 2022 and 2023, the last time they moved URLs (Special:Diff/1088207822/1088292106 and Special:Diff/1168750626/1173686109). Looks like they moved again, and again neglected to create redirect mappings, this time unrecoverable due to different domain, path and IDs. SEO is a thing. -- GreenC 15:44, 26 April 2025 (UTC)
::Another user made that request. I requested archives for archive.usab.com around the same time back in 2023. MrLinkinPark333 (talk) 18:32, 26 April 2025 (UTC)
:::OK. Wish websites could redirect map. [https://archive.org/details/78_the-old-refrain-vecchio-ritornello_beniamino-gigli-kerisler_gbia0076340a "The old refrain"]. -- GreenC 20:25, 26 April 2025 (UTC)
:::This will take a while. The link density per page is high, Wayback API is running slow, some other issues specific to the domain. -- GreenC 04:07, 27 April 2025 (UTC)
::::The bot has a decision tree which URLs move through as it tests for status and archive availability. The number of possible paths through the tree are countless. Ideally it's a short path but some domains don't behave well and it can take a long time to wind through and come to a final decision. I was able to tweak certain areas to speed it up where it kept going down a path that hit some lengthy sleep-retry cycles. -- GreenC 16:20, 28 April 2025 (UTC)
Enwiki
- Checked 3,907 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=archive.fiba.com&max=500&server=enwiki&ns=None&enddate=20250427&startdate=20250430&nosect=on edited 3,764 pages]. Added 2,303 {{tld|dead link}}. Switched 108 {{para|url-status|live}} to dead. Added 14,565 archive URLs (13,536 Wayback). Changed 138 citation metadata.
IABot DB
- Checked 23,600 unique URLs and update the same which propagate through 300+ wikis
moviegalleri.net
[https://moviegalleri.net/ The site] has lost all its pre-2025 archives. Once mainly a poster hosting website, [https://moviegalleri.net/category/photos/ it no longer does]. The files using this as source need to be updated. Kailash29792 (talk) 16:16, 21 April 2025 (UTC)
[https://en.wikipedia.org/w/index.php?search=insource%3Amoviegalleri.net+insource%3A%2Fmoviegalleri.net%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 243 pages]
Enwiki
- Checked 242 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=moviegalleri.net&max=500&server=enwiki&ns=None&enddate=20250425&startdate=20250428&nosect=on edited 229 pages]. Resolved 5 soft-404s. Added 29 {{tld|dead link}}. Switched 49 {{para|url-status|live}} to dead. Added 155 archive URLs (152 Wayback). Changed 21 citation metadata.
IABot DB
- Checked about 150 unique URLs and updated propagate through 300+ wikis
gamasutra.com
This website is now located at gamedeveloper.com. Unfortunately, old links dont redirect like [http://www.gamasutra.com/view/news/206537/Independent_Games_Festivals_1999_finalists_then_and_now.php here] to [https://www.gamedeveloper.com/business/independent-games-festival-s-1999-finalists-then-and-now there] for Fire and Darkness. If ghost redirects can be found, that would be great. [https://en.wikipedia.org/w/index.php?search=insource%3Agamasutra+insource%3A%2F%28%5B.%5D%7C%5C%2F%29gamasutra%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 ~4700 articles].Thanks! MrLinkinPark333 (talk) 19:15, 23 April 2025 (UTC)
I made a redirect map:
mapredirectTable[&"https://www.gamedeveloper.com/business/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/game-platforms/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/design/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/audio/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/press-release/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/programming/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/production/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/marketing/{x}"] = 1
mapredirectTable[&"https://www.gamedeveloper.com/console/{x}"] = 1
Each URL is tested against these possibilities. It works about half the time. Sometimes there are small changes for example
becomes
(change "Assassins" to "assasin-s"). Or
becomes
. The extra "-i-" is because the [https://www.gamedeveloper.com/business/epic-is-suing-a-i-fortnite-i-user-experience-tester-over-chapter-2-leaks title of the page] contains italics (!) like you might do in HTML
but pseudo-encoded directly into the URL. The s rule can can be checked for by converting any word that ends in "s_" to "s-s-" (Noahs_Ark becomes noah-s-ark), but sometimes only some of the words have this. Similarly there are abbreviations like "_QR_" becomes "-q-r-", and rule is applied inconsistently, one URL had 3 abbreviations but only one of them had this rule (2^3 = 8 possibilities). Another problem is many of the old URLs have a truncated version of the article title, the new URLs have the full title. I know how to solve for this ([https://en.wikibooks.org/wiki/A_Link_Rot_Bestiary/Chapter_4_:_Redirects#Inferred_mapped_redirect inferred mapped redirects]) and can give it a try. -- GreenC 16:04, 29 April 2025 (UTC)
:I found solutions for these problems. It's deploying every feature the bot has ([https://en.wikibooks.org/wiki/A_Link_Rot_Bestiary/Chapter_4_:_Redirects all of this] and [https://en.wikibooks.org/wiki/A_Link_Rot_Bestiary/Chapter_5_:_Archived_Redirect this]), and then some. Runs very slow but good results. -- GreenC 18:26, 30 April 2025 (UTC)
::Redid core functions for 'inferred mapped redirects' and 'ruled inferred mapped redirects', which will be applicable for future domains. This was a great test case domain. -- GreenC 23:04, 1 May 2025 (UTC)
Enwiki
- Pass 1 (0001-4675): Checked 4,675 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=gamasutra.com&max=500&server=enwiki&ns=None&enddate=20250430&startdate=20250503&nosect=on edited 3,970 pages]. Moved 2,494 links to a new URL: 14 normal redirects, 1,320 ruled mapped redirects, 510 inferred mapped redirects, 274 ghost mapped redirects, 376 ruled inferred mapped redirects. Added 45 {{tld|dead link}}. Switched 165 {{para|url-status|dead}} to live. Switched 2,780 {{para|url-status|live}} to dead. Added 1,720 archive URLs (1,536 Wayback).
- Pass 2 (0001-4070): Checked 4,070 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=gamasutra.com&max=500&server=enwiki&ns=None&enddate=20250502&startdate=20250505&nosect=on edited 568 pages]. Moved 861 links to a new URL: 4 normal redirects, 25 ruled mapped redirects, 502 inferred mapped redirects, 43 ghost mapped redirects, 287 ruled inferred mapped redirects. Switched 691 {{para|url-status|dead}} to live. Switched 1 {{para|url-status|live}} to dead. Added 2 archive URLs (2 Wayback).
IABot DB
- Checked and updated about 10,000 unique links which propagate through 300+ wikis
desimartini.com
Desimartini. [https://www.desimartini.com/about-us/ No longer active]. Kailash29792 (talk) 02:28, 24 April 2025 (UTC)
[https://en.wikipedia.org/w/index.php?search=insource%3Adesimartini.com+insource%3A%2Fdesimartini%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 249 pages]
Enwiki
- Checked 260 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=desimartini.com&max=500&server=enwiki&ns=None&enddate=20250425&startdate=20250428&nosect=on edited 251 pages]. Added 20 {{tld|dead link}}. Switched 99 {{para|url-status|live}} to dead. Added 145 archive URLs (139 Wayback). Changed 49 citation metadata.
IABot DB
- Checked 359 unique URLs and updated about the same.
tv-ark.org.uk
Usurped, main page redirects to advertisement for payday loans. [https://en.wikipedia.org/w/index.php?title=Special:LinkSearch&limit=500&target=tv-ark.org.uk 428 links according to Special:LinkSearch]. 2A00:807:D3:B2CD:1D64:EE5:ECA4:CA80 (talk) 09:00, 25 April 2025 (UTC)
:{{done}} in WP:JUDI batch #27 -- GreenC 15:24, 5 May 2025 (UTC)
edition.cnn.com
CNN seems to have quietly deleted (?) the edition.cnn.com domain, as attempting to go to any source that links it gives my browser an error saying it redirected too many times. One example: https://edition.cnn.com/TRANSCRIPTS/0707/13/ng.01.html is still listed as live at Chris Benoit double-murder and suicide, while other CNN links such as https://www.cnn.com/2007/US/07/17/wrestler.murder/index.html work fine. There may be a way to replace these links, but I haven't found it in my brief research. wizzito | say hello! 09:10, 27 April 2025 (UTC)
:I did some more searching and found that actually, the more recent edition.cnn.com links are live, such as https://edition.cnn.com/2016/12/29/news/donald-trump-golf-courses/index.html at Donald Trump. It seems to be the older links, from the 90s and 2000s, that are having this problem, but not all of them as well?
- http://edition.cnn.com/2006/POLITICS/09/24/clinton.binladen/index.html at Bill Clinton is live.
- http://edition.cnn.com/TRANSCRIPTS/1006/01/lkl.01.html is listed as live at Lady Gaga but is actually dead.
- At Lyle and Erik Menendez, https://transcripts.cnn.com/show/lkl/date/2005-12-20/segment/01 is live, while https://edition.cnn.com/US/9906/16/menendez/ is dead.
:Seems to affect both transcripts and non-transcripts alike. What a clusterfuck. wizzito | say hello! 09:23, 27 April 2025 (UTC)
::It might be only edition.cnn.com/TRANSCRIPTS/*
and edition.cnn.com/US/*
links that are repeatedly redirecting, based on these few links. wizzito | say hello! 09:27, 27 April 2025 (UTC)
:::It looks like some of the pages were removed ([https://web.archive.org/web/20250117085141/https://edition.cnn.com/TRANSCRIPTS/0707/13/ng.01.html example]) and now those pages have a configuration error (redirect loop). It's [https://en.wikipedia.org/w/index.php?search=insource%3Aedition.cnn.com+insource%3A%2Fedition%5B.%5Dcnn%5B.%5Dcom%5C%2F%28US%7CTRANSCRIPTS%29%5C%2F%2Fi&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 600 pages] limiting to TRANSCRIPTS and US at edition.cnn.com. But there are others like TECH ([http://edition.cnn.com/TECH/computing/9910/14/t_t/pokemon.tt/ example]). More likely [https://en.wikipedia.org/w/index.php?search=insource%3Aedition.cnn.com+insource%3A%2Fedition%5B.%5Dcnn%5B.%5Dcom%5C%2F%5BA-Z-%5D%7B1%2C20%7D%5C%2F%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 1661 pages]. This assumes uppercase and doesn't include the possibility of /YYYY/KEYTERM like Bill Clinton URL. Maybe the best approach is check all edition.cnn.com to be safe ([https://en.wikipedia.org/w/index.php?search=insource%3Aedition.cnn.com+insource%3A%2Fedition%5B.%5Dcnn%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 22,400 pages]) and add some special code looking for the redirect loop, otherwise a normal check for dead links. -- GreenC 16:40, 27 April 2025 (UTC)
::::Good idea, but that's a lot of pages to check. If your bot can do it, though, that would be fine. wizzito | say hello! 18:24, 28 April 2025 (UTC)
:::::Discovery: I managed to find the missing Lady Gaga transcript at https://transcripts.cnn.com/show/lkl/date/2010-06-01/segment/01. Looks like the most important parts of the link are the date (1006/01 = 2010-06-01), show (lkl = Larry King Live) and segment (01). We could remap the transcripts at least using this. wizzito | say hello! 18:39, 28 April 2025 (UTC)
::::::That date pattern was being used in the 1990s, ancient. I [https://en.wikipedia.org/w/index.php?title=Special:LinkSearch&limit=500&offset=7500&target=edition.cnn.com sifted through them] a bit but couldn't find anything additional. -- GreenC 01:11, 29 April 2025 (UTC)
::::::A simple rule for edition.cnn.com/TRANSCRIPTS: Convert [https://edition.cnn.com/TRANSCRIPTS/0707/13/ng.01.html this] to [https://transcripts.cnn.com/TRANSCRIPTS/0707/13/ng.01.html that] then redirects [https://transcripts.cnn.com/show/ng/date/2007-07-13/segment/01 here]. -- GreenC 22:03, 3 May 2025 (UTC)
Enwiki
:{{underline|Pass 1}} (00001-03000): Checked 3,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=edition.cnn.com&max=500&server=enwiki&ns=None&enddate=20250504&startdate=20250507&nosect=on edited 1,408 pages]. Moved 1,200 links to a new URL: 921 normal redirects, 168 ruled mapped redirects, 111 ghost mapped redirects, Resolved 26 soft-404s. Removed 1 {{tld|dead link}}. Added 35 {{tld|dead link}}. Switched 24 {{para|url-status|dead}} to live. Switched 122 {{para|url-status|live}} to dead. Added 342 archive URLs (292 Wayback). Changed 213 citation metadata.
:{{underline|Pass 2}} (03001-12000): Checked 9,001 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=edition.cnn.com&max=500&server=enwiki&ns=None&enddate=20250505&startdate=20250508&nosect=on edited 3,613 pages]. Moved 3,214 links to a new URL: 2,642 normal redirects, 509 ruled mapped redirects, 63 ghost mapped redirects, Resolved 72 soft-404s. Added 84 {{tld|dead link}}. Switched 53 {{para|url-status|dead}} to live. Switched 219 {{para|url-status|live}} to dead. Added 679 archive URLs (601 Wayback). Changed 670 citation metadata.
:{{underline|Pass 3}} (12001-22166): Checked 10,167 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=edition.cnn.com&max=500&server=enwiki&ns=None&enddate=20250505&startdate=20250508&nosect=on edited 3,988 pages]. Moved 3,484 links to a new URL: 2,870 normal redirects, 495 ruled mapped redirects, 119 ghost mapped redirects, Resolved 104 soft-404s. Added 109 {{tld|dead link}}. Switched 60 {{para|url-status|dead}} to live. Switched 258 {{para|url-status|live}} to dead. Added 746 archive URLs (673 Wayback). Changed 735 citation metadata.
IABot DB
- Checked over 52,000 unique links and updated 7,354
digital.olivesoftware.com
Nothing special here sadly, just need them all rescued or marked as dead. Only about 385 cases that I can find, with a lot already archived. Chew(V • T • E) 22:32, 27 April 2025 (UTC)
::Actually a somewhat difficult site to retrieve Wayback links, due to their pages being almost entirely images, and there are some malformed retrievals in the Wayback Machine. Not confident all the 188 {{tld|dead link}} added are accurate but most should be. When I first ran it, almost all were tagged as dead link, then I made some adjustments and got the results below. -- GreenC 15:56, 7 May 2025 (UTC)
Enwiki
- Checked 389 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=digital.olivesoftware.com&max=500&server=enwiki&ns=None&enddate=20250506&startdate=20250509&nosect=on edited 361 pages]. Added 188 {{tld|dead link}}. Switched 42 {{para|url-status|live}} to dead. Added 1,452 archive URLs (1,445 Wayback).
IABot DB
classification.gov.au
Please check for any potential dead links and/or redirects on the [https://oflc.gov.au OFLC website] (which was shut down not that long ago) and [https://classification.gov.au the Australian Classification Board's website]. [https://en.wikipedia.org/w/index.php?search=insource%3Aoflc.gov.au+insource%3A%2Foflc.gov.au%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 9 links for OFLC] and [https://en.wikipedia.org/w/index.php?search=insource%3Aclassification.gov.au+insource%3A%2Fclassification.gov.au%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 269 links for ACB]. Lord Sjones23 (talk - contributions) 03:52, 29 April 2025 (UTC)
For oflc.gov.au I set to dead in IABot.org there are so few links -- GreenC 01:34, 7 May 2025 (UTC)
Enwiki
- Checked 270 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=classification.gov.au&max=500&server=enwiki&ns=None&enddate=20250506&startdate=20250509&nosect=on edited 116 pages]. Moved 20 links to a new URL: 11 normal redirects, 9 ghost mapped redirects, Added 44 {{tld|dead link}}. Switched 4 {{para|url-status|dead}} to live. Switched 38 {{para|url-status|live}} to dead. Added 70 archive URLs (69 Wayback). Changed 96 citation metadata.
IABot DB
- Checked about 1,100 unique URLs and updated about 750
wycombewanderers.co.uk
similar to redimps.co.uk - domain no longer active (Wycombe Wanderers F.C. now use the domain wwfc.com) but marked as permalive by IABOT. used on [https://en.wikipedia.org/w/index.php?search=insource%3Awycombewanderers.co.uk+insource%3A%2Fwycombewanderers%5B.%5Dco%5B.%5Duk%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 268 pages]. more recent articles can seemingly be fixed by changing domain - [https://www.wycombewanderers.co.uk/news/2020/november/former-chelsea-youngster-joins-the-chairboys/] as used on 2020–21 Chelsea F.C. season is live at [https://www.wwfc.com/news/2020/november/former-chelsea-youngster-joins-the-chairboys/] but older ones use a different url format so cant be fixed like this even if still live. Microwave Anarchist (talk) 13:32, 29 April 2025 (UTC)
- Odd site. Some URLs can be accessed via bot [https://www.wwfc.com/news/2020/september/third-kit-launches-live-on-tv/], other URLs it falsely returns 404 [https://www.wwfc.com/news/2018/july/cherry-red-records-are-new-front-of-home-shirt-sponsor/]. CloudFront is involved. Looks like bot protection misconfiguration, or partial coverage. It seems any URL with "/news/YYYY/month/" is working. So what I can do is for those that return 404, if it also has that "news" form, it will assume it's actually 200. I am unable to verify it, so this will be a "Blind URL Move" with the associated hazard of getting it wrong. I'll do spot checks before uploading the diffs. -- GreenC 17:18, 8 May 2025 (UTC)
- Another: Given an archive URL like [https://web.archive.org/web/20140731114335/http://www.wycombewanderers.co.uk/news/article/rowe-returns-and-walker-joins-1792305.aspx this], extract the date "31 July 2014", and from that build a new URL [https://www.wwfc.com/news/2014/july/rowe-returns-and-walker-joins here] that works. This is a inferred mapped redirect. -- GreenC 18:31, 8 May 2025 (UTC)
- :Unfortunately there is a bad combination of factors: 1. They use JavaScript so it's not evident what's being displayed without a headless browser. 2. A headless browser is slow and doesn't work without cookies to get past the cookie acceptance button and that is difficult to setup. 3. They have some unusual bot blocker, that redirects to a "404" (displayed) but actually returns status code 200. The end results is over 400 URLs ending in .aspx that could be converted to a live page, but I can't make it work due to these complications. -- GreenC 20:17, 8 May 2025 (UTC)
Enwiki
- Checked 268 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=wycombewanderers.co.uk&max=500&server=enwiki&ns=None&enddate=20250507&startdate=20250510&nosect=on edited 222 pages]. Moved 347 links to a new URL: 347 ruled mapped redirects, Resolved 426 soft-404s. Added 19 {{tld|dead link}}. Switched 27 {{para|url-status|dead}} to live. Switched 22 {{para|url-status|live}} to dead. Added 234 archive URLs (193 Wayback). Changed 17 citation metadata.
IABot DB
- Checked and updated 821 unique links
renatabernal.com
Usurped: used to be the website of an artist living in upstate NY, now has been taken over by what looks like a japanese porn company. 66.24.80.198 (talk) 17:42, 30 April 2025 (UTC)
:{{done}} in WP:JUDI batch #27 -- GreenC 15:24, 5 May 2025 (UTC)
bhaskar.com
Any subdomains of bhaskar.com don't work, like [http://daily.bhaskar.com/article/CEL-when-ajay-devgn-left-karisma-kapoor-and-married-kajol-4305245-PHO.html this] at Ajay Devgn. Some of the urls without subdomains work, like [https://www.bhaskar.com/news/c-58-1960641-NOR.html this] at Tariq Umar Khan. But [http://www.bhaskar.com/rajasthan/jaipur this] doesn't work at Jaipur. I request that the subdomains be focused on first. [https://en.wikipedia.org/w/index.php?search=insource%3A%22bhaskar.com%22&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 ~3200] overall. Thank you! MrLinkinPark333 (talk) 20:04, 1 May 2025 (UTC)
:It's difficult to check www vs everything else because it ends up being being the same process internally. I'll check them all at one go. -- GreenC 22:51, 8 May 2025 (UTC)
Enwiki
- Checked 3,317 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=bhaskar.com&max=500&server=enwiki&ns=None&enddate=20250507&startdate=20250510&nosect=on edited 1,060 pages]. Moved 356 links to a new URL: 336 normal redirects, 20 ghost mapped redirects, Resolved 1,304 soft-404s. Removed 5 {{tld|dead link}}. Added 92 {{tld|dead link}}. Switched 38 {{para|url-status|dead}} to live. Switched 181 {{para|url-status|live}} to dead. Added 537 archive URLs (498 Wayback). Changed 91 citation metadata.
IABot DB
- Checked 6,300 unique links and updated about 2,000 which propagate through 300+ wikis
finlex.fi
Per thread at top of this page. -- GreenC 01:51, 3 May 2025 (UTC)
[https://en.wikipedia.org/w/index.php?search=insource%3Afinlex.fi+insource%3A%2Ffinlex%5B.%5Dfi%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 402 pages]
This will be done in two passes. -- GreenC 05:52, 9 May 2025 (UTC)
Enwiki
:{{underline|Pass 1}} (001-402): Checked 402 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=finlex.fi&max=500&server=enwiki&ns=None&enddate=20250508&startdate=20250511&nosect=on edited 225 pages]. Moved 215 links to a new URL: 215 normal redirects, Added 2 {{tld|dead link}}. Switched 49 {{para|url-status|dead}} to live. Switched 17 {{para|url-status|live}} to dead. Added 139 archive URLs (137 Wayback).
:{{underline|Pass 2}} (001-402): Checked 402 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=finlex.fi&max=500&server=enwiki&ns=None&enddate=20250508&startdate=20250511&nosect=on edited 221 pages]. Moved 336 links to a new URL: 336 normal redirects, Switched 1 {{para|url-status|dead}} to live. Changed 6 citation metadata.
IABot DB
- Checked 3,700 unique URLs and updated 627 which propagate to 300+ wikis
portlandtribune.com
This website has suddenly removed most of their archives. For example, sports only go up to [https://portlandtribune.com/category/sports/page/4/ April 26th] of this year. Hopefully this is just a temporary implement, but I request archived link just in case. [https://en.wikipedia.org/w/index.php?search=insource%3Aportlandtribune.com%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 ~1000.] Thank you! MrLinkinPark333 (talk) 00:30, 4 May 2025 (UTC)
Enwiki
- Checked 1,017 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=portlandtribune.com&max=500&server=enwiki&ns=None&enddate=20250508&startdate=20250511&nosect=on edited 774 pages]. Moved 23 links to a new URL: 1 normal redirects, 22 ghost mapped redirects, Added 102 {{tld|dead link}}. Switched 4 {{para|url-status|dead}} to live. Switched 225 {{para|url-status|live}} to dead. Added 663 archive URLs (622 Wayback). Changed 4 citation metadata.
IABot DB
- Checked 1,335 unique links and updated about the same
almasdarnews.com
The site has been down for some time and there are quite a few articles that use it. Thanks, David O. Johnson (talk) 03:36, 6 May 2025 (UTC)
[https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Aalmasdarnews.com+insource%3A%2Falmasdarnews%5B.%5Dcom%2F&title=Special%3ASearch&ns0=1 707 pages]
Enwiki
- Checked 711 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=almasdarnews.com&max=500&server=enwiki&ns=None&enddate=20250508&startdate=20250511&nosect=on edited 265 pages]. Added 14 {{tld|dead link}}. Switched 303 {{para|url-status|live}} to dead. Added 330 archive URLs (312 Wayback). Changed 11 citation metadata.
IABot DB
- Checked and updated about 5,400 unique URLs
{{done}} -- GreenC 02:06, 10 May 2025 (UTC)
:Appreciate it. David O. Johnson (talk) 02:16, 10 May 2025 (UTC)
iana.org/root-whois
Looks like there used to be redirects from
[https://en.wikipedia.org/w/index.php?search=insource%3Aiana.org+insource%3A%2Fiana.org%5C%2Froot-whois%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 212 pages]
:The example for .zr is a dead link [https://www.iana.org/domains/root/db/zr.html] and archive snapshots go to 404 pages. The [https://www.iana.org/domains/root/db/aq.html .aq works]. In the case of .zr it will end up adding a {{tld|dead link}} template and leaving the original URL. -- GreenC 22:32, 9 May 2025 (UTC)
Enwiki
- Checked 212 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=iana.org&max=500&server=enwiki&ns=None&enddate=20250508&startdate=20250511&nosect=on edited 212 pages]. Moved 216 links to a new URL: 1 normal redirects, 215 ruled mapped redirects, Switched 1 {{para|url-status|dead}} to live.
IABot DB
- Checked and updated 600 unique URLs which propagate through 300+ wikis
Vice
:moved from Wikipedia:AutoWikiBrowser/Tasks
= motherboard.vice.com =
Shouldn't all motherboard.vice.com links be replaced?
So from:
https://motherboard.vice.com/en_us/article/ identifier
/articlename
to:
https://www.vice.com/en/article /articlename
So for example if you got this link:
https://motherboard.vice.com/en_us/article/yp3ppm/robert-de-niro-wants-a-dialogue-about-anti-vaxxing-at-tribeca-film-festival
the article is actually located at:
https://www.vice.com/en/article/robert-de-niro-wants-a-dialogue-about-anti-vaxxing-at-tribeca-film-festival
894 results for [https://en.wikipedia.org/w/index.php?go=Go&search=insource%3A%22motherboard.vice.com%22&title=Special%3ASearch&ns0=1 insource:"motherboard.vice.com"]
Polygnotus (talk) 14:58, 9 May 2025 (UTC)
:Interesting... I'll try to do something about that, even though I'm not the most experienced with RegEx... - OpalYosutebito 『talk』 『articles I want to eat』 15:29, 9 May 2025 (UTC)
:Polygnotus, sounds like a job for WP:URLREQ. — Qwerfjkltalk 15:53, 9 May 2025 (UTC)
[https://en.wikipedia.org/w/index.php?search=insource%3Amotherboard.vice.com+insource%3A%2Fmotherboard%5B.%5Dvice%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 897 pages]
----
There are numerous redirect mapping rules:
- http://motherboard.vice.com/read/turns-out-google-glass-is-good-for-breastfeeding --> https://www.vice.com/en/article/turns-out-google-glass-is-good-for-breastfeeding
- http://motherboard.vice.com/en_uk/read/this-guy-implanted-his-bitcoin-wallet-and-made-a-payment-with-his-hand --> https://www.vice.com/en/article/this-guy-implanted-his-bitcoin-wallet-and-made-a-payment-with-his-hand
- http://motherboard.vice.com/2012/8/6/nasa-s-mars-rover-crashed-into-a-dmca-takedown --> http://www.vice.com/en/article/nasa-s-mars-rover-crashed-into-a-dmca-takedown
- https://www.vice.com/en/article/neapqg/300-californian-cities-secretly-have-access-to-palantir --> https://www.vice.com/en/article/300-californian-cities-secretly-have-access-to-palantir
- etc..
Everything changed to "www.vice.com/en/article/" .. including other vice.com URLs .. I'll focus on motherboard for now -- GreenC 03:31, 10 May 2025 (UTC)
:Thank you! Polygnotus (talk) 03:42, 10 May 2025 (UTC)
:Enwiki
:* Checked 897 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=motherboard.vice.com&max=500&server=enwiki&ns=None&enddate=20250509&startdate=20250512&nosect=on edited 885 pages]. Moved 1,049 links to a new URL: 1,038 ruled mapped redirects, 11 ghost mapped redirects, Resolved 2 soft-404s. Removed 3 {{tld|dead link}}. Switched 92 {{para|url-status|dead}} to live. Switched 1 {{para|url-status|live}} to dead. Added 4 archive URLs (4 Wayback). Changed 2 citation metadata.
= misc.vice.com =
The subdomains whose content is still hosted on vice.com, but in the root, are sports, munchies, noisey, news, waypoint, thump, broadly, and fightland.
It is unclear to me what happened to the video subdomain. It seems to have disappeared?
There is also a /read/ as you mentioned above (not just on the motherboard subdomain).
https://www.vice.com/read/prolific-music-critic-robert-christgau-knows-what-he-likes-and-hates-v23n07
which moved to:
https://www.vice.com/en/article/prolific-music-critic-robert-christgau-knows-what-he-likes-and-hates-v23n07/
/blog/ also changed to /article/
I searched for insource:/[^(fightland|broadly|thump|video|waypoint|sports|munchies|noisey|news|motherboard)]\.vice\.com/
and I think that was all of 'em.
Polygnotus (talk) 04:47, 10 May 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?search=insource%3Avice.com+insource%3A%2F%28%5B.%5D%7C%5C%2F%29%28fightland%7Cbroadly%7Cthump%7Cvideo%7Cwaypoint%7Csports%7Cmunchies%7Cnoisey%7Cnews%29%5C.vice%5C.com%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 4,741 pages] -- GreenC 05:51, 10 May 2025 (UTC)
:Enwiki
:* Checked 4,746 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=vice.com&max=500&server=enwiki&ns=None&enddate=20250509&startdate=20250512&nosect=on edited 4,681 pages]. Moved 5,121 links to a new URL: 4 normal redirects, 5,091 ruled mapped redirects, 26 ghost mapped redirects, Removed 9 {{tld|dead link}}. Added 10 {{tld|dead link}}. Switched 258 {{para|url-status|dead}} to live. Switched 39 {{para|url-status|live}} to dead. Added 312 archive URLs (299 Wayback). Changed 309 citation metadata.
:IABot DB
:* Checked about 9,000 unique links (includes motherboard); updated 8,691 which propagate to 300+ wikis
= i-d.vice.com =
This is related but different:
It looks like a company called Bedford Media acquired a magazine called i-D from Vice media.
They seem to have relocated to a different domain.
So for example the old URLs looked like this:
https://i-d.vice.com/en/article/qjkakb/how-did-taylor-swift-get-successful
which was moved to:
https://i-d.co/article/how-did-taylor-swift-get-successful/
I haven't done a full investigation but this appears to be related to the fact that Vice Media was in financial trouble at some point. Polygnotus (talk) 04:35, 10 May 2025 (UTC)
[https://en.wikipedia.org/w/index.php?search=insource%3Ai-d.vice.com%20insource%3A%2Fi-d.vice.com%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 1403 pages] Polygnotus (talk) 04:42, 10 May 2025 (UTC)
::Yeah I thought Vice was a gonner, but glad to see they have not only kept their old content accessible (95% anyway), but reorganized the URLs to be consistent, most large sites don't achieve this degree of accuracy. It's unfortunate they somehow got entangled with the Trump clan (Bedford Media). Ironically the Soros Fund Management may have sold it to them. An article from a year ago says many of the old sites like Motherboard will be coming back.[https://www.axios.com/2024/05/09/vice-media-relaunch-savage-ventures] If so that could change everything again, depending what they do with the archives. -- GreenC 21:15, 11 May 2025 (UTC)
:::https://variety.com/2023/digital/news/karlie-kloss-acquires-id-magazine-vice-media-1235790828/ {{tq|Kloss supported Hillary Clinton in the 2016 United States presidential election, Joe Biden in the 2020 election, and Kamala Harris in the 2024 election. She and her husband attended the March for Our Lives event in Washington, D.C. in protest of gun violence in March 2018. She is a feminist and has stated that her decision to leave Victoria's Secret was partly motivated by her feminist beliefs.}} -- Karlie_Kloss#Personal_life Polygnotus (talk) 04:33, 12 May 2025 (UTC)
:Enwiki
:* Checked 1,404 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=vice.com&max=500&server=enwiki&ns=None&enddate=20250511&startdate=20250514&nosect=on edited 1,366 pages]. Moved 1,426 links to a new URL: 1,426 ruled mapped redirects, Resolved 206 soft-404s. Removed 1 {{tld|dead link}}. Added 8 {{tld|dead link}}. Switched 124 {{para|url-status|dead}} to live. Switched 32 {{para|url-status|live}} to dead. Added 139 archive URLs (139 Wayback). Changed 28 citation metadata.
:IABot DB
:* Checked and fixed about 2,000 unique links -- GreenC 04:23, 12 May 2025 (UTC)
:{{done}} -- GreenC 15:33, 12 May 2025 (UTC)
::Daaaaaang... you're out here doing the lord's work! - OpalYosutebito 『talk』 『articles I want to eat』 04:32, 12 May 2025 (UTC)
:::Indeed, the thing God cares most about is the preservation of, and correct linking to, webpages. 666 is not the devils number, 404 is. Polygnotus (talk) 04:34, 12 May 2025 (UTC)
::::lol ! GreenC 15:33, 12 May 2025 (UTC)
::::There are two things that us Wikipedians find absolutely detestable: 404 links and lost media. Also, the number 4 is unlucky in East Asian cultures, since it's often pronounced similarly to "death" in Chinese and Japanese - OpalYosutebito 『talk』 『articles I want to eat』 01:33, 13 May 2025 (UTC)
:::::I noticed this when working on International_Commerce_Centre#Floor_count they end up skipping a lot of floors. -- GreenC 16:29, 13 May 2025 (UTC)
=www.vice.com=
User:Polygnotus: [https://en.wikipedia.org/w/index.php?search=insource%3Awww.vice.com+insource%3A%2Fwww%5B.%5Dvice%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 16,000+ www pages]. The first result at Google contains https://www.vice.com/en/article/v7d7j9/google-had-secret-project-to-convince-employees-that-unions-suck which redirects to https://www.vice.com/en/article/google-had-secret-project-to-convince-employees-that-unions-suck/ .. I think we should update. Couple reasons: the redirects may eventually stop working like the misc.vice.com set. Also I found with the misc set sometimes URLs with a non-English language code like "/nl/" may no longer work but when changed to "/en/" works. Some other things like that. Adding new URLs also triggers saves into the Wayback Machine. -- GreenC 05:44, 11 May 2025 (UTC)
:@GreenC I kinda feel guilty for having you do all this work. I agree that relying on redirects is not a good idea, despite them being 301s, because there are just so many that having to change them again would suck. According to [https://www.reuters.com/business/media-telecom/vice-media-relaunch-digital-platform-partnership-with-savage-ventures-2024-05-09/ Reuters] they have some weird deal with a company called "Savage Ventures" but that seems to only apply to the videos. If I dig
video.vice.com I see a CNAME of savageplatform.[https://docs.wpvip.com/domains/convenience-domains/ go-vip.net]. It is possible that they are still working on bringing the videos back (because you wouldn't use go-vip.net in prod); and if they do there are another [https://en.wikipedia.org/w/index.php?search=insource%3Avideo.vice.com%20insource%3A%2Fvideo%5B.%5Dvice%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1&ns6=1 91] that are fixable. Polygnotus (talk) 05:59, 11 May 2025 (UTC)
::There was work finding and building rules but that's not uncommon, vice is in the medium category of difficulty. Beyond that it's the computer doing the work and the rules are now programmed. Getting them now is more accurate than trying to build a redirect map later. For videos, about half of the links in the misc set that are Wayback or dead link or url-status=dead are /video/ or /blog/ .. the rest are pages that no longer exist, or moved to a very different URL with no pattern. -- GreenC 14:00, 11 May 2025 (UTC)
:Enwiki
:* {{underline|Pass 1}} (00001-03000) Checked 3,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=www.vice.com&max=500&server=enwiki&ns=None&enddate=20250511&startdate=20250514&nosect=on edited 2,004 pages]. Moved 2,369 links to a new URL: 10 normal redirects, 2,345 ruled mapped redirects, 14 ghost mapped redirects, Added 3 {{tld|dead link}}. Switched 52 {{para|url-status|dead}} to live. Switched 7 {{para|url-status|live}} to dead. Added 40 archive URLs (37 Wayback). Changed 268 citation metadata.
:* {{underline|Pass 2}} (03001-09000) Checked 6,000 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=www.vice.com&max=500&server=enwiki&ns=None&enddate=20250511&startdate=20250514&nosect=on edited 3,995 pages]. Moved 4,677 links to a new URL: 18 normal redirects, 4,639 ruled mapped redirects, 20 ghost mapped redirects, Removed 1 {{tld|dead link}}. Added 6 {{tld|dead link}}. Switched 107 {{para|url-status|dead}} to live. Switched 18 {{para|url-status|live}} to dead. Added 86 archive URLs (82 Wayback). Changed 586 citation metadata.
:* {{underline|Pass 3}} (09001-16923) Checked 7,923 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=www.vice.com&max=500&server=enwiki&ns=None&enddate=20250512&startdate=20250515&nosect=on edited 5,287 pages]. Moved 6,184 links to a new URL: 30 normal redirects, 6,125 ruled mapped redirects, 29 ghost mapped redirects, Removed 2 {{tld|dead link}}. Added 3 {{tld|dead link}}. Switched 145 {{para|url-status|dead}} to live. Switched 14 {{para|url-status|live}} to dead. Added 107 archive URLs (104 Wayback). Changed 745 citation metadata.
:IABot DB
:* Checked about 20,000 unique URLs and updated 3,282 which propagate through 300+ wikis
= edge.vice.com =
Edge cases known via IABot scans somewhere in the 300+ wikis:
- insource:vice.com insource:([.]|\/)(artofblue|company|eyeforaneye|films|hbo|thecreatorsproject|www2|wwww|thump2|daily|livenation|jp|creators|takemeanywhere|chillie|als|tonic|impact|cookiewall|garage|partners|plus|cutoff|upload-assets|free|assets|amuse|filmschool|images|motherboard-images|video-images|vice-images|noticias|studios|new)[.]vice[.]com/
This is more than the regex can handle before timing out it returns [https://en.wikipedia.org/w/index.php?search=insource%3Avice.com+insource%3A%2F%28%5B.%5D%7C%5C%2F%29%28artofblue%7Ccompany%7Ceyeforaneye%7Cfilms%7Chbo%7Cthecreatorsproject%7Cwww2%7Cwwww%7Cthump2%7Cdaily%7Clivenation%7Cjp%7Ccreators%7Ctakemeanywhere%7Cchillie%7Cals%7Ctonic%7Cimpact%7Ccookiewall%7Cgarage%7Cpartners%7Cplus%7Ccutoff%7Cupload-assets%7Cfree%7Cassets%7Camuse%7Cfilmschool%7Cimages%7Cmotherboard-images%7Cvideo-images%7Cvice-images%7Cnoticias%7Cstudios%7Cnew%29%5B.%5Dvice%5B.%5Dcom%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 355]. With an SQL query it gets 442 pages on enwiki. -- GreenC 16:06, 13 May 2025 (UTC)
:Enwiki
:* Checked 442 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=vice.com&max=500&server=enwiki&ns=None&enddate=20250512&startdate=20250515&nosect=on edited 424 pages]. Moved 310 links to a new URL: 1 normal redirects, 307 ruled mapped redirects, 2 ghost mapped redirects, Resolved 15 soft-404s. Removed 1 {{tld|dead link}}. Added 4 {{tld|dead link}}. Switched 7 {{para|url-status|dead}} to live. Switched 22 {{para|url-status|live}} to dead. Added 127 archive URLs (125 Wayback). Changed 21 citation metadata.
:IABot DB
:Checked 684 unique urls and updated 657 which propagate through 300+ wikis
= vice.com =
[https://en.wikipedia.org/w/index.php?go=Go&search=insource%3Avice.com+insource%3A%2F%5C%2Fvice%5B.%5Dcom%2F&title=Special%3ASearch&ns0=1 16 pages]
:Enwiki
:* Checked 16 pages and [https://sigma.toolforge.org/summary.py?name=GreenC+bot&search=vice.com&max=500&server=enwiki&ns=None&enddate=20250512&startdate=20250515&nosect=on edited 14 pages]. Moved 14 links to a new URL: 14 ruled mapped redirects, Switched 3 {{para|url-status|dead}} to live.
:IABot DB
:*Checked 11 unique URLs and updated 5
Railway Heritage Register Online
Per the discussion at Wikipedia talk:WikiProject UK Railways the Railway Heritage Register Online have revamped their site with a new database without any clear link between them. While they are being fixed manually, would it be possible for a bot to mark links to http://www.ws.rhrp.org.uk/ws/ (the old pattern) as dead and link to the Internet Archive version. The new pattern starts https://ws.rhrp.org.uk/WagonSurvey/ so those links should be left as-is. Thryduulf (talk) 16:09, 9 May 2025 (UTC)
:User:Thryduulf that would be no problem. -- GreenC 16:50, 9 May 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?search=insource%3Aws.rhrp.org.uk+insource%3A%2Fws%5B.%5Drhrp%5B.%5Dorg%5B.%5Duk%5C%2Fws%5C%2F%2F&title=Special%3ASearch&profile=advanced&fulltext=1&ns0=1 27 pages]
::There are two Railway heritage Register Online paths that have been moved. In addition to the above wagon pages, there are carriage pages that start http://www.cs.rhrp.org.uk/se/ (old pattern) which have now been replaced by https://cs.rhrp.org.uk/CarriageSurvey/ Geof Sheppard (talk) 17:22, 10 May 2025 (UTC)
basic.newspapers.com
For newspapers.com, some institutions give users under a specific company or institution access to newspapers.com, and the url appears as basic.newspapers.com. When attempting to access this from a device that has access to newspapers.com, it presents the error:
Unauthorized Access
Please return to your institution's website or portal and sign in again.
This can be fixed by removing the "basic." from the start of every newspapers.com url that uses the basic subdomain. It causes no issues for computers in those institutions, as the domain automatically redirects them to a basic page. See Jayson Werth as an article that has this issue. The bot would need to run multiple times, to correct any urls that have come up between times, but probably not often at all. Yoblyblob (Talk) :) 18:47, 9 May 2025 (UTC)
:[https://en.wikipedia.org/w/index.php?fulltext=Search&search=insource%3Abasic.newspapers.com+insource%3A%2Fbasic.newspapers.com%2F&title=Special%3ASearch&ns0=1 48 pages] GreenC 19:25, 9 May 2025 (UTC)
Template:Refideas on talk pages
I've noticed that many URLs within the {{t|Refideas}} template ("{{tq|This template provides a way to tell other editors about potentially useful references that could be used to improve the article.}}") on article talk pages are broken. Is it possible for a bot to check [https://w.wiki/E5FL all 24,250 talk pages] and add archived versions of these URLs if needed? I'm not sure if this is feasible, since the bot would need to avoid altering any URLs outside of the Refideas template. 87.95.243.221 (talk) 16:46, 12 May 2025 (UTC)