![]() | This page is an archive. Do not edit the contents of this page. Please direct any additional comments to the current main page. |
I noticed
http://www.vh1.com/news/articles/1497672/03022005/mudvayne.jhtml just redirects to
https://www.facebook.com/VH1/ which is less than helpful. Then I noticed even
http://www.vh1.com/ redirects to Facebook.
The article in question was archived and is actually still live at
https://www.mtv.com/news/xu79dk/mudvayne-lose-the-makeup-find-inspiration-in-isolation so if any article wasn't archived it could be worth having a log of what failed so someone could search mtv.com for it. — Alexis Jazz (
talk or ping me) 10:35, 18 November 2023 (UTC)
Step 1: fix dead links
|url-status=live
-> dead{{
dead link}}
tags ie. no archives availableStep 2: add archive URLs to CS1|2 that have no archive URL, and set |url-status=live
User:Alexis Jazz: this is done. -- Green C 22:56, 1 December 2023 (UTC)
User:Alexis Jazz: here are 154 pages with 203 URLs my bot marked with {{
dead link}}
(there might be others preexisting). BTW I noticed many of the archive URLs are poor quality, due to music videos in the source links, the archive providers often have trouble with video. --
Green
C 16:37, 2 December 2023 (UTC)
Can usemod.com go on WP:JUDI? See /info/en/?search=Special:LinkSearch?target=*.usemod.com
Is the bot able to update links? http://www.usemod.com/cgi-bin/mb.pl?GoodBye should be http://meatballwiki.org/wiki/GoodBye WhatamIdoing ( talk) 06:02, 28 November 2023 (UTC)
{{
usurped}}
to
User:Sj/Presentation? It's a lot of personal space and talk page comments to be modified without permissions. --
Green
C 06:50, 2 December 2023 (UTC)
Was editing a broken reference here, & upon trying the original site, it was immediately blocked by my Anti-virus. Apparently it's now been usurped into a site injecting Malware (or perhaps just that link, I'm not really keen on dealing with a citation giving me Malware again). I've corrected the archive to what appears to be a working & safe version of the reference & set the link to usurped. Thought it'd be prudent to mention it here, in case there's other links to the site lurking on the Wiki.
Here's the links to my 2 edits for quick reference. Again, the archive appears to be safe, but I wouldn't recommend going to the original site without active Anti-virus protection. 1. 2.
(Side note: Unfortunately I can't remember or find the previous citation/site that gave me Malware, but it should be in the list of my deleted edits if someone has access to that, with a very obvious "TROJAN WARNING" quote) Silverleaf81 ( talk) 05:53, 2 December 2023 (UTC)
The domain name flare.com is for sale! The magazine has moved to https://fashionmagazine.com/flare/ but the old content no longer seems to be online. Much of it has been archived in the usual places. Certes ( talk) 17:32, 6 December 2023 (UTC)
Hello, please change all links (in the main namespace) of the form http://www.nextbestpicture.com/2/post/2020/12/the-2020-indiana-film-journalists-association-ifja-winners.html to https://nextbestpicture.com/the-2020-indiana-film-journalists-association-ifja-winners/ (i.e. everything between the first slash after the domain name and the last one in the link should be removed, the ".html" should be replaced with a slash, and HTTP should be changed to HTTPS). Lots of these links seem to be marked as dead by InternetArchiveBot, including at Clarke Peters (where I noticed this and fixed it manually) and On the Rocks (film). Thanks! Graham87 ( talk) 07:06, 12 December 2023 (UTC)
Graham87: here you go Special:Diff/1186100009/1190645424. Good find. It edited over 500 pages, fixed many cites. It was difficult they use a bot blocker that's why Wayback Machine and IABot had trouble. I had a solution for it and was able to verify the new links work, in a few cases it required an archive URL. -- Green C 02:43, 19 December 2023 (UTC)
Hello. I notice that after clicking on this IMO link, it says the website moved to a new url and the old one will be available until this month. Looking through the IMO links on Wikipedia, some formats can be swapped over already:
There are other ones that aren't in these three categories and that I don't see in the new website. Here are some examples. I was wondering if the old public.wmo.int links could be changed to the new wmo.int links where possible, and the broken public.wmo.int with no new URL could be archived. There's 436 links to go through. Thanks! MrLinkinPark333 ( talk) 00:29, 17 December 2023 (UTC)
MrLinkinPark333: Here is what I did: migrate links where possible, as you discovered above like with Press releases, simply by changing the URL. This method only worked for some, the new site doesn't have all the pages from the old site. Thus, anything it couldn't find at the new site, it converted to public-old.wmo.int to bypass the information page that says the link is doomed. Then it saved a copy of the public-old.wmo.int link to the Wayback Machine. Then it added those Wayback links into the citation as archive URLs with url-status of dead (soon dead). I think this method saved the most content from imminent destruction. At some point later, once the new site is working, I can make more changes if you see ways to convert the public-old.wmo.int links to the new site at wmo.int. There are 195 public-old links in 160 articles. -- Green C 19:14, 18 December 2023 (UTC)
Further reading Corbett, C.T. (1958) Our Pioneer Nazarenes. Kansas City, MO.: Nazarene Publishing House. [2][permanent dead link]
This can be corrected by linking to one of the following: https://whdl.org/en/browse/resources/6629 https://nmi.whdl.org/en/browse/resources/6629 https://apnts.whdl.org/en/browse/resources/6629
Thanks! 174.127.124.132 ( talk) 07:22, 17 December 2023 (UTC)
It seems that the Episcopal Diocese of North West Texas used the URL www.nwt.org for information about the candidates. That site is now for sale. References to that site, such as at /info/en/?search=Scott_Mayer_(bishop) should be corrected/removed. Fr Kevin PJ Coffey, SCP ( talk) 16:45, 18 December 2023 (UTC)
The formatting of exoplanet.eu catalog entries has changed recently, so that all entries now have a numeric ID (e.g. 1261 for Kepler-62f). The previous format (which had the planet name alone) still soft-redirects to the correct target, but older links using a previous format need to be corrected by hand. – LaundryPizza03 ( d c̄) 01:29, 15 December 2023 (UTC)
User:LaundryPizza03: Is there an example of an old link, and its corresponding new link? -- Green C 04:08, 15 December 2023 (UTC)
I see "kepler-62" (dash) is now "kepler_62" (underscore). It might be be possible to convert ?p1=55+Cnc&p2=b
to 55_cnc_b
and then loading that page
https://exoplanet.eu/catalog/55_cnc_b/ and extracting the new URL from the HTML. As you suggest, I'll take a look at the linksearch and see how homogeneous. I'll get to this not immediately. --
Green
C 04:35, 15 December 2023 (UTC)
User:LaundryPizza03: Seeing a lot of links
like this. I added an archive URL because the source link is dead. I'd prefer to convert them to the new /catalog url scheme, but there is no way to link to a star, only planets,
like this. Am I missing something? What do you recommend for URLs with star.php?st=
--
Green
C 19:02, 21 December 2023 (UTC)
star_name="HD 5319"
then click "Apply filter" it brings up a list of planets. However, there is no way to link to this search result. Only a person manually entering the star name can find it, there is no API or mechanism for automated use. --
Green
C 19:21, 21 December 2023 (UTC)
{{
Cite EPE}}
. Over time, individual pages at the site will stop working, and the standard link rot tools won't detect or fix them, when the links are abstracted behind a custom external link template. I suppose it's possible the template could be useful if the entire site changes structure, but most likely the data in the template won't be sufficient to accommodate the new URL scheme. Thus at best the template makes adding a link a little quicker, and more uniform looking, but at the cost of increased link rot and challenges down the road when the URL scheme changes. I've always thought standard cite templates are the best way to go because there are so many tools that support them. --
Green
C 02:51, 23 December 2023 (UTC)This forum is getting a lot of requests recently. The requests can take a lot of work, 1-7 days each depending on the complexity: custom programming, data discovery, running tests cases, qualifying results, designing algorithms, waiting for the bot to run (slow due to networking), etc... Furthermore, my time to do this work is limited! If you make a request, and time goes by, that is why. I wish there was a way to boilerplate it, and I have generalized the code as much as possible, but ultimately this work is bespoke and artistic in nature due to the endless variety of conditions at remote sites. I try to respond to requests in chronological order, except when a site needs be triaged due to imminent outage, has an extremely large footprint, or can be addressed quickly, in those cases I might respond before some others. -- Green C 20:10, 20 December 2023 (UTC)
The
WP:JUDI folks have gotten to it. I'll add the archive URLs at
Draft:Chris Byars once I get off my school laptop (which blocks IA). Cheers,
Mach61 (
talk) 22:14, 23 December 2023 (UTC)
The sub-site "inventors.█████.com" ("about" censored because of wiki filter) now appears to be " thoughtco.com", with references/external links either linking to the same article on the new site, or simply don't work. Apparently there are 150+ articles using the inventors URL (1), & what looks like 500+ external link search results (2), although a significant portion are on talk pages. Silverleaf81 ( talk) 09:28, 17 December 2023 (UTC)
User:Silverleaf81: This is done. It got most of them. It added 341 archive.today URLs. A list of about 50 questionables is at Wikipedia:Link_rot/cases/inventors.about.com but not all of them are legitimately a problem. -- Green C 02:24, 26 December 2023 (UTC)
According to this archived link, the IPA fonts were transferred from IPA to the Character Information Technology Promotion Council, who now host the fonts on their website. Citation 14 should link to https://moji.or.jp/mojikiban/font/ and Citations 13 and 22 (which is a dead link) should be https://moji.or.jp/ipafont/.
(Apologies if this is the wrong place for this. I'm new to editing and I didn't want to mess up the citation.) Ichneumonidae ( talk) 18:25, 26 December 2023 (UTC)
My website runeberg.org just recently moved from http: to https: so it would be nice if someone could update the remaining 11,000 links accordingly. This is not urgent, as everything works fine with automatic redirects, but it would be nice. Thank you. -- LA2 ( talk) 22:57, 17 December 2023 (UTC)
{{
dead link}}
tag. The rest are converted to https. There was some typos and non-working links to Google Translate I manually fixed.
List of http runeberg.org links --
Green
C 20:31, 26 December 2023 (UTC)I found many broken links to Yahoo! Groups. Can we find archived copies of these pages? Jarble ( talk) 18:19, 18 December 2023 (UTC)
Jarble: The bot added 1,474 new archive URLs. I limited it to only adding archive.today because it has the best coverage for this site, Wayback had trouble making good saves due to logins and cookies. There were 115 it couldn't find and added a {{
dead link}}
. Also added the archives to IABot's database so these updates will propagate to over 300 other wikis. --
Green
C 04:48, 28 December 2023 (UTC)
The website www.spacelaunchreport.com was cited extensively in many spaceflight articles and now has been usurped by an adware site of some sort. Could all of these links please be archived? Example link http://www.spacelaunchreport.com/falcon9ft.html#f9stglog from List of Falcon 9 first-stage boosters. Ergzay ( talk) 10:02, 27 December 2023 (UTC)
Many links from http://www.atsdr.cdc.gov have been migrated to https://atsdr.cdc.gov or https://wwwn.cdc.gov, which has broken a lot of links. Some automated attempts to archive the pages have resulted in archives of 404 errors at this page. I noticed this on Health effects of radon, and unfortunately the IDs on a lot of these pages ("ToxFAQs") have no relation to the new, identical pages on the HTTPS websites. Additionally, some articles like Peninsula Extension refer to Public Health Assessments, which need to be found in an archived page since the files have been deleted and are only available by email request. Reconrabbit ( talk| edits) 18:38, 19 December 2023 (UTC)
User:Reconrabbit: I can see why this has gone unaddressed for so long it's complicated. I can't promise everything is perfect but most everything that is dead now has an archive URL. They use JavaScript redirects which gave bots trouble, thus the bad archive URLs. I checked the existing archive URLs for soft-404s, this is imperfect, but it did find and replace a few: Special:Diff/1190591816/1192546009 I fixed a few of the ToxFAQ links by manually looking them up: Special:Diff/1189670705/1192547048 But most were simply archived: Special:Diff/1121144402/1192546200 If you want to create a map of old -> new the bot can use that to make changes on-wiki.
The http links existed in about 350 articles. The bot edited 211 pages. I think the difference is the links were already archived, or working such as the PDFs. It added 141 new archive URLs. And it made 127 redirect moves: Special:Diff/1154065478/1192545155 Hope that helps. -- Green C 00:05, 30 December 2023 (UTC)
![]() | This page is an archive. Do not edit the contents of this page. Please direct any additional comments to the current main page. |
I noticed
http://www.vh1.com/news/articles/1497672/03022005/mudvayne.jhtml just redirects to
https://www.facebook.com/VH1/ which is less than helpful. Then I noticed even
http://www.vh1.com/ redirects to Facebook.
The article in question was archived and is actually still live at
https://www.mtv.com/news/xu79dk/mudvayne-lose-the-makeup-find-inspiration-in-isolation so if any article wasn't archived it could be worth having a log of what failed so someone could search mtv.com for it. — Alexis Jazz (
talk or ping me) 10:35, 18 November 2023 (UTC)
Step 1: fix dead links
|url-status=live
-> dead{{
dead link}}
tags ie. no archives availableStep 2: add archive URLs to CS1|2 that have no archive URL, and set |url-status=live
User:Alexis Jazz: this is done. -- Green C 22:56, 1 December 2023 (UTC)
User:Alexis Jazz: here are 154 pages with 203 URLs my bot marked with {{
dead link}}
(there might be others preexisting). BTW I noticed many of the archive URLs are poor quality, due to music videos in the source links, the archive providers often have trouble with video. --
Green
C 16:37, 2 December 2023 (UTC)
Can usemod.com go on WP:JUDI? See /info/en/?search=Special:LinkSearch?target=*.usemod.com
Is the bot able to update links? http://www.usemod.com/cgi-bin/mb.pl?GoodBye should be http://meatballwiki.org/wiki/GoodBye WhatamIdoing ( talk) 06:02, 28 November 2023 (UTC)
{{
usurped}}
to
User:Sj/Presentation? It's a lot of personal space and talk page comments to be modified without permissions. --
Green
C 06:50, 2 December 2023 (UTC)
Was editing a broken reference here, & upon trying the original site, it was immediately blocked by my Anti-virus. Apparently it's now been usurped into a site injecting Malware (or perhaps just that link, I'm not really keen on dealing with a citation giving me Malware again). I've corrected the archive to what appears to be a working & safe version of the reference & set the link to usurped. Thought it'd be prudent to mention it here, in case there's other links to the site lurking on the Wiki.
Here's the links to my 2 edits for quick reference. Again, the archive appears to be safe, but I wouldn't recommend going to the original site without active Anti-virus protection. 1. 2.
(Side note: Unfortunately I can't remember or find the previous citation/site that gave me Malware, but it should be in the list of my deleted edits if someone has access to that, with a very obvious "TROJAN WARNING" quote) Silverleaf81 ( talk) 05:53, 2 December 2023 (UTC)
The domain name flare.com is for sale! The magazine has moved to https://fashionmagazine.com/flare/ but the old content no longer seems to be online. Much of it has been archived in the usual places. Certes ( talk) 17:32, 6 December 2023 (UTC)
Hello, please change all links (in the main namespace) of the form http://www.nextbestpicture.com/2/post/2020/12/the-2020-indiana-film-journalists-association-ifja-winners.html to https://nextbestpicture.com/the-2020-indiana-film-journalists-association-ifja-winners/ (i.e. everything between the first slash after the domain name and the last one in the link should be removed, the ".html" should be replaced with a slash, and HTTP should be changed to HTTPS). Lots of these links seem to be marked as dead by InternetArchiveBot, including at Clarke Peters (where I noticed this and fixed it manually) and On the Rocks (film). Thanks! Graham87 ( talk) 07:06, 12 December 2023 (UTC)
Graham87: here you go Special:Diff/1186100009/1190645424. Good find. It edited over 500 pages, fixed many cites. It was difficult they use a bot blocker that's why Wayback Machine and IABot had trouble. I had a solution for it and was able to verify the new links work, in a few cases it required an archive URL. -- Green C 02:43, 19 December 2023 (UTC)
Hello. I notice that after clicking on this IMO link, it says the website moved to a new url and the old one will be available until this month. Looking through the IMO links on Wikipedia, some formats can be swapped over already:
There are other ones that aren't in these three categories and that I don't see in the new website. Here are some examples. I was wondering if the old public.wmo.int links could be changed to the new wmo.int links where possible, and the broken public.wmo.int with no new URL could be archived. There's 436 links to go through. Thanks! MrLinkinPark333 ( talk) 00:29, 17 December 2023 (UTC)
MrLinkinPark333: Here is what I did: migrate links where possible, as you discovered above like with Press releases, simply by changing the URL. This method only worked for some, the new site doesn't have all the pages from the old site. Thus, anything it couldn't find at the new site, it converted to public-old.wmo.int to bypass the information page that says the link is doomed. Then it saved a copy of the public-old.wmo.int link to the Wayback Machine. Then it added those Wayback links into the citation as archive URLs with url-status of dead (soon dead). I think this method saved the most content from imminent destruction. At some point later, once the new site is working, I can make more changes if you see ways to convert the public-old.wmo.int links to the new site at wmo.int. There are 195 public-old links in 160 articles. -- Green C 19:14, 18 December 2023 (UTC)
Further reading Corbett, C.T. (1958) Our Pioneer Nazarenes. Kansas City, MO.: Nazarene Publishing House. [2][permanent dead link]
This can be corrected by linking to one of the following: https://whdl.org/en/browse/resources/6629 https://nmi.whdl.org/en/browse/resources/6629 https://apnts.whdl.org/en/browse/resources/6629
Thanks! 174.127.124.132 ( talk) 07:22, 17 December 2023 (UTC)
It seems that the Episcopal Diocese of North West Texas used the URL www.nwt.org for information about the candidates. That site is now for sale. References to that site, such as at /info/en/?search=Scott_Mayer_(bishop) should be corrected/removed. Fr Kevin PJ Coffey, SCP ( talk) 16:45, 18 December 2023 (UTC)
The formatting of exoplanet.eu catalog entries has changed recently, so that all entries now have a numeric ID (e.g. 1261 for Kepler-62f). The previous format (which had the planet name alone) still soft-redirects to the correct target, but older links using a previous format need to be corrected by hand. – LaundryPizza03 ( d c̄) 01:29, 15 December 2023 (UTC)
User:LaundryPizza03: Is there an example of an old link, and its corresponding new link? -- Green C 04:08, 15 December 2023 (UTC)
I see "kepler-62" (dash) is now "kepler_62" (underscore). It might be be possible to convert ?p1=55+Cnc&p2=b
to 55_cnc_b
and then loading that page
https://exoplanet.eu/catalog/55_cnc_b/ and extracting the new URL from the HTML. As you suggest, I'll take a look at the linksearch and see how homogeneous. I'll get to this not immediately. --
Green
C 04:35, 15 December 2023 (UTC)
User:LaundryPizza03: Seeing a lot of links
like this. I added an archive URL because the source link is dead. I'd prefer to convert them to the new /catalog url scheme, but there is no way to link to a star, only planets,
like this. Am I missing something? What do you recommend for URLs with star.php?st=
--
Green
C 19:02, 21 December 2023 (UTC)
star_name="HD 5319"
then click "Apply filter" it brings up a list of planets. However, there is no way to link to this search result. Only a person manually entering the star name can find it, there is no API or mechanism for automated use. --
Green
C 19:21, 21 December 2023 (UTC)
{{
Cite EPE}}
. Over time, individual pages at the site will stop working, and the standard link rot tools won't detect or fix them, when the links are abstracted behind a custom external link template. I suppose it's possible the template could be useful if the entire site changes structure, but most likely the data in the template won't be sufficient to accommodate the new URL scheme. Thus at best the template makes adding a link a little quicker, and more uniform looking, but at the cost of increased link rot and challenges down the road when the URL scheme changes. I've always thought standard cite templates are the best way to go because there are so many tools that support them. --
Green
C 02:51, 23 December 2023 (UTC)This forum is getting a lot of requests recently. The requests can take a lot of work, 1-7 days each depending on the complexity: custom programming, data discovery, running tests cases, qualifying results, designing algorithms, waiting for the bot to run (slow due to networking), etc... Furthermore, my time to do this work is limited! If you make a request, and time goes by, that is why. I wish there was a way to boilerplate it, and I have generalized the code as much as possible, but ultimately this work is bespoke and artistic in nature due to the endless variety of conditions at remote sites. I try to respond to requests in chronological order, except when a site needs be triaged due to imminent outage, has an extremely large footprint, or can be addressed quickly, in those cases I might respond before some others. -- Green C 20:10, 20 December 2023 (UTC)
The
WP:JUDI folks have gotten to it. I'll add the archive URLs at
Draft:Chris Byars once I get off my school laptop (which blocks IA). Cheers,
Mach61 (
talk) 22:14, 23 December 2023 (UTC)
The sub-site "inventors.█████.com" ("about" censored because of wiki filter) now appears to be " thoughtco.com", with references/external links either linking to the same article on the new site, or simply don't work. Apparently there are 150+ articles using the inventors URL (1), & what looks like 500+ external link search results (2), although a significant portion are on talk pages. Silverleaf81 ( talk) 09:28, 17 December 2023 (UTC)
User:Silverleaf81: This is done. It got most of them. It added 341 archive.today URLs. A list of about 50 questionables is at Wikipedia:Link_rot/cases/inventors.about.com but not all of them are legitimately a problem. -- Green C 02:24, 26 December 2023 (UTC)
According to this archived link, the IPA fonts were transferred from IPA to the Character Information Technology Promotion Council, who now host the fonts on their website. Citation 14 should link to https://moji.or.jp/mojikiban/font/ and Citations 13 and 22 (which is a dead link) should be https://moji.or.jp/ipafont/.
(Apologies if this is the wrong place for this. I'm new to editing and I didn't want to mess up the citation.) Ichneumonidae ( talk) 18:25, 26 December 2023 (UTC)
My website runeberg.org just recently moved from http: to https: so it would be nice if someone could update the remaining 11,000 links accordingly. This is not urgent, as everything works fine with automatic redirects, but it would be nice. Thank you. -- LA2 ( talk) 22:57, 17 December 2023 (UTC)
{{
dead link}}
tag. The rest are converted to https. There was some typos and non-working links to Google Translate I manually fixed.
List of http runeberg.org links --
Green
C 20:31, 26 December 2023 (UTC)I found many broken links to Yahoo! Groups. Can we find archived copies of these pages? Jarble ( talk) 18:19, 18 December 2023 (UTC)
Jarble: The bot added 1,474 new archive URLs. I limited it to only adding archive.today because it has the best coverage for this site, Wayback had trouble making good saves due to logins and cookies. There were 115 it couldn't find and added a {{
dead link}}
. Also added the archives to IABot's database so these updates will propagate to over 300 other wikis. --
Green
C 04:48, 28 December 2023 (UTC)
The website www.spacelaunchreport.com was cited extensively in many spaceflight articles and now has been usurped by an adware site of some sort. Could all of these links please be archived? Example link http://www.spacelaunchreport.com/falcon9ft.html#f9stglog from List of Falcon 9 first-stage boosters. Ergzay ( talk) 10:02, 27 December 2023 (UTC)
Many links from http://www.atsdr.cdc.gov have been migrated to https://atsdr.cdc.gov or https://wwwn.cdc.gov, which has broken a lot of links. Some automated attempts to archive the pages have resulted in archives of 404 errors at this page. I noticed this on Health effects of radon, and unfortunately the IDs on a lot of these pages ("ToxFAQs") have no relation to the new, identical pages on the HTTPS websites. Additionally, some articles like Peninsula Extension refer to Public Health Assessments, which need to be found in an archived page since the files have been deleted and are only available by email request. Reconrabbit ( talk| edits) 18:38, 19 December 2023 (UTC)
User:Reconrabbit: I can see why this has gone unaddressed for so long it's complicated. I can't promise everything is perfect but most everything that is dead now has an archive URL. They use JavaScript redirects which gave bots trouble, thus the bad archive URLs. I checked the existing archive URLs for soft-404s, this is imperfect, but it did find and replace a few: Special:Diff/1190591816/1192546009 I fixed a few of the ToxFAQ links by manually looking them up: Special:Diff/1189670705/1192547048 But most were simply archived: Special:Diff/1121144402/1192546200 If you want to create a map of old -> new the bot can use that to make changes on-wiki.
The http links existed in about 350 articles. The bot edited 211 pages. I think the difference is the links were already archived, or working such as the PDFs. It added 141 new archive URLs. And it made 127 redirect moves: Special:Diff/1154065478/1192545155 Hope that helps. -- Green C 00:05, 30 December 2023 (UTC)