![]() | This page is an archive. Do not edit the contents of this page. Please direct any additional comments to the current main page. |
Processed about 10k links almost all of which were either dead, soft-404 or better-off-dead ie. the new page works but is otherwise behind a paywall and replacement with an archive-url is an improvement. This site uses extensive bot detection. For a workaround see the awk scripts "t" and "t2" in the meta directory businessweek.00000-10000 -- Green C 02:49, 3 April 2023 (UTC)
Back in November 2020 we discussed here what to do with broken links to the discontinued EBSCOhost Connection service. I just discovered that EBSCO has a new service called EBSCO Essentials that seems, at first glance, to provide access to the same information as the discontinued service.
Previous archived discussion of this topic:
But this is not immediately actionable as far as I know, because I haven't discovered a way to map the old URLs to the new ones. For example, taking the first item in Now Up-to-Date § Further reading, the old dead URL is http://connection.ebscohost.com/c/articles/9412062384 and I found the same item (note the same accession number in the URL) at https://essentials.ebsco.com/search/eds/details/now-s-new-calendar?an=9412062384 but I haven't found a way to get a URL with only the accession number and not the item title. This could merit further investigation. Notifying GreenC and Samwalton9 (WMF), who participated in previous conversations, in case they are interested. Biogeographist ( talk) 13:51, 5 April 2023 (UTC)
{{dead link}}
that could be moved, it's within the realm of doing it manually, should anyway want to take it on. There are more than 1,100 with an archive URL. These numbers are what my bot did in 2020, there are more that pre-existed to the bot run not sure how many. --
Green
C
14:12, 5 April 2023 (UTC)
Please add archive links to all links of the form "https://www.australianoftheyear.org.au/recipients/", except the root site, which is still a live link. The content is very stable but the URL's are not (I fixed many of the manually a year ago and they're already broken again ... I probably should've let the archive bot naturally do its thing, but I got some interesting article fixes out of the exercise). The new URLS cannot be automatically replaced (e.g. https://www.australianoftheyear.org.au/recipients/barbara-holborow/813/ is now https://cms.australianoftheyear.org.au/recipients/barbara-holborow-oam, depending on any honours the person received ... and I really don't feel like fixing these links manually every twelve months or so. Graham 87 03:55, 31 March 2023 (UTC)
I would like to have a bot that does the following: 1. " http://www." or "http://" turn into " https://www." or "https://" for all URLs in every pages. 2. "https://" is the most favourable URL outcome, second would be " https://www." or "http://", so that " http://www." at last. 161.81.115.230 ( talk) 11:41, 6 April 2023 (UTC)
Last year, Newspapers.com changed image viewers and for a time had URLs like https://www.newspapers.com/image/legacy/210351002/ (the end is a number of up to 9 digits). The /legacy must now be removed; these links do not resolve. They should now be, e.g., https://www.newspapers.com/image/210351002/. Note that while it's good practice to clip these articles for non-subscribers, this should at least be resolved. Sammi Brie (she/her • t • c) 23:47, 2 April 2023 (UTC)
This site is long dead, but many articles using it haven't tagged it as dead or in need of archive links. Kailash29792 (talk) 06:16, 5 April 2023 (UTC)
Reported. User:Billinghurst, I can usurped domains with WaybackMedic according to process at WP:USURPURL but only on Enwiki. If the domain is blacklisted at the wiki level I probably can't because the blacklist will prevent the bot from editing the page. In which case the blacklist will need to be lifted for the bot to run. -- Green C 22:07, 11 January 2023 (UTC)
web\.archive\.org/web/[0-9]+/https?://(?:[a-z0-9]+\.|)about\.com
Discussion: Wikipedia_talk:WikiProject_Thailand#Royal_Thai_Government_Gazette_(ราชกิจจานุเบกษา)_document_URL_scheme_change .. links exist in about 1,500 articles. Not all links are dead but the ones that are, are soft-404. -- Green C 20:00, 22 April 2023 (UTC)
Due to the inevitable link rot that will likely happen as a result of Buzzfeed News shutting down, I am requesting that the InternetArchiveBot archive that domain. If need be, you can see a slightly more detailed version of my request on WP:BOTREQ here. (Thanks @ Izno for pointing me in the right direction.) — That Coptic Guyping me! ( talk) ( contribs) 19:15, 20 April 2023 (UTC)
Not entirely sure this is the right place to report this (if somewhere else is more appropriate, let me know), but as reported here, haitimega.com — which used to contain information about Haiti and which eight articles therefore cite — seems to have rotted / been usurped by a different site. (So links need to be replaced with links to archived versions of the pages, when these exist.) -sche ( talk) 18:58, 28 April 2023 (UTC)
![]() | This page is an archive. Do not edit the contents of this page. Please direct any additional comments to the current main page. |
Processed about 10k links almost all of which were either dead, soft-404 or better-off-dead ie. the new page works but is otherwise behind a paywall and replacement with an archive-url is an improvement. This site uses extensive bot detection. For a workaround see the awk scripts "t" and "t2" in the meta directory businessweek.00000-10000 -- Green C 02:49, 3 April 2023 (UTC)
Back in November 2020 we discussed here what to do with broken links to the discontinued EBSCOhost Connection service. I just discovered that EBSCO has a new service called EBSCO Essentials that seems, at first glance, to provide access to the same information as the discontinued service.
Previous archived discussion of this topic:
But this is not immediately actionable as far as I know, because I haven't discovered a way to map the old URLs to the new ones. For example, taking the first item in Now Up-to-Date § Further reading, the old dead URL is http://connection.ebscohost.com/c/articles/9412062384 and I found the same item (note the same accession number in the URL) at https://essentials.ebsco.com/search/eds/details/now-s-new-calendar?an=9412062384 but I haven't found a way to get a URL with only the accession number and not the item title. This could merit further investigation. Notifying GreenC and Samwalton9 (WMF), who participated in previous conversations, in case they are interested. Biogeographist ( talk) 13:51, 5 April 2023 (UTC)
{{dead link}}
that could be moved, it's within the realm of doing it manually, should anyway want to take it on. There are more than 1,100 with an archive URL. These numbers are what my bot did in 2020, there are more that pre-existed to the bot run not sure how many. --
Green
C
14:12, 5 April 2023 (UTC)
Please add archive links to all links of the form "https://www.australianoftheyear.org.au/recipients/", except the root site, which is still a live link. The content is very stable but the URL's are not (I fixed many of the manually a year ago and they're already broken again ... I probably should've let the archive bot naturally do its thing, but I got some interesting article fixes out of the exercise). The new URLS cannot be automatically replaced (e.g. https://www.australianoftheyear.org.au/recipients/barbara-holborow/813/ is now https://cms.australianoftheyear.org.au/recipients/barbara-holborow-oam, depending on any honours the person received ... and I really don't feel like fixing these links manually every twelve months or so. Graham 87 03:55, 31 March 2023 (UTC)
I would like to have a bot that does the following: 1. " http://www." or "http://" turn into " https://www." or "https://" for all URLs in every pages. 2. "https://" is the most favourable URL outcome, second would be " https://www." or "http://", so that " http://www." at last. 161.81.115.230 ( talk) 11:41, 6 April 2023 (UTC)
Last year, Newspapers.com changed image viewers and for a time had URLs like https://www.newspapers.com/image/legacy/210351002/ (the end is a number of up to 9 digits). The /legacy must now be removed; these links do not resolve. They should now be, e.g., https://www.newspapers.com/image/210351002/. Note that while it's good practice to clip these articles for non-subscribers, this should at least be resolved. Sammi Brie (she/her • t • c) 23:47, 2 April 2023 (UTC)
This site is long dead, but many articles using it haven't tagged it as dead or in need of archive links. Kailash29792 (talk) 06:16, 5 April 2023 (UTC)
Reported. User:Billinghurst, I can usurped domains with WaybackMedic according to process at WP:USURPURL but only on Enwiki. If the domain is blacklisted at the wiki level I probably can't because the blacklist will prevent the bot from editing the page. In which case the blacklist will need to be lifted for the bot to run. -- Green C 22:07, 11 January 2023 (UTC)
web\.archive\.org/web/[0-9]+/https?://(?:[a-z0-9]+\.|)about\.com
Discussion: Wikipedia_talk:WikiProject_Thailand#Royal_Thai_Government_Gazette_(ราชกิจจานุเบกษา)_document_URL_scheme_change .. links exist in about 1,500 articles. Not all links are dead but the ones that are, are soft-404. -- Green C 20:00, 22 April 2023 (UTC)
Due to the inevitable link rot that will likely happen as a result of Buzzfeed News shutting down, I am requesting that the InternetArchiveBot archive that domain. If need be, you can see a slightly more detailed version of my request on WP:BOTREQ here. (Thanks @ Izno for pointing me in the right direction.) — That Coptic Guyping me! ( talk) ( contribs) 19:15, 20 April 2023 (UTC)
Not entirely sure this is the right place to report this (if somewhere else is more appropriate, let me know), but as reported here, haitimega.com — which used to contain information about Haiti and which eight articles therefore cite — seems to have rotted / been usurped by a different site. (So links need to be replaced with links to archived versions of the pages, when these exist.) -sche ( talk) 18:58, 28 April 2023 (UTC)