Operator: Green Cardamom ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 00:57, Saturday, February 18, 2017 ( UTC)
Automatic, Supervised, or Manual: Automatic
Programming language(s): Nim + Awk
Source code available: Yes
Function overview: Convert machine-specific URLs at Internet Archive to generic work page URLs.
Links to relevant discussions (where appropriate): User_talk:Cyberpower678/Archive_45#IABot_dead_link_fix
Edit period(s): One time run initially
Estimated number of pages affected: 20,000
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): Yes
Function details:
The bot's purpose is easiest seen by example diff: [1]
Links to Internet Archive collections have a main work page in this form: https://archive.org/details/manualofconcholo111tryo .. the work page contains multiple files in PDF, Epub, text etc.. as seen in the index. It's possible to link to a file like this: http://ia700307.us.archive.org/4/items/manualofconcholo111tryo/manualofconcholo111tryo.pdf .. the link contains a machine ID ia700307 in the cluster so if the cluster changes - a machine replaced for hardware failure etc.. the link dies.
Initial tests showed that approximately 50% of all such "machine ID links" on enwiki have become dead links. The bot's purpose is to replace all such machine ID links with the work page which redirects to whatever machine hosts it in the IA cluster. The bot will also detect and remove any {{dead link}}
tags.
The code for the bot is completed and tested and ready to run. It is a module of WaybackMedic so whatever pages for this bot will also get other WM fixes. It will initially target all articles containing machine ID links as found by searching a recent database dump, then continue running incidentally as part of the WaybackMedic runs.
Approved for trial (10 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Just a short run to see some more examples. Otherwise, I don't see any issues. —
HELLKNOWZ ▎
TALK 15:54, 18 February 2017 (UTC)
reply
It did a couple beyond 10 during the last batch of WaybackMedic. Trial complete. --
Green
C 20:07, 19 February 2017 (UTC)
reply
There Is No Preview Available For This Item
This item does not appear to have any files that can be experienced on Archive.org.
Please download files in this item to interact with them on your computer.
AFTER. — xaosflux Talk 02:06, 28 February 2017 (UTC) reply
Needs
wider discussion.
This seems to be an undesirable landing page for readers, a larger community discussion is warranted. Please post at appropriate venues, including
Wikipedia:Village pump (proposals), and centralize a discussion. —
xaosflux
Talk 02:09, 28 February 2017 (UTC)
reply
{{ BAGAssistanceNeeded}} -- the BRFA is now 2 months old and the discussions opened per the "wider discussion" request above are now archived. No one has commented other than Thincat. It's in line with policy about using permalinks. I've continued dribbling in changes incidentally as part of the WaybackMedic work, probably a few hundred more. -- Green C 16:20, 18 April 2017 (UTC) reply
Approved.. Useful extension to existing task to preserve links for inevitable linkrot. No pending concerns brought up during wider discussion (mainly SILENCE). Trusted botop. No further issues with trial edits. Due to the nature of the edits, various special cases and reliance on external tools, there may be occasional unforeseen errors. Obviously, this BRFA is approved on general BOTPOL assumption that any issues are fixed and discussed, if needed. Approval also includes potential expansion to include other unambiguously better archive URLs that may appear in future or be for other services. —
HELLKNOWZ ▎
TALK 17:47, 18 April 2017 (UTC)
reply
Operator: Green Cardamom ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 00:57, Saturday, February 18, 2017 ( UTC)
Automatic, Supervised, or Manual: Automatic
Programming language(s): Nim + Awk
Source code available: Yes
Function overview: Convert machine-specific URLs at Internet Archive to generic work page URLs.
Links to relevant discussions (where appropriate): User_talk:Cyberpower678/Archive_45#IABot_dead_link_fix
Edit period(s): One time run initially
Estimated number of pages affected: 20,000
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): Yes
Function details:
The bot's purpose is easiest seen by example diff: [1]
Links to Internet Archive collections have a main work page in this form: https://archive.org/details/manualofconcholo111tryo .. the work page contains multiple files in PDF, Epub, text etc.. as seen in the index. It's possible to link to a file like this: http://ia700307.us.archive.org/4/items/manualofconcholo111tryo/manualofconcholo111tryo.pdf .. the link contains a machine ID ia700307 in the cluster so if the cluster changes - a machine replaced for hardware failure etc.. the link dies.
Initial tests showed that approximately 50% of all such "machine ID links" on enwiki have become dead links. The bot's purpose is to replace all such machine ID links with the work page which redirects to whatever machine hosts it in the IA cluster. The bot will also detect and remove any {{dead link}}
tags.
The code for the bot is completed and tested and ready to run. It is a module of WaybackMedic so whatever pages for this bot will also get other WM fixes. It will initially target all articles containing machine ID links as found by searching a recent database dump, then continue running incidentally as part of the WaybackMedic runs.
Approved for trial (10 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete. Just a short run to see some more examples. Otherwise, I don't see any issues. —
HELLKNOWZ ▎
TALK 15:54, 18 February 2017 (UTC)
reply
It did a couple beyond 10 during the last batch of WaybackMedic. Trial complete. --
Green
C 20:07, 19 February 2017 (UTC)
reply
There Is No Preview Available For This Item
This item does not appear to have any files that can be experienced on Archive.org.
Please download files in this item to interact with them on your computer.
AFTER. — xaosflux Talk 02:06, 28 February 2017 (UTC) reply
Needs
wider discussion.
This seems to be an undesirable landing page for readers, a larger community discussion is warranted. Please post at appropriate venues, including
Wikipedia:Village pump (proposals), and centralize a discussion. —
xaosflux
Talk 02:09, 28 February 2017 (UTC)
reply
{{ BAGAssistanceNeeded}} -- the BRFA is now 2 months old and the discussions opened per the "wider discussion" request above are now archived. No one has commented other than Thincat. It's in line with policy about using permalinks. I've continued dribbling in changes incidentally as part of the WaybackMedic work, probably a few hundred more. -- Green C 16:20, 18 April 2017 (UTC) reply
Approved.. Useful extension to existing task to preserve links for inevitable linkrot. No pending concerns brought up during wider discussion (mainly SILENCE). Trusted botop. No further issues with trial edits. Due to the nature of the edits, various special cases and reliance on external tools, there may be occasional unforeseen errors. Obviously, this BRFA is approved on general BOTPOL assumption that any issues are fixed and discussed, if needed. Approval also includes potential expansion to include other unambiguously better archive URLs that may appear in future or be for other services. —
HELLKNOWZ ▎
TALK 17:47, 18 April 2017 (UTC)
reply