Operator: Gaelan ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 11:25, Wednesday, February 20, 2019 ( UTC)
Automatic, Supervised, or Manual: automatic
Programming language(s): Python
Source code available: here (currently implements heuristics to fix URL; code to actually edit the fixed URL into the article pending BRFA)
Function overview: Fix Category:Pages with URL errors in obvious cases
Links to relevant discussions (where appropriate):
Edit period(s): One big run, then regularly to fix new cases
Estimated number of pages affected: ~500
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): No
Function details: This bot attempts to fix common errors that cause articles to be placed into Category:Pages with URL errors. Specifically, tries to:
In either case, it checks that the resulting URL is accessible and returns a 200 status code before making changes. Testing against the first 200 articles in the category, the bot was able to fix 9% of the articles.
'<>[]
-
?
. There's also a few edits where the bot correctly fixes the URL but adds a space at the end for no reason—I'll look into that. Everything else seemed fine to me.
Gaelan
💬
✏️
04:23, 23 February 2019 (UTC)
reply
Operator: Gaelan ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 11:25, Wednesday, February 20, 2019 ( UTC)
Automatic, Supervised, or Manual: automatic
Programming language(s): Python
Source code available: here (currently implements heuristics to fix URL; code to actually edit the fixed URL into the article pending BRFA)
Function overview: Fix Category:Pages with URL errors in obvious cases
Links to relevant discussions (where appropriate):
Edit period(s): One big run, then regularly to fix new cases
Estimated number of pages affected: ~500
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): No
Function details: This bot attempts to fix common errors that cause articles to be placed into Category:Pages with URL errors. Specifically, tries to:
In either case, it checks that the resulting URL is accessible and returns a 200 status code before making changes. Testing against the first 200 articles in the category, the bot was able to fix 9% of the articles.
'<>[]
-
?
. There's also a few edits where the bot correctly fixes the URL but adds a space at the end for no reason—I'll look into that. Everything else seemed fine to me.
Gaelan
💬
✏️
04:23, 23 February 2019 (UTC)
reply