Operator: NicoV ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 17:25, Monday, February 25, 2019 ( UTC)
Function overview: To fix ISSN with an incorrect syntax. As described in ISSN#Code format, the correct syntax for an ISSN is "an eight digit code, divided by a hyphen into two four-digit numbers"
Automatic, Supervised, or Manual: Automatic
Programming language(s): Java ( Wikipedia:WPCleaner)
Source code available: On Github
Links to relevant discussions (where appropriate): Maintenance task for CW Error #106
Edit period(s): At most, twice a month, following the dump analysis that I already perform, see Wikipedia:Bots/Requests for approval/WikiCleanerBot.
Estimated number of pages affected: Around a thousand At most a few hundred pages for the first complete run (pages with such problems are listed in
Wikipedia:CHECKWIKI/WPC 106 dump, which currently contains a list of 1315 420 pages), and probably no more than a few dozen after that on each run given the evolution of the number of pages in the list.
Namespace(s): Main namespace
Exclusion compliant (Yes/No): No, because there's no reason to use an incorrect syntax for an ISSN instead of the correct one.
Function details: Based on the list generated on Wikipedia:CHECKWIKI/WPC 106 dump, the bot will only fix trivial problems (like a missing hyphen in the ISSN number, extra whitespace characters...) and will leave the more complex ones to be fixed by a human. It will reduced a lot the list, so human editors can fix the remaining problems.
For the bot flag, I currently don't have it, and I would like to keep it that way (or if need be, only added temporarily for the first run).
If you will be operating from the dump, could you not do a dry run outputting to Wikipedia:CHECKWIKI/WPC 106 dump so its handling of the pathological cases there can be inspected? -- Xover ( talk) 17:48, 25 February 2019 (UTC) reply
Comment: The dump list appears to have some false positives on it. I picked one page at random, Pocket Dwellers, and there is an ISSN of 00062510 listed within a citation template. This ISSN is valid within a CS1 template; articles with invalid ISSNs are placed in Category:CS1 errors: ISSN. The template handles this unhyphenated ISSN format with no trouble, displaying properly with a hyphen. It should not be "corrected"; the bot would be making a cosmetic edit, leaving the rendered page unchanged. Perhaps the dump analysis should be corrected before this bot attempts to modify articles based on the list. – Jonesey95 ( talk) 17:56, 25 February 2019 (UTC) reply
Page Wikipedia:CHECKWIKI/WPC 106 dump has been updated to avoid reporting missing dash when the template automatically adds it to the displayed result, there are only 420 pages remaining compared to the 1315 initially. I could probably also remove reports for internal links to pages like ISSN 1175-5326 which exist, but even if they are reported, the bot won't fix anything there. With the current algorithm, a dry run modifies 115 pages on the 420.
-- NicoV ( Talk on frwiki) 12:36, 26 February 2019 (UTC) reply
|issn=
was being used in a {{
WorldCat}} template, which doesn't support that parameter. Also, it looks like dashes, as in
Iran–Iraq War and
The Mauritius Command and
Resonant inductive coupling, are also silently converted to hyphens by CS1 templates, so those don't need to be fixed and should be removed from the WPCleaner report.
Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete.
Primefac (
talk)
20:32, 4 April 2019 (UTC)
reply
Trial complete.
Primefac I've done the 50 edits, they can be checked in
this list. I've seen no problem in the edits. --
NicoV (
Talk on frwiki)
14:03, 8 May 2019 (UTC)
reply
{{ BAG assistance needed}} Any decision ? -- NicoV ( Talk on frwiki) 17:15, 12 June 2019 (UTC) reply
Operator: NicoV ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 17:25, Monday, February 25, 2019 ( UTC)
Function overview: To fix ISSN with an incorrect syntax. As described in ISSN#Code format, the correct syntax for an ISSN is "an eight digit code, divided by a hyphen into two four-digit numbers"
Automatic, Supervised, or Manual: Automatic
Programming language(s): Java ( Wikipedia:WPCleaner)
Source code available: On Github
Links to relevant discussions (where appropriate): Maintenance task for CW Error #106
Edit period(s): At most, twice a month, following the dump analysis that I already perform, see Wikipedia:Bots/Requests for approval/WikiCleanerBot.
Estimated number of pages affected: Around a thousand At most a few hundred pages for the first complete run (pages with such problems are listed in
Wikipedia:CHECKWIKI/WPC 106 dump, which currently contains a list of 1315 420 pages), and probably no more than a few dozen after that on each run given the evolution of the number of pages in the list.
Namespace(s): Main namespace
Exclusion compliant (Yes/No): No, because there's no reason to use an incorrect syntax for an ISSN instead of the correct one.
Function details: Based on the list generated on Wikipedia:CHECKWIKI/WPC 106 dump, the bot will only fix trivial problems (like a missing hyphen in the ISSN number, extra whitespace characters...) and will leave the more complex ones to be fixed by a human. It will reduced a lot the list, so human editors can fix the remaining problems.
For the bot flag, I currently don't have it, and I would like to keep it that way (or if need be, only added temporarily for the first run).
If you will be operating from the dump, could you not do a dry run outputting to Wikipedia:CHECKWIKI/WPC 106 dump so its handling of the pathological cases there can be inspected? -- Xover ( talk) 17:48, 25 February 2019 (UTC) reply
Comment: The dump list appears to have some false positives on it. I picked one page at random, Pocket Dwellers, and there is an ISSN of 00062510 listed within a citation template. This ISSN is valid within a CS1 template; articles with invalid ISSNs are placed in Category:CS1 errors: ISSN. The template handles this unhyphenated ISSN format with no trouble, displaying properly with a hyphen. It should not be "corrected"; the bot would be making a cosmetic edit, leaving the rendered page unchanged. Perhaps the dump analysis should be corrected before this bot attempts to modify articles based on the list. – Jonesey95 ( talk) 17:56, 25 February 2019 (UTC) reply
Page Wikipedia:CHECKWIKI/WPC 106 dump has been updated to avoid reporting missing dash when the template automatically adds it to the displayed result, there are only 420 pages remaining compared to the 1315 initially. I could probably also remove reports for internal links to pages like ISSN 1175-5326 which exist, but even if they are reported, the bot won't fix anything there. With the current algorithm, a dry run modifies 115 pages on the 420.
-- NicoV ( Talk on frwiki) 12:36, 26 February 2019 (UTC) reply
|issn=
was being used in a {{
WorldCat}} template, which doesn't support that parameter. Also, it looks like dashes, as in
Iran–Iraq War and
The Mauritius Command and
Resonant inductive coupling, are also silently converted to hyphens by CS1 templates, so those don't need to be fixed and should be removed from the WPCleaner report.
Approved for trial (50 edits). Please provide a link to the relevant contributions and/or diffs when the trial is complete.
Primefac (
talk)
20:32, 4 April 2019 (UTC)
reply
Trial complete.
Primefac I've done the 50 edits, they can be checked in
this list. I've seen no problem in the edits. --
NicoV (
Talk on frwiki)
14:03, 8 May 2019 (UTC)
reply
{{ BAG assistance needed}} Any decision ? -- NicoV ( Talk on frwiki) 17:15, 12 June 2019 (UTC) reply