Operator: Hazard-SJ ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 01:55, Tuesday May 29, 2012 ( UTC)
Automatic, Supervised, or Manual:
Programming language(s): Python
Source code available: Pywikipedia
Function overview: Fixing ISBNs
Links to relevant discussions (where appropriate): Wikipedia:BON#Rich_Farmbrough.27s_bots
Edit period(s): Occasional
Estimated number of pages affected: N/A
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): Yes
Function details: This bot will go through a list of pages which have possible ISBN errors, then attempts to fix them. This "list" will be made by my scanning a database dump. I have already attempted using the script to ensure it works, twice in the sandbox ( here and here), and three in mainspace ( here, here, and here). Yes, it may need a few tweaks to become more efficient, but as Helpful Pixie Bot can do this no more, I'd like my bot to try to help out. Hazard-SJ ✈ 01:55, 29 May 2012 (UTC) reply
There were a very large number of complaints when Helpful Pixie Bot resumed doing this shortly before the Arb case caused it to be blocked. How will you avoid a similar situation? Also, what specifically are the errors the bot will fix? Anomie ⚔ 02:45, 29 May 2012 (UTC) reply
Do you intend to treat lack of dashes as an ISBN error, as WP:COSMETICBOT, or as a matter of editor style choice and hence ignore? Personally, I would prefer the bot only to action this if the article was already internally inconsistent. Spinning Spark 09:08, 29 May 2012 (UTC) reply
(r'ISBN: (\d+)', r'ISBN \1')
(r'ISBN( number| no\.?| No\.?|-Nummer|-Nr\.):? (\d+)', r'ISBN \2')
)(ur'ISBN (978|979) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d)(?!\d)', r'ISBN \1-\2-\3-\4-\5')
for ISBN-13(ur'ISBN (\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d|X|x)(?!\d)', r'ISBN \1-\2-\3-\4')
for ISBN-10(r'ISBN(| | )((\d(-?)){12}\d|(\d(-?)){9}[\dXx])', r'ISBN \2')
){{
BAGAssistanceNeeded}}
Could I take a trial please?
Hazard-SJ
✈ 21:03, 2 June 2012 (UTC)
reply
) or
ISBN
978-1-4000-4360-6 (literal U+00A0 character). To prevent a linebreak, you'd have to use a piped link
Special:BookSources explicitly (or use a template that does this, if there is one), or use something like {{
nowrap}}. Speaking of which, what will this bot do if it sees the ISBN in the text portion of a wikilink, e.g. [[Special:BookSources/9781400043606|ISBN 978-1-4000-4360-6]]
?
Anomie
⚔ 10:49, 6 June 2012 (UTC)
reply
The bot has edited WP space, which seems inadvisable. The value of editing WP space is less, and the risk of "correcting" an intentional error in an instructional page is greater. Jc3s5h ( talk) 03:18, 12 June 2012 (UTC) reply
&thisdoesnotexistbuttest=ISBN 978-87-988154-5-7
to the URL, but it wasn't changed). I'll be addressing the other issues soon.
Hazard-SJ
✈ 02:48, 13 June 2012 (UTC)
reply(ur'ISBN (978|979) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d)(?!\d)', r'ISBN \1-\2-\3-\4-\5'), # ISBN-13 (ur'ISBN (\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d|X|x)(?!\d)', r'ISBN \1-\2-\3-\4'), # ISBN-10
(ur'ISBN (978|979)[\-−\.‐-―](\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d)(?!\d)', r'ISBN \1-\2-\3-\4-\5'), # ISBN-13 (ur'ISBN (\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d|X|x)(?!\d)', r'ISBN \1-\2-\3-\4'), # ISBN-10 (ur'ISBN (978|979) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d)(?!\d)', r'ISBN \1 \2 \3 \4 \5'), # ISBN-13 (ur'ISBN (\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d|X|x)(?!\d)', r'ISBN \1 \2 \3 \4'), # ISBN-10
{{
BAGAssistanceNeeded}}
Is there anything I left out? If not, can I go for another trial please?
Hazard-SJ
✈ 03:38, 13 June 2012 (UTC)
reply
-namespace:0
, and noticed near the end of the trial, and since to bot took nearly forever to start (reading an entire enwiki dump first), I didn't stop it to fix that (not to say that it won't be done in the future!).
Hazard-SJ
✈ 23:07, 20 June 2012 (UTC)
reply
Operator: Hazard-SJ ( talk · contribs · SUL · edit count · logs · page moves · block log · rights log · ANI search)
Time filed: 01:55, Tuesday May 29, 2012 ( UTC)
Automatic, Supervised, or Manual:
Programming language(s): Python
Source code available: Pywikipedia
Function overview: Fixing ISBNs
Links to relevant discussions (where appropriate): Wikipedia:BON#Rich_Farmbrough.27s_bots
Edit period(s): Occasional
Estimated number of pages affected: N/A
Exclusion compliant (Yes/No): Yes
Already has a bot flag (Yes/No): Yes
Function details: This bot will go through a list of pages which have possible ISBN errors, then attempts to fix them. This "list" will be made by my scanning a database dump. I have already attempted using the script to ensure it works, twice in the sandbox ( here and here), and three in mainspace ( here, here, and here). Yes, it may need a few tweaks to become more efficient, but as Helpful Pixie Bot can do this no more, I'd like my bot to try to help out. Hazard-SJ ✈ 01:55, 29 May 2012 (UTC) reply
There were a very large number of complaints when Helpful Pixie Bot resumed doing this shortly before the Arb case caused it to be blocked. How will you avoid a similar situation? Also, what specifically are the errors the bot will fix? Anomie ⚔ 02:45, 29 May 2012 (UTC) reply
Do you intend to treat lack of dashes as an ISBN error, as WP:COSMETICBOT, or as a matter of editor style choice and hence ignore? Personally, I would prefer the bot only to action this if the article was already internally inconsistent. Spinning Spark 09:08, 29 May 2012 (UTC) reply
(r'ISBN: (\d+)', r'ISBN \1')
(r'ISBN( number| no\.?| No\.?|-Nummer|-Nr\.):? (\d+)', r'ISBN \2')
)(ur'ISBN (978|979) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d)(?!\d)', r'ISBN \1-\2-\3-\4-\5')
for ISBN-13(ur'ISBN (\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d|X|x)(?!\d)', r'ISBN \1-\2-\3-\4')
for ISBN-10(r'ISBN(| | )((\d(-?)){12}\d|(\d(-?)){9}[\dXx])', r'ISBN \2')
){{
BAGAssistanceNeeded}}
Could I take a trial please?
Hazard-SJ
✈ 21:03, 2 June 2012 (UTC)
reply
) or
ISBN
978-1-4000-4360-6 (literal U+00A0 character). To prevent a linebreak, you'd have to use a piped link
Special:BookSources explicitly (or use a template that does this, if there is one), or use something like {{
nowrap}}. Speaking of which, what will this bot do if it sees the ISBN in the text portion of a wikilink, e.g. [[Special:BookSources/9781400043606|ISBN 978-1-4000-4360-6]]
?
Anomie
⚔ 10:49, 6 June 2012 (UTC)
reply
The bot has edited WP space, which seems inadvisable. The value of editing WP space is less, and the risk of "correcting" an intentional error in an instructional page is greater. Jc3s5h ( talk) 03:18, 12 June 2012 (UTC) reply
&thisdoesnotexistbuttest=ISBN 978-87-988154-5-7
to the URL, but it wasn't changed). I'll be addressing the other issues soon.
Hazard-SJ
✈ 02:48, 13 June 2012 (UTC)
reply(ur'ISBN (978|979) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d)(?!\d)', r'ISBN \1-\2-\3-\4-\5'), # ISBN-13 (ur'ISBN (\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d+) *[\- −\.‐-―] *(\d|X|x)(?!\d)', r'ISBN \1-\2-\3-\4'), # ISBN-10
(ur'ISBN (978|979)[\-−\.‐-―](\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d)(?!\d)', r'ISBN \1-\2-\3-\4-\5'), # ISBN-13 (ur'ISBN (\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d+)[\-−\.‐-―](\d|X|x)(?!\d)', r'ISBN \1-\2-\3-\4'), # ISBN-10 (ur'ISBN (978|979) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d)(?!\d)', r'ISBN \1 \2 \3 \4 \5'), # ISBN-13 (ur'ISBN (\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d+) +[\- −\.‐-―] +(\d|X|x)(?!\d)', r'ISBN \1 \2 \3 \4'), # ISBN-10
{{
BAGAssistanceNeeded}}
Is there anything I left out? If not, can I go for another trial please?
Hazard-SJ
✈ 03:38, 13 June 2012 (UTC)
reply
-namespace:0
, and noticed near the end of the trial, and since to bot took nearly forever to start (reading an entire enwiki dump first), I didn't stop it to fix that (not to say that it won't be done in the future!).
Hazard-SJ
✈ 23:07, 20 June 2012 (UTC)
reply