![]() | This page is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Hello, NicDumZ/Archive 1, and welcome to Wikipedia. Thank you for your contributions. I hope you like it here and decide to stay. If you are looking for help, please do any of the following:
{{helpme}}
on your user page, and someone will answer your questions shortlyThere are a lot of standards and policies here, but as long as you are editing in good faith, you are encouraged to be bold in updating pages. Here are a few links you might find useful:
I hope you enjoy editing here and being a Wikipedian! Please sign your name on talk and vote pages using four tildes (~~~~), which produces your name and the current date. Also, it would be a huge help if you could explain each of your edits with an edit summary. Again, welcome!-- NAH ID 17:49, 25 July 2007 (UTC)
User:NicDumZ/Bir Hakeim moved. Anthony Appleyard 16:41, 27 July 2007 (UTC)
Connell66 has smiled at you! Smiles promote
WikiLove and hopefully this one has made your day better. Spread the WikiLove by smiling at someone else, whether it be someone you have had disagreements with in the past or a good friend. Happy editing!
Smile at others by adding {{
subst:Smile}} to their talk page with a friendly message.
You just sent me a message. Yes, I will proofread it. I can't translate, though. Laleena talk to me contributions to Wikipedia 12:18, 28 July 2007 (UTC)
Hi NicDumZ. I have encountered similar problems, and I don't have a simple answer. Disruptive editors will often tie up talk-page debates with red herrings, strawman arguments and question-begging, so it's best to address your arguments to an ideal intelligent editor, rather than a specific disruptive one. Also consider filing an RfC. In the meantime I will take a look at the specific issue you're referring to.-- G-Dett 16:30, 28 July 2007 (UTC)
Thanks for your note. Discussion is good, and I think we've managed to have a reasonable one, even though disagreeing on many points. Jayjg (talk) 21:18, 30 July 2007 (UTC)
Ah, I didn't even see you'd changed it. I just tested it and assumed I'd mixed them up. Thanks for letting me know. Mackan79 15:02, 3 August 2007 (UTC)
I didn't mean to imply, in my comments on WP:AN/I, that you were in any way "the bad guy". I'm sorry if it came across that way. MastCell Talk 20:52, 6 August 2007 (UTC)
Excuse me, I read your message on my discussion page and I wanted to know about something. What kind of articles can I create? I really want to create an article, but I can't think of one. Can you help me, please? KiaraFan13 18:17, 7 August 2007 (UTC)
Yes, you can look at my contributions, but I'll try to wait for a few months until I can create an article. I was going to create an article about a fictional character in my series of plays called The 23 Kids, but the idea was off because she is modeled after Hannah Montana and she wouldn't want to be featured on television. KiaraFan13 19:20, 7 August 2007 (UTC)
Hi NicDumZ,
Good work on translating the Battle of Bir Hakeim article, and don't worry about the mistakes, translation is always a bit tricky.
If you want to improve it further, I have only one thing to say to you: inline citations! Your article won't get past start class if all the important facts aren't appropriately cited. To know which points need a cite, and how to do it see WP:MILHIST#CITE. In short:
Ideally, as this is English wikipedia, you should use sources in English, but if none are available, you'll have to use some in French.
Voila, j'espère que ça t'a aidé, et si t'as besoin d'autres conseils, tu peux aussi me demander en Français.
A +
Raoulduke47
19:25, 7 August 2007 (UTC)
Hello,
An Arbitration case involving you has been opened: Wikipedia:Requests for arbitration/Allegations of apartheid. Please add any evidence you may wish the Arbitrators to consider to the evidence sub-page, Wikipedia:Requests for arbitration/Allegations of apartheid/Evidence. You may also contribute to the case on the workshop sub-page, Wikipedia:Requests for arbitration/Allegations of apartheid/Workshop.
On behalf of the Arbitration Committee, Newyorkbrad 18:04, 12 August 2007 (UTC)
The Arbitration Committee has adopted a motion in the above arbitration case, providing: "As the Committee has been unable to determine which actions in this matter, if any, were undertaken in bad faith, and as the community appears to be satisfactorily dealing with the underlying content dispute, the case is dismissed with no further action being taken." This notice is given by a Clerk on behalf of the Arbitration Committee. Newyorkbrad 19:13, 26 October 2007 (UTC)
Sure, I'm almost done with the new version that handles prods, after that I'll post it. BJ Talk 14:54, 28 December 2007 (UTC)
Just to be sure there is no foul play, can you please edit the user page of your bot's account with your "real" account? Thanks a lot! -- lucasbfr ho ho ho 16:09, 29 December 2007 (UTC)
While I appreciate your suggestion, it really doesn't make any sense to me. Why should there be inclusion guidelines for French communes? Shouldn't we have articles on all towns? The Rambot created all the articles on US towns back in 2001, and the Eubot created articles on all Italian communes in 2006. I also just noticed that all Swiss and almost all German and Austrian municipalities have articles. So do other English-speaking countries like UK and Australia. To be frank, Nick, I think your idea is rubbish. Why should we deny "any country town" in France an article when on our homefront we have one for every? Editorofthewiki ( talk) 19:05, 6 January 2008 (UTC)
Hi NicDumZ I wanted to let you know that Wikipedia:Bots/Requests for approval/DumZiBoT has been approved. Please visit the above link for more information. Thanks! BAGBot ( talk) 03:30, 3 February 2008 (UTC)
Glad to see a bot doing this.
Does it also convert inline exlinks to refs?
I made such a change here. (It's the first change, the second just needed a different name.) -- Jtir ( talk) 14:16, 3 February 2008 (UTC)
This edit possibly missed the exlinks in a named ref (<ref name="RFC3092">). The named ref had the bare exlink repeated in three places. I made the corrections in these two edits [1] [2] (it took two edits because I didn't realize the exlink had been repeated in three places). And, yes, the page can be accessed. :-) -- Jtir ( talk) 16:45, 3 February 2008 (UTC)
It looks good to me, so long as the tiles are accurate. On the other hand, I can see concerns about bots such as those raised above. In any case, so long as you're amenable to receiving feedback whem and if problems arise, I think both you and the bot will be happy together. Clever, by the way. :) As a used to be programmer I wouldn't mind seeing the code. Cheers. •Jim62sch• dissera! 21:08, 3 February 2008 (UTC)
What a great idea for a bot. This is something that I manually do all of the time. Your bot is very helpful, and I cannot believe that no one had thought of it sooner. Kudos! нмŵוτн τ 18:01, 3 February 2008 (UTC)
Ditto. I think DumZiBoT is doing fine. Decriptive text as a link label sure beats just a number ([2]) in the references section. - Fnlayson ( talk) 20:01, 3 February 2008 (UTC)
...but I hope that, when it has caught up with all the untitled references, I might find some reason to look at my watchlist again! TINY MARK 23:08, 3 February 2008 (UTC)
Hi. Your bot is really useful but it needs some tuning i think. Can you please exclude JSOTR links? Check here. For non-registered users JSTOR gives the message: "JSTOR: Accessing JSTOR" and doesn't show the real html. -- Magioladitis ( talk) 01:56, 4 February 2008 (UTC)
Thank you. -- Duncan ( talk) 09:46, 4 February 2008 (UTC)
Keep it up. -- Arcadian ( talk) 13:09, 4 February 2008 (UTC)
Your bot edited two pages and cleaned up the reference sections a job that I really don't like doing. Thank you, your bot is very useful. EconomistBR ( talk) 15:40, 3 February 2008 (UTC)
Amen, amen. Replacing cryptic ref URLs with the corresponding <title> element via a bot is a fantastic idea!! Kudos. — ¾-10 01:53, 5 February 2008 (UTC)
You need to turn off this bot, especially on science articles. You are making it difficult if not impossible to watch science articles for trolls, vandals and POV-warriors, because all I see on my watchlist is your useless bot. You are making Wikipedia worse off, not better, because once the POV warriors know how your bot works, they'll just put in links without titles, and your bot will format it, making yours the last change in history. This will take more work using Twinkle or other vandal fighting tools. Either turn the thing off, or I will ask for administrative assistance. OrangeMarlin Talk• Contributions 17:50, 3 February 2008 (UTC)
Hi, can your linkbot be set loose on Andrew Sullivan? Benji boi 08:14, 4 February 2008 (UTC)
Here is another problem: "[http://www.medscape.com/viewarticle/554347?sssdmh=dm1.259053&src=ddd Log In Problems<!-- Bot generated title -->]" http://en.wikipedia.org/?title=Pergolide&curid=622942&diff=188999410&oldid=150088654 Maybe you should add a bad word list that contains error message words... Сасусlе 12:42, 4 February 2008 (UTC)
Do you have a link to a list of exlinks that are excluded? (a blacklist?) I am thinking of adding a third reason to the section in User:DumZiBoT/refLinks that lists reasons an exlink might not be changed. -- Jtir ( talk) 12:51, 4 February 2008 (UTC)
Your bot fixed a bare reference in Landing craft, but it included the gratuitous word "dumb." Is that your idea of humor, or a flaw in your bot, or what? I have removed the word "dumb." Lou Sander ( talk) 15:34, 4 February 2008 (UTC)
Hi, I noticed the your bot introduced a hidden comment into East Mountain that looks like spam: " TopoZone - The Web's Topographic Map, and more!" Can you explain this?-- Pgagnon999 ( talk) 18:24, 4 February 2008 (UTC)
Hmmm....interesting. Not sure how I feel about the opportunity for a free hidden advertising plug for companies with clever URL titles. . .or (in this case anyway) if the bot introduced anything of value that wasn't already inherent in the URL sytax itself, but it is what it is. . .and, at the end of the day, not a super big deal.-- Pgagnon999 ( talk) 23:49, 4 February 2008 (UTC)
Another issue with the bot: When a URL redirects, the bot is following it to its new destination and blithely listing the title of the new URL. Where I observed this: In List of unaccredited institutions of higher learning, http://www.asiaweek.com/asiaweek/features/universities2000/artic_online.html redirected to the current issue of TIME Magazine, so the bot left a link title of "TIME Magazine - Asia Edition - February 11, 2008 Vol. 171, No. 5". That misdirection was fairly innocuous (although the current issue of the magazine would be useless as a source, at least it's clean), and I've fixed that particular misdirection with a link to the archive.org version of the original AsiaWeek article, but I think that as a general policy the bot process should be generating a list of domains that redirect, rather than generating new titles. -- Orlady ( talk) 15:02, 5 February 2008 (UTC)
While you are about it, NicDumZ, perhaps you can turn your attention to the article malleus. The info box ref to the image of the gestation stage indicated needs fixing as it directs you straight to the UNC University Wiki article, unless you as the potential reader know what you are doing. Not many of our readers might know that though. Unless he (the reader) knows to home in on the template used he is going to be nonplussed. Many thanks, and congratulations on your work. Do you actually look out for unsourced articles, too? Dieter Simon ( talk) 01:41, 5 February 2008 (UTC) Dieter Simon ( talk) 01:43, 5 February 2008 (UTC)
Hey, I just saw the edits made by your bot at Krav Maga -- great bot, in both concept and performance! Kudos! JDoorjam JDiscourse 19:32, 5 February 2008 (UTC)
Absolutely brilliant - congratulations from me too --
Matilda
talk
22:41, 5 February 2008 (UTC)
I just figured out what your bot does, after quite a bit of confusion. As soon as I figured it out, I was quite impressed. Thanks for making such a useful addition to the Wikimunity. Darkage7 ( talk) 07:20, 6 February 2008 (UTC)
![]() |
What a Brilliant Idea Barnstar | |
You are awarded this barnstar for programming DumZiBoT to expand bare references. Thanks for helping make Wikipedia a well-referenced resource. Flibirigit ( talk) 07:34, 6 February 2008 (UTC) |
I've seen probably fifty pages on my Watched list get (slightly) improved by this both in the past two days - keep up the work, it's a great idea. Sherurcij ( Speaker for the Dead) 08:52, 6 February 2008 (UTC)
Nice BOT - can you change it to use a basic citation template though ? eg
<ref>
{{Citation
| title =
| url =
}}
</ref>
Cheers -- John ( Daytona2 · Talk · Contribs) 23:02, 5 February 2008 (UTC)
Your bot is doing great work! Thank you so much! Aleta (Sing) 14:23, 6 February 2008 (UTC)
Here is a bug for you to fix. The text is supposed to be in Russian, but it is gibberish due to incorrect encoding.— Ëzhiki (Igels Hérissonovich Ïzhakoff-Amursky) • ( yo?); 16:51, 4 February 2008 (UTC)
Server: nginx/0.6.25 Date: Mon, 04 Feb 2008 16:56:46 GMT |
Perhaps you could set up a requests page where people could post articles they would like the bot to fix the references. I was trying to get your bot to have a go at February 2008 tornado outbreak, but there doesn't seem to be a way to add requests. Great job on the bot btw! Cheers, JACO PLANE • 2008-02-6 17:26
Each reference that is modified gets "<!-- Bot generated title -->" added to it, it's about 29 bytes per each reference modified. It may not seem like much, but that can add up quick. Shouldn't that kind of comment just be put into the edit summary? Gh5046 ( talk) 20:35, 6 February 2008 (UTC)
Oh, and I forgot to say, thanks for creating this bot. It's very helpful. Gh5046 ( talk) 20:36, 6 February 2008 (UTC)
http://en.wikipedia.org/?title=Pointy_hat&diff=189487992&oldid=182653326
Look at the diff line around "Gomer". Is "Untitled Document" more useful than the bare URL? -- Damian Yerrick ( talk | stalk) 21:09, 6 February 2008 (UTC)
In Meishi, DumZiBoT did not convert three bare exlinks in one of the references.
<ref>See, e.g., http://www.adobe.com/jp/special/creativesuite/portal/guides/cs2_01_52.html, http://www.washiya.com/shop/namecard/index.html, http://www.kenseido.co.jp/shop/kps/namecard.html</ref>
-- Jtir ( talk) 21:13, 6 February 2008 (UTC)
Regulation of acupuncture, as well as acupuncture, could use his talents... again, thank you, very nice work! best regards, Jim Butler ( t) 05:46, 7 February 2008 (UTC)
Hi—First, kudos on a most excellent bot. I was reading your discussion with Dispenser about filling in more of the parameters of template:cite web, and I have a suggestion. The basic idea is to slog through a dump examining occurrences of template:cite web, and correlating the values for the url= and work= parameters. For instance, if 99% of the time, url values with a prefix of http://nytimes.com/ co-occur with work=The New York Times, then you can reliably add the latter to the references you generate for similar urls. You can build up a dictionary of these relationships in a first pass of the bot (or with a separate script). Make sense? — johndburger 01:41, 7 February 2008 (UTC)
Would it be possible for the bot to convert bare refs to refs using {{ cite web}} instead? Instead of a lead "[" it would add "<ref>{{cite web | url =" then before the new title it would add "|title =" and after the title instead of "]" it would add "|accessdate=2008-02-08}}</ref>"? It would also have to add reflist at the bottom if it were not already there. Just an idea. Thanks Ruhrfisch ><>°° 17:09, 8 February 2008 (UTC)
<html xmlns="
http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
—
Dispenser
17:39, 8 February 2008 (UTC)
(query result[s]?|query)$
. Possibly check for duplicate titles. —
Dispenser
17:39, 8 February 2008 (UTC)Your bot puts refs before periods, they should be after periods. — Rlevse • Talk • 12:14, 9 February 2008 (UTC)
Very nice at acupuncture. A much-needed service. rock on, Jim Butler ( t) 10:15, 3 February 2008 (UTC)
hi, is it possible to add this info also like: "Retrieved on 2008-02-09." --— Typ932 T | C 15:16, 9 February 2008 (UTC)
Your description of the robot uses he references—the masculine pronoun. I understand English isn't your first language and respectfully suggest you refer to it as either she or it. Most properly, use of it is appropriate since robots are usually considered gender neutral. However, considering the fuss above, she wouldn't be out of line. Ships and boats are normally referred to as she, perhaps for their unpredictable temperaments. Your use of he is amusing, and presents absolutely no difficulty in understanding or communication. The choice is yours of course. — EncMstr 23:29, 8 February 2008 (UTC)
You can use the gallery tag to organize them. It's probably easier than making a table. :) vıd ıoman 22:31, 10 February 2008 (UTC)
What is this [9]? Please fix the handling of non latin scripts before running the bot again. -- jergen ( talk) 08:27, 7 February 2008 (UTC)
![]() |
What a Brilliant Idea Barnstar | |
I've got nothing but praise for this bot idea. – sgeureka t•c 15:27, 8 February 2008 (UTC) |
And I have another improvement suggestion, but I know another bot already cares about this, so don't feel yourself pushed to do this. I became aware of your bot with this edit, and {{ reflist}} or <references /> was missing on the page to display the <ref> at all. Would it be to much coding effort to also have the bot check this? – sgeureka t•c 15:27, 8 February 2008 (UTC)
Your bot made rather a mess of the Meishi article by converting the anchor text for one of the references into unintelligible garbage. Does this thing understand Shift JIS? -- Sakurambo 桜ん坊 14:28, 6 February 2008 (UTC)
python.codecs
module to raise an error (character #19563 of the html source, but since the codecs parser failed, I don't think that this number is reliable). I can't do anything to solve these kind of problems, that's really not my fault, sorry.
NicDumZ
~
16:34, 6 February 2008 (UTC)Well, feel free to try by yourself, instead of assuming that I'm deliberately using another encoding :
import urllib2
url = u'http://www.youmeishi.com/contents/product/paper.html'
handler = urllib2.urlopen(url)
source = handler.read()
to_uni = Unicode(source, "Shift JIS") #will raise UnicodeDecodeError (illegal mutibyte sequence)
There must be some problem in the encoding of the HTML source. What you have to understand is that my script tries first to convert to the encoding specified in the "meta" markup of the page. When no UnicodeDecodeError is raised, it assumes that it works, and uses that encoding. But when an error is raised, it goes on an try other encodings. When a "fine" codec is found, i.e. a codec that does not raise an error during the conversion, I use it. But there's no way for an automated script to determine whether a character sequence makes sense or not... (Also, some pages actually say they use one encoding in their meta tags, while they're not; And a lot of pages are not sending any encoding : that's why I try other encodings)
NicDumZ ~ 09:29, 7 February 2008 (UTC)
<meta http-equiv="Content-Type" content="text/html; charset=Shift_JIS">
Okay. Stop this. Read again : my point was not about that particular page, but about others : If I stop at the first UnicodeDecodeError that I get, that means that I will not be able to detect any encoding for pages not specifying their encodings. And I was saying that pages not specifying their encoding are way more common than pages specifying an encoding, and badly encoded, hence I made the implementation choice to try to detect an encoding, since the false positives are very rare (Over 25,000 contributions, I've been reported less than 10 errors : You could say that some errors are remaining undetected, but still, even considering, exaggeratedly, that 500 errors are remaining, that would make a 0,02% error rate. Come on, give me some space.)
NicDumZ ~ 12:27, 7 February 2008 (UTC)
unicode()
is basically mapping this byte sequence to the corresponding unicode codepoint. The only reason it fails is that somewhere, ther is a byte sequence that IS NOT shift-JIS compliant, remember the error message : UnicodeDecodeError: 'shift_jis' codec can't decode bytes in position 19563-19564: illegal multibyte sequence. ILLEGAL MULTIBYTE SEQUENCE. Get it ?shift jis
did not contain that character. shift jis 2004
does, so I now try shift jis, then shift jis 2004, then cp932.
NicDumZ
~
09:13, 8 February 2008 (UTC)Chiming in, the bot messed up the title for this link in Vii as well. Jappalang ( talk) 01:35, 14 February 2008 (UTC)
Thanks for your recent work on chess articles. The bot is doing a fine job! Voorlandt ( talk) 08:48, 13 February 2008 (UTC)
Would it be hard to make a bot that consolidated references with <cite name=X>? Just making a suggestion! -- Adoniscik ( talk) 06:43, 7 February 2008 (UTC)
I just removed the following bot-generated title from an article: C:\Documents and Settings\wabalber\Local Settings\Temporary Internet Files\OLK1D0\rptApprovedSchoolsWeb.snp . . . (That apparently is the automatically generated "title" of a PDF file on a US government website.) Can the bot be trained to ignore "titles" that are file names? -- Orlady ( talk) 14:43, 13 February 2008 (UTC)
[A-Z]\:([/\|/][\w| ]+)+(.[\w+| ])?
but what I came here to say: Excellent bot!
Martijn Hoekstra (
talk)
19:04, 13 February 2008 (UTC)
^\w{3,}://\w+
and ^[A-Za-z]:\\\w+
). —
Dispenser
20:56, 13 February 2008 (UTC)Why would you let a bot loose to make substantial edits on wikipedia unless it is has been extraordinarily well tested?
The bot made two substantial, and completely incorrect, edits to the ITA_Software entry, substituting the name of some other organization for the company's actual name, and adding a sentence about that other organization's members.
That's pretty destructive, and given the probability of such kinds of mistakes (1, by my calculation) by any robot with such grand plans, seems kind of obvious such robots should not be running around wikipedia. —Preceding unsigned comment added by 67.165.122.220 ( talk) 06:32, 14 February 2008 (UTC)
Thanks for enhancing the raw references I put in most of my edits ! :-) Reminds me of a project I use to propose on my user page :
What do you think about it ? Nicolas1981 ( talk) 10:08, 14 February 2008 (UTC)
Excellent work with this bot. You have greatly contributed toward a better wikipedia. Fredsmith2 ( talk) 02:22, 15 February 2008 (UTC)
Take this diff: [13] Most of the "titles" it generates really aren't helpful: http://myweb.tiscali.co.uk/celynog/Brittany/kermario.htm Kermario<!-- Bot generated title -->] - it's more helpful to the reader to see the bare URL. In this case, it hints that the site is a personal website for some amateur based in Britain, and even without going there, suggests that it's not a strong reference. What does captioning it as "Kermario" add? How does this benefit the reader? Stevage 04:38, 10 February 2008 (UTC)
![]() | This page is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Hello, NicDumZ/Archive 1, and welcome to Wikipedia. Thank you for your contributions. I hope you like it here and decide to stay. If you are looking for help, please do any of the following:
{{helpme}}
on your user page, and someone will answer your questions shortlyThere are a lot of standards and policies here, but as long as you are editing in good faith, you are encouraged to be bold in updating pages. Here are a few links you might find useful:
I hope you enjoy editing here and being a Wikipedian! Please sign your name on talk and vote pages using four tildes (~~~~), which produces your name and the current date. Also, it would be a huge help if you could explain each of your edits with an edit summary. Again, welcome!-- NAH ID 17:49, 25 July 2007 (UTC)
User:NicDumZ/Bir Hakeim moved. Anthony Appleyard 16:41, 27 July 2007 (UTC)
Connell66 has smiled at you! Smiles promote
WikiLove and hopefully this one has made your day better. Spread the WikiLove by smiling at someone else, whether it be someone you have had disagreements with in the past or a good friend. Happy editing!
Smile at others by adding {{
subst:Smile}} to their talk page with a friendly message.
You just sent me a message. Yes, I will proofread it. I can't translate, though. Laleena talk to me contributions to Wikipedia 12:18, 28 July 2007 (UTC)
Hi NicDumZ. I have encountered similar problems, and I don't have a simple answer. Disruptive editors will often tie up talk-page debates with red herrings, strawman arguments and question-begging, so it's best to address your arguments to an ideal intelligent editor, rather than a specific disruptive one. Also consider filing an RfC. In the meantime I will take a look at the specific issue you're referring to.-- G-Dett 16:30, 28 July 2007 (UTC)
Thanks for your note. Discussion is good, and I think we've managed to have a reasonable one, even though disagreeing on many points. Jayjg (talk) 21:18, 30 July 2007 (UTC)
Ah, I didn't even see you'd changed it. I just tested it and assumed I'd mixed them up. Thanks for letting me know. Mackan79 15:02, 3 August 2007 (UTC)
I didn't mean to imply, in my comments on WP:AN/I, that you were in any way "the bad guy". I'm sorry if it came across that way. MastCell Talk 20:52, 6 August 2007 (UTC)
Excuse me, I read your message on my discussion page and I wanted to know about something. What kind of articles can I create? I really want to create an article, but I can't think of one. Can you help me, please? KiaraFan13 18:17, 7 August 2007 (UTC)
Yes, you can look at my contributions, but I'll try to wait for a few months until I can create an article. I was going to create an article about a fictional character in my series of plays called The 23 Kids, but the idea was off because she is modeled after Hannah Montana and she wouldn't want to be featured on television. KiaraFan13 19:20, 7 August 2007 (UTC)
Hi NicDumZ,
Good work on translating the Battle of Bir Hakeim article, and don't worry about the mistakes, translation is always a bit tricky.
If you want to improve it further, I have only one thing to say to you: inline citations! Your article won't get past start class if all the important facts aren't appropriately cited. To know which points need a cite, and how to do it see WP:MILHIST#CITE. In short:
Ideally, as this is English wikipedia, you should use sources in English, but if none are available, you'll have to use some in French.
Voila, j'espère que ça t'a aidé, et si t'as besoin d'autres conseils, tu peux aussi me demander en Français.
A +
Raoulduke47
19:25, 7 August 2007 (UTC)
Hello,
An Arbitration case involving you has been opened: Wikipedia:Requests for arbitration/Allegations of apartheid. Please add any evidence you may wish the Arbitrators to consider to the evidence sub-page, Wikipedia:Requests for arbitration/Allegations of apartheid/Evidence. You may also contribute to the case on the workshop sub-page, Wikipedia:Requests for arbitration/Allegations of apartheid/Workshop.
On behalf of the Arbitration Committee, Newyorkbrad 18:04, 12 August 2007 (UTC)
The Arbitration Committee has adopted a motion in the above arbitration case, providing: "As the Committee has been unable to determine which actions in this matter, if any, were undertaken in bad faith, and as the community appears to be satisfactorily dealing with the underlying content dispute, the case is dismissed with no further action being taken." This notice is given by a Clerk on behalf of the Arbitration Committee. Newyorkbrad 19:13, 26 October 2007 (UTC)
Sure, I'm almost done with the new version that handles prods, after that I'll post it. BJ Talk 14:54, 28 December 2007 (UTC)
Just to be sure there is no foul play, can you please edit the user page of your bot's account with your "real" account? Thanks a lot! -- lucasbfr ho ho ho 16:09, 29 December 2007 (UTC)
While I appreciate your suggestion, it really doesn't make any sense to me. Why should there be inclusion guidelines for French communes? Shouldn't we have articles on all towns? The Rambot created all the articles on US towns back in 2001, and the Eubot created articles on all Italian communes in 2006. I also just noticed that all Swiss and almost all German and Austrian municipalities have articles. So do other English-speaking countries like UK and Australia. To be frank, Nick, I think your idea is rubbish. Why should we deny "any country town" in France an article when on our homefront we have one for every? Editorofthewiki ( talk) 19:05, 6 January 2008 (UTC)
Hi NicDumZ I wanted to let you know that Wikipedia:Bots/Requests for approval/DumZiBoT has been approved. Please visit the above link for more information. Thanks! BAGBot ( talk) 03:30, 3 February 2008 (UTC)
Glad to see a bot doing this.
Does it also convert inline exlinks to refs?
I made such a change here. (It's the first change, the second just needed a different name.) -- Jtir ( talk) 14:16, 3 February 2008 (UTC)
This edit possibly missed the exlinks in a named ref (<ref name="RFC3092">). The named ref had the bare exlink repeated in three places. I made the corrections in these two edits [1] [2] (it took two edits because I didn't realize the exlink had been repeated in three places). And, yes, the page can be accessed. :-) -- Jtir ( talk) 16:45, 3 February 2008 (UTC)
It looks good to me, so long as the tiles are accurate. On the other hand, I can see concerns about bots such as those raised above. In any case, so long as you're amenable to receiving feedback whem and if problems arise, I think both you and the bot will be happy together. Clever, by the way. :) As a used to be programmer I wouldn't mind seeing the code. Cheers. •Jim62sch• dissera! 21:08, 3 February 2008 (UTC)
What a great idea for a bot. This is something that I manually do all of the time. Your bot is very helpful, and I cannot believe that no one had thought of it sooner. Kudos! нмŵוτн τ 18:01, 3 February 2008 (UTC)
Ditto. I think DumZiBoT is doing fine. Decriptive text as a link label sure beats just a number ([2]) in the references section. - Fnlayson ( talk) 20:01, 3 February 2008 (UTC)
...but I hope that, when it has caught up with all the untitled references, I might find some reason to look at my watchlist again! TINY MARK 23:08, 3 February 2008 (UTC)
Hi. Your bot is really useful but it needs some tuning i think. Can you please exclude JSOTR links? Check here. For non-registered users JSTOR gives the message: "JSTOR: Accessing JSTOR" and doesn't show the real html. -- Magioladitis ( talk) 01:56, 4 February 2008 (UTC)
Thank you. -- Duncan ( talk) 09:46, 4 February 2008 (UTC)
Keep it up. -- Arcadian ( talk) 13:09, 4 February 2008 (UTC)
Your bot edited two pages and cleaned up the reference sections a job that I really don't like doing. Thank you, your bot is very useful. EconomistBR ( talk) 15:40, 3 February 2008 (UTC)
Amen, amen. Replacing cryptic ref URLs with the corresponding <title> element via a bot is a fantastic idea!! Kudos. — ¾-10 01:53, 5 February 2008 (UTC)
You need to turn off this bot, especially on science articles. You are making it difficult if not impossible to watch science articles for trolls, vandals and POV-warriors, because all I see on my watchlist is your useless bot. You are making Wikipedia worse off, not better, because once the POV warriors know how your bot works, they'll just put in links without titles, and your bot will format it, making yours the last change in history. This will take more work using Twinkle or other vandal fighting tools. Either turn the thing off, or I will ask for administrative assistance. OrangeMarlin Talk• Contributions 17:50, 3 February 2008 (UTC)
Hi, can your linkbot be set loose on Andrew Sullivan? Benji boi 08:14, 4 February 2008 (UTC)
Here is another problem: "[http://www.medscape.com/viewarticle/554347?sssdmh=dm1.259053&src=ddd Log In Problems<!-- Bot generated title -->]" http://en.wikipedia.org/?title=Pergolide&curid=622942&diff=188999410&oldid=150088654 Maybe you should add a bad word list that contains error message words... Сасусlе 12:42, 4 February 2008 (UTC)
Do you have a link to a list of exlinks that are excluded? (a blacklist?) I am thinking of adding a third reason to the section in User:DumZiBoT/refLinks that lists reasons an exlink might not be changed. -- Jtir ( talk) 12:51, 4 February 2008 (UTC)
Your bot fixed a bare reference in Landing craft, but it included the gratuitous word "dumb." Is that your idea of humor, or a flaw in your bot, or what? I have removed the word "dumb." Lou Sander ( talk) 15:34, 4 February 2008 (UTC)
Hi, I noticed the your bot introduced a hidden comment into East Mountain that looks like spam: " TopoZone - The Web's Topographic Map, and more!" Can you explain this?-- Pgagnon999 ( talk) 18:24, 4 February 2008 (UTC)
Hmmm....interesting. Not sure how I feel about the opportunity for a free hidden advertising plug for companies with clever URL titles. . .or (in this case anyway) if the bot introduced anything of value that wasn't already inherent in the URL sytax itself, but it is what it is. . .and, at the end of the day, not a super big deal.-- Pgagnon999 ( talk) 23:49, 4 February 2008 (UTC)
Another issue with the bot: When a URL redirects, the bot is following it to its new destination and blithely listing the title of the new URL. Where I observed this: In List of unaccredited institutions of higher learning, http://www.asiaweek.com/asiaweek/features/universities2000/artic_online.html redirected to the current issue of TIME Magazine, so the bot left a link title of "TIME Magazine - Asia Edition - February 11, 2008 Vol. 171, No. 5". That misdirection was fairly innocuous (although the current issue of the magazine would be useless as a source, at least it's clean), and I've fixed that particular misdirection with a link to the archive.org version of the original AsiaWeek article, but I think that as a general policy the bot process should be generating a list of domains that redirect, rather than generating new titles. -- Orlady ( talk) 15:02, 5 February 2008 (UTC)
While you are about it, NicDumZ, perhaps you can turn your attention to the article malleus. The info box ref to the image of the gestation stage indicated needs fixing as it directs you straight to the UNC University Wiki article, unless you as the potential reader know what you are doing. Not many of our readers might know that though. Unless he (the reader) knows to home in on the template used he is going to be nonplussed. Many thanks, and congratulations on your work. Do you actually look out for unsourced articles, too? Dieter Simon ( talk) 01:41, 5 February 2008 (UTC) Dieter Simon ( talk) 01:43, 5 February 2008 (UTC)
Hey, I just saw the edits made by your bot at Krav Maga -- great bot, in both concept and performance! Kudos! JDoorjam JDiscourse 19:32, 5 February 2008 (UTC)
Absolutely brilliant - congratulations from me too --
Matilda
talk
22:41, 5 February 2008 (UTC)
I just figured out what your bot does, after quite a bit of confusion. As soon as I figured it out, I was quite impressed. Thanks for making such a useful addition to the Wikimunity. Darkage7 ( talk) 07:20, 6 February 2008 (UTC)
![]() |
What a Brilliant Idea Barnstar | |
You are awarded this barnstar for programming DumZiBoT to expand bare references. Thanks for helping make Wikipedia a well-referenced resource. Flibirigit ( talk) 07:34, 6 February 2008 (UTC) |
I've seen probably fifty pages on my Watched list get (slightly) improved by this both in the past two days - keep up the work, it's a great idea. Sherurcij ( Speaker for the Dead) 08:52, 6 February 2008 (UTC)
Nice BOT - can you change it to use a basic citation template though ? eg
<ref>
{{Citation
| title =
| url =
}}
</ref>
Cheers -- John ( Daytona2 · Talk · Contribs) 23:02, 5 February 2008 (UTC)
Your bot is doing great work! Thank you so much! Aleta (Sing) 14:23, 6 February 2008 (UTC)
Here is a bug for you to fix. The text is supposed to be in Russian, but it is gibberish due to incorrect encoding.— Ëzhiki (Igels Hérissonovich Ïzhakoff-Amursky) • ( yo?); 16:51, 4 February 2008 (UTC)
Server: nginx/0.6.25 Date: Mon, 04 Feb 2008 16:56:46 GMT |
Perhaps you could set up a requests page where people could post articles they would like the bot to fix the references. I was trying to get your bot to have a go at February 2008 tornado outbreak, but there doesn't seem to be a way to add requests. Great job on the bot btw! Cheers, JACO PLANE • 2008-02-6 17:26
Each reference that is modified gets "<!-- Bot generated title -->" added to it, it's about 29 bytes per each reference modified. It may not seem like much, but that can add up quick. Shouldn't that kind of comment just be put into the edit summary? Gh5046 ( talk) 20:35, 6 February 2008 (UTC)
Oh, and I forgot to say, thanks for creating this bot. It's very helpful. Gh5046 ( talk) 20:36, 6 February 2008 (UTC)
http://en.wikipedia.org/?title=Pointy_hat&diff=189487992&oldid=182653326
Look at the diff line around "Gomer". Is "Untitled Document" more useful than the bare URL? -- Damian Yerrick ( talk | stalk) 21:09, 6 February 2008 (UTC)
In Meishi, DumZiBoT did not convert three bare exlinks in one of the references.
<ref>See, e.g., http://www.adobe.com/jp/special/creativesuite/portal/guides/cs2_01_52.html, http://www.washiya.com/shop/namecard/index.html, http://www.kenseido.co.jp/shop/kps/namecard.html</ref>
-- Jtir ( talk) 21:13, 6 February 2008 (UTC)
Regulation of acupuncture, as well as acupuncture, could use his talents... again, thank you, very nice work! best regards, Jim Butler ( t) 05:46, 7 February 2008 (UTC)
Hi—First, kudos on a most excellent bot. I was reading your discussion with Dispenser about filling in more of the parameters of template:cite web, and I have a suggestion. The basic idea is to slog through a dump examining occurrences of template:cite web, and correlating the values for the url= and work= parameters. For instance, if 99% of the time, url values with a prefix of http://nytimes.com/ co-occur with work=The New York Times, then you can reliably add the latter to the references you generate for similar urls. You can build up a dictionary of these relationships in a first pass of the bot (or with a separate script). Make sense? — johndburger 01:41, 7 February 2008 (UTC)
Would it be possible for the bot to convert bare refs to refs using {{ cite web}} instead? Instead of a lead "[" it would add "<ref>{{cite web | url =" then before the new title it would add "|title =" and after the title instead of "]" it would add "|accessdate=2008-02-08}}</ref>"? It would also have to add reflist at the bottom if it were not already there. Just an idea. Thanks Ruhrfisch ><>°° 17:09, 8 February 2008 (UTC)
<html xmlns="
http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
—
Dispenser
17:39, 8 February 2008 (UTC)
(query result[s]?|query)$
. Possibly check for duplicate titles. —
Dispenser
17:39, 8 February 2008 (UTC)Your bot puts refs before periods, they should be after periods. — Rlevse • Talk • 12:14, 9 February 2008 (UTC)
Very nice at acupuncture. A much-needed service. rock on, Jim Butler ( t) 10:15, 3 February 2008 (UTC)
hi, is it possible to add this info also like: "Retrieved on 2008-02-09." --— Typ932 T | C 15:16, 9 February 2008 (UTC)
Your description of the robot uses he references—the masculine pronoun. I understand English isn't your first language and respectfully suggest you refer to it as either she or it. Most properly, use of it is appropriate since robots are usually considered gender neutral. However, considering the fuss above, she wouldn't be out of line. Ships and boats are normally referred to as she, perhaps for their unpredictable temperaments. Your use of he is amusing, and presents absolutely no difficulty in understanding or communication. The choice is yours of course. — EncMstr 23:29, 8 February 2008 (UTC)
You can use the gallery tag to organize them. It's probably easier than making a table. :) vıd ıoman 22:31, 10 February 2008 (UTC)
What is this [9]? Please fix the handling of non latin scripts before running the bot again. -- jergen ( talk) 08:27, 7 February 2008 (UTC)
![]() |
What a Brilliant Idea Barnstar | |
I've got nothing but praise for this bot idea. – sgeureka t•c 15:27, 8 February 2008 (UTC) |
And I have another improvement suggestion, but I know another bot already cares about this, so don't feel yourself pushed to do this. I became aware of your bot with this edit, and {{ reflist}} or <references /> was missing on the page to display the <ref> at all. Would it be to much coding effort to also have the bot check this? – sgeureka t•c 15:27, 8 February 2008 (UTC)
Your bot made rather a mess of the Meishi article by converting the anchor text for one of the references into unintelligible garbage. Does this thing understand Shift JIS? -- Sakurambo 桜ん坊 14:28, 6 February 2008 (UTC)
python.codecs
module to raise an error (character #19563 of the html source, but since the codecs parser failed, I don't think that this number is reliable). I can't do anything to solve these kind of problems, that's really not my fault, sorry.
NicDumZ
~
16:34, 6 February 2008 (UTC)Well, feel free to try by yourself, instead of assuming that I'm deliberately using another encoding :
import urllib2
url = u'http://www.youmeishi.com/contents/product/paper.html'
handler = urllib2.urlopen(url)
source = handler.read()
to_uni = Unicode(source, "Shift JIS") #will raise UnicodeDecodeError (illegal mutibyte sequence)
There must be some problem in the encoding of the HTML source. What you have to understand is that my script tries first to convert to the encoding specified in the "meta" markup of the page. When no UnicodeDecodeError is raised, it assumes that it works, and uses that encoding. But when an error is raised, it goes on an try other encodings. When a "fine" codec is found, i.e. a codec that does not raise an error during the conversion, I use it. But there's no way for an automated script to determine whether a character sequence makes sense or not... (Also, some pages actually say they use one encoding in their meta tags, while they're not; And a lot of pages are not sending any encoding : that's why I try other encodings)
NicDumZ ~ 09:29, 7 February 2008 (UTC)
<meta http-equiv="Content-Type" content="text/html; charset=Shift_JIS">
Okay. Stop this. Read again : my point was not about that particular page, but about others : If I stop at the first UnicodeDecodeError that I get, that means that I will not be able to detect any encoding for pages not specifying their encodings. And I was saying that pages not specifying their encoding are way more common than pages specifying an encoding, and badly encoded, hence I made the implementation choice to try to detect an encoding, since the false positives are very rare (Over 25,000 contributions, I've been reported less than 10 errors : You could say that some errors are remaining undetected, but still, even considering, exaggeratedly, that 500 errors are remaining, that would make a 0,02% error rate. Come on, give me some space.)
NicDumZ ~ 12:27, 7 February 2008 (UTC)
unicode()
is basically mapping this byte sequence to the corresponding unicode codepoint. The only reason it fails is that somewhere, ther is a byte sequence that IS NOT shift-JIS compliant, remember the error message : UnicodeDecodeError: 'shift_jis' codec can't decode bytes in position 19563-19564: illegal multibyte sequence. ILLEGAL MULTIBYTE SEQUENCE. Get it ?shift jis
did not contain that character. shift jis 2004
does, so I now try shift jis, then shift jis 2004, then cp932.
NicDumZ
~
09:13, 8 February 2008 (UTC)Chiming in, the bot messed up the title for this link in Vii as well. Jappalang ( talk) 01:35, 14 February 2008 (UTC)
Thanks for your recent work on chess articles. The bot is doing a fine job! Voorlandt ( talk) 08:48, 13 February 2008 (UTC)
Would it be hard to make a bot that consolidated references with <cite name=X>? Just making a suggestion! -- Adoniscik ( talk) 06:43, 7 February 2008 (UTC)
I just removed the following bot-generated title from an article: C:\Documents and Settings\wabalber\Local Settings\Temporary Internet Files\OLK1D0\rptApprovedSchoolsWeb.snp . . . (That apparently is the automatically generated "title" of a PDF file on a US government website.) Can the bot be trained to ignore "titles" that are file names? -- Orlady ( talk) 14:43, 13 February 2008 (UTC)
[A-Z]\:([/\|/][\w| ]+)+(.[\w+| ])?
but what I came here to say: Excellent bot!
Martijn Hoekstra (
talk)
19:04, 13 February 2008 (UTC)
^\w{3,}://\w+
and ^[A-Za-z]:\\\w+
). —
Dispenser
20:56, 13 February 2008 (UTC)Why would you let a bot loose to make substantial edits on wikipedia unless it is has been extraordinarily well tested?
The bot made two substantial, and completely incorrect, edits to the ITA_Software entry, substituting the name of some other organization for the company's actual name, and adding a sentence about that other organization's members.
That's pretty destructive, and given the probability of such kinds of mistakes (1, by my calculation) by any robot with such grand plans, seems kind of obvious such robots should not be running around wikipedia. —Preceding unsigned comment added by 67.165.122.220 ( talk) 06:32, 14 February 2008 (UTC)
Thanks for enhancing the raw references I put in most of my edits ! :-) Reminds me of a project I use to propose on my user page :
What do you think about it ? Nicolas1981 ( talk) 10:08, 14 February 2008 (UTC)
Excellent work with this bot. You have greatly contributed toward a better wikipedia. Fredsmith2 ( talk) 02:22, 15 February 2008 (UTC)
Take this diff: [13] Most of the "titles" it generates really aren't helpful: http://myweb.tiscali.co.uk/celynog/Brittany/kermario.htm Kermario<!-- Bot generated title -->] - it's more helpful to the reader to see the bare URL. In this case, it hints that the site is a personal website for some amateur based in Britain, and even without going there, suggests that it's not a strong reference. What does captioning it as "Kermario" add? How does this benefit the reader? Stevage 04:38, 10 February 2008 (UTC)