Done
Hi, I wish you luck with Labs...
See Wikipedia talk:WPCleaner#CheckWikipedia_does_not_work_on_sv.wikipedia.org.5B....5D, svwiki has no errors reported on Labs, while some are reported on toolserver. -- NicoV ( Talk on frwiki) 06:55, 6 November 2013 (UTC)
Done
Would it be possible to configure a list of abbreviations for which it would be normal to have the reference just after a punctuation ? For example, etc.<ref>REF</ref>
is OK because etc.
is an abbreviation. WPCleaner uses error_067_abbreviations_..
to configure this list. --
NicoV (
Talk on frwiki)
13:50, 13 November 2013 (UTC)
Done
Moin Moin Bgwhite, at the german Wikipedia reached me a question. I set a DEFAULTSORT (german: SORTIERUNG), but before there was a DEAFULTSORT directly at the categorie ( see this link). In the article is the template "Disambiguation" and set automatically the categorie "Begriffsklärung". Could you say me, if its right or wrong to do so? Thanks. -- Crazy1880 ( talk) 09:03, 29 November 2013 (UTC)
Done
It seems like the tool assumes that all projects capitalize the first letter. That's not true for Wiktionaries, so those links usually point to the wrong entry. 18:54, 22 December 2013 (UTC) — Preceding unsigned comment added by Skalman ( talk • contribs)
Done
Error #64 needs to be corrected too. E.g. [[a|A]] is being reported, even though [[A]] does not point to the same page in Wiktionaries. Skalman ( talk) 00:11, 7 January 2014 (UTC)
Done
Below I will list some false possitives for the #89-error that I've/will encounter/d (right now there is only one, but I will find more...)
{{DEFAULTSORT:UTC-08:30}}
( t) Josve05a ( c) 18:29, 23 December 2013 (UTC)
2,5-Dimethoxy-4-chloroamphetamine, 1,4,6-Androstatriene-3,17-dione, 2-Phenyl-3,6-dimethylmorpholine etc. is false possitives since it does have a comma, but is not suposed to have a space between. -( t) Josve05a ( c) 20:49, 24 December 2013 (UTC)
Resolved
The program WPCleaner detects <small>
-tags as a #42-error. I belewe (of what I can understand, that that error is only there for reporting strike-tags and not small-tags. It might be a bug in the program or in the CHECKWIKI-coding.
<small>(television)</small>
and <small>(singing)</small>
as #42-errors.<small>(eliminated 2-4)</small>
as a #42-error.<small>(with [[Ike Turner]])</small>
as a #42-error.<small>[[UK Singles Chart]]</small>
as a #42-error.<small>Annual sales estimates reflect free admission for Wayne, Oakland, and Macomb county residents for millage years. Expenditures rise about 1.9% annually for inflation. Investments yield about 3.8% annually.</small>
as a #42-error.( t) Josve05a ( c) 18:49, 23 December 2013 (UTC)
Done
2 observations for the wmflabs version:
-- Steenth ( talk) 14:57, 7 January 2014 (UTC)
Resolved
If we know any more errors that can be implemented, list the here.
[http://example.com/ Website where [[Anders Smith]] is a writer.]
-(
t)
Josve05a (
c)
15:49, 8 January 2014 (UTC) Resolved
Moin Moin
Bgwhite, since the update to wmf10 the daily scan for new "errors" ins't running. Can you check this, please. Thank you and regards --
Crazy1880 (
talk)
18:22, 17 January 2014 (UTC)
Resolved
We specifically use DEFAULTSORT with special characters in order to put pages in our preferred order.
To clarify: [1] and [2] don't make sense for us. Skalman ( talk) 23:28, 6 January 2014 (UTC)
Resolved
NicoV, TMg, Josve05a, Matěj Suchánek and Kwami
New Unicode control characters and the entire Private Use Areas (PUA) are now being checked for enwiki only.
I'm not a Unicode expert or do I understand some things. Magioladitis knows more about this. Should any of the new control characters be ported to other wikis? Bgwhite ( talk) 21:35, 17 January 2014 (UTC)
I've gone through the PUA to Cao Hong, maybe 30% of the total. This is quite manageable. There are very few that are intentional, and most of those deal specifically with assignments to the PUA (such as the Apple logo). Those can be substituted with &#x...; and tagged with {{ PUA}} for future maintenance. Some are stray characters which can just be deleted. PUA within text is almost always due to copying and pasting. Often the original can be found by doing a Gsearch of the surrounding text and corrected. In relatively few cases do we need to alert someone familiar with the article to fix. Of the articles I reviewed (up to Cao Hong in BG's sandbox list), I skipped emoji as too much work, and left notes on the talk pages of IBM 1620 and Sakya. Multiply that by 3 or 4 and we really don't have much work to do, and once we take care of the backlog, it should be easy to keep up with the dump. — kwami ( talk) 00:51, 18 January 2014 (UTC)
Can't find the PUA in Nay Toe.
The Inner Mongolian govt and publishers use PUA rather than Unicode for classical Mongolian script, so we may want to handle these separately. We'd want to embed a supporting font in WP at least. But Mongolian WP uses Cyrillic, so it shouldn't be a problem to scan WP-mn for PUA. — kwami ( talk) 02:28, 18 January 2014 (UTC)
Sakya's been fixed. Ask user:BabelStone to convert Tibetan PUA. — kwami ( talk) 06:32, 18 January 2014 (UTC)
Resolved
{{Liste|Reason. --[[User:Example]]}}
is allowed in an article in the German Wikipedia. Do you think it's possible to add an "allow user signatures in whitelistes templates" feature? I'm not sure if it's worth the trouble. Maybe it's easier to disable the error in dewiki.-- TMg 18:20, 27 January 2014 (UTC)
error_095_templates_frwiki=
Utilisateur
Utilisatrice
Discussion Utilisateur
Discussion Utilisatrice
Discussion utilisatrice END
Resolved
Regarding edits like
this - they are unnecessary. The <br />
tag is perfectly valid HTML 5, and indeed,
HTML Tidy converts all <br>
to <br />
when a Wikipedia page is served. --
Redrose64 (
talk)
21:44, 28 January 2014 (UTC)
<br />
but <br/ >
and I believe they are incorrect (not 100% sure that whitespace is accepted between "/" and ">". --
NicoV (
Talk on frwiki)
22:04, 28 January 2014 (UTC)
Resolved
Hi,
I'm just starting this thread to be sure I'm not missing anything I need to do in WPCleaner to be coherent with the recent changes in Check Wiki. Feel free to edit directly the list below. -- NicoV ( Talk on frwiki) 10:42, 31 January 2014 (UTC)
<a>
(previous error #519 renumbered)<strike>
(previous error #517 renumbered)http://
(old error removed, new error added)http://
(new error added) Resolved
Hi Bgwhite,
Errors #96 and #97 have a _templates_ parameter in Wikipedia:WikiProject Check Wikipedia/Translation. How the parameters are used? For example, ABP is detected by #96 because there's {{ toc right}} (lowercase) in it, but in the parameter, there's only "TOC[ ]+right" (uppercase). -- NicoV ( Talk on frwiki) 15:29, 12 February 2014 (UTC)
[ ]+
) necessary in the _templates_ parameter ? There are no regular expressions in templates list for other errors (#3, #28). --
NicoV (
Talk on frwiki)
13:37, 13 February 2014 (UTC)
[ ]+
and do a simple template name comparison in WPCleaner. --
NicoV (
Talk on frwiki)
12:53, 16 February 2014 (UTC)
[ ]+
. You shouldn't have to change when I can just added the templates twice. It should be changed on my end and not yours.
Bgwhite (
talk)
07:13, 17 February 2014 (UTC)
Resolved
Hi, bug opened about WMFLabs being completely out again. -- NicoV ( Talk on frwiki) 12:51, 16 February 2014 (UTC)
Resolved
Hi, it seems that #3 doesn't take into account the list of templates that can be used instead of <references>
. On frwiki, the full scan has just run, and we end up with 400k articles listed in #3. I checked the first one in the list
fr:!!! which hasn't been modified for months, and has {{
références}} at the end of the article --
NicoV (
Talk on frwiki)
17:00, 2 March 2014 (UTC)
Done
The checkup for <math>-tags should disregard programming-tags like <math.h> header library that are mentioned in several articles. -- StreifiGreif ( talk) 16:37, 3 March 2014 (UTC)
<math.c>
tags should be between these three tags. There might be some unintended consequences, so give a yell if you see problems.
Bgwhite (
talk)
20:21, 3 March 2014 (UTC) Resolved
Hi, on frwiki, #90 is detecting
fr:Diplomatie (jeu) because of [http://fr.wikipedia.org/wiki/Allan_B._Calhamer?redirect=no Allan B. Calhamer]
. Should it be detected? Is there a wiki syntax that can be used to convert this external link into an internal link? --
NicoV (
Talk on frwiki)
12:13, 27 February 2014 (UTC)
I did not come across any articles with redirect=no. -- Magioladitis ( talk) 08:32, 28 February 2014 (UTC)
The twice monthly dump files are not being processed at the moment. WMFLabs has a problem with mounting various directories, including where the dumps are located. Problems have been going on for a few days. A bug report has been filed, but no action or acknowledgement of the bug report has happened. So, unknown when this will be fixed. Bgwhite ( talk) 21:57, 21 January 2014 (UTC)
Greetings Wikipedia checkers! I have a question.
Over at
the village pump I'm talking to people about the feasibility of cleaning up all the copy-and-pasted comments in template documentation that derive from {{
Documentation/preload}}. My reasoning is that they cause clutter and represent a low-quality form of documentation that can't be updated easily. Some editors have suggested that they're necessary to prevent inexperienced template editors from including template categories directly in templates, when our standard procedure is to place them in <includeonly>
blocks on template documentation pages. I think that this is not enough of a problem to merit thousands of copies of the same string of text being pasted into templates. Fixing occurrences of it is a task completely suited to a bot such as the ones you operate. What would you say about the feasibility of adding that as a task? My thinking is that the logic would be something like:
That doesn't strike me as being particularly complex by the standards of your project. If you think that it is a reasonable goal, that would be just great. Ideally, I'd like to rewrite the template documentation documentation template (try saying that five times in a row) to better explain how template categories should work, and then commission a one-off bot run to clean out all the variants of the copy-and-pasted comments.
What do you think? Thanks, — Scott • talk 13:42, 23 January 2014 (UTC)
Would you be interested in participating in a user study? We are a team at University of Washington studying methods for finding collaborators within a Wikipedia community. We are looking for volunteers to evaluate a new visualization tool. All you need to do is to prepare for your laptop/desktop, web camera, and speaker for video communication with Google Hangout. We will provide you with a Amazon gift card in appreciation of your time and participation. For more information about this study, please visit our wiki page ( http://meta.wikimedia.org/wiki/Research:Finding_a_Collaborator). If you would like to participate in our user study, please send me a message at Wkmaster ( talk) 13:07, 18 February 2014 (UTC).
The powers that be are in the process of moving everything at WMFLabs to a new data center. Checkwiki's move barfed. Checkwiki will be down until things get fixed. Bgwhite ( talk) 09:32, 5 March 2014 (UTC)
Done
@ Salix alba:, @ NicoV:, @ Magioladitis:
Salix alba asked a
question about mismatched <sub>
and <sup>
tags. He was guessing there are ~4,000 articles with problems. After doing a scan, he is wrong. There are 7,096 articles from February's dump file. Examples are:
Looking at the source code of the rendered web pages, it appears the MediaWiki software does convert the mismatched tags to the correct value. However, there are around ~400 articles where there are broken or missing tags and this does cause rendering problems.
However, the majority of problems come at the end of a table cell where it doesn't do damage.
Should this be added to Checkwiki? AWB doesn't currently warn or fix the problem, not sure about WPCleaner. Should these be added to AWB and/or WPCleaner?
Bgwhite (
talk)
08:25, 27 February 2014 (UTC)
Bgwhite I could fix the <sup/> and <sub/> if someone give me the list. -- Magioladitis ( talk) 09:36, 27 February 2014 (UTC)
<sup id="foo">ref</sup>
or a style attribute, this breaks my simple test. There seem to be a couple of different errors e<sup>x</sub>
and e<sub>x</sup>
in all the cases I've looked at its the first tag which is correct, and could probably be auto corrected. There is also a bunch of cases where there in just one tag, say a single <sup>
or </sub>
alone. Sports articles seem to have a lot of these. It seems fine to just strip these tags completely. Line by line checks seem to be ok as I've never seen then span multiple lines.Bgwhite I fixed everything in the two given lists. -- Magioladitis ( talk) 22:01, 27 February 2014 (UTC)
A<sup>-1</sub> normal text.
into A<sup>-1 normal text.</sup>
, discarding the </sub>
and fixing things by adding a </sup>
at the end of the line. You can see the effect at
Divergent series in the Zeta function regularization section at the end.--
Salix alba (
talk):
23:16, 27 February 2014 (UTC)<sub>
as #98 and <sup>
as #99.
Magioladitis, can you do a bot run to fix the mismatched tags now or will it better to wait till a fix is put into AWB? I'll get you the lists if you can do it now.
Bgwhite (
talk)
00:10, 28 February 2014 (UTC)Bgwhite how is AWB supposed to fix this? In casse of mixed tags (for instance <sup>50</sub>) how do we know which is the correct one? -- Magioladitis ( talk) 06:56, 28 February 2014 (UTC)
Bgwhite rev 9957 added fix for bad sup/sub tags. -- Magioladitis ( talk) 06:57, 28 February 2014 (UTC)
<sup>([^<]*)</sub>
→ <sup>$1</sup>
and similar for <sub>
. So far its 174 edits without problems.--
Salix alba (
talk):
08:03, 1 March 2014 (UTC)
Bgwhite rev 9958 added fix for bad center tags. We already had fix for bad small tags. -- Magioladitis ( talk) 22:41, 28 February 2014 (UTC)
Bgwhite, Rjwilmsi alerts for unclosed <math>, <source>, <ref>, <code>, <nowiki>, <small>, <pre> or <gallery> tags and comments. Should we update it for sub/sup tags? -- Magioladitis ( talk) 22:54, 28 February 2014 (UTC)
rev 9959 to fix more of <sup/>, </sup/> etc. -- Magioladitis ( talk) 09:37, 1 March 2014 (UTC)
Done
Hi, it seems that #3 detects a lot of false positives: 179 pages were detected during tonight scan, and when I checked the first 4 articles (
fr:Abdallah Naaman,
fr:Adda Daouéni,
fr:Adrien de Pauger,
fr:Agriculture étrusque), they all had a <references />
through {{
references}} (which is one of the templates for references). --
NicoV (
Talk on frwiki)
01:48, 13 March 2014 (UTC)
Resolved
Yobot keeps on changing "Related topics" to "See also"...sorry, Related topics isn't wrong and no policy discourages the use of that section title, no matter how many times Yobot persists to change it.-- ColonelHenry ( talk) 18:54, 24 March 2014 (UTC)
Done
Hi, I saw that - at least for the German WP - there's a huge list of ID#84. But on virtually all sites this is because of captions that are comment by <-- and --> Problem is that often the author did not put the opening commentary-tag in the same line as the caption or that he comment multiple captions thus the second and so on are missing "their" opening tag. See any chances to get a workaround for that? -- StreifiGreif ( talk) 17:37, 7 March 2014 (UTC)
Done
Hi, should we detect #48 (internal links to the title) when they are inside <includeonly>...</includeonly>
tags ? On frwiki, all articles in
fr:Catégorie:Effectif actuel de franchise de la LNH are included in other articles, so they have a link to themselves inside a <includeonly>...</includeonly>
. --
NicoV (
Talk on frwiki)
08:43, 13 April 2014 (UTC)
Done. Bgwhite ( talk) 21:21, 18 April 2014 (UTC)
Resolved
Why is #81 off for enwp, has there been a discussion in the past which I was not a part of or...why? ( t) Josve05a ( c) 00:01, 15 April 2014 (UTC)
Done
Hi, it seems that
#67 is detected only when there's no whitespace characters between the punctuation and the reference. It would be better if . <ref
was also detected. --
NicoV (
Talk on frwiki)
09:44, 16 April 2014 (UTC)
Resolved
A user used WP:WCW to fix a spelling and punctuation mistake in an article:
I was the next one to edit the article and made completely separate edits for content, yet the previous edits noted above were automatically reversed:
I was curious if anybody knows why this happened, has it happened elsewhere, and if there is something that can be done to fix it for users that employ this tool. Thanks. Wondering55 ( talk) 20:57, 16 April 2014 (UTC)
Resolved
The link in the tab that says WMFLabs at the top of this page is not working. it brings me to an 'Internal error'-page. ( t) Josve05a ( c) 21:27, 16 April 2014 (UTC)
Done
Hi, it seems that #54 detects false positives when the list element ends with a br followed by <math>...</math>
. The math tags are probably removed before analyzing.
Example on fr:Action de groupe (mathématiques):
**[[Théorème de Cayley|par translations à gauche]] ; cette action est [[#Action simplement transitive|simplement transitive]], c'est-à-dire [[#Action libre|libre]] et [[#Action transitive|transitive]] :<br /><math>G \times G \rightarrow G,\ (g,x) \mapsto gx</math>
Maybe, rather than removing math tags, just remove the contents of the math tags? -- NicoV ( Talk on frwiki) 04:32, 19 April 2014 (UTC)
Resolved
Not sure whether I'm allowed to change "Wikipedia:WikiProject Check Wikipedia/Participants" by myself. Therefore, I'm requesting...please add me to the "Participants" list on "Wikipedia:WikiProject Check Wikipedia". Thanks.
--
LukasMatt (
talk)
05:11, 22 April 2014 (UTC)
Resolved
Please, don't hit me ! ;-)
I spent quite some time in the last weeks to fix the ISBN errors reported by CW on frwiki, and I thought I had almost finished, but I found a whole bunch of articles that don't seem to be reported. For example, fr:Pont-canal de l'Argent-Double which I fixed today wasn't reported. I'm not entirely sure, because someone may have marked the article as fixed without fixing it... Do you have an easy way to check if the previous version was detected by #69 ?
Done
Sorry to bother you again... I was wondering why there was (almost) never errors detected for #1 on frwiki, so I looked at the code: apparently only {{template:
is detected, and not the localized names for template (like {{modèle:
). --
NicoV (
Talk on frwiki)
08:06, 22 April 2014 (UTC)
Is it possible to detect how many articles has been marked as 'done' using WPC? It could be "fun" to see. ( t) Josve05a ( c) 16:40, 19 April 2014 (UTC)
Done
Hi, what HTML named characters are excluded from the search in #11? I figure dagger, emdash and endash are excluded because they got their own error. But, are there other characters excluded? (like nbsp, emsp, ...). -- NicoV ( Talk on frwiki) 11:22, 13 April 2014 (UTC)
@ Bgwhite: @ Magioladitis: I tried to go through the list of existing HTML named entities to see which ones should be reported. What do you think of this list ? (I took the current list, added what seemed reasonable, and then removed the ones that are excluded by AWB.) -- NicoV ( Talk on frwiki) 23:00, 14 April 2014 (UTC)
Done. Updated list is now in checkwiki. Bgwhite ( talk) 21:21, 18 April 2014 (UTC)
@ NicoV and Bgwhite: Now I recall we discontinued this error. There were complains that html entities should not change especially in pages about math where math formulas are allowed not only in math tags but also in plain text. This is the reason AWB skips unicodification in pages with math tags. -- Magioladitis ( talk) 17:13, 20 April 2014 (UTC)
<math>
or {{
math}}?
Bgwhite (
talk)
22:30, 21 April 2014 (UTC)
Done
Hi, it would be nice to have the "notice" column filled for #94 (like the text just before the isolated closing ref tag). I'm trying to fix them on frwiki, and when WPCleaner doesn't find the problem I don't know if it has been fixed since it has been detected or if there's a discrepancy between WPCleaner and CheckWiki script. -- NicoV ( Talk on frwiki) 21:51, 2 April 2014 (UTC)
‡Hereford United deducted 3 points for fielding an unregistered player.</ref>[1]
Greeting, wiki checkers!!
I plan to propose a GSOC project through Wikimedia this year, based around the idea of Parsoid-based online-detection of broken wikitext. The original idea of the project is defined here, Which is to develop a tool that will use parsoid to fix broken wikitext found while parsing wiki pages and then develop a user interface for editors to fix broken wikitext. But after few discussions on the project with the parsoid team, We found out that we already have tool Check Wikipedia. But it lacks the fixup information that parsoid generates while parsing wiki pages. So through my GSOC project we plan to integrate this information with your tool.
After having discussions with parsoid devs, I have written an application draft under my username GSOC Application 2014. I would be really thankful, if I get some feedback and we can have some discussion on the same. Hardik95 ( talk) 21:30, 14 March 2014 (UTC)
Hi! It seems that now CheckWiki works parallel on 2 servers: toolserver.org and tools.wmflabs.org, and they are using:
Different language communities use different servers, but they translate the same descriptions, which do not always fit to the logic. It seems to be a problem.
So, e.g., error 042 searches errors with incorrect <small> tags on the one server and <strike> tags on the other. But they take description of the error from the same page, which should be translated from enwiki translation page. Another example is error 089, etc.
(I am from eowiki.) Yurij Karcev ( talk) 06:38, 14 March 2014 (UTC)
It was suggested to exclude all pages where adding DEFAULTSORT doesn't make a difference. Redirects are an example. If a page neither
[[Category:Ä]]
requires DEFAULTSORT but [[Category:Ä|A]]
does not)it can be skipped. The following line of code should do that (again, not tested). -- TMg 20:24, 20 January 2014 (UTC)
if ( index( $text, '{{' ) >= 0 or $text =~ /\[\[($cat_regex):[^[|\]]+\]\]/i ) {
# Do the check
}
Can you write the errors on the talk page of the appropriate article? Because in many cases the author of an article watches it and then can correct the ISBN. -- Tsor ( talk) 19:38, 14 January 2014 (UTC)
Since yesterday I cannot mark articles as "Done". Leads to an error message. -- Tsor ( talk) 10:45, 16 January 2014 (UTC)
{{U|Ts
Could not connect to database: Can't connect to MySQL server on 'tools-db' (111)
. (
t)
Josve05a (
c)
14:48, 16 January 2014 (UTC)
When I fix the error 16 on arwiki is just fix about 5% of all list, I try with WCP and AWB, where the problem. -- Zaher talk 13:42, 28 November 2013 (UTC)
Hi, I've made a lot of improvements in WPCleaner to help fixing ISBN errors #69, #70, #71, #72 and #73 (which account for about 10k errors for enwiki). Some of this improvements require configuration in WPCleaner configuration file or Check Wiki configuration file.
isbn=
), possibility to search in several web sites using an other parameter of the template (for example the title). This is configurable in
general_isbn_search_engines_templates, with no default configuration as it depends on the templates of the wiki. Example available in
frwiki configuration.If you have other ideas on how to help fixing those errors, I'm quite interested. -- NicoV ( Talk on frwiki) 23:21, 19 November 2013 (UTC)
Resolved
I object to a blanket replacement of HTML entities with the corresponding Unicode character on the basis of source code readability. The Wikipedia editor lacks any mechanism to identify the character at the cursor location. Also, the editor can direct the editor to use a variety of different fonts, and the casual editor probably does not know what font is in use. Thus there are many similar characters, such as −, -, – A, Α, Η, K, Κ, N, and Ν. When these are present in the source as Unicode rather than HTML entities it is difficult for editors to know which is which. Jc3s5h ( talk) 14:28, 5 May 2014 (UTC)
<math />
or {{
math}}. When working in manual mode, no automatic replacement is done, just a suggestion to replace them by their Unicode character. When working in bot mode, automatic replacement (not sure if I should keep this). --
NicoV (
Talk on frwiki)
13:24, 6 May 2014 (UTC)I insist these bots comply with MOS:MARKUP. Jc3s5h ( talk) 13:40, 6 May 2014 (UTC)
Done
Hi, with the last dump on frwiki, I see that several articles are detected by #67 but it's a <references>...</references>
not a <ref>...</ref>
... (
fr:2 février,
fr:23 février, ...). Maybe only detect if there's no letter after ref (white space, ">", ...) ? --
NicoV (
Talk on frwiki)
08:44, 6 May 2014 (UTC)
Done
For Homepage → enwiki → High priority (and all and middle and low), would you please make the "ID" column sortable?
--
LukasMatt (
talk)
07:17, 3 May 2014 (UTC)
Resolved
Hi, are multiple <ref>...</ref>
tags separated by commas (or other punctuations) detected by #61 or #67: like <ref>...</ref>
,<ref>...</ref>
? If not, it may be useful to create a new error for that, because on many wiki, references should not be separated by normal punctuation, but rather by things like
fr:Modèle:,. --
NicoV (
Talk on frwiki)
12:51, 12 May 2014 (UTC)
Not done
Hi, when fixing ISBN in frwiki, I found a few cases where the same ISBN was defined several times in one ISBN template: one time with the "-" separators, one time without. Do you think we should create a new error for this? -- NicoV ( Talk on frwiki) 09:57, 15 May 2014 (UTC)
Please note: This is an updated version of a previous post that I made.
Hi all,
My name is Adi Khajuria and I am helping out with Wikimania 2014 in London.
One of our initiatives is to create leaflets to increase the discoverability of various wikimedia projects, and showcase the breadth of activity within wikimedia. Any kind of project can have a physical paper leaflet designed - for free - as a tool to help recruit new contributors. These leaflets will be printed at Wikimania 2014, and the designs can be re-used in the future at other events and locations.
This is particularly aimed at highlighting less discoverable but successful projects, e.g:
• Active Wikiprojects: Wikiproject Medicine, WikiProject Video Games, Wikiproject Film
• Tech projects/Tools, which may be looking for either users or developers.
• Less known major projects: Wikinews, Wikidata, Wikivoyage, etc.
• Wiki Loves Parliaments, Wiki Loves Monuments, Wiki Loves ____
• Wikimedia thematic organisations, Wikiwomen’s Collaborative, The Signpost
The deadline for submissions is 1st July 2014
For more information or to sign up for one for your project, go to:
Project leaflets
Adikhajuria (
talk)
12:43, 25 June 2014 (UTC)
Are you looking to recruit more contributors to your project?
We are offering to design and print physical paper leaflets to be distributed at Wikimania 2014 for all projects that apply.
For more information, click the link below.
Project leaflets
Adikhajuria (
talk)
14:57, 22 May 2014 (UTC)
Adikhajuria Bgwhite I would be interested on that. -- Magioladitis ( talk) 17:30, 12 June 2014 (UTC)
Not possible - Wrong forum
Why this edit was claimed as a CHECKWIKI fix? Near as I can see - it moved the authorlink parameter from next to the author to later in the reference template and removed a space. This doesn't look like any sort of error to me.... and I really prefer to see authorlinks near the author parameter - makes more sense. I also like the space - there is no rule that it shouldn't exist and it makes it easier to edit and tell sections of templates. Ealdgyth - Talk 12:30, 14 May 2014 (UTC)
Moin Moin Bgwhite and NicoV, since this evening I got to see "404 Not Found" for the script https://tools.wmflabs.org/checkwiki/cgi-bin/checkwiki.cgi is there something wrong this evening? Regards -- Crazy1880 ( talk) 17:30, 3 June 2014 (UTC)
ca:Rent (musical) gives a false positive for issue #72 because of a URL which contains the string "/qisbn=1164910567/". Can you please check on it? -- Joutbis ( talk) 18:32, 14 July 2014 (UTC)
Is the old interface gone for good? If so, how come errors #30 and #79 don't get flagged in the new one? -- Joutbis ( talk) 18:37, 14 July 2014 (UTC)
Done
I suggest that we add "u00a0" (invisible nbsp) in the list of invisible unicode characters. -- Magioladitis ( talk) 06:53, 2 August 2014 (UTC)
Done Bgwhite ( talk) 07:42, 22 August 2014 (UTC)
Not possible - Wrong forum
I was linked her by es, but the word "hard space" (1970–1991?) does not appear on the page. Any serious (AWB) es should specify by Unicode, and maybe HTML entity when needed. - DePiep ( talk) 20:45, 26 July 2014 (UTC)
Hi, I use to fix ISBN codes listed in the itwiki page of the high priorities. Unfortunately, the preceding page of the toolserver was daily updated, while this new page seems not. Am I wrong? Or....? Thanks. -- Er Cicero ( talk) 21:38, 6 August 2014 (UTC)
Hi,
Don't worry, not a request for more work to do, just an announcement to make. I'm happy to announce WPCleaner v1.32, with the main addition being the ability to add/update/remove a warning about ISBN errors (#70, #71, #72, #73) on article talk page. This can work either on a given article (from the full analysis window), or on a big bunch of articles as a bot tool (members of Category:Pages with ISBN errors, articles listed in #70-73, articles with the warning on their talk page).
Some configuration is required before being able to use it on a wiki. I've configured it for frwiki, and used it this weekend :
With the addition of the automatic detection of ISBN errors in cite templates on frwiki, I hope that it will help reduce the number of ISBN errors.
If you wish to configure this for an other wiki, please check what WPC is doing on one article before trying the bot tool on large scale. -- NicoV ( Talk on frwiki) 21:28, 27 April 2014 (UTC)
Given that I was just working on ISBN errors last night, I feel entitled to spout my two halers worth...
On the page "→ Homepage → enwiki → middle priority → ISBN with wrong length", I wish the table contained an additional indication if the error occurs multiple times in the article. Surely, if the script can find the error once in an article, it can also find the error more than once and tell us rather that hording such information for itself.
--
LukasMatt (
talk)
01:48, 29 April 2014 (UTC)
Thanks, NicoV. One more request, please. In "→ Homepage → enwiki → middle priority → ISBN with wrong length", instead of only showing 25 articles per page, can we have something like
-- LukasMatt ( talk) 12:33, 29 April 2014 (UTC)
&limit=50
to the URL like
https://tools.wmflabs.org/checkwiki/cgi-bin/checkwiki.cgi?project=frwiki&view=only&id=12&limit=50 --
NicoV (
Talk on frwiki)
13:43, 29 April 2014 (UTC)
"List of all ISBN errors" is not going to happen. That information isn't stored in the database by design.
As for "View (previous 50) (next 50)", that is a good idea. Will add it to the list of things to do.
Bgwhite (
talk)
16:48, 29 April 2014 (UTC)
@ NicoV: I am very interested in this feature, thanks for it! Will be working on assimilating this with cswiki. Matěj Suchánek ( talk | cont.) 15:06, 30 April 2014 (UTC)
Done
Copied from the section "Showing ISBN errors to other editors"
Thanks, NicoV. One more request, please. In "→ Homepage → enwiki → middle priority → ISBN with wrong length", instead of only showing 25 articles per page, can we have something like
-- LukasMatt ( talk) 12:33, 29 April 2014 (UTC)
Bgwhite, would it be possible to do the same for the list of "done" articles ? Thanks -- NicoV ( Talk on frwiki) 09:43, 25 May 2014 (UTC)
Done
Moin Moin @ Bgwhite:, since today there is a problem with "more" in every ID. If an article has an special character you couldn't open "more". If there is no special character, there is no problem. Tip: Is this a Bug from #Homepage → enwiki? Regards -- Crazy1880 ( talk) 08:41, 10 May 2014 (UTC)
Moin Moin and sorry
Bgwhite and
Redrose64, but the problem is not done. Now I have the problem in every browser, that under "more" when there is a special character you couldn't click on "done" and set it as done.
And in the IE there is the problem, that I am not able to open "more" by articles with special character. Please check there again, thanks -- Crazy1880 ( talk) 05:43, 16 May 2014 (UTC)
Hi, it seems that false positives are detected when the closing ref tag is </ref >
(with the space at the end). For
Spahettification, CheckWiki reports the error being at <ref> pour une corde du même type de 8 m
. --
NicoV (
Talk on frwiki)
05:27, 10 July 2014 (UTC)
I did not remember that but AWB fixes the spacing inside close reg tag! -- Magioladitis ( talk) 07:52, 10 July 2014 (UTC)
Hi, on frwiki,
fr:Fièvre hémorragique Ebola is detected with the following notice </ref>. | width = 225 | icd1
. The notice is related to text in the infobox, but I don't see any problem there: there's a opening ref tag before. --
NicoV (
Talk on frwiki)
16:36, 22 July 2014 (UTC)
Hi
Bgwhite,
fr:Fièvre hémorragique Ebola is popping up almost daily, and there's also a false positive with
fr:Multiplicateur de tension, with the following notice <ref name="yuan">{{Harvnb|Yuan|2010|pp=1
, where I don't see any problem. --
NicoV (
Talk on frwiki)
09:36, 8 August 2014 (UTC)
<ref name="10.1002/(SICI)1096-9071(199911)59:3<341::AID-JMV14">
. I removed the offending <. Now for the sad part. AWB did pick up the error and the correct spot. Crap.Hi, I just found out that there were several Check Wiki main pages:
-- NicoV ( Talk on frwiki) 08:13, 14 August 2014 (UTC)
Done
Hi, when clicking on "Done", the list is displayed again and at the beginning of the page, there's the name of the article that has been marked as done. If this name contains accented characters, they are badly displayed. For example, in the list for #96, I clicked on Done for Liste des députés de la treizième législature par circonscription, the page is displayed with "Liste des députés de la treizième législature par circonscription" just after the Check Wikipedia title. -- NicoV ( Talk on frwiki) 12:09, 19 August 2014 (UTC)
Done
Hi, a suggestion for a prettier notice for #25 errors: instead of displaying a <br>
between the two titles, maybe put a real line break so that the two titles are one above an other. Just a suggestion to have a better display. --
NicoV (
Talk on frwiki)
22:00, 20 August 2014 (UTC)
Done
Moin Moin Bgwhite, at this morning I would like to open the Check Wikipedia an got the following massage: Cloud not connect to database: Host '10.68.17.174' is blocked because of many connection errors; unblock with 'mysqladmin flush-hosts'. Could you have a look at? Thanks -- Crazy1880 ( talk) 04:58, 21 August 2014 (UTC)
Down again... -- NicoV ( Talk on frwiki) 07:11, 23 August 2014 (UTC)
This edit [8] breaks the formatting, because (contrary to popular belief) a blank line is not always equivalent to <p>. Please fix your tools to operate only where you understand the effects of what you're doing and, ideally, stop "fixing" things that aren't broken in pursuit of some perfectionist ideal of what markup should look like. Thanks. EEng ( talk) 00:53, 8 August 2014 (UTC)
Resolved
Hello. I've spent some time fixing ISBN errors and came here as a result of the relocation of Wikipedia:WikiProject_Check_Wikipedia/ISBN_errors. Looking at Wikipedia:WikiProject_Check_Wikipedia/List_of_errors I'm a bit worried to see "ISBN with wrong checksum" marked as "Fixed in all cases" by WPC. This sounds like a tool "fixing" ISBNs that fail the checksum test by blindly applying a recalculated checksum. I would expect this to be the wrong action about 90% of the time. Hopefully I've misunderstood. Could someone please clarify what is actually going on? TuxLibNit ( talk) 19:10, 30 August 2014 (UTC)
Done
Would you please active fa translation? I want to start translating this tool in Farsi but it doesn't have any page for farsi Yamaha5 ( talk) 05:26, 11 July 2014 (UTC)
Resolved
It seems that people keep trying to correct this error on an article I've formatted that intentionally uses an HTML quirk to have one end tag closing off two start tags so one of the start tags can be removed at a later date to display some other text (effectively <!-- foo <!-- bar -->). People keep closing off the first tag at the wrong point because it appears to be unpaired when HTML ignores any open tags in between a pair of tags. The results are here, where if you scroll down to the bottom you see that content that would have been hidden is now displayed because of the "correction". I am tired of having to re-fix these pages because people use semi-automated tools to correct this false positive. I've even had to put "There is no need for another closing comment tag" into the hidden text to jump out at people who constantly break the page but no one notices.— Ryūlóng ( 琉竜) 14:14, 29 August 2014 (UTC)
Resolved
Please update the arwiki Last scanned dump 2014-04-07 (80 days old). -- Zaher talk 23:19, 26 June 2014 (UTC)
Done
Hi! I can't find where are double small tags here. There are 90k entries so I thought it's something in a template but I haven't found anything. Thanks for your help! -- AlessioMela ( talk) 08:40, 1 July 2014 (UTC)
People here might be interested in the thread Wikipedia:Village_pump_(technical)#Parsoid_Based_Linter.-- Salix alba ( talk): 02:38, 9 July 2014 (UTC)
-- Magioladitis ( talk) 20:47, 19 August 2014 (UTC)
Hi,
fr:Élément meta is reported by #92 with the notice "=== L'attribut ===". It seems that it's because there are several titles in the form L'attribut <code>something</code>
. I think contents of <code>...</code>
should be kept for analyzing #92. --
NicoV (
Talk on frwiki)
10:36, 14 August 2014 (UTC)
Done
Hi, I saw in CW main page that for frwikiversity links to project page and translation page are pointing to frwiki. There's a project page and a translation page, but I'm not sure if they're correct (I will try to update the translation page using what's in frwiki). -- NicoV ( Talk on frwiki) 09:21, 25 August 2014 (UTC)
Done
Hi, like in the past update I can't find double tag small in those 90k articles. -- AlessioMela ( talk) 17:54, 26 August 2014 (UTC)
The WikiProject Report would like to focus on WikiProject Check Wikipedia for a Signpost article. This is an excellent opportunity to draw attention to your efforts and attract new members to the project. Would you be willing to participate in an interview? If so, here are the questions for the interview. Just add your response below each question and feel free to skip any questions that you don't feel comfortable answering. Multiple editors will have an opportunity to respond to the interview questions, so be sure to sign your answers. If you know anyone else who would like to participate in the interview, please share this with them. Thanks, Rcsprinter123 (constabulary) @ 08:38, 29 August 2014 (UTC)
Discussion in User_talk:Frietjes#Infoboxes_to_take_of revealed that most probably Error #31 needs expansion to cover more HTML table tags. -- Magioladitis ( talk) 22:45, 31 May 2014 (UTC)
<table
. There are legitimate cases where <td>
can be used. Will first check the upcoming June dump file to see the lay of the land for tr and td tags.
Bgwhite (
talk)
06:47, 1 June 2014 (UTC)
<tr>
. I do expect articles to go onto the whitelist. A listing of articles can be found at
User:Bgwhite/Sandbox1.
Bgwhite (
talk)
00:25, 16 September 2014 (UTC)Hello! I'd like to propose to detect a new error type: sometimes there are an in-page interlanguage links written as a regular interlanguage links, i.e. without a starting colon. But they are obviously in-page links since they contain a pipe symbol. For example, this situation was on a page 男同性恋免疫缺乏症 of Chinese Wiki (I don't know such examples in En.Wiki), which contained two such links: [[en:Kaposi's sarcoma|卡波西氏肉瘤]] and [[en:Pneumocystis pneumonia|卡氏肺囊虫肺炎]]. A link part after the pipe symbol is obviously useless for the regular interwikis and this situation is undoubted error. -- Emaus ( talk) 14:35, 2 June 2014 (UTC)
@ Bgwhite and NicoV: [[[[foo]]]] is caught as #64 by CHECKWIKI but as #10 by WPCleaner. It is not fixed by AWB. -- Magioladitis ( talk) 06:51, 18 June 2014 (UTC)
OK. I am getting rusty. Sorry again. This one show that AWB did not fix 64. but this is maybe due to the order of how stuff is done. Same here. -- Magioladitis ( talk) 13:14, 20 June 2014 (UTC)
@ Bgwhite: After the last dump I realised that the whitelist for #48 never works. Same for the #101 whitelist. -- Magioladitis ( talk) 08:09, 18 June 2014 (UTC)
@
Bgwhite: Error 24 whitelist does not work. --
Magioladitis (
talk)
08:46, 21 September 2014 (UTC)
@ Bgwhite: Error 31 and 49 whitelists do not work. -- Magioladitis ( talk) 09:59, 21 September 2014 (UTC)
Done
We should exclude anything inside timeline tags. -- Magioladitis ( talk) 07:10, 19 June 2014 (UTC)
Done
We should exclude search inside {{ Not a typo}}. -- Magioladitis ( talk) 07:49, 20 June 2014 (UTC)
Hi @ Bgwhite:, I was wondering if we could enhance the integration between Check Wiki and tools like WPCleaner, by providing access to the direct analysis of an article in Check Wiki: I'd like to be able to send a request to Check Wiki script checkwiki_bots.cgi (with the following parameters: wiki, article title, article text) and receive an answer telling me which errors are still detected and where (character position ?). I don't know how much work that would be on your side, but that could be very helpful to users when WPCleaner doesn't detect the problem CW detected: we would know if CW thinks that the problem is still present and where, so I could tell the user where it is on their current version of the article. -- NicoV ( Talk on frwiki) 20:01, 10 August 2014 (UTC)
I can see that this wikiproject uses scripts and tools to assist work of the participants. I have a feeling that (usually) routinely done tasks are to be done server-side instead. What wiki software features would ease this work? Gryllida ( talk) 04:13, 17 September 2014 (UTC)
Hello there! As you may already know, most WikiProjects here on Wikipedia struggle to stay active after they've been founded. I believe there is a lot of potential for WikiProjects to facilitate collaboration across subject areas, so I have submitted a grant proposal with the Wikimedia Foundation for the "WikiProject X" project. WikiProject X will study what makes WikiProjects succeed in retaining editors and then design a prototype WikiProject system that will recruit contributors to WikiProjects and help them run effectively. Please review the proposal here and leave feedback. If you have any questions, you can ask on the proposal page or leave a message on my talk page. Thank you for your time! (Also, sorry about the posting mistake earlier. If someone already moved my message to the talk page, feel free to remove this posting.) Harej ( talk) 22:47, 1 October 2014 (UTC)
For the last few days Check Wikipedia reports no errors at all at the Polish Wikipedia. Please have a look. ToSter ( talk) 12:47, 16 October 2014 (UTC)
I saw a bot correction of a citation I posted the other day, and the edit summary referred me here to the description of error number 48, title linked in text. But the cite template documentation says that the title of a source can be wikilinked to an existing Wikipedia article, as I attempted to do. Did I throw the error with my citation because the span of text wikilinked was no letter-for-letter identical with the title of the book in the template title field? If so, I can fix the problem by setting up a redirect to the article. The citation I put in new articles the other day is shown here (the raw mark-up of this question in edit mode will show exactly how I coded the template).
Flynn, James R. (2009).
What Is Intelligence?: Beyond the Flynn Effect (expanded paperback ed.). Cambridge:
Cambridge University Press.
ISBN
978-0-521-74147-7. {{
cite book}}
: Unknown parameter |laydate=
ignored (
help); Unknown parameter |laysummary=
ignored (
help)
Thanks for any advice you have about this. -- WeijiBaikeBianji ( talk, how I edit) 18:06, 8 October 2014 (UTC)
NicoV Magioladitis After looking at some of the articles in a list of #39 errors not fixed by a bot, I've noticed some "false positives". I use quotation marks because it is actually errors with mediawiki that is causing the problem.
Newlines don't function in <blockquote>
, {{
quote}}, {{
cquote}} and {{
quotation}}. I have the checkwiki code skip these for error #39. After looking at the new list of articles, <ref>
, [[Image: and {{
bq}} also don't work.
<skip several hours>
I have the bug bookmarked and brought it up. Low and behold, the patch that was submitted in December 2011 was finally accepted. Final changes were made today on enwiki. Turns out Visual Editor was assuming newlines worked the same everywhere... silly VE. So, VE started the move to finally fix the problem. Hey, who knew, VE was actually helpful for the first time ever. According to the log, it only took 8 1/2 years to fix.
I've verified that {{
quote}}, {{
cquote}} and {{
quotation}}, <blockquote>
and {{
bq}} now treat newlines correctly.
I've verified that <ref>
and [[Image: still barfs on newlines.
I need to add the ref and various image tags to #39's code and remove the currently skipped templates in #39's code. Bgwhite ( talk) 05:22, 16 October 2013 (UTC)
a
b
c
We can re-enable search inside <blockquote>
since bug fixed. --
Magioladitis (
talk)
23:49, 15 September 2014 (UTC)
Time to start thinking about what new errors should be added to Checkwiki.
Ping: Magioladitis, NicoV, Meno25, Crazy1880, LindsayH, GoingBatty, Matěj Suchánek, Josve05a, ChrisGualtieri, Graham87. I think that is everybody. If not, add them to the list.
What should or should not be added will be determined by several factors:
Some examples:
<strike>
with <s>
. It would take a copy/paste to code up. WPCleaner finds and fixes the problem. It would be Low priority.Bgwhite ( talk) 01:34, 26 November 2013 (UTC)
A few suggestions:
The errors I suggested are covered by the Database reports on English Wikipedia. Database reports are updated regularly only on enwiki, Commons and Meta. Moving the errors to checkwiki means that the reports would get generated for other wikis too. So, maybe disable those errors for enwiki and enable them for other wikis. -- Meno25 ( talk) 06:43, 26 November 2013 (UTC)
CHECKWIKI is more about common syntax errors. We need to focus on that. If lists are already generated by other bots/projects we do not need to duplicate the job. Bgwhite's idea of unspaced DEFAULTSORT is a great example of what we are after. WPC's extended list is another good example. I have some minor suggestions:
I don't know if this is an error or maybe already monitored but:
{{
cite web}}
without access dates.<ref>http://exemple.com/</ref>
is used without title/description. This is to prevent link rot.|accessdate=
.( t) Josve05a ( c) 11:52, 26 November 2013 (UTC)
Hi, I think new errors should be generic enough to work on most wikis, so avoid very specific errors (for example: {{
cite web}}
without access dates should be dealt by the template itself: put the page in a maintenance category if access dates are missing). Otherwise, some of WPCleaner errors in the #5xx numbers:
{{Template:...}}
(low)<strike>...</strike>
<a>...</a>
Some of them are probably hard to develop or require access to a lot more information, so they will be difficult to add (non-existent templates / files, ...) -- NicoV ( Talk on frwiki) 12:57, 26 November 2013 (UTC)
A few more:
-( t) Josve05a ( c) 16:12, 26 November 2013 (UTC)
Ping: Magioladitis, NicoV, Meno25, Crazy1880, LindsayH, GoingBatty, Matěj Suchánek, Josve05a, ChrisGualtieri, Graham87.
Following is a list of errors that I think could be added. Some notes:
Description | Priority | Coding | Tools to detect | Tools to fix | Other |
---|---|---|---|---|---|
Useless "Template" in {{Template:...}} | low | Done | WPC, AWB | WPC, AWB | #1 (#502) |
Internal link written as an external link | medium | Done | WPC | WPC & Frescobot | #90 (#511) |
Interwiki link written as an external link | low | Done | WPC | WPC | #91 (#512) |
Internal link inside an external link | medium | WPC (#513) | WPC | ||
<strike>...</strike>
|
low | Done | WPC, AWB | WPC, AWB* | #42 (#517). Obsolete in HTML5. Use <s>...</s> instead
|
<a>...</a>
|
low | Done | WPC | WPC | #4 (#519) |
URL without http:// | high | Done | WPC, AWB | WPC, AWB | #62 |
Finding cases of url= http://http:// | medium | Done | WPC, AWB | WPC, AWB | #93 |
Blank lines in bulleted vertical lists | medium | Accessibility issue per Wikipedia:Accessibility#Blocked elements | |||
Putting the TOC in the standard position | medium | Done | WPC | #96 and #97. Accessibility issue per MOS Elements of the lead | |
No blank space after the comma in DEFAULTSORT | low | Done | WPC, AWB | WPC, AWB | #89 |
Unbalanced ref tags | medium | Done | WPC, AWB | WPC, AWB | #94 |
Detecting user signatures in articles | low | Done | WPC, AWB | WPC, AWB | #95 |
Detecting fat redirects (redirects obscuring page content) | low | ||||
<span class="plainlinks"> in articles | low | ||||
Pipe in external link [http:/www.wikipedia.org|Wikipedia] | low | ||||
Link to a year which has another description ([[2012|2013]]) | low | This error is often caused by VE. | |||
Cases of {{cite web|url=http://www.wikipedia.org| title= | medium | ||||
Move anchor in front title in heading | |||||
Detect non-existent files (red linked files) | |||||
Detect non-existent templates | WPC (#508) | ||||
Detect refs <ref name=> | low | easy | often detected as #56 | ||
Category with double colon | easy | AWB | |||
More same parameters in template | medium | medium |
Magioladitis, NicoV, Meno25, GoingBatty, Matěj Suchánek, Josve05a, ChrisGualtieri
<a>
<strike>
The script found an external link that should be replaced with a interwiki link. An example would be on enwiki [http://fr.wikipedia.org/wiki/Larry Wall] should be written as [[:fr:Larry Wall]]
so it says fr.wikipedia.org in the extrnal link and not en.wikipedia.org. -(
t)
Josve05a (
c)
21:07, 24 December 2013 (UTC)Bgwhite I've updated WPCleaner (version 1.31) for the following errors for all wikis: #1 (previously #502), #4 (previously #519), #42 (previously #517), #90 (previously #511), #91 (previously #512). Still have to do: #62, #89, #93, #94. Old #62 and #89 have been disabled. -- NicoV ( Talk on frwiki) 21:51, 22 January 2014 (UTC)
[http://www.imdb.com/name/nm0403424/ Hurley on the [[Internet Movie Database]]]
to [[:imdbname:0403424|Hurley on the]][[Internet Movie Database]]]
. I see multiple issues with this. It removes the blank space, it leaves 3 bracket at the end (without the WPCleander reporing it. (Found on
Colin Hurley). (
t)
Josve05a (
c)
10:52, 23 January 2014 (UTC)
NicoV and Matt S., in theory frwiki and cswiki should start seeing the new errors at the next 0z run.... if the database is up. Today's outage was caused by a disc getting full. Bgwhite ( talk) 07:38, 24 January 2014 (UTC)
Discussion
| ||||
---|---|---|---|---|
If a website is called "www.news.de" for example something like this is valid in the German Wikipedia: <ref>www.news.de: [http://www.news.de/article Article].</ref> <ref>www.news.de: ''[http://www.news.de/article Article]''.</ref> This shouldn't be reported as an error. Would be nice to have this excluded somehow. Disabling the check would also disable the check for /(?:<ref\b[^<>]*>|url\s*=)\s*www\w*\.(?![^<>[\]{|}]*\[\w*:?\/\/)/i
-- TMg 17:10, 19 January 2014 (UTC)
|
Resolved
Hi, on frwiki, there are 5 false positives for #2:
-- NicoV ( Talk on frwiki) 13:37, 17 November 2014 (UTC)
It seems to be happening again on frwiki ( fr:Antihéros, fr:Insulte, ...) but I don't find anything wrong in the articles, even somewhere else. -- NicoV ( Talk on frwiki) 09:21, 5 April 2014 (UTC)
Hi, I know that you're always looking for more work since it's so easy to use Labs ;-)
I'd like to suggest adding some statistics for Check Wiki to give us some information on how errors evolve on each wiki. Would it be possible to add a table with the following informations ?
-- NicoV ( Talk on frwiki) 10:21, 6 November 2013 (UTC)
Please include pages in namespace "ملحق" (NS:104) on Arabic Wikipedia (arwiki) in the lists generated by Checkwiki script. This namespace contains lists and years pages. Pages in that namespace are counted in the number of articles (magic word: {{NUMBEROFARTICLES}}) and AWB's Auto-Tagger already tags articles in that namespace. -- Meno25 ( talk) 12:11, 23 November 2013 (UTC)
Hi all. In Demons (novel) the section headed "Characters" employs paragraphs within a bulleted list. This has been coded per the advice given here, but Yobot (and, I think, other AWB-based robots) persists in making "corrections": [11] [12] [13] [14] [15] [16] and so on. Aside from destroying the logical structure of the section, this is also contrary to accessibility guidelines.
I note that the detection of error #39 has already been modified to accept the use of <p>
s within certain tags, such as <blockquote>
. Can this tolerance be extended to include <p>
s within lists?
(I was uncertain whether to raise this concern here, with Yobot, or with AWB. If I've chosen the wrong place, could you please let me know, and I'll try again.) In the meantime, thanks for your collective good work with checkwiki: fighting the good fight, and at scale! — Simon the Likable ( talk) 13:49, 10 February 2014 (UTC)
<p>
tags, with the : it appears as an one item list to a screen reader.
Bgwhite (
talk)
07:12, 11 February 2014 (UTC)
<p>
s), but have also taken on board
Graham87's point and removed the blank lines between list items. Thus, I think the
current version covers both visual and accessibility requirements, and follows recommended coding practices in
Help:List#Paragraphs_in_lists and now
WP:LISTGAP.<p>
s within lists? (Or perhaps there is some other solution?) —
Simon the Likable (
talk)
13:59, 11 February 2014 (UTC)
<li>
tags. If a blank line happens, the list ends. In Magioladitis' version, it starts as a list. When the first : happens, the list is ended. The HTML tags to produce the layout for the : consists of <dl>
and <dt>
tags. The use of the dl and dt tags is standard HTML practice when text needs varying indentation. The source for this talk page is full of dl and dt tags.
Bgwhite (
talk)
06:23, 12 February 2014 (UTC)<p>
error in the article.
Bgwhite (
talk)
06:23, 12 February 2014 (UTC)Bgwhite thanks to Frietjes we found a wonderful workaround called {{ paragraph break}}. -- Magioladitis ( talk) 08:30, 24 January 2015 (UTC)
Is the code (or list of regular expressions) available? I believe I could suggest some improvements for cutting down on false positives and/or the number of whitelisted articles for some of the lists. Frietjes ( talk) 15:35, 17 October 2014 (UTC)
$test_text =~ s/\{\{\{\|safesubst:\}\}\}//g;
<tr
' to '<tr[^a-z]
' in error_031_html_table_elements which would avoid matching '<transcript>
' and other non-table tags that start with tr.
Frietjes (
talk)
16:34, 17 October 2014 (UTC){{{|safesubst:}}}
, which is suboptimal :( I suppose the better thing would be to fix
Module:RfD, but it seems as though there was a logical reason for
adding it there. not sure if there is any other solution, but we shall see. it would be a shame to have to resort to such hacks since, technically, {{{|safesubst:}}}
is a programming element.
Frietjes (
talk)
21:08, 17 October 2014 (UTC)Frietjes, Bgwhite RfD changed the code used. Hopefully, this resolves are problem. -- Magioladitis ( talk) 08:32, 24 January 2015 (UTC)
Hi, with the latest full dump, there seems to be a lot of false positives for #87 (HTML entities without ;). Examples from the 25 first pages reported:
&intr
),
fr:Association malienne des droits de l'homme (&intr
),
fr:California Love (&interval
),&geocode
)<ref>...</ref>
tag:
fr:Avahi (&geissmann2000
),
fr:Avahi du Sambirano (&geissmann2000
),
fr:Ayurveda (&Rhodes
),
fr:Baryonyx (&Newsbury2004
),
fr:Biochar (&Lehmann2008
),
fr:Caraka Saṃhitā (&Rhodes
),
fr:Carnotaurus (&chiarelli2009
)<timeline>...</timeline>
tag:
fr:Canton de Steenvoorde (id:Blancs&Nuls
)&CentralTower
)&phis;
)&Gem
, probably matching &ge
),
fr:Aldo Cibic (&Partners
, probably matching &part
)-- NicoV ( Talk on frwiki) 20:54, 21 July 2014 (UTC)
how about a check for this? Frietjes ( talk) 16:27, 10 November 2014 (UTC)
<ref >
(changed to <ref>
by BG19bot) is not an error and should not be changed. Whitespace is permissible here and even has advantages, as giving word wrap a safe place to break lines without introducing either syntactic or legibility confusion.
Andy Dingley (
talk)
14:16, 14 November 2014 (UTC)
< ref>hi</ref>
is the same thing as <ref >hi</ref>
(in either XML or in wikicode parsing) then you really do need to fix your bot.<ref >hi</ref>
is well-formed and should not be messed with by 'bots.
Andy Dingley (
talk)
22:29, 14 November 2014 (UTC)
Resolved
Hi, on frwiki, there are 5 false positives for #2:
-- NicoV ( Talk on frwiki) 13:37, 17 November 2014 (UTC)
@ NicoV: After discussion with Bgwhite CHECKWIKI now checks for the following magic words too: "BASEPAGENAME", "FULLPAGENAME", "PAGENAME", "PAGESIZE", "PROTECTIONLEVEL", "Pagename", "SUBPAGENAME", "Subpagename". -- Magioladitis ( talk) 23:48, 2 January 2015 (UTC)
Is this template something that would be useful to you guys? That is, if users were educated to flag problems with it, would it help you find currently missed errors? The TfD discussion is at Wikipedia:Templates for discussion/Log/2014 November 17#Template:Coding. Comments are welcome! —PC -XT + 06:52, 22 November 2014 (UTC)
There are many false positives for error #43 which include usage of the {{ familytree}} template - at plwiki pl:Burbonowie might be an example. That's probably because the brace '}' can be used legally as a parameter. ToSter ( talk) 20:55, 3 November 2014 (UTC)
{{(}}
and {{)}}
. (3) add a line to the checkwiki.pl script to do something like content =~ s/(\{\{[Ff]amilytree[^\{\}]*)[\{]([^\{\}])/$1{$2/g;
content =~ s/(\{\{[Ff]amilytree[^\{\}]*)[\}]([^\{\}])/$1}$2/g;
ae
, so it's an easy replacement.
Frietjes (
talk)
21:10, 5 November 2014 (UTC)
For most of the life of
Shooting of Michael Brown, we have used
list-defined references and commented out unused refs rather than removing them. The commenting technique we have consistently used is to change <ref name=...>
to <!--ref name=...>
and change </ref>
to </ref-->
. This method requires the least amount of effort. This has not been a problem until
this bot edit, which used WCW according to its editsum.
We have no problem with the change to Vox.Feds, since it was commented incorrectly to begin with. For the remaining three refs, WCW apparently "fixed" the leading ref tags, despite the fact that they were inside valid comments. This requires us to (1) notice what the bot did, and (2) then clean up after it. We wonder why this has happened for the first time since we started using this technique in August, and we would like to know what we can do to prevent it from happening again. I'm watching, so no need to ping me. ‑‑ Mandruss ☎ 08:45, 13 December 2014 (UTC)
>
. This is why the bot arrived at the page.Mandruss Hi. I used my bot account but it was a manual edit. No unclosed comment tag are fixed in bot mode. Feel free to improve. -- Magioladitis ( talk) 13:12, 13 December 2014 (UTC)
Maybe it's time to add unclosed center tags as error #102? Errors 28 and 39 reduced and we need a need game to play with. -- Magioladitis ( talk) 08:24, 3 October 2014 (UTC)
Resolved
Have a look at this page - a page without a title is reported. ToSter ( talk) 07:55, 9 November 2014 (UTC)
Resolved
I don't understand how the whitelists are handled - is there any guide on this? At plwiki, there is a whitelist for #58 but checkwiki still reports pl:Remixes 81 - 04. ToSter ( talk) 21:31, 19 November 2014 (UTC)
Resolved
The presence of empty rows, as I removed for instance here. — TheDJ ( talk • contribs) 13:58, 25 November 2014 (UTC)
class="wikitable" class="wikitable sortable"
→ class="wikitable sortable"
or style="foo1" style="foo2"
→ style="foo2"
. basically, duplicate class or style declarations where the first one is ignored due to the presence of the second.
Frietjes (
talk)
16:55, 25 November 2014 (UTC) Resolved
Hi, is it possible to run an instance of the checkwiki tool (the lists of errors) on nonWMF wiki project? We would like to catch and be able to fix errors like you do. Thanks. -- Wesalius ( talk) 07:27, 5 December 2014 (UTC)
Bgwhite Is
this dump working? Its produced with php /var/www/wiki/maintenance/dumpBackup.php \ plugin=AbstractFilter:/var/www/wiki/extensions/ActiveAbstract/AbstractFilter.php \ --current \ --report=100 \ --output=gzip:/var/www/wiki/WSdump2.gz \ --filter=namespace:NS_MAIN \ --filter=noredirect \
.
Bgwhite How did it go?-- Wesalius ( talk) 17:49, 20 December 2014 (UTC)
Resolved
@ NicoV and Bgwhite: Something is wrong. WPCleaner doesn't list any errors on svwp, even though there are. Itworks with enwp, but not with svwp. ( t) Josve05a ( c) 18:49, 13 December 2014 (UTC)
Done
Hi, I wish you luck with Labs...
See Wikipedia talk:WPCleaner#CheckWikipedia_does_not_work_on_sv.wikipedia.org.5B....5D, svwiki has no errors reported on Labs, while some are reported on toolserver. -- NicoV ( Talk on frwiki) 06:55, 6 November 2013 (UTC)
Done
Would it be possible to configure a list of abbreviations for which it would be normal to have the reference just after a punctuation ? For example, etc.<ref>REF</ref>
is OK because etc.
is an abbreviation. WPCleaner uses error_067_abbreviations_..
to configure this list. --
NicoV (
Talk on frwiki)
13:50, 13 November 2013 (UTC)
Done
Moin Moin Bgwhite, at the german Wikipedia reached me a question. I set a DEFAULTSORT (german: SORTIERUNG), but before there was a DEAFULTSORT directly at the categorie ( see this link). In the article is the template "Disambiguation" and set automatically the categorie "Begriffsklärung". Could you say me, if its right or wrong to do so? Thanks. -- Crazy1880 ( talk) 09:03, 29 November 2013 (UTC)
Done
It seems like the tool assumes that all projects capitalize the first letter. That's not true for Wiktionaries, so those links usually point to the wrong entry. 18:54, 22 December 2013 (UTC) — Preceding unsigned comment added by Skalman ( talk • contribs)
Done
Error #64 needs to be corrected too. E.g. [[a|A]] is being reported, even though [[A]] does not point to the same page in Wiktionaries. Skalman ( talk) 00:11, 7 January 2014 (UTC)
Done
Below I will list some false possitives for the #89-error that I've/will encounter/d (right now there is only one, but I will find more...)
{{DEFAULTSORT:UTC-08:30}}
( t) Josve05a ( c) 18:29, 23 December 2013 (UTC)
2,5-Dimethoxy-4-chloroamphetamine, 1,4,6-Androstatriene-3,17-dione, 2-Phenyl-3,6-dimethylmorpholine etc. is false possitives since it does have a comma, but is not suposed to have a space between. -( t) Josve05a ( c) 20:49, 24 December 2013 (UTC)
Resolved
The program WPCleaner detects <small>
-tags as a #42-error. I belewe (of what I can understand, that that error is only there for reporting strike-tags and not small-tags. It might be a bug in the program or in the CHECKWIKI-coding.
<small>(television)</small>
and <small>(singing)</small>
as #42-errors.<small>(eliminated 2-4)</small>
as a #42-error.<small>(with [[Ike Turner]])</small>
as a #42-error.<small>[[UK Singles Chart]]</small>
as a #42-error.<small>Annual sales estimates reflect free admission for Wayne, Oakland, and Macomb county residents for millage years. Expenditures rise about 1.9% annually for inflation. Investments yield about 3.8% annually.</small>
as a #42-error.( t) Josve05a ( c) 18:49, 23 December 2013 (UTC)
Done
2 observations for the wmflabs version:
-- Steenth ( talk) 14:57, 7 January 2014 (UTC)
Resolved
If we know any more errors that can be implemented, list the here.
[http://example.com/ Website where [[Anders Smith]] is a writer.]
-(
t)
Josve05a (
c)
15:49, 8 January 2014 (UTC) Resolved
Moin Moin
Bgwhite, since the update to wmf10 the daily scan for new "errors" ins't running. Can you check this, please. Thank you and regards --
Crazy1880 (
talk)
18:22, 17 January 2014 (UTC)
Resolved
We specifically use DEFAULTSORT with special characters in order to put pages in our preferred order.
To clarify: [1] and [2] don't make sense for us. Skalman ( talk) 23:28, 6 January 2014 (UTC)
Resolved
NicoV, TMg, Josve05a, Matěj Suchánek and Kwami
New Unicode control characters and the entire Private Use Areas (PUA) are now being checked for enwiki only.
I'm not a Unicode expert or do I understand some things. Magioladitis knows more about this. Should any of the new control characters be ported to other wikis? Bgwhite ( talk) 21:35, 17 January 2014 (UTC)
I've gone through the PUA to Cao Hong, maybe 30% of the total. This is quite manageable. There are very few that are intentional, and most of those deal specifically with assignments to the PUA (such as the Apple logo). Those can be substituted with &#x...; and tagged with {{ PUA}} for future maintenance. Some are stray characters which can just be deleted. PUA within text is almost always due to copying and pasting. Often the original can be found by doing a Gsearch of the surrounding text and corrected. In relatively few cases do we need to alert someone familiar with the article to fix. Of the articles I reviewed (up to Cao Hong in BG's sandbox list), I skipped emoji as too much work, and left notes on the talk pages of IBM 1620 and Sakya. Multiply that by 3 or 4 and we really don't have much work to do, and once we take care of the backlog, it should be easy to keep up with the dump. — kwami ( talk) 00:51, 18 January 2014 (UTC)
Can't find the PUA in Nay Toe.
The Inner Mongolian govt and publishers use PUA rather than Unicode for classical Mongolian script, so we may want to handle these separately. We'd want to embed a supporting font in WP at least. But Mongolian WP uses Cyrillic, so it shouldn't be a problem to scan WP-mn for PUA. — kwami ( talk) 02:28, 18 January 2014 (UTC)
Sakya's been fixed. Ask user:BabelStone to convert Tibetan PUA. — kwami ( talk) 06:32, 18 January 2014 (UTC)
Resolved
{{Liste|Reason. --[[User:Example]]}}
is allowed in an article in the German Wikipedia. Do you think it's possible to add an "allow user signatures in whitelistes templates" feature? I'm not sure if it's worth the trouble. Maybe it's easier to disable the error in dewiki.-- TMg 18:20, 27 January 2014 (UTC)
error_095_templates_frwiki=
Utilisateur
Utilisatrice
Discussion Utilisateur
Discussion Utilisatrice
Discussion utilisatrice END
Resolved
Regarding edits like
this - they are unnecessary. The <br />
tag is perfectly valid HTML 5, and indeed,
HTML Tidy converts all <br>
to <br />
when a Wikipedia page is served. --
Redrose64 (
talk)
21:44, 28 January 2014 (UTC)
<br />
but <br/ >
and I believe they are incorrect (not 100% sure that whitespace is accepted between "/" and ">". --
NicoV (
Talk on frwiki)
22:04, 28 January 2014 (UTC)
Resolved
Hi,
I'm just starting this thread to be sure I'm not missing anything I need to do in WPCleaner to be coherent with the recent changes in Check Wiki. Feel free to edit directly the list below. -- NicoV ( Talk on frwiki) 10:42, 31 January 2014 (UTC)
<a>
(previous error #519 renumbered)<strike>
(previous error #517 renumbered)http://
(old error removed, new error added)http://
(new error added) Resolved
Hi Bgwhite,
Errors #96 and #97 have a _templates_ parameter in Wikipedia:WikiProject Check Wikipedia/Translation. How the parameters are used? For example, ABP is detected by #96 because there's {{ toc right}} (lowercase) in it, but in the parameter, there's only "TOC[ ]+right" (uppercase). -- NicoV ( Talk on frwiki) 15:29, 12 February 2014 (UTC)
[ ]+
) necessary in the _templates_ parameter ? There are no regular expressions in templates list for other errors (#3, #28). --
NicoV (
Talk on frwiki)
13:37, 13 February 2014 (UTC)
[ ]+
and do a simple template name comparison in WPCleaner. --
NicoV (
Talk on frwiki)
12:53, 16 February 2014 (UTC)
[ ]+
. You shouldn't have to change when I can just added the templates twice. It should be changed on my end and not yours.
Bgwhite (
talk)
07:13, 17 February 2014 (UTC)
Resolved
Hi, bug opened about WMFLabs being completely out again. -- NicoV ( Talk on frwiki) 12:51, 16 February 2014 (UTC)
Resolved
Hi, it seems that #3 doesn't take into account the list of templates that can be used instead of <references>
. On frwiki, the full scan has just run, and we end up with 400k articles listed in #3. I checked the first one in the list
fr:!!! which hasn't been modified for months, and has {{
références}} at the end of the article --
NicoV (
Talk on frwiki)
17:00, 2 March 2014 (UTC)
Done
The checkup for <math>-tags should disregard programming-tags like <math.h> header library that are mentioned in several articles. -- StreifiGreif ( talk) 16:37, 3 March 2014 (UTC)
<math.c>
tags should be between these three tags. There might be some unintended consequences, so give a yell if you see problems.
Bgwhite (
talk)
20:21, 3 March 2014 (UTC) Resolved
Hi, on frwiki, #90 is detecting
fr:Diplomatie (jeu) because of [http://fr.wikipedia.org/wiki/Allan_B._Calhamer?redirect=no Allan B. Calhamer]
. Should it be detected? Is there a wiki syntax that can be used to convert this external link into an internal link? --
NicoV (
Talk on frwiki)
12:13, 27 February 2014 (UTC)
I did not come across any articles with redirect=no. -- Magioladitis ( talk) 08:32, 28 February 2014 (UTC)
The twice monthly dump files are not being processed at the moment. WMFLabs has a problem with mounting various directories, including where the dumps are located. Problems have been going on for a few days. A bug report has been filed, but no action or acknowledgement of the bug report has happened. So, unknown when this will be fixed. Bgwhite ( talk) 21:57, 21 January 2014 (UTC)
Greetings Wikipedia checkers! I have a question.
Over at
the village pump I'm talking to people about the feasibility of cleaning up all the copy-and-pasted comments in template documentation that derive from {{
Documentation/preload}}. My reasoning is that they cause clutter and represent a low-quality form of documentation that can't be updated easily. Some editors have suggested that they're necessary to prevent inexperienced template editors from including template categories directly in templates, when our standard procedure is to place them in <includeonly>
blocks on template documentation pages. I think that this is not enough of a problem to merit thousands of copies of the same string of text being pasted into templates. Fixing occurrences of it is a task completely suited to a bot such as the ones you operate. What would you say about the feasibility of adding that as a task? My thinking is that the logic would be something like:
That doesn't strike me as being particularly complex by the standards of your project. If you think that it is a reasonable goal, that would be just great. Ideally, I'd like to rewrite the template documentation documentation template (try saying that five times in a row) to better explain how template categories should work, and then commission a one-off bot run to clean out all the variants of the copy-and-pasted comments.
What do you think? Thanks, — Scott • talk 13:42, 23 January 2014 (UTC)
Would you be interested in participating in a user study? We are a team at University of Washington studying methods for finding collaborators within a Wikipedia community. We are looking for volunteers to evaluate a new visualization tool. All you need to do is to prepare for your laptop/desktop, web camera, and speaker for video communication with Google Hangout. We will provide you with a Amazon gift card in appreciation of your time and participation. For more information about this study, please visit our wiki page ( http://meta.wikimedia.org/wiki/Research:Finding_a_Collaborator). If you would like to participate in our user study, please send me a message at Wkmaster ( talk) 13:07, 18 February 2014 (UTC).
The powers that be are in the process of moving everything at WMFLabs to a new data center. Checkwiki's move barfed. Checkwiki will be down until things get fixed. Bgwhite ( talk) 09:32, 5 March 2014 (UTC)
Done
@ Salix alba:, @ NicoV:, @ Magioladitis:
Salix alba asked a
question about mismatched <sub>
and <sup>
tags. He was guessing there are ~4,000 articles with problems. After doing a scan, he is wrong. There are 7,096 articles from February's dump file. Examples are:
Looking at the source code of the rendered web pages, it appears the MediaWiki software does convert the mismatched tags to the correct value. However, there are around ~400 articles where there are broken or missing tags and this does cause rendering problems.
However, the majority of problems come at the end of a table cell where it doesn't do damage.
Should this be added to Checkwiki? AWB doesn't currently warn or fix the problem, not sure about WPCleaner. Should these be added to AWB and/or WPCleaner?
Bgwhite (
talk)
08:25, 27 February 2014 (UTC)
Bgwhite I could fix the <sup/> and <sub/> if someone give me the list. -- Magioladitis ( talk) 09:36, 27 February 2014 (UTC)
<sup id="foo">ref</sup>
or a style attribute, this breaks my simple test. There seem to be a couple of different errors e<sup>x</sub>
and e<sub>x</sup>
in all the cases I've looked at its the first tag which is correct, and could probably be auto corrected. There is also a bunch of cases where there in just one tag, say a single <sup>
or </sub>
alone. Sports articles seem to have a lot of these. It seems fine to just strip these tags completely. Line by line checks seem to be ok as I've never seen then span multiple lines.Bgwhite I fixed everything in the two given lists. -- Magioladitis ( talk) 22:01, 27 February 2014 (UTC)
A<sup>-1</sub> normal text.
into A<sup>-1 normal text.</sup>
, discarding the </sub>
and fixing things by adding a </sup>
at the end of the line. You can see the effect at
Divergent series in the Zeta function regularization section at the end.--
Salix alba (
talk):
23:16, 27 February 2014 (UTC)<sub>
as #98 and <sup>
as #99.
Magioladitis, can you do a bot run to fix the mismatched tags now or will it better to wait till a fix is put into AWB? I'll get you the lists if you can do it now.
Bgwhite (
talk)
00:10, 28 February 2014 (UTC)Bgwhite how is AWB supposed to fix this? In casse of mixed tags (for instance <sup>50</sub>) how do we know which is the correct one? -- Magioladitis ( talk) 06:56, 28 February 2014 (UTC)
Bgwhite rev 9957 added fix for bad sup/sub tags. -- Magioladitis ( talk) 06:57, 28 February 2014 (UTC)
<sup>([^<]*)</sub>
→ <sup>$1</sup>
and similar for <sub>
. So far its 174 edits without problems.--
Salix alba (
talk):
08:03, 1 March 2014 (UTC)
Bgwhite rev 9958 added fix for bad center tags. We already had fix for bad small tags. -- Magioladitis ( talk) 22:41, 28 February 2014 (UTC)
Bgwhite, Rjwilmsi alerts for unclosed <math>, <source>, <ref>, <code>, <nowiki>, <small>, <pre> or <gallery> tags and comments. Should we update it for sub/sup tags? -- Magioladitis ( talk) 22:54, 28 February 2014 (UTC)
rev 9959 to fix more of <sup/>, </sup/> etc. -- Magioladitis ( talk) 09:37, 1 March 2014 (UTC)
Done
Hi, it seems that #3 detects a lot of false positives: 179 pages were detected during tonight scan, and when I checked the first 4 articles (
fr:Abdallah Naaman,
fr:Adda Daouéni,
fr:Adrien de Pauger,
fr:Agriculture étrusque), they all had a <references />
through {{
references}} (which is one of the templates for references). --
NicoV (
Talk on frwiki)
01:48, 13 March 2014 (UTC)
Resolved
Yobot keeps on changing "Related topics" to "See also"...sorry, Related topics isn't wrong and no policy discourages the use of that section title, no matter how many times Yobot persists to change it.-- ColonelHenry ( talk) 18:54, 24 March 2014 (UTC)
Done
Hi, I saw that - at least for the German WP - there's a huge list of ID#84. But on virtually all sites this is because of captions that are comment by <-- and --> Problem is that often the author did not put the opening commentary-tag in the same line as the caption or that he comment multiple captions thus the second and so on are missing "their" opening tag. See any chances to get a workaround for that? -- StreifiGreif ( talk) 17:37, 7 March 2014 (UTC)
Done
Hi, should we detect #48 (internal links to the title) when they are inside <includeonly>...</includeonly>
tags ? On frwiki, all articles in
fr:Catégorie:Effectif actuel de franchise de la LNH are included in other articles, so they have a link to themselves inside a <includeonly>...</includeonly>
. --
NicoV (
Talk on frwiki)
08:43, 13 April 2014 (UTC)
Done. Bgwhite ( talk) 21:21, 18 April 2014 (UTC)
Resolved
Why is #81 off for enwp, has there been a discussion in the past which I was not a part of or...why? ( t) Josve05a ( c) 00:01, 15 April 2014 (UTC)
Done
Hi, it seems that
#67 is detected only when there's no whitespace characters between the punctuation and the reference. It would be better if . <ref
was also detected. --
NicoV (
Talk on frwiki)
09:44, 16 April 2014 (UTC)
Resolved
A user used WP:WCW to fix a spelling and punctuation mistake in an article:
I was the next one to edit the article and made completely separate edits for content, yet the previous edits noted above were automatically reversed:
I was curious if anybody knows why this happened, has it happened elsewhere, and if there is something that can be done to fix it for users that employ this tool. Thanks. Wondering55 ( talk) 20:57, 16 April 2014 (UTC)
Resolved
The link in the tab that says WMFLabs at the top of this page is not working. it brings me to an 'Internal error'-page. ( t) Josve05a ( c) 21:27, 16 April 2014 (UTC)
Done
Hi, it seems that #54 detects false positives when the list element ends with a br followed by <math>...</math>
. The math tags are probably removed before analyzing.
Example on fr:Action de groupe (mathématiques):
**[[Théorème de Cayley|par translations à gauche]] ; cette action est [[#Action simplement transitive|simplement transitive]], c'est-à-dire [[#Action libre|libre]] et [[#Action transitive|transitive]] :<br /><math>G \times G \rightarrow G,\ (g,x) \mapsto gx</math>
Maybe, rather than removing math tags, just remove the contents of the math tags? -- NicoV ( Talk on frwiki) 04:32, 19 April 2014 (UTC)
Resolved
Not sure whether I'm allowed to change "Wikipedia:WikiProject Check Wikipedia/Participants" by myself. Therefore, I'm requesting...please add me to the "Participants" list on "Wikipedia:WikiProject Check Wikipedia". Thanks.
--
LukasMatt (
talk)
05:11, 22 April 2014 (UTC)
Resolved
Please, don't hit me ! ;-)
I spent quite some time in the last weeks to fix the ISBN errors reported by CW on frwiki, and I thought I had almost finished, but I found a whole bunch of articles that don't seem to be reported. For example, fr:Pont-canal de l'Argent-Double which I fixed today wasn't reported. I'm not entirely sure, because someone may have marked the article as fixed without fixing it... Do you have an easy way to check if the previous version was detected by #69 ?
Done
Sorry to bother you again... I was wondering why there was (almost) never errors detected for #1 on frwiki, so I looked at the code: apparently only {{template:
is detected, and not the localized names for template (like {{modèle:
). --
NicoV (
Talk on frwiki)
08:06, 22 April 2014 (UTC)
Is it possible to detect how many articles has been marked as 'done' using WPC? It could be "fun" to see. ( t) Josve05a ( c) 16:40, 19 April 2014 (UTC)
Done
Hi, what HTML named characters are excluded from the search in #11? I figure dagger, emdash and endash are excluded because they got their own error. But, are there other characters excluded? (like nbsp, emsp, ...). -- NicoV ( Talk on frwiki) 11:22, 13 April 2014 (UTC)
@ Bgwhite: @ Magioladitis: I tried to go through the list of existing HTML named entities to see which ones should be reported. What do you think of this list ? (I took the current list, added what seemed reasonable, and then removed the ones that are excluded by AWB.) -- NicoV ( Talk on frwiki) 23:00, 14 April 2014 (UTC)
Done. Updated list is now in checkwiki. Bgwhite ( talk) 21:21, 18 April 2014 (UTC)
@ NicoV and Bgwhite: Now I recall we discontinued this error. There were complains that html entities should not change especially in pages about math where math formulas are allowed not only in math tags but also in plain text. This is the reason AWB skips unicodification in pages with math tags. -- Magioladitis ( talk) 17:13, 20 April 2014 (UTC)
<math>
or {{
math}}?
Bgwhite (
talk)
22:30, 21 April 2014 (UTC)
Done
Hi, it would be nice to have the "notice" column filled for #94 (like the text just before the isolated closing ref tag). I'm trying to fix them on frwiki, and when WPCleaner doesn't find the problem I don't know if it has been fixed since it has been detected or if there's a discrepancy between WPCleaner and CheckWiki script. -- NicoV ( Talk on frwiki) 21:51, 2 April 2014 (UTC)
‡Hereford United deducted 3 points for fielding an unregistered player.</ref>[1]
Greeting, wiki checkers!!
I plan to propose a GSOC project through Wikimedia this year, based around the idea of Parsoid-based online-detection of broken wikitext. The original idea of the project is defined here, Which is to develop a tool that will use parsoid to fix broken wikitext found while parsing wiki pages and then develop a user interface for editors to fix broken wikitext. But after few discussions on the project with the parsoid team, We found out that we already have tool Check Wikipedia. But it lacks the fixup information that parsoid generates while parsing wiki pages. So through my GSOC project we plan to integrate this information with your tool.
After having discussions with parsoid devs, I have written an application draft under my username GSOC Application 2014. I would be really thankful, if I get some feedback and we can have some discussion on the same. Hardik95 ( talk) 21:30, 14 March 2014 (UTC)
Hi! It seems that now CheckWiki works parallel on 2 servers: toolserver.org and tools.wmflabs.org, and they are using:
Different language communities use different servers, but they translate the same descriptions, which do not always fit to the logic. It seems to be a problem.
So, e.g., error 042 searches errors with incorrect <small> tags on the one server and <strike> tags on the other. But they take description of the error from the same page, which should be translated from enwiki translation page. Another example is error 089, etc.
(I am from eowiki.) Yurij Karcev ( talk) 06:38, 14 March 2014 (UTC)
It was suggested to exclude all pages where adding DEFAULTSORT doesn't make a difference. Redirects are an example. If a page neither
[[Category:Ä]]
requires DEFAULTSORT but [[Category:Ä|A]]
does not)it can be skipped. The following line of code should do that (again, not tested). -- TMg 20:24, 20 January 2014 (UTC)
if ( index( $text, '{{' ) >= 0 or $text =~ /\[\[($cat_regex):[^[|\]]+\]\]/i ) {
# Do the check
}
Can you write the errors on the talk page of the appropriate article? Because in many cases the author of an article watches it and then can correct the ISBN. -- Tsor ( talk) 19:38, 14 January 2014 (UTC)
Since yesterday I cannot mark articles as "Done". Leads to an error message. -- Tsor ( talk) 10:45, 16 January 2014 (UTC)
{{U|Ts
Could not connect to database: Can't connect to MySQL server on 'tools-db' (111)
. (
t)
Josve05a (
c)
14:48, 16 January 2014 (UTC)
When I fix the error 16 on arwiki is just fix about 5% of all list, I try with WCP and AWB, where the problem. -- Zaher talk 13:42, 28 November 2013 (UTC)
Hi, I've made a lot of improvements in WPCleaner to help fixing ISBN errors #69, #70, #71, #72 and #73 (which account for about 10k errors for enwiki). Some of this improvements require configuration in WPCleaner configuration file or Check Wiki configuration file.
isbn=
), possibility to search in several web sites using an other parameter of the template (for example the title). This is configurable in
general_isbn_search_engines_templates, with no default configuration as it depends on the templates of the wiki. Example available in
frwiki configuration.If you have other ideas on how to help fixing those errors, I'm quite interested. -- NicoV ( Talk on frwiki) 23:21, 19 November 2013 (UTC)
Resolved
I object to a blanket replacement of HTML entities with the corresponding Unicode character on the basis of source code readability. The Wikipedia editor lacks any mechanism to identify the character at the cursor location. Also, the editor can direct the editor to use a variety of different fonts, and the casual editor probably does not know what font is in use. Thus there are many similar characters, such as −, -, – A, Α, Η, K, Κ, N, and Ν. When these are present in the source as Unicode rather than HTML entities it is difficult for editors to know which is which. Jc3s5h ( talk) 14:28, 5 May 2014 (UTC)
<math />
or {{
math}}. When working in manual mode, no automatic replacement is done, just a suggestion to replace them by their Unicode character. When working in bot mode, automatic replacement (not sure if I should keep this). --
NicoV (
Talk on frwiki)
13:24, 6 May 2014 (UTC)I insist these bots comply with MOS:MARKUP. Jc3s5h ( talk) 13:40, 6 May 2014 (UTC)
Done
Hi, with the last dump on frwiki, I see that several articles are detected by #67 but it's a <references>...</references>
not a <ref>...</ref>
... (
fr:2 février,
fr:23 février, ...). Maybe only detect if there's no letter after ref (white space, ">", ...) ? --
NicoV (
Talk on frwiki)
08:44, 6 May 2014 (UTC)
Done
For Homepage → enwiki → High priority (and all and middle and low), would you please make the "ID" column sortable?
--
LukasMatt (
talk)
07:17, 3 May 2014 (UTC)
Resolved
Hi, are multiple <ref>...</ref>
tags separated by commas (or other punctuations) detected by #61 or #67: like <ref>...</ref>
,<ref>...</ref>
? If not, it may be useful to create a new error for that, because on many wiki, references should not be separated by normal punctuation, but rather by things like
fr:Modèle:,. --
NicoV (
Talk on frwiki)
12:51, 12 May 2014 (UTC)
Not done
Hi, when fixing ISBN in frwiki, I found a few cases where the same ISBN was defined several times in one ISBN template: one time with the "-" separators, one time without. Do you think we should create a new error for this? -- NicoV ( Talk on frwiki) 09:57, 15 May 2014 (UTC)
Please note: This is an updated version of a previous post that I made.
Hi all,
My name is Adi Khajuria and I am helping out with Wikimania 2014 in London.
One of our initiatives is to create leaflets to increase the discoverability of various wikimedia projects, and showcase the breadth of activity within wikimedia. Any kind of project can have a physical paper leaflet designed - for free - as a tool to help recruit new contributors. These leaflets will be printed at Wikimania 2014, and the designs can be re-used in the future at other events and locations.
This is particularly aimed at highlighting less discoverable but successful projects, e.g:
• Active Wikiprojects: Wikiproject Medicine, WikiProject Video Games, Wikiproject Film
• Tech projects/Tools, which may be looking for either users or developers.
• Less known major projects: Wikinews, Wikidata, Wikivoyage, etc.
• Wiki Loves Parliaments, Wiki Loves Monuments, Wiki Loves ____
• Wikimedia thematic organisations, Wikiwomen’s Collaborative, The Signpost
The deadline for submissions is 1st July 2014
For more information or to sign up for one for your project, go to:
Project leaflets
Adikhajuria (
talk)
12:43, 25 June 2014 (UTC)
Are you looking to recruit more contributors to your project?
We are offering to design and print physical paper leaflets to be distributed at Wikimania 2014 for all projects that apply.
For more information, click the link below.
Project leaflets
Adikhajuria (
talk)
14:57, 22 May 2014 (UTC)
Adikhajuria Bgwhite I would be interested on that. -- Magioladitis ( talk) 17:30, 12 June 2014 (UTC)
Not possible - Wrong forum
Why this edit was claimed as a CHECKWIKI fix? Near as I can see - it moved the authorlink parameter from next to the author to later in the reference template and removed a space. This doesn't look like any sort of error to me.... and I really prefer to see authorlinks near the author parameter - makes more sense. I also like the space - there is no rule that it shouldn't exist and it makes it easier to edit and tell sections of templates. Ealdgyth - Talk 12:30, 14 May 2014 (UTC)
Moin Moin Bgwhite and NicoV, since this evening I got to see "404 Not Found" for the script https://tools.wmflabs.org/checkwiki/cgi-bin/checkwiki.cgi is there something wrong this evening? Regards -- Crazy1880 ( talk) 17:30, 3 June 2014 (UTC)
ca:Rent (musical) gives a false positive for issue #72 because of a URL which contains the string "/qisbn=1164910567/". Can you please check on it? -- Joutbis ( talk) 18:32, 14 July 2014 (UTC)
Is the old interface gone for good? If so, how come errors #30 and #79 don't get flagged in the new one? -- Joutbis ( talk) 18:37, 14 July 2014 (UTC)
Done
I suggest that we add "u00a0" (invisible nbsp) in the list of invisible unicode characters. -- Magioladitis ( talk) 06:53, 2 August 2014 (UTC)
Done Bgwhite ( talk) 07:42, 22 August 2014 (UTC)
Not possible - Wrong forum
I was linked her by es, but the word "hard space" (1970–1991?) does not appear on the page. Any serious (AWB) es should specify by Unicode, and maybe HTML entity when needed. - DePiep ( talk) 20:45, 26 July 2014 (UTC)
Hi, I use to fix ISBN codes listed in the itwiki page of the high priorities. Unfortunately, the preceding page of the toolserver was daily updated, while this new page seems not. Am I wrong? Or....? Thanks. -- Er Cicero ( talk) 21:38, 6 August 2014 (UTC)
Hi,
Don't worry, not a request for more work to do, just an announcement to make. I'm happy to announce WPCleaner v1.32, with the main addition being the ability to add/update/remove a warning about ISBN errors (#70, #71, #72, #73) on article talk page. This can work either on a given article (from the full analysis window), or on a big bunch of articles as a bot tool (members of Category:Pages with ISBN errors, articles listed in #70-73, articles with the warning on their talk page).
Some configuration is required before being able to use it on a wiki. I've configured it for frwiki, and used it this weekend :
With the addition of the automatic detection of ISBN errors in cite templates on frwiki, I hope that it will help reduce the number of ISBN errors.
If you wish to configure this for an other wiki, please check what WPC is doing on one article before trying the bot tool on large scale. -- NicoV ( Talk on frwiki) 21:28, 27 April 2014 (UTC)
Given that I was just working on ISBN errors last night, I feel entitled to spout my two halers worth...
On the page "→ Homepage → enwiki → middle priority → ISBN with wrong length", I wish the table contained an additional indication if the error occurs multiple times in the article. Surely, if the script can find the error once in an article, it can also find the error more than once and tell us rather that hording such information for itself.
--
LukasMatt (
talk)
01:48, 29 April 2014 (UTC)
Thanks, NicoV. One more request, please. In "→ Homepage → enwiki → middle priority → ISBN with wrong length", instead of only showing 25 articles per page, can we have something like
-- LukasMatt ( talk) 12:33, 29 April 2014 (UTC)
&limit=50
to the URL like
https://tools.wmflabs.org/checkwiki/cgi-bin/checkwiki.cgi?project=frwiki&view=only&id=12&limit=50 --
NicoV (
Talk on frwiki)
13:43, 29 April 2014 (UTC)
"List of all ISBN errors" is not going to happen. That information isn't stored in the database by design.
As for "View (previous 50) (next 50)", that is a good idea. Will add it to the list of things to do.
Bgwhite (
talk)
16:48, 29 April 2014 (UTC)
@ NicoV: I am very interested in this feature, thanks for it! Will be working on assimilating this with cswiki. Matěj Suchánek ( talk | cont.) 15:06, 30 April 2014 (UTC)
Done
Copied from the section "Showing ISBN errors to other editors"
Thanks, NicoV. One more request, please. In "→ Homepage → enwiki → middle priority → ISBN with wrong length", instead of only showing 25 articles per page, can we have something like
-- LukasMatt ( talk) 12:33, 29 April 2014 (UTC)
Bgwhite, would it be possible to do the same for the list of "done" articles ? Thanks -- NicoV ( Talk on frwiki) 09:43, 25 May 2014 (UTC)
Done
Moin Moin @ Bgwhite:, since today there is a problem with "more" in every ID. If an article has an special character you couldn't open "more". If there is no special character, there is no problem. Tip: Is this a Bug from #Homepage → enwiki? Regards -- Crazy1880 ( talk) 08:41, 10 May 2014 (UTC)
Moin Moin and sorry
Bgwhite and
Redrose64, but the problem is not done. Now I have the problem in every browser, that under "more" when there is a special character you couldn't click on "done" and set it as done.
And in the IE there is the problem, that I am not able to open "more" by articles with special character. Please check there again, thanks -- Crazy1880 ( talk) 05:43, 16 May 2014 (UTC)
Hi, it seems that false positives are detected when the closing ref tag is </ref >
(with the space at the end). For
Spahettification, CheckWiki reports the error being at <ref> pour une corde du même type de 8 m
. --
NicoV (
Talk on frwiki)
05:27, 10 July 2014 (UTC)
I did not remember that but AWB fixes the spacing inside close reg tag! -- Magioladitis ( talk) 07:52, 10 July 2014 (UTC)
Hi, on frwiki,
fr:Fièvre hémorragique Ebola is detected with the following notice </ref>. | width = 225 | icd1
. The notice is related to text in the infobox, but I don't see any problem there: there's a opening ref tag before. --
NicoV (
Talk on frwiki)
16:36, 22 July 2014 (UTC)
Hi
Bgwhite,
fr:Fièvre hémorragique Ebola is popping up almost daily, and there's also a false positive with
fr:Multiplicateur de tension, with the following notice <ref name="yuan">{{Harvnb|Yuan|2010|pp=1
, where I don't see any problem. --
NicoV (
Talk on frwiki)
09:36, 8 August 2014 (UTC)
<ref name="10.1002/(SICI)1096-9071(199911)59:3<341::AID-JMV14">
. I removed the offending <. Now for the sad part. AWB did pick up the error and the correct spot. Crap.Hi, I just found out that there were several Check Wiki main pages:
-- NicoV ( Talk on frwiki) 08:13, 14 August 2014 (UTC)
Done
Hi, when clicking on "Done", the list is displayed again and at the beginning of the page, there's the name of the article that has been marked as done. If this name contains accented characters, they are badly displayed. For example, in the list for #96, I clicked on Done for Liste des députés de la treizième législature par circonscription, the page is displayed with "Liste des députés de la treizième législature par circonscription" just after the Check Wikipedia title. -- NicoV ( Talk on frwiki) 12:09, 19 August 2014 (UTC)
Done
Hi, a suggestion for a prettier notice for #25 errors: instead of displaying a <br>
between the two titles, maybe put a real line break so that the two titles are one above an other. Just a suggestion to have a better display. --
NicoV (
Talk on frwiki)
22:00, 20 August 2014 (UTC)
Done
Moin Moin Bgwhite, at this morning I would like to open the Check Wikipedia an got the following massage: Cloud not connect to database: Host '10.68.17.174' is blocked because of many connection errors; unblock with 'mysqladmin flush-hosts'. Could you have a look at? Thanks -- Crazy1880 ( talk) 04:58, 21 August 2014 (UTC)
Down again... -- NicoV ( Talk on frwiki) 07:11, 23 August 2014 (UTC)
This edit [8] breaks the formatting, because (contrary to popular belief) a blank line is not always equivalent to <p>. Please fix your tools to operate only where you understand the effects of what you're doing and, ideally, stop "fixing" things that aren't broken in pursuit of some perfectionist ideal of what markup should look like. Thanks. EEng ( talk) 00:53, 8 August 2014 (UTC)
Resolved
Hello. I've spent some time fixing ISBN errors and came here as a result of the relocation of Wikipedia:WikiProject_Check_Wikipedia/ISBN_errors. Looking at Wikipedia:WikiProject_Check_Wikipedia/List_of_errors I'm a bit worried to see "ISBN with wrong checksum" marked as "Fixed in all cases" by WPC. This sounds like a tool "fixing" ISBNs that fail the checksum test by blindly applying a recalculated checksum. I would expect this to be the wrong action about 90% of the time. Hopefully I've misunderstood. Could someone please clarify what is actually going on? TuxLibNit ( talk) 19:10, 30 August 2014 (UTC)
Done
Would you please active fa translation? I want to start translating this tool in Farsi but it doesn't have any page for farsi Yamaha5 ( talk) 05:26, 11 July 2014 (UTC)
Resolved
It seems that people keep trying to correct this error on an article I've formatted that intentionally uses an HTML quirk to have one end tag closing off two start tags so one of the start tags can be removed at a later date to display some other text (effectively <!-- foo <!-- bar -->). People keep closing off the first tag at the wrong point because it appears to be unpaired when HTML ignores any open tags in between a pair of tags. The results are here, where if you scroll down to the bottom you see that content that would have been hidden is now displayed because of the "correction". I am tired of having to re-fix these pages because people use semi-automated tools to correct this false positive. I've even had to put "There is no need for another closing comment tag" into the hidden text to jump out at people who constantly break the page but no one notices.— Ryūlóng ( 琉竜) 14:14, 29 August 2014 (UTC)
Resolved
Please update the arwiki Last scanned dump 2014-04-07 (80 days old). -- Zaher talk 23:19, 26 June 2014 (UTC)
Done
Hi! I can't find where are double small tags here. There are 90k entries so I thought it's something in a template but I haven't found anything. Thanks for your help! -- AlessioMela ( talk) 08:40, 1 July 2014 (UTC)
People here might be interested in the thread Wikipedia:Village_pump_(technical)#Parsoid_Based_Linter.-- Salix alba ( talk): 02:38, 9 July 2014 (UTC)
-- Magioladitis ( talk) 20:47, 19 August 2014 (UTC)
Hi,
fr:Élément meta is reported by #92 with the notice "=== L'attribut ===". It seems that it's because there are several titles in the form L'attribut <code>something</code>
. I think contents of <code>...</code>
should be kept for analyzing #92. --
NicoV (
Talk on frwiki)
10:36, 14 August 2014 (UTC)
Done
Hi, I saw in CW main page that for frwikiversity links to project page and translation page are pointing to frwiki. There's a project page and a translation page, but I'm not sure if they're correct (I will try to update the translation page using what's in frwiki). -- NicoV ( Talk on frwiki) 09:21, 25 August 2014 (UTC)
Done
Hi, like in the past update I can't find double tag small in those 90k articles. -- AlessioMela ( talk) 17:54, 26 August 2014 (UTC)
The WikiProject Report would like to focus on WikiProject Check Wikipedia for a Signpost article. This is an excellent opportunity to draw attention to your efforts and attract new members to the project. Would you be willing to participate in an interview? If so, here are the questions for the interview. Just add your response below each question and feel free to skip any questions that you don't feel comfortable answering. Multiple editors will have an opportunity to respond to the interview questions, so be sure to sign your answers. If you know anyone else who would like to participate in the interview, please share this with them. Thanks, Rcsprinter123 (constabulary) @ 08:38, 29 August 2014 (UTC)
Discussion in User_talk:Frietjes#Infoboxes_to_take_of revealed that most probably Error #31 needs expansion to cover more HTML table tags. -- Magioladitis ( talk) 22:45, 31 May 2014 (UTC)
<table
. There are legitimate cases where <td>
can be used. Will first check the upcoming June dump file to see the lay of the land for tr and td tags.
Bgwhite (
talk)
06:47, 1 June 2014 (UTC)
<tr>
. I do expect articles to go onto the whitelist. A listing of articles can be found at
User:Bgwhite/Sandbox1.
Bgwhite (
talk)
00:25, 16 September 2014 (UTC)Hello! I'd like to propose to detect a new error type: sometimes there are an in-page interlanguage links written as a regular interlanguage links, i.e. without a starting colon. But they are obviously in-page links since they contain a pipe symbol. For example, this situation was on a page 男同性恋免疫缺乏症 of Chinese Wiki (I don't know such examples in En.Wiki), which contained two such links: [[en:Kaposi's sarcoma|卡波西氏肉瘤]] and [[en:Pneumocystis pneumonia|卡氏肺囊虫肺炎]]. A link part after the pipe symbol is obviously useless for the regular interwikis and this situation is undoubted error. -- Emaus ( talk) 14:35, 2 June 2014 (UTC)
@ Bgwhite and NicoV: [[[[foo]]]] is caught as #64 by CHECKWIKI but as #10 by WPCleaner. It is not fixed by AWB. -- Magioladitis ( talk) 06:51, 18 June 2014 (UTC)
OK. I am getting rusty. Sorry again. This one show that AWB did not fix 64. but this is maybe due to the order of how stuff is done. Same here. -- Magioladitis ( talk) 13:14, 20 June 2014 (UTC)
@ Bgwhite: After the last dump I realised that the whitelist for #48 never works. Same for the #101 whitelist. -- Magioladitis ( talk) 08:09, 18 June 2014 (UTC)
@
Bgwhite: Error 24 whitelist does not work. --
Magioladitis (
talk)
08:46, 21 September 2014 (UTC)
@ Bgwhite: Error 31 and 49 whitelists do not work. -- Magioladitis ( talk) 09:59, 21 September 2014 (UTC)
Done
We should exclude anything inside timeline tags. -- Magioladitis ( talk) 07:10, 19 June 2014 (UTC)
Done
We should exclude search inside {{ Not a typo}}. -- Magioladitis ( talk) 07:49, 20 June 2014 (UTC)
Hi @ Bgwhite:, I was wondering if we could enhance the integration between Check Wiki and tools like WPCleaner, by providing access to the direct analysis of an article in Check Wiki: I'd like to be able to send a request to Check Wiki script checkwiki_bots.cgi (with the following parameters: wiki, article title, article text) and receive an answer telling me which errors are still detected and where (character position ?). I don't know how much work that would be on your side, but that could be very helpful to users when WPCleaner doesn't detect the problem CW detected: we would know if CW thinks that the problem is still present and where, so I could tell the user where it is on their current version of the article. -- NicoV ( Talk on frwiki) 20:01, 10 August 2014 (UTC)
I can see that this wikiproject uses scripts and tools to assist work of the participants. I have a feeling that (usually) routinely done tasks are to be done server-side instead. What wiki software features would ease this work? Gryllida ( talk) 04:13, 17 September 2014 (UTC)
Hello there! As you may already know, most WikiProjects here on Wikipedia struggle to stay active after they've been founded. I believe there is a lot of potential for WikiProjects to facilitate collaboration across subject areas, so I have submitted a grant proposal with the Wikimedia Foundation for the "WikiProject X" project. WikiProject X will study what makes WikiProjects succeed in retaining editors and then design a prototype WikiProject system that will recruit contributors to WikiProjects and help them run effectively. Please review the proposal here and leave feedback. If you have any questions, you can ask on the proposal page or leave a message on my talk page. Thank you for your time! (Also, sorry about the posting mistake earlier. If someone already moved my message to the talk page, feel free to remove this posting.) Harej ( talk) 22:47, 1 October 2014 (UTC)
For the last few days Check Wikipedia reports no errors at all at the Polish Wikipedia. Please have a look. ToSter ( talk) 12:47, 16 October 2014 (UTC)
I saw a bot correction of a citation I posted the other day, and the edit summary referred me here to the description of error number 48, title linked in text. But the cite template documentation says that the title of a source can be wikilinked to an existing Wikipedia article, as I attempted to do. Did I throw the error with my citation because the span of text wikilinked was no letter-for-letter identical with the title of the book in the template title field? If so, I can fix the problem by setting up a redirect to the article. The citation I put in new articles the other day is shown here (the raw mark-up of this question in edit mode will show exactly how I coded the template).
Flynn, James R. (2009).
What Is Intelligence?: Beyond the Flynn Effect (expanded paperback ed.). Cambridge:
Cambridge University Press.
ISBN
978-0-521-74147-7. {{
cite book}}
: Unknown parameter |laydate=
ignored (
help); Unknown parameter |laysummary=
ignored (
help)
Thanks for any advice you have about this. -- WeijiBaikeBianji ( talk, how I edit) 18:06, 8 October 2014 (UTC)
NicoV Magioladitis After looking at some of the articles in a list of #39 errors not fixed by a bot, I've noticed some "false positives". I use quotation marks because it is actually errors with mediawiki that is causing the problem.
Newlines don't function in <blockquote>
, {{
quote}}, {{
cquote}} and {{
quotation}}. I have the checkwiki code skip these for error #39. After looking at the new list of articles, <ref>
, [[Image: and {{
bq}} also don't work.
<skip several hours>
I have the bug bookmarked and brought it up. Low and behold, the patch that was submitted in December 2011 was finally accepted. Final changes were made today on enwiki. Turns out Visual Editor was assuming newlines worked the same everywhere... silly VE. So, VE started the move to finally fix the problem. Hey, who knew, VE was actually helpful for the first time ever. According to the log, it only took 8 1/2 years to fix.
I've verified that {{
quote}}, {{
cquote}} and {{
quotation}}, <blockquote>
and {{
bq}} now treat newlines correctly.
I've verified that <ref>
and [[Image: still barfs on newlines.
I need to add the ref and various image tags to #39's code and remove the currently skipped templates in #39's code. Bgwhite ( talk) 05:22, 16 October 2013 (UTC)
a
b
c
We can re-enable search inside <blockquote>
since bug fixed. --
Magioladitis (
talk)
23:49, 15 September 2014 (UTC)
Time to start thinking about what new errors should be added to Checkwiki.
Ping: Magioladitis, NicoV, Meno25, Crazy1880, LindsayH, GoingBatty, Matěj Suchánek, Josve05a, ChrisGualtieri, Graham87. I think that is everybody. If not, add them to the list.
What should or should not be added will be determined by several factors:
Some examples:
<strike>
with <s>
. It would take a copy/paste to code up. WPCleaner finds and fixes the problem. It would be Low priority.Bgwhite ( talk) 01:34, 26 November 2013 (UTC)
A few suggestions:
The errors I suggested are covered by the Database reports on English Wikipedia. Database reports are updated regularly only on enwiki, Commons and Meta. Moving the errors to checkwiki means that the reports would get generated for other wikis too. So, maybe disable those errors for enwiki and enable them for other wikis. -- Meno25 ( talk) 06:43, 26 November 2013 (UTC)
CHECKWIKI is more about common syntax errors. We need to focus on that. If lists are already generated by other bots/projects we do not need to duplicate the job. Bgwhite's idea of unspaced DEFAULTSORT is a great example of what we are after. WPC's extended list is another good example. I have some minor suggestions:
I don't know if this is an error or maybe already monitored but:
{{
cite web}}
without access dates.<ref>http://exemple.com/</ref>
is used without title/description. This is to prevent link rot.|accessdate=
.( t) Josve05a ( c) 11:52, 26 November 2013 (UTC)
Hi, I think new errors should be generic enough to work on most wikis, so avoid very specific errors (for example: {{
cite web}}
without access dates should be dealt by the template itself: put the page in a maintenance category if access dates are missing). Otherwise, some of WPCleaner errors in the #5xx numbers:
{{Template:...}}
(low)<strike>...</strike>
<a>...</a>
Some of them are probably hard to develop or require access to a lot more information, so they will be difficult to add (non-existent templates / files, ...) -- NicoV ( Talk on frwiki) 12:57, 26 November 2013 (UTC)
A few more:
-( t) Josve05a ( c) 16:12, 26 November 2013 (UTC)
Ping: Magioladitis, NicoV, Meno25, Crazy1880, LindsayH, GoingBatty, Matěj Suchánek, Josve05a, ChrisGualtieri, Graham87.
Following is a list of errors that I think could be added. Some notes:
Description | Priority | Coding | Tools to detect | Tools to fix | Other |
---|---|---|---|---|---|
Useless "Template" in {{Template:...}} | low | Done | WPC, AWB | WPC, AWB | #1 (#502) |
Internal link written as an external link | medium | Done | WPC | WPC & Frescobot | #90 (#511) |
Interwiki link written as an external link | low | Done | WPC | WPC | #91 (#512) |
Internal link inside an external link | medium | WPC (#513) | WPC | ||
<strike>...</strike>
|
low | Done | WPC, AWB | WPC, AWB* | #42 (#517). Obsolete in HTML5. Use <s>...</s> instead
|
<a>...</a>
|
low | Done | WPC | WPC | #4 (#519) |
URL without http:// | high | Done | WPC, AWB | WPC, AWB | #62 |
Finding cases of url= http://http:// | medium | Done | WPC, AWB | WPC, AWB | #93 |
Blank lines in bulleted vertical lists | medium | Accessibility issue per Wikipedia:Accessibility#Blocked elements | |||
Putting the TOC in the standard position | medium | Done | WPC | #96 and #97. Accessibility issue per MOS Elements of the lead | |
No blank space after the comma in DEFAULTSORT | low | Done | WPC, AWB | WPC, AWB | #89 |
Unbalanced ref tags | medium | Done | WPC, AWB | WPC, AWB | #94 |
Detecting user signatures in articles | low | Done | WPC, AWB | WPC, AWB | #95 |
Detecting fat redirects (redirects obscuring page content) | low | ||||
<span class="plainlinks"> in articles | low | ||||
Pipe in external link [http:/www.wikipedia.org|Wikipedia] | low | ||||
Link to a year which has another description ([[2012|2013]]) | low | This error is often caused by VE. | |||
Cases of {{cite web|url=http://www.wikipedia.org| title= | medium | ||||
Move anchor in front title in heading | |||||
Detect non-existent files (red linked files) | |||||
Detect non-existent templates | WPC (#508) | ||||
Detect refs <ref name=> | low | easy | often detected as #56 | ||
Category with double colon | easy | AWB | |||
More same parameters in template | medium | medium |
Magioladitis, NicoV, Meno25, GoingBatty, Matěj Suchánek, Josve05a, ChrisGualtieri
<a>
<strike>
The script found an external link that should be replaced with a interwiki link. An example would be on enwiki [http://fr.wikipedia.org/wiki/Larry Wall] should be written as [[:fr:Larry Wall]]
so it says fr.wikipedia.org in the extrnal link and not en.wikipedia.org. -(
t)
Josve05a (
c)
21:07, 24 December 2013 (UTC)Bgwhite I've updated WPCleaner (version 1.31) for the following errors for all wikis: #1 (previously #502), #4 (previously #519), #42 (previously #517), #90 (previously #511), #91 (previously #512). Still have to do: #62, #89, #93, #94. Old #62 and #89 have been disabled. -- NicoV ( Talk on frwiki) 21:51, 22 January 2014 (UTC)
[http://www.imdb.com/name/nm0403424/ Hurley on the [[Internet Movie Database]]]
to [[:imdbname:0403424|Hurley on the]][[Internet Movie Database]]]
. I see multiple issues with this. It removes the blank space, it leaves 3 bracket at the end (without the WPCleander reporing it. (Found on
Colin Hurley). (
t)
Josve05a (
c)
10:52, 23 January 2014 (UTC)
NicoV and Matt S., in theory frwiki and cswiki should start seeing the new errors at the next 0z run.... if the database is up. Today's outage was caused by a disc getting full. Bgwhite ( talk) 07:38, 24 January 2014 (UTC)
Discussion
| ||||
---|---|---|---|---|
If a website is called "www.news.de" for example something like this is valid in the German Wikipedia: <ref>www.news.de: [http://www.news.de/article Article].</ref> <ref>www.news.de: ''[http://www.news.de/article Article]''.</ref> This shouldn't be reported as an error. Would be nice to have this excluded somehow. Disabling the check would also disable the check for /(?:<ref\b[^<>]*>|url\s*=)\s*www\w*\.(?![^<>[\]{|}]*\[\w*:?\/\/)/i
-- TMg 17:10, 19 January 2014 (UTC)
|
Resolved
Hi, on frwiki, there are 5 false positives for #2:
-- NicoV ( Talk on frwiki) 13:37, 17 November 2014 (UTC)
It seems to be happening again on frwiki ( fr:Antihéros, fr:Insulte, ...) but I don't find anything wrong in the articles, even somewhere else. -- NicoV ( Talk on frwiki) 09:21, 5 April 2014 (UTC)
Hi, I know that you're always looking for more work since it's so easy to use Labs ;-)
I'd like to suggest adding some statistics for Check Wiki to give us some information on how errors evolve on each wiki. Would it be possible to add a table with the following informations ?
-- NicoV ( Talk on frwiki) 10:21, 6 November 2013 (UTC)
Please include pages in namespace "ملحق" (NS:104) on Arabic Wikipedia (arwiki) in the lists generated by Checkwiki script. This namespace contains lists and years pages. Pages in that namespace are counted in the number of articles (magic word: {{NUMBEROFARTICLES}}) and AWB's Auto-Tagger already tags articles in that namespace. -- Meno25 ( talk) 12:11, 23 November 2013 (UTC)
Hi all. In Demons (novel) the section headed "Characters" employs paragraphs within a bulleted list. This has been coded per the advice given here, but Yobot (and, I think, other AWB-based robots) persists in making "corrections": [11] [12] [13] [14] [15] [16] and so on. Aside from destroying the logical structure of the section, this is also contrary to accessibility guidelines.
I note that the detection of error #39 has already been modified to accept the use of <p>
s within certain tags, such as <blockquote>
. Can this tolerance be extended to include <p>
s within lists?
(I was uncertain whether to raise this concern here, with Yobot, or with AWB. If I've chosen the wrong place, could you please let me know, and I'll try again.) In the meantime, thanks for your collective good work with checkwiki: fighting the good fight, and at scale! — Simon the Likable ( talk) 13:49, 10 February 2014 (UTC)
<p>
tags, with the : it appears as an one item list to a screen reader.
Bgwhite (
talk)
07:12, 11 February 2014 (UTC)
<p>
s), but have also taken on board
Graham87's point and removed the blank lines between list items. Thus, I think the
current version covers both visual and accessibility requirements, and follows recommended coding practices in
Help:List#Paragraphs_in_lists and now
WP:LISTGAP.<p>
s within lists? (Or perhaps there is some other solution?) —
Simon the Likable (
talk)
13:59, 11 February 2014 (UTC)
<li>
tags. If a blank line happens, the list ends. In Magioladitis' version, it starts as a list. When the first : happens, the list is ended. The HTML tags to produce the layout for the : consists of <dl>
and <dt>
tags. The use of the dl and dt tags is standard HTML practice when text needs varying indentation. The source for this talk page is full of dl and dt tags.
Bgwhite (
talk)
06:23, 12 February 2014 (UTC)<p>
error in the article.
Bgwhite (
talk)
06:23, 12 February 2014 (UTC)Bgwhite thanks to Frietjes we found a wonderful workaround called {{ paragraph break}}. -- Magioladitis ( talk) 08:30, 24 January 2015 (UTC)
Is the code (or list of regular expressions) available? I believe I could suggest some improvements for cutting down on false positives and/or the number of whitelisted articles for some of the lists. Frietjes ( talk) 15:35, 17 October 2014 (UTC)
$test_text =~ s/\{\{\{\|safesubst:\}\}\}//g;
<tr
' to '<tr[^a-z]
' in error_031_html_table_elements which would avoid matching '<transcript>
' and other non-table tags that start with tr.
Frietjes (
talk)
16:34, 17 October 2014 (UTC){{{|safesubst:}}}
, which is suboptimal :( I suppose the better thing would be to fix
Module:RfD, but it seems as though there was a logical reason for
adding it there. not sure if there is any other solution, but we shall see. it would be a shame to have to resort to such hacks since, technically, {{{|safesubst:}}}
is a programming element.
Frietjes (
talk)
21:08, 17 October 2014 (UTC)Frietjes, Bgwhite RfD changed the code used. Hopefully, this resolves are problem. -- Magioladitis ( talk) 08:32, 24 January 2015 (UTC)
Hi, with the latest full dump, there seems to be a lot of false positives for #87 (HTML entities without ;). Examples from the 25 first pages reported:
&intr
),
fr:Association malienne des droits de l'homme (&intr
),
fr:California Love (&interval
),&geocode
)<ref>...</ref>
tag:
fr:Avahi (&geissmann2000
),
fr:Avahi du Sambirano (&geissmann2000
),
fr:Ayurveda (&Rhodes
),
fr:Baryonyx (&Newsbury2004
),
fr:Biochar (&Lehmann2008
),
fr:Caraka Saṃhitā (&Rhodes
),
fr:Carnotaurus (&chiarelli2009
)<timeline>...</timeline>
tag:
fr:Canton de Steenvoorde (id:Blancs&Nuls
)&CentralTower
)&phis;
)&Gem
, probably matching &ge
),
fr:Aldo Cibic (&Partners
, probably matching &part
)-- NicoV ( Talk on frwiki) 20:54, 21 July 2014 (UTC)
how about a check for this? Frietjes ( talk) 16:27, 10 November 2014 (UTC)
<ref >
(changed to <ref>
by BG19bot) is not an error and should not be changed. Whitespace is permissible here and even has advantages, as giving word wrap a safe place to break lines without introducing either syntactic or legibility confusion.
Andy Dingley (
talk)
14:16, 14 November 2014 (UTC)
< ref>hi</ref>
is the same thing as <ref >hi</ref>
(in either XML or in wikicode parsing) then you really do need to fix your bot.<ref >hi</ref>
is well-formed and should not be messed with by 'bots.
Andy Dingley (
talk)
22:29, 14 November 2014 (UTC)
Resolved
Hi, on frwiki, there are 5 false positives for #2:
-- NicoV ( Talk on frwiki) 13:37, 17 November 2014 (UTC)
@ NicoV: After discussion with Bgwhite CHECKWIKI now checks for the following magic words too: "BASEPAGENAME", "FULLPAGENAME", "PAGENAME", "PAGESIZE", "PROTECTIONLEVEL", "Pagename", "SUBPAGENAME", "Subpagename". -- Magioladitis ( talk) 23:48, 2 January 2015 (UTC)
Is this template something that would be useful to you guys? That is, if users were educated to flag problems with it, would it help you find currently missed errors? The TfD discussion is at Wikipedia:Templates for discussion/Log/2014 November 17#Template:Coding. Comments are welcome! —PC -XT + 06:52, 22 November 2014 (UTC)
There are many false positives for error #43 which include usage of the {{ familytree}} template - at plwiki pl:Burbonowie might be an example. That's probably because the brace '}' can be used legally as a parameter. ToSter ( talk) 20:55, 3 November 2014 (UTC)
{{(}}
and {{)}}
. (3) add a line to the checkwiki.pl script to do something like content =~ s/(\{\{[Ff]amilytree[^\{\}]*)[\{]([^\{\}])/$1{$2/g;
content =~ s/(\{\{[Ff]amilytree[^\{\}]*)[\}]([^\{\}])/$1}$2/g;
ae
, so it's an easy replacement.
Frietjes (
talk)
21:10, 5 November 2014 (UTC)
For most of the life of
Shooting of Michael Brown, we have used
list-defined references and commented out unused refs rather than removing them. The commenting technique we have consistently used is to change <ref name=...>
to <!--ref name=...>
and change </ref>
to </ref-->
. This method requires the least amount of effort. This has not been a problem until
this bot edit, which used WCW according to its editsum.
We have no problem with the change to Vox.Feds, since it was commented incorrectly to begin with. For the remaining three refs, WCW apparently "fixed" the leading ref tags, despite the fact that they were inside valid comments. This requires us to (1) notice what the bot did, and (2) then clean up after it. We wonder why this has happened for the first time since we started using this technique in August, and we would like to know what we can do to prevent it from happening again. I'm watching, so no need to ping me. ‑‑ Mandruss ☎ 08:45, 13 December 2014 (UTC)
>
. This is why the bot arrived at the page.Mandruss Hi. I used my bot account but it was a manual edit. No unclosed comment tag are fixed in bot mode. Feel free to improve. -- Magioladitis ( talk) 13:12, 13 December 2014 (UTC)
Maybe it's time to add unclosed center tags as error #102? Errors 28 and 39 reduced and we need a need game to play with. -- Magioladitis ( talk) 08:24, 3 October 2014 (UTC)
Resolved
Have a look at this page - a page without a title is reported. ToSter ( talk) 07:55, 9 November 2014 (UTC)
Resolved
I don't understand how the whitelists are handled - is there any guide on this? At plwiki, there is a whitelist for #58 but checkwiki still reports pl:Remixes 81 - 04. ToSter ( talk) 21:31, 19 November 2014 (UTC)
Resolved
The presence of empty rows, as I removed for instance here. — TheDJ ( talk • contribs) 13:58, 25 November 2014 (UTC)
class="wikitable" class="wikitable sortable"
→ class="wikitable sortable"
or style="foo1" style="foo2"
→ style="foo2"
. basically, duplicate class or style declarations where the first one is ignored due to the presence of the second.
Frietjes (
talk)
16:55, 25 November 2014 (UTC) Resolved
Hi, is it possible to run an instance of the checkwiki tool (the lists of errors) on nonWMF wiki project? We would like to catch and be able to fix errors like you do. Thanks. -- Wesalius ( talk) 07:27, 5 December 2014 (UTC)
Bgwhite Is
this dump working? Its produced with php /var/www/wiki/maintenance/dumpBackup.php \ plugin=AbstractFilter:/var/www/wiki/extensions/ActiveAbstract/AbstractFilter.php \ --current \ --report=100 \ --output=gzip:/var/www/wiki/WSdump2.gz \ --filter=namespace:NS_MAIN \ --filter=noredirect \
.
Bgwhite How did it go?-- Wesalius ( talk) 17:49, 20 December 2014 (UTC)
Resolved
@ NicoV and Bgwhite: Something is wrong. WPCleaner doesn't list any errors on svwp, even though there are. Itworks with enwp, but not with svwp. ( t) Josve05a ( c) 18:49, 13 December 2014 (UTC)