This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 10 | Archive 11 | Archive 12 | Archive 13 | Archive 14 | Archive 15 | → | Archive 20 |
https://github.com/ms609/citation-bot/pull/1165
AManWithNoPlan (
talk) 18:53, 4 January 2019 (UTC)
See also
[4].
Headbomb {
t ·
c ·
p ·
b} 17:46, 5 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1172 Once implemented it should fix this.
AManWithNoPlan (
talk) 01:17, 6 January 2019 (UTC)
Let's have something like
https://tools.wmflabs.org/citations/list.php?linksonpage=User:Headbomb/Sandbox5
This would be super useful. We could be build lists of pages with crappy citations with AWB's database scanner or with clever insource:// search (e.g. pages with raw GoogleBooks links, pages with raw DOI links, ...), then put the list of pages to be edited somewhere (e.g. User:Headbomb/Sandbox5), then tell the bot to run against those pages (follow redirects if they exist). Headbomb { t · c · p · b} 14:45, 22 August 2018 (UTC)
https://tools.wmflabs.org/citations/list.php?linksonpage=Book:Canada
, that would find all links on the page (likely direct links for simplicity) and run the bot on those pages, that would be great.That is if you have [[Foobar|Barfoo]]
somewhere on the page, get
Foobar (follow redirects if there are any), and run the bot on that. Repeat for all other links it finds.
Headbomb {
t ·
c ·
p ·
b} 22:30, 26 August 2018 (UTC)
This is still something that would be incredibly useful. Headbomb { t · c · p · b} 15:02, 30 November 2018 (UTC)
This is basically a request that would allow any user to run a full-automated bot without needing WP:BRFA. Given this is a tool designed for manual watching of diffs, I wonder how wise it would be to turn the bot keys over. -- Green C 16:27, 5 January 2019 (UTC)
{{ fixed}} prints with pipes now. AManWithNoPlan ( talk) 13:43, 8 January 2019 (UTC)
The adsabs database seems to more generous with matches suddenly. I have already submitted two fixes.
https://github.com/ms609/citation-bot/pull/1174
https://github.com/ms609/citation-bot/pull/1169
AManWithNoPlan (
talk) 14:59, 6 January 2019 (UTC)
There something wrong with that one bibcode that redirects to another one. That makes us not expand it since one check we do is to make sure the bibcode we get back is the one we sent out. This is unfixable, since we will not remove the double check. The second issue is that the not currently rejects expansion of any book bibcodes since that rehires is to write code that we have not done yet. I might look into writing that code.
AManWithNoPlan (
talk) 22:53, 6 January 2019 (UTC)
Every handle resolver has to be added separately.
https://github.com/ms609/citation-bot/pull/1181
AManWithNoPlan (
talk) 18:48, 7 January 2019 (UTC)
Commonly seen:
{{cite news | url=https://www.questia.com/read/1G1-61177939 | title=Max hangs up his boots with £200m | work=[[The People]] | date=March 31, 1996 | accessdate=March 4, 2013 | author=Gunn, Cathy}}{{Subscription required|via=[[Questia Online Library]]}}
{{cite news |last=Heffer|first=Simon|title= Beaten by Eton: The Land of Lost Content: The biography of Anthony Chenevix-Trench by Mark Peel |date=27 July 1996 |accessdate= 3 December 2012 |location =London |newspaper=[[Daily Mail]] {{Subscription required|via=[[Questia Online Library]]}}|url=https://www.questia.com/read/1G1-111427463}}
On both subscription status is noted with the {{
subscription}}
template, which can be inside or outside the CS1|2.
The better format would be:
{{cite news | url=https://www.questia.com/read/1G1-61177939 | title=Max hangs up his boots with £200m | work=[[The People]] | url-access=subscription | via = [[Questia Online Library]] | date=March 31, 1996 | accessdate=March 4, 2013 | author=Gunn, Cathy}}
{{cite news |last=Heffer|first=Simon|title= Beaten by Eton: The Land of Lost Content: The biography of Anthony Chenevix-Trench by Mark Peel |date=27 July 1996 |accessdate= 3 December 2012 |location =London |newspaper=[[Daily Mail]] |url=https://www.questia.com/read/1G1-111427463 | url-access=subscription | via = [[Questia Online Library]] }}
The {{
subscription}}
is replaced with |url-access=
and if there is a |via=
argument, with a |via=
in the CS1|2. The |subscription=
template goes by
many names. --
Green
C 19:30, 7 January 2019 (UTC)
I will close this item as {{ wontfix}} and have moved a link to the discussion area above. AManWithNoPlan ( talk) 17:02, 8 January 2019 (UTC)
If you have something like
{{cite web |url=http://www.example.com/asdf.pdf |title=title}}
, giving{{cite journal |url=http://www.example.com/asdf.pdf |title=title}}
, giving{{
cite journal}}
: Cite journal requires |journal=
(
help)Citation templates automatically append (PDF) next to the link. So there's no point in having
{{cite journal |url=http://www.example.com/asdf.pdf |title=title |format=PDF}}
, giving{{
cite journal}}
: Cite journal requires |journal=
(
help)So if you find |format=PDF
or similar (e.g. |format=
pdf
/ |format=Portable Document Format
/ |format=
pdf
), remove it as pointless.
Headbomb {
t ·
c ·
p ·
b} 17:41, 5 January 2019 (UTC)
|format=
pdf
exist in case the URL does not have an apparent ".pdf", so this suggestion would only be done when the URL has a ".pdf". But I wonder if there is any other reason for using |format=
pdf
? --
Green
C 18:22, 5 January 2019 (UTC)Flag to archive {{ notabug}}. Moving link above. AManWithNoPlan ( talk) 14:03, 7 January 2019 (UTC)
|format=PDF
will STILL be displayed. b) This is the English Wikipedia. Unlike |language=English
other wikis can easily implement automatic PDF detection, and would be better off doing so.
Headbomb {
t ·
c ·
p ·
b} 16:44, 7 January 2019 (UTC)
This is the wrong bot for the initial cleanup. Something else needs to fix this and then we can play whack a mole on new ones. Assuming this is a good idea of course. AManWithNoPlan ( talk) 19:17, 7 January 2019 (UTC)
|postscript=.
or |url=<PMC-URL>
. It's simplifies the edit window and makes references easier/more consistent to edit.
Headbomb {
t ·
c ·
p ·
b} 21:03, 7 January 2019 (UTC)
Something with the bibcode database has gone wonky suddenly. Adding lots of data integrity checks. Obviously more needs done.
AManWithNoPlan (
talk) 19:08, 7 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1188
AManWithNoPlan (
talk) 18:19, 8 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1187
AManWithNoPlan (
talk) 18:13, 8 January 2019 (UTC)
It’s an internal php bug. Work around:
https://github.com/ms609/citation-bot/pull/1193
AManWithNoPlan (
talk) 22:14, 9 January 2019 (UTC)
{{
wontfix}} at this time. It does run, just too slowly.
AManWithNoPlan (
talk) 22:04, 10 January 2019 (UTC)
|url=
in favor of the |chapter-url=
doi, or at least add the |chapter-url=
doi as |doi=
instead of the |url=
doi.
I will have to think about this and all the possible combinations
AManWithNoPlan (
talk) 22:37, 26 December 2018 (UTC)
|bibcode=2001gpm..book.....L
and changes {{
cite journal}}
to {{
cite book}} for journal article.
|journal=Genetics
, which is a bug.
Link to old edit. I saw the script trying to make this change on a page a few moments before I reported this as well, so it is still doing it. (
t)
Josve05a (
c) 02:13, 11 January 2019 (UTC){{cite arXiv |author=Limin Lu |date=1998 |title=The Metal Contents of Very Low Column Density Lyman-alpha Clouds: Implications for the Origin of Heavy Elements in the Intergalactic Medium |eprint=astro-ph/9802189 |display-authors=etal}}</ref>
to
{{cite journal |author=Limin Lu |date=1998 |title=The Metal Contents of Very Low Column Density Lyman-alpha Clouds: Implications for the Origin of Heavy Elements in the Intergalactic Medium |arxiv=astro-ph/9802189 |display-authors=etal|bibcode=1998astro.ph..2189L }}</ref>
https://github.com/ms609/citation-bot/pull/1210 AManWithNoPlan ( talk) 06:55, 11 January 2019 (UTC)
This bug was previously reported at
User talk:Citation bot/Archive 7 § Don't change urls and
User talk:Citation bot/Archive 7 § Bot breaks URL in pages field of citation template by changing hyphen to en dash in URL but apparently was not completely fixed.
This bug may occur in this case because the link is a protocol-relative URL, which is a deprecated link format on Wikipedia. In such cases, citation bot should update the link format instead of breaking the URL with the unfortunate hyphen/dash exchange. Biogeographist ( talk) 16:14, 10 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1215
AManWithNoPlan (
talk) 17:39, 11 January 2019 (UTC)
We got it fixed so now it fails on all pages 🙄
AManWithNoPlan (
talk) 05:44, 16 January 2019 (UTC)
{{
wontfix}} so many links and so many that block us or time out that it does eventually finish (after a long-time), if you (and your web browser) will let it. Probably best to run section by section.
AManWithNoPlan (
talk) 16:49, 16 January 2019 (UTC)
Running it in the debugger I find that there are mostly pdf files, which have no usable metadata. Once this pull is implemented
https://github.com/ms609/citation-bot/pull/1229/ the bot will have a lot more cites on it "don't try to hard" list.
AManWithNoPlan (
talk) 17:00, 16 January 2019 (UTC)
There also is "first=SPIEGEL ONLINE, Hamburg|last=Germany" on the page already which also does not seem to be correct, however this was not added by the bot.
They have changed the format to be longer.
AManWithNoPlan (
talk) 19:45, 13 January 2019 (UTC)
The bibcode title does not match very well, so we reject it. Perhaps we are too picky.
AManWithNoPlan (
talk) 18:16, 17 January 2019 (UTC)
Mostly {{ fixed}}, but this bibcode is still too different of a title to match. AManWithNoPlan ( talk) 18:28, 18 January 2019 (UTC)
Why does the bot remove publisher and location from the "Cite journal" template? Especially for magazines that have been published for a long time, these things change and may perhaps be of interest? Mr.choppers | ✎ 04:20, 19 January 2019 (UTC)
Flagging for archiving since links exist above {{ notabug}}. The documentation is lacking considering the publisher location removal has been standard for a decade.
{{
notabug}} tell them to publish metadata. Seriously it is just a doi.org url.
AManWithNoPlan (
talk) 04:46, 21 January 2019 (UTC)
The maintainer of reFill is looking to pass the torch Wikipedia:Village_pump_(technical)#reFill_is_looking_for_a_maintainer. Is the functionality of reFill already part of Citation bot? I know this tool is very popular though it has a long list of bugs to be worked out and the code base is PhP. -- Green C 13:16, 8 January 2019 (UTC)
{{ notabug}} seems like others are taking it over and a 2.0 version is moving fast. AManWithNoPlan ( talk) 16:36, 21 January 2019 (UTC)
I know that some users are tirelessly working on converting bare links to journal articles into {{
cite journal}} calls (which then citation bot can clean up). What are your preferred ways? Do you have regular expressions or other aids to share for the purpose? I see that a simplistic regex search for DOI URLs in bare links, like insource:http insource:/\[http[^ ]+10\.[0-9]{4,5}\/[^ ]+ /
, finds several thousands of pages and I'm not sure what's the best way to help.
Nemo 18:07, 16 January 2019 (UTC)
insource:/\>https\:\/\/doi\.org\/10/
> or search for specific publisher links and try to "fix all" from that domain. (
t)
Josve05a (
c) 18:20, 16 January 2019 (UTC)
Here are 5000+ examples for whoever is interested: phabricator:P8007. Nemo 14:49, 18 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1236 Amy thoughts on this AManWithNoPlan ( talk) 05:49, 20 January 2019 (UTC)
{{ fixed}} bot now does more AManWithNoPlan ( talk) 16:35, 21 January 2019 (UTC)
10.1016/j.agee.2010.07.017
AManWithNoPlan (
talk) 18:43, 18 January 2019 (UTC)
This is a bit of a garbage DOI (someone at science made
doi:
10.1126/science.10.1126/SCIENCE.291.5501.24 the doi instead of
doi:
10.1126/SCIENCE.291.5501.24 like a sane person would), but it's a valid one nonetheless.
Headbomb {
t ·
c ·
p ·
b} 17:32, 13 January 2019 (UTC)
Is it just me, or is the bot considerable slower since about a week? We're talking 30 minutes + to run on articles. Sometimes several hours. Headbomb { t · c · p · b} 03:24, 21 January 2019 (UTC)
|citeseerx=<!--Copyvio: 10.1.1.whatever/foobar-->
, although the CiteSeerX page contains more than just the file and the metadata is gives is useful).
Headbomb {
t ·
c ·
p ·
b} 16:59, 7 November 2018 (UTC)
They have a takedown link on each page now, and they seem to be within the law as an NSF site http://vondranlegal.com/what-to-do-when-the-federal-government-infringes-your-copyright/ AManWithNoPlan ( talk) 15:40, 22 January 2019 (UTC)
Why is this bot constantly changing cite web to cite book here? I am using the online version of this dictionary, not the paper version. Peacemaker67 ( click to talk to me) 23:04, 22 January 2019 (UTC)
{{ fixed}} flag for archiving AManWithNoPlan ( talk) 05:09, 23 January 2019 (UTC)
It converts the place to location after the removal of location occurs.
AManWithNoPlan (
talk) 19:00, 22 January 2019 (UTC)
https://en.wikipedia.org/?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=879725824
|doi=
when it should have been added as a |jstor=
(as well/instead).|chapter=
should be used, not {{
cite journal}}( t) Josve05a ( c) 00:25, 23 January 2019 (UTC)
<ref>{{doi|10.1111/jep.12752}}</ref>
should be treated as |doi=10.1111/jep.12752
(
t)
Josve05a (
c) 00:34, 23 January 2019 (UTC)
Meanwhile, I've checked the result of the recent bare ref conversion change and I've not found any mistake, only good edits. Special:Diff/879655266, Special:Diff/879653934, Special:Diff/879649478, Special:Diff/879639416, Special:Diff/879626624, Special:Diff/879617148, Special:Diff/879616120, Special:Diff/879615613, Special:Diff/879614147, Special:Diff/879613874, Special:Diff/879611681, Special:Diff/879611148, Special:Diff/879609896, Special:Diff/879601520, Special:Diff/879598740, Special:Diff/879592444, Special:Diff/879590812, Special:Diff/879581261, Special:Diff/879574435, Special:Diff/879568947, Special:Diff/879566233. Nemo 17:32, 22 January 2019 (UTC)
"With the pre-existing ones and all the others?" Hardly so. Headbomb { t · c · p · b} 20:07, 22 January 2019 (UTC)
{{cite journal |last1=Benedict |first1=Ruth |title=Reviewed Work: An Apache Life-Way: The Economic, Social, and Religious Institutions of the Chiricahua Indians by Morris E. Opler |journal=American Anthropologist |series=New Series |date=October–December 1942 |volume=44 |issue=4, Part 1 |pages=692–693 |url=https://www-jstor-org.rp.nla.gov.au/stable/663315 |accessdate=17 January 2019 }}
{{cite journal |last1=Benedict |first1=Ruth |title=Reviewed Work: An Apache Life-Way: The Economic, Social, and Religious Institutions of the Chiricahua Indians by Morris E. Opler |journal=American Anthropologist |series=New Series |date=October–December 1942 |volume=44 |issue=4, Part 1 |pages=692–693 |url=
https://www-jstor-org.rp.nla.gov.au/stable/663315 |accessdate=17 January 2019 }}
They did not include proxy in their url, annoying.
AManWithNoPlan (
talk) 18:46, 24 January 2019 (UTC)
{{ notabug}}
https://github.com/ms609/citation-bot/pull/1262 when we know, we will pad now (after this on wikipedia of course).
AManWithNoPlan (
talk) 18:56, 27 January 2019 (UTC)
See bug report template. Both an access date and a complete URL were removed by Citation Bot from a "Cite journal" template.
RobDuch (
talk) 20:50, 28 January 2019 (UTC)
|date=1899 1899-1985
https://en.wikipedia.org/?title=Helmut_Karl_Buechner&diff=880655437&oldid=837624460
https://en.wikipedia.org/?title=Wallace_Roy_Ernst&diff=880655720&oldid=802573936
This is a possible placeholder / shorthand for no authors or N/A. Maybe.
Headbomb {
t ·
c ·
p ·
b} 05:31, 29 January 2019 (UTC)
Citation bot is making odd changes to references like this where it converts a {{ cite journal}} to a {{ cite book}} (when the reference in question very much is a journal, not a book) and removes valid publisher information. See also here where the bot simply removed parameters with no discernible reason. Can anyone explain why the bot is doing this? Parsecboy ( talk) 12:40, 29 January 2019 (UTC)
{{ notabug}} data from databases is not clear. AManWithNoPlan ( talk) 14:01, 30 January 2019 (UTC)
There are ten different DOI providers. We have always supported Crossref. We added more recently. Now even more are coming. We also are adding tests for the ones that don’t work so we know if they suddenly start working and can check for bugs. Who knew that movies had dois? And no, we don’t expand the black panther marvel movie doi even with the new code. https://github.com/ms609/citation-bot/pull/1253 AManWithNoPlan ( talk) 18:18, 26 January 2019 (UTC)
{{ fixed}} and running great. AManWithNoPlan ( talk) 14:48, 31 January 2019 (UTC)
|title=
in {{
cite book}} italicizes, but |title=
in {{
cite web}} does not, because the title of a web article should not be automatically italicized in its entirety.
Looks like this issue is similar/related to the previously reported bug where {{ cite web}} was changed to {{ cite journal}}. What are the criteria with which this bot is changing citation templates from one to another? I think we can assume that most of these templates have been specifically chosen by editors, what is the bot supposed to be "fixing"? Thanks.— TAnthony Talk 18:43, 30 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1275 Will not remove trailing period, if there is another period in the last word.
AManWithNoPlan (
talk) 03:31, 31 January 2019 (UTC)
Citoid usage discussion on MediaWiki.org
{{ wontfix}} — flag to archive
The DOI is valid and points to the correct journal, but you are write that these ISSN only DOIs are probablematic and should probably be 100% ignored.
AManWithNoPlan (
talk) 16:57, 29 January 2019 (UTC)
Much better now these too will help:
https://github.com/ms609/citation-bot/pull/1279 https://github.com/ms609/citation-bot/pull/1277 https://github.com/ms609/citation-bot/pull/1278 https://github.com/ms609/citation-bot/pull/1280
So many small improvements for such a rare promblem. AManWithNoPlan ( talk) 17:52, 31 January 2019 (UTC)
Actually this could apply to any URLs that resolve to the same place as the DOI.
Headbomb {
t ·
c ·
p ·
b} 01:24, 15 January 2019 (UTC)
Citation is complete The doi is not an ISSN-only doi (points to article not journal) The url hostname is on the list canonical publishers The url does not contain 'pdf', 'image', 'plate', 'figure', or 'picture' The doi resolves to something
AManWithNoPlan ( talk) 18:41, 27 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1294 Will raise the bar.
AManWithNoPlan (
talk) 05:01, 7 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1286 documenting improvements {{ fixed}}
I'm not convinced the citation in question belongs in the article at all, but that's beside the point. —
David Eppstein (
talk) 22:06, 7 February 2019 (UTC)
|title=
plus |title-link=
.
AManWithNoPlan (
talk) 22:24, 7 February 2019 (UTC)
That is annoying that their meta-data has the publisher listed as journal. I will investigate.
AManWithNoPlan (
talk) 21:30, 7 February 2019 (UTC)
According to the documentation, the bots actions are correct. {{
cite arxiv}} is an odd beast that does things its own way.
AManWithNoPlan (
talk) 21:42, 7 February 2019 (UTC)
|arxiv=
to |eprint=
could probably be removed at this point, since that dates back to a time where |arxiv=
was not supported. The addition of |class=
to a cite arxiv is fine though.
Headbomb {
t ·
c ·
p ·
b} 22:32, 7 February 2019 (UTC)https://github.com/ms609/citation-bot/pull/1306 AManWithNoPlan ( talk) 23:01, 7 February 2019 (UTC)
I don't understand how this one happened. Citation bot did correctly find a publication matching the arXiv preprint. To do so, it must have matched title and authors, because that's the only information in common between the arXiv preprint and the published version. When I ask for bibtex metadata from doi.org, I get
@incollection{Grier_2013, doi = {10.1007/978-3-642-39206-1_42}, url = { https://doi.org/10.1007%2F978-3-642-39206-1_42}, year = 2013, publisher = {Springer Berlin Heidelberg}, pages = {497--503}, author = {Daniel Grier}, title = {Deciding the Winner of an Arbitrary Finite Poset Game Is {PSPACE}-Complete}, booktitle = {Automata, Languages, and Programming} }
which does correctly include the title of the paper (but not the series). So the information was obviously there. But Citation bot chose to remove it. — David Eppstein ( talk) 21:47, 7 February 2019 (UTC)
<isbn type="print">978-3-642-39205-4</isbn> <isbn type="electronic">978-3-642-39206-1</isbn> <issn type="print">0302-9743</issn> <issn type="electronic">1611-3349</issn> <series_title>Lecture Notes in Computer Science</series_title> <volume_title>Automata, Languages, and Programming</volume_title> <volume>7965</volume> <contributors> <contributor sequence="first" contributor_role="author"> <given_name>Daniel</given_name> <surname>Grier</surname> </contributor> </contributors> <component_number>Chapter 42</component_number> <year media_type="print">2013</year> <first_page>497</first_page> <last_page>503</last_page> <doi type="book_content">10.1007/978-3-642-39206-1_42</doi> <publication_type>full_text</publication_type> <article_title> Deciding the Winner of an Arbitrary Finite Poset Game Is PSPACE-Complete </article_title>
Get a better url. DOIs have not access dates.
AManWithNoPlan (
talk) 02:23, 9 February 2019 (UTC)
|date=2004
to |date=30 April 2004
to be more specific.
Headbomb {
t ·
c ·
p ·
b} 02:50, 9 February 2019 (UTC)Not sure if this is still happening: [32]. Nemo 10:10, 10 February 2019 (UTC) {{ fixed}}
|(author|first|last)\d?=et\s*al\.?
, replace with |display-authors=etal
. Similar for |display-editors=
local function name_has_etal (name, etal, nocat)
.|authors=
where I see it, which are often used in combination. --
Izno (
talk) 04:43, 7 February 2019 (UTC)This will handle the simplest cases: https://github.com/ms609/citation-bot/pull/1302 AManWithNoPlan ( talk) 21:22, 7 February 2019 (UTC)
In general '/' should be treated the same way as ':' is.
Headbomb {
t ·
c ·
p ·
b} 20:47, 8 February 2019 (UTC)
The "dead" page contains "Deze pagina is niet gevonden" which means "this page was not found", While the archived copy is a pdf which does not seems to contain a specific title (other than the file name).
Redalert2fan (
talk) 22:25, 8 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1317
AManWithNoPlan (
talk) 01:31, 9 February 2019 (UTC)
Bad metadata for this is so common that we actually have a whole list of capitalization rules and exceptions . In fact it is so bad that we don’t trust the metadata and change the capitalization after we get it.
AManWithNoPlan (
talk) 14:11, 9 February 2019 (UTC)
Note: this happens with all aircraft pages like this from russianplanes.net. Thanks,
Redalert2fan (
talk) 20:56, 9 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1318
AManWithNoPlan (
talk) 23:15, 9 February 2019 (UTC)
The full page number is 89017-1–89017-5. So which is more useful?
AManWithNoPlan (
talk) 00:50, 11 February 2019 (UTC)
I cannot reproduce it. Very odd. AManWithNoPlan ( talk) 23:28, 9 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1329 AManWithNoPlan ( talk) 20:34, 11 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1330
AManWithNoPlan (
talk) 23:59, 11 February 2019 (UTC)
chapterurl
instead of the standard url
. The parameter can be used as a standalone, especially when citing things like legislative texts (as my example shows). This bug was
previously reported in 2015, but was withdrawn.
I wonder when that broke?
AManWithNoPlan (
talk) 00:43, 11 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1326
AManWithNoPlan (
talk) 17:05, 11 February 2019 (UTC)
I noticed we have some 1000 links to www3.interscience.wiley.com/cgi-bin/ which seem to all give an HTTP 403 error. Do they work for anyone? Should they be removed? Is it a job for a bot? For this bot or some other? Nemo 09:35, 8 February 2019 (UTC)
{{ wontfix}} by this bot. Some other bot should grab them all. Verify they are dead and then remove. AManWithNoPlan ( talk) 17:01, 11 February 2019 (UTC)
pages in the cited source containing the information that supports the article text.to quote Help:Citation Style 1#Pages, or
A range of pages in the source that supports the content.to quote Template:Cite journal.
|pages=
parameter, which is supposed to be a range, as appropriate for the full citation. And not the in-source specifier of where specific material is to be found, which is appropriate for individual (and multiple) short-cites within the article.should publisher be removed – discussion about the above discussion
{{ fixed}} - discussion above archives, so archive our link to it
merging subscription neeeded into cite templates
{{ notabug}} looks like they have it all under control.
|title=
/|work=
parameters (where the |title=
is a conference paper and the |work=
is the proceedings title) to |chapter=
/|title=
/|work=
(moving paper title to |chapter=
and conference proceedings title to |title=
but leaving |work=
in place. The original |title=
/|work=
is not the best coding but is a valid combination of parameters. The changed |chapter=
/|title=
/|work=
is an invalid combination, the citation template complains about it, and in addition it fails to display the chapter.
CS2 sucks. I think I have a solution, I can work on.
AManWithNoPlan (
talk) 02:29, 11 February 2019 (UTC)
|mode=cs1
. So it's not the style, but the all-in-one template parameterization that you're complaining about. But that has its advantages, too: for instance, that way you don't have quite as much of a problem with people using cite journal for conference papers. —
David Eppstein (
talk) 03:13, 11 February 2019 (UTC)
Urls that match the DOI are removed.
AManWithNoPlan (
talk) 21:23, 13 February 2019 (UTC)
...&q=%22House+&+garden%22+computer+Sutherland+1966&dq=...
is trimmed to:...&q=%22House+&dq=...
This is probably a Pale Moon browser fault which apparently doesn't encoded url properly. On SeaMonkey "&" is encoded as %26, and entering the full url with unencoded "&" trims it just like the bot did. (It apparently was a temporary browser glitch, because after testing in Pale Moon, url was properly encoded too) Cause found: automatic cite in Visual Editor decodes %26 in q= to "&" (
VisualEditor/Feedback). --
MarMi wiki (
talk) 19:53, 14 February 2019 (UTC)
{{
cite book}}
: |access-date=
requires |url=
(
help); External link in |chapterurl=
(
help); Unknown parameter |chapterurl=
ignored (|chapter-url=
suggested) (
help); Unknown parameter |editors=
ignored (|editor=
suggested) (
help)to
{{
cite book}}
: |journal=
ignored (
help); Unknown parameter |editors=
ignored (|editor=
suggested) (
help){{
cite book}}
: Unknown parameter |editors=
ignored (|editor=
suggested) (
help)
Most likely not fixable, will look at meta data
AManWithNoPlan (
talk) 02:34, 9 February 2019 (UTC)
|journal=Methods in Molecular Biology
→ |series=Methods in Molecular Biology
→ |journal=Methods in Molecular Biology (Clifton, N.j)
+ |series=Methods in Molecular Biology
→ |journal=<!-- -->
+ |series=Methods in Molecular Biology
cycle per dump.
Headbomb {
t ·
c ·
p ·
b} 02:55, 9 February 2019 (UTC)
That’s an interesting question. What should be done when a decade old consensus is challenged? Should we stop and wait or what. I don’t know.
AManWithNoPlan (
talk) 01:26, 10 February 2019 (UTC)
I don’t have strong opinion, I am here to code. Wow! That’s a lot a explanation! My one opinion is that people should remove publisher and location (which are almost always wrong sadly) and wiki link to a page about the journal-and make it if needed: a permanent fix that makes Wikipedia better and everyone happy. I just find it funny that pretty much every one who complains is pointing to journals with incorrect publishers listed or journals so obscure that even that information won’t help much. AManWithNoPlan ( talk) 14:06, 10 February 2019 (UTC)
Removing only when there's a unique identifier ( https://github.com/ms609/citation-bot/pull/1323) seems a good way to address everyone's concerns. Nemo 10:20, 11 February 2019 (UTC)
I have fixed this specific link with IABot and added the correct title myself.
Redalert2fan (
talk) 18:59, 13 February 2019 (UTC)
Note, also, in
an edit the bot made earlier this month, it altered the same citation but without changing the spacing... so I'm not sure why it made the change as a separate edit a couple of weeks later.
EdChem (
talk) 14:21, 15 February 2019 (UTC)
Really hard to see in that diff, but I think this will do it. At the very least, it will crank down the greediness. https://github.com/ms609/citation-bot/pull/1343 AManWithNoPlan ( talk)
This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 10 | Archive 11 | Archive 12 | Archive 13 | Archive 14 | Archive 15 | → | Archive 20 |
https://github.com/ms609/citation-bot/pull/1165
AManWithNoPlan (
talk) 18:53, 4 January 2019 (UTC)
See also
[4].
Headbomb {
t ·
c ·
p ·
b} 17:46, 5 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1172 Once implemented it should fix this.
AManWithNoPlan (
talk) 01:17, 6 January 2019 (UTC)
Let's have something like
https://tools.wmflabs.org/citations/list.php?linksonpage=User:Headbomb/Sandbox5
This would be super useful. We could be build lists of pages with crappy citations with AWB's database scanner or with clever insource:// search (e.g. pages with raw GoogleBooks links, pages with raw DOI links, ...), then put the list of pages to be edited somewhere (e.g. User:Headbomb/Sandbox5), then tell the bot to run against those pages (follow redirects if they exist). Headbomb { t · c · p · b} 14:45, 22 August 2018 (UTC)
https://tools.wmflabs.org/citations/list.php?linksonpage=Book:Canada
, that would find all links on the page (likely direct links for simplicity) and run the bot on those pages, that would be great.That is if you have [[Foobar|Barfoo]]
somewhere on the page, get
Foobar (follow redirects if there are any), and run the bot on that. Repeat for all other links it finds.
Headbomb {
t ·
c ·
p ·
b} 22:30, 26 August 2018 (UTC)
This is still something that would be incredibly useful. Headbomb { t · c · p · b} 15:02, 30 November 2018 (UTC)
This is basically a request that would allow any user to run a full-automated bot without needing WP:BRFA. Given this is a tool designed for manual watching of diffs, I wonder how wise it would be to turn the bot keys over. -- Green C 16:27, 5 January 2019 (UTC)
{{ fixed}} prints with pipes now. AManWithNoPlan ( talk) 13:43, 8 January 2019 (UTC)
The adsabs database seems to more generous with matches suddenly. I have already submitted two fixes.
https://github.com/ms609/citation-bot/pull/1174
https://github.com/ms609/citation-bot/pull/1169
AManWithNoPlan (
talk) 14:59, 6 January 2019 (UTC)
There something wrong with that one bibcode that redirects to another one. That makes us not expand it since one check we do is to make sure the bibcode we get back is the one we sent out. This is unfixable, since we will not remove the double check. The second issue is that the not currently rejects expansion of any book bibcodes since that rehires is to write code that we have not done yet. I might look into writing that code.
AManWithNoPlan (
talk) 22:53, 6 January 2019 (UTC)
Every handle resolver has to be added separately.
https://github.com/ms609/citation-bot/pull/1181
AManWithNoPlan (
talk) 18:48, 7 January 2019 (UTC)
Commonly seen:
{{cite news | url=https://www.questia.com/read/1G1-61177939 | title=Max hangs up his boots with £200m | work=[[The People]] | date=March 31, 1996 | accessdate=March 4, 2013 | author=Gunn, Cathy}}{{Subscription required|via=[[Questia Online Library]]}}
{{cite news |last=Heffer|first=Simon|title= Beaten by Eton: The Land of Lost Content: The biography of Anthony Chenevix-Trench by Mark Peel |date=27 July 1996 |accessdate= 3 December 2012 |location =London |newspaper=[[Daily Mail]] {{Subscription required|via=[[Questia Online Library]]}}|url=https://www.questia.com/read/1G1-111427463}}
On both subscription status is noted with the {{
subscription}}
template, which can be inside or outside the CS1|2.
The better format would be:
{{cite news | url=https://www.questia.com/read/1G1-61177939 | title=Max hangs up his boots with £200m | work=[[The People]] | url-access=subscription | via = [[Questia Online Library]] | date=March 31, 1996 | accessdate=March 4, 2013 | author=Gunn, Cathy}}
{{cite news |last=Heffer|first=Simon|title= Beaten by Eton: The Land of Lost Content: The biography of Anthony Chenevix-Trench by Mark Peel |date=27 July 1996 |accessdate= 3 December 2012 |location =London |newspaper=[[Daily Mail]] |url=https://www.questia.com/read/1G1-111427463 | url-access=subscription | via = [[Questia Online Library]] }}
The {{
subscription}}
is replaced with |url-access=
and if there is a |via=
argument, with a |via=
in the CS1|2. The |subscription=
template goes by
many names. --
Green
C 19:30, 7 January 2019 (UTC)
I will close this item as {{ wontfix}} and have moved a link to the discussion area above. AManWithNoPlan ( talk) 17:02, 8 January 2019 (UTC)
If you have something like
{{cite web |url=http://www.example.com/asdf.pdf |title=title}}
, giving{{cite journal |url=http://www.example.com/asdf.pdf |title=title}}
, giving{{
cite journal}}
: Cite journal requires |journal=
(
help)Citation templates automatically append (PDF) next to the link. So there's no point in having
{{cite journal |url=http://www.example.com/asdf.pdf |title=title |format=PDF}}
, giving{{
cite journal}}
: Cite journal requires |journal=
(
help)So if you find |format=PDF
or similar (e.g. |format=
pdf
/ |format=Portable Document Format
/ |format=
pdf
), remove it as pointless.
Headbomb {
t ·
c ·
p ·
b} 17:41, 5 January 2019 (UTC)
|format=
pdf
exist in case the URL does not have an apparent ".pdf", so this suggestion would only be done when the URL has a ".pdf". But I wonder if there is any other reason for using |format=
pdf
? --
Green
C 18:22, 5 January 2019 (UTC)Flag to archive {{ notabug}}. Moving link above. AManWithNoPlan ( talk) 14:03, 7 January 2019 (UTC)
|format=PDF
will STILL be displayed. b) This is the English Wikipedia. Unlike |language=English
other wikis can easily implement automatic PDF detection, and would be better off doing so.
Headbomb {
t ·
c ·
p ·
b} 16:44, 7 January 2019 (UTC)
This is the wrong bot for the initial cleanup. Something else needs to fix this and then we can play whack a mole on new ones. Assuming this is a good idea of course. AManWithNoPlan ( talk) 19:17, 7 January 2019 (UTC)
|postscript=.
or |url=<PMC-URL>
. It's simplifies the edit window and makes references easier/more consistent to edit.
Headbomb {
t ·
c ·
p ·
b} 21:03, 7 January 2019 (UTC)
Something with the bibcode database has gone wonky suddenly. Adding lots of data integrity checks. Obviously more needs done.
AManWithNoPlan (
talk) 19:08, 7 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1188
AManWithNoPlan (
talk) 18:19, 8 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1187
AManWithNoPlan (
talk) 18:13, 8 January 2019 (UTC)
It’s an internal php bug. Work around:
https://github.com/ms609/citation-bot/pull/1193
AManWithNoPlan (
talk) 22:14, 9 January 2019 (UTC)
{{
wontfix}} at this time. It does run, just too slowly.
AManWithNoPlan (
talk) 22:04, 10 January 2019 (UTC)
|url=
in favor of the |chapter-url=
doi, or at least add the |chapter-url=
doi as |doi=
instead of the |url=
doi.
I will have to think about this and all the possible combinations
AManWithNoPlan (
talk) 22:37, 26 December 2018 (UTC)
|bibcode=2001gpm..book.....L
and changes {{
cite journal}}
to {{
cite book}} for journal article.
|journal=Genetics
, which is a bug.
Link to old edit. I saw the script trying to make this change on a page a few moments before I reported this as well, so it is still doing it. (
t)
Josve05a (
c) 02:13, 11 January 2019 (UTC){{cite arXiv |author=Limin Lu |date=1998 |title=The Metal Contents of Very Low Column Density Lyman-alpha Clouds: Implications for the Origin of Heavy Elements in the Intergalactic Medium |eprint=astro-ph/9802189 |display-authors=etal}}</ref>
to
{{cite journal |author=Limin Lu |date=1998 |title=The Metal Contents of Very Low Column Density Lyman-alpha Clouds: Implications for the Origin of Heavy Elements in the Intergalactic Medium |arxiv=astro-ph/9802189 |display-authors=etal|bibcode=1998astro.ph..2189L }}</ref>
https://github.com/ms609/citation-bot/pull/1210 AManWithNoPlan ( talk) 06:55, 11 January 2019 (UTC)
This bug was previously reported at
User talk:Citation bot/Archive 7 § Don't change urls and
User talk:Citation bot/Archive 7 § Bot breaks URL in pages field of citation template by changing hyphen to en dash in URL but apparently was not completely fixed.
This bug may occur in this case because the link is a protocol-relative URL, which is a deprecated link format on Wikipedia. In such cases, citation bot should update the link format instead of breaking the URL with the unfortunate hyphen/dash exchange. Biogeographist ( talk) 16:14, 10 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1215
AManWithNoPlan (
talk) 17:39, 11 January 2019 (UTC)
We got it fixed so now it fails on all pages 🙄
AManWithNoPlan (
talk) 05:44, 16 January 2019 (UTC)
{{
wontfix}} so many links and so many that block us or time out that it does eventually finish (after a long-time), if you (and your web browser) will let it. Probably best to run section by section.
AManWithNoPlan (
talk) 16:49, 16 January 2019 (UTC)
Running it in the debugger I find that there are mostly pdf files, which have no usable metadata. Once this pull is implemented
https://github.com/ms609/citation-bot/pull/1229/ the bot will have a lot more cites on it "don't try to hard" list.
AManWithNoPlan (
talk) 17:00, 16 January 2019 (UTC)
There also is "first=SPIEGEL ONLINE, Hamburg|last=Germany" on the page already which also does not seem to be correct, however this was not added by the bot.
They have changed the format to be longer.
AManWithNoPlan (
talk) 19:45, 13 January 2019 (UTC)
The bibcode title does not match very well, so we reject it. Perhaps we are too picky.
AManWithNoPlan (
talk) 18:16, 17 January 2019 (UTC)
Mostly {{ fixed}}, but this bibcode is still too different of a title to match. AManWithNoPlan ( talk) 18:28, 18 January 2019 (UTC)
Why does the bot remove publisher and location from the "Cite journal" template? Especially for magazines that have been published for a long time, these things change and may perhaps be of interest? Mr.choppers | ✎ 04:20, 19 January 2019 (UTC)
Flagging for archiving since links exist above {{ notabug}}. The documentation is lacking considering the publisher location removal has been standard for a decade.
{{
notabug}} tell them to publish metadata. Seriously it is just a doi.org url.
AManWithNoPlan (
talk) 04:46, 21 January 2019 (UTC)
The maintainer of reFill is looking to pass the torch Wikipedia:Village_pump_(technical)#reFill_is_looking_for_a_maintainer. Is the functionality of reFill already part of Citation bot? I know this tool is very popular though it has a long list of bugs to be worked out and the code base is PhP. -- Green C 13:16, 8 January 2019 (UTC)
{{ notabug}} seems like others are taking it over and a 2.0 version is moving fast. AManWithNoPlan ( talk) 16:36, 21 January 2019 (UTC)
I know that some users are tirelessly working on converting bare links to journal articles into {{
cite journal}} calls (which then citation bot can clean up). What are your preferred ways? Do you have regular expressions or other aids to share for the purpose? I see that a simplistic regex search for DOI URLs in bare links, like insource:http insource:/\[http[^ ]+10\.[0-9]{4,5}\/[^ ]+ /
, finds several thousands of pages and I'm not sure what's the best way to help.
Nemo 18:07, 16 January 2019 (UTC)
insource:/\>https\:\/\/doi\.org\/10/
> or search for specific publisher links and try to "fix all" from that domain. (
t)
Josve05a (
c) 18:20, 16 January 2019 (UTC)
Here are 5000+ examples for whoever is interested: phabricator:P8007. Nemo 14:49, 18 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1236 Amy thoughts on this AManWithNoPlan ( talk) 05:49, 20 January 2019 (UTC)
{{ fixed}} bot now does more AManWithNoPlan ( talk) 16:35, 21 January 2019 (UTC)
10.1016/j.agee.2010.07.017
AManWithNoPlan (
talk) 18:43, 18 January 2019 (UTC)
This is a bit of a garbage DOI (someone at science made
doi:
10.1126/science.10.1126/SCIENCE.291.5501.24 the doi instead of
doi:
10.1126/SCIENCE.291.5501.24 like a sane person would), but it's a valid one nonetheless.
Headbomb {
t ·
c ·
p ·
b} 17:32, 13 January 2019 (UTC)
Is it just me, or is the bot considerable slower since about a week? We're talking 30 minutes + to run on articles. Sometimes several hours. Headbomb { t · c · p · b} 03:24, 21 January 2019 (UTC)
|citeseerx=<!--Copyvio: 10.1.1.whatever/foobar-->
, although the CiteSeerX page contains more than just the file and the metadata is gives is useful).
Headbomb {
t ·
c ·
p ·
b} 16:59, 7 November 2018 (UTC)
They have a takedown link on each page now, and they seem to be within the law as an NSF site http://vondranlegal.com/what-to-do-when-the-federal-government-infringes-your-copyright/ AManWithNoPlan ( talk) 15:40, 22 January 2019 (UTC)
Why is this bot constantly changing cite web to cite book here? I am using the online version of this dictionary, not the paper version. Peacemaker67 ( click to talk to me) 23:04, 22 January 2019 (UTC)
{{ fixed}} flag for archiving AManWithNoPlan ( talk) 05:09, 23 January 2019 (UTC)
It converts the place to location after the removal of location occurs.
AManWithNoPlan (
talk) 19:00, 22 January 2019 (UTC)
https://en.wikipedia.org/?title=User%3AJosve05a%2Fcite-sandbox&diff=prev&oldid=879725824
|doi=
when it should have been added as a |jstor=
(as well/instead).|chapter=
should be used, not {{
cite journal}}( t) Josve05a ( c) 00:25, 23 January 2019 (UTC)
<ref>{{doi|10.1111/jep.12752}}</ref>
should be treated as |doi=10.1111/jep.12752
(
t)
Josve05a (
c) 00:34, 23 January 2019 (UTC)
Meanwhile, I've checked the result of the recent bare ref conversion change and I've not found any mistake, only good edits. Special:Diff/879655266, Special:Diff/879653934, Special:Diff/879649478, Special:Diff/879639416, Special:Diff/879626624, Special:Diff/879617148, Special:Diff/879616120, Special:Diff/879615613, Special:Diff/879614147, Special:Diff/879613874, Special:Diff/879611681, Special:Diff/879611148, Special:Diff/879609896, Special:Diff/879601520, Special:Diff/879598740, Special:Diff/879592444, Special:Diff/879590812, Special:Diff/879581261, Special:Diff/879574435, Special:Diff/879568947, Special:Diff/879566233. Nemo 17:32, 22 January 2019 (UTC)
"With the pre-existing ones and all the others?" Hardly so. Headbomb { t · c · p · b} 20:07, 22 January 2019 (UTC)
{{cite journal |last1=Benedict |first1=Ruth |title=Reviewed Work: An Apache Life-Way: The Economic, Social, and Religious Institutions of the Chiricahua Indians by Morris E. Opler |journal=American Anthropologist |series=New Series |date=October–December 1942 |volume=44 |issue=4, Part 1 |pages=692–693 |url=https://www-jstor-org.rp.nla.gov.au/stable/663315 |accessdate=17 January 2019 }}
{{cite journal |last1=Benedict |first1=Ruth |title=Reviewed Work: An Apache Life-Way: The Economic, Social, and Religious Institutions of the Chiricahua Indians by Morris E. Opler |journal=American Anthropologist |series=New Series |date=October–December 1942 |volume=44 |issue=4, Part 1 |pages=692–693 |url=
https://www-jstor-org.rp.nla.gov.au/stable/663315 |accessdate=17 January 2019 }}
They did not include proxy in their url, annoying.
AManWithNoPlan (
talk) 18:46, 24 January 2019 (UTC)
{{ notabug}}
https://github.com/ms609/citation-bot/pull/1262 when we know, we will pad now (after this on wikipedia of course).
AManWithNoPlan (
talk) 18:56, 27 January 2019 (UTC)
See bug report template. Both an access date and a complete URL were removed by Citation Bot from a "Cite journal" template.
RobDuch (
talk) 20:50, 28 January 2019 (UTC)
|date=1899 1899-1985
https://en.wikipedia.org/?title=Helmut_Karl_Buechner&diff=880655437&oldid=837624460
https://en.wikipedia.org/?title=Wallace_Roy_Ernst&diff=880655720&oldid=802573936
This is a possible placeholder / shorthand for no authors or N/A. Maybe.
Headbomb {
t ·
c ·
p ·
b} 05:31, 29 January 2019 (UTC)
Citation bot is making odd changes to references like this where it converts a {{ cite journal}} to a {{ cite book}} (when the reference in question very much is a journal, not a book) and removes valid publisher information. See also here where the bot simply removed parameters with no discernible reason. Can anyone explain why the bot is doing this? Parsecboy ( talk) 12:40, 29 January 2019 (UTC)
{{ notabug}} data from databases is not clear. AManWithNoPlan ( talk) 14:01, 30 January 2019 (UTC)
There are ten different DOI providers. We have always supported Crossref. We added more recently. Now even more are coming. We also are adding tests for the ones that don’t work so we know if they suddenly start working and can check for bugs. Who knew that movies had dois? And no, we don’t expand the black panther marvel movie doi even with the new code. https://github.com/ms609/citation-bot/pull/1253 AManWithNoPlan ( talk) 18:18, 26 January 2019 (UTC)
{{ fixed}} and running great. AManWithNoPlan ( talk) 14:48, 31 January 2019 (UTC)
|title=
in {{
cite book}} italicizes, but |title=
in {{
cite web}} does not, because the title of a web article should not be automatically italicized in its entirety.
Looks like this issue is similar/related to the previously reported bug where {{ cite web}} was changed to {{ cite journal}}. What are the criteria with which this bot is changing citation templates from one to another? I think we can assume that most of these templates have been specifically chosen by editors, what is the bot supposed to be "fixing"? Thanks.— TAnthony Talk 18:43, 30 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1275 Will not remove trailing period, if there is another period in the last word.
AManWithNoPlan (
talk) 03:31, 31 January 2019 (UTC)
Citoid usage discussion on MediaWiki.org
{{ wontfix}} — flag to archive
The DOI is valid and points to the correct journal, but you are write that these ISSN only DOIs are probablematic and should probably be 100% ignored.
AManWithNoPlan (
talk) 16:57, 29 January 2019 (UTC)
Much better now these too will help:
https://github.com/ms609/citation-bot/pull/1279 https://github.com/ms609/citation-bot/pull/1277 https://github.com/ms609/citation-bot/pull/1278 https://github.com/ms609/citation-bot/pull/1280
So many small improvements for such a rare promblem. AManWithNoPlan ( talk) 17:52, 31 January 2019 (UTC)
Actually this could apply to any URLs that resolve to the same place as the DOI.
Headbomb {
t ·
c ·
p ·
b} 01:24, 15 January 2019 (UTC)
Citation is complete The doi is not an ISSN-only doi (points to article not journal) The url hostname is on the list canonical publishers The url does not contain 'pdf', 'image', 'plate', 'figure', or 'picture' The doi resolves to something
AManWithNoPlan ( talk) 18:41, 27 January 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1294 Will raise the bar.
AManWithNoPlan (
talk) 05:01, 7 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1286 documenting improvements {{ fixed}}
I'm not convinced the citation in question belongs in the article at all, but that's beside the point. —
David Eppstein (
talk) 22:06, 7 February 2019 (UTC)
|title=
plus |title-link=
.
AManWithNoPlan (
talk) 22:24, 7 February 2019 (UTC)
That is annoying that their meta-data has the publisher listed as journal. I will investigate.
AManWithNoPlan (
talk) 21:30, 7 February 2019 (UTC)
According to the documentation, the bots actions are correct. {{
cite arxiv}} is an odd beast that does things its own way.
AManWithNoPlan (
talk) 21:42, 7 February 2019 (UTC)
|arxiv=
to |eprint=
could probably be removed at this point, since that dates back to a time where |arxiv=
was not supported. The addition of |class=
to a cite arxiv is fine though.
Headbomb {
t ·
c ·
p ·
b} 22:32, 7 February 2019 (UTC)https://github.com/ms609/citation-bot/pull/1306 AManWithNoPlan ( talk) 23:01, 7 February 2019 (UTC)
I don't understand how this one happened. Citation bot did correctly find a publication matching the arXiv preprint. To do so, it must have matched title and authors, because that's the only information in common between the arXiv preprint and the published version. When I ask for bibtex metadata from doi.org, I get
@incollection{Grier_2013, doi = {10.1007/978-3-642-39206-1_42}, url = { https://doi.org/10.1007%2F978-3-642-39206-1_42}, year = 2013, publisher = {Springer Berlin Heidelberg}, pages = {497--503}, author = {Daniel Grier}, title = {Deciding the Winner of an Arbitrary Finite Poset Game Is {PSPACE}-Complete}, booktitle = {Automata, Languages, and Programming} }
which does correctly include the title of the paper (but not the series). So the information was obviously there. But Citation bot chose to remove it. — David Eppstein ( talk) 21:47, 7 February 2019 (UTC)
<isbn type="print">978-3-642-39205-4</isbn> <isbn type="electronic">978-3-642-39206-1</isbn> <issn type="print">0302-9743</issn> <issn type="electronic">1611-3349</issn> <series_title>Lecture Notes in Computer Science</series_title> <volume_title>Automata, Languages, and Programming</volume_title> <volume>7965</volume> <contributors> <contributor sequence="first" contributor_role="author"> <given_name>Daniel</given_name> <surname>Grier</surname> </contributor> </contributors> <component_number>Chapter 42</component_number> <year media_type="print">2013</year> <first_page>497</first_page> <last_page>503</last_page> <doi type="book_content">10.1007/978-3-642-39206-1_42</doi> <publication_type>full_text</publication_type> <article_title> Deciding the Winner of an Arbitrary Finite Poset Game Is PSPACE-Complete </article_title>
Get a better url. DOIs have not access dates.
AManWithNoPlan (
talk) 02:23, 9 February 2019 (UTC)
|date=2004
to |date=30 April 2004
to be more specific.
Headbomb {
t ·
c ·
p ·
b} 02:50, 9 February 2019 (UTC)Not sure if this is still happening: [32]. Nemo 10:10, 10 February 2019 (UTC) {{ fixed}}
|(author|first|last)\d?=et\s*al\.?
, replace with |display-authors=etal
. Similar for |display-editors=
local function name_has_etal (name, etal, nocat)
.|authors=
where I see it, which are often used in combination. --
Izno (
talk) 04:43, 7 February 2019 (UTC)This will handle the simplest cases: https://github.com/ms609/citation-bot/pull/1302 AManWithNoPlan ( talk) 21:22, 7 February 2019 (UTC)
In general '/' should be treated the same way as ':' is.
Headbomb {
t ·
c ·
p ·
b} 20:47, 8 February 2019 (UTC)
The "dead" page contains "Deze pagina is niet gevonden" which means "this page was not found", While the archived copy is a pdf which does not seems to contain a specific title (other than the file name).
Redalert2fan (
talk) 22:25, 8 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1317
AManWithNoPlan (
talk) 01:31, 9 February 2019 (UTC)
Bad metadata for this is so common that we actually have a whole list of capitalization rules and exceptions . In fact it is so bad that we don’t trust the metadata and change the capitalization after we get it.
AManWithNoPlan (
talk) 14:11, 9 February 2019 (UTC)
Note: this happens with all aircraft pages like this from russianplanes.net. Thanks,
Redalert2fan (
talk) 20:56, 9 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1318
AManWithNoPlan (
talk) 23:15, 9 February 2019 (UTC)
The full page number is 89017-1–89017-5. So which is more useful?
AManWithNoPlan (
talk) 00:50, 11 February 2019 (UTC)
I cannot reproduce it. Very odd. AManWithNoPlan ( talk) 23:28, 9 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1329 AManWithNoPlan ( talk) 20:34, 11 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1330
AManWithNoPlan (
talk) 23:59, 11 February 2019 (UTC)
chapterurl
instead of the standard url
. The parameter can be used as a standalone, especially when citing things like legislative texts (as my example shows). This bug was
previously reported in 2015, but was withdrawn.
I wonder when that broke?
AManWithNoPlan (
talk) 00:43, 11 February 2019 (UTC)
https://github.com/ms609/citation-bot/pull/1326
AManWithNoPlan (
talk) 17:05, 11 February 2019 (UTC)
I noticed we have some 1000 links to www3.interscience.wiley.com/cgi-bin/ which seem to all give an HTTP 403 error. Do they work for anyone? Should they be removed? Is it a job for a bot? For this bot or some other? Nemo 09:35, 8 February 2019 (UTC)
{{ wontfix}} by this bot. Some other bot should grab them all. Verify they are dead and then remove. AManWithNoPlan ( talk) 17:01, 11 February 2019 (UTC)
pages in the cited source containing the information that supports the article text.to quote Help:Citation Style 1#Pages, or
A range of pages in the source that supports the content.to quote Template:Cite journal.
|pages=
parameter, which is supposed to be a range, as appropriate for the full citation. And not the in-source specifier of where specific material is to be found, which is appropriate for individual (and multiple) short-cites within the article.should publisher be removed – discussion about the above discussion
{{ fixed}} - discussion above archives, so archive our link to it
merging subscription neeeded into cite templates
{{ notabug}} looks like they have it all under control.
|title=
/|work=
parameters (where the |title=
is a conference paper and the |work=
is the proceedings title) to |chapter=
/|title=
/|work=
(moving paper title to |chapter=
and conference proceedings title to |title=
but leaving |work=
in place. The original |title=
/|work=
is not the best coding but is a valid combination of parameters. The changed |chapter=
/|title=
/|work=
is an invalid combination, the citation template complains about it, and in addition it fails to display the chapter.
CS2 sucks. I think I have a solution, I can work on.
AManWithNoPlan (
talk) 02:29, 11 February 2019 (UTC)
|mode=cs1
. So it's not the style, but the all-in-one template parameterization that you're complaining about. But that has its advantages, too: for instance, that way you don't have quite as much of a problem with people using cite journal for conference papers. —
David Eppstein (
talk) 03:13, 11 February 2019 (UTC)
Urls that match the DOI are removed.
AManWithNoPlan (
talk) 21:23, 13 February 2019 (UTC)
...&q=%22House+&+garden%22+computer+Sutherland+1966&dq=...
is trimmed to:...&q=%22House+&dq=...
This is probably a Pale Moon browser fault which apparently doesn't encoded url properly. On SeaMonkey "&" is encoded as %26, and entering the full url with unencoded "&" trims it just like the bot did. (It apparently was a temporary browser glitch, because after testing in Pale Moon, url was properly encoded too) Cause found: automatic cite in Visual Editor decodes %26 in q= to "&" (
VisualEditor/Feedback). --
MarMi wiki (
talk) 19:53, 14 February 2019 (UTC)
{{
cite book}}
: |access-date=
requires |url=
(
help); External link in |chapterurl=
(
help); Unknown parameter |chapterurl=
ignored (|chapter-url=
suggested) (
help); Unknown parameter |editors=
ignored (|editor=
suggested) (
help)to
{{
cite book}}
: |journal=
ignored (
help); Unknown parameter |editors=
ignored (|editor=
suggested) (
help){{
cite book}}
: Unknown parameter |editors=
ignored (|editor=
suggested) (
help)
Most likely not fixable, will look at meta data
AManWithNoPlan (
talk) 02:34, 9 February 2019 (UTC)
|journal=Methods in Molecular Biology
→ |series=Methods in Molecular Biology
→ |journal=Methods in Molecular Biology (Clifton, N.j)
+ |series=Methods in Molecular Biology
→ |journal=<!-- -->
+ |series=Methods in Molecular Biology
cycle per dump.
Headbomb {
t ·
c ·
p ·
b} 02:55, 9 February 2019 (UTC)
That’s an interesting question. What should be done when a decade old consensus is challenged? Should we stop and wait or what. I don’t know.
AManWithNoPlan (
talk) 01:26, 10 February 2019 (UTC)
I don’t have strong opinion, I am here to code. Wow! That’s a lot a explanation! My one opinion is that people should remove publisher and location (which are almost always wrong sadly) and wiki link to a page about the journal-and make it if needed: a permanent fix that makes Wikipedia better and everyone happy. I just find it funny that pretty much every one who complains is pointing to journals with incorrect publishers listed or journals so obscure that even that information won’t help much. AManWithNoPlan ( talk) 14:06, 10 February 2019 (UTC)
Removing only when there's a unique identifier ( https://github.com/ms609/citation-bot/pull/1323) seems a good way to address everyone's concerns. Nemo 10:20, 11 February 2019 (UTC)
I have fixed this specific link with IABot and added the correct title myself.
Redalert2fan (
talk) 18:59, 13 February 2019 (UTC)
Note, also, in
an edit the bot made earlier this month, it altered the same citation but without changing the spacing... so I'm not sure why it made the change as a separate edit a couple of weeks later.
EdChem (
talk) 14:21, 15 February 2019 (UTC)
Really hard to see in that diff, but I think this will do it. At the very least, it will crank down the greediness. https://github.com/ms609/citation-bot/pull/1343 AManWithNoPlan ( talk)