![]() | This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 30 | ← | Archive 35 | Archive 36 | Archive 37 | Archive 38 | Archive 39 |
Possibly from garbage Pubmed metadata
Headbomb {
t ·
c ·
p ·
b}
21:57, 2 October 2023 (UTC)
The issue here should be to recognize language=rup
Headbomb {
t ·
c ·
p ·
b}
22:41, 13 October 2023 (UTC)
From discussions (
1,
2,
3) on stopping useless cruft – for example
this useless blank archive of a Jstor article – from semi-automated mass archiving, a number of editors have noted their
support for a bot to parse Jstor and Worldcat URLs (eg https://www.jstor.org/stable/24432812
) for their respective |jstor=24432812
and |oclc=
parameters where relevant and purge URLs, archive URLs, and archive metadata for CS1 templates.
Is this something that can be done with citation bot? I will note that I'm not saying to purge all URLs – they can be useful if the full text is separately hosted elsewhere – just URLs and archives thereof (almost always useless blank pages) that are duplicative of the generated parameter URLs. Tagging GreenC. Ifly6 ( talk) 06:19, 22 September 2023 (UTC)
|url=
|jstor=24432812
)Is there really nothing we can do on this without an RFC? Ifly6 ( talk) 17:13, 25 September 2023 (UTC)
|url=
entry would prevent "archive" links from being added. It is a dispute between whether a reference should look like this:{{Cite journal |last=Steel |first=Catherine |date=2014 |title=The Roman senate and the post-Sullan "res publica" |journal=Historia: Zeitschrift für Alte Geschichte |volume=63 |issue=3 |pages=323–339 |doi=10.25162/historia-2014-0018 |jstor=24432812 |s2cid=151289863 |issn=0018-2311 }}
{{Cite journal |last=Steel |first=Catherine |date=2014 |title=The Roman senate and the post-Sullan "res publica" |journal=Historia: Zeitschrift für Alte Geschichte |volume=63 |issue=3 |pages=323–339 |doi=10.25162/historia-2014-0018 |jstor=24432812 |s2cid=151289863 |issn=0018-2311 |url=https://www.jstor.org/stable/24432812 |access-date=26 May 2022 |archive-date=26 May 2022 |archive-url=https://web.archive.org/web/20220526152815/https://www.jstor.org/stable/24432812 |url-status=live }}
|url=
entirely duplicate existing links in the citation and regardless add nothing for the unprivileged reader while clogging up the mark up and making it difficult to do the part of "editor". Even if I have Ivy League library access and be able to read all full texts through proxies (eg
Penn Libraries), that doesn't mean that linking the proxy page whole (like https://www-jstor-org.wikipedialibrary.idm.oclc.org/
) does any good for readers without Penn or Wikipedia library privileges.
Ifly6 (
talk)
23:37, 25 September 2023 (UTC)|url=
https://www.jstor.org/stable/24432812
is for the "average" reader this link does nothing because they will not have a Jstor subscription. If adding it is to help the "average" university student, the link also does nothing because they will have to go through their university proxy. If it is to help the privileged editor with
WP:LIBRARY access, it also does nothing because we have to go through a proxy too. The only people it supports are those few who have direct access to Jstor (which ironically includes me via the Federal Reserve).
Ifly6 (
talk)
23:43, 25 September 2023 (UTC)
|jstor=
parameter lets them know that the article is on JSTOR and they will likely have access to it, and once they click on the link they can easily log in via whatever gives them access via whatever proxy, or if they're physically at their library just click the link and access it. The JSTOR link also provides metabibliographic information, a first page preview, and abstract. Plus JSTOR allows independent researchers 100 free articles each month, and if someone so chooses they have the option to buy it à la carte. Anything which helps a reader access a source is useful, and quite often JSTOR is the electronic place of record for a journal. [Edit: sorry I'm following more closely now, I still think it should be in |jstor=
-- that's why we have that parameter; it does not also belong in |url=
.]
Umimmak (
talk)
23:49, 25 September 2023 (UTC)
What do you mean "the link does nothing"?The link to the native Jstor website in
|url=
is not the proper one and will not yield the full text unless you have direct Jstor access. If you access it through a proxy, you would have to copy the Jstor ID and paste it in after ../static/
. Putting the direct URL in |url=
is not very useful and largely facilitates
WP:MEATBOTs crufting up articles with unnecessary mark up pointing to blank archive pages.
Ifly6 (
talk)
01:13, 26 September 2023 (UTC)
|jstor=
. Apparently there was an RFD, ban, or something of the sort which has led the maintainer(s) of the bot not being willing to re-enable that previously-present functionality.
Ifly6 (
talk)
21:54, 27 September 2023 (UTC)
|access-date=
in {{
cite book}} and {{
cite journal}} (and maybe others) should be removed, we'll have solved almost the entire problem of these kinds of URLs without needing to determine whether or not readers / editors will understand the alternative stable identifiers.
Folly Mox (
talk)
04:44, 30 September 2023 (UTC)
Well that issue is why we're here at Citation bot. Do you think it's actually impossible to get a decision for Citation bot to remove those URLs? A bot to remove those archives would produce even more watchlist events, which people in the discussion below seem to be adamantly against, while also probably being impossible to implement per GreenC's comment above. Ifly6 ( talk) 19:36, 1 October 2023 (UTC)
permalive
for IABot is also a reasonable first step.
Folly Mox (
talk)
22:09, 1 October 2023 (UTC)
|url=
including a "jstor.org/stable/XXXXX" to turn it into a
JSTOR
10.5555/al.ap.person.bm000000658 this would not work; occasionally JSTOR the website gets cited instead of a book/article it is hosting so just bots should be aware of this.
Umimmak (
talk)
21:20, 5 October 2023 (UTC)
{{
wontfix}} because people are whiners.
AManWithNoPlan (
talk)
20:41, 24 October 2023 (UTC)
The bot never respected {{
in use}} because very often people who use {{
in use}} will also use the bot to expand citations.
Headbomb {
t ·
c ·
p ·
b}
22:42, 13 October 2023 (UTC)
From my discussion page: Hi, I see that you have used citation bot to add dates to references to numismatics.org.uk webpages here. I am not familiar with the bot, so could you explain what the dates mean? The pages seem to be updated regularly.
I think the bot is wrong. Grimes2 ( talk) 14:41, 13 October 2023 (UTC)
meta property="article:published_time"
instead of meta property="article: modified_time"
.
Folly Mox (
talk)
16:00, 13 October 2023 (UTC)
[10] 2001:14BA:9CE5:8400:20AB:2C62:7318:4F88 ( talk) 04:35, 22 October 2023 (UTC)
elonet.finna.fi
but also elonet.fi
it seems:
[13],
[14] (
probably some more). Please stop the bot from changing the citation templates of elonet.fi and finna.fi from "cite web" to "cite book", thank you.
2001:14BA:9CE5:8400:79D9:9129:F234:CDFA (
talk)
20:03, 24 October 2023 (UTC)At the Kenny Clarke article in the oral history ref, the bot changes "Cite web" to "Cite journal" without changing any other parameters, causing this error message. While checking hidden categories on that page, I discovered that the bot did this in June 2022 and I reproduced the problem just now. Graham87 ( talk) 06:50, 23 October 2023 (UTC)
|work=
AManWithNoPlan (
talk)
20:56, 24 October 2023 (UTC)
Spinixster
(chat!)
12:53, 30 September 2023 (UTC)
}}
.
The first instance, starting from
my edit:
|access-date=February 13, 2011 |url-status=dead}}</ref>
(by me) (and 3 more like it) (Not official, though I don't see why.) (Nannyware keeps me from viewing any "archive" websites and/or I didn't have time.)|access-date=February 13, 2011 }}{{dead link|29 September 2023}}</ref>
(by Citation bot, Misc citation tidying...) (Made it official, but wrong format) (only got 3 of 4 instances) (disoptimal - should be no space before the closing }}
.|access-date=February 13, 2011 }}{{dead link|date=September 2023}}</ref>
(by AnomieBOT, Dating maintenance tags...) (Corrected the 3 changed by Citation bot.)|access-date=February 13, 2011 |archive-date=December 18, 2010 |archive-url=https://web.archive.org/web/20101218224918/http://makingitbigcareers.com/books/making-it-big-in-software/mark-russinovich/ |url-status=dead }}
(by AManWithNoPlan), Rescuing 4 sources and tagging 0 as dead.) #IABot (v2.0.9.5)) (disoptimal - field order should be |access-date= |url-status= |archive-url= |archive-date=.)
Today I do not see "Expand citations" in my tools menu. I do not know what caused it to disappear.
Treat as if /pdf-vor or /pdf isn't there. Headbomb { t · c · p · b} 01:33, 2 November 2023 (UTC)
That is super odd.
AManWithNoPlan (
talk)
21:24, 2 November 2023 (UTC)
changing a date to today and breaking a ref in the process. I am sick and tired of Citation bot going around breaking sfn refs willy nilly. DuncanHill ( talk) 22:05, 2 November 2023 (UTC)
{{ fixed}} the bug that was causing extra book clean-ups. But, no idea where that date came from. AManWithNoPlan ( talk) 00:55, 5 November 2023 (UTC)
1. Bot seems to be replacing (only very occasionally) "year=[a year]" with "date=[today's date]" - there seems to be no particular pattern to which sources these are, and none of them are 2023 sources.
2. Where a journal is published in single-issue volumes (this particular one is Archaeological Reports, the bot has erroneously expanded "volume=14" to "volume=14|issue=14", though no issue 14 that year or ever existed.
(Edit:) 3. Replaced:
{{cite book| last=Gates| first=Charles| year=2004| chapter=The Adoption of Pictorial Imagery in Minoan Wall Painting: A Comparativist Perspective| series=Hesperia Supplements| title= ΧΑΡΙΣ: Essays in Honor of Sara A. Immerwahr| editor-last=Chapin| editor-first=Anne P.| publisher=American School of Classical Studies at Athens| place=Athens| isbn=978-0-87661-533-1| pages=27–46| jstor=1354061}}
with
{{cite book| last=Gates| first=Charles| title=The Adoption of Pictorial Imagery in Minoan Wall Painting: A Comparativist Perspective| series=Hesperia Supplements| date=4 November 2023| volume=33| editor-last=Chapin| editor-first=Anne P.| publisher=American School of Classical Studies at Athens| place=Athens| isbn=978-0-87661-533-1| pages=27–46| jstor=1354061}}
Overriding the (correct) description of this article as a chapter in a Festschrift and that Festschrift's title.
4. It added the page range 1-370 to a cite book template, where the whole book is being cited.
And similar wikilink parameters. Headbomb { t · c · p · b} 09:12, 17 October 2023 (UTC)
The following? (this would be compared after removing numbers and dashes):
authorlink chapterlink contributorlink editorlink episodelink interviewerlink inventorlink serieslink subjectlink titlelink translatorlink
AManWithNoPlan ( talk) 13:22, 25 October 2023 (UTC)
|chapter-link=
.
That is the title CrossRef has
https://search.crossref.org/?from_ui=yes&q=10.1063%2Fpt.6.4.20200327a
AManWithNoPlan (
talk)
12:23, 5 November 2023 (UTC)
Below is a method to detect replacement characters. (Not pretty but works.) I added some inline comments because it's an obscure language
isbinary() in Nim
|
---|
# # Return true if string contains a 'replacement' or binary character (black diamond with ? in middle) # Based on: https://unix.stackexchange.com/questions/474709/how-to-grep-for-unicode-in-a-bash-script/474812#474812 # Requires a secondary shell layer so UTF-8 works # tcsh -s 'grep -axv ".*" <filename>' # proc isbinary*(s: string): bool {.discardable.} = result = false # default return value let tmpfile = mktempname(GX.ramdir & "isbinary.") # Generate a temporary and unique filename "isbinary.xxx" to be located in a ramdisk directory for speed s >* tmpfile # Write the string to the tempfile let command1 = "tcsh -c 'grep -axv \".*\" \"" & tmpfile & "\" | wc -l'" # need to use tcsh -c for UTF-8 to work. Bash with similar -c might also work. let c1 = runshellBasic(command1) # run the shell command and capture output to c1 if strip(c1) !~ "^0$": # If the output is not "0" (only) then it contains a replacement character. result = true removeFile(tmpfile) # Delete the temp file and return 'result' |
— Preceding unsigned comment added by GreenC ( talk • contribs) 16:32, 5 November 2023 (UTC)
As you know the "automatic" option in the Visual Editor cite button does not support any pdfs, perhaps because it would be too slow. As this bot is not constrained as much for time it would be great if the bot could expand more pdfs from major organisations. For example the second cite in Agriculture_in_Turkey#Subsidies namely https://www.g20.org/content/dam/gtwenty/gtwenty_new/document/G20_Report_on_Macroeconomic_impacts.pdf Chidgk1 ( talk) 09:08, 6 November 2023 (UTC)
{{ wontfix}} - PDF files, and the bot uses https://en.wikipedia.org/api/rest_v1/#/Citation/getCitation which is outside our control
{{
citation}}
is such a pain to |work=with
|chapter=
to {{
citation}}
templates that have |work=
. |work=
(aliases: |journal=
, |magazine=
, |newspaper=
, |periodical=
, |website=
) configures {{citation}}
as a periodical template. |chapter=
(aliases |contribution=
, |entry=
, |article=
, |section=
) is not supported in the cs1 periodical templates ({{
cite journal}}
, {{
cite magazine}}
, {{
cite news}}
, {{
cite periodical}}
, {{
cite web}}
) and not supported in the cs2 {{citation}}
template when it is configured as a periodical template.{{citation}}
templates are best written as {{
cite Australian Dictionary of Biography}}
templates. In no case should the bot add a |chapter=
alias to {{citation}}
when there is a |work=
alias with an assigned value present in {{citation}}
.
Although this is a very useful bot I am struggling to understand what it can and cannot do and how it works.
I recently submitted a bug report, and a couple of enhancement requests asking if the bot could be run on pdf files and they were immediately closed because there is an api which is "outside our control".
Could the "function summary" be rewritten with a first paragraph to explain what the bot does in very simple terms and a second para to explain how it works in very simple terms and relegate the technical explanation to later paragraphs?
Also it would be useful if the "won't fix" could be left here for a couple of days for us to read rather than being immediately archived.
Chidgk1 ( talk) 12:21, 7 November 2023 (UTC)
And it should leave every other 'I' alone too. This is particularly annoying. The only 'I' that needs capitalization are those from Part I, Section I, etc... Headbomb { t · c · p · b} 22:41, 10 November 2023 (UTC)
|title=
parameter scraped from meta
tags in the HTML header, and absolutely nothing else.
If the goal is to wrap a URL in a citation template so Internet Archive picks it up, and there's no good translators available for the domain, just set the |title=(URL)
so it's obvious the citation is incomplete and needs work. This sort of lazy not-citation is essentially worthless, and encourages people to use scripts for tasks the scripts are not ready to handle, instead of putting in the one minute of work it takes to create a real citation by looking at the source.
If Citation bot can't figure out anything from the URL except the title, it should either leave the link alone, set the title to the URL, or tag its change with a template like {{ citation needs human review}} so this sort of garbage can be tracked.
Apologies for the strong language, but if we train a whole generation of editors to rely on pushbutton non-solutions, the maintenance burden of trash citations is going to outpace our capacity and never be fixed. Folly Mox ( talk) 12:06, 20 November 2023 (UTC)
Also Category:CS1 errors: dates Headbomb { t · c · p · b} 12:23, 26 November 2023 (UTC)
|last1=|
(the pipe character |
).
The garbage human-entered title prevented the full expansion. Wondering if we can't just yeet the title out when converting a cite web to a cite journal/book for ieeexplore links. It's a highly-reliable database. Either way, the website= parameter should be nuked. Headbomb { t · c · p · b} 00:02, 16 November 2023 (UTC)
and
[41] (here I manually switched |journal=
to |series=
"|work= ignored"
zero width space character in |title= at position 1
"
On Wikipedia, we follow
MOS:TITLECAPS. If journals want to style themselves differently, that's is up to them, but we're not bound to follow.
Headbomb {
t ·
c ·
p ·
b}
20:54, 2 December 2023 (UTC)
Per the update description in the category. Headbomb { t · c · p · b} 23:04, 2 December 2023 (UTC)
Fixed
AManWithNoPlan (
talk)
02:13, 3 December 2023 (UTC)
|volume=
, |issue=
, and |journal=
are already present in the citation. No isbn is given in the metadata of the
target article. |journal=
is altered to |series=
, refreshingly avoiding the "periodical ignored" error, but this change doesn't appear to make any sense.
Not sure how to handle that, in general.
AManWithNoPlan (
talk)
01:53, 4 December 2023 (UTC)
{{
citation}}
template to make a 'book' reference. Bot changed |title=
to |chapter=
, added a new |title=
that more-or-less duplicated existing |journal=
, and |volume=
.{{citation}}
uses the work parameters (|journal=
, |magazine=
, |newspaper=
, |periodical=
, |website=
, |work=
) to switch from its default 'book' format (|title=
rendered in an italic font) to its 'work' format (|title=
rendered in an upright font with quote marks). When making these types of edits to {{citation}}
templates (it desires to make a 'book' reference), the bot should remove work parameters (in this case |journal=
).
That URL was already broken before Citation bot got to it. Truncating it at the dash glyph might have fixed it.
Folly Mox (
talk)
01:19, 3 December 2023 (UTC)
{{
citation}}
: CS1 maint: location missing publisher (
link). The series is an essential part of the entry: it provides the necessary context for the volume number, and shows that this was the 150th volume published by the society, whereas its absence suggests that the history is itself a work of 150 or more volumes.
See also User:Citation_bot/use#..._the_bot_made_a_mistake?. Headbomb { t · c · p · b} 00:18, 15 November 2023 (UTC)
|series=Publications of the Surtees Society
rather than the shorter removed series name. No idea whether this would affect the bot's attempted removals. It also needs a publisher; following the same catalog entry, it looks like |publisher=Andrews & co. and B. Quaritch for the Surtees Society
would be accurate. —
David Eppstein (
talk)
01:29, 15 November 2023 (UTC)
Failing to understand "OUP Academic" as "Oxford University Press" (already present, correctly in
Folly Mox (
talk)
20:28, 11 November 2023 (UTC)
|publisher=
) is one thing; adding an unsupported |journal=
parameter to {{
cite book}} is something I thought Citation bot was better than.
doi:
10.1186/s40645-015-0068-z and
doi:
10.1007/s00445-020-01384-6 are both of the new problematic "article number" type. |issue=
is wrong, and |page=
is less than ideal, but the best the CS1 and 2 have for us at the moment. The journals clearly state to not use issue for these in the "how to cite" areas.
AManWithNoPlan (
talk)
20:54, 7 November 2023 (UTC)
{{
cite journal}}
supports |article-number=
:
{{cite journal |last1=Geshi |first1=N. |last2=Yamada |first2=I. |last3=Matsumoto |first3=K. |last4=Nishihara |first4=A. |last5=Miyagi |first5=I. |title=Accumulation of rhyolite magma and triggers for a caldera-forming eruption of the Aira Caldera, Japan |journal=Bulletin of Volcanology |volume=82 |article-number=44 |year=2020 |doi=10.1007/s00445-020-01384-6 |url=https://link.springer.com/article/10.1007/s00445-020-01384-6}}
|id={{Project MUSE|59700|type=book}}
instead of URL
https://muse.jhu.edu/book/59700
Maybe 5 years ago, IABot had a bug that added a "#" to the end of every archive URL, and sometimes the source URL. The bug is long fixed, and WaybackMedic has been removing the errant #'s, but it's a cosmetic edit that can only be done when making another edit to the page, so it's been a long process. There are a lot of them. An example: Special:Diff/1183493290/1185983128 (second change). My code below if interested, no edge cases, simply removing any trailing # from the URLs. It won't break the archive URL.
Extended content
|
---|
# Fix trailing # in |url and |archive-url added by IABot 2.0 beta10 psplit(GX.articlework, GX.cite2, p): if isarg("archive-url", "value", p.field[i]) and isarg("url", "value", p.field[i]): archiveurl = getarg("archive-url", "clean", p.field[i]) sourceurl = getarg("url", "clean", p.field[i]) j = 0 if archiveurl ~ "[#]$": inc(j) sub("[#]$", "", archiveurl) p.field[i] = replacearg(p.field[i], "archive-url", archiveurl, "cosmetic1.1") if sourceurl ~ "[#]$": inc(j) sub("[#]$", "", sourceurl) p.field[i] = replacearg(p.field[i], "url", sourceurl, "cosmetic1.2") if j > 0: p.ok += inclog("cosmetic1.1", GX.esformat, Project.logiats, &"{archiveurl} ---- remove trailing #") psplit() iterates over every cite template which are held in p.field[i] |
Green C 17:35, 20 November 2023 (UTC)
Still needs fixed. Fix did not work. AManWithNoPlan ( talk) 00:41, 5 December 2023 (UTC)
Covers both Proceedings of Science and Journal of Science Communication, from SISSA. The other SISSA journals have different prefixes. Headbomb { t · c · p · b} 02:25, 5 December 2023 (UTC)
|title=
→ |chapter=
and |journal=
→ |title=
, hooray!). These two misidentifications don't – to my knowledge – even change the display of the article, but they are still incorrect. The first one I changed back to {{
cite web}}. The second I fid the same: although properly it should be {{
cite dictionary}}, it rather obviously failed verification so I didn't bother.
WikiJournals Headbomb { t · c · p · b} 15:36, 10 December 2023 (UTC)
Unfortunately this journal is
not preserved so there are no archived copies either.
Nemo
11:19, 3 December 2023 (UTC)
|doi-broken-date=
.
Headbomb {
t ·
c ·
p ·
b}
15:01, 8 December 2023 (UTC)
|citeseerx=
and |s2cid=
.
This was my fix: changing back to {{
cite web}}, adding the url of the source, and an unrelated fix to |publisher=
. I'm not sure this is really Citation bot's fault, or if maybe the parameter set supported by {{
cite document}} ought be expanded to allow for more stable identifiers. Pinging
Trappist the monk as the template maintainer, to see if they have input.
Folly Mox (
talk)
19:11, 9 December 2023 (UTC)
{{
cite web}}
to {{
cite document}}
is not going to work when |url=
, |citeseerx=
, and |s2cid=
have assigned values. |s2cid=
is excluded from {{cite document}}
because links to readable copies of the source from that identifier are hit-or-miss at best (recall the plethora of complaints about the bot adding |s2cid=
that have been voiced on this talk page). |citeseerx=
is excluded because we have {{
cite citeseerx}}
.|citeseerx=10.1.1.42.3374
, an alternate fix might be:
{{cite citeseerx |last=Wirz |first=Marc |title=Characterizing the Grzegorczyk hierarchy by safe recursion |date=November 1999 |citeseerx=10.1.1.42.3374}}
{{
cite document}}
is a 'last resort' sort of template when absolutely none of the other cs1|2 templates apply. The bot should avoid using {{cite document}}
because, almost always, there is a better choice.Headbomb { t · c · p · b} 01:20, 11 December 2023 (UTC)
Fixed
AManWithNoPlan (
talk)
13:27, 11 December 2023 (UTC)
Note to me for when I have time. AManWithNoPlan ( talk) 16:26, 10 December 2023 (UTC)
|title=
parameter with incorrect characters, adding the article to
Category:CS1 errors: invisible characters (e.g.
this edit in Malayalam).|title=
(and maybe even |website=
) (e.g.
this edit).
Here's
another edit in Gujarati. — Preceding
unsigned comment added by
GoingBatty (
talk •
contribs)
20:11, 12 December 2023 (UTC)
The character is ​ (zero width space). Headbomb { t · c · p · b} 00:07, 13 December 2023 (UTC)
Headbomb { t · c · p · b} 01:22, 15 December 2023 (UTC)
There is a pull request that allows specifying name list style for newly-added name entries: https://github.com/ms609/citation-bot/pull/4236
It adds an option to already existing style of first1/last1, first2,last2, etc.
This pull request introduces the following functionality. If a page contains {{Use vanc name-list-style}} template, then the bot will use |vauthors= and |veditors= attributes rather than firstN/lastN and editor-firstN/editor-lastN when adding name entries for a citation template if the names were not specified in this template. This is similar to {{Use dmy dates}} template when the bot uses date format as specified on the page. To reproduce this behaviour, edit a page on Wikipedia, add {{Use vanc name-list-style}} template (or {{Use vanc name-list-style|date=December 2023}}), delete author names (firstN/lastN) and run the bot. It will fill the names as vauthors. Maxim Masiutin ( talk) 16:48, 7 December 2023 (UTC)
{{
Use vanc name-list-style}}
exist? Was there any discussion that brought it into existence? cs1|2 doesn't know anything about that template but will understand {{
CS1 config|name-list-style=vanc}}
. Why create a new otherwise non-functional template?{{
use dmy dates}}
and {{
use mdy dates}}
to control date formatting when cs1|2 templates are rendered. See
Template:Use dmy dates § Auto-formatting citation template dates for example. I see no reason to keep {{
Use vanc name-list-style}}
.{{use xxx dates}}
templates came first (January 2009). Development of
Module:Citation (the predecessor to
Module:Citation/CS1) began August 2012. Auto date formatting was added to Module:Citation/CS1 April 2019. Support for {{
CS1 config}}
was added August 2023. {{CS1 config}}
applies only to cs1|2 templates but the {{use xxx dates}}
templates apply to both the article body and to article referencing (regardless of how referencing is implemented).{{
CS1 config}}
because that template has functionality beyond being a simple flag template.Would it be possible to expand from non-templated reference <ref>[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553785/ Bar]</ref>
, as long as |title=
would be exactly the same (Bar
) which already exists for the URL specified as if the bot would try to expand the bare URL (as long as there is no other content in the ref)?
Jonatan Svensson Glad (
talk)
17:16, 24 July 2023 (UTC)
{{
cite journal}}
template. In that template: |journal=arXiv
, |doi=10.48550/arXiv.2206.12231
, and |doi-access=free
. The only action that the bot took was to convert |doi=10.48550/arXiv.2206.12231
to |arxiv=2206.12231
.arXiv
is not a journal so {{cite journal}}
is the wrong template; should be changed to {{
cite arxiv}}
. When removing |doi=
, the bot should always remove |doi-access=
. Remember that {{cite arxiv}}
supports a limited subset of the whole cs1|2 parameter set so other parameters in a {{cite journal}}
→ {{cite arxiv}}
conversion may need to be removed. The limited parameter set is defined in
Module:Citation/CS1/Whitelist lines
340–346.
If you run the bot again, then it does clean up. I will look at having it not take two times.
AManWithNoPlan (
talk)
15:10, 9 December 2023 (UTC)
According to
Help:Citation Style, An author may be cited using separate parameters for the author's surname and given name by using
However, the bot replaces |last= and |first= to |last1= and |first1= even when there is just one author, which is contrary to the description of the CS1 Citation Style.
The bot should probably already not replace them back, but it should definitely avoid changing that in the future. Also, when there were no authors specified, and there is a single author, the bot should use |last= and |first=
If you agree with that, I can try to submit a pull request. Maxim Masiutin ( talk) 15:38, 9 December 2023 (UTC)
Citation bot changed one of the source descriptions in the article James Hamilton (English Army officer) from:
{{Cite web|last=Smith |first=Geoffrey |date=May 2006 |title=Armorer, Sir Nicholas (c.1620–1686) |website=[[Oxford Dictionary of National Biography]] |doi=10.1093/ref:odnb/94686 |url=http://www.oxforddnb.com/index/94686/ |access-date=13 May 2023 |url-access=subscription}}
to:
{{Cite ODNB|last=Smith |first=Geoffrey |date=May 2006 |title=Armorer, Sir Nicholas (c.1620–1686) |doi=10.1093/ref:odnb/94686 |url=http://www.oxforddnb.com/index/94686/ |access-date=13 May 2023 |url-access=subscription}}
I wondered why. I read up on Template:ODNB. It says it is a wrapper around Template:Cite encyclopedia. Well, perhaps I should not have used "Cite web" but "Cite encyclopedia" and Citation bot should probably have corrected me to:
{{Cite encyclopedia|last=Smith |first=Geoffrey |date=May 2006 |title=Armorer, Sir Nicholas (c.1620–1686) |encyclopedia=[[Oxford Dictionary of National Biography]] |edition=online |publisher=[[Oxford University Press]] |doi=10.1093/ref:odnb/94686 |url=http://www.oxforddnb.com/index/94686/ |access-date=13 May 2023 |url-access=subscription}}
However, I do not understand why we should be forced to use a wrapper around Cite encyclopedia rather than the original. I thought the use of the ODBC template was voluntary and not obligatory. With thanks and best regards Johannes Schade ( talk) 13:10, 19 November 2023 (UTC)
|pages totales=
to |pages=
|pages totales=
, as the total number of pages is not supported in the English Wikipedia citation templates.
Note, it shouldn't remove the extra pipe.
Headbomb {
t ·
c ·
p ·
b}
18:54, 17 December 2023 (UTC)
This edit changed a proceedings title from the version given by DBLP ("Proceedings of the 22nd Annual European Symposium on Algorithms (ESA 2014), Wroclaw, Poland, September 8–10, 2014") to a much more concise version from another source ("Algorithms - ESA 2014"), maybe the publisher or maybe MathSciNet (both list it that way). Note that the actual publisher page for the full proceedings lists it has having the more detailed title "Algorithms - ESA 2014: 22th Annual European Symposium, Wrocław, Poland, September 8-10, 2014. Proceedings". The DBLP title is more or less what you get if you put that into a more intelligible order. Curiously, the bot left the DBLP title in place for the other citation it touched, from WG '92. I think that the DBLP version is better and that this level of change (not the correction of any actual error in a citation) constitutes WP:CITEVAR. Please stop. — David Eppstein ( talk) 06:46, 30 October 2023 (UTC)
with the MathML text
This conflicts with the maths extension and inturn causes a maths syntax error.
The Title is CrossRef is "Measurement of lepton universality parameters in \n<mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\"><mml:msup><mml:mi>B<\/mml:mi><mml:mo>+<\/mml:mo><\/mml:msup><mml:mo stretchy=\"false\">\u2192<\/mml:mo><mml:msup><mml:mi>K<\/mml:mi><mml:mo>+<\/mml:mo><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>+<\/mml:mo><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>\u2212<\/mml:mo><\/mml:msup><\/mml:math>\n and \n<mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\"><mml:msup><mml:mi>B<\/mml:mi><mml:mn>0<\/mml:mn><\/mml:msup><mml:mo stretchy=\"false\">\u2192<\/mml:mo><mml:msup><mml:mi>K<\/mml:mi><mml:mrow><mml:mo>*<\/mml:mo><mml:mn>0<\/mml:mn><\/mml:mrow><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>+<\/mml:mo><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>\u2212<\/mml:mo><\/mml:msup><\/mml:math>\n decays" which makes it difficult to clean up. I tags that are not used are the annoying ones.
https://github.com/ms609/citation-bot/commit/36648e552b4bf9b4f1e7ff1c88383701e79c95e0
AManWithNoPlan (
talk)
21:02, 10 December 2023 (UTC)
<math>
tags inside a <nowki>
. There is another citation in the same revision (actually the same paper) where this has been done. Probably anything with mathematics in it is going to need human attention. Some sort of tracking for these occurrences could be useful. --
Salix alba (
talk):
12:15, 12 December 2023 (UTC)
curl -LH "Accept: application/x-bibtex"
https://doi.org/10.48550/arXiv.2212.09152
title = {Test of lepton universality in $b \rightarrow s \ell^+ \ell^-$ decays}
Bot rebooted to make sure no running jobs continue to use old code.
AManWithNoPlan (
talk)
15:31, 21 December 2023 (UTC)
Date/year/access-date/archive-date/etc.
X. 8 December 2022. {{
cite book}}
: Check date values in: |date=
(
help)
X. Monday, November 2, 1981. {{
cite book}}
: Check date values in: |date=
(
help)
X. 08 Dec 2023. {{
cite book}}
: Check date values in: |date=
(
help)
X. 08 December 2023. {{
cite book}}
: Check date values in: |date=
(
help)
AManWithNoPlan ( talk) 00:52, 22 December 2023 (UTC)
This edit changed links that consistently lead to the new Google books web pages to ones that do not. 50.47.144.129 ( talk) 19:49, 30 October 2023 (UTC)
The culprit is U+2010 : HYPHEN, which should be replaced with the standard U+002D : HYPHEN-MINUS.
Headbomb {
t ·
c ·
p ·
b}
03:56, 21 December 2023 (UTC)
i.e. change url=www.
to
url= https://www.
(same for chapter-url, archive-url, etc...)
Headbomb { t · c · p · b} 15:57, 16 December 2023 (UTC)
|url=
in infoboxes (which should probably use {{
URL}}) and URLs that contain "url=" in the middle.
This modified search finds no articles to be fixed. However, if you find patterns in the category that bots can fix, please let me know.
GoingBatty (
talk)
23:11, 22 December 2023 (UTC)
There the bot is right though. The title is "Graph Drawing". As Springer themselves say, the suggested way to cite this is "Eppstein, D. (2009). Isometric Diamond Subgraphs. In: Tollis, I.G., Patrignani, M. (eds) Graph Drawing. GD 2008. Lecture Notes in Computer Science, vol 5417. Springer, Berlin, Heidelberg.
https://doi.org/10.1007/978-3-642-00219-9_37"
"16th International Symposium...." is the expanded subtitle of GD 2008. One could replace it with "Graph Drawing: 16th International Symposium..." instead of ""Graph Drawing. GD 2008."
But the word "Proceedings" is nowhere in there, and shouldn't be. Headbomb { t · c · p · b} 00:42, 24 November 2023 (UTC)
If the CrossRef database contradicts the information in the article, the bot will stick with the data already in Wikipedia, and assume the error to be with CrossRef.This seems wise, and I'm wondering when the behaviour was changed, and where the consensus for the change arose. Folly Mox ( talk) 23:26, 24 November 2023 (UTC)
title-link
parameter or a wikilink in the title
itself, that seems like a pretty good sign that a human took the trouble to get the information right. A bot shouldn't override that.
XOR'easter (
talk)
18:57, 25 November 2023 (UTC)
|title-link=
is present, |title=
should not be altered outside of punctuation changes. If |periodical=
(or one of its aliases) is present, the wrapper template should not be changed to {{
cite book}}. If adding |chapter=
, and |journal=
or |issue=
is present, the wrapper template should be changed to {{
cite conference}} rather than {{
cite book}}. If none of |title=
and |chapter=
match the existing |title=
(delta punctuation), there's a mismatch between the database record and the work intending to be cited.
Folly Mox (
talk)
20:54, 25 November 2023 (UTC)Ok, after seeing this keep going and going with no effort to fix or address the problem, I am going to start adding {{ bots|deny=Citation bot}} to all new articles I create, instead of merely the ones where I see this happening. — David Eppstein ( talk) 07:55, 3 December 2023 (UTC)
April-May 1995 to April–May 1995
December 7 2023 to December 7, 2023
AManWithNoPlan ( talk) 20:57, 23 December 2023 (UTC)
Unicode is only of the most useful tools ever invented that is also pure evil.
AManWithNoPlan (
talk)
23:36, 22 December 2023 (UTC)
It is just really dandy that the publisher reports that information as the editors.
https://api.crossref.org/works/10.1093/gmo/9781561592630.article.48611
AManWithNoPlan (
talk)
21:31, 25 December 2023 (UTC)
Isn't {{
cite news}}
and probably |work=The Christian Science Monitor
a better choice? At
The Christian Science Monitor we describe the organization as a 'nonprofit news organization that publishes daily articles both in electronic format and a weekly print edition' originally established 'as a daily newspaper'.
— Trappist the monk ( talk) 16:14, 26 December 2023 (UTC)
Seems to not be working ATM. Headbomb { t · c · p · b} 20:14, 5 January 2024 (UTC)
Would be useful to clear most of that category. Headbomb { t · c · p · b} 06:30, 5 January 2024 (UTC)
Same for
Headbomb { t · c · p · b} 07:09, 5 January 2024 (UTC)
![]() | This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 30 | ← | Archive 35 | Archive 36 | Archive 37 | Archive 38 | Archive 39 |
Possibly from garbage Pubmed metadata
Headbomb {
t ·
c ·
p ·
b}
21:57, 2 October 2023 (UTC)
The issue here should be to recognize language=rup
Headbomb {
t ·
c ·
p ·
b}
22:41, 13 October 2023 (UTC)
From discussions (
1,
2,
3) on stopping useless cruft – for example
this useless blank archive of a Jstor article – from semi-automated mass archiving, a number of editors have noted their
support for a bot to parse Jstor and Worldcat URLs (eg https://www.jstor.org/stable/24432812
) for their respective |jstor=24432812
and |oclc=
parameters where relevant and purge URLs, archive URLs, and archive metadata for CS1 templates.
Is this something that can be done with citation bot? I will note that I'm not saying to purge all URLs – they can be useful if the full text is separately hosted elsewhere – just URLs and archives thereof (almost always useless blank pages) that are duplicative of the generated parameter URLs. Tagging GreenC. Ifly6 ( talk) 06:19, 22 September 2023 (UTC)
|url=
|jstor=24432812
)Is there really nothing we can do on this without an RFC? Ifly6 ( talk) 17:13, 25 September 2023 (UTC)
|url=
entry would prevent "archive" links from being added. It is a dispute between whether a reference should look like this:{{Cite journal |last=Steel |first=Catherine |date=2014 |title=The Roman senate and the post-Sullan "res publica" |journal=Historia: Zeitschrift für Alte Geschichte |volume=63 |issue=3 |pages=323–339 |doi=10.25162/historia-2014-0018 |jstor=24432812 |s2cid=151289863 |issn=0018-2311 }}
{{Cite journal |last=Steel |first=Catherine |date=2014 |title=The Roman senate and the post-Sullan "res publica" |journal=Historia: Zeitschrift für Alte Geschichte |volume=63 |issue=3 |pages=323–339 |doi=10.25162/historia-2014-0018 |jstor=24432812 |s2cid=151289863 |issn=0018-2311 |url=https://www.jstor.org/stable/24432812 |access-date=26 May 2022 |archive-date=26 May 2022 |archive-url=https://web.archive.org/web/20220526152815/https://www.jstor.org/stable/24432812 |url-status=live }}
|url=
entirely duplicate existing links in the citation and regardless add nothing for the unprivileged reader while clogging up the mark up and making it difficult to do the part of "editor". Even if I have Ivy League library access and be able to read all full texts through proxies (eg
Penn Libraries), that doesn't mean that linking the proxy page whole (like https://www-jstor-org.wikipedialibrary.idm.oclc.org/
) does any good for readers without Penn or Wikipedia library privileges.
Ifly6 (
talk)
23:37, 25 September 2023 (UTC)|url=
https://www.jstor.org/stable/24432812
is for the "average" reader this link does nothing because they will not have a Jstor subscription. If adding it is to help the "average" university student, the link also does nothing because they will have to go through their university proxy. If it is to help the privileged editor with
WP:LIBRARY access, it also does nothing because we have to go through a proxy too. The only people it supports are those few who have direct access to Jstor (which ironically includes me via the Federal Reserve).
Ifly6 (
talk)
23:43, 25 September 2023 (UTC)
|jstor=
parameter lets them know that the article is on JSTOR and they will likely have access to it, and once they click on the link they can easily log in via whatever gives them access via whatever proxy, or if they're physically at their library just click the link and access it. The JSTOR link also provides metabibliographic information, a first page preview, and abstract. Plus JSTOR allows independent researchers 100 free articles each month, and if someone so chooses they have the option to buy it à la carte. Anything which helps a reader access a source is useful, and quite often JSTOR is the electronic place of record for a journal. [Edit: sorry I'm following more closely now, I still think it should be in |jstor=
-- that's why we have that parameter; it does not also belong in |url=
.]
Umimmak (
talk)
23:49, 25 September 2023 (UTC)
What do you mean "the link does nothing"?The link to the native Jstor website in
|url=
is not the proper one and will not yield the full text unless you have direct Jstor access. If you access it through a proxy, you would have to copy the Jstor ID and paste it in after ../static/
. Putting the direct URL in |url=
is not very useful and largely facilitates
WP:MEATBOTs crufting up articles with unnecessary mark up pointing to blank archive pages.
Ifly6 (
talk)
01:13, 26 September 2023 (UTC)
|jstor=
. Apparently there was an RFD, ban, or something of the sort which has led the maintainer(s) of the bot not being willing to re-enable that previously-present functionality.
Ifly6 (
talk)
21:54, 27 September 2023 (UTC)
|access-date=
in {{
cite book}} and {{
cite journal}} (and maybe others) should be removed, we'll have solved almost the entire problem of these kinds of URLs without needing to determine whether or not readers / editors will understand the alternative stable identifiers.
Folly Mox (
talk)
04:44, 30 September 2023 (UTC)
Well that issue is why we're here at Citation bot. Do you think it's actually impossible to get a decision for Citation bot to remove those URLs? A bot to remove those archives would produce even more watchlist events, which people in the discussion below seem to be adamantly against, while also probably being impossible to implement per GreenC's comment above. Ifly6 ( talk) 19:36, 1 October 2023 (UTC)
permalive
for IABot is also a reasonable first step.
Folly Mox (
talk)
22:09, 1 October 2023 (UTC)
|url=
including a "jstor.org/stable/XXXXX" to turn it into a
JSTOR
10.5555/al.ap.person.bm000000658 this would not work; occasionally JSTOR the website gets cited instead of a book/article it is hosting so just bots should be aware of this.
Umimmak (
talk)
21:20, 5 October 2023 (UTC)
{{
wontfix}} because people are whiners.
AManWithNoPlan (
talk)
20:41, 24 October 2023 (UTC)
The bot never respected {{
in use}} because very often people who use {{
in use}} will also use the bot to expand citations.
Headbomb {
t ·
c ·
p ·
b}
22:42, 13 October 2023 (UTC)
From my discussion page: Hi, I see that you have used citation bot to add dates to references to numismatics.org.uk webpages here. I am not familiar with the bot, so could you explain what the dates mean? The pages seem to be updated regularly.
I think the bot is wrong. Grimes2 ( talk) 14:41, 13 October 2023 (UTC)
meta property="article:published_time"
instead of meta property="article: modified_time"
.
Folly Mox (
talk)
16:00, 13 October 2023 (UTC)
[10] 2001:14BA:9CE5:8400:20AB:2C62:7318:4F88 ( talk) 04:35, 22 October 2023 (UTC)
elonet.finna.fi
but also elonet.fi
it seems:
[13],
[14] (
probably some more). Please stop the bot from changing the citation templates of elonet.fi and finna.fi from "cite web" to "cite book", thank you.
2001:14BA:9CE5:8400:79D9:9129:F234:CDFA (
talk)
20:03, 24 October 2023 (UTC)At the Kenny Clarke article in the oral history ref, the bot changes "Cite web" to "Cite journal" without changing any other parameters, causing this error message. While checking hidden categories on that page, I discovered that the bot did this in June 2022 and I reproduced the problem just now. Graham87 ( talk) 06:50, 23 October 2023 (UTC)
|work=
AManWithNoPlan (
talk)
20:56, 24 October 2023 (UTC)
Spinixster
(chat!)
12:53, 30 September 2023 (UTC)
}}
.
The first instance, starting from
my edit:
|access-date=February 13, 2011 |url-status=dead}}</ref>
(by me) (and 3 more like it) (Not official, though I don't see why.) (Nannyware keeps me from viewing any "archive" websites and/or I didn't have time.)|access-date=February 13, 2011 }}{{dead link|29 September 2023}}</ref>
(by Citation bot, Misc citation tidying...) (Made it official, but wrong format) (only got 3 of 4 instances) (disoptimal - should be no space before the closing }}
.|access-date=February 13, 2011 }}{{dead link|date=September 2023}}</ref>
(by AnomieBOT, Dating maintenance tags...) (Corrected the 3 changed by Citation bot.)|access-date=February 13, 2011 |archive-date=December 18, 2010 |archive-url=https://web.archive.org/web/20101218224918/http://makingitbigcareers.com/books/making-it-big-in-software/mark-russinovich/ |url-status=dead }}
(by AManWithNoPlan), Rescuing 4 sources and tagging 0 as dead.) #IABot (v2.0.9.5)) (disoptimal - field order should be |access-date= |url-status= |archive-url= |archive-date=.)
Today I do not see "Expand citations" in my tools menu. I do not know what caused it to disappear.
Treat as if /pdf-vor or /pdf isn't there. Headbomb { t · c · p · b} 01:33, 2 November 2023 (UTC)
That is super odd.
AManWithNoPlan (
talk)
21:24, 2 November 2023 (UTC)
changing a date to today and breaking a ref in the process. I am sick and tired of Citation bot going around breaking sfn refs willy nilly. DuncanHill ( talk) 22:05, 2 November 2023 (UTC)
{{ fixed}} the bug that was causing extra book clean-ups. But, no idea where that date came from. AManWithNoPlan ( talk) 00:55, 5 November 2023 (UTC)
1. Bot seems to be replacing (only very occasionally) "year=[a year]" with "date=[today's date]" - there seems to be no particular pattern to which sources these are, and none of them are 2023 sources.
2. Where a journal is published in single-issue volumes (this particular one is Archaeological Reports, the bot has erroneously expanded "volume=14" to "volume=14|issue=14", though no issue 14 that year or ever existed.
(Edit:) 3. Replaced:
{{cite book| last=Gates| first=Charles| year=2004| chapter=The Adoption of Pictorial Imagery in Minoan Wall Painting: A Comparativist Perspective| series=Hesperia Supplements| title= ΧΑΡΙΣ: Essays in Honor of Sara A. Immerwahr| editor-last=Chapin| editor-first=Anne P.| publisher=American School of Classical Studies at Athens| place=Athens| isbn=978-0-87661-533-1| pages=27–46| jstor=1354061}}
with
{{cite book| last=Gates| first=Charles| title=The Adoption of Pictorial Imagery in Minoan Wall Painting: A Comparativist Perspective| series=Hesperia Supplements| date=4 November 2023| volume=33| editor-last=Chapin| editor-first=Anne P.| publisher=American School of Classical Studies at Athens| place=Athens| isbn=978-0-87661-533-1| pages=27–46| jstor=1354061}}
Overriding the (correct) description of this article as a chapter in a Festschrift and that Festschrift's title.
4. It added the page range 1-370 to a cite book template, where the whole book is being cited.
And similar wikilink parameters. Headbomb { t · c · p · b} 09:12, 17 October 2023 (UTC)
The following? (this would be compared after removing numbers and dashes):
authorlink chapterlink contributorlink editorlink episodelink interviewerlink inventorlink serieslink subjectlink titlelink translatorlink
AManWithNoPlan ( talk) 13:22, 25 October 2023 (UTC)
|chapter-link=
.
That is the title CrossRef has
https://search.crossref.org/?from_ui=yes&q=10.1063%2Fpt.6.4.20200327a
AManWithNoPlan (
talk)
12:23, 5 November 2023 (UTC)
Below is a method to detect replacement characters. (Not pretty but works.) I added some inline comments because it's an obscure language
isbinary() in Nim
|
---|
# # Return true if string contains a 'replacement' or binary character (black diamond with ? in middle) # Based on: https://unix.stackexchange.com/questions/474709/how-to-grep-for-unicode-in-a-bash-script/474812#474812 # Requires a secondary shell layer so UTF-8 works # tcsh -s 'grep -axv ".*" <filename>' # proc isbinary*(s: string): bool {.discardable.} = result = false # default return value let tmpfile = mktempname(GX.ramdir & "isbinary.") # Generate a temporary and unique filename "isbinary.xxx" to be located in a ramdisk directory for speed s >* tmpfile # Write the string to the tempfile let command1 = "tcsh -c 'grep -axv \".*\" \"" & tmpfile & "\" | wc -l'" # need to use tcsh -c for UTF-8 to work. Bash with similar -c might also work. let c1 = runshellBasic(command1) # run the shell command and capture output to c1 if strip(c1) !~ "^0$": # If the output is not "0" (only) then it contains a replacement character. result = true removeFile(tmpfile) # Delete the temp file and return 'result' |
— Preceding unsigned comment added by GreenC ( talk • contribs) 16:32, 5 November 2023 (UTC)
As you know the "automatic" option in the Visual Editor cite button does not support any pdfs, perhaps because it would be too slow. As this bot is not constrained as much for time it would be great if the bot could expand more pdfs from major organisations. For example the second cite in Agriculture_in_Turkey#Subsidies namely https://www.g20.org/content/dam/gtwenty/gtwenty_new/document/G20_Report_on_Macroeconomic_impacts.pdf Chidgk1 ( talk) 09:08, 6 November 2023 (UTC)
{{ wontfix}} - PDF files, and the bot uses https://en.wikipedia.org/api/rest_v1/#/Citation/getCitation which is outside our control
{{
citation}}
is such a pain to |work=with
|chapter=
to {{
citation}}
templates that have |work=
. |work=
(aliases: |journal=
, |magazine=
, |newspaper=
, |periodical=
, |website=
) configures {{citation}}
as a periodical template. |chapter=
(aliases |contribution=
, |entry=
, |article=
, |section=
) is not supported in the cs1 periodical templates ({{
cite journal}}
, {{
cite magazine}}
, {{
cite news}}
, {{
cite periodical}}
, {{
cite web}}
) and not supported in the cs2 {{citation}}
template when it is configured as a periodical template.{{citation}}
templates are best written as {{
cite Australian Dictionary of Biography}}
templates. In no case should the bot add a |chapter=
alias to {{citation}}
when there is a |work=
alias with an assigned value present in {{citation}}
.
Although this is a very useful bot I am struggling to understand what it can and cannot do and how it works.
I recently submitted a bug report, and a couple of enhancement requests asking if the bot could be run on pdf files and they were immediately closed because there is an api which is "outside our control".
Could the "function summary" be rewritten with a first paragraph to explain what the bot does in very simple terms and a second para to explain how it works in very simple terms and relegate the technical explanation to later paragraphs?
Also it would be useful if the "won't fix" could be left here for a couple of days for us to read rather than being immediately archived.
Chidgk1 ( talk) 12:21, 7 November 2023 (UTC)
And it should leave every other 'I' alone too. This is particularly annoying. The only 'I' that needs capitalization are those from Part I, Section I, etc... Headbomb { t · c · p · b} 22:41, 10 November 2023 (UTC)
|title=
parameter scraped from meta
tags in the HTML header, and absolutely nothing else.
If the goal is to wrap a URL in a citation template so Internet Archive picks it up, and there's no good translators available for the domain, just set the |title=(URL)
so it's obvious the citation is incomplete and needs work. This sort of lazy not-citation is essentially worthless, and encourages people to use scripts for tasks the scripts are not ready to handle, instead of putting in the one minute of work it takes to create a real citation by looking at the source.
If Citation bot can't figure out anything from the URL except the title, it should either leave the link alone, set the title to the URL, or tag its change with a template like {{ citation needs human review}} so this sort of garbage can be tracked.
Apologies for the strong language, but if we train a whole generation of editors to rely on pushbutton non-solutions, the maintenance burden of trash citations is going to outpace our capacity and never be fixed. Folly Mox ( talk) 12:06, 20 November 2023 (UTC)
Also Category:CS1 errors: dates Headbomb { t · c · p · b} 12:23, 26 November 2023 (UTC)
|last1=|
(the pipe character |
).
The garbage human-entered title prevented the full expansion. Wondering if we can't just yeet the title out when converting a cite web to a cite journal/book for ieeexplore links. It's a highly-reliable database. Either way, the website= parameter should be nuked. Headbomb { t · c · p · b} 00:02, 16 November 2023 (UTC)
and
[41] (here I manually switched |journal=
to |series=
"|work= ignored"
zero width space character in |title= at position 1
"
On Wikipedia, we follow
MOS:TITLECAPS. If journals want to style themselves differently, that's is up to them, but we're not bound to follow.
Headbomb {
t ·
c ·
p ·
b}
20:54, 2 December 2023 (UTC)
Per the update description in the category. Headbomb { t · c · p · b} 23:04, 2 December 2023 (UTC)
Fixed
AManWithNoPlan (
talk)
02:13, 3 December 2023 (UTC)
|volume=
, |issue=
, and |journal=
are already present in the citation. No isbn is given in the metadata of the
target article. |journal=
is altered to |series=
, refreshingly avoiding the "periodical ignored" error, but this change doesn't appear to make any sense.
Not sure how to handle that, in general.
AManWithNoPlan (
talk)
01:53, 4 December 2023 (UTC)
{{
citation}}
template to make a 'book' reference. Bot changed |title=
to |chapter=
, added a new |title=
that more-or-less duplicated existing |journal=
, and |volume=
.{{citation}}
uses the work parameters (|journal=
, |magazine=
, |newspaper=
, |periodical=
, |website=
, |work=
) to switch from its default 'book' format (|title=
rendered in an italic font) to its 'work' format (|title=
rendered in an upright font with quote marks). When making these types of edits to {{citation}}
templates (it desires to make a 'book' reference), the bot should remove work parameters (in this case |journal=
).
That URL was already broken before Citation bot got to it. Truncating it at the dash glyph might have fixed it.
Folly Mox (
talk)
01:19, 3 December 2023 (UTC)
{{
citation}}
: CS1 maint: location missing publisher (
link). The series is an essential part of the entry: it provides the necessary context for the volume number, and shows that this was the 150th volume published by the society, whereas its absence suggests that the history is itself a work of 150 or more volumes.
See also User:Citation_bot/use#..._the_bot_made_a_mistake?. Headbomb { t · c · p · b} 00:18, 15 November 2023 (UTC)
|series=Publications of the Surtees Society
rather than the shorter removed series name. No idea whether this would affect the bot's attempted removals. It also needs a publisher; following the same catalog entry, it looks like |publisher=Andrews & co. and B. Quaritch for the Surtees Society
would be accurate. —
David Eppstein (
talk)
01:29, 15 November 2023 (UTC)
Failing to understand "OUP Academic" as "Oxford University Press" (already present, correctly in
Folly Mox (
talk)
20:28, 11 November 2023 (UTC)
|publisher=
) is one thing; adding an unsupported |journal=
parameter to {{
cite book}} is something I thought Citation bot was better than.
doi:
10.1186/s40645-015-0068-z and
doi:
10.1007/s00445-020-01384-6 are both of the new problematic "article number" type. |issue=
is wrong, and |page=
is less than ideal, but the best the CS1 and 2 have for us at the moment. The journals clearly state to not use issue for these in the "how to cite" areas.
AManWithNoPlan (
talk)
20:54, 7 November 2023 (UTC)
{{
cite journal}}
supports |article-number=
:
{{cite journal |last1=Geshi |first1=N. |last2=Yamada |first2=I. |last3=Matsumoto |first3=K. |last4=Nishihara |first4=A. |last5=Miyagi |first5=I. |title=Accumulation of rhyolite magma and triggers for a caldera-forming eruption of the Aira Caldera, Japan |journal=Bulletin of Volcanology |volume=82 |article-number=44 |year=2020 |doi=10.1007/s00445-020-01384-6 |url=https://link.springer.com/article/10.1007/s00445-020-01384-6}}
|id={{Project MUSE|59700|type=book}}
instead of URL
https://muse.jhu.edu/book/59700
Maybe 5 years ago, IABot had a bug that added a "#" to the end of every archive URL, and sometimes the source URL. The bug is long fixed, and WaybackMedic has been removing the errant #'s, but it's a cosmetic edit that can only be done when making another edit to the page, so it's been a long process. There are a lot of them. An example: Special:Diff/1183493290/1185983128 (second change). My code below if interested, no edge cases, simply removing any trailing # from the URLs. It won't break the archive URL.
Extended content
|
---|
# Fix trailing # in |url and |archive-url added by IABot 2.0 beta10 psplit(GX.articlework, GX.cite2, p): if isarg("archive-url", "value", p.field[i]) and isarg("url", "value", p.field[i]): archiveurl = getarg("archive-url", "clean", p.field[i]) sourceurl = getarg("url", "clean", p.field[i]) j = 0 if archiveurl ~ "[#]$": inc(j) sub("[#]$", "", archiveurl) p.field[i] = replacearg(p.field[i], "archive-url", archiveurl, "cosmetic1.1") if sourceurl ~ "[#]$": inc(j) sub("[#]$", "", sourceurl) p.field[i] = replacearg(p.field[i], "url", sourceurl, "cosmetic1.2") if j > 0: p.ok += inclog("cosmetic1.1", GX.esformat, Project.logiats, &"{archiveurl} ---- remove trailing #") psplit() iterates over every cite template which are held in p.field[i] |
Green C 17:35, 20 November 2023 (UTC)
Still needs fixed. Fix did not work. AManWithNoPlan ( talk) 00:41, 5 December 2023 (UTC)
Covers both Proceedings of Science and Journal of Science Communication, from SISSA. The other SISSA journals have different prefixes. Headbomb { t · c · p · b} 02:25, 5 December 2023 (UTC)
|title=
→ |chapter=
and |journal=
→ |title=
, hooray!). These two misidentifications don't – to my knowledge – even change the display of the article, but they are still incorrect. The first one I changed back to {{
cite web}}. The second I fid the same: although properly it should be {{
cite dictionary}}, it rather obviously failed verification so I didn't bother.
WikiJournals Headbomb { t · c · p · b} 15:36, 10 December 2023 (UTC)
Unfortunately this journal is
not preserved so there are no archived copies either.
Nemo
11:19, 3 December 2023 (UTC)
|doi-broken-date=
.
Headbomb {
t ·
c ·
p ·
b}
15:01, 8 December 2023 (UTC)
|citeseerx=
and |s2cid=
.
This was my fix: changing back to {{
cite web}}, adding the url of the source, and an unrelated fix to |publisher=
. I'm not sure this is really Citation bot's fault, or if maybe the parameter set supported by {{
cite document}} ought be expanded to allow for more stable identifiers. Pinging
Trappist the monk as the template maintainer, to see if they have input.
Folly Mox (
talk)
19:11, 9 December 2023 (UTC)
{{
cite web}}
to {{
cite document}}
is not going to work when |url=
, |citeseerx=
, and |s2cid=
have assigned values. |s2cid=
is excluded from {{cite document}}
because links to readable copies of the source from that identifier are hit-or-miss at best (recall the plethora of complaints about the bot adding |s2cid=
that have been voiced on this talk page). |citeseerx=
is excluded because we have {{
cite citeseerx}}
.|citeseerx=10.1.1.42.3374
, an alternate fix might be:
{{cite citeseerx |last=Wirz |first=Marc |title=Characterizing the Grzegorczyk hierarchy by safe recursion |date=November 1999 |citeseerx=10.1.1.42.3374}}
{{
cite document}}
is a 'last resort' sort of template when absolutely none of the other cs1|2 templates apply. The bot should avoid using {{cite document}}
because, almost always, there is a better choice.Headbomb { t · c · p · b} 01:20, 11 December 2023 (UTC)
Fixed
AManWithNoPlan (
talk)
13:27, 11 December 2023 (UTC)
Note to me for when I have time. AManWithNoPlan ( talk) 16:26, 10 December 2023 (UTC)
|title=
parameter with incorrect characters, adding the article to
Category:CS1 errors: invisible characters (e.g.
this edit in Malayalam).|title=
(and maybe even |website=
) (e.g.
this edit).
Here's
another edit in Gujarati. — Preceding
unsigned comment added by
GoingBatty (
talk •
contribs)
20:11, 12 December 2023 (UTC)
The character is ​ (zero width space). Headbomb { t · c · p · b} 00:07, 13 December 2023 (UTC)
Headbomb { t · c · p · b} 01:22, 15 December 2023 (UTC)
There is a pull request that allows specifying name list style for newly-added name entries: https://github.com/ms609/citation-bot/pull/4236
It adds an option to already existing style of first1/last1, first2,last2, etc.
This pull request introduces the following functionality. If a page contains {{Use vanc name-list-style}} template, then the bot will use |vauthors= and |veditors= attributes rather than firstN/lastN and editor-firstN/editor-lastN when adding name entries for a citation template if the names were not specified in this template. This is similar to {{Use dmy dates}} template when the bot uses date format as specified on the page. To reproduce this behaviour, edit a page on Wikipedia, add {{Use vanc name-list-style}} template (or {{Use vanc name-list-style|date=December 2023}}), delete author names (firstN/lastN) and run the bot. It will fill the names as vauthors. Maxim Masiutin ( talk) 16:48, 7 December 2023 (UTC)
{{
Use vanc name-list-style}}
exist? Was there any discussion that brought it into existence? cs1|2 doesn't know anything about that template but will understand {{
CS1 config|name-list-style=vanc}}
. Why create a new otherwise non-functional template?{{
use dmy dates}}
and {{
use mdy dates}}
to control date formatting when cs1|2 templates are rendered. See
Template:Use dmy dates § Auto-formatting citation template dates for example. I see no reason to keep {{
Use vanc name-list-style}}
.{{use xxx dates}}
templates came first (January 2009). Development of
Module:Citation (the predecessor to
Module:Citation/CS1) began August 2012. Auto date formatting was added to Module:Citation/CS1 April 2019. Support for {{
CS1 config}}
was added August 2023. {{CS1 config}}
applies only to cs1|2 templates but the {{use xxx dates}}
templates apply to both the article body and to article referencing (regardless of how referencing is implemented).{{
CS1 config}}
because that template has functionality beyond being a simple flag template.Would it be possible to expand from non-templated reference <ref>[https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5553785/ Bar]</ref>
, as long as |title=
would be exactly the same (Bar
) which already exists for the URL specified as if the bot would try to expand the bare URL (as long as there is no other content in the ref)?
Jonatan Svensson Glad (
talk)
17:16, 24 July 2023 (UTC)
{{
cite journal}}
template. In that template: |journal=arXiv
, |doi=10.48550/arXiv.2206.12231
, and |doi-access=free
. The only action that the bot took was to convert |doi=10.48550/arXiv.2206.12231
to |arxiv=2206.12231
.arXiv
is not a journal so {{cite journal}}
is the wrong template; should be changed to {{
cite arxiv}}
. When removing |doi=
, the bot should always remove |doi-access=
. Remember that {{cite arxiv}}
supports a limited subset of the whole cs1|2 parameter set so other parameters in a {{cite journal}}
→ {{cite arxiv}}
conversion may need to be removed. The limited parameter set is defined in
Module:Citation/CS1/Whitelist lines
340–346.
If you run the bot again, then it does clean up. I will look at having it not take two times.
AManWithNoPlan (
talk)
15:10, 9 December 2023 (UTC)
According to
Help:Citation Style, An author may be cited using separate parameters for the author's surname and given name by using
However, the bot replaces |last= and |first= to |last1= and |first1= even when there is just one author, which is contrary to the description of the CS1 Citation Style.
The bot should probably already not replace them back, but it should definitely avoid changing that in the future. Also, when there were no authors specified, and there is a single author, the bot should use |last= and |first=
If you agree with that, I can try to submit a pull request. Maxim Masiutin ( talk) 15:38, 9 December 2023 (UTC)
Citation bot changed one of the source descriptions in the article James Hamilton (English Army officer) from:
{{Cite web|last=Smith |first=Geoffrey |date=May 2006 |title=Armorer, Sir Nicholas (c.1620–1686) |website=[[Oxford Dictionary of National Biography]] |doi=10.1093/ref:odnb/94686 |url=http://www.oxforddnb.com/index/94686/ |access-date=13 May 2023 |url-access=subscription}}
to:
{{Cite ODNB|last=Smith |first=Geoffrey |date=May 2006 |title=Armorer, Sir Nicholas (c.1620–1686) |doi=10.1093/ref:odnb/94686 |url=http://www.oxforddnb.com/index/94686/ |access-date=13 May 2023 |url-access=subscription}}
I wondered why. I read up on Template:ODNB. It says it is a wrapper around Template:Cite encyclopedia. Well, perhaps I should not have used "Cite web" but "Cite encyclopedia" and Citation bot should probably have corrected me to:
{{Cite encyclopedia|last=Smith |first=Geoffrey |date=May 2006 |title=Armorer, Sir Nicholas (c.1620–1686) |encyclopedia=[[Oxford Dictionary of National Biography]] |edition=online |publisher=[[Oxford University Press]] |doi=10.1093/ref:odnb/94686 |url=http://www.oxforddnb.com/index/94686/ |access-date=13 May 2023 |url-access=subscription}}
However, I do not understand why we should be forced to use a wrapper around Cite encyclopedia rather than the original. I thought the use of the ODBC template was voluntary and not obligatory. With thanks and best regards Johannes Schade ( talk) 13:10, 19 November 2023 (UTC)
|pages totales=
to |pages=
|pages totales=
, as the total number of pages is not supported in the English Wikipedia citation templates.
Note, it shouldn't remove the extra pipe.
Headbomb {
t ·
c ·
p ·
b}
18:54, 17 December 2023 (UTC)
This edit changed a proceedings title from the version given by DBLP ("Proceedings of the 22nd Annual European Symposium on Algorithms (ESA 2014), Wroclaw, Poland, September 8–10, 2014") to a much more concise version from another source ("Algorithms - ESA 2014"), maybe the publisher or maybe MathSciNet (both list it that way). Note that the actual publisher page for the full proceedings lists it has having the more detailed title "Algorithms - ESA 2014: 22th Annual European Symposium, Wrocław, Poland, September 8-10, 2014. Proceedings". The DBLP title is more or less what you get if you put that into a more intelligible order. Curiously, the bot left the DBLP title in place for the other citation it touched, from WG '92. I think that the DBLP version is better and that this level of change (not the correction of any actual error in a citation) constitutes WP:CITEVAR. Please stop. — David Eppstein ( talk) 06:46, 30 October 2023 (UTC)
with the MathML text
This conflicts with the maths extension and inturn causes a maths syntax error.
The Title is CrossRef is "Measurement of lepton universality parameters in \n<mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\"><mml:msup><mml:mi>B<\/mml:mi><mml:mo>+<\/mml:mo><\/mml:msup><mml:mo stretchy=\"false\">\u2192<\/mml:mo><mml:msup><mml:mi>K<\/mml:mi><mml:mo>+<\/mml:mo><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>+<\/mml:mo><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>\u2212<\/mml:mo><\/mml:msup><\/mml:math>\n and \n<mml:math xmlns:mml=\"http:\/\/www.w3.org\/1998\/Math\/MathML\" display=\"inline\"><mml:msup><mml:mi>B<\/mml:mi><mml:mn>0<\/mml:mn><\/mml:msup><mml:mo stretchy=\"false\">\u2192<\/mml:mo><mml:msup><mml:mi>K<\/mml:mi><mml:mrow><mml:mo>*<\/mml:mo><mml:mn>0<\/mml:mn><\/mml:mrow><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>+<\/mml:mo><\/mml:msup><mml:msup><mml:mo>\u2113<\/mml:mo><mml:mo>\u2212<\/mml:mo><\/mml:msup><\/mml:math>\n decays" which makes it difficult to clean up. I tags that are not used are the annoying ones.
https://github.com/ms609/citation-bot/commit/36648e552b4bf9b4f1e7ff1c88383701e79c95e0
AManWithNoPlan (
talk)
21:02, 10 December 2023 (UTC)
<math>
tags inside a <nowki>
. There is another citation in the same revision (actually the same paper) where this has been done. Probably anything with mathematics in it is going to need human attention. Some sort of tracking for these occurrences could be useful. --
Salix alba (
talk):
12:15, 12 December 2023 (UTC)
curl -LH "Accept: application/x-bibtex"
https://doi.org/10.48550/arXiv.2212.09152
title = {Test of lepton universality in $b \rightarrow s \ell^+ \ell^-$ decays}
Bot rebooted to make sure no running jobs continue to use old code.
AManWithNoPlan (
talk)
15:31, 21 December 2023 (UTC)
Date/year/access-date/archive-date/etc.
X. 8 December 2022. {{
cite book}}
: Check date values in: |date=
(
help)
X. Monday, November 2, 1981. {{
cite book}}
: Check date values in: |date=
(
help)
X. 08 Dec 2023. {{
cite book}}
: Check date values in: |date=
(
help)
X. 08 December 2023. {{
cite book}}
: Check date values in: |date=
(
help)
AManWithNoPlan ( talk) 00:52, 22 December 2023 (UTC)
This edit changed links that consistently lead to the new Google books web pages to ones that do not. 50.47.144.129 ( talk) 19:49, 30 October 2023 (UTC)
The culprit is U+2010 : HYPHEN, which should be replaced with the standard U+002D : HYPHEN-MINUS.
Headbomb {
t ·
c ·
p ·
b}
03:56, 21 December 2023 (UTC)
i.e. change url=www.
to
url= https://www.
(same for chapter-url, archive-url, etc...)
Headbomb { t · c · p · b} 15:57, 16 December 2023 (UTC)
|url=
in infoboxes (which should probably use {{
URL}}) and URLs that contain "url=" in the middle.
This modified search finds no articles to be fixed. However, if you find patterns in the category that bots can fix, please let me know.
GoingBatty (
talk)
23:11, 22 December 2023 (UTC)
There the bot is right though. The title is "Graph Drawing". As Springer themselves say, the suggested way to cite this is "Eppstein, D. (2009). Isometric Diamond Subgraphs. In: Tollis, I.G., Patrignani, M. (eds) Graph Drawing. GD 2008. Lecture Notes in Computer Science, vol 5417. Springer, Berlin, Heidelberg.
https://doi.org/10.1007/978-3-642-00219-9_37"
"16th International Symposium...." is the expanded subtitle of GD 2008. One could replace it with "Graph Drawing: 16th International Symposium..." instead of ""Graph Drawing. GD 2008."
But the word "Proceedings" is nowhere in there, and shouldn't be. Headbomb { t · c · p · b} 00:42, 24 November 2023 (UTC)
If the CrossRef database contradicts the information in the article, the bot will stick with the data already in Wikipedia, and assume the error to be with CrossRef.This seems wise, and I'm wondering when the behaviour was changed, and where the consensus for the change arose. Folly Mox ( talk) 23:26, 24 November 2023 (UTC)
title-link
parameter or a wikilink in the title
itself, that seems like a pretty good sign that a human took the trouble to get the information right. A bot shouldn't override that.
XOR'easter (
talk)
18:57, 25 November 2023 (UTC)
|title-link=
is present, |title=
should not be altered outside of punctuation changes. If |periodical=
(or one of its aliases) is present, the wrapper template should not be changed to {{
cite book}}. If adding |chapter=
, and |journal=
or |issue=
is present, the wrapper template should be changed to {{
cite conference}} rather than {{
cite book}}. If none of |title=
and |chapter=
match the existing |title=
(delta punctuation), there's a mismatch between the database record and the work intending to be cited.
Folly Mox (
talk)
20:54, 25 November 2023 (UTC)Ok, after seeing this keep going and going with no effort to fix or address the problem, I am going to start adding {{ bots|deny=Citation bot}} to all new articles I create, instead of merely the ones where I see this happening. — David Eppstein ( talk) 07:55, 3 December 2023 (UTC)
April-May 1995 to April–May 1995
December 7 2023 to December 7, 2023
AManWithNoPlan ( talk) 20:57, 23 December 2023 (UTC)
Unicode is only of the most useful tools ever invented that is also pure evil.
AManWithNoPlan (
talk)
23:36, 22 December 2023 (UTC)
It is just really dandy that the publisher reports that information as the editors.
https://api.crossref.org/works/10.1093/gmo/9781561592630.article.48611
AManWithNoPlan (
talk)
21:31, 25 December 2023 (UTC)
Isn't {{
cite news}}
and probably |work=The Christian Science Monitor
a better choice? At
The Christian Science Monitor we describe the organization as a 'nonprofit news organization that publishes daily articles both in electronic format and a weekly print edition' originally established 'as a daily newspaper'.
— Trappist the monk ( talk) 16:14, 26 December 2023 (UTC)
Seems to not be working ATM. Headbomb { t · c · p · b} 20:14, 5 January 2024 (UTC)
Would be useful to clear most of that category. Headbomb { t · c · p · b} 06:30, 5 January 2024 (UTC)
Same for
Headbomb { t · c · p · b} 07:09, 5 January 2024 (UTC)