This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 5 | ← | Archive 8 | Archive 9 | Archive 10 | Archive 11 | Archive 12 | → | Archive 15 |
If expanding from raw JSTOR URL ( http://www.jstor.org/stable/3363372), the bot does not find doi's, while citoid does. Can we somehow call the same resources as Citoid?
"New Lights upon Old Tunes. "The Arethusa"". The Musical Times and Singing Class Circular. 35 (620): 666–668. 1894. JSTOR 3363372.
"New Lights upon Old Tunes. "The Arethusa"". The Musical Times and Singing Class Circular. 35 (620): 666–668. 1894. doi: 10.2307/3363372. JSTOR 3363372. ( t) Josve05a ( c) 21:18, 30 August 2018 (UTC)
{{ notabug}}
Looks like a false positive, but I can't reproduce from the citation alone. Did you get any clue from the bot's output as to what was happening here? Can you reproduce from the page?
Martin (
Smith609 –
Talk) 18:25, 24 August 2018 (UTC)
API gives...
Checking AdsAbs database > AdsAbs search 3476/50000: title:"Music and Connectionism" + Adding bibcode: 1994ASAJ...96.1218T + Adding journal: Acoustical Society of America Journal - Dropping parameter "publisher" + Adding volume: 96 + Adding issue: 2 + Adding pages: 1218 + Adding doi: 10.1121/1.410341
Headbomb { t · c · p · b} 18:30, 24 August 2018 (UTC)
|title=
)
This should apply to all such abbreviation (unspaced or S.H.I.E.L.D. or spaced R. G.), plus a small list of words like "Inc., Ltd."
Headbomb {
t ·
c ·
p ·
b} 14:14, 27 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/703/files AManWithNoPlan ( talk) 00:27, 1 September 2018 (UTC)
That will take a special case for the journal name.
AManWithNoPlan (
talk) 13:28, 30 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/710
AManWithNoPlan (
talk) 02:45, 2 September 2018 (UTC)
(\s*)\|(\s*)volume(\s*)=(\s*)(\d+)\s*\((\d+(-|–|\–|\{\{ndash\}\})?\d*)\)
→ $1|$2volume$3=$4$5$1|$2issue$3=$4$6
However, if |issue=
is already set and ≠ $6, skip
See
[5] for a small sample of what is screwed up. The regex would catch more cases though.
Headbomb {
t ·
c ·
p ·
b} 23:53, 22 August 2018 (UTC)
If not set then look for ^([A-Z0-9]+)(\([0-9].\))$
Thus volumes and numbers and capitals. Issues start with numbers AManWithNoPlan ( talk) 03:45, 2 September 2018 (UTC)
|issue=
is already set and = $6, because I'm skipping on "if issue is set", which you presumable could do with citation bot (if ≠ $6, should be skipped, per above).
Headbomb {
t ·
c ·
p ·
b} 11:19, 2 September 2018 (UTC)https://github.com/ms609/citation-bot/pull/713 AManWithNoPlan ( talk) 15:16, 2 September 2018 (UTC)
I had to do this to get the bot to do this. ( t) Josve05a ( c) 12:57, 23 August 2018 (UTC)
{{ fixed}}
|journal=eFoobar
→ |journal=EFoobar
|journal=eFoobar
as a pattern, keep it |journal=eFoobar
Well, eLife has about 375 uses on Wikipedia WP:JCW/E12, and eJournal / e-Journal appear a crap ton. (Note that they will often display as ELife / EJournal /E-Journal due to how JL-Bot presents that information.) So most could probagbly be handled with an exception for eLife / eJournal / e-Journal. Headbomb { t · c · p · b} 18:36, 24 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/699 AManWithNoPlan ( talk) 18:27, 1 September 2018 (UTC)
|title=The Serving Soldier » Home
with |title=The Serving Soldier " Home
Unless their is a pair of « »
, we should not assume these are quotation marks, they may in fact be arrows, as here. (
t)
Josve05a (
c) 13:52, 27 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/720
AManWithNoPlan (
talk) 21:58, 3 September 2018 (UTC)
Not sure if this is a bug or not, but it feels odd to see an error message, so wanted to confirm what it ment....
* Expand citation: ''Solanum perlongistylum'' and ''S. catilliflorum'', New Endemic Peruvian Species of Solanum, Section Basarthrum, Are Close Relatives of the Domesticated Pepino, ''S. muricatum'' > Extracting information from SICI > Found and used SICI [..> rifydoi] > Checking that DOI 10.3417/1055-3177(2006)16[161:SPASCN]2.0.CO;2 is operational... DOI ok. . Initial authors exist, skipping authorlink in tidy . Initial authors exist, skipping authorlink in tidy . Initial authors exist, skipping authorlink in tidy . Initial authors exist, skipping authorlink in tidy > Checking AdsAbs database > AdsAbs search 4720/50000: doi:"10.3417/1055-3177(2006)16[161:SPASCN]2.0.CO;2" > AdsAbs search 4721/50000: pub:"Novon: A Journal for Botanical Nomenclature" year:2006 issn:1055-3177 volume:"16" page:"161–167" [..> indpmid] > Searching PubMed... - Errors detected in PMID search (SimpleXMLElement Object ( [FieldNotFound] => 161:SPASCN ) ); abandoned. nothing found.
( t) Josve05a ( c) 14:33, 22 August 2018 (UTC)
What is prefable? Filing bug reports and feature requests here, or on GitHube (as issues)? ( t) Josve05a ( c) 14:13, 26 August 2018 (UTC)
I think that it is too big.
AManWithNoPlan (
talk) 20:18, 24 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/664
AManWithNoPlan (
talk) 16:35, 28 August 2018 (UTC)
Maybe live, maybe not: [8] Headbomb { t · c · p · b} 00:45, 2 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/716
https://github.com/ms609/citation-bot/pull/715
AManWithNoPlan (
talk) 00:22, 3 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/717
AManWithNoPlan (
talk) 01:27, 3 September 2018 (UTC)
|journal=
is added, any of its possible duplicates (like |work=
and |website=
) should be removed. That's a CS1 error, and the bot should not produce CS1 errors.
The decision to include the stupid generic |work=
in the citation templates is a bane to bots everywhere.
AManWithNoPlan (
talk) 14:28, 28 August 2018 (UTC)
|work=
for all citation templates? It is a global/generic parameter which works for all citation templates, and |journal=
, |website=
etc. are just synonyms. (
t)
Josve05a (
c) 16:27, 28 August 2018 (UTC)
|work=
is vague and unclear to most people. What's a work? It can be a book title, a conference proceeding titles, a journal title, a website, ... |journal=
or |website=
or whatever is clear and cannot be confused.
Headbomb {
t ·
c ·
p ·
b} 16:44, 28 August 2018 (UTC)
{{
cite journal}}
or {{
cite book}}
. It's much easier to use the parameter |work=
. It looks terrible when the bot changes "cite news [...] |work=BBC News
" to |website=BBC News
instead of changing it to {{
cite news}}
and keep work. Now it creates more work for editors, both to change form |website=
to |work=
or |newpaper=
, and to {{
cite news}}
, instead of just to {{
cite news}}
which still results in the same output. (
t)
Josve05a (
c) 17:08, 28 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/719 AManWithNoPlan ( talk) 03:30, 3 September 2018 (UTC)
In
this edit, all the bot does is replace the {{
cite web}} parameter work=
with website=
. The template's documentation says they are aliases. Even if the bot was also doing something useful, these changes clutter up the diff screen for a style that is not in any way preferable. Why is the bot changing these? —
Bilorv
(c)
(talk) 16:51, 5 September 2018 (UTC)
Is it possible to use this on MediaWiki installations that are not part of wikipedia.org? — Omegatron ( talk) 01:49, 7 September 2018 (UTC)
Much, much better. However, it could still be a bit better: When you start, you have
*** Processing page '{2018 FFA Cup preliminary rounds}' : 12:13:01 -------------------------------------------------------------------------- [12:13:02] Processing page '[[2018 FFA Cup preliminary rounds]]' — [[edit]]—[[history]]
This should be simplified to
-------------------------------------------------------------------------- [12:13:02] Processing page '[[2018 FFA Cup preliminary rounds]]' – [[edit]] – [[history]]
This eliminates redundancy and, the spaces help + use endashes. When no changes are required, you have
# No changes required. # # #
This should be simplified to
# No changes required.
When you have a change, you have
# Writing to Peace Pledge Union... Written to [[Peace Pledge Union]]{{POV|date=October 2015}} The '''Peace Pledge Union (PPU)''' is a British [[pacifist]] ...
This should be simplified to (with a line break after "Writing to Peace Pledge Union...")
# Writing to [[Peace Pledge Union]]... {{POV|date=October 2015}} The '''Peace Pledge Union (PPU)''' is a British [[pacifist]] ...
And when you end with
[[history]] / [[last edit]]
This could be much simpler/clearer with
[[diff]]
Headbomb { t · c · p · b} 12:16, 21 August 2018 (UTC)
{{ wontfix}} not going to pollute code with all sorts of "if ( running a category) then " code. AManWithNoPlan ( talk) 21:56, 12 September 2018 (UTC)
Not sure if there is a reason why this isn't done yet, but would it be possible to add JSTOR links in cases where this isn't already added? For example, source 12 in Brachiosaurus has a doi, but I know it is also on JSTOR [11], so shouldn't the bot be able to cross check? FunkMonk ( talk) 00:22, 14 September 2018 (UTC)
{{ wontfix}} nope. not searcable. jstor disabled tbat years ago. AManWithNoPlan ( talk)|
https://www.jstor.org/openurl?doi=10.1671/0272-4634(2003)023[0344:teovpi]2.0.co;2. {{
cite web}}
: Missing or empty |title=
(
help)
AManWithNoPlan (
talk) 13:19, 14 September 2018 (UTC)
https://support.jstor.org/hc/en-us/articles/115005079047-JSTOR-OpenURL-Linking-
{{ Fixed}}
FYI, I noticed text-weight: bold
, which is not valid CSS. The property is font-weight
. --
Izno (
talk) 22:34, 9 September 2018 (UTC)
|format=Full text
, |format=Accepted manuscript
, |format=Submitted manuscript
are all inappropriate uses of |format=
(not a 'new' parameter); see
documentation.
diff
The bot continues to add |format=
with inappropriate values, in this case Full text
. The purpose of the 'format' parameters is to identify for the reader the file format of the linked source, PDF, XLS, DOC, etc. (see the
documentation).
But the just-as-innappropriate
case 'submittedVersion': $format = 'Submitted manuscript'; break; case 'acceptedVersion': $format = 'Accepted manuscript'; break;
remain. Headbomb { t · c · p · b} 14:26, 9 August 2018 (UTC)
|format=Full text
, |format=Accepted manuscript
, |format=Submitted manuscript
are appropriate in |format=
ever. Read the template
documentation. The only thing that belongs in |format=
is the electronic file format: PDF, XLS, DOC, MP3, etc.|preprint-url=
that appends "
preprint" at the end of the template.
Headbomb {
t ·
c ·
p ·
b} 13:50, 11 August 2018 (UTC)
Unless it is a file format, nothing should be added by the bot in |format=
. think what you want about the existance of such a paramenter all we want, but don't misuse it. (
t)
Josve05a (
c) 20:29, 24 August 2018 (UTC)
Should perhaps have been fixed with
https://github.com/ms609/citation-bot/pull/652 but in
https://en.wikipedia.org/?title=Vallejo_%28ferry%29&diff=prev&oldid=856647186 the bot converted raw Google Book URLs to {{
cite web}}
. (Is this just because I'm using the gadget tool now and not the user script, and that pull fixes are delayed, or is this a new error?) (
t)
Josve05a (
c) 18:17, 26 August 2018 (UTC)
{{ wontfix}} for now. We have some improvements coming. AManWithNoPlan ( talk) 17:08, 20 September 2018 (UTC)
Women's liberation movement in North America Don't even know where to start. Changing isbn numbers from those given in source viewed, changing publishing information or deleting publisher, removing publishing location, all distort and are incompatible with accuracy of citation. As a historian, this bot failed to improve any of the citations and distorted the accuracy of information about source and material. Improving citations are always welcome, but deleting information which identifies sourcing accurately is not worthwhile. SusunW ( talk) 04:16, 8 September 2018 (UTC)
|id=
since en.wikipedia.org has decided that to remove publisher and location for all journals.
AManWithNoPlan (
talk) 03:35, 10 September 2018 (UTC)
The real solution is wiki linking the journal name and making an article about the journal. AManWithNoPlan ( talk) 15:47, 10 September 2018 (UTC)
significant attention by the world at large and over a period of time". So it is quite possible to have a journal (likely very specialized) that is well-known and highly respected in a narrow field of experts, and has published articles relevant to some topic, but which has not gained "significant attention by the world at large". WP:V does not require a WP:N source. ♦ J. Johnson (JJ) ( talk) 00:14, 11 September 2018 (UTC)
{{ notabug}} AManWithNoPlan ( talk) 14:04, 20 September 2018 (UTC)
|first=M. M
|first=M. M.
The meta data does not have them. That’s the problem.
AManWithNoPlan (
talk) 22:15, 27 August 2018 (UTC)
|first#=
in a citation you find the pattern (^| )[A-z]\.
, replace (^| )([A-z])( |$)
with $1$2.
in all other |first#=
found in the citation.|first#=
too. It's a really really widespread problem.
Headbomb {
t ·
c ·
p ·
b} 01:49, 28 August 2018 (UTC)
In
https://en.wikipedia.org/?title=Khanate_of_Kazan&diff=prev&oldid=856050147 the bot expands from an URL to a {{
cite journal}}
, which is amazing! However, the bot should mention this in the edit summary somehow. (
t)
Josve05a (
c) 14:38, 22 August 2018 (UTC)
this edit was triggered via
https://tools.wmflabs.org/citations/category.php?edit=toolbar&slow=1&user=Headbomb&cat=Livestock%20stubs
With the edit summary
This should instead be
When you run the page, you are presented with
Activated by Headbomb.
> Expanding 'Ununennium'; will commit edits.
Reading authentication tokens from tools.wmflabs.org.
[00:35:06] Processing page 'Ununennium' — edit—history
...
This would be much clearer/less intimidating if it was something like
Citation bot activated by Headbomb. The bot will automatically make edit(s) if it can.
>Bot logging on tools.wmflabs.org.
[00:35:06] Processing page 'Ununennium' — edit—history
...
Headbomb { t · c · p · b} 00:55, 30 August 2018 (UTC)
|url=
https://www.worldcat.org/oclc/873805659
→ |oclc=873805659
https://github.com/ms609/citation-bot/pull/741
AManWithNoPlan (
talk) 03:02, 9 September 2018 (UTC)
Running the bot multiple times after each edit on the same page results in new edits being made. All possible edits should be done before saving the article.
( t) Josve05a ( c) 21:33, 26 August 2018 (UTC)
<ref>https://arxiv.org/ftp/arxiv/papers/1312/1312.7288.pdf</ref>
https://arxiv.org/abs/1312.7288
https://github.com/ms609/citation-bot/pull/711
AManWithNoPlan (
talk) 02:54, 2 September 2018 (UTC)
I ran into the same issue. Sometimes I run the bot through an article twice because it appears in multiple reference cleanup required sections and I notice that the bot would add a page number with regular hyphen (-), then clean it up later with an en dash(–). Examples are
[14]
[15] and
[16]
[17]. If the intention of the bot is to have en dashes for page numbers, maybe it could do that when adding it so it does not have to make the subsequent edit again. --
AquaDTRS (
talk) 20:07, 6 September 2018 (UTC)
|work=Google.com
to a {{
cite book}} with |website=Google.com
|website=[Gg]oogle.com
from Google Books
Which is better via= or delete?
AManWithNoPlan (
talk) 22:32, 2 September 2018 (UTC)
|postcript=.
{{
cite journal}}
: Invalid |ref=harv
(
help)CS1 maint: postscript (
link)|postcript=.
{{
cite journal}}
: Invalid |ref=harv
(
help)
What are your thoughts on remove empty |postcript=
on {{
citation}} also, since it does nothing?
AManWithNoPlan (
talk) 21:35, 8 September 2018 (UTC)
|postcript=
{{
citation}}
: Invalid |ref=harv
(
help)|postcript=
{{
citation}}
: Invalid |ref=harv
(
help)In case a DOI does not resolve (i.e. is broken/inactive), check if the DOI has more than one forward-slash. If it does, remove the second and all content after it.
Real example: 10.1111/ruso.12119/full
to 10.1111/ruso.12119
. If it resolves and gives matching metadata, replace the |doi=
field. (
t)
Josve05a (
c) 01:49, 6 September 2018 (UTC)
/full
and remove them, if at the end of a broken DOI. (
t)
Josve05a (
c) 01:50, 6 September 2018 (UTC)
Should we really remove |url=
simply because the URL has a known doi in it? I think if the URL is a PDF file, it is worth keeping since it is linking to the journal article directly (as open source). We don't always remove |url=
when |doi=
is present, only if that specific URL happens to have the DOI in its path. Either we should always delete the URL, or never in my own opinion, but if we should, we shouldn't do so when the URL is a PDF. (
t)
Josve05a (
c) 22:28, 27 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/704 AManWithNoPlan ( talk) 01:38, 1 September 2018 (UTC)
to
Gotta love bad meta data. The bibcode has a journal parameter.
AManWithNoPlan (
talk) 02:46, 28 August 2018 (UTC)
[numFound] => 1 [start] => 0 [docs] => Array ( [0] => stdClass Object ( [arxiv_class] => Array ( [0] => gr-qc ) [identifier] => Array ( [0] => 2001gcit.conf..195H [1] => 2001gr.qc.....3067H [2] => 2001LNP...562..195H [3] => 10.1007/3-540-40988-2_10 [4] => 2001gcit.conf..195H [5] => gr-qc/0103067 [6] => 10.1007/3-540-40988-2_10 [7] => 2001gr.qc.....3067H ) [year] => 2001 [page] => Array ( [0] => 195 ) [bibcode] => 2001LNP...562..195H [pubdate] => 2001-00-00 [author] => Array ( [0] => Haugan, Mark P. [1] => Lämmerzahl, C. ) [volume] => 562 [doi] => Array ( [0] => 10.1007/3-540-40988-2_10 ) [pub] => Gyros, Clocks, Interferometers ...: Testing Relativistic Gravity in Space [doctype] => inbook [title] => Array ( [0] => Principles of Equivalence: Their Role in Gravitation Physics and Experiments That Test Them ) ) )
In {{
cite web}}, since |work=
is an alias for |website=
which is the template native parameter. Might not be ideal for many things that should actually be {{
cite news}}.
AManWithNoPlan (
talk) 20:22, 5 September 2018 (UTC)
@
AManWithNoPlan:. This is very disruptive. Might suggest disabling the tool until you are able to find and fix the problem, particularly if this is not even a "biggest fish" bug. It should not be converting |work=
-> |website=
at the rate of 1,000s or 10's of thousands. --
Green
C 13:40, 16 September 2018 (UTC)
|doi=
. However, that doi is marked as broken/inactive.
Follow-up from
User_talk:Citation_bot/Archive_9#Broken_dois_and_removal_of_URLs, still not fixed. (
t)
Josve05a (
c) 01:46, 6 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/729
AManWithNoPlan (
talk) 15:10, 6 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/729
AManWithNoPlan (
talk) 15:11, 6 September 2018 (UTC)
Thanks for reporting the issue. I believe this might occur for a number of articles which I ran the bot through, although I won't know which ones until the list of articles with invalid dates gets populated again in the next cycle. Also, I was thinking maybe the bot could include a feature to check for an invalid year before it replaces it, just in case it finds a set of numbers that look like dates elsewhere again. --
AquaDTRS (
talk) 19:38, 6 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/733
AManWithNoPlan (
talk) 23:53, 6 September 2018 (UTC)
! No match for bibcode identifier: 2012PhLB..713...17I; 2014A&A...571A..16P + Adding url: http://www.jstor.org/stable/78071"
https://github.com/ms609/citation-bot/pull/732
AManWithNoPlan (
talk) 20:27, 6 September 2018 (UTC)
|title=The First Destroyer «
to |title=The First Destroyer "
Regression of
User_talk:Citation_bot/Archive_10#Arrows_and_not_always_quotes
|doi=
10.1002/(SICI)1097-0134(20000515)39:3<216::AID-PROT40>3.0.CO;2-#
as inactive/broken, despite it being alive and working
https://github.com/ms609/citation-bot/pull/751
AManWithNoPlan (
talk) 16:24, 11 September 2018 (UTC)
https://en.wikipedia.org/?title=Andragogy&diff=859059054&oldid=859058657
|at=
with non-specific |pages=
|at=pp.425–439, see Table 2 p. 426 for tempering temperatures
to |pages=425–439
https://github.com/ms609/citation-bot/pull/756
AManWithNoPlan (
talk) 20:36, 11 September 2018 (UTC)
�
in parameters instead of the proper unicode characterü
, ä
and ö
Sorry, but the data is wrong in CrossRef. We could detect it, but we cannot fix it.
AManWithNoPlan (
talk) 16:19, 11 September 2018 (UTC)
I think thats fixed in out gothub development tree.
AManWithNoPlan (
talk) 01:05, 14 September 2018 (UTC)
|title=
.
! CrossRef server error loading headers for DOI 10.1002/(SICI)1520-6394(1998)8:1 <64::AID-DA10>3.0.CO;2-S: HTTP/1.0 400 Bad request DOI ok. ! No CrossRef record found for doi '10.1002/(SICI)1520-6394(1998)8:1 <64::AID-DA10>3.0.CO;2-S'; marking as broken
assuming php memory bug
AManWithNoPlan (
talk) 03:19, 18 September 2018 (UTC)
As a result, the link to the paper is on the proceedings name and not in the title. For instance "Proceedings of ISSTA, Demonstration Track" https://hal.archives-ouvertes.fr/hal-01321615/file/astor.pdf points to "ASTOR: A Program Repair Library for Java" (on /info/en/?search=Automatic_bug_fixing)
This is confusing both for human readers and for search engines.
This is
GIGO.
Headbomb {
t ·
c ·
p ·
b} 13:37, 18 September 2018 (UTC)
|deadurl=no/yes/whatever
when you no url is present.
{{
fixed}} removes this when removing url now
AManWithNoPlan (
talk) 14:25, 20 September 2018 (UTC)
Thank you for changing the bug title.
AManWithNoPlan (
talk) 18:24, 9 September 2018 (UTC)
{{ cite web}} should be the exception. Leave that one alone, unless it's converted. e.g. [23]. Headbomb { t · c · p · b} 23:18, 18 September 2018 (UTC)
|access-date=
removal at
User:GreenC_bot/Job_5 that was arrived at by lengthy community input over a 5 month period. --
Green
C 01:48, 19 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/810
AManWithNoPlan (
talk) 23:15, 20 September 2018 (UTC)
|authorlinux=
to |authorlink#=
|pages=8159
to |pages=astro-ph/9508159
Brandenberger, Robert H. (1995). "Formation of Structure in the Universe": 8159.
Bibcode:
1995astro.ph..8159B. {{
cite journal}}
: Cite journal requires |journal=
(
help)
{{cite book |author={{noitalic|{{lang|zh-hans|国务院人口普查办公室、国家统计局人口和社会科技统计司编}}}} |date=2012 |script-title=zh:中国2010年人口普查分县资料 |location=Beijing |publisher={{noitalic|{{lang|zh-hans|中国统计出版社}}}} [China Statistics Press] |page= |isbn=978-7-5037-6659-6 }}
to
{{cite book |author={{noitalic|{{lang|zh-hans|国务院人口普查办公室、国家统计局人口和社会科技统计司编}}}} |title=中国2010年人口普查分县资料 |date=2012 |script-title=zh:中国2010年人口普查分县资料 |location=Beijing |publisher={{noitalic|{{lang|zh-hans|中国统计出版社}}}} [China Statistics Press] |page= |isbn=978-7-5037-6659-6 }}
Making the title 中国2010年人口普查分县资料
appear twice.
|title=
if |script-title=
is the same (or includes the title in it's string).
Same with История русского автомата
in
https://en.wikipedia.org/?title=7.62×39mm&oldid=860197219 (
t)
Josve05a (
c) 00:34, 19 September 2018 (UTC)
# # # citation_bot_placeholder_comment 0 # # #
{{cite web |<!--trans--->title = Beij}} unfixable without massive effort
AManWithNoPlan (
talk) 02:49, 22 September 2018 (UTC)
{{cite journal|last1=Kaye|first1=Steven|last2=Fox|first2=Joseph M.|last3=Hicks|first3=Frederick A.|last4=Buchwald|first4=Stephen L.|title=The Use of Catalytic Amounts of CuCl and Other Improvements in the Benzyne Route to Biphenyl-Based Phosphine Ligands|journal=Advanced Synthesis & Catalysis|date=31 December 2001|volume=343|issue=8|pages=789–794|doi=10.1002/1615-4169(20011231)343:83.0.CO;2-A|url=http://onlinelibrary.wiley.com/doi/10.1002/1615-4169(20011231)343:8%3C789::AID-ADSC789%3E3.0.CO;2-A/full|language=en|issn=1615-4169|doi-broken-date=2017-04-22}}
to
{{cite journal|last1=Kaye|first1=Steven|last2=Fox|first2=Joseph M.|last3=Hicks|first3=Frederick A.|last4=Buchwald|first4=Stephen L.|title=The Use of Catalytic Amounts of CuCl and Other Improvements in the Benzyne Route to Biphenyl-Based Phosphine Ligands|journal=Advanced Synthesis & Catalysis|date=31 December 2001|volume=343|issue=8|pages=789–794|doi=10.1002/1615-4169(20011231)343:83.0.CO;2-A|language=en|issn=1615-4169|doi-broken-date=2018-09-21}}
Perhaps even replace with new one as in this cass the doi was missing a character
AManWithNoPlan (
talk) 21:06, 21 September 2018 (UTC)
do not drop urls that point to .pdf even if they have doi AManWithNoPlan ( talk) 03:14, 23 September 2018 (UTC)
I don't know if this is the case (pretty sure it isn't), but the bot should convert
|publication-date=
→ |date=
|publication-place=
→ |location=
If |location=
/ |date=
aren't set / are empty
Headbomb {
t ·
c ·
p ·
b} 13:48, 12 August 2018 (UTC)
Only other one I can think of is
|orig-year=
/|origyear=
→ |year=
Headbomb { t · c · p · b} 16:19, 12 August 2018 (UTC)
|orig-year=
should be converted to |year=
only if (a) |year=
is empty and (b) |orig-year=
contains only a valid four-digit year. Both must be true. If |orig-year=
contains additional text, it should not be moved to |year=
; that will cause an error message to appear. –
Jonesey95 (
talk) 17:45, 12 August 2018 (UTC)
|website=wayback.archive.org
, |publisher=wayback.archive.org
etc.|website=archive.org
, |publisher=archive.org
etc. if the (main) url has another domain that archive.org.
https://github.com/ms609/citation-bot/pull/822
AManWithNoPlan (
talk) 15:49, 25 September 2018 (UTC)
https://en.wikipedia.org/?title=CKMT1B&diff=858892445&oldid=858892287
https://en.wikipedia.org/?title=%CA%BBOumuamua&diff=prev&oldid=861195731
|publisher=Amazon.com
if removing Amazon URL in favor of ISBN.
https://github.com/ms609/citation-bot/pull/823
AManWithNoPlan (
talk) 15:54, 25 September 2018 (UTC)
|vauthors=
and the like for name-formatting or the use of CS1 versus CS2).|page=
and |pages=
are aliases.
|page=13-25
should be converted to |pages=13–25
|page=
/|pages=
/|at=
is set, remove the others (if they are empty / redundant)|year=
and |date=
are aliases.
|date=2008
should be converted to |year=2008
|year=
/|date=
is set, remove the other (if it is empty)|via=
online makes sense if a URL is provided, so remove it if there is no url provided.
So a citation like
{{cite journal |last=Smith |first=John |date=2007 |year= |title=Foobar |journal=Barfoo Journal |volume=3 |issue=4 |page=34-44 |pages= |via=}}
cleans up to
{{cite journal |last=Smith |first=John |year=2007 |title=Foobar |journal=Barfoo Journal |volume=3 |issue=4 |pages=34–44}}
Headbomb { t · c · p · b} 17:01, 25 September 2018 (UTC)
|date=
. Otherwise these are reasonable suggestions. --
Izno (
talk) 17:03, 25 September 2018 (UTC)
|date=
is not a a true alias of |year=
– true aliases cause the 'more than one of param and param' error message as here with |work=
and |journal=
:
{{cite journal |title=Title |work=Work |journal=Journal}}
→ "Title". Journal. {{
cite journal}}
: More than one of |work=
and |journal=
specified (
help)|date=
and |year=
are required (or desired), they cannot be aliases. I agree with Editor Izno that |date=
should be preferred over |year=
when both are not required.|date=
and |year=
are required (or desired)" what would those occasions be? Having |date=2008-04-26
and |year=2008
just presents redundant information.
Headbomb {
t ·
c ·
p ·
b} 18:39, 25 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/829 AManWithNoPlan ( talk) 23:37, 26 September 2018 (UTC)
|year=
field when set to "n.d." thus causing there to be a cite date error|year=
in this case.
Add tests and soon code.
https://github.com/ms609/citation-bot/pull/825
AManWithNoPlan (
talk) 18:27, 26 September 2018 (UTC)
Unless the entire journal name is wiki-linked, the data is almost always wrong. Secondly, partial links corrupt the COINS data and should not be done that way.
AManWithNoPlan (
talk) 20:02, 27 September 2018 (UTC)
{{
fixed}}
AManWithNoPlan (
talk) 21:02, 27 September 2018 (UTC)
|date=
field causing a cite date error to occur such as "| date= 2011-05-10T06:34:00-0400"
{{
fixed}}
AManWithNoPlan (
talk) 21:01, 27 September 2018 (UTC)
For example
<ref>{{cite web |url=https://www.sciencedirect.com/science/article/pii/S0024379512004405 |title=Geometry of the Welch bounds}}</ref>
or
<ref>https://www.sciencedirect.com/science/article/pii/S0024379512004405</ref>
Those URLs are extremely common and if they can be parsed (similar to DOI urls), that would be fantastic. And then they could be removed since they'll be redundant with DOIs. Headbomb { t · c · p · b} 22:52, 29 August 2018 (UTC) We would need to grab the hmtl, parse as xml, <meta name="citation_doi" content="10.1016/j.laa.2012.05.036" /> Or we could use: https://api.elsevier.com/content/object/pii/S0024379512004405 AManWithNoPlan ( talk) 14:31, 30 August 2018 (UTC)
I'd like the option to have a 'rebuild references' when they are so crappy we need to TNT them (for whatever reason), and start anew. Two options would be present
This would present things in a 'standardized' parameter order with 'standardized' whitespace
multiline | inline |
---|---|
<ref> {{cite arXiv |last1= |first1= |last2= |first2= |... |date= or |year= |title= |arxiv=<import> |class= }}</ref> |
<ref>{{cite arXiv |last1= |first1= |last2= |first2= |... |date= or |year= |title= |arxiv=<import> |class=}}</ref> |
multiline | inline |
---|---|
{{cite book |last1= |first1= |last2= |first2= |... |date= or |year= |chapter= |chapter-url=<import> |chapter-url-access=<import> |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |series= |volume= |pages= (or |page=) |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import> }} |
{{cite book |last1= |first1= |last2= |first2= |... |date= or |year= |chapter= |chapter-url=<import> |chapter-url-access=<import> |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |series= |volume= |pages= (or |page=) |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import>}} |
multiline | inline |
---|---|
{{cite journal |last1= |first1= |last2= |first2= |... |date= or |year= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |journal= |series= |volume= |issue= |pages= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import> }} |
{{cite journal |last1= |first1= |last2= |first2= |... |date= or |year= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |journal= |series= |volume= |issue= |pages= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import>}} |
multiline | inline |
---|---|
{{cite web |last1= |first1= |last2= |first2= |... |date= or |year= |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |url=<import> |website= |series= |volume= |pages= |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import> }} |
{{cite web |last1= |first1= |last2= |first2= |... |date= or |year= |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |url=<import> |website= |series= |volume= |pages= |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import>}} |
Whatever is marked <import>
would be carried over from the old citation, with URLs/Identifiers used to rebuilt the rest of the citation. The rest would be present (if the bot can/would fill them), or omitted (if the bot can't/wouldn't fill them).
Headbomb {
t ·
c ·
p ·
b} 17:59, 1 September 2018 (UTC)
See also bioRvix at https://en.wikipedia.org/?title=Homo_sapiens&diff=prev&oldid=861469112
Fixed in
GitHub Pull 855
AManWithNoPlan (
talk) 21:37, 27 September 2018 (UTC)
?edit=toolbar&slow=1&zotero=0
(
t)
Josve05a (
c) 21:42, 27 September 2018 (UTC)
|date=scheme=dcterms.ISO8601; 2013-10-23
https://github.com/ms609/citation-bot/pull/858
AManWithNoPlan (
talk) 16:08, 28 September 2018 (UTC)
What do you suggest? The template supports two different title parameters and then shows them both. We added code that prevents duplicates, but in these cases the script title and title are very different (or maybe it basically one is printed and the other is cursive styling of the same words). Perhaps:
if (has script-title and new title is not all western characters) then ignore new title else add title end if
AManWithNoPlan ( talk) 16:12, 28 September 2018 (UTC)
|trans-title=
.if (has script-title) then ignore new title else add title end if
( t) Josve05a ( c) 16:18, 28 September 2018 (UTC)
Activating the bot sends in into an endless loop of doing absolutely nothing. Can't really explain more save it just fails to run properly on any page you try to run it on. Headbomb { t · c · p · b} 14:39, 29 September 2018 (UTC)
{{ fixed}} AManWithNoPlan ( talk) 20:36, 29 September 2018 (UTC)
Why???? What you expect the bot to do?
AManWithNoPlan (
talk) 22:48, 3 October 2018 (UTC)
{{cite LSA|url=https://www.ncbi.nlm.nih.gov/pubmed/4043876|year=1985|title=Kondous laventicus, a new ceboid primate from the Miocene of the La Venta, Colombia, South America}}
{{Cite LSA|last=Setoguchi|first=T.|date=1985|title=Kondous laventicus, a new ceboid primate from the Miocene of the La Venta, Colombia, South America|url=https://www.ncbi.nlm.nih.gov/pubmed/4043876|journal=Folia Primatologica; International Journal of Primatology|volume=44|pages=96–101|year=1985}}
http://dare.uva.nl/personal/pure/en/publications/functional-reconstruction-of-structurally-complex-epitopes-using-clips-technology(ce45bb5a-7823-4872-a0b1-e5e5a99a79e5).html |type=Submitted manuscrip
|url-access=free
Or, actually, in this case it should have added |hdl=11245/1.309707
instead of the URL, but in general when adding a free URL, it should add |url-access=free
. (
t)
Josve05a (
c) 22:42, 3 October 2018 (UTC)
|url-access=free
is not supported by cs1|2 because values in |url=
are presumed to be free-to-read.
http://dare.uva.nl/personal/pure/en/publications/functional-reconstruction-of-structurally-complex-epitopes-using-clips-technology(ce45bb5a-7823-4872-a0b1-e5e5a99a79e5).html |type=Submitted manuscrip
|hdl=11245/1.309707
instead
{{
wontfix}} the meta data is poor quality
AManWithNoPlan (
talk) 03:05, 4 October 2018 (UTC)
|url=
https://www.ncbi.nlm.nih.gov/books/NBK24662/
, nothing happens
The treasure trove of URL readers used by Citoid do actually parse this page:
https://github.com/zotero/translators
AManWithNoPlan (
talk) 18:37, 21 August 2018 (UTC)
<div class="publication-meta-secondary">DOI: 10.1136/jnnp.2008.144360 [...]
{{ wontfix}} they block us and anything that looks like scraping. AManWithNoPlan ( talk) 02:01, 5 October 2018 (UTC)
Those were not bare urls though. Headbomb { t · c · p · b} 13:43, 4 October 2018 (UTC)
This is a GIGO problem, since the info exists in the headers of the web page(s) in question, but it would be great if the tool could ignore this junk instead of inserting it. See the archives of
User talk:Zhaofeng Li/reFill, another tool that editors have been using to semi-automatically insert this junk for years. Gnomes remove it manually if tool-using editors fail to see it in Preview. –
Jonesey95 (
talk) 15:02, 29 September 2018 (UTC)
|journal=Ilr Review
|journal=ILR Review
https://github.com/ms609/citation-bot/pull/874
AManWithNoPlan (
talk) 03:07, 4 October 2018 (UTC)
https://github.com/ms609/citation-bot/pull/877
AManWithNoPlan (
talk) 03:04, 4 October 2018 (UTC)
Rasmussen, D. T. (2002). "The origin of Primates". In Hartwig, W. C. (ed.). The Primate Fossil Record. Cambridge: Cambridge University Press. pp. 5–9.
Bibcode:
2002prfr.book.....H. {{
cite book}}
: |journal=
ignored (
help)
Newer code avoids those book bibcodes
AManWithNoPlan (
talk) 03:08, 5 October 2018 (UTC)
I'm currently using https://tools.wmflabs.org/citations/category.php?cat=1980_births&slow=1 which makes one edit and then stops (this seems to be a bug from above discussions). I like to make the bot run on random pages and then stop when it has made an edit, I, however, don't want to specify a category. I'd love to be able to use a link such as https://tools.wmflabs.org/citations/random.php and just have the bot find a page where it will make an edit. ( t) Josve05a ( c) 20:59, 19 August 2018 (UTC)
The bot historically logged each page that it visited to a database, and could be run on the page that had been longest without a visit. The database didn't make the migration to ToolForge, but some of the code still exists. Something like what you suggest would be a good step towards the bot running unsupervised again (which had to be discontinued because I didn't have AManWithNoPlan to keep up with bug reports!) Martin ( Smith609 – Talk) 13:56, 21 August 2018 (UTC)
{{ wontfix}} too many other things to do AManWithNoPlan ( talk) 14:49, 8 October 2018 (UTC)
|website=Google Books
and |website=Books.google.es
Regression of
User talk:Citation bot/Archive 10#remove website=Google for books (
t)
Josve05a (
c) 23:06, 29 September 2018 (UTC)
|class=
to cite journal|class=
to cite journal, remove |class=
from cite journal.
https://github.com/ms609/citation-bot/pull/902
AManWithNoPlan (
talk) 15:05, 8 October 2018 (UTC)
|doi=10.1007/978-3-642-75924-6_15#page-1
.|publisher=Springer, Berlin, Heidelberg
|doi=10.1007/978-3-642-75924-6_15
|publisher=Springer
or SpringerLink
|location=Berlin, Heidelberg
https://github.com/ms609/citation-bot/pull/886
AManWithNoPlan (
talk) 04:36, 5 October 2018 (UTC)
https://github.com/ms609/citation-bot/pull/885 AManWithNoPlan ( talk) 04:36, 5 October 2018 (UTC)
{{ fixed}} AManWithNoPlan ( talk) 14:31, 8 October 2018 (UTC)
|date=Invalid date
{{
fixed}}
AManWithNoPlan (
talk) 14:30, 8 October 2018 (UTC)
This is very annoying.
Headbomb {
t ·
c ·
p ·
b} 06:13, 5 October 2018 (UTC)
-- AManWithNoPlan ( talk) 02:08, 7 October 2018 (UTC)
that website uses invalid ssl certs and so the bounces get stopped by https libraries. i really do not want to turn that off.
AManWithNoPlan (
talk) 04:21, 8 October 2018 (UTC)
|jstor=j.ctt6wp6td.10
[
{
"itemType": "bookSection",
"notes": [],
"tags": [],
"title": "Underground Visions:: Strategies of Resistance along the Amsterdam Metro Lines",
"abstractNote": "The association between Amsterdam and the underground is rather ambiguous to say the least. On the one hand, the Netherlands, and Amsterdam in particular, are proud to present themselves as hospitable vis-à-vis alternative ‘underground’ cultures – a legacy from the 1960s and 1970s when feminist, gay, hippy, student, and squat movements were dominating the social and cultural scenes. The global tourist reputation of Amsterdam as the capital of sex, drugs, and rock ’n’ roll has largely been built on the legacy of a 1960s underground culture that was leftist and avant-garde. At the same time, however, the other notion of",
"publisher": "Amsterdam University Press",
"ISBN": [
"9789089645050"
],
"pages": "77–96",
"bookTitle": "Paris-Amsterdam Underground",
"series": "Essays on Cultural Resistance, Subversion, and Diversion",
"url": "http://www.jstor.org/stable/j.ctt6wp6td.10",
"date": "2013",
"libraryCatalog": "JSTOR",
"accessDate": "2018-09-09",
"shortTitle": "Underground Visions",
"author": [
[
"Ginette",
"Verstraete"
]
],
"seriesEditor": [
[
"Christoph",
"Lindner"
],
[
"Andrew",
"Hussey"
]
],
"source": [
"Zotero"
]
}
]
{{ fixed}} AManWithNoPlan ( talk) 15:15, 9 October 2018 (UTC)
|hdl=
we will probably add regex's to catch the more common ones. We do the same with pubmed.
AManWithNoPlan (
talk) 22:11, 30 September 2018 (UTC)
<div class="article-title">Exile Drama: The Translation of Ernst Toller's <i>Pastor Hall</i> (1939)</div>
as Exile Drama: The Translation of Ernst Toller's Pastor Hall(1939)
, stipping the space beteen the </i> (1939)
Problem with euppublishing.com? (
t)
Josve05a (
c) 22:02, 7 October 2018 (UTC)
10.3366/
, ensure there is a space before any parantesis. If not, consider adding one. Or something like that? Or, if htere is no space, make the bot try and scrape the landing page and see if the HTML there has a space? (
t)
Josve05a (
c) 17:43, 8 October 2018 (UTC)This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 5 | ← | Archive 8 | Archive 9 | Archive 10 | Archive 11 | Archive 12 | → | Archive 15 |
If expanding from raw JSTOR URL ( http://www.jstor.org/stable/3363372), the bot does not find doi's, while citoid does. Can we somehow call the same resources as Citoid?
"New Lights upon Old Tunes. "The Arethusa"". The Musical Times and Singing Class Circular. 35 (620): 666–668. 1894. JSTOR 3363372.
"New Lights upon Old Tunes. "The Arethusa"". The Musical Times and Singing Class Circular. 35 (620): 666–668. 1894. doi: 10.2307/3363372. JSTOR 3363372. ( t) Josve05a ( c) 21:18, 30 August 2018 (UTC)
{{ notabug}}
Looks like a false positive, but I can't reproduce from the citation alone. Did you get any clue from the bot's output as to what was happening here? Can you reproduce from the page?
Martin (
Smith609 –
Talk) 18:25, 24 August 2018 (UTC)
API gives...
Checking AdsAbs database > AdsAbs search 3476/50000: title:"Music and Connectionism" + Adding bibcode: 1994ASAJ...96.1218T + Adding journal: Acoustical Society of America Journal - Dropping parameter "publisher" + Adding volume: 96 + Adding issue: 2 + Adding pages: 1218 + Adding doi: 10.1121/1.410341
Headbomb { t · c · p · b} 18:30, 24 August 2018 (UTC)
|title=
)
This should apply to all such abbreviation (unspaced or S.H.I.E.L.D. or spaced R. G.), plus a small list of words like "Inc., Ltd."
Headbomb {
t ·
c ·
p ·
b} 14:14, 27 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/703/files AManWithNoPlan ( talk) 00:27, 1 September 2018 (UTC)
That will take a special case for the journal name.
AManWithNoPlan (
talk) 13:28, 30 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/710
AManWithNoPlan (
talk) 02:45, 2 September 2018 (UTC)
(\s*)\|(\s*)volume(\s*)=(\s*)(\d+)\s*\((\d+(-|–|\–|\{\{ndash\}\})?\d*)\)
→ $1|$2volume$3=$4$5$1|$2issue$3=$4$6
However, if |issue=
is already set and ≠ $6, skip
See
[5] for a small sample of what is screwed up. The regex would catch more cases though.
Headbomb {
t ·
c ·
p ·
b} 23:53, 22 August 2018 (UTC)
If not set then look for ^([A-Z0-9]+)(\([0-9].\))$
Thus volumes and numbers and capitals. Issues start with numbers AManWithNoPlan ( talk) 03:45, 2 September 2018 (UTC)
|issue=
is already set and = $6, because I'm skipping on "if issue is set", which you presumable could do with citation bot (if ≠ $6, should be skipped, per above).
Headbomb {
t ·
c ·
p ·
b} 11:19, 2 September 2018 (UTC)https://github.com/ms609/citation-bot/pull/713 AManWithNoPlan ( talk) 15:16, 2 September 2018 (UTC)
I had to do this to get the bot to do this. ( t) Josve05a ( c) 12:57, 23 August 2018 (UTC)
{{ fixed}}
|journal=eFoobar
→ |journal=EFoobar
|journal=eFoobar
as a pattern, keep it |journal=eFoobar
Well, eLife has about 375 uses on Wikipedia WP:JCW/E12, and eJournal / e-Journal appear a crap ton. (Note that they will often display as ELife / EJournal /E-Journal due to how JL-Bot presents that information.) So most could probagbly be handled with an exception for eLife / eJournal / e-Journal. Headbomb { t · c · p · b} 18:36, 24 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/699 AManWithNoPlan ( talk) 18:27, 1 September 2018 (UTC)
|title=The Serving Soldier » Home
with |title=The Serving Soldier " Home
Unless their is a pair of « »
, we should not assume these are quotation marks, they may in fact be arrows, as here. (
t)
Josve05a (
c) 13:52, 27 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/720
AManWithNoPlan (
talk) 21:58, 3 September 2018 (UTC)
Not sure if this is a bug or not, but it feels odd to see an error message, so wanted to confirm what it ment....
* Expand citation: ''Solanum perlongistylum'' and ''S. catilliflorum'', New Endemic Peruvian Species of Solanum, Section Basarthrum, Are Close Relatives of the Domesticated Pepino, ''S. muricatum'' > Extracting information from SICI > Found and used SICI [..> rifydoi] > Checking that DOI 10.3417/1055-3177(2006)16[161:SPASCN]2.0.CO;2 is operational... DOI ok. . Initial authors exist, skipping authorlink in tidy . Initial authors exist, skipping authorlink in tidy . Initial authors exist, skipping authorlink in tidy . Initial authors exist, skipping authorlink in tidy > Checking AdsAbs database > AdsAbs search 4720/50000: doi:"10.3417/1055-3177(2006)16[161:SPASCN]2.0.CO;2" > AdsAbs search 4721/50000: pub:"Novon: A Journal for Botanical Nomenclature" year:2006 issn:1055-3177 volume:"16" page:"161–167" [..> indpmid] > Searching PubMed... - Errors detected in PMID search (SimpleXMLElement Object ( [FieldNotFound] => 161:SPASCN ) ); abandoned. nothing found.
( t) Josve05a ( c) 14:33, 22 August 2018 (UTC)
What is prefable? Filing bug reports and feature requests here, or on GitHube (as issues)? ( t) Josve05a ( c) 14:13, 26 August 2018 (UTC)
I think that it is too big.
AManWithNoPlan (
talk) 20:18, 24 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/664
AManWithNoPlan (
talk) 16:35, 28 August 2018 (UTC)
Maybe live, maybe not: [8] Headbomb { t · c · p · b} 00:45, 2 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/716
https://github.com/ms609/citation-bot/pull/715
AManWithNoPlan (
talk) 00:22, 3 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/717
AManWithNoPlan (
talk) 01:27, 3 September 2018 (UTC)
|journal=
is added, any of its possible duplicates (like |work=
and |website=
) should be removed. That's a CS1 error, and the bot should not produce CS1 errors.
The decision to include the stupid generic |work=
in the citation templates is a bane to bots everywhere.
AManWithNoPlan (
talk) 14:28, 28 August 2018 (UTC)
|work=
for all citation templates? It is a global/generic parameter which works for all citation templates, and |journal=
, |website=
etc. are just synonyms. (
t)
Josve05a (
c) 16:27, 28 August 2018 (UTC)
|work=
is vague and unclear to most people. What's a work? It can be a book title, a conference proceeding titles, a journal title, a website, ... |journal=
or |website=
or whatever is clear and cannot be confused.
Headbomb {
t ·
c ·
p ·
b} 16:44, 28 August 2018 (UTC)
{{
cite journal}}
or {{
cite book}}
. It's much easier to use the parameter |work=
. It looks terrible when the bot changes "cite news [...] |work=BBC News
" to |website=BBC News
instead of changing it to {{
cite news}}
and keep work. Now it creates more work for editors, both to change form |website=
to |work=
or |newpaper=
, and to {{
cite news}}
, instead of just to {{
cite news}}
which still results in the same output. (
t)
Josve05a (
c) 17:08, 28 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/719 AManWithNoPlan ( talk) 03:30, 3 September 2018 (UTC)
In
this edit, all the bot does is replace the {{
cite web}} parameter work=
with website=
. The template's documentation says they are aliases. Even if the bot was also doing something useful, these changes clutter up the diff screen for a style that is not in any way preferable. Why is the bot changing these? —
Bilorv
(c)
(talk) 16:51, 5 September 2018 (UTC)
Is it possible to use this on MediaWiki installations that are not part of wikipedia.org? — Omegatron ( talk) 01:49, 7 September 2018 (UTC)
Much, much better. However, it could still be a bit better: When you start, you have
*** Processing page '{2018 FFA Cup preliminary rounds}' : 12:13:01 -------------------------------------------------------------------------- [12:13:02] Processing page '[[2018 FFA Cup preliminary rounds]]' — [[edit]]—[[history]]
This should be simplified to
-------------------------------------------------------------------------- [12:13:02] Processing page '[[2018 FFA Cup preliminary rounds]]' – [[edit]] – [[history]]
This eliminates redundancy and, the spaces help + use endashes. When no changes are required, you have
# No changes required. # # #
This should be simplified to
# No changes required.
When you have a change, you have
# Writing to Peace Pledge Union... Written to [[Peace Pledge Union]]{{POV|date=October 2015}} The '''Peace Pledge Union (PPU)''' is a British [[pacifist]] ...
This should be simplified to (with a line break after "Writing to Peace Pledge Union...")
# Writing to [[Peace Pledge Union]]... {{POV|date=October 2015}} The '''Peace Pledge Union (PPU)''' is a British [[pacifist]] ...
And when you end with
[[history]] / [[last edit]]
This could be much simpler/clearer with
[[diff]]
Headbomb { t · c · p · b} 12:16, 21 August 2018 (UTC)
{{ wontfix}} not going to pollute code with all sorts of "if ( running a category) then " code. AManWithNoPlan ( talk) 21:56, 12 September 2018 (UTC)
Not sure if there is a reason why this isn't done yet, but would it be possible to add JSTOR links in cases where this isn't already added? For example, source 12 in Brachiosaurus has a doi, but I know it is also on JSTOR [11], so shouldn't the bot be able to cross check? FunkMonk ( talk) 00:22, 14 September 2018 (UTC)
{{ wontfix}} nope. not searcable. jstor disabled tbat years ago. AManWithNoPlan ( talk)|
https://www.jstor.org/openurl?doi=10.1671/0272-4634(2003)023[0344:teovpi]2.0.co;2. {{
cite web}}
: Missing or empty |title=
(
help)
AManWithNoPlan (
talk) 13:19, 14 September 2018 (UTC)
https://support.jstor.org/hc/en-us/articles/115005079047-JSTOR-OpenURL-Linking-
{{ Fixed}}
FYI, I noticed text-weight: bold
, which is not valid CSS. The property is font-weight
. --
Izno (
talk) 22:34, 9 September 2018 (UTC)
|format=Full text
, |format=Accepted manuscript
, |format=Submitted manuscript
are all inappropriate uses of |format=
(not a 'new' parameter); see
documentation.
diff
The bot continues to add |format=
with inappropriate values, in this case Full text
. The purpose of the 'format' parameters is to identify for the reader the file format of the linked source, PDF, XLS, DOC, etc. (see the
documentation).
But the just-as-innappropriate
case 'submittedVersion': $format = 'Submitted manuscript'; break; case 'acceptedVersion': $format = 'Accepted manuscript'; break;
remain. Headbomb { t · c · p · b} 14:26, 9 August 2018 (UTC)
|format=Full text
, |format=Accepted manuscript
, |format=Submitted manuscript
are appropriate in |format=
ever. Read the template
documentation. The only thing that belongs in |format=
is the electronic file format: PDF, XLS, DOC, MP3, etc.|preprint-url=
that appends "
preprint" at the end of the template.
Headbomb {
t ·
c ·
p ·
b} 13:50, 11 August 2018 (UTC)
Unless it is a file format, nothing should be added by the bot in |format=
. think what you want about the existance of such a paramenter all we want, but don't misuse it. (
t)
Josve05a (
c) 20:29, 24 August 2018 (UTC)
Should perhaps have been fixed with
https://github.com/ms609/citation-bot/pull/652 but in
https://en.wikipedia.org/?title=Vallejo_%28ferry%29&diff=prev&oldid=856647186 the bot converted raw Google Book URLs to {{
cite web}}
. (Is this just because I'm using the gadget tool now and not the user script, and that pull fixes are delayed, or is this a new error?) (
t)
Josve05a (
c) 18:17, 26 August 2018 (UTC)
{{ wontfix}} for now. We have some improvements coming. AManWithNoPlan ( talk) 17:08, 20 September 2018 (UTC)
Women's liberation movement in North America Don't even know where to start. Changing isbn numbers from those given in source viewed, changing publishing information or deleting publisher, removing publishing location, all distort and are incompatible with accuracy of citation. As a historian, this bot failed to improve any of the citations and distorted the accuracy of information about source and material. Improving citations are always welcome, but deleting information which identifies sourcing accurately is not worthwhile. SusunW ( talk) 04:16, 8 September 2018 (UTC)
|id=
since en.wikipedia.org has decided that to remove publisher and location for all journals.
AManWithNoPlan (
talk) 03:35, 10 September 2018 (UTC)
The real solution is wiki linking the journal name and making an article about the journal. AManWithNoPlan ( talk) 15:47, 10 September 2018 (UTC)
significant attention by the world at large and over a period of time". So it is quite possible to have a journal (likely very specialized) that is well-known and highly respected in a narrow field of experts, and has published articles relevant to some topic, but which has not gained "significant attention by the world at large". WP:V does not require a WP:N source. ♦ J. Johnson (JJ) ( talk) 00:14, 11 September 2018 (UTC)
{{ notabug}} AManWithNoPlan ( talk) 14:04, 20 September 2018 (UTC)
|first=M. M
|first=M. M.
The meta data does not have them. That’s the problem.
AManWithNoPlan (
talk) 22:15, 27 August 2018 (UTC)
|first#=
in a citation you find the pattern (^| )[A-z]\.
, replace (^| )([A-z])( |$)
with $1$2.
in all other |first#=
found in the citation.|first#=
too. It's a really really widespread problem.
Headbomb {
t ·
c ·
p ·
b} 01:49, 28 August 2018 (UTC)
In
https://en.wikipedia.org/?title=Khanate_of_Kazan&diff=prev&oldid=856050147 the bot expands from an URL to a {{
cite journal}}
, which is amazing! However, the bot should mention this in the edit summary somehow. (
t)
Josve05a (
c) 14:38, 22 August 2018 (UTC)
this edit was triggered via
https://tools.wmflabs.org/citations/category.php?edit=toolbar&slow=1&user=Headbomb&cat=Livestock%20stubs
With the edit summary
This should instead be
When you run the page, you are presented with
Activated by Headbomb.
> Expanding 'Ununennium'; will commit edits.
Reading authentication tokens from tools.wmflabs.org.
[00:35:06] Processing page 'Ununennium' — edit—history
...
This would be much clearer/less intimidating if it was something like
Citation bot activated by Headbomb. The bot will automatically make edit(s) if it can.
>Bot logging on tools.wmflabs.org.
[00:35:06] Processing page 'Ununennium' — edit—history
...
Headbomb { t · c · p · b} 00:55, 30 August 2018 (UTC)
|url=
https://www.worldcat.org/oclc/873805659
→ |oclc=873805659
https://github.com/ms609/citation-bot/pull/741
AManWithNoPlan (
talk) 03:02, 9 September 2018 (UTC)
Running the bot multiple times after each edit on the same page results in new edits being made. All possible edits should be done before saving the article.
( t) Josve05a ( c) 21:33, 26 August 2018 (UTC)
<ref>https://arxiv.org/ftp/arxiv/papers/1312/1312.7288.pdf</ref>
https://arxiv.org/abs/1312.7288
https://github.com/ms609/citation-bot/pull/711
AManWithNoPlan (
talk) 02:54, 2 September 2018 (UTC)
I ran into the same issue. Sometimes I run the bot through an article twice because it appears in multiple reference cleanup required sections and I notice that the bot would add a page number with regular hyphen (-), then clean it up later with an en dash(–). Examples are
[14]
[15] and
[16]
[17]. If the intention of the bot is to have en dashes for page numbers, maybe it could do that when adding it so it does not have to make the subsequent edit again. --
AquaDTRS (
talk) 20:07, 6 September 2018 (UTC)
|work=Google.com
to a {{
cite book}} with |website=Google.com
|website=[Gg]oogle.com
from Google Books
Which is better via= or delete?
AManWithNoPlan (
talk) 22:32, 2 September 2018 (UTC)
|postcript=.
{{
cite journal}}
: Invalid |ref=harv
(
help)CS1 maint: postscript (
link)|postcript=.
{{
cite journal}}
: Invalid |ref=harv
(
help)
What are your thoughts on remove empty |postcript=
on {{
citation}} also, since it does nothing?
AManWithNoPlan (
talk) 21:35, 8 September 2018 (UTC)
|postcript=
{{
citation}}
: Invalid |ref=harv
(
help)|postcript=
{{
citation}}
: Invalid |ref=harv
(
help)In case a DOI does not resolve (i.e. is broken/inactive), check if the DOI has more than one forward-slash. If it does, remove the second and all content after it.
Real example: 10.1111/ruso.12119/full
to 10.1111/ruso.12119
. If it resolves and gives matching metadata, replace the |doi=
field. (
t)
Josve05a (
c) 01:49, 6 September 2018 (UTC)
/full
and remove them, if at the end of a broken DOI. (
t)
Josve05a (
c) 01:50, 6 September 2018 (UTC)
Should we really remove |url=
simply because the URL has a known doi in it? I think if the URL is a PDF file, it is worth keeping since it is linking to the journal article directly (as open source). We don't always remove |url=
when |doi=
is present, only if that specific URL happens to have the DOI in its path. Either we should always delete the URL, or never in my own opinion, but if we should, we shouldn't do so when the URL is a PDF. (
t)
Josve05a (
c) 22:28, 27 August 2018 (UTC)
https://github.com/ms609/citation-bot/pull/704 AManWithNoPlan ( talk) 01:38, 1 September 2018 (UTC)
to
Gotta love bad meta data. The bibcode has a journal parameter.
AManWithNoPlan (
talk) 02:46, 28 August 2018 (UTC)
[numFound] => 1 [start] => 0 [docs] => Array ( [0] => stdClass Object ( [arxiv_class] => Array ( [0] => gr-qc ) [identifier] => Array ( [0] => 2001gcit.conf..195H [1] => 2001gr.qc.....3067H [2] => 2001LNP...562..195H [3] => 10.1007/3-540-40988-2_10 [4] => 2001gcit.conf..195H [5] => gr-qc/0103067 [6] => 10.1007/3-540-40988-2_10 [7] => 2001gr.qc.....3067H ) [year] => 2001 [page] => Array ( [0] => 195 ) [bibcode] => 2001LNP...562..195H [pubdate] => 2001-00-00 [author] => Array ( [0] => Haugan, Mark P. [1] => Lämmerzahl, C. ) [volume] => 562 [doi] => Array ( [0] => 10.1007/3-540-40988-2_10 ) [pub] => Gyros, Clocks, Interferometers ...: Testing Relativistic Gravity in Space [doctype] => inbook [title] => Array ( [0] => Principles of Equivalence: Their Role in Gravitation Physics and Experiments That Test Them ) ) )
In {{
cite web}}, since |work=
is an alias for |website=
which is the template native parameter. Might not be ideal for many things that should actually be {{
cite news}}.
AManWithNoPlan (
talk) 20:22, 5 September 2018 (UTC)
@
AManWithNoPlan:. This is very disruptive. Might suggest disabling the tool until you are able to find and fix the problem, particularly if this is not even a "biggest fish" bug. It should not be converting |work=
-> |website=
at the rate of 1,000s or 10's of thousands. --
Green
C 13:40, 16 September 2018 (UTC)
|doi=
. However, that doi is marked as broken/inactive.
Follow-up from
User_talk:Citation_bot/Archive_9#Broken_dois_and_removal_of_URLs, still not fixed. (
t)
Josve05a (
c) 01:46, 6 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/729
AManWithNoPlan (
talk) 15:10, 6 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/729
AManWithNoPlan (
talk) 15:11, 6 September 2018 (UTC)
Thanks for reporting the issue. I believe this might occur for a number of articles which I ran the bot through, although I won't know which ones until the list of articles with invalid dates gets populated again in the next cycle. Also, I was thinking maybe the bot could include a feature to check for an invalid year before it replaces it, just in case it finds a set of numbers that look like dates elsewhere again. --
AquaDTRS (
talk) 19:38, 6 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/733
AManWithNoPlan (
talk) 23:53, 6 September 2018 (UTC)
! No match for bibcode identifier: 2012PhLB..713...17I; 2014A&A...571A..16P + Adding url: http://www.jstor.org/stable/78071"
https://github.com/ms609/citation-bot/pull/732
AManWithNoPlan (
talk) 20:27, 6 September 2018 (UTC)
|title=The First Destroyer «
to |title=The First Destroyer "
Regression of
User_talk:Citation_bot/Archive_10#Arrows_and_not_always_quotes
|doi=
10.1002/(SICI)1097-0134(20000515)39:3<216::AID-PROT40>3.0.CO;2-#
as inactive/broken, despite it being alive and working
https://github.com/ms609/citation-bot/pull/751
AManWithNoPlan (
talk) 16:24, 11 September 2018 (UTC)
https://en.wikipedia.org/?title=Andragogy&diff=859059054&oldid=859058657
|at=
with non-specific |pages=
|at=pp.425–439, see Table 2 p. 426 for tempering temperatures
to |pages=425–439
https://github.com/ms609/citation-bot/pull/756
AManWithNoPlan (
talk) 20:36, 11 September 2018 (UTC)
�
in parameters instead of the proper unicode characterü
, ä
and ö
Sorry, but the data is wrong in CrossRef. We could detect it, but we cannot fix it.
AManWithNoPlan (
talk) 16:19, 11 September 2018 (UTC)
I think thats fixed in out gothub development tree.
AManWithNoPlan (
talk) 01:05, 14 September 2018 (UTC)
|title=
.
! CrossRef server error loading headers for DOI 10.1002/(SICI)1520-6394(1998)8:1 <64::AID-DA10>3.0.CO;2-S: HTTP/1.0 400 Bad request DOI ok. ! No CrossRef record found for doi '10.1002/(SICI)1520-6394(1998)8:1 <64::AID-DA10>3.0.CO;2-S'; marking as broken
assuming php memory bug
AManWithNoPlan (
talk) 03:19, 18 September 2018 (UTC)
As a result, the link to the paper is on the proceedings name and not in the title. For instance "Proceedings of ISSTA, Demonstration Track" https://hal.archives-ouvertes.fr/hal-01321615/file/astor.pdf points to "ASTOR: A Program Repair Library for Java" (on /info/en/?search=Automatic_bug_fixing)
This is confusing both for human readers and for search engines.
This is
GIGO.
Headbomb {
t ·
c ·
p ·
b} 13:37, 18 September 2018 (UTC)
|deadurl=no/yes/whatever
when you no url is present.
{{
fixed}} removes this when removing url now
AManWithNoPlan (
talk) 14:25, 20 September 2018 (UTC)
Thank you for changing the bug title.
AManWithNoPlan (
talk) 18:24, 9 September 2018 (UTC)
{{ cite web}} should be the exception. Leave that one alone, unless it's converted. e.g. [23]. Headbomb { t · c · p · b} 23:18, 18 September 2018 (UTC)
|access-date=
removal at
User:GreenC_bot/Job_5 that was arrived at by lengthy community input over a 5 month period. --
Green
C 01:48, 19 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/810
AManWithNoPlan (
talk) 23:15, 20 September 2018 (UTC)
|authorlinux=
to |authorlink#=
|pages=8159
to |pages=astro-ph/9508159
Brandenberger, Robert H. (1995). "Formation of Structure in the Universe": 8159.
Bibcode:
1995astro.ph..8159B. {{
cite journal}}
: Cite journal requires |journal=
(
help)
{{cite book |author={{noitalic|{{lang|zh-hans|国务院人口普查办公室、国家统计局人口和社会科技统计司编}}}} |date=2012 |script-title=zh:中国2010年人口普查分县资料 |location=Beijing |publisher={{noitalic|{{lang|zh-hans|中国统计出版社}}}} [China Statistics Press] |page= |isbn=978-7-5037-6659-6 }}
to
{{cite book |author={{noitalic|{{lang|zh-hans|国务院人口普查办公室、国家统计局人口和社会科技统计司编}}}} |title=中国2010年人口普查分县资料 |date=2012 |script-title=zh:中国2010年人口普查分县资料 |location=Beijing |publisher={{noitalic|{{lang|zh-hans|中国统计出版社}}}} [China Statistics Press] |page= |isbn=978-7-5037-6659-6 }}
Making the title 中国2010年人口普查分县资料
appear twice.
|title=
if |script-title=
is the same (or includes the title in it's string).
Same with История русского автомата
in
https://en.wikipedia.org/?title=7.62×39mm&oldid=860197219 (
t)
Josve05a (
c) 00:34, 19 September 2018 (UTC)
# # # citation_bot_placeholder_comment 0 # # #
{{cite web |<!--trans--->title = Beij}} unfixable without massive effort
AManWithNoPlan (
talk) 02:49, 22 September 2018 (UTC)
{{cite journal|last1=Kaye|first1=Steven|last2=Fox|first2=Joseph M.|last3=Hicks|first3=Frederick A.|last4=Buchwald|first4=Stephen L.|title=The Use of Catalytic Amounts of CuCl and Other Improvements in the Benzyne Route to Biphenyl-Based Phosphine Ligands|journal=Advanced Synthesis & Catalysis|date=31 December 2001|volume=343|issue=8|pages=789–794|doi=10.1002/1615-4169(20011231)343:83.0.CO;2-A|url=http://onlinelibrary.wiley.com/doi/10.1002/1615-4169(20011231)343:8%3C789::AID-ADSC789%3E3.0.CO;2-A/full|language=en|issn=1615-4169|doi-broken-date=2017-04-22}}
to
{{cite journal|last1=Kaye|first1=Steven|last2=Fox|first2=Joseph M.|last3=Hicks|first3=Frederick A.|last4=Buchwald|first4=Stephen L.|title=The Use of Catalytic Amounts of CuCl and Other Improvements in the Benzyne Route to Biphenyl-Based Phosphine Ligands|journal=Advanced Synthesis & Catalysis|date=31 December 2001|volume=343|issue=8|pages=789–794|doi=10.1002/1615-4169(20011231)343:83.0.CO;2-A|language=en|issn=1615-4169|doi-broken-date=2018-09-21}}
Perhaps even replace with new one as in this cass the doi was missing a character
AManWithNoPlan (
talk) 21:06, 21 September 2018 (UTC)
do not drop urls that point to .pdf even if they have doi AManWithNoPlan ( talk) 03:14, 23 September 2018 (UTC)
I don't know if this is the case (pretty sure it isn't), but the bot should convert
|publication-date=
→ |date=
|publication-place=
→ |location=
If |location=
/ |date=
aren't set / are empty
Headbomb {
t ·
c ·
p ·
b} 13:48, 12 August 2018 (UTC)
Only other one I can think of is
|orig-year=
/|origyear=
→ |year=
Headbomb { t · c · p · b} 16:19, 12 August 2018 (UTC)
|orig-year=
should be converted to |year=
only if (a) |year=
is empty and (b) |orig-year=
contains only a valid four-digit year. Both must be true. If |orig-year=
contains additional text, it should not be moved to |year=
; that will cause an error message to appear. –
Jonesey95 (
talk) 17:45, 12 August 2018 (UTC)
|website=wayback.archive.org
, |publisher=wayback.archive.org
etc.|website=archive.org
, |publisher=archive.org
etc. if the (main) url has another domain that archive.org.
https://github.com/ms609/citation-bot/pull/822
AManWithNoPlan (
talk) 15:49, 25 September 2018 (UTC)
https://en.wikipedia.org/?title=CKMT1B&diff=858892445&oldid=858892287
https://en.wikipedia.org/?title=%CA%BBOumuamua&diff=prev&oldid=861195731
|publisher=Amazon.com
if removing Amazon URL in favor of ISBN.
https://github.com/ms609/citation-bot/pull/823
AManWithNoPlan (
talk) 15:54, 25 September 2018 (UTC)
|vauthors=
and the like for name-formatting or the use of CS1 versus CS2).|page=
and |pages=
are aliases.
|page=13-25
should be converted to |pages=13–25
|page=
/|pages=
/|at=
is set, remove the others (if they are empty / redundant)|year=
and |date=
are aliases.
|date=2008
should be converted to |year=2008
|year=
/|date=
is set, remove the other (if it is empty)|via=
online makes sense if a URL is provided, so remove it if there is no url provided.
So a citation like
{{cite journal |last=Smith |first=John |date=2007 |year= |title=Foobar |journal=Barfoo Journal |volume=3 |issue=4 |page=34-44 |pages= |via=}}
cleans up to
{{cite journal |last=Smith |first=John |year=2007 |title=Foobar |journal=Barfoo Journal |volume=3 |issue=4 |pages=34–44}}
Headbomb { t · c · p · b} 17:01, 25 September 2018 (UTC)
|date=
. Otherwise these are reasonable suggestions. --
Izno (
talk) 17:03, 25 September 2018 (UTC)
|date=
is not a a true alias of |year=
– true aliases cause the 'more than one of param and param' error message as here with |work=
and |journal=
:
{{cite journal |title=Title |work=Work |journal=Journal}}
→ "Title". Journal. {{
cite journal}}
: More than one of |work=
and |journal=
specified (
help)|date=
and |year=
are required (or desired), they cannot be aliases. I agree with Editor Izno that |date=
should be preferred over |year=
when both are not required.|date=
and |year=
are required (or desired)" what would those occasions be? Having |date=2008-04-26
and |year=2008
just presents redundant information.
Headbomb {
t ·
c ·
p ·
b} 18:39, 25 September 2018 (UTC)
https://github.com/ms609/citation-bot/pull/829 AManWithNoPlan ( talk) 23:37, 26 September 2018 (UTC)
|year=
field when set to "n.d." thus causing there to be a cite date error|year=
in this case.
Add tests and soon code.
https://github.com/ms609/citation-bot/pull/825
AManWithNoPlan (
talk) 18:27, 26 September 2018 (UTC)
Unless the entire journal name is wiki-linked, the data is almost always wrong. Secondly, partial links corrupt the COINS data and should not be done that way.
AManWithNoPlan (
talk) 20:02, 27 September 2018 (UTC)
{{
fixed}}
AManWithNoPlan (
talk) 21:02, 27 September 2018 (UTC)
|date=
field causing a cite date error to occur such as "| date= 2011-05-10T06:34:00-0400"
{{
fixed}}
AManWithNoPlan (
talk) 21:01, 27 September 2018 (UTC)
For example
<ref>{{cite web |url=https://www.sciencedirect.com/science/article/pii/S0024379512004405 |title=Geometry of the Welch bounds}}</ref>
or
<ref>https://www.sciencedirect.com/science/article/pii/S0024379512004405</ref>
Those URLs are extremely common and if they can be parsed (similar to DOI urls), that would be fantastic. And then they could be removed since they'll be redundant with DOIs. Headbomb { t · c · p · b} 22:52, 29 August 2018 (UTC) We would need to grab the hmtl, parse as xml, <meta name="citation_doi" content="10.1016/j.laa.2012.05.036" /> Or we could use: https://api.elsevier.com/content/object/pii/S0024379512004405 AManWithNoPlan ( talk) 14:31, 30 August 2018 (UTC)
I'd like the option to have a 'rebuild references' when they are so crappy we need to TNT them (for whatever reason), and start anew. Two options would be present
This would present things in a 'standardized' parameter order with 'standardized' whitespace
multiline | inline |
---|---|
<ref> {{cite arXiv |last1= |first1= |last2= |first2= |... |date= or |year= |title= |arxiv=<import> |class= }}</ref> |
<ref>{{cite arXiv |last1= |first1= |last2= |first2= |... |date= or |year= |title= |arxiv=<import> |class=}}</ref> |
multiline | inline |
---|---|
{{cite book |last1= |first1= |last2= |first2= |... |date= or |year= |chapter= |chapter-url=<import> |chapter-url-access=<import> |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |series= |volume= |pages= (or |page=) |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import> }} |
{{cite book |last1= |first1= |last2= |first2= |... |date= or |year= |chapter= |chapter-url=<import> |chapter-url-access=<import> |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |series= |volume= |pages= (or |page=) |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import>}} |
multiline | inline |
---|---|
{{cite journal |last1= |first1= |last2= |first2= |... |date= or |year= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |journal= |series= |volume= |issue= |pages= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import> }} |
{{cite journal |last1= |first1= |last2= |first2= |... |date= or |year= |title= |trans-title=<import> |language=<import non-English> |script-title=<import> |url=<import> |url-access=<import> |access-date=<import> |format=<import, if valid> |archive-url=<import> |archive-date=<import> |dead-url=<import> |journal= |series= |volume= |issue= |pages= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import>}} |
multiline | inline |
---|---|
{{cite web |last1= |first1= |last2= |first2= |... |date= or |year= |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |url=<import> |website= |series= |volume= |pages= |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import> }} |
{{cite web |last1= |first1= |last2= |first2= |... |date= or |year= |editor1-last= |editor1-first= |editor2-last= |editor2-first= |title= |url=<import> |website= |series= |volume= |pages= |location= |publisher= |type= |arxiv=<import> |asin=<import> |asin-tld=<import> |bibcode=<import> |bibcode-access=<import> |biorxiv=<import> |citeseerx=<import> |doi=<import> |doi-access=<import> |doi-brokendate= |hdl=<import> |hdl-access=<import> |isbn=<import> |ismn=<import> |issn=<import> |jfm=<import> |jstor=<import> |jstor-access=<import> |lccn=<import> |mr=<import> |oclc=<import> |ol=<import> |ol-access=<import> |osti=<import> |osti-access=<import> |pmc=<import> |embargo=<import> |pmid=<import> |rfc=<import> |ssrn=<import> |zbl=<import> |id=<import> |quote=<import> |ref=<import>}} |
Whatever is marked <import>
would be carried over from the old citation, with URLs/Identifiers used to rebuilt the rest of the citation. The rest would be present (if the bot can/would fill them), or omitted (if the bot can't/wouldn't fill them).
Headbomb {
t ·
c ·
p ·
b} 17:59, 1 September 2018 (UTC)
See also bioRvix at https://en.wikipedia.org/?title=Homo_sapiens&diff=prev&oldid=861469112
Fixed in
GitHub Pull 855
AManWithNoPlan (
talk) 21:37, 27 September 2018 (UTC)
?edit=toolbar&slow=1&zotero=0
(
t)
Josve05a (
c) 21:42, 27 September 2018 (UTC)
|date=scheme=dcterms.ISO8601; 2013-10-23
https://github.com/ms609/citation-bot/pull/858
AManWithNoPlan (
talk) 16:08, 28 September 2018 (UTC)
What do you suggest? The template supports two different title parameters and then shows them both. We added code that prevents duplicates, but in these cases the script title and title are very different (or maybe it basically one is printed and the other is cursive styling of the same words). Perhaps:
if (has script-title and new title is not all western characters) then ignore new title else add title end if
AManWithNoPlan ( talk) 16:12, 28 September 2018 (UTC)
|trans-title=
.if (has script-title) then ignore new title else add title end if
( t) Josve05a ( c) 16:18, 28 September 2018 (UTC)
Activating the bot sends in into an endless loop of doing absolutely nothing. Can't really explain more save it just fails to run properly on any page you try to run it on. Headbomb { t · c · p · b} 14:39, 29 September 2018 (UTC)
{{ fixed}} AManWithNoPlan ( talk) 20:36, 29 September 2018 (UTC)
Why???? What you expect the bot to do?
AManWithNoPlan (
talk) 22:48, 3 October 2018 (UTC)
{{cite LSA|url=https://www.ncbi.nlm.nih.gov/pubmed/4043876|year=1985|title=Kondous laventicus, a new ceboid primate from the Miocene of the La Venta, Colombia, South America}}
{{Cite LSA|last=Setoguchi|first=T.|date=1985|title=Kondous laventicus, a new ceboid primate from the Miocene of the La Venta, Colombia, South America|url=https://www.ncbi.nlm.nih.gov/pubmed/4043876|journal=Folia Primatologica; International Journal of Primatology|volume=44|pages=96–101|year=1985}}
http://dare.uva.nl/personal/pure/en/publications/functional-reconstruction-of-structurally-complex-epitopes-using-clips-technology(ce45bb5a-7823-4872-a0b1-e5e5a99a79e5).html |type=Submitted manuscrip
|url-access=free
Or, actually, in this case it should have added |hdl=11245/1.309707
instead of the URL, but in general when adding a free URL, it should add |url-access=free
. (
t)
Josve05a (
c) 22:42, 3 October 2018 (UTC)
|url-access=free
is not supported by cs1|2 because values in |url=
are presumed to be free-to-read.
http://dare.uva.nl/personal/pure/en/publications/functional-reconstruction-of-structurally-complex-epitopes-using-clips-technology(ce45bb5a-7823-4872-a0b1-e5e5a99a79e5).html |type=Submitted manuscrip
|hdl=11245/1.309707
instead
{{
wontfix}} the meta data is poor quality
AManWithNoPlan (
talk) 03:05, 4 October 2018 (UTC)
|url=
https://www.ncbi.nlm.nih.gov/books/NBK24662/
, nothing happens
The treasure trove of URL readers used by Citoid do actually parse this page:
https://github.com/zotero/translators
AManWithNoPlan (
talk) 18:37, 21 August 2018 (UTC)
<div class="publication-meta-secondary">DOI: 10.1136/jnnp.2008.144360 [...]
{{ wontfix}} they block us and anything that looks like scraping. AManWithNoPlan ( talk) 02:01, 5 October 2018 (UTC)
Those were not bare urls though. Headbomb { t · c · p · b} 13:43, 4 October 2018 (UTC)
This is a GIGO problem, since the info exists in the headers of the web page(s) in question, but it would be great if the tool could ignore this junk instead of inserting it. See the archives of
User talk:Zhaofeng Li/reFill, another tool that editors have been using to semi-automatically insert this junk for years. Gnomes remove it manually if tool-using editors fail to see it in Preview. –
Jonesey95 (
talk) 15:02, 29 September 2018 (UTC)
|journal=Ilr Review
|journal=ILR Review
https://github.com/ms609/citation-bot/pull/874
AManWithNoPlan (
talk) 03:07, 4 October 2018 (UTC)
https://github.com/ms609/citation-bot/pull/877
AManWithNoPlan (
talk) 03:04, 4 October 2018 (UTC)
Rasmussen, D. T. (2002). "The origin of Primates". In Hartwig, W. C. (ed.). The Primate Fossil Record. Cambridge: Cambridge University Press. pp. 5–9.
Bibcode:
2002prfr.book.....H. {{
cite book}}
: |journal=
ignored (
help)
Newer code avoids those book bibcodes
AManWithNoPlan (
talk) 03:08, 5 October 2018 (UTC)
I'm currently using https://tools.wmflabs.org/citations/category.php?cat=1980_births&slow=1 which makes one edit and then stops (this seems to be a bug from above discussions). I like to make the bot run on random pages and then stop when it has made an edit, I, however, don't want to specify a category. I'd love to be able to use a link such as https://tools.wmflabs.org/citations/random.php and just have the bot find a page where it will make an edit. ( t) Josve05a ( c) 20:59, 19 August 2018 (UTC)
The bot historically logged each page that it visited to a database, and could be run on the page that had been longest without a visit. The database didn't make the migration to ToolForge, but some of the code still exists. Something like what you suggest would be a good step towards the bot running unsupervised again (which had to be discontinued because I didn't have AManWithNoPlan to keep up with bug reports!) Martin ( Smith609 – Talk) 13:56, 21 August 2018 (UTC)
{{ wontfix}} too many other things to do AManWithNoPlan ( talk) 14:49, 8 October 2018 (UTC)
|website=Google Books
and |website=Books.google.es
Regression of
User talk:Citation bot/Archive 10#remove website=Google for books (
t)
Josve05a (
c) 23:06, 29 September 2018 (UTC)
|class=
to cite journal|class=
to cite journal, remove |class=
from cite journal.
https://github.com/ms609/citation-bot/pull/902
AManWithNoPlan (
talk) 15:05, 8 October 2018 (UTC)
|doi=10.1007/978-3-642-75924-6_15#page-1
.|publisher=Springer, Berlin, Heidelberg
|doi=10.1007/978-3-642-75924-6_15
|publisher=Springer
or SpringerLink
|location=Berlin, Heidelberg
https://github.com/ms609/citation-bot/pull/886
AManWithNoPlan (
talk) 04:36, 5 October 2018 (UTC)
https://github.com/ms609/citation-bot/pull/885 AManWithNoPlan ( talk) 04:36, 5 October 2018 (UTC)
{{ fixed}} AManWithNoPlan ( talk) 14:31, 8 October 2018 (UTC)
|date=Invalid date
{{
fixed}}
AManWithNoPlan (
talk) 14:30, 8 October 2018 (UTC)
This is very annoying.
Headbomb {
t ·
c ·
p ·
b} 06:13, 5 October 2018 (UTC)
-- AManWithNoPlan ( talk) 02:08, 7 October 2018 (UTC)
that website uses invalid ssl certs and so the bounces get stopped by https libraries. i really do not want to turn that off.
AManWithNoPlan (
talk) 04:21, 8 October 2018 (UTC)
|jstor=j.ctt6wp6td.10
[
{
"itemType": "bookSection",
"notes": [],
"tags": [],
"title": "Underground Visions:: Strategies of Resistance along the Amsterdam Metro Lines",
"abstractNote": "The association between Amsterdam and the underground is rather ambiguous to say the least. On the one hand, the Netherlands, and Amsterdam in particular, are proud to present themselves as hospitable vis-à-vis alternative ‘underground’ cultures – a legacy from the 1960s and 1970s when feminist, gay, hippy, student, and squat movements were dominating the social and cultural scenes. The global tourist reputation of Amsterdam as the capital of sex, drugs, and rock ’n’ roll has largely been built on the legacy of a 1960s underground culture that was leftist and avant-garde. At the same time, however, the other notion of",
"publisher": "Amsterdam University Press",
"ISBN": [
"9789089645050"
],
"pages": "77–96",
"bookTitle": "Paris-Amsterdam Underground",
"series": "Essays on Cultural Resistance, Subversion, and Diversion",
"url": "http://www.jstor.org/stable/j.ctt6wp6td.10",
"date": "2013",
"libraryCatalog": "JSTOR",
"accessDate": "2018-09-09",
"shortTitle": "Underground Visions",
"author": [
[
"Ginette",
"Verstraete"
]
],
"seriesEditor": [
[
"Christoph",
"Lindner"
],
[
"Andrew",
"Hussey"
]
],
"source": [
"Zotero"
]
}
]
{{ fixed}} AManWithNoPlan ( talk) 15:15, 9 October 2018 (UTC)
|hdl=
we will probably add regex's to catch the more common ones. We do the same with pubmed.
AManWithNoPlan (
talk) 22:11, 30 September 2018 (UTC)
<div class="article-title">Exile Drama: The Translation of Ernst Toller's <i>Pastor Hall</i> (1939)</div>
as Exile Drama: The Translation of Ernst Toller's Pastor Hall(1939)
, stipping the space beteen the </i> (1939)
Problem with euppublishing.com? (
t)
Josve05a (
c) 22:02, 7 October 2018 (UTC)
10.3366/
, ensure there is a space before any parantesis. If not, consider adding one. Or something like that? Or, if htere is no space, make the bot try and scrape the landing page and see if the HTML there has a space? (
t)
Josve05a (
c) 17:43, 8 October 2018 (UTC)