From Wikipedia, the free encyclopedia

Please leave me a message below:

Your message here!

FYI: Wikipedia talk:De Gruyter and de:Wikipedia: De Gruyter Greets -- Informationswiedergutmachung ( talk) 23:40, 15 October 2014 (UTC) reply

Hey thanks for the heads up. Saehrimnir is coordinating there and it looks like signups are going well. They should be sent out soon, hopefully by New Year's. Best, Jake Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply

Partnership with French paid journal sources

Good evening,

In France, there are ongoing experiments by journalists to free themselves of the advertisement business and to run paid pure player journals.

Mediapart uses this model since 2008 and has recently been joined by a project started on Kickstarter, by experimented French journalists from serious background, fr:Hexagones. The first has proven in several French affairs to be a valuable source. The last wants to provide a source more oriented to quality investigation than to buzz news.

I would like to approach these two medias to offer a Library partnership.

For that, I would first like to know if you have a grid of the arguments (better content and verifiability for Wikipedia, more external links and usage increase for the partner) to use for such approach.

Secondly, I would also like to know if I got a positive answer, if web newspapers enters in the scope of the Library project, or if only archives of web journal enters in scope. -- Dereckson ( talk) 22:44, 19 October 2014 (UTC) reply

Hi Dereckson! We've just updated our partner promotional messaging. You can view/use/copy/adapt it here: [1]. Any reliable sources are within the scope of the project, definitely including web newspapers. Thanks so much for inquiring and let me know if you have any questions or progress or success! Cheers, Jake Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply

De Gruyter data protection

Hi can u answer us on German WP what are exactly the data protection conditions. Thx.-- Sanandros ( talk) 05:44, 21 October 2014 (UTC) reply

Thanks, I've replied there. Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply

Wikipedia Library

Hi, I notice that Research coordinators are needed for the Wikipedia library and I would love to help by becoming one. I have access to Questia so Ibelieve I could do the job. Thanks. Uhlan talk 00:26, 18 November 2014 (UTC) reply

Hi Uhlan. Thanks so much for your offer. I'm excited to have you on as a new Research Coordinator. I'm excited to be working with you! Jake Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply


A barnstar for you and your team!

The Special Barnstar
Your success in creating partnerships with publishers of high quality content is appreciated. Your work is essential! Happy Christmas JimRenge ( talk) 21:18, 23 December 2014 (UTC) reply
That is so kind User:JimRenge! It indeed takes a team and I want to share this wholeheartedly with User:Astinson (WMF), User:Nikkimaria, and User:PEarley (WMF). Best wishes from all of us this holiday. I hope you enjoy it. Best, Jake Ocaasi (WMF) ( talk) 21:22, 23 December 2014 (UTC) reply

London Times archival access

I don't know if anyone's approached Gale about access to their archive of the London Times, but I would really like access as it's a very good reference to British military and naval promotions and appointments. I believe that most editors in the UK can get access through their public libraries, but that leaves most non-academic users outside the UK in the lurch. I believe that the New York Public Library has a subscription so that might be another avenue to be explored.-- Sturmvogel 66 ( talk) 18:11, 13 January 2015 (UTC) reply

Thanks for the suggestion Sturmvogel. We have unsuccessfully tried to get LT before, but I will add it to our ongoing outreach list. We are also always tracking suggestions at Wikipedia:The_Wikipedia_Library/Journals/Requests in case other ideas cross your mind. Cheers! Jake Ocaasi (WMF) ( talk) 00:35, 15 January 2015 (UTC) reply

Partner/Outreach Coordinators

Hi Jake - I see that the Wikipedia Library needs Partner and Outreach coordinators. What do those jobs entail? - Thanks; LeoRomero ( talk) 19:00, 21 February 2015 (UTC) reply

Hi LeoRomero ( talk · contribs)! Thanks so much for expressing interest.
  • Partner coordinators help out with contacting potential donors--publishers, databases, university presses, journals, etc. We have pretty refined brochures and packaging about our programs but it still takes time to make contact and see those initial talks through to a donation. We are ready to train someone comfortable with professional, real-world outreach skills so we can broaden the number and scope of our partnerships.
  • Outreach coordinators help us connect with and communicate to the external world of institutions, organizations, groups, and events. This involves micro-tasks like reposting relevant twitter content from @WikiLibrary and on Facebook, medium tasks discovering stories from among journal access recipients for the Wikimedia Blog or working on the TWL Books & Bytes newsletter, high-impact conference coordination such as submitting proposals for major events where TWL would present, and advanced tasks such as writing long-form opinion pieces about the role of Wikipedia in a changing landscape of digital reference.
For both positions we would be ready to gradually introduce you to increasingly relevant and challenging projects so you could grow into them. We are learning as you are learning about how to scale our work and would be excited to experiment with you in a role that felt well-scoped and interesting for your background and leanings. Let me know what you think! Please feel free to email me at: jorlowitz@wikimedia.org if you'd like to chat off-wiki. Cheers! Ocaasi (WMF) ( talk) 18:25, 2 March 2015 (UTC) reply
Thanks Jake, email's on the way! LeoRomero ( talk) 19:36, 2 March 2015 (UTC) reply

ProQuest Congressional

Hi, Ocaasi. Wehwalt suggested that I leave a message here on your talk page. I was wondering if there is some way for me to gain access to the website ProQuest Congressional. Most of my work on Wikipedia involves coins and coin legislation, which is covered in-depth on ProQuest Congressional, including laws, proposed laws and Congressional hearings. Currently, Wehwalt has access to the website through his Wikipedia partnership with George Mason University, so he has been able to supply some absolutely invaluable information to me, but his access will expire shortly. Do you know if the Wikipedia Library will at some point begin a partnership with ProQuest Congressional, or if there is another way that I could gain access as an active editor? Thank you in advance for your time.- RHM22 ( talk) 20:17, 17 March 2015 (UTC) reply

Hi RHM22. Indeed, ProQuest is very valuable and also huge. We are starting with a pilot donation for their Literature Online program. If that is successful we will aim to receive a much broader 'package' including sources like Proquest Congressional. To be very honest, this will take up to a year, so your best bet for now is still WP:RX. Thanks for the feedback and happy researching! Cheers, Jake Ocaasi t | c 20:20, 17 March 2015 (UTC) reply
Thank you for the extremely fast reply, Ocaasi! I understand that it will take some time. ProQuest is indeed an enormous resource, as even just the Congressional archives are incredibly massive. I believe I have everything needed for now, but I will probably require Congressional records for future projects, so I will enquire at the noticeboard if so. Thanks again.- RHM22 ( talk) 20:23, 17 March 2015 (UTC) reply

fa.wiki

Hello dear Jake, I'm working on Farsi branch of The Wikipedia Library in fa.wiki. Please creat your account in our wiki. I'm translate and creat main and about page in this wiki. we are in touch with Patrick. Thank you Darafsh Kaviyani ( Talk) 09:19, 26 June 2015 (UTC) reply

It is a pleasure

It was a pleasure to talk to you in the video chat. You are the first WMF staff member, let alone Wikipedian, that I visually communicated with. :-). If I have any further questions regarding IA, I will let you know.— cyberpower Chat:Limited Access 17:10, 14 August 2015 (UTC) reply

Cyberpower, the please was all mine. It's great to have you in the movement contributing so much! I'll update you on our IA talks in late August. Cheers, Jake Ocaasi (WMF) ( talk) 17:25, 14 August 2015 (UTC) reply

A question

Reading the list of paid databases for which Wikipedians may request free accounts prompted me to consider a related thought. Does Wikipedia make available a list of free online resources for use in research? I have been writing and editing articles for more than a year, and I haven't seen one.

The donated accounts are a great help, and I appreciate the ones that I was fortunate to receive this year. The number of accounts, however, is limited. Meanwhile, some free resources offer specialized searches (such as The Official Academy Awards Database) and searches of publications not included in either newspaper databases or academic journals (such as Media History Digital Library).

As I have written, edited, and done research, I have compiled a list of free resources that have been useful to me to varying degrees. I suspect that many others who work on material on Wikipedia have done the same.

I wonder whether a directory-style list of such resources could be compiled and made available somewhere on Wikipedia. I think it would be especially helpful to newcomers who perhaps are familiar with little more than using basic search engines for online research. It could also be good for those of us who have some experience; perhaps we might pick up a few sources to add to our own favorites.

I thought that your running of The Wikipedia Library might make you the best person for considering this question/proposal. I am interested in your thoughts of whether my idea is feasible and/or desirable. Eddie Blick ( talk) 02:17, 9 October 2015 (UTC) reply

Hey Eddie Blick! Great question. Yes, we do: 1) WP:TWL/OA is an index of useful open access sources and projects; 2) WP:TWL/Resources is a more comprehensive directory of generally useful free (but not necessarily open acess) resources. We even have a team of Research coordinators working on that, and you can sign up to be one if you like :) I hope this helps, I like your ideas, and feel free to reach out any time! Cheers, Jake Ocaasi (WMF) ( talk) 16:00, 9 October 2015 (UTC) reply
Thanks for your reply. I wasn't aware of those two sites; I'll take a look and find out what is available on them. I will also look at the information about being a research coordinator and see whether I might be able to contribute in that way. I appreciate your help. Eddie Blick ( talk) 19:22, 9 October 2015 (UTC) reply

Request for assistance

Hello : Today I have sent you an e mail asking for you counseling and assistance. Please take the time to look at it. Thanks Rudy235 ( talk) 00:19, 24 October 2015 (UTC) reply

How long is the Pete Forsyth panel discussion tonight

Thanks. Cheers! {{u| Checkingfax}} { Talk} 20:55, 28 October 2015 (UTC) reply

6pm-7:30pm California time. Watch it here: https://www.youtube.com/watch?v=t-cF7433aT4 Cheers, Ocaasi (WMF) ( talk) 21:14, 28 October 2015 (UTC) reply
See you there. Cheers! {{u| Checkingfax}} { Talk} 22:42, 28 October 2015 (UTC) reply

Welcome to The Wikipedia Adventure!

Hi Ocaasi (WMF)! We're so happy you wanted to play to learn, as a friendly and fun way to get into our community and mission. I think these links might be helpful to you as you get started.

-- 11:04, Sunday, November 15, 2015 ( UTC)

Welcome to The Wikipedia Adventure!

Hi Ocaasi (WMF)! We're so happy you wanted to play to learn, as a friendly and fun way to get into our community and mission. I think these links might be helpful to you as you get started.

-- 22:58, Thursday, November 19, 2015 ( UTC)

Telegram

Hello, Ocaasi (WMF). Please check your email; you've got mail!
Message added 18:58, 4 January 2016 (UTC). It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{ You've got mail}} or {{ ygm}} template.

-- samtar whisper 18:58, 4 January 2016 (UTC) reply

Phabricator

Hi Ocaasi, I dropped you a line a little while back - just letting you know I got a project created on Phabricator for tracking bugs more effectively! I've added all the old ones on there - please become a member so you can get updates and notifications :) -- samtar whisper 19:02, 7 January 2016 (UTC) reply

A barnstar for you!

The Tireless Contributor Barnstar
Thank you so much for creating the Wikipedia adventure! I love it and it is so helpful! Elsa Enchanted ( talk) 17:44, 4 April 2016 (UTC) reply

My kudos to Nikkimaria and your thanks

Well, it is Nikkimaria and (I guess) you who should be praised for granting us the access, not me ;).

Zezen ( talk) 20:59, 4 April 2016 (UTC), who has already researched in depth 19th c. Liberia migration policies using this resource and found a fascinating long-forgotten book from 1903 about this very subject, when a spade was still called a spade... reply

APPRECIATION Hey I just love the Wikipedia Adventure. I wonder where do you get such ideas. Must tell me the place!

Amy2563 ( talk) 17:22, 23 May 2016 (UTC) reply

Thanks you for the award

I am taking up the work/pleasure of editing in wikipedia (en español/spanish) and I found The Cure Award. It's very motivating, now I will put more effort to continue in this knowledge sharing adventure. I'm studying now to validate my doctor of medicine degree in USA, and editing Wikipedia is part of my studying routine. Thanks again, greetings, -- Behemot leviatan ( talk) 02:01, 1 June 2016 (UTC) reply

Help needed for OAbot

Hi Jake! Can you please see Help_talk:Citation_Style_1#Adding_open_access_links_to_references and lead the relevant discussion at WP:TWL? Cheers, − Pintoch ( talk) 17:35, 10 June 2016 (UTC) reply

You've got mail!

Hello, Ocaasi (WMF). Please check your email; you've got mail! The subject is Re: The Wikipedia Library coordination.
Message added 01:42, 1 October 2016 (UTC). It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{ You've got mail}} or {{ ygm}} template.

-- Cameron11598 (Talk) 01:42, 1 October 2016 (UTC) reply

Teahouse question unanswered

Someone had an idea for The Wikipedia Library but didn't get a response here. Thanks.— Vchimpanzee • talk • contributions • 19:25, 13 October 2016 (UTC) reply

Thanks, I replied! Best, Jake Ocaasi (WMF) ( talk) 15:35, 14 October 2016 (UTC) reply
Okay, where is the response, because I checked the Teahouse archives and the talk page of the person who asked for help.— Vchimpanzee • talk • contributions •
Funny, I found another response I left and confused it for the question you asked about. Replied new now on the user's Talk page. Thanks Vchimpanzee! Jake Ocaasi (WMF) ( talk) 21:43, 18 October 2016 (UTC) reply

Page for Litwin Books, LLC

Hi Jake! Good meeting you at DLF in Milwaukee. As you suggested, I created a page for Litwin Books and Library Juice Press in my sandbox page. You had offered to review it. Thanks! Rlitwin ( talk) 16:23, 7 December 2016 (UTC) reply

Hi Rlitwin! I added the draft template to the top of your page. Just click the blue button to submit it for review. I can't review it myself because of my conflict of interest with WMF staff and library publishers, but we have a team of volunteers who do just that! Cheers, Jake Ocaasi (WMF) ( talk) 19:39, 7 December 2016 (UTC) reply
Great - Thank you. Rlitwin ( talk)

You've got mail!!!!!!!

Hello, Ocaasi (WMF). Please check your email; you've got mail!
It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{ You've got mail}} or {{ ygm}} template.{{u| Checkingfax}} { Talk} 06:13, 1 April 2017 (UTC) reply

Welcome to The Wikipedia Adventure!

Hi Ocaasi (WMF)! We're so happy you wanted to play to learn, as a friendly and fun way to get into our community and mission. I think these links might be helpful to you as you get started.

-- 18:10, Wednesday, April 19, 2017 ( UTC)

Facto Post – Issue 1 – 14 June 2017

Facto Post – Issue 1 – 14 June 2017

Editorial

This newsletter starts with the motto "common endeavour for 21st century content". To unpack that slogan somewhat, we are particularly interested in the new, post-Wikidata collection of techniques that are flourishing under the Wikimedia collaborative umbrella. To linked data, SPARQL queries and WikiCite, add gamified participation, text mining and new holding areas, with bots, tech and humans working harmoniously.

Scientists, librarians and Wikimedians are coming together and providing a more unified view of an emerging area. Further integration of both its community and its technical aspects can be anticipated.

While Wikipedia will remain the discursive heart of Wikimedia, data-rich and semantic content will support it. We'll aim to be both broad and selective in our coverage. This publication Facto Post (the very opposite of retroactive) and call to action are brought to you monthly by ContentMine.

Links
Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 09:33, 14 June 2017 (UTC) reply

Facto Post – Issue 2 – 13 July 2017

Facto Post – Issue 2 – 13 July 2017

Editorial: Core models and topics

Wikimedians interest themselves in everything under the sun — and then some. Discussion on "core topics" may, oddly, be a fringe activity, and was popular here a decade ago.

The situation on Wikidata today does resemble the halcyon days of 2006 of the English Wikipedia. The growth is there, and the reliability and stylistic issues are not yet pressing in on the project. Its Berlin conference at the end of October will have five years of achievement to celebrate. Think Wikimania Frankfurt 2005.

Progress must be made, however, on referencing "core facts". This has two parts: replacing "imported from Wikipedia" in referencing by external authorities; and picking out statements, such as dates and family relationships, that must not only be reliable but be seen to be reliable.

In addition, there are many properties on Wikidata lacking a clear data model. An emerging consensus may push to the front key sourcing and biomedical properties as requiring urgent attention. Wikidata's "manual of style" is currently distributed over thousands of discussions. To make it coalesce, work on such a core is needed.

Links


Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

Facto Post – Issue 3 – 11 August 2017

Facto Post – Issue 3 – 11 August 2017

Wikimania report

Interviewed by Facto Post at the hackathon, Lydia Pintscher of Wikidata said that the most significant recent development is that Wikidata now accounts for one third of Wikimedia edits. And the essential growth of human editing.

Internet-In-A-Box

Impressive development work on Internet-in-a-Box featured in the WikiMedFoundation annual conference on Thursday. Hardware is Raspberry Pi, running Linux and the Kiwix browser. It can operate as a wifi hotspot and support a local intranet in parts of the world lacking phone signal. The medical use case is for those delivering care, who have smartphones but have to function in clinics in just such areas with few reference resources. Wikipedia medical content can be served to their phones, and power supplied by standard lithium battery packages.

Yesterday Katherine Maher unveiled the draft Wikimedia 2030 strategy, featuring a picturesque metaphor, "roads, bridges and villages". Here "bridges" could do with illustration. Perhaps it stands for engineering round or over the obstacles to progress down the obvious highways. Internet-in-a-Box would then do fine as an example.

"Bridging the gap" explains a take on that same metaphor, with its human component. If you are at Wikimania, come talk to WikiFactMine at its stall in the Community Village, just by the 3D-printed display for Bassel Khartabil; come hear T Arrow talk at 3 pm today in Drummond West, Level 3.

Link

  • Plaudit for the Medical Wikipedia app, content that is loaded into Internet-In-A-Box with other material, such as per-country documentation.
Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:55, 12 August 2017 (UTC) reply

Facto Post – Issue 4 – 18 September 2017

Facto Post – Issue 4 – 18 September 2017

Editorial: Conservation data

The IUCN Red List update of 14 September led with a threat to North American ash trees. The International Union for Conservation of Nature produces authoritative species listings that are peer-reviewed. Examples used as metonyms for loss of species and biodiversity, and discussion of extinction rates, are the usual topics covered in the media to inform us about this area. But actual data matters.

Dorstenia elata, a critically endangered South American herb, contained in Moraceae, the family of figs and mulberries

Clearly, conservation work depends on decisions about what should be done, and where. While animals, particularly mammals, are photogenic, species numbers run into millions. Plant species lie at the base of typical land-based food chains, and vegetation is key to the habitats of most animals.

ContentMine dictionaries, for example as tabulated at d:Wikidata:WikiFactMine/Dictionary list, enable detailed control of queries about endangered species, in their taxonomic context. To target conservation measures properly, species listings running into the thousands are not what is needed: range maps showing current distribution are. Between the will to act, and effective steps taken, the services of data handling are required. There is now no reason at all why Wikidata should not take up the burden.

Links

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 14:46, 18 September 2017 (UTC) reply

Facto Post – Issue 5 – 17 October 2017

Facto Post – Issue 5 – 17 October 2017

Editorial: Annotations

Annotation is nothing new. The glossators of medieval Europe annotated between the lines, or in the margins of legal manuscripts of texts going back to Roman times, and created a new discipline. In the form of web annotation, the idea is back, with texts being marked up inline, or with a stand-off system. Where could it lead?

1495 print version of the Digesta of Justinian, with the annotations of the glossator Accursius from the 13th century

ContentMine operates in the field of text and data mining (TDM), where annotation, simply put, can add value to mined text. It now sees annotation as a possible advance in semi-automation, the use of human judgement assisted by bot editing, which now plays a large part in Wikidata tools. While a human judgement call of yes/no, on the addition of a statement to Wikidata, is usually taken as decisive, it need not be. The human assent may be passed into an annotation system, and stored: this idea is standard on Wikisource, for example, where text is considered "validated" only when two different accounts have stated that the proof-reading is correct. A typical application would be to require more than one person to agree that what is said in the reference translates correctly into the formal Wikidata statement. Rejections are also potentially useful to record, for machine learning.

As a contribution to data integrity on Wikidata, annotation has much to offer. Some "hard cases" on importing data are much more difficult than average. There are for example biographical puzzles: whether person A in one context is really identical with person B, of the same name, in another context. In science, clinical medicine require special attention to sourcing ( WP:MEDRS), and is challenging in terms of connecting findings with the methodology employed. Currently decisions in areas such as these, on Wikipedia and Wikidata, are often made ad hoc. In particular there may be no audit trail for those who want to check what is decided.

Annotations are subject to a World Wide Web Consortium standard, and behind the terminology constitute a simple JSON data structure. What WikiFactMine proposes to do with them is to implement the MEDRS guideline, as a formal algorithm, on bibliographical and methodological data. The structure will integrate with those inputs the human decisions on the interpretation of scientific papers that underlie claims on Wikidata. What is added to Wikidata will therefore be supported by a transparent and rigorous system that documents decisions.

An example of the possible future scope of annotation, for medical content, is in the first link below. That sort of detailed abstract of a publication can be a target for TDM, adds great value, and could be presented in machine-readable form. You are invited to discuss the detailed proposal on Wikidata, via its talk page.

Links

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 08:46, 17 October 2017 (UTC) reply

Facto Post – Issue 6 – 15 November 2017

Facto Post – Issue 6 – 15 November 2017

WikidataCon Berlin 28–9 October 2017

WikidataCon 2017 group photo

Under the heading rerum causas cognescere, the first ever Wikidata conference got under way in the Tagesspiegel building with two keynotes, One was on YAGO, about how a knowledge base conceived ten years ago if you assume automatic compilation from Wikipedia. The other was from manager Lydia Pintscher, on the "state of the data". Interesting rumours flourished: the mix'n'match tool and its 600+ datasets, mostly in digital humanities, to be taken off the hands of its author Magnus Manske by the WMF; a Wikibase incubator site is on its way. Announcements came in talks: structured data on Wikimedia Commons is scheduled to make substantive progress by 2019. The lexeme development on Wikidata is now not expected to make the Wiktionary sites redundant, but may facilitate automated compilation of dictionaries.

WD-FIST explained

And so it went, with five strands of talks and workshops, through to 11 pm on Saturday. Wikidata applies to GLAM work via metadata. It may be used in education, raises issues such as author disambiguation, and lends itself to different types of graphical display and reuse. Many millions of SPARQL queries are run on the site every day. Over the summer a large open science bibliography has come into existence there.

Wikidata's fifth birthday party on the Sunday brought matters to a close. See a dozen and more reports by other hands.

Links

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:02, 15 November 2017 (UTC) reply

Facto Post – Issue 7 – 15 December 2017

Facto Post – Issue 7 – 15 December 2017

A new bibliographical landscape

At the beginning of December, Wikidata items on individual scientific articles passed the 10 million mark. This figure contrasts with the state of play in early summer, when there were around half a million. In the big picture, Wikidata is now documenting the scientific literature at a rate that is about eight times as fast as papers are published. As 2017 ends, progress is quite evident.

Behind this achievement are a technical advance ( fatameh), and bots that do the lifting. Much more than dry migration of metadata is potentially involved, however. If paper A cites paper B, both papers having an item, a link can be created on Wikidata, and the information presented to both human readers, and machines. This cross-linking is one of the most significant aspects of the scientific literature, and now a long-sought open version is rapidly being built up.

The effort for the lifting of copyright restrictions on citation data of this kind has had real momentum behind it during 2017. WikiCite and the I4OC have been pushing hard, with the result that on CrossRef over 50% of the citation data is open. Now the holdout publishers are being lobbied to release rights on citations.

But all that is just the beginning. Topics of papers are identified, authors disambiguated, with significant progress on the use of the four million ORCID IDs for researchers, and proposals formulated to identify methodology in a machine-readable way. P4510 on Wikidata has been introduced so that methodology can sit comfortably on items about papers.

More is on the way. OABot applies the unpaywall principle to Wikipedia referencing. It has been proposed that Wikidata could assist WorldCat in compiling the global history of book translation. Watch this space.

And make promoting #1lib1ref one of your New Year's resolutions. Happy holidays, all!

November 2017 map of geolocated Wikidata items, made by Addshore

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 14:54, 15 December 2017 (UTC) reply

Facto Post – Issue 8 – 15 January 2018

Facto Post – Issue 8 – 15 January 2018

Metadata on the March

From the days of hard-copy liner notes on music albums, metadata have stood outside a piece or file, while adding to understanding of where it comes from, and some of what needs to be appreciated about its content. In the GLAM sector, the accumulation of accurate metadata for objects is key to the mission of an institution, and its presentation in cataloguing.

Today Wikipedia turns 17, with worlds still to conquer. Zooming out from the individual GLAM object to the ontology in which it is set, one such world becomes apparent: GLAMs use custom ontologies, and those introduce massive incompatibilities. From a recent article by sadads, we quote the observation that "vocabularies needed for many collections, topics and intellectual spaces defy the expectations of the larger professional communities." A job for the encyclopedist, certainly. But the data-minded Wikimedian has the advantages of Wikidata, starting with its multilingual data, and facility with aliases. The controlled vocabulary — sometimes referred to as a "thesaurus" as term of art — simplifies search: if a "spade" must be called that, rather than "shovel", it is easier to find all spade references. That control comes at a cost.

SVG pedestrian crosses road
Zebra crossing/crosswalk, Singapore

Case studies in that article show what can lie ahead. The schema crosswalk, in jargon, is a potential answer to the GLAM Babel of proliferating and expanding vocabularies. Even if you have no interest in Wikidata as such, simply vocabularies V and W, if both V and W are matched to Wikidata, then a "crosswalk" arises from term v in V to w in W, whenever v and w both match to the same item d in Wikidata.

For metadata mobility, match to Wikidata. It's apparently that simple: infrastructure requirements have turned out, so far, to be challenges that can be met.

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 12:38, 15 January 2018 (UTC) reply

Facto Post – Issue 9 – 5 February 2018

Facto Post – Issue 9 – 5 February 2018

m:Grants:Project/ScienceSource is the new ContentMine proposal: please take a look.

Wikidata as Hub

One way of looking at Wikidata relates it to the semantic web concept, around for about as long as Wikipedia, and realised in dozens of distributed Web institutions. It sees Wikidata as supplying central, encyclopedic coverage of linked structured data, and looks ahead to greater support for "federated queries" that draw together information from all parts of the emerging network of websites.

Another perspective might be likened to a photographic negative of that one: Wikidata as an already-functioning Web hub. Over half of its properties are identifiers on other websites. These are Wikidata's "external links", to use Wikipedia terminology: one type for the DOI of a publication, another for the VIAF page of an author, with thousands more such. Wikidata links out to sites that are not nominally part of the semantic web, effectively drawing them into a larger system. The crosswalk possibilities of the systematic construction of these links was covered in Issue 8.

Wikipedia:External links speaks of them as kept "minimal, meritable, and directly relevant to the article." Here Wikidata finds more of a function. On viaf.org one can type a VIAF author identifier into the search box, and find the author page. The Wikidata Resolver tool, these days including Open Street Map, Scholia etc., allows this kind of lookup. The hub tool by maxlath takes a major step further, allowing both lookup and crosswalk to be encoded in a single URL.

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:50, 5 February 2018 (UTC) reply

Facto Post – Issue 10 – 12 March 2018

Facto Post – Issue 10 – 12 March 2018

Milestone for mix'n'match

Around the time in February when Wikidata clicked past item Q50000000, another milestone was reached: the mix'n'match tool uploaded its 1000th dataset. Concisely defined by its author, Magnus Manske, it works "to match entries in external catalogs to Wikidata". The total number of entries is now well into eight figures, and more are constantly being added: a couple of new catalogs each day is normal.

Since the end of 2013, mix'n'match has gradually come to play a significant part in adding statements to Wikidata. Particularly in areas with the flavour of digital humanities, but datasets can of course be about practically anything. There is a catalog on skyscrapers, and two on spiders.

These days mix'n'match can be used in numerous modes, from the relaxed gamified click through a catalog looking for matches, with prompts, to the fantastically useful and often demanding search across all catalogs. I'll type that again: you can search 1000+ datasets from the simple box at the top right. The drop-down menu top left offers "creation candidates", Magnus's personal favourite. m:Mix'n'match/Manual for more.

For the Wikidatan, a key point is that these matches, however carried out, add statements to Wikidata if, and naturally only if, there is a Wikidata property associated with the catalog. For everyone, however, the hands-on experience of deciding of what is a good match is an education, in a scholarly area, biographical catalogs being particularly fraught. Underpinning recent rapid progress is an open infrastructure for scraping and uploading.

Congratulations to Magnus, our data Stakhanovite!

Links

3D printing

To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 12:26, 12 March 2018 (UTC) reply

Facto Post – Issue 11 – 9 April 2018

Facto Post – Issue 11 – 9 April 2018

The 100 Skins of the Onion

Open Citations Month, with its eminently guessable hashtag, is upon us. We should be utterly grateful that in the past 12 months, so much data on which papers cite which other papers has been made open, and that Wikidata is playing its part in hosting it as "cites" statements. At the time of writing, there are 15.3M Wikidata items that can do that.

Pulling back to look at open access papers in the large, though, there is is less reason for celebration. Access in theory does not yet equate to practical access. A recent LSE IMPACT blogpost puts that issue down to "heterogeneity". A useful euphemism to save us from thinking that the whole concept doesn't fall into the realm of the oxymoron.

Some home truths: aggregation is not content management, if it falls short on reusability. The PDF file format is wedded to how humans read documents, not how machines ingest them. The salami-slicer is our friend in the current downloading of open access papers, but for a better metaphor, think about skinning an onion, laboriously, 100 times with diminishing returns. There are of the order of 100 major publisher sites hosting open access papers, and the predominant offer there is still a PDF.

Red onion cross section

From the discoverability angle, Wikidata's bibliographic resources combined with the SPARQL query are superior in principle, by far, to existing keyword searches run over papers. Open access content should be managed into consistent HTML, something that is currently strenuous. The good news, such as it is, would be that much of it is already in XML. The organisational problem of removing further skins from the onion, with sensible prioritisation, is certainly not insuperable. The CORE group (the bloggers in the LSE posting) has some answers, but actually not all that is needed for the text and data mining purposes they highlight. The long tail, or in other words the onion heart when it has become fiddly beyond patience to skin, does call for a pis aller. But the real knack is to do more between the XML and the heart.

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 16:25, 9 April 2018 (UTC) reply

Facto Post – Issue 12 – 28 May 2018

Facto Post – Issue 12 – 28 May 2018

ScienceSource funded

The Wikimedia Foundation announced full funding of the ScienceSource grant proposal from ContentMine on May 18. See the ScienceSource Twitter announcement and 60 second video.

A medical canon?

The proposal includes downloading 30,000 open access papers, aiming (roughly speaking) to create a baseline for medical referencing on Wikipedia. It leaves open the question of how these are to be chosen.

The basic criteria of WP:MEDRS include a concentration on secondary literature. Attention has to be given to the long tail of diseases that receive less current research. The MEDRS guideline supposes that edge cases will have to be handled, and the premature exclusion of publications that would be in those marginal positions would reduce the value of the collection. Prophylaxis misses the point that gate-keeping will be done by an algorithm.

Two well-known but rather different areas where such considerations apply are tropical diseases and alternative medicine. There are also a number of potential downloading troubles, and these were mentioned in Issue 11. There is likely to be a gap, even with the guideline, between conditions taken to be necessary but not sufficient, and conditions sufficient but not necessary, for candidate papers to be included. With around 10,000 recognised medical conditions in standard lists, being comprehensive is demanding. With all of these aspects of the task, ScienceSource will seek community help.

Links

OpenRefine logo, courtesy of Google

To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM. ScienceSource pages will be announced there, and in this mass message.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:16, 28 May 2018 (UTC) reply

Facto Post – Issue 13 – 29 May 2018

Facto Post – Issue 13 – 29 May 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Respecting MEDRS

Facto Post enters its second year, with a Cambridge Blue (OK, Aquamarine) background, a new logo, but no Cambridge blues. On-topic for the ScienceSource project is a project page here. It contains some case studies on how the WP:MEDRS guideline, for the referencing of articles at all related to human health, is applied in typical discussions.

Close to home also, a template, called {{ medrs}} for short, is used to express dissatisfaction with particular references. Technology can help with patrolling, and this Petscan query finds over 450 articles where there is at least one use of the template. Of course the template is merely suggesting there is a possible issue with the reliability of a reference. Deciding the truth of the allegation is another matter.

This maintenance issue is one example of where ScienceSource aims to help. Where the reference is to a scientific paper, its type of algorithm could give a pass/fail opinion on such references. It could assist patrollers of medical articles, therefore, with the templated references and more generally. There may be more to proper referencing than that, indeed: context, quite what the statement supported by the reference expresses, prominence and weight. For that kind of consideration, case studies can help. But an algorithm might help to clear the backlog.

Evidence pyramid leading up to clinical guidelines, from WP:MEDRS
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 18:19, 29 June 2018 (UTC) reply

Facto Post – Issue 14 – 21 July 2018

Facto Post – Issue 14 – 21 July 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Plugging the gaps – Wikimania report

Officially it is "bridging the gaps in knowledge", with Wikimania 2018 in Cape Town paying tribute to the southern African concept of ubuntu to implement it. Besides face-to-face interactions, Wikimedians do need their power sources.

Hackathon mentoring table wiring

Facto Post interviewed Jdforrester, who has attended every Wikimania, and now works as Senior Product Manager for the Wikimedia Foundation. His take on tackling the gaps in the Wikimedia movement is that "if we were an army, we could march in a column and close up all the gaps". In his view though, that is a faulty metaphor, and it leads to a completely false misunderstanding of the movement, its diversity and different aspirations, and the nature of the work as "fighting" to be done in the open sector. There are many fronts, and as an eventualist he feels the gaps experienced both by editors and by users of Wikimedia content are inevitable. He would like to see a greater emphasis on reuse of content, not simply its volume.

If that may not sound like radicalism, the Decolonizing the Internet conference here organized jointly with Whose Knowledge? can redress the picture. It comes with the claim to be "the first ever conference about centering marginalized knowledge online".

Plugbar buildup at the Hackathon
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 06:10, 21 July 2018 (UTC) reply

Facto Post – Issue 15 – 21 August 2018

Facto Post – Issue 15 – 21 August 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Neglected diseases
Anti-parasitic drugs being distributed in Côte d'Ivoire
What's a Neglected Disease?, ScienceSource video

To grasp the nettle, there are rare diseases, there are tropical diseases and then there are "neglected diseases". Evidently a rare enough disease is likely to be neglected, but neglected disease these days means a disease not rare, but tropical, and most often infectious or parasitic. Rare diseases as a group are dominated, in contrast, by genetic diseases.

A major aspect of neglect is found in tracking drug discovery. Orphan drugs are those developed to treat rare diseases (rare enough not to have market-driven research), but there is some overlap in practice with the WHO's neglected diseases, where snakebite, a "neglected public health issue", is on the list.

From an encyclopedic point of view, lack of research also may mean lack of high-quality references: the core medical literature differs from primary research, since it operates by aggregating trials. This bibliographic deficit clearly hinders Wikipedia's mission. The ScienceSource project is currently addressing this issue, on Wikidata. Its Wikidata focus list at WD:SSFL is trying to ensure that neglect does not turn into bias in its selection of science papers.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 13:23, 21 August 2018 (UTC) reply

Facto Post – Issue 16 – 30 September 2018

Facto Post – Issue 16 – 30 September 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

The science publishing landscape

In an ideal world ... no, bear with your editor for just a minute ... there would be a format for scientific publishing online that was as much a standard as SI units are for the content. Likewise cataloguing publications would not be onerous, because part of the process would be to generate uniform metadata. Without claiming it could be the mythical free lunch, it might be reasonably be argued that sandwiches can be packaged much alike and have barcodes, whatever the fillings.

The best on offer, to stretch the metaphor, is the meal kit option, in the form of XML. Where scientific papers are delivered as XML downloads, you get all the ingredients ready to cook. But have to prepare the actual meal of slow food yourself. See Scholarly HTML for a recent pass at heading off XML with HTML, in other words in the native language of the Web.

The argument from real life is a traditional mixture of frictional forces, vested interests, and the classic irony of the principle of unripe time. On the other hand, discoverability actually diminishes with the prolific progress of science publishing. No, it really doesn't scale. Wikimedia as movement can do something in such cases. We know from open access, we grok the Web, we have our own horse in the HTML race, we have Wikidata and WikiJournal, and we have the chops to act.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 17:57, 30 September 2018 (UTC) reply

Facto Post – Issue 17 – 29 October 2018

Facto Post – Issue 17 – 29 October 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Wikidata imaged

Around 2.7 million Wikidata items have an illustrative image. These files, you might say, are Wikimedia's stock images, and if the number is large, it is still only 5% or so of items that have one. All such images are taken from Wikimedia Commons, which has 50 million media files. One key issue is how to expand the stock.

Indeed, there is a tool. WD-FIST exploits the fact that each Wikipedia is differently illustrated, mostly with images from Commons but also with fair use images. An item that has sitelinks but no illustrative image can be tested to see if the linked wikis have a suitable one. This works well for a volunteer who wants to add images at a reasonable scale, and a small amount of SPARQL knowledge goes a long way in producing checklists.

Gran Teatro, Cáceres, Spain, at night

It should be noted, though, that there are currently 53 Wikidata properties that link to Commons, of which P18 for the basic image is just one. WD-FIST prompts the user to add signatures, plaques, pictures of graves and so on. There are a couple of hundred monograms, mostly of historical figures, and this query allows you to view all of them. commons:Category:Monograms and its subcategories provide rich scope for adding more.

And so it is generally. The list of properties linking to Commons does contain a few that concern video and audio files, and rather more for maps. But it contains gems such as P3451 for "nighttime view". Over 1000 of those on Wikidata, but as for so much else, there could be yet more.

Go on. Today is Wikidata's birthday. An illustrative image is always an acceptable gift, so why not add one? You can follow these easy steps: (i) log in at https://tools.wmflabs.org/widar/, (ii) paste the Petscan ID 6263583 into https://tools.wmflabs.org/fist/wdfist/ and click run, and (iii) just add cake.

Birthday logo
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 15:01, 29 October 2018 (UTC) reply

Facto Post – Issue 18 – 30 November 2018

Facto Post – Issue 18 – 30 November 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

WikiCite issue

GLAM ♥ data — what is a gallery, library, archive or museum without a catalogue? It follows that Wikidata must love librarians. Bibliography supports students and researchers in any topic, but open and machine-readable bibliographic data even more so, outside the silo. Cue the WikiCite initiative, which was meeting in conference this week, in the Bay Area of California.

Wikidata training for librarians at WikiCite 2018

In fact there is a broad scope: "Open Knowledge Maps via SPARQL" and the "Sum of All Welsh Literature", identification of research outputs, Library.Link Network and Bibframe 2.0, OSCAR and LUCINDA (who they?), OCLC and Scholia, all these co-exist on the agenda. Certainly more library science is coming Wikidata's way. That poses the question about the other direction: is more Wikimedia technology advancing on libraries? Good point.

Wikimedians generally are not aware of the tech background that can be assumed, unless they are close to current training for librarians. A baseline definition is useful here: " bash, git and OpenRefine". Compare and contrast with pywikibot, GitHub and mix'n'match. Translation: scripting for automation, version control, data set matching and wrangling in the large, are on the agenda also for contemporary library work. Certainly there is some possible common ground here. Time to understand rather more about the motivations that operate in the library sector.

Links

Account creation is now open on the ScienceSource wiki, where you can see SPARQL visualisations of text mining.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:20, 30 November 2018 (UTC) reply

Facto Post – Issue 19 – 27 December 2018

Facto Post – Issue 19 – 27 December 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Learning from Zotero

Zotero is free software for reference management by the Center for History and New Media: see Wikipedia:Citing sources with Zotero. It is also an active user community, and has broad-based language support.

Zotero logo

Besides the handiness of Zotero's warehousing of personal citation collections, the Zotero translator underlies the citoid service, at work behind the VisualEditor. Metadata from Wikidata can be imported into Zotero; and in the other direction the zotkat tool from the University of Mannheim allows Zotero bibliographies to be exported to Wikidata, by item creation. With an extra feature to add statements, that route could lead to much development of the focus list (P5008) tagging on Wikidata, by WikiProjects.

Zotero demo video

There is also a large-scale encyclopedic dimension here. The construction of Zotero translators is one facet of Web scraping that has a strong community and open source basis. In that it resembles the less formal mix'n'match import community, and growing networks around other approaches that can integrate datasets into Wikidata, such as the use of OpenRefine.

Looking ahead, the thirtieth birthday of the World Wide Web falls in 2019, and yet the ambition to make webpages routinely readable by machines can still seem an ever-retreating mirage. Wikidata should not only be helping Wikimedia integrate its projects, an ongoing process represented by Structured Data on Commons and lexemes. It should also be acting as a catalyst to bring scraping in from the cold, with institutional strengths as well as resourceful code.

Links

Diversitech, the latest ContentMine grant application to the Wikimedia Foundation, is in its community review stage until January 2.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 19:08, 27 December 2018 (UTC) reply

Facto Post – Issue 20 – 31 January 2019

Facto Post – Issue 20 – 31 January 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Everything flows (and certainly data does)

Recently Jimmy Wales has made the point that computer home assistants take much of their data from Wikipedia, one way or another. So as well as getting Spotify to play Frosty the Snowman for you, they may be able to answer the question "is the Pope Catholic?" Possibly by asking for disambiguation ( Coptic?).

Amazon Echo device using the Amazon Alexa service in voice search showdown with the Google rival on an Android phone

Headlines about data breaches are now familiar, but the unannounced circulation of information raises other issues. One of those is Gresham's law stated as "bad data drives out good". Wikipedia and now Wikidata have been criticised on related grounds: what if their content, unattributed, is taken to have a higher standing than Wikimedians themselves would grant it? See Wikiquote on a misattribution to Bismarck for the usual quip about "law and sausages", and why one shouldn't watch them in the making.

Wikipedia has now turned 18, so should act like as adult, as well as being treated like one. The Web itself turns 30 some time between March and November this year, per Tim Berners-Lee. If the Knowledge Graph by Google exemplifies Heraclitean Web technology gaining authority, contra GIGO, Wikimedians still have a role in its critique. But not just with the teenage skill of detecting phoniness.

There is more to beating Gresham than exposing the factoid and urban myth, where WP:V does do a great job. Placeholders must be detected, and working with Wikidata is a good way to understand how having one statement as data can blind us to replacing it by a more accurate one. An example that is important to open access is that, firstly, the term itself needs considerable unpacking, because just being able to read material online is a poor relation of "open"; and secondly, trying to get Creative Commons license information into Wikidata shows up issues with classes of license (such as CC-BY) standing for the actual license in major repositories. Detailed investigation shows that "everything flows" exacerbates the issue. But Wikidata can solve it.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:53, 31 January 2019 (UTC) reply

Facto Post – Issue 21 – 28 February 2019

Facto Post – Issue 21 – 28 February 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

What is a systematic review?

Systematic reviews are basic building blocks of evidence-based medicine, surveys of existing literature devoted typically to a definite question that aim to bring out scientific conclusions. They are principled in a way Wikipedians can appreciate, taking a critical view of their sources.

PRISMA flow diagram for a systematic review

Ben Goldacre in 2014 wrote (link below) "[...] : the "information architecture" of evidence based medicine (if you can tolerate such a phrase) is a chaotic, ad hoc, poorly connected ecosystem of legacy projects. In some respects the whole show is still run on paper, like it's the 19th century." Is there a Wikidatan in the house? Wouldn't some machine-readable content that is structured data help?

File:Schittny, Facing East, 2011, Legacy Projects.jpg
2011 photograph by Bernard Schittny of the "Legacy Projects" group

Most likely it would, but the arcana of systematic reviews and how they add value would still need formal handling. The PRISMA standard dates from 2009, with an update started in 2018. The concerns there include the corpus of papers used: how selected and filtered? Now that Wikidata has a 20.9 million item bibliography, one can at least pose questions. Each systematic review is a tagging opportunity for a bibliography. Could that tagging be reproduced by a query, in principle? Can it even be second-guessed by a query (i.e. simulated by a protocol which translates into SPARQL)? Homing in on the arcana, do the inclusion and filtering criteria translate into metadata? At some level they must, but are these metadata explicitly expressed in the articles themselves? The answer to that is surely "no" at this point, but can TDM find them? Again "no", right now. Automatic identification doesn't just happen.

Actually these questions lack originality. It should be noted though that WP:MEDRS, the reliable sources guideline used here for health information, hinges on the assumption that the usefully systematic reviews of biomedical literature can be recognised. Its nutshell summary, normally the part of a guideline with the highest density of common sense, allows literature reviews in general validity, but WP:MEDASSESS qualifies that indication heavily. Process wonkery about systematic reviews definitely has merit.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:02, 28 February 2019 (UTC) reply

Facto Post – Issue 22 – 28 March 2019

Facto Post – Issue 22 – 28 March 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

When in the cloud, do as the APIs do

Half a century ago, it was the era of the mainframe computer, with its air-conditioned room, twitching tape-drives, and appearance in the title of a spy novel Billion-Dollar Brain then made into a Hollywood film. Now we have the cloud, with server farms and the client–server model as quotidian: this text is being typed on a Chromebook.

File:Cloud-API-Logo.svg
Logo of Cloud API on Google Cloud Platform

The term Applications Programming Interface or API is 50 years old, and refers to a type of software library as well as the interface to its use. While a compiler is what you need to get high-level code executed by a mainframe, an API out in the cloud somewhere offers a chance to perform operations on a remote server. For example, the multifarious bots active on Wikipedia have owners who exploit the MediaWiki API.

APIs (called RESTful) that allow for the GET HTTP request are fundamental for what could colloquially be called "moving data around the Web"; from which Wikidata benefits 24/7. So the fact that the Wikidata SPARQL endpoint at query.wikidata.org has a RESTful API means that, in lay terms, Wikidata content can be GOT from it. The programming involved, besides the SPARQL language, could be in Python, younger by a few months than the Web.

Magic words, such as occur in fantasy stories, are wishful (rather than RESTful) solutions to gaining access. You may need to be a linguist to enter Ali Baba's cave or the western door of Moria (French in the case of " Open Sesame", in fact, and Sindarin being the respective languages). Talking to an API requires a bigger toolkit, which first means you have to recognise the tools in terms of what they can do. On the way to the wikt:impactful or polymathic modern handling of facts, one must perhaps take only tactful notice of tech's endemic problem with documentation, and absorb the insightful point that the code in APIs does articulate the customary procedures now in place on the cloud for getting information. As Owl explained to Winnie-the-Pooh, it tells you The Thing to Do.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:45, 28 March 2019 (UTC) reply

Facto Post – Issue 23 – 30 April 2019

Facto Post – Issue 23 – 30 April 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Completely clouded?
Cloud computing logo

Talk of cloud computing draws a veil over hardware, but also, less obviously but more importantly, obscures such intellectual distinction as matters most in its use. Wikidata begins to allow tasks to be undertaken that were out of easy reach. The facility should not be taken as the real point.

Coming in from another angle, the "executive decision" is more glamorous; but the "administrative decision" should be admired for its command of facts. Think of the attitudes ad fontes, so prevalent here on Wikipedia as "can you give me a source for that?", and being prepared to deal with complicated analyses into specified subcases. Impatience expressed as a disdain for such pedantry is quite understandable, but neither dirty data nor false dichotomies are at all good to have around.

Issue 13 and Issue 21, respectively on WP:MEDRS and systematic reviews, talk about biomedical literature and computing tasks that would be of higher quality if they could be made more "administrative". For example, it is desirable that the decisions involved be consistent, explicable, and reproducible by non-experts from specified inputs.

What gets clouded out is not impossibly hard to understand. You do need to put together the insights of functional programming, which is a doctrinaire and purist but clearcut approach, with the practicality of office software. Loopless computation can be conceived of as a seamless forward march of spreadsheet columns, each determined by the content of previous ones. Very well: to do a backward audit, when now we are talking about Wikidata, we rely on integrity of data and its scrupulous sourcing: and clearcut case analyses. The MEDRS example forces attention on purge attempts such as Beall's list.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:27, 30 April 2019 (UTC) reply

Facto Post – Issue 24 – 17 May 2019

Facto Post – Issue 24 – 17 May 2019
Text mining display of noun phrases from the US Presidential Election 2012

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.
Semantic Web and TDM – a ContentMine view

Two dozen issues, and this may be the last, a valediction at least for a while.

It's time for a two-year summation of ContentMine projects involving TDM ( text and data mining).

Wikidata and now Structured Data on Commons represent the overlap of Wikimedia with the Semantic Web. This common ground is helping to convert an engineering concept into a movement. TDM generally has little enough connection with the Semantic Web, being instead in the orbit of machine learning which is no respecter of the semantic. Don't break a taboo by asking bots "and what do you mean by that?"

The ScienceSource project innovates in TDM, by storing its text mining results in a Wikibase site. It strives for compliance of its fact mining, on drug treatments of diseases, with an automated form of the relevant Wikipedia referencing guideline MEDRS. Where WikiFactMine set up an API for reuse of its results, ScienceSource has a SPARQL query service, with look-and-feel exactly that of Wikidata's at query.wikidata.org. It also now has a custom front end, and its content can be federated, in other words used in data mashups: it is one of over 50 sites that can federate with Wikidata.

The human factor comes to bear through the front end, which combines a link to the HTML version of a paper, text mining results organised in drug and disease columns, and a SPARQL display of nearby drug and disease terms. Much software to develop and explain, so little time! Rather than telling the tale, Facto Post brings you ScienceSource links, starting from the how-to video, lower right.

ScienceSourceReview, introductory video: but you need run it from the original upload file on Commons
Links for participation

The review tool requires a log in on sciencesource.wmflabs.org, and an OAuth permission (bottom of a review page) to operate. It can be used in simple and more advanced workflows. Examples of queries for the latter are at d:Wikidata_talk:ScienceSource project/Queries#SS_disease_list and d:Wikidata_talk:ScienceSource_project/Queries#NDF-RT issue.

Please be aware that this is a research project in development, and may have outages for planned maintenance. That will apply for the next few days, at least. The ScienceSource wiki main page carries information on practical matters. Email is not enabled on the wiki: use site mail here to Charles Matthews in case of difficulty, or if you need support. Further explanatory videos will be put into commons:Category:ContentMine videos.


If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 18:52, 17 May 2019 (UTC) reply

Invitation to participate in a discussion about publicly disclosing subscribers of TWL resources

Hi Ocaasi (WMF), I have started a discussion over our Village pump with the aim of maintaining a public list of all editors who are granted access to any TWL resource. Your thoughts and opinions on the proposal are welcome:-) Regards, WBG converse

not sure if its of much value, but wrote an diversity and inclusion essay here, if youre interested

Any thoughts appreciated :) -- [E.3] [chat2] [me] 18:36, 13 August 2019 (UTC) reply

From Wikipedia, the free encyclopedia

Please leave me a message below:

Your message here!

FYI: Wikipedia talk:De Gruyter and de:Wikipedia: De Gruyter Greets -- Informationswiedergutmachung ( talk) 23:40, 15 October 2014 (UTC) reply

Hey thanks for the heads up. Saehrimnir is coordinating there and it looks like signups are going well. They should be sent out soon, hopefully by New Year's. Best, Jake Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply

Partnership with French paid journal sources

Good evening,

In France, there are ongoing experiments by journalists to free themselves of the advertisement business and to run paid pure player journals.

Mediapart uses this model since 2008 and has recently been joined by a project started on Kickstarter, by experimented French journalists from serious background, fr:Hexagones. The first has proven in several French affairs to be a valuable source. The last wants to provide a source more oriented to quality investigation than to buzz news.

I would like to approach these two medias to offer a Library partnership.

For that, I would first like to know if you have a grid of the arguments (better content and verifiability for Wikipedia, more external links and usage increase for the partner) to use for such approach.

Secondly, I would also like to know if I got a positive answer, if web newspapers enters in the scope of the Library project, or if only archives of web journal enters in scope. -- Dereckson ( talk) 22:44, 19 October 2014 (UTC) reply

Hi Dereckson! We've just updated our partner promotional messaging. You can view/use/copy/adapt it here: [1]. Any reliable sources are within the scope of the project, definitely including web newspapers. Thanks so much for inquiring and let me know if you have any questions or progress or success! Cheers, Jake Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply

De Gruyter data protection

Hi can u answer us on German WP what are exactly the data protection conditions. Thx.-- Sanandros ( talk) 05:44, 21 October 2014 (UTC) reply

Thanks, I've replied there. Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply

Wikipedia Library

Hi, I notice that Research coordinators are needed for the Wikipedia library and I would love to help by becoming one. I have access to Questia so Ibelieve I could do the job. Thanks. Uhlan talk 00:26, 18 November 2014 (UTC) reply

Hi Uhlan. Thanks so much for your offer. I'm excited to have you on as a new Research Coordinator. I'm excited to be working with you! Jake Ocaasi (WMF) ( talk) 21:20, 23 December 2014 (UTC) reply


A barnstar for you and your team!

The Special Barnstar
Your success in creating partnerships with publishers of high quality content is appreciated. Your work is essential! Happy Christmas JimRenge ( talk) 21:18, 23 December 2014 (UTC) reply
That is so kind User:JimRenge! It indeed takes a team and I want to share this wholeheartedly with User:Astinson (WMF), User:Nikkimaria, and User:PEarley (WMF). Best wishes from all of us this holiday. I hope you enjoy it. Best, Jake Ocaasi (WMF) ( talk) 21:22, 23 December 2014 (UTC) reply

London Times archival access

I don't know if anyone's approached Gale about access to their archive of the London Times, but I would really like access as it's a very good reference to British military and naval promotions and appointments. I believe that most editors in the UK can get access through their public libraries, but that leaves most non-academic users outside the UK in the lurch. I believe that the New York Public Library has a subscription so that might be another avenue to be explored.-- Sturmvogel 66 ( talk) 18:11, 13 January 2015 (UTC) reply

Thanks for the suggestion Sturmvogel. We have unsuccessfully tried to get LT before, but I will add it to our ongoing outreach list. We are also always tracking suggestions at Wikipedia:The_Wikipedia_Library/Journals/Requests in case other ideas cross your mind. Cheers! Jake Ocaasi (WMF) ( talk) 00:35, 15 January 2015 (UTC) reply

Partner/Outreach Coordinators

Hi Jake - I see that the Wikipedia Library needs Partner and Outreach coordinators. What do those jobs entail? - Thanks; LeoRomero ( talk) 19:00, 21 February 2015 (UTC) reply

Hi LeoRomero ( talk · contribs)! Thanks so much for expressing interest.
  • Partner coordinators help out with contacting potential donors--publishers, databases, university presses, journals, etc. We have pretty refined brochures and packaging about our programs but it still takes time to make contact and see those initial talks through to a donation. We are ready to train someone comfortable with professional, real-world outreach skills so we can broaden the number and scope of our partnerships.
  • Outreach coordinators help us connect with and communicate to the external world of institutions, organizations, groups, and events. This involves micro-tasks like reposting relevant twitter content from @WikiLibrary and on Facebook, medium tasks discovering stories from among journal access recipients for the Wikimedia Blog or working on the TWL Books & Bytes newsletter, high-impact conference coordination such as submitting proposals for major events where TWL would present, and advanced tasks such as writing long-form opinion pieces about the role of Wikipedia in a changing landscape of digital reference.
For both positions we would be ready to gradually introduce you to increasingly relevant and challenging projects so you could grow into them. We are learning as you are learning about how to scale our work and would be excited to experiment with you in a role that felt well-scoped and interesting for your background and leanings. Let me know what you think! Please feel free to email me at: jorlowitz@wikimedia.org if you'd like to chat off-wiki. Cheers! Ocaasi (WMF) ( talk) 18:25, 2 March 2015 (UTC) reply
Thanks Jake, email's on the way! LeoRomero ( talk) 19:36, 2 March 2015 (UTC) reply

ProQuest Congressional

Hi, Ocaasi. Wehwalt suggested that I leave a message here on your talk page. I was wondering if there is some way for me to gain access to the website ProQuest Congressional. Most of my work on Wikipedia involves coins and coin legislation, which is covered in-depth on ProQuest Congressional, including laws, proposed laws and Congressional hearings. Currently, Wehwalt has access to the website through his Wikipedia partnership with George Mason University, so he has been able to supply some absolutely invaluable information to me, but his access will expire shortly. Do you know if the Wikipedia Library will at some point begin a partnership with ProQuest Congressional, or if there is another way that I could gain access as an active editor? Thank you in advance for your time.- RHM22 ( talk) 20:17, 17 March 2015 (UTC) reply

Hi RHM22. Indeed, ProQuest is very valuable and also huge. We are starting with a pilot donation for their Literature Online program. If that is successful we will aim to receive a much broader 'package' including sources like Proquest Congressional. To be very honest, this will take up to a year, so your best bet for now is still WP:RX. Thanks for the feedback and happy researching! Cheers, Jake Ocaasi t | c 20:20, 17 March 2015 (UTC) reply
Thank you for the extremely fast reply, Ocaasi! I understand that it will take some time. ProQuest is indeed an enormous resource, as even just the Congressional archives are incredibly massive. I believe I have everything needed for now, but I will probably require Congressional records for future projects, so I will enquire at the noticeboard if so. Thanks again.- RHM22 ( talk) 20:23, 17 March 2015 (UTC) reply

fa.wiki

Hello dear Jake, I'm working on Farsi branch of The Wikipedia Library in fa.wiki. Please creat your account in our wiki. I'm translate and creat main and about page in this wiki. we are in touch with Patrick. Thank you Darafsh Kaviyani ( Talk) 09:19, 26 June 2015 (UTC) reply

It is a pleasure

It was a pleasure to talk to you in the video chat. You are the first WMF staff member, let alone Wikipedian, that I visually communicated with. :-). If I have any further questions regarding IA, I will let you know.— cyberpower Chat:Limited Access 17:10, 14 August 2015 (UTC) reply

Cyberpower, the please was all mine. It's great to have you in the movement contributing so much! I'll update you on our IA talks in late August. Cheers, Jake Ocaasi (WMF) ( talk) 17:25, 14 August 2015 (UTC) reply

A question

Reading the list of paid databases for which Wikipedians may request free accounts prompted me to consider a related thought. Does Wikipedia make available a list of free online resources for use in research? I have been writing and editing articles for more than a year, and I haven't seen one.

The donated accounts are a great help, and I appreciate the ones that I was fortunate to receive this year. The number of accounts, however, is limited. Meanwhile, some free resources offer specialized searches (such as The Official Academy Awards Database) and searches of publications not included in either newspaper databases or academic journals (such as Media History Digital Library).

As I have written, edited, and done research, I have compiled a list of free resources that have been useful to me to varying degrees. I suspect that many others who work on material on Wikipedia have done the same.

I wonder whether a directory-style list of such resources could be compiled and made available somewhere on Wikipedia. I think it would be especially helpful to newcomers who perhaps are familiar with little more than using basic search engines for online research. It could also be good for those of us who have some experience; perhaps we might pick up a few sources to add to our own favorites.

I thought that your running of The Wikipedia Library might make you the best person for considering this question/proposal. I am interested in your thoughts of whether my idea is feasible and/or desirable. Eddie Blick ( talk) 02:17, 9 October 2015 (UTC) reply

Hey Eddie Blick! Great question. Yes, we do: 1) WP:TWL/OA is an index of useful open access sources and projects; 2) WP:TWL/Resources is a more comprehensive directory of generally useful free (but not necessarily open acess) resources. We even have a team of Research coordinators working on that, and you can sign up to be one if you like :) I hope this helps, I like your ideas, and feel free to reach out any time! Cheers, Jake Ocaasi (WMF) ( talk) 16:00, 9 October 2015 (UTC) reply
Thanks for your reply. I wasn't aware of those two sites; I'll take a look and find out what is available on them. I will also look at the information about being a research coordinator and see whether I might be able to contribute in that way. I appreciate your help. Eddie Blick ( talk) 19:22, 9 October 2015 (UTC) reply

Request for assistance

Hello : Today I have sent you an e mail asking for you counseling and assistance. Please take the time to look at it. Thanks Rudy235 ( talk) 00:19, 24 October 2015 (UTC) reply

How long is the Pete Forsyth panel discussion tonight

Thanks. Cheers! {{u| Checkingfax}} { Talk} 20:55, 28 October 2015 (UTC) reply

6pm-7:30pm California time. Watch it here: https://www.youtube.com/watch?v=t-cF7433aT4 Cheers, Ocaasi (WMF) ( talk) 21:14, 28 October 2015 (UTC) reply
See you there. Cheers! {{u| Checkingfax}} { Talk} 22:42, 28 October 2015 (UTC) reply

Welcome to The Wikipedia Adventure!

Hi Ocaasi (WMF)! We're so happy you wanted to play to learn, as a friendly and fun way to get into our community and mission. I think these links might be helpful to you as you get started.

-- 11:04, Sunday, November 15, 2015 ( UTC)

Welcome to The Wikipedia Adventure!

Hi Ocaasi (WMF)! We're so happy you wanted to play to learn, as a friendly and fun way to get into our community and mission. I think these links might be helpful to you as you get started.

-- 22:58, Thursday, November 19, 2015 ( UTC)

Telegram

Hello, Ocaasi (WMF). Please check your email; you've got mail!
Message added 18:58, 4 January 2016 (UTC). It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{ You've got mail}} or {{ ygm}} template.

-- samtar whisper 18:58, 4 January 2016 (UTC) reply

Phabricator

Hi Ocaasi, I dropped you a line a little while back - just letting you know I got a project created on Phabricator for tracking bugs more effectively! I've added all the old ones on there - please become a member so you can get updates and notifications :) -- samtar whisper 19:02, 7 January 2016 (UTC) reply

A barnstar for you!

The Tireless Contributor Barnstar
Thank you so much for creating the Wikipedia adventure! I love it and it is so helpful! Elsa Enchanted ( talk) 17:44, 4 April 2016 (UTC) reply

My kudos to Nikkimaria and your thanks

Well, it is Nikkimaria and (I guess) you who should be praised for granting us the access, not me ;).

Zezen ( talk) 20:59, 4 April 2016 (UTC), who has already researched in depth 19th c. Liberia migration policies using this resource and found a fascinating long-forgotten book from 1903 about this very subject, when a spade was still called a spade... reply

APPRECIATION Hey I just love the Wikipedia Adventure. I wonder where do you get such ideas. Must tell me the place!

Amy2563 ( talk) 17:22, 23 May 2016 (UTC) reply

Thanks you for the award

I am taking up the work/pleasure of editing in wikipedia (en español/spanish) and I found The Cure Award. It's very motivating, now I will put more effort to continue in this knowledge sharing adventure. I'm studying now to validate my doctor of medicine degree in USA, and editing Wikipedia is part of my studying routine. Thanks again, greetings, -- Behemot leviatan ( talk) 02:01, 1 June 2016 (UTC) reply

Help needed for OAbot

Hi Jake! Can you please see Help_talk:Citation_Style_1#Adding_open_access_links_to_references and lead the relevant discussion at WP:TWL? Cheers, − Pintoch ( talk) 17:35, 10 June 2016 (UTC) reply

You've got mail!

Hello, Ocaasi (WMF). Please check your email; you've got mail! The subject is Re: The Wikipedia Library coordination.
Message added 01:42, 1 October 2016 (UTC). It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{ You've got mail}} or {{ ygm}} template.

-- Cameron11598 (Talk) 01:42, 1 October 2016 (UTC) reply

Teahouse question unanswered

Someone had an idea for The Wikipedia Library but didn't get a response here. Thanks.— Vchimpanzee • talk • contributions • 19:25, 13 October 2016 (UTC) reply

Thanks, I replied! Best, Jake Ocaasi (WMF) ( talk) 15:35, 14 October 2016 (UTC) reply
Okay, where is the response, because I checked the Teahouse archives and the talk page of the person who asked for help.— Vchimpanzee • talk • contributions •
Funny, I found another response I left and confused it for the question you asked about. Replied new now on the user's Talk page. Thanks Vchimpanzee! Jake Ocaasi (WMF) ( talk) 21:43, 18 October 2016 (UTC) reply

Page for Litwin Books, LLC

Hi Jake! Good meeting you at DLF in Milwaukee. As you suggested, I created a page for Litwin Books and Library Juice Press in my sandbox page. You had offered to review it. Thanks! Rlitwin ( talk) 16:23, 7 December 2016 (UTC) reply

Hi Rlitwin! I added the draft template to the top of your page. Just click the blue button to submit it for review. I can't review it myself because of my conflict of interest with WMF staff and library publishers, but we have a team of volunteers who do just that! Cheers, Jake Ocaasi (WMF) ( talk) 19:39, 7 December 2016 (UTC) reply
Great - Thank you. Rlitwin ( talk)

You've got mail!!!!!!!

Hello, Ocaasi (WMF). Please check your email; you've got mail!
It may take a few minutes from the time the email is sent for it to show up in your inbox. You can remove this notice at any time by removing the {{ You've got mail}} or {{ ygm}} template.{{u| Checkingfax}} { Talk} 06:13, 1 April 2017 (UTC) reply

Welcome to The Wikipedia Adventure!

Hi Ocaasi (WMF)! We're so happy you wanted to play to learn, as a friendly and fun way to get into our community and mission. I think these links might be helpful to you as you get started.

-- 18:10, Wednesday, April 19, 2017 ( UTC)

Facto Post – Issue 1 – 14 June 2017

Facto Post – Issue 1 – 14 June 2017

Editorial

This newsletter starts with the motto "common endeavour for 21st century content". To unpack that slogan somewhat, we are particularly interested in the new, post-Wikidata collection of techniques that are flourishing under the Wikimedia collaborative umbrella. To linked data, SPARQL queries and WikiCite, add gamified participation, text mining and new holding areas, with bots, tech and humans working harmoniously.

Scientists, librarians and Wikimedians are coming together and providing a more unified view of an emerging area. Further integration of both its community and its technical aspects can be anticipated.

While Wikipedia will remain the discursive heart of Wikimedia, data-rich and semantic content will support it. We'll aim to be both broad and selective in our coverage. This publication Facto Post (the very opposite of retroactive) and call to action are brought to you monthly by ContentMine.

Links
Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 09:33, 14 June 2017 (UTC) reply

Facto Post – Issue 2 – 13 July 2017

Facto Post – Issue 2 – 13 July 2017

Editorial: Core models and topics

Wikimedians interest themselves in everything under the sun — and then some. Discussion on "core topics" may, oddly, be a fringe activity, and was popular here a decade ago.

The situation on Wikidata today does resemble the halcyon days of 2006 of the English Wikipedia. The growth is there, and the reliability and stylistic issues are not yet pressing in on the project. Its Berlin conference at the end of October will have five years of achievement to celebrate. Think Wikimania Frankfurt 2005.

Progress must be made, however, on referencing "core facts". This has two parts: replacing "imported from Wikipedia" in referencing by external authorities; and picking out statements, such as dates and family relationships, that must not only be reliable but be seen to be reliable.

In addition, there are many properties on Wikidata lacking a clear data model. An emerging consensus may push to the front key sourcing and biomedical properties as requiring urgent attention. Wikidata's "manual of style" is currently distributed over thousands of discussions. To make it coalesce, work on such a core is needed.

Links


Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

Facto Post – Issue 3 – 11 August 2017

Facto Post – Issue 3 – 11 August 2017

Wikimania report

Interviewed by Facto Post at the hackathon, Lydia Pintscher of Wikidata said that the most significant recent development is that Wikidata now accounts for one third of Wikimedia edits. And the essential growth of human editing.

Internet-In-A-Box

Impressive development work on Internet-in-a-Box featured in the WikiMedFoundation annual conference on Thursday. Hardware is Raspberry Pi, running Linux and the Kiwix browser. It can operate as a wifi hotspot and support a local intranet in parts of the world lacking phone signal. The medical use case is for those delivering care, who have smartphones but have to function in clinics in just such areas with few reference resources. Wikipedia medical content can be served to their phones, and power supplied by standard lithium battery packages.

Yesterday Katherine Maher unveiled the draft Wikimedia 2030 strategy, featuring a picturesque metaphor, "roads, bridges and villages". Here "bridges" could do with illustration. Perhaps it stands for engineering round or over the obstacles to progress down the obvious highways. Internet-in-a-Box would then do fine as an example.

"Bridging the gap" explains a take on that same metaphor, with its human component. If you are at Wikimania, come talk to WikiFactMine at its stall in the Community Village, just by the 3D-printed display for Bassel Khartabil; come hear T Arrow talk at 3 pm today in Drummond West, Level 3.

Link

  • Plaudit for the Medical Wikipedia app, content that is loaded into Internet-In-A-Box with other material, such as per-country documentation.
Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:55, 12 August 2017 (UTC) reply

Facto Post – Issue 4 – 18 September 2017

Facto Post – Issue 4 – 18 September 2017

Editorial: Conservation data

The IUCN Red List update of 14 September led with a threat to North American ash trees. The International Union for Conservation of Nature produces authoritative species listings that are peer-reviewed. Examples used as metonyms for loss of species and biodiversity, and discussion of extinction rates, are the usual topics covered in the media to inform us about this area. But actual data matters.

Dorstenia elata, a critically endangered South American herb, contained in Moraceae, the family of figs and mulberries

Clearly, conservation work depends on decisions about what should be done, and where. While animals, particularly mammals, are photogenic, species numbers run into millions. Plant species lie at the base of typical land-based food chains, and vegetation is key to the habitats of most animals.

ContentMine dictionaries, for example as tabulated at d:Wikidata:WikiFactMine/Dictionary list, enable detailed control of queries about endangered species, in their taxonomic context. To target conservation measures properly, species listings running into the thousands are not what is needed: range maps showing current distribution are. Between the will to act, and effective steps taken, the services of data handling are required. There is now no reason at all why Wikidata should not take up the burden.

Links

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 14:46, 18 September 2017 (UTC) reply

Facto Post – Issue 5 – 17 October 2017

Facto Post – Issue 5 – 17 October 2017

Editorial: Annotations

Annotation is nothing new. The glossators of medieval Europe annotated between the lines, or in the margins of legal manuscripts of texts going back to Roman times, and created a new discipline. In the form of web annotation, the idea is back, with texts being marked up inline, or with a stand-off system. Where could it lead?

1495 print version of the Digesta of Justinian, with the annotations of the glossator Accursius from the 13th century

ContentMine operates in the field of text and data mining (TDM), where annotation, simply put, can add value to mined text. It now sees annotation as a possible advance in semi-automation, the use of human judgement assisted by bot editing, which now plays a large part in Wikidata tools. While a human judgement call of yes/no, on the addition of a statement to Wikidata, is usually taken as decisive, it need not be. The human assent may be passed into an annotation system, and stored: this idea is standard on Wikisource, for example, where text is considered "validated" only when two different accounts have stated that the proof-reading is correct. A typical application would be to require more than one person to agree that what is said in the reference translates correctly into the formal Wikidata statement. Rejections are also potentially useful to record, for machine learning.

As a contribution to data integrity on Wikidata, annotation has much to offer. Some "hard cases" on importing data are much more difficult than average. There are for example biographical puzzles: whether person A in one context is really identical with person B, of the same name, in another context. In science, clinical medicine require special attention to sourcing ( WP:MEDRS), and is challenging in terms of connecting findings with the methodology employed. Currently decisions in areas such as these, on Wikipedia and Wikidata, are often made ad hoc. In particular there may be no audit trail for those who want to check what is decided.

Annotations are subject to a World Wide Web Consortium standard, and behind the terminology constitute a simple JSON data structure. What WikiFactMine proposes to do with them is to implement the MEDRS guideline, as a formal algorithm, on bibliographical and methodological data. The structure will integrate with those inputs the human decisions on the interpretation of scientific papers that underlie claims on Wikidata. What is added to Wikidata will therefore be supported by a transparent and rigorous system that documents decisions.

An example of the possible future scope of annotation, for medical content, is in the first link below. That sort of detailed abstract of a publication can be a target for TDM, adds great value, and could be presented in machine-readable form. You are invited to discuss the detailed proposal on Wikidata, via its talk page.

Links

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Opted-out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 08:46, 17 October 2017 (UTC) reply

Facto Post – Issue 6 – 15 November 2017

Facto Post – Issue 6 – 15 November 2017

WikidataCon Berlin 28–9 October 2017

WikidataCon 2017 group photo

Under the heading rerum causas cognescere, the first ever Wikidata conference got under way in the Tagesspiegel building with two keynotes, One was on YAGO, about how a knowledge base conceived ten years ago if you assume automatic compilation from Wikipedia. The other was from manager Lydia Pintscher, on the "state of the data". Interesting rumours flourished: the mix'n'match tool and its 600+ datasets, mostly in digital humanities, to be taken off the hands of its author Magnus Manske by the WMF; a Wikibase incubator site is on its way. Announcements came in talks: structured data on Wikimedia Commons is scheduled to make substantive progress by 2019. The lexeme development on Wikidata is now not expected to make the Wiktionary sites redundant, but may facilitate automated compilation of dictionaries.

WD-FIST explained

And so it went, with five strands of talks and workshops, through to 11 pm on Saturday. Wikidata applies to GLAM work via metadata. It may be used in education, raises issues such as author disambiguation, and lends itself to different types of graphical display and reuse. Many millions of SPARQL queries are run on the site every day. Over the summer a large open science bibliography has come into existence there.

Wikidata's fifth birthday party on the Sunday brought matters to a close. See a dozen and more reports by other hands.

Links

Editor Charles Matthews. Please leave feedback for him.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:02, 15 November 2017 (UTC) reply

Facto Post – Issue 7 – 15 December 2017

Facto Post – Issue 7 – 15 December 2017

A new bibliographical landscape

At the beginning of December, Wikidata items on individual scientific articles passed the 10 million mark. This figure contrasts with the state of play in early summer, when there were around half a million. In the big picture, Wikidata is now documenting the scientific literature at a rate that is about eight times as fast as papers are published. As 2017 ends, progress is quite evident.

Behind this achievement are a technical advance ( fatameh), and bots that do the lifting. Much more than dry migration of metadata is potentially involved, however. If paper A cites paper B, both papers having an item, a link can be created on Wikidata, and the information presented to both human readers, and machines. This cross-linking is one of the most significant aspects of the scientific literature, and now a long-sought open version is rapidly being built up.

The effort for the lifting of copyright restrictions on citation data of this kind has had real momentum behind it during 2017. WikiCite and the I4OC have been pushing hard, with the result that on CrossRef over 50% of the citation data is open. Now the holdout publishers are being lobbied to release rights on citations.

But all that is just the beginning. Topics of papers are identified, authors disambiguated, with significant progress on the use of the four million ORCID IDs for researchers, and proposals formulated to identify methodology in a machine-readable way. P4510 on Wikidata has been introduced so that methodology can sit comfortably on items about papers.

More is on the way. OABot applies the unpaywall principle to Wikipedia referencing. It has been proposed that Wikidata could assist WorldCat in compiling the global history of book translation. Watch this space.

And make promoting #1lib1ref one of your New Year's resolutions. Happy holidays, all!

November 2017 map of geolocated Wikidata items, made by Addshore

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 14:54, 15 December 2017 (UTC) reply

Facto Post – Issue 8 – 15 January 2018

Facto Post – Issue 8 – 15 January 2018

Metadata on the March

From the days of hard-copy liner notes on music albums, metadata have stood outside a piece or file, while adding to understanding of where it comes from, and some of what needs to be appreciated about its content. In the GLAM sector, the accumulation of accurate metadata for objects is key to the mission of an institution, and its presentation in cataloguing.

Today Wikipedia turns 17, with worlds still to conquer. Zooming out from the individual GLAM object to the ontology in which it is set, one such world becomes apparent: GLAMs use custom ontologies, and those introduce massive incompatibilities. From a recent article by sadads, we quote the observation that "vocabularies needed for many collections, topics and intellectual spaces defy the expectations of the larger professional communities." A job for the encyclopedist, certainly. But the data-minded Wikimedian has the advantages of Wikidata, starting with its multilingual data, and facility with aliases. The controlled vocabulary — sometimes referred to as a "thesaurus" as term of art — simplifies search: if a "spade" must be called that, rather than "shovel", it is easier to find all spade references. That control comes at a cost.

SVG pedestrian crosses road
Zebra crossing/crosswalk, Singapore

Case studies in that article show what can lie ahead. The schema crosswalk, in jargon, is a potential answer to the GLAM Babel of proliferating and expanding vocabularies. Even if you have no interest in Wikidata as such, simply vocabularies V and W, if both V and W are matched to Wikidata, then a "crosswalk" arises from term v in V to w in W, whenever v and w both match to the same item d in Wikidata.

For metadata mobility, match to Wikidata. It's apparently that simple: infrastructure requirements have turned out, so far, to be challenges that can be met.

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 12:38, 15 January 2018 (UTC) reply

Facto Post – Issue 9 – 5 February 2018

Facto Post – Issue 9 – 5 February 2018

m:Grants:Project/ScienceSource is the new ContentMine proposal: please take a look.

Wikidata as Hub

One way of looking at Wikidata relates it to the semantic web concept, around for about as long as Wikipedia, and realised in dozens of distributed Web institutions. It sees Wikidata as supplying central, encyclopedic coverage of linked structured data, and looks ahead to greater support for "federated queries" that draw together information from all parts of the emerging network of websites.

Another perspective might be likened to a photographic negative of that one: Wikidata as an already-functioning Web hub. Over half of its properties are identifiers on other websites. These are Wikidata's "external links", to use Wikipedia terminology: one type for the DOI of a publication, another for the VIAF page of an author, with thousands more such. Wikidata links out to sites that are not nominally part of the semantic web, effectively drawing them into a larger system. The crosswalk possibilities of the systematic construction of these links was covered in Issue 8.

Wikipedia:External links speaks of them as kept "minimal, meritable, and directly relevant to the article." Here Wikidata finds more of a function. On viaf.org one can type a VIAF author identifier into the search box, and find the author page. The Wikidata Resolver tool, these days including Open Street Map, Scholia etc., allows this kind of lookup. The hub tool by maxlath takes a major step further, allowing both lookup and crosswalk to be encoded in a single URL.

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:50, 5 February 2018 (UTC) reply

Facto Post – Issue 10 – 12 March 2018

Facto Post – Issue 10 – 12 March 2018

Milestone for mix'n'match

Around the time in February when Wikidata clicked past item Q50000000, another milestone was reached: the mix'n'match tool uploaded its 1000th dataset. Concisely defined by its author, Magnus Manske, it works "to match entries in external catalogs to Wikidata". The total number of entries is now well into eight figures, and more are constantly being added: a couple of new catalogs each day is normal.

Since the end of 2013, mix'n'match has gradually come to play a significant part in adding statements to Wikidata. Particularly in areas with the flavour of digital humanities, but datasets can of course be about practically anything. There is a catalog on skyscrapers, and two on spiders.

These days mix'n'match can be used in numerous modes, from the relaxed gamified click through a catalog looking for matches, with prompts, to the fantastically useful and often demanding search across all catalogs. I'll type that again: you can search 1000+ datasets from the simple box at the top right. The drop-down menu top left offers "creation candidates", Magnus's personal favourite. m:Mix'n'match/Manual for more.

For the Wikidatan, a key point is that these matches, however carried out, add statements to Wikidata if, and naturally only if, there is a Wikidata property associated with the catalog. For everyone, however, the hands-on experience of deciding of what is a good match is an education, in a scholarly area, biographical catalogs being particularly fraught. Underpinning recent rapid progress is an open infrastructure for scraping and uploading.

Congratulations to Magnus, our data Stakhanovite!

Links

3D printing

To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 12:26, 12 March 2018 (UTC) reply

Facto Post – Issue 11 – 9 April 2018

Facto Post – Issue 11 – 9 April 2018

The 100 Skins of the Onion

Open Citations Month, with its eminently guessable hashtag, is upon us. We should be utterly grateful that in the past 12 months, so much data on which papers cite which other papers has been made open, and that Wikidata is playing its part in hosting it as "cites" statements. At the time of writing, there are 15.3M Wikidata items that can do that.

Pulling back to look at open access papers in the large, though, there is is less reason for celebration. Access in theory does not yet equate to practical access. A recent LSE IMPACT blogpost puts that issue down to "heterogeneity". A useful euphemism to save us from thinking that the whole concept doesn't fall into the realm of the oxymoron.

Some home truths: aggregation is not content management, if it falls short on reusability. The PDF file format is wedded to how humans read documents, not how machines ingest them. The salami-slicer is our friend in the current downloading of open access papers, but for a better metaphor, think about skinning an onion, laboriously, 100 times with diminishing returns. There are of the order of 100 major publisher sites hosting open access papers, and the predominant offer there is still a PDF.

Red onion cross section

From the discoverability angle, Wikidata's bibliographic resources combined with the SPARQL query are superior in principle, by far, to existing keyword searches run over papers. Open access content should be managed into consistent HTML, something that is currently strenuous. The good news, such as it is, would be that much of it is already in XML. The organisational problem of removing further skins from the onion, with sensible prioritisation, is certainly not insuperable. The CORE group (the bloggers in the LSE posting) has some answers, but actually not all that is needed for the text and data mining purposes they highlight. The long tail, or in other words the onion heart when it has become fiddly beyond patience to skin, does call for a pis aller. But the real knack is to do more between the XML and the heart.

Links


To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 16:25, 9 April 2018 (UTC) reply

Facto Post – Issue 12 – 28 May 2018

Facto Post – Issue 12 – 28 May 2018

ScienceSource funded

The Wikimedia Foundation announced full funding of the ScienceSource grant proposal from ContentMine on May 18. See the ScienceSource Twitter announcement and 60 second video.

A medical canon?

The proposal includes downloading 30,000 open access papers, aiming (roughly speaking) to create a baseline for medical referencing on Wikipedia. It leaves open the question of how these are to be chosen.

The basic criteria of WP:MEDRS include a concentration on secondary literature. Attention has to be given to the long tail of diseases that receive less current research. The MEDRS guideline supposes that edge cases will have to be handled, and the premature exclusion of publications that would be in those marginal positions would reduce the value of the collection. Prophylaxis misses the point that gate-keeping will be done by an algorithm.

Two well-known but rather different areas where such considerations apply are tropical diseases and alternative medicine. There are also a number of potential downloading troubles, and these were mentioned in Issue 11. There is likely to be a gap, even with the guideline, between conditions taken to be necessary but not sufficient, and conditions sufficient but not necessary, for candidate papers to be included. With around 10,000 recognised medical conditions in standard lists, being comprehensive is demanding. With all of these aspects of the task, ScienceSource will seek community help.

Links

OpenRefine logo, courtesy of Google

To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see below.
Editor Charles Matthews, for ContentMine. Please leave feedback for him. Back numbers are here.
Reminder: WikiFactMine pages on Wikidata are at WD:WFM. ScienceSource pages will be announced there, and in this mass message.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:16, 28 May 2018 (UTC) reply

Facto Post – Issue 13 – 29 May 2018

Facto Post – Issue 13 – 29 May 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Respecting MEDRS

Facto Post enters its second year, with a Cambridge Blue (OK, Aquamarine) background, a new logo, but no Cambridge blues. On-topic for the ScienceSource project is a project page here. It contains some case studies on how the WP:MEDRS guideline, for the referencing of articles at all related to human health, is applied in typical discussions.

Close to home also, a template, called {{ medrs}} for short, is used to express dissatisfaction with particular references. Technology can help with patrolling, and this Petscan query finds over 450 articles where there is at least one use of the template. Of course the template is merely suggesting there is a possible issue with the reliability of a reference. Deciding the truth of the allegation is another matter.

This maintenance issue is one example of where ScienceSource aims to help. Where the reference is to a scientific paper, its type of algorithm could give a pass/fail opinion on such references. It could assist patrollers of medical articles, therefore, with the templated references and more generally. There may be more to proper referencing than that, indeed: context, quite what the statement supported by the reference expresses, prominence and weight. For that kind of consideration, case studies can help. But an algorithm might help to clear the backlog.

Evidence pyramid leading up to clinical guidelines, from WP:MEDRS
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 18:19, 29 June 2018 (UTC) reply

Facto Post – Issue 14 – 21 July 2018

Facto Post – Issue 14 – 21 July 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Plugging the gaps – Wikimania report

Officially it is "bridging the gaps in knowledge", with Wikimania 2018 in Cape Town paying tribute to the southern African concept of ubuntu to implement it. Besides face-to-face interactions, Wikimedians do need their power sources.

Hackathon mentoring table wiring

Facto Post interviewed Jdforrester, who has attended every Wikimania, and now works as Senior Product Manager for the Wikimedia Foundation. His take on tackling the gaps in the Wikimedia movement is that "if we were an army, we could march in a column and close up all the gaps". In his view though, that is a faulty metaphor, and it leads to a completely false misunderstanding of the movement, its diversity and different aspirations, and the nature of the work as "fighting" to be done in the open sector. There are many fronts, and as an eventualist he feels the gaps experienced both by editors and by users of Wikimedia content are inevitable. He would like to see a greater emphasis on reuse of content, not simply its volume.

If that may not sound like radicalism, the Decolonizing the Internet conference here organized jointly with Whose Knowledge? can redress the picture. It comes with the claim to be "the first ever conference about centering marginalized knowledge online".

Plugbar buildup at the Hackathon
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 06:10, 21 July 2018 (UTC) reply

Facto Post – Issue 15 – 21 August 2018

Facto Post – Issue 15 – 21 August 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Neglected diseases
Anti-parasitic drugs being distributed in Côte d'Ivoire
What's a Neglected Disease?, ScienceSource video

To grasp the nettle, there are rare diseases, there are tropical diseases and then there are "neglected diseases". Evidently a rare enough disease is likely to be neglected, but neglected disease these days means a disease not rare, but tropical, and most often infectious or parasitic. Rare diseases as a group are dominated, in contrast, by genetic diseases.

A major aspect of neglect is found in tracking drug discovery. Orphan drugs are those developed to treat rare diseases (rare enough not to have market-driven research), but there is some overlap in practice with the WHO's neglected diseases, where snakebite, a "neglected public health issue", is on the list.

From an encyclopedic point of view, lack of research also may mean lack of high-quality references: the core medical literature differs from primary research, since it operates by aggregating trials. This bibliographic deficit clearly hinders Wikipedia's mission. The ScienceSource project is currently addressing this issue, on Wikidata. Its Wikidata focus list at WD:SSFL is trying to ensure that neglect does not turn into bias in its selection of science papers.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 13:23, 21 August 2018 (UTC) reply

Facto Post – Issue 16 – 30 September 2018

Facto Post – Issue 16 – 30 September 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

The science publishing landscape

In an ideal world ... no, bear with your editor for just a minute ... there would be a format for scientific publishing online that was as much a standard as SI units are for the content. Likewise cataloguing publications would not be onerous, because part of the process would be to generate uniform metadata. Without claiming it could be the mythical free lunch, it might be reasonably be argued that sandwiches can be packaged much alike and have barcodes, whatever the fillings.

The best on offer, to stretch the metaphor, is the meal kit option, in the form of XML. Where scientific papers are delivered as XML downloads, you get all the ingredients ready to cook. But have to prepare the actual meal of slow food yourself. See Scholarly HTML for a recent pass at heading off XML with HTML, in other words in the native language of the Web.

The argument from real life is a traditional mixture of frictional forces, vested interests, and the classic irony of the principle of unripe time. On the other hand, discoverability actually diminishes with the prolific progress of science publishing. No, it really doesn't scale. Wikimedia as movement can do something in such cases. We know from open access, we grok the Web, we have our own horse in the HTML race, we have Wikidata and WikiJournal, and we have the chops to act.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 17:57, 30 September 2018 (UTC) reply

Facto Post – Issue 17 – 29 October 2018

Facto Post – Issue 17 – 29 October 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Wikidata imaged

Around 2.7 million Wikidata items have an illustrative image. These files, you might say, are Wikimedia's stock images, and if the number is large, it is still only 5% or so of items that have one. All such images are taken from Wikimedia Commons, which has 50 million media files. One key issue is how to expand the stock.

Indeed, there is a tool. WD-FIST exploits the fact that each Wikipedia is differently illustrated, mostly with images from Commons but also with fair use images. An item that has sitelinks but no illustrative image can be tested to see if the linked wikis have a suitable one. This works well for a volunteer who wants to add images at a reasonable scale, and a small amount of SPARQL knowledge goes a long way in producing checklists.

Gran Teatro, Cáceres, Spain, at night

It should be noted, though, that there are currently 53 Wikidata properties that link to Commons, of which P18 for the basic image is just one. WD-FIST prompts the user to add signatures, plaques, pictures of graves and so on. There are a couple of hundred monograms, mostly of historical figures, and this query allows you to view all of them. commons:Category:Monograms and its subcategories provide rich scope for adding more.

And so it is generally. The list of properties linking to Commons does contain a few that concern video and audio files, and rather more for maps. But it contains gems such as P3451 for "nighttime view". Over 1000 of those on Wikidata, but as for so much else, there could be yet more.

Go on. Today is Wikidata's birthday. An illustrative image is always an acceptable gift, so why not add one? You can follow these easy steps: (i) log in at https://tools.wmflabs.org/widar/, (ii) paste the Petscan ID 6263583 into https://tools.wmflabs.org/fist/wdfist/ and click run, and (iii) just add cake.

Birthday logo
Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 15:01, 29 October 2018 (UTC) reply

Facto Post – Issue 18 – 30 November 2018

Facto Post – Issue 18 – 30 November 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

WikiCite issue

GLAM ♥ data — what is a gallery, library, archive or museum without a catalogue? It follows that Wikidata must love librarians. Bibliography supports students and researchers in any topic, but open and machine-readable bibliographic data even more so, outside the silo. Cue the WikiCite initiative, which was meeting in conference this week, in the Bay Area of California.

Wikidata training for librarians at WikiCite 2018

In fact there is a broad scope: "Open Knowledge Maps via SPARQL" and the "Sum of All Welsh Literature", identification of research outputs, Library.Link Network and Bibframe 2.0, OSCAR and LUCINDA (who they?), OCLC and Scholia, all these co-exist on the agenda. Certainly more library science is coming Wikidata's way. That poses the question about the other direction: is more Wikimedia technology advancing on libraries? Good point.

Wikimedians generally are not aware of the tech background that can be assumed, unless they are close to current training for librarians. A baseline definition is useful here: " bash, git and OpenRefine". Compare and contrast with pywikibot, GitHub and mix'n'match. Translation: scripting for automation, version control, data set matching and wrangling in the large, are on the agenda also for contemporary library work. Certainly there is some possible common ground here. Time to understand rather more about the motivations that operate in the library sector.

Links

Account creation is now open on the ScienceSource wiki, where you can see SPARQL visualisations of text mining.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:20, 30 November 2018 (UTC) reply

Facto Post – Issue 19 – 27 December 2018

Facto Post – Issue 19 – 27 December 2018

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Learning from Zotero

Zotero is free software for reference management by the Center for History and New Media: see Wikipedia:Citing sources with Zotero. It is also an active user community, and has broad-based language support.

Zotero logo

Besides the handiness of Zotero's warehousing of personal citation collections, the Zotero translator underlies the citoid service, at work behind the VisualEditor. Metadata from Wikidata can be imported into Zotero; and in the other direction the zotkat tool from the University of Mannheim allows Zotero bibliographies to be exported to Wikidata, by item creation. With an extra feature to add statements, that route could lead to much development of the focus list (P5008) tagging on Wikidata, by WikiProjects.

Zotero demo video

There is also a large-scale encyclopedic dimension here. The construction of Zotero translators is one facet of Web scraping that has a strong community and open source basis. In that it resembles the less formal mix'n'match import community, and growing networks around other approaches that can integrate datasets into Wikidata, such as the use of OpenRefine.

Looking ahead, the thirtieth birthday of the World Wide Web falls in 2019, and yet the ambition to make webpages routinely readable by machines can still seem an ever-retreating mirage. Wikidata should not only be helping Wikimedia integrate its projects, an ongoing process represented by Structured Data on Commons and lexemes. It should also be acting as a catalyst to bring scraping in from the cold, with institutional strengths as well as resourceful code.

Links

Diversitech, the latest ContentMine grant application to the Wikimedia Foundation, is in its community review stage until January 2.

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 19:08, 27 December 2018 (UTC) reply

Facto Post – Issue 20 – 31 January 2019

Facto Post – Issue 20 – 31 January 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Everything flows (and certainly data does)

Recently Jimmy Wales has made the point that computer home assistants take much of their data from Wikipedia, one way or another. So as well as getting Spotify to play Frosty the Snowman for you, they may be able to answer the question "is the Pope Catholic?" Possibly by asking for disambiguation ( Coptic?).

Amazon Echo device using the Amazon Alexa service in voice search showdown with the Google rival on an Android phone

Headlines about data breaches are now familiar, but the unannounced circulation of information raises other issues. One of those is Gresham's law stated as "bad data drives out good". Wikipedia and now Wikidata have been criticised on related grounds: what if their content, unattributed, is taken to have a higher standing than Wikimedians themselves would grant it? See Wikiquote on a misattribution to Bismarck for the usual quip about "law and sausages", and why one shouldn't watch them in the making.

Wikipedia has now turned 18, so should act like as adult, as well as being treated like one. The Web itself turns 30 some time between March and November this year, per Tim Berners-Lee. If the Knowledge Graph by Google exemplifies Heraclitean Web technology gaining authority, contra GIGO, Wikimedians still have a role in its critique. But not just with the teenage skill of detecting phoniness.

There is more to beating Gresham than exposing the factoid and urban myth, where WP:V does do a great job. Placeholders must be detected, and working with Wikidata is a good way to understand how having one statement as data can blind us to replacing it by a more accurate one. An example that is important to open access is that, firstly, the term itself needs considerable unpacking, because just being able to read material online is a poor relation of "open"; and secondly, trying to get Creative Commons license information into Wikidata shows up issues with classes of license (such as CC-BY) standing for the actual license in major repositories. Detailed investigation shows that "everything flows" exacerbates the issue. But Wikidata can solve it.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:53, 31 January 2019 (UTC) reply

Facto Post – Issue 21 – 28 February 2019

Facto Post – Issue 21 – 28 February 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

What is a systematic review?

Systematic reviews are basic building blocks of evidence-based medicine, surveys of existing literature devoted typically to a definite question that aim to bring out scientific conclusions. They are principled in a way Wikipedians can appreciate, taking a critical view of their sources.

PRISMA flow diagram for a systematic review

Ben Goldacre in 2014 wrote (link below) "[...] : the "information architecture" of evidence based medicine (if you can tolerate such a phrase) is a chaotic, ad hoc, poorly connected ecosystem of legacy projects. In some respects the whole show is still run on paper, like it's the 19th century." Is there a Wikidatan in the house? Wouldn't some machine-readable content that is structured data help?

File:Schittny, Facing East, 2011, Legacy Projects.jpg
2011 photograph by Bernard Schittny of the "Legacy Projects" group

Most likely it would, but the arcana of systematic reviews and how they add value would still need formal handling. The PRISMA standard dates from 2009, with an update started in 2018. The concerns there include the corpus of papers used: how selected and filtered? Now that Wikidata has a 20.9 million item bibliography, one can at least pose questions. Each systematic review is a tagging opportunity for a bibliography. Could that tagging be reproduced by a query, in principle? Can it even be second-guessed by a query (i.e. simulated by a protocol which translates into SPARQL)? Homing in on the arcana, do the inclusion and filtering criteria translate into metadata? At some level they must, but are these metadata explicitly expressed in the articles themselves? The answer to that is surely "no" at this point, but can TDM find them? Again "no", right now. Automatic identification doesn't just happen.

Actually these questions lack originality. It should be noted though that WP:MEDRS, the reliable sources guideline used here for health information, hinges on the assumption that the usefully systematic reviews of biomedical literature can be recognised. Its nutshell summary, normally the part of a guideline with the highest density of common sense, allows literature reviews in general validity, but WP:MEDASSESS qualifies that indication heavily. Process wonkery about systematic reviews definitely has merit.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 10:02, 28 February 2019 (UTC) reply

Facto Post – Issue 22 – 28 March 2019

Facto Post – Issue 22 – 28 March 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

When in the cloud, do as the APIs do

Half a century ago, it was the era of the mainframe computer, with its air-conditioned room, twitching tape-drives, and appearance in the title of a spy novel Billion-Dollar Brain then made into a Hollywood film. Now we have the cloud, with server farms and the client–server model as quotidian: this text is being typed on a Chromebook.

File:Cloud-API-Logo.svg
Logo of Cloud API on Google Cloud Platform

The term Applications Programming Interface or API is 50 years old, and refers to a type of software library as well as the interface to its use. While a compiler is what you need to get high-level code executed by a mainframe, an API out in the cloud somewhere offers a chance to perform operations on a remote server. For example, the multifarious bots active on Wikipedia have owners who exploit the MediaWiki API.

APIs (called RESTful) that allow for the GET HTTP request are fundamental for what could colloquially be called "moving data around the Web"; from which Wikidata benefits 24/7. So the fact that the Wikidata SPARQL endpoint at query.wikidata.org has a RESTful API means that, in lay terms, Wikidata content can be GOT from it. The programming involved, besides the SPARQL language, could be in Python, younger by a few months than the Web.

Magic words, such as occur in fantasy stories, are wishful (rather than RESTful) solutions to gaining access. You may need to be a linguist to enter Ali Baba's cave or the western door of Moria (French in the case of " Open Sesame", in fact, and Sindarin being the respective languages). Talking to an API requires a bigger toolkit, which first means you have to recognise the tools in terms of what they can do. On the way to the wikt:impactful or polymathic modern handling of facts, one must perhaps take only tactful notice of tech's endemic problem with documentation, and absorb the insightful point that the code in APIs does articulate the customary procedures now in place on the cloud for getting information. As Owl explained to Winnie-the-Pooh, it tells you The Thing to Do.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:45, 28 March 2019 (UTC) reply

Facto Post – Issue 23 – 30 April 2019

Facto Post – Issue 23 – 30 April 2019

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.

Completely clouded?
Cloud computing logo

Talk of cloud computing draws a veil over hardware, but also, less obviously but more importantly, obscures such intellectual distinction as matters most in its use. Wikidata begins to allow tasks to be undertaken that were out of easy reach. The facility should not be taken as the real point.

Coming in from another angle, the "executive decision" is more glamorous; but the "administrative decision" should be admired for its command of facts. Think of the attitudes ad fontes, so prevalent here on Wikipedia as "can you give me a source for that?", and being prepared to deal with complicated analyses into specified subcases. Impatience expressed as a disdain for such pedantry is quite understandable, but neither dirty data nor false dichotomies are at all good to have around.

Issue 13 and Issue 21, respectively on WP:MEDRS and systematic reviews, talk about biomedical literature and computing tasks that would be of higher quality if they could be made more "administrative". For example, it is desirable that the decisions involved be consistent, explicable, and reproducible by non-experts from specified inputs.

What gets clouded out is not impossibly hard to understand. You do need to put together the insights of functional programming, which is a doctrinaire and purist but clearcut approach, with the practicality of office software. Loopless computation can be conceived of as a seamless forward march of spreadsheet columns, each determined by the content of previous ones. Very well: to do a backward audit, when now we are talking about Wikidata, we rely on integrity of data and its scrupulous sourcing: and clearcut case analyses. The MEDRS example forces attention on purge attempts such as Beall's list.

Links

If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 11:27, 30 April 2019 (UTC) reply

Facto Post – Issue 24 – 17 May 2019

Facto Post – Issue 24 – 17 May 2019
Text mining display of noun phrases from the US Presidential Election 2012

The Editor is Charles Matthews, for ContentMine. Please leave feedback for him, on his User talk page.
To subscribe to Facto Post go to Wikipedia:Facto Post mailing list. For the ways to unsubscribe, see the footer.
Semantic Web and TDM – a ContentMine view

Two dozen issues, and this may be the last, a valediction at least for a while.

It's time for a two-year summation of ContentMine projects involving TDM ( text and data mining).

Wikidata and now Structured Data on Commons represent the overlap of Wikimedia with the Semantic Web. This common ground is helping to convert an engineering concept into a movement. TDM generally has little enough connection with the Semantic Web, being instead in the orbit of machine learning which is no respecter of the semantic. Don't break a taboo by asking bots "and what do you mean by that?"

The ScienceSource project innovates in TDM, by storing its text mining results in a Wikibase site. It strives for compliance of its fact mining, on drug treatments of diseases, with an automated form of the relevant Wikipedia referencing guideline MEDRS. Where WikiFactMine set up an API for reuse of its results, ScienceSource has a SPARQL query service, with look-and-feel exactly that of Wikidata's at query.wikidata.org. It also now has a custom front end, and its content can be federated, in other words used in data mashups: it is one of over 50 sites that can federate with Wikidata.

The human factor comes to bear through the front end, which combines a link to the HTML version of a paper, text mining results organised in drug and disease columns, and a SPARQL display of nearby drug and disease terms. Much software to develop and explain, so little time! Rather than telling the tale, Facto Post brings you ScienceSource links, starting from the how-to video, lower right.

ScienceSourceReview, introductory video: but you need run it from the original upload file on Commons
Links for participation

The review tool requires a log in on sciencesource.wmflabs.org, and an OAuth permission (bottom of a review page) to operate. It can be used in simple and more advanced workflows. Examples of queries for the latter are at d:Wikidata_talk:ScienceSource project/Queries#SS_disease_list and d:Wikidata_talk:ScienceSource_project/Queries#NDF-RT issue.

Please be aware that this is a research project in development, and may have outages for planned maintenance. That will apply for the next few days, at least. The ScienceSource wiki main page carries information on practical matters. Email is not enabled on the wiki: use site mail here to Charles Matthews in case of difficulty, or if you need support. Further explanatory videos will be put into commons:Category:ContentMine videos.


If you wish to receive no further issues of Facto Post, please remove your name from our mailing list. Alternatively, to opt out of all massmessage mailings, you may add Category:Wikipedians who opt out of message delivery to your user talk page.
Newsletter delivered by MediaWiki message delivery

MediaWiki message delivery ( talk) 18:52, 17 May 2019 (UTC) reply

Invitation to participate in a discussion about publicly disclosing subscribers of TWL resources

Hi Ocaasi (WMF), I have started a discussion over our Village pump with the aim of maintaining a public list of all editors who are granted access to any TWL resource. Your thoughts and opinions on the proposal are welcome:-) Regards, WBG converse

not sure if its of much value, but wrote an diversity and inclusion essay here, if youre interested

Any thoughts appreciated :) -- [E.3] [chat2] [me] 18:36, 13 August 2019 (UTC) reply


Videos

Youtube | Vimeo | Bing

Websites

Google | Yahoo | Bing

Encyclopedia

Google | Yahoo | Bing

Facebook