The October drive reduced the backlog from 9,700 to an amazing 0! Congratulations to WaddlesJP13 who led with 2084 points. See
this page for further details. The queue is steadily rising again and is approaching 2,000. It would be great if <2,000 were the “new normal”. Please continue to help out even if it's only for a few or even one patrol a day.
2022 Awards
Onel5969 won the 2022 cup for 28,302 article reviews last year - that's an average of nearly 80/day. There was one Gold Award (5000+ reviews), 11 Silver (2000+), 28 Iron (360+) and 39 more for the 100+ barnstar. Rosguill led again for the 4th year by clearing 49,294 redirects. For the full details see the
Awards page and the
Hall of Fame. Congratulations everyone!
Minimum deletion time: The previous
WP:NPP guideline was to wait 15 minutes before tagging for deletion (including draftification and
WP:BLAR). Due to complaints, a consensus decided to raise the time to 1 hour. To illustrate this, very new pages in the
feed are now highlighted in red. (As always, this is not applicable to attack pages, copyvios, vandalism, etc.)
New draftify script: In response to feedback from AFC, the The Move to Draft script now provides a choice of set messages that also link the creator to a new, friendly
explanation page. The script also warns reviewers if the creator is probably still developing the article. The former script is no longer maintained. Please edit your edit your
common.js or vector.js file from User:Evad37/MoveToDraft.js to User:MPGuy2824/MoveToDraft.js
Redirects: Some of our redirect reviewers have reduced their activity and the backlog is up to 9,000+ (two months deep). If you are interested in this distinctly different task and need any help, see
this guide,
this checklist, and spend some time at
WP:RFD.
Discussions with the WMF The
PageTriage open letter signed by 444 users is bearing fruit. The Growth Team has assigned some software engineers to work on PageTriage, the software that powers the NewPagesFeed and the Page Curation toolbar. WMF has submitted
dozens of patches in the last few weeks to modernize PageTriage's code, which will make it easier to write patches in the future. This work is helpful but is not very visible to the end user. For patches visible to the end user, volunteers such as Novem Linguae and MPGuy2824 have been writing patches for bug reports and feature requests. The Growth Team also
had a video conference with the NPP coordinators to discuss
revamping the landing pages that new users see.
Reminders
Newsletter feedback - please take this
short poll about the newsletter.
If you no longer wish to be a reviewer, please ask any admin to remove you from the group. If you want the tools back again, just ask at
PERM.
To opt out of future mailings, please remove yourself
here.
New Pages Patrol newsletter June 2023
Hello Jianhui67,
Backlog
Redirect drive: In response to an unusually high redirect backlog, we held a redirect backlog drive in May. The drive completed with 23851 reviews done in total, bringing the redirect backlog to 0 (momentarily). Congratulations to Hey man im josh who led with a staggering 4316 points, followed by Meena and Greyzxq with 2868 and 2546 points respectively. See
this page for more details. The redirect queue is steadily rising again and is steadily approaching 4,000. Please continue to help out, even if it's only for a few or even one review a day.
Redirect autopatrol: All administrators without autopatrol have now been added to the redirect autopatrol list. If you see any users who consistently create significant amounts of good quality redirects, consider requesting redirect autopatrol for them
here.
WMF work on PageTriage: The
WMF Moderator Tools team, consisting of Sam,
Jason and
Susana, and also some patches from Jon, has been hard at work
updating PageTriage. They are focusing their efforts on modernising the extension's code rather than on bug fixes or new features, though some user-facing work will be prioritised. This will help make sure that this extension is not deprecated, and is easier to work on in the future. In the next month or so, we will have an opt-in
beta test where new page patrollers can help test the rewrite of
Special:NewPagesFeed, to help find bugs. We will post more details at
WT:NPPR when we are ready for beta testers.
Articles for Creation (AFC): All new page reviewers are now automatically approved for Articles for Creation draft reviewing (you do not need to apply at
WT:AFCP like was required previously). To install the
AFC helper script, visit
Special:Preferences, visit the Gadgets tab, tick "Yet Another AFC Helper Script", then click "Save". To find drafts to review, visit
Special:NewPagesFeed, and at the top left, tick "Articles for Creation". To review a draft, visit a submitted draft, click on the "More" menu, then click "Review (AFCH)". You can also comment on and submit drafts that are unsubmitted using the script.
You can review the AFC workflow at
WP:AFCR. It is up to you if you also want to mark your AFC accepts as NPP reviewed (this is allowed but optional, depends if you would like a second set of eyes on your accept). Don't forget that
draftspace is optional, so moves of drafts to mainspace (even if they are not ready) should not be reverted, except possibly if there is conflict of interest.
Pro tip: Did you know that visual artists such as painters have their own
SNG? The most common part of this "creative professionals" criteria that applies to artists is
WP:ARTIST 4b (solo exhibition, not group exhibition, at a major museum) or 4d (being represented within the permanent collections of two museums).
Reminders
Newsletter feedback - please take this
short poll about the newsletter.
To opt out of future mailings, please remove yourself
here.
Wikidata weekly summary #608
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here:
What changes would you like to see in the newsletter in 2024?"
Discussions
Open request for adminship:
EPIC (RfP scheduled to end after 26 December 2023 20:34 UTC)
New requests for permissions/Bot:
Balyozbot. Tasks:
Import sitelinks, labels, descriptions from ku wikipedia pages which use the template
w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
Upcoming:
Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
Papers:
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
Videos
Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
Tool of the week
Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using
cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
Newest
WikiProjects:
WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
Upcoming: The next
Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Papers:
Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
Videos:
WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
Tool of the week
WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs:
PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
Videos
Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
Notebooks:
Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the
point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff
Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project
Software Collaboration for Wikidata prematurely. Read their joint statement
here.
IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (
phab:T351968)
Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (
phab:T305660)
Wikibase REST API:
We finished the work on making it possible to get all sitelinks of an Item (
phab:T344041)
We are working on getting a sitelink for a given wiki (
phab:T344039)
Here's your quick overview of what has been happening around Wikidata over the last week.
Translations are available.
Discussions
New request for comments:
Domain name as data (Summary: How should Wikidata store the domain name associated with an item? There are many properties for URLs, but a domain name is a different value.)
PLW 2024: Provenance loves Wiki - Fri. 12th - Sun. 14th January. If you missed the event, catch up by reading the slides, Notes and watching the recordings on the Project page
Next:
Linked Open Data in Heritage Workshop > Jan. 23rd, 13:00 - 15:00 CET. If you are in the Maastricht University Faculty and want to know enhance heritage research, improve data management, connectivity and visualisation,
register for the Workshop.
AskWikidata: Natural language queries to Wikidata, a naive prototype created by Senior Software Engineer for Wikidata, Robert Timm.
Want to try? (Google Colab)
IP Masking: We are continuing to adapt Wikibase to the upcoming IP Masking feature. We worked on hiding warnings about IP addresses being saved when they don’t apply (
phab:T353807,
phab:T352006) and creating temporary accounts when editing (
phab:T354730)
Wikibase REST API:
We continued working on the ability to get a sitelink for a given site (
phab:T344039)
We started working on the ability to remove a sitelink for a given wiki (
phab:T344685)
We worked on fixing a bug where the REST API PUT request does not handle statement on Items with lowercase statement IDs (
phab:T352644)
mul language code: We did user testing to find any remaining issue before release
Nubuke Wikidata Workshop - Sat. 27 April 1100 - 1400 GMT at the Nubuke Foundation, Lome Close Accra, Ghana.
There's still time for nominating your favorite tool in the
Coolest Tool Award! To nominate, follow this link. Deadline: 10 May 2024 - winners unveiled at Wikimania 2024
Coordinate Me 2024 - an international Wikidata competition around content with geodata. The competition starts on 1 May 2024 and ends on 31 May 2024.
Wikimedia Hackathon 2024 is just around the corner, taking place in Tallinn, Estonia from 02 - 06 May. We hope there will be lots of Wikidata hacking projects worked on.
Bringing PanglaoDB to 5-star Linked Open Data using Wikidata - This paper documents the experiences mapping PanglaoDB's free text cell types and genes i Wikidata to improve reusability and fairness. By Tiago Lubiana & João Vitor F. Cavalcante.
Exploring Wikidata for Social Science Research (Portugese) - Highlights the relevance of open resources such as Wikidata for the Social Sciences, providing access to information that facilitates research in the social, economic and cultural areas. By Wenceslao Arroyo-Machado.
MWCon Spring 2024 (Day 3) - This session of the MediaWiki Users and Developers Conference explores the different extensions you can use with Wikibase. See the rest of the
program here
Getting Started with OpenRefine - Introducing OpenRefine, importing data, filtering and faceting and bulk editing, as well as how to connect to other data sources for reconciling your data. By Margaret Heller & Diana Rusch.
RTE substation ID (identifier of electrical substations operated by RTE in France)
API formatter URL (URI template from which "$1" can automatically be replaced with the effective property value on items; for API access and other machine-readable data)
leaf morphology (characterization of aspects of the shape of a plant’s leaves)
risk group (risk group of a biological agent guiding its initial handling in labs according to the risk group classification defined by the WHO laboratory biosafety manual)
timetable/schedule URL (link to the timetable or the schedule (in PDF, HTML, image format) for the given service or the event schedule. The current timetable/schedule should have preferred rank.)
deterioration (to indicate types of deterioration presented by an artwork, building, artifact, etc.)
Type of representation (Property to indicate the '''representation type''' as a qualifier for Wikimedia Commons SDC Depicts statements of such Wikidata items, indicating the media type of the media file as can be derived from its registered property in Wikidata, being different from P18 (image).)
is fake of (the kind (class) of elements this item falsifies / is a fake for)
tartan (item's tartan; Tartan is a Scottish cloth pattern symbolizing a clan, region, or group.)
hasFeldpostNumber (Property to link German military units to their respective Feldpost numbers, referencing the specific identifier used during the military communications in the world wars.)
total deposits (total value of deposits held by a bank or financial institution)
Total loans (total value of loans given out by a bank or financial institution)
source of transfer & destination of transfer (entity that a transferred item is initially associated with, before this process associates it with another entity (the destination of transfer) [aliases: source / sender])
rythme narratif (video game mechanic based on the rhythm of the player's actions)
event role (item that describes a role in an event class)
Explore data about Tabakalera participants - find biographical information of film director's that have participate din the San Sebastian Film Festival Tabalakera cultural centre.
Mothers on Wikidata - explore the mother and child relationships of Wikidata items.
Presidents of French Universities - curious about which individuals have presided at French Universities and Higher education institutions? This query explores that!
Showcase Items:
Jerzy Spława-Neyman - A polish Statistician, the first to introduce a confidence interval to statistical hypothesis testing.
The October drive reduced the backlog from 9,700 to an amazing 0! Congratulations to WaddlesJP13 who led with 2084 points. See
this page for further details. The queue is steadily rising again and is approaching 2,000. It would be great if <2,000 were the “new normal”. Please continue to help out even if it's only for a few or even one patrol a day.
2022 Awards
Onel5969 won the 2022 cup for 28,302 article reviews last year - that's an average of nearly 80/day. There was one Gold Award (5000+ reviews), 11 Silver (2000+), 28 Iron (360+) and 39 more for the 100+ barnstar. Rosguill led again for the 4th year by clearing 49,294 redirects. For the full details see the
Awards page and the
Hall of Fame. Congratulations everyone!
Minimum deletion time: The previous
WP:NPP guideline was to wait 15 minutes before tagging for deletion (including draftification and
WP:BLAR). Due to complaints, a consensus decided to raise the time to 1 hour. To illustrate this, very new pages in the
feed are now highlighted in red. (As always, this is not applicable to attack pages, copyvios, vandalism, etc.)
New draftify script: In response to feedback from AFC, the The Move to Draft script now provides a choice of set messages that also link the creator to a new, friendly
explanation page. The script also warns reviewers if the creator is probably still developing the article. The former script is no longer maintained. Please edit your edit your
common.js or vector.js file from User:Evad37/MoveToDraft.js to User:MPGuy2824/MoveToDraft.js
Redirects: Some of our redirect reviewers have reduced their activity and the backlog is up to 9,000+ (two months deep). If you are interested in this distinctly different task and need any help, see
this guide,
this checklist, and spend some time at
WP:RFD.
Discussions with the WMF The
PageTriage open letter signed by 444 users is bearing fruit. The Growth Team has assigned some software engineers to work on PageTriage, the software that powers the NewPagesFeed and the Page Curation toolbar. WMF has submitted
dozens of patches in the last few weeks to modernize PageTriage's code, which will make it easier to write patches in the future. This work is helpful but is not very visible to the end user. For patches visible to the end user, volunteers such as Novem Linguae and MPGuy2824 have been writing patches for bug reports and feature requests. The Growth Team also
had a video conference with the NPP coordinators to discuss
revamping the landing pages that new users see.
Reminders
Newsletter feedback - please take this
short poll about the newsletter.
If you no longer wish to be a reviewer, please ask any admin to remove you from the group. If you want the tools back again, just ask at
PERM.
To opt out of future mailings, please remove yourself
here.
New Pages Patrol newsletter June 2023
Hello Jianhui67,
Backlog
Redirect drive: In response to an unusually high redirect backlog, we held a redirect backlog drive in May. The drive completed with 23851 reviews done in total, bringing the redirect backlog to 0 (momentarily). Congratulations to Hey man im josh who led with a staggering 4316 points, followed by Meena and Greyzxq with 2868 and 2546 points respectively. See
this page for more details. The redirect queue is steadily rising again and is steadily approaching 4,000. Please continue to help out, even if it's only for a few or even one review a day.
Redirect autopatrol: All administrators without autopatrol have now been added to the redirect autopatrol list. If you see any users who consistently create significant amounts of good quality redirects, consider requesting redirect autopatrol for them
here.
WMF work on PageTriage: The
WMF Moderator Tools team, consisting of Sam,
Jason and
Susana, and also some patches from Jon, has been hard at work
updating PageTriage. They are focusing their efforts on modernising the extension's code rather than on bug fixes or new features, though some user-facing work will be prioritised. This will help make sure that this extension is not deprecated, and is easier to work on in the future. In the next month or so, we will have an opt-in
beta test where new page patrollers can help test the rewrite of
Special:NewPagesFeed, to help find bugs. We will post more details at
WT:NPPR when we are ready for beta testers.
Articles for Creation (AFC): All new page reviewers are now automatically approved for Articles for Creation draft reviewing (you do not need to apply at
WT:AFCP like was required previously). To install the
AFC helper script, visit
Special:Preferences, visit the Gadgets tab, tick "Yet Another AFC Helper Script", then click "Save". To find drafts to review, visit
Special:NewPagesFeed, and at the top left, tick "Articles for Creation". To review a draft, visit a submitted draft, click on the "More" menu, then click "Review (AFCH)". You can also comment on and submit drafts that are unsubmitted using the script.
You can review the AFC workflow at
WP:AFCR. It is up to you if you also want to mark your AFC accepts as NPP reviewed (this is allowed but optional, depends if you would like a second set of eyes on your accept). Don't forget that
draftspace is optional, so moves of drafts to mainspace (even if they are not ready) should not be reverted, except possibly if there is conflict of interest.
Pro tip: Did you know that visual artists such as painters have their own
SNG? The most common part of this "creative professionals" criteria that applies to artists is
WP:ARTIST 4b (solo exhibition, not group exhibition, at a major museum) or 4d (being represented within the permanent collections of two museums).
Reminders
Newsletter feedback - please take this
short poll about the newsletter.
To opt out of future mailings, please remove yourself
here.
Wikidata weekly summary #608
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here:
What changes would you like to see in the newsletter in 2024?"
Discussions
Open request for adminship:
EPIC (RfP scheduled to end after 26 December 2023 20:34 UTC)
New requests for permissions/Bot:
Balyozbot. Tasks:
Import sitelinks, labels, descriptions from ku wikipedia pages which use the template
w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
Upcoming:
Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
Papers:
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
Videos
Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
Tool of the week
Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using
cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
Newest
WikiProjects:
WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
Upcoming: The next
Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Papers:
Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
Videos:
WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
Tool of the week
WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs:
PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
Videos
Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
Notebooks:
Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the
point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff
Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project
Software Collaboration for Wikidata prematurely. Read their joint statement
here.
IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (
phab:T351968)
Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (
phab:T305660)
Wikibase REST API:
We finished the work on making it possible to get all sitelinks of an Item (
phab:T344041)
We are working on getting a sitelink for a given wiki (
phab:T344039)
Here's your quick overview of what has been happening around Wikidata over the last week.
Translations are available.
Discussions
New request for comments:
Domain name as data (Summary: How should Wikidata store the domain name associated with an item? There are many properties for URLs, but a domain name is a different value.)
PLW 2024: Provenance loves Wiki - Fri. 12th - Sun. 14th January. If you missed the event, catch up by reading the slides, Notes and watching the recordings on the Project page
Next:
Linked Open Data in Heritage Workshop > Jan. 23rd, 13:00 - 15:00 CET. If you are in the Maastricht University Faculty and want to know enhance heritage research, improve data management, connectivity and visualisation,
register for the Workshop.
AskWikidata: Natural language queries to Wikidata, a naive prototype created by Senior Software Engineer for Wikidata, Robert Timm.
Want to try? (Google Colab)
IP Masking: We are continuing to adapt Wikibase to the upcoming IP Masking feature. We worked on hiding warnings about IP addresses being saved when they don’t apply (
phab:T353807,
phab:T352006) and creating temporary accounts when editing (
phab:T354730)
Wikibase REST API:
We continued working on the ability to get a sitelink for a given site (
phab:T344039)
We started working on the ability to remove a sitelink for a given wiki (
phab:T344685)
We worked on fixing a bug where the REST API PUT request does not handle statement on Items with lowercase statement IDs (
phab:T352644)
mul language code: We did user testing to find any remaining issue before release
Nubuke Wikidata Workshop - Sat. 27 April 1100 - 1400 GMT at the Nubuke Foundation, Lome Close Accra, Ghana.
There's still time for nominating your favorite tool in the
Coolest Tool Award! To nominate, follow this link. Deadline: 10 May 2024 - winners unveiled at Wikimania 2024
Coordinate Me 2024 - an international Wikidata competition around content with geodata. The competition starts on 1 May 2024 and ends on 31 May 2024.
Wikimedia Hackathon 2024 is just around the corner, taking place in Tallinn, Estonia from 02 - 06 May. We hope there will be lots of Wikidata hacking projects worked on.
Bringing PanglaoDB to 5-star Linked Open Data using Wikidata - This paper documents the experiences mapping PanglaoDB's free text cell types and genes i Wikidata to improve reusability and fairness. By Tiago Lubiana & João Vitor F. Cavalcante.
Exploring Wikidata for Social Science Research (Portugese) - Highlights the relevance of open resources such as Wikidata for the Social Sciences, providing access to information that facilitates research in the social, economic and cultural areas. By Wenceslao Arroyo-Machado.
MWCon Spring 2024 (Day 3) - This session of the MediaWiki Users and Developers Conference explores the different extensions you can use with Wikibase. See the rest of the
program here
Getting Started with OpenRefine - Introducing OpenRefine, importing data, filtering and faceting and bulk editing, as well as how to connect to other data sources for reconciling your data. By Margaret Heller & Diana Rusch.
RTE substation ID (identifier of electrical substations operated by RTE in France)
API formatter URL (URI template from which "$1" can automatically be replaced with the effective property value on items; for API access and other machine-readable data)
leaf morphology (characterization of aspects of the shape of a plant’s leaves)
risk group (risk group of a biological agent guiding its initial handling in labs according to the risk group classification defined by the WHO laboratory biosafety manual)
timetable/schedule URL (link to the timetable or the schedule (in PDF, HTML, image format) for the given service or the event schedule. The current timetable/schedule should have preferred rank.)
deterioration (to indicate types of deterioration presented by an artwork, building, artifact, etc.)
Type of representation (Property to indicate the '''representation type''' as a qualifier for Wikimedia Commons SDC Depicts statements of such Wikidata items, indicating the media type of the media file as can be derived from its registered property in Wikidata, being different from P18 (image).)
is fake of (the kind (class) of elements this item falsifies / is a fake for)
tartan (item's tartan; Tartan is a Scottish cloth pattern symbolizing a clan, region, or group.)
hasFeldpostNumber (Property to link German military units to their respective Feldpost numbers, referencing the specific identifier used during the military communications in the world wars.)
total deposits (total value of deposits held by a bank or financial institution)
Total loans (total value of loans given out by a bank or financial institution)
source of transfer & destination of transfer (entity that a transferred item is initially associated with, before this process associates it with another entity (the destination of transfer) [aliases: source / sender])
rythme narratif (video game mechanic based on the rhythm of the player's actions)
event role (item that describes a role in an event class)
Explore data about Tabakalera participants - find biographical information of film director's that have participate din the San Sebastian Film Festival Tabalakera cultural centre.
Mothers on Wikidata - explore the mother and child relationships of Wikidata items.
Presidents of French Universities - curious about which individuals have presided at French Universities and Higher education institutions? This query explores that!
Showcase Items:
Jerzy Spława-Neyman - A polish Statistician, the first to introduce a confidence interval to statistical hypothesis testing.