This page has archives. Sections older than 31 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present.
Wikidata weekly summary #608
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here:
What changes would you like to see in the newsletter in 2024?"
Discussions
Open request for adminship:
EPIC (RfP scheduled to end after 26 December 2023 20:34 UTC)
New requests for permissions/Bot:
Balyozbot. Tasks:
Import sitelinks, labels, descriptions from ku wikipedia pages which use the template
w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
Upcoming:
Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
Papers:
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
Videos
Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
Tool of the week
Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using
cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
Newest
WikiProjects:
WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
Upcoming: The next
Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Papers:
Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
Videos:
WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
Tool of the week
WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs:
PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
Videos
Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
Notebooks:
Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the
point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff
Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project
Software Collaboration for Wikidata prematurely. Read their joint statement
here.
IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (
phab:T351968)
Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (
phab:T305660)
Wikibase REST API:
We finished the work on making it possible to get all sitelinks of an Item (
phab:T344041)
We are working on getting a sitelink for a given wiki (
phab:T344039)
Here's your quick overview of what has been happening around Wikidata over the last week.
Translations are available.
Discussions
New request for comments:
Domain name as data (Summary: How should Wikidata store the domain name associated with an item? There are many properties for URLs, but a domain name is a different value.)
PLW 2024: Provenance loves Wiki - Fri. 12th - Sun. 14th January. If you missed the event, catch up by reading the slides, Notes and watching the recordings on the Project page
Next:
Linked Open Data in Heritage Workshop > Jan. 23rd, 13:00 - 15:00 CET. If you are in the Maastricht University Faculty and want to know enhance heritage research, improve data management, connectivity and visualisation,
register for the Workshop.
AskWikidata: Natural language queries to Wikidata, a naive prototype created by Senior Software Engineer for Wikidata, Robert Timm.
Want to try? (Google Colab)
IP Masking: We are continuing to adapt Wikibase to the upcoming IP Masking feature. We worked on hiding warnings about IP addresses being saved when they don’t apply (
phab:T353807,
phab:T352006) and creating temporary accounts when editing (
phab:T354730)
Wikibase REST API:
We continued working on the ability to get a sitelink for a given site (
phab:T344039)
We started working on the ability to remove a sitelink for a given wiki (
phab:T344685)
We worked on fixing a bug where the REST API PUT request does not handle statement on Items with lowercase statement IDs (
phab:T352644)
mul language code: We did user testing to find any remaining issue before release
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-06-24. Please help
Translate.
Upcoming:
Lexicodays, online event dedicated to Lexemes on Wikidata, will take place on June 28-30. It takes place across time zone and both in English and Indonesian. Check the program and find the access links on the event page.
Wikidata in Prometheus - Prometheus is a non-commercial image archive for art and cultural studies. It hosts images from a variety of image and media databases and now works can be connected with Wikidata.
New
Mix'n'match feature: For lists of (full or auto) matches where both the MnM entry and the Wikidata Item have coordinates, it now shows the distance between them in the description. (
source)
Other Noteworthy Stuff
The Wikidata development team at Wikimedia Deutschland is planning a brief survey to understand the various ways people contribute to the project and identify user contribution patterns. A request has been made for a CentralNotice banner to deploy the survey to a broad audience. Your feedback, comments, and questions on this request are welcome:
m:CentralNotice/Request/Wikidata Community Survey 2024
The second iteration of the
Wikidata:Open Online Course will begin from July 1 until August 11. Whether you're a beginner taking your first steps, an individual in need of a refresher on Wikidata concepts, or a seasoned trainer looking to level up your skills - this course is right for you.
Multilingualism - organizes work around achieving 100% Wikidata multilingualism for every language with MediaWiki internationalization support. It is initiated, developed & supported by Wikimedia Language Diversity community volunteers.
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-01. Please help
Translate.
Discussions
New requests for permissions/Bot:
DifoolBot 4 Task(s) - Split single references containing multiple reference URLs into multiple references.
Bot Bozze Task(s) - Add sitelinks to itwiki draft articles after they've been moved to the main namespace.
New request for comments:
Spelling convention for labels and descriptions in English - RfC started 2024-06-25. This RfC requests feedback and input for finding consistency in spelling convention as English has multiple regional variations.
Past: The
Lexicodays 2024 was an online event designed to offer a discussion space for the Wikidata community about Lexicographical Data. An archive of some of the slides and session recordings are here
c:Category:Lexicodays 2024. More will be added as they become available.
Upcoming:
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 10th July 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Botany-focused Wikidata online workshop online as part of the #IBC2024. Date: Tuesday 9th July at 9pm NZST (GMT+12) / 11 am central Europe.
Register here!
Press, articles, blog posts, videos
Blogs
Querying for audio on Wikidata - This blog post discusses using SPARQL queries on Wikidata to find audio recordings, focusing on musical compositions and their associated genres.
Diff Blog: Imagining a Wikidata future for librarians together - the sixth and final blog post from the LD42023 conference. Silvia Gutiérrez (WMF) and Giovanna Fontenelle (WMF) document the results of the collaborative session on building a bridge between the Library-Wikidata community and WMF.
Library Knowledge as Linked Data: A Wikidata Approach: Contributing to a shared data commons. David Erlandson describes the experiences of using Wikidata for the pilot Program for Cooperative Cataloging to "accelerate the movement towards ubiquitous identifier creation and identity management at the network level".
User:Zvpunry/CreateNewItem - This is a User script to easily add a new Item while editing a Statement and noticing that the desired Item is missing.
Other Noteworthy Stuff
The second iteration of the
Wikidata:Open Online Course has begun. Class will continue until August 11. Whether you're a beginner taking your first steps, an individual in need of a refresher on Wikidata concepts, or a seasoned trainer looking to level up your skills - this course is right for you.
Newest
WikiProjects:
Inuktitut - This is the space to organize work to assure that the sum of all knowledge and the supporting infrastructure for necessary services are available in Inuktitut (ᐃᓄᒃᑎᑐᑦ, Inuktitut).
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-08. Please help
Translate.
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 10th July 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Registration for Wikimania 2024 is open! In-person participants: please register until 26 July, 11:59 p.m., UTC. Virtual participants can register anytime. If you received a scholarship from the Wikimedia Foundation, you will receive an email with a registration code and instructions.
Tool of the week
User:Teester/EntityShape.js - a userscript that adds an input box to a Wikidata page wherein you can enter an EntitySchema (such as
E10). When you click "Check", checks whether each statement and property conforms to the schema. It then displays a summary at the top of the Item for each property indicating whether they conform or not. It also adds a badge to each statement and each property on the page indicating whether they conform or not.
Other Noteworthy Stuff
Wikidata teams' development goals for the third quarter of 2024 have been updated:
Wikidata:Development plan
indexer (entity responsible for compiling an index of a book, database, website or other forms of media publications in the form of a methodical arrangement of records designed to enable users to locate information quickly. Example: Hazel K. Bell (Q70226489))
WorldCyclingStats ID (identifier on the website WorldCyclingStats (www.worldcyclingstats.com))
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-15. Please help
Translate.
Discussions
New requests for permissions/Bot:
SpinachBot - Task/s: AI agent-enabled question answering through the creation and execution of complex SPARQL queries on Wikidata. Users tag the bog with wikidata-related questions, and it tries to come up with an answer by iteratively creating SPARQL queries.
An Intelligent System: What I learned through taking an introductory Wikidata course - The author, Anne-Christine Hoff, dispels misconceptions about Wikidata. She highlights that it is a relational communication system, not solely bot-driven, and allows users worldwide to add localized data in multiple languages, creating a self-structuring repository of information
Merge and diff - blog post by Magnus about adding new properties (taxon data speficially, NCBI, GBIF, and iINaturalist) to the AC2WD tool. If you have the user script installed on Wikidata (see tool below), AC2WD will automatically show up on relevant taxon items.
Online workshop Upskilling in Wikidata for maximum impact IBC 2024 - Recording of the virtual Wikidata workshop given on the 9th of July 2024. An onboarding and introductory event in anticipation of a full in-person workshop to be held at the International Botanical Congress 2024 in Madrid on the 21st of July 2024
User:Magnus Manske/ac2wd.js - This script adds an "AC2WD" link in the tools sidebar. When you click on it, it uses the
AC2WD tool to check the item for certain Authority Control IDs (eg VIAF). It then checks these AC datasets for statements (and more AC IDs). It will then add any new information it found as new statements, or add more references to existing statements where possible. A green checkmark will be appended to the link if data was added (reload the page to see), otherwise a "—" if no new data was available.
myfixguide.com (photos about how to disassemble hardware)
indexer (entity responsible for compiling an index of a book, database, website or other forms of media publications in the form of a methodical arrangement of records designed to enable users to locate information quickly)
Newest
WikiProjects:
Human Cells - dedicated to improve our coverage of information about these fundamental entities that compose organisms. The project focuses mainly on two kinds of cell classes: species-agnostic classes, such as
neutrophil (Q188417), and human-specific classes, such as
human neutrophil (Q101405102).
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-22. Please help
Translate.
Discussions
New requests for permissions/Bot:
DannyS712 bot - Task/s: I want to get approval for a bot with translation admin rights that will automatically mark pages for translations if and only if the latest version is identical to the version that is already in the translation system, i.e. only pages with no "net" changes in the pending edits.
DerIchBot - Task/s: Adding data about schools provided by the German and Austrian governments to wikidata.
DifoolBot 5 - Task/s: Change reference URLs into the related ID property and merge references with the same ID property.
AroundTheBot - Task/s: Automated import of Albanian nouns with IPA from Wiktionary, with the long-term goal of using this data to do pronunciation-based comparison/word evolution between languages.
Past: Wikimedia Indonesia hosted
the 2024 Data Visualization Competition from June 5 to 18. The event featured data visualizations (posters and graphics) and short essays using data from Wikidata. Visit
the competition page (in Indonesian) to view the winning entries.
Upcoming: Wikidata's 12th birthday decentralized events will take place in October and November 2024. Feel free to browse
the documentation pages to learn how to organize an event in your area, get funding, and get in touch with other organizers.
Press, articles, blog posts, videos
Papers:
Representación de datos abiertos con Wikidata Query Service (Q126917814). This paper details the Wikidata Query Service, for the creation of data visualizations. All visualization options available in the WQS are explored, accompanied by example queries that introduce the implementation of these visualizations. By Ángel Obregón-Sierra and Silvia Cecilia Anselmi.
Paulina, a new tool for exploring public domain works.
Other Noteworthy Stuff
WMF Product and Technology Advisory Council (PTAC) invites interested volunteers to apply. As part of the movement strategy recommendation for "Coordinating Across Stakeholders," the PTAC will bring technical contributors and the Wikimedia Foundation together to co-define a more resilient, future-proof technological platform.
indexer (entity responsible for compiling an index of a book, database, website or other forms of media publications in the form of a methodical arrangement of records designed to enable users to locate information quickly)
Reference Verification - a research and development project aimed at helping Wikidata editors check the quality of external references based on various types of AI/ML models.
NZWomenPhotographers - aims to improve information about New Zealand women photographers, based on a dataset provided by the Museum of New Zealand Te Papa Tongarewa.
Newest
database reports:
Items with P569=P570 - Items with instance of (P31) --> human (Q5) and the same year in date of birth (P569) and date of death (P570) (2024-07-22)
Showcase Lexemes:
water (L3302) - (S1) common liquid substance (S2) chemical compound of hydrogen and oxygen (H₂O) (S3) a body of water, usually a river, a lake, or an ocean
Development
mul: We fixed the last blocker for the limited MUL rollout to Wikidata on July 29th (
phab:T362917)
EntitySchemas:
We fixed a misplaced background color in EntitySchema (
phab:T369283)
We’re investigating how to make EntitySchemas searchable by label (
phab:T362005)
Query Service: preparation for the graph split is continuing by the Search Platform Team. We started looking into adapting the constraints checks for it (
phab:T369079)
This page has archives. Sections older than 31 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present.
Wikidata weekly summary #608
Here's your quick overview of what has been happening around Wikidata over the last week.
Welcome to 2023’s Final Weekly Summary!
A big thank you to everyone who contributed to the newsletter this year!👏🙏 As we step into 2024, we'd love to hear what changes you would like to see in the newsletter. Share your wishlist here:
What changes would you like to see in the newsletter in 2024?"
Discussions
Open request for adminship:
EPIC (RfP scheduled to end after 26 December 2023 20:34 UTC)
New requests for permissions/Bot:
Balyozbot. Tasks:
Import sitelinks, labels, descriptions from ku wikipedia pages which use the template
w:ku:Template:Înterwîkî etîket û danasîn. (There are over 1800 articles that use this template waiting to be connected to Wikidata at the moment.)
Add sitelinks to kuwiktionary / kuwikipedia categories / create an item for the category if necessary. I have been doing this manually for quite some time using Quickstatements but since I need to get permission for the first task, I will be handling them using a bot as well.
Upcoming:
Introducing WMF Wishathon for Wikimedia’s Community Wishlist! "focused on bringing together people who already contribute to technical aspects of the Wikimedia projects, who know how to find their way on the technical ecosystem, and who are able to work or collaborate on projects rather autonomously." March 15th to 17th, 2024.
African Librarians empowered to share knowledge and enhance information visibility through AfLIA Wikidata Online Course --> The "Promoting Open Knowledge Practices in African Libraries through Wikidata" project, executed by AfLIA with support from the Wikimedia Foundation, trained African librarians on using Wikidata to enhance the visibility of library collections and close the knowledge and gender gap on Africa. The course was facilitated by experienced African Wikimedian editors and included diverse strategies for learner engagement and support.
Papers:
Increasing Coverage and Precision of Textual Information in Multilingual Knowledge Graphs by (Conia et al, 2023) --> This paper introduces a novel task of automatic Knowledge Graph Enhancement (KGE) to bridge the gap in the quantity and quality of textual information between English and non-English languages in Wikidata. It presents M-NTA, an unsupervised approach that combines Machine Translation, Web Search, and Large Language Models to generate high-quality textual information, and studies its impact on Entity Linking, Knowledge Graph Completion, and Question Answering tasks.
Videos
Wikidata, Wikisource and Wiktionary: Wikisource for DH (WiSe 2023) --> The lecture "Fundamentals and application-oriented methods of the Digital Humanities" by Kay-Michael Würzner is designed as a series of lectures in which teachers in the "Digital Humanities" course present their fields of work and key topics and present them for discussion.
Empowering Open-Source Generative AI by Integrating the Wikidata knowledge graph --> Generative AI has changed the information ecosystem, and open-source knowledge graphs like Wikidata can become invaluable assets, propelling a myriad of applications forward. Jonathan Fraine & Lydia Pintscher present the practical integration of Wikidata's open-source, open-access knowledge graph to empower Generative AI applications. Harnessing the real-time updated, structured data encapsulated within Wikidata, they explore automated content creation, data augmentation, and semantic analysis, underpinning the generative paradigms. Through a blend of theoretical insights and real-world applications, they elucidate how to leverage Wikidata to elevate generative AI applications, breaking down existing data silos, and fostering a collaborative ecosystem within our global community of developers and contributors.
Wiki Indaba 2023 - African content on Wikidata --> Discussion with Alice Kibombo, Georges Fodouop and Jesse Asiedu-Akrofi, about Wikidata for African Librarians during the Wiki Indaba conference, that took place between 3-5 November 2023 in Agadir, Morocco.
No Time to Wait - S07E10 - ACMI // Wikidata - Paul Duchesne + Simon Loffler --> Report on recent residency program to extensively link together collection data from ACMI with Wikidata. This work has allowed the organisation to import vast quantities of data and media to enrich their own internet collection experience, as well enable writing information back to source and federating with other linked institutions.
Map of K-Pop Idols --> An interactive map where each red dot represents a K-pop Idol (a singer or musician in South Korean Pop music) you are able to click on.
Disney as the Mega Corporation it is Today --> Disney has greatly evolved from the simple animation company that first debuted in 1923 with its signature Steamboat Willie animation. This analysis details some of the major acquisitions Disney has chosen to help expand its reach as a media and entertainment company.
State of statues in the US --> Map of how many statues there are, who is depicted in the statues, their genders, and where the statues are concentrated.
An Analysis on Nepo Babies: Net Worths and Fame --> This work uses Wikidata to analyze the influence and success of children of famous actors (nepo babies) in the entertainment industry, and compares the careers and net worth of these children with their parents to understand the impact of nepotism on their success.
Tool of the week
Cersei - is a tool designed for importing or scraping data from various third-party sources, using source-specific Python code. It can use a "headless browser" to scrape complicated websites that rely on eg JavaScript to navigate. It can therefore access data sources that can not be accessed via eg Mix'n'match. The data from sources can be updated regularly, either for everything, or just changed entries (if the source has a "recent changes" equivalent).
Wikidata:Zotero/Cita - is a Wikidata addon for Zotero that adds citations (i.e., what other items an item cites) metadata support to this open source reference management software, using
cites work (P2860) information available from Wikidata, and enabling users to easily contribute missing data.
production manager (manager that is responsible for the administration of a feature film or television production; oversees production plans, controls resources, initiates production, ensures ongoing operations, monitors schedules and expenditures, and creates a detailed production schedule and budget)
Newest
WikiProjects:
WikiProject Städel Museum Wikidata Clean-Up - This WikiProject from the Städel Museum aims to actively participate in the Wikimedia community by maintaining and updating the quality of its data. This includes their collection of public domain art, which has been digitized and made freely available for public use. The project focuses on ensuring that the most current and high-quality data, including high-resolution images and improved metadata, are available on platforms like Wikimedia Commons and Wikidata.
Upcoming: The next
Wikidata+Wikibase office hours will take place on Wednesday, 17:00 UTC, 17th January 2023 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Papers:
Improving maintenance of community-based knowledge graphs. This paper by Nicolas Ferranti addresses the critical issue of data quality in open knowledge graphs, with a specific focus on Wikidata. It aims to formalize Wikidata's unique approaches to assess and resolve data inconsistencies, proposing a semi-automatic refinement pipeline to empower the Wikidata user community in maintaining and enhancing the reliability of this extensive collaborative knowledge graph.
Videos:
WikidataCon 2023 Day 1.5 - The past and future of Wikidata. In this video Lydia Pintscher takes a moment to review the major events of Wikidata over the past few years. Then turns to look forward and predict what Wikidata's prospects will be over the next year.
Tool of the week
WICA: Wikidata's insights for created articles is an updated version of an old tool. It now includes many new features to analyse your list of created articles using Wikidata properties.
Nonprofit Status (Indicating the legal and tax status of a non-profit organization (specific to served legal areas, aka. Countries). Addition to {{P|1454}}. {{P|1628}} to [https://schema.org/nonprofitStatus nonprofitStatus] from schema.org. Organizations can have multiple Nonprofit Status from different countries.)
creative director (person who makes high-level creative decisions, oversees the creation of creative assets such as adverts, products, events or logos and guides and directs the creative people who create the end result)
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 17th January 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Blogs:
PubChem on Wikidata – What is the state of coverage? by Tiago Lubiana. In summary, Wikidata has good coverage of the structured chemical data in PubChem, though there are improvement points. PubChem displays, and will always display, textual information and vendor-specific data that do not fit Wikidata, but they are complementary tools in the ecosystem of open chemical data.
LIS Journals’ Lack of Participation in Wikidata Item Creation by Eric Willey & Susan Radovsky, discusses the gap of Wikidata items being created for scholarly articles by the scholar's themselves and if this can lead to inconsistent or inaccurate data model.
Quantifying Americanization: Coverage of American Topics in Different Wikipedias: this paper asks whether there is an americanisation bias in the content created by the communities. By Piotr Konieczny & Włodzimierz Lewoniewski.
Videos
Map Kerala Initiative is an opendata portal geospatial map powered by Wikidata and OpenStreetMap, introduced by Manoj Karingamadathil.
Notebooks:
Wikipedia article as a timeline - This tool transforms a Wikipedia article in a timeline by parsing all internal links in a Wikipedia article and retrieving the date corresponding to each internal link using the
point in time (P585) property in Wikidata.
Tool of the week Map your list of created articles - a notebook display of geolocated articles on a map created by a user per chosen project and batch (featured/good article).
Other Noteworthy Stuff
Wikimedia Indonesia and Wikimedia Deutschland ended their partnership within the project
Software Collaboration for Wikidata prematurely. Read their joint statement
here.
IP masking/temporary accounts: We are adjusting Wikibase to be prepared for the upcoming changes to no longer expose IP addresses for non-logged-in users (
phab:T351968)
Dumps/lex. data: We’re adjusting how empty lists of Forms and Senses are represented in JSON dumps (
phab:T305660)
Wikibase REST API:
We finished the work on making it possible to get all sitelinks of an Item (
phab:T344041)
We are working on getting a sitelink for a given wiki (
phab:T344039)
Here's your quick overview of what has been happening around Wikidata over the last week.
Translations are available.
Discussions
New request for comments:
Domain name as data (Summary: How should Wikidata store the domain name associated with an item? There are many properties for URLs, but a domain name is a different value.)
PLW 2024: Provenance loves Wiki - Fri. 12th - Sun. 14th January. If you missed the event, catch up by reading the slides, Notes and watching the recordings on the Project page
Next:
Linked Open Data in Heritage Workshop > Jan. 23rd, 13:00 - 15:00 CET. If you are in the Maastricht University Faculty and want to know enhance heritage research, improve data management, connectivity and visualisation,
register for the Workshop.
AskWikidata: Natural language queries to Wikidata, a naive prototype created by Senior Software Engineer for Wikidata, Robert Timm.
Want to try? (Google Colab)
IP Masking: We are continuing to adapt Wikibase to the upcoming IP Masking feature. We worked on hiding warnings about IP addresses being saved when they don’t apply (
phab:T353807,
phab:T352006) and creating temporary accounts when editing (
phab:T354730)
Wikibase REST API:
We continued working on the ability to get a sitelink for a given site (
phab:T344039)
We started working on the ability to remove a sitelink for a given wiki (
phab:T344685)
We worked on fixing a bug where the REST API PUT request does not handle statement on Items with lowercase statement IDs (
phab:T352644)
mul language code: We did user testing to find any remaining issue before release
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-06-24. Please help
Translate.
Upcoming:
Lexicodays, online event dedicated to Lexemes on Wikidata, will take place on June 28-30. It takes place across time zone and both in English and Indonesian. Check the program and find the access links on the event page.
Wikidata in Prometheus - Prometheus is a non-commercial image archive for art and cultural studies. It hosts images from a variety of image and media databases and now works can be connected with Wikidata.
New
Mix'n'match feature: For lists of (full or auto) matches where both the MnM entry and the Wikidata Item have coordinates, it now shows the distance between them in the description. (
source)
Other Noteworthy Stuff
The Wikidata development team at Wikimedia Deutschland is planning a brief survey to understand the various ways people contribute to the project and identify user contribution patterns. A request has been made for a CentralNotice banner to deploy the survey to a broad audience. Your feedback, comments, and questions on this request are welcome:
m:CentralNotice/Request/Wikidata Community Survey 2024
The second iteration of the
Wikidata:Open Online Course will begin from July 1 until August 11. Whether you're a beginner taking your first steps, an individual in need of a refresher on Wikidata concepts, or a seasoned trainer looking to level up your skills - this course is right for you.
Multilingualism - organizes work around achieving 100% Wikidata multilingualism for every language with MediaWiki internationalization support. It is initiated, developed & supported by Wikimedia Language Diversity community volunteers.
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-01. Please help
Translate.
Discussions
New requests for permissions/Bot:
DifoolBot 4 Task(s) - Split single references containing multiple reference URLs into multiple references.
Bot Bozze Task(s) - Add sitelinks to itwiki draft articles after they've been moved to the main namespace.
New request for comments:
Spelling convention for labels and descriptions in English - RfC started 2024-06-25. This RfC requests feedback and input for finding consistency in spelling convention as English has multiple regional variations.
Past: The
Lexicodays 2024 was an online event designed to offer a discussion space for the Wikidata community about Lexicographical Data. An archive of some of the slides and session recordings are here
c:Category:Lexicodays 2024. More will be added as they become available.
Upcoming:
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 10th July 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Botany-focused Wikidata online workshop online as part of the #IBC2024. Date: Tuesday 9th July at 9pm NZST (GMT+12) / 11 am central Europe.
Register here!
Press, articles, blog posts, videos
Blogs
Querying for audio on Wikidata - This blog post discusses using SPARQL queries on Wikidata to find audio recordings, focusing on musical compositions and their associated genres.
Diff Blog: Imagining a Wikidata future for librarians together - the sixth and final blog post from the LD42023 conference. Silvia Gutiérrez (WMF) and Giovanna Fontenelle (WMF) document the results of the collaborative session on building a bridge between the Library-Wikidata community and WMF.
Library Knowledge as Linked Data: A Wikidata Approach: Contributing to a shared data commons. David Erlandson describes the experiences of using Wikidata for the pilot Program for Cooperative Cataloging to "accelerate the movement towards ubiquitous identifier creation and identity management at the network level".
User:Zvpunry/CreateNewItem - This is a User script to easily add a new Item while editing a Statement and noticing that the desired Item is missing.
Other Noteworthy Stuff
The second iteration of the
Wikidata:Open Online Course has begun. Class will continue until August 11. Whether you're a beginner taking your first steps, an individual in need of a refresher on Wikidata concepts, or a seasoned trainer looking to level up your skills - this course is right for you.
Newest
WikiProjects:
Inuktitut - This is the space to organize work to assure that the sum of all knowledge and the supporting infrastructure for necessary services are available in Inuktitut (ᐃᓄᒃᑎᑐᑦ, Inuktitut).
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-08. Please help
Translate.
The next
Wikidata+Wikibase office hours will take place on Wednesday, 16:00 UTC on Wednesday, 10th July 2024 (18:00 Berlin time) in the
Wikidata Telegram group. The Wikidata and Wikibase office hours are online events where the development team presents what they have been working on over the past quarter, and the community is welcome to ask questions and discuss important issues related to the development of Wikidata and Wikibase.
Registration for Wikimania 2024 is open! In-person participants: please register until 26 July, 11:59 p.m., UTC. Virtual participants can register anytime. If you received a scholarship from the Wikimedia Foundation, you will receive an email with a registration code and instructions.
Tool of the week
User:Teester/EntityShape.js - a userscript that adds an input box to a Wikidata page wherein you can enter an EntitySchema (such as
E10). When you click "Check", checks whether each statement and property conforms to the schema. It then displays a summary at the top of the Item for each property indicating whether they conform or not. It also adds a badge to each statement and each property on the page indicating whether they conform or not.
Other Noteworthy Stuff
Wikidata teams' development goals for the third quarter of 2024 have been updated:
Wikidata:Development plan
indexer (entity responsible for compiling an index of a book, database, website or other forms of media publications in the form of a methodical arrangement of records designed to enable users to locate information quickly. Example: Hazel K. Bell (Q70226489))
WorldCyclingStats ID (identifier on the website WorldCyclingStats (www.worldcyclingstats.com))
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-15. Please help
Translate.
Discussions
New requests for permissions/Bot:
SpinachBot - Task/s: AI agent-enabled question answering through the creation and execution of complex SPARQL queries on Wikidata. Users tag the bog with wikidata-related questions, and it tries to come up with an answer by iteratively creating SPARQL queries.
An Intelligent System: What I learned through taking an introductory Wikidata course - The author, Anne-Christine Hoff, dispels misconceptions about Wikidata. She highlights that it is a relational communication system, not solely bot-driven, and allows users worldwide to add localized data in multiple languages, creating a self-structuring repository of information
Merge and diff - blog post by Magnus about adding new properties (taxon data speficially, NCBI, GBIF, and iINaturalist) to the AC2WD tool. If you have the user script installed on Wikidata (see tool below), AC2WD will automatically show up on relevant taxon items.
Online workshop Upskilling in Wikidata for maximum impact IBC 2024 - Recording of the virtual Wikidata workshop given on the 9th of July 2024. An onboarding and introductory event in anticipation of a full in-person workshop to be held at the International Botanical Congress 2024 in Madrid on the 21st of July 2024
User:Magnus Manske/ac2wd.js - This script adds an "AC2WD" link in the tools sidebar. When you click on it, it uses the
AC2WD tool to check the item for certain Authority Control IDs (eg VIAF). It then checks these AC datasets for statements (and more AC IDs). It will then add any new information it found as new statements, or add more references to existing statements where possible. A green checkmark will be appended to the link if data was added (reload the page to see), otherwise a "—" if no new data was available.
myfixguide.com (photos about how to disassemble hardware)
indexer (entity responsible for compiling an index of a book, database, website or other forms of media publications in the form of a methodical arrangement of records designed to enable users to locate information quickly)
Newest
WikiProjects:
Human Cells - dedicated to improve our coverage of information about these fundamental entities that compose organisms. The project focuses mainly on two kinds of cell classes: species-agnostic classes, such as
neutrophil (Q188417), and human-specific classes, such as
human neutrophil (Q101405102).
Here's your quick overview of what has been happening around Wikidata over the last week. This is the Wikidata summary of the week before 2024-07-22. Please help
Translate.
Discussions
New requests for permissions/Bot:
DannyS712 bot - Task/s: I want to get approval for a bot with translation admin rights that will automatically mark pages for translations if and only if the latest version is identical to the version that is already in the translation system, i.e. only pages with no "net" changes in the pending edits.
DerIchBot - Task/s: Adding data about schools provided by the German and Austrian governments to wikidata.
DifoolBot 5 - Task/s: Change reference URLs into the related ID property and merge references with the same ID property.
AroundTheBot - Task/s: Automated import of Albanian nouns with IPA from Wiktionary, with the long-term goal of using this data to do pronunciation-based comparison/word evolution between languages.
Past: Wikimedia Indonesia hosted
the 2024 Data Visualization Competition from June 5 to 18. The event featured data visualizations (posters and graphics) and short essays using data from Wikidata. Visit
the competition page (in Indonesian) to view the winning entries.
Upcoming: Wikidata's 12th birthday decentralized events will take place in October and November 2024. Feel free to browse
the documentation pages to learn how to organize an event in your area, get funding, and get in touch with other organizers.
Press, articles, blog posts, videos
Papers:
Representación de datos abiertos con Wikidata Query Service (Q126917814). This paper details the Wikidata Query Service, for the creation of data visualizations. All visualization options available in the WQS are explored, accompanied by example queries that introduce the implementation of these visualizations. By Ángel Obregón-Sierra and Silvia Cecilia Anselmi.
Paulina, a new tool for exploring public domain works.
Other Noteworthy Stuff
WMF Product and Technology Advisory Council (PTAC) invites interested volunteers to apply. As part of the movement strategy recommendation for "Coordinating Across Stakeholders," the PTAC will bring technical contributors and the Wikimedia Foundation together to co-define a more resilient, future-proof technological platform.
indexer (entity responsible for compiling an index of a book, database, website or other forms of media publications in the form of a methodical arrangement of records designed to enable users to locate information quickly)
Reference Verification - a research and development project aimed at helping Wikidata editors check the quality of external references based on various types of AI/ML models.
NZWomenPhotographers - aims to improve information about New Zealand women photographers, based on a dataset provided by the Museum of New Zealand Te Papa Tongarewa.
Newest
database reports:
Items with P569=P570 - Items with instance of (P31) --> human (Q5) and the same year in date of birth (P569) and date of death (P570) (2024-07-22)
Showcase Lexemes:
water (L3302) - (S1) common liquid substance (S2) chemical compound of hydrogen and oxygen (H₂O) (S3) a body of water, usually a river, a lake, or an ocean
Development
mul: We fixed the last blocker for the limited MUL rollout to Wikidata on July 29th (
phab:T362917)
EntitySchemas:
We fixed a misplaced background color in EntitySchema (
phab:T369283)
We’re investigating how to make EntitySchemas searchable by label (
phab:T362005)
Query Service: preparation for the graph split is continuing by the Search Platform Team. We started looking into adapting the constraints checks for it (
phab:T369079)