|
||||
This page has archives. Sections older than 14 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present. |
This is a page for requesting one-off database queries for certain criteria. Users who are interested and able to perform SQL queries on the projects can provide results from the Quarry website.
You may also be interested in the following:
Quarry does not have access to page content, so queries which require checking wikitext cannot be answered with Quarry. However, someone may be able to assist by using Quarry in another way (e.g. checking the table of category links rather than the "Category:" text) or suggest an alternative tool.
I already have a query, albeit SPARQL, but been running into this issue and I thought somebody could help here. Thanks! Assem Khidhr ( talk) 05:11, 12 May 2024 (UTC)
Hey. I was wondering if this is even possible and if someone here might be able to help with this. The query in User:Jonesey95/self-transcluded-templates collects templates that either have zero transclusions or are only transcluded on the template itself. I'd like to extend this to also the sub-pages of the template.
The new logic would be:
Is this possible? Gonnym ( talk) 09:17, 29 May 2024 (UTC)
What you ask for in your first paragraph and in your bulleted list aren't the same thingsI know... The first paragraph is what we currently have at User:Jonesey95/self-transcluded-templates. What I asked in the bulleted list is what I hoped we could modify it to.
tl_from <> page_id
, tl_from
also has to differ from the page ids of all subpages and any talk page.
Certes (
talk) 17:23, 29 May 2024 (UTC)
Does anyone really know, definitively, what the distinction between (log_type = 'patrol' AND log_action = 'patrol)
and (log_type = 'pagetriage-curation' AND log_action IN ('reviewed', 'reviewed-article', 'reviewed-redirect'))
is? In particular, why do you variously get one, the other, or both even for patrols by the same person within minutes of each other (example:
both,
pagetriage only,
patrol only); exactly which on-wiki actions result in those cases; and is there a good way to distinguish genuine multiple reviews of the same title without doublecounting the various log_type/log_action combinations (perhaps group by log_page)?
(Context is Wikipedia:Requests for adminship/Elli#General comments, starting with User:Hey man im josh's comment at 21:42, 31 May 2024; my best guesses for the answers are in the description of quarry:query/83443.) — Cryptic 00:05, 1 June 2024 (UTC)
Any way to speed up this page table query? CirrusSearch does it fast, but maybe elasticsearch has its own indexing or something to help make it fast. – Novem Linguae ( talk) 18:08, 15 June 2024 (UTC)
WHERE page_namespace IN (0,1,3,4,5,6,7,9,10,11,12,13,14,15,100,101,118,119,710,711,828,829)
, but that doesn't work here. So you're going to have to partition it into several (well, lots of) queries that look small enough that the optimizer uses the index instead of a full table scan - WHERE page_namespace = 0 AND page_title < 'M'
, WHERE page_namespace =0 AND page_title >= 'M'
, and so on for the other possible namespaces. —
Cryptic 00:38, 22 June 2024 (UTC)
I realize PetScan can be used to show links that appear on two pages, but I have a need for it to show such a result sorted by the order in which they appear on the first page. Based on my limited understanding of the database, I don't think the order of the links on a page are tracked. But just in case I'm wrong, can someone show me a query to do this? Or suggest an alternative approach? Thanks. Stefen Towers among the rest! Gab • Gruntwerk 23:49, 22 June 2024 (UTC)
|
||||
This page has archives. Sections older than 14 days may be automatically archived by Lowercase sigmabot III when more than 4 sections are present. |
This is a page for requesting one-off database queries for certain criteria. Users who are interested and able to perform SQL queries on the projects can provide results from the Quarry website.
You may also be interested in the following:
Quarry does not have access to page content, so queries which require checking wikitext cannot be answered with Quarry. However, someone may be able to assist by using Quarry in another way (e.g. checking the table of category links rather than the "Category:" text) or suggest an alternative tool.
I already have a query, albeit SPARQL, but been running into this issue and I thought somebody could help here. Thanks! Assem Khidhr ( talk) 05:11, 12 May 2024 (UTC)
Hey. I was wondering if this is even possible and if someone here might be able to help with this. The query in User:Jonesey95/self-transcluded-templates collects templates that either have zero transclusions or are only transcluded on the template itself. I'd like to extend this to also the sub-pages of the template.
The new logic would be:
Is this possible? Gonnym ( talk) 09:17, 29 May 2024 (UTC)
What you ask for in your first paragraph and in your bulleted list aren't the same thingsI know... The first paragraph is what we currently have at User:Jonesey95/self-transcluded-templates. What I asked in the bulleted list is what I hoped we could modify it to.
tl_from <> page_id
, tl_from
also has to differ from the page ids of all subpages and any talk page.
Certes (
talk) 17:23, 29 May 2024 (UTC)
Does anyone really know, definitively, what the distinction between (log_type = 'patrol' AND log_action = 'patrol)
and (log_type = 'pagetriage-curation' AND log_action IN ('reviewed', 'reviewed-article', 'reviewed-redirect'))
is? In particular, why do you variously get one, the other, or both even for patrols by the same person within minutes of each other (example:
both,
pagetriage only,
patrol only); exactly which on-wiki actions result in those cases; and is there a good way to distinguish genuine multiple reviews of the same title without doublecounting the various log_type/log_action combinations (perhaps group by log_page)?
(Context is Wikipedia:Requests for adminship/Elli#General comments, starting with User:Hey man im josh's comment at 21:42, 31 May 2024; my best guesses for the answers are in the description of quarry:query/83443.) — Cryptic 00:05, 1 June 2024 (UTC)
Any way to speed up this page table query? CirrusSearch does it fast, but maybe elasticsearch has its own indexing or something to help make it fast. – Novem Linguae ( talk) 18:08, 15 June 2024 (UTC)
WHERE page_namespace IN (0,1,3,4,5,6,7,9,10,11,12,13,14,15,100,101,118,119,710,711,828,829)
, but that doesn't work here. So you're going to have to partition it into several (well, lots of) queries that look small enough that the optimizer uses the index instead of a full table scan - WHERE page_namespace = 0 AND page_title < 'M'
, WHERE page_namespace =0 AND page_title >= 'M'
, and so on for the other possible namespaces. —
Cryptic 00:38, 22 June 2024 (UTC)
I realize PetScan can be used to show links that appear on two pages, but I have a need for it to show such a result sorted by the order in which they appear on the first page. Based on my limited understanding of the database, I don't think the order of the links on a page are tracked. But just in case I'm wrong, can someone show me a query to do this? Or suggest an alternative approach? Thanks. Stefen Towers among the rest! Gab • Gruntwerk 23:49, 22 June 2024 (UTC)