This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 5 | Archive 6 | Archive 7 | Archive 8 | Archive 9 | Archive 10 |
Have you considered moving STiki to an instance @ Wikimedia Labs? One advantage of doing this is that you can recruit other Labs users (stakeholders in the functioning of STiki) to help with maintenance and testing new features. If you're interested, I'd be happy to help out. Given Snuggle's reliance on STiki, I'd also volunteer to help with maintenance. -- EpochFail( talk • work) 15:48, 23 May 2013 (UTC)
Snuggle, the newcomer socialization tool I've been building, is finally ready for general use. All you need to do to get started is point your browser to https://snuggle.grouplens.org. Let me know if you run into any trouble. I'll be watching WT:Snuggle. Or you can also just contact me directly. Thanks for your patience.
See also:
-- EpochFail( talk • work) 19:46, 14 June 2013 (UTC)
I don't even know where to begin this week. I can only assume that we're being spammed, particularly by a redlink generator with a fondness for Polish. I don't even know if I can comfortably narrow it down. Serendi pod ous 06:03, 7 July 2013 (UTC)
Sorry, I misunderstood. My email addresses can easily be found at the bottom of my professional homepage. Thanks, West.andrew.g ( talk) 14:27, 11 July 2013 (UTC)
It hasn't been updated since the 23rd. This is potentially ruinous, as it means we can't check to see if a spike follows a human pattern. Serendi pod ous 06:17, 27 July 2013 (UTC)
I was thinking about getting the weekly WP:Topred – and a potential variant – community traffic. To do this, I'll rehash an old (possibly bad) idea to make things newsworthy for the Signpost. Maybe you could publish one really long list? Maybe it would capture everything down to 10 page views per week? I remember you saying it could take a ridiculous amount of computational time to do something like this, so maybe this could be a once a year or a bi-annual exercise. Maybe an op-ed to the community to encourage the creation of articles (and redirects) that people want could be of interest to the community. Biosthmors ( talk) 16:06, 13 August 2013 (UTC)
Did you get any help from the analytics people? Serendi pod ous 08:38, 18 August 2013 (UTC)
Hey Doc. West, I'm contacting you about a study that I'm running with TheOriginalSoni exploring newcomer mentorship activities in Wikipedia. I'd like to ask you a few questions about your interactions with newcomers and to explore how a tool like WP:Snuggle might make mentoring work easier. The interview and demo session will take 30 minutes to an hour depending on how much time we spend discussing things. If you're interested, let me know.
Thanks for your consideration. -- EpochFail ( talk • contribs) 15:03, 31 August 2013 (UTC)
I created
WP:WMF and then linked to
m:Grants, which looks like it could use a link to flow funding?
Biosthmors (
talk) 10:23, 1 September 2013 (UTC)
:Or if these flow funds are only to support the English Wikipedia, perhaps listing it at WP:WMF is better.
Biosthmors (
talk) 10:23, 1 September 2013 (UTC)
Nevermind. I see that flow funding was a pilot so maybe it's not beneficial to add a link. Meanwhile, I'm curious which Wikipedia space pages get the most hits. Could you publish a weekly 1000, 2000, or 5000 list, perhaps? Best regards. Biosthmors ( talk) 11:25, 5 September 2013 (UTC)
in that rant about doing the Top25 alone. It's just that, given the number of viewers, and the number of people roasting me over it, it would be nice if some were there to help instead of rant. Serendi pod ous 15:38, 11 September 2013 (UTC)
Hello. There is currently a discussion at Wikipedia:Administrators' noticeboard/Incidents regarding an issue with which you may have been involved. Thank you. Tariqmudallal ( talk) 01:00, 22 September 2013 (UTC)
I was looking at the list for most revised pages on Wikipedia and found you made 188,938 edits to a single page,
West.andrew.g/Dead links ...but it looks like this project was discontinued in January 2013. I'm just curious what this page was for and how you could possibly carry out that many edits, even with the use of a bot.
Thanks for any information you can offer,
West.andrew.g.
Liz
Read!
Talk! 16:30, 30 September 2013 (UTC)
Hello West.andrew.g! I'm a big fan of your work on Wikipedia traffic stats (well, who isn't? ;-). Maybe you remember that some time ago i wrote to you a little about german WP article traffic and TV events: the second screen effect. The german journalist that made the second screen/Wikipedia analysis i mentioned has since built a Wikipedia traffic trends website http://wikipedia.trending.eu/de/index.html and he also wrote a blogpost about your traffic analysis.
BTW, it seems that your Signpost WP traffic report has inspired the Spiegel (Germany's biggest news magazine and online news website) to publish Wikipedia traffic trend reports almost every week since june 2013. See:
(Try google translate, that should work OK for german -> english)
Another story that might interest you is that in September 2012, german Wikipedia put a link to the traffic stats on every single Wikipedia page (including user namespace etc.). It's the link "Abrufstatistik" at the bottom, and it links to the corresponding page views at stat.grok.se. A little later, a SEO presentation appeared online with new tips how to successfully spam external links on deWP. Actually it's quite hard to get SEO links on deWP, Wikipedians are very vigilant about spam. And deWP has flagged revisions, so that the added link is only visible to readers if this edit was patrolled by an editor -> edit will likely be reverted, unless the link goes to content that seems legitimate. The SEO how-to therefor recommends to use the traffic stats to concentrate the effort on the most effective, popular target article. Work-Flow: identify suitable Wikipedia article with lots of traffic and your topic -> Create content (i.e. translate english WP article parts in german and embellish) and publish on your website -> introduce the information in Wikipedia -> Backlink from Wikipedia.
Anyway, I recently reread your Wikipedia:Wikipedia_Signpost/2013-02-04/Special_report. You wrote about the power law distribution of article traffic: "The top 25 most viewed pages represent 4% of all total views, and the top 5000 represent 19% of all views. Though the distribution has an extremely long tail, the top 5000 data provides an opportunity to locate popular but poorly written articles that need attention, as opposed to randomly selecting one of the 4.15 million remaining articles on the project." That's a great starting point for quality control. I guess the page view distribution is similar on german Wikipedia? Maybe with less power ;-) ? I wonder if you would be interested in making an analysis of deWP traffic pattern and a comparison of enWP and deWP? I think the TV/second screen effect could also be worth another look to compare deWP/enWP. I suspect the effect is also happening on enWP, only it is diluted very much by the more global audience. What do you think? I could help and translate (also for the german Kurier) if you think this would be something for the Signpost. -- Atlasowa ( talk) 23:25, 7 October 2013 (UTC)
As I mention there in the last paragraph in italics, how do you get the WP:5000 to display quality assessments adjacent to the names? Thanks. Biosthmors ( talk) pls notify me (i.e. {{ U}}) while signing a reply, thx 22:27, 9 October 2013 (UTC)
Someone over at the Traffic report talk page suggested that bounce rate stats could determine whether website article views are due to mistaken clicks or genuine interest, but I'm not sure what bounce rate stats are. Have you heard of them? Serendi pod ous 19:58, 12 October 2013 (UTC)
Update's a bit late this week, making sure everything's OK at your end. Serendi pod ous 10:52, 20 October 2013 (UTC)
debugging | |
Thank you for the "popular redlinks" list and for fixing the "+" bug in it. — rybec 23:10, 4 November 2013 (UTC) |
Hi Andrew. We are having a discussion on how to implent a local user permission system for a script that is used at WP:AfC. How do you do this with Stiki, and is it independent of MedWiki? Forgive my ignorance, but I am not a programmer. Regards, Kudpung กุดผึ้ง ( talk) 05:50, 31 October 2013 (UTC)
- Wikipedia:AFCR, instructions for budding AfC reviewer-candidates (general)
- Wikipedia:AFCH, instructions for budding AfC reviewer-candidates (the helper-gadget)
- Wikipedia:WikiProject_Articles_for_creation/Helper_script, helpdocs for the gadget
- MediaWiki:Gadget-afchelper-beta.js, source code for the gadget
I need to know because it will upend my schedule. Serendi pod ous 17:59, 27 November 2013 (UTC)
Hoi, I was told of your popular redlinks. I want to know a few things about the processing that you do. Is this something that can be done for every language? If so, could this become a tool that is available in the labs environment??
FYI this is the kind of information that really shows people what to concentrate on when they want to make a difference in the service we provide.
Thanks, GerardM ( talk) 16:30, 28 November 2013 (UTC)
A discussion concerning the creation of improbable redirects, related to a page or pages you created, has started at Wikipedia:Administrators' noticeboard/Archive257#Mass creation of very improbable redirects. Fram ( talk) 11:18, 29 November 2013 (UTC)
\x
encoding is a redlink issue (despite non-technical consensus going otherwise). As I have demonstrated several times previously, the stats.grok.se page view tool has issues and is not the ground-truth everyone makes it out to be. (2) The fact I do not support the mass creation of redirects in order to patch the errors of misconfigured external software.
West.andrew.g (
talk) 18:31, 29 November 2013 (UTC)
It's interesting to look at your popular pages list, but, I'm wondering, why are there underscores in the list, it didn't use to be that way. Could this issue be fixed? It felt much easier to read without them. UsefulWikipedia ( talk) 04:16, 5 December 2013 (UTC)
I'm thinking about doing a year end review for the Signpost. Is it possible to generate a list for all of 2013? Serendi pod ous 09:52, 2 December 2013 (UTC)
Hey, Andrew,
I was wondering if you had an archive of the popular pages chart...I looked into your subpages listing and I didn't see anything but I thought I'd ask. Also, do you do an year-end type of list, for all of 2013? Or is this something that Wikipedia issues itself? I'm sure there is interest in it. Thanks for all of your work!
Liz
Read!
Talk! 12:45, 9 December 2013 (UTC)
Hi West.andrew.g.
I'm an admin at the Hebrew Wikivoyage and I am very interested in creating a list of the most popular articles in the Hebrew Wikipedia (I am hoping such a list would help our small community better decide which articles we should expand ASAP based on their popularity in the Hebrew Wikipedia). I have understood that you are not interested in creating any such lists for other language editions of Wikipedia BUT that you are willing to share your processing code. I am very well interested in trying to run it (I am hoping it won't be too complicated). ויקיג'אנקי ( talk) 06:30, 16 December 2013 (UTC)
Recently, on the STiki talk page you said, "Between STiki and ClueBotNG, we've got large parts of the problem space covered using some pretty intelligent machinery. That being said, both of these projects are going to appreciate any volunteer assistance they can receive." Apart from using STiki, I am interested to understand both processes better, and maybe contribute more. Do you have anything in mind? -- Greenmaven ( talk) 01:31, 20 December 2013 (UTC)
@ Jack Greenmaven: -- It's a more complicated set of interdependencies. The autonomous work of Cluebot feeds into the reputations of the "metadata" algorithm. The cumulative body of human effort (regardless of source queue) is used to retrain that "metadata" algorithm/queue. I have provided this set of human classifications to the CBNG folks (as a one-time dump) so they could use it in a similar fashion as they see appropriate. To what extent this is done, if at all, I am unsure. I know those folks aren't fond of the fact the STiki-classified set is non-representative of its bot workload -- a fair argument -- and why they have sought to create a representative corpus offline. West.andrew.g ( talk) 04:46, 24 December 2013 (UTC)
Greetings everyone. I've recently posted over User_talk:West.andrew.g/Popular_pages, but I realize some of my watchlisters might not follow both pages. After much processing has been brought to bear, I've aggregated all the page view statistics for 2013. I thought these would likely be of general interest, and I'd appreciate if others would re-post to relevant discussion pages and forums (on or off wiki; Reddit and some others picked up on my last effort in this vein).
ARTICLE | VIEWS -------------------------------------- [[Main_Page]] | 3,895,581,597 [[Facebook]] | 30,608,777 [[Deaths_in_2013]] | 21,246,624 [[Breaking_Bad]] | 17,389,161 [[Google]] | 16,759,294 [[World_War_II]] | 16,676,636 [[Wiki]] | 16,285,560 [[YouTube]] | 15,938,076
ARTICLE | UTC DATE | VIEWS | REASON ---------------------------------------------------------------------- [[Jorge_Bergoglio]] | March 13, 2013 | 1,460,586 | Papal ascension [[Shakuntala_Devi]] | November 4, 2013 | 766,256 | Google Doodle [[Paul_Walker]] | December 1, 2013 | 752,770 | Death [[Grace_Hopper]] | December 9, 2013 | 621,694 | Google Doodle [[Nelson_Mandela]] | December 5, 2013 | 484,966 | Death [[Jodie_Foster]] | January 14, 2013 | 451,270 | Came out at Golden Globes [[Beyonc%C3%A9_Knowles]] | February 4, 2013 | 378,923 | Super bowl halftime [[Nicolaus_Copernicus]] | February 19, 2013 | 336,836 | Google Doodle [[Seth_MacFarlane]] | February 25, 2013 | 320,999 | Hosted the Oscars [[Daniel_Day-Lewis]] | February 25, 2013 | 318,839 | Oscars [[Society_of_Jesus]] | March 13, 2013 | 287,568 | Papal ascension [[Mindy_McCready]] | February 18, 2013 | 282,679 | Death [[Hermann_Rorschach]] | November 8, 2013 | 276,072 | Google Doodle [[Edith_Head]] | October 28, 2013 | 263,915 | Google Doodle [[Raymond_Loewy]] | November 5, 2013 | 258,301 | Google Doodle [[Margaret_Thatcher]] | April 8, 2013 | 252,906 | Death [[Pope_Francis]] | March 13, 2013 | 248,753 | Papal ascension [[Peter_Capaldi]] | August 4, 2013 | 244,667 | Announced as next Dr. Who
Thanks everyone. West.andrew.g ( talk) 17:15, 13 January 2014 (UTC)
Hi there, looks like #360 on your list should be linking to Cancún but the accented character has caused the link to fail. I have seen such boxes before with Czech diacritics, I don't know how to fix it but hoping you may! Thanks, C 679 19:22, 3 February 2014 (UTC)
WikiAudit can not use on Windows7 9shi ( talk) 07:10, 4 February 2014 (UTC)(on zh wiki 9shi)
It's great! All us sight-deprived people thank you. Coretheapple ( talk) 17:14, 13 February 2014 (UTC)
... for fixing the silly barnstar mistake on Flyer22's talk page. Further proof that I need new reading glasses. Widr ( talk) 18:43, 17 February 2014 (UTC)
First paragraph. :-) I'd be very interested if you'd like to write another analysis of the trends. Ed [talk] [majestic titan] 23:33, 28 January 2014 (UTC)
See at WP:STiki/milestones - Ugog Nizdast ( talk) 07:27, 23 March 2014 (UTC)
Hello,
Thanks for the welcome message. I have actually used Stiki before but under old account names. I just noticed on the Stiki leaderboard that those two account names are listed separately: "Gold Standard" and "Athleek123". Could you put these two together (and three once the leaderboard updates my latest uses with my current username)?
Thanks,
The Cascadian 04:08, 25 April 2014 (UTC)
Hello! I think that tool WP:TOPRED might be particulary helpful for wiktionaries... Can you please include in your top the data from the wiktionaries? Your algorithm for extract redlinks are open source? Can i see it somewhere? Thank you. -- Grenadine ( talk) 20:27, 29 April 2014 (UTC)
Hello there, a proposal regarding pre-adminship review has been raised at Village pump by Anna Frodesiak. Your comments here is very much appreciated. Many thanks. Jim Carter through MediaWiki message delivery ( talk) 06:47, 28 May 2014 (UTC)
Just checking. :-) Serendi pod ous 09:43, 1 June 2014 (UTC)
Hello Andrew, in the top 5000 articles list and Stats.grok.se, I'm seeing weird articles I don't excpect to see in that list. For example, Le Cordon Bleu College of Culinary Arts Atlanta, Alexandria, Virginia, foods and other topics aren't that popular, they've been placed incorrectly. Is the counter counting pageviews correctly? I searched this issue and it says it includes non-human views. In the Wikipedia article traffic statistics, I'm seeing some glitchy page names with nonsense characters. Why is this happening for several months, and can this be fixed? It didn't use to be that way. I noticed before 2013 this did not happen.
And also, does your list support articles with colons in their name, such as Call of Duty: Advanced Warfare? I'm sure that topic will be up there. A Great Catholic Person ( talk) 06:41, 8 June 2014 (UTC)
Alright, I understand that now. I'll check this weekend. By the way, can the stats.grok.se and stats-classic.grok.se (old version) of Wikipedia stats be reverted to December 2010 data, rather than the 2014 data? Visitors are going to wonder what are those weird articles and glitches are, for now until the DoS attacks get fixed. I don't mind how oudated it is. I can't ask Henrik because he is not responding. He once said it can't because it needs code changes, but I don't mind a code change one last time. I don't mind outdated data, and I forgot to check 2010 rankings of articles I wanted to check.
And... is it possible to generate the list of 2013 popular pages, but with the colons? Also, could every report you've generated from October 2012 to now be recreated but with colons? I don't want to wait until 2015 for data without colons, and I want to see what some titles with colons ranked in the top 10,000 last year.
And, also, because technology is growing, can this start counting pageviews from mobile devices and other machines? A Great Catholic Person ( talk) 02:58, 11 June 2014 (UTC)
Alright, I read all that, but both Henrik and Killiondude (another one who has an FAQ page) are both down. I don't care about how outdated the stats-classic.grok.se is, I want the old December 2010 data back. Plus, Killiondude's FAQ has a link to October 2009 data for Michael Jackson. It won't work. Just one code change is good enough, the design should have been "phased out" already with 2010. The link comes up with 0 pageviews, plus I prefer the old design for viewing data from any time. The top 1000 list date I can take will be from December 2009 (it's fine!) to December 2011. I also forgot to check at least a lot of articles' rankings in the 2010 version, and I want to know them badly. I'm very curious because the classic has 2014 data. I don't need an updated one. The old one is okay. Should I wait until Henrik is back? Because it will take so long. A Great Catholic Person ( talk) 22:23, 21 June 2014 (UTC)
I'm not meaning generate those reports, I just want the stats classic version's top 1000 list to change from January 2014 to December 2010. A Great Catholic Person ( talk) 16:59, 22 June 2014 (UTC)
Apologies for being latent in posting here (there are about 1.24 days left in the conference), but I am at Wikimania in London. User:Jmh649 gave a presentation about WP:MED and utilized some of the recent statistical work I've done for him (and an academic paper is in progress). Besides that, I've largely been hanging out in the research/analytical/"social machines" tracks. If your a user/supporter/fan of WP:STiki, WP:5000, or anything else I've done -- I'd love to meet you, so don't hesitate to reach out. West.andrew.g ( talk) 15:27, 9 August 2014 (UTC)
I know all what you said above, but how about you try to tune your popular pages lists to not include the popular redlinks and non-human pageviews to your lists? Also, blackout articles ( Lycos, Ddd, Alexandria, Virginia, etc and others being attacked) to your lists, especially for a 2014 top list. I understand, but they are okay to be on other lists. I like seeing what are the top articles of the week are, but now they are replaced with BS articles. If this continues, I can't trust your data anymore. A Great Catholic Person ( talk) 01:00, 4 August 2014 (UTC)
I haven't seen the top 5000 or any other of your reports updating since August 14. A Great Catholic Person ( talk) 16:15, 17 August 2014 (UTC)
A significant statistical issue has come to my attention. Quite simply, the WMF does not record/report per-article mobile views, and thus they are unavailable for my aggregation....
The complete write-up is at User_talk:West.andrew.g/Popular_pages#STICKY:_On_the_Non-Reporting_of_Mobile_Views.
Please consolidate all discussion at that location. Thanks, West.andrew.g ( talk) 18:42, 4 September 2014 (UTC)
Ed [talk] [majestic titan] 16:24, 9 September 2014 (UTC)
This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 5 | Archive 6 | Archive 7 | Archive 8 | Archive 9 | Archive 10 |
Have you considered moving STiki to an instance @ Wikimedia Labs? One advantage of doing this is that you can recruit other Labs users (stakeholders in the functioning of STiki) to help with maintenance and testing new features. If you're interested, I'd be happy to help out. Given Snuggle's reliance on STiki, I'd also volunteer to help with maintenance. -- EpochFail( talk • work) 15:48, 23 May 2013 (UTC)
Snuggle, the newcomer socialization tool I've been building, is finally ready for general use. All you need to do to get started is point your browser to https://snuggle.grouplens.org. Let me know if you run into any trouble. I'll be watching WT:Snuggle. Or you can also just contact me directly. Thanks for your patience.
See also:
-- EpochFail( talk • work) 19:46, 14 June 2013 (UTC)
I don't even know where to begin this week. I can only assume that we're being spammed, particularly by a redlink generator with a fondness for Polish. I don't even know if I can comfortably narrow it down. Serendi pod ous 06:03, 7 July 2013 (UTC)
Sorry, I misunderstood. My email addresses can easily be found at the bottom of my professional homepage. Thanks, West.andrew.g ( talk) 14:27, 11 July 2013 (UTC)
It hasn't been updated since the 23rd. This is potentially ruinous, as it means we can't check to see if a spike follows a human pattern. Serendi pod ous 06:17, 27 July 2013 (UTC)
I was thinking about getting the weekly WP:Topred – and a potential variant – community traffic. To do this, I'll rehash an old (possibly bad) idea to make things newsworthy for the Signpost. Maybe you could publish one really long list? Maybe it would capture everything down to 10 page views per week? I remember you saying it could take a ridiculous amount of computational time to do something like this, so maybe this could be a once a year or a bi-annual exercise. Maybe an op-ed to the community to encourage the creation of articles (and redirects) that people want could be of interest to the community. Biosthmors ( talk) 16:06, 13 August 2013 (UTC)
Did you get any help from the analytics people? Serendi pod ous 08:38, 18 August 2013 (UTC)
Hey Doc. West, I'm contacting you about a study that I'm running with TheOriginalSoni exploring newcomer mentorship activities in Wikipedia. I'd like to ask you a few questions about your interactions with newcomers and to explore how a tool like WP:Snuggle might make mentoring work easier. The interview and demo session will take 30 minutes to an hour depending on how much time we spend discussing things. If you're interested, let me know.
Thanks for your consideration. -- EpochFail ( talk • contribs) 15:03, 31 August 2013 (UTC)
I created
WP:WMF and then linked to
m:Grants, which looks like it could use a link to flow funding?
Biosthmors (
talk) 10:23, 1 September 2013 (UTC)
:Or if these flow funds are only to support the English Wikipedia, perhaps listing it at WP:WMF is better.
Biosthmors (
talk) 10:23, 1 September 2013 (UTC)
Nevermind. I see that flow funding was a pilot so maybe it's not beneficial to add a link. Meanwhile, I'm curious which Wikipedia space pages get the most hits. Could you publish a weekly 1000, 2000, or 5000 list, perhaps? Best regards. Biosthmors ( talk) 11:25, 5 September 2013 (UTC)
in that rant about doing the Top25 alone. It's just that, given the number of viewers, and the number of people roasting me over it, it would be nice if some were there to help instead of rant. Serendi pod ous 15:38, 11 September 2013 (UTC)
Hello. There is currently a discussion at Wikipedia:Administrators' noticeboard/Incidents regarding an issue with which you may have been involved. Thank you. Tariqmudallal ( talk) 01:00, 22 September 2013 (UTC)
I was looking at the list for most revised pages on Wikipedia and found you made 188,938 edits to a single page,
West.andrew.g/Dead links ...but it looks like this project was discontinued in January 2013. I'm just curious what this page was for and how you could possibly carry out that many edits, even with the use of a bot.
Thanks for any information you can offer,
West.andrew.g.
Liz
Read!
Talk! 16:30, 30 September 2013 (UTC)
Hello West.andrew.g! I'm a big fan of your work on Wikipedia traffic stats (well, who isn't? ;-). Maybe you remember that some time ago i wrote to you a little about german WP article traffic and TV events: the second screen effect. The german journalist that made the second screen/Wikipedia analysis i mentioned has since built a Wikipedia traffic trends website http://wikipedia.trending.eu/de/index.html and he also wrote a blogpost about your traffic analysis.
BTW, it seems that your Signpost WP traffic report has inspired the Spiegel (Germany's biggest news magazine and online news website) to publish Wikipedia traffic trend reports almost every week since june 2013. See:
(Try google translate, that should work OK for german -> english)
Another story that might interest you is that in September 2012, german Wikipedia put a link to the traffic stats on every single Wikipedia page (including user namespace etc.). It's the link "Abrufstatistik" at the bottom, and it links to the corresponding page views at stat.grok.se. A little later, a SEO presentation appeared online with new tips how to successfully spam external links on deWP. Actually it's quite hard to get SEO links on deWP, Wikipedians are very vigilant about spam. And deWP has flagged revisions, so that the added link is only visible to readers if this edit was patrolled by an editor -> edit will likely be reverted, unless the link goes to content that seems legitimate. The SEO how-to therefor recommends to use the traffic stats to concentrate the effort on the most effective, popular target article. Work-Flow: identify suitable Wikipedia article with lots of traffic and your topic -> Create content (i.e. translate english WP article parts in german and embellish) and publish on your website -> introduce the information in Wikipedia -> Backlink from Wikipedia.
Anyway, I recently reread your Wikipedia:Wikipedia_Signpost/2013-02-04/Special_report. You wrote about the power law distribution of article traffic: "The top 25 most viewed pages represent 4% of all total views, and the top 5000 represent 19% of all views. Though the distribution has an extremely long tail, the top 5000 data provides an opportunity to locate popular but poorly written articles that need attention, as opposed to randomly selecting one of the 4.15 million remaining articles on the project." That's a great starting point for quality control. I guess the page view distribution is similar on german Wikipedia? Maybe with less power ;-) ? I wonder if you would be interested in making an analysis of deWP traffic pattern and a comparison of enWP and deWP? I think the TV/second screen effect could also be worth another look to compare deWP/enWP. I suspect the effect is also happening on enWP, only it is diluted very much by the more global audience. What do you think? I could help and translate (also for the german Kurier) if you think this would be something for the Signpost. -- Atlasowa ( talk) 23:25, 7 October 2013 (UTC)
As I mention there in the last paragraph in italics, how do you get the WP:5000 to display quality assessments adjacent to the names? Thanks. Biosthmors ( talk) pls notify me (i.e. {{ U}}) while signing a reply, thx 22:27, 9 October 2013 (UTC)
Someone over at the Traffic report talk page suggested that bounce rate stats could determine whether website article views are due to mistaken clicks or genuine interest, but I'm not sure what bounce rate stats are. Have you heard of them? Serendi pod ous 19:58, 12 October 2013 (UTC)
Update's a bit late this week, making sure everything's OK at your end. Serendi pod ous 10:52, 20 October 2013 (UTC)
debugging | |
Thank you for the "popular redlinks" list and for fixing the "+" bug in it. — rybec 23:10, 4 November 2013 (UTC) |
Hi Andrew. We are having a discussion on how to implent a local user permission system for a script that is used at WP:AfC. How do you do this with Stiki, and is it independent of MedWiki? Forgive my ignorance, but I am not a programmer. Regards, Kudpung กุดผึ้ง ( talk) 05:50, 31 October 2013 (UTC)
- Wikipedia:AFCR, instructions for budding AfC reviewer-candidates (general)
- Wikipedia:AFCH, instructions for budding AfC reviewer-candidates (the helper-gadget)
- Wikipedia:WikiProject_Articles_for_creation/Helper_script, helpdocs for the gadget
- MediaWiki:Gadget-afchelper-beta.js, source code for the gadget
I need to know because it will upend my schedule. Serendi pod ous 17:59, 27 November 2013 (UTC)
Hoi, I was told of your popular redlinks. I want to know a few things about the processing that you do. Is this something that can be done for every language? If so, could this become a tool that is available in the labs environment??
FYI this is the kind of information that really shows people what to concentrate on when they want to make a difference in the service we provide.
Thanks, GerardM ( talk) 16:30, 28 November 2013 (UTC)
A discussion concerning the creation of improbable redirects, related to a page or pages you created, has started at Wikipedia:Administrators' noticeboard/Archive257#Mass creation of very improbable redirects. Fram ( talk) 11:18, 29 November 2013 (UTC)
\x
encoding is a redlink issue (despite non-technical consensus going otherwise). As I have demonstrated several times previously, the stats.grok.se page view tool has issues and is not the ground-truth everyone makes it out to be. (2) The fact I do not support the mass creation of redirects in order to patch the errors of misconfigured external software.
West.andrew.g (
talk) 18:31, 29 November 2013 (UTC)
It's interesting to look at your popular pages list, but, I'm wondering, why are there underscores in the list, it didn't use to be that way. Could this issue be fixed? It felt much easier to read without them. UsefulWikipedia ( talk) 04:16, 5 December 2013 (UTC)
I'm thinking about doing a year end review for the Signpost. Is it possible to generate a list for all of 2013? Serendi pod ous 09:52, 2 December 2013 (UTC)
Hey, Andrew,
I was wondering if you had an archive of the popular pages chart...I looked into your subpages listing and I didn't see anything but I thought I'd ask. Also, do you do an year-end type of list, for all of 2013? Or is this something that Wikipedia issues itself? I'm sure there is interest in it. Thanks for all of your work!
Liz
Read!
Talk! 12:45, 9 December 2013 (UTC)
Hi West.andrew.g.
I'm an admin at the Hebrew Wikivoyage and I am very interested in creating a list of the most popular articles in the Hebrew Wikipedia (I am hoping such a list would help our small community better decide which articles we should expand ASAP based on their popularity in the Hebrew Wikipedia). I have understood that you are not interested in creating any such lists for other language editions of Wikipedia BUT that you are willing to share your processing code. I am very well interested in trying to run it (I am hoping it won't be too complicated). ויקיג'אנקי ( talk) 06:30, 16 December 2013 (UTC)
Recently, on the STiki talk page you said, "Between STiki and ClueBotNG, we've got large parts of the problem space covered using some pretty intelligent machinery. That being said, both of these projects are going to appreciate any volunteer assistance they can receive." Apart from using STiki, I am interested to understand both processes better, and maybe contribute more. Do you have anything in mind? -- Greenmaven ( talk) 01:31, 20 December 2013 (UTC)
@ Jack Greenmaven: -- It's a more complicated set of interdependencies. The autonomous work of Cluebot feeds into the reputations of the "metadata" algorithm. The cumulative body of human effort (regardless of source queue) is used to retrain that "metadata" algorithm/queue. I have provided this set of human classifications to the CBNG folks (as a one-time dump) so they could use it in a similar fashion as they see appropriate. To what extent this is done, if at all, I am unsure. I know those folks aren't fond of the fact the STiki-classified set is non-representative of its bot workload -- a fair argument -- and why they have sought to create a representative corpus offline. West.andrew.g ( talk) 04:46, 24 December 2013 (UTC)
Greetings everyone. I've recently posted over User_talk:West.andrew.g/Popular_pages, but I realize some of my watchlisters might not follow both pages. After much processing has been brought to bear, I've aggregated all the page view statistics for 2013. I thought these would likely be of general interest, and I'd appreciate if others would re-post to relevant discussion pages and forums (on or off wiki; Reddit and some others picked up on my last effort in this vein).
ARTICLE | VIEWS -------------------------------------- [[Main_Page]] | 3,895,581,597 [[Facebook]] | 30,608,777 [[Deaths_in_2013]] | 21,246,624 [[Breaking_Bad]] | 17,389,161 [[Google]] | 16,759,294 [[World_War_II]] | 16,676,636 [[Wiki]] | 16,285,560 [[YouTube]] | 15,938,076
ARTICLE | UTC DATE | VIEWS | REASON ---------------------------------------------------------------------- [[Jorge_Bergoglio]] | March 13, 2013 | 1,460,586 | Papal ascension [[Shakuntala_Devi]] | November 4, 2013 | 766,256 | Google Doodle [[Paul_Walker]] | December 1, 2013 | 752,770 | Death [[Grace_Hopper]] | December 9, 2013 | 621,694 | Google Doodle [[Nelson_Mandela]] | December 5, 2013 | 484,966 | Death [[Jodie_Foster]] | January 14, 2013 | 451,270 | Came out at Golden Globes [[Beyonc%C3%A9_Knowles]] | February 4, 2013 | 378,923 | Super bowl halftime [[Nicolaus_Copernicus]] | February 19, 2013 | 336,836 | Google Doodle [[Seth_MacFarlane]] | February 25, 2013 | 320,999 | Hosted the Oscars [[Daniel_Day-Lewis]] | February 25, 2013 | 318,839 | Oscars [[Society_of_Jesus]] | March 13, 2013 | 287,568 | Papal ascension [[Mindy_McCready]] | February 18, 2013 | 282,679 | Death [[Hermann_Rorschach]] | November 8, 2013 | 276,072 | Google Doodle [[Edith_Head]] | October 28, 2013 | 263,915 | Google Doodle [[Raymond_Loewy]] | November 5, 2013 | 258,301 | Google Doodle [[Margaret_Thatcher]] | April 8, 2013 | 252,906 | Death [[Pope_Francis]] | March 13, 2013 | 248,753 | Papal ascension [[Peter_Capaldi]] | August 4, 2013 | 244,667 | Announced as next Dr. Who
Thanks everyone. West.andrew.g ( talk) 17:15, 13 January 2014 (UTC)
Hi there, looks like #360 on your list should be linking to Cancún but the accented character has caused the link to fail. I have seen such boxes before with Czech diacritics, I don't know how to fix it but hoping you may! Thanks, C 679 19:22, 3 February 2014 (UTC)
WikiAudit can not use on Windows7 9shi ( talk) 07:10, 4 February 2014 (UTC)(on zh wiki 9shi)
It's great! All us sight-deprived people thank you. Coretheapple ( talk) 17:14, 13 February 2014 (UTC)
... for fixing the silly barnstar mistake on Flyer22's talk page. Further proof that I need new reading glasses. Widr ( talk) 18:43, 17 February 2014 (UTC)
First paragraph. :-) I'd be very interested if you'd like to write another analysis of the trends. Ed [talk] [majestic titan] 23:33, 28 January 2014 (UTC)
See at WP:STiki/milestones - Ugog Nizdast ( talk) 07:27, 23 March 2014 (UTC)
Hello,
Thanks for the welcome message. I have actually used Stiki before but under old account names. I just noticed on the Stiki leaderboard that those two account names are listed separately: "Gold Standard" and "Athleek123". Could you put these two together (and three once the leaderboard updates my latest uses with my current username)?
Thanks,
The Cascadian 04:08, 25 April 2014 (UTC)
Hello! I think that tool WP:TOPRED might be particulary helpful for wiktionaries... Can you please include in your top the data from the wiktionaries? Your algorithm for extract redlinks are open source? Can i see it somewhere? Thank you. -- Grenadine ( talk) 20:27, 29 April 2014 (UTC)
Hello there, a proposal regarding pre-adminship review has been raised at Village pump by Anna Frodesiak. Your comments here is very much appreciated. Many thanks. Jim Carter through MediaWiki message delivery ( talk) 06:47, 28 May 2014 (UTC)
Just checking. :-) Serendi pod ous 09:43, 1 June 2014 (UTC)
Hello Andrew, in the top 5000 articles list and Stats.grok.se, I'm seeing weird articles I don't excpect to see in that list. For example, Le Cordon Bleu College of Culinary Arts Atlanta, Alexandria, Virginia, foods and other topics aren't that popular, they've been placed incorrectly. Is the counter counting pageviews correctly? I searched this issue and it says it includes non-human views. In the Wikipedia article traffic statistics, I'm seeing some glitchy page names with nonsense characters. Why is this happening for several months, and can this be fixed? It didn't use to be that way. I noticed before 2013 this did not happen.
And also, does your list support articles with colons in their name, such as Call of Duty: Advanced Warfare? I'm sure that topic will be up there. A Great Catholic Person ( talk) 06:41, 8 June 2014 (UTC)
Alright, I understand that now. I'll check this weekend. By the way, can the stats.grok.se and stats-classic.grok.se (old version) of Wikipedia stats be reverted to December 2010 data, rather than the 2014 data? Visitors are going to wonder what are those weird articles and glitches are, for now until the DoS attacks get fixed. I don't mind how oudated it is. I can't ask Henrik because he is not responding. He once said it can't because it needs code changes, but I don't mind a code change one last time. I don't mind outdated data, and I forgot to check 2010 rankings of articles I wanted to check.
And... is it possible to generate the list of 2013 popular pages, but with the colons? Also, could every report you've generated from October 2012 to now be recreated but with colons? I don't want to wait until 2015 for data without colons, and I want to see what some titles with colons ranked in the top 10,000 last year.
And, also, because technology is growing, can this start counting pageviews from mobile devices and other machines? A Great Catholic Person ( talk) 02:58, 11 June 2014 (UTC)
Alright, I read all that, but both Henrik and Killiondude (another one who has an FAQ page) are both down. I don't care about how outdated the stats-classic.grok.se is, I want the old December 2010 data back. Plus, Killiondude's FAQ has a link to October 2009 data for Michael Jackson. It won't work. Just one code change is good enough, the design should have been "phased out" already with 2010. The link comes up with 0 pageviews, plus I prefer the old design for viewing data from any time. The top 1000 list date I can take will be from December 2009 (it's fine!) to December 2011. I also forgot to check at least a lot of articles' rankings in the 2010 version, and I want to know them badly. I'm very curious because the classic has 2014 data. I don't need an updated one. The old one is okay. Should I wait until Henrik is back? Because it will take so long. A Great Catholic Person ( talk) 22:23, 21 June 2014 (UTC)
I'm not meaning generate those reports, I just want the stats classic version's top 1000 list to change from January 2014 to December 2010. A Great Catholic Person ( talk) 16:59, 22 June 2014 (UTC)
Apologies for being latent in posting here (there are about 1.24 days left in the conference), but I am at Wikimania in London. User:Jmh649 gave a presentation about WP:MED and utilized some of the recent statistical work I've done for him (and an academic paper is in progress). Besides that, I've largely been hanging out in the research/analytical/"social machines" tracks. If your a user/supporter/fan of WP:STiki, WP:5000, or anything else I've done -- I'd love to meet you, so don't hesitate to reach out. West.andrew.g ( talk) 15:27, 9 August 2014 (UTC)
I know all what you said above, but how about you try to tune your popular pages lists to not include the popular redlinks and non-human pageviews to your lists? Also, blackout articles ( Lycos, Ddd, Alexandria, Virginia, etc and others being attacked) to your lists, especially for a 2014 top list. I understand, but they are okay to be on other lists. I like seeing what are the top articles of the week are, but now they are replaced with BS articles. If this continues, I can't trust your data anymore. A Great Catholic Person ( talk) 01:00, 4 August 2014 (UTC)
I haven't seen the top 5000 or any other of your reports updating since August 14. A Great Catholic Person ( talk) 16:15, 17 August 2014 (UTC)
A significant statistical issue has come to my attention. Quite simply, the WMF does not record/report per-article mobile views, and thus they are unavailable for my aggregation....
The complete write-up is at User_talk:West.andrew.g/Popular_pages#STICKY:_On_the_Non-Reporting_of_Mobile_Views.
Please consolidate all discussion at that location. Thanks, West.andrew.g ( talk) 18:42, 4 September 2014 (UTC)
Ed [talk] [majestic titan] 16:24, 9 September 2014 (UTC)