footballnation.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Someone seems to be spamming a non-notable American football blog across the articles of many American Football players. -- Jayron 32 04:55, 17 May 2013 (UTC)
wikinewstime.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
If I remember correctly there were quite a few more IP addresses spamming this site, but these were the only three I could find. - Sudo Ghost 17:36, 28 February 2013 (UTC)
soccerdatabase.eu: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
This was recently discussed at ANI - Wikipedia:Administrators' noticeboard/Incidents#Mass removal of references to soccerdatabase.eu website. soccerdatabase.eu/ is a mirror site of www.playerhistory.com/ - the latter is defunct and the owner is launching legal action against the former. Giant Snowman 18:26, 8 May 2013 (UTC)
So nobody seems to care that this website is a massive copyvio? Giant Snowman 16:38, 30 May 2013 (UTC)
telerik.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
$ whois 82.103.64.57 inetnum: 82.103.64.0 - 82.103.64.255 netname: TELERIK descr: Telerik Corp.
This is only the recent abuse, See WikiProject Spam report for the full story. MER-C 11:44, 17 May 2013 (UTC)
tr.im: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
This URL shortener has been used recently:
And sometime in the past - I removed one use here. Deli nk ( talk) 13:31, 19 May 2013 (UTC)
MER-C 13:00, 20 May 2013 (UTC)
secure.vivid.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Affiliate site for Vivid Entertainment.
Trivialist ( talk) 21:57, 20 May 2013 (UTC)
Adsense google_ad_client = pub-2363916027311907 (
Track -
Report -
reverseinternet.com • meta:
Track -
Report)
Google Analytics ID: UA-37869191 - (
Track -
Report -
reverseinternet.com • Meta:
Track -
Report)
Google Analytics ID: UA-37867698 - (
Track -
Report -
reverseinternet.com • Meta:
Track -
Report)
MER-C 10:00, 2 June 2013 (UTC)
See the discussion initiated by the website's owner at Wikipedia:External_links/Noticeboard#Link_to_ProWresBlog. Although there is a clear COI in this editor pushing his/her own website, the real problem is the huge amount of copyvio content - screencaps taken directly from Sky Sports in the UK as well as other TV channels, plus videos and animated gifs. -- Biker Biker ( talk) 20:05, 4 June 2013 (UTC)
According to the owner of the website here, parstimes.com is a personal page with economical purpose. At the moment there are more than 99 links mainly on EL section to this website. Farhikht ( talk) 13:23, 6 June 2013 (UTC)
A webcam website that's been aggressively spammed by multiple IPs and users. For diffs of spam, see the contibs for the spammers listed. Spencer T♦ C 17:44, 6 June 2013 (UTC)
Spam is coming out of Hungary; spammer adds this link, or replaces valid company URL with their own, for instance here. Blacklist please; cleanup is a drag. Drmies ( talk) 17:03, 1 May 2013 (UTC)
MER-C 12:25, 3 June 2013 (UTC)
checkmarx.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Long-term problem with multiple accounts promoting the software company Checkmarx and its products. Behaviour includes persistent spamming of external links to checkmarx.com. Some diffs are provided in the list above; see Wikipedia:Sockpuppet investigations/Grenoble jojo for further details. — Psychonaut ( talk) 09:54, 12 June 2013 (UTC)
roichecker.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Persistant reference spamming across city related articles. After being blocked as 207 came back immediately as the 223 IP address to continue spamming. The articles they are adding these to are being added in alphabetical order, which seems to suggest that they fully intend to spam all 400 of these articles. - Sudo Ghost 19:27, 25 June 2013 (UTC)
The edits mentioned here have been obviously done without proper knowledge of the Wikipedia rules and departed from earlier contributions were cities were mentioned because of being added in the top 400 list. Such mentions in the past had a different structure (i.e. "[City] was added to the Top 400 business investments destinations with high return potential."), which comply with citation standards. We are still researching on how the additions cited here occurred but are anyway committed to add content only when it is compliant. Please, see that adding the above mentioned references to articles of cities has had a positive impact on the localities as they are always looking for more attention and opportunities to receive investors to create jobs. Blacklisting the site affects more than just the site, it also takes merit off those localities. May you please inform me on how to request the delisting? May you take care of it? Or I should proceed to request inclusion in the whitelist? Sincerely, -- Mba lwall ( talk) 16:30, 26 June 2013 (UTC)
If one uses {{User:Faizhaider}} which should show the user page one gets a spam filter warning saying that one is trying to link to http://www.smfaizhaider.co.nr and http://www.faizhaider.co.nr. This issue creates problem for users when they try to include my username in any discussion. I searched the blacklist logs but was not able to find any entry for the two sites. -- Sayed Mohammad Faiz Haider t c s 16:21, 19 June 2013 (UTC)
{{
user|Faizhaider}}
as
Faizhaider (
talk ·
contribs), as you can see by this reply. The template {{User:Faizhaider}} (with a colon in stead of a vertical bar) would transclude your entire user page, and nobody should be doing that. ~
Amatulić (
talk) 22:15, 25 June 2013 (UTC)This link on the Jesus Christians wikipage has been blacklisted for some reason. The Jesus Christians wikipage was recently modified in a biased way and when trying to revert we discovered that this link was "blacklisted", however when checking the blacklist for May 2013 the domain was not listed. We would like to restore the link because of historic significance. Thanks.
libertyreserve.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Liberty Reserve is in the news as it has been shut down by authorities. Liberty Reserve dot com currently shows a seizure notice. A link to this would be of encyclopaedic interest for the Liberty Reserve article. LukeSurl t c 22:59, 29 May 2013 (UTC)
Not sure why this is blocked. I checked
en.wikipedia.org/wiki/MediaWiki:Titleblacklist en.wikipedia.org/wiki/MediaWiki:Spam-blacklist
and this site is not listed. I tried sitecheck.sucuri.net/scanner/
All referenced security checking sites showed guard-soft.com as OK, except
that McAfee seems to give a 'warning' but fails to explain why. I requested McAfee to recheck there assessment. Note: I discovered wikipedia blocking this site when I attempted to edit: en.wikipedia.org/wiki/Guilloche
There's a question at RSN about a possible malware site. Could someone take a look at Wikipedia:Reliable_sources/Noticeboard#Please_check_the_source? WhatamIdoing ( talk) 06:01, 12 February 2011 (UTC)
Where are the guidelines for what is blacklisted and what is not? Is it based upon the content of the sites, i.e. unmanageable advertising, or upon the action of editors in adding dubious citations? Or both? Could the guidelines be linked in a header from this Interface page? -- Bejnar ( talk) 17:13, 3 June 2013 (UTC)
Do we have the technical capability of designating links as blocked in articles only? In an IFD or PUI discussion, occasionally I get blocked by the spam filter for trying to link to a site as the source for a copyvio image. For example, examiner.com should obviously never ever in a million years be linked to from an article, but if we had the technical capability of doing so, it would be nice to be able to link to it at IFD/PUI. -- B ( talk) 00:46, 15 April 2013 (UTC)
\*.onion was recently added, but the article Tor (anonymity network) still has a link of .onion type as per a year or two old consensus. There is no whitelisting of that link, so what impact will the blacklisting of *.onion have on that article? Belorn ( talk) 08:45, 18 April 2013 (UTC)
Over on the whitelist page there's a whitelist request for shoe-shop.com. I had offered to whitelist the 'about' page, as is standard practice for blacklisted sites that have their own Wikipedia article, until I noticed that it isn't blacklisted. Locally or globally. It isn't in any logfile. I can't find any record of prior discussion in the archives of this page or on Wikipedia talk:WikiProject Spam.
That tells me it's collateral damage. However, I can't find any wildcard entries in either blacklist that would trigger on it. And yet, a URL containing shoe-shop.com triggers the blacklist error message. I'm sure there's some pattern that I'm not seeing. How would I locate the problem blacklist entry? ~ Amatulić ( talk) 15:14, 8 May 2013 (UTC)
(?:boot|shoe|ugg)[a-z0-9-]*(?:buy|cheap|mall|mart|outlet|shop|store|sale)[a-z0-9-]*\.(?:biz|c[no]|info|u[ks]|hk|jp|org|net)
wget -O- /info/en/?search=MediaWiki:Spam-blacklist | grep -o '^\\b[^ ]*' \
| while read regex
do echo "shoe-shop.com" | grep -P -q "$regex" && echo "$regex"
done
Is the blacklist sometimes used for unhelpful reference URLs that aren't actually spam? Should it be?--
Elvey (
talk) 18:10, 11 June 2013 (UTC)
example below:
Based on a small sample I looked at, it seems Wikipedia has many apparently dead links (like this intended to be to PDFs of the form ebscohost.com...pdfviewer...: All 7 of the 323 pages containing ebscohost and pdfviewer] I looked at had dead EBSCO links. These are NOT links that hit a paywall (like this. Rather, they bring up 404-like server error messages.
A second problematic type of EBSCO link are the three added by a user's (sole ever) edit that are of the form hxxp://0-web.ebscohost.com.sculib.scu.edu/ehost/pdfviewer/pdfviewer?sid=[hex string]@sessionmgr13&vid=4&hid=13. (Note the bold portion!) Presumably, these links work ONLY for subscribers that are ALSO at SCU. We shouldn't allow such links, and perhaps the blacklist (or a similarly functioning parallel system) would be a good solution? Or maybe there's a formula that can be used to fix all such ebscohost.com.[foo].edu and ebscohost.com...ca links?
PS Since I started writing this, I've noticed that EBSCO staff is heavily editing their own article. On the plus side, maybe that means they'd be available, willing, and able to help fix these links or suggest ways to deal with them systematically. note posted. -- Elvey ( talk) 18:10, 11 June 2013 (UTC)
I am tempted to see these sites as redirects, which will be location-dependent whether they work. I would consider that these should typically be converted to direct links to the object (within educational institutions, one can generally use a web-proxy to get to literature - a direct link would either be the link on the server where the literature resides, or the DOI. If someone then has to change that to go through the proxy, that is then something that that person needs to do (we can't anticipate that by any definition). Links through proxy servers have no place whatsoever. I am somewhat tempted to say that these need blanket blacklisting on meta, as they could possibly be abused to circumvent other blacklistings (for a relatively open proxy), and serve no function whatsoever to most readers except for the (few) ones that have access through the proxy - I doubt even if the url can be understood well enough to be able to figure out a real link from it. It is however going to be very obnoxious for the users that in good faith insert the proxy url they copy from their web-browser and then they can't save, and one could think of cases where it is appropriate (if information is only available to people who can pass the proxy and no-where else in the world, it could still a good reference for certain information - think of it of a book of which the single copy is in an nearly inaccessible library (the library in the Vatican), it is still verifiable by proxying through people who do have access to the library (ask the pope)).
Note, that with creative regex rule-writing, we could blacklist the two 'bad' examples of Nurg (the non-persistent link and the institution proxies), still enabling good ones (the permalinks). -- Dirk Beetstra T C 09:30, 12 June 2013 (UTC)
NOTE: Discussion continues at https://meta.wikimedia.org/?title=Talk:Spam_blacklist&action=edit§ion=11.
I'm getting no response there. Can we consider adding this here for now? :
ebscohost\.com(\.|.*(pdfviewer|EbscoContent))
-- Elvey ( talk) 22:38, 26 June 2013 (UTC)
blacklist starts with some comments and the last line of these comments is
This is plainly untrue. (Counterexample: \bbooks\.google\.com/books\?vid=ISBN0521009464\b) Let's replace it with a true statement, e.g.:
or
-- Elvey ( talk) 19:48, 22 June 2013 (UTC)
This
edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
Would you please fix the above documentation error? -- Elvey ( talk) 22:38, 26 June 2013 (UTC)
footballnation.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Someone seems to be spamming a non-notable American football blog across the articles of many American Football players. -- Jayron 32 04:55, 17 May 2013 (UTC)
wikinewstime.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
If I remember correctly there were quite a few more IP addresses spamming this site, but these were the only three I could find. - Sudo Ghost 17:36, 28 February 2013 (UTC)
soccerdatabase.eu: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
This was recently discussed at ANI - Wikipedia:Administrators' noticeboard/Incidents#Mass removal of references to soccerdatabase.eu website. soccerdatabase.eu/ is a mirror site of www.playerhistory.com/ - the latter is defunct and the owner is launching legal action against the former. Giant Snowman 18:26, 8 May 2013 (UTC)
So nobody seems to care that this website is a massive copyvio? Giant Snowman 16:38, 30 May 2013 (UTC)
telerik.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
$ whois 82.103.64.57 inetnum: 82.103.64.0 - 82.103.64.255 netname: TELERIK descr: Telerik Corp.
This is only the recent abuse, See WikiProject Spam report for the full story. MER-C 11:44, 17 May 2013 (UTC)
tr.im: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
This URL shortener has been used recently:
And sometime in the past - I removed one use here. Deli nk ( talk) 13:31, 19 May 2013 (UTC)
MER-C 13:00, 20 May 2013 (UTC)
secure.vivid.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Affiliate site for Vivid Entertainment.
Trivialist ( talk) 21:57, 20 May 2013 (UTC)
Adsense google_ad_client = pub-2363916027311907 (
Track -
Report -
reverseinternet.com • meta:
Track -
Report)
Google Analytics ID: UA-37869191 - (
Track -
Report -
reverseinternet.com • Meta:
Track -
Report)
Google Analytics ID: UA-37867698 - (
Track -
Report -
reverseinternet.com • Meta:
Track -
Report)
MER-C 10:00, 2 June 2013 (UTC)
See the discussion initiated by the website's owner at Wikipedia:External_links/Noticeboard#Link_to_ProWresBlog. Although there is a clear COI in this editor pushing his/her own website, the real problem is the huge amount of copyvio content - screencaps taken directly from Sky Sports in the UK as well as other TV channels, plus videos and animated gifs. -- Biker Biker ( talk) 20:05, 4 June 2013 (UTC)
According to the owner of the website here, parstimes.com is a personal page with economical purpose. At the moment there are more than 99 links mainly on EL section to this website. Farhikht ( talk) 13:23, 6 June 2013 (UTC)
A webcam website that's been aggressively spammed by multiple IPs and users. For diffs of spam, see the contibs for the spammers listed. Spencer T♦ C 17:44, 6 June 2013 (UTC)
Spam is coming out of Hungary; spammer adds this link, or replaces valid company URL with their own, for instance here. Blacklist please; cleanup is a drag. Drmies ( talk) 17:03, 1 May 2013 (UTC)
MER-C 12:25, 3 June 2013 (UTC)
checkmarx.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Long-term problem with multiple accounts promoting the software company Checkmarx and its products. Behaviour includes persistent spamming of external links to checkmarx.com. Some diffs are provided in the list above; see Wikipedia:Sockpuppet investigations/Grenoble jojo for further details. — Psychonaut ( talk) 09:54, 12 June 2013 (UTC)
roichecker.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Persistant reference spamming across city related articles. After being blocked as 207 came back immediately as the 223 IP address to continue spamming. The articles they are adding these to are being added in alphabetical order, which seems to suggest that they fully intend to spam all 400 of these articles. - Sudo Ghost 19:27, 25 June 2013 (UTC)
The edits mentioned here have been obviously done without proper knowledge of the Wikipedia rules and departed from earlier contributions were cities were mentioned because of being added in the top 400 list. Such mentions in the past had a different structure (i.e. "[City] was added to the Top 400 business investments destinations with high return potential."), which comply with citation standards. We are still researching on how the additions cited here occurred but are anyway committed to add content only when it is compliant. Please, see that adding the above mentioned references to articles of cities has had a positive impact on the localities as they are always looking for more attention and opportunities to receive investors to create jobs. Blacklisting the site affects more than just the site, it also takes merit off those localities. May you please inform me on how to request the delisting? May you take care of it? Or I should proceed to request inclusion in the whitelist? Sincerely, -- Mba lwall ( talk) 16:30, 26 June 2013 (UTC)
If one uses {{User:Faizhaider}} which should show the user page one gets a spam filter warning saying that one is trying to link to http://www.smfaizhaider.co.nr and http://www.faizhaider.co.nr. This issue creates problem for users when they try to include my username in any discussion. I searched the blacklist logs but was not able to find any entry for the two sites. -- Sayed Mohammad Faiz Haider t c s 16:21, 19 June 2013 (UTC)
{{
user|Faizhaider}}
as
Faizhaider (
talk ·
contribs), as you can see by this reply. The template {{User:Faizhaider}} (with a colon in stead of a vertical bar) would transclude your entire user page, and nobody should be doing that. ~
Amatulić (
talk) 22:15, 25 June 2013 (UTC)This link on the Jesus Christians wikipage has been blacklisted for some reason. The Jesus Christians wikipage was recently modified in a biased way and when trying to revert we discovered that this link was "blacklisted", however when checking the blacklist for May 2013 the domain was not listed. We would like to restore the link because of historic significance. Thanks.
libertyreserve.com: Linksearch en (insource) - meta - de - fr - simple - wikt:en - wikt:fr • Spamcheck • MER-C X-wiki • gs • Reports: Links on en - COIBot - COIBot-Local • Discussions: tracked - advanced - RSN • COIBot- Link, Local, & XWiki Reports - Wikipedia: en - fr - de • Google: search • meta • Domain: domaintools • AboutUs.com
Liberty Reserve is in the news as it has been shut down by authorities. Liberty Reserve dot com currently shows a seizure notice. A link to this would be of encyclopaedic interest for the Liberty Reserve article. LukeSurl t c 22:59, 29 May 2013 (UTC)
Not sure why this is blocked. I checked
en.wikipedia.org/wiki/MediaWiki:Titleblacklist en.wikipedia.org/wiki/MediaWiki:Spam-blacklist
and this site is not listed. I tried sitecheck.sucuri.net/scanner/
All referenced security checking sites showed guard-soft.com as OK, except
that McAfee seems to give a 'warning' but fails to explain why. I requested McAfee to recheck there assessment. Note: I discovered wikipedia blocking this site when I attempted to edit: en.wikipedia.org/wiki/Guilloche
There's a question at RSN about a possible malware site. Could someone take a look at Wikipedia:Reliable_sources/Noticeboard#Please_check_the_source? WhatamIdoing ( talk) 06:01, 12 February 2011 (UTC)
Where are the guidelines for what is blacklisted and what is not? Is it based upon the content of the sites, i.e. unmanageable advertising, or upon the action of editors in adding dubious citations? Or both? Could the guidelines be linked in a header from this Interface page? -- Bejnar ( talk) 17:13, 3 June 2013 (UTC)
Do we have the technical capability of designating links as blocked in articles only? In an IFD or PUI discussion, occasionally I get blocked by the spam filter for trying to link to a site as the source for a copyvio image. For example, examiner.com should obviously never ever in a million years be linked to from an article, but if we had the technical capability of doing so, it would be nice to be able to link to it at IFD/PUI. -- B ( talk) 00:46, 15 April 2013 (UTC)
\*.onion was recently added, but the article Tor (anonymity network) still has a link of .onion type as per a year or two old consensus. There is no whitelisting of that link, so what impact will the blacklisting of *.onion have on that article? Belorn ( talk) 08:45, 18 April 2013 (UTC)
Over on the whitelist page there's a whitelist request for shoe-shop.com. I had offered to whitelist the 'about' page, as is standard practice for blacklisted sites that have their own Wikipedia article, until I noticed that it isn't blacklisted. Locally or globally. It isn't in any logfile. I can't find any record of prior discussion in the archives of this page or on Wikipedia talk:WikiProject Spam.
That tells me it's collateral damage. However, I can't find any wildcard entries in either blacklist that would trigger on it. And yet, a URL containing shoe-shop.com triggers the blacklist error message. I'm sure there's some pattern that I'm not seeing. How would I locate the problem blacklist entry? ~ Amatulić ( talk) 15:14, 8 May 2013 (UTC)
(?:boot|shoe|ugg)[a-z0-9-]*(?:buy|cheap|mall|mart|outlet|shop|store|sale)[a-z0-9-]*\.(?:biz|c[no]|info|u[ks]|hk|jp|org|net)
wget -O- /info/en/?search=MediaWiki:Spam-blacklist | grep -o '^\\b[^ ]*' \
| while read regex
do echo "shoe-shop.com" | grep -P -q "$regex" && echo "$regex"
done
Is the blacklist sometimes used for unhelpful reference URLs that aren't actually spam? Should it be?--
Elvey (
talk) 18:10, 11 June 2013 (UTC)
example below:
Based on a small sample I looked at, it seems Wikipedia has many apparently dead links (like this intended to be to PDFs of the form ebscohost.com...pdfviewer...: All 7 of the 323 pages containing ebscohost and pdfviewer] I looked at had dead EBSCO links. These are NOT links that hit a paywall (like this. Rather, they bring up 404-like server error messages.
A second problematic type of EBSCO link are the three added by a user's (sole ever) edit that are of the form hxxp://0-web.ebscohost.com.sculib.scu.edu/ehost/pdfviewer/pdfviewer?sid=[hex string]@sessionmgr13&vid=4&hid=13. (Note the bold portion!) Presumably, these links work ONLY for subscribers that are ALSO at SCU. We shouldn't allow such links, and perhaps the blacklist (or a similarly functioning parallel system) would be a good solution? Or maybe there's a formula that can be used to fix all such ebscohost.com.[foo].edu and ebscohost.com...ca links?
PS Since I started writing this, I've noticed that EBSCO staff is heavily editing their own article. On the plus side, maybe that means they'd be available, willing, and able to help fix these links or suggest ways to deal with them systematically. note posted. -- Elvey ( talk) 18:10, 11 June 2013 (UTC)
I am tempted to see these sites as redirects, which will be location-dependent whether they work. I would consider that these should typically be converted to direct links to the object (within educational institutions, one can generally use a web-proxy to get to literature - a direct link would either be the link on the server where the literature resides, or the DOI. If someone then has to change that to go through the proxy, that is then something that that person needs to do (we can't anticipate that by any definition). Links through proxy servers have no place whatsoever. I am somewhat tempted to say that these need blanket blacklisting on meta, as they could possibly be abused to circumvent other blacklistings (for a relatively open proxy), and serve no function whatsoever to most readers except for the (few) ones that have access through the proxy - I doubt even if the url can be understood well enough to be able to figure out a real link from it. It is however going to be very obnoxious for the users that in good faith insert the proxy url they copy from their web-browser and then they can't save, and one could think of cases where it is appropriate (if information is only available to people who can pass the proxy and no-where else in the world, it could still a good reference for certain information - think of it of a book of which the single copy is in an nearly inaccessible library (the library in the Vatican), it is still verifiable by proxying through people who do have access to the library (ask the pope)).
Note, that with creative regex rule-writing, we could blacklist the two 'bad' examples of Nurg (the non-persistent link and the institution proxies), still enabling good ones (the permalinks). -- Dirk Beetstra T C 09:30, 12 June 2013 (UTC)
NOTE: Discussion continues at https://meta.wikimedia.org/?title=Talk:Spam_blacklist&action=edit§ion=11.
I'm getting no response there. Can we consider adding this here for now? :
ebscohost\.com(\.|.*(pdfviewer|EbscoContent))
-- Elvey ( talk) 22:38, 26 June 2013 (UTC)
blacklist starts with some comments and the last line of these comments is
This is plainly untrue. (Counterexample: \bbooks\.google\.com/books\?vid=ISBN0521009464\b) Let's replace it with a true statement, e.g.:
or
-- Elvey ( talk) 19:48, 22 June 2013 (UTC)
This
edit request has been answered. Set the |answered= or |ans= parameter to no to reactivate your request. |
Would you please fix the above documentation error? -- Elvey ( talk) 22:38, 26 June 2013 (UTC)