Spam: Difference between revisions

The educational technology and digital learning wiki
Jump to navigation Jump to search
mNo edit summary
Line 55: Line 55:


== Links ==
== Links ==
=== General ===
* [http://en.wikipedia.org/wiki/Spam_(electronic) Spam (electronic)] (Wikipedia)
* [http://en.wikipedia.org/wiki/Spamdexing Spamdexing] (Wikipedia)
* [http://www.sixapart.com/pronet/comment_spam.html Six Apart Guide to Comment Spam] (good reading for web log owners)
* [http://www.gearhack.com/Articles/FightSpam/ Fight Comment Spam, Ban IP's] A large list of banned IP addresses by Chieh Cheng. (There exist others)
=== General wiki spamming ===


* [http://meta.wikimedia.org/wiki/Wiki_Spam Wiki Spam] (Wikimedia)
* [http://meta.wikimedia.org/wiki/Wiki_Spam Wiki Spam] (Wikimedia)
* [http://www.gearhack.com/Forums/DisplayComments.php?file=Computer/Network/Internet/Protecting_Your_Wiki_From_Spam.html Protecting Your Wiki From Spam]


=== Mediawiki ===
=== Mediawiki ===

Revision as of 14:06, 6 July 2009

Draft

Lookup IP addresses and domain names

This may allow to block whole domains (e.g. in the httpd.conf file or at the system level). Sometimes, wikis are spammed manually and this can help a bit.

Mediawiki spamming

There exist several strategies:

Registered users

To fight spamming, only registered uses should be able to edit. Edit Localsettings.php

$wgGroupPermissions['*']['edit']            = false;
$wgGroupPermissions['*']['createaccount']   = true;
$wgGroupPermissions['*']['read']            = true;
Light-weight user creation that requires some math

This can defeat some scripts

Making user creation more difficult with captcha

This can defeat more scripts

Filtering edits and page names

Prevent creation of pages with bad words in the title and/or the text.

The builtin WgSpamRegex variable

Mediawiki includes a $wgSpamRegex variable. The goals is prevent three things: (a) bad words, (b) links to bad web sites and (c) CSS tricks to hide contents.

Insert in LocalSettings.php something like:

$wgSpamRegex = "/badword1|barword2|abcdefghi-website\.com|display_remove_:none|overflow_remove_:\s*auto;\s*height:\s*[0-4]px;/i"

I will not show ours here since I can't include it in this page ;)

Read the manual page for detail. It includes a longer regular expression that you may adopt.

Don't forget to edit MediaWiki:Spamprotectiontext

Spam blacklists extensions (an alternative)

The SpamBlacklist extension prevents edits that contain URL hosts that match regular expression patterns defined in specified files or wiki pages.

Links

General

General wiki spamming

Mediawiki