ForumScan script: useful for catching spammers

A forum for the discussion and development of programs to assist working on or playing with the Vegastrike engine and data sets.
Post Reply
geoscope
Bounty Hunter
Bounty Hunter
Posts: 205
Joined: Thu Jun 29, 2006 1:58 am
Contact:

ForumScan script: useful for catching spammers

Post by geoscope »

Announcing ForumScan.py ver 0.5 -- The Forum Spam Catching Utility --
It can also be used to download Forum User Profiles, but does not currently display contact information, only interests.

ForumScan.py now consists of 3 files, plus 2 generated:
  • ForumScan.py -- the script
    ForumScan.lst -- list of users to checkout
    WordList.lst -- Word list to use for scoring
Generated:
  • ForumScan.wrk -- the user post summaries and profile information
    ForumScan.log -- log of activities and errors, very verbose right now
I'm thinking of breaking the log into 2, once for activities, another for errors, or just reducing the excessive logging of users skipped. Errors have useful info in them: If a profile does not exist -- assuming user account was deleted. If post summary does not exist -- user posts perhaps already deleted, or user has not yet posted any messages.

Can anyone think of a better name for the script or extensions?

And GOOD NEWS, I've almost got the scoring done. Latest output from the script...

Code: Select all

~/prog/SpamFilter$ ./ForumScan.py
Total File Score:  3622
About 20-30 more lines of code, and it will score individual users, and print a report to screen of anyone with a score below specified amount.

You guys who have been finding the spam posters, have you been keeping a copy of the spam you found, or a list of words that set off your "spammy-senses"? The code for scoring both the relevant wording and spam wording is in place, I just need a list for Spam words.

BTW, next update either later tonight or sometime tommorrow will be the complete, fully functional script... Minus a few helper functions, like
  • Import a user list, to watch # right now, just append the names to the END of the ForumScan.lst file.
    Autodelete of watched names from ForumScan.lst and
    Autodelete of data from ForumScan.wrk
    Can't figure out how to do that yet, maybe commandline option, feed it names that the script user has checked out and dealt with.
    If you have any ideas let me know in the new thread.
The first attachment is the script and helper files. The second is sample output from ForumScan ver 0.5 for those that can't run an python script, but would like to help review and eliminate spam, or that have limited download speed... If you want to help get rid of Forum spam See The Off-Topic Forum Thread: http://vegastrike.sourceforge.net/forum ... php?t=7305
Last edited by geoscope on Sat Aug 26, 2006 3:34 am, edited 1 time in total.
geoscope
Bounty Hunter
Bounty Hunter
Posts: 205
Joined: Thu Jun 29, 2006 1:58 am
Contact:

Post by geoscope »

ForumScan ver 0.6
Now scores users! Stores the Score! Prints the user and score to screen!
Saves the world, and still has time to do laundry!

Edit: Also fixed a bug. If user profile or summary does not exit, that error message is now saved in the workfile. Before, no information for that user was found in the workfile.
You do not have the required permissions to view the files attached to this post.
Post Reply