The Arcive of Official vBulletin Modifications Site.It is not a VB3 engine, just a parsed copy! |
|
#1
|
||||
|
||||
Spam Buster / Killer to Merge.
UK Jimbo and I have decided to merge our projects together. We'll be using this thread for development purposes. Any comments or input on what you'd like to see in an antispam software would be appreciated.
|
#2
|
||||
|
||||
PHP Code:
|
#3
|
|||
|
|||
genius.. merge may be the best to avoid duplicates, and will help concentrate on the problem... (two heads is always better than just one!)
|
#4
|
||||
|
||||
Moved from Beta hacks to this forum as this is not a hack release but a hack collaboration discussion.
|
#5
|
||||
|
||||
K Erwin. Thanks.
|
#6
|
|||
|
|||
OK guys, fine, but remember the K.I.S.S. method and don't get it so complicated that it takes a programming degree to install and maintain it.
The prime need is to keep JOEBLOW from registering and posting "visit our website for our new promotional marketing campaign where you can earn thousands." Click here: www.joeblowsspamsite.com Keeping a running list of joeblowsspamsite and his favorite key words and phrases so that they never see the light of day. I'm sure you know the way and we'll be looking for your posts. |
#7
|
||||
|
||||
Ok, several changes here.
1) More commentary code. 2) Blacklist is now called from a file instead of a template 3) File is included from PHPINCLUDE_START This is currently the PHPINCLUDE_START section PHP Code:
The other file is antispam itself for the moment. If it isn't clear with the comment code let me know. PHP Code:
|
#8
|
||||
|
||||
Hi Michael,
I couldn't get hold of you on MSN Messenger so I've penned down some thoughts/questions here: New code I like the style, comments and fact that it's been called from PHPINCLUDE_START good work Blacklists What do you think the best way of storing the blacklist(s) and making it editable is? I wonder if a phrase would be a good plan (even if it was managed through a custom part of the admincp rather than the phrase manager). If we wanted to store them as files and be able to write to them via the web interface the files would have to be world or web user writable. It'd be easy to setup a scheduled task to drag down the latest copy of the MT blacklist. Multiple fields What are your thoughts on breaking down the fields passed into the "spam engine"? I'm thinking along the lines of the way that the second version of spamBuster was able to have rules relating to the body text or the subject. Username might be anoter field worth matching against - lots of spammers seem to use the recipe [username][number] like robby34. Perhaps something to worry about later. Are you happy with me going ahead and writing a lower level library that does the spam processing and leaving some of the vBulletin integration (admincp code) to you? Made some changes to the code in the file handling and character substitution PHP Code:
The comments should be replaced out using preg_replace() The main regular expression tests are done using preg_match() which I think in a lot of cases is faster than eregi() Looking at those changes who'd guess I've developed in Perl a fair bit? :ermm: I've not tested this code the line I'm most dubious about is the comment removing code. |
#9
|
|||||
|
|||||
Quote:
Quote:
The local list needs to be a template for quick accessability. At 50K and growing, I don't think it's gonna fit in the phrase system. At some point we need a cron job to go to jay allen's site and pull down the updates to the list (He's given his permission for this). To spare his bandwidth, we need to get the system to only do a full refresh when requested from the admincp. Most of the time the system should download the latest 100 additions about once every 3 to 5 days. Quote:
What would be idea is an algorythm to iterate over the user's signature, post, and title, extract all URL's and put them in an array. Then compare these arrays for a match. Depending on the number of matches we can extract domain names from the array and add them to the local list. The message itself should be scanned for spammyness. Repeated use of the $ character, FREE in all caps, and maybe use an unusual words list (user definable) for words that shouldn't occur on a normal basis - viagra for example. Quote:
As far as functions - right now the code here has the actions taken inside the searching loop. To be honest these need to be seperate. Set up some kind of static variable to cound matches and return it, and put the ban action in a seperate function (or look into the possibility of using the existing ban functions. BTW, I noticed that in spam buster you wrote a routine to send mail - there's already a mail function in vbulletin: vbmail. It's defined in the functions library with is included on all executions of the vbulletin code. Quote:
|
#10
|
||||||||
|
||||||||
Quote:
Quote:
Quote:
A little wrapper script in ./includes/cron and called via the scheduled tasks is probably the best way to go. Quote:
Quote:
Quote:
Quote:
The sb_send_mail function was just there to build the mail. vbmail() was used to do the sending PHP Code:
Quote:
Right - there's plenty to bet getting on with now |
|
|
X vBulletin 3.8.12 by vBS Debug Information | |
---|---|
|
|
More Information | |
Template Usage:
Phrase Groups Available:
|
Included Files:
Hooks Called:
|