Go Back   vb.org Archive > vBulletin Modifications > Archive > vB.org Archives > vBulletin 3.7 > vBulletin 3.7 Add-ons
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
Automatic Thread Tagger Details »»
Automatic Thread Tagger
Version: 1.2.0, by Phalynx Phalynx is offline
Developer Last Online: Nov 2015 Show Printable Version Email this Page

Category: New Posting Features - Version: 3.7.x Rating:
Released: 07-15-2008 Last Update: 01-08-2009 Installs: 841
DB Changes Uses Plugins Auto-Templates
Additional Files Translations  
No support by the author.

Automatic Thread Tagger


Description
When a user submits a new thread this modification will automatically take keywords from the thread title and use these as tags. You can use Automatic Thread Tagger to propose the user AJAX tags for his new thread, or it assigns new tags after saving the new thread. It can add the translated thread prefix to the tags.
Additionally, you can tag existing threads via maintenance and also scheduled tasks.

This modification is a successor to the terminated Automatic Thread Tagger by MrEyes:
https://vborg.vbsupport.ru/showthread.php?t=179927

As an example, if a user submits a thread with a title of:
"Fish Food for Cats!"

The thread will be automatically tagged with:

- Fish
- Food
- Cats

If the user also submits an actual tag of "Fish" this will not be duplicated. Any rules you have setup for tagging will be respected.
If you choose to do so this product will also automatically tag threads created by incoming RSS feeds.

Demo
I cannot show you the process of creation, but here is a list of tags generated by Auto Thread Tagger:
http://www.insideearth.net/tags.php?langid=5
http://www.insidesupcom.de/tags.php?langid=1


Automatic Tagging of existing threads
You can tag existing threads via maintenance or scheduled task/cron. They will be created with a special flag so they can be easily identified and deleted. Manual assigned tags are not touched. Maintenance is also working if Automatic Tagging is disabled via settings. Great if you want to test some settings. Automatic Tagging will take the date of the thread creation and also the userid of the creator. This process can be automated by running a scheduled job once a night.

Please keep in mind that tags that were proposed via AJAX are not tagged as auto tagged and therefore cannot be identified as such (and therefore not deleted automatically). If you want to retain the auto tagged flag you should disable AJAX and enable the tagging after the thread has been saved. As an alternative way you can also disable this and let new threads be tagged in the night from the scheduled job.


Installation / Upgrade
1. Upload all files from "upload" to your server, take care of the directory structure
2. Import "product-auto_thread_tagger110.xml" as a product, overwrite if it's already installed
3. Check settings
4. Run maintenance / Auto Tag Threads to tag existing threads (needed if you want to use the cron)

After install, and by default the modification is disabled, this will allow you to play around with configuration before switching it on.


Troubleshooting
If you report a bug please post the thread title that created it, without this I cannot test it and improve the language parsers.

* If no threads are tagged you will have to check the following:
- Is the modification enabled? Is the action you are testing enabled? (vBulletin tagging, whole auto thread tagger system, AJAX, new threads)
- Are the words you are using badwords or filtered out?

* Cron/Scheduled Task is not tagging all threads.
- The cron is limited to 500 (you can change this via settings) threads per run to avoid heavy impact on server. Make sure you run maintenance auto tagger before this to tag old threads. You can check the scheduled tasks log to see if it is running correctly.
Important: If a thread title does not meet minimum requirements to be included in tags (f.e. one word thread titles, too short words), it will be forever in this queue.

* I'm using polish, arabic, turkish, etc.. language and the tagger is not working like it should.
- If not already replaced, replace the filter replacement '&'=>'and' with ' & '=>'and' (a space before and after &)



Todo
What comes next? You decide. Tell me what you are missing and I'll look if it can be integrated.


Why thread title and not thread text?
Parsing the thread text for tags is an extremely unlikely addition as this would require some fairly heavy processing to ensure quality of tags.


What are Stopwords?
Stopwords is the name given to words which are filtered out prior to processing of tags.
The user Hostboard on vBulletin.org posted some resources regardings this:
https://vborg.vbsupport.ru/showpost....&postcount=380



History
1.2.0, 9th August 2008
- Fixed error with missing threadid's
- Fixed error with AJAX and prefix
- Fixed error with not indexing tags via cron
- Added polish, spain, english stopwords
- Compatibel with vBulletin 3.8

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #62  
Old 07-18-2008, 12:17 AM
Rik Brown's Avatar
Rik Brown Rik Brown is offline
 
Join Date: Nov 2005
Location: St. Louis, Missouri, USA
Posts: 132
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Excellent mod! Really love it. Thanks!

Wish list:

1) Remember the last thread # tagged and start from there ("resume mode") just tagging new threads rather than having to start over from scratch (which takes a very long time with 100,000+ threads) each time the mod is run.

2) Then, being able run the mod either by vb's scheduler or the server's cron (daily would be fine for me)

3) Would like to be able to ban tags with non-alphabetical characters unless they are in the vbulletin "Thread tag allowed list."

Thanks again. -- Rik
Reply With Quote
  #63  
Old 07-18-2008, 02:06 AM
needaltuna needaltuna is offline
 
Join Date: Sep 2007
Posts: 57
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Would somebody -- anybody! -- please tell me whether this mod improves search engine rankings. Thank you.
Reply With Quote
  #64  
Old 07-18-2008, 02:12 AM
glorify's Avatar
glorify glorify is offline
 
Join Date: Aug 2004
Posts: 376
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

It gives you more pages, but it may take awhile to get saturation unless the php is rewritten to html. The best way to get Google to visit the pages without rewriting or waiting is to add adsense in your navbar. That way, adsense bot will look at each page and theoretically, you'll get a tad quicker results.
Reply With Quote
  #65  
Old 07-18-2008, 04:22 AM
needaltuna needaltuna is offline
 
Join Date: Sep 2007
Posts: 57
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thank you for that very informative answer.
Reply With Quote
  #66  
Old 07-18-2008, 07:13 AM
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Location: Erlangen, Germany
Posts: 2,747
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by redlabour View Post
Example: bluewin.ch or aol.com should tagged as bluewin.ch and aol.com and not bluewinch and aolcom
Only if you remove the filter '.'=>''

Quote:
Originally Posted by popowich View Post
I think I see the remaining part of the duplicates problem.

There are leading spaces in front of some the tags.

For example "tag" and " tag".
Thanks, I'll take a deeper look. If thats it, I will include the fix in the next version.

Quote:
Originally Posted by popowich View Post
Is there a way to build phrases? "new york" instead of "new" and "york". I don't care if it's not perfect, just looking to teach it some common ones for my site.
Use "Composite Tags".

Quote:
Originally Posted by Rik Brown View Post
Wish list:

1) Remember the last thread # tagged and start from there ("resume mode") just tagging new threads rather than having to start over from scratch (which takes a very long time with 100,000+ threads) each time the mod is run.

2) Then, being able run the mod either by vb's scheduler or the server's cron (daily would be fine for me)

3) Would like to be able to ban tags with non-alphabetical characters unless they are in the vbulletin "Thread tag allowed list."
Great ideas, will try to implement them in next version.
Reply With Quote
  #67  
Old 07-18-2008, 08:45 AM
6piston's Avatar
6piston 6piston is offline
 
Join Date: Apr 2005
Posts: 139
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

hi phalynx

thanks for the plugin. a real classic.
works like charm!

my only question is,
i've got 213156 threads to tag.

i ran maintenence and the thing is, it freezes at random numbers of posts. sometimes 1640 sometimes 12XXX posts. i've set the minium number of post tagging to 99999999999999 as well.

i also tried turning the board off but it still happens.

i understand that this could be a server issue too

but anyone out there with the same problem, please tell me what to do to get all posts tagged?

thanks!
Tom
Reply With Quote
  #68  
Old 07-18-2008, 11:29 AM
Charlie98902 Charlie98902 is offline
 
Join Date: Dec 2006
Posts: 1,156
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

So remove this : '-'=>'' to allow the use of this: -?
Reply With Quote
  #69  
Old 07-18-2008, 12:04 PM
Phalynx Phalynx is offline
 
Join Date: Feb 2004
Location: Erlangen, Germany
Posts: 2,747
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Yes
Reply With Quote
  #70  
Old 07-18-2008, 12:56 PM
Britmovie Britmovie is offline
 
Join Date: Mar 2008
Posts: 154
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Exclude Search Words?
Is it best to set to yes or no? Little confused on this one?
Reply With Quote
  #71  
Old 07-18-2008, 03:53 PM
tfw2005 tfw2005 is offline
 
Join Date: Sep 2004
Posts: 58
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I had an issue, not sure what if anything can be done.

Had the default vbulletin max tags by thread starter at 5, and user at 2.

When the auto tagger ran on long thread titles by Registerd Users, it tried to insert 10,15 words, etc.

They kept getting "thread tags were exceeded by X amount" error messages.

Pissing people off, because they don't even know its happening, or what to do to fix it.

Any way we can get a work around saying "if auto-tagger, allow unlimited tags -> else use tagging default"

My temp work around is allowing a super-high amount of tags for thread starter.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 01:25 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.09605 seconds
  • Memory Usage 2,321KB
  • Queries Executed 25 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (4)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (6)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (4)pagenav_pagelink
  • (2)pagenav_pagelinkrel
  • (11)post_thanks_box
  • (11)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (11)post_thanks_postbit_info
  • (10)postbit
  • (11)postbit_onlinestatus
  • (11)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete