Go Back   vb.org Archive > Community Discussions > Modification Requests/Questions (Unpaid)
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools Display Modes
  #1  
Old 08-09-2004, 10:41 AM
TosaInu's Avatar
TosaInu TosaInu is offline
 
Join Date: Jul 2004
Posts: 256
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default Searchlog

Hello,

Our site has about 500,000 posts, not the biggest site, but some posts are pages long. It's pretty important to have an efficient searchlog (the more there we want other hacks). I read about the fulltext: it's interesting for us and it's not.

-It's possible to exclude some forums from the searchlog, but the sql fulltext is an all or nothing (as far as I understand it). Our board has an Off Topic forum and the content is 'volatile'. The topics shouldn't be deleted but storing all those posts in the searchlog while hurting the search for content isn't a jolly good idea either.

A post database with an sql fulltext search index is about as large as a post database having a searchlog. The searchlog can be made smaller though (people having access to SQL config can probably gain there). An optimized searchlog is better for storage and I guess it will beat the sql fulltext in speed.

-The SQL search omits small words, it's easy and necessary to add some site specific ones in the searchlog. I estimate we have 50 smaller than 3 letters word. Prime subjects of our site. Searchlog allows to do that.

The searchlog lacks some options though to make it the perfect solution for us. A badwordlist. Storing the 10,000's of records with variants of $@#!, cowstuff, horsetool, $ costs and pound sterling costs of products, words merged with &tags like &34my, numbers, yes..yes, yeh, yes, yes? and 10's of their variants, hello, hallo, ciao, current, altogether, nice, mine, yours .................... is not efficient. The word mine alone has 3631 records.

A tool to delete such entries from an existing searchlog would also be great. I know it's possible to make sql queries in say PHPMyAdmin, but it's errorprone and timeconsuming.

A PHP script that lists the wordlog and allows to select the words you want to strip will be convenient. The script stores the array of word ID's and deletes the corresponding records in the postindex

DELETE FROM vb_postindex
WHERE WORDID = deleteWORDID

I lack even the basic knowledge to create even the most basic PHP script. I guess it will be of great help to optimize the searchlog, I will surely use it. Someone please?
Reply With Quote
  #2  
Old 08-09-2004, 12:52 PM
TosaInu's Avatar
TosaInu TosaInu is offline
 
Join Date: Jul 2004
Posts: 256
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

There's a badword list: vb\includes\searchwords.php. That's nice. Would it be possible to make a cleaning tool and have it automatically insert/append words that are removed to the badwordlist? Train it so to say.
Reply With Quote
  #3  
Old 08-09-2004, 12:59 PM
Liquid1ce's Avatar
Liquid1ce Liquid1ce is offline
 
Join Date: Feb 2004
Location: /root
Posts: 196
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

TosaInu

totally off topic but maybe you should change your forum default skin to your own
Reply With Quote
  #4  
Old 08-09-2004, 01:08 PM
TosaInu's Avatar
TosaInu TosaInu is offline
 
Join Date: Jul 2004
Posts: 256
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Hello Liquid1ce,

You mean on this board? I'm lost again, what benefit does that give?
Reply With Quote
  #5  
Old 08-09-2004, 03:08 PM
TosaInu's Avatar
TosaInu TosaInu is offline
 
Join Date: Jul 2004
Posts: 256
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Did some manual queries, stripped some greetings and political factions currently used to insult each other:

SQL-query : [Wijzigen] [Creëer PHP Code]
DELETE FROM `anvb3_postindex` WHERE wordid = '5972';# Getroffen rijen:1944
DELETE FROM `anvb3_postindex` WHERE wordid = '5974';# Getroffen rijen:5270
DELETE FROM `anvb3_postindex` WHERE wordid = '5986';# Getroffen rijen:350
DELETE FROM `anvb3_postindex` WHERE wordid = '6009';# Getroffen rijen:1215
DELETE FROM `anvb3_postindex` WHERE wordid = '6016';# Getroffen rijen:1654
DELETE FROM `anvb3_postindex` WHERE wordid = '6024';# Getroffen rijen:11481
DELETE FROM `anvb3_postindex` WHERE wordid = '6025';# Getroffen rijen:18
DELETE FROM `anvb3_postindex` WHERE wordid = '6039';# Getroffen rijen:1
DELETE FROM `anvb3_postindex` WHERE wordid = '6040';# Getroffen rijen:607
DELETE FROM `anvb3_postindex` WHERE wordid = '6041';# Getroffen rijen:27
DELETE FROM `anvb3_postindex` WHERE wordid = '6102';# Getroffen rijen:761
DELETE FROM `anvb3_postindex` WHERE wordid = '6104';# Getroffen rijen:13805
DELETE FROM `anvb3_postindex` WHERE wordid = '6123';# Getroffen rijen:3631
DELETE FROM `anvb3_postindex` WHERE wordid = '6130';# Getroffen rijen:158
DELETE FROM `anvb3_postindex` WHERE wordid = '6152';# Getroffen rijen:137
DELETE FROM `anvb3_postindex` WHERE wordid = '6174';# Getroffen rijen:10
DELETE FROM `anvb3_postindex` WHERE wordid = '6175';# Getroffen rijen:14
DELETE FROM `anvb3_postindex` WHERE wordid = '6189';# Getroffen rijen:3046
DELETE FROM `anvb3_postindex` WHERE wordid = '6190';# Getroffen rijen:776
DELETE FROM `anvb3_postindex` WHERE wordid = '6194';# Getroffen rijen:2
DELETE FROM `anvb3_postindex` WHERE wordid = '6195';# Getroffen rijen:16
DELETE FROM `anvb3_postindex` WHERE wordid = '6209';# Getroffen rijen:2
DELETE FROM `anvb3_postindex` WHERE wordid = '6211';# Getroffen rijen:18
DELETE FROM `anvb3_postindex` WHERE wordid = '6222';# Getroffen rijen:289
DELETE FROM `anvb3_postindex` WHERE wordid = '6238';# Getroffen rijen:13
DELETE FROM `anvb3_postindex` WHERE wordid = '6242';# Getroffen rijen:704
DELETE FROM `anvb3_postindex` WHERE wordid = '6256';# Getroffen rijen:897
DELETE FROM `anvb3_postindex` WHERE wordid = '6261';# Getroffen rijen:304

Getroffen rijen means deleted rows.

~40,000 deleted rows are deleted.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 11:14 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.03840 seconds
  • Memory Usage 2,205KB
  • Queries Executed 13 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (1)ad_showthread_firstpost
  • (1)ad_showthread_firstpost_sig
  • (1)ad_showthread_firstpost_start
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)navbar
  • (3)navbar_link
  • (120)option
  • (5)post_thanks_box
  • (5)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (5)post_thanks_postbit_info
  • (5)postbit
  • (5)postbit_onlinestatus
  • (5)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_postinfo_query
  • fetch_postinfo
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete