The Arcive of Official vBulletin Modifications Site.It is not a VB3 engine, just a parsed copy! |
|
#751
|
|||
|
|||
16.5 mil posts
488k threads vb 3.6 no plans to move to vb 4 until everyone else does it, too. boolean and phrase search are needed. been missing them. spending for a search solution - no problem. spending 2k - no way. Quote:
|
#752
|
||||
|
||||
Okay, I've been working slowly but surely... Here's the following constraints thus far:
1. New threads/posts added when you run your delta cron job (most run every 2-5 min)... 2. Changes in # views, last poster, deleted threads / posts, etc should be real time updates. 3. Edits to the title or post text will not be updated until next full re-index (usually nightly) unless it is within the delta file. Will have boolean searching, phrase, etc... |
#753
|
|||
|
|||
Quote:
|
#754
|
|||
|
|||
Apart from using Sphinx to search for the similar threads, you can also use it to generate the post excerpts with search keywords highlighted when in the "Show search results as posts" mode.
Our stats: almost 14 mln posts, 1.1 mln threads, 300k users, vB 3.8. We're using our own Sphinx implementation since it predates the hack in this thread. We got rid of the obscure search and sort modes though (such as sorting by the number of views or replies), and there was not a single complaint from our members. I don't think you should focus too much on 100% compliance with the default search. Having too many document attributes will inflate the index size, resulting in more I/O and more sluggish performance. If you are worried about the need to edit the default search form template, you could always clone it, make the necessary changes and ship it with the product. |
#755
|
||||
|
||||
Thanks for the feedback guys. Another thing I'm pondering on is instead of trying to work off just a main + delta index is to break the total post count up and constantly rotate smaller indexes...
I.E. If a site has 10,000,000 posts... Have 10 indexes each with 1,000,000 threads. Then have each of the indexes rotate say hourly. This would be a shift from the typical one massive re-index nightly (or however often you do it). In theory too, the last index would contain the most recent posts and could be re-indexed more often. I dunno, that's just a thought... My concern right now is the core code for searching, the indexes themselves can be manipulated differently at a later time as that is transparent to everything else. |
#756
|
|||
|
|||
Quote:
|
#757
|
||||
|
||||
kmike - thanks for that info, I must over looked over that in the docs...
Just curious, how much of a performance difference did you see using the distributed process? I kind of got sidetracked today... One of my good friend's wife just got out of the hospital, so I was there for a while today. Then I was coding some anti-spammer measures for my forum registration process... |
#758
|
|||
|
|||
We have 2 post indexes, one or our live post table, and one for our archived post table. They each have 30 million posts each. I don't see a point in sharding the post indexes aside from being able to take advantage of multiple CPUs when indexing.
The way I see it, if I can keep the old indexes online while I do a full reindex, I don't really care how long the full reindex takes since (at least in our case), the search server is just a slave database server and not our primary. |
#759
|
||||
|
||||
The only thing I am waiting on before converting to vB4 is sphinx (or a working search alternative). The rest of the little stuff I modded I can do with or without until those developers get upgrades.
1.3 million threads 18 million posts |
#760
|
|||
|
|||
mute, can you share how did you archive post table ? What changes did you do in code and MySQL ? I want to move my old posts to another post_archive table but I am not sure how can I join those tables from vbulletin code.
eoc_Jason my forum is 200k threads and 10mil posts, vb 3.8.4. I have only one database (no slave), nginx webserver, Core I7 with 12GB RAM. I installed sphinx on server and from ssh it works great but from moded search.php it works very strange, sometimes when I want to find some keywords with option "show results as posts" it returns "no results" message but if I change search options to "show results as thread" with same keywords, I got good numbers of results showen as threads. Users posts search does not works at all, search.php?do=finduser&u=xxx always gives blank screen no php errors in log or anywhere just blank screen and thats it. |
Thread Tools | |
Display Modes | |
|
|
X vBulletin 3.8.12 by vBS Debug Information | |
---|---|
|
|
More Information | |
Template Usage:
Phrase Groups Available:
|
Included Files:
Hooks Called:
|