Go Back   vb.org Archive > vBulletin Modifications > Premium Modifications > vBulletin Open Source Products > Sphinx
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
Sphinx: WARNING: duplicate document ids found Details »»
Sphinx: WARNING: duplicate document ids found
Version: , by FractalizeR FractalizeR is offline
Developer Last Online: Nov 2014 Show Printable Version Email this Page

Version: Unknown Rating:
Released: 05-04-2010 Last Update: Never Installs: 0
 
No support by the author.

The following is the output of cronjob /usr/local/sphinx/cron/delta.sh:

Code:
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'ForumDelta'...
collected 43 docs, 0.0 MB
collected 1 attr values
sorted 0.0 Mvalues, 100.0% done
sorted 0.0 Mhits, 100.0% done
total 43 docs, 5030 bytes
total 0.014 sec, 361480.03 bytes/sec, 3090.19 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'ThreadPostDelta'...
collected 2966 docs, 1.2 MB
collected 588 attr values
sorted 0.0 Mvalues, 100.0% done
sorted 0.1 Mhits, 84.8% done
WARNING: duplicate document ids found
total 2966 docs, 1159212 bytes
total 122.929 sec, 9429.94 bytes/sec, 24.13 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'DiscussionMessageDelta'...
collected 0 docs, 0.0 MB
collected 1 attr values
sorted 0.0 Mvalues, 100.0% done
total 0 docs, 0 bytes
total 0.034 sec, 0.00 bytes/sec, 0.00 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'SocialGroupDelta'...
collected 0 docs, 0.0 MB
collected 1 attr values
sorted 0.0 Mvalues, 100.0% done
total 0 docs, 0 bytes
total 0.010 sec, 0.00 bytes/sec, 0.00 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'VisitorMessageDelta'...
collected 0 docs, 0.0 MB
collected 1 attr values
sorted 0.0 Mvalues, 100.0% done
total 0 docs, 0 bytes
total 0.014 sec, 0.00 bytes/sec, 0.00 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'BlogEntryDelta'...
collected 0 docs, 0.0 MB
collected 0 attr values
sorted 0.0 Mvalues, nan% done
total 0 docs, 0 bytes
total 0.046 sec, 0.00 bytes/sec, 0.00 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'BlogCommentDelta'...
collected 0 docs, 0.0 MB
collected 1 attr values
sorted 0.0 Mvalues, 100.0% done
total 0 docs, 0 bytes
total 0.010 sec, 0.00 bytes/sec, 0.00 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Sphinx 0.9.8-id64-release (r1371)
Copyright (c) 2001-2008, Andrew Aksyonoff

using config file '/usr/local/sphinx/etc/vbulletin-sphinx.php'...
indexing index 'CMSArticlesDelta'...
collected 0 docs, 0.0 MB
collected 1 attr values
sorted 0.0 Mvalues, 100.0% done
total 0 docs, 0 bytes
total 0.011 sec, 0.00 bytes/sec, 0.00 docs/sec
rotating indices: succesfully sent SIGHUP to searchd (pid=11570).
Please look at ThreadPostDelta indexing:

WARNING: duplicate document ids found message appears. Is that a normal behavior of Sphinx? What is the document id used?

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #2  
Old 05-04-2010, 03:41 PM
sung sung is offline
 
Join Date: Feb 2002
Posts: 36
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

I got the warning as well (so glad it isn't just me), which I've reported in the vbulletin.com forums.

It can cause all sorts of nasty problems with Sphinx.

Quote:
There are a few different restrictions imposed on the source data which is going to be indexed by Sphinx, of which the single most important one is:

ALL DOCUMENT IDS MUST BE UNIQUE UNSIGNED NON-ZERO INTEGER NUMBERS (32-BIT OR 64-BIT, DEPENDING ON BUILD TIME SETTINGS).

If this requirement is not met, different bad things can happen. For instance, Sphinx can crash with an internal assertion while indexing; or produce strange results when searching due to conflicting IDs. Also, a 1000-pound gorilla might eventually come out of your display and start throwing barrels at you. You've been warned.
Reply With Quote
  #3  
Old 05-04-2010, 08:42 PM
FractalizeR's Avatar
FractalizeR FractalizeR is offline
 
Join Date: Oct 2005
Location: Russia, Moscow
Posts: 368
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

The following combination is used in configuration file to make so-called Document ID, that MUST be unique:

Code:
SELECT (c.contenttypeid << 32) | (p.postid) AS id
On some reason, it appears non-unique. However, I don't see how it can be other than really duplicating rows are returned by complete query
Reply With Quote
  #4  
Old 06-19-2010, 11:58 PM
graham_w graham_w is offline
 
Join Date: Apr 2005
Posts: 46
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Did you ever sort this out - i'm noticing the same error.

Cheers
Reply With Quote
  #5  
Old 06-20-2010, 06:49 AM
FractalizeR's Avatar
FractalizeR FractalizeR is offline
 
Join Date: Oct 2005
Location: Russia, Moscow
Posts: 368
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

No, but it looks like it doesn't affect search quality.
Reply With Quote
  #6  
Old 06-20-2010, 07:53 AM
graham_w graham_w is offline
 
Join Date: Apr 2005
Posts: 46
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thanks for the reply - yeah I did find a thread saying similar on the sphinx website.

Cheers
Reply With Quote
  #7  
Old 06-20-2010, 05:35 PM
JesterP JesterP is offline
 
Join Date: Jun 2007
Posts: 92
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by graham_w View Post
Thanks for the reply - yeah I did find a thread saying similar on the sphinx website.

Cheers
I recieved in my inbox this morning:

--->8---

### SAVE ORDERED IDS TO SEARCH CACHE ###;

MySQL Error : Duplicate entry '92f3f32f09b269797e91242ce55639a6-lastpost-DESC' for key 2
Error Number : 1062
Request Date : Sunday, June 20th 2010 @ 10:44:01 AM

---8<---
Everything is still running and I am not seeing anything bad happening. No errors since.
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 08:53 AM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.06516 seconds
  • Memory Usage 2,265KB
  • Queries Executed 20 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (2)bbcode_code
  • (2)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (5)navbar_link
  • (120)option
  • (7)post_thanks_box
  • (7)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (7)post_thanks_postbit_info
  • (6)postbit
  • (7)postbit_onlinestatus
  • (7)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete