vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   vB3 Programming Discussions (https://vborg.vbsupport.ru/forumdisplay.php?f=15)
-   -   I suck at regexp! (read: HELP!) (https://vborg.vbsupport.ru/showthread.php?t=61683)

Cloudrunner 02-18-2004 02:20 AM

I suck at regexp! (read: HELP!)
 
So I need a little guidance.

This is for my latest idea for a hack and will be released soon.

BUT

I have setup the sql to pull all the posts from the DB that contain a URL link via
Code:

SELECT pagetext FROM posts WHERE pagetext LIKE '%[ URL ]%[ /URL ]%';
That being the case then $row['pagetext'] will be the entire post.

From that string I need to extract each and EVERY instance of '[ URL ]blah[ /URL ]' from within that string.

Now I understand I could use the explode(); function etc, but that only gives the first instance of '[ URL ]blah[ /URL ]'. I need each and every instance because my users have a habit of adding multiple URLs to their posts.

That being said, anyone who can lend a hand on this will get full credit when the hack is released.

Any takers, or even suggestions?

I've been fighting this for a few days to get the correct way to do it, and have been found a failure at it, for I suck at regexp!

Thank you in advance for any help that you may give.

)O( Cloudrunner )O(

AndrewD 02-18-2004 08:45 AM

Quote:

Originally Posted by Cloudrunner
So I need a little guidance.

This is for my latest idea for a hack and will be released soon.

BUT

I have setup the sql to pull all the posts from the DB that contain a URL link via
Code:

SELECT pagetext FROM posts WHERE pagetext LIKE '%[ URL ]%[ /URL ]%';
That being the case then $row['pagetext'] will be the entire post.

From that string I need to extract each and EVERY instance of '[ URL ]blah[ /URL ]' from within that string.

Now I understand I could use the explode(); function etc, but that only gives the first instance of '[ URL ]blah[ /URL ]'. I need each and every instance because my users have a habit of adding multiple URLs to their posts.

That being said, anyone who can lend a hand on this will get full credit when the hack is released.

Any takers, or even suggestions?

I've been fighting this for a few days to get the correct way to do it, and have been found a failure at it, for I suck at regexp!

Thank you in advance for any help that you may give.

)O( Cloudrunner )O(

I had the same need, and this is what I came up with. Rather than looking for the URL's I pass it through parse_bbcode first, because there may be html there as well. It dumps the links and the text into $titles and $links (which are arrays - check the documemtation on preg_match)

PHP Code:

$selectpost $DB_site->query("
        SELECT "
.
            
TABLE_PREFIX "post.postid as postid, ".
            
TABLE_PREFIX "post.username as username, ".
            
TABLE_PREFIX "post.userid as userid, ".
            
TABLE_PREFIX "post.threadid as threadid, ".
            
TABLE_PREFIX "post.title as title, ".
            
TABLE_PREFIX "post.pagetext as pagetext, ".
            
TABLE_PREFIX "post.dateline as dateline, ".
            
TABLE_PREFIX "thread.title as threadtitle
        FROM "
TABLE_PREFIX "post LEFT JOIN "TABLE_PREFIX "thread
        ON "
TABLE_PREFIX "post.threadid = "TABLE_PREFIX "thread.threadid
        ORDER BY "
TABLE_PREFIX "post.dateline
    "
);

$urllist = array();

while (
$postrec $DB_site->fetch_array($selectpost)) {
   
$p parse_bbcode2($postrec['pagetext'],0,0,0,1);
   
$lines preg_split('/(\Z|<br \/>)/'$p, -1PREG_SPLIT_NO_EMPTY);
   foreach (
$lines as $line) {
      
$i preg_match_all ("/<a.*?>.*?<\/a.*?>/"$line$url PREG_SET_ORDER);
      
$k 0;
      while (
$k $i) {
         
preg_match("/>(.*?)</",$url[$k][0], $titles);
         
preg_match("/<a *href *= *\"*(.*?)\"*( |>)/",$url[$k][0], $links);
         
$k++;
      }
   }




All times are GMT. The time now is 11:16 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.00962 seconds
  • Memory Usage 1,738KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (2)bbcode_code_printable
  • (1)bbcode_php_printable
  • (1)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (2)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete