Go Back   vb.org Archive > vBulletin Modifications > vBulletin 4.x Modifications > vBulletin 4.x Add-ons
FAQ Community Calendar Today's Posts Search

Reply
 
Thread Tools
VBPG HTML Extractor - Convert URL to CMS Article Details »»
VBPG HTML Extractor - Convert URL to CMS Article
Version: 1.0.0, by Dylanblitz Dylanblitz is offline
Developer Last Online: Nov 2023 Show Printable Version Email this Page

Category: Administrative and Maintenance Tools - Version: 4.0.8 Rating:
Released: 11-22-2010 Last Update: Never Installs: 7
Uses Plugins
Additional Files Is in Beta Stage  
No support by the author.

Not sure if anyone will need this. A friend asked me to make it for him so he could convert a lot of static pages from his old site into CMS Articles. So I figured I'd release it here in case it'll help anyone else.

This will take a url and convert it into a Static Page CMS Article. Depending on the layout of the original page and your understanding of regex you might get away with minor editing. If you are pulling from a Drupal site or something like it, it can be very easy to pull just the data. On the sample site I show I had to do no editing of the body text.

I am not the greatest with regex so please don't ask me to figure out the regex expression to get your content into the page. If you can't figure it out, I have the default set to use the body tags and you'll have to edit out the extra menu/banner/etc content.

I ran into one issue, not sure what causes it. When I was working with it on my site it worked no problem. When I installed a vanilla vB with the sample CMS data it wouldn't show any of the extracted articles on the All Sections view. Once I clicked the Verify and Repair Node Table it worked fine.

Download Now

File Type: zip html_extractor.zip (6.1 KB, 80 views)

Screenshots

File Type: jpg 01 - original.jpg (112.5 KB, 0 views)
File Type: jpg 02 - admin.jpg (51.0 KB, 0 views)
File Type: jpg 03 - admin1.jpg (63.0 KB, 0 views)
File Type: png 04 - admin2.png (7.8 KB, 0 views)
File Type: png 05 - admin3.png (39.1 KB, 0 views)
File Type: jpg 06 - front.jpg (58.7 KB, 0 views)
File Type: jpg 07 - front2.jpg (93.1 KB, 0 views)

Supporters / CoAuthors

Show Your Support

  • This modification may not be copied, reproduced or published elsewhere without author's permission.

Comments
  #2  
Old 11-23-2010, 08:06 PM
Dylanblitz Dylanblitz is offline
 
Join Date: Oct 2005
Location: OC, California
Posts: 732
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

**Reserved**
Reply With Quote
  #3  
Old 11-23-2010, 09:47 PM
Alfa1's Avatar
Alfa1 Alfa1 is offline
 
Join Date: Dec 2005
Location: Netherlands
Posts: 3,537
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Thanks for releasing this!

Does this convert it page for page or can I use this to convert my 200 html pages to vbCMS?

If so then how does this handle category structure?
Reply With Quote
  #4  
Old 11-23-2010, 10:08 PM
Dylanblitz Dylanblitz is offline
 
Join Date: Oct 2005
Location: OC, California
Posts: 732
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by Alfa1 View Post
Thanks for releasing this!

Does this convert it page for page or can I use this to convert my 200 html pages to vbCMS?

If so then how does this handle category structure?
It just does one page at a time, I'm not sure how I would be able to loop it without knowing the title of each page to lookup.
Reply With Quote
  #5  
Old 11-23-2010, 11:27 PM
wottech wottech is offline
 
Join Date: May 2006
Posts: 122
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

In Alfa1's case, I think he has his static pages in specific folders, and each folder is named for a specific category. With that said, could you loop it to convert every file in a specified directory?

{runs and hides}
Reply With Quote
  #6  
Old 11-24-2010, 12:28 AM
Dylanblitz Dylanblitz is offline
 
Join Date: Oct 2005
Location: OC, California
Posts: 732
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Quote:
Originally Posted by wottech View Post
In Alfa1's case, I think he has his static pages in specific folders, and each folder is named for a specific category. With that said, could you loop it to convert every file in a specified directory?

{runs and hides}
If it's on the same server and accessible with the current account it would be fairly easy. Walking a remote directory is harder, especially if you don't have ftp access.

So it depends really on how the server is setup.

Another way is to have a textbox where you can add several links at once, say a list of 200 of them, and cycle through those. But that's just easier on the coder

I'll play around and see if I can make it have several different options depending on how the site/server is setup.
In the case of multiple though I'd have to go with defaults on most of the options with the admin selecting the section to enter the new articles into. Then the admin would have to go through and modify the articles for their preview text, settings, etc.
Reply With Quote
  #7  
Old 11-25-2010, 07:59 PM
M@rc M@rc is offline
 
Join Date: Oct 2010
Posts: 19
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Great mod. Tagged for later.
Reply With Quote
  #8  
Old 06-27-2011, 07:51 PM
final kaoss final kaoss is offline
 
Join Date: Apr 2006
Posts: 1,314
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Interesting, I'm wondering how this would work with the new editor and also is there a way to make it post the static pages as threads instead of articles?
Reply With Quote
  #9  
Old 02-02-2012, 04:00 PM
bootytag bootytag is offline
 
Join Date: Jun 2011
Posts: 2
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Can it do RSS FEEDS and update it
Reply With Quote
  #10  
Old 04-26-2012, 01:24 PM
avitor avitor is offline
 
Join Date: Jun 2007
Posts: 119
Благодарил(а): 0 раз(а)
Поблагодарили: 0 раз(а) в 0 сообщениях
Default

Your criteria or the body tags did not return any valid content. Please press the back button and try again
any help what the reason?
thanks cool mod
Reply With Quote
Reply


Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 03:32 PM.


Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2024, vBulletin Solutions Inc.
X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.04562 seconds
  • Memory Usage 2,326KB
  • Queries Executed 25 (?)
More Information
Template Usage:
  • (1)SHOWTHREAD
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)ad_showthread_beforeqr
  • (2)bbcode_quote
  • (1)footer
  • (1)forumjump
  • (1)forumrules
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (1)modsystem_post
  • (1)navbar
  • (4)navbar_link
  • (120)option
  • (1)pagenav
  • (1)pagenav_curpage
  • (1)pagenav_pagelink
  • (10)post_thanks_box
  • (10)post_thanks_button
  • (1)post_thanks_javascript
  • (1)post_thanks_navbar_search
  • (10)post_thanks_postbit_info
  • (9)postbit
  • (8)postbit_attachment
  • (10)postbit_onlinestatus
  • (10)postbit_wrapper
  • (1)spacer_close
  • (1)spacer_open
  • (1)tagbit_wrapper 

Phrase Groups Available:
  • global
  • inlinemod
  • postbit
  • posting
  • reputationlevel
  • showthread
Included Files:
  • ./showthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/functions_bigthree.php
  • ./includes/class_postbit.php
  • ./includes/class_bbcode.php
  • ./includes/functions_reputation.php
  • ./includes/functions_post_thanks.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • showthread_start
  • showthread_getinfo
  • forumjump
  • showthread_post_start
  • showthread_query_postids
  • showthread_query
  • bbcode_fetch_tags
  • bbcode_create
  • showthread_postbit_create
  • postbit_factory
  • postbit_display_start
  • post_thanks_function_post_thanks_off_start
  • post_thanks_function_post_thanks_off_end
  • post_thanks_function_fetch_thanks_start
  • post_thanks_function_fetch_thanks_end
  • post_thanks_function_thanked_already_start
  • post_thanks_function_thanked_already_end
  • fetch_musername
  • postbit_imicons
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • postbit_attachment
  • postbit_display_complete
  • post_thanks_function_can_thank_this_post_start
  • pagenav_page
  • pagenav_complete
  • tag_fetchbit_complete
  • forumrules
  • navbits
  • navbits_complete
  • showthread_complete