vb.org Archive

vb.org Archive (https://vborg.vbsupport.ru/index.php)
-   Forum and Server Management (https://vborg.vbsupport.ru/forumdisplay.php?f=232)
-   -   Importing from old archive html files? (https://vborg.vbsupport.ru/showthread.php?t=328177)

Tim Wheatley 08-14-2020 04:51 PM

Importing from old archive html files?
 
I ran a site 20 years ago that was lost and recently acquired the entire archive or everything that is on archive.org - it's a ton of posts, attachments, etc.

Is there any way at all to import from the archive (for example index.php?t-125231.html or whatever) into a real post in a database? Obviously expecting some data loss, non-recovery of users, etc. But is ANYTHING possible?

If not I will likely just upload the html archive and let search engines crawl it, maybe remove the links to the 'real' forum that no longer exists.

Dave 08-14-2020 05:00 PM

If you have the .html files in a relatively organized structure and format then it's possible by creating a custom PHP script that iterates over all the files.

This PHP script should utilize either the DOMDocument PHP class or regular expressions to get the content and to insert it properly into a database.

Tim Wheatley 08-15-2020 01:09 AM

Quote:

Originally Posted by Dave (Post 2603727)
If you have the .html files in a relatively organized structure and format then it's possible by creating a custom PHP script that iterates over all the files.

This PHP script should utilize either the DOMDocument PHP class or regular expressions to get the content and to insert it properly into a database.

Thanks very much for the reply. I'm really interested to hear it may be possible. The backups were downloaded from archivarix_com, so the formatting is quite nice I think. I'll see what I can do in terms of using find/replace and uploading it as an archive at this point...


All times are GMT. The time now is 01:16 AM.

Powered by vBulletin® Version 3.8.12 by vBS
Copyright ©2000 - 2025, vBulletin Solutions Inc.

X vBulletin 3.8.12 by vBS Debug Information
  • Page Generation 0.01151 seconds
  • Memory Usage 1,709KB
  • Queries Executed 10 (?)
More Information
Template Usage:
  • (1)ad_footer_end
  • (1)ad_footer_start
  • (1)ad_header_end
  • (1)ad_header_logo
  • (1)ad_navbar_below
  • (1)bbcode_quote_printable
  • (1)footer
  • (1)gobutton
  • (1)header
  • (1)headinclude
  • (6)option
  • (1)post_thanks_navbar_search
  • (1)printthread
  • (3)printthreadbit
  • (1)spacer_close
  • (1)spacer_open 

Phrase Groups Available:
  • global
  • postbit
  • showthread
Included Files:
  • ./printthread.php
  • ./global.php
  • ./includes/init.php
  • ./includes/class_core.php
  • ./includes/config.php
  • ./includes/functions.php
  • ./includes/class_hook.php
  • ./includes/modsystem_functions.php
  • ./includes/class_bbcode_alt.php
  • ./includes/class_bbcode.php
  • ./includes/functions_bigthree.php 

Hooks Called:
  • init_startup
  • init_startup_session_setup_start
  • init_startup_session_setup_complete
  • cache_permissions
  • fetch_threadinfo_query
  • fetch_threadinfo
  • fetch_foruminfo
  • style_fetch
  • cache_templates
  • global_start
  • parse_templates
  • global_setup_complete
  • printthread_start
  • bbcode_fetch_tags
  • bbcode_create
  • bbcode_parse_start
  • bbcode_parse_complete_precache
  • bbcode_parse_complete
  • printthread_post
  • printthread_complete