Quote:
Originally Posted by sabret00the
wouldn't it just be a matter of looking for certain strings?
|
Nope, I know for a fact that the html output from Livejournals (for example) is incredibly awful and has NO useful identifiers whatsoever that you could use in a regexp to gather the desired data (ie. the journal content itself).
I've not yet encountered an online journal site where they've done proper HTML markup that's thusly useful that we could fetch the data automatically. If you can show me one that does it consistently, I could write an importer for that specific one, though, but I don't know if it's worth the hassle....