Posted on 06/16/2024 4:00:43 PM PDT by Paul R.
A relatively small web forum I am a member of is shutting down soon. There are a lot of posts on it I and other members would like to archive to HD for reference. Saving individual pages is VERY time consuming, and we only have until the end of the month. Do any of our FReeper computer guru's have any experience with this?
So far, at least, the forum "master" has not raised any objection to saving of the material for personal use, but is also not assisting and may not have any time or ideas anyway.
Do any FReepers have any experience with something like this? Maybe try some other archival program?
Thanks in advance!
Is this a database driven website?
We need more info.
Paul,
I’d be happy to help. I’ve been doing website stuff for over 30 years.
The simplest solution is to just use SiteSucker. Easy to use, but make sure it’s configured right or it could end up saving webpages from other sites connected to your site.
Is it a WordPress based site, or some other kind of forum that consists of files and a database?
Let me know how I can help.
+1 for Sitesucker
I don’t think so. It’s just posts about fishing, fishing gear, etc. Plenty of pics.
It’s my go to source for info. and advice regarding non-Chinese gear, which I avoid when possible, any more.
This is not my site. But, again, the webmaster has not expressed any objection to saving of the pages of the site for personal use, and he HAS been involved in the thread about the shutdown. Multiple members have been saving individual pages with copy/paste, etc. But even for a small forum that’s not practical to save the whole thing or maintain any structure.
It’s ALMOST as if I heard FR was shutting down and I wanted to archive it, except that FR is MUCH, MUCH bigger. (Probably 10,000x if I had to guess.)
SFAIK it’s not a WordPress type site, but I am not sure. It IS a forum.
If it’s a forum, 99.9% chance is has a database for storing posts. Most any website scraper/down-loader will just grab the html that a browser gets from that and save it as html pages, plus images.
I’m on Linux/Ubuntu and I’ve used httrack for that but like someone mentioned about a different tool. You have to be careful what depth of links you grab. Might need 2-3 depending on how the site works. If the forum allows embedded youtube videos it could get big unless you figure out how to filter that out.
Like everything tech the answer is, it depends on some pesky thing like variables.
Plug the url into builtwith.com and you might get an idea what it’s ‘built with’. At any rate, you’re only going to be able to scrape the html pages that get rendered for the browser.
I guess I’d better add that I’m using Windows 10 or 11...
Go to the URL of your site and add “/wp-admin” to the end. For example, if your site is www.example.com, you would go to www.example.com/wp-admin. If you get a login page it is a wordpress site. If it is then there are wordpress plugins that do full backups.
One quick thing to check is how much if any of the site has already been archived by archive.org’s wayback machine.
Ok, I tried that (adding the “/wp-admin”) and got a “forbidden access” error message.
So I guess that confirms it’s not a wordpress site.
I found it, but it seems to be bits and pieces. I am NOT sure I’m using the archive.org site correctly. It seems almost like another planet.
Anyone considered offering to buy the site and content from him?
Bkmk
What is the URL?
A few of the members have been talking about that possibility, but, apparently it isn’t going anywhere. The site is one of ~60 forums owned by the same company and apparently the whole thing is going down shortly.
Disclaimer: Opinions posted on Free Republic are those of the individual posters and do not necessarily represent the opinion of Free Republic or its management. All materials posted herein are protected by copyright law and the exemption for fair use of copyrighted works.