The original post: /r/datahoarder by /u/itsnotnotbacon on 2025-01-03 01:06:21.
So the forum at https://forum.bradleysmoker.com/index.php is being shut down by the company. They’ve created a FB group and want all discussion happening there, Im guessing to save costs and less maintenance. The forum has been active since 2006, contains a wealth of information like recipes and techniques, and has an awesome community. When it was announced recently they’d be shutting down the forum in January, a lot of people were upset, and even more so when Bradley said they weren’t planning on archiving anything.
My thoughts were to archive certain sections of the forum (there’s a lot of sections I dont want), store the files in the cloud and share the address in the forum before it’s shut down. I’ve tried SiteSucker, Wget and HTTrack, all to some success but not what I want. I did manage to get the complete Fish section with HTTrack using a command I asked chatgpt to help me write, but then when I took the same command and changed it down download the Sausage Making section, it crapped out after page 5.
Before the site gets taken offline, does anyone have a recommendation or advice for how I can successfully save only certain portions of this site? THANK YOU!
**EDIT to clarify what Im looking for**
There are certain sections of the site (ex fish, sausage making, jerky, meat,etc) that I would like to preserve. I’m trying to get the entire section including all threads, posts, replies. With SiteSucker I specified the path to the section I wanted (to the fish section for example), the paths i didnt want, (everything else), selected Check All links. When downloading the file, it was saving everything, not just the fish section.
A few days ago I’d never used Wget or HTTrack so I’ve been explaining to chatgpt what i want and getting it to write the terminal commands for me. I think this is the last command i tried with HTTrack but it was also grabbing everything, not just what I specified.
httrack “https://forum.bradleysmoker.com/index.php?board=24.0” -O ~/Documents/Sausage\ Making --depth=123 --ext-depth=123 --mirror --stay-on-same-domain --accept “index.php?board=24*”
So Im still trying to figure out how to just save parts of the forum, not the entire thing. If you have any suggestions, Im all ears! Thank you