It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
3776
 
 
The original post: /r/datahoarder by /u/BesterFriend on 2025-02-06 15:29:18.

Managing our vast collections can be quite the task. Recently, I explored GetResponse, initially known for email marketing, and discovered it offers substantial file storage with intuitive organizational features. I've started using it to back up my archives, and the ease of categorizing and retrieving files has been impressive. The platform also emphasizes security, ensuring our precious data remains protected. If you're seeking a reliable method to streamline your backup process, this might be a tool to consider.

3777
 
 
The original post: /r/datahoarder by /u/kodark on 2025-02-06 15:25:18.

Long time lurker, first time poster. I usually just like seeing people’s drive setups but I have to jump in for this. Is there an ongoing effort here? How can I help?

3778
 
 
The original post: /r/datahoarder by /u/Castelunan on 2025-02-06 15:17:21.

Basically, I want to build a home server that will do pretty much everything, including to serve as a NAS that can be accessed from outside the house. I'm looking at 14TB Hdds right now, but the cost for multiple of them is pretty steep. I'm thinking about foregoing and sort of Raid and instead using a NUC as a second machine dedicated to backups for the NAS, does that sound reasonable? I'd also like suggestions on where to get purchase the drives if possible. Going refurb, already checked out Goharddrive and ServerPartDeals. There don't seem to be many if any alternatives with sane prices and 5 year warranties like those two offer that I've seen so far.

Looking at a 9700X for the Cpu, 32GB of registered Ecc Ram, and a 1TB NVME for boot + cache. I'm reusing a 750W SFF PSU I had laying around, gonna install TrueNAS. Feedback welcome & appreciated, thanks.

3779
 
 
The original post: /r/datahoarder by /u/alphaplay09 on 2025-02-06 14:26:35.

Hello guys i am a fan of snap2html it does the job right and it is an open source and have all feature except for media files data such as length for music and movies. Are there any alternative that produces an html file and have all the features of snap2html application plus the media files data information while it is free? Really appriciate your help and thank you in advance.

3780
 
 
The original post: /r/datahoarder by /u/cheater00 on 2025-02-06 08:27:15.

This seems to be the only reupload of a video that just got taken down and I would very much appreciate it if someone managed to help me download it to my local disk:

https://vk.com/wall-227673890_679

as you can see, it's pretty fun and interesting and I would like to be able to show it to people in the future. The original uploader got told to take it down, probably due to commercial music or materials.

I tried downloading it with yt-dlp, but that didn't find any videos and says "Downloading 0 items":

https://vk.com/wall-227673890_679 Extracting cookies from firefox Extracted 1508 cookies from firefox [vk:wallpost] Extracting URL: https://vk.com/wall-227673890_679 [vk:wallpost] -227673890_679: Downloading JSON metadata [download] Downloading playlist: YOUTUBE ЮТУБ всё лучшее - Wall post -227673890_679 [vk:wallpost] Playlist YOUTUBE ЮТУБ всё лучшее - Wall post -227673890_679: Downloading 0 items [download] Finished downloading playlist: YOUTUBE ЮТУБ всё лучшее - Wall post -227673890_679

3781
 
 
The original post: /r/datahoarder by /u/ranoa_peasant on 2025-02-06 07:58:52.

I was wondering if anyone already bought from HMCW-Deals on Ebay.

While browsing for some new drives i found this (https://www.ebay.de/itm/156245926181) offer which looks good as far as i can tell.

240€ for a 20 TB recertified drive with 2 years warrenty sounds a bit to good to be true compared to the other prices in germany / europe.

I mean when tested thoroughly i would probably use it in a raid.

PS: I ordered one and will check out if it is ok :)

3782
 
 
The original post: /r/datahoarder by /u/CantLeaveYet on 2025-02-06 07:27:10.

I've tried everything that I found about it, but the most I got are only parts of my boards. Anybody knows how to do it properly?

3783
 
 
The original post: /r/datahoarder by /u/clementchw on 2025-02-06 05:55:01.

I recently needed to transform some of my Google Drive links into direct download links for a project, so I developed this program in Python. You're welcome to take a look and suggest any improvements. I hope it might help someone else who needs this too!

This tool converts sharing links into the direct download link format for Google Drive.

https://github.com/clementtech/GoogleDriveDirectDownload

3784
 
 
The original post: /r/datahoarder by /u/planepoint101 on 2025-02-06 05:15:00.

NOAA (more specifically, the National Centers for Environmental Information) has ~25 years of climate reports (monthly + annual) at https://www.ncei.noaa.gov/access/monitoring/monthly-report/national, and I'd like to download them. I've never done anything like this before.

I tried using HTTrack; I used the GUI, and got a ~700 MB folder (set of files) which was basically a kind of 'empty-looking' version of the above webpage, without any ability to link to reports. I also tried the CLI version, with a little help from GPT-4o (at duck.ai), but got similar to the above results:

$ httrack "https://www.ncei.noaa.gov/access/monitoring/monthly-report/" -O "/home/af/web_copies/" -N "*.*" -%P -%e0 -%k -%s

(Actually, the above command was supposed to give me the total size without actually downloading anything...didn't work).

Lastly I tried wget...

$ wget -m https://www.ncei.noaa.gov/access/monitoring/monthly-report/ --convert-links --page-requisites --no-parent

...and did get a folder in which was buried an html file that, when opened, displayed a stripped-down version of the webpage in question, but none of the links work and none of the reports can be summoned up (so none of that stuff saved -- I turned wifi off to test it out).

I'm also wondering how I might figure out the total size of the files before I go ahead and download.

Can anyone at least point me in the right direction? Which tool(s) are best, is there some online resource on how to use it? (This -- https://www.httrack.com/html/fcguide.html -- was very detailed, but too technical for me to get much out of at this point).

Thanks!

3785
 
 
The original post: /r/datahoarder by /u/NoRecognition8007 on 2025-02-06 04:57:12.

It is running on my server. No clue how long it'll take.

That is all. Keep an eye out.

3786
 
 
The original post: /r/datahoarder by /u/Massive_Writing4511 on 2025-02-06 04:33:38.

Edit: *using

Hi All,

I am trying to mirror all available files from the census, TIGER/Line, and LEHD FTP servers. However, I am unsure if I am doing it correctly, or if the FTP servers are still down. I checked far and wide across the internet for help, however, I could not find anything that helped me. At least the HTTP versions of the servers are running and accessible from browsers. However, it is not possible to download them all easily from the browser.

For instance, I am trying to use FileZilla using ftp://ftp2.census.gov/ and user and password as "anonymous" as per a document I found from census website. However, I get the following result.

Status: Resolving address of ftp2.census.gov

Status: Connecting to [2610:20:2010:a09:1000:0:9481:4b23]:21...

Status: Connection established, waiting for welcome message...

Error: Could not connect to server

Status: Waiting to retry...

Status: Resolving address of ftp2.census.gov

Status: Connecting to [2610:20:2010:a09:1000:0:9481:4b23]:21...

Status: Connection established, waiting for welcome message...

Error: Could not connect to server

I also want to mirror the following but none of them are working. Is it something I am doing wrong or is it down for everyone?

https://lehd.ces.census.gov/data/j2j/

https://lehd.ces.census.gov/data/lodes/

https://lehd.ces.census.gov/data/pseo/

https://lehd.ces.census.gov/data/qwi/

PS. I am new-ish to using FTP. I have used it a couple of times in the past, but none had this issue.

3787
 
 
The original post: /r/datahoarder by /u/Sufficient_Level_238 on 2025-02-06 03:37:15.

I have two external hard drives. I love them dearly, they can hold so much of my stuff that I can't get rid of. But I am very simple and know almost nothing about technology so that is something I will have to start teaching myself. For now I just go by what my dumb brain understands which is "external hard drive big? Good, that good storage". They are san disk G drives. Idk how many bites, I went with the highest that I could afford, which was somewhere in the middle and seemed like a lot.

I got these guys because I had one gdrive originally but with no silicone shell for protection. That thing lasted me for twelve years. It was beautiful. That is all I need to feel safe to use these things. But I am open to other suggestions of what drives, also am lurking around figuring out myself what's good from other folks here. Drives are all I understand. I don't understand internal harddrives at this time. And I have a mistrust of mainstream storage drives on the internet like icloud, google drive, and dropbox. When I was in college I heard stories about the dropbox being hacked twice. I was relieved that I didn't have anything precious saved in there.

But I do feel very dumb for just storing things on hard drives and nothing else. It's precarious business. My first g drive died when I accidentally knocked it off the table two years ago. It was a devastating loss. I was never able to fix it because the only business available that could do that was out of my budget. So I got two new ones to replace it. But I want more. MORE. I am so paranoid of losing shit and I also have some paranoia about mainstream tech being hacked and everything being lost or stolen that way.

If you have found an online storage drive that you trust, do you have recommendations? I only know of one called Stash which I hear is popular for storing away nsfw stuff. I hope you're safe and well and doing okay and if not I hope it gets better soon.

3788
 
 
The original post: /r/datahoarder by /u/DivyangTandel on 2025-02-06 02:47:05.

Hi buddies, I am very much frustrated, how resilio sync treat file operation in mobile app with selective sync and read write permission.

As you all know resilio shows place holder in selective sync, which appears system wide in windows computer so any file operation replicates in all sync device. However this place holder appears only within the resilio sync app in case of mobile app and not in mobiles built in file manager or any other third party file manager. So any file operation outside resilio sync app will behave some weird way as outlined below

Device-A have Folder with all files which it had shared to Device-B with read and write permission, which Device-B have added as selective sync.

Following is my use case, Device-A have huge storage so I want all file in there. Any file I add to Device-B will copy to Device-A as it is in full sunc mode. I want to organise file and folder in Device-B and it get sync to Device-A. This file organization I will do with third party file manager or through gallary app as that is most convenient and user friendly. Since Device-B have read write permission all the changes should be sync back to Device-A.

But Now following happens which drives me crazy. Device-B to Device-A

File/folder added -> copies to Device-A (expected)

File/folder Rename -> rename at Device-A (expected)

File/folder copy within main shared folder -> replicate at Device-A (expected)

File/Folder delete -> No delete at Device-A (Not expected) but I also expect it not to get delete, as once organise, I will delete all file from Device-B to free space while retaining at Device-A for future download

File/Folder MOVE -> Original file remains at Device-A and moved file at Device-B is treated as new file and copy from there to Device-A (creating double file) - NOT EXPECTED

As you see any move operation at Device-B will create duplicate file at Device-A

TLDR; Please suggest me best solution where I can auto backup all my data from Device-B to Device-A automatically with in same network *without needing any intervention, and can perform all file operation except delete from Device-A.

I need to contineously add new file and keep organizing it at Device-B.

Pls do not suggest synchthing as it do not support browsing file from Device-A and download only needed file. I can use FTP, WebDAV and foldersync app to replicate similar kind of behaviour but again these folder are not bind with each other so can not know if file is already available at Device-B before syncing or downloading from Device-A

Resilio is best match but Move operation in selective sync is just make it useless for my use case.

3789
 
 
The original post: /r/datahoarder by /u/ClayGirl84 on 2025-02-05 22:21:55.

Hi Everyone,

I just lost my Mother unexpectedly back on January 26th, 2025. She and I shared the family house together as we took care of my Father through a Traumatic Brain Injury, Brain Surgery, Dementia, and eventually a final diagnosis of Parkinson’s. I took care of him at home all the way through hospice. Most likely the Parkinson’s caused the fall that caused the Traumatic Brain Injury. So I have spent my entire life in my childhood home with my parents, who happened to be slightly older than the average couple when my older brothers and I came into being. My father had been out of the workforce for 8 years before he passed and honestly he never had much of a digital, online presence. Not that he was a troglodyte, in fact date night in the earlier days of their marriage and when we were tots; was building and upgrading our first family computer. Both my parents even with their slightly older and my friends’ parents adored technology. My Dad got in on the ground floor with the University in one area and my Mother unknowingly at the time was at the same university and helped with the digitization of the entire university library and helped with the interlibrary network too, that went from university to university.

So when they married, my Dad knew when my Mom finally finished her second Masters and would absolutely go on to her PhD that she was going to need a machine that could handle her chosen field of geology and environmental studies/engineering. So they built the family computer to to handle the advanced computations at the time, and to speak with the university mainframe to run at night when people were allowed to have access to large data crunching machines and machines that would run written programs that they simply had to invent (and were able to scientifically prove later on). This was all the 1980s and 1990s.

All of this being said. My father didn’t have a digital presence to remember him by. And that saddened me as we had few pictures of him too. He and Mom mostly took the pictures, but tapered off as we grew up. I know my Mom was on Facebook and possibly other social networking sites. What I wasn’t aware of until a colleague of her’s showed me yesterday that on YouTube there are quite a few interviews of her, acceptance speeches for scientific/environmental awards, lectures, community outreach meetings and more.

I’m not asking anyone here to do the work, but I deeply want to archive all of this data before these videos are lost. And I didn’t know she had used social networking systems for work/research and sweetly used Facebook as a public online journal. I have found stories from her about her life before us kids, I have found family stories and extended family stories, and even her recipes. I desperately want to capture all of that information. I know it will be a great deal of indexing and tagging entries, but I don’t want to lose even more of her, than my sometimes sieve like mind does to me.

Can someone point me in the right direction of a spreadsheet or flowchart that will walk me through how to save all of this information?

Thank you for your time. She was amazing and I was so thankful to find out that I’ll still get to see her and hear her voice. She had a brilliant mind (they both did). This just hurts so much. I lost my Mom and my Friend.

Thank You, ClayGirl84

TL;DR - I want to capture my Mom’s digital life before it is gone, what pre set up worksheets/workflow charts/spreadsheets are there that I can follow? I read through all the subreddit rules and fyi, and googled my question. Best lead was you all. Also thank you for what you do; especially recently it is deeply appreciated.

3790
 
 
The original post: /r/datahoarder by /u/Smooth_Whole_7250 on 2025-02-05 21:12:40.

I am not super tech savvy, looking for a scanner for documents and receipts.

Would like;

Duplex copying

Auto doc feeder

Receipt friendly

Removes blank pages

Easy and dependable

Scans to PDF

Easy to use software, or can use 3rd party easy to use software

Produces searchable docs

Easy to copy from scanner, do not have to do something on PC to get a good scan

Any guidance, or other attributes you recommend that I should look for, you can offer is much appreicated.

3791
 
 
The original post: /r/datahoarder by /u/Efficient-Active-720 on 2025-02-05 20:12:47.

My used storage capacity enreased a lot recently and I'm planning to get my setup - especially my backup-setup - in order. As backup means basically copying data from one drive to others what involves a lot of disks and targets, I'm asking myself if anyone can suggest a good was (or tool?) to document the backup-plan on what is stored where and is/should be transfered when. I can't believe you all just remember this in your head. ;-)

3792
 
 
The original post: /r/datahoarder by /u/Efficient-Active-720 on 2025-02-05 20:02:18.

I read often about backup to Blu Ray disk. This seems to me to be the most expensive, slowest and most complex option option. What is the advantage of burning Blu Ray discs? What kind of data do you use Blu Ray for?

3793
 
 
The original post: /r/datahoarder by /u/gilletvertigo on 2025-02-06 08:38:27.

Has anyone else noticed that the amount of archive.ph captures is reducing day by day? For example, about a month ago the amount of https://imgur.com//* captures was 1988. Yesterday it was only 1930 and today 1924. I noticed that the oldest captures are disappearing. The admin hasn't posted to the blog for over a year and now it seems that the storage is full and the oldest captures are being overwritten. Is the site abandoned?

3794
 
 
The original post: /r/datahoarder by /u/MathResponsibly on 2025-02-06 06:44:23.

Just posting this here:

https://web.archive.org/web/20250206004334/https://www.youtube.com/watch?v=Cse3pUxvecY

I saw the video yesterday when it was first released, and now it was "removed from the public domain" for some reason. I managed to snag 480p version of it from youtube before it was changed to private, and the internet archive also only has 480p version. Did anyone manage to snag the 1080p version??

3795
 
 
The original post: /r/datahoarder by /u/pansapiens on 2025-02-06 06:43:14.
3796
 
 
The original post: /r/datahoarder by /u/AgentArks on 2025-02-06 06:12:36.

With the recent subreddit bans and Reddit cracking down on communities they claim are “unmoderated” (even when they are), what’s the best way to archive subreddits before they disappear?

Also, for subreddits that are already banned, is there any way to archive their content retroactively? Or are there existing archives where I can find this information?

Would love to hear what tools and methods you all recommend.

3797
 
 
The original post: /r/datahoarder by /u/yuhyuhAYE on 2025-02-06 04:27:29.

As the title says- the FTP server is back up. I know there were ongoing efforts to archive everything that were halted by the server being taken down.

3798
 
 
The original post: /r/datahoarder by /u/unengaged_crayon on 2025-02-06 03:12:32.

Hello all,

I'm hoping to get a text download of the top (n) posts of ar slash transgender_surgeries - its a huge resource for trans people, and the fact that it disappeared was a scare for a lot of them. There's a lot of photos, but I'm not super interested in them (maybe once I get the text, but one thing at a time). I understand text is extremely efficient for file size, so that's what I'm looking at first (I only have a little more than a 1tb to play with). I've already tried my hand at URS, but it's sort of being a pain with python versions and poetry and etc.

any help is appreciated :D

3799
 
 
The original post: /r/datahoarder by /u/CowMaterial6539 on 2025-02-06 00:37:37.

Original Title: PSA: The Canadian Data Center is a secure and sustainable back-up for the entire Internet Archive. It’s a full, second live copy preserved outside the US. It looks even like a small version of the building they have in San Francisco, which their logo is based on!

3800
 
 
The original post: /r/datahoarder by /u/Logical_Marionberry4 on 2025-02-06 01:56:55.

Looks like the Environmental Justice Mapping tool EJScreen was taken down. I have the gdb saved if someone is able to load them.

view more: ‹ prev next ›