It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
4676
 
 
The original post: /r/datahoarder by /u/MathematicianLess793 on 2025-01-14 08:23:50.

Love "myfaveTT" for downloading Tiktok videos but thats just it, its only for videos. What about my favorite audios?

4677
 
 
The original post: /r/datahoarder by /u/Being_Parzival on 2025-01-14 07:42:10.
4678
 
 
The original post: /r/datahoarder by /u/Kai_ on 2025-01-14 05:43:40.

Hi all. Like many I've self-taught over many years on /r/buildapc and am fairly confident with consumer hardware. Now that I'm approaching building a NAS though I'm realising that there is a lot of knowledge in the enterprise domain that starts to become necessary.

Are there any recommended resources for approaching a rack-mounted NAS/SAN build?

Some questions so far:

  • I assume ECC memory is recommended?
  • Are there motherboards better suited to this application rather than typical gaming mobos? If the intent is to have a number of PCI.e or SAS/SATA SSDs, are there mobos that just have 10-20 of these?
  • Never explored SAS, is this recommended over SATA for the disk's, or just used as a breakout intermediary? I understand a SAS controller can be split into multiple SATA ports
  • Are the form factors of rack mounted machines still the standard ATX / mATX / ITX etc? Or would we be looking at something else like a blade/backplane board with a totally different arrangement.
  • Are we usually looking at the same CPU sockets and chipsets? I've seen references to dedicated disk controllers but assume that that wouldn't be applicable if the goal is to present all disks to the OS kernel directly?
4679
 
 
The original post: /r/datahoarder by /u/BlueeWaater on 2025-01-14 05:16:49.

Looking to download the transcripts of some videos, what can I use?

4680
 
 
The original post: /r/datahoarder by /u/Beavisguy on 2025-01-14 04:29:49.

Are old Imgur gifs that good deleted like 1 1/2 years ago archived on Wayback Machine or Archive.org?? If the are archived how do I search for them??

4681
 
 
The original post: /r/datahoarder by /u/Leading-Geologist-39 on 2025-01-13 18:37:02.

Found an older cheap consumer Z370 mainboard with a TB header from times where they handed those out on the Intel platforms like candy. Added a cheap same brand TB3 PCIe card (Asus ThunderboltEX 3) with USB-C. It was an old pre-2020 build with an 8700 with a cooler and RAM still on it. Booted linux off a USB drive, installed Samba and plugged in my laptop that has a TB3 USB-C port. Gave the TB3 network link some private IP address on each side and tested performance over that Thunderbolt 10GbE link.

The maximum I could get out of this old Z370 platform with few PCIe lanes was 5 NVMe M.2 drives for a RAIDz1 config. Used 5x2TB Crucial T500 and reading and writing large files from the laptop's very fast NVMe maxes out the link continuously at 1.25GB/s.

Even if you just add a couple old HDDs with a RAIDz1/2 transferring large files will still be around 500-800MB/s easily. The SSDs are definitely a bit wasted due to the 10GbE limitation of Thunderbolt 3.

(I set ashift to 14 to match the 16KB SSD page size spec, scrubs only at 2.5GB/s. I am certain it's entirely limited by the overall PCIe lane constraints, these are old PCIe 3.0 slots where the 2 onboard M.2 SSDs share 4 PCIe 3.0 lanes which is just sad for a modern PCIe 4.0 SSD...)

Plugged in a Macbook with USB-C for fun and ran a Timemachine backup over SMB (with fruit plugin) just to see how stable it is for a variety of workloads. Came back about 20 minutes later to see the 1TB backup already completed. No hiccups, no disconnects, and even if you accidentally unplug it the file system remains unaffected. At most you gotta restart any transfers already in progress.

An actual DAS that doesn't run its own OS with ZFS and just exposes the drives over USB3 or something can certainly run these SSDs at higher speeds than 10GbE but I don't wanna deal with "safely ejecting" and whatnot, and I already had the mainboard sitting in storage ever since I switched to AM4.

The best part is that I can ZFS send/receive between my main storage server (good for backups) and as that runs with snapshots over the regular NIC on the mainboard it's entirely independent of whether I have the Thunderbolt cable plugged into any computer.

Does it have ECC RAM? No. Is it power efficient? With 30W 24/7 not particularly. But my main storage server is sitting in its rack far away and all I get is 1Gbps wired or at best around the same over Wifi to a laptop. Browsing my 4k prores video files where each one is 100GB+ in size that connection is just no longer sufficient in 2025. Having a couple TB of fast temporary space available at my workspace is sweet.

If I had instead added some SSDs to my desktop computer I wouldn't have had a fun project setting up another server with ZFS. This was at first an experiment to see how stable a TB3 direct link really is and now it's a part of my workflow as I found it to be more reliable than any actual DAS I had sitting on my desk in the past.

With a very expensive SFX build and M.2 SSDs you could definitely get the footprint on your desk down to actual DAS size, but I have a 6 foot TB3 cable so I could put the cheap bulky computer case in a corner under the desk where I don't have to look at it.

4682
 
 
The original post: /r/datahoarder by /u/Objective-Inside-219 on 2025-01-13 18:34:34.

There was a lot of controversy over Verbatim's new non carbon mdisc formula, claiming reduced life. What happened with thar?

I understand Ritek did glass platter mdiscs? Are they still around?

How do I tell which mdisc is which when buying, and avoid fake mdiscs?

4683
 
 
The original post: /r/datahoarder by /u/Luvenis on 2025-01-13 16:49:19.

Right now I have a tiny desktop pc with a 8tb hdd that I use for plex.

I can't really find anything useful for them as there's no external cases that can house them all in a practical way.

My idea is to either sell them or see if someone wants to trade for an 8tb ssd.

4684
 
 
The original post: /r/datahoarder by /u/WispofSnow on 2025-01-13 16:45:10.

Intro

Good day everyone! I found a way to bulk download TikTok videos for the impending ban in the United States. This is going to be a guide for those who want to archive either their own videos, or anyone who wants copies of the actual video files. This guide is for a Windows base device.

This guide is going to use 3 components:

  1. Your exported Tiktok data to get your video links
  2. YT-DLP to download the actual videos
  3. Notepad++ to edit your text files from your tiktok data

Prep and Installing Programs

Request your Tiktok data. They make take a few hours to compile it, but once available, download it.

Press the Windows key and type "Powershell" into the search bar. Open powershell. Copy and paste the below into it and press enter:

Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser

Now enter the below and press enter:

Invoke-RestMethod -Uri https://get.scoop.sh/ | Invoke-Expression

Press the Windows key and type CMD into the search bar. Open CMD(commad prompt) on your computer. Copy and paste the below into it and press enter:

scoop install yt-dlp

You will see the program begin to install. This may take some time. While that is installing, we're going to download and installNotepad++. Just download the most recent release and double click the downloaded .exe file to install. Follow the steps on screen and the program will install itself.

Downloading Videos

Link Extraction

Once you have your tiktok data, unzip the file and you will see all of your data. You're going to want to look in the Activity folder. There you will see .txt (text) files. For this guide we're going to download the "Favorite Videos" but this will work for any file as they're formatted the same.

Open Notepad++. On the top left, click "file" then "open" from the drop down menu. Find your tiktok folder, then the file you're wanting to download vidoes from.

We have to isolate the links, so we're going to remove anything not related to the links.

Press the Windows key and type "notepad", open Notepad. Not Notepad++ which is already open, plain normal notepad. (You can use Notepad++ for this, but to keep everything separated for those who don't use a computer often, we're going to use a separate program to keep everything clear.)

Paste what is below into Notepad.

https?://[^\s]+

Go back to Notepad++ and click "CTRL+F", a new menu will pop up. From the tabs at the top, select "Mark", then paste https?://[^\s]+ into the "find" box. At the bottom of the window you will see a "search mode" section. Click the bubble next to "regular expression", then select the "mark text" button. This will select all your links. Click the "copy marked text" button then the "close" button to close your window.

Go back to the "file" menu on the top left, then hit "new" to create a new document. Paste your links in the new document. Click "file" then "save as" and place the document in an easily accessible location. I named my document "download" for this guide. If you named it something else, use that name instead of "download".

Downloading Videos using .txt file

Go to your file manager and decide where you want your videos to be saved. I went to my "videos" file and made a folder called "TikTok" for this guide. You can place your items anywhere, but if you're not use to using a PC, I would recommend following the guide exactly.

Right click your folder (for us its "Tiktok") and select "copy as path" from the popup menu.

Paste this into your notepad, in the same window that we've been using. You should see something similar to:

"C:\Users[Your Computer Name]\Videos\TikTok"

Find your TikTok download.txt file we made in the last step, and copy and paste the path for that as well. It should look similar to:

"C:\Users[Your Computer Name]\Downloads\download.txt"

Copy and paste this into the same .txt file:

yt-dlp

We're now going to make a command prompt using all of the information in our Notepad.

yt-dlp -P "C:\Users[Your Computer Name]\Videos\TikTok" -a "C:\Users[Your Computer Name]\Downloads\download.txt"

yt-dlp tells the computer what program we're going to be using. -P tells the program where to download the files to. -a tells the program where to pull the links from.

Now paste your newly made command into Command Prompt and hit enter! All videos linked in the text file will download.

Done!

Congrats! The program should now be downloading all of the videos. Reminder that sometimes videos will fail, but this is much easier than going through and downloading them one by one.

If you run into any errors, a quick Google search should help, or comment here and I will try to help.

4685
 
 
The original post: /r/datahoarder by /u/maplesyrup987 on 2025-01-14 03:56:19.

Considering digitizing 100s of 4x6 photos originally taken on 35mm film. Understand there are purpose built scanners such as the Epson FF-680.

My 4x6 photos have the typical orange timestamp in the corner. Is there software available that can read/OCR and automatically insert the date into the scanned photo file detail? Similar to EXIF data for digital photos. Essentially want to scan 4x6 photos (along with any notes on back side) and add to Adobe Lightroom catalog by date automatically.

I saw this post (https://www.reddit.com/r/computervision/s/gOsmBGhQ2c) but was wondering if anyone had a solution for the full workflow. Also do not want to upload my personal photos to some website.

Thanks.

4686
 
 
The original post: /r/datahoarder by /u/Myfirstreddit124 on 2025-01-14 01:50:18.

When I move a file by drag-and-drop in File Explorer on Windows 11, the destination file has a Modified time attribute that is 1-2 seconds later than the source file. Created time is preserved.

When I copy with robocopy copy:dato in Command Prompt, the destination file also has a Modified time that is 1-2 seconds later than the source file. Created is preserved.

When I copy by drag-and-drop in File Explorer, Modified is preserved but Created is updated to current.

When I move by drag-and-drop on MacOS 15, both Modified and Created times are preserved.

When I copy on Mac, Modified is preserved and Created is updated to current.

Source and destination are separate external ExFAT drives.

How can I move/copy files on Windows/Mac while preserving the exact Modified/Created times?

4687
 
 
The original post: /r/datahoarder by /u/Myfirstreddit124 on 2025-01-14 01:50:17.

When I move a file by drag-and-drop in File Explorer on Windows 11, the destination file has a Modified time attribute that is 1-2 seconds later than the source file. Created time is preserved.

When I copy with robocopy copy:dato in Command Prompt, the destination file also has a Modified time that is 1-2 seconds later than the source file. Created is preserved.

When I copy by drag-and-drop in File Explorer, Modified is preserved but Created is updated to current.

When I move by drag-and-drop on MacOS 15, both Modified and Created times are preserved.

When I copy on Mac, Modified is preserved and Created is updated to current.

Source and destination are separate external ExFAT drives.

How can I move/copy files on Windows/Mac while preserving the exact Modified/Created times?

4688
 
 
The original post: /r/datahoarder by /u/luxfc on 2025-01-14 01:28:30.

What software do you guys recommend for splitting large files/arquives into smaller parts on Mac? Used to use 7-Zip or Winrar on Windows but am looking for what else is out there and recommended for a Mac based workflow.

4689
 
 
The original post: /r/datahoarder by /u/MasterSatyr on 2025-01-14 00:52:58.

Hi all,

I'm currently creating a dedicated Plex server. I've always run this from my Gaming PC but electricity cost and longevity of this PC have made me decide to run it off of a NUC. I recently purchased a GMK Tec NUC from Amazon (link to listing here) which says it comes with 4 USB 3.2 ports. I am housing my hard drives in a Mediasonic Hard Drive Enclosure that is equipped with a USB 3.2 port. I noticed that my transfer speeds were around 40 MB/s when I'm copying files from the hard drive enclosure to the M.2 drive in my NUC. I know there are a lot of variables, but I feel like it should be faster. I then looked at the Device Manager for my NUC and it lists a USB Root Hub (USB 3.0) and an Intel USB 3.10 eXtensible Host Controller - 1.20.

How do I verify that my included ports are USB 3.2? Is there anything I'm missing in my setup that could increase transfer speeds?

https://preview.redd.it/oedg179dxuce1.jpg?width=844&format=pjpg&auto=webp&s=00babf092d163935f4cbcfa2b8b4a366363012d2

https://preview.redd.it/eaonwbydxuce1.jpg?width=448&format=pjpg&auto=webp&s=cac3dfcd07b232c5bb26d1ef5c4afae5134a6f19

4690
 
 
The original post: /r/datahoarder by /u/Agreeable_Repeat_568 on 2025-01-14 00:24:12.

I don't know why it took this long to find this but I had always thought HBAs were super power hungry so I had avoided them and tried things like the ASM1166 cards for extra sata ports.

I came across a posting that had the data sheet for the 9300 and listed the power consumption, I had been looking for this data so I searched through the the 9200 -9600 series data sheets and the 9500 seems awesome for power efficiency compared to other cards. I don't really see it mentioned very often so I am wondering why? Any issues with it? I have read it can be a bit of a pain to flash?

LSI SAS 9200-8e, dual port, host bus adapter

9300 8 and 4-port, 12Gb/s SAS host bus adapter family

SAS 9311 8 and 4-port, 12Gb/s SAS host bus adapter family

9400 Series Tri-Mode Storage HBAs

9500 Series PCIe Gen 4.0 Tri-Mode Storage HBAs

9600 Series 24G PCIe 4.0 Tri-Mode RAID Adapters and eHBAs

FYI I did cross post this.

4691
 
 
The original post: /r/datahoarder by /u/Enigma343 on 2025-01-13 23:33:50.

A month or two ago, Twitter removed the button to Delete All Bookmarks.

Perhaps the way I harvest tweets could use some refinement, so to summarize my current approach: I use WFDownloader to download off my bookmarks page, and then I move those files elsewhere for further categorization. As a result, I delete my bookmarks between data pulls, as it will otherwise download the same files again.

I'd like to either delete all my bookmarks with the click of a button, or, alternatively, find an approach that doesn't require me to delete my Twitter bookmarks one by one.

Thanks in advance!

4692
 
 
The original post: /r/datahoarder by /u/d33roq on 2025-01-13 23:25:55.

So, I have a bunch of dvd and bluray cartoon collection discs (Looney Tunes Golden Collection, etc) which I've been ripping so that they're all available through my server. The problem is that a disc might contain 30 cartoons all of which I have to title one-by-one - a massive pain in the ass when there are 40-50 discs of these. I've been using makemkv then splitting and naming the chapters with mkvtoolnix but wondered if there was an easier way. CD ripping software uses CDDB to automatically name each song, is there a dvd database equivalent?

4693
 
 
The original post: /r/datahoarder by /u/ZVH1 on 2025-01-13 23:12:11.
4694
 
 
The original post: /r/datahoarder by /u/Professional-Swim745 on 2025-01-13 22:29:12.

Hi all. Trying to backup my tik tok account using the myfavett chrome extension. I’m running into some issues though where the local mp4 file section and the downloading file section will stall. Any idea why this is happening? If it has happened to you any recommendations to fix it? Thanks

4695
 
 
The original post: /r/datahoarder by /u/Timur18769 on 2025-01-13 22:08:32.

I’m looking for advice on how to secure my SSD against unauthorized access, including by authorities, and methods to ensure that data is completely and irretrievably deleted, even with recovery tools. Additionally, I’d like to know how I can remotely reset my PC to erase everything securely if I lose physical access to it. Any recommendations or best practices?

4696
 
 
The original post: /r/datahoarder by /u/AlexNavajero on 2025-01-13 21:59:45.

I have a Toshiba DT01ACA100. Works fine except one thing. While connected to a USB 3.0 dock, the read speed is 46 MB/s, while connected to SATA the read speed is 14 MB/s - tested on two different motherboards. Write speed on both is ~180 MB/s. Surface tests are good, smart is good except 3 UDMA errors.

What could be the cause of such low read speed?

4697
 
 
The original post: /r/datahoarder by /u/Joeygrtgamer on 2025-01-13 21:57:05.

I was just wondering, what are some good affordable DVD and Blu-ray rippers and burners.

4698
 
 
The original post: /r/datahoarder by /u/mattbrow89 on 2025-01-13 21:52:52.
4699
 
 
The original post: /r/datahoarder by /u/Happybeaver2024 on 2025-01-13 21:33:36.

I recently upgraded to a homebuilt TrueNAS machine, and now I have my old Synology ds1813+ that I would like to sell. Does anyone have an idea of what a fair price for this unit is? In Canada if possible. Not looking to make a great deal of money, just a fair price.

Is $200 CAD a fair price ($150 USD)?

4700
 
 
The original post: /r/datahoarder by /u/AggressiveReview6694 on 2025-01-13 21:06:39.

Hi everyone,

I’m considering getting the Icy Box IB-RD3802-C31 enclosure as storage for my home server. The server would be running Immich 24/7, so reliability and performance are quite important.

Has anyone here used this enclosure before? Would you recommend it for this kind of setup? Any insights or alternatives would be greatly appreciated!

Thanks in advance!

view more: ‹ prev next ›