this post was submitted on 03 Aug 2025
1 points (100.0% liked)

It's A Digital Disease!

23 readers
1 users here now

This is a sub that aims at bringing data hoarders together to share their passion with like minded people.

founded 2 years ago
MODERATORS
 
The original post: /r/datahoarder by /u/SnooDogs8806 on 2025-08-03 14:25:03+00:00.

Hi hoarders, I need help scraping the whole website/domain at https://www.tpcvietnam.com/ with wget

I'm working on a dataset about the specifications of these powertools, so I need the text from all their product pages. Been reading the cheatsheet at https://scrapingant.com/blog/wget-cheatsheet but all the tech jargon is not helping at all.

Any help/hint is much appreciated. I'm in a rush for the commands, but would like to learn how to do this again when they update their product catalogue.

Example needed information:

https://www.tpcvietnam.com/product/may-ban-dinh-u-total-tcsnli6008/

Specification of a TOTAL brand powertool

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here