uyetch
uyetch

Reputation: 2190

How to download all links to .zip files on a given web page using wget/curl?

A page contains links to a set of .zip files, all of which I want to download. I know this can be done by wget and curl. How is it done?

Upvotes: 89

Views: 83379

Answers (3)

M Lindblad
M Lindblad

Reputation: 71

For other scenarios with some parallel magic I use:

curl [url] | grep -i [filending] | sed -n 's/.*href="\([^"]*\).*/\1/p' |  parallel -N5 wget -

Upvotes: 7

creaktive
creaktive

Reputation: 5210

The command is:

wget -r -np -l 1 -A zip http://example.com/download/

Options meaning:

-r,  --recursive          specify recursive download.
-np, --no-parent          don't ascend to the parent directory.
-l,  --level=NUMBER       maximum recursion depth (inf or 0 for infinite).
-A,  --accept=LIST        comma-separated list of accepted extensions.

Upvotes: 139

K.-Michael Aye
K.-Michael Aye

Reputation: 5605

Above solution does not work for me. For me only this one works:

wget -r -l1 -H -t1 -nd -N -np -A.mp3 -erobots=off [url of website]

Options meaning:

-r            recursive
-l1           maximum recursion depth (1=use only this directory)
-H            span hosts (visit other hosts in the recursion)
-t1           Number of retries
-nd           Don't make new directories, put downloaded files in this one
-N            turn on timestamping
-A.mp3        download only mp3s
-erobots=off  execute "robots.off" as if it were a part of .wgetrc

Upvotes: 95

Related Questions