Why does my Curl command fail to download a file most of the time, but sometimes works?

Question

I have a strange problem with Curl on Ubuntu 16.04. Each night i Curl a large file from a remote API endpoint and save it to a folder, i use the curl command inside a bash script, i have noticed that my Curl command fails to save the output most of the time, so i created a script that checks if the file has saved, and if not it attempts to download again.

Each night i check the logs and i can see see that most of the time the curl fails at-least 5 times before saving the file:

file is empty - retrying download
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  4768    0  4768    0     0     47      0 --:--:--  0:01:40 --:--:--  1236
file is empty - retrying download
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  4768    0  4768    0     0     47      0 --:--:--  0:01:40 --:--:--  1266
file is empty - retrying download
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed

It always stops after 1 minute and 40 seconds, i suspect this is when Curl gives up.

The curl command i am using is:

curl -s "https://*.com/api/v2.0/*.*./?apitoken=123" > file.txt

The strange thing is when i run curl without the apostrophes round the URL and don't save it to a file:

  curl -s https://*.com/api/v2.0/*.*./?apitoken=123

I see the output straight away

When i run it without apostrophes and pipe into a file:

  curl -s "https://*.com/api/v2.0/*.*./?apitoken=123 > file.txt

I see the file downloads, but the command output does not go into file.txt it just appears in the shell.

If i use -o with Curl to specify the output i get the same timeout issue as above and it often fails multiple times before working.

All my other curl commands work so i suspect this is an issue because its a big file.

When i run Curl with -v i can see that after it fails i receive a 524 error from cloudflare: HTTP/1.1 524 Origin Time-out

What i don't understand is - Why my original command fails 90% of the time before eventually suceeding - Why after removing the apostrophes from the URL it downloads straight away - Why without the apostrophes it doesn't save to a file with " curl > file"

Can anybody shed light on this and show a proper method of curling a large file? (preferably piping the output to a file rather then using -o)

Why does my Curl command fail to download a file most of the time, but sometimes works?

Answers (1)

Related Questions