Reputation: 401
How can I compare a tar
file (already compressed) of the original folder with the original folder?
First I created archive file using
tar -kzcvf directory_name.zip directory_name
Then I tried to compare using
tar -diff -vf directory_name.zip directory_name
But it didn't work.
Upvotes: 40
Views: 35901
Reputation: 5070
I recently needed a better compare than what "tar --diff" produced so I made this short script:
#!/bin/bash
tar tf "$1" | while read ; do
if [ "${REPLY%/}" = "$REPLY" ] ; then
tar xOf "$1" "$REPLY" | diff -u - "$REPLY"
fi
done
Upvotes: 1
Reputation: 53
The method of pix is way slow for large compressed tar files, because it extracts each file individually. I use the tar --diff method loking for files with different modification time and extract and diff only these. The files are extracted into a folder base.orig where base is either the top level folder of the tar file or teh given comparison folder. This results in diffs including the date of the original file.
Here is the script:
#!/bin/bash
set -o nounset
# Print usage
if [ "$#" -lt 1 ] ; then
echo 'Diff a tar (or compressed tar) file with a folder'
echo 'difftar-folder.sh <tarfile> [<folder>] [strip]'
echo default for folder is .
echo default for strip is 0.
echo 'strip must be 0 or 1.'
exit 1
fi
# Parse parameters
tarfile=$1
if [ "$#" -ge 2 ] ; then
folder=$2
else
folder=.
fi
if [ "$#" -ge 3 ] ; then
strip=$3
else
strip=0
fi
# Get path prefix if --strip is used
if [ "$strip" -gt 0 ] ; then
prefix=`tar -t -f $tarfile | head -1`
else
prefix=
fi
# Original folder
if [ "$strip" -gt 0 ] ; then
orig=${prefix%/}.orig
elif [ "$folder" = "." ] ; then
orig=${tarfile##*/}
orig=./${orig%%.tar*}.orig
elif [ "$folder" = "" ] ; then
orig=${tarfile##*/}
orig=${orig%%.tar*}.orig
else
orig=$folder.orig
fi
echo $orig
mkdir -p "$orig"
# Make sure tar uses english output (for Mod time differs)
export LC_ALL=C
# Search all files with a deviating modification time using tar --diff
tar --diff -a -f "$tarfile" --strip $strip --directory "$folder" | grep "Mod time differs" | while read -r file ; do
# Substitute ': Mod time differs' with nothing
file=${file/: Mod time differs/}
# Check if file exists
if [ -f "$folder/$file" ] ; then
# Extract original file
tar -x -a -f "$tarfile" --strip $strip --directory "$orig" "$prefix$file"
# Compute diff
diff -u "$orig/$file" "$folder/$file"
fi
done
Upvotes: 4
Reputation: 197
To ignore differences in some or all of the metadata (user, time, permissions), you can pipe the result to awk
:
tar --compare --file=archive-file.tar -C /some/where/ | awk '!/Mode/ && !/Uid/ && !/Gid/ && !/time/'
That should output only the true differences between the tar and the directory /some/where/
Upvotes: 8
Reputation: 209
You may use diff's --compare (-diff, d) option. You have to take some care because diff compares only files specified on the command line, and only those that simultaneously exist inside the archive. For example, new existing files are not reported. Often I prefer the approach of pix to take more control.
However, unlike pix and Michael Soegtrop, I do not think you have to extract any file.
The following code test diff's ability to compare files.
touch refF; setTM12 () { touch -r refF F1 F2; };
# create the files
echo a1a > F1; echo a2a > F2; echo a3a>F3; echo a4a>F4; setTM12;
tar cf tarF F1 F2 F3 F4;
# do not change times of F1 F2
# modify F1 F2 F3, change the mtime of F4
echo mod > F1; echo longer > F2; setTM12;
sleep 2; echo XXX > F3; touch F4;
tar -df tarF F1 F2 F3 F4
F1: Contents differ
F2: Size differs
F3: Mod time differs
F3: Contents differ
F4: Mod time differs
You may need to know that Size differs
implicitly tags files whose contents differs, for example F2
.
-v
is handy option, two.
tar -vdf tarF F1 F2 F3 F4
F1
F1: Contents differ
F2
F2: Size differs <--- Means that the Contents differ, too !
F3
F3: Mod time differs
F3: Contents differ
F4
F4: Mod time differs
Upvotes: 0
Reputation: 1
The easy way is to write:
tar df file
This compares the file with the current working directory, and tell us about if any of the files has been removed.tar df file -C path/folder
This compares the file with the folder.Upvotes: -1
Reputation: 5636
--compare (-d)
is more handy for that.
tar --compare --file=archive-file.tar
works if archive-file.tar is in the directory it was created. To compare archive-file.tar against a remote target (eg if you have moved archive-file.tar to /some/where/) use the -C
parameter:
tar --compare --file=archive-file.tar -C /some/where/
If you want to see tar working, use -v
without -v
only errors (missing files/folders) are reported.
Tipp: This works with compressed tar.bz/ tar.gz archives, too.
Upvotes: 58
Reputation: 389
It should be --diff
Try this (without the last directory_name):
tar --diff -vf directory_name.zip
The problem is that the --diff command only looks for differences on the existing files among the tar file and the folder. So, if a new file is added to the folder, the diff command does not report this.
Upvotes: 13