Sunday 1 September 2019

Parallel file copy from single source to multiple targets?


I have a several large files on optical media I would like to copy to multiple targets - in this case I have two hard drives attached to the same computer. Is there a utility that can function like:


copy source target1 target2 ... targetN

Answer



For single files you can use tee to copy to multiple places:


cat  | tee   > 

or if you prefer the demoggified version:


tee   >  < 

Note that as Dennis points out in the comments tee outputs to stdout as well as the listed files, hence using redirect to point to file 3 in the above examples. You could also redirect this to /dev/null as below - this has the advantage of keeping the file list more consistent on the command line (which may make it easier to script up a solution for variable numbers of files) but is a little less efficient (though the efficiency difference is small: about the same as the difference between using the cat version or the version without cat):


cat  | tee    > /dev/null

You could probably combine one of the above with find quite easily to operate on multiple files in one directory and less easily to operate on files spread over a directory structure. Otherwise you might just have to set the multiple copy operations off in parallel as separate tasks and hope that the OS disk cache is bright and/or big enough that each of the parallel tasks used cached read data from the first instead of causing drive-head thrashing.


AVAILABILITY: tee is commonly available on standard Linux setups and other unix or unix-alike systems, usually as part of the GNU "coreutils" package. If you are using Windows (your question doesn't specify) then you should find it in the various Windows ports such as Cygwin.


PROGRESS INFORMATION: As copying a large file off optical media may take some time (or over slow network, or an even larger file from even local fast media), progress information can be useful. On the command line I tend to use pipe viewer (available in most Linux distros & many Windows port collections and easy to compile yourself where not available directly) for this - just replace cat with pv like so:


pv  | tee   > 

No comments:

Post a Comment

How can I VLOOKUP in multiple Excel documents?

I am trying to VLOOKUP reference data with around 400 seperate Excel files. Is it possible to do this in a quick way rather than doing it m...