Tuesday, 4 June 2019

hardlink - What is the difference between NTFS hard links and directory junctions?


The title says it all... what's the difference? When do I use one and when do I use the other?


Added: Note that Junction points, Hard links and Symbolic (soft) links are three separate things on NTFS.



Answer



NTFS data files are stored with all their attributes as an "inode" with an ID, and filenames point to the inode. All files have one+ hardlink. some tools can create extra names (hardlinks) which point to the same inode, which has a counter of the number of names pointing to it. If the number goes to zero, the OS deletes the file, the inode.


Special tools, like LN can create additional filenames which point to existing inodes, using a filename that points to them. Note that all the file attributes, including dates and times are stored with the inode, not the filename.


(Extra) hardlinks are useful when you want different names to point to the same data, read or write. They save space and overhead. However, they clobber programs which assume different names point to different data. e.g, if you replace references to duplicate files by hardlinks to a single file, then you have removed any backup you have! The windows OS is one program which assumes different names point to different sets of bits.


Hardlinks have to be on the same physical and logical volume.


They are useful for:




  • reducing path lengths since Windows just supports 256 characters or so.




  • reducing storage space of duplicate data files - eg, you could have multiple backups of a partition with multiple complete directory trees, but with hardlinks for files which have not changed since the previous backup.




Junction points are messier and use Reparse Points (which I do not understand). They can point to different physical or logical volumes but they have to be on the same computer. They apparently can be used like symlinks for directories, allowing a second name for the same destination, but limited to a single computer, rather than just a single volume.


Useful for reducing path lengths, or simplifying working with several programs with hardcoded directory names.


Same caveats, too! Many programs assume different names = different files.


And then there are symbolic links (which can go across networks), shortcuts and PIFs and .....


Also some information from Microsoft itself:


Basically, they're saying to use hard links only for files within the same volume (disk drive letter) and junction points only for directories on the same computer (they can span across different local volumes).


No comments:

Post a Comment

How can I VLOOKUP in multiple Excel documents?

I am trying to VLOOKUP reference data with around 400 seperate Excel files. Is it possible to do this in a quick way rather than doing it m...