Database Reference
In-Depth Information
[
28
]
For an exposition of the scalability limits of HDFS, see Konstantin V. Shvachko,
“HDFS Scalability:
The Limits to Growth”
,
April 2010.
property names, and deprecated the old ones (see
Which Properties Can I Set?
)
. This topic uses the new
property names.
[
30
]
In Hadoop 2 and later, there is a new filesystem interface called
FileContext
with better handling of
multiple filesystems (so a single
FileContext
can resolve multiple filesystem schemes, for example) and
a cleaner, more consistent interface.
FileSystem
is still more widely used, however.
[
34
]
Even for a single file copy, the
distcp
variant is preferred for large files since
hadoop fs -cp
copies
the file via the client running the command.