|
|
|
|
|
- Overview
- How to programmatically manipulate files on an HDFS server
HDFS (Hadoop Distributed File System) is one of the components of the Hadoop framework. HDFS is a distributed file system, which means that it uses a cluster of servers to create a storage space. HDFS can store any number of files. The files stored in the cluster are divided into data blocks and distributed among the different nodes of the HDFS cluster. To guarantee data resilience, each data block is duplicated on several nodes (three by default). Therefore, the file system is not affected if a node fails. How to programmatically manipulate files on an HDFS server To handle files on a HDFS server, you can: - Connect to the HDFS server (hdfsOpenConnection).
- Perform one of the following operations:
- Close the connection to the server (hdfsCloseConnection).
Please note: HDFS functions are only available for Windows Vista or Windows Server 2008.
This page is also available for…
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|