Hadoop2-shell Operation detailed

Source: Internet
Author: User
Tags hdfs dfs hadoop fs

FS Shell

The call file system (FS) shell command should use the form Bin/hadoop FS <args>. All of the FS shell commands use URI paths as parameters. The URI format is Scheme://authority/path. For the HDFs file system, Scheme is HDFS, to the local file system, scheme is file. The scheme and authority parameters are optional, and if not specified, the default scheme specified in the configuration is used. An HDFs file or directory such as/parent/child can be represented as hdfs://namenode:namenodeport/parent/child, or simpler/parent/ Child (assuming that the default value in your configuration file is Namenode:namenodeport). The behavior of most FS shell commands is similar to that of the corresponding Unix shell commands, and the differences are noted below when the commands are used in detail. Error messages are output to stderr, and other information is output to stdout.

appendtofile--New Commands

Usage:hdfs dfs -appendToFile <localsrc> ... <dst>

从本地文件系统追加单 源,或多源 到目的文件系统。也从标准输入读取并追加到目的文件系统。

    • hdfs dfs -appendToFile localfile /user/hadoop/hadoopfile
    • hdfs dfs -appendToFile localfile1 localfile2 /user/hadoop/hadoopfile
    • hdfs dfs -appendToFile localfile hdfs://nn.example.com/hadoop/hadoopfile
    • hdfs dfs -appendToFile - hdfs://nn.example.com/hadoop/hadoopfileReads the input from stdin.

return value:
Successful return 0, Failure returns-1.

Cat

How to use: HDFs dfs-cat uri [uri ...]

Outputs the contents of the path-specified file to stdout.

Example:

    • HDFs Dfs-cat Hdfs://host1:port1/file1 Hdfs://host2:port2/file2
    • HDFs Dfs-cat File:///file3/user/hadoop/file4

return value:
Successful return 0, Failure returns-1.

Chgrp

How to use: HDFs dfs-chgrp [-R] GROUP uri [uri ...] Change Group Association of files. With-r, make the change recursively through the directory structure. The user must be the owner of files, or else a super-user. Additional information is on the Permissions User Guide. -

Change the group to which the file belongs. Using-R will make the changes recursive under the directory structure. The user of the command must be the owner or superuser of the file. For more information, see the HDFs Permissions User Guide.

chmod

How to use: HDFs Dfs-chmod [-r] <mode[,mode] ... | octalmode> uri [uri ...]

Permissions to change the file. Using-R will make the changes recursive under the directory structure. The user of the command must be the owner or superuser of the file. For more information, see the HDFs Permissions User Guide.

Chown

How to use: HDFs Dfs-chown [-R] [Owner][:[group]] uri [URI]

Change the owner of the file. Using-R will make the changes recursive under the directory structure. The user of the command must be a superuser. For more information, see the HDFs Permissions User Guide.

Copyfromlocal

How to use: HDFs dfs-copyfromlocal <localsrc> URIs

In addition to qualifying the source path as a local file, it is similar to the put command.

Copytolocal

How to use: HDFs dfs-copytolocal [-IGNORECRC] [-CRC] URI <localdst>

In addition to qualifying the target path as a local file, it is similar to the Get command.

count--New Commands

Usage:hdfs dfs -count [-q] <paths>

Calculates the number of directories, files, and bytes that conform to a specific pattern. The output columns with-count are as follows: Dir_count, File_count, content_size file_name

The output with-count-q is listed under: QUOTA, Remaining_quata, Space_quota, Remaining_space_quota, Dir_count, File_count, Content_size, FILE _name

Example:

    • hdfs dfs -count hdfs://nn1.example.com/file1 hdfs://nn2.example.com/file2
    • hdfs dfs -count -q hdfs://nn1.example.com/file1

return value:

Successful return 0, Failure returns-1.

Cp

How to use: HDFs dfs-cp uri [uri ...] <dest>

Copies the file from the source path to the destination path. This command allows for multiple source paths, at which point the destination path must be a directory.
Example:

    • HDFs Dfs-cp/user/hadoop/file1/user/hadoop/file2
    • HDFs Dfs-cp/user/hadoop/file1/user/hadoop/file2/user/hadoop/dir

return value:

Successful return 0, Failure returns-1.

Du

How to use: HDFs dfs-du uri [uri ...]

Displays the size of all files in the directory, or when you specify only one file, the size of this file is displayed.
Example:
HDFs Dfs-du/user/hadoop/dir1/user/hadoop/file1 Hdfs://host:port/user/hadoop/dir1
return value:
Successful return 0, Failure returns-1.

Dus

How to use: HDFs dfs-dus <args>

Displays the size of the file.

Expunge

How to use: HDFs dfs-expunge

Empty the Recycle Bin. Refer to the HDFS design documentation documentation for more information on the properties of the Recycle Bin.

Get

How to use: HDFs dfs-get [-IGNORECRC] [-CRC] <src> <localdst>

Copy the file to the local file system. The-IGNORECRC option can be used to replicate the failed file for CRC validation. Use the-CRC option to copy files and CRC information.

Example:

    • HDFs Dfs-get/user/hadoop/file LocalFile
    • HDFs Dfs-get hdfs://host:port/user/hadoop/file LocalFile

return value:

Successful return 0, Failure returns-1.

Getmerge

How to use: HDFs dfs-getmerge <src> <localdst> [ADDNL]

Accepts a source directory and a destination file as input, and connects all the files in the source directory to the local destination file. ADDNL is optional and is used to specify that a line break is added at the end of each file.

Ls

How to use: HDFs dfs-ls <args>

If it is a file, the file information is returned in the following format:
File name < copy count > Size Modified Date Modify time permission User ID Group ID
If it is a directory, it returns a list of its immediate sub-files, as in Unix. The directory returns information for the list as follows:
Directory name <dir> Modified date Modify time permission User ID Group ID
Example:
HDFs Dfs-ls/user/hadoop/file1/user/hadoop/file2 Hdfs://host:port/user/hadoop/dir1/nonexistentfile
return value:
Successful return 0, Failure returns-1.

Lsr

How to use: HDFs DFS-LSR <args>
The recursive version of the LS command. Similar to the Ls-r in Unix.

Mkdir

How to use: HDFs dfs-mkdir <paths>

Accept the URI specified by the path as a parameter to create these directories. It behaves like a Unix mkdir-p, which creates levels of parent directories in the path.

Example:

    • HDFs Dfs-mkdir/user/hadoop/dir1/user/hadoop/dir2
    • HDFs Dfs-mkdir Hdfs://host1:port1/user/hadoop/dir Hdfs://host2:port2/user/hadoop/dir

return value:

Successful return 0, Failure returns-1.

Movefromlocal

How to use: Dfs-movefromlocal <src> <dst>

Outputs a "not implemented" message.

Mv

How to use: HDFs dfs-mv uri [uri ...] <dest>

Moves the file from the source path to the destination path. This command allows for multiple source paths, at which point the destination path must be a directory. Files are not allowed to move between different file systems.
Example:

    • HDFs Dfs-mv/user/hadoop/file1/user/hadoop/file2
    • HDFs dfs-mv hdfs://host:port/file1 hdfs://host:port/file2 hdfs://host:port/file3 hdfs://host:port/dir1

return value:

Successful return 0, Failure returns-1.

Put

How to use: HDFs dfs-put <localsrc> ... <dst>

Copy single or multiple source paths from the local file system to the target file system. Read input from standard input is also supported to write to the target file system.

    • hdfs dfs -put localfile /user/hadoop/hadoopfile
    • hdfs dfs -put localfile1 localfile2 /user/hadoop/hadoopdir
    • hdfs dfs -put localfile hdfs://nn.example.com/hadoop/hadoopfile
    • hdfs dfs -put - hdfs://nn.example.com/hadoop/hadoopfile
    • Reads the input from the standard input.

return value:

Successful return 0, Failure returns-1.

Rm

How to use: HDFs dfs-rm uri [uri ...]

Deletes the specified file. Only non-empty directories and files are deleted. Refer to the RMR command for recursive deletions.
Example:

    • HDFs DFS-RM Hdfs://host:port/file/user/hadoop/emptydir

return value:

Successful return 0, Failure returns-1.

RMr

How to use: HDFs dfs-rmr uri [uri ...]

The recursive version of Delete.
Example:

    • HDFs Dfs-rmr/user/hadoop/dir
    • HDFs DFS-RMR Hdfs://host:port/user/hadoop/dir

return value:

Successful return 0, Failure returns-1.

Setrep

How to use: HDFs Dfs-setrep [-R] <path>

Change the copy factor of a file. The-r option is used to recursively change the copy factor for all files in the directory.

Options:

    • -W requires the command to wait for replication to complete. This is likely to take a long time.
    • -R allows for back compatibility. It doesn't matter.

Example:

    • HDFs dfs-setrep-w 3-r/user/hadoop/dir1

return value:

Successful return 0, Failure returns-1.

Stat

How to use: HDFs dfs-stat uri [uri ...]

Returns the statistics for the specified path.

Example:

    • HDFs Dfs-stat Path

return value:
Successful return 0, Failure returns-1.

Tail

How to use: HDFs Dfs-tail [-f] URI

Outputs the contents of the 1K bytes at the end of the file to stdout. The-f option is supported, and behaves the same as UNIX.

Example:

    • HDFs Dfs-tail Pathname

return value:
Successful return 0, Failure returns-1.

Test

How to use: HDFs dfs-test-[ezd] URI

Options:
-e checks whether the file exists. Returns 0 if it exists.
-Z Checks if the file is 0 bytes. Returns 0 if it is.
-D returns 1 if the path is a directory, otherwise 0 is returned.

Example:

    • HDFs dfs-test-e filename
Text

How to use: HDFs dfs-text <src>

Output the source file as text format. The allowed formats are zip and Textrecordinputstream.

Touchz

How to use: HDFs dfs-touchz uri [uri ...]

Create a 0-byte empty file.

Example:

    • Hadoop-touchz pathname
return value:
Successful return 0, Failure returns-1. Reference: http://hadoop.apache.org/docs/r2.2.0/hadoop-project-dist/hadoop-common/FileSystemShell.html

Hadoop2-shell Operation detailed

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.