Merge linux Files and remove duplicates

Source: Internet
Author: User

 

(1) intersection of two files
Prerequisites: No duplicate rows are allowed in each file.
1. Extract the union of the two files (duplicate rows only retain one copy)
Cat file1 file2 | sort | uniq> file3
2. Extract the intersection of two files (only files that exist in both files are left)
Cat file1 file2 | sort | uniq-d> file3
3. Delete the intersection and leave other rows.
Cat file1 file2 | sort | uniq-u> file3
(2) merge two files
A file is on, and a file is under
Cat file1 file2> file3
One file on the left and one file on the right
Paste file1 file2> file3
(3) Remove duplicate rows from a file
Sort file | uniq
Note: Repeat multiple rows is recorded as one row. That is to say, these duplicate rows are still present, but all are omitted as one row!
Sort file | uniq-u
The above command can remove all repeated lines, that is, non-repeated lines in the file!

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.