Pandas. dataframe. drop_duplicates usage instructions

Source: Internet
Author: User
Dataframe. drop_duplicates (subset = none, keep = 'first', inplace = false)

SubsetTo determine which column duplicate occurs, all columns are considered by default.
KeepContains three parametersFirst,Last,False,FirstIt indicates that the first repeat data retrieved is retained and all subsequent data are deleted;LastIndicates that the last retrieved duplicate data is retained and all previously searched duplicate data is deleted,FalseThis means that all searched duplicate data is deleted and none is retained. That is, if two rows of data are duplicated, both rows of data are deleted instead of one row. The default parameter isFirst.

Pandas. dataframe. drop_duplicates usage instructions

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.