There are several identical records in thousands of records. How can I use SQL statements to delete duplicates? Thank you!
1. Search for redundant duplicate records in the Table. duplicate records are determined based on a single field (peopleId ).
Select * from people
Where peopleId in (select peopleId from people group by peopleId having count (peopleId)> 1)
2. Delete unnecessary duplicate records in the Table. Repeat records are determined based on a single field (eagleid), leaving only the records with the smallest rowid
Delete from people
Where peopleId in (select peopleId from people group by peopleId having count (peopleId)> 1)
And rowid not in (select min (rowid) from people group by peopleId having count (peopleId)> 1)
3. Search for redundant duplicate records in the table (multiple fields)
Select * from vitae
Where (a. peopleId, a. seq) in (select peopleId, seq from vitae group by peopleId, seq having count (*)> 1)
4. Delete redundant record (multiple fields) in the table, leaving only the records with the smallest rowid
Delete from vitae
Where (a. peopleId, a. seq) in (select peopleId, seq from vitae group by peopleId, seq having count (*)> 1)
And rowid not in (select min (rowid) from vitae group by peopleId, seq having count (*)> 1)
5. Search for redundant duplicate records (multiple fields) in the table, excluding records with the smallest rowid
Select * from vitae
Where (a. peopleId, a. seq) in (select peopleId, seq from vitae group by peopleId, seq having count (*)> 1)
And rowid not in (select min (rowid) from vitae group by peopleId, seq having count (*)> 1)
For example, there is A field "name" in Table A, and the "name" values may be the same between different records,
Now, you need to query items with duplicate "name" values between records in the table;
Select Name, Count (*) From A Group By Name Having Count (*)> 1
If the gender is also the same, the statement is as follows:
Select Name, sex, Count (*) From A Group By Name, sex Having Count (*)> 1
2. group by Method
Query data:
Select count (num), max (name) from student -- lists the number of repeated records and its name attribute
Group by num
Having count (num)> 1 -- group by num to find the num column in the table, that is, more than once
Delete data:
Delete from student
Group by num
Having count (num)> 1
In this way, all duplicates are deleted.
3. Use the distinct method-useful for small tables
Create table table_new as select distinct * from table1 minux
Truncate table table1;
Insert into table1 select * from table_new;
Delete select a. * from FLRK1 a inner join FLRK1 B on a. Record Number = B. Record Number and
(A. [ID] = B. [ID] and a. warehouse receiving date = B. warehouse receiving date and a. operation time = B. Operation Time)
Delete from FLRK1 where record number in
(Select min (Record Number) from FLRK1 group by record number having count (Record Number)> 1)
Table A structure:
ID RQ SJ C
--------------------------------------------
1 2005-07-14 14:20:50 A1
2 05:12:23 A1
3 2005-07-14 14:20:50 A1
4 2005-06-16 16:16:16 A2
5 16:16:16 A2
6 05:10:35 A3
7 05:12:23 A1
--------------------------------------------
Find one SQL statement and delete the same record for the RQ, SJ, and C fields in table.
Result:
ID RQ SJ C
--------------------------------------------
1 2005-07-14 14:20:50 A1
2 2005-02-15 05:12:23 A1
4 2005-06-16 16:16:16 A2
6 05:10:35 A3
--------------------------------------------
Delete from A Where ID Not In (Select Min (ID) from A Group By RQ, SJ, C)
Delete a from tb a inner join tb as B on. fid <B. fid and. c = B. c and. rq = B. rq and. sj = B. sj
Delete from A t
Where exists (select 1 from A where ID <A. ID and SJ = t. SJ and RQ = t. RQ and C = t. c)
Method 1
Declare @ max integer, @ id integer
Declare cur_rows cursor local for select Main field, count (*) from table name group by main field having count (*)> 1
Open cur_rows
Fetch cur_rows into @ id, @ max
While fetch_status = 0
Begin
Select @ max = @ max-1
Set rowcount @ max
Delete from table name where primary field = @ id
Fetch cur_rows into @ id, @ max
End
Close cur_rows
Set rowcount 0
Method 2
There are two Repeated Records. One is a completely repeated record, that is, records with all fields being repeated, and the other is records with duplicate key fields, such as duplicate Name fields, other fields are not necessarily repeated or can be ignored.
1. For the first type of repetition, it is easier to solve.
Select distinct * from tableName
You can get the result set without repeated records.
If the table needs to delete duplicate records (one record is retained), you can delete the record as follows:
Select distinct * into # Tmp from tableName
Drop table tableName
Select * into tableName from # Tmp
Drop table # Tmp
The reason for this repetition is that the table design is not weekly. You can add a unique index column.
2. Repeat problems usually require that the first record in the repeat record be retained. The procedure is as follows:
Assume that the duplicate fields are Name and Address. You must obtain the unique result set of the two fields.
Select identity (int, 1, 1) as autoID, * into # Tmp from tableName
Select min (autoID) as autoID into # Tmp2 from # Tmp group by Name, autoID
Select * from # Tmp where autoID in (select autoID from # tmp2)
The last select command gets the result set with no duplicate Name and Address (but an autoID field is added, which can be omitted in the select clause when writing)