There is a list of English names in the heap. é rimaadiyogamehmettun çerguoyouzhangjoseph carlosmariateguicesaramor ósannepyburnbruced... there is a list of English names in the heap.
Sjoerd van Tuinen
Ulisses Caramaschi
Ulisses Caramaschi
Pankaj Sekhsaria
Brandon Balleng é e
Prima Adi Yoga
Mehmet Tun çer
Guoyou Zhang
Jose Carlos Mariategui
Cesar Amor ós
Anne Pyburn
Bruce D Patterson
Nancy Sevcenko
Dwirienna Adiyanti
Ulises Chavez Jimenez
It contains Chinese English names, such as Guoyou Zhang. How can I use a program to quickly filter Chinese names.
You can have a list of Chinese surnames and commonly used Chinese characters. I don't know how to write it using a php program?
Reply content:
There is a list of English names
Sjoerd van Tuinen
Ulisses Caramaschi
Ulisses Caramaschi
Pankaj Sekhsaria
Brandon Balleng é e
Prima Adi Yoga
Mehmet Tun çer
Guoyou Zhang
Jose Carlos Mariategui
Cesar Amor ós
Anne Pyburn
Bruce D Patterson
Nancy Sevcenko
Dwirienna Adiyanti
Ulises Chavez Jimenez
It contains Chinese English names, such as Guoyou Zhang. How can I use a program to quickly filter Chinese names.
You can have a list of Chinese surnames and commonly used Chinese characters. I don't know how to write it using a php program?
Only one database can be matched.
If this is what your product manager has proposed, please let him give up.
There are a lot of links online.
See the excel recognition method.
It is easier to distinguish Chinese Characters
How to distinguish English and Pinyin
My idea is to find a pinyin database, and then compare it to identify whether it is a Chinese name.
You can check the link below to see if the source code can be used as a pinyin library.
Http://www.oschina.net/code/snippet_862384_25415
A Chinese pinyin dictionary is required !!! Otherwise, it cannot be implemented !!!
I don't know how to deal with php. For Java, I can use pinyin4j to generate a last name pinyin table from the hundred surnames and then match it.
There is no very accurate way, because there is no way to strictly differentiate. Even if you can organize a pinyin library, there are many problems. For example, a lady married to a foreigner switched to her husband's surname, and the comparison database could not handle the problem. The names of ethnic minorities such as Tibet and Mongolia cannot be processed. In South Korea, Thailand, and other countries, surnames are spelled in the same way as Chinese characters.
The essence is to distinguish pinyin from English words and match them with regular expressions:
1. Commonly UsedThe end of the last name.The rule matches all names to obtain the initial screening data.
2. Most of the male pinyin characters are composed of the initials and the finals. This rule is used to further test the initial screening data.
Do you mean Marry Zhang is a Chinese or a foreigner?
The premise of matching is that Chinese people do not have to give themselves a foreign name like foreign companies.
What about Mike in English or Mike in Chinese? This is not a big problem.