I recently worked on a Weibo workstation, but I was unable to find it. I had to write one as needed.
Recognize Chinese characters and spaces as ending symbols
The short address recognition regular expression is as follows:
(HTTP | HTTPS | FTP | ftps ):\/\/)? ([A-zA-Z0-9-] + \.) {1, 5} (COM | CN | net | org | HK | tw) (\/(\ w |-) + (\. ([A-Za-Z] + ))?) + )? (\/)? (\?? ([\. %: A-zA-Z0-9 _-] + = [# \. %: a-zA-Z0-9 _-] + (& amp ;)?) + )?
Explanation:
(HTTP | HTTPS | FTP | ftps ):\/\/)?
Indicates whether the request header is contained.
([A-zA-Z0-9-] + \.) {} (COM | CN | net | org | HK | tw)
Indicates the domain name itself, including the 5th pole and domain name suffix
(\/(\ W |-) + (\. ([A-Za-Z] + ))?) + )? (\/)?
The domain name address followed by the domain name.
It contains the following information: "/", "/***. ***"
(\/)? (\?? ([\. %: A-zA-Z0-9 _-] + = [# \. %: a-zA-Z0-9 _-] + (& amp ;)?) + )?
Indicates the encoded URL address.
Contains "? * ** = ***","? * ** = *** & *** = *** # Sdsd"
Disadvantages:
Like Sina Weibo, it is easy to use the # symbol of a topic as part of a short address.