java 對於url地址的實體符號的處理

來源:互聯網
上載者:User

標籤:bsp   escape   decode   epo   ons   nec   htm   out   group   

<!-- https://mvnrepository.com/artifact/org.apache.commons/commons-lang3         <dependency>            <groupId>org.apache.commons</groupId>            <artifactId>commons-lang3</artifactId>            <version>3.4</version>        </dependency>-->public static String getNextPage(String web) throws Exception {        HttpComponentsClientHttpRequestFactory factory=new HttpComponentsClientHttpRequestFactory();//        factory.setConnectTimeout(60000);//        factory.set        String regx="上一頁</a>)(<a.*?href=[\\\"‘]?(((http|https)?://)?/?[^\\\"‘]+)[\\\"‘]?.*?>(.+)</a>";        RestTemplate template=new RestTemplate();        URI uri=new URI(URLDecoder.decode(web,"utf-8"));        String stri = template.getForObject(uri, String.class);        Pattern pattern=Pattern.compile(regx);        Matcher matcher = pattern.matcher(stri);        matcher.find();        String group = matcher.group();        group = group.substring(group.indexOf("href=\"/") + 7, group.indexOf("\" title=\""));        group="http://www.youbianku.com/"+group;        group= StringEscapeUtils.unescapeHtml4(group);        return group;    }

 

java 對於url地址的實體符號的處理

聯繫我們

該頁面正文內容均來源於網絡整理,並不代表阿里雲官方的觀點,該頁面所提到的產品和服務也與阿里云無關,如果該頁面內容對您造成了困擾,歡迎寫郵件給我們,收到郵件我們將在5個工作日內處理。

如果您發現本社區中有涉嫌抄襲的內容,歡迎發送郵件至: info-contact@alibabacloud.com 進行舉報並提供相關證據,工作人員會在 5 個工作天內聯絡您,一經查實,本站將立刻刪除涉嫌侵權內容。

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.