頭條搜索站長(zhǎng)平臺(tái)-適配關(guān)系正則說明
PC站點(diǎn)下以目錄站點(diǎn)作為移動(dòng)端站點(diǎn)時(shí),作這種行為對(duì)搜索引擎極不友好,主流搜索引擎一直不贊成不鼓勵(lì)這種建方式。為了滿足該需求,頭條站長(zhǎng)平臺(tái)移動(dòng)適配工具提供滿足此需求的功能。移動(dòng)端適配功能的添加有助站點(diǎn)鏈接被收錄。
適配規(guī)則:
1)規(guī)則適配:PC地址和移動(dòng)端地址存在對(duì)應(yīng)關(guān)系時(shí)候,可以添加PC和移動(dòng)端的適配規(guī)則表達(dá)式,進(jìn)行適配(如PC頁(yè)面https://www.toutiao.com/i7001293156146840067/ 移動(dòng)頁(yè)面https://m.toutiao.com/i7001293156146840067可提交適配關(guān)系)。頭條站長(zhǎng)平臺(tái)推薦站長(zhǎng)使用規(guī)則適配進(jìn)行提交,對(duì)于新增同類型URL可以持續(xù)生效,該方式處理周期相對(duì)URL處理更短。
2)URL適配:當(dāng)時(shí)站點(diǎn)URL不滿足適配規(guī)則時(shí),站長(zhǎng)可以通過URL適配進(jìn)行URL規(guī)則批量提交。文件格式為:每行前后兩個(gè)URL,分別是PC鏈接和移動(dòng)鏈接,中間用空格分隔,一個(gè)文件最多可以提交5萬(wàn)對(duì)url,可進(jìn)行多個(gè)文件提交。
規(guī)則適配說明:
數(shù)字規(guī)則
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/123456.html-> https://m.tt.com/123456.html
規(guī)則:
https://www.tt.com/([0-9]+).html-> https://m.tt.com/${1}.html
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/b123456.html-> https://m.tt.com/26299483.html
規(guī)則:
https://www.tt.com/b([0-9]+).html-> https://m.tt.com/${1}.html
字母規(guī)則:
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/news/ -> https://m.tt.com/news/
規(guī)則:
https://www.tt.com/([a-zA-Z]+)/ -> https://m.tt.com/${1}/
字母和數(shù)字混合規(guī)則(字母和數(shù)字混合字符串,字母和數(shù)字出現(xiàn)多次):
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/a1b2c3d4e5f6/ -> https://m.tt.com/a1b2c3d4e5f6/
規(guī)則:
https://www.tt.com/((?:[a-zA-Z]+[0-9]+|[0-9]+[a-zA-Z]+)[a-zA-Z0-9]+)/ -> https://m.tt.com/${1}/
url對(duì)應(yīng)關(guān)系:
https://by.tt.com/01/02/03/a1b2c3d4e5f6.html-> https://m.tt.com/by/01/02/03/a1b2c3d4e5f6.html
規(guī)則:
https://news.tt.com/([0-9]+)/([0-9]+)/([0-9]+)/([ a-zA-Z0-9]+).html-> https://m.tt.com/news/${1}/${2}/${3}/${4}.html
字母和數(shù)字混合規(guī)則(字母和數(shù)字混合字符串,字母和數(shù)字出現(xiàn)1次)
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/az123/ -> https://m.tt.com/az123/
規(guī)則:
https://www.tt.com/([a-zA-Z]+)([0-9]+)/-> https://m.tt.com/${1}${2}/
中文字符串規(guī)則:
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/站長(zhǎng)平臺(tái)/ -> https://m.tt.com/站長(zhǎng)平臺(tái)/
規(guī)則:
https://www.tt.com/((?:%[a-zA-Z0-9]{2,})+)/-> https://m.tt.com/${1}/
鏈接字符'-'或者'_'連接的數(shù)字或者字母規(guī)則
url對(duì)應(yīng)關(guān)系:
https://www.tt.com/by-a1-by/-> https://m.tt.com/by-a1-by/
規(guī)則:
https://www.tt.com/([a-zA-Z]+)-([a-zA-Z]+)([0-9]+)-([a-zA-Z]+)/->https://m.tt.com/${1}-${2}${3}/
對(duì)參數(shù)部分進(jìn)行正則替換生成pattern的例子:
url對(duì)應(yīng)關(guān)系:
http://www.tt.com/news.html?id=123 -> http://m.tt.com/news.html?id=123
規(guī)則:
http://www.abc.com/article\.html?id=([^&]+) -> http://m.abc.com/article.html?id=${2}
來源:頭條搜索站長(zhǎng)平臺(tái)