后浪云Python教程：python如何匹配中文

2021-11-26

中文字符的编码范围是：

\u4e00-\u9fa5

使用正则匹配中文

# -*- coding:utf-8 -*-

import re

'''
python 3.5版本
正则匹配中文，固定形式：\u4E00-\u9FA5
'''

words = 'study in 山海大学'
regex_str = ".*?([\u4E00-\u9FA5]+大学)"
match_obj = re.match(regex_str, words)
if match_obj:
    print(match_obj.group(1))


结果：山海大学

作者：后浪云

链接：https://www.idc.net/help/178302/

文章版权归作者所有，未经允许请勿转载。

THE END