正则表达式 - Python 中如何用正则匹配中文词组

Question

情景如下，网页中有一段： {代码...} 用 BeautifulSoup4 和 Requests 抓取一段网页内容，如果匹配到有“没有复本” 字样，就抛出异常。 如何实现用正则匹配特定的中文词组呢？ （PS 问：如何在 BeautifulSoup4 中搜...

伊谢尔伦 · Answer

Code

#! /usr/bin/env python
# -*- coding: utf-8 -*-

content = """

    此书刊没有复本


      此书刊可能正在订购中或者处理中



get_text() Get all the text content from the tag, but it is unicode encoded. After encoding it with utf-8, you can directly search with regular expressions.