from lxml import etree
html = '''
<li class="tag_1">需要的内容1
<a>需要的内容2</a>
</li>
'''
selector = etree.HTML(html )
contents = selector.xpath ( '//li[@class = "tag_1"]')
contents1 = selector.xpath ( '//li[@class = "tag_1"]')[0]
contents2 = contents1.xpath('string(.)')
contents3 = selector.xpath ( '//li[@class = "tag_1"]/text()')
print(contents) #[<Element li at 0x2c55e88>]
print(contents1) # <Element li at 0x2c55e88>
print(contents2)
print(contents3)
输出
D:\Python\venv\Scripts\python.exe D:/Python/venv/test9.py
[<Element li at 0x2c75ec8>]
<Element li at 0x2c75ec8>
需要的内容1
需要的内容2
['需要的内容1\n ', '\n ']
Process finished with exit code 0
string(.)可以用于提取标签嵌套标签的内容。
免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:is@yisu.com进行举报,并提供相关证据,一经查实,将立刻删除涉嫌侵权内容。