本篇文章给大家分享的是有关Python中怎么保存搜索引擎结果,小编觉得挺实用的,因此分享给大家学习,希望大家阅读完这篇文章后可以有所收获,话不多说,跟着小编一起来看看吧。
#!/usr/bin/env python #-*- encoding:utf-8 -*- import sys import re import httplib def request_and_save(conn, query_str, f): conn.request(”GET”, query_str) r1 = conn.getresponse() line = r1.read() p1 = re.compile(”<a href=”) p2 = re.compile(”</a>”) match_start = 0 match_end = 0 while line: m = p1.search(line) if m is None: break mmatch_start = m.start() lineline = line[match_start:] m = p2.search(line) if m is None: break mmatch_end = m.end() item = line[:match_end] if item.find(”class=l”) == -1: lineline = line[match_end:] continue f.write(item) f.write(”<br>\n”); lineline = line[match_end:] # end function if len(sys.argv) < 2: print “Usage: google.py words” sys.exit(0) query_str = “/search?aq=f\&complete=1\&hl=zh-CN\&newwindow=1\&q=” + sys.argv[1] query_str2 = “/search?q=” + sys.argv[1] + “\&complete=1\&hl=zh-CN\&newwindow=1\&start=10\&sa=N” query_str3 = “/search?q=” + sys.argv[1] + “\&complete=1\&hl=zh-CN\&newwindow=1\&start=20\&sa=N” query_str4 = “/search?q=” + sys.argv[1] + “\&complete=1\&hl=zh-CN\&newwindow=1\&start=30\&sa=N” query_str5 = “/search?q=” + sys.argv[1] + “\&complete=1\&hl=zh-CN\&newwindow=1\&start=40\&sa=N” query_str6 = “/search?q=” + sys.argv[1] + “\&complete=1\&hl=zh-CN\&newwindow=1\&start=50\&sa=N” query_str7 = “/search?q=” + sys.argv[1] + “\&complete=1\&hl=zh-CN\&newwindow=1\&start=60\&sa=N” html_header = “<html><head><meta http-equiv=\”content-type\” content=\”text/html;charset=gb2312\”><title>kf701 python search tool</title></head><body>\n” html_header += “<p align=center><font size=3>kf701 python search tool</font></p>” html_end = “</body></html>” conn = httplib.HTTPConnection(”www.google.cn“) print ‘Search ‘ + sys.argv[1] + ‘, Save result in ‘ + sys.argv[1] + ‘-search.html’ f = file( sys.argv[1] + “-search.html”, “w”) f.write(html_header); request_and_save(conn, query_str, f) request_and_save(conn, query_str2, f) request_and_save(conn, query_str3, f) request_and_save(conn, query_str4, f) request_and_save(conn, query_str5, f) request_and_save(conn, query_str6, f) request_and_save(conn, query_str7, f) f.write(html_end) f.close() conn.close()
以上就是Python中怎么保存搜索引擎结果,小编相信有部分知识点可能是我们日常工作会见到或用到的。希望你能通过这篇文章学到更多知识。更多详情敬请关注亿速云行业资讯频道。
亿速云「云服务器」,即开即用、新一代英特尔至强铂金CPU、三副本存储NVMe SSD云盘,价格低至29元/月。点击查看>>
免责声明:本站发布的内容(图片、视频和文字)以原创、转载和分享为主,文章观点不代表本网站立场,如果涉及侵权请联系站长邮箱:is@yisu.com进行举报,并提供相关证据,一经查实,将立刻删除涉嫌侵权内容。