国产探花免费观看_亚洲丰满少妇自慰呻吟_97日韩有码在线_资源在线日韩欧美_一区二区精品毛片,辰东完美世界有声小说,欢乐颂第一季,yy玄幻小说排行榜完本

首頁 > 編程 > Python > 正文

Python爬蟲實(shí)現(xiàn)全國(guó)失信被執(zhí)行人名單查詢功能示例

2020-02-22 23:58:19
字體:
供稿:網(wǎng)友

本文實(shí)例講述了Python爬蟲實(shí)現(xiàn)全國(guó)失信被執(zhí)行人名單查詢功能。分享給大家供大家參考,具體如下:

一、需求說明

利用百度的接口,實(shí)現(xiàn)一個(gè)全國(guó)失信被執(zhí)行人名單查詢功能。輸入姓名,查詢是否在全國(guó)失信被執(zhí)行人名單中。

二、python實(shí)現(xiàn)

版本1:

# -*- coding:utf-8*-import sysreload(sys)sys.setdefaultencoding('utf-8')import timeimport requeststime1=time.time()import pandas as pdimport jsoniname=[]icard=[]def person_executed(name):  for i in range(0,30):    try:      url="https://sp0.baidu.com/8aQDcjqpAAV3otqbppnN2DJv/api.php?resource_id=6899" /      "&query=%E5%A4%B1%E4%BF%A1%E8%A2%AB%E6%89%A7%E8%A1%8C%E4%BA%BA%E5%90%8D%E5%8D%95" /      "&cardNum=&" /      "iname="+str(name)+ /      "&areaName=" /      "&pn="+str(i*10)+ /      "&rn=10" /      "&ie=utf-8&oe=utf-8&format=json"      html=requests.get(url).content      html_json=json.loads(html)      html_data=html_json['data']      for each in html_data:        k=each['result']        for each in k:          print each['iname'],each['cardNum']          iname.append(each['iname'])          icard.append(each['cardNum'])    except:      passif __name__ == '__main__':  name="郭**"  person_executed(name)  print len(iname)  #####################將數(shù)據(jù)組織成數(shù)據(jù)框###########################  data=pd.DataFrame({"name":iname,"IDCard":icard})  #################數(shù)據(jù)框去重####################################  data1=data.drop_duplicates()  print data1  print len(data1)  #########################寫出數(shù)據(jù)到excel#########################################  pd.DataFrame.to_excel(data1,"F://iname_icard_query.xlsx",header=True,encoding='gbk',index=False)  time2=time.time()  print u'ok,爬蟲結(jié)束!'  print u'總共耗時(shí):'+str(time2-time1)+'s'

三、效果展示

"D:/Program Files/Python27/python.exe" D:/PycharmProjects/learn2017/全國(guó)失信被執(zhí)行人查詢.py
郭** 34122319790****5119
郭** 32032119881****2419
郭** 32032119881****2419
3
                IDCard name
0  34122319790****5119  郭**
1  32032119881****2419  郭**
2
ok,爬蟲結(jié)束!
總共耗時(shí):7.72000002861s
Process finished with exit code 0

版本2:

# -*- coding:utf-8*-import sysreload(sys)sys.setdefaultencoding('utf-8')import timeimport requeststime1=time.time()import pandas as pdimport jsoniname=[]icard=[]courtName=[]areaName=[]caseCode=[]duty=[]performance=[]disruptTypeName=[]publishDate=[]def person_executed(name):  for i in range(0,30):    try:      url="https://sp0.baidu.com/8aQDcjqpAAV3otqbppnN2DJv/api.php?resource_id=6899" /      "&query=%E5%A4%B1%E4%BF%A1%E8%A2%AB%E6%89%A7%E8%A1%8C%E4%BA%BA%E5%90%8D%E5%8D%95" /      "&cardNum=&" /      "iname="+str(name)+ /      "&areaName=" /      "&pn="+str(i*10)+ /      "&rn=10" /      "&ie=utf-8&oe=utf-8&format=json"      html=requests.get(url).content      html_json=json.loads(html)      html_data=html_json['data']      for each in html_data:        k=each['result']        for each in k:          print each['iname'],each['cardNum'],each['courtName'],each['areaName'],each['caseCode'],each['duty'],each['performance'],each['disruptTypeName'],each['publishDate']          iname.append(each['iname'])          icard.append(each['cardNum'])          courtName.append(each['courtName'])          areaName.append(each['areaName'])          caseCode.append(each['caseCode'])          duty.append(each['duty'])          performance.append(each['performance'])          disruptTypeName.append(each['disruptTypeName'])          publishDate.append(each['publishDate'])    except:      passif __name__ == '__main__':  name="郭**"  person_executed(name)  print len(iname)  #####################將數(shù)據(jù)組織成數(shù)據(jù)框###########################  # data=pd.DataFrame({"name":iname,"IDCard":icard})  detail_data=pd.DataFrame({"name":iname,"IDCard":icard,"courtName":courtName,"areaName":areaName,"caseCode":caseCode,"duty":duty,"performance":performance,/               "disruptTypeName":disruptTypeName,"publishDate":publishDate})  #################數(shù)據(jù)框去重####################################  # data1=data.drop_duplicates()  # print data1  # print len(data1)  detail_data1=detail_data.drop_duplicates()  # print detail_data1  # print len(detail_data1)  #########################寫出數(shù)據(jù)到excel#########################################  pd.DataFrame.to_excel(detail_data1,"F://iname_icard_query.xlsx",header=True,encoding='gbk',index=False)  time2=time.time()  print u'ok,爬蟲結(jié)束!'  print u'總共耗時(shí):'+str(time2-time1)+'s'            
發(fā)表評(píng)論 共有條評(píng)論
用戶名: 密碼:
驗(yàn)證碼: 匿名發(fā)表
主站蜘蛛池模板: 永胜县| 临汾市| 宁武县| 龙海市| 天台县| 保德县| 鹿邑县| 陵水| 吴旗县| 佛冈县| 武定县| 大新县| 南涧| 洪洞县| 台北市| 夏邑县| 新竹市| 濉溪县| 彰化县| 义马市| 区。| 扎鲁特旗| 普兰县| 庆元县| 芦溪县| 双柏县| 兴宁市| 花莲县| 大丰市| 托克托县| 双江| 钦州市| 汶川县| 饶阳县| 开阳县| 苏尼特右旗| 疏附县| 景宁| 景宁| 平南县| 南丰县|