午夜剧场伦理_日本一道高清_国产又黄又硬_91黄色网战_女同久久另类69精品国产_妹妹的朋友在线

您的位置:首頁技術(shù)文章
文章詳情頁

python - seleium 爬網(wǎng)頁數(shù)據(jù),只能怕當前頁,如果我輸入兩頁的話,會出現(xiàn)初始頁數(shù)據(jù)下載兩次的情況

瀏覽:203日期:2022-07-16 15:35:58

問題描述

import requestsfrom lxml import html,etreefrom selenium import webdriverimport time, json#how many page do you want to scanpage_numnotint = input('how many page do you want to scan')page_num = int(page_numnotint)file_name = ’jd_goods_data.json’url = ’https://list.jd.com/list.html?cat=1713,3264,3414&page=1&delivery=1&sort=sort_totalsales15_desc&trans=1&JL=4_10_0#J_main ’driver = webdriver.Chrome()driver.get(url)base_html = driver.page_sourceselctor = etree.HTML(base_html)date_info = []name_data, price_data = [], []jd_goods_data = {}for q in range(page_num): i = int(1) while True:name_string = ’//*[@id='plist']/ul/li[%d]/p/p[3]/a/em/text()’ %(i)price_string = ’//*[@id='plist']/ul/li[%d]/p/p[2]/strong[1]/i/text()’ %(i)if i == 60: breakelse: i += 1name = selctor.xpath(name_string)[0]name_data.append(name)price = selctor.xpath(price_string)[0]price_data.append(price)jd_goods_data[name] = priceprint(name_data)with open(file_name, ’w’) as f: json.dump(jd_goods_data, f) time.sleep(2) driver.find_element_by_xpath(’//*[@id='J_bottomPage']/span[1]/a[10]’).click() time.sleep(2)# for k, v in jd_goods_data.items(): # print(k,v) # with open(file_name, ’w’) as f: # json.dump(jd_goods_data, f)

問題解答

回答1:

import requestsfrom lxml import html,etreefrom selenium import webdriverimport time, json#how many page do you want to scanpage_numnotint = input('how many page do you want to scan')page_num = int(page_numnotint)file_name = ’jd_goods_data.json’driver = webdriver.Chrome()date_info = []name_data, price_data = [], []jd_goods_data = {}for q in range(page_num): url = ’https://list.jd.com/list.html?cat=1713,3264,3414&page={page}&delivery=1&sort=sort_totalsales15_desc&trans=1&JL=4_10_0#J_main’.format(page=q) driver.get(url) base_html = driver.page_source selctor = etree.HTML(base_html) i = 1 while True:name_string = ’//*[@id='plist']/ul/li[%d]/p/p[3]/a/em/text()’ %(i)price_string = ’//*[@id='plist']/ul/li[%d]/p/p[2]/strong[1]/i/text()’ %(i)if i == 60: breakelse: i += 1name = selctor.xpath(name_string)[0]name_data.append(name)price = selctor.xpath(price_string)[0]price_data.append(price)jd_goods_data[name] = priceprint(name_data)with open(file_name, ’w’) as f: json.dump(jd_goods_data, f)driver.quit()

標簽: Python 編程
主站蜘蛛池模板: 中文字幕在线观看网站 | 日本中文在线 | 东方伊人免费在线观看 | 亚洲欧美一二三区 | www.久久久久久久久久 | 成人免费观看在线视频 | 看免费毛片 | 四虎永久免费在线 | 99精品在线免费观看 | 日韩在线欧美 | 亚洲爱爱视频 | 成年人网站在线观看视频 | 国产伦一区二区三区 | 成人欧美一区二区三区在线观看 | 日韩毛片一级 | 日韩综合激情 | 超级碰在线视频 | 一区二区三区免费在线视频 | 日韩网站免费观看高清 | 放几个免费的毛片出来看 | 国产精品88 | 午夜视频网址 | 大陆av片| 欧美一区二区影院 | 黄色a大片 | 黄色小视频免费 | 啪啪日韩 | 国产视频分类 | 欧美激情第三页 | 久久成年视频 | 久久久久爱| 91免费视频观看 | 亚洲tv在线观看 | 美女色视频| 欧美在线a | 国内91视频| 手机av网站| www.日韩一区 | 欧美日韩精品免费观看 | 香蕉视频入口 | 午夜黄 |