Crawling - Pokemon image collection

화이티 ·2023년 12월 18일

Crawling

목록 보기
4/7

How to collect Pokemon img:

1. import library

from selenium import webdriver as wb
from bs4 import BeautifulSoup as bs
import requests as req

import pandas as pd

from urllib.request import urlretrieve
  • 2. get link
driver = wb.Chrome()
url = 'https://www.pokemonkorea.co.kr/pokedex#pokedex_1'
driver.get(url)

head_option = {
'user-agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0 Safari/537.36'}

res= req.get(url, headers = head_option)
html = bs(res.text,'lxml')
  • 3. get the img link
img= html.select('img.img-fluid')

urlretrieve(img[0]['src'],'C:\\Users\\dd\\AppData\\Local\\Temp\\Pokemon\\pokemon1.jpg')

data = []
for i in range(18):
data.append(urlretrieve(img[i]['src'],f"C:\\Users\\dd\\AppData\\Local\\Temp\\Pokemon\\pokemon{i}.jpg"))
data

Method2:

imgs = driver.find_elements(By.CLASS_NAME,'img-fluid')
len(imgs)

imgs[0].get_attribute('src')

for i in range(len(imgs)):
	      urlretrieve(imgs[i].get_attribute('src'),f"C:\\\\Users\\\\dd\\\\AppData\\\\Local\\\\Temp\\\\Pokemon\\\\pokemon{i}.jpg")
profile
열심히 공부합시다! The best is yet to come! 💜

0개의 댓글