참조:
https://blog.naver.com/gomdol1551/222872778308
https://blog.naver.com/happy-ds/222335771358
https://blog.naver.com/codef_api/222721006534
(영문 해석)
The short answer is that web scraping is about extracting the data from one or more websites. While crawling is about finding or discovering URLs or links on the web.
Usually, in web data extraction projects, you need to combine crawling and scraping. So you first crawl - or discover - the URLs, download the HTML files, and then scrape the data from those files. This means you extract data and do something with it, like storing it in a database or further processing it.
- In web scraping, it's all about the data. The data fields you want to extract from specific websites. And it's a big difference because with scraping you usually know the target websites, you may not know the specific page URLs, but you know the domains at least.
- With crawling, you probably don't know the specific URLs and you probably don't know the domains either. And this is the reason you crawl: you want to find the URLs. So that you can do something with them later. For example, search engines crawl the web so they can index pages and display them in the search results.
Source:
https://www.zyte.com/learn/difference-between-web-scraping-and-web-crawling/
https://brightdata.com/blog/leadership/web-crawling-vs-web-scraping