Tuesday, March 5, 2019

Python web-crawling framework

https://scrapy.org/

Scrapy is a free and open-source web-crawling framework written in Python. Originally designed for web scraping, it can also be used to extract data using APIs or as a general-purpose web crawler. It is currently maintained by Scrapinghub Ltd., a web-scraping development and services company.



Beautiful Soup is a Python package for parsing HTML and XML documents. It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping. It is available for Python 2.7 and Python 3. 

https://www.crummy.com/software/BeautifulSoup/

[]

No comments:

Post a Comment

Коментар: