# WebScraping **Repository Path**: ghostsheep2020/WebScraping ## Basic Information - **Project Name**: WebScraping - **Description**: 《Web Scraping with Python》用python写网络爬虫一书的源代码。 - **Primary Language**: Python - **License**: Apache-2.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 13 - **Created**: 2022-11-18 - **Last Updated**: 2022-11-18 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README ![WebScraping][id] [id]: http://git.oschina.net/liinux/WebScraping/attach_files/download?i=68892&u=http%3A%2F%2Ffiles.git.oschina.net%2Fgroup1%2FM00%2F00%2F93%2FPaAvDFgiqJuAH0LmAAKohomzzqo630.png%3Ftoken%3Dc1e0f83dc08884149e6e3a6a50aec4d5%26ts%3D1478666343%26attname%3D%E7%94%A8python%E5%86%99%E7%BD%91%E7%BB%9C%E7%88%AC%E8%99%AB.png "WebScraping" ##1.This repository contains source code of examples from the book *Web Scraping with Python*, published by Packt Publishing. ##2.Examples have been tested with Python 2.7 and depend on: * [BeautifulSoup](http://www.crummy.com/software/BeautifulSoup/) (Ch 2) * [lxml](http://lxml.de/) (Ch 2-9) * [pymongo](http://api.mongodb.org/python/current/) (Ch 3-5, 9) * [PyQt](http://www.riverbankcomputing.co.uk/software/pyqt/intro) / [PySide](https://pypi.python.org/pypi/PySide) (Ch 5) * [ghost](http://jeanphix.me/Ghost.py/) (Ch 5) * [Selenium WebDriver](http://www.seleniumhq.org/projects/webdriver/) (Ch 5, 9) * [mechanize](http://wwwsearch.sourceforge.net/mechanize/) (Ch 6) * [PIL](http://www.pythonware.com/products/pil/) / [Pillow](https://python-pillow.github.io/) (Ch 7) * [pytesseract](https://github.com/madmaze/pytesseract) (Ch 7) * [scrapy](http://scrapy.org/) (Ch 8) * [portia](https://github.com/scrapinghub/portia) (Ch 8) * [scrapely](https://github.com/scrapy/scrapely) (Ch 8) ##3.This examples will break in future as websites change and dependencies are updated, so [bug reports and patches](https://bitbucket.org/wswp/code/issues?status=new&status=open) are welcome. ##4.Discussion 开源网络爬虫QQ交流群:322937592