- BeautifulSoup, doc
- lxml
- html5lib
- html.parser (standard module)
- ElementTree (standard module)
- xml.dom
- xml.dom.minidom
-
ghost.py - webkit web client
- Xvfb- virtual framebuffer for headless browsing
https://realpython.com/blog/python/headless-selenium-testing-with-python-and-phantomjs/
https://github.com/dhamaniasad/HeadlessBrowsers
https://code.google.com/archive/p/pywebkitgtk/
https://github.com/teddziuba/stanislaw
http://n1k0.github.io/casperjs/
https://github.com/makinacorpus/spynner
https://www.gnu.org/software/pythonwebkit/
https://www.blog.pythonlibrary.org/2012/05/06/website-automation-with-python-firefox-and-selenium/
http://mydataprovider.com/2016/06/28/compare-web-scraping-services-by-price/
https://www.import.io/builder/
https://scrapinghub.com/crawlera/
https://scrapinghub.com/scrapy-cloud/
https://blog.scrapinghub.com/2016/09/28/how-to-run-python-scripts-in-scrapy-cloud/
http://stackoverflow.com/questions/2861/options-for-html-scraping
https://pythonhosted.org/pyquery/
See also