GitHub - web-user/python-scrapy: Framework for extracting the data you need from websites.

##Project to scrapy Tripadvisor.com

Consist of three spiders:

$ tripadvisor_hotel_url - first run this spider, it will scrape all urls from

target city

$ tripadvisor_hotel scrape main hotel info

$ tripadvisor_rating scrape main hotel rating info

$ tripadvisor_review scrape all reviews from hotel page

Project uses:

-python 3v
-mongodb 4.0
-pip3
-chromdriver (for selenium)

to launch spider:
   'scrapy crawl <spider_name> -a start_url=https://www.tripadvisor.com.sg/Hotels-g293916-Bangkok-Hotels.html -a city=Bangkok'

all spider stores data in mongodb

Name	Name	Last commit message	Last commit date
Latest commit History 4 Commits 4 Commits
trip	trip
.gitignore	.gitignore
6+eG5HuzYy	6+eG5HuzYy
LICENSE	LICENSE
README.md	README.md
main.py	main.py
scrapy.cfg	scrapy.cfg

Provide feedback