ContentService

A video crawler,keeping track of the most recent videos from Internet

This project started from one of my work about 2 years ago,it is used as a tool to crawl newest videos information to provide content service for web pages.

The whole system running on django framework,written in Python.All crawlers are written before,and when the system runs,it would create multi-process to instance the crawlers in ContentService/contentservice/crawlerimpl/, then execute the crawling task.

There are 2 kinds of crawlers:ListCrawler and ContentCrawler,the former is used to crawl the video list and saving the task into mongoDB while ignoring the details about the video,the ContentCrawler executes the detailed task fetched from mongoDB created by ListCrawler.

if you have any doubts while using this crawling tools,I'm glad to offer you some help O(∩_∩)O~

Name	Name	Last commit message	Last commit date
Latest commit History 12 Commits
contentservice	contentservice
scripts	scripts
.project	.project
LICENSE	LICENSE
MANIFEST.in	MANIFEST.in
README.md	README.md
contentservice.cfg	contentservice.cfg
contentservice.cron	contentservice.cron
manage.py	manage.py
run.sh	run.sh
setup.py	setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ContentService

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Search code, repositories, users, issues, pull requests...

License

hitflame/ContentService

Folders and files

Latest commit

History

Repository files navigation

ContentService

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages