Salamer Zhihu Crawler Save

a crawler for zhihu

Project README

#Zhihu_Crawler


this is web crawler for zhihu.com

the crawler use Redis for checking the url has been crawled or not,and use mongodb for storing data.

if you wanna print out the data,run:

python engine.py --mongo

the crawler would store the data in mongodb

but if just run :

python engine.py

you will see

************************************************************
用户名:Mingo鸣哥

用户性别:female

用户地址:香港

被同意:59960

被感谢:14474

被关注:39055

关注了:806

工作:记者/
教育:香港中文大学 (Chinese University of Hong Kong)/新媒体
************************************************************
Open Source Agenda is not affiliated with "Salamer Zhihu Crawler" Project. README Source: salamer/Zhihu_Crawler
Stars
95
Open Issues
2
Last Commit
7 years ago
License

Open Source Agenda Badge

Open Source Agenda Rating