Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings

learnerada/zhihu

Open more actions menu
 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

知乎爬虫————实现验证码自动识别

1.
使用keras框架搭建小型VGG卷积神经网络:
网络分为四个卷积层(卷积核3X3),两个采样层。一个全连接层
2.
从www.zhihu.com/captcha.gif爬取验证码图片作为训练样本
3.
分别训练出一个切割器和一个识别器
4.
网络模型和网络权重保存在.h5文件中可直接读取
5.
只需修改zhihu.py文件中的登录邮箱及密码,运行zhihu.py即可 
当然前提是要先安装好keras:)
6.
keras的安装可参考https://keras-cn.readthedocs.io/en/latest/

由于验证码图片较小,VGG的运算量也不会很大,不影响实现
具体结构为(由于图片样本较小,所以卷积核特特征通道数较小):
--------conv 3X3, 4通道---------卷基层
--------conv 3X3, 4通道---------卷基层
--------pool 2X2--------------------采样层
--------conv 3X3, 8通道---------卷基层
--------conv 3X3, 8通道---------卷基层
--------pool 2X2--------------------采样层
--------64神经元-------------------全连接隐藏层
--------16神经元-------------------全连接隐藏层
--------分类--------------------------全连接输出层


About

知乎爬虫(验证码自动识别)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%
Morty Proxy This is a proxified and sanitized view of the page, visit original site.