Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
/ Hawk Public
forked from ferventdesert/Hawk

visualized crawler & ETL IDE written with C#/WPF

License

Notifications You must be signed in to change notification settings

fundoop/Hawk

Open more actions menu
 
 

Repository files navigation

Hawk: Advanced ETL & Crawler Stream Engine


Welcome to use Hawk! Hawk can graphically crawle webpage, clean, process and save data without programming, written in C#/WPF, open source according to the GPL protocol.

Introduction

The meaning of Hawk is "Eagle", which can kill prey efficiently and accurately. Its design idea comes from Lisp language , and its function mimics awk in Unix.

The key features are as follows:

  • Intelligent analysis of web content without programming.
  • WYSIWYG, conversion, filtering & storage with visually drag and drop.
  • Parallel processing & high speed
  • Support mutli-file & database: xml, csv, sqlite,mongodb...
  • Tasks can be save, pause, restart & reused.
  • Focus on crawler but power is far beyond that.

splash

Fast & Smart webpage crawling :

1.gif-1001.8kB

WYSIWYG ETL:

2.gif


欢迎使用Hawk! HAWK无需编程,可见即所得的图形化数据采集和清洗工具,依据GPL协议开源。

介绍

Hawk的含义为“鹰”,能够高效,准确地捕杀猎物。它的思想来源于Lisp语言,功能模仿了Linux工具awk。

特点如下:

  • 智能分析网页内容,无需编程
  • 所见即所得,可视化拖拽,快地实现转换和过滤等数据清洗操作
  • 能从各类数据库和文件实现导入导出
  • 任务可以被保存和复用
  • 其最适合的领域是爬虫和数据清洗,但其威力远超于此。

About

visualized crawler & ETL IDE written with C#/WPF

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • C# 100.0%
Morty Proxy This is a proxified and sanitized view of the page, visit original site.