Group: Wanghaomiao
1. JsoupXpath12 usages
cn.wanghaomiao » JsoupXpathApache
一个非常好用而且强大的基于xpath的html解析器。html的DOM树生成依赖Jsoup。Lexer 和 Parser基于Antlr4,支持完备的W3C XPATH 1.0标准语法,W3C规范:http://www.w3.org/TR/1999/REC-xpath-19991116。
Last Release on Mar 7, 2023
3. SeimiCrawler Package Plugin
cn.wanghaomiao » maven-seimicrawler-plugin
Package seimicrawler project so that can be fast and standalone deployed.It is based on maven-war-plugin and modified.
这是专为SeimiCrawler工程专门定制的一个maven发布工具,意在简化开发者项目发布与部署流程。本插件是基于Apache的maven-war-plugin修改而来,依然采用Apache License Version2.0发布。
Last Release on Dec 8, 2016
4. SeimiCrawler
cn.wanghaomiao » SeimiCrawlerApache
一个支持分布式的可以高效开发且可以高效运行的爬虫框架。设计思想上融合了spring与scrapy的优点。An powerful,agile,powerful,distributed crawler framework.
Last Release on Apr 25, 2023