| Crawler4j
Crawler4j is a Java library which provides a simple interface for crawling the web. Using it, you can setup a multi-threaded web crawler in 5 minutes! It is also very efficient, it has been able to download and parse 200 pages per second on a Quad core PC with cable connection.
|
| | Category | Crawlers | License | Apache Software License | HomePage | http://code.google.com/p/crawler4j/ |
|
|
Java is a trademark or registered trademark of Sun Microsystems, Inc. in the United States
and other countries. This site is independent of Sun Microsystems, Inc.