Apache Nutch

Apache Nutch
Original author(s)Doug Cutting, Mike Cafarella
Developer(s)Apache Software Foundation
Stable release
1.x1.20 / 24 April 2024; 6 months ago (2024-04-24)[1]
2.x2.4 / 11 October 2019; 5 years ago (2019-10-11)[1]
RepositoryNutch Github Repository
Written inJava
Operating systemCross-platform
TypeWeb crawler
LicenseApache License 2.0
Websitenutch.apache.org

Apache Nutch is a highly extensible and scalable open source web crawler software project.

  1. ^ a b "Apache Nutch™ - Downloads". Retrieved 11 June 2024.