| Name: | nutch |
|---|---|
| Version: | 1.0 |
| Release: | 0.16.20081201040121nightly.el6 |
| Architecture: | noarch |
| Group: | Development/Tools |
| Size: | 25547424 |
| License: | ASL 2.0 |
| RPM: | nutch-1.0-0.16.20081201040121nightly.el6.noarch.rpm |
| Source RPM: | nutch-1.0-0.16.20081201040121nightly.el6.src.rpm |
| Build Date: | Sat Aug 31 2013 |
| Build Host: | ca-build44.us.oracle.com |
| Vendor: | Oracle America |
| URL: | http://lucene.apache.org/nutch/index.html |
| Summary: | Open source web-search software |
| Description: | Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc. |
- rebuild nutch from git
- Fixing nutch buildroot symlink issue
- We also need the nutch lib directory to contian the nutch jar
- updating bin directory permissions
- fixing nutch executable permissions
- we need scripts in bin for spacewalk-doc-indexes
- shrinking nutch rpm from 70M to 22M
- Correcting URL of the tarball in Nutch pkg (lzap+git@redhat.com)
- Removing unnecessary files - Erasing empty lines
- dropping unnecessary files
- rebuild
- Rebuild for new build tools.
- initial