| Name: | nutch |
|---|---|
| Version: | 1.0 |
| Release: | 0.19.20081201040121nightly.el7 |
| Architecture: | noarch |
| Group: | Unspecified |
| Size: | 25474152 |
| License: | ASL 2.0 |
| RPM: | nutch-1.0-0.19.20081201040121nightly.el7.noarch.rpm |
| Source RPM: | nutch-1.0-0.19.20081201040121nightly.el7.src.rpm |
| Build Date: | Fri May 10 2019 |
| Build Host: | x86-ol7-builder-03.us.oracle.com |
| Vendor: | Oracle America |
| URL: | http://lucene.apache.org/nutch/index.html |
| Summary: | Open source web-search software |
| Description: | Nutch is open source web-search software. It builds on Lucene Java, adding web-specifics, such as a crawler, a link-graph database, parsers for HTML and other document formats, etc. |
- removed %%defattr from specfile - remove install/clean section initial cleanup - removed Group from specfile - removed BuildRoot from specfiles
- 1483503 - move hadoop logs to /var/log
- recompile all packages with the same (latest) version of java - fixed tito build warning - replace legacy name of Tagger with new one
- rebuild nutch from git
- Fixing nutch buildroot symlink issue
- We also need the nutch lib directory to contian the nutch jar
- updating bin directory permissions
- fixing nutch executable permissions
- we need scripts in bin for spacewalk-doc-indexes
- shrinking nutch rpm from 70M to 22M