Web-Harvest Changelog

What's new in Web-Harvest 2.001

Mar 31, 2010

Plug-in mechanism enabled - users may develop their own processors and seamlessly integrate them to Web-Harvest.
New processors developed:
database: perform select/insert/update/delete operations against specified database (JDBC driver is required on classpath).
mail: send emails with optional attachments.
zip: crate ZIP archives with specified content.
ftp: access FTP server and perform common operations: list, get, put, del, etc.
tokenize: split text to list of elements.
json-to-xml: convert JSON formatted value to XML.
xml-to-json: convert XML to JSON formatted value.
file processor updated with action to list files with specified name filter.
http processor updated to support multipart forms (enabling file uploads).
charset and delimiter attributes added to text processor.
empty attribute added to loop and while processors in order to prevent accumulating of large results that may produce memory leaks. This is replacement for putting empty processor inside the loop body.
Several new attributes added to regexp processor to enable regular expression fine-tunning.
Complete access to http response headers.
GUI improvements:
Simple debugging added: user may define breakpoints where execution pauses and runtime values can be seen.
Charset selection enabled in settings dialog for configuration files.
Editor auto-completion improvements - auto-completion is available for attribute values wherever possible.
Editor improvements: copying lines/selection, deleting lines, (un)commenting xml fragments.
List of recently opened files added to File menu.
Dependency libraries updated:
HtmlCleaner updated to version 2.1.
Saxon updated to version 9 (XSLT 2.0, XQuery 1.0, XPath 2.0).
Number of new attributes supported in html-to-xml processor.
Number of bug fixes.
Java 1.4 is no more supported - JRE 1.5 or higher is required.

Web-Harvest Changelog

What's new in Web-Harvest 2.001

New in Web-Harvest 1.0 (Mar 31, 2010)

New in Web-Harvest 0.5 (Mar 31, 2010)