May 18th, 2012· Feature - Added the possibility to add [Ordinal] in the file rename pattern of Preferences>Exports to include the sequential number of the explored page during a browse or a dig. (This simplifies the matching of downloaded images with extracted data, for instance.)
· Feature - Added the #ORDINAL# replacement variable to scrapers, which allows you to add the sequential number of the explored page to a scraped record, during a browse or a dig. (This simplifies the matching of downloaded images with extracted data, for instance.)
· Fix - The search engine management panel is operational again.
· Fix - File downloads were failing if 'ignore' was chosen in the file exist menu of the macro editor. This was fixed.
· Enhancement - Removed useless disabled options in the action menu of the job editor (they will be implemented some day).
May 18th, 2012· Enhancement - Quicker HTML export.
· Enhancement - Global performance improved when automatic images extraction is on.
· Enhancement - Added #nextPage!# scraper rule to force next page to be taken into account even if the URL has already been visited in a browse or in a dig.
· Fix - Link extraction bug that was blocking browse or dig process in some specific dyamic pages.
· Fix - Matrix generation was looping if the "Matrix" checkbox was checked although the string was not a matrix.
May 18th, 2012· Fix - Minor fixes in macros.
· Fix - Fix problem with some Windows computers on Firefox 12.
· Enhancement - Works with Firefox 4 to 14.
May 18th, 2012· Feature - Can now directly receive and execute macros/jobs generated by OutWit Images.
· Feature - Portrait/Landscape options were added in the bottom panel of the 'images' view.
· Feature - The "select if" text box in the 'images' view now allows to type '800x600', '800x', 'x600', 'x800x' to compare respectively: width AND height, width, height and width OR height.
· Feature - A renaming preference was added to the Export Preferences, to set renaming patterns for downloaded files and exported files. In the pattern a succession of "#" means the addition of a numerical increment (with leading zeros) and "[OriginalName]" allows to include the original base name of the file.
· Fix - Open was failing is some cases when the file extension was in capital.
· Fix - Several minor fixes in macros, jobs.
· Enhancement - Small interface changes, including the possibility to preview the macros and jobs interface in light version.
· Enhancement - Works with Firefox 4 to 12.
April 30th, 2012· Feature - Can now directly receive and execute macros/jobs generated by OutWit Images.
· Feature - Portrait/Landscape options were added in the bottom panel of the 'images' view.
· Feature - The "select if" text box in the 'images' view now allows to type '800x600', '800x', 'x600', 'x800x' to compare respectively: width AND height, width, height and width OR height.
· Feature - A renaming preference was added to the Export Preferences, to set renaming patterns for downloaded files and exported files. In the pattern a succession of "#" means the addition of a numerical increment (with leading zeros) and "[OriginalName]" allows to include the original base name of the file.
· Fix - Open was failing is some cases when the file extension was in capital.
· Fix - Several minor fixes in macros, jobs.
· Enhancement - Small interface changes, including the possibility to preview the macros and jobs interface in light version.
April 24th, 2012· Feature - Significantly improved the HTML export layout, especially when data includes image URLs (display of Hires images, row highlighting, full text display on hover...).
· Feature - Added a new #checkIf# directive, allowing to set the values of the OK checkboxes in a scraper if a condition matches. (Example: Description:#checkIf#, Format:Detail, Replace:01001 - the OK checkboxes of the five first lines of the scraper will be respectively set to false, true, false, false and true, if the string "" is found in the source code of the page.) You can put as many #checkIf# directives as you want in a scraper.
· Fix - Minor fixes in update check.
· Fix - Corrected a bug that prevented the Dig function to work in some cases.
· Fix - Removed error messages than appeared in the console in certain cases, when quitting the application.
· Fix - Upgrade/Downgrade uninstall problems were still happening in certain cases. They were fixed.
· Enhancement - More Hi-Res images should be found in common galleries.
· Enhancement - Minor fixes were made to the 'Guess' algorithm in cases of very small numbers of rows.
March 29th, 2012· Feature - Dragging data from the catch to the Page now works.
· Fix - Multiple selection for exporting automators was not working on Windows. This was corrected.
· Enhancement - Works with Firefox 4 to 13.
March 13th, 2012· Fix - Corrected reload button sometimes hidden on Windows.
· Fix - Fixed a regression in "Browse All Pages".
· Fix - In very rare cases, sites with javascript generated popups could crash the application. This was fixed.
· Enhancement - Works with Firefox 4 to 11.
· Enhancement - Very large directories of queries are better handled. Usually 50k items are working smoothly. We have tested up to 500k. At these levels, the application requires very large RAM space.
· Enhancement - Moving very large numbers of rows to the catch could be extremely long. This was optimized further.
· Enhancement - A confirm dialog was added to the datasheet Empty buttons.
March 5th, 2012· Fix - History/typeahead fixed in the search bar textbox.
· Fix - Corrected a bug that prevented exports in certain cases for the document and news views.
· Fix - Minor fixes and enhancements in the query manager.
February 13th, 2012· Feature - Added more options to the browse popup menu.
· Fix - Corrected issue with the "active" and "matrix" checkboxes of the "queries" view.
· Enhancement - Optimized the loading and of very large directories of queries.
· Enhancement - Optimized the launch time before a browse or a fast scrape on very large numbers of URLs.
February 13th, 2012· Enhancement - Kernel update for Firefox 10.
· Enhancement - Improvements in the upgrade/downgrade system.
January 27th, 2012· Feature - Added a preference to make popup links open within the Hub instead of a new Firefox Window.
· Feature - Added a preference to prevent an alert to be displayed 10 seconds before the execution of a job.
· Feature - First implementation of POST queries in the Hub (to automatically fill forms). POST data must be added to the URL like ordinary parameters (after ? and separated by &, but the name of each POST parameter must simply be prefixed with #POST#
January 19th, 2012· Feature - Added #replace(value,findPattern,replacePattern)# function in scrapers.
· Feature - Added javascript link support to #nextToVisit()# function.
· Fix - Corrected glitches in update/downgrade processes.
· Enhancement - Now filtering out records that are empty or containing only empty fields from scrape results.
January 10th, 2012· Enhancement - The Hub is now available on all platforms, both as a standalone application and as a Firefox extension.
· Enhancement - Number of fixes and enhancements in the scraper engine.
· Enhancement - The ctrl-S, ctrl-F / cmd-S, cmd-F (save & find) commands are now working on the page even if the browser doesn't have the focus.
· Enhancement - A count of visited and remaining pages is now displayed in the status bar during automatic exploration.
· Enhancement - A throbber was added in the address bar while loading a page.
· Enhancement - A list of available tutorials is displayed at first run. It can then be accessed with the Tutorials item of the Help menu.
· Fix - The addition of a "i" flag is now working properly in regexps, when clicking on the "ignore case" checkbox of the find bar.
· Fix - Openning a Help window from a tutorial works again
· Fix - A bug was corrected in the clean text function which was trimming +=- characters at the end of a string in the extracted data.
· Fix - The #replace# directive could lead to an infinite loop in some cases. This was corrected. browser doesn't have the focus.
· Known Issue - There are occasional problems in the renaming of queries. (Workaround for now: edit the cell and hit return.)
December 18th, 2011· Fix - Fast Scraping a series of URLs, the process doesn't stop any more when the domain of one of the URLs doesn't exist.
· Fix - A recent regression prevented macros to export data and download files at the same time. It is working agin.
· Fix - The #nextToVisit()# function was not working with URLs containing spaces. This was corrected.
· Fix - Exports respect again the 'Include id column' preference.
· Fix - Columns that have been hidden in the column picker are not exported.
· Fix - Various other minor fixes.
· Enhancement - Works with Firefox 3.6 to 10. Note that this is the last version supporting Firefox 3.6 and that several features are already not working with this version. You should update your browser to enjoy all features.
December 2nd, 2011· Fix - The #replace# scraper directive now applies the replacement recursively.
· Enhancement - Enhanced figure normalization functions.
· Enhancement - Now resets all views to default and preferences to original values after execution of tutorials and wizards.
December 2nd, 2011· Fix - minor corrections in the update system.
· Enhancement - version management in WitScript.
December 2nd, 2011· Fix - Corrected problem displaying some records with very large fields in the Detail panel.
· Fix - Fast scraping works on very large selections of datasheet or catch rows.
· Fix - The export button at the bottom of the datasheets now works even if the datasheet doesn't have the focus.
· Fix - Fixed the overwriting of existing files when manually saving export files.
· Fix - Running several macros in a row during the same session could fail in some cases. This was corrected.
December 2nd, 2011· Feature - The right-click menu of the datasheets has changed: Automatic browsing and Scraper application are now gathered under "Auto-Explore Selected Links". In this submenu, you will find the exploration and scraping functions: 'Browse','Dig', Fast Scrape selected URLs (with the new possibility for the latter, to include the selected data fields in the scraping results) as well as the possibility to apply generic macros (i.e. macros that are not specific to a given URL) to the selected links.
· Feature - You can now set the preferences so that FF runs OutWit Hub automatically on launch.
· Feature - Empty/Export/Download buttons were added to the datasheet bottom panels depending on the context, to export selected content.
· Feature - You can now access and scrape either the "original" source code as downloaded by your browser or the modified "dynamic" source code as altered by Javascript scripts. This allows you to scrape AJAX pages that couldn't be scraped before.
· Feature - It is now possible to completely disable images and plugins (like flash) in the Hub, for faster browsing: Use the right-click menu on 'page' in the side bar.
· Feature - New settings were added to the time preferences, including temporization and pauses at set intervals for the fast scraping mode (XHR queries).
· Feature - First implementation of the WitScript engine to run tutorials, complex scripts and wizards.
· Enhancement - Scrapers were optimized, should be easier to produce and more forgiving.
· Feature - One major change is that scrapers can now reuse a same marker in several lines and use overlapping markers, which was not possible in the previous version.
· Feature - Many new directives were also added to scrapers to help the debugging: #showSource#, #showMatches#, #showResults#, #showDelimiter#, #showScraperErrors#, #showServerErrors#, #simulate#, #showAlert#...
· Feature - Added the new directive #variable#myVariableName#. The occurrences of the variable (#myVariableName#) are replaced, at application time, by the scraped value in all other lines of the scraper.
· Feature - Added a #nextPage# directive in scrapers, allowing you to tell OutWit Hub how to find the link to the next page in a series when it doesn't find it automatically. For the moment, this is only applied when the scraped view is active (bottom panel not at default settings).
· Feature - Added the #nextToVisit(#myURL#)# function which, in the 'Replace' field, instructs the Hub to give the variable #myURL# the next scaped value which is not found in the list of visited URLs. This means that, used in conjunction with #nextPage# and #BACK# you can create complex scraping workflows. You can, in particular, create multi-level scraping processes.
· Feature - Added new replacement variables in scrapers: #URL#, #BASEURL#, #DOMAIN#.
· Feature - Added new replacement functions: #(term1 operator term2)# and #if(condition,valueIfTrue,valueIfFalse)# or #if(condition;valueIfTrue;valueIfFalse)#. Works with the following operators: (comparison operators); a=A (case-insensitive comparison); a==a (case-sensitive comparison); a!=b (not equal, case insensitive); a!==b (not equal, case sensitive); a+b (addition of integers: 1+3=4; concatenation of strings: out+wit=outwit; incrementing characters: c+3=f), a-b (subtraction of integers: 5-2=3 or decrementing chars: e-3=b ), a*b (multiplication), a/b (division) and a^b (power). The terms can be literals, variables or functions.
· Feature - When using equality operators on strings (=, !=, ==, !==), you can now use the wildcard % in the second term to replace any string. (ex. these three statements are true: headstart = Head% ; homeland == h%d ; lighthouse = %HOUSE).
· Feature - Added URL alteration functions: #getParam(url,parameterName)# and #setParam(url,parameterName,parameterValue)#. When used with #URL# in the #nextPage# directive line, you can easily set the value of the next page url in some cases. ex.: #setParam(#URL#,page,#(#getParam(#URL#,page)#+1)#)# in the replacement field will generate the next url, incrementing the parameter 'page'.
· Feature - Added Lookup list replacement: #lookUp(value,listOfValuesToFind,listOfReplacementValues)# or #lookUp(value;listOfValuesToFind;listOfReplacementValues)# for replacing lists of values. The elements of the first list will be respectively replaced by those of the second.
· Feature - The right-click menu on a scraper field in the scraper editor now allows you to highlight the matches in the source code. The same feature used on the description field, highlights matches for the whole scraper line.
· Feature - In the scraper fields right-click menu was also added a small library of sample scraper lines and directives.
· Feature - Added 'insert line' to the right-click menu of all datasheets, with a keyboard shortcut. This allows easier data manipulation in all datasheet but is particularly useful in the 'queries' view.
· Feature - Regular expressions can now be used in the find bar (ctrl-F or cmd-F) of the 'page', 'source' and 'scrapers' views! Just begin and end your patterns with "/" (i.e.: /yourRegularExpression/ ).
· Fix - The slideshow function now works even in on-demand images mode.
· Fix - Large number of fixes and performance optimizations throughout the code.
· Fix - Corrected encoding problems in the Dynamic Source that could happen if the meta declaration was not UTF-8.
· Enhancement - Complete refactoring of all datasheets in the application (views and catch): they are now able to handle hundreds of thousands of rows.
· Enhancement - Refactoring of the export functions to be able to handle very large volumes of data as well.
· Enhancement - When downloading files, an increment or a date was added to those with duplicate filenames. It is still the case, but when the url contains parameters, OutWit now adds these to the filename before testing for duplicates. This is more effective when the images come from databases.
· Enhancement - Some cosmetic work was done on the User Interface.
· Enhancement - More next page links are found, in more languages.
· Enhancement - Fully compatible with Firefox 3.6 to 9.
December 2nd, 2011· Feature - updater to version 2.0.
· Fix - some fixes were backported from v.2.0.
October 28th, 2011· Fix - Several fixes in scrapers.
· Enhancement - Compatible with FF3.6 to 9.0
August 22nd, 2011· Feature - A few debug directives were added to scrapers: #showSource#, #showResults#, #showErrors#.
· Enhancement - Works with Firefox 3.6 to 7.
August 22nd, 2011· Enhancement - Algorithm of exploration for high resolution images was optimized.
· Enhancement - Some minor enhancements and fixes in the scraper application algorithm.
August 22nd, 2011· Fix - Corrected compatibility problem with version 4 of Firefox.
· Fix - Corrected problem in scrapers which could happen when the marker before contained a single character.
July 4th, 2011· Fix - Corrected a problem happening in the recognition of URL with UTF-8 characters in text files.
· Enhancement - Now compatible with all versions of Firefox between 3.6 and 6.
July 4th, 2011· Fix - Corrected error in console when dragging an item to the view list.
· Fix - Fixed minor remaining problems in notification dialogs.
July 4th, 2011· Fix - The 'Empty' checkbox was useless in the history view. It was removed. (A way will soon be provided to disable the history view.)
· Fix - In version 1.0.6.9, notification dialogs couldn't be closed with Firefox v3.6. This has been fixed (sorry).
· Enhancement - Bug report, suggestion and version history pages are now open as separate Firefox windows so that calling them doesn't disrupt your current work in the Hub.
· Enhancement - High resolution image extraction was improved.
July 4th, 2011· Fix - A bug was fixed that prevented to drag elements from the page to the catch when the 'save incoming files' checkbox was checked.
· Fix - The current scraper was not saved when clicking on 'Execute' directly when a cell was still being edited. This was fixed.
· Enhancement - Several fine-tuning corrections were made to the scraper application. They should not result in any noticeable differences in the extracted results, but will bring performance enhancements in some cases.
· Enhancement - Works with Firefox 3.6 to 5.x on all platforms.
· Enhancement - The default extension for Excel exports is back to '.xls' which works best on all platforms. (This can be changed in the preferences).
July 4th, 2011· Fix - Fixed a bug on the #stop# directive.
· Fix - Solved rare problems occurring at startup on some systems.
· Fix - Several minor fixes.
· Enhancement - Compatible with Firefox 5 Beta.
May 20th, 2011· Fix - Additional fixes and enhancements to jobs, covering a larger range of possible contexts (going to waking from sleep, inside the application, in another application, leaving and restarting the Hub, other processes already running...)
· Fix - "cleared scope" errors were fixed
· Enhancement - Digging through local files and folders has been dramatically enhanced.
May 20th, 2011· Fix - An unwanted space character was still added in some cases before \0 in the replacement string of scrapers. This was fixed.
· Fix - The opening of multiple html files was fixed.
· Fix - Fixes and enhancements in Job scheduling.
· Fix - The Dig depth problem when the option was entered through the advanced settings dialog was fixed.
· Enhancement - The Next Page function recognizes more series.
· Enhancement - Long texts are not truncated anymore in the HTML export, only partly hidden. This means that the data is there and, although not fully visible, can be copied and pasted.
May 20th, 2011· Feature - Added the #replace# directive to scrapers.
· Fix - Corrected problem occurring when multiple directives were used in a scraper.
· Fix - File>Open multiple files now works with .htm extension.
· Fix - Corrected a Dig depth issue that appeared when set through Dig's Advanced settings dialog.
· Fix - Corrected a bug in the randomize function of jobs.
· Enhancement - Further enhancements in Next Page link recognition algorithm.
· Enhancement - Added the possibility to set Separator & Labels in the #repeat# directive.
· Enhancement - In the HTML export, the data that was previously truncated for layout purposes is now only hidden, which means that it is present in the export page source code and can be selected and copied.
May 20th, 2011· Feature - Browse through selected URLs now accepts "javascript:xxx" strings.
· Fix - Labels ending with a digit are handled properly again in scrapers.
· Fix - The extra space was removed before \0 replacement in scrapers.
May 20th, 2011· Feature - Added a preference to set the minimum number of rows an HTML table must have to be extracted by the 'tables' view.
· Feature - Added an escape/unescape function in the right click menu of the scraper editor, which helps switching between literal content and a regular expression pattern.
· Fix - Corrected cell cleaning of repeated fields in scrapers.
· Fix - Several fixes in number normalization functions.
· Fix - Minor fixes in advanced date parser.
· Fix - In the 'tables' view, corrected a bug that made the factorization of cell labels into a column header fail when labels contained some characters (including $ or €).
· Fix - The index increment function was not working properly when generating a new automator version number. This was fixed.
· Fix - Corrected a bug when downloading files with strange characters, in particular, starting with a ".".
· Fix - Corrected problems in the CSV export of some files containing strings of the form "#some characters:".
· Enhancement - Scraped lines are now reordered when using fast scraping mode.
· Enhancement - Automators can now be imported in several ways: opening several files at once, clicking on the link of an automator, etc.
· Enhancement - Enhanced extended email recognition.
· Known Issue - An unwanted space character is inserted before the value when \0 is used in scraper replacement field.
· Known Issue - Scraper labels ending with a digit end up in cell content as "#LabelXX.:".
May 20th, 2011· Fix - The download issue for files with the same name was corrected.
· Fix - The 'Unresponsive script' issue in line 744 of overlay.js has been fixed.
May 20th, 2011· Feature - A submenu now gives the choice of scraper to be appled to selected URLs. If no specific scraper is chosen, the scraper to apply is selected automatically, as before.
· Fix - Fixed some problems in the the check for updates function.
· Fix - Fixed a problem under Windows XP and Vista with Firefox 4 beta.
· Fix - Fixed minor problems in the URL filter for browse and dig.
· Fix - The automatic inclusion of hidden fields (like "Source url") if used in a macro is now working.
· Fix - Minor bugs were corrected in Jobs.
· Enhancement - Delete/Select duplicates now work for multiple selections.
· Enhancement - Select duplicates now ignores blank cells by default. (This can be changed in the preferences.)
· Enhancement - The "TAB" key is now ignored in the scraper editor.
· Known Issue - Problems can appear when downloading files of the same name. The files may be overwritten or ignored. Please download the next version if you are experiencing this problem.
· Known Issue - In this version, some users have experienced 'Unresponsive script' messages. Please download the next version if you are experiencing this problem.
February 24th, 2011· Fix - Corrected bug in macros which prevented to save exported files when the overwrite option was selected.
· Fix - It is now possible to set tabs as separator in the preferences for csv export.
· Fix - Apply scraper on a generation matrix now generates the URLs then does the apply scraper.
· Known Issue - The enhancement of v1.0.2.5. concerning the hidden columns in the macros is not working properly. You still need to select the wanted column in the column picker for now.
February 16th, 2011· Feature - Lines can now be duplicated in the scraper editor (via the right click menu).
· Fix - Some locales were modified in English and French.
· Fix - A bug was corrected that prevented new records to be created in some cases when the first line of a scraper included a Separator and a list of labels.
· Fix - All deduplication functions are working again in the right-click menu on the datasheets.
· Fix - When a filter is set to the exploration or URLs (dig/browse), the filter doesn't apply to the URL of the current page any more, but only to the links within the start page.
· Fix - RSS feeds are now found not only if links to feeds are present but on the feed pages themselves.
· Fix - The local help files were removed. All help pages are now online.
· Fix - The Execute button of the scraper editor didn't work when the layout was set to side by side windows. This has now been fixed.
· Fix - The messages in the info bar are now more explicit when no scrapers have been found for the URLs.
· Enhancement - In macros, the program now ensures that a column is shown if used in a macro, even if hidden with the column picker. This means that if, for instance, you want the 'source URL' column to always be present in the exported data, use it in the 'Select if' or 'Sort by' zones of the macro definition.
· Enhancement - Additional cases are now handled for the next page recognition in series of results.
· Enhancement - Right-clicking on a generation matrix now allows to directly apply a scraper to all generated URLs.
February 16th, 2011· Feature - A few instructions were added to the generation matrix format (groups and steps in ranges, #RANDOM[x:y]#).
· Fix - A regression was noted in the links view of version 1.0.1.9, please update to this version.
· Fix - A bug was corrected in the links view on a specific type of double encoded links.
· Fix - A few bugs were corrected in generation matrices.
· Fix - Some regressions were fixed in next page link recognition.
· Fix - Advanced settings bugs were corrected in the Dig function.
· Fix - A problem was fixed that occurred when a macro and a scraper had the same name.
· Fix - An interface problem was corrected in the scraper editor that left a cell highlighted after editing.
· Enhancement - The next page recognition algorithm was further enhanced to recognize series, reducing the risk of false positives.
· Enhancement - Now compatible with Firefox 4 beta 10.
· Enhancement - The on-demand image extraction mode now has a setting in preferences .
January 3rd, 2011· Feature - A split function for exported files was added to macros. (For now, the only criterion is the number of exported rows. We will add other criteria in future versions.)
· Feature - An option was added in the preferences to include the ID column when exporting data to a file.
· Feature - Scrapers: added #repeat#, #start# and #stop# directives for value repetition, hierarchical extraction in some cases and extraction start/stop.
· Feature - Scrapers: added time variables in the replacement field.
· Feature - A min / max setting was added to the time preferences for automatic exploration features, in order to allow random temporization in dig and browse.
· Fix - A bug was fixed that appeared in some cases saving and deleting query directories.
· Fix - The slideshow is working again when the automatic processing of images is disabled.
· Fix - The "Save incoming files" checkbox of the catch is working again
· Fix - Corrected a bug in scrapers happening with field names ending with a digit.
· Fix - Corrected a problem in macros altering the settings in some rare cases.
· Enhancement - Macros: changes were made in the dig conditions, for consistency in the MAU. The syntax is now the same as in the "select if" criteria.
· Enhancement - The kernel is now compatible with Firefox 4 b8.
· Enhancement - Small changes and enhancements were made in macros.
· Enhancement - Some significant changes and enhancements were made in scrapers.
· Enhancement - A checkbox was added to the catch section of the macro editor, to empty the catch without saving it.
· Enhancement - Most of the useless multiple log messages have been removed.
· Enhancement - Query generation matrices: added a few functions to the time variables.
· Enhancement - Added the possibility to select the whole content of the address bar when pressing the esc key.
December 10th, 2010· Feature - A new option in the right click menu on the images view name in the side panel allows you to deactivate the automatic exploration for images in order to save time in processes that do not require images. In this mode, images are not completely disabled but will only be processed on demand.
· Feature - Additional options can be found in the datasheet right click menu under "Duplicates". Submenu options allow to select or delete partial or total duplicates.
· Fix - "Fast Scraping" and "Apply Scraper" could slow down and eventually stall after large numbers of queries or when queries failed because of bandwidth or server response problems. This was fixed.
· Enhancement - The catch was periodically backed up even when not changed. It is now only done if necessary.
· Enhancement - The Next Page function was enhanced and should now recognize the navigation links more often and with more pertinence.
December 10th, 2010· Fix - Corrected a regression introduced in version 1.0.0.25 on email address extraction in some pages.
November 19th, 2010· Fix - Corrected problems in macros when the start page is https protocol.
· Fix - Corrected problem rare issue with emails found in hex encoded strings.
· Enhancement - Compatible with Firefox 4b7 (considerably faster on this one).
· Known Issue - A few minor interface glitches left in latest beta (mainly contextual menu style).
November 19th, 2010· Feature - The "Advanced..." item was added to the Dig button menu (hold button down), opening an advanced setting dialog where URLs to explore can be filtered in different ways.
· Enhancement - the Next Page function was enhanced and finds the following page link in more cases.
· Enhancement - More interface glitches were corrected.
November 3rd, 2010· Fix - Minor bug fixes
· Enhancement - Remaining graphic interface glitches solved for Firefox 4 beta
November 3rd, 2010· Feature - A 'Keep Order' option was added in the scraped view's bottom panel to force the columns to be extracted in the order of the scraper
· Fix - In some cases, return characters were doubled when copied or dragged from the source code to the scraper editor. This was corrected
· Fix - Fixed a problem that appeared in rare cases in table extraction
October 20th, 2010· Enhancement - Support for Macintosh PPC was restored for Firefox 3 (It will not work, however, with Firefox 4).
October 20th, 2010· Feature - Delimiter, BOM, text identifier can now be set for CSV export.
· Feature - The catch file is now backed up at regular intervals.
· Feature - A character escaping system was added to matrices ("\" like in the regExp syntax).
· Fix - In some cases, the text cleaning of scraped data could add a unwanted space character. This was fixed.
· Fix - About box was fixed for Firefox 4.
· Fix - Some localization fixes.
· Enhancement - Preferences can now be found in the Tools menu for all platforms, including Mac, which leaves acces to Firefox preferences when in the Hub.
· Enhancement - Drag and drop was enhanced in the queries view.
· Enhancement - Better handling of dynamic pages, handling DOM nodes when they are added in AJAX pages.
· Enhancement - Preferences have been updated with catch autosave and export settings.
· Enhancement - A few shortcuts were changed for consistency with OS and other apps practices.
· Enhancement - Registration email is now case insensitive.
· Enhancement - The maximum timeout value for the loading of a page was raised to 10mn.
September 13th, 2010· Fix - Now works with Firefox 4b5 on all platforms.
September 2nd, 2010· Feature - The sort order is now saved for each queries directory. Reordering is possible by dragging queries.
· Feature - The query sorting (click on column header) now has 3 states: ascending, descending and user defined.
· Fix - The 'sort by' and 'limit' functions are now working in macros.
· Fix - The 'send to catch function' was not working in some cases from the images view. This was fixed.
· Fix - Sending data to catch with empty columns was removing last columns in certain cases.
· Fix - The page and images throbbers were spinning endlessly in some pages on Firefox 4.
· Enhancement - Removed the alert on redirection errors.
· Enhancement - When a processes is finished, the time of the end of execution is displayed in the info bar.
· Enhancement - The program recognizes more next page links.
· Enhancement - 'Select similar' and 'Select identical' functions were dramatically optimized.
· Fix - Several minor bugs were corrected in the scraper application. Please check that your previous scrapers are still behaving the same way.
· Fix - Firefox restart problems were corrected in the kernel compatibility check and update process.
· Fix - In rare cases, the numbering of columns was incorect in tables. The indexes now correspond to the column number, if no header is set for the column.
· Enhancement - Preferences are applied when the preferences window is closed. The OK and Cancel buttons now work in Windows.
· Enhancement - The 'Next Page' function was enhanced and now recognizes a larger number of links.
· Fix - The images view throbber was spinning forever when loading a file without images, a non-HTML document or a local file. This was fixed.
· Fix - A bug was corrected in the 'Invert Selection' function.
· Enhancement - The Chinese locale was removed for now. It was too partial to be useable. (Spanish and German locales were kept for now although they are not fully completed. In version 1.0, only fully translated locales will be available.)
· Enhancement - More separators are recognized in the list of labels field in the Scraper Editor. Labels can be separated by: ,;:/|# or TAB. (The TAB separation allows to simply copy and paste headers from a table of the page.)
· Enhancement - The update and compatibility check processes were enhanced.
· Enhancement - More minor fixes and enhancements were added to this version.
· Feature - Now works with the latest beta of Firefox 4.
· Feature - A first implementation of export to HTML was added. More layout flexibility will be added later.
· Feature - The Words view now finds frequent groups of words. (This feature is being tested and will probably not remain as is in the following versions.)
· Feature - A preference was added to choose whether the program should load or edit URLs on double click in datasheets
· Feature - The content of the Text view can now be moved to catch or exported.
· Fix - Send to catch on page load bug was fixed in data views.
· Fix - A bug was corrected that could offset cells when moved to the catch from tables.
· Fix - Some complex pages could generate Unresponsive Script alerts. This should not happen anymore.
· Fix - Fixed a bug opening catch files with spaces in the name.
· Fix - A data type was added in the SQL export, for broader compatibility.
· Enhancement - Queries were sometime sent too fast in 'fast scraping' mode for some sites. The frequency was reduced. (A preference will soon allow direct control over this parameter.)
· Enhancement - The management and execution of automators was enhanced in several parts of the code.
· Enhancement - The editing, moving and duplicating of query directory content was enhanced.
· Enhancement - Many other small fixes and enhancements were made.
· Feature - When opening an XML file, the program now recognizes if it is an OutWit automator and imports it directly to the user database.
· Feature - Select If function was not working on 'equal to' and 'not equal to' with string. this was corrected.
· Feature - It is now possible to split a scraped field into several fields with the new separator features of the scrapers (pro version).
· Fix - Send to catch on page load bug was fixed in several views.
· Known Issue - The 'send to catch on page load' function is still not working in data views. (Corrected in the coming update.)
· Fix - Export was not working for some extractors when used in macros. This was corrected.
· Fix - Insert rows in an empty datasheet is now working properly.
· Fix - Downloaded files with hexa encoded characters in the filename are now saved with the proper name on the disk.
· Fix - Dig and browse were not working properly on some pages, creating duplicated catch data, and not downloading files. This is fixed. Please use the bug report form if you find problematic web sites.
· Enhancement - Improved loading time of some web pages.
· Enhancement - Next and Up buttons now show their URL in the status bar.
· Enhancement - New enhancements were made to the scraper application function. They might alter the way old scrapers will behave. Please test thoroughly.
· Enhancement - When exporting a selection of rows to Excel, if labels are identical in all rows, the header will be set to that label in the exported file.
· Enhancement - In queries the program now recognizes if a query is a matrix when dragged and after it has been edited.
· Enhancement - Registration system was updated.
· Enhancement - Images extraction and dynamic node processing was enhanced for large AJAX pages.
· Enhancement - Online help is now... online.
· Fix - A bug was corrected that prevented exports from macros on Windows.
· Fix - A bug was corrected that made the Catch export from a macro generate a new column for each value.
· Fix - A few glitches were addressed in the saving of jobs when changes are made in the manager or in a macro.
· Fix - A refresh problem was solved in the last/next execution field of the job editor.
· Fix - Deleting the last lines of a table now removes its values from the detail panel or the queries editor.
· Enhancement - A finer access to timeout setting was added in the preferences panel.
· Enhancement - The name of the Jobs view is set to bold when a job is scheduled and active.
August 20th, 2010· Feature - Pro version features were added to this version for beta test purposes. These features include new views (queries, macros, jobs, documents), as well as new features in the previous views (replace in scrapers, adjacent, limit...). Please check the inline help for more info on these.
· Feature - A serial number must be entered to access these features. Pre-registration is open. We will send a temporary key to users who wish to beta test the version, as soon as the test program starts.
· Feature - An info bar was added on top of the main panel with info on processes in progress and on the content of the current view.
· Feature - The info bar can be moved up and down to reveal or hide a log of the program actions. The number of messages is limited at this point, but additional info will come in future versions.
· Fix - A whole list of fixes and enhancements were made in this version.
August 10th, 2009· Fix - Dragging and dropping text and images from the page to the catch wasn't working with Firefox 3.5. This was corrected.
July 13th, 2009· Feature - The 'Insert line(s)' feature was added to the right-click menu in most views.
· Feature - Through this 'Insert line(s)' feature, it is now possible to insert generated sequences of strings in most views (links, images, emails, tables, lists, guess and scraper). This can be used to automatically create incremented strings, URLs to explore, sequential file names to download, etc.. In the current version, this feature is limited to 15 items.
· Feature - A new checkbox in the bottom panel of the 'Images' view allows to ask the program to look for possible neighboring images in series on the same server. In the current version, the program only searches for directly adjacent images in sequences.
· Enhancement - Some images were filtered out if the aspect ratio of the high resolution image was different from the one of the thumbnail. The reason was that it usually prevented from getting images for which the full resolution was unavailable. However, this also prevented from finding the high res images in many galleries. This filter was removed.
· Enhancement - When openning a Catch file, the program now asks if the user wants to add to the existing items of the catch or replace them.
· Fix - Bookmarks and History are now accessible through the Navigation menu on Macintosh.
· Fix - Several minor bugs and locale issues were fixed.
· Feature - Added access to the history of visited pages in the Navigation menu.
· Feature - Added possibility to remember the state of the Save Incoming Files mode between sessions.
· Feature - Now keeps path of the destination folder in preferences.
· Fix - Corrected a bug that sometimes occurred when importing a scraper.
· Enhancement - Some Kernel code refactoring and commenting.
· Enhancement - Finds more high-resolution images in galleries.
· Enhancement - Finds images in html files saved on the hard disk.
· Enhancement - Finds images in simple text files.
· Fix - Some more bugs were fixed in the scraper application.
· Enhancement - Now compatible with Firefox 3.5.
April 15th, 2009· Now compatible with the latest beta of Firefox.
November 17th, 2008· Performance optimization. Enhancement in lists recognition: now handles DL lists. Minor bug fixes.
November 10th, 2008· Minor fixes in the first run process and in the Tables view.
October 24th, 2008· Complete re-writing of the slideshow module allowing to navigate back and forth with the arrow keys and optimizing the image loading process.
· Added advanced selection features in the bottom panel of each view. The extracted data can now be filtered using criteria such as contains, begins with, greater than...
· Refactoring of the code and addition of image management features, for the coming release of OutWit Images.