OutWit Hub Light Changelog

What's new in OutWit Hub Light 9.0.0.8

Sep 9, 2020
  • Feature - Data refining tools: #decode()# several algorithms were added to this function to decode an obfuscated or encrypted string extracted by the scraper line into plain text.
  • Feature - Data refining tools: Compute: allow to perform basic operations on selected numerical cells of a colum.
  • Feature - Data refining tools: Delete columns > to the right: deletes the selected column and all additional columns to its right.
  • Feature - Export: It is now possible to export each scraped row to a separate file.
  • Feature - Scrapers: #clickOnNodes# instructs the scraper to click on page elements matching a css selector.
  • Feature - Scrapers: #decode()# several algorithms were added to this function to decode an obfuscated or encrypted string extracted by the scraper line into plain text.
  • Feature - Scrapers: #EARLIEST# and #LATEST# allow to return the first/last date matching the scraper line.
  • Feature - Scrapers: #enableNodes# and #disableNodes# allow to directly change the state of page elements matching a css selector.
  • Feature - Scrapers: #ifURLContains#, #ifURLDoesNotContain# allows to execute a scraper line or not depending on the URL being scraped.
  • Feature - Scrapers: #ignoreIfField# instructs the scraper to ignore this page or record if a field has a certain value.
  • Feature - Scrapers: #lowerCase()#, #upperCase()#, #properCase()#, #sentenceCase()# alters the case of the string extracted by a scraper line.
  • Feature - Scrapers: #PAGESTATUS# replacement function returns info on the current page (errors, title...).
  • Feature - Scrapers: #pressKey# allows the scraper to simulate a key press in certain cases.
  • Feature - Scrapers: #select# adds elements matching a css selector to the selection in the current page.
  • Feature - Scrapers: #setValue# now also allows to check radio buttons, checkboxes, etc.
  • Feature - Scrapers: A ^ suffix in the description (myURLFieldName^) in a scraper line destined to extract a URL, only returns the "top" url in the hierarchy (out of example.com/products/shoes and example.com/products/, only the latter is returned.)
  • Feature - Scrapers: Multiple required fields (descriptions ending with "!") can now be interpreted as AND or OR conditions.
  • Enhancement - contact recognition & filtering: Implementation of job title recognition (Debug stage. mostly English for now). Better elimination of example/bogus email addresses and phone numbers. Enhancements in name, company, address and copyright fields recognition. Enhancements throughout the program in first/last names split and in physical address split. Better handling of obfuscated email addresses.
  • Enhancement - Data refining tools: Clean up > Normalize All Figure: enhanced and optimized.
  • Enhancement - Enhancement of date recognition, including dates without a year.
  • Enhancement - Export: performance enhancements in export functions.
  • Enhancement - The application window is resized at launch if it exceeds the dimensions of the screen.
  • Enhancement - Updated the list of User Agents in the advanced preferences.
  • Fix - Fixed source colorizing problems in the case of line breaks inside HTML tags.
  • Fix - Multiple fixes and enhancements in String Generation functions.
  • Fixes - Many more enhancements and fixes throughout the code.

New in OutWit Hub Light 8.0.0.46 (May 31, 2019)

  • Feature - #emptyDirectory# Empties the first directory of the queries view matching the passed name.
  • Feature - #splitField# Splits the passed field as a post-process, using the values in the separator and labels columns. (Can allow consecutive splits.)
  • Feature - #decodeEntities# Decodes HTML entities (like & or >) to their plain text equivalent.
  • Feature - #decodeURL# Decodes URL encoded characters (like %20) to their plain text equivalent.
  • Feature - #save# Saves the string extracted by the scraper line to a separate text file.
  • Feature - #screenshot# Saves a screenshot of the current page into a file using the passed file name.
  • Feature - #hideNodes# Makes the nodes matching the passed css selector invisible.
  • Feature - #scrollBy# Scrolls the page loaded in the OutWit Hub browser by the passed number of pixels.
  • Feature - #resetPrefOnStop# Reset the passed preference to its default value at the end of the scrape process.
  • Feature - #uniqueField# Makes sure that no duplicate values are extracted for the specified field(s) during the same exploration. (An alternative to deduplication while scraping, in case volumes are too large to post-process it.)
  • Feature - #setValue# Sets the value of the <select> or <input> HTML block matching the format column, to the value passed in the replace column.
  • Feature - #restartEvery# Sets 'auto-explore on startup' flag to true and restarts the application, every n pages or seconds.
  • Feature - #uncheckURLInQuery# Unchecks the 'OK' checkbox of the first line containing the current URL in the passed query directory.
  • Feature - #uncheckItemInQuery# Unchecks the 'OK' checkbox of the first line containing the string extracted by the scraper line in the passed query directory.
  • Enhancement - It is now possible to set the field name with a variable in the #default# directive.
  • Feature - #readFromQueries# Reads the next active string from the passed query directory and stores its value in the passed variable, then unchecks the line in the query directory.
  • Feature - #switchTo# Changes the current view to the value set in the replace column.
  • Feature - #reapply# now accepts parameters for the number of applications and the delay between them.
  • Feature - #adler32()# Used in the replacement column, allows you to generate a short hash from the string extracted by the scraper line. (This can be useful for deduplication although it is not 100% reliable as, even if it is unlikely, two different strings can result in the same hash.)
  • Feature - #encodeBase64()#, #decodeBase64()# Converts the string extracted by the scraper line into a base64 encoded string or decodes it into plain text.
  • Feature - #decode()# Decodes the string extracted by the scraper line into plain text, trying several algorithms.
  • Feature - #unique()# Only returns the string extracted by the scraper line if the values is unique during the same exploration. (An alternative to deduplication while scraping, in case volumes are too large to post-process it.)
  • Feature - #WEEK# was added to the time variables. Returns the week number in the year.
  • Feature - #LAST-POST-QUERY# returns the last POST query send. #LAST-POST-QUERY#param# returns the value of the passed parameter in the last POST query sent.
  • Feature - Several tools were added to the right-click menu on datasheets: Insert Index Column, Duplicate Column, Indexed Duplicate Column, Copy from Column..., Select if in...
  • Feature - When scraping a self-updating AJAX page, the #reapply# directive now allows to do the extraction n times at the frequency you choose.
  • Enhancement - Faster start preparation and end of process cleaning in large volume Fast Scrapes.
  • Enhancement - The contact recognition module and its dictionary were enhanced, lax recognition and dummy email addresses elimination, improved.
  • Enhancement - Improved dictionary of multilingual words, acronyms and roots, frequently used in company names addresses etc. to enhance recognition.
  • Fixes - Many more enhancements and fixes throughout the code.

New in OutWit Hub Light 7.0.0.56 (May 7, 2018)

  • Feature - (Expert & Enterprise) Added 'Duplicate Column', 'Insert Index Column', etc., to the right-click menu on datasheets.
  • Fixes - Many minor fixes and optimizations.
  • Fix - there was a regression in 7.0.0.55 that could prevent correct scraping in Fast mode. This was fixed in 7.0.0.56

New in OutWit Hub Light 7.0.0.36 (Mar 2, 2018)

  • Features - Hundreds of new features and fixes. (detail coming)

New in OutWit Hub Light 6.0.0.72 (Apr 7, 2017)

  • Feature - The text size control from the View menu now increases or decreases the size of the page text as well as the extracted data.
  • Fixes - Many fixes, in particular on inline editing in sorted datasheets and managers and in the scrollToEnd function .

New in OutWit Hub Light 6.0.0.51 (Apr 7, 2017)

  • Editions - New Expert Edition: OutWit Hub now comes in three different editions: Pro, Expert and Enterprise. Expert is single user and contains all features that were reserved to the Enterprise edition until version 5.0. Enterprise now allows several users or instances to share common automators.
  • Feature - (Expert & Enterprise editions) #suspend#n#, #suspendIf#n#, #suspendIfNot#n#: added a parameter to wait for n seconds before resuming when the OK button is clicked. (Useful to give the user time to interract with the page, solve a captcha, etc. ).
  • Feature - (Expert & Enterprise editions) #firstName(string)#, #lastName(string)#, #firstLastName(string)#, #gender(string)#: tries to finds the most likely first, last, first & last name or the gender in the passed full name string.
  • Feature - Pro users can now organize their automators (scrapers, macros, jobs, queries), grouping them by projects.
  • Features - A large series of directives and functions was added to the pro version: #autoEmpty#, #autoCatch#, #emptyOnDemand#, #deduplicate#, #default#, #default#fieldName#, #pauseBefore#, #checkIfURL# and #checkIfNotURL#, #encodeURL()#, #SECOND# ... #FIFTH#, #LOCALIP#.
  • Features - New Directives were added to Expert & Enterprise editions: #scope# (outside or within domain, all links or with a depth of 1 or 2), #deduplicateOnStop#criterionColumnName#, #deduplicateWithinPage#, #scrollToEnd#cssSelector#...
  • Feature - (Expert & Enterprise editions) Added preference to create additional Gender column when using the Insert First/Last Name function in the right-click menu. The column contains the string defined in the preference (like "Dear Mr", "Dear Ms") when the gender is recognized and a fallback value (like "Dear Customer") otherwise.
  • Feature - (Expert & Enterprise editions) The words view now includes a text box where to type or paste the words to count in the page.
  • Enhancement - The Scroll to End directive was enhanced to work in more AJAX pages.
  • Enhancement - The email recognition module now allows for diacritic characters, more dummy email addresses ([email protected]...) are eliminated, lax recognition (jackie at mysite dot com...) is much more efficient.

New in OutWit Hub Light 5.0.1.57 (Apr 7, 2017)

  • Feature - added #checkIfURL# and #checkIfNotURL# scraping directives for extraction conditions on the current URL.
  • Fix - fixes in abortIf abortIfNot and abortAfter.

New in OutWit Hub Light 5.0.1.42 (Mar 6, 2017)

  • Feature - It is now possible to use a multiple character string as the CONCAT separator.
  • Feature - Added preference to name the fields in the queries of SQL exports.
  • Feature - Added #MaxColumns# directive to limit the number of columns in the extracted data.
  • Fix - fixed stalling explorations in certain cases when the server did not answer.
  • Fix - #REQUESTED-URL# works in more cases.
  • Fixes - several fixes and optimizations in contact extractions on large lists of URLs.
  • Enhancement - Enhancements and fixes in #suspendIf# and #formatDate()#.

New in OutWit Hub Light 3.0.2.15 (Jan 8, 2013)

  • Fix - Several fixes and enhancements in the job execution engine, preferences and management.
  • Fix - Splitting cells to columns could fail in the case of untitled columns. It should now be working in all cases.
  • Fix - #addToQueue# directive and #nextToVisit()# function now also works with javascript links.
  • Enhancement - Checking/unchecking OKs on a multiple selection is now possible in the query editor like in the managers

New in OutWit Hub Light 3.0.2.5 (Jan 8, 2013)

  • Feature - Scrapers: added #FIRST#, #LAST#, #UNIQUE#, #STRICTLY-UNIQUE#, #DISTINCT#, #STRICTLY-DISTINCT#, #HAPAX#, #SHORTEST#, #LONGEST# replacement functions to the scraper engine.
  • Fix - Scrapers: corrected a problem that could cause a browse to stop before the queue of URLs to visit had been completely explored. This could happen when a #nextPage# directive was setting a next page link that had already been visited.
  • Feature - More User Gear management features were added to the File menu.
  • Feature - Regular expressions are now allowed in the datasheet Right-Click Edit>Replace... functions.
  • Fix - First name recognition was reactivated (colorizing, separation First/Last names are operational).
  • Fix - Enhancements and fixes in the export layout management.
  • Fix - The links extractor now properly handles 'Area' node links (image maps).
  • Enhancement - If data was grabbed from several pages of the same site, the default HTML export title only displays the domain name instead of the full URL.
  • Enhancement - Works with Firefox 4 to 18.
  • Enhancement - The underscore character (_) is now allowed in datasheet column headers and scraper field names.
  • Enhancement - Added a direct access to the FAQ from the Help menu.

New in OutWit Hub Light 3.0.1.47 (Jan 8, 2013)

  • Fix - Corrected a problem causing partial document extraction.

New in OutWit Hub Light 3.0.1.45 (Jan 8, 2013)

  • Feature - It is now possible to import the automators to the current User Gear from a backup User Gear file.
  • Fix - Corrected the slow saving of jobs that was happening in some cases.

New in OutWit Hub Light 3.0.1.40 (Dec 18, 2012)

  • Fix - Drag and drop is working again in the query editor (right panel).
  • Fix - Column order and numbering is now working in 3.0 in the same way as in 2.1.
  • Enhancement - Added Preferences to add DROP and CREATE instructions before the INSERTs in SQL Export.
  • Enhancement - It is now possible to run a Job immediately, by choosing 0 minutes in the 'start in' menu.
  • Enhancement - By default, the program asks to execute missed jobs at startup. A preference was added to prevent this.

New in OutWit Hub Light 3.0.1.34 (Dec 18, 2012)

  • Fix - The option checkboxes were inactive in views bottom panels. This was corrected.
  • Enhancement - The combined Browse & Dig process was fixed and optimized
  • Feature - It is now possible to put catch/columnName in the start page textbox of the Macro editor.

New in OutWit Hub Light 3.0.1.33 (Dec 18, 2012)

  • Enhancement - Compatibility: Version 17 of Firefox dropped E4X, an important technology used to handle XML data. Several parts of the code were refactored to replace this library. This should not generate any changes in the behavior. Please notify us of any regression you may observe in the reading or saving of automators or in the execution of tutorials.
  • Enhancement - Automators: The User Gear (the file containing your scrapers, macros etc.) is now backed up at launch each time its size has changed.
  • Enhancement - Automators: Trying to import automators with an & in the path was failing. This was fixed.
  • Feature - Automators: The User Gear containing your automators, export layouts and thesaurus can now be saved/backed up on the hard disk and loaded into the application. This allows you to easily transfer your scrapers, macros... from one profile to the other or from the addon to the standalone version.
  • Fix - Automators: Fixed activation/deactivation of automators when clicking on an active checkbox in a multiple selection.
  • Fix - Command Line: Macros based on a local URL now also work when launched in command line.
  • Features - Datasheets: The right-click menu of all datasheets in the application was reorganized and now includes a number of additional features.
  • Feature - Datasheets: A new "Delete Unselected" item was added to the datasheets right-click menu. (Instead of doing Invert Selection then Delete.)
  • Feature - Datasheets: A new "Select Block" item was added to the datasheets right-click menu. It will select all the rows of the table/list/rss feed (depending on the view) where the selected row is located.
  • Feature - Datasheets: A series of features (colorization button, people name recognition & split, etc) were added to the datasheets. They are destined to be used with a simple user thesaurus feature which will be added in a later version. We already included them as they may prove interesting separately.
  • Feature - Datasheets: New options of the datasheet right-click menu allow you to delete colums with few populated cells. (This is interesting after large scrapes, when useless columns have be created by marginal data fields.)
  • Features - Datasheets: Replacement functions were added to the right-click menu on datasheets.
  • Enhancement - Datasheets: Multiple optimizations were brought to datasheets behavior and coloring when selecting / modifying cells or rows.
  • Enhancement - Datasheets: The Normalize Figures function was enhanced and can now be applied on a whole datasheet.
  • Enhancement - Datasheets: The number of columns allowed in a row of any datasheet is now limited to 120. This value is arbitrary you can change it the preference panel. Note, however, that performances decrease dramatically with several hundreds of columns.
  • Fix - Datasheets: Column headers are now cleared when the data is deleted from a datasheet.
  • Enhancement - Detail: The Detail panel was modified to be able to display even very large fields in very large records.
  • Enhancement - Dig: Added sql, csv to the list of diggable file types.
  • Feature - Export: An Export Preview panel is available in all views on the left of the datasheet.
  • Enhancement - Export: The SQL export was corrected and enhanced. It is now possible to define an export file name that includes a . (dot), allowing you to address a database as well as the table: if you name the file myDB.myTable.sql, the insert statements will be done to the database myDB and the table myTable.
  • Enhancement - Export: Added thead/tbody to table export.
  • Feature - Export: The Edit button at the top right of the Export Preview panel gives access to the Export Layout Editor, where you can choose which column should be exported, in what order, with what separator and in what style.
  • Fix - Exports: Preferences now have the priority over the default export layout for separators in CSV export.
  • Enhancement - Fast Scraping: Added a cache in fast scraping, which means that the application will not send twice the same query during a fast scrape.
  • Enhancement - Fast Scraping: The Fast Scrape function was dramatically optimized for large numbers of queries. This should not be noticeable for casual use, but if you scrape tens of thousands of URLs or more in one batch, it will.
  • Enhancement - Guess: Import / recognition of separated or tabulated text files (CSV, TXT, SQL...) by the guess view was enhanced.
  • Enhancement - Guess: Improved date & people name recognition.
  • Enhancement - Guess: The guess view is understanding more data files formats and should handle better field contents which include separators.
  • Fix - Help: Fixed navigation buttons in the Help Panel.
  • Enhancement - Install: It is now possible to install or update the Hub directly by clicking on the link of an .xpi file.
  • Enhancement - Install: The Mac version is now digitally signed (like the Windows version) to avoid alerts in Mountain Lion.
  • Enhancement - Install: Works again on 32bit Macs as well as 64bits.
  • Enhancement - Jobs: Jobs with a past date are now automatically unchecked.
  • Feature - Jobs: Jobs can now contain multiple macros. Several actions can also be included in a job (reload, reapply, quit, pause...).
  • Fix - Jobs: Fixes in job execution after quiting and restarting the application.
  • Feature - Lists: and , ... tags can now be included in the lists extraction result, when the "Add Titles" option is checked in the "lists" bottom panel.
  • Enhancement - Log: It is now possible to copy data from the log panel.
  • Enhancement - Log: The Log Panel at the top of each view can slide towards the right and give room to the current page so that you see in the same screen the original page and the data being extracted.
  • Enhancement - Macros: Cosmetic work on the Macro editor (serious refactoring is still overdue, but this is already better).
  • Fix - Macros: Activation/deactivation of scrapers in the execution of macros is behaving slightly differently from 2.1. Please look for side effects in your previous macros.
  • Fix - Macros: Editable menu lists are back in the macro editor to select dynamic columns in data extractors.
  • Fix - Macros: In rare cases, macros could stall randomly during execution, due to redirections in social network iFrames (facebook likes, etc) A monitoring module was added which reactivates the macro in these cases.
  • Features - Ordinal: A new 'Ordinal' column was added to all datasheets (hidden by default, check it in the column picker in the upper right corner to see it). This column contains an id composed of three numbers separated by dots. The first number is incremented each time a new extraction is done when the 'empty' checkbox is not checked. The second number is only used in lists, tables, scraped and news: it is incremented for each new list, table or rss feed in the document, the third number is the order number of the occurrence of the item in the document. This can be very useful to select specific items or ranges of items with the 'selectIf' textbox.
  • Enhancement - Ordinal: The ordinal ID of fast scrape results has changed to make it match the format of the other views. What was the ordinal ID in previous versions is now the second group of digits in the new format.
  • Feature - Page: The 'Workshop' option of the Navigation menu gives access to a blank page where you can paste data to be processed by the Hub. When you right-click in the page, note the 'Paste Text' option of the menu: it allows you to paste data from Excel, Word or other applications without formating. (Guess should do a better job with data pasted this way from spreadsheets.)
  • Enhancement - Page: Added sql to the list of accepted file extensions. SQL files can now be opened and processed with OutWit Hub.
  • Enhancement - Performances: Link extraction now takes almost half the time on page load. This is noticeable on very large pages like the United States article on Wikipedia.
  • Enhancement - Performances: Improved loading time when the Catch contains rows with large numbers of column.
  • Enhancement - Performances: The addition of large blocks of data to a sorted datasheet has also been made dramatically faster.
  • Enhancement - Preferences: A download rename pattern (preference) was added: filename truncation option [original name,10] (first 10 chars) or [original name, -30] (last 30 chars).
  • Enhancement - Preferences: The default setting (ignoring non-latin characters, in the words view) was modified to work better with Central European languages.
  • Feature - Preferences: A new preference was added to the Advanced panel, allowing you to allow the 'words' views to extract words in non-latin characters. (This will be refined as we go, following our users feedback.)
  • Feature - Preferences: Added preference for replacement of return characters in 'Clean Text'.
  • Feature - Preferences: Added preference to ignore HTTP errors while Fast Scraping.
  • Fix - Preferences: Corrected temporization scale imprecisions in time preference.
  • Fix - Preferences: The 'Bypass Cache' preference is working again (it could not be changed and remained set to 'always').
  • Feature - Proxy: Proxy setting can now be defined within the Hub.
  • Fix - Registration: With profiles containing hundreds of automators, the program could start in light mode and switch to pro after a few seconds. This was corrected.
  • Feature - Scrapers: New replacement functions were added to scrapers: #MAX#, #MIN#, #AVERAGE#, #SUM#, #CONCAT# and #COUNT#. (see tutorial.) These should be entered in the replacement field of a scraper line to get one cumulated/computed value instead of all the matching values.
  • Enhancement - Scrapers: It is now possible to set the value of a scraper variable with a variable or function.
  • Enhancement - Scrapers: Several enhancements and fixes were brought to the scraping engine (please test your old scrapers).
  • Enhancement - Scrapers: The Scraper code has changed (please test your old scrapers), mostly in the way cell text is cleaned. Performances should be better in many cases.
  • Enhancement - Scrapers: When a scraper has a higher priority than the one you are trying to use, the dialog now allows you to disable it and run the one you chose.
  • Feature - Scrapers: added #check# directive (same as #checkIf# and #checkIfNot#, but without a condition).
  • Feature - Scrapers: added #keepOrder# directive which allows you to set this option in the scraper rather then manually or in a macro.
  • Feature - Scrapers: added #setAnchorRow# #useAnchorRow# #useCurrentRow# directives which allow you to scrape columns in HTML pages that are made using left or right floating blocks.
  • Feature - Scrapers: New directives were added to scrapers. #caseSensitive#, #checkIf#, #checkIfNot#, #setDLRenamePattern#, #setExportRenamePattern#, #insertIf#, #insertIfNot#, #setPauseInterval#, #setPauseDuration#...
  • Features - Scrapers: A #pause# directive was added to the scrapers (enter the pause duration in seconds in the replacement field).
  • Features - Scrapers: The directive #addToQueue# was added to the scrapers. It allows to grab URLs to be visited without using a variable, and to use the #nextToVisit()# without a parameter.
  • Features - Scrapers: The directive #showNextPageCandidates# was added to the scrapers, for debug purposes.
  • Fix - Scrapers: #TIME#, #DATE#, #DATETIME# are now working even if 'before', 'after' and 'format' fields are empty in a scraper line.
  • Fix - Scrapers: Activating / deactivating scrapers was not working properly in the light version.
  • Fix - Scrapers: Corrected problem happening in some cases when using \0 in a #replace# scraper directive.
  • Fix - Scrapers: Fixed a recent bug with #URL# and #BASEURL# in scrapers.
  • Enhancement - Source: Source code extraction and colorization was optimized. The source display time should be divided by three or more compared to v2.1.
  • Enhancement - Source: Source colorization has changed slightly. Image links are now appearing in dark yellow.
  • Feature - String Generation Patterns: A more efficient editor was added for creating string generation patterns (matrices).
  • Feature - String Generation Patterns: In enumerations like [a;b;c][1;2;3;4] added modulo fill when group patterns do not have the same length (instead of showing error).
  • Feature - String Generation Patterns: It is now posible to generate alternated strings for cases where you may need to go back to a menu between pages, for instance.
  • Feature - String Generation: Added hexadecimal and alphanumerical string generation (like in [00:FF]).
  • Enhancement - Tables: In the table extractor, the algorithm deciding which link to put in the first column was modified to select the most interesting link (in most cases).
  • Enhancement - Tutorials: Since Firefox 11, on Macintosh, a bug was preventing UI element animations in tutorials and wizards. To highlight interface items, we have now replaced the lens effect on Mac by the circling of designated elements and reactivated the feature.
  • Fix - Tutorials: Fixed highlighting and animations in tutorials.
  • Enhancement - UI: A time estimate was added in the status bar during automatic explorations.
  • Enhancement - UI: The program interface was somewhat altered, in particular to allow for Export Layout personalization.
  • Fix - More fixing, cleaning, debugging, optimizing, refactoring, beautifying and all that sort of things.

New in OutWit Hub Light 2.1.4.41 (Nov 21, 2012)

  • Enhancement - Compatibility update.
  • Enhancement - Upgrade system to OutWit Kernel 3.0.

New in OutWit Hub Light 2.1.4.37 (Nov 21, 2012)

  • Enhancement - Compatibility update for Firefox 16.0.1 on Windows.

New in OutWit Hub Light 2.1.4.32 (Sep 15, 2012)

  • Enhancement - Compatibility update for Firefox 15.0. - Works with FF 4.0 to 17.

New in OutWit Hub Light 2.1.4.31 (Sep 3, 2012)

  • Fix - Checkboxes were not visible anymore in the macro editor with Firefox 15.0. This was corrected
  • Enhancement - Compatibility update for Firefox 15.0. - Works with FF 4.0 to 16.

New in OutWit Hub Light 2.1.4.28 (Aug 29, 2012)

  • Fix - fixed a rare error in the words view extraction.
  • Enhancement - removed critical update alert (follow version history if you deactivate auto update in Firefox).
  • Enhancement - compatible with Firefox 4 to 16.

New in OutWit Hub Light 2.1.4.22 (Jun 20, 2012)

  • Fix - The naming of files when exporting multiple automators didn't display the automator type properly. This was corrected
  • Fix - Removed an alert that was preventing to type a query name in the Macro As URL field.
  • Fix - Corrected a source code loading error that could occur during the exploration of dynamic pages with a very short temporization.
  • Enhancement - This version is compatible with Firefox 4 to 13 (as well as Beta 14 and Aurora 15).
  • Enhancement - Smoother animations in tutorials.
  • Enhancement - Blocking alerts from sites are disabled during automatic explorations.

New in OutWit Hub Light 2.1.4.3 (May 18, 2012)

  • Feature - Added the possibility to add [Ordinal] in the file rename pattern of Preferences>Exports to include the sequential number of the explored page during a browse or a dig. (This simplifies the matching of downloaded images with extracted data, for instance.)
  • Feature - Added the #ORDINAL# replacement variable to scrapers, which allows you to add the sequential number of the explored page to a scraped record, during a browse or a dig. (This simplifies the matching of downloaded images with extracted data, for instance.)
  • Fix - The search engine management panel is operational again.
  • Fix - File downloads were failing if 'ignore' was chosen in the file exist menu of the macro editor. This was fixed.
  • Enhancement - Removed useless disabled options in the action menu of the job editor (they will be implemented some day).

New in OutWit Hub Light 2.1.4.1 (May 12, 2012)

  • Enhancement - Quicker HTML export.
  • Enhancement - Global performance improved when automatic images extraction is on.
  • Enhancement - Added #nextPage!# scraper rule to force next page to be taken into account even if the URL has already been visited in a browse or in a dig.
  • Fix - Link extraction bug that was blocking browse or dig process in some specific dyamic pages.
  • Fix - Matrix generation was looping if the "Matrix" checkbox was checked although the string was not a matrix.

New in OutWit Hub Light 2.1.3.16 (May 8, 2012)

  • Fix - Minor fixes in macros.
  • Fix - Fix problem with some Windows computers on Firefox 12.
  • Enhancement - Works with Firefox 4 to 14.

New in OutWit Hub Light 2.1.3.11 (Apr 30, 2012)

  • Feature - Can now directly receive and execute macros/jobs generated by OutWit Images.
  • Feature - Portrait/Landscape options were added in the bottom panel of the 'images' view.
  • Feature - The "select if" text box in the 'images' view now allows to type '800x600', '800x', 'x600', 'x800x' to compare respectively: width AND height, width, height and width OR height.
  • Feature - A renaming preference was added to the Export Preferences, to set renaming patterns for downloaded files and exported files. In the pattern a succession of "#" means the addition of a numerical increment (with leading zeros) and "[OriginalName]" allows to include the original base name of the file.
  • Fix - Open was failing is some cases when the file extension was in capital.
  • Fix - Several minor fixes in macros, jobs.
  • Enhancement - Small interface changes, including the possibility to preview the macros and jobs interface in light version.

New in OutWit Hub Light 2.1.2.32 (Mar 29, 2012)

  • Feature - Dragging data from the catch to the Page now works.
  • Fix - Multiple selection for exporting automators was not working on Windows. This was corrected.
  • Enhancement - Works with Firefox 4 to 13.

New in OutWit Hub Light 2.1.2.14 (Mar 13, 2012)

  • Fix - Corrected reload button sometimes hidden on Windows.
  • Fix - Fixed a regression in "Browse All Pages".
  • Fix - In very rare cases, sites with javascript generated popups could crash the application. This was fixed.
  • Enhancement - Works with Firefox 4 to 11.
  • Enhancement - Very large directories of queries are better handled. Usually 50k items are working smoothly. We have tested up to 500k. At these levels, the application requires very large RAM space.
  • Enhancement - Moving very large numbers of rows to the catch could be extremely long. This was optimized further.
  • Enhancement - A confirm dialog was added to the datasheet Empty buttons.

New in OutWit Hub Light 2.1.1.26 (Mar 3, 2012)

  • Fix - History/typeahead fixed in the search bar textbox.
  • Fix - Corrected a bug that prevented exports in certain cases for the document and news views.
  • Fix - Minor fixes and enhancements in the query manager.

New in OutWit Hub Light 2.1.1.20 (Feb 13, 2012)

  • Feature - Added more options to the browse popup menu.
  • Fix - Corrected issue with the "active" and "matrix" checkboxes of the "queries" view.
  • Enhancement - Optimized the loading and of very large directories of queries.
  • Enhancement - Optimized the launch time before a browse or a fast scrape on very large numbers of URLs

New in OutWit Hub Light 2.1.1.9 (Feb 13, 2012)

  • Enhancement - Kernel update for Firefox 10.
  • Enhancement - Improvements in the upgrade/downgrade system.

New in OutWit Hub Light 2.1.1.5 (Feb 13, 2012)

  • Fix - Removed time limitation that was added during beta period (sorry for the inconvenience).
  • Fix - Various fixes.

New in OutWit Hub Light 2.1.1.3 (Jan 28, 2012)

  • Feature - Added a preference to make popup links open within the Hub instead of a new Firefox Window.
  • Feature - Added a preference to prevent an alert to be displayed 10 seconds before the execution of a job.
  • Feature - First implementation of POST queries in the Hub (to automatically fill forms). POST data must be added to the URL like ordinary parameters (after ? and separated by &, but the name of each POST parameter must simply be prefixed with #POST#

New in OutWit Hub Light 2.1.0.51 (Jan 20, 2012)

  • Enhancement - The Hub is now available on all platforms, both as a standalone application and as a Firefox extension.
  • Enhancement - Number of fixes and enhancements in the scraper engine.
  • Enhancement - The ctrl-S, ctrl-F / cmd-S, cmd-F (save & find) commands are now working on the page even if the browser doesn't have the focus.
  • Enhancement - A count of visited and remaining pages is now displayed in the status bar during automatic exploration.
  • Enhancement - A throbber was added in the address bar while loading a page.
  • Enhancement - A list of available tutorials is displayed at first run. It can then be accessed with the Tutorials item of the Help menu.
  • Fix - The addition of a "i" flag is now working properly in regexps, when clicking on the "ignore case" checkbox of the find bar.
  • Fix - Openning a Help window from a tutorial works again
  • Fix - A bug was corrected in the clean text function which was trimming +=- characters at the end of a string in the extracted data.
  • Fix - The #replace# directive could lead to an infinite loop in some cases. This was corrected. browser doesn't have the focus.
  • Known Issue - There are occasional problems in the renaming of queries. (Workaround for now: edit the cell and hit return.)