What's new in dtSearch Network with Spider 7.66.7922

Oct 15, 2010
  • dtSearch Engine:
  • Added .NET 4.0 versions of the .NET API (dtSearchNetApi4.dll, dtSearch.Spider4.dll) and sample code for C# .NET 4.0 and VB.NET 4.0
  • Added dtsSearchFastSearchFilterOnly search flag to enable much faster, optimized generation of a SearchFilter from a search when no other output is required from the search.
  • Added WordListBuilder.GetLastError to the C++, Java, and .NET APIs to provide better reporting of errors resulting from WordListBuilder calls.
  • Added new flag to enable caching of field values in WordListBuilder to make ListFieldValues calls faster. The flag is dtsWordListEnableFieldValuesCache (in the WordListBuilderFlags enumeration) and is passed to WordListBuilder using the new SetFlags method.
  • Added new .NET method Server.SetEnginePath to allow ASP.NET application deployment without administrative access
  • Added new .NET sample application, AzureDemo, demonstrating use of the dtSearch Engine in an Azure instance. For documentation explaining how to deploy in Azure, see:
  • Overviews > Installing the dtSearch Engine > Deployment steps: Azure 64-bit (in dtSearchApiRef.chm).
  • Added a way to disable file parsers using the file type table (filetype.xml) by setting the TypeId to the id of the parser to disable and the Flags value to 2.
  • All Products:
  • Faster indexing of binary data using the filtering algorithm
  • Fixes and minor enhancements:
  • In dtSearch Desktop, added SizeK, IndexRetrievedFrom, SearchDate, ReportDate variables to SearchReportTemplate.rtf and SearchListTemplate.rtf
  • Java and .NET API: Fixed IIndexStatusHandler bug causing PercentDone to remain zero during compressiong of an index
  • Added docId of document being removed from an index to IndexFileInfo reporting through IIndexStatusHandler
  • File parser bug fixes affecting Office XML drawings embedded in Word, PowerPoint, and Excel files; interpretation of OEM character codes (_x00NN_) in Excel 2007 files.
  • Added SearchJob.UnindexedSearchFlags in the .NET API and SearchJob.setUnindexedSearchFlags in the Java API to enable case and accent-sensitive unindexed searches in these APIs
  • Added .NET SearchFilter.GetItems() to provide access to an array of the doc ids selected in a SearchFilter

New in dtSearch Network with Spider 7.66.7918 (Sep 30, 2010)

  • dtSearch Engine:
  • Added .NET 4.0 versions of the .NET API (dtSearchNetApi4.dll, dtSearch.Spider4.dll) and sample code for C# .NET 4.0 and VB.NET 4.0
  • Added dtsSearchFastSearchFilterOnly search flag to enable much faster, optimized generation of a SearchFilter from a search when no other output is required from the search.
  • Added WordListBuilder.GetLastError to the C++, Java, and .NET APIs to provide better reporting of errors resulting from WordListBuilder calls.
  • Added new flag to enable caching of field values in WordListBuilder to make ListFieldValues calls faster. The flag is dtsWordListEnableFieldValuesCache (in the WordListBuilderFlags enumeration) and is passed to WordListBuilder using the new SetFlags method.
  • Added new .NET method Server.SetEnginePath to allow ASP.NET application deployment without administrative access
  • Added new .NET sample application, AzureDemo, demonstrating use of the dtSearch Engine in an Azure instance. For documentation explaining how to deploy in Azure, see:
  • Overviews Installing the dtSearch Engine Deployment steps: Azure 64-bit (in dtSearchApiRef.chm).
  • All Products:
  • Faster indexing of binary data using the filtering algorithm
  • Fixes and minor enhancements:
  • In dtSearch Desktop, added SizeK, IndexRetrievedFrom, SearchDate, ReportDate variables to SearchReportTemplate.rtf and SearchListTemplate.rtf
  • Java and .NET API: Fixed IIndexStatusHandler bug causing PercentDone to remain zero during compressiong of an index
  • Added docId of document being removed from an index to IndexFileInfo reporting through IIndexStatusHandler
  • File parser bug fixes affecting Office XML drawings embedded in Word, PowerPoint, and Excel files; interpretation of OEM character codes (_x00NN_) in Excel 2007 files.
  • Added SearchJob.UnindexedSearchFlags in the .NET API and SearchJob.setUnindexedSearchFlags in the Java API to enable case and accent-sensitive unindexed searches in these APIs

New in dtSearch Network with Spider 7.65.7907 (Sep 15, 2010)

  • Fixed bug in dtSearch Desktop indexer causing it to forget index caching setting if the "Clear index before adding documents" box is checked in the Update Index dialog box
  • Fixed file parser bug affecting indexing of QuickBooks backup (*.qbb) files

New in dtSearch Network with Spider 7.64.7876 (Apr 20, 2010)

  • Enhancements (dtSearch Engine):
  • Added dtsSearchLanguageAnalyzerSynonyms flag to enable using a language analyzer to generate morphological variations on a search term at search time. When this flag is set, the language analyzer is called for each word or phrase in the search request. The flag dtsLaInputIsSearchTerm is passed to the language analyzer in dtsLaJob.flags, so the language analyzer knows why it is being called.
  • Added dtssGetWordBreaker API function to provide direct access to the dtSearch Engine's internal word breaker using the language analyzer API. For sample code demonstrating how to use this API, see the WordBreak example in examples\vc8\WordBreak.
  • Added more structural information to the output generated by conversion to the it_ContentAsXml file format.
  • Added to COM interface: WordListBuilder.ListFieldValues, WordListBuilder.SetFilter, and IndexJob.EnumerableFields.
  • Added dtsListIndexSkipNoiseWords flag for ListIndexJob to list words in an index without including any noise words.
  • Added dtsoFfSkipDataSourceFields flag for Options.FieldFlags to prevent DocFields values from appearing in FileConverter output
  • Fixes and minor enhancements:
  • Fixed incorrect display of CreationDate and ModDate properties in PDF files
  • Fixed incorrect hit highlighting when Unicode Filtering options at search time different from options used to index a file. To ensure consistent options, Unicode Filtering options are stored in the index when the index is created, in the index_a.ix file.
  • Fixed error updating index when directory specified for temporary files is inaccessible.
  • Fixed index merge bug causing "Inconsistent doc ids from target index" error during merge.
  • Fixed two search report bugs causing incorrect hit highlighting.
  • Improved formatting of documents converted from Ami Pro and Quattro Pro to HTML
  • Added automatic detection of gb2312 and JIS encoding.
  • Added automatic detection of XyWrite, XBase, WordStar 3.x, and WordPerfect 4.2 and TAR files.
  • Improved reporting of file types by FileConverter.DetectedTypeId, providing much more specific information about Microsoft Word versions and adding type detection for additional file formats
  • Added support for text extraction from Adobe Framemaker MIF, XFA form templates in PDF files, and Visio XML files
  • Fixed "Excessive nesting" error indexing OpenOffice document due to bug parsing table structure
  • Fixed RTF file parser bug affecting handling of the \upr tag
  • Other file parser bug fixes affecting Multimate, Lotus 1-2-3, PDF, Word, PowerPoint

New in dtSearch Network with Spider 7.63.7836 (Apr 20, 2010)

  • Fixes:
  • Fixed problem running dtSearch.exe on some systems after installing the kb973923 patch from Windows Update.
  • Fixed missing checkboxes in dtWebSetup64.exe under Windows Server 2008
  • Compatibility notes for developers working with the .NET 2.0 API only:
  • In dtSearch 7.63, the DLL dependencies for dtSearchNetApi2.dll have changed due to the release of the Visual Studio .NET 2005 Service Pack 1 Security Update for ATL. Because dtSearchNetApi2.dll is built with the updated version of Visual Studio .NET 2005, it requires the updated MFC and CRT DLLs that are included with that version.
  • This issue does not affect any other dtSearch Engine API.
  • This Microsoft redistributable program will install the required components:
  • Microsoft Visual C++ 2005 Service Pack 1 Redistributable Package ATL Security Update (July 28, 2009)
  • http://www.microsoft.com/downloads/details.aspx?familyid=766A6AF7-EC73-40FF-B072-9112BAB119C2&displaylang=en

New in dtSearch Network with Spider 7.63.7835 (Apr 20, 2010)

  • dtSearch Engine:
  • Added IndexFileInfo.UserFields in .NET API to provide access to stored fields through the IIndexStatusHandler callback interface during indexing.
  • Added dtsnIndexDeletedFileRemoved, dtsnIndexListedFileRemoved, and dtsnIndexListedFileNotRemoved notifications to the indexing status callbacks to notify the calling application when files are removed from the index during indexing or when an attempt to remove a listed file fails.
  • Compatibility note for developers working with the .NET 2.0 API only: The DLL dependencies for dtSearchNetApi2.dll have changed due to the release of the Visual Studio .NET 2005 Service Pack 1 Security Update for ATL. Because dtSearchNetApi2.dll is built with the updated version of Visual Studio .NET 2005, it requires the updated MFC and CRT DLLs that are included with that version. This Microsoft redistributable program will install the required components:
  • Microsoft Visual C++ 2005 Service Pack 1 Redistributable Package ATL Security Update (July 28, 2009)
  • http://www.microsoft.com/downloads/details.aspx?familyid=766A6AF7-EC73-40FF-B072-9112BAB119C2&displaylang=en
  • This issue does not affect any other dtSearch Engine API.
  • Fixes and minor enhancements:
  • Fixed bug in search report generation causing text in adjacent table cells to be run together in output
  • dtSearchw.exe: Fixed "Invalid character" error in dtSearch Desktop opening document
  • dtSearchw.exe: Fixed bug affecting use of drag-and-drop to re-order columns in search results, causing the wrong column order to result
  • Fixed memory leak (in version 7.62 only) when using regular expressions in File Segmentation Rules to split documents or in Text Fields definitions.
  • File parser bug fixes affecting: MS Word 2007, PDF, ZIP, OpenOffice.
  • dtSearchw.exe: Added option to suppress automatic correction of hit highlighting when a document was indexed with a different version or the document was modified since it was last indexed (in Options > Preferences > Document Display).
  • Fixed missing filename associations (*.ilb, *.dtSearch) in 7.62 setup program.
  • Fixed truncation of very long search report when generated using dtsReportWholeFile from cached text.

New in dtSearch Network with Spider 7.62.7804 (Apr 20, 2010)

  • All products:
  • Regular expression searching extended to support TR1 regular expressions
  • dtSearch Engine:
  • Java API: Added IIndexStatusHandler to Java API for monitoring of IndexJobs
  • Java API: Added IndexInfo object for more efficient retrieval of index properties from an index
  • Java API: Added SearchFilter.SelectItems() with array of doc ids
  • .NET API: Added SearchFilter.SelectItems() with array of doc ids
  • Java API: Added SearchJob.WantResultsAsFilter
  • FieldFlags: Added dtsoFfHtmlSkipImgAlt and dtsoFfHtmlSkipInputValues
  • Language Analyzer API: Added dtsLaBlockWasSkipped to LanguageAnalyzerWordFlags, providing a way for a language analyzer to request that the internal dtSearch word breaker handle a block of text from the input.
  • C++ API: Added userFields to dtsIndexProgressInfo, providing a way to access stored fields from a document as it is indexed
  • Added dtsConvertAutoUpdateSearch flag to ensure consistent hit highlighting when a document was modified since it was indexed or was indexed by an older version of dtSearch than is used to search it.
  • Fixes and minor enhancements:
  • dten600.exe: Faster generation of search results with dtsSearchWantHitDetails enabled
  • dtSearchw.exe: Fixed error causing "Enter Serial Number" dialog box to appear under Vista due to UAC problem
  • dten600.dll: Fixed error causing corrupt index with message referencing !zd and int vector error when automatic recognition of dates is enabled, hyphen processing is set to dtsoHyphenAll, and the last word of a document ends with a hyphen.
  • Added new cmap files for PDF text extraction.
  • Improved speed for "not (something)" and pure xfilter searches.
  • Reduced memory use for searches that retrieve large numbers of documents with a relatively small MaxFilesToRetrieve value.
  • dtSearchw.exe: Fixed RTF output generated from Search Report that would not open in WordPad
  • dtSearchw.exe: Fixed hit navigation error caused when text with highlight markings was pasted into a Word 2007 document from dtSearch search results, and then the resulting document indexed and searched again.
  • dten600.exe: Several improvements to the automatic detection of MIME-encoded files
  • Other file parser bug fixes affecting: MS Word, PDF, MSG, ZIP, SWF, RTF, WordPerfect

New in dtSearch Network with Spider 7.61.7769 (Apr 20, 2010)

  • dtSearch Desktop/Network:
  • Added "Search within these results"
  • Added improved zoom-in/zoom-out for document windows. To use, hold down the Ctrl key and roll the mouse wheel forward or backward, or press Ctrl+PLUS to zoom in and Ctrl+MINUS to zoom out.
  • dtSearch Engine:
  • Added it_ContentAsXml output format for FileConverter. This format organizes document content, metadata, and attachments into a standard XML format for easier automated processing. It does not currently support hit highlighting and is designed for automated content extraction only.
  • Fixes and minor enhancements:
  • dten600.dll: Added workaround for invalid records created in PowerPoint files when the same file is edited by PowerPoint 2003 and PowerPoint 2007
  • dten600.dll: Fixed missed hard page breaks in Word 2007 files
  • dten600.dll: Other file parser bug fixes: PDF, MIME, QPW, SWF
  • dtSearch.exe: Fixed error restoring window position on multi-monitor systems
  • dten600.dll: Added diagnostic information to history.ix (records error messages generated during index updates, and logs index directory contents)
  • dten600.dll: Fixed error handling encoding of file when filetype.xml specifies a format inconsistent with automatically-detected UTF-8 encoding
  • dten600.dll: Fixed bug causing "Inconsistent doc ids from target index" error merging indexes with the dtsIndexKeepExistingDocIds flag set.
  • dten600.dll: Fixed bug affecting text fields extraction in very long text files with no line breaks and text fields defined to end at end-of-line using the $$$ mark
  • dtsearchw.exe: Fixed bug in Edit > Copy File causing last access time not to be transferred when source file was read-only
  • dtSearchNetApi2.dll: Fixed bug affecting processing of the " character in FileConverter.InputFields when highlighting hits

New in dtSearch Network with Spider 7.60.7739 (Apr 20, 2010)

  • dtSearch Engine:
  • Added Visual C++ 2008 sample applications
  • Added IndexJob.EnumerableFields and WordListBuilder.ListFieldValues to provide a quick way to list all values of a field
  • Added WordListBuilder.SetSearchFilter to limit output to documents specified by a SearchFilter
  • dtSearch Publish:
  • Added CopyFileExtensions option providing a way to designate filename extensions for files to automatically copy from the CD documents folder when clicked
  • dtSearch Desktop/Network:
  • Added new user interface appearance options and updated toolbar icons
  • Fixes and minor enhancements:
  • lbvprot.dll: Fixed slow PDF file opening with Adobe Reader 9
  • dten600.dll: Added metadata extraction for M4A files
  • dten600.dll: Fixed FileConverter bug causing extra copies of comments to be generated in HTML files containing an

New in dtSearch Network with Spider 7.55 Build 7700 (Oct 15, 2008)

  • 64 bit products now include: dtSearch Web with Spider; dtSearch Engine for Win & .NET; dtSearch Engine for Linux

New in dtSearch Network with Spider 7.30 (Oct 30, 2006)

  • new terabyte indexer