What's new in dtSearch Desktop with Spider 7.84.8404

Aug 26, 2016
  • Fixes and minor enhancements:
  • dtSearch Desktop indexer now reports image-only PDF files in the index update log. (Metadata in these files is still indexed. This change only affects reporting in the index update log.)
  • Improved 32-bit indexing performance in low-memory conditions.
  • Fixed indexing crash in mso20win32client.dll in the latest update to Office 2016 when indexing Outlook messages.
  • Added API to change the score of a document in the dtsnSearchFound notification (C++) or ISearchStatusHandler.OnFound callback (.NET)
  • File parser bug fixes affecting: .pdf, .rtf, .emf, .doc, .xls, .ppt, .pst
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.83.8353 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • Improved formatting of WordPerfect documents.
  • Fixed file parsing bug causing extra line breaks to appear between paragraphs in .msg files.
  • File parser bug fixes affecting: .pdf, .msg, .xlsx, .jtd, .one
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.82.8339 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • All executables are code signed using SHA-2 in addition to SHA-1 (dual signed). All MSI files are signed using SHA-2 only, because MSI files do not support dual signing. Please see this Microsoft article for more information on SHA-1 deprecation.
  • Added option in dtSearch Desktop's Edit > Copy File function to automatically shorten excessively long filenames.
  • Added new C# sample, AjaxWordListBuilder, demonstrating how to use the dtSearch Engine's WordListBuilder object on a web page.
  • Added ixStepCommittingUpdate and ixStepRemovingDeletedFiles to the IndexingStep enumeration to separately identify these steps during an index update.
  • Added file parser support for a OneNote file format variant created by certain Microsoft online services.
  • Added experimental option in dtSearch Desktop to use the standard dtSearch Desktop "Next Hit" toolbar button to navigate hits in PDF files displayed in Adobe Reader (otherwise only the Ctrl+Shift+Space hotkey can be used). This option is in Options > Preferences > PDF View Options.
  • File parser bug fixes affecting: .pdf, .xlsx, .xlsb
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.81.8281 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • File parser bug fixes affecting: .doc, .pdf, .rar, .docx, .msg, .one, .pages, .qpw, .ppt
  • Fixed bug preventing "view as report" in dtSearch Desktop from working with PDF files opened in Adobe Reader
  • Fixed error reporting bug causing "Unable to access index %2" error message (without the index path) in dtSearch Desktop when an index could not be accessed to search.
  • Tested and compatible with Windows 10.
  • Added support for highlighting hits in Adobe Reader DC. An updated version of the dtSearch PDF Search Highlighter plug-in is also needed for Adobe Reader DC.
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.80.8253 (Aug 26, 2016)

  • Added support for indexing PDF files with 128-bit RC4, 128-bit AES, and 256-bit AES encryption, as long as the file does not require a password to open and does not have the "copy text" permission disabled.
  • Fixes and minor enhancements:
  • Fixed dtSearch Desktop bug causing some PDF files to be opened in a separate Adobe Reader window when file is located on a network share.
  • Fixed extra "PBrush" and "Adobe Photoshop Image" captions in some Word documents with embedded images.
  • In the Linux version, the dtSearch Engine library (.so) files are installed in the dtsearch/bin and dtsearch/bin64 folders instead of the lib and lib64 folders.
  • Added RAR file parser (dtv_rar.so) to the Linux version of the dtSearch Engine
  • Fixed incorrect parsing of some .docx, .xlsx, and .pptx documents when document has missing or incorrect filename extension.
  • Other file parser bug fixes affecting: .mdb, .pdf, WordPerfect 4.2, WordStar, KeyNote, .tar
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.79.8233 (Aug 26, 2016)

  • Added support for indexing Apple iWork 2009 Pages, Numbers, and Keynote files
  • Fixes and minor enhancements:
  • Fixed bug affecting cancellation of file conversion after either expiration of FileConverter.TimeoutSeconds or when OutputStringMaxSize exceeded when processing large binary input files with the dtsConvertInlineContainer flag.
  • File parser bug fixes affecting: *.xlsx, .pdf, .doc, .msg, .rtf, .wps
  • Added FileConverter.SetIndexCache() API to an IndexCache to be used with file conversion.
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.78.8215 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • Fixed incorrectly rounded display of numeric value in Excel with conditional formatting
  • Fixed bug affecting cancellation of file conversion after either expiration of FileConverter.TimeoutSeconds or when OutputStringMaxSize exceeded when processing large binary input files with the dtsConvertInlineContainer flag.
  • Fixed dtSearch Web search form bug causing "undefined" to appear in Filename field in Internet Explorer 8
  • Added IndexInfo.TotalDataSize to COM and .NET APIs
  • File parser bug fixes affecting: .docx (incorrect display of paragraph style; error handling non-breaking hyphens), .zip (hang indexing file deleted by antivirus software during indexing), .html
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.77.8205 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • Fixed security issue reported in a third-party component, imgman32.dll, used in the dtimage.exe utility. See http://support.dtsearch.com/faq/dts0235.htm for more information.
  • Added support for indexing Outlook 2013 and 2010 OST files. Note: Microsoft has not officially documented the OST file format specification, so this support is based on unofficial non-Microsoft information about the OST file format.
  • Added support for indexing metadata in Adobe Photoshop images
  • Fixed "~dtpdf.tmp" filenames appeared in tabs in dtSearch Publish
  • Fixed PDF hit highlighting error in dtSearch Publish on systems with Adobe Reader versions 7-9 and Internet Explorer 10
  • Added "Images" field at the end of MIME messages listing names of inline image files.
  • Fixed incorrect handling of filename-only indexing option causing "Unsupported file format" errors
  • In the API, the flag dtsIndexCreateVersion6 is now ignored, so indexes will always be created in the current index format.
  • Fixed high-DPI scaling error in dtSearch Desktop causing checkbox lists to be drawn incorrectly
  • Fix bug causing filename-only indexing option to instead report all files as inaccessible.
  • File parser bug fixes affecting: .xls, .doc, .msg
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.76.8193 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • Fixed bug causing incorrect XML conversion output from conversion of Word document with Ole10Native stream to it_ContentAsXml format
  • File parser bug fixes affecting: .xls, .doc, .rar, .pst
  • Fixed dtSearch Desktop indexer bug in "Update Multiple Indexes" dialog box causing case/accent sensitivity to be transferred between indexes when the "Clear index before adding documents" box was checked.
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.75.8178 (Aug 26, 2016)

  • Fixes and minor enhancements:
  • Updated RAR file parser to support RAR 5
  • Reduced stack use when indexing very deeply-nested containers
  • dtSearch Web: Fixed bug in Build Search Form when generating a search form containing a custom field name
  • dtSearch Web: Fixed bug affecting highlighting of the selected hit when clicking Next Hit in Internet Explorer 8
  • Some Microsoft Photo Editor 3.0 objects embedded in Office documents were not recognized as image data
  • File parser bug fixes affecting: .mdb (Access 2003), .msg embedded in .rtf, .xls, .xlsx, .xlsb, .pdf
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.74.8166 (Aug 26, 2016)

  • Added support for indexing iCalendar (*.ics) files
  • Added support for indexing Outlook emails and other content using 64-bit versions of Microsoft Office. A 64-bit version of mapitool is also included.
  • Fixes and minor enhancements:
  • dtSearch Desktop: fixed keyboard navigation problem affecting PgUp, PgDn, and the cursor keys.
  • dtSearch Web: fixed bug causing "Document information could not be retrieved from the index" error message when trying to open some documents after a search.
  • API enhancement: Options.UserThesaurusFile can be set to an XML string containing the user thesaurus, instead of the name of a file containing the thesaurus data.
  • dtSearch Web/Publish: Fixed bug affecting multiple selection (using Ctrl+Click) of indexes to search on search forms
  • dtSearch Web Setup 64-bit: Fixed error launching help
  • File parser bug fixes affecting .msg, .doc, .docx, .xlsx, .emf, .pdf
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.73.8140 (Aug 26, 2016)

  • Fixed dtSearch Desktop bug affecting keyboard navigation in retrieved documents.
  • Fixed dtSearch Web/Publish bugs affecting hit navigation in some browser versions.
  • Fixed dtSearch Web/Publish PDF highlighting browser compatibility issue affecting Internet Explorer 11.

New in dtSearch Desktop with Spider 7.73.8139 (Aug 26, 2016)

  • Fixed browser compatibility bug in dtSearch Desktop 7.73.8138 only causing problems with the "Next Hit" button on some systems.

New in dtSearch Desktop with Spider 7.73.8138 (Aug 26, 2016)

  • dtSearch Desktop:
  • Added DocStyles.css in the dtSearch templates folder to control the formatting of property tables and headings in retrieved files.
  • Fixes and minor enhancements:
  • FileConverter API - fixed missing line breaks when converting from HTML to .txt
  • FileConverter API - fixed missing ... tags around HTML metadata when converting from HTML to it_ContentAsXml
  • Java API - faster garbage collection of strings passed through IIndexStatusHandler API reduces memory use during indexing
  • Java API - added Options.storedFieldDelimiterChar
  • Reduced memory use when indexing .msg files with very large numbers of recipients or attachments
  • SearchReportJob API - fixed slow detection of timeout
  • dtSearch Desktop Indexer - fixed "not responding" message when indexing some very large documents
  • File parser bug fixes affecting .ppt, .pdf, .xls, .xlsx, .one, .xps, .msg, DocFile property sets
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.72.8186 (Sep 13, 2013)

  • All products:
  • Support for indexing OneNote 2010 and OneNote 2007 *.one files. Display and extraction of images and embedded documents are supported for both formats. Ink is displayed as OCR output (text) only.
  • dtSearch Engine:
  • Added FileConverter.InputStream in the .NET API to allow the input document to be passed as a .NET Stream object
  • Fixes and minor enhancements:
  • Added dtsoFfSkipEmailProperties flag in Options.FieldFlags to suppress display of email properties such as sender, subject, etc.
  • Fixed XML structure errors in XML generated for it_ContentAsXml output by FileConverter
  • File parser bug fixes affecting .msg, .docx, .xfa, .pdf, .rtf, .xlsb, .xlsx, .ppt
  • Added detection of Windows PE and NE executables and Linux ELF executables (these formats are still indexed according to the binary files setting, with content either filtered or skipped)
  • Fixed bug causing use of dtsExoDoNotConvertAttachments in FileConverter.ExtractionOptions to generate an incorrect "File Encrypted" error for some documents during file conversion (not indexing or searching).
  • Fixed bug causing email headers to be indexed even if dtsoSkipEmailHeaders flag is set if filetype.xml set up to index message bodies separately from attachments.
  • Added support for metadata extraction from HDPhoto images.
  • Fixed bug causing extra whitespace to be generated in conversion to plain text (UTF-8) format.
  • Mapitool.exe utility uses message delivered date property instead of modified date property to set the file date for extracted message items.
  • dtSearch Developer installer updated May 18, 2013 to add missing MemAllocator.h header file.
  • Fixed bug affecting nested proximity searches where a matching set of the terms exists exactly once and are preceeded (by the exact proximity range) by two or more instances of the proximity terms that do not satisfy the proximity criteria.
  • Other bug fixes.

New in dtSearch Desktop with Spider 7.71.8080 (Dec 4, 2012)

  • dtSearch Engine:
  • New support for highlighting hits using different colors for each search term. For API documentation on this feature, please see the article "Highlighting each term using different attributes" in dtSearchApiRef.chm.
  • Fixes and minor enhancements:
  • Added "Find Indexes" button in dtSearch Desktop's Index Library Manager to locate all indexes in a folder tree.
  • Fixed CSV file parser bug that caused "duplicate field id" error during index merge.
  • In dtSearch Web Setup, improved detection and reporting of IIS configuration problems such as missing IIS components
  • File parser bug fixes affecting MSG, WordPerfect 4.2, PPTX, XBase, PPT, XLSB
  • In Language Analyzer API, dtsLaJob.indexRetrievedFrom and dtsLaJob.alphabetLocation were not set during searches

New in dtSearch Desktop with Spider 7.70.8063 (Dec 4, 2012)

  • dtSearch Desktop:
  • New support for displaying images embedded in Office documents (DOC, DOCX, PPT, PPTX, XLS, XLSX, RTF, EML). To enable display of images in dtSearch Desktop, click Options > Preferences > Document display, and check the box to "Display images in documents".
  • Added new options in dtSearch Desktop to (1) hide MIME headers in emails, (2) show properties of images embedded in documents, and (3) control whether paths are indexed along with filenames when the "Index filenames as text" options is enabled. These options are in the Options > Preferences > Indexing Options dialog box.
  • dtSearch Engine:
  • Embedded attachments, objects and images in documents can be extracted using dtsExtractionOptions (C++) or ExtractionOptions (Java and .NET), which specify output locations and rules for filename generation. Currently the following are supported:
  • Attachments in EML, MSG, DBX, TNEF (winmail.dat), PDF, MDB and ACCDB (Access);
  • objects in DOC, DOCX, XLS, XLSX, PPT, PPTX, RTF;
  • images in DOC, DOCX, PPT, PPTX, XLS, XLSX, RTF, EML, MDB and ACCDB (Access).
  • New single-document option for indexing Access (*.mdb, *.accdb), XBase (*.dbf), and Comma-separated values (*.csv) files.
  • By default, dtSearch indexes each record of database files (*.mdb, *.accdb, *.csv, *.dbf) as a separate document. This new option provides a way to index all records in a database file as a single document. For more information, see dtSearchApiRef.chm (Overviews > Databases and Fields > Database files (*.mdb, *.dbf, *.csv))
  • Added dtsoFfShowImageProperties flag in Options.FieldFlags to display image properties (such as EXIF data) for images embedded in documents. Image properties are always indexed for images in seperate files. This flag only affects images embedded in documents, such as a .jpg embedded in a Word file. A related change, made for consistency, affects the hanlding of image files embedded in .eml email files. Previously these properties were always extracted. Now they will only be extracted with the dtsoFfShowImageProperties flag is set, so .eml files will be handled consistently with other file formats.
  • Fixes and minor enhancements:
  • Eliminated use of FILE_FLAG_RANDOM_ACCESS, which could cause excessive memory consumption under Windows Server 2008 because of what appears to be a bug in Windows caching behavior (see http://support.microsoft.com/kb/2549369 for more information).
  • Zlib version updated to 1.2.7
  • dtSearch.Spider2.dll and dtSearch.Spider4.dll have new dependencies on zlib DLLs zlib_wapidll_{VC8/VC10}_{32/64}.dll to handle gzipped sitemap.xml files.
  • Added file parsers for Ichitaro word processor versions 5 and later.
  • File parser bug fixes affecting MSG, PDF, DOCX, PPTX, Excel 2, RTF
  • Message attachments to MIME emails are now indexed as attachments (so they can be handled consistently with other attachments in the new attachment-related features described above) rather than being merged with the text of the message.
  • Added reporting of PDF files that do not contain any page text. In dtSearch Desktop, these will appear in the index log with "Image Only" after the type name (click View Log in the Update Index dialog box to see the log of indexed files). In the API, the flag fiImageOnly will be set in IndexFileInfo (.NET, Java) or dtsIndexProgressInfo.fileInfoFlags (C++) during indexing.
  • Removed extra path information from headers in containers converted to text using FileConverter.exe or FileConvertJob with the dtsConvertInlineContainer flag
  • Removed "Document Properties" caption from Word, PowerPoint, and Excel 2003 file properties. For applications that require this flag for backward compatibility, use the new flag dtsoFfIncludeDocumentPropertiesCaption in Options.FieldFlags
  • Added new values to SearchReportJob.Header and SearchReportTemplate.rtf: %%Ordinal%%, %%DocId%%, %%Type%%
  • Added new dtsConvertIncludeBOM flag to FileConverter.Flags to add UTF-8 BOM to UTF-8 output
  • FileConverter with dtsConvertJustDetectType produces more specific type ids for image, music, and video files instead of it_Media
  • Fixed search/highlighting error affecting the pre/N and w/N operators
  • Added new dtsnIndexFolderInaccessible callback notification in IndexJob, logging in indexlog.dat, and logging in HTML index log of inaccessible folders during indexing
  • Fixed incorrect time zone adjustment of PDF built-in creation and modification date fields
  • Fixed too-long filenames generated for items extracted from PST files (names could be too long for some file systems when copied using Edit > Copy File in dtSearch Desktop)

New in dtSearch Desktop with Spider 7.68.8025 (Jun 12, 2012)

  • Includes new (August 9, 2011) security updates from Microsoft Bulletin MS11-025. For more information on this Microsoft security update, including updated dependencies for dtSearch Engine developer components, please see this article.
  • Added dtsoFfIndexArchiveFileLists flag. This option adds a searchable file named ArchiveFileList.html to ZIP and RAR archives during indexing. The original file is not modified but the ArchiveFileList.html file is searchable as if it were part of the ZIP or RAR file. The file consists of a list of the names of the files inside the archive.

New in dtSearch Desktop with Spider 7.67.7973 (Jun 12, 2012)

  • Added support for highlighting hits in PDF files in Adobe Reader X and Adobe Acrobat X.
  • As a result of the Microsoft MFC MS11-025 update, dtSearch Desktop 7.67 requires Windows XP or or newer and will not run under Windows 2000.

New in dtSearch Desktop with Spider 7.67.7959 (Apr 1, 2011)

  • All products
  • Added support for indexing PST files directly.
  • dtSearch Engine
  • .NET API: Added DataSource.DocStream to allow a document to be passed through the DataSource API as a stream
  • .NET API: Added sample code demonstrating indexing of Azure blob data (examples\cs4\AzureBlobDemo)
  • Fixes and minor enhancements
  • Fixed hit navigation error in the 64-bit version of dtSearch Desktop

New in dtSearch Desktop with Spider 7.66.7924 (Nov 12, 2010)

  • dtSearch Engine:
  • Added .NET 4.0 versions of the .NET API (dtSearchNetApi4.dll, dtSearch.Spider4.dll) and sample code for C# .NET 4.0 and VB.NET 4.0
  • Added dtsSearchFastSearchFilterOnly search flag to enable much faster, optimized generation of a SearchFilter from a search when no other output is required from the search.
  • Added WordListBuilder.GetLastError to the C++, Java, and .NET APIs to provide better reporting of errors resulting from WordListBuilder calls.
  • Added new flag to enable caching of field values in WordListBuilder to make ListFieldValues calls faster. The flag is dtsWordListEnableFieldValuesCache (in the WordListBuilderFlags enumeration) and is passed to WordListBuilder using the new SetFlags method.
  • Added new .NET method Server.SetEnginePath to allow ASP.NET application deployment without administrative access
  • Added new .NET sample application, AzureDemo, demonstrating use of the dtSearch Engine in an Azure instance. For documentation explaining how to deploy in Azure, see:
  • Overviews > Installing the dtSearch Engine > Deployment steps: Azure 64-bit (in dtSearchApiRef.chm).
  • Added a way to disable file parsers using the file type table (filetype.xml) by setting the TypeId to the id of the parser to disable and the Flags value to 2.
  • All Products:
  • Faster indexing of binary data using the filtering algorithm
  • Fixes and minor enhancements:
  • In dtSearch Desktop, added SizeK, IndexRetrievedFrom, SearchDate, ReportDate variables to SearchReportTemplate.rtf and SearchListTemplate.rtf
  • Java and .NET API: Fixed IIndexStatusHandler bug causing PercentDone to remain zero during compressiong of an index
  • Added docId of document being removed from an index to IndexFileInfo reporting through IIndexStatusHandler
  • File parser bug fixes affecting Office XML drawings embedded in Word, PowerPoint, and Excel files; interpretation of OEM character codes (_x00NN_) in Excel 2007 files.
  • Added SearchJob.UnindexedSearchFlags in the .NET API and SearchJob.setUnindexedSearchFlags in the Java API to enable case and accent-sensitive unindexed searches in these APIs
  • Added .NET SearchFilter.GetItems() to provide access to an array of the doc ids selected in a SearchFilter

New in dtSearch Desktop with Spider 7.65.7907 (Oct 15, 2010)

  • Fixes and minor enhancements:
  • Fixed bug in dtSearch Desktop indexer causing it to forget index caching setting if the "Clear index before adding documents" box is checked in the Update Index dialog box
  • Fixed file parser bug affecting indexing of QuickBooks backup (*.qbb) files

New in dtSearch Desktop with Spider 7.65.7906 (Oct 15, 2010)

  • Enhancements (dtSearch Engine):
  • Added dtsoFfSkipEmailHeaders flag for Options.FieldFlags to suppress searching and display of headers in MIME and Outlook messages
  • Fixes and minor enhancements:
  • Reduced memory requirements for parsing very large XLS files
  • Fixed bug that allowed XML output from saved search results and XML generated by conversion to it_ContentAsXml to contain the colon (":") character in tag names, which caused the generated XML to fail validation.
  • Fixed PDF hit highlighting error affecting documents using ActualText parameter.
  • Automatic date recognition has been changed to limit the scope of automatically recognized entities so they will not cross a field boundary.
  • Fixed error in HTML conversion causing some output to fail to word-wrap when displayed in a browser.
  • Fixed memory leak in searches that use the dtsSearchLanguageAnalyzerSynonyms flag.
  • ZIP file parser applies default encoding in filetype.xml when interpreting ambiguous ZIP filenames, and applies automatic encoding detection if no default encoding is specified.
  • File parser bug fixes affecting HTML, XLS, DOC (paragraph numbering error), DOCX, PPT.
  • Reduced memory use when merging very large indexes
  • Fixed PDF hit highlighting errors in certain types of corrupt PDF files

New in dtSearch Desktop with Spider 7.64.7876 (Apr 20, 2010)

  • Enhancements (dtSearch Engine):
  • Added dtsSearchLanguageAnalyzerSynonyms flag to enable using a language analyzer to generate morphological variations on a search term at search time. When this flag is set, the language analyzer is called for each word or phrase in the search request. The flag dtsLaInputIsSearchTerm is passed to the language analyzer in dtsLaJob.flags, so the language analyzer knows why it is being called.
  • Added dtssGetWordBreaker API function to provide direct access to the dtSearch Engine's internal word breaker using the language analyzer API. For sample code demonstrating how to use this API, see the WordBreak example in examples\vc8\WordBreak.
  • Added more structural information to the output generated by conversion to the it_ContentAsXml file format.
  • Added to COM interface: WordListBuilder.ListFieldValues, WordListBuilder.SetFilter, and IndexJob.EnumerableFields.
  • Added dtsListIndexSkipNoiseWords flag for ListIndexJob to list words in an index without including any noise words.
  • Added dtsoFfSkipDataSourceFields flag for Options.FieldFlags to prevent DocFields values from appearing in FileConverter output
  • Fixes and minor enhancements:
  • Fixed incorrect display of CreationDate and ModDate properties in PDF files
  • Fixed incorrect hit highlighting when Unicode Filtering options at search time different from options used to index a file. To ensure consistent options, Unicode Filtering options are stored in the index when the index is created, in the index_a.ix file.
  • Fixed error updating index when directory specified for temporary files is inaccessible.
  • Fixed index merge bug causing "Inconsistent doc ids from target index" error during merge.
  • Fixed two search report bugs causing incorrect hit highlighting.
  • Improved formatting of documents converted from Ami Pro and Quattro Pro to HTML
  • Added automatic detection of gb2312 and JIS encoding.
  • Added automatic detection of XyWrite, XBase, WordStar 3.x, and WordPerfect 4.2 and TAR files.
  • Improved reporting of file types by FileConverter.DetectedTypeId, providing much more specific information about Microsoft Word versions and adding type detection for additional file formats
  • Added support for text extraction from Adobe Framemaker MIF, XFA form templates in PDF files, and Visio XML files
  • Fixed "Excessive nesting" error indexing OpenOffice document due to bug parsing table structure
  • Fixed RTF file parser bug affecting handling of the \upr tag
  • Other file parser bug fixes affecting Multimate, Lotus 1-2-3, PDF, Word, PowerPoint

New in dtSearch Desktop with Spider 7.64.7850 (Oct 24, 2009)

  • Fixes and minor enhancements
  • Fixed incorrect display of CreationDate and ModDate properties in PDF files
  • Fixed incorrect hit highlighting when Unicode Filtering options at search time different from options used to index a file. To ensure consistent options, Unicode Filtering options are stored in the index when the index is created, in the index_a.ix file.

New in dtSearch Desktop with Spider 7.63.7835 (Oct 24, 2009)

  • dtSearch Engine
  • Added IndexFileInfo.UserFields in .NET API to provide access to stored fields through the IIndexStatusHandler callback interface during indexing.
  • Added dtsnIndexDeletedFileRemoved, dtsnIndexListedFileRemoved, and dtsnIndexListedFileNotRemoved notifications to the indexing status callbacks to notify the calling application when files are removed from the index during indexing or when an attempt to remove a listed file fails.
  • Compatibility note for developers working with the .NET 2.0 API only: The DLL dependencies for dtSearchNetApi2.dll have changed due to the release of the Visual Studio .NET 2005 Service Pack 1 Security Update for ATL. Because dtSearchNetApi2.dll is built with the updated version of Visual Studio .NET 2005, it requires the updated MFC and CRT DLLs that are included with that version. This Microsoft redistributable program will install the required components:
  • Microsoft Visual C++ 2005 Service Pack 1 Redistributable Package ATL Security Update (July 28, 2009)
  • http://www.microsoft.com/downloads/details.aspx?familyid=766A6AF7-EC73-40FF-B072-9112BAB119C2&displaylang=en
  • This issue does not affect any other dtSearch Engine API.
  • Fixes and minor enhancements
  • Fixed bug in search report generation causing text in adjacent table cells to be run together in output
  • dtSearchw.exe: Fixed "Invalid character" error in dtSearch Desktop opening document
  • dtSearchw.exe: Fixed bug affecting use of drag-and-drop to re-order columns in search results, causing the wrong column order to result
  • Fixed memory leak (in version 7.62 only) when using regular expressions in File Segmentation Rules to split documents or in Text Fields definitions.
  • File parser bug fixes affecting: MS Word 2007, PDF, ZIP, OpenOffice.
  • dtSearchw.exe: Added option to suppress automatic correction of hit highlighting when a document was indexed with a different version or the document was modified since it was last indexed (in Options > Preferences > Document Display).
  • Fixed missing filename associations (*.ilb, *.dtSearch) in 7.62 setup program.
  • Fixed truncation of very long search report when generated using dtsReportWholeFile from cached text.

New in dtSearch Desktop with Spider 7.62.7804 (Oct 24, 2009)

  • All products
  • Regular expression searching extended to support TR1 regular expressions
  • dtSearch Engine
  • Java API: Added IIndexStatusHandler to Java API for monitoring of IndexJobs
  • Java API: Added IndexInfo object for more efficient retrieval of index properties from an index
  • Java API: Added SearchFilter.SelectItems() with array of doc ids
  • .NET API: Added SearchFilter.SelectItems() with array of doc ids
  • Java API: Added SearchJob.WantResultsAsFilter
  • FieldFlags: Added dtsoFfHtmlSkipImgAlt and dtsoFfHtmlSkipInputValues
  • Language Analyzer API: Added dtsLaBlockWasSkipped to LanguageAnalyzerWordFlags, providing a way for a language analyzer to request that the internal dtSearch word breaker handle a block of text from the input.
  • C++ API: Added userFields to dtsIndexProgressInfo, providing a way to access stored fields from a document as it is indexed
  • Added dtsConvertAutoUpdateSearch flag to ensure consistent hit highlighting when a document was modified since it was indexed or was indexed by an older version of dtSearch than is used to search it.
  • Fixes and minor enhancements
  • dten600.exe: Faster generation of search results with dtsSearchWantHitDetails enabled
  • dtSearchw.exe: Fixed error causing "Enter Serial Number" dialog box to appear under Vista due to UAC problem
  • dten600.dll: Fixed error causing corrupt index with message referencing !zd and int vector error when automatic recognition of dates is enabled, hyphen processing is set to dtsoHyphenAll, and the last word of a document ends with a hyphen.
  • Added new cmap files for PDF text extraction.
  • Improved speed for "not (something)" and pure xfilter searches.
  • Reduced memory use for searches that retrieve large numbers of documents with a relatively small MaxFilesToRetrieve value.
  • dtSearchw.exe: Fixed RTF output generated from Search Report that would not open in WordPad
  • dtSearchw.exe: Fixed hit navigation error caused when text with highlight markings was pasted into a Word 2007 document from dtSearch search results, and then the resulting document indexed and searched again.
  • dten600.exe: Several improvements to the automatic detection of MIME-encoded files
  • Other file parser bug fixes affecting: MS Word, PDF, MSG, ZIP, SWF, RTF, WordPerfect

New in dtSearch Desktop with Spider 7.61 Build 7764 (Mar 27, 2009)

  • All products
  • New file parser added for RAR (*.rar) archives.
  • dtSearch Desktop/Network
  • Added "Search within these results"
  • Added improved zoom-in/zoom-out for document windows. To use, hold down the Ctrl key and roll the mouse wheel forward or backward, or press Ctrl+PLUS to zoom in and Ctrl+MINUS to zoom out.
  • dtSearch Engine
  • Added it_ContentAsXml output format for FileConverter. This format organizes document content, metadata, and attachments into a standard XML format for easier automated processing. It does not currently support hit highlighting and is designed for automated content extraction only.
  • Fixes and minor enhancements
  • dten600.dll: Added workaround for invalid records created in PowerPoint files when the same file is edited by PowerPoint 2003 and PowerPoint 2007
  • dten600.dll: Fixed missed hard page breaks in Word 2007 files
  • dten600.dll: Other file parser bug fixes: PDF, MIME, QPW, SWF
  • dtSearch.exe: Fixed error restoring window position on multi-monitor systems
  • dten600.dll: Added diagnostic information to history.ix (records error messages generated during index updates, and logs index directory contents)
  • dten600.dll: Fixed error handling encoding of file when filetype.xml specifies a format inconsistent with automatically-detected UTF-8 encoding

New in dtSearch Desktop with Spider 7.60 Build 7737 (Jan 14, 2009)

  • dtSearch Engine
  • Added Visual C++ 2008 sample applications
  • Added IndexJob.EnumerableFields and WordListBuilder.ListFieldValues to provide a quick way to list all values of a field
  • Added WordListBuilder.SetSearchFilter to limit output to documents specified by a SearchFilter
  • dtSearch Publish
  • Added CopyFileExtensions option providing a way to designate filename extensions for files to automatically copy from the CD documents folder when clicked
  • Fixes and minor enhancements
  • dten600.dll: Fixed FileConverter bug causing extra copies of comments to be generated in HTML files containing an

New in dtSearch Desktop with Spider 7.60 Build 7733 (Dec 29, 2008)

  • dtSearch Engine
  • Added Visual C++ 2008 sample applications
  • Added IndexJob.EnumerableFields and WordListBuilder.ListFieldValues to provide a quick way to list all values of a field
  • Added WordListBuilder.SetSearchFilter to limit output to documents specified by a SearchFilter
  • dtSearch Publish
  • Added CopyFileExtensions option providing a way to designate filename extensions for files to automatically copy from the CD documents folder when clicked
  • Fixes and minor enhancements
  • dten600.dll: Fixed FileConverter bug causing extra copies of comments to be generated in HTML files containing an

New in dtSearch Desktop with Spider 7.60 Build 7720 (Oct 15, 2008)

  • Added CopyFileExtensions option providing a way to designate filename extensions for files to automatically copy from the CD documents folder when clicked

New in dtSearch Desktop with Spider 7.55 Build 7700 (Oct 14, 2008)

  • dten600.dll: Fixed Microsoft Access parser bug causing some MDB files to be indexed very slowly
  • dten600.dll: Fixed PDF file parser word breaking error affecting PDF files created using MacroMedia FlashPaper
  • dten600.dll: Fixed error in MS Word 6.0 file parser affecting footnote extraction
  • dten600.dll: Other file parser bug fixes (XML, MIME, Outlook MSG, Access)
  • dtSearch.exe: Fixed formatting error and added diagnostic information to error message when file could not be opened
  • dtSearch.exe: Added support in Edit > Copy File for copying container files other than archives

New in dtSearch Desktop with Spider 7.30 (Oct 30, 2006)

  • new terabyte indexer