dtSearch Changelog

What's new in dtSearch 2023.02 Beta

Oct 25, 2023

Updated RAR file parser to the current version of the Rarlab source (6.2.10 released August 1, 2023). dtSearch uses source code from Rarlab to implement content extraction from RAR archives.
Note: Rarlab reports that the updated source fixes two security vulnerabilities. Based on information available about vulnerabilities, we do not believe they affect any dtSearch product.
(1) CVE-2023-40477 (out of bounds write) affects RAR recovery volumes. RAR recovery volumes have a .rev extension and a different binary header from RAR archives, so dtSearch will not invoke the RAR file processing code if it encounters a RAR4 recovery volume. Additionally, the code to process recovery volumes is disabled in the RAR extraction code that dtSearch uses. This Rarlab article states that unrar.dll is not affected by the vulnerability, and the unrar.dll source code is what dtSearch uses in its RAR file parser.
(2) CVE-2023-38831 (launching of an incorrect file) is associated with the WinRAR user interface and does not affect dtSearch.
dtSearch Desktop Index Manager index properties window displays open error message instead of blank index properties when an index cannot be opened
Fixed search performance bug that could cause reduced search speed during phrase searches involving extremely large documents
Fixed incorrect handling of the Unicode soft hyphen character (U+00AD), which should be ignored during searches because it is not indexed.
Fixed bug that could cause duplicate field name errors when documents include field names with certain CJK diacritical characters. Verifying an index will indicate if an index is affected. Affected indexes should be rebuilt.
Text log files history.ix, dtSearchIndexingHistory.log, and indexlog.dat now format dates YYYY/MM/DD instead of MM/DD/YYYY
File parser bug fixes affecting: DOCX, PDF
In dtSearch Desktop the Options > Create group policy dialog creates an MSI file that places the registry keys under HKMU instead of HKCU or HKLM, so you can control at installation time whether the installation is per-user or per-machine. To install per-machine, specify ALLUSERS=2 and run the .msi with administrator permissions.
Fixed time zone bug in the Linux indexer causing documents to be reindexed unnecessarily during an incremental index update.
Added option in dtSearch Desktop in Options > Preferences > Indexing Resources to "Ask Windows to keep computer awake during indexing" (to prevent automatic sleep from blocking scheduled updates)./li>
Other bug fixes

New in dtSearch 2022.02 (Build 8775) (Dec 2, 2022)

New in dtSearch 2021.02 (Build 8730) (Dec 7, 2021)

Index Groups:
Index groups provide a way to organize your indexes in the Search dialog box to make very large numbers of indexes easier to manage. To enable this option, click Options > Preferences > Search Options, and check the box to "Show indexes by group". When groups are enabled, if an index name contains a colon, then the part before the colon is considered to be the group. For example, if an index is named "Business: Records", then the group is "Business". In the Search dialog box, "Records" would appear under a collapsible "Business" group heading.
"Find indexes":
The new "find indexes" control in the Search dialog box lets you filter the list of indexes by name and quickly select indexes to search using the keyboard.
To enable it, click Options > Preferences > Search Options, and check the box to "Show index finder".
To use "find indexes", in the Search dialog box press Ctrl+F and type any part of an index name. As you type the index list will update to only show matching index names. You can also use the * and ? wildcard characters to find indexes using wildcard matches.
Press Ctrl+Q to quickly check only the first listed index and return to the search request box, or press ENTER to just return to the search request box.
Better handling of font scaling on systems with multiple monitors.
Added sort direction arrows to the search results column headers.
Added Ctrl+Q hotkey for checking/unchecking checkboxes to select items in search results.
Added support for using 64-bit Adobe Acrobat/Adobe Reader to display PDF files. The 64-bit dtSearch PDF Search Highlighter plug-in is also needed for hit-highlighting to work.
The dtSearch Desktop/Network search shortcut now launches the dtSearch Desktop/Network version that corresponds to the Adobe Reader/Acrobat version installed, so if you have the 64-bit version of Adobe Acrobat or Adobe Reader, the shortcut will automatically launch the 64-bit version of dtSearch Desktop/Network.
Added option in Options > Preferences > Indexing resources to "Store a copy of indexed Outlook items in the index". Checking this option eliminates the need to run the 32-bit version of dtSearch Desktop/Network if you are using the 32-bit version of Outlook and the 64-bit version of dtSearch Desktop/Network if you are using the 64-bit version of Outlook, because dtSearch can display retrieved Outlook items by extracting them from the index.

New in dtSearch 7.96.8668 (Jun 10, 2020)

New in dtSearch 7.95.8633 (Jan 30, 2020)

New in dtSearch 7.84.8404 (Aug 26, 2016)

New in dtSearch 7.83.8353 (Aug 26, 2016)

New in dtSearch 7.82.8339 (Feb 10, 2016)

New in dtSearch 7.81.8281 (Feb 10, 2016)

New in dtSearch 7.80.8253 (Feb 10, 2016)

New in dtSearch 7.79.8235 (Mar 18, 2015)

New in dtSearch 7.78.8215 (Nov 5, 2014)

New in dtSearch 7.77.8205 (Sep 3, 2014)

New in dtSearch 7.76.8193 (May 21, 2014)

New in dtSearch 7.75.8178 (Mar 17, 2014)

New in dtSearch 7.74.8153 (Oct 15, 2013)

New in dtSearch 7.74.8152 (Oct 12, 2013)

New in dtSearch 7.74.8150 (Oct 5, 2013)

New in dtSearch 7.73.8128 (Aug 13, 2013)

New in dtSearch 7.73.8126 (Jun 29, 2013)

New in dtSearch 7.73.8123 (Jun 8, 2013)

New in dtSearch 7.73.8121 (May 28, 2013)

New in dtSearch 7.73.8120 (May 14, 2013)

New in dtSearch 7.72.8095 (Mar 7, 2013)

New in dtSearch 7.72.8091 (Feb 7, 2013)

New in dtSearch 7.72.8089 (Feb 2, 2013)

New in dtSearch 7.71.8080 (Dec 4, 2012)

New in dtSearch 7.70.8063 (Dec 4, 2012)

dtSearch Desktop:
New support for displaying images embedded in Office documents (DOC, DOCX, PPT, PPTX, XLS, XLSX, RTF, EML). To enable display of images in dtSearch Desktop, click Options > Preferences > Document display, and check the box to "Display images in documents".
Added new options in dtSearch Desktop to (1) hide MIME headers in emails, (2) show properties of images embedded in documents, and (3) control whether paths are indexed along with filenames when the "Index filenames as text" options is enabled. These options are in the Options > Preferences > Indexing Options dialog box.
dtSearch Engine:
Embedded attachments, objects and images in documents can be extracted using dtsExtractionOptions (C++) or ExtractionOptions (Java and .NET), which specify output locations and rules for filename generation. Currently the following are supported:
Attachments in EML, MSG, DBX, TNEF (winmail.dat), PDF, MDB and ACCDB (Access);
objects in DOC, DOCX, XLS, XLSX, PPT, PPTX, RTF;
images in DOC, DOCX, PPT, PPTX, XLS, XLSX, RTF, EML, MDB and ACCDB (Access).
New single-document option for indexing Access (*.mdb, *.accdb), XBase (*.dbf), and Comma-separated values (*.csv) files.
By default, dtSearch indexes each record of database files (*.mdb, *.accdb, *.csv, *.dbf) as a separate document. This new option provides a way to index all records in a database file as a single document. For more information, see dtSearchApiRef.chm (Overviews > Databases and Fields > Database files (*.mdb, *.dbf, *.csv))
Added dtsoFfShowImageProperties flag in Options.FieldFlags to display image properties (such as EXIF data) for images embedded in documents. Image properties are always indexed for images in seperate files. This flag only affects images embedded in documents, such as a .jpg embedded in a Word file. A related change, made for consistency, affects the hanlding of image files embedded in .eml email files. Previously these properties were always extracted. Now they will only be extracted with the dtsoFfShowImageProperties flag is set, so .eml files will be handled consistently with other file formats.
Fixes and minor enhancements:
Eliminated use of FILE_FLAG_RANDOM_ACCESS, which could cause excessive memory consumption under Windows Server 2008 because of what appears to be a bug in Windows caching behavior (see http://support.microsoft.com/kb/2549369 for more information).
Zlib version updated to 1.2.7
dtSearch.Spider2.dll and dtSearch.Spider4.dll have new dependencies on zlib DLLs zlib_wapidll_{VC8/VC10}_{32/64}.dll to handle gzipped sitemap.xml files.
Added file parsers for Ichitaro word processor versions 5 and later.
File parser bug fixes affecting MSG, PDF, DOCX, PPTX, Excel 2, RTF
Message attachments to MIME emails are now indexed as attachments (so they can be handled consistently with other attachments in the new attachment-related features described above) rather than being merged with the text of the message.
Added reporting of PDF files that do not contain any page text. In dtSearch Desktop, these will appear in the index log with "Image Only" after the type name (click View Log in the Update Index dialog box to see the log of indexed files). In the API, the flag fiImageOnly will be set in IndexFileInfo (.NET, Java) or dtsIndexProgressInfo.fileInfoFlags (C++) during indexing.
Removed extra path information from headers in containers converted to text using FileConverter.exe or FileConvertJob with the dtsConvertInlineContainer flag
Removed "Document Properties" caption from Word, PowerPoint, and Excel 2003 file properties. For applications that require this flag for backward compatibility, use the new flag dtsoFfIncludeDocumentPropertiesCaption in Options.FieldFlags
Added new values to SearchReportJob.Header and SearchReportTemplate.rtf: %%Ordinal%%, %%DocId%%, %%Type%%
Added new dtsConvertIncludeBOM flag to FileConverter.Flags to add UTF-8 BOM to UTF-8 output
FileConverter with dtsConvertJustDetectType produces more specific type ids for image, music, and video files instead of it_Media
Fixed search/highlighting error affecting the pre/N and w/N operators
Added new dtsnIndexFolderInaccessible callback notification in IndexJob, logging in indexlog.dat, and logging in HTML index log of inaccessible folders during indexing
Fixed incorrect time zone adjustment of PDF built-in creation and modification date fields
Fixed too-long filenames generated for items extracted from PST files (names could be too long for some file systems when copied using Edit > Copy File in dtSearch Desktop)

New in dtSearch 7.68.8025 (Jun 14, 2012)

New in dtSearch 7.67.7973 (Jun 14, 2012)

New in dtSearch 7.66.7936 (Jun 14, 2012)

Added .NET 4.0 versions of the .NET API (dtSearchNetApi4.dll, dtSearch.Spider4.dll) and sample code for C# .NET 4.0 and VB.NET 4.0
Added dtsSearchFastSearchFilterOnly search flag to enable much faster, optimized generation of a SearchFilter from a search when no other output is required from the search.
Added WordListBuilder.GetLastError to the C++, Java, and .NET APIs to provide better reporting of errors resulting from WordListBuilder calls.
Added new flag to enable caching of field values in WordListBuilder to make ListFieldValues calls faster. The flag is dtsWordListEnableFieldValuesCache (in the WordListBuilderFlags enumeration) and is passed to WordListBuilder using the new SetFlags method.
Added new .NET method Server.SetEnginePath to allow ASP.NET application deployment without administrative access
Added new .NET sample application, AzureDemo, demonstrating use of the dtSearch Engine in an Azure instance. For documentation explaining how to deploy in Azure, see:
Overviews > Installing the dtSearch Engine > Deployment steps: Azure 64-bit (in dtSearchApiRef.chm).
Added a way to disable file parsers using the file type table (filetype.xml) by setting the TypeId to the id of the parser to disable and the Flags value to 2.
Added a mechanism for a dtsInputStream to simulate an I/O error by returning a negative value from read() of less than 10,000. When this occurs, dtSearch will interpret it as an I/O error and halt processing of the current input file immediately, reporting an I/O error through the API.
Java and .NET API: Fixed IIndexStatusHandler bug causing PercentDone to remain zero during compression of an index
Added docId of document being removed from an index to IndexFileInfo reporting through IIndexStatusHandler
Fixed FileConverter bug that caused invalid XML to be generated from some conversions due to output of character code 128.
Added SearchJob.UnindexedSearchFlags in the .NET API and SearchJob.setUnindexedSearchFlags in the Java API to enable case and accent-sensitive unindexed searches in these APIs
Added .NET SearchFilter.GetItems() to provide access to an array of the doc ids selected in a SearchFilter
File parser bug fixes affecting Office XML drawings embedded in Word, PowerPoint, and Excel files; interpretation of OEM character codes (_x00NN_) in Excel 2007 files; dates prior to 1970 in MDB files; performance and memory use parsing MIME files; Word auto-numbering; PDF

New in dtSearch 7.65.7907 (Jun 14, 2012)

New in dtSearch 7.65.7906 (Jun 14, 2012)

New in dtSearch 7.64.7876 (Jun 14, 2012)

New in dtSearch 7.63.7836 (Jun 14, 2012)

New in dtSearch 7.63.7835 (Jun 14, 2012)

New in dtSearch 7.62.7804 (Jun 14, 2012)

New in dtSearch 7.61.7769 (Jun 14, 2012)

New in dtSearch 7.54 Build 7680 (Mar 27, 2009)