CCExtractor Changelog

What's new in CCExtractor 0.94

Dec 16, 2021

BOM is no longer enabled by default on windows platforms
CEA-708: Rust decoder is now default instead of C decoder
CEA-708 subs are now extracted by default
New: Add check for Minimum supported rust version (MSRV) (#1387)
Fix: Fix CEA-708 Carriage Return command implementation
Fix: Fix bug with startat/endat parameter (#1396)
Fix: Mac Build processes (#1390)
Fix: Fix bug with negative delay parameter (#1365)
Pin Rust to 1.56.0 due to bug in 1.57.0
Changes in release artifacts:
Reintroduction of a minimal CCExtractor source package for Linux (omits the windows and git folder)
Add a portable version for Windows

New in CCExtractor 0.93 (Aug 17, 2021)

New in CCExtractor 0.92 (Aug 10, 2021)

New in CCExtractor 0.91 (Jul 26, 2021)

New in CCExtractor 0.90 (Jul 14, 2021)

New in CCExtractor 0.88 (May 22, 2019)

New in CCExtractor 0.87 (Jan 22, 2019)

New: Upgrade libGPAC to 0.7.1.
New: mp4 tx3g & multitrack subtitles.
New: Guide to update dependencies (docs/Updating_Dependencies.txt).
New: Add LICENSE File (#959).
New: Display quantisation mode in info box (#954).
New: Add instruction required to build ccextractor with HARDSUBX support (#946).
New: Added version no. of libraries to --version.
New: Added -quant (OCR quantization function).
New: Python API now compatible with Python 3.
Fix: linux/builddebug: Added non-local directories to the incluye search path so we don't
require a locally compiled tesseract or leptonica.
Fix: Correct -HARDSUBX Bug In CMake, allow build with hardsubx using cmake (#966).
Fix: possible segfaults in hardsubx_classifier.c due to strdup (#963).
Fix: Improve the start and end timestamps of extracted burned in captions (#962).
Fix: Update COMPILATION.md (#960).
Fix: Fixed crash with "-out=report" and "-out=null".
Fix: -nocf not working with OCR'ing (#958).
Fix: segfault in add_cc_sub_text and initialize to NULL in init_encoder (#950).
Fix: ccx_decoders_common.c: Copy data type when creating a copy of the subtitle structure.
Fix: Implicit declaration of these functions throws warning during build (#948).
Fix: ccx_decoders_common.c: Properly release allocated resources on free_subtitle().
Fix: Added a datatype member to struct cc_subtitle - needed so we can properly free all
memory when void *data points to a structure that has its own pointers.
Fix: dvb_subtitle_decoder.c: When combining image regions verify that the offset is
never negative.
Fix: Updated traivis.yml to fix osx build (#947).
Fix: Add utf8proc src file to cmake, updated header file (#944).
Fix: Added required pointers on freep() calls.
Fix: Removed dvb_debug_traces_to_stdout and used the usual dbg_print instead.
Fix: Additional debug traces for DVB.
Fix: Fix minor memory leak in ocr.c.
Fix: Fix issue with displaying utf8proc version.
Fix: Fix failing cmake due to liblept/tesseract header files.
Fix: Added missing n in params.c.
Fix: builddebug: Use -fsanitize=address -fno-omit-frame-pointer.
Fix: ccx_decoders_common.c: Removed trivial memory leak.
Fix: ccx_encoders_srt.c: Made sure a pointer is non-NULL before dereferencing.
Fix: dvb_subtitle_decoder.c: Initialize pointer members to NULL when creating a structure.
Fix: lib_ccx.c: Initialize (memset 0) structure cc_subtitle after memory allocation.
Fix: Added verboseness to error/warnings in dvb_subtitle_decoder.c.
Fix: dvb_subtitle_decoder.c: Work on passing invalid streams errors upstream (plus some
warning messages) so we can eventually recover from this situation instead of crashing.
Fix: telxcc.c: Currently setting a colour doesn't necessarily add a space even though the
specifications mandate it. (#930).
Fix: dvb_subtitle_decoder.c: Fix null pointer derefence when region==NULL in write_dvb_sub.
Fix: DVB Teletext subtitle incomplete.
Fix: replace all 0xA characters within startbox with 0x20.
Fix: DVB Teletext subtitle incomplete (#922).
Fix: Add missing return value to one of the returns in process_tx3g().
Fix: Typos and other minor bugs.
Fix: Tidy CMakeLists & vcxproj (#920).
Fix: Added m2ts and -mxf to help screen.
Fix: Added MKV to demuxer_print_cfg.
Fix: Added MXF to demuxer_print_cfg.
Fix: "Out of order packets" error had wrong print() parameters.
Fix: Updated Python documentation.
Fix: Fix incorrect path in XML (#904).
Fix: linux build script (non-debug): Don't hide warnings from compiler.
Fix: linux build script (debug): Display what's step of the build script we're in.
Fix: Make the build reproducible (#976).
Fix: Remove instance of o1 and o2 from help.
Fix: Colors of DVB subtitles with depth 2 broken due to a missing break.
Fix: CEA-708: Caption loss due to CW command (#991).
Fix: CEA-708: Update patch for windows priority with functions (#990).

New in CCExtractor 0.75 (Apr 25, 2016)

New in CCExtractor 0.74 (Apr 25, 2016)

New in CCExtractor 0.73 (Apr 25, 2016)

New in CCExtractor 0.72 (Apr 25, 2016)

New in CCExtractor 0.71 (Apr 25, 2016)

New in CCExtractor 0.70 (Apr 25, 2016)

New in CCExtractor 0.69 (Apr 25, 2016)

New in CCExtractor 0.68 (Apr 25, 2016)

New in CCExtractor 0.67 (Apr 25, 2016)

New in CCExtractor 0.66 (Apr 25, 2016)

Fixed bug in auto detection code that triggered a message about file being auto of sync.
Added -investigate_packets
The PMT is used to select the most promising elementary stream to get captions from. Sometimes captions are where you least expect it so -datapid allows you to select a elementary stream manually, in case the CC location is not obvious from the PMT contents. To assist looking for the right stream, the parameter "-investigate_packets" will have CCExtractor look inside each
stream, looking for CC markers, and report the streams that are likely to contain CC data even if it can't be determined from their PMT entry.
Added -datastreamtype to manually selecting a stream based on its type instead of its PID. Useful if your recording program always hides the caption under the stream stream type.
Added -streamtype so if an elementary stream is selected manually for processing the streamtype can be selected too. This can be needed if you process for example a stream that is declared as "private MPEG" in the PMT, so CCExtractor can't tell what it is.
Usually you'll want -streamtype 2 (MPEG video) or -streamtype 6 MPEG private data).
PMT content listing improved, it now shows the stream type for more types.
Fixes in roll-up, cursor was being moved to column 1 if a RU2, RU3 or RU4 was received even if already in roll-up mode.
Added -autoprogram. If a multiprogram TS is processed and autoprogram is used CCExtractor will analyze all PMTs and use the first program that has a suitable data stream.
Timed transcript (ttxt) now also exports the caption mode roll-up, paint-on, etc) next to each line, as it's useful to detect things like commercials.
Content Advisory information from XDS is now decoded if it's transmitted in "US TV parental guidelines" or "MPA".
Other encoding such as Canada's are not supported yet due to lack of samples.
Copy Management information from XDS is now decoded.
Added -xds. If present and export format is timed transcript only), XDS information will be saved to file (same file as the transcript, with XDS being clearly marked). Note that for now all XDS data is exported even if it doesn't change, so the transcript file will be significantly larger.
Added some PaintOn support, at least enough to prevent it from breaking things when the other modes are used.
Removed afd_data() warning. AFD doesn't carry any caption related data. AFD still detected in code in case we want to do something with it later anyway.
Ported last changes from Petr Kutalek's telxcc. Current version is 2.4.4.
In teletext mode when exporting to transcript (not .srt), an effort is made to detect and merge line duplicates. This is done by using the Levenshtein's distance, which is the number of changes requires to convert one string to another. To simplify things, strings are compared up to the length of the shortest one.
There are 3 parameters that can be used to tweak the thresholds:
deblev: Enable debug so the calculated distance for each two strings is displayed. The output includes both strings, the calculated distance, the maximum allowed distance, and whether the strings are ultimately considered equivalent or not, i.e.
the calculated distance is less or equal than the max allowed.
levdistmincnt value: Minimum distance we always allow regardless of the length of the strings. Default 2. This means that if the calculated distance is 0, 1 or 2, we consider the strings to be equivalent.
levdistmaxpct value: Maximum distance we allow, as a percentage of the shortest string length. Default 10%. For example, consider a comparison of one string of 30 characters and one of 60 characters. We want to determine whether the first 30 characters of the longer string are more or less the same as the shortest string, i.e. whether the longest string is the shortest one plus new characters and maybe some corrections. Since the shortest string is 30 characters and the default percentage is 10%, we would allow a distance of
up to 3 between the first 30 characters.
Added -lf : Use UNIX line terminator (LF) instead of Windows (CRLF).
Added -noautotimeref: Prevent UTC reference from being auto set from the stream data.

CCExtractor Changelog

What's new in CCExtractor 0.94

New in CCExtractor 0.93 (Aug 17, 2021)

New in CCExtractor 0.92 (Aug 10, 2021)

New in CCExtractor 0.91 (Jul 26, 2021)

New in CCExtractor 0.90 (Jul 14, 2021)

New in CCExtractor 0.88 (May 22, 2019)

New in CCExtractor 0.87 (Jan 22, 2019)

New in CCExtractor 0.75 (Apr 25, 2016)

New in CCExtractor 0.74 (Apr 25, 2016)

New in CCExtractor 0.73 (Apr 25, 2016)

New in CCExtractor 0.72 (Apr 25, 2016)

New in CCExtractor 0.71 (Apr 25, 2016)

New in CCExtractor 0.70 (Apr 25, 2016)

New in CCExtractor 0.69 (Apr 25, 2016)

New in CCExtractor 0.68 (Apr 25, 2016)

New in CCExtractor 0.67 (Apr 25, 2016)

New in CCExtractor 0.66 (Apr 25, 2016)

New in CCExtractor 0.65 (Apr 25, 2016)

New in CCExtractor 0.64 (Oct 29, 2012)

New in CCExtractor 0.63 (Aug 17, 2012)

New in CCExtractor 0.59 (Oct 7, 2011)

New in CCExtractor 0.54 (Apr 17, 2009)

New in CCExtractor 0.53 (Feb 24, 2009)

New in CCExtractor 0.52 (Dec 22, 2008)

New in CCExtractor 0.50 (Dec 13, 2008)

New in CCExtractor 0.49 (Dec 10, 2008)

New in CCExtractor 0.46 (Nov 25, 2008)

New in CCExtractor 0.45 (Nov 17, 2008)

New in CCExtractor 0.44 (Sep 10, 2008)

New in CCExtractor 0.41 (Jun 16, 2008)

New in CCExtractor 0.40 (May 21, 2008)

New in CCExtractor 0.30 (May 25, 2007)