Audiveris Changelog

What's new in Audiveris 5.3.1

Jul 4, 2023
  • This release had to be quickly published to fix a bug on small void head templates in 5.3.0 release.
  • Setting the "Small Heads" processing switch led to lots of false positives during HEADS step, due to corrupted templates built for small void heads.

New in Audiveris 5.3.0 (Jun 28, 2023)

  • Long-awaited support of drums unpitched notation
  • Support for various music and text font families
  • User management of logical parts
  • User edition of staff geometry
  • Support for multi-measure rests, measure repeats, octave shifts, fingering, plucking, etc.
  • Use of Java 17, Tesseract 5.x, MusicXML 4.0

New in Audiveris 5.2.5 (Jan 18, 2022)

  • Fixed incompatibility between dummy-rests created on-the-fly at export time and potential related beams
  • Fixed user prompt on staff selection, not applicable to brace drop or assignment

New in Audiveris 5.2.4 (Sep 15, 2021)

  • Better handling of vertical alignment within a time slot
  • Several minor bug fixes

New in Audiveris 5.2.3 (Aug 5, 2021)

  • Support for rest chords interleaved within beam head chords
  • Bug fixes on links between heads and stems
  • Better version upgrade mechanism

New in Audiveris 5.2.2 (Jul 9, 2021)

  • Chords popup menu offers "Next In Voice" relation to better guide voice/slot assignments
  • Improved tie slurs detection on staff, across systems and across pages
  • Fixed regression on some head/stem configurations in 5.2.1
  • Minor bug fixe

New in Audiveris 5.2.1 (Jun 21, 2021)

  • Focus on poor-quality scores
  • Detection of tablatures
  • Support for 1-line percussion staves, for merged grand staff
  • Support for cross heads, implicit tuplets, non measure-long whole rests
  • Ability to manually edit any Inter instance
  • Ability to insert compound notes, key signature cancel
  • Ability to guide voice and time slot for chords
  • Ability to limit book processing on selected sheets
  • Handbook hosted on GitHub pages

New in Audiveris 4.2 (Dec 14, 2012)

  • Distribution Changes:
  • Installation: Major OSes Windows and Linux and machine architectures x86 and x64 are supported via dedicated installers. The installers take care of all Audiveris dependencies (Java runtime, C++ runtime, musical font, etc).
  • Nota: Support for MacOS has been dropped from the scope of this release to avoid further delays.
  • OCR: A few selected languages are pre-installed with Audiveris distribution (deu, eng, fra, ita). Additional languages can be supported by downloading the related trained data from the dedicated Tesseract web page.
  • NetBeans: A pre-populated nbproject folder provides NetBeans support out-of-the-box.
  • New Features:
  • OCR: Tesseract V3.02 has been integrated in place of oldish V2.04 version. This much more powerful engine has led to a global redesign of text handling within Audiveris. There is now a dedicated TEXTS step which performs a layout analysis on each whole system image and transcribes the identified textual items. Note also that several languages can be selected at the same time.
  • Binarization: Extracting foreground pixels from the background has long been performed using a global threshold on pixel gray value. Even images with non-uniform illumination can now be processed with an adaptive filter which takes into account the neighborhood of the current pixel.
  • Glyph recognition: The major part of neural network input consists in moments which capture glyph key characteristics. Former Hu geometric moments have been replaced by ART moments (Angular Radial Transform, as used by MPEG-7) which are less sensible to noise.
  • Plugins: Audiveris MusicXML output can be "piped" to external softwares such as score editors or MIDI sequencers, through a flexible plugin mechanism. Consequently, these features have been removed from Audiveris application.
  • Bug Fixes:
  • PDF input: Several free Java libraries have been tested (PDFRenderer for a long time, then JPedal and PDFBox) but none was really satisfactory. Hence support for PDF input is now delegated to a Ghostscript sub-process, a fully functional and perennial solution.
  • Other Changes:
  • Doc: A comprehensive Handbook is now available from Audiveris web page, as well as the API JavaDoc of the current release. The former installation tab is now merged with the first chapter of the handbook.
  • Wiki: The online Audiveris Wiki contains detailed documentation about how to process each score of the set of examples available on MakeMusic/Recordare site. It is also used to gather information about evolutions being considered for Audiveris software.

New in Audiveris 4.1 (Dec 14, 2012)

  • Distribution Changes:
  • Several installation files have been published, all using the 4.1beta core name. This reflects the status of continuous development rather than stable release of the software.
  • New Features:
  • Filaments: They are long glyphs representing the core of either horizontal or vertical lines (staff lines candidates and barlines candidates respectively). These filaments are formalized in natural splines, which are sequences of Bézier curves with continuity up to the second derivative.
  • Grid: The staff lines and barlines are connected into a grid of sometimes rather wavy lines. The grid itself is taken as the referential for all the other glyphs, whatever the potential skew or other distortion of the image, and thus saving the need for any pre-processing. Moreover, one can on demand easily build and save a "dewarped" version of the initial image.
  • Scale: Additional key informations are derived from run length histograms (jitter on staff line thickness and spacing, typical beam height, whether the image is music or not).
  • Systems: The boundary between two consecutive systems is now a broken line, resulting from the incremental inclusion of glyphs into their nearest system.
  • Training: Besides full sheets taken as training samples, the user can select a mode that takes every manual assignment as a new training sample.
  • Symbols: The HEAD_AND_FLAG family of compound symbols no longer exists, thanks to an aggressive strategy in glyph split pattern.
  • Known Issues:
  • OCR: We are still stuck to the old Tesseract version (2.04). The new Tesseract generation (3.x) has been out for more than one year now but still lacks a Java connection under Windows.
  • Other Changes:
  • Time: All time values, such as offsets within a measure, are computed using rational values, which makes them independent of the score divisions value.

New in Audiveris 4.0 (Dec 14, 2012)

  • New Features:
  • Display: The main application window has been simplified. Only two views are now shown for each sheet: Picture (focused on input image) and Data (focused on items detected). We no longer have separate windows for sheet and score. The score elements are displayed in a translucid manner on top of the sheet glyphs they represent, in order to visually catch any discrepency. Separate voices can be displayed each in a specific color
  • Every other window (Log, Errors, Boards) can be displayed or hidden, and each individual board can be selected at will
  • Font: Former symbol bitmaps have been dropped for the use of a TrueType music font (Stoccata.ttf then MusicalSymbols.ttf). This allows endless zooming of displays and printouts with no loss of quality
  • The font is even used to build artificial symbols used for initial training of the neural network
  • Print: Ability to print the resulting score into a PDF file
  • Multi-page: Multi-page images (using PDF or TIFF format) can be transcribed to multi-page scores in memory
  • A disk-based prototype version, using a map/reduce approach, allows to combine existing MusicXML pages into a single score

New in Audiveris 3.4 (Dec 14, 2012)

  • Distribution Changes:
  • Libraries: All the external jars (23 as of this writing) needed to rebuild and/or run Audiveris are now provided in a dedicated /lib folder available in the download area. A developer can still pick up a newer jar version from the Internet.
  • Player: The XenoPlay MusicXML player has been replaced by a better player, named Zong!
  • New Features:
  • Bench data: To allow the analysis of multiple batch runs, and compare the recognition efficiency, each sheet processing can log key data in a dedicated file. For the same purpose, time-out values can be defined for script or step processing.
  • Bar Lines: The user can now interactively assign / deassign a bar line that defines parts, thus recreating the systems from scratch.
  • Constants: All application constants can now be set from the CLI with the -option keyword. This complements the ability to set them from the Tools | Options UI menu.
  • Dots: Support for double dots, ability to assign the role of any dot (augmentation, repeat bar line, staccato)
  • Horizontals: Horizontal entities (ledgers, endings) can now be forced or deassigned manually.
  • MIDI Player: The MIDI playback is now driven from a separate console window, borrowed from Zong! player.
  • OCR: Tesseract OCR is now available under both Windows and Linux.
  • Score: From a dedicated Shape palette, the user can Drag n' Drop a (virtual) glyph to either the score view or the sheet view, thus injecting entities directly into the score structure.
  • ScoreView: The zoom of the score view can now be adjusted at will, thanks to a slider and better symbol bitmap definitions. A next version will replace them with the use of Stocatta true-type font.
  • Time Signature: The user can now enter any custom time signature, defining numerator and denominator values explicitly.
  • Time Slots: Within a measure, the time slots are meant to gather notes that begin at the same moment in time. The user can now choose at the score level the policy for determining the time slots, either through stem alignment or through note head alignment.
  • Tuplets: 6-tuplets are now supported, as well as tuplets mixing beamed notes with other notes (flagged notes, rests, ...).
  • UI: A new board (Shape palette) is available. It allows drag n' drop for entity injection, easier navigation through shape ranges, and shape assignment by double-click.
  • UI: All boards now have an expand / collapse mechanism, thus allowing to save room in the column of boards.
  • Bug Fixes:
  • Player: The Zong! Player is now more tolerant with respect to measure defects. It no longer throws an exception whenever the notes durations within a measure are not consistent with the measure expected duration.
  • Player Data: The data part of Zong player is now provided as a resource in a dedicated jar file, thus allowing the launching of Audiveris from any location of your computer.
  • Exception handler have been removed from all unitary tests, so that the results are clearly seen as successes or failures
  • Known Issues:
  • Virtual Glyphs: For the time being, the (virtual) glyphs created by direct injection cannot be moved or resized once they have been dropped from the Shape palette to their target view. However, they can be deleted and re-injected (this workaround addresses a move but not a resize).
  • Other Changes:
  • Images: Support for most pixel sizes.
  • Lyrics: Much better handling of lyric text pieces, with the ability for the user to enter extension sign or to split words with a space. The OCR can process several text lines as a whole, which often leads to better results.
  • Symbols: The symbols bitmap definitions (in the /symbol folder) have been refined with at least a 16-pixel interline definition, resulting in better display notably in score view.
  • Tiff: Images are forwarded to Tesseract OCR by memory, avoiding temporary files

New in Audiveris 3.3 (Jul 6, 2010)

  • Distribution Changes:
  • Libraries: All the external jars (23 as of this writing) needed to rebuild and/or run Audiveris are now provided in a dedicated /lib folder available in the download area. A developer can still pick up a newer jar version from the Internet.
  • Player: The XenoPlay MusicXML player has been replaced by a better player, named Zong!, still developed by Andreas Wenger.
  • New Features:
  • Bench data: To allow the analysis of multiple batch runs, and compare the recognition efficiency, each sheet processing can log key data in a dedicated file. For the same purpose, time-out values can be defined for script or step processing.
  • Bar Lines: The user can now interactively assign / deassign a bar line that defines parts, thus recreating the systems from scratch.
  • Constants: All application constants can now be set from the CLI with the -option keyword. This complements the ability to set them from the Tools | Options UI menu.
  • Dots: Support for double dots, ability to assign the role of any dot (augmentation, repeat bar line, staccato, ...)
  • Horizontals: Horizontal entities (ledgers, endings) can now be forced or deassigned manually.
  • MIDI Player: The MIDI playback is now driven from a separate console window, borrowed from Zong! player.
  • OCR: Tesseract OCR is now available under both Windows and Linux.
  • Score: From a dedicated Shape palette, the user can Drag n' Drop a (virtual) glyph to either the score view or the sheet view, thus injecting entities directly into the score structure.
  • ScoreView: The zoom of the score view can now be adjusted at will, thanks to a slider and better symbol bitmap definitions. A next version will replace them with the use of Stocatta true-type font.
  • Time Signature: The user can now enter any custom time signature, defining numerator and denominator values explicitly.
  • Time Slots: Within a measure, the time slots are meant to gather notes that begin at the same moment in time. The user can now choose at the score level the policy for determining the time slots, either through stem alignment or through note head alignment.
  • Tuplets: 6-tuplets are now supported, as well as tuplets mixing beamed notes with other notes (flagged notes, rests, ...).
  • UI: A new board (Shape palette) is available. It allows drag n' drop for entity injection, easier navigation through shape ranges, and shape assignment by double-click.
  • UI: All boards now have an expand / collapse mechanism, thus allowing to save room in the column of boards.
  • Bug Fixes:
  • Player Data: The data part of Zong player is now provided as a resource in a dedicated jar file, thus allowing the launching of Audiveris from any location of your computer.
  • Exception handler have been removed from all unitary tests, so that the results are clearly seen as successes or failures
  • Other Changes:
  • Images: Support for most pixel sizes.
  • Lyrics: Much better handling of lyric text pieces, with the ability for the user to enter extension sign or to split words with a space. The OCR can process several text lines as a whole, which often leads to better results.
  • Symbols: The symbols bitmap definitions (in the /symbol folder) have been refined with at least a 16-pixel interline definition, resulting in better display notably in score view.
  • Tiff: Images are forwarded to Tesseract OCR by memory, avoiding temporary files