Hadoop-BAM Changelog

What's new in Hadoop-BAM 7.0

Sep 11, 2014

Switching from Picard/Samtools to HTSJDK
First release to OSS Sonatype
Renaming of packages
Changes to VariantContextCodec: encoding of genotypes generated by other means than VCF import (thanks to Joel Thibault)
Change of JDK version requirements

New in Hadoop-BAM 6.2 (Apr 4, 2014)

New in Hadoop-BAM 6.1 (Mar 26, 2014)

New in Hadoop-BAM 6.0 (Jul 8, 2013)

New in Hadoop-BAM 5.1 (Nov 26, 2012)

New in Hadoop-BAM 5.0 (Oct 18, 2012)

New in Hadoop-BAM 4.0 (Oct 18, 2012)

SAM input and output support. AnySAMInputFormat handles transparent support of both SAM and BAM inputs even in the same Hadoop job. For output, there is no SAMOutputFormat; only AnySAMOutputFormat, which can be used to output either SAM or BAM. BAMOutputFormat will be deprecated in
the future.
Fix longstanding regression in the embedded Picard library causing end-of-file markers to be written into BAM files by every reduce task. For this reason e.g. 'samtools view' refused to show the contents of BAM files output by Hadoop-BAM.
Fix crash on some inputs caused by a bug in fi.tkk.ics.hadoop.bam.custom.hadoop.InputSampler.
Fix possible crash-on-valid situations in heuristic BAM splitting.
Various I/O classes from the Seal project are now incorporated. This includes input formats for FASTQ and QSEQ and an output format for QSEQ.
Unmapped reads are now ordered after, not before, all other reads.
Allow using Hadoop's "-libjars" command line argument instead of
HADOOP_CLASSPATH to specify the Picard .jars. This ended up being fiendishly complicated and somewhat fragile.
Partitioning files are now saved in the output, not input, directory.
'sort' plugin version 3.0:
Important bug fix for merging: conflicting IDs from different files
weren't being properly corrected.
SAM input and output support. Can input SAM and BAM files at the same time and output to either format.
When not using -o, each reducer now outputs headers into the BAM files.
'view' plugin version 1.1, with SAM input support.
Add new 'cat' plugin version 1.0, for concatenating SAM/BAM files. The main intended use case is joining the output of 'sort' when it is used without -o.
'summarize' plugin version 2.0, with SAM input support.
SplittingBAMIndexer can now be used from within the library as well as a command line tool and can index files directly in HDFS.
Various minor bug fixes.
Lots of documentation updates.
Various clarifications in the README.
Much quieter error messages when plugin loading fails.
build.xml now looks in the HADOOP_HOME environment variable for Hadoop
jars. As a result, the required minimum version of Ant is now 1.7.1.
fi.tkk.ics.hadoop.bam.custom is now compiled with warnings off, for less noisy builds.

Hadoop-BAM Changelog

What's new in Hadoop-BAM 7.0

New in Hadoop-BAM 6.2 (Apr 4, 2014)

New in Hadoop-BAM 6.1 (Mar 26, 2014)

New in Hadoop-BAM 6.0 (Jul 8, 2013)

New in Hadoop-BAM 5.1 (Nov 26, 2012)

New in Hadoop-BAM 5.0 (Oct 18, 2012)

New in Hadoop-BAM 4.0 (Oct 18, 2012)

New in Hadoop-BAM 3.3 (Feb 23, 2012)

New in Hadoop-BAM 3.2 (Feb 23, 2012)

New in Hadoop-BAM 3.1 (Feb 23, 2012)

New in Hadoop-BAM 3.0 (Feb 23, 2012)

New in Hadoop-BAM 2.0 (Feb 23, 2012)