HTSlib is a library for reading and writing bioinformatics file formats. CRAM is a compressed format which stores DNA sequence alignment data. As one method of removing redundant data, CRAM uses reference-based compression so that instead of storing the full sequence for each alignment record it stores a location in an external reference sequence along with a list of differences to the reference at that location as a sequence of "features". When decoding CRAM records, the reference data is stored in a char array, and parts matching the alignment record sequence are copied over as necessary. Due to insufficient validation of the feature data series, it was possible to make the cram_decode_seq() function copy data from either before the start, or after the end of the stored reference either into the buffer used to store the output sequence for the cram record, or into the buffer used to build the SAM MD tag. This allowed arbitrary data to be leaked to the calling function.
Impact
This bug may allow information about program state to be leaked. It may also cause a program crash through an attempt to access invalid memory.
Patches
Versions 1.23.1, 1.22.2 and 1.21.1 include fixes for this issue.
Workarounds
There is no workaround for this issue.
References
4a5ef25
22ec523
2a45eb1
HTSlib is a library for reading and writing bioinformatics file formats. CRAM is a compressed format which stores DNA sequence alignment data. As one method of removing redundant data, CRAM uses reference-based compression so that instead of storing the full sequence for each alignment record it stores a location in an external reference sequence along with a list of differences to the reference at that location as a sequence of "features". When decoding CRAM records, the reference data is stored in a char array, and parts matching the alignment record sequence are copied over as necessary. Due to insufficient validation of the feature data series, it was possible to make the
cram_decode_seq()function copy data from either before the start, or after the end of the stored reference either into the buffer used to store the output sequence for the cram record, or into the buffer used to build the SAMMDtag. This allowed arbitrary data to be leaked to the calling function.Impact
This bug may allow information about program state to be leaked. It may also cause a program crash through an attempt to access invalid memory.
Patches
Versions 1.23.1, 1.22.2 and 1.21.1 include fixes for this issue.
Workarounds
There is no workaround for this issue.
References
4a5ef25
22ec523
2a45eb1