Out-of-bounds read in HTSlib CRAM reader due to improper validation of input

HTSlib is a library for reading and writing bioinformatics file formats. CRAM is a compressed format which stores DNA sequence alignment data. As one method of removing redundant data, CRAM uses reference-based compression so that instead of storing the full sequence for each alignment record it stores a location in an external reference sequence along with a list of differences to the reference at that location as a sequence of "features". When decoding CRAM records, the reference data is stored in a char array, and parts matching the alignment record sequence are copied over as necessary. Due to insufficient validation of the feature data series, it was possible to make the cram_decode_seq() function copy data from either before the start, or after the end of the stored reference either into the buffer used to store the output sequence for the cram record, or into the buffer used to build the SAM MD tag. This allowed arbitrary data to be leaked to the calling function.

Impact

This bug may allow information about program state to be leaked. It may also cause a program crash through an attempt to access invalid memory.

Patches

Versions 1.23.1, 1.22.2 and 1.21.1 include fixes for this issue.

Workarounds

There is no workaround for this issue.

References

4a5ef25
22ec523
2a45eb1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Out-of-bounds read in HTSlib CRAM reader due to improper validation of input

Package

Affected versions

Patched versions

Description

Impact

Patches

Workarounds

References

Severity

CVSS overall score

CVSS v4 base metrics

Exploitability Metrics

Vulnerable System Impact Metrics

Subsequent System Impact Metrics

CVSS v4 base metrics

Exploitability Metrics

Vulnerable System Impact Metrics

Subsequent System Impact Metrics

CVE ID

Weaknesses

Out-of-bounds Read

Improper Validation of Array Index