ParsableByteArray

Kotlin |Java

@UnstableApi
@CheckReturnValue
class ParsableByteArray

Wraps a byte array, providing a set of methods for parsing data from it. Numerical values are parsed with the assumption that their constituent bytes are in big endian order.

Summary

Constants
`const Int`	`INVALID_CODE_POINT = 1114112` A value that is outside the valid range of unicode code points.

Public constructors
`ParsableByteArray()` Creates a new instance that initially has no backing data.
`ParsableByteArray(data: ByteArray!)` Creates a new instance wrapping `data`, and sets the limit to `data.length`.
`ParsableByteArray(limit: Int)` Creates a new instance with `limit` bytes and sets the limit.
`ParsableByteArray(data: ByteArray!, limit: Int)` Creates a new instance that wraps an existing array.

Public functions
`Int`	`bytesLeft()` Returns the number of bytes yet to be read.
`Int`	`capacity()` Returns the capacity of the array, which may be larger than the limit.
`Unit`	`ensureCapacity(requiredCapacity: Int)` Ensures the backing array is at least `requiredCapacity` long.
`Int`	`limit()` Returns the limit.
`Char`	`peekChar()` Peeks at the next two bytes and interprets them as a big-endian char.
`Char`	`peekChar(charset: Charset!)` This function is deprecated. Either use `peekChar` to peek the next two bytes (big-endian) or `peekCodePoint` to peek in a `Charset`-aware way.
`Int`	`peekCodePoint(charset: Charset!)` Peeks at the code point starting at `getPosition` as interpreted by `charset`.
`Int`	`peekInt()` Peeks the next four bytes as a signed value.
`Int`	`peekUnsignedByte()` Peeks at the next byte as an unsigned value.
`Int`	`peekUnsignedInt24()` Peeks the next three bytes as an unsigned value.
`Unit`	`readBytes(bitArray: ParsableBitArray!, length: Int)` Reads the next `length` bytes into `bitArray`, and resets the position of `bitArray` to zero.
`Unit`	`readBytes(buffer: ByteBuffer!, length: Int)` Reads the next `length` bytes into `buffer`.
`Unit`	`readBytes(buffer: ByteArray!, offset: Int, length: Int)` Reads the next `length` bytes into `buffer` at `offset`.
`String?`	`readDelimiterTerminatedString(delimiter: Char)` Reads up to the next delimiter byte (or the limit) as UTF-8 characters.
`Double`	`readDouble()` Reads the next eight bytes as a 64-bit floating point value.
`Float`	`readFloat()` Reads the next four bytes as a 32-bit floating point value.
`Int`	`readInt()` Reads the next four bytes as a signed value
`Int`	`readInt24()` Reads the next three bytes as a signed value.
`String?`	`readLine()` Reads a line of text in UTF-8.
`String?`	`readLine(charset: Charset!)` Reads a line of text in `charset`.
`Int`	`readLittleEndianInt()` Reads the next four bytes as a signed value in little endian order.
`Int`	`readLittleEndianInt24()` Reads the next three bytes as a signed value in little endian order.
`Long`	`readLittleEndianLong()` Reads the next eight bytes as a signed value in little endian order.
`Short`	`readLittleEndianShort()` Reads the next two bytes as a signed value.
`Long`	`readLittleEndianUnsignedInt()` Reads the next four bytes as an unsigned value in little endian order.
`Int`	`readLittleEndianUnsignedInt24()` Reads the next three bytes as an unsigned value in little endian order.
`Int`	`readLittleEndianUnsignedIntToInt()` Reads the next four bytes as a little endian unsigned integer into an integer, if the top bit is a zero.
`Int`	`readLittleEndianUnsignedShort()` Reads the next two bytes as an unsigned value.
`Long`	`readLong()` Reads the next eight bytes as a signed value.
`String?`	`readNullTerminatedString()` Reads up to the next NUL byte (or the limit) as UTF-8 characters.
`String!`	`readNullTerminatedString(length: Int)` Reads the next `length` bytes as UTF-8 characters.
`Short`	`readShort()` Reads the next two bytes as a signed value.
`String!`	`readString(length: Int)` Reads the next `length` bytes as UTF-8 characters.
`String!`	`readString(length: Int, charset: Charset!)` Reads the next `length` bytes as characters in the specified `Charset`.
`Int`	`readSynchSafeInt()` Reads a Synchsafe integer.
`Int`	`readUnsignedByte()` Reads the next byte as an unsigned value.
`Int`	`readUnsignedFixedPoint1616()` Reads the next four bytes, returning the integer portion of the fixed point 16.16 integer.
`Long`	`readUnsignedInt()` Reads the next four bytes as an unsigned value.
`Int`	`readUnsignedInt24()` Reads the next three bytes as an unsigned value.
`Int`	`readUnsignedIntToInt()` Reads the next four bytes as an unsigned integer into an integer, if the top bit is a zero.
`Int`	`readUnsignedLeb128ToInt()` Reads an unsigned variable-length LEB128 value into an int.
`Long`	`readUnsignedLeb128ToLong()` Reads an unsigned variable-length LEB128 value into a long.
`Long`	`readUnsignedLongToLong()` Reads the next eight bytes as an unsigned long into a long, if the top bit is a zero.
`Int`	`readUnsignedShort()` Reads the next two bytes as an unsigned value.
`Long`	`readUtf8EncodedLong()` Reads a long value encoded by UTF-8 encoding
`Charset?`	`readUtfCharsetFromBom()` Reads a UTF byte order mark (BOM) and returns the UTF `Charset` it represents.
`Unit`	`reset(data: ByteArray!)` Updates the instance to wrap `data`, and resets the position to zero and the limit to `data.length`.
`Unit`	`reset(limit: Int)` Resets the position to zero and the limit to the specified value.
`Unit`	`reset(data: ByteArray!, limit: Int)` Updates the instance to wrap `data`, and resets the position to zero.
`Unit`	`setLimit(limit: Int)` Sets the limit.
`java-static Unit`	`@VisibleForTesting setShouldEnforceLimitOnLegacyMethods(enforceLimit: Boolean)` Sets whether all read/peek methods should enforce that `getPosition` never exceeds `limit`.
`Unit`	`skipBytes(bytes: Int)` Moves the reading offset by `bytes`.
`Unit`	`skipLeb128()` Skips a variable-length LEB128 value.

Public properties
`ByteArray<Byte>!`	`data`
`Int`	`position`

Constants

INVALID_CODE_POINT

const val INVALID_CODE_POINT = 1114112: Int

A value that is outside the valid range of unicode code points.

Public constructors

ParsableByteArray

ParsableByteArray()

Creates a new instance that initially has no backing data.

ParsableByteArray

ParsableByteArray(data: ByteArray!)

Creates a new instance wrapping data, and sets the limit to data.length.

Parameters
`data: ByteArray!`	The array to wrap.

ParsableByteArray

ParsableByteArray(limit: Int)

Creates a new instance with limit bytes and sets the limit.

Parameters
`limit: Int`	The limit to set.

ParsableByteArray

ParsableByteArray(data: ByteArray!, limit: Int)

Creates a new instance that wraps an existing array.

Parameters
`data: ByteArray!`	The data to wrap.
`limit: Int`	The limit to set.

Public functions

bytesLeft

fun bytesLeft(): Int

Returns the number of bytes yet to be read.

capacity

fun capacity(): Int

Returns the capacity of the array, which may be larger than the limit.

ensureCapacity

fun ensureCapacity(requiredCapacity: Int): Unit

Ensures the backing array is at least requiredCapacity long.

position, limit, and all data in the underlying array (including that beyond limit) are preserved.

This might replace or wipe the underlying array, potentially invalidating any local references.

limit

fun limit(): Int

Returns the limit.

peekChar

fun peekChar(): Char

Peeks at the next two bytes and interprets them as a big-endian char.

peekChar

fun peekChar(charset: Charset!): Char

peekCodePoint

fun peekCodePoint(charset: Charset!): Int

Peeks at the code point starting at getPosition as interpreted by charset.

The exact behaviour depends on charset:

US_ASCII: Returns the byte at getPosition if it's valid ASCII (less than 0x80), otherwise returns INVALID_CODE_POINT.
UTF-8: If getPosition is the start of a UTF-8 code unit the whole unit is decoded and returned. Otherwise INVALID_CODE_POINT is returned.
UTF-16 (all endian-nesses):
- If getPosition is at the start of a high surrogate code unit and the following two bytes are a isLowSurrogate low surrogate} code unit, the combined code point is returned.
- Otherwise the single code unit starting at getPosition is returned directly.
- UTF-16 has no support for byte-level synchronization, so if getPosition is not aligned with the start of a UTF-16 code unit then the result is undefined.

Throws
`java.lang.IllegalArgumentException`	if charset is not supported. Only US_ASCII, UTF-8, UTF-16, UTF-16BE, and UTF-16LE are supported.
`java.lang.IndexOutOfBoundsException`	if `bytesLeft` doesn't allow reading the smallest code unit in `charset` (1 byte for ASCII and UTF-8, 2 bytes for UTF-16).

peekInt

fun peekInt(): Int

Peeks the next four bytes as a signed value.

peekUnsignedByte

fun peekUnsignedByte(): Int

Peeks at the next byte as an unsigned value.

peekUnsignedInt24

fun peekUnsignedInt24(): Int

Peeks the next three bytes as an unsigned value.

readBytes

fun readBytes(bitArray: ParsableBitArray!, length: Int): Unit

Reads the next length bytes into bitArray, and resets the position of bitArray to zero.

Parameters
`bitArray: ParsableBitArray!`	The `ParsableBitArray` into which the bytes should be read.
`length: Int`	The number of bytes to write.

readBytes

fun readBytes(buffer: ByteBuffer!, length: Int): Unit

Reads the next length bytes into buffer.

Parameters
`buffer: ByteBuffer!`	The `ByteBuffer` into which the read data should be written.
`length: Int`	The number of bytes to read.

See also
`put`

readBytes

fun readBytes(buffer: ByteArray!, offset: Int, length: Int): Unit

Reads the next length bytes into buffer at offset.

Parameters
`buffer: ByteArray!`	The array into which the read data should be written.
`offset: Int`	The offset in `buffer` at which the read data should be written.
`length: Int`	The number of bytes to read.

See also
`arraycopy`

readDelimiterTerminatedString

fun readDelimiterTerminatedString(delimiter: Char): String?

Reads up to the next delimiter byte (or the limit) as UTF-8 characters.

Returns
`String?`	The string not including any terminating delimiter byte, or null if the end of the data has already been reached.

readDouble

fun readDouble(): Double

Reads the next eight bytes as a 64-bit floating point value.

readFloat

fun readFloat(): Float

Reads the next four bytes as a 32-bit floating point value.

readInt

fun readInt(): Int

Reads the next four bytes as a signed value

readInt24

fun readInt24(): Int

Reads the next three bytes as a signed value.

readLine

fun readLine(): String?

Reads a line of text in UTF-8.

Equivalent to passing UTF_8 to readLine.

readLine

fun readLine(charset: Charset!): String?

Reads a line of text in charset.

A line is considered to be terminated by any one of a carriage return ('\r'), a line feed ('\n'), or a carriage return followed immediately by a line feed ('\r\n'). This method discards leading UTF byte order marks (BOM), if present.

The position is advanced to start of the next line (i.e. any line terminators are skipped).

Parameters
`charset: Charset!`	The charset used to interpret the bytes as a `String`.

Returns
`String?`	The line not including any line-termination characters, or null if the end of the data has already been reached.

Throws
`java.lang.IllegalArgumentException`	if charset is not supported. Only US_ASCII, UTF-8, UTF-16, UTF-16BE, and UTF-16LE are supported.

readLittleEndianInt

fun readLittleEndianInt(): Int

Reads the next four bytes as a signed value in little endian order.

readLittleEndianInt24

fun readLittleEndianInt24(): Int

Reads the next three bytes as a signed value in little endian order.

readLittleEndianLong

fun readLittleEndianLong(): Long

Reads the next eight bytes as a signed value in little endian order.

readLittleEndianShort

fun readLittleEndianShort(): Short

Reads the next two bytes as a signed value.

readLittleEndianUnsignedInt

fun readLittleEndianUnsignedInt(): Long

Reads the next four bytes as an unsigned value in little endian order.

readLittleEndianUnsignedInt24

fun readLittleEndianUnsignedInt24(): Int

Reads the next three bytes as an unsigned value in little endian order.

readLittleEndianUnsignedIntToInt

fun readLittleEndianUnsignedIntToInt(): Int

Reads the next four bytes as a little endian unsigned integer into an integer, if the top bit is a zero.

Throws
`java.lang.IllegalStateException`	Thrown if the top bit of the input data is set.

readLittleEndianUnsignedShort

fun readLittleEndianUnsignedShort(): Int

Reads the next two bytes as an unsigned value.

readLong

fun readLong(): Long

Reads the next eight bytes as a signed value.

readNullTerminatedString

fun readNullTerminatedString(): String?

Reads up to the next NUL byte (or the limit) as UTF-8 characters.

Returns
`String?`	The string not including any terminating NUL byte, or null if the end of the data has already been reached.

readNullTerminatedString

fun readNullTerminatedString(length: Int): String!

Reads the next length bytes as UTF-8 characters. A terminating NUL byte is discarded, if present.

Parameters
`length: Int`	The number of bytes to read.

Returns
`String!`	The string, not including any terminating NUL byte.

readShort

fun readShort(): Short

Reads the next two bytes as a signed value.

readString

fun readString(length: Int): String!

Reads the next length bytes as UTF-8 characters.

Parameters
`length: Int`	The number of bytes to read.

Returns
`String!`	The string encoded by the bytes.

readString

fun readString(length: Int, charset: Charset!): String!

Reads the next length bytes as characters in the specified Charset.

Parameters
`length: Int`	The number of bytes to read.
`charset: Charset!`	The character set of the encoded characters.

Returns
`String!`	The string encoded by the bytes in the specified character set.

readSynchSafeInt

fun readSynchSafeInt(): Int

Reads a Synchsafe integer.

Synchsafe integers keep the highest bit of every byte zeroed. A 32 bit synchsafe integer can store 28 bits of information.

Returns
`Int`	The parsed value.

readUnsignedByte

fun readUnsignedByte(): Int

Reads the next byte as an unsigned value.

readUnsignedFixedPoint1616

fun readUnsignedFixedPoint1616(): Int

Reads the next four bytes, returning the integer portion of the fixed point 16.16 integer.

readUnsignedInt

fun readUnsignedInt(): Long

Reads the next four bytes as an unsigned value.

readUnsignedInt24

fun readUnsignedInt24(): Int

Reads the next three bytes as an unsigned value.

readUnsignedIntToInt

fun readUnsignedIntToInt(): Int

Reads the next four bytes as an unsigned integer into an integer, if the top bit is a zero.

Throws
`java.lang.IllegalStateException`	Thrown if the top bit of the input data is set.

readUnsignedLeb128ToInt

fun readUnsignedLeb128ToInt(): Int

Reads an unsigned variable-length LEB128 value into an int.

Returns
`Int`	integer value

Throws
`java.lang.IllegalArgumentException`	if the read value is greater than `MAX_VALUE` or less than `MIN_VALUE`

readUnsignedLeb128ToLong

fun readUnsignedLeb128ToLong(): Long

Reads an unsigned variable-length LEB128 value into a long.

Returns
`Long`	long value

Throws
`java.lang.IllegalStateException`	if the byte to be read is over the limit of the parsable byte array

readUnsignedLongToLong

fun readUnsignedLongToLong(): Long

Reads the next eight bytes as an unsigned long into a long, if the top bit is a zero.

Throws
`java.lang.IllegalStateException`	Thrown if the top bit of the input data is set.

readUnsignedShort

fun readUnsignedShort(): Int

Reads the next two bytes as an unsigned value.

readUtf8EncodedLong

fun readUtf8EncodedLong(): Long

Reads a long value encoded by UTF-8 encoding

Returns
`Long`	Decoded long value

Throws
`java.lang.NumberFormatException`	if there is a problem with decoding

readUtfCharsetFromBom

fun readUtfCharsetFromBom(): Charset?

Reads a UTF byte order mark (BOM) and returns the UTF Charset it represents. Returns null without advancing position if no BOM is found.

reset

fun reset(data: ByteArray!): Unit

Updates the instance to wrap data, and resets the position to zero and the limit to data.length.

Parameters
`data: ByteArray!`	The array to wrap.

reset

fun reset(limit: Int): Unit

Resets the position to zero and the limit to the specified value. This might replace or wipe the underlying array, potentially invalidating any local references.

Parameters
`limit: Int`	The limit to set.

reset

fun reset(data: ByteArray!, limit: Int): Unit

Updates the instance to wrap data, and resets the position to zero.

Parameters
`data: ByteArray!`	The array to wrap.
`limit: Int`	The limit to set.

setLimit

fun setLimit(limit: Int): Unit

Sets the limit.

Parameters
`limit: Int`	The limit to set.

setShouldEnforceLimitOnLegacyMethods

@VisibleForTesting
java-static fun setShouldEnforceLimitOnLegacyMethods(enforceLimit: Boolean): Unit

Sets whether all read/peek methods should enforce that getPosition never exceeds limit.

Setting this to true in tests can help catch cases of accidentally reading beyond limit but still within the bounds of the underlying getData.

Some (newer) methods will always enforce the invariant, even when this is set to false.

Defaults to false (this may change in a later release).

skipBytes

fun skipBytes(bytes: Int): Unit

Moves the reading offset by bytes.

Parameters
`bytes: Int`	The number of bytes to skip.

Throws
`java.lang.IllegalArgumentException`	Thrown if the new position is neither in nor at the end of the array.

skipLeb128

fun skipLeb128(): Unit

Skips a variable-length LEB128 value.

Public properties

data

val data: ByteArray<Byte>!

position

var position: Int