Expand description
Low-level bindings to the zstd library.
Structs§
- ZDICT_
params_ t - ZSTD_
CCtx_ s - ZSTD_
CDict_ s - ZSTD_
DCtx_ s - ZSTD_
DDict_ s - ZSTD_
bounds - ZSTD_
inBuffer_ s - Streaming
- ZSTD_
outBuffer_ s
Enums§
- ZSTD_
EndDirective - ZSTD_
Error Code - ZSTD_
Reset Directive - ZSTD_
cParameter - ZSTD_
dParameter - Advanced decompression API (Requires v1.4.0+)
- ZSTD_
strategy - Advanced compression API (Requires v1.4.0+)
Constants§
- ZSTD_
BLOCKSIZELOG_ MAX - ZSTD_
BLOCKSIZE_ MAX - ZSTD_
CLEVEL_ DEFAULT - ZSTD_
CONTENTSIZE_ ERROR - ZSTD_
CONTENTSIZE_ UNKNOWN - ZSTD_
MAGICNUMBER - ZSTD_
MAGIC_ DICTIONARY - ZSTD_
MAGIC_ SKIPPABLE_ MASK - ZSTD_
MAGIC_ SKIPPABLE_ START - ZSTD_
VERSION_ MAJOR - ZSTD_
VERSION_ MINOR - ZSTD_
VERSION_ NUMBER - ZSTD_
VERSION_ RELEASE
Functions§
- ZDICT_
finalize ⚠Dictionary - ZDICT_finalizeDictionary(): Given a custom content as a basis for dictionary, and a set of samples, finalize dictionary by adding headers and statistics according to the zstd dictionary format.
- ZDICT_
getDict ⚠Header Size - ZDICT_
getDictID ⚠ - ZDICT_
getError ⚠Name - ZDICT_
isError ⚠ - ZDICT_
train ⚠From Buffer - ZDICT_trainFromBuffer():
Train a dictionary from an array of samples.
Redirect towards ZDICT_optimizeTrainFromBuffer_fastCover() single-threaded, with d=8, steps=4,
f=20, and accel=1.
Samples must be stored concatenated in a single flat buffer
samplesBuffer
, supplied with an array of sizessamplesSizes
, providing the size of each sample, in order. The resulting dictionary will be saved intodictBuffer
. @return: size of dictionary stored intodictBuffer
(<=dictBufferCapacity
) or an error code, which can be tested with ZDICT_isError(). Note: Dictionary training will fail if there are not enough samples to construct a dictionary, or if most of the samples are too small (< 8 bytes being the lower limit). If dictionary training fails, you should use zstd without a dictionary, as the dictionary would’ve been ineffective anyways. If you believe your samples would benefit from a dictionary please open an issue with details, and we can look into it. Note: ZDICT_trainFromBuffer()’s memory usage is about 6 MB. Tips: In general, a reasonable dictionary has a size of ~ 100 KB. It’s possible to select smaller or larger size, just by specifyingdictBufferCapacity
. In general, it’s recommended to provide a few thousands samples, though this can vary a lot. It’s recommended that total size of all samples be about ~x100 times the target size of dictionary. - ZSTD_
CCtx_ ⚠load Dictionary - ZSTD_CCtx_loadDictionary() : Requires v1.4.0+
Create an internal CDict from
dict
buffer. Decompression will have to use same dictionary. @result : 0, or an error code (which can be tested with ZSTD_isError()). Special: Loading a NULL (or 0-size) dictionary invalidates previous dictionary, meaning “return to no-dictionary mode”. Note 1 : Dictionary is sticky, it will be used for all future compressed frames, until parameters are reset, a new dictionary is loaded, or the dictionary is explicitly invalidated by loading a NULL dictionary. Note 2 : Loading a dictionary involves building tables. It’s also a CPU consuming operation, with non-negligible impact on latency. Tables are dependent on compression parameters, and for this reason, compression parameters can no longer be changed after loading a dictionary. Note 3 :dict
content will be copied internally. Use experimental ZSTD_CCtx_loadDictionary_byReference() to reference content instead. In such a case, dictionary buffer must outlive its users. Note 4 : Use ZSTD_CCtx_loadDictionary_advanced() to precisely select how dictionary content must be interpreted. Note 5 : This method does not benefit from LDM (long distance mode). If you want to employ LDM on some large dictionary content, prefer employing ZSTD_CCtx_refPrefix() described below. - ZSTD_
CCtx_ ⚠refC Dict - ZSTD_CCtx_refCDict() : Requires v1.4.0+ Reference a prepared dictionary, to be used for all future compressed frames. Note that compression parameters are enforced from within CDict, and supersede any compression parameter previously set within CCtx. The parameters ignored are labelled as “superseded-by-cdict” in the ZSTD_cParameter enum docs. The ignored parameters will be used again if the CCtx is returned to no-dictionary mode. The dictionary will remain valid for future compressed frames using same CCtx. @result : 0, or an error code (which can be tested with ZSTD_isError()). Special : Referencing a NULL CDict means “return to no-dictionary mode”. Note 1 : Currently, only one dictionary can be managed. Referencing a new dictionary effectively “discards” any previous one. Note 2 : CDict is just referenced, its lifetime must outlive its usage within CCtx.
- ZSTD_
CCtx_ ⚠refPrefix - ZSTD_CCtx_refPrefix() : Requires v1.4.0+ Reference a prefix (single-usage dictionary) for next compressed frame. A prefix is only used once. Tables are discarded at end of frame (ZSTD_e_end). Decompression will need same prefix to properly regenerate data. Compressing with a prefix is similar in outcome as performing a diff and compressing it, but performs much faster, especially during decompression (compression speed is tunable with compression level). This method is compatible with LDM (long distance mode). @result : 0, or an error code (which can be tested with ZSTD_isError()). Special: Adding any prefix (including NULL) invalidates any previous prefix or dictionary Note 1 : Prefix buffer is referenced. It must outlive compression. Its content must remain unmodified during compression. Note 2 : If the intention is to diff some large src data blob with some prior version of itself, ensure that the window size is large enough to contain the entire source. See ZSTD_c_windowLog. Note 3 : Referencing a prefix involves building tables, which are dependent on compression parameters. It’s a CPU consuming operation, with non-negligible impact on latency. If there is a need to use the same prefix multiple times, consider loadDictionary instead. Note 4 : By default, the prefix is interpreted as raw content (ZSTD_dct_rawContent). Use experimental ZSTD_CCtx_refPrefix_advanced() to alter dictionary interpretation.
- ZSTD_
CCtx_ ⚠reset - ZSTD_CCtx_reset() : There are 2 different things that can be reset, independently or jointly :
- ZSTD_
CCtx_ ⚠setParameter - ZSTD_CCtx_setParameter() : Set one compression parameter, selected by enum ZSTD_cParameter. All parameters have valid bounds. Bounds can be queried using ZSTD_cParam_getBounds(). Providing a value beyond bound will either clamp it, or trigger an error (depending on parameter). Setting a parameter is generally only possible during frame initialization (before starting compression). Exception : when using multi-threading mode (nbWorkers >= 1), the following parameters can be updated during compression (within same frame): => compressionLevel, hashLog, chainLog, searchLog, minMatch, targetLength and strategy. new parameters will be active for next job only (after a flush()). @return : an error code (which can be tested using ZSTD_isError()).
- ZSTD_
CCtx_ ⚠setPledged SrcSize - ZSTD_CCtx_setPledgedSrcSize() : Total input data size to be compressed as a single frame. Value will be written in frame header, unless if explicitly forbidden using ZSTD_c_contentSizeFlag. This value will also be controlled at end of frame, and trigger an error if not respected. @result : 0, or an error code (which can be tested with ZSTD_isError()). Note 1 : pledgedSrcSize==0 actually means zero, aka an empty frame. In order to mean “unknown content size”, pass constant ZSTD_CONTENTSIZE_UNKNOWN. ZSTD_CONTENTSIZE_UNKNOWN is default value for any new frame. Note 2 : pledgedSrcSize is only valid once, for the next frame. It’s discarded at the end of the frame, and replaced by ZSTD_CONTENTSIZE_UNKNOWN. Note 3 : Whenever all input data is provided and consumed in a single round, for example with ZSTD_compress2(), or invoking immediately ZSTD_compressStream2(,,,ZSTD_e_end), this value is automatically overridden by srcSize instead.
- ZSTD_
CStream ⚠InSize - ZSTD_
CStream ⚠OutSize - ZSTD_
DCtx_ ⚠load Dictionary - ZSTD_DCtx_loadDictionary() : Requires v1.4.0+
Create an internal DDict from dict buffer, to be used to decompress all future frames.
The dictionary remains valid for all future frames, until explicitly invalidated, or
a new dictionary is loaded.
@result : 0, or an error code (which can be tested with ZSTD_isError()).
Special : Adding a NULL (or 0-size) dictionary invalidates any previous dictionary,
meaning “return to no-dictionary mode”.
Note 1 : Loading a dictionary involves building tables,
which has a non-negligible impact on CPU usage and latency.
It’s recommended to “load once, use many times”, to amortize the cost
Note 2 :
dict
content will be copied internally, sodict
can be released after loading. Use ZSTD_DCtx_loadDictionary_byReference() to reference dictionary content instead. Note 3 : Use ZSTD_DCtx_loadDictionary_advanced() to take control of how dictionary content is loaded and interpreted. - ZSTD_
DCtx_ ⚠refD Dict - ZSTD_DCtx_refDDict() : Requires v1.4.0+ Reference a prepared dictionary, to be used to decompress next frames. The dictionary remains active for decompression of future frames using same DCtx.
- ZSTD_
DCtx_ ⚠refPrefix - ZSTD_DCtx_refPrefix() : Requires v1.4.0+ Reference a prefix (single-usage dictionary) to decompress next frame. This is the reverse operation of ZSTD_CCtx_refPrefix(), and must use the same prefix as the one used during compression. Prefix is only used once. Reference is discarded at end of frame. End of frame is reached when ZSTD_decompressStream() returns 0. @result : 0, or an error code (which can be tested with ZSTD_isError()). Note 1 : Adding any prefix (including NULL) invalidates any previously set prefix or dictionary Note 2 : Prefix buffer is referenced. It must outlive decompression. Prefix buffer must remain unmodified up to the end of frame, reached when ZSTD_decompressStream() returns 0. Note 3 : By default, the prefix is treated as raw content (ZSTD_dct_rawContent). Use ZSTD_CCtx_refPrefix_advanced() to alter dictMode (Experimental section) Note 4 : Referencing a raw content prefix has almost no cpu nor memory cost. A full dictionary is more costly, as it requires building tables.
- ZSTD_
DCtx_ ⚠reset - ZSTD_DCtx_reset() : Return a DCtx to clean state. Session and parameters can be reset jointly or separately. Parameters can only be reset when no active frame is being decompressed. @return : 0, or an error code, which can be tested with ZSTD_isError()
- ZSTD_
DCtx_ ⚠setParameter - ZSTD_DCtx_setParameter() : Set one compression parameter, selected by enum ZSTD_dParameter. All parameters have valid bounds. Bounds can be queried using ZSTD_dParam_getBounds(). Providing a value beyond bound will either clamp it, or trigger an error (depending on parameter). Setting a parameter is only possible during frame initialization (before starting decompression). @return : 0, or an error code (which can be tested using ZSTD_isError()).
- ZSTD_
DStream ⚠InSize - ZSTD_
DStream ⚠OutSize - ZSTD_
cParam_ ⚠getBounds - ZSTD_cParam_getBounds() : All parameters must belong to an interval with lower and upper bounds, otherwise they will either trigger an error or be automatically clamped. @return : a structure, ZSTD_bounds, which contains - an error status field, which must be tested using ZSTD_isError() - lower and upper bounds, both inclusive
- ZSTD_
compress ⚠ - Simple API
Compresses
src
content as a single zstd compressed frame into already allocateddst
. NOTE: ProvidingdstCapacity >= ZSTD_compressBound(srcSize)
guarantees that zstd will have enough space to successfully compress the data. @return : compressed size written intodst
(<= `dstCapacity), or an error code if it fails (which can be tested using ZSTD_isError()). - ZSTD_
compress2 ⚠ - ZSTD_compress2() : Behave the same as ZSTD_compressCCtx(), but compression parameters are set using the advanced API. (note that this entry point doesn’t even expose a compression level parameter). ZSTD_compress2() always starts a new frame. Should cctx hold data from a previously unfinished frame, everything about it is forgotten.
- ZSTD_
compress ⚠Bound - ZSTD_
compressC ⚠Ctx - ZSTD_compressCCtx() :
Same as ZSTD_compress(), using an explicit ZSTD_CCtx.
Important : in order to mirror
ZSTD_compress()
behavior, this function compresses at the requested compression level, ignoring any other advanced parameter . If any advanced parameter was set using the advanced API, they will all be reset. OnlycompressionLevel
remains. - ZSTD_
compress ⚠Stream - Alternative for ZSTD_compressStream2(zcs, output, input, ZSTD_e_continue). NOTE: The return value is different. ZSTD_compressStream() returns a hint for the next read size (if non-zero and not an error). ZSTD_compressStream2() returns the minimum nb of bytes left to flush (if non-zero and not an error).
- ZSTD_
compress ⚠Stream2 - ZSTD_compressStream2() : Requires v1.4.0+ Behaves about the same as ZSTD_compressStream, with additional control on end directive.
- ZSTD_
compress_ ⚠usingC Dict - ZSTD_compress_usingCDict() : Compression using a digested Dictionary. Recommended when same dictionary is used multiple times. Note : compression level is decided at dictionary creation time, and frame parameters are hardcoded (dictID=yes, contentSize=yes, checksum=no)
- ZSTD_
compress_ ⚠using Dict - Simple dictionary API
Compression at an explicit compression level using a Dictionary.
A dictionary can be any arbitrary data segment (also called a prefix),
or a buffer with specified information (see zdict.h).
Note : This function loads the dictionary, resulting in significant startup delay.
It’s intended for a dictionary used only once.
Note 2 : When
dict == NULL || dictSize < 8
no dictionary is used. - ZSTD_
createC ⚠Ctx - ZSTD_
createC ⚠Dict - ZSTD_createCDict() :
When compressing multiple messages or blocks using the same dictionary,
it’s recommended to digest the dictionary only once, since it’s a costly operation.
ZSTD_createCDict() will create a state from digesting a dictionary.
The resulting state can be used for future compression operations with very limited startup cost.
ZSTD_CDict can be created once and shared by multiple threads concurrently, since its usage is read-only.
@dictBuffer can be released after ZSTD_CDict creation, because its content is copied within CDict.
Note 1 : Consider experimental function
ZSTD_createCDict_byReference()
if you prefer to not duplicate @dictBuffer content. Note 2 : A ZSTD_CDict can be created from an empty @dictBuffer, in which case the only thing that it transports is the @compressionLevel. This can be useful in a pipeline featuring ZSTD_compress_usingCDict() exclusively, expecting a ZSTD_CDict parameter with any data, including those without a known dictionary. - ZSTD_
createC ⚠Stream - ZSTD_
createD ⚠Ctx - ZSTD_
createD ⚠Dict - ZSTD_createDDict() : Create a digested dictionary, ready to start decompression operation without startup delay. dictBuffer can be released after DDict creation, as its content is copied inside DDict.
- ZSTD_
createD ⚠Stream - ZSTD_
dParam_ ⚠getBounds - ZSTD_dParam_getBounds() : All parameters must belong to an interval with lower and upper bounds, otherwise they will either trigger an error or be automatically clamped. @return : a structure, ZSTD_bounds, which contains - an error status field, which must be tested using ZSTD_isError() - both lower and upper bounds, inclusive
- ZSTD_
decompress ⚠ - ZSTD_decompress() :
compressedSize
: must be the exact size of some number of compressed and/or skippable frames.dstCapacity
is an upper bound of originalSize to regenerate. If user cannot imply a maximum upper bound, it’s better to use streaming mode to decompress data. @return : the number of bytes decompressed intodst
(<=dstCapacity
), or an errorCode if it fails (which can be tested using ZSTD_isError()). - ZSTD_
decompressD ⚠Ctx - ZSTD_decompressDCtx() : Same as ZSTD_decompress(), requires an allocated ZSTD_DCtx. Compatible with sticky parameters (see below).
- ZSTD_
decompress ⚠Stream - ZSTD_decompressStream() :
Streaming decompression function.
Call repetitively to consume full input updating it as necessary.
Function will update both input and output
pos
fields exposing current state via these fields: - ZSTD_
decompress_ ⚠usingD Dict - ZSTD_decompress_usingDDict() : Decompression using a digested Dictionary. Recommended when same dictionary is used multiple times.
- ZSTD_
decompress_ ⚠using Dict - ZSTD_decompress_usingDict() :
Decompression using a known Dictionary.
Dictionary must be identical to the one used during compression.
Note : This function loads the dictionary, resulting in significant startup delay.
It’s intended for a dictionary used only once.
Note : When
dict == NULL || dictSize < 8
no dictionary is used. - ZSTD_
defaultC ⚠Level - ZSTD_
endStream ⚠ - Equivalent to ZSTD_compressStream2(zcs, output, &emptyInput, ZSTD_e_end).
- ZSTD_
find ⚠Frame Compressed Size - ZSTD_findFrameCompressedSize() : Requires v1.4.0+
src
should point to the start of a ZSTD frame or skippable frame.srcSize
must be >= first frame size @return : the compressed size of the first frame starting atsrc
, suitable to pass assrcSize
toZSTD_decompress
or similar, or an error code if input is invalid - ZSTD_
flush ⚠Stream - Equivalent to ZSTD_compressStream2(zcs, output, &emptyInput, ZSTD_e_flush).
- ZSTD_
freeC ⚠Ctx - ZSTD_
freeC ⚠Dict - ZSTD_freeCDict() : Function frees memory allocated by ZSTD_createCDict(). If a NULL pointer is passed, no operation is performed.
- ZSTD_
freeC ⚠Stream - ZSTD_
freeD ⚠Ctx - ZSTD_
freeD ⚠Dict - ZSTD_freeDDict() : Function frees memory allocated with ZSTD_createDDict() If a NULL pointer is passed, no operation is performed.
- ZSTD_
freeD ⚠Stream - ZSTD_
getDecompressed ⚠Size - ZSTD_getDecompressedSize() :
NOTE: This function is now obsolete, in favor of ZSTD_getFrameContentSize().
Both functions work the same way, but ZSTD_getDecompressedSize() blends
“empty”, “unknown” and “error” results to the same return value (0),
while ZSTD_getFrameContentSize() gives them separate return values.
@return : decompressed size of
src
frame content if known and not empty, 0 otherwise. - ZSTD_
getDictID_ ⚠fromC Dict - ZSTD_getDictID_fromCDict() : Requires v1.5.0+
Provides the dictID of the dictionary loaded into
cdict
. If @return == 0, the dictionary is not conformant to Zstandard specification, or empty. Non-conformant dictionaries can still be loaded, but as content-only dictionaries. - ZSTD_
getDictID_ ⚠fromD Dict - ZSTD_getDictID_fromDDict() : Requires v1.4.0+
Provides the dictID of the dictionary loaded into
ddict
. If @return == 0, the dictionary is not conformant to Zstandard specification, or empty. Non-conformant dictionaries can still be loaded, but as content-only dictionaries. - ZSTD_
getDictID_ ⚠from Dict - ZSTD_getDictID_fromDict() : Requires v1.4.0+ Provides the dictID stored within dictionary. if @return == 0, the dictionary is not conformant with Zstandard specification. It can still be loaded, but as a content-only dictionary.
- ZSTD_
getDictID_ ⚠from Frame - ZSTD_getDictID_fromFrame() : Requires v1.4.0+
Provides the dictID required to decompressed the frame stored within
src
. If @return == 0, the dictID could not be decoded. This could for one of the following reasons : - ZSTD_
getError ⚠Code - ZSTD_getErrorCode() :
convert a
size_t
function result into aZSTD_ErrorCode
enum type, which can be used to compare with enum list published above - ZSTD_
getError ⚠Name - ZSTD_
getError ⚠String - ZSTD_
getFrame ⚠Content Size - ZSTD_
initC ⚠Stream - Equivalent to:
- ZSTD_
initD ⚠Stream - ZSTD_initDStream() : Initialize/reset DStream state for new decompression operation. Call before new decompression operation using same DStream.
- ZSTD_
isError ⚠ - ZSTD_
maxC ⚠Level - ZSTD_
minC ⚠Level - ZSTD_
sizeof_ ⚠CCtx - ZSTD_sizeof_*() : Requires v1.4.0+ These functions give the current memory usage of selected object. Note that object memory usage can evolve (increase or decrease) over time.
- ZSTD_
sizeof_ ⚠CDict - ZSTD_
sizeof_ ⚠CStream - ZSTD_
sizeof_ ⚠DCtx - ZSTD_
sizeof_ ⚠DDict - ZSTD_
sizeof_ ⚠DStream - ZSTD_
version ⚠Number - ZSTD_versionNumber() : Return runtime library version, the value is (MAJOR100100 + MINOR*100 + RELEASE).
- ZSTD_
version ⚠String - ZSTD_versionString() : Return runtime library version, like “1.4.5”. Requires v1.3.0+.
Type Aliases§
- ZSTD_
CCtx - Explicit context
- ZSTD_
CDict - Bulk processing dictionary API
- ZSTD_
CStream - ZSTD_
DCtx - ZSTD_
DDict - ZSTD_
DStream - ZSTD_
inBuffer - Streaming
- ZSTD_
outBuffer