sharpetronics/zstd - zstd - Gitea: Git with a cup of tea

mirror of https://github.com/facebook/zstd.git synced 2025-10-16 00:04:24 -04:00

Author	SHA1	Message	Date
Nick Terrell	d051cd5b43	Use workspace for count and CTable	2017-03-02 16:38:07 -08:00
Nick Terrell	a419777eb1	Allow compressor to repeat Huffman tables * Compressor saves most recently used Huffman table and reuses it if it produces better results. * I attempted to preserve CPU usage profile. I intentionally left all of the existing heuristics in place. There is only a speed difference on the second block and later. When compressing large enough blocks (say >= 4 KiB) there is no significant difference in compression speed. Dictionary compression of one block is the same speed for blocks with literals <= 1 KiB, and after that the difference is not very significant. * In the synthetic data, with blocks 10 KB or smaller, most blocks can't use repeated tables because the previous block did not contain a symbol that the current block contains. Once blocks are about 12 KB or more, most previous blocks have valid Huffman tables for the current block, and the compression ratio and decompression speed jumped. * In silesia blocks as small as 4KB can frequently reuse the previous Huffman table (85%), but it isn't as profitable, and the previous Huffman table only gets used about 3% of the time. * Microbenchmarks show that `HUF_validateCTable()` takes ~55 ns and `HUF_estimateCompressedSize()` takes ~35 ns. They are decently well optimized, the first versions took 90 ns and 120 ns respectively. `HUF_validateCTable()` could be twice as fast, if we cast the `HUF_CElt` to a `U32` and compare to 0. However, `U32` has an alignment of 4 instead of 2, so I think that might be undefined behavior. * I've ran `zstreamtest` compiled normally, with UASAN and with MSAN for 4 hours each. The worst case for the speed difference is a bunch of small blocks in the same frame. I modified `bench.c` to compress the input in a single frame but with blocks of the given block size, set by `-B`. Benchmarks on level 1: \| Program \| Block size \| Corpus \| Ratio \| Compression MB/s \| Decompression MB/s \| \|-----------\|------------\|-----------\|-------\|------------------\|--------------------\| \| zstd.base \| 256 \| synthetic \| 2.364 \| 110.0 \| 297.0 \| \| zstd \| 256 \| synthetic \| 2.367 \| 108.9 \| 297.0 \| \| zstd.base \| 256 \| silesia \| 2.204 \| 93.8 \| 415.7 \| \| zstd \| 256 \| silesia \| 2.204 \| 93.4 \| 415.7 \| \| zstd.base \| 512 \| synthetic \| 2.594 \| 144.2 \| 420.0 \| \| zstd \| 512 \| synthetic \| 2.599 \| 141.5 \| 425.7 \| \| zstd.base \| 512 \| silesia \| 2.358 \| 118.4 \| 432.6 \| \| zstd \| 512 \| silesia \| 2.358 \| 119.8 \| 432.6 \| \| zstd.base \| 1024 \| synthetic \| 2.790 \| 192.3 \| 594.1 \| \| zstd \| 1024 \| synthetic \| 2.794 \| 192.3 \| 600.0 \| \| zstd.base \| 1024 \| silesia \| 2.524 \| 148.2 \| 464.2 \| \| zstd \| 1024 \| silesia \| 2.525 \| 148.2 \| 467.6 \| \| zstd.base \| 4096 \| synthetic \| 3.023 \| 300.0 \| 1000.0 \| \| zstd \| 4096 \| synthetic \| 3.024 \| 300.0 \| 1010.1 \| \| zstd.base \| 4096 \| silesia \| 2.779 \| 223.1 \| 623.5 \| \| zstd \| 4096 \| silesia \| 2.779 \| 223.1 \| 636.0 \| \| zstd.base \| 16384 \| synthetic \| 3.131 \| 350.0 \| 1150.1 \| \| zstd \| 16384 \| synthetic \| 3.152 \| 350.0 \| 1630.3 \| \| zstd.base \| 16384 \| silesia \| 2.871 \| 296.5 \| 883.3 \| \| zstd \| 16384 \| silesia \| 2.872 \| 294.4 \| 898.3 \|	2017-03-02 13:27:52 -08:00
Yann Collet	31432cc57d	Merge pull request #579 from iburinoc/multiframe Check to ensure ddict isn't null before dereference	2017-03-01 11:02:04 -08:00
Yann Collet	51598510c0	Merge pull request #580 from facebook/speedStream Improve streaming decompression speed	2017-03-01 10:59:51 -08:00
Yann Collet	43764cdb1d	updated NEWS for 1.1.4 cmake, performance	2017-02-28 17:44:17 -08:00
Yann Collet	c896735b8d	Merge pull request #575 from Majlen/cmake-improvement Cmake improvement	2017-02-28 15:32:21 -08:00
Sean Purcell	a81d4fee58	Check to ensure ddict isn't null before dereference	2017-02-28 15:28:29 -08:00
Yann Collet	a5cbc02ed1	Merge pull request #578 from inikep/dev decompression: --rm is silent when input is stdin	2017-02-28 15:21:28 -08:00
Przemyslaw Skibinski	5c1c80cbb6	travis.yml: fixed pull_request	2017-02-28 18:34:39 +01:00
Yann Collet	22d79762ef	fixed multi frames	2017-02-28 02:12:42 -08:00
Milan Ševčík	4b62f41969	Added compile flags to pzstd Definition NDEBUG from original Makefile -Wno-shadow silences shadowing in initializers	2017-02-28 10:57:09 +01:00
Milan Ševčík	eeb080e601	-Wstrict-prototypes is not supported with C++	2017-02-28 10:57:09 +01:00
Milan Ševčík	5a1cc5c22d	Improve handling of library symlinks. Previous method was failing to remove the symlinks when make clean was invoked and wasn't portable.	2017-02-28 10:57:09 +01:00
Milan Ševčík	bf8a30ce0d	Add zstdmt target in cmake	2017-02-28 10:57:09 +01:00
Milan Ševčík	59709d97d9	Support building contrib utils from cmake	2017-02-28 10:57:09 +01:00
Yann Collet	a33ae64204	fixed decoding skippable frames	2017-02-28 01:15:28 -08:00
Yann Collet	c0b1731bce	added test for decompression with NULL dict and NULL DDict previous version of ZSTD_decompressMultiFrame() would fail that test	2017-02-28 01:02:46 -08:00
Przemyslaw Skibinski	8e5032a965	cli : fix : --rm is silent when input is stdin (decompression)	2017-02-28 09:42:37 +01:00
Przemyslaw Skibinski	8b3560e196	update gzip tests	2017-02-28 09:41:23 +01:00
Yann Collet	d1760113ec	Improved speed of ZSTD_decompressStream() When ZSTD_decompressStream() detects that there is enough space in dst to complete decompression in a single pass, delegates to ZSTD_decompress(), for an extra ~5% speed boost	2017-02-28 00:14:28 -08:00
Przemyslaw Skibinski	a3352d06bc	updated .travis.yml (2)	2017-02-28 08:20:53 +01:00
Przemyslaw Skibinski	ca1d3d4232	updated .travis.yml	2017-02-28 08:16:49 +01:00
Yann Collet	1d7f30f9d4	Merge branch 'decompressStream' into dev	2017-02-27 20:55:22 -08:00
Yann Collet	a81c2e7e44	Merge pull request #573 from facebook/ddict Improved DDict memory usage	2017-02-27 20:54:42 -08:00
Yann Collet	952d06fa9c	fullbench : -i0 displays list of functions to bench	2017-02-27 17:58:02 -08:00
Yann Collet	67d86a74a5	added test case : --rm on stdin must remain silent (instead of failing)	2017-02-27 16:09:20 -08:00
Yann Collet	ef569bf75f	Merge branch 'dev' of github.com:facebook/zstd into dev	2017-02-27 15:58:38 -08:00
Yann Collet	dccd6b6f65	cli : fix : --rm is silent when input is stdin previously, app would produce an error message, and stop.	2017-02-27 15:57:50 -08:00
Yann Collet	ea7589ce07	Merge pull request #571 from inikep/dev11 gzip tests	2017-02-27 13:54:33 -08:00
Przemyslaw Skibinski	5d848527e6	use "./gzip" for gzip tests	2017-02-27 22:02:03 +01:00
Yann Collet	b78f211068	Merge pull request #569 from iburinoc/testcorpus Fix some more ARM compile errors	2017-02-27 10:19:37 -08:00
Yann Collet	3ac85faf1f	Merge pull request #572 from prashantkhandelwal/dev Fix for a small typo	2017-02-27 10:19:08 -08:00
Przemyslaw Skibinski	862698f479	minor tweaks in FIO_decompressGzFrame	2017-02-27 13:21:05 +01:00
Prashant Khandelwal	013f8b4c27	Fix for a small Typo	2017-02-27 16:28:22 +05:30
Yann Collet	0b9b894b2d	reduced ZSTD_DDict memory usage saved 128 KB	2017-02-27 00:27:30 -08:00
Przemyslaw Skibinski	b43d75154d	update gzip tests	2017-02-27 09:07:35 +01:00
Przemyslaw Skibinski	1479c98661	Merge remote-tracking branch 'refs/remotes/facebook/dev' into dev11	2017-02-27 08:56:43 +01:00
Sean Purcell	2302bfa4bf	Merge branch 'dev' into testcorpus	2017-02-26 22:27:31 -08:00
Yann Collet	bd7fa21deb	added ZSTD_refDDict() Now DDict does no longer depends on DCtx duplication	2017-02-26 14:43:07 -08:00
Yann Collet	d73eebc00f	loadEntropy works on new ZSTD_entropy_t type	2017-02-26 10:16:42 -08:00
Yann Collet	8629f0e41f	created entropy structure type	2017-02-25 18:33:31 -08:00
Yann Collet	8dff956dbf	Added DDict unit test in fuzzer also : slightly modified loadEntropy : know src must points at start of dictionary	2017-02-25 10:11:15 -08:00
Yann Collet	682ae8a10e	Merge pull request #567 from inikep/dev11 faster Travis tests	2017-02-25 06:57:31 -08:00
Yann Collet	0990d79476	Merge pull request #566 from facebook/forceRawDict Force raw dict	2017-02-24 16:59:24 -08:00
Sean Purcell	22468b0cc3	Fix some more ARM compile errors https://travis-ci.org/facebook/zstd/jobs/204807461 Can't get them to reproduce the compile errors locally, tested fix by forcing that test to run on Travis.	2017-02-24 10:55:42 -08:00
Przemyslaw Skibinski	a66b764d79	added tests for gzip	2017-02-24 16:09:17 +01:00
Przemyslaw Skibinski	8740d6bcf5	fix uninitialized value warning	2017-02-24 09:24:55 +01:00
Yann Collet	df9f9296e3	attempt to fix pthreat linking error replacing -lpthread by -pthread	2017-02-24 00:16:05 -08:00
Yann Collet	14312d833e	zstdmt : fix : loading prefix from previous segments There used to be a (very small) chance that loading prefix from previous segment would be confused with a real zstd dictionary. For that to happen, the prefix needs to start with the same value as dictionary magic. That's 1 chance in 4 billions if all values have equal probability. But in fact, since some values are more common (0x00000000 for example) others are less common, and dictionary magic was selected to be one of them, so probabilities are likely even lower. Anyway, this risk is no down to zero by adding a new CCtx parameter : ZSTD_p_forceRawDict Current parameter policy : the parameter "stick" to its CCtx, so any dictionary loading after ZSTD_p_forceRawDict is set will be loaded in "raw" ("content only") mode, even if CCtx is re-used multiple times with multiple different dictionary. It's up to the user to reset this value differently if it needs so.	2017-02-23 23:42:12 -08:00
Przemyslaw Skibinski	b68ea5d87b	rearrange Travis tests	2017-02-24 08:18:44 +01:00

1 2 3 4 5 ...

2857 Commits