zstd/compress at a3c3a38b9b956d2689a019b1b29482e86fd98836 - zstd

mirror of https://github.com/facebook/zstd.git synced 2025-10-09 00:05:28 -04:00

History

Nick Terrell a3c3a38b9b [lazy] Skip over incompressible data

Every 256 bytes the lazy match finders process without finding a match,
they will increase their step size by 1. So for bytes [0, 256) they search
every position, for bytes [256, 512) they search every other position,
and so on. However, they currently still insert every position into
their hash tables. This is different from fast & dfast, which only
insert the positions they search.

This PR changes that, so now after we've searched 2KB without finding
any matches, at which point we'll only be searching one in 9 positions,
we'll stop inserting every position, and only insert the positions we
search. The exact cutoff of 2KB isn't terribly important, I've just
selected a cutoff that is reasonably large, to minimize the impact on
"normal" data.

This PR only adds skipping to greedy, lazy, and lazy2, but does not
touch btlazy2.

| Dataset | Level | Compiler     | CSize ∆ | Speed ∆ |
|---------|-------|--------------|---------|---------|
| Random  |     5 | clang-14.0.6 |    0.0% |   +704% |
| Random  |     5 | gcc-12.2.0   |    0.0% |   +670% |
| Random  |     7 | clang-14.0.6 |    0.0% |   +679% |
| Random  |     7 | gcc-12.2.0   |    0.0% |   +657% |
| Random  |    12 | clang-14.0.6 |    0.0% |  +1355% |
| Random  |    12 | gcc-12.2.0   |    0.0% |  +1331% |
| Silesia |     5 | clang-14.0.6 | +0.002% |  +0.35% |
| Silesia |     5 | gcc-12.2.0   | +0.002% |  +2.45% |
| Silesia |     7 | clang-14.0.6 | +0.001% |  -1.40% |
| Silesia |     7 | gcc-12.2.0   | +0.007% |  +0.13% |
| Silesia |    12 | clang-14.0.6 | +0.011% | +22.70% |
| Silesia |    12 | gcc-12.2.0   | +0.011% |  -6.68% |
| Enwik8  |     5 | clang-14.0.6 |    0.0% |  -1.02% |
| Enwik8  |     5 | gcc-12.2.0   |    0.0% |  +0.34% |
| Enwik8  |     7 | clang-14.0.6 |    0.0% |  -1.22% |
| Enwik8  |     7 | gcc-12.2.0   |    0.0% |  -0.72% |
| Enwik8  |    12 | clang-14.0.6 |    0.0% | +26.19% |
| Enwik8  |    12 | gcc-12.2.0   |    0.0% |  -5.70% |

The speed difference for clang at level 12 is real, but is probably
caused by some sort of alignment or codegen issues. clang is
significantly slower than gcc before this PR, but gets up to parity with
it.

I also measured the ratio difference for the HC match finder, and it
looks basically the same as the row-based match finder. The speedup on
random data looks similar. And performance is about neutral, without the
big difference at level 12 for either clang or gcc.

2023-03-20 11:18:29 -07:00

clevels.h

initial commit

2023-01-04 13:01:54 -08:00

fse_compress.c

[fse] Delete unused functions

2023-01-27 13:15:07 -08:00

hist.c

initial commit

2023-01-04 13:01:54 -08:00

hist.h

initial commit

2023-01-04 13:01:54 -08:00

huf_compress.c

Replace Huffman boolean args with flags bit set

2023-01-20 14:12:53 -08:00

zstd_compress_internal.h

[lazy] Skip over incompressible data

2023-03-20 11:18:29 -07:00

zstd_compress_literals.c

Replace Huffman boolean args with flags bit set

2023-01-20 14:12:53 -08:00

zstd_compress_literals.h

fix root cause of #3416

2023-01-12 15:41:08 -08:00

zstd_compress_sequences.c

initial commit

2023-01-04 13:01:54 -08:00

zstd_compress_sequences.h

initial commit

2023-01-04 13:01:54 -08:00

zstd_compress_superblock.c

Replace Huffman boolean args with flags bit set

2023-01-20 14:12:53 -08:00

zstd_compress_superblock.h

initial commit

2023-01-04 13:01:54 -08:00

zstd_compress.c

[lazy] Skip over incompressible data

2023-03-20 11:18:29 -07:00

zstd_cwksp.h

Add init once memory (#3528 ) (#3529 )

2023-03-13 13:20:49 -07:00

zstd_double_fast.c

initial commit

2023-01-04 13:01:54 -08:00

zstd_double_fast.h

initial commit

2023-01-04 13:01:54 -08:00

zstd_fast.c

initial commit

2023-01-04 13:01:54 -08:00

zstd_fast.h

initial commit

2023-01-04 13:01:54 -08:00

zstd_lazy.c

[lazy] Skip over incompressible data

2023-03-20 11:18:29 -07:00

zstd_lazy.h

Cap hashLog & chainLog to ensure that we only use 32 bits of hash

2023-01-20 14:05:26 -08:00

zstd_ldm_geartab.h

initial commit

2023-01-04 13:01:54 -08:00

zstd_ldm.c

initial commit

2023-01-04 13:01:54 -08:00

zstd_ldm.h

initial commit

2023-01-04 13:01:54 -08:00

zstd_opt.c

Add init once memory (#3528 ) (#3529 )

2023-03-13 13:20:49 -07:00

zstd_opt.h

initial commit

2023-01-04 13:01:54 -08:00

zstdmt_compress.c

Deprecated bufferless and block level APIs

2023-03-16 10:04:15 -07:00

zstdmt_compress.h

initial commit

2023-01-04 13:01:54 -08:00