sharpetronics/zstd - zstd - Gitea: Git with a cup of tea

mirror of https://github.com/facebook/zstd.git synced 2025-10-04 00:02:33 -04:00

Author	SHA1	Message	Date
Victor Zhang	63acf9a995	Oopsie with huf.h, debug.h	2024-12-18 09:56:50 -08:00
Victor Zhang	58a7f4b869	Oopsie with threading.h	2024-12-17 18:37:33 -08:00
Victor Zhang	5222dd87cf	Oopsie with fse.h	2024-12-17 18:11:58 -08:00
Victor Zhang	fc726da774	Move #includes out of `extern "C"` blocks Do some include shuffling for `**.h` files within lib, programs, tests, and zlibWrapper. `lib/legacy` and `lib/deprecated` are untouched. `#include`s within `extern "C"` blocks in .cpp files are untouched. todo: shuffling for `xxhash.h`	2024-12-17 17:55:07 -08:00
郑苏波 (Super Zheng)	5e0a83ec25	Disallow 32-bit mode in clang section Fix register %rbx is only available in 64-bit mode	2024-12-04 06:47:32 -08:00
Yann Collet	7bad787d8b	made ZSTD_isPower2() an inline function	2024-10-23 11:50:57 -07:00
Yann Collet	4ce91cbf2b	fixed workspace alignment on non 64-bit systems	2024-10-23 11:50:57 -07:00
Adenilson Cavalcanti	6dbd49bcd0	[riscv] Enable support for weak symbols Both gcc and clang support weak symbols on RISC-V, therefore let's enable it. This should fix issue #4069.	2024-08-06 16:55:32 -07:00
Adenilson Cavalcanti	c3c28c4d5a	[zstd][android] Fix build with NDK r27 The NDK cross compiler declares the target as __linux (which is not technically incorrect), which triggers the enablement of _GNU_SOURCE in the newly added code that requires the presence of qsort_r() used in the COVER dictionary code. Even though the NDK uses llvm/libc, it doesn't declare qsort_r() in the stdlib.h header. The build fix is to only activate the _GNU_SOURCE macro if the OS is not Android, as then we will fallback to the C90 compliant code. This patch should solve the reported issue number #4103.	2024-07-29 17:13:58 -07:00
Adenilson Cavalcanti	345bcb5ff7	[zstd][dict] Ensure that dictionary training functions are fully reentrant The two main functions used for dictionary training using the COVER algorithm require initialization of a COVER_ctx_t where a call to qsort() is performed. The issue is that the standard C99 qsort() function doesn't offer a way to pass an extra parameter for the comparison function callback (e.g. a pointer to a context) and currently zstd relies on a global static variable to hold a pointer to a context needed to perform the sort operation. If a zstd library user invokes either ZDICT_trainFromBuffer_cover or ZDICT_optimizeTrainFromBuffer_cover from multiple threads, the global context may be overwritten before/during the call/execution to qsort() in the initialization of the COVER_ctx_t, thus yielding to crashes and other bad things (Tm) as reported on issue #4045. Enters qsort_r(): it was designed to address precisely this situation, to quote from the documention [1]: "the comparison function does not need to use global variables to pass through arbitrary arguments, and is therefore reentrant and safe to use in threads." It is available with small variations for multiple OSes (GNU, BSD[2], Windows[3]), and the ISO C11 [4] standard features on annex B-21 qsort_s() as part of the <stdlib.h>. Let's hope that compilers eventually catch up with it. For now, we have to handle the small variations in function parameters for each platform. The current fix solves the problem by allowing each executing thread pass its own COVER_ctx_t instance to qsort_r(), removing the use of a global pointer and allowing the code to be reentrant. Unfortunately for *BSD, we cannot leverage qsort_r() given that its API has changed on newer versions of FreeBSD (14.0) and the other BSD variants (e.g. NetBSD, OpenBSD) don't implement it. For such cases we provide a fallback that will work only requiring support for compilers implementing support for C90. [1] https://man7.org/linux/man-pages/man3/qsort_r.3.html [2] https://man.freebsd.org/cgi/man.cgi?query=qsort_r [3] https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/qsort-s?view=msvc-170 [4] https://www.open-std.org/jtc1/sc22/wg14/www/docs/n1548.pdf	2024-07-01 23:52:31 -07:00
Yann Collet	3de0541aef	Merge pull request #4079 from elasota/truncated-huff-state-error Throw error if Huffman weight initial states are truncated	2024-06-30 16:17:03 -07:00
elasota	0938308ff6	Throw error if Huffman weight initial states are truncated	2024-06-20 17:46:16 -04:00
Dimitri Papadopoulos	44e83e9180	Fix typos not found by codespell	2024-06-20 20:16:25 +02:00
Joseph Chen	5fadd8e6b1	revert FSE_readNCount_body attribute	2024-05-15 10:47:50 +08:00
Joseph Chen	2955d92ac0	Improve support for IAR compiler with attributes and intrinsics	2024-05-14 17:01:19 +08:00
Yuriy Chernyshov	72c16b187d	Fix building on windows-x86 if clang already includes [D101338](https://reviews.llvm.org/D101338) landed in 2021, so clang16 should have it	2024-04-01 09:53:08 -07:00
Yann Collet	9cc3304614	add line number to debug traces	2024-03-14 12:11:11 -07:00
Nick Terrell	94c102038b	[cpu] Backport fix for rbx clobbering on Windows with Clang Backport folly fix for rbx clobbering: `f22f88b8b9` This supercedes PR #3646.	2024-03-13 09:45:40 -04:00
Yann Collet	db996d253e	Merge pull request #3933 from facebook/fix3819 prevent XXH64 from being autovectorized by XXH512 by default	2024-03-12 09:46:48 -07:00
Yann Collet	372fddf4e6	Merge pull request #3860 from likema/fix-xxhash-aix-51 Fix building xxhash on AIX 5.1	2024-03-09 15:34:02 -08:00
Yann Collet	007cda88ca	prevent XXH64 from being autovectorized by XXH512 by default backport fix https://github.com/Cyan4973/xxHash/pull/924 from libxxhash	2024-03-07 16:43:13 -08:00
Yann Collet	e62e15df19	fix clangbuild notably -Wconversion and -Wdocumentation	2024-02-20 22:43:22 -08:00
Yann Collet	e6f4b46493	playTests.sh does no longer needs grep -E it makes the test script more portable across posix systems because `grep -E` is not guaranteed while `grep` is fairly common.	2024-01-15 11:16:46 -08:00
Like Ma	66269e74a0	Fix building xxhash on AIX 5.1	2024-01-14 00:09:48 +08:00
Nick Terrell	8193250615	Modernize macros to use `do { } while (0)` This PR introduces no functional changes. It attempts to change all macros currently using `{ }` or some variant of that to to `do { } while (0)`, and introduces trailing `;` where necessary. There were no bugs found during this migration. The bug in Visual Studios warning on this has been fixed since VS2015. Additionally, we have several instances of `do { } while (0)` which have been present for several releases, so we don't have to worry about breaking peoples builds. Fixes Issue #3830.	2023-11-21 20:05:17 -05:00
Yann Collet	6b3d12fe54	Merge pull request #3820 from facebook/xxh082 update xxhash library to v0.8.2	2023-11-21 09:11:40 -08:00
Nick Terrell	e122fcbf58	[debug] Don't define g_debuglevel in the kernel We only use this constant when `DEBUGLEVEL>=2`, but we get -Werror=pedantic errors for empty translation units, so still define it except in kernel environments. Backport from the kernel: https://lore.kernel.org/lkml/20230616144400.172683-1-ben.dooks@codethink.co.uk/	2023-11-17 09:54:10 -08:00
Yann Collet	59dcc47579	update license text	2023-11-16 16:19:25 -08:00
Yann Collet	3fd5f9f52d	fix the copyright linter	2023-11-13 15:50:42 -08:00
Yann Collet	592b1acb18	update xxhash to v0.8.2 List of updates : https://github.com/Cyan4973/xxHash/releases/tag/v0.8.2 This is also a preparation task before taking care of #3819	2023-11-13 15:42:07 -08:00
Yann Collet	24dabde507	revert to manually defining DTable thus avoiding the analyzer and ubsan to associate DTable to a size of 1.	2023-10-18 22:45:57 -07:00
Yann Collet	d988e00a7f	baby-step towards solving flexArray issue #3785 the flexArray in structure FSE_DecompressWksp is just a way to derive a pointer easily, without risk/complexity of calculating it manually. Not sure if this change is good enough to avoid ubsan warnings though.	2023-10-18 16:21:39 -07:00
Yann Collet	c1e588fcb4	Merge pull request #3771 from DimitriPapadopoulos/codespell Fix new typos found by codespell	2023-10-07 19:29:41 -07:00
Nick Terrell	43118da8a7	Stop suppressing pointer-overflow UBSAN errors * Remove all pointer-overflow suppressions from our UBSAN builds/tests. * Add `ZSTD_ALLOW_POINTER_OVERFLOW_ATTR` macro to suppress pointer-overflow at a per-function level. This is a superior approach because it also applies to users who build zstd with UBSAN. * Add `ZSTD_wrappedPtr{Diff,Add,Sub}()` that use these suppressions. The end goal is to only tag these functions with `ZSTD_ALLOW_POINTER_OVERFLOW`. But we can start by annoting functions that rely on pointer overflow, and gradually transition to using these. * Add `ZSTD_maybeNullPtrAdd()` to simplify pointer addition when the pointer may be `NULL`. * Fix all the fuzzer issues that came up. I'm sure there will be a lot more, but these are the ones that came up within a few minutes of running the fuzzers, and while running GitHub CI.	2023-09-28 17:35:05 -04:00
Dimitri Papadopoulos	fe34776c20	Fix new typos found by codespell	2023-09-23 18:56:01 +02:00
Nick Terrell	396ef5b434	Fix & refactor Huffman repeat tables for dictionaries The Huffman repeat mode checker assumed that the CTable was zeroed in the region `[maxSymbolValue + 1, 256)`. This assumption didn't hold for tables built in the dictionaries, because it didn't go through the same codepath. Since this code was originally written, we added a header to the CTable that specifies the `tableLog`. Add `maxSymbolValue` to that header, and check that the table's `maxSymbolValue` is at least the block's `maxSymbolValue`. This solution is cleaner because we write this header for every CTable we build, so it can't be missed in any code path. Credit to OSS-Fuzz	2023-08-25 13:21:58 -04:00
Yann Collet	118200f7b9	Merge pull request #3677 from facebook/detectOverflow Changed the decoding loop to detect more invalid cases of corruption sooner	2023-07-05 00:59:08 -07:00
Nidhi Jaju	b1a30e2b4a	hide asm functions on apple platforms	2023-06-26 00:07:30 +00:00
Yann Collet	e4aeaebc20	fixed incorrect test in Win32 pthread wrapper reported by @Banzai24-yht in #3683	2023-06-20 08:34:26 -07:00
Yann Collet	84e898a76c	removed _old variant from splitLit	2023-06-16 14:42:28 -07:00
Yann Collet	d9645327b3	fixed MEM_STATIC already defined in Linux Kernel mode	2023-06-14 20:07:18 -07:00
Yann Collet	74c901bbed	fix : unused attribute for FORCE_INLINE functions fix2 : reloadDStreamFast is used by decompress4x2, modified the entry point, so that it works fine in this case too.	2023-06-14 16:32:51 -07:00
Yann Collet	ba50807029	make the bitstream generate only 0-value bits after an overflow	2023-06-14 15:42:37 -07:00
Yann Collet	3732a08f5b	fixed decoder behavior when nbSeqs==0 is encoded using 2 bytes The sequence section starts with a number, which tells how sequences are present in the section. If this number if 0, the section automatically ends. The number 0 can be represented using the 1 byte or the 2 bytes formats. That's because the 2-bytes formats fully overlaps the 1 byte format. However, when 0 is represented using the 2-bytes format, the decoder was expecting the sequence section to continue, and was looking for FSE tables, which is incorrect. Fixed this behavior, in both the reference decoder and the educational behavior. In practice, this behavior never happens, because the encoder will always select the 1-byte format to represent 0, since this is more efficient. Completed the fix with a new golden sample for tests, a clarification of the specification, and a decoder errata paragraph.	2023-06-05 16:03:00 -07:00
Duncan Horn	1b994cbc57	Get zstd working with ARM64EC on Windows	2023-05-23 18:40:31 -04:00
Han Zhu	e6dccbf482	Inline BIT_reloadDStream Inlining `BIT_reloadDStream` provided >3% decompression speed improvement for clang PGO-optimized zstd binary, measured using the Silesia corpus with compression level 1. The win comes from improved register allocation which leads to fewer spills and reloads. Take a look at this comparison of profile-annotated hot assembly before and after this change: https://www.diffchecker.com/UjDGIyLz/. The diff is a bit messy, but notice three fewer moves after inlining. In general LLVM's register allocator works better when it can see more code. For example, when the register allocator sees a call instruction, it partitions the registers into caller registers and callee registers, and it is not free to do whatever it wants with all the registers for the current function. Inlining the callee lets the register allocation access all registers and use them more flexsibly.	2023-03-28 15:36:02 -07:00
Yonatan Komornik	91f4c23e63	Add salt into row hash (#3528 part 2) (#3533 ) Part 2 of #3528 Adds hash salt that helps to avoid regressions where consecutive compressions use the same tag space with similar data (running zstd -b5e7 enwik8 -B128K reproduces this regression).	2023-03-13 15:34:13 -07:00
Yonatan Komornik	9420bce8a4	Add init once memory (#3528 ) (#3529 ) - Adds memory type that is guaranteed to have been initialized at least once in the workspace's lifetime. - Changes tag space in row hash to be based on init once memory.	2023-03-13 13:20:49 -07:00
Dimitri Papadopoulos	547794ef40	Fix typos found by codespell	2023-02-18 10:31:48 +01:00
Yonatan Komornik	c78f434aa4	Fix zstd-dll build missing dependencies (#3496 ) * Fixes zstd-dll build (https://github.com/facebook/zstd/issues/3492): - Adds pool.o and threading.o dependency to the zstd-dll target - Moves custom allocation functions into header to avoid needing to add dependency on common.o - Adds test target for zstd-dll - Adds github workflow that buildis zstd-dll	2023-02-12 12:32:31 -08:00

1 2 3 4 5 ...

827 Commits