root/xz - xz - Root on GIT

root/xz

mirror of https://git.tukaani.org/xz.git synced 2025-07-01 09:56:37 +00:00

Author	SHA1	Message	Date
Lasse Collin	b5a5d9e3f7	liblzma: Disable CLMUL CRC on old MSVC targeting 32-bit x86 On GitHub runners, VS 2019 16.11 (MSVC 19.29.30158) results in test failures. VS 2022 17.13 (MSVC 19.43.34808) works. In xz 5.6.x there was a #pragma-based workaround for MSVC builds for 32-bit x86. Another method was thought to work with the new rewritten CLMUL CRC. Apparently it doesn't. Keep it simple and disable CLMUL CRC with any non-recent MSVC when building for 32-bit x86. Fixes: 54eaea5ea49b ("liblzma: x86 CLMUL CRC: Rewrite") Fixes: https://github.com/tukaani-project/xz/issues/171 Reported-by: Andrew Murray	2025-04-07 22:36:58 +03:00
Lasse Collin	c5fd88dfc3	liblzma: Remove MSVC hack from CLMUL CRC It's not enough with MSVC 19.29 (VS 2019) even if the hack was also applied to the CRC32 code. The tests crash when built for 32-bit x86.	2025-04-07 22:36:58 +03:00
Lasse Collin	6f412814a8	Update AUTHORS The contributions have been rewritten.	2025-01-04 19:57:17 +02:00
Lasse Collin	f7103c2c2a	Revert "liblzma: Add ARM64 CRC32 instruction support detection on OpenBSD" This reverts commit dc03f6290f5b9bd3d50c7e12e58dee870889d599. OpenBSD 7.6 will support elf_aux_info(3), and the detection code used on FreeBSD will work on OpenBSD 7.6 too. Keep things simpler and drop the OpenBSD-specific sysctl() method. Thanks to Christian Weisgerber.	2024-07-19 20:06:24 +03:00
Xi Ruoyao	7baf6835cf	liblzma: Speed up CRC32 calculation on 64-bit LoongArch The crc.w.{b/h/w/d}.w instructions in LoongArch can calculate the CRC32 result for 1/2/4/8 bytes in a single operation. Using these is much faster compared to the generic method. Optimized CRC32 is enabled unconditionally on 64-bit LoongArch because the LoongArch specification says that CRC32 instructions shall be implemented for 64-bit processors. Optimized CRC32 isn't enabled for 32-bit LoongArch processors because not enough information is available about them. Co-authored-by: Lasse Collin <lasse.collin@tukaani.org> Closes: https://github.com/tukaani-project/xz/pull/86	2024-07-01 17:09:57 +03:00
Lasse Collin	0ed8936685	liblzma: ARM64 CRC32: Align the buffer faster Instead of doing it byte by byte, use the 1/2/4-byte CRC32 instructions.	2024-06-28 14:20:49 +03:00
Lasse Collin	fe77c4e130	liblzma: Tidy up crc_common.h Prefix ARM64_RUNTIME_DETECTION with CRC_ and reorder it to be with the other ARM64-specific lines. That macro isn't used outside this file. ARM64 CLMUL implementation doesn't exist yet and thus CRC64_ARM64_CLMUL isn't used anywhere yet. It's not ideal that the single-letter CRC utility macros are here as they pollute the namespace of the LZ encoder files. Those could be moved their own crc_macros.h like they were in 5.2.x but in practice this is fine enough already.	2024-06-23 23:09:14 +03:00
Lasse Collin	7484d37538	liblzma: Move lzma_crcXX_table[][] declarations to crc_common.h LZ encoder needs lzma_crc32_table[0] but otherwise those tables are private to the CRC code. In contrast, the other things in check.h are needed in several places.	2024-06-23 15:37:46 +03:00
Lasse Collin	85b081f5d4	liblzma: Make 32-bit x86 CRC assembly co-exist with CLMUL Now runtime detection of CLMUL support can pick between the CLMUL and the generic assembly implementations. Whatever overhead this has for builds that omit CLMUL completely isn't important because builds for any non-ancient system is likely to include the CLMUL code too. Handle the CRC tables in crcXX_fast.c files because now these files are built even when assembly code is used. If 32-bit x86 assembly is enabled then it will always be built even if compiler flags were such that CLMUL would be allowed unconditionally. That is, runtime detection will be used anyway. This keeps the build rules simpler. In LZ encoder, build and use lzma_lz_hash_table[256] if CLMUL CRC is used without runtime detection. Previously this wasn't needed because crc32_table.c included the lzma_crc32_table[][] in the build unless encoder support had been disabled. Including an 8 KiB table was silly when only 1 KiB is actually used. So now liblzma is 7 KiB smaller if CLMUL is enabled without runtime detection.	2024-06-23 14:36:44 +03:00
Lasse Collin	6667d503b5	liblzma: CRC: Rename crcXX_generic to lzma_crcXX_generic This prepares for the possibility that lzma_crc32_generic and lzma_crc64_generic are extern functions.	2024-06-23 14:36:44 +03:00
Lasse Collin	30a2d5d510	liblzma: CRC CLMUL: Omit is_arch_extension_supported() when not needed On E2K the function compiles only due to compiler emulation but the function is never used. It's cleaner to omit the function when it's not needed even though it's a "static inline" function. Thanks to Ilya Kurdyukov.	2024-06-17 15:00:55 +03:00
Lasse Collin	54eaea5ea4	liblzma: x86 CLMUL CRC: Rewrite It's faster with both tiny and large buffers and doesn't require disabling any sanitizers. With large buffers the extra speed is from folding four 16-byte chunks in parallel. The 32-bit x86 with MSVC reportedly still needs a workaround. Now the simpler "__asm mov ebx, ebx" trick is enough but it needs to be in lzma_crc64() instead of crc64_arch_optimized(). Thanks to Iouri Kharon for testing and the fix. Thanks to Ilya Kurdyukov for testing the speed with aligned and unaligned buffers on a few x86 processors and on E2K v6. Thanks to Sam James for general feedback. Fixes: https://github.com/tukaani-project/xz/issues/112 Fixes: https://github.com/tukaani-project/xz/issues/122	2024-06-17 15:00:49 +03:00
Lasse Collin	20014c2614	liblzma: Use a single macro to select CLMUL CRC to build This way it's clearer that two things cannot be selected at the same time.	2024-06-16 12:59:17 +03:00
Lasse Collin	d8fb098617	liblzma: CRC32 CLMUL: Refactor the constants and simplify By using modulus scaled constants, the final reduction can be simplified.	2024-06-16 12:56:54 +03:00
Lasse Collin	ef652ac391	liblzma: CRC64 CLMUL: Refactor the constants Now it refers to crc_clmul_consts_gen.c. vfold8 was renamed to mu_p and the p no longer has the lowest bit set (it makes no difference as the output bits it affects are ignored).	2024-06-16 12:56:54 +03:00
Lasse Collin	9f5fc17e32	liblzma: Add crc_clmul_consts_gen.c It's a standalone program that prints the required constants. It's won't be a part of the normal build of the package.	2024-06-16 12:56:54 +03:00
Lasse Collin	71b147aab7	liblzma: Remove CRC_USE_GENERIC_FOR_SMALL_INPUTS It was already commented out.	2024-06-16 12:56:54 +03:00
Lasse Collin	f99a7be406	liblzma: Remove crc_attr_no_sanitize_address It's not enough to silence the address sanitizer. Also memory and thread sanitizers would need to be silenced. They, at least currently, aren't smart enough to see that the extra bytes are discarded from the xmm registers by later instructions. Valgrind is smarter, possibly because this kind of code isn't weird to write in assembly. Agner Fog's optimizing_assembly.pdf even mentions this idea of doing an aligned read and then discarding the extra bytes. The sanitizers don't instrument assembly code but Valgrind checks all code. It's better to change the implementation to avoid the sanitization attributes which also look scary in the code. (Somehow they can look more scary than __asm__ which is implictly unsanitized.) See also: https://github.com/tukaani-project/xz/issues/112 https://github.com/tukaani-project/xz/issues/122	2024-06-16 12:56:54 +03:00
Lasse Collin	0a32d2072c	liblzma: Fix a typo in a comment Thanks to Sam James for spotting it. Fixes: f644473a211394447824ea00518d0a214ff3f7f2	2024-06-11 22:42:04 +03:00
Lasse Collin	afd9b4d282	liblzma: Fix a comment indentation	2024-06-10 23:19:27 +03:00
Lasse Collin	50e6bff274	liblzma: Fix white space	2024-06-10 23:19:27 +03:00
Lasse Collin	dc03f6290f	liblzma: Add ARM64 CRC32 instruction support detection on OpenBSD The C code is from Christian Weisgerber, I merely reordered the OSes. Then I added the build system checks without testing them. Also thanks to Brad Smith who submitted a similar patch on GitHub a few hours after Christian had sent his via email. Co-authored-by: Christian Weisgerber <naddy@mips.inka.de> Closes: https://github.com/tukaani-project/xz/pull/125	2024-06-07 15:06:59 +03:00
Lasse Collin	4ffc60f323	tuklib_integer: Rename bswapXX to byteswapXX The __builtin_bswapXX from GCC and Clang are preferred when they are available. This can allow compilers to emit the x86 MOVBE instruction instead of doing a load + byteswap as two instructions (which would happen if the byteswapping is done in inline asm). bswap16, bswap32, and bswap64 exist in system headers on *BSDs and Darwin. #defining bswap16 on NetBSD results in a warning about macro redefinition. It's safest to avoid this namespace conflict completely. No OS supported by tuklib_integer.h uses byteswapXX names and a web search doesn't immediately find any obvious danger of namespace conflicts. So let's try these still-pretty-short names for the macros. Thanks to Sam James for pointing out the compiler warning on NetBSD 10.0.	2024-04-25 14:00:57 +03:00
Lasse Collin	6286c1900c	liblzma: CRC: Simplify table omission macros A macro is useful to prevent a single #if directive from getting too ugly but only one macro is needed for all archs.	2024-04-10 23:33:17 +03:00
Lasse Collin	45da936c87	liblzma: ARM64 CRC: Fix omission of CRC32 table The macro name had an odd typo so the table wasn't omitted when it should have. Fixes: 1940f0ec28f08c0ac72c1413d9706fb82eabe6ad	2024-04-10 23:12:23 +03:00
Lasse Collin	fc43cecd32	liblzma: ARM64 CRC32: Change style of the macOS code to match FreeBSD I didn't test this but it shouldn't change any functionality. Fixes: 761f5b69a4c778c8bcb09279b845b07c28790575	2024-04-10 23:12:23 +03:00
Lasse Collin	1024cd4cd9	liblzma: ARM64 CRC32: Add error checking to FreeBSD-specific code Also add parenthesis to the return statement. I didn't test this. Fixes: 761f5b69a4c778c8bcb09279b845b07c28790575	2024-04-10 23:12:23 +03:00
Lasse Collin	2337f7021c	liblzma: ARM64 CRC32: Use negation instead of subtracting from 8 Subtracting from 0 is negation, this just keeps warnings away. Fixes: 761f5b69a4c778c8bcb09279b845b07c28790575	2024-04-10 23:12:11 +03:00
Lasse Collin	d8fffd01aa	liblzma: ARM64 CRC32: Tweak coding style and comments	2024-04-10 22:53:53 +03:00
Lasse Collin	689ae24273	liblzma: Remove ifunc support. This is NOT done for security reasons even though the backdoor relied on the ifunc code. Instead, the reason is that in this project ifunc provides little benefits but it's quite a bit of extra code to support it. The only case where ifunc might matter for performance is if the CRC functions are used directly by an application. In normal compression use it's completely irrelevant.	2024-04-09 18:22:27 +03:00
Lasse Collin	e93e13c8b3	Remove the backdoor found in 5.6.0 and 5.6.1 (CVE-2024-3094). While the backdoor was inactive (and thus harmless) without inserting a small trigger code into the build system when the source package was created, it's good to remove this anyway: - The executable payloads were embedded as binary blobs in the test files. This was a blatant violation of the Debian Free Software Guidelines. - On machines that see lots bots poking at the SSH port, the backdoor noticeably increased CPU load, resulting in degraded user experience and thus overwhelmingly negative user feedback. - The maintainer who added the backdoor has disappeared. - Backdoors are bad for security. This reverts the following without making any other changes: 6e636819 Tests: Update two test files. a3a29bbd Tests: Test --single-stream can decompress bad-3-corrupt_lzma2.xz. 0b4ccc91 Tests: Update RISC-V test files. 8c9b8b20 liblzma: Fix typos in crc32_fast.c and crc64_fast.c. 82ecc538 liblzma: Fix false Valgrind error report with GCC. cf44e4b7 Tests: Add a few test files. 3060e107 Tests: Use smaller dictionary size in RISC-V test files. e2870db5 Tests: Add two RISC-V Filter test files. The RISC-V test files also have real content that tests the filter but the real content would fit into much smaller files. A generator program would need to be available as well. Thanks to Andres Freund for finding and reporting it and making it public quickly so others could act without a delay. See: https://www.openwall.com/lists/oss-security/2024/03/29/4	2024-04-09 17:57:39 +03:00
Jia Tan	8c9b8b2063	liblzma: Fix typos in crc32_fast.c and crc64_fast.c.	2024-03-09 09:52:32 +08:00
Jia Tan	82ecc53819	liblzma: Fix false Valgrind error report with GCC. With GCC and a certain combination of flags, Valgrind will falsely trigger an invalid write. This appears to be due to the omission of instructions to properly save, set up, and restore the frame pointer. The IFUNC resolver is a leaf function since it only calls a function that is inlined. So sometimes GCC omits the frame pointer instructions in the resolver unless this optimization is explictly disabled. This fixes https://bugzilla.redhat.com/show_bug.cgi?id=2267598.	2024-03-09 09:20:57 +08:00
Jia Tan	72d2933bfa	liblzma: Use attribute no_profile_instrument_function with ifunc. Thanks to Sam James for determining this was the attribute needed to workaround the GCC bug and for his version of the patch in Gentoo.	2024-03-05 01:54:30 +08:00
Lasse Collin	b941549573	liblzma: Include the SPDX license identifier 0BSD to generated files. Perhaps the generated files aren't even copyrightable but using the same license for them as for the rest of the liblzma keeps things more consistent for tools that look for license info.	2024-02-14 18:31:16 +02:00
Lasse Collin	22af94128b	Add SPDX license identifier into 0BSD source code files.	2024-02-14 18:31:16 +02:00
Lasse Collin	689e0228ba	Change most public domain parts to 0BSD. Translations and doc/xz-file-format.txt and doc/lzma-file-format.txt were not touched. COPYING.0BSD was added.	2024-02-14 18:31:12 +02:00
Lasse Collin	76946dc433	Fix SHA-256 authors. The initial commit 5d018dc03549c1ee4958364712fb0c94e1bf2741 in 2007 had a comment in sha256.c that the code is based on Crypto++ Library 5.5.1. In 2009 the Authors list in sha256.c and the AUTHORS file was updated with information that the code had come from Crypto++ but via 7-Zip. I know I had viewed 7-Zip's SHA-256 code but back then the C code has been identical enough with Crypto++, so I don't why I thought the author info would need that extra step via 7-Zip for this single file. Another error is that I had mixed sha.* and shacal2.* files when checking for author info in Crypto++. The shacal2.* files aren't related to liblzma's sha256.c and thus Kevin Springle's code in Crypto++ isn't either.	2024-02-14 15:23:00 +02:00
Jia Tan	adb073da76	liblzma: Fix typo discovered by codespell.	2024-02-09 23:59:54 +08:00
Jia Tan	7f68a68c19	liblzma: Update Authors list in crc32_arm64.h.	2024-02-02 01:38:51 +08:00
Jia Tan	97f9ba50b8	liblzma: Check HAVE_USABLE_CLMUL before omitting CRC32 table. This was split from the prior commit so it could be easily applied to the 5.4 branch. Closes: https://github.com/tukaani-project/xz/pull/77	2024-02-01 20:09:11 +08:00
Jia Tan	ca9015f4de	liblzma: Check HAVE_USABLE_CLMUL before omitting CRC64 table. If liblzma is configured with --disable-clmul-crc CFLAGS="-msse4.1 -mpclmul", then it will fail to compile because the generic version must be used but the CRC tables were not included.	2024-02-01 20:09:11 +08:00
Jia Tan	2f1552a91c	liblzma: Only use ifunc in crcXX_fast.c if its needed. The code was using HAVE_FUNC_ATTRIBUTE_IFUNC instead of CRC_USE_IFUNC. With ARM64, ifunc is incompatible because it requires non-inline function calls for runtime detection.	2024-02-01 20:09:11 +08:00
Jia Tan	1940f0ec28	liblzma: Omit CRC tables when not needed with ARM64 optimizations. This is similar to the existing x86-64 CLMUL conditions to omit the tables. They were slightly refactored to improve readability.	2024-02-01 20:09:11 +08:00
Jia Tan	761f5b69a4	liblzma: Rename crc32_aarch64.h to crc32_arm64.h. Even though the proper name for the architecture is aarch64, this project uses ARM64 throughout. So the rename is for consistency. Additionally, crc32_arm64.h was slightly refactored for the following changes: * Added MSVC, FreeBSD, and macOS support in is_arch_extension_supported(). * crc32_arch_optimized() now checks the size when aligning the buffer. * crc32_arch_optimized() loop conditions were slightly modified to avoid both decrementing the size and incrementing the buffer pointer. * Use the intrinsic wrappers defined in <arm_acle.h> because GCC and Clang name them differently. * Minor spacing and comment changes.	2024-02-01 20:09:11 +08:00
Jia Tan	455a08609c	liblzma: Refactor crc_common.h. The CRC_GENERIC is now split into CRC32_GENERIC and CRC64_GENERIC, since the ARM64 optimizations will be different between CRC32 and CRC64. For the same reason, CRC_ARCH_OPTIMIZED is split into CRC32_ARCH_OPTIMIZED and CRC64_ARCH_OPTIMIZED. ifunc will only be used with x86-64 CLMUL because the runtime detection methods needed with ARM64 are not compatible with ifunc.	2024-02-01 20:09:11 +08:00
Chenxi Mao	849d0f282a	Speed up CRC32 calculation on ARM64 The CRC32 instructions in ARM64 can calculate the CRC32 result for 8 bytes in a single operation, making the use of ARM64 instructions much faster compared to the general CRC32 algorithm. Optimized CRC32 will be enabled if ARM64 has CRC extension running on Linux. Signed-off-by: Chenxi Mao <chenxi.mao2013@gmail.com>	2024-01-27 21:49:26 +08:00
Lasse Collin	fbb3ce541e	liblzma: CRC: Add a comment to crc_x86_clmul.h about BUILDING_ macros.	2024-01-11 15:25:00 +02:00
Lasse Collin	4f518c1b6b	liblzma: CRC: Remove crc_always_inline, use lzma_always_inline instead. Now crc_simd_body() in crc_x86_clmul.h is only called once in a translation unit, we no longer need to be so cautious about ensuring the always-inline behavior.	2024-01-11 15:24:35 +02:00
Lasse Collin	35c03ec6bf	liblzma: CRC: Update CLMUL comments to more generic wording.	2024-01-11 14:39:46 +02:00

1 2 3

141 Commits