root/xz - xz - Root on GIT

root/xz

mirror of https://git.tukaani.org/xz.git synced 2026-04-08 00:58:00 +00:00

Author	SHA1	Message	Date
Lasse Collin	ee44863ae8	liblzma: Add ifunc implementation to crc64_fast.c. The ifunc method avoids indirection via the function pointer crc64_func. This works on GNU/Linux and probably on FreeBSD too. The previous __attribute((__constructor__)) method is kept for compatibility with ELF platforms which do support ifunc. The ifunc method has some limitations, for example, building liblzma with -fsanitize=address will result in segfaults. The configure option --disable-ifunc must be used for such builds. Thanks to Hans Jansen for the original patch. Closes: https://github.com/tukaani-project/xz/pull/53	2023-06-27 23:55:59 +08:00
Lasse Collin	b473a92891	Change a few HTTP URLs to HTTPS. The xz man page timestamp was intentionally left unchanged.	2023-03-18 15:56:07 +02:00
Lasse Collin	718b22a6c5	liblzma: Silence a warning from MSVC. It gives C4146 here since unary minus with unsigned integer is still unsigned (which is the intention here). Doing it with substraction makes it clearer and avoids the warning. Thanks to Nathan Moinvaziri for reporting this.	2023-02-16 17:59:50 +02:00
Lasse Collin	6c886cc5b3	Fix warnings from clang -Wdocumentation.	2023-01-12 03:11:40 +02:00
Lasse Collin	0b8fa310cf	liblzma: CLMUL CRC64: Work around a bug in MSVC, second attempt. This affects only 32-bit x86 builds. x86-64 is OK as is. I still cannot easily test this myself. The reporter has tested this and it passes the tests included in the CMake build and performance is good: raw CRC64 is 2-3 times faster than the C version of the slice-by-four method. (Note that liblzma doesn't include a MSVC-compatible version of the 32-bit x86 assembly code for the slice-by-four method.) Thanks to Iouri Kharon for figuring out a fix, testing, and benchmarking.	2023-01-10 22:15:55 +02:00
Lasse Collin	cfabb62a48	Revert "liblzma: CLMUL CRC64: Workaround a bug in MSVC (VS2015-2022)." This reverts commit 36edc65ab4cf10a131f239acbd423b4510ba52d5. It was reported that it wasn't a good enough fix and MSVC still produced (different kind of) bad code when building for 32-bit x86 if optimizations are enabled. Thanks to Iouri Kharon.	2023-01-10 12:47:16 +02:00
Lasse Collin	36edc65ab4	liblzma: CLMUL CRC64: Workaround a bug in MSVC (VS2015-2022). I haven't tested with MSVC myself and there doesn't seem to be information about the problem online, so I'm relying on the bug report. Thanks to Iouri Kharon for the bug report and the patch.	2023-01-09 12:22:05 +02:00
Lasse Collin	f644473a21	liblzma: Add fast CRC64 for 32/64-bit x86 using SSSE3 + SSE4.1 + CLMUL. It also works on E2K as it supports these intrinsics. On x86-64 runtime detection is used so the code keeps working on older processors too. A CLMUL-only build can be done by using -msse4.1 -mpclmul in CFLAGS and this will reduce the library size since the generic implementation and its 8 KiB lookup table will be omitted. On 32-bit x86 this isn't used by default for now because by default on 32-bit x86 the separate assembly file crc64_x86.S is used. If --disable-assembler is used then this new CLMUL code is used the same way as on 64-bit x86. However, a CLMUL-only build (-msse4.1 -mpclmul) won't omit the 8 KiB lookup table on 32-bit x86 due to a currently-missing check for disabled assembler usage. The configure.ac check should be such that the code won't be built if something in the toolchain doesn't support it but --disable-clmul-crc option can be used to unconditionally disable this feature. CLMUL speeds up decompression of files that have compressed very well (assuming CRC64 is used as a check type). It is know that the CLMUL code is significantly slower than the generic code for tiny inputs (especially 1-8 bytes but up to 16 bytes). If that is a real-world problem then there is already a commented-out variant that uses the generic version for small inputs. Thanks to Ilya Kurdyukov for the original patch which was derived from a white paper from Intel [1] (published in 2009) and public domain code from [2] (released in 2016). [1] https://www.intel.com/content/dam/www/public/us/en/documents/white-papers/fast-crc-computation-generic-polynomials-pclmulqdq-paper.pdf [2] https://github.com/rawrunprotected/crc	2022-11-14 23:05:46 +02:00
Lasse Collin	eb0f1450ad	liblzma: Use __attribute__((__constructor__)) if available. This uses it for CRC table initializations when using --disable-small. It avoids mythread_once() overhead. It also means that then --disable-small --disable-threads is thread-safe if this attribute is supported.	2022-11-14 16:00:52 +02:00
Lasse Collin	48dde3bab9	liblzma: Silence -Wconversion warning from crc64_fast.c.	2022-10-31 11:54:44 +02:00
Ed Maste	865e0a3689	liblzma: Use non-executable stack on FreeBSD as on Linux	2022-02-22 01:23:34 +02:00
H.J. Lu	4fd79b90c5	liblzma: Enable Intel CET in x86 CRC assembly codes When Intel CET is enabled, we need to include <cet.h> in assembly codes to mark Intel CET support and add _CET_ENDBR to indirect jump targets. Tested on Intel Tiger Lake under CET enabled Linux.	2020-12-23 17:13:33 +02:00
Lasse Collin	b8e12f5ab4	Typo fixes from fossies.org. https://fossies.org/linux/misc/xz-5.2.5.tar.xz/codespell.html	2020-03-23 18:07:50 +02:00
Lasse Collin	5e78fcbf2e	Rename read32ne to aligned_read32ne, and similarly for the others. Using the aligned methods requires more care to ensure that the address really is aligned, so it's nicer if the aligned methods are prefixed. The next commit will remove the unaligned_ prefix from the unaligned methods which in liblzma are used in more places than the aligned ones.	2019-12-31 00:29:48 +02:00
Lasse Collin	a12b13c5f0	liblzma: Silence clang -Wmissing-variable-declarations.	2019-06-24 23:45:21 +03:00
Lasse Collin	ac398c3baf	liblzma: Disable external SHA-256 by default. This is the sane thing to do. The conflict with OpenSSL on some OSes and especially that the OS-provided versions can be significantly slower makes it clear that it was a mistake to have the external SHA-256 support enabled by default. Those who want it can now pass --enable-external-sha256 to configure. INSTALL was updated with notes about OSes where this can be a bad idea. The SHA-256 detection code in configure.ac had some bugs that could lead to a build failure in some situations. These were fixed, although it doesn't matter that much now that the external SHA-256 is disabled by default. MINIX >= 3.2.0 uses NetBSD's libc and thus has SHA256_Init in libc instead of libutil. Support for the libutil version was removed.	2016-03-13 20:21:49 +02:00
Lasse Collin	c6bf438ab3	liblzma: Fix a build failure related to external SHA-256 support. If an appropriate header and structure were found by configure, but a library with a usable SHA-256 functions wasn't, the build failed.	2015-11-02 18:16:51 +02:00
Lasse Collin	5dcffdbcc2	liblzma: SHA-256: Optimize the Maj macro slightly. The Maj macro is used where multiple things are added together, so making Maj a sum of two expressions allows some extra freedom for the compiler to schedule the instructions. I learned this trick from <http://www.hackersdelight.org/corres.txt>.	2014-08-03 21:32:25 +03:00
Lasse Collin	a9477d1e0c	liblzma: SHA-256: Optimize the way rotations are done. This looks weird because the rotations become sequential, but it helps quite a bit on both 32-bit and 64-bit x86: - It requires fewer instructions on two-operand instruction sets like x86. - It requires one register less which matters especially on 32-bit x86. I hope this doesn't hurt other archs. I didn't invent this idea myself, but I don't remember where I saw it first.	2014-08-03 21:08:12 +03:00
Lasse Collin	5a76c7c8ee	liblzma: SHA-256: Remove the GCC #pragma that became unneeded. The unrolling in the previous commit should avoid the situation where a compiler may think that an uninitialized variable might be accessed.	2014-08-03 20:38:13 +03:00
Lasse Collin	9a096f8e57	liblzma: SHA-256: Unroll a little more. This way a branch isn't needed for each operation to choose between blk0 and blk2, and still the code doesn't grow as much as it would with full unrolling.	2014-08-03 20:33:38 +03:00
Lasse Collin	bc7650d87b	liblzma: SHA-256: Do the byteswapping without a temporary buffer.	2014-08-03 19:56:43 +03:00
Lasse Collin	e28528f1c8	liblzma: Remove a useless C99ism from sha256.c. Unsurprisingly it makes no difference in compiled output.	2014-01-12 12:50:30 +02:00
Lasse Collin	3e62c68d75	Fix typos in comments.	2014-01-12 12:11:36 +02:00
Lasse Collin	ab22562066	A few typo fixes to comments and the xz man page. Thanks to Jim Meyering.	2012-08-24 16:27:31 +03:00
Lasse Collin	b94aa0c838	liblzma: Try to use SHA-256 from the operating system. If the operating system libc or other base libraries provide SHA-256, use that instead of our own copy. Note that this doesn't use OpenSSL or libgcrypt or such libraries to avoid creating dependencies to other packages. This supports at least FreeBSD, NetBSD, OpenBSD, Solaris, MINIX, and Darwin. They all provide similar but not identical SHA-256 APIs; everyone is a little different. Thanks to Wim Lewis for the original patch, improvements, and testing.	2011-05-21 15:08:44 +03:00
Lasse Collin	b34c5ce4b2	liblzma: Use TUKLIB_GNUC_REQ to check GCC version in sha256.c.	2011-04-05 22:41:33 +03:00
Lasse Collin	4785f2021a	Fix jl -> jb in ASM files.	2010-02-12 12:41:20 +02:00
Lasse Collin	6b50c9429b	Use __APPLE__ instead of __MACH__ in ASM files. This allows the files to work on HURD. Thanks to Jonathan Nieder.	2010-02-12 12:31:22 +02:00
Lasse Collin	f1a28b96c9	Add missing consts to pointer casts.	2009-11-22 12:05:33 +02:00
Lasse Collin	ebfb2c5e1f	Use a tuklib module for integer handling. This replaces bswap.h and integer.h. The tuklib module uses <byteswap.h> on GNU, <sys/endian.h> on *BSDs and <sys/byteorder.h> on Solaris, which may contain optimized code like inline assembly.	2009-10-04 22:57:12 +03:00
Lasse Collin	c5f68b5cc7	Make liblzma produce the same output on both endiannesses. Seems that it is a problem in some cases if the same version of XZ Utils produces different output on different endiannesses, so this commit fixes that problem. The output will still vary between different XZ Utils versions, but I cannot avoid that for now. This commit bloatens the code on big endian systems by 1 KiB, which should be OK since liblzma is bloated already. ;-)	2009-10-02 11:03:26 +03:00
Lasse Collin	655457b9ad	Revert 43f44160b1ddcbf7e5205c37db09b3bebe7226f9 and use a fix that works on all systems using GNU assembler. Maybe the assembler code is used e.g. on Solaris x86 but let's worry about it if this doesn't work on it.	2009-08-31 21:59:25 +03:00
Lasse Collin	43f44160b1	Fix x86 assembler on GCC 3. Thanks to Karl Berry.	2009-08-29 13:35:23 +03:00
Lasse Collin	f42ee98166	Build system fixes Don't use libtool convenience libraries to avoid recently discovered long-standing subtle but somewhat severe bugs in libtool (at least 1.5.22 and 2.2.6 are affected). It was found when porting XZ Utils to Windows <http://lists.gnu.org/archive/html/libtool/2009-06/msg00070.html> but the problem is significant also e.g. on GNU/Linux. Unless --disable-shared is passed to configure, static library built from a set of convenience libraries will contain PIC objects. That is, while libtool builds non-PIC objects too, only PIC objects will be used from the convenience libraries. On 32-bit x86 (tested on mobile XP2400+), using PIC instead of non-PIC makes the decompressor 10 % slower with the default CFLAGS. So while xz was linked against static liblzma by default, it got the slower PIC objects unless --disable-shared was used. I tend develop and benchmark with --disable-shared due to faster build time, so I hadn't noticed the problem in benchmarks earlier. This commit also adds support for building Windows resources into liblzma and executables.	2009-06-30 17:09:57 +03:00
Lasse Collin	b2b1f86753	Hopefully improved portability of the assembler code in Autotools based builds on Windows.	2009-06-27 00:43:06 +03:00
Lasse Collin	390a640856	Basic support for building with Cygwin and MinGW using the Autotools based build system. It's not good yet, more fixes will follow.	2009-06-26 15:37:53 +03:00
Lasse Collin	1c9360b7d1	Fix @variables@ to $(variables) in Makefile.am files. Fix the ordering of libgnu.a and LTLIBINTL on the linker command line and added missing LTLIBINTL to tests/Makefile.am.	2009-06-26 14:47:31 +03:00
Lasse Collin	a6f43e6412	Use a GCC-specific #pragma instead of GCC-specific -Wno-uninitialized to silence a bogus warning.	2009-05-02 16:16:28 +03:00
Lasse Collin	02ddf09bc3	Put the interesting parts of XZ Utils into the public domain. Some minor documentation cleanups were made at the same time.	2009-04-13 11:27:40 +03:00
Lasse Collin	96c46df7de	Improve support for DOS-like systems. Here DOS-like means DOS, Windows, and OS/2.	2009-02-13 17:29:02 +02:00
Lasse Collin	a3bbbe05d3	Let the user specify custom CFLAGS on the make command line. Previously custom CFLAGS worked only when they were passed to configure.	2009-02-09 14:54:31 +02:00
Lasse Collin	0e27028d74	Add a separate internal function to initialize the CRC32 table, which is used also by LZ encoder. This was needed because calling lzma_crc32() and ignoring the result is a no-op due to lzma_attr_pure.	2009-02-08 18:24:50 +02:00
Lasse Collin	bfd91198e4	Support LZMA_API_STATIC in assembler files to avoid __declspec(dllexport) equivalent.	2009-02-07 15:55:47 +02:00
Lasse Collin	99c1c2abfa	Updated the x86 assembler code: - Use call/ret pair to get instruction pointer for PIC. - Use PIC only if PIC or __PIC__ is #defined. - The code should work on MinGW and Darwin in addition to GNU/Linux and Solaris.	2009-02-02 21:19:01 +02:00
Lasse Collin	22a0c6dd94	Modify LZMA_API macro so that it works on Windows with other compilers than MinGW. This may hurt readability of the API headers slightly, but I don't know any better way to do this.	2009-02-02 20:14:03 +02:00
Lasse Collin	f54bcf6f80	Remove dangling crc64_init.c.	2009-01-30 00:29:58 +02:00
Lasse Collin	449b8c832b	Regenerate the CRC tables without trailing blanks.	2009-01-26 20:09:17 +02:00
Jim Meyering	850f740042	remove trailing blanks from all but .xz files	2009-01-26 20:01:51 +02:00
Lasse Collin	7ed9d943b3	Remove lzma_init() and other init functions from liblzma API. Half of developers were already forgetting to use these functions, which could have caused total breakage in some future liblzma version or even now if --enable-small was used. Now liblzma uses pthread_once() to do the initializations unless it has been built with --disable-threads which make these initializations thread-unsafe. When --enable-small isn't used, liblzma currently gets needlessly linked against libpthread (on systems that have it). While it is stupid for now, liblzma will need threads in future anyway, so this stupidity will be temporary only. When --enable-small is used, different code CRC32 and CRC64 is now used than without --enable-small. This made the resulting binary slightly smaller, but the main reason was to clean it up and to handle the lack of lzma_init_check(). The pkg-config file lzma.pc was renamed to liblzma.pc. I'm not sure if it works correctly and portably for static linking (Libs.private includes -pthread or other operating system specific flags). Hopefully someone complains if it is bad. lzma_rc_prices[] is now included as a precomputed array even with --enable-small. It's just 128 bytes now that it uses uint8_t instead of uint32_t. Smaller array seemed to be at least as fast as the more bloated uint32_t array on x86; hopefully it's not bad on other architectures.	2008-12-31 00:30:49 +02:00

1 2 3

112 Commits