root/xz - xz - Root on GIT

root/xz

mirror of https://git.tukaani.org/xz.git synced 2026-04-09 09:38:01 +00:00

Author	SHA1	Message	Date
Lasse Collin	5ae7958dbc	Windows: Disable MinGW-w64's stdio functions in size-optimized builds This only affects builds with UCRT. With legacy MSVCRT, the replacement functions are always enabled. Omitting the MinGW-w64 replacements saves over 20 KiB per executable. The downside is that --enable-small or XZ_SMALL=ON disables thousand separator support in xz messages. If someone is OK with the slower speed of slightly smaller builds, lack of thousand separators won't matter. Don't override __USE_MINGW_ANSI_STDIO if it is already defined (via CPPFLAGS or such method). (cherry picked from commit 4eae859ae8ad7072eaa74aeaee79a2c3c12c55cb)	2025-01-22 15:39:57 +02:00
Lasse Collin	f4d988bc04	liblzma: Fix the encoder breakage on big endian ARM64 When the 8-byte method was enabled for ARM64, a check for endianness wasn't added. This broke the LZMA/LZMA2 encoder. Test suite caught it. Fixes: cd64dd70d5665b6048829c45772d08606f44672e Co-authored-by: Marcus Comstedt <marcus@mc.pp.se> (cherry picked from commit 150356207c8d6a3e0af465b676430d19d62f884c)	2025-01-12 13:11:07 +02:00
Lasse Collin	e22c4fb259	Windows: Update manifest comments about long UTF-8 filenames (cherry picked from commit b01b0958025a2da284b53a583f313f8140636cb5)	2025-01-12 13:11:06 +02:00
Lasse Collin	8eaf2cb58e	liblzma: Always validate the first digit of a preset string lzma_str_to_filters() may call parse_lzma12_preset() in two ways. The call from str_to_filters() detects the string type from the first character(s) and as a side-effect it validates the first digit of the preset string. So this change makes no difference there. However, the call from parse_options() doesn't pre-validate the string. parse_lzma12_preset() will return an invalid value which is passed to lzma_lzma_preset() which safely rejects it. The bug still affects the the error message: $ xz --filters=lzma2:preset=X xz: Error in --filters=FILTERS option: xz: lzma2:preset=X xz: ^ xz: Unsupported preset After the fix: $ xz --filters=lzma2:preset=X xz: Error in --filters=FILTERS option: xz: lzma2:preset=X xz: ^ xz: Unsupported preset The ^ now correctly points to the X and not past it because the X itself is the problematic character. Fixes: cedeeca2ea6ada5b0411b2ae10d7a859e837f203 (cherry picked from commit 75107217670a97b7b772833669d88c3c2f188e37)	2025-01-05 12:58:46 +02:00
Lasse Collin	9b5a5c39f4	xz: Fix getopt_long argument type in --filters* Forgetting the argument (or not using = to separate the option from the argument) resulted in lzma_str_to_filters() being called with NULL as input string argument. The function handles it fine but xz passes the NULL to printf() too: $ xz --filters xz: Error in --filters=FILTERS option: xz: (null) xz: ^ xz: Unexpected NULL pointer argument(s) to lzma_str_to_filters() Now it's correct: $ xz --filters xz: option '--filters' requires an argument The --filters-help option doesn't take any arguments. Fixes: 9ded880a0221f4d1256845fc4ab957ffd377c760 Fixes: d6af7f347077b22403133239592e478931307759 Fixes: a165d7df1964121eb9df715e6f836a31c865beef (cherry picked from commit 52ff32433734d03befd85a5bf00fba77d6501455)	2025-01-05 11:42:02 +02:00
Lasse Collin	4ddf32c92b	xzdec: Don't leave Landlock file descriptor open for no reason This fix is similar to 48ff3f06521ca326996ab9a04d1b342098960427. Fixes: d74fb5f060b76db709b50f5fd37490394e52f975 (cherry picked from commit 2655c81b5e92278b0fd51f6537c1116f8349b02a)	2025-01-04 20:09:42 +02:00
Lasse Collin	1a5b93ed57	liblzma: Silence warnings from "clang -Wimplicit-fallthrough" (cherry picked from commit 672da29bb3a209a727ae46c0df948d7eea69f2e2)	2025-01-02 15:48:57 +02:00
Lasse Collin	33899ee86d	xzdec: Fix language in a comment (cherry picked from commit e34dbd6a0ae7a560a5508d51fc0bd142c5a320dc)	2025-01-02 15:48:31 +02:00
Lasse Collin	5a208b0c92	Windows: Make NLS require UCRT and gettext-runtime >= 0.23.1 Also remove the recently-added workaround from tuklib_gettext.h. Requiring a new enough gettext-runtime is cleaner. I guess it's mostly MSYS2 where xz is built with translation support, so once MSYS2 has Gettext >= 0.23.1, this requirement shouldn't be a problem in practice. (cherry picked from commit 16821252c504071f5c2012e415e59cbf5fb79820)	2025-01-02 15:42:13 +02:00
Lasse Collin	89db6aacbf	xz man page: Describe the source file deletion in -z and -d options The DESCRIPTION section always explained it, and the OPTIONS section only described the differences to the default behavior. However, new users in a hurry may skip reading DESCRIPTION. The default behavior is a bit dangerous, thus it's good to repeat in --compress and --decompress docs that source file is removed after successful operation. Fixes: https://github.com/tukaani-project/xz/issues/150 (cherry picked from commit 653732bd6f06d8f465bf353bf6e1c16f1405b906)	2024-12-30 10:52:44 +02:00
Lasse Collin	3324ea3576	xz: Fix comments (cherry picked from commit 260d5d36203955a7148ae1ab05d0931c942028d5)	2024-12-27 09:15:33 +02:00
Lasse Collin	8a7d922fb8	Windows: Workaround a UTF-8 issue in Gettext's libintl_setlocale() See the comment. In this package, locale is set at program startup and not changed later, so the point (2) in the comment isn't a problem. Fixes: 46ee0061629fb075d61d83839e14dd193337af59 (cherry picked from commit b40e3321a7fb9dfdf8ffb30e7e0788c2f0abc941)	2024-12-20 16:35:13 +02:00
Lasse Collin	dcdd40cacc	Revert "Windows: Use UTF-8 locale when active code page is UTF-8" This reverts commit 0d0b574cc45045d6150d397776340c068df59e2a. (cherry picked from commit bc4165da92b56668ddd1b7014b3488a0fad1733a)	2024-12-20 16:35:13 +02:00
Lasse Collin	f8e42ed44d	xzdec: Use setlocale() instead of tuklib_gettext_setlocale() xzdec isn't translated and doesn't need libintl on Windows even when NLS is enabled, thus libintl_setlocale() cannot interfere with the locale settings. Thus, standard setlocale() works perfectly. In the commit 78868b6e, the explanation in the commit message is wrong. Fixes: 78868b6ed63fa4c89f73e3dfed27abfb8b0d46db (cherry picked from commit d6796f9ce5359faaaed82926c1735aee3694430f)	2024-12-20 16:35:13 +02:00
Lasse Collin	3ed40b9f87	Windows: Revert the setlocale(LC_ALL, ".UTF8") documentation Only leave the FindFileFirstA() notes from 20dfca81, reverting the incorrect setlocale() notes. On Windows, Gettext's <libintl.h> overrides setlocale() with libintl_setlocale() wrapper. I hadn't noticed this, and thus my conclusions were wrong. Fixes: 20dfca8171dad4c64785ac61d5b68972c444877b (cherry picked from commit e607329a615759f1519016595dd38df7c89208f2)	2024-12-20 16:35:12 +02:00
Lasse Collin	4e0ebbabe4	tuklib_mbstr_width: Change the behavior when wcwidth() is not available If wcwidth() isn't available (Windows), previously it was assumed that one byte == one column in the terminal. Now it is assumed that one multibyte character == one column. This works better with UTF-8. Languages that only use single-width characters without any combining characters should work correctly with this. In xz, none of po/*.po contain combining characters and only ko.po, zh_CN.po, and zh_TW.po contain fullwidth characters. Thus, "only" those three translations in xz are broken on Windows with the UTF-8 code page. Broken means that column headings in xz -lvv and (only in the master branch) strings in --long-help are misaligned, so it's not a huge problem. I don't know if those three languages displayed perfectly before the UTF-8 change because I hadn't tested translations with native Windows builds before. Fixes: 46ee0061629fb075d61d83839e14dd193337af59 (cherry picked from commit b797c44c42ea54fe1c52722a2fca0c9618575598)	2024-12-18 19:22:01 +02:00
Lasse Collin	4ff609adb0	xzdec: Use setlocale() via tuklib_gettext_setlocale() xzdec isn't translated and didn't have locale-specific behavior in the past. On Windows with UTF-8 in the application manifest, setting the locale makes a difference though: - Without any setlocale() call, non-ASCII filenames don't display properly in Command Prompt unless one first uses "chcp 65001" to set the console code page to UTF-8. - setlocale(LC_ALL, "") is enough to make non-ASCII filenames print correctly in Command Prompt without using "chcp 65001", assuming that the non-UTF-8 code page (like 850) supports those non-ASCII characters. - setlocale(LC_ALL, ".UTF8") is even better because then mbrtowc() and such functions use an UTF-8 locale instead of a legacy code page. The tuklib_gettext_setlocale() macro takes care of this (without enabling any translations). Fixes: 46ee0061629fb075d61d83839e14dd193337af59 (cherry picked from commit 78868b6ed63fa4c89f73e3dfed27abfb8b0d46db)	2024-12-18 19:22:00 +02:00
Lasse Collin	4e7a48bf15	Windows: Use UTF-8 locale when active code page is UTF-8 XZ Utils 5.6.3 set the active code page to UTF-8 to fix CVE-2024-47611. This wasn't paired with UCRT-specific setlocale(LC_ALL, ".UTF8"), thus non-ASCII characters from translations became mojibake. Fixes: 46ee0061629fb075d61d83839e14dd193337af59 (cherry picked from commit 0d0b574cc45045d6150d397776340c068df59e2a)	2024-12-18 19:22:00 +02:00
Lasse Collin	d20e4115e1	Windows: Document the need for setlocale(LC_ALL, ".UTF8") Also warn about unpaired surrogates and (somewhat UTF-8-specific) MAX_PATH issue in FindFirstFileA(). Fixes: 46ee0061629fb075d61d83839e14dd193337af59 (cherry picked from commit 20dfca8171dad4c64785ac61d5b68972c444877b)	2024-12-18 19:22:00 +02:00
Lasse Collin	f9f0cdae8a	xzdec: Call tuklib_progname_init() early enough If the early pledge() call on OpenBSD fails, it calls my_errorf() which requires the "progname" variable. Fixes: d74fb5f060b76db709b50f5fd37490394e52f975 (cherry picked from commit 4e936f234056e5831013ed922145b666b04bb1e3)	2024-12-18 19:22:00 +02:00
Dexter Castor Döpping	d86fa15b72	liblzma: Fix incorrect macro name in a comment Fixes: 33b8a24b6646a9dbfd8358405aec466b13078559 Closes: https://github.com/tukaani-project/xz/pull/155 (cherry picked from commit bee0c044d30a6ad3b3d94901c27e7519f6f46e27)	2024-12-18 19:22:00 +02:00
Mark Wielaard	d9c2e7572b	xz: Landlock: Fix a file descriptor leak (cherry picked from commit 48ff3f06521ca326996ab9a04d1b342098960427)	2024-12-18 19:21:59 +02:00
Lasse Collin	9331ce4009	Bump version and soname for 5.6.3	2024-10-01 12:50:28 +03:00
Lasse Collin	bf518b9ba4	Windows: Embed an application manifest in the EXE files IMPORTANT: This includes a security fix to command line tool argument handling. Some toolchains embed an application manifest by default to declare UAC-compliance. Some also declare compatibility with Vista/8/8.1/10/11 to let the app access features newer than those of Vista. We want all the above but also two more things: - Declare that the app is long path aware to support paths longer than 259 characters (this may also require a registry change). - Force the code page to UTF-8. This allows the command line tools to access files whose names contain characters that don't exist in the current legacy code page (except unpaired surrogates). The UTF-8 code page also fixes security issues in command line argument handling which can be exploited with malicious filenames. See the new file w32_application.manifest.comments.txt. Thanks to Orange Tsai and splitline from DEVCORE Research Team for discovering this issue. Thanks to Vijay Sarvepalli for reporting the issue to me. Thanks to Kelvin Lee for testing with MSVC and helping with the required build system fixes. (cherry picked from commit 46ee0061629fb075d61d83839e14dd193337af59)	2024-10-01 12:16:29 +03:00
Lasse Collin	5718ce932e	Windows: Set DLL name accurately in StringFileInfo on Cygwin and MSYS2 Now the information in the "Details" tab in the file properties dialog matches the naming convention of Cygwin and MSYS2. This is only a cosmetic change. (cherry picked from commit dad153091552b52a41b95ec4981c6951f1cae487)	2024-10-01 12:16:28 +03:00
Lasse Collin	e77c0ca61d	common_w32res.rc: White space edits LANGUAGE and VS_VERSION_INFO begin new statements so put an empty line between them. (cherry picked from commit 8940ecb96fe9f0f2a9cfb8b66fe9ed31ffbea904)	2024-10-01 12:16:28 +03:00
Tobias Stoeckmann	aef9a25b32	lzmainfo: Avoid integer overflow The MB output can overflow with huge numbers. Most likely these are invalid .lzma files anyway, but let's avoid garbage output. lzmadec was adapted from LZMA Utils. The original code with this bug was written in 2005, over 19 years ago. Co-authored-by: Lasse Collin <lasse.collin@tukaani.org> Closes: https://github.com/tukaani-project/xz/pull/144 (cherry picked from commit 76cfd0a9bb33ae8e534b1f73f6359dc825589f2f)	2024-09-18 20:53:11 +03:00
Tobias Stoeckmann	40a7f163f5	xzdec: Remove unused short option -M "xzdec -M123" exited with exit status 1 without printing any messages. The "M:" entry should have been removed when the memory usage limiter support was removed from xzdec. Fixes: 792331bdee706aa852a78b171040ebf814c6f3ae Closes: https://github.com/tukaani-project/xz/pull/143 [ Lasse: Commit message edits ] (cherry picked from commit 78355aebb7fb654302e5e33692ba109909dacaff)	2024-09-18 20:53:11 +03:00
Yifeng Li	3a4a05d75e	liblzma: Fix x86-64 movzw compatibility in range_decoder.h Support for instruction "movzw" without suffix in "GNU as" was added in commit [1] and stabilized in binutils 2.27, released in August 2016. Earlier systems don't accept this instruction without a suffix, making range_decoder.h's inline assembly unable to build on old systems such as Ubuntu 16.04, creating error messages like: lzma_decoder.c: Assembler messages: lzma_decoder.c:371: Error: no such instruction: `movzw 2(%r11),%esi' lzma_decoder.c:373: Error: no such instruction: `movzw 4(%r11),%edi' lzma_decoder.c:388: Error: no such instruction: `movzw 6(%r11),%edx' lzma_decoder.c:398: Error: no such instruction: `movzw (%r11,%r14,4),%esi' Change "movzw" to "movzwl" for compatibility. [1] https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=c07315e0c610e0e3317b4c02266f81793df253d2 Suggested-by: Lasse Collin <lasse.collin@tukaani.org> Tested-by: Yifeng Li <tomli@tomli.me> Signed-off-by: Yifeng Li <tomli@tomli.me> Fixes: 3182a330c1512cc1f5c87b5c5a272578e60a5158 Fixes: https://github.com/tukaani-project/xz/issues/121 Closes: https://github.com/tukaani-project/xz/pull/136 (cherry picked from commit 6cd7c8607843c337edfe2c472aa316602a393754)	2024-09-06 19:33:20 +03:00
Lasse Collin	9edddda563	liblzma: Tweak a comment (cherry picked from commit 7c292dd0bf23cefcdf4b1509f3666322e08a7ede)	2024-09-06 19:33:20 +03:00
Lasse Collin	0f47db18d0	xz: Remove the TODO comment about --recursive It won't be implemented. find + xargs is more flexible, for example, it allows compressing small files in parallel. An example for that has been included in the xz man page since 2010. (cherry picked from commit baecfa142644eb5f5c6dd6f8e2f531c362fa3747)	2024-09-06 19:31:12 +03:00
Lasse Collin	ff697eb154	liblzma: CRC CLMUL: Omit is_arch_extension_supported() when not needed On E2K the function compiles only due to compiler emulation but the function is never used. It's cleaner to omit the function when it's not needed even though it's a "static inline" function. Thanks to Ilya Kurdyukov. (cherry picked from commit 30a2d5d51006301a3ddab5ef1f5ff0a9d74dce6f)	2024-09-06 19:00:30 +03:00
Lasse Collin	a44493ec41	xz: Fix white space (cherry picked from commit c7164b1927e3fe7cdba70ee4687e1a590a81043b)	2024-09-06 18:56:17 +03:00
Lasse Collin	5e74a6a813	liblzma: Fix a typo in a comment Thanks to Sam James for spotting it. Fixes: f644473a211394447824ea00518d0a214ff3f7f2 (cherry picked from commit 0a32d2072c598de281058b26dc08920fbf0cd2a1)	2024-09-06 18:56:17 +03:00
Lasse Collin	3f7edc673c	liblzma: Fix a comment indentation (cherry picked from commit afd9b4d282a10186808c3331dad4caf79c02d55f)	2024-09-06 18:56:16 +03:00
Lasse Collin	8a9cc7ca08	liblzma: Fix white space (cherry picked from commit 50e6bff274568c568930e15094da8217e7d47d28)	2024-09-06 18:56:16 +03:00
RainRat	b29b13082f	Fix typos Closes: https://github.com/tukaani-project/xz/pull/124 (cherry picked from commit 9e73918a4f14be754a23f74dda45ca431939a4a0)	2024-09-06 18:51:59 +03:00
Lasse Collin	6f66155e01	tuklib_integer: Fix building on OpenBSD/sparc64 that uses GCC 4.2 GCC 4.2 doesn't have __builtin_bswap16() and friends so tuklib_integer.h tries to use OS-specific byte swap methods instead. On OpenBSD those macros are swap16/32/64 instead of bswap16/32/64 like on other BSDs and Darwin. An alternative to "#ifdef __OpenBSD__" could be "#ifdef swap16" as it is a macro. But since OpenBSD seems to be a special case under this special case of "BSDs and Darwin", checking for __OpenBSD__ seems the more conservative choice now. Thanks to Christian Weisgerber and Brad Smith who both submitted the same patch a few hours apart. Co-authored-by: Christian Weisgerber <naddy@mips.inka.de> Co-authored-by: Brad Smith <brad@comstyle.com> Closes: https://github.com/tukaani-project/xz/pull/126 (cherry picked from commit 04b23addf3733873667675df2439725f076c2f36)	2024-09-06 18:51:59 +03:00
Sam James	dc6b6011b4	xz: list: suppress -Wformat-nonliteral for Solaris Solaris' GCC can't understand that our use is fine, unlike modern compilers: ``` list.c: In function 'print_totals_basic': list.c:1191:4: error: format not a string literal, argument types not checked [-Werror=format-nonliteral] uint64_to_str(totals.files, 0)); ^~~~~~~~~~~~~ cc1: all warnings being treated as errors ``` It's presumably because of older gettext missing format attributes. This is with `gcc (GCC) 7.3.0`. (cherry picked from commit b69768c8bd1a34fde311935c551d061ba52d9a3f)	2024-09-06 18:51:56 +03:00
Lasse Collin	3ec664d3f6	Bump version and soname for 5.6.2	2024-05-29 18:03:51 +03:00
Lasse Collin	8fda5ce872	Fix typos Thanks to xx on #tukaani. (cherry picked from commit 4e9023857d287f624562156b60dc23d2b64c0f10)	2024-05-23 11:36:05 +03:00
Lasse Collin	2729079bcb	liblzma: Fix white space Thanks to xx on #tukaani. (cherry picked from commit b14d08fbbc254485ace9ccfe7908674f608a62ae)	2024-05-23 11:36:05 +03:00
Lasse Collin	a289c4dfeb	xz: Document the static function get_chains_memusage() (cherry picked from commit 142e670a413a7bce1a2647f1cf1f33f8ee2dbe88)	2024-05-23 11:28:20 +03:00
Lasse Collin	6f0db31713	xz: Rename filters_memusage_max() to get_chains_memusage() (cherry picked from commit 78e984399a64bfee5d11e7308e0bdbc1006db2ca)	2024-05-23 11:28:20 +03:00
Lasse Collin	d7e2bf7e2d	xz: Rename filter_memusages to chains_memusages (cherry picked from commit 54c3db0a83d3e67d89aba92a0957f2dce9b111a7)	2024-05-23 11:28:20 +03:00
Lasse Collin	58f200b6d1	xz: Simplify the memory usage scaling code This is closer to what it was before the --filtersX support was added, just extended to support for scaling all filter chains. The method before this commit was an extended version of the original too but it was done in a more complex way for no clear reason. In case of an error, the complex version printed fewer informative messages (a good thing) but it's not a sigificant benefit. In the limit is too low even for single-threaded mode, the required amount of memory is now reported like in 5.4.x instead of like in 5.5.1alpha - 5.6.1 which showed the original non-scaled usage. It had been a FIXME in the old code but it's not clear what message makes the most sense. Fixes: 5f0c5a04388f8334962c70bc37a8c2ff8f605e0a (cherry picked from commit d9e1ae79ec90d6a7eafeaceaf0ece4f0c83d4417)	2024-05-23 11:28:20 +03:00
Lasse Collin	41bdc9fa5c	xz: Edit comments (cherry picked from commit 0ee56983d198b776878432703de664049b1be32e)	2024-05-23 11:28:20 +03:00
Lasse Collin	52e40c1912	xz: Rename chain_idx to chain_num (cherry picked from commit ec82a49c3553f7206104582dbfb8b64fa433b491)	2024-05-23 11:28:20 +03:00
Lasse Collin	8a01963331	xz: Edit coding style (cherry picked from commit a731a6993c34bbbd55abaf9c166718682b1da24f)	2024-05-23 11:28:20 +03:00
Lasse Collin	e3ad7eda74	xz: Edit comments Fixes: 5f0c5a04388f8334962c70bc37a8c2ff8f605e0a (cherry picked from commit 32eb176b89243fce3112347fe43a8ad14a9fd2be)	2024-05-23 11:28:20 +03:00

1 2 3 4 5 ...

1371 Commits