serenity

mirror of https://github.com/RGBCube/serenity synced 2025-10-29 11:22:07 +00:00

Author	SHA1	Message	Date
Timothy Flynn	eed956b473	AK: Increase LittleEndianOutputBitStream's buffer size and remove loops This is very similar to the LittleEndianInputBitStream bit buffer change from `8e834d4bb2`. We currently buffer one byte of data for the underlying stream. And when we put bits onto that buffer, we do so 1 bit at a time. This replaces the u8 buffer with a u64. And instead of looping at all, we perform bitwise operations to write the desired number of bits. Using the "enwik8" file as a test (100MB uncompressed, commonly used in benchmarks: https://www.mattmahoney.net/dc/enwik8.zip), compression time decreases from: 13.62s to 10.9s on Serenity (cold) 13.62s to 9.22s on Serenity (warm) 2.93s to 2.32s on Linux One caveat is that this requires explicitly flushing any leftover bits when the caller is done with the stream. The byte buffer implementation implicitly flushed its data every time the buffer was byte-aligned, as doing so would always fill the byte. This is no longer the case. But for now, this should be fine as the one user of this class, DEFLATE, already has a "flush everything now that we're done" finalizer.	2023-04-02 10:54:37 +02:00
Nico Weber	85d0637058	LibCompress: Make CanonicalCode::from_bytes() return ErrorOr<> No intended behavior change.	2023-04-02 06:19:46 +02:00
Nico Weber	bc70d7bb77	LibCompress: Reduce indentation in CompressedBlock::try_read_more() ...by removing `else` after `return`. No behavior change.	2023-04-01 13:57:39 +02:00
Nico Weber	c3b8b3124c	LibCompress: Remove two needless heap allocations	2023-03-31 08:44:30 -06:00
Timothy Flynn	8b56d82865	AK+LibCompress: Remove the Deflate back-reference intermediate buffer Instead of reading bytes from the output stream into a buffer, just to immediately write them back out, we can skip the middle-man and copy the bytes directly into the output buffer.	2023-03-31 06:56:11 +02:00
Timothy Flynn	9f238793e0	gunzip+LibCompress: Increase buffer sizes used by Deflate and gunzip Co-authored-by: Andreas Kling <kling@serenityos.org>	2023-03-31 06:56:11 +02:00
Timothy Flynn	7447a91d7e	LibCompress: Decode non-self-referencing back-references in one shot We currently decode back-references one byte at a time, while writing that byte back out to the output buffer. This is only necessary when the back-reference refers to itself, i.e. when the back-reference distance is less than its length. In other cases, we can read the entire back- reference block in one shot. Using the "enwik8" file as a test (100MB uncompressed, commonly used in benchmarks: https://www.mattmahoney.net/dc/enwik8.zip), decompression time decreases from: 5.8s to 4.89s on Serenity (cold) 2.3s to 1.72s on Serenity (warm) 1.6s to 1.06s on Linux	2023-03-29 13:22:11 +01:00
Timothy Flynn	5aaefe4e62	LibCompress: Use prefix tables to decode Huffman codes up to 8 bits long Huffman codes have a useful property in that they are prefix codes. That is, a set of bits representing a Huffman-coded symbol is never a prefix of another symbol. This allows us to create a table, where each index in the table are integers whose prefix is the entry's corresponding Huffman code. With Deflate, we can have codes up to 16 bits in length, thus creating a prefix table with 2^16 entries. So instead of creating a table fit all possible codes, we use a cutoff of 8-bit codes. Codes larger than 8 bits fall back to the binary search method. Using the "enwik8" file as a test (100MB uncompressed, commonly used in benchmarks: https://www.mattmahoney.net/dc/enwik8.zip), decompression time decreases from 3.527s to 2.585s on Linux.	2023-03-29 07:19:14 +02:00
Timothy Flynn	20aaab47f9	LibCompress: Use a bit stream for the entire GZIP decompression process We currently mix normal and bit streams during GZIP decompression, where the latter is a wrapper around the former. This isn't causing issues now as the underlying bit stream buffer is a byte, so the normal stream can pick up where the bit stream left off. In order to increase the size of that buffer though, the normal stream will not be able to assume it can resume reading after the bit stream. The buffer can easily contain more bits than it was meant to read, so when the normal stream resumes, there may be N bits leftover in the bit stream that the normal stream was meant to read. To avoid weird behavior when mixing streams, this changes the GZIP decompressor to always read from a bit stream.	2023-03-29 07:19:14 +02:00
Andreas Kling	aeb8224ec8	LibCompress: Speed up deflate decompression by ~11% ...simply by using LittleEndianInputBitStream::read_bit() instead of read_bits(1). This puts us on the fast path for single-bit reads. There's still lots of money on the table for bigger optimizations to claim here, just picking an embarrassingly low-hanging fruit. :^)	2023-03-24 17:08:35 +01:00
Tim Schumacher	ae51c1821c	Everywhere: Remove unintentional partial stream reads and writes	2023-03-13 15:16:20 +00:00
Tim Schumacher	ecd1862859	AK: Rename Stream::write_entire_buffer to Stream::write_until_depleted No functional changes.	2023-03-13 15:16:20 +00:00
Tim Schumacher	a3f73e7d85	AK: Rename Stream::read_entire_buffer to Stream::read_until_filled No functional changes.	2023-03-13 15:16:20 +00:00
Tim Schumacher	d5871f5717	AK: Rename Stream::{read,write} to Stream::{read_some,write_some} Similar to POSIX read, the basic read and write functions of AK::Stream do not have a lower limit of how much data they read or write (apart from "none at all"). Rename the functions to "read some [data]" and "write some [data]" (with "data" being omitted, since everything here is reading and writing data) to make them sufficiently distinct from the functions that ensure to use the entire buffer (which should be the go-to function for most usages). No functional changes, just a lot of new FIXMEs.	2023-03-13 15:16:20 +00:00
Nico Weber	dfb45705e6	LibCompress: Make DeflateCompressor::write() use loop, not recursion This is performance-neutral, but Instruments.app had a hard time visualizing the very deeply nested stack frames here. No behavior change.	2023-03-13 06:32:56 +00:00
Tim Schumacher	43f98ac6e1	Everywhere: Remove the `AK::` qualifier from Stream usages	2023-02-13 00:50:07 +00:00
Tim Schumacher	220fbcaa7e	AK: Remove the fallible constructor from `FixedMemoryStream`	2023-02-08 17:44:32 +00:00
Tim Schumacher	8b2f23d016	AK: Remove the fallible constructor from `LittleEndianOutputBitStream`	2023-02-08 17:44:32 +00:00
Tim Schumacher	093cf428a3	AK: Move memory streams from `LibCore`	2023-01-29 19:16:44 -07:00
Tim Schumacher	2470dd3bb5	AK: Move bit streams from `LibCore`	2023-01-29 19:16:44 -07:00
Tim Schumacher	8464da1439	AK: Move `Stream` and `SeekableStream` from `LibCore` `Stream` will be qualified as `AK::Stream` until we remove the `Core::Stream` namespace. `IODevice` now reuses the `SeekMode` that is defined by `SeekableStream`, since defining its own would require us to qualify it with `AK::SeekMode` everywhere.	2023-01-29 19:16:44 -07:00
Tim Schumacher	5f2ea31816	AK: Move `Handle` from `LibCore` and name it `MaybeOwned` The new name should make it abundantly clear what it does.	2023-01-29 19:16:44 -07:00
Tim Schumacher	1ca62de558	LibCore: Return `EBADF` on unsupported stream operations	2023-01-19 11:41:56 +01:00
Tim Schumacher	46a53dc6e0	LibCompress: Switch the deflate seekback buffer to `CircularBuffer`	2023-01-13 17:34:45 -07:00
Tim Schumacher	0d69fbd19f	LibCompress: Remove `DuplexMemoryStream` from `DeflateDecompressor`	2023-01-13 17:34:45 -07:00
Tim Schumacher	d23f0a7405	LibCompress: Switch `DeflateDecompressor` to a fallible constructor We don't have anything fallible in there yet, but we will soon switch the seekback buffer to the new `CircularBuffer`, which has a fallible constructor. We have to do the same for the internal `GzipDecompressor::Member` class, as it needs to construct a `DeflateCompressor` from its received stream.	2023-01-13 17:34:45 -07:00
Tim Schumacher	f4afee4278	LibCompress: Switch `DeflateCompressor` to a fallible constructor	2023-01-10 10:28:26 +01:00
Tim Schumacher	8cd2cf2b77	LibCompress: Port `DeflateCompressor` to `Core::Stream`	2023-01-10 10:28:26 +01:00
Nico Weber	c96e663b0a	LibCompress: Add two missing return statements	2022-12-26 13:56:13 -05:00
Tim Schumacher	c6d71ca727	LibCore: Rename `MemoryStream` to `FixedMemoryStream` This is to differentiate between the upcoming `AllocatingMemoryStream`, which automatically allocates memory as needed instead of operating on a static memory area.	2022-12-15 13:28:29 +00:00
Tim Schumacher	30abd47099	LibCompress: Port `DeflateDecompressor` to `Core::Stream`	2022-12-12 16:21:39 +00:00
Lenny Maiorani	9afc7d5379	LibCompress: Change DeflateSpecialCodeLengths to constexpr variables	2022-04-03 17:36:48 +01:00
Idan Horowitz	086969277e	Everywhere: Run clang-format	2022-04-01 21:24:45 +01:00
mjz19910	3102d8e160	Everywhere: Fix many spelling errors	2022-01-07 10:56:59 +01:00
Andreas Kling	80d4e830a0	Everywhere: Pass AK::ReadonlyBytes by value	2021-11-11 01:27:46 +01:00
Linus Groh	649d2faeab	Everywhere: Use "the SerenityOS developers." in copyright headers We had some inconsistencies before: - Sometimes "The", sometimes "the" - Sometimes trailing ".", sometimes no trailing "." I picked the most common one (lowecase "the", trailing ".") and applied it to all copyright headers. By using the exact same string everywhere we can ensure nothing gets missed during a global search (and replace), and that these inconsistencies are not spread any further (as copyright headers are commonly copied to new files).	2021-04-29 00:59:26 +02:00
Idan Horowitz	1c512a702a	AK+Userland: Use idan.horowitz@serenityos.org for my copyright headers	2021-04-22 22:42:38 +02:00
Brian Gianforcaro	1682f0b760	Everything: Move to SPDX license identifiers in all files. SPDX License Identifiers are a more compact / standardized way of representing file license information. See: https://spdx.dev/resources/use/#identifiers This was done with the `ambr` search and replace tool. ambr --no-parent-ignore --key-from-file --rep-from-file key.txt rep.txt *	2021-04-22 11:22:27 +02:00
Linus Groh	2b0c361d04	Everywhere: Fix a bunch of typos	2021-04-18 10:30:03 +02:00
Idan Horowitz	974a981ded	LibCompress: Convert DeflateDecompressor from recursive to iterative This way a deflate blob that contains a large amount of small blocks wont cause a stack overflow.	2021-03-19 23:03:04 +01:00
Idan Horowitz	ea7bdf02b8	LibCompress: fail gracefuly on invalid symbols in DeflateDecompressor	2021-03-17 21:57:16 +01:00
Idan Horowitz	071ee7c6f4	LibCompress: Check for impossible back references in DeflateDecompressor This commit makes sure that we fail if an encoded lz77 back reference references bytes that are outside our sliding window, instead of just silently failing, which triggers an assertion down the line.	2021-03-17 21:57:16 +01:00
Idan Horowitz	8533cceed5	LibCompress: Fail gracefuly on missing huffman codes in DeflateDecompressor	2021-03-16 21:57:44 +01:00
Idan Horowitz	be5a8d9c7f	LibCompress: Check and fail for input stream errors in DeflateDecompressor Since we were not checking for error flags set by read_bits we would just always read 0 as the bits' value, which in some edge cases could lead to an infinite loop.	2021-03-16 21:57:44 +01:00
Idan Horowitz	ea5f83616e	LibCompress+AK: Dont short-circuit error handling propagation In the case that both the stream and the wrapped substream had errors to be handled only one of the two would be resolved due to boolean short circuiting. this commit ensures both are handled irregardless of one another.	2021-03-16 14:56:50 +01:00
Idan Horowitz	a955fd4156	LibCompress+AK: Propagate error handling to wrapped streams This ensures that when a DeflateCompressor stream is cleared of any errors its underlying wrapped streams (InputBitStream/InputMemoryStream) will be cleared as well and wont fail a VERIFY on destruction.	2021-03-15 21:35:48 +01:00
Idan Horowitz	7e587a615e	LibCompress: Handle literal only lz77 streams in DeflateCompressor Very incompressible data could sometimes produce no backreferences which would result in no distance huffman code being created (as it was not needed), so VERIFY the code exists only if it is actually needed for writing the stream.	2021-03-14 11:05:35 +01:00
Idan Horowitz	b1e3176f9f	LibCompress: Replace goto with simple recursion in DeflateCompressor This is just a bit easier on the eyes :^)	2021-03-13 23:50:07 +01:00
Idan Horowitz	bcbfa7db62	LibCompress: Implement DEFLATE compression This commit adds a fully functional DEFLATE compression implementation that can be used to implement compression for higher level formats like gzip, zlib or zip. A large part of this commit is based on Hans Wennborg's great article about the DEFLATE and zip specifications: https://www.hanshq.net/zip.html	2021-03-13 20:07:25 +01:00
Andreas Kling	ef1e5db1d0	Everywhere: Remove klog(), dbg() and purge all LogStream usage :^) Good-bye LogStream. Long live AK::Format!	2021-03-12 17:29:37 +01:00

1 2

54 commits