serenity

mirror of https://github.com/RGBCube/serenity synced 2025-05-31 08:38:12 +00:00

Author	SHA1	Message	Date
Rodrigo Tobar	3588709986	LibPDF: Load Type1C fonts when found Now that our CFF parser is working we can load Type1C fonts in PDF, which are backed by a CFF stream.	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	c4b45a82cd	LibPDF: Add initial CFF parsing The Compat Font Format specification (Adobe's Technical Note #5176) is used by PDF's Type1C fonts to store their data. While being similar in spirit to PS1 Type 1 Font Programs, it was designed for a more compact representation and thus space reduction (but an increment on complexity). It also shares most of the charstring encoding logic, which is why the CFF class also inherits from Type1FontProgram. This initial implementation is still lacking many details, e.g.: * It doesn't include all the built-in CFF SIDs * It doesn't support CFF-provided SIDs (defaults those glyphs to the space character) * More checks in general	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	1ec4ad5eb6	LibPDF: Add name -> char code conversion in Encoding This is an operation that was already being done (sub-optimally) in PS1FontProgram, so we are replacing that. We will use this during CFF parsing too.	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	1b90ea7d3a	LibPDF: Augment Type11FontProgram with Type2 capabilities The Type1FontProgram logic was based on the Adobe Type 1 Font Format; in particular, it implemented the CharStrings Dictionary section (charstring decoding, and most commands). In the case of Type1, these charstrings are read from a PS1 diciontary, with one entry per character in the font's charset. This has served us well for Type1 font rendering. When implementing Type1C font rendering, this wasn't enough. Type1C PDF fonts are specified in embedded CFF (Compact Font File) streams, which also contain a charstring dictionary with an entry for each character in the font's charset. These entries can be slightly different from those in a PS1 Font Program though: depending on a flag in the CFF, the entries will be encoded either in the original charstring format from the Adobe Type 1 Font Format, or in the "Type 2 Charstring Format" (Adobe's Technical Note #1577). This new format is for the most part a super-set of the original, with small differences, all in the name of making the representation as compact as possible: * The glyph's width is not specified via a separate command; instead it's an optional additional argument to the first command of the charstring stream (and even then, it's only the difference to a nominal character width specified in the CFF). * The interpretation of a 4-byte number is different from Type 1: in Type 1 this is a 4-byte unsigned integer, whereas in Type 1 it's a fixed decimal with 16 bits of fractional part. * Many commands accept a variable set of arguments, so they can draw more than one line/curve on a single go. These are all retro-compatible with Type 1's commands. All these changes are implemented in this patch in a backwards-compatible way. To ensure Type 1/2 behavior is accessed, a new parameter indicates which behavior is desired when decoding the charstring stream. I also took the chance to centralise some logic that was previously duplicated across the parse_glyph function. Common lambdas capture the logic for moving to, or drawing a line/curve to a given point and updating the glyph state. Similarly, some command logic, including reading parameters, are shared by several commands. Finally, I've re-organised the cases in the main switch to group together related commands.	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	f06de0fa07	LibPDF: Remove unused member	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	416585f75a	LibPDF: Add new Type1FontProgram base class We are planning to add support for CFF fonts to read Type1 fonts, and therefore much of the logic already found in PS1FontProgram will be useful for representing the Type1 fonts read from CFF. This commit moves the PS1-independent bits of PS1FontProgram into a new Type1FontProgram base class that can be used as the base for CFF-based Type1 fonts in the future. The Type1Font class uses this new type now instead of storing a PS1FontProgram pointer. While doing this refactoring I also took care of making some minor adjustments to the PS1FontProgram API, namely: * Its create() method is static and returns a NonnullRefPtr<Type1FontProgram>. * Many (all?) of the parse_* methods are now static. * Added const where possible. Notably, the Type1FontProgram also contains at the moment the code that parses the CharString data from the PS1 program. This logic is very similar in CFF files, so after some minor adjustments later on it should be possible to reuse most of it.	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	e751ec2089	LibPDF: Avoid reading fields from moved-from Data object This might not be an issue at the moment, but moved-from objects are usually in a unspecifed but valid state, meaning that we shouldn't read from them.	2023-01-25 15:40:11 +01:00
Rodrigo Tobar	bfeca4ebb3	LibPDF: Record base font name read from document This will be useful for debugging, or if we later on want to show all the fonts found in the document in an organised manner.	2023-01-25 15:40:11 +01:00
Timothy Flynn	f3db548a3d	AK+Everywhere: Rename FlyString to DeprecatedFlyString DeprecatedFlyString relies heavily on DeprecatedString's StringImpl, so let's rename it to A) match the name of DeprecatedString, B) write a new FlyString class that is tied to String.	2023-01-09 23:00:24 +00:00
Julian Offenhäuser	2a70ea3ee7	LibPDF: Propagate errors in PDFFont::create()	2023-01-09 22:54:36 +00:00
Julian Offenhäuser	ac31b1bda3	LibPDF: Make glyphs from standard 14 fonts show up in Type1Font Previously, we would assume that all standard 14 fonts use a TrueTypeFont dictionary. Now we render them in Type1Font as well, given that it doesn't contain a PostScript font program.	2023-01-09 22:54:36 +00:00
MacDue	eeb6072f15	LibGfx+LibPDF: Apply subpixel offset in affine transformation	2023-01-05 13:50:26 +01:00
MacDue	91db49f7b3	LibPDF: Use subpixel accurate text rendering This just enables the new tricks from LibGfx with the same nice improvements :^)	2023-01-05 12:09:35 +01:00
Andreas Kling	f982400063	LibGfx: Rename TTF/TrueType to OpenType OpenType is the backwards-compatible successor to TrueType, and the format we're actually parsing in LibGfx. So let's call it that.	2022-12-21 08:44:22 +01:00
Rodrigo Tobar	a1af79dca6	LibPDF: Follow a FontFile's Length values These can be references (at least from what I've found in some documents), so we want to resolve them before using them.	2022-12-16 01:24:43 -07:00
Rodrigo Tobar	41bd304a7f	LibPDF: Ignore seac PS1 commands for now This command is meant to print an Standard Encoding Accented Character. It's not critical to implement it yet, but if we want to render more documents we need to handle the instruction, even if simply ignore it.	2022-12-16 01:24:43 -07:00
Andreas Kling	d6a3be1615	LibPDF: Add missing character quirk for WinAnsiEncoding fonts Fonts with the encoding name "WinAnsiEncoding" should render missing characters above character code 040 (octal) as a "bullet" character. This patch adds Encoding::should_map_to_bullet(char_code) which is then called by char_code_to_code_point() to check if the given char code should be displayed as a bullet instead. I didn't have a good way to test this, so I've only verified that it works by manually overriding inputs to the function during the rendering stage. This takes care of a FIXME in the Annex D part of the PDF specification.	2022-12-08 09:54:20 +01:00
MacDue	7be0b27dd3	Meta+Userland: Pass Gfx::IntPoint by value This is just two ints or 8 bytes or the size of the reference on x86_64 or AArch64.	2022-12-07 11:48:27 +01:00
Linus Groh	6e19ab2bbc	AK+Everywhere: Rename String to DeprecatedString We have a new, improved string type coming up in AK (OOM aware, no null state), and while it's going to use UTF-8, the name UTF8String is a mouthful - so let's free up the String name by renaming the existing class. Making the old one have an annoying name will hopefully also help with quick adoption :^)	2022-12-06 08:54:33 +01:00
Julian Offenhäuser	e06a065594	LibPDF: Override Type 1 character mappings by encoding in font dict If the font dictionary includes an "Encoding" entry, it will be used instead of the PS1FontProgram's built-in encoding.	2022-11-25 22:44:47 +01:00
Julian Offenhäuser	65ff80e8a5	LibPDF: Add alternative names to is_standard_latin_font() helper	2022-11-25 22:44:47 +01:00
Julian Offenhäuser	9cb3b23377	LibPDF: Move all font handling to Type1Font and TrueTypeFont classes It was previously the job of the renderer to create fonts, load replacements for the standard 14 fonts and to pass the font size back to the PDFFont when asking for glyph widths. Now, the renderer tells the font its size at creation, as it doesn't change throughout the life of the font. The PDFFont itself is now responsible to decide whether or not it needs to use a replacement font, which still is Liberation Serif for now. This means that we can now render embedded TrueType fonts as well :^) It also makes the renderer's job much more simple and leads to a much cleaner API design.	2022-11-25 22:44:47 +01:00
Julian Offenhäuser	e748a94f80	LibPDF: Introduce loading of common font data in PDFFont base class This font data is shared between Type 1 and TrueType fonts, which is why we can now load it in the base class that they both use.	2022-11-25 22:44:47 +01:00
Julian Offenhäuser	dd82a026f8	LibPDF: Pass PDFFont::draw_glyph() a char code instead of a code point We would previously pass this function a unicode code point, which is not actually what we want here. Instead, we want the "raw" code point, with the font itself deciding whether or not it needs to be re-mapped. This same mistake in terminology applied to PS1FontProgram.	2022-11-25 22:44:47 +01:00
Julian Offenhäuser	7c4f5b58be	LibPDF: Use Gfx::PathRasterizer for Adobe Type 1 font rendering This gives much better visual results than painting the path directly. It also has the nice side effect that Type 1 fonts will now look much more similar to TrueType fonts, which use the same class :^) In addition, we can now cache glyph bitmaps for repeated use.	2022-11-19 11:04:34 +01:00
Julian Offenhäuser	b14f0950a5	LibPDF: Add very basic support for Adobe Type 1 font rendering Previously we would draw all text, no matter what font type, as Liberation Serif, which results in things like ugly character spacing. We now have partial support for drawing Type 1 glyphs, which are part of a PostScript font program. We completely ignore hinting for now, which results in ugly looking characters at low resolutions, but gain support for a large number of typefaces, including most of the default fonts used in TeX.	2022-10-16 17:44:54 +02:00
Julian Offenhäuser	e6f29302a7	LibPDF: Add glyph drawing and type info methods to PDFFont A PDFFont can now be asked for its specific type and whether it is part of the standard 14 fonts. It now also contains a method to draw a glyph, which is stubbed-out for now. This will be useful for the renderer to take into consideration when drawing text, since we don't include replacements for the standard set of fonts yet, but still want to make use of embedded fonts when available.	2022-10-16 17:44:54 +02:00
Julian Offenhäuser	36828f1385	LibPDF: Don't expect glyph width arrays to contain integers They might also contain floats, in which case we convert them to int before use.	2022-09-17 10:07:14 +01:00
Simon Wanner	5136c5ae1a	LibGfx: Move ScaledFont and new base class VectorFont out of TTF	2022-04-09 23:48:18 +02:00
Simon Wanner	6f8fd91f22	LibGfx: Move TTF files from TrueTypeFont/ to Font/TrueType/	2022-04-09 23:48:18 +02:00
Matthew Olsson	4d0f74a15c	LibPDF: Add Type0 and TrueType fonts	2022-03-31 18:10:45 +02:00
Matthew Olsson	e831c374f4	LibPDF: Abstract Type1 font data TTF font types will use the same data	2022-03-31 18:10:45 +02:00
Matthew Olsson	058cf5f7f7	LibPDF: Accept font size in PDFFont::get_char_width This will be required for TTF fonts	2022-03-31 18:10:45 +02:00
Matthew Olsson	5f9d35909d	LibPDF: Move font files into their own directory	2022-03-31 18:10:45 +02:00

34 commits