SWHIG Seminar Ian Kim 1/18/12
General overview of printing technology ◦ Background information: Vector vs. raster graphics How inkjet printers work ◦ Application: PostScript and PDF How PostScript and PDF are understood by printers Understanding printing requires a change in perspective ◦ Not the same thing as pixels on a monitor!
Key idea in understanding printing ◦ Font rendering ◦ Graphic design Most work with raster graphics ◦ JPEGs, PNGs, GIFs… ◦ The disadvantage? Limited resolution Vector graphics: scalable to any resolution ◦ Extensively utilized in printing for this reason: images stay crisp even when printed at high res ◦ Font rendering is done with vector graphics Inexact scaling: font hints
Vector graphicsRaster graphics Picture is described by mathematical functions Can be scaled to any resolution Scaling up image does not change its look Geometric transforms do not reduce level of detail in image Picture is described by an array of pixels Has one limited, “native” resolution Scaling up the image results in stretching Geometric transforms can cause generation loss
Left: a comparison between vector and raster graphics. When zoomed-in, the raster version loses detail whereas the vector version does not. Above: an example of a Bézier curve, a common primitive used in vector graphics. Source: Wikimedia Commons
Computer pixels do not have a direct correspondence with ink dots on a page ◦ Measure of pixels on a screen: pixels per inch (PPI) ◦ For example, 96 PPI is standard in Windows Reason? Limited inkjet printer color palette ◦ Ever heard of CMYK? (But sometimes more) ◦ Use dithering to get more colors ◦ This requires many ink dots for a single pixel ◦ Measure of this ink dot density: dots per inch (DPI) Don’t confuse DPI and PPI!
Above: Differences between pixels on a computer screen and how they are printed by a typical inkjet printer. The method printers use to output the full array of colors of a computer pixel using just the CMYK colors is called dithering. The image uses just red and blue, but as the density increases the pattern is interpreted as solid purple. Often this requires quite a few dots to pull off. The example at left measures ~0.25”x0.25”, and is about 150 DPI. The individual ink dots are visible. Source: Wikimedia Commons
PostScript: a programming language used to describe the layout of a page ◦ Contains control structures, is Turing-complete… ◦ Heavily stack based Functions pop arguments off stack, push return values onto stack Even objects are pushed onto the stock and popped off all at once to display the page ◦ These characteristics will be important later! De facto standard for page description ◦ Slowly being phased out in favor of native PDF
Originally each printer had its own printer language ◦ Primitive example: ASCII with control characters to tell the printer how to print the plaintext ◦ Not portable, and hard to guarantee desired output ◦ Each language was device-specific PostScript allowed portability & reliability ◦ Device-independent page description ◦ Consistency of results across devices ◦ Similar to the motivation behind development of high-level programming languages
PostScript input is interpreted by a raster image processor (RIP) ◦ Originally, this used to be contained within printers Powerful processors Very expensive (1984 LaserWriter: $3500) ◦ Nowadays, pages can be rendered on the computer then sent to a “dumb” printer (cheaper!) Name comes from rasterization: rendering vector graphics into a raster image ◦ Convert PostScript input (vectors) to printed output (raster)
%!PS /Courier % name the desired font 20 selectfont % choose the size in points and establish % the font as the current one moveto % position the current point at % coordinates 72, 500 (the origin is at the % lower-left corner of the page) (Hello world!) show % stroke the text in parentheses showpage % print all on the page Displays “Hello world!” on a new page.
PostScript, as a programming language, makes rendering pages difficult ◦ Implicit global state: can’t jump pages ◦ Variables, control flow statements Stack-based computation ◦ No instant feedback to changes ◦ Need to push everything onto the stack, then render page Computationally expensive ◦ Printers had better processors than the PCs!
Encapsulated PostScript (EPS) was introduced to address these problems ◦ Contains an encapsulated bitmap “preview” of the accompanying PS code ◦ Forbids the use of global-state modification and control flow statements ◦ Has a well-defined BoundingBox element Allowed PCs to see PS code output without rendering it Predecessor to Portable Document Format (PDF)
Based on PostScript technology ◦ Both were developed by Adobe Not a replacement for PostScript! ◦ But is a replacement for EPS files ◦ Example: PostScript was used to print PDF output Now, native PDF printers exist PostScript is a programming language; PDF is a generalized container format ◦ Hyperlinks, images, fonts… ◦ You can even embed movies into PDF documents! (Why you would want to is another question)
Embedded fonts ◦ Fixes font compatibility issues Transparency support PDF objects are PostScript objects that have already been interpreted by a RIP ◦ Eliminates need for preview images ◦ Objects can be more intuitively manipulated ◦ Pages can be accessed in arbitrary order ◦ More consistency from screen-to-paper PDF provides greater reliability and consistency ◦ Mistakes in the printing world are expensive!
Printing: conceptual differences ◦ Drawn in terms of vectors, then converted to raster ◦ A screen pixel doesn’t equal a dot PostScript: “high-level” printing ◦ Brought together device-specific printing languages ◦ Still used today by printers to print documents Next step: PDF ◦ Eliminates need for Encapsulated PostScript ◦ Extensible container for other kinds of content ◦ Not just confined to a physical page anymore