Skip to Content
77 Mass Ave

Reading Closed Books

New technology makes it possible to decipher text printed on a stack of pages.
October 18, 2016

MIT researchers and their colleagues are developing an imaging method that can read closed books.

In September, in Nature Communications, the researchers reported a new method that they tested on a stack of papers, each with one letter printed on it. The method was able to correctly identify the letters on the top nine sheets.

“The Metropolitan Museum in New York showed a lot of interest in this, because they want to, for example, look into some antique books that they don’t even want to touch,” says Barmak Heshmat, a research scientist in Ramesh Raskar’s Camera Culture group at the MIT Media Lab and corresponding author on the new paper. He adds that the system could be used to analyze a large range of materials organized in thin layers, such as coatings on machine parts or pharmaceuticals.

The MIT researchers developed the algorithms that acquire images from individual sheets in stacks of paper; researchers at Georgia Tech developed the algorithm that interprets the often distorted or incomplete images as individual letters.

The system uses terahertz radiation, a band of electromagnetic radiation between microwaves and infrared light. Terahertz radiation can distinguish between ink and blank paper, in a way that x-rays can’t, and it can be emitted in such short bursts that the distance it has traveled can be accurately measured. The system exploits the fact that air pockets about 20 micrometers wide separate a book’s pages naturally. The difference in the degree to which air and paper bend light means that the boundary between them will reflect terahertz radiation back to a detector.

In the researchers’ setup, a standard terahertz spectrometer emits ultrashort bursts of radiation, and the camera’s built-in sensor detects their reflections. From the reflections’ time of arrival, the MIT researchers’ algorithm can gauge the distance to the individual pages of the book.

While most of the radiation is either absorbed or reflected by the book, some of it bounces around between pages before returning to the sensor, producing a spurious signal. The sensor’s electronics also produce a background hum. One of the tasks of the MIT researchers’ algorithm is to filter out all this “noise.”

The information about the pages’ distance helps: it allows the algorithm to home in on just the terahertz signals whose arrival times suggest that they are true reflections. Then it relies on two different measures of the reflections’ energy, and assumptions about both the energy profiles of true reflections and the statistics of noise, to extract information about the chemical properties of the reflecting surfaces. This allows researchers to identify letters by determining the exact shapes of ink printed on each individual page.

Keep Reading

Most Popular

Large language models can do jaw-dropping things. But nobody knows exactly why.

And that's a problem. Figuring it out is one of the biggest scientific puzzles of our time and a crucial step towards controlling more powerful future models.

OpenAI teases an amazing new generative video model called Sora

The firm is sharing Sora with a small group of safety testers but the rest of us will have to wait to learn more.

Google’s Gemini is now in everything. Here’s how you can try it out.

Gmail, Docs, and more will now come with Gemini baked in. But Europeans will have to wait before they can download the app.

This baby with a head camera helped teach an AI how kids learn language

A neural network trained on the experiences of a single young child managed to learn one of the core components of language: how to match words to the objects they represent.

Stay connected

Illustration by Rose Wong

Get the latest updates from
MIT Technology Review

Discover special offers, top stories, upcoming events, and more.

Thank you for submitting your email!

Explore more newsletters

It looks like something went wrong.

We’re having trouble saving your preferences. Try refreshing this page and updating them one more time. If you continue to get this message, reach out to us at customer-service@technologyreview.com with a list of newsletters you’d like to receive.