• April 16, 2014

Information Overload, Then and Now

Illustrations by Julie Delton for The Chronicle Review

Feeling overwhelmed by too much information? What else is new? The amount of digital data available on the Web every day reaches records of mind-boggling proportions—now more than a zettabyte (1021 bytes) and presumably accumulating at an ever-increasing rate, estimated at 30-percent growth per year from 1999 to 2002.

But such figures—often presented as evidence of unprecedented and stress-inducing overload—don't mean much. After all, it takes only one or two pages of Google hits to overwhelm the average reader. Does it really matter whether there are hundreds or thousands more pages after those?

More important, information overload was experienced long before the appearance of today's digital gadgets. Complaints about "too many books" echo across the centuries, from when books were papyrus rolls, parchment manuscripts, or hand printed. The complaint is also common in other cultural traditions, like the Chinese, built on textual accumulation around a canon of classics.

Writing was very likely the first culprit, making possible the accumulation of texts beyond what a single mind could master, even a mind trained to memorize Homer or biblical texts. Writing on durable surfaces (like parchment or paper), with a high level of redundancy (when multiple copies were produced, whether manuscript or printed), also made it possible to recover texts after they had fallen into oblivion, so that being in continuous active use was no longer essential to a text's transmission, as is the case in an oral culture.

Reactions to overload have often been emotional, whether hostile or enthusiastic. Early negative responses include Ecclesiastes 12:12 ("Of making books there is no end," probably from the fourth or third century BC) and Seneca's "distringit librorum multitudo" ("the abundance of books is distraction," first century AD). But we also find enthusiasm for accumulation—of papyri at the Library of Alexandria (founded in the early third century BC) or of the 20,000 "facts" that Pliny the Elder accumulated in Historia naturalis (completed in AD 77). Though we no longer care especially about ancient precedent, we hear the same doom and praise today.

Overload has also triggered pragmatic responses, as generations have done their best to locate and use the information they seek, under inevitable constraints of time, energy, and other resources. Typically we select from collective storage facilities, like libraries and the Internet, and not only books and Web pages but also specific parts of them (like arguments, quotations, or facts). If we wish to revisit results, we need to store them so that they are retrievable. Electronic media have prompted attempts (as in Microsoft's MyLifeBits) to store the entirety of an individual's experiences, but among scholars a more conventional method is to take notes and store selections or summaries.

Tools for searching for and retrieving that information have a long history, too, although it is obscured by the fact that working texts are often not considered worth preserving. Notes could be taken on temporary surfaces like reusable wax tablets; even when they were written on more-permanent materials (like the 160 papyrus rolls bequeathed by Pliny the Elder to his nephew), they were typically not copied for other people. Although annotations survive in the margins of medieval manuscripts, significant collections of working papers and free-standing reading notes come down to us only from the start of the Renaissance, when paper became widely available. Humanists taught the art of note-taking under topical headings called "commonplaces," generating reams of excerpts selected for rhetorical, historical, or moral interest.

In doing so, they looked back to ancient calls for note-taking, but they were also deeply indebted to medieval practices of text management. The oldest was the florilegium: collected bits, or "flowers," from authoritative texts, sorted by topic (often the same headings that the humanists pick up; for example, vices and virtues). Starting in the 13th century, the scholastics devised powerful new tools, notably the alphabetical index (including biblical concordances and subject indexes) and the structuring and layout of a text into numbered sections and subsections, so that it could be consulted rather than read through.

Printing, which spread rapidly after its invention, circa 1450, triggered only a few genuine innovations in the presentation of texts: title page, pagination, and the use of white space instead of color to highlight section breaks. Above all printing created the conditions under which a broad reading public was able to use tools that had previously been limited to a small, specialized elite. Early printed books boasted of their indexes. Printed reference books, most of them in genres that had existed in the Middle Ages, became steady sellers, with frequent and frequently enlarged new editions.

Printing also hugely magnified the perception of overload. Books were produced and accumulated in unprecedented numbers, and, given their drop in cost, many more readers than before had access to more books than they could read. Especially after the mid-16th century, contemporaries frequently complained of the overabundance of books. Of course complaints, made in print, typically were leveled at an excess of "bad books," offering a new, good book as a solution, the goodness of which might stem from greater inclusiveness or selectiveness, depending on the kind of book.

The genres promoted as solutions to overload in the 16th century included bibliographies (some universal, others selective) and many kinds of compilations that gathered the best bits from all those books one couldn't manage to read. The last essentially offered ready-made reading notes of the kind humanist pedagogues recommended taking oneself. The periodical and the book review were also advertised as remedies to overabundance when they appeared, in the late 17th century. And during the 18th century, contemporaries commented on the explosion of dictionaries and encyclopedias. Those vernacular works, focused on recent writing, borrowed features from Latin reference genres even as they abandoned the Latin works' focus on classical culture. The new volumes, including Encyclopaedia Britannica, remained the staple of reference rooms until recently. But bear this in mind: Early reference works were not only valued but also decried as shortcuts that threatened to lure students and scholars away from learning. Thus the anxieties surrounding Wikipedia and other shortcuts available today have historical antecedents.

Electronic media have made overload seem universal, spreading well beyond scholarly fields already familiar with the phenomenon and into almost every activity, including shopping and entertainment, and visible to anyone doing a basic Internet search. We have all acquired various coping skills, often without giving our methods much thought. My study of information management in the era of humanist note-taking and early printed reference books has left me wondering about what we risk losing in academic scholarship as we move more of our work to electronic media:

Storing. It has been a long time since scholars have been concerned, as the humanists were in the Renaissance, about being unable to recover long-forgotten texts. Historians routinely look at old manuscripts, preserved but then forgotten in a personal or institutional library or archive. Although the handwriting may require some deciphering, ink on paper remains legible for centuries. But despite the rise of standards to make digital coding durable, the inevitable obsolescence of hardware and software creates major barriers to reading electronic texts after they have fallen into disuse. True, the Internet and electronic files offer great redundancy, which raises the chances that information will be preserved. (Indeed, book history suggests that redundancy is more effective than the durability of the medium in ensuring preservation—for example, ancient stone inscriptions often survive only in the printed transcriptions made of them.) But computers preserve only what has been upgraded to match their ever-changing specifications. Documents without anyone interested in using them and upgrading them to new platforms may become inaccessible. What are the odds of being able to read one's own or someone else's digital notes in 20 years, let alone a few hundred?

Sorting. Early printed indexes were notoriously difficult to use, even in their day. One needed to search under different synonyms for material on a topic, and users complained that they could never find what they were looking for. By the late 19th century, the professionalization of library cataloging and indexing had brought some relief in the form of a controlled vocabulary, a set of agreed-upon subject headings. Although subject and index headings perforce change over time, as do the categories by which we remember and manage our own notes, they remain a uniquely powerful tool. Today search engines can track the keywords chosen by individual users and writers, but we still need library catalogers and indexers who can identify relevant category terms that do not appear explicitly in the text and who can group related topics under consistent subject headings.

Selecting and summarizing. Keyword searches and data mining offer tempting alternatives to earlier methods used by readers and authors to select the "best bits" to store and later refer to. Microsoft Word and Web sites like tldr.it (for "too long, didn't read it") even offer automated summarizing (which seems to operate by selecting a few complete sentences from a text, without very convincing selection criteria). As we have seen, it is precisely amid great overabundance that selection and summarizing become all the more valuable. Shortcuts like indexes or reference books have long been available—as well as people whom the wealthy and powerful have relied on to take notes. Now we have Wikipedia, and we can rely on computers for many tasks.

But making and using shortcuts skillfully and responsibly requires judgment, too. I hope that such judicium, a central value in education since the Renaissance, will continue to define intellectual work and to spur demand for high-quality information, contextual understanding, and methods for building on previous reading and experience, so that we are not reduced to searching for everything anew.

It's important to remember that information overload is not unique to our time, lest we fall into doomsaying. At the same time, we need to proceed carefully in the transition to electronic media, lest we lose crucial methods of working that rely on and foster thoughtful decision making. Like generations before us, we need all the tools for gathering and assessing information that we can muster—some inherited from the past, others new to the present. Many of our technologies will no doubt rapidly seem obsolete, but, we can hope, not human attention and judgment, which should continue to be the central components of thoughtful information management.

Ann Blair is a professor of history at Harvard University and author of Too Much to Know: Managing Scholarly Information Before the Modern Age, just published by Yale University Press.


1. brightedge - November 29, 2010 at 10:20 am

Although most readers will catch this typo, a zettabyte is 10^21 bytes or one thousand million million million bytes. Now isn't that a relief!

2. ctmathewes - November 29, 2010 at 12:31 pm

This piece has substantial overlap with a piece published this weekend in the Boston Globe Ideas section:


Oddly, not noted here. Why?

3. no1curr - November 30, 2010 at 01:38 am

This is how things work, in my experience. My guess is that in support of her new book (see the byline), her agent was successful at placing pieces in serveral venues. Hence the synchronous appearance of these pieces.

4. drjilliantweiss - November 30, 2010 at 06:58 am

I think this is a very important issue. I believe that one of the most important skills I learned in graduate school is how to skim a source and understand its skeleton: the main point, methodology, theoretical framework, and evidence. While it is also important to learn how to read deeply, without learning to read broadly, it is impossible to do good research. I myself am overwhelmed daily by hundreds of sources, including The Chronicle, filling up my email box. But, of course, that's the function of article titles and email subject lines and email folders -- I get to choose which items to skim, which to read, which to re-read, and to which to respond. No one would argue that must fully read each email and each article put in front of my face. I explicitly teach my thesis students how to skim books and articles. I think the idea of tldr.it is brilliant, and I can't wait to learn more about it. Of course, skimming and suchlike is only one skill. I'm not arguing it substitutes for careful reading of sources when appropriate.

5. dank48 - November 30, 2010 at 08:47 am

So, the internet is like the real world, sort of. Heavy, man.

6. cwinton - November 30, 2010 at 09:22 am

Rest easy, I believe the archivists have finally managed to get through Woodrow Wilson's presidency.

7. mercuria - November 30, 2010 at 12:01 pm

Deride, deride, #6, but you may be surprised by the pressures put upon archivists in these Internet-euphoric times. If we bow to Administrative cost-benefit ratios, we'd pulp entire collections in lieu of Google Books copies. Then everyone would say,"Where's the paper? What the hell do archivists do anyway?" If we stay on the traditional path, preserving paper first, then people can't easily access manuscript collections online. So everyone still says, "What the hell do archivists do anyway?"

In climates of budget slashing and general "What the hell do we do anyway?" type soul-searching, discussions seen in this article are valid and timely. We rarely get the make the final call in these matters, due to money and competing priorities. So careful weighing and presentation of these factors is often vital to access the information you seem to take for granted.

8. cshunt312 - November 30, 2010 at 04:44 pm

I am the founder of a professional community called Social Media in Organizations (SMinOrgs), and our tagline is "new tools for doing old things." I always appreciate articles that help put what seems novel and daunting and unprecedented today into historical perspective.

Thanks for the link to the Boston Globe article, @ctmathewes. I'm sharing that with the SMinOrgs Community (since access to this article is restricted).

Courtney Hunt

9. drjeff - December 02, 2010 at 06:13 pm

FYI: Ecclesiastes, according to the traditional view, was written by King Solomon. Indeed, it's hard to imagine the first paragraph could have been written by anyone else with a straight face. That would put it in the mid or late 10th century BCE (950 or later). (See Wikipedia on King Solomon for a fascinating read.)

10. smalecxl - December 06, 2010 at 11:08 pm

