So the guy arrested for stealing 4,000,000 documents from JSTOR also downloaded 20,000,000 pages from Pacer

July 20th, 2011

From the Times in 2009:

To Mr. Malamud, putting the nation’s legal system behind a wall of cash and kludge separates the people from what he calls the “operating system for democracy.” So, using $600,000 in contributions in 2008, he bought a 50-year archive of papers from the federal appellate courts and placed them online. By this year, he was ready to take on the larger database of district courts.

Those courts, with the help of the Government Printing Office, had opened a free trial of Pacer at 17 libraries around the country. Mr. Malamud urged fellow activists to go to those libraries, download as many court documents as they could, and send them to him for republication on the Web, where Google could get to them.

Aaron Swartz, a 22-year-old Stanford dropout and entrepreneur who read Mr. Malamud’s appeal, managed to download an estimated 20 percent of the entire database: 19,856,160 pages of text.

Then on Sept. 29, all of the free servers stopped serving. The government, it turns out, was not pleased.

A notice went out from the Government Printing Office that the free Pacer pilot program was suspended, “pending an evaluation.” A couple of weeks later, a Government Printing Office official, Richard G. Davis, told librarians that “the security of the Pacer service was compromised. The F.B.I. is conducting an investigation.”

Lawyers for Mr. Malamud and Mr. Swartz told them that they appeared to have broken no laws, noting nonetheless that it was impossible to say what angry government officials might do.

At the administrative office of the courts, a spokeswoman, Karen Redmond, said she could not comment on the fate of the free trial of Pacer, or whether there had been a criminal investigation into the mass download.

The free program “is not terminated,” Ms. Redmond said. “We’ll just have to see what happens after the evaluation.” As for the system’s cost, she said: “We’re about as cheap as we can get it. We’re talking pennies a page.”

Schwart’z indictment is available here.

If you’re interested in obtaining better access to PACER and court docs, check out RecapTheLaw.org and this free Firefox Extension.

RECAP is an extension (or “add on”) for the Firefox web browser that improves the PACER experience while helping PACER users build a free and open repository of public court records. RECAP users automatically donate the documents they purchase from PACER into a public repository hosted by the Internet Archive. And RECAP saves users money by alerting them when a document they are searching for is already available from this repository. RECAP also makes other enhancements to the PACER experience, including more user-friendly file names.