Acrobat for Legal Professionals

May 17, 2008

Generating TIFF and Text files from PDF for Concordance and Summation

Adobe is the custodian for both PDF and TIFF (Tagged Image File Format) formats.

While PDF is superior in many ways, TIFF remains a popular format for use in large case litigation support systems such as Concordance and Summation.

If you have a lot of PDFs in your production it can be a challenge to work with these systems as they do not robustly support PDF and conversion is necessary. These systems want to ingest a . . .

  • TIFF file to represent each individual document page
  • TEXT file of the text of each page

Processing several hundred documents to individual TEXT and TIFF files is a candidate for some serious automation!

Fortunately, repetitive tasks like this can be easily accomplished using Acrobat Professional. Since Acrobat can be automated using JavaScript, it is possible to string together several steps and save a lot of time.

In this article, I've included a Tiff-Text Processing Batch Script to download which handles all of this conversion automatically. Here are the results:

Sample Results

More…

02:56 AM | Permalink

May 08, 2008

Extracting Non-Sequential Pages- Another Method

In my last entry, I discussed one method to Extract a Non-Sequential Range of Pages from a PDF.

One of my bosses always told us that our customers are smarter than we are.

I was reminded of this recently when I received this note from James Strupp of the Federal Office of the Public Defender:

A far easier and more direct way to do this, in my view, is to create a new PDF from "multiple files", even if you are just extracting pages from one file.

Doh! That's a great idea! Jame even sent along his detailed instructions. I've added some screen captures and additional detail of my own.

Read on to learn about this alternate method to extract pages.

More…

05:25 AM | Permalink