Legal
The views expressed in this blog are my own and do not necessarily reflect the views of Adobe Systems Incorporated.
Search
May
| S | M | T | W | T | F | S |
|---|---|---|---|---|---|---|
| 1 | 2 | 3 | ||||
| 4 | 5 | 6 | 7 | 8 | 9 | 10 |
| 11 | 12 | 13 | 14 | 15 | 16 | 17 |
| 18 | 19 | 20 | 21 | 22 | 23 | 24 |
| 25 | 26 | 27 | 28 | 29 | 30 | 31 |
Archives
- June 2008
- May 2008
- April 2008
- March 2008
- February 2008
- January 2008
- December 2007
- November 2007
- October 2007
- September 2007
- August 2007
- July 2007
- June 2007
- May 2007
- April 2007
- March 2007
- February 2007
- January 2007
- December 2006
- November 2006
- October 2006
- September 2006
- August 2006
- June 2006
- May 2006
- February 2006
- January 2006
- December 2005
- November 2005
- October 2005
May 17, 2008
Generating TIFF and Text files from PDF for Concordance and Summation
Adobe is the custodian for both PDF and TIFF (Tagged Image File Format) formats.
While PDF is superior in many ways, TIFF remains a popular format for use in large case litigation support systems such as Concordance and Summation.
If you have a lot of PDFs in your production it can be a challenge to work with these systems as they do not robustly support PDF and conversion is necessary. These systems want to ingest a . . .
- TIFF file to represent each individual document page
- TEXT file of the text of each page
Processing several hundred documents to individual TEXT and TIFF files is a candidate for some serious automation!
Fortunately, repetitive tasks like this can be easily accomplished using Acrobat Professional. Since Acrobat can be automated using JavaScript, it is possible to string together several steps and save a lot of time.
In this article, I've included a Tiff-Text Processing Batch Script to download which handles all of this conversion automatically. Here are the results:
May 08, 2008
Extracting Non-Sequential Pages- Another Method
In my last entry, I discussed one method to Extract a Non-Sequential Range of Pages from a PDF.
One of my bosses always told us that our customers are smarter than we are.
I was reminded of this recently when I received this note from James Strupp of the Federal Office of the Public Defender:
A far easier and more direct way to do this, in my view, is to create a new PDF from "multiple files", even if you are just extracting pages from one file.
Doh! That's a great idea! Jame even sent along his detailed instructions. I've added some screen captures and additional detail of my own.
Read on to learn about this alternate method to extract pages.