Acrobat for Legal Professionals

June 28, 2007

Full Text Search of PDF using Adobe Acrobat

Lately, everyone’s been asking me to help them find themselves…

After a talk at the Missouri Solo and Small Firm conference, I chatted with a solo real estate attorney who asked for my advice on developing a searchable article archive from the materials he had collected over the years. “How do I find the articles I need?” he asked.

I also talked to a lawyer who took on a probono criminal defense case. “How can I find where my client is mentioned in all the police records I was sent?” she asked.

And, at the 2007 LegalTech West show, a workman’s compensation investigator asked how to search medical records. “How can I apply notes to these handwritten medical records and find them later?” he asked.

In this article, I’ll discuss how to use Acrobat Professional to create a full-text index so you can find what you need… fast!

Read on to learn more…

Searching Beyond Text of the Document

Acrobat can find text in the following parts of a PDF:

  • Text of the document (regular or OCR)
  • Title, Subject, Author, Keyword (metadata)
  • Notes and Annotations
  • Bookmarks
  • PDF Attachments

So, what does this mean for legal professionals?

  1. You can find words or phrases across multiple documents quickly to help you find key facts, names, places, etc. that are contained within the text of documents.
  2. You can capture your thinking about a document—in the PDF—while reviewing it using bookmarks and comment tools.
  3. You can later find documents by the notes and knowledge you’ve applied to them.

That’s powerful.

Setting up for Search

Step 1: Make sure your documents are text searchable by Acrobat

  • Use Acrobat Optical Character Recognition (OCR) if you have paper documents or image-only PDFs in your document collection. See my article on Batch OCR.
  • Convert electronic files such as word processing, spreadsheets, etc. to PDF

Step 2: Locate and Segregate Documents

Depending on the type of project you have, you may wish to move similar documents to individual directories.

For example, let’s say you have accumulated several years of legal research on trusts. You may wish to segregate the documents by state or issue.

If you are indexing client files, you may wish to index by client or perhaps even by matter.

Granularity Illustration

There’s no right or wrong way to organize your documents, but you do need to strike a balance between how much time you spend organizing your files and how easy it is to find what you need.

Create an Index

Follow these steps to create a full-text search index using Acrobat 8 Professional:

  1. Choose Advanced —>Document Processing —>Full Text Index with Catalog

    Catalog Window
    Click the New Index button
  2. The Build Index window will appear:
    Catalog Building Window1) Give the index a name
    2) Enter a description of the index
    3) Choose the directory that will be indexed. All sub-directories will be indexed.
    4) Click the Build button
  3. Acrobat will create a .pdx (index) file at the top level of the directory you specified.
    Saving the Index fileClick the Save button.
  4. The Index Progress window will appear:
    Indexing Progress Window Note that Acrobat will skip any documents which are secured with an Open password.

Attaching to the Index and Searching

Follow these steps to attach to the index you created:

  1. Choose Edit—>Search

    —or type—

    Windows: Control-Shift-F
    Macintosh: Command-Shift-F

    Acrobat will split your screen between the Search window and the Document window.
  2. In the Search window on the left, click on Advanced Search at the bottom:

    Clicking for the Advanced Search options
  3. In the Advanced Search panel, click on the Look In menu and choose Select Index.

    Attaching to the index
  4. The Index Selection window will appear.
    Click the Add button
    Attach to the Index
    Locate the the index file (.pdx) that you created earlier. Normally, Acrobat will automatically find it for you.
  5. Click OK

Searching the Index

Once you select an index, Acrobat will keep it selected so you can search against it.

In the example below, we are searching for Donald, the first name of one of the parties to the case:
Using the Advanced Search options

Click the Search button.

Acrobat will present a list of documents that match the search criteria:

Search Results List

Interpreting and Using Results

Remember that Acrobat can search not only the text of documents, but comments and bookmarks, too.

Click the + sign to view the hits on each document.

Acrobat gives you useful visual indicators in the search window:

Acrobat will select and perform a different action depending on what you double-click:

When you click on a . . .

This is what happens

Text of Document

Word is highlighted in the document

Bookmark

The bookmark panel is opened and the bookmark is highlighted

Comment

The comment is selected and opened. E.g. a text note opens

Search Tip

You can save a step searching by changing Acrobat’s defaults to always use the Advanced Search options. Go to Edit—> Preferences—> Search and click the checkbox for Always use Advanced Search Options.

Final Thoughts

In this article, you learned how to create an index to search across multiple PDF documents.

Acrobat 8 offers new indexing capabilities by allowing you to embed a full-text index in:

  • A single PDF document
  • PDF package

This gives you the ability to have an index that travels with a PDF wherever you send it.

For example, you could create a trial notebook package complete with index and give it to your colleague to take to court or mediation.

I plan to cover these capabilities in a future blog article.

Posted by Rick Borstein at 5:26 AM on June 28, 2007

Comments

Copyright © 2009 Adobe Systems Incorporated. All rights reserved.
Use of this website signifies your agreement to the Terms of Use and Online Privacy Policy (updated 07-14-2009).