Index PDF documents on SharePoint using Adobe PDF IFilter 9

Using the SharePoint Search you can find documents based on their filename, metadata or content within a document. By default the content of Office documents is indexed by the SharePoint crawler, but PDF files are not crawled. To add support for PDF files you have to add an IFilter which the SharePoint crawler uses to read through PDF files and add the information to the search index.

To obtain an IFilter for PDF you can purchase the Foxit PDF IFilter from Foxit Software. There is also a free PDF IFilter available from Adobe which does exactly what you need and what this post is all about.

 

Getting Adobe IFilter 9 to work with SharePoint

In earlier days you were required to download Adobe's IFilter as a separate file. Since Adobe 8.0 it is included within the Adobe Acrobat and Adobe Reader products. The current version of Adobe Reader is 9.0 includes an IFilter that is compatible with the latest PDF implementations.

To enable PDF indexing use the following steps:

  • Download Adobe Reader 9.0, which includes IFilter 9.0.0.0, from http://www.adobe.com/products/acrobat/
  • Download the Acrobat PDF Picture, to display in front of PDF search result items, from http://www.adobe.com/misc/linking.html
  • Add the PDF file type to the Extensions List for WSS search by editing the registry
    • Start regedit
    • Open the key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Applications\{Random GUID}\Gather\Search\Extensions\ExtensionList
    • Add PDF to the list as a new String Value. Use a new high value e.g. if 37 is the highest value, use "38" as the key with the value "pdf"
  • Add the Acrobat PDF picture to the SharePoint templates directory. Copy the Acrobat PDF picture called pdficon_small.gif in the 12 Hive\TEMPLATE\IMAGES folder, e.g. %programfiles%\Common Files\Microsoft Shared\Web Server Extensions\12\TEMPLATE\IMAGES.
  • Bind the Acrobat PDF picture to the PDF file type
    • Open the 12 Hive\TEMPLATE\XML\DOCICON.XML file
    • Find the <DocIcons.ByExtension> part
    • Add the following mapping:
      <mapping Key="pdf" Value="pdficon_small.gif" OpenControl="" />
  • Change IFilter mapping in registry
    • Start regedit
    • Open the key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\
    • Add (or modify) the .pdf key
    • Add a Multi-String value with value {E8978DA6-047F-4E3D-9C78-CDBE46041603} or modify if another GUID value already exists.
    • Open the key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Office Server\12.0\Search\Setup\ContentIndexCommon\Filters\Extension\
    • Add (or modify) the .pdf key
    • Add a Multi-String value with value {E8978DA6-047F-4E3D-9C78-CDBE46041603} or modify if another GUID value already exists.
  • Add the Adobe Reader folder to the environment path variable
    • Right Click on My Computer
    • Open Properties
    • Open the Advanced tab
    • Go to the Environment variables
    • Edit the Path variable
    • Add your Reader folder to the Path list, e.g. C:\Program Files\Adobe\Reader 9.0\Reader
  • Restart the Search service by restarting your server or executing the following commands:
    • Run: net stop osearch
    • Run: net start osearch
  • Crawl the PDF documents
    • Existing PDF documents that were crawled before the Adobe PDF IFilter has been installed are not indexed during an incremental crawl. You have to edit each existing PDF file to trigger the crawler to reindex the file during an incremental crawl. It´s easier to run a full crawl after you have installed the Adobe PDF IFilter.

Now all PDF documents are crawled you can query on content inside a PDF document.

Published Thu, Oct 2 2008 3:30 PM by Harold van de Kamp

Comments

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, October 13, 2008 2:24 PM by David Weber

Beste Harold,

We hebben overlappende interesses op zakelijk niveau. Zou je mij willen contacteren via "davidweber at telfort punt nl"

Groet,

David.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, November 12, 2008 9:41 PM by Rocky

I followed these steps, and then I tried these labs.adobe.com/.../PDF_iFilter_8_-_64-bit_Support but still can't index pdf files. The Crawl logs show a message saying the file couldn't be crawled because the filter is missing. Any Ideas? Thanks.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Saturday, November 22, 2008 7:45 PM by John Kowalczyk

After searching for 3 days, this was the best most current information on how to integrate the lastest Acrobat v9 iFilter into SharePoint 2007. I kept finding old information on v5/v6 and SharePoint 2003, with everything I found stating that since v7 the iFilter is included in the Reader download, there was no information on how to implement. And although I've only completed the steps about an hour ago and cannot confirm that the PDF content is properly indexed, I can say that there are no obvious errors in the suggested steps contained in this post.

# SharePoint Adbobe PDF IFilter Index Configuration | tomfusion.com

Pingback from  SharePoint Adbobe PDF IFilter Index Configuration | tomfusion.com

# Couldn't get it going used IFilter 6 instead

Tuesday, December 02, 2008 3:07 AM by Will

Hi, I couldn't get it working, then I found MS KB 927675 for Adobe IFilter 6 and got it to work.  It is a bit simpler to configure. It does not show how to do the icon as described above though.

Searching through pdf's in SP is a beautiful thing!

Keep up the good work Kamp!

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, December 02, 2008 6:15 PM by LarryH

I completed the steps above on my MOSS2007 32-bit instance and it worked.  I would suggest adding an iisreset to the last step after the full crawl.  The pdf icon was not showing up until the iisreset.

Thanks for the great instructions and research on this!

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, December 11, 2008 5:13 PM by Tom

Thanks, worked great for me with sharepoint server 2007

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, December 15, 2008 8:16 PM by Tinch

going to be taking care of this later this week! that's for the great information

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, January 08, 2009 11:15 PM by rod bergren

the iisreset is important. It didn't quite work as expected until I did that.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, January 22, 2009 2:11 PM by onlyme

All I can say is.... what a royal pain in the ass to have to go through!

In this day and age we should all be able to expect a simple donwload/run/finish type of operation instead of this hassle. Why can't these software giants actually pull their asses into gear and do something right for a change !!

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, January 22, 2009 2:12 PM by onlyme

oh, ps.. thanks for this info :)

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, February 05, 2009 11:37 AM by Michael

PDF Search is working fine. Very good documentation.

Thank you :)

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, February 09, 2009 7:10 PM by Matt

I tried this and adobe's documentation.  Although it indexes the file name, it does not index the content of the document.  Guess I'll go back to using the IFilter 6.0...

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, March 04, 2009 8:14 PM by Charan V

This works like a magic....Very good documentation!!!

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Friday, March 06, 2009 8:30 PM by Ariel

This works great.  Thanks very much for posting this.  Although this worked great for me on one server, for some reason the new string value in:

HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Applications\{Random GUID}\Gather\Search\Extensions\ExtensionList

... keeps dissapearing after a reboot on another server.  The result being pdf search won't work.  Any ideas?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, March 09, 2009 8:36 AM by Simon

Works for me but like one of the other comments, I loose part of it after a reboot, ie Applications\{Random GUID}\Gather\Search\Extensions\ExtensionList

Also

the search says it found a <PDF icon> "Microsoft Word" <Valid filename> ".doc" , but with the hyperlink being to the correct pdf

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, March 09, 2009 10:48 PM by aditi

I only followed these 3 steps and pdf crawling is successful:

• Downloaded Adobe Reader 9.0

• Go to Search Settings -> File Types – Add pdf

       (Note: Pdf image shows up automatically with the pdf extension.)

• Added the PDF file type to the Extensions List:

o Start regedit

o Open the key HKEY_LOCAL_MACHINE\SOFTWARE\Microsoft\Shared Tools\Web Server Extensions\12.0\Search\Applications\{GUID}\Gather\Search\Extensions\ExtensionList

o Added PDF to the list as a new String Value. Used a new high value e.g. used "38" as the key with the value "pdf".

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, May 05, 2009 1:47 PM by Freek

Hi Harold,

The PDF icon did not show up until I changed 'mapping' into 'Mapping': "<Mapping Key="pdf" Value="pdficon_small.gif" OpenControl="" />" and did an IISRESET.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, May 18, 2009 8:53 PM by jr

I appreciate the info and have followed the steps and have done a full crawl but I still cannot search within a pdf.  I can find no errors in the logs.  I have tried rebooting and reinstalling but the pdf search still doesn't work.  Any ideas why that may be?  Envio: WindowsServer 2008, MOSS 2007 Enterprise Edition, Adobe 9.0

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, May 20, 2009 9:46 PM by Geoff

After chasing my tail for a few hours and also not being able to get the content of PDFs to show up, it turned out the PDFs I was indexing were larger than the default sizes allowed by Search Server. Once I adjusted the MaxDownloadSize and MaxGrowFactor reg keys they started showing up.

Check out:

support.microsoft.com/.../318747

support.microsoft.com/.../927675

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Saturday, May 23, 2009 5:49 AM by Nerble

I can confirm that the difference in GUIDs between the Adobe PDF filter earlier versions and the one used by version 9 (E8978DA6-047F-4E3D-9C78-CDBE46041603) is indeed the cause of my particular manifestation of this problem.  I write this comment to hopefully save some of you other sysadmins some pain by pointing out that a full server bounce is required before the fix takes effect... just cycling the services involved doesn't do the trick.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Friday, June 05, 2009 3:40 PM by Lise

I've followed the instructions and my pdfs are now indexed and searchable. But the document properties for pdfs are not indexed. I want to show the last-modified date of the documents in my search result site, but how to find it? In the crawled properties section I see several folders, among others office and web, and I was told that crawled properties of pdfs were supposed to be in a folder named pdf. There are no such folder... Any ideas?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, June 18, 2009 10:24 AM by alanc

What do you mean by:

"Add a Multi-String value with value ... or modify if another GUID value already exists."

?

When I look in my registry keys for .pdf, they already have 4 entries inside, including a Multi-String.

So should I rename the existing Multi-String or create a new one, and either way, what am I supposed to call it?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Friday, June 19, 2009 8:05 AM by alanc

Doh! Was looking in the wrong key, which was inconveniently close to the one you specified.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, July 07, 2009 11:16 PM by Pankaj

You are the best. I knew all the steps earlier and tried them exactly in the order last week but couldnt get it to work. I did them again today following youur article and voila!! it worked..i guess this blog's my lucky charm..Thanks a lot!!keep blogging!!!

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Friday, July 17, 2009 12:43 AM by Dave Chung

I am trying to get the metadata to populate in my searches after installing Adobe PDF iFilter v6.  I can see the file along with the icon but cannot see any of the metadata surrounding the pdf file.  I have installed in my QA Environment and it actually shows metadata....so I don't know how to explain it.  Can you help please?!!!!

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, September 07, 2009 12:48 PM by johnm

I followed these instructions (thanks!) and most of my pdfs are now searchable.

Unfortunately even though it says it has crawled all of them some pdfs do not appear in the search results.

All I can gather so far is that in the non-searchable ones Properties - PDF - PDF version = 1.3

While the searchable ones have PDF version = 1.4

Anyhing I can do about this?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, September 16, 2009 8:41 AM by Manasi

Please can anyone tell whether Adobe IFilter 9.0 searches metadata properties of PDF?? Please help..

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, September 16, 2009 11:30 AM by Manasi

Can anyone please tell if Adobe IFilter 9.0 supports searches for metadada of PDF

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, September 16, 2009 11:42 AM by Manasi

Can you please tell if Adobe IFilter 9.0 supports xmp metadata search of PDF?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, September 23, 2009 1:10 PM by Manasi

Hi,

Thanx for a great blog.I tried the above steps but unfortunately, couldnt search through PDF. can anyone please guide to resolve this issue?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, December 08, 2009 7:15 PM by stan

Thanks very much!

Worked great for me with one little modification.

At the end, instead of "net stop osearch" and net start osearch, I need to run "net stop spsearch" and "net start spsearch".  

# Sharepoint 2007 MOSS PDF Indexing

Sunday, January 17, 2010 2:07 PM by Sharepoint @ decatec

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, January 25, 2010 3:45 AM by Dileep Jose

this is just what I wanted. Great blog

bitofthought.blogspot.com

Dj

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, March 18, 2010 1:39 PM by Joyce Slavik

Terrific post. Worked great on my WSS 3.0 SharePoint server running Windows 2008.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Friday, March 26, 2010 3:37 PM by Mike Jackson

While the indexing is working, I am still having trouble extracting the metadata from pdf to populate SharePoint.  Any ideas?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, April 19, 2010 4:01 PM by pxpx

I am still not able to go this working.  I am using Moss 2007 32bit, so I am using the Adobe Ifilter 6.0 because 9.0 is only for 64bit.  I followed all of the instructions that you had as well as Microsoft's KB article support.microsoft.com/.../927675

In the Microsoft KB Article i think the GUID they give is incorrect.  When I scanned the registry for acrordif.dll the GUID they give was not there but one very close.  

{8215BA54-B69F-4275-AE11-31CB6359EB09} instead of the listed {8315BA54-B69F-4275-AE11-31CB6359EB09}

I have rebooted numerous times any assistance is greatly appreciated.  

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, May 05, 2010 5:30 PM by HedRat

In answer to the messages asking for metadata from PDF files for Sharepoint, you can connect the script here (www.dur.ac.uk/.../index.html) directly to your Sharepoint store and get a text file of everything.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, May 20, 2010 1:26 AM by Adnan Ahmed

Hi All,

I have published an article at www.mossgurus.com/.../default.aspx explaining How to Install and configure PDF iFilter (64 bit) for SharePoint Server 2010 or Search Server 2010 Express with screen shots of each important steps.

I do hope that you will find this article useful. Please leave your comments.

Thanks

Adnan Ahmed

Senior MS Solutions Consultant

Owner: www.mossgurus.com

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, July 14, 2010 6:26 PM by doitright

grrrreeeeatt! thank you, all steps clear and right

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, August 05, 2010 4:26 PM by Jake

I also had to do the following:

regsvr32 AcroRdIF.dll (for Adobe Reader 9)

Add "pdf" to file types in search administration (for search server)

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, August 11, 2010 1:19 PM by elliptical for sale

I find it difficult to subscribe RSS feeds, bookmark this site anyway I have is a very useful and complete information.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, August 11, 2010 1:21 PM by elliptical for sale

I find it difficult to subscribe RSS feeds, bookmark this site anyway I have is a very useful and complete information.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, August 12, 2010 10:22 AM by Ivan Wilson

Great stuff - this worked for me on a Windows 2003 32-bit server, SharePoint 2007 and Adobe Reader 9.0

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, October 11, 2010 3:00 PM by Tina

On one farm this works like charm, but on the other it doesn't. Do I need to restart the server, I have done iisreset, but still can't get the pdf crawled.

ekhichdi.com/.../filtering-process-could-not-process-this-item-WSS-3.0-107.html

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, October 28, 2010 7:34 AM by Download windows 7 themes free

Kudos to you! This is a really good blog here and I love your style of writing. How did you get so good at blogging?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Friday, December 24, 2010 5:32 AM by school management system

There is obviously a lot for me to discover outside of my books. Thanks for the great read and merry xmas. :)

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, January 05, 2011 11:23 PM by Keith V

This is very well written and helpful.  I am having an issue with searching content within the .pdf files, though.  They aren't being searched.  They're small files, simple.  And search returns content from Word docs.

Possibility: I got iFilter 10 from Adobe, and that's what I installed.  Would that have a different GUID from the ones listed for Version 9?

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Wednesday, January 19, 2011 11:09 AM by PsYkO

I have the same problem than Keith V.

I can't find anymore the acrobat reader v9.0 so i installed the v10.0 but it doesn't work... (i think the problem is coming from the GUID like he said...)

Please help...

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Saturday, February 05, 2011 5:40 PM by foods with fiber

thanks, loved it. It really added to my knowledgei'll surely visit back again later.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Saturday, February 05, 2011 5:46 PM by foods with fiber

thanks, loved it. It really added to my knowledgei'll surely visit back again later.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, March 01, 2011 9:20 PM by Bob

Has anyone been able to search using keywords meta property?

Also, Subject property is crawled and searchable, but can not map the property in Central Admin and subsequently to include it in search results or use as a facet.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Sunday, March 06, 2011 7:35 PM by wedding invitations

thanks for such a wonderful blog

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Sunday, March 06, 2011 7:36 PM by sell annuity

Can you suggest me the template you are using in your blog, and appreciatable content.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Monday, March 07, 2011 12:26 PM by sniperspy review

Thanks for such a wonderful blog

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, March 08, 2011 8:44 PM by Griggster

Worked brilliantly thank you. I just wanted to mention I did orginally have Adobe Reader X installed and reverted back to Adobe Reader 9.4 for pdf content to be crawled inside of documents on a x86 platform.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, March 24, 2011 11:14 PM by AngryAdmin

This procedure worked for me first try.  I am on a MS 2003 server using Sharepoint 7.  Great information.  

One curiousity though... is the GUID you use {E8978DA6-047F-4E3D-9C78-CDBE46041603}  of any significance or is it just random?

Thanks again for the fix.

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, September 13, 2011 9:09 AM by cheap necklaces

Buy cheap 14K Gold Chains,Women's Jewely,Men's Gold Jewelry,Children's Jewelry,Bracelet,Pendant,Earring. www.gogulive.com

<a href="http://www.gogulive.com"target="_blank">cheap gold chain</a>

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Tuesday, September 20, 2011 10:26 PM by miro

It is also possible with the recognized text indexing of PDF document. PDF is indexed as TXT file.

eg. www.miromr.admis.sk/pdftosp

Miro

# re: Index PDF documents on SharePoint using Adobe PDF IFilter 9

Thursday, May 31, 2012 8:19 AM by Anonymous

to get good windows 7 ,office activation code here:

http://www.keyyeah.com/

# wholesale jordans

Tuesday, August 07, 2012 5:29 AM by wholesale jordans

Hey, I just hopped over to your site via StumbleUpon.  Not somthing I would normally read, but I liked your thoughts none the less.  Thanks for making something worth reading.

# Pdf Converter

Wednesday, April 24, 2013 8:12 PM by Vinod Manammal

We pick deskUNPDF PDF Converter as the best PDF converter software because it does exactly what you’d expect it to do. More information please visit pdf-to-image-converter.net

Leave a Comment

(required) 
(required) 
(optional)
(required) 
Please add 1 and 6 and type the answer here: