Search Adobe Portable Document Format (PDF) documents files in WSS 3.0 and MOSS 2007

This is a very common scenario; you search a Microsoft Windows SharePoint Services 3.0 or MOSS 2007 Web site for content that is located in Adobe Portable Document Format (PDF) documents. However, no PDF documents are returned in the search results, even though the content that you search for is located in PDF documents in the Windows SharePoint Services 3.0 or MOSS 2007 Web site.

This issue occurs as Windows SharePoint Services Search service in Windows SharePoint Services 3.0 or MOSS 2007 is not configured to crawl PDF documents. By default, WSS 3.0 or MOSS 2007 will not index content located in Acrobat PDF files, so you need to set up the IFilter (IFilters are plugins that allow the Windows SharePoint Indexing Service to index different various file formats so that they become searchable).  Also the PDF files uploaded to the Windows SharePoint Services 3.0 or MOSS 2007 Web site prior to the installation of the IFilter won’t be re-crawled automatically.

Configuring WSS 3.0 or MOSS 2007 to search pdf documents:

  1. Download the Adobe PDF IFilter from the following Adobe Web site:

    http://www.adobe.com/support/downloads/detail.jsp?ftpID=2611

    2.       Stop the IIS Admin service. To do this, follow these steps:

                                                                   i.      Click Start, point to Administrative Tools, and then click Services.
                                                                 ii.      Right-click IIS Admin Service, and then click Stop.

    3.       Run the Adobe PDF IFilter Setup program to install the filter on the server.

    For WSS 3.0, install the PDF iFilter on servers running the WSS [SPSearch] search service.

    For MOSS 2007, install the PDF iFilter on the Index servers.

    4.       Register the Adobe PDF IFilter. To do this, follow these steps:

    Click Start, and then click Run.

    In the Open box, type cmd, and then click OK.

    Switch to the folder that contains the Adobe PDF Filter 6.0 files. For example, switch to the following folder:

    E.g. Drive: Program FilesAdobePDF IFilter 6.0

    Type regsvr32.exe pdffilt.dll and then press ENTER.

    If you receive a message that states that the operation was successful, click OK.

    Type exit and then press ENTER to exit the command prompt.

    5.       Perform this step for MOSS 2007 only (Optional for WSS 3.0)

    Start -> Program -> Microsoft Office Server -> SharePoint 3.0 Central Administration  -> <Name of Shared Service Provider> -> Search Settings -> File Types -> New File Type (Add extension  pdf here)

    6.       Add the following registry entry, and then set the registry entry value to pdf:

    To do this, follow these steps:

    Click Start, click Run, type regedit, and then click OK.

    Locate and then click the following registry subkey:

    HKEY_LOCAL_MACHINESOFTWAREMicrosoftShared ToolsWeb Server Extensions12.0SearchApplicationsGUIDGatherSearchExtensionsExtensionList

    On the Edit menu, point to New, and then click String Value.

    Type 38, and then press ENTER. Right-click the registry entry that you created, and then click Modify.

    In the Value data box, type pdf, and then click OK.

    7.       Verify that the following two registry subkeys are present and that they contain the appropriate values.

    Note: These registry subkeys and the values that they contain are created when you installed the Adobe PDF IFilter on the server.

    For WSS 3.0

    1.       HKEY_LOCAL_MACHINESOFTWAREMicrosoftShared ToolsWeb Server Extensions12.0SearchSetupContentIndexCommonFiltersExtension.pdf
    This registry subkey must contain the following registry entry:
    Name: Default
    Type: REG_MULTI_SZ
    Data: {
     E8978DA6-047F-4E3D-9C78-CDBE46041603}
    2.       HKEY_LOCAL_MACHINESOFTWAREMicrosoftShared ToolsWeb Server Extensions12.0SearchSetupFilters.pdf
    This registry subkey must contain the following registry entries:
                                                                                                             i.            Name: Default
    Type: REG_SZ
    Data: (value not set)
                                                                                                           ii.            Name: Extension
    Type: REG_SZ
    Data: pdf
                                                                                                          iii.            Name: FileTypeBucket
    Type: REG_DWORD
    Data: 0x00000001 (1)
                                                                                                         iv.            Name: MimeTypes
    Type: REG_SZ
    Data: application/pdf

    For MOSS 2007

    Modify the following Registry keys by changing their “Default” value to the new CLSID of the Adobe IFilter:  {E8978DA6-047F-4E3D-9C78-CDBE46041603}

    HKEY_LOCAL_MACHINESOFTWAREMicrosoftOffice                  

    server12.0SearchSetupContentIndexCommonFiltersExtension.pdf

    Default –> {E8978DA6-047F-4E3D-9C78-CDBE46041603}

                        HKEY_LOCAL_MACHINESOFTWAREMicrosoftShared ToolsWeb Server                          

Extensions12.0SearchSetupContentIndexCommonFiltersExtension.pdf
Default –> {E8978DA6-047F-4E3D-9C78-CDBE46041603}
8.       Start the IIS Admin service.
9.       Recycle the search service:
Click Start, click Run, type cmd, and then click OK.
For WSS 3.0
net stop spsearch at the command prompt, and then press ENTER.
net start spsearch at the command prompt, and then press ENTER.
For MOSS 2007
net stop osearch at the command prompt, and then press ENTER.
net start osearch at the command prompt, and then press ENTER.
Note: If the Windows SharePoint Services 3.0 or MOSS 2007 Web site already had PDF documents before you installed the Adobe PDF IFilter on the server, you may have to perform an additional step. After you install and configure the Adobe PDF IFilter on the server, modify a property of the existing PDF documents on the server. For example, modify a metadata property such as the title, or upload the PDF documents to the Windows SharePoint Services Web site again. When you do this, the Windows SharePoint Services Search service crawls the existing PDF documents again.
OR
                 Start a full crawl of the content so that the files get crawled again with the IFilter.

Related Post

3 thoughts on “Search Adobe Portable Document Format (PDF) documents files in WSS 3.0 and MOSS 2007”

Leave a Reply

Your email address will not be published. Required fields are marked *