Showing posts with label The filtering process could not be initialized. Verify that the file extension is a known type and is correct. Show all posts
Showing posts with label The filtering process could not be initialized. Verify that the file extension is a known type and is correct. Show all posts

15 November, 2011

The filtering process could not be initialized. Verify that the file extension is a known type and is correct

Presently I am working on one search issue which is severity-1 and working towards the resolution. Here is my analysis and research that I did on this issue. If anybody has any suggestions then please append the same. I would be more than HAPPY to implement and discuss the same so that SharePoint community will benefit the same. Thank you

This is the site link: URL which is affected and reflecting in the crawl logs: /sites/ccas/applications/secore/shared%20documents/settlements/projects/2010/r1/20091005-%20ccas-so1%20owni%20gw%20correction/project%20plan/200919590_plan_owni_correction.xls

Error Message: The filtering process could not be initialized. Verify that the file extension is a known type and is correct

Problem Description: Search is not working and getting the above mentioned error message. Its happening only for one files i.e. .XLS

Action Taken:
1) Checked the file extension is added or not (How?-Please refer the following steps)
+Central administration
+Shared Service provided
+Search settings
+File Extensions
Reference: http://technet.microsoft.com/en-us/library/cc262700(office.12).aspx


2) Checked file name has any invalid characters but its proper one and no issues with it.

3) Checked IFilter which was released after the initial release of SP2007
reference: http://www.microsoft.com/downloads/en/details.aspx?FamilyId=60C92A37-719C-4077-B5C6-CAC34F4227CC&displaylang=en


4) This is not PPTX/any 2007 document so thats why i haven't checked this(Link to Microsoft 2010 Filter Packs): http://www.microsoft.com/downloads/en/details.aspx?FamilyID=5cd4dcd7-d3e6-4970-875e-aba93459fbee&displaylang=en

5) The document does not consist of any embedded links as per this reference:
http://blogs.msdn.com/b/russmax/archive/2010/11/11/sharepoint-2007-search-trying-to-crawl-office-documents-that-contain-embedded-links.aspx


6) Resetting the crawl index and run a full crawl is not a feasible solution as we cannot do this in production environment.

7) As per this link: http://social.msdn.microsoft.com/Forums/en-US/sharepoint2010general/thread/2906b07f-291d-4ac0-8b3f-d47166cb309d/
We cannot disable the search service and reset & configure everything

The only thing that we can try at this moment is: convert the document to .XLSX and do incremental crawl and check the results
reference: http://social.technet.microsoft.com/Forums/en-US/sharepointsearch/thread/9ee200bd-94c9-43c9-b329-e15b09c6f0f3/