Forum

Posted by
Martin Pedersen  -  April 2010
Hi guys,


We are experiencing some problems with PDF's on our intranet.

As the search crawler attempts to index PDF-files we get this error in our debuglog:

10:12:29  : Crawler 6136: Error: May not be a PDF file (continuing anyway)
 0d 1h11m : Crawler 6136: Error: PDF file is damaged - attempting to reconstruct xref table...
          : Crawler 6136: Error: Couldn't find trailer dictionary
          : Crawler 6136: Error: Couldn't read xref table

I know there is a patch for this posted for version 5.0.403, I assume that this patch is included in ..449? Is there any other way around this problem?


cheers
/martin
 
Posted by
Martin Pedersen  -  April 2010

I know there is a patch for this posted for version 5.0.403, I assume that this patch is included in ..449? Is there any other way around this problem?


Just a note: Sadly we are running Roxen on Windows servers.
 
Posted by
Erik Allemann  -  April 2010
Hi guys,


We are experiencing some problems with PDF's on our intranet.

As the search crawler attempts to index PDF-files we get this error in our debuglog:

10:12:29  : Crawler 6136: Error: May not be a PDF file (continuing anyway)
 0d 1h11m : Crawler 6136: Error: PDF file is damaged - attempting to reconstruct xref table...
          : Crawler 6136: Error: Couldn't find trailer dictionary
          : Crawler 6136: Error: Couldn't read xref table


The error is probably harmless. We've experienced this type of error message on our testbox outputted from pdf2html, but the files get indexed anyway.

Are some PDFs not being indexed?


I know there is a patch for this posted for version 5.0.403, I assume that this patch is included in ..449? Is there any other way around this problem?


The PDF patch in release2 is included in release3.
 
Posted by
Martin Pedersen  -  April 2010
Thanks for the swift answer Eric. Good to know that it's harmless. :)


Are some PDFs not being indexed?


I will look into that. Our biggest problem right now is that the server is running out of memory and restarting on a regular basis. Typicly a couple of times a day. Probably more related to my other post in this forum though..
 
1
Search this thread: