From: sklett on
I need to bang out a quick application to extract CCITT compressed TIF
images from a ton of PDFs. I've used PDFSharp in the past to work with PDFs
but ti doesn't have support for the PDF /CCITTFaxDecode filter.

I've googled for the obvious terms to try to find some code samples or
information about how to accomplish what I want but haven't found anything
at all.
If anyone here has experience in working with PDFs and extracting TIFFs
could you possible help with the following questions (some are very newbie
questions)

1) Does PDF store image data in a special PDF format or wrapped in any
other objects? In other words, I thought I would just be able to write the
image stream from the PDF to disk and it would result in a TIFF image (hah!)
but this isn't the case, I'm wondering if the image data is wrapped in an
additional format?

2) Anyone know of any (free) libraries that can decompress the CCITT codec?
I'm not sure if it's group 3 or 4, I imagine that is in the header of the
image data?

Any info greatly appreciated,
Steve


From: sklett on
I've found and read the relevant sections of the PDF specification.
It appears that the data is NOT wrapped in any additional structures, that
it's basically a stream of CCITT G3/G4 encoded data.

My challenge now is how to handle that encoded data. I'm wondering if I can
create an Image object from a MemoryStream (stream from PDF) then save with
the proper encoding?
if anyone has experience decoding the CCITT coded I would still really
appreciate any tips or help.

I will post back if I can get this working.


"sklett" <s(a)s.com> wrote in message
news:em3hI9trIHA.4716(a)TK2MSFTNGP06.phx.gbl...
>I need to bang out a quick application to extract CCITT compressed TIF
>images from a ton of PDFs. I've used PDFSharp in the past to work with
>PDFs but ti doesn't have support for the PDF /CCITTFaxDecode filter.
>
> I've googled for the obvious terms to try to find some code samples or
> information about how to accomplish what I want but haven't found anything
> at all.
> If anyone here has experience in working with PDFs and extracting TIFFs
> could you possible help with the following questions (some are very newbie
> questions)
>
> 1) Does PDF store image data in a special PDF format or wrapped in any
> other objects? In other words, I thought I would just be able to write
> the image stream from the PDF to disk and it would result in a TIFF image
> (hah!) but this isn't the case, I'm wondering if the image data is wrapped
> in an additional format?
>
> 2) Anyone know of any (free) libraries that can decompress the CCITT
> codec? I'm not sure if it's group 3 or 4, I imagine that is in the header
> of the image data?
>
> Any info greatly appreciated,
> Steve
>


 | 
Pages: 1
Prev: SMPP
Next: Adding row to databound Datagridview