> Computers > Peripherals > Scanners
Various Topics Home | Disclaimer | Report Adult Posts

Various Topics on Scanners



Scanners - "Best common file format to use to create PDFs?" in Peripherals


Old 05-30-2006   #1
..k
 
Default Best common file format to use to create PDFs?

> "Zak" <duff@nomail.invalid> schreef in bericht
>>
>>I use XP.
>>
>> I scan a do***ent to a graphics file (eg jpg, gif, bmp, etc)
>> Then I make a PDF from the graphics file.
>>
>> I see that if I save a large bitmap (BMP) and then create a PDF
>> then the PDF is no bigger than if I had compressed the image to a
>> GIF and used that to create a PDF file.
>>
>> In order to preserve quality, avoid artefacts and interference
>> patterns, would it be better to save to a large and detailed
>> intermediate file like a BMP (or even a jpeg) or to save to a
>> small lossless file like a GIF or TIF?



On 26 May 2006, Nils<bla@bla.com> wrote:
>
> Afaik, GIF is not used as a graphical format inside PDF. It is
> probably compressed as a "TIFF" with Cittfax or LZW compression.
> "TIFF" is between brackets, because the embedded stream is also not
> a complete TIFF file, just contains the compressed graphics and
> some extra information like scansize, colordepth, color channels,
> etc.
>
> What happens under the hood in your PDF creation really depends on
> the PDF engine you're using. Many engines actually resize your
> graphics to match the PDF DPI resolution. If you're an experienced
> programmer you could try to generate the PDF yourself, with the
> images in full resolution. The PDF specification is open and can be
> found on the Adobe website.
>
> Nils



Hi Nils and others. I understand now that when I create a PDF from a
image file that the format of the image file is not used inside the
PDF. Instead some other format is used in the PDF (which Nils kindly
suggests may be a specialized form of TIFF).

It is this conversion from my image file format to the internal PDF
format which I want to be done smoothly. I am on XP and I am
wondering if it is better to start with a GIF or a JPG or BMP or
whatever to feed into my PDF creation utility.

I should say that I am starting with a hard copy of a do***ent
created on a word processor. I want to avoid artefacts, unecessarily
jagged lines, moire effects and all that stuff which might come from
transforming from an "awkward2 image format to a PDF.

My PDFs will be for public distribution. I have preferred to scan to
a GIF file rather than a TIFF because I have ***umed that when I
circulate the basic image file among certain people that the best
balance between image size and the best chance of them being able to
see the file is a GIF.

To me TIFF feels a bit specialized. For example, I never see a web
page with TIFF images but I see lots of pages with GIFs.

Also there seem to be various compression options for a TIFF (group 3
or 4, LZW, JPEG deflate, none) which might makes it even harder for
me to know what to choose as a common format! The Wikipedia says
do***ents are often scanned to TIFF group 4 but is that something
which has the best chance of being seen on various PCs in various
organisations that I might need to send it to?
 
Old 05-30-2006   #2
....
 
Default Re: Best common file format to use to create PDFs?

"Zak" <duff@nomail.invalid> wrote in message
news:Xns97D3A7EDC887F64A18E@127.0.0.1...
>> "Zak" <duff@nomail.invalid> schreef in bericht
>>>
>>>I use XP.
>>>
>>> I scan a do***ent to a graphics file (eg jpg, gif, bmp, etc)
>>> Then I make a PDF from the graphics file.
>>>
>>> I see that if I save a large bitmap (BMP) and then create a PDF
>>> then the PDF is no bigger than if I had compressed the image to a
>>> GIF and used that to create a PDF file.
>>>
>>> In order to preserve quality, avoid artefacts and interference
>>> patterns, would it be better to save to a large and detailed
>>> intermediate file like a BMP (or even a jpeg) or to save to a
>>> small lossless file like a GIF or TIF?

>
>
> On 26 May 2006, Nils<bla@bla.com> wrote:
>>
>> Afaik, GIF is not used as a graphical format inside PDF. It is
>> probably compressed as a "TIFF" with Cittfax or LZW compression.
>> "TIFF" is between brackets, because the embedded stream is also not
>> a complete TIFF file, just contains the compressed graphics and
>> some extra information like scansize, colordepth, color channels,
>> etc.
>>
>> What happens under the hood in your PDF creation really depends on
>> the PDF engine you're using. Many engines actually resize your
>> graphics to match the PDF DPI resolution. If you're an experienced
>> programmer you could try to generate the PDF yourself, with the
>> images in full resolution. The PDF specification is open and can be
>> found on the Adobe website.
>>
>> Nils

>
>
> Hi Nils and others. I understand now that when I create a PDF from a
> image file that the format of the image file is not used inside the
> PDF. Instead some other format is used in the PDF (which Nils kindly
> suggests may be a specialized form of TIFF).
>
> It is this conversion from my image file format to the internal PDF
> format which I want to be done smoothly. I am on XP and I am
> wondering if it is better to start with a GIF or a JPG or BMP or
> whatever to feed into my PDF creation utility.
>
> I should say that I am starting with a hard copy of a do***ent
> created on a word processor. I want to avoid artefacts, unecessarily
> jagged lines, moire effects and all that stuff which might come from
> transforming from an "awkward2 image format to a PDF.


You can create a very clean PDF directly from a Microsoft Word Do***ent
(.doc).
There are programs that act like a printer that creates a PDF, just by
"printing a PDF".

PDF Create! is one such program.

Just search Google for "microsoft word print pdf" without the quotes.
You will get lots of responses.

--
CSM1
http://www.carlmcmillan.com
--

>
> My PDFs will be for public distribution. I have preferred to scan to
> a GIF file rather than a TIFF because I have ***umed that when I
> circulate the basic image file among certain people that the best
> balance between image size and the best chance of them being able to
> see the file is a GIF.
>
> To me TIFF feels a bit specialized. For example, I never see a web
> page with TIFF images but I see lots of pages with GIFs.
>
> Also there seem to be various compression options for a TIFF (group 3
> or 4, LZW, JPEG deflate, none) which might makes it even harder for
> me to know what to choose as a common format! The Wikipedia says
> do***ents are often scanned to TIFF group 4 but is that something
> which has the best chance of being seen on various PCs in various
> organisations that I might need to send it to?

--
CSM1
http://www.carlmcmillan.com
--


 
Old 05-30-2006   #3
..k
 
Default Re: Best common file format to use to create PDFs?

On 30 May 2006, CSM1<nomoremail@nomail.com> wrote:

> You can create a very clean PDF directly from a Microsoft Word
> Do***ent (.doc).
> There are programs that act like a printer that creates a PDF, just
> by "printing a PDF".
>
> PDF Create! is one such program.
>
> Just search Google for "microsoft word print pdf" without the
> quotes. You will get lots of responses.



The do***ents are not written by me. They have been sent to me so they
are in hard copy form and need scanning.
 
Old 05-30-2006   #4
.... ..Track..
 
Default Re: Best common file format to use to create PDFs?

Why not let a program like PDF-Tools take care of the problems for you -
this will scan direct to PDF for you without the intermediate image
process - all you need to do is make you decision's regarding
optimisation/compression.

you can try it here within the PDF-XChange PRO package (not standard or Lite
versions) - until licensed you will get demo watermarks in the top
right/left corner of each page which do add about 4kb to each page.

http://www.docu-track.com/downloads/users/

--
Best Regards

John Verbeeten
Tracker Software Products
PDF-XChange & SDK, Image-XChange SDK,
PDF-Tools & SDK, TIFF-XChange & SDK, DocuTrack.
Email : johnV@docu-track.com
Support: http://www.docu-track.com/forum/index.php
Web site : http://www.docu-track.com
"Zak" <duff@nomail.invalid> wrote in message
news:Xns97D3A7EDC887F64A18E@127.0.0.1...
>> "Zak" <duff@nomail.invalid> schreef in bericht
>>>
>>>I use XP.
>>>
>>> I scan a do***ent to a graphics file (eg jpg, gif, bmp, etc)
>>> Then I make a PDF from the graphics file.
>>>
>>> I see that if I save a large bitmap (BMP) and then create a PDF
>>> then the PDF is no bigger than if I had compressed the image to a
>>> GIF and used that to create a PDF file.
>>>
>>> In order to preserve quality, avoid artefacts and interference
>>> patterns, would it be better to save to a large and detailed
>>> intermediate file like a BMP (or even a jpeg) or to save to a
>>> small lossless file like a GIF or TIF?

>
>
> On 26 May 2006, Nils<bla@bla.com> wrote:
>>
>> Afaik, GIF is not used as a graphical format inside PDF. It is
>> probably compressed as a "TIFF" with Cittfax or LZW compression.
>> "TIFF" is between brackets, because the embedded stream is also not
>> a complete TIFF file, just contains the compressed graphics and
>> some extra information like scansize, colordepth, color channels,
>> etc.
>>
>> What happens under the hood in your PDF creation really depends on
>> the PDF engine you're using. Many engines actually resize your
>> graphics to match the PDF DPI resolution. If you're an experienced
>> programmer you could try to generate the PDF yourself, with the
>> images in full resolution. The PDF specification is open and can be
>> found on the Adobe website.
>>
>> Nils

>
>
> Hi Nils and others. I understand now that when I create a PDF from a
> image file that the format of the image file is not used inside the
> PDF. Instead some other format is used in the PDF (which Nils kindly
> suggests may be a specialized form of TIFF).
>
> It is this conversion from my image file format to the internal PDF
> format which I want to be done smoothly. I am on XP and I am
> wondering if it is better to start with a GIF or a JPG or BMP or
> whatever to feed into my PDF creation utility.
>
> I should say that I am starting with a hard copy of a do***ent
> created on a word processor. I want to avoid artefacts, unecessarily
> jagged lines, moire effects and all that stuff which might come from
> transforming from an "awkward2 image format to a PDF.
>
> My PDFs will be for public distribution. I have preferred to scan to
> a GIF file rather than a TIFF because I have ***umed that when I
> circulate the basic image file among certain people that the best
> balance between image size and the best chance of them being able to
> see the file is a GIF.
>
> To me TIFF feels a bit specialized. For example, I never see a web
> page with TIFF images but I see lots of pages with GIFs.
>
> Also there seem to be various compression options for a TIFF (group 3
> or 4, LZW, JPEG deflate, none) which might makes it even harder for
> me to know what to choose as a common format! The Wikipedia says
> do***ents are often scanned to TIFF group 4 but is that something
> which has the best chance of being seen on various PCs in various
> organisations that I might need to send it to?



 
Old 05-30-2006   #5
..S
 
Default Re: Best common file format to use to create PDFs?

In article <Xns97D3AF497897B64A18E@127.0.0.1>,
Zak <duff@nomail.invalid> wrote:

>
> The do***ents are not written by me. They have been sent to me so they
> are in hard copy form and need scanning.


If the do***ents are in single-sheet form and can be fed thru a
sheet-feed scanner, the fairly new Fujitsu "ScanSnap" can automatically
produce PDF output (or other formats, at user's option).

It's a bit pricey (circa $400) but it's a pretty nice unit, small, fast,
easy to use, can do both sides at once, auto-select for B&W or color,
and so on.
 
Old 05-30-2006   #6
..nc.. .... ..o..
 
Default Re: Best common file format to use to create PDFs?

["Followup-To:" header set to comp.periphs.scanners.]
On Tue, 30 May 2006 16:30:29 +0100, Zak staggered into the Black Sun and
said:
>> "Zak" <duff@nomail.invalid> schreef in bericht

> On 26 May 2006, Nils<bla@bla.com> wrote:
>>> In order to preserve quality, avoid artefacts and interference
>>> patterns, would it be better to save to a large and detailed
>>> intermediate file like a BMP (or even a jpeg) or to save to a small
>>> lossless file like a GIF or TIF?

>> Afaik, GIF is not used as a graphical format inside PDF. It is
>> probably compressed as a "TIFF" with Cittfax or LZW compression.
>> "TIFF" is between brackets, because the embedded stream is also not a
>> complete TIFF file.
>>
>> What happens under the hood in your PDF creation really depends on
>> the PDF engine you're using. Many engines actually resize your
>> graphics to match the PDF DPI resolution. If you're an experienced
>> programmer you could try to generate the PDF yourself


Zak is obviously not a programmer, let alone an experienced one. Using
PDFlib from C isn't that difficult if you have some experience in C,
though.

>> The PDF specification is open and can be found on the Adobe website.


And creating a PDF using only the spec would take a bunch of experienced
programmers a while. The PDF spec is really, really complex. Its
complexity is one reason why PDFlib and ps2pdf and OpenOffice's "print
to PDF" functionality exist.

> I understand now that when I create a PDF from a image file that the
> format of the image file is not used inside the PDF. Instead some
> other format is used in the PDF (which Nils kindly suggests may be a
> specialized form of TIFF).


Using tiff2ps -> ps2pdf says that a grayscale TIFF ends up converted to
a stream object that can be decoded by the FlateDecode filter.
YPDFEngineMV, obviously.

> It is this conversion from my image file format to the internal PDF
> format which I want to be done smoothly. I am wondering if it is
> better to start with a GIF or a JPG or BMP or whatever to feed into my
> PDF creation utility.


Depends on what you want. Get a good scan, and convert it to
black-and-white if you can do that without losing important info;
that'll make the PDF smaller. JPEG may introduce artifacts, so you
probably don't want to use that. TIFF G4 and TIFF LZW are lossless, so
you may want to use those.

> I should say that I am starting with a hard copy of a do***ent created
> on a word processor.


Yuck. The original WordPerfect or whatever file would've been a much
better place to start from. PDFs with just text in them tend to be
smaller, display faster, and can look good at any zoom level. PDFs made
from images take a longer time to display, are larger, and look terrible
at high zoom levels.

> I have preferred to scan to a GIF file rather than a TIFF because I
> have ***umed that when I circulate the basic image file among certain
> people that the best balance between image size and the best chance of
> them being able to see the file is a GIF.


? You're creating a PDF, not distributing a series of image files.

> To me TIFF feels a bit specialized. For example, I never see a web
> page with TIFF images but I see lots of pages with GIFs.


This is because of Hysterical Raisins in the history of web browsers,
and because of Unisys being ***es. JPEG compresses better than TIFF-LZW
for lossy color images, and smaller images are preferred, especially when
you're on dialup. TIFF-LZW gives the best lossless compression for
color images, but TIFF-LZW is usually used where losslessness is more
important than file size (like in prepress.) Also, Unisys said they'd
sue anyone who made a TIFF-LZW compressor unless they paid Unisys a
license fee.[0] These things combined made it so that the earliest GUI
browsers didn't support viewing TIFFs, just JPEGs and GIFs. And this
has persisted to the present day... even though TIFF-G4 compresses
better than *anything* else, and does so losslessly, iff your image is
black-and-white.

> Also there seem to be various compression options for a TIFF (group 3
> or 4, LZW, JPEG deflate, none) which might makes it even harder for me
> to know what to choose as a common format! The Wikipedia says
> do***ents are often scanned to TIFF group 4 but is that something
> which has the best chance of being seen on various PCs in various
> organisations that I might need to send it to?


....what? If somebody can't figure out how to view a Group4 TIFF,
they're probably computer-illiterate. Anyway, aren't you making a PDF
here? It doesn't matter what the original image format was if it's been
PDFed. Acrobrat Reader can decode the image data within a PDF, as long
as the PDF library/PDF writer/whatever that created that PDF wasn't
smoking crack. Anyway, HTH,

[0] Fortunately, their patent (on a *mathematical method*!) expired a
couple of years ago, so all the Free stuff can write LZW now, which is a
win for everybody.

--
Matt G|There is no Darkness in eternity/But only Light too dim for us to see
Screw up your courage! You've screwed up everything else.
 

Thread Tools
Display Modes





Powered by vBulletin®
Copyright ©2000 - 2009, Jelsoft Enterprises Ltd.
Search Engine Friendly URLs by vBSEO 3.3.0