Extract text and images from pdf file

Hi guys, can you guide me to tutorials, examples and scripts where I
can learn how to extract text, images, etc from pdf file.

Thanks
Undbund
undbund [ Mo, 31 März 2008 08:42 ] [ ID #1933044 ]

Re: Extract text and images from pdf file

On Mar 31, 3:42 am, undbund <undb... [at] gmail.com> wrote:
> Hi guys, can you guide me to tutorials, examples and scripts where I
> can learn how to extract text, images, etc from pdf file.
>
> Thanks
> Undbund
start here
http://www.php.net/pdf
SrSilveira [ Mo, 31 März 2008 19:32 ] [ ID #1933066 ]

Re: Extract text and images from pdf file

SrSilveira <srsilveira [at] gmail.com> wrote:
>On Mar 31, 3:42 am, undbund <undb... [at] gmail.com> wrote:
>
>> Hi guys, can you guide me to tutorials, examples and scripts where I
>> can learn how to extract text, images, etc from pdf file.
>
>start here
>http://www.php.net/pdf

That's an interesting suggestion, but it doesn't do anything to solve his
problem. The PDF functions are used to CREATE PDFs, but they don't do
anything about READING PDFs.

To extract stuff from a PDF file, you need a PDF rendering library. I'm
not aware of any PHP packages that do that (although I'm sure someone will
correct me), but you might look into xpdf or poppler.
--
Tim Roberts, timr [at] probo.com
Providenza & Boekelheide, Inc.
Tim Roberts [ Di, 01 April 2008 07:48 ] [ ID #1933072 ]

Re: Extract text and images from pdf file

On Apr 1, 5:48 pm, Tim Roberts <t... [at] probo.com> wrote:
> SrSilveira <srsilve... [at] gmail.com> wrote:
> >On Mar 31, 3:42 am, undbund <undb... [at] gmail.com> wrote:
>
> >> Hi guys, can you guide me to tutorials, examples and scripts where I
> >> can learn how to extract text, images, etc from pdf file.
>
> >start here
> >http://www.php.net/pdf
>
> That's an interesting suggestion, but it doesn't do anything to solve his
> problem. The PDF functions are used to CREATE PDFs, but they don't do
> anything about READING PDFs.
>
> To extract stuff from a PDF file, you need a PDF rendering library. I'm
> not aware of any PHP packages that do that (although I'm sure someone will
> correct me), but you might look into xpdf or poppler.
> --
> Tim Roberts, t... [at] probo.com
> Providenza & Boekelheide, Inc.

I have looked over for such libraries, but they cost too much and I
found non for PHP. Can this be done in any other programming language?

Thanks for all your replies
undbund [ Mi, 02 April 2008 02:35 ] [ ID #1933993 ]

Re: Extract text and images from pdf file

undbund wrote:
> On Apr 1, 5:48 pm, Tim Roberts <t... [at] probo.com> wrote:
>> SrSilveira <srsilve... [at] gmail.com> wrote:
>>> On Mar 31, 3:42 am, undbund <undb... [at] gmail.com> wrote:
>>>> Hi guys, can you guide me to tutorials, examples and scripts where I
>>>> can learn how to extract text, images, etc from pdf file.
>>> start here
>>> http://www.php.net/pdf
>> That's an interesting suggestion, but it doesn't do anything to solve his
>> problem. The PDF functions are used to CREATE PDFs, but they don't do
>> anything about READING PDFs.
>>
>> To extract stuff from a PDF file, you need a PDF rendering library. I'm
>> not aware of any PHP packages that do that (although I'm sure someone will
>> correct me), but you might look into xpdf or poppler.
>> --
>> Tim Roberts, t... [at] probo.com
>> Providenza & Boekelheide, Inc.
>
> I have looked over for such libraries, but they cost too much and I
> found non for PHP. Can this be done in any other programming language?
>
> Thanks for all your replies
>

Who knows? Ask in another language newsgroup.

But I don't know of anything for PHP.

--
==================
Remove the "x" from my email address
Jerry Stuckle
JDS Computer Training Corp.
jstucklex [at] attglobal.net
==================
Jerry Stuckle [ Mi, 02 April 2008 03:47 ] [ ID #1933994 ]

Re: Extract text and images from pdf file

"undbund" <undbund [at] gmail.com> wrote in message
news:9dba2711-a98a-4a52-be0e-049399bb4e9d [at] s37g2000prg.google groups.com...
> On Apr 1, 5:48 pm, Tim Roberts <t... [at] probo.com> wrote:
>> SrSilveira <srsilve... [at] gmail.com> wrote:
>> >On Mar 31, 3:42 am, undbund <undb... [at] gmail.com> wrote:
>>
>> >> Hi guys, can you guide me to tutorials, examples and scripts
>> >> where I
>> >> can learn how to extract text, images, etc from pdf file.
>>
>> >start here
>> >http://www.php.net/pdf
>>
>> That's an interesting suggestion, but it doesn't do anything to
>> solve his
>> problem. The PDF functions are used to CREATE PDFs, but they don't
>> do
>> anything about READING PDFs.
>>
>> To extract stuff from a PDF file, you need a PDF rendering library.
>> I'm
>> not aware of any PHP packages that do that (although I'm sure
>> someone will
>> correct me), but you might look into xpdf or poppler.
>> --
>> Tim Roberts, t... [at] probo.com
>> Providenza & Boekelheide, Inc.
>
> I have looked over for such libraries, but they cost too much and I
> found non for PHP. Can this be done in any other programming
> language?
>
> Thanks for all your replies

Hi,

have a look at Perl and the PDF::Reuse module.

R.
Csaba [ Mi, 02 April 2008 03:54 ] [ ID #1934004 ]
PHP » comp.lang.php » Extract text and images from pdf file

Vorheriges Thema: MySQL -> PHP Encoding / Decoding problem
Nächstes Thema: PHP/Apache setup for localhost