HELP: Convert PDF to HTML

Hello all Programmers!

I want to write a software myself that can Convert PDF file's content to HTML. But I don't know where to start:

Is there any valid ActiveX for this

Any exist Function that can extract PDF contents and add it to an HTML file

Or anything else that a newbie like me don't know

So wonderful if I can receive any reply of yours... Thanks a lot!

So many things to learn in this world, hihi : - )




Answer this question

HELP: Convert PDF to HTML

  • jumpKnot0

    I have surfed through the Internet, and I found a proper solution for my app, that is: Convert PDF file to Image file.

    I think this way can give a better result than my first idea. Cos' when I used a professional software that convert PDF to HTML, I got a fail result when that program convert a table in a PDF file to HTML: the text in each cell does not stay in its right place :( And I think there will be no error like that when I convert PDF content to Image (at least, as I think, cos I only just a newbie in this subject,hihi : - p )

    Now I have the source code for my 2nd idea, and I'm downloading the Acrobat SDK to use it in .NET.

    Here is the link to that relative topic: http://www.developerfusion.co.uk/show/5091/2/

    It is about Creating a thumbnail of a PDF file, but I think I can use it, too... just a little change in the code, maybe...

    But I still want many many helps from our friendly public.

    Please tell me what u think about my new solution. Is it really a better way Oops, 'bout me, i think yes...



  • kuntushi

    Thank-you :-) We appreciate your valuable feedback and certainly hope you gain benefit from this place. We do try our best and it makes us feel great that you are finding it useful/to your benefit. That's all that matters

  • TwoSixTwo

    you need to use some PDF SDK that maybe available then generate html from it in some way. in .NET there is no direct way since PDF's are not a Microsoft product really but Adobe as well as having no relevance to the .NET Framework itself.

    I believe there are a couple of open source PDF projects you can look at and use and customize the way you want to use the application to your needs

    maybe take a look at this:

    http://www.codeproject.com/csharp/MgPDFReader.asp

    commercial/SDK:

    http://www.pdfonline.com/

    im sure there are more somewhere



  • rp1933

    Thank you very very much!

    I'm following your instructions! Hope to see the light... hihi!!!

    I have only just posted this subject and I can't believe that there will be a reply as fast as this!

    I feel really surprise and happy, cause this is the first time I post a question to MSDN forum. Thanks you and MSDN!

    I believe that I can receive more n more helps in this friendly Forum!



  • HELP: Convert PDF to HTML