» Archive for June, 2008

Batch Extract XMP from PDF to XML

Saturday, June 28th, 2008 by rubypdf

Here is a requirement that want to batch dump xmp from PDF to xml file or Database,

I’d like to know if you have developed or if you can develop an application for extracting customize XMP from PDF documents.

I’ll try to be more relevant: I customized a specific card for additional metadata in Acrobat Professional. If I save the xmp properties in xml format, I obtain the value that I inserted, after that I import xml file in Database. I’d like to know if is possible to develop an application that can extract xmp customized value from a group of PDF files.

And what is XMP?

Adobe’s Extensible Metadata Platform (XMP) is a labeling technology that allows you to embed data about a file, known as metadata, into the file itself. With XMP, desktop applications and back-end publishing systems gain a common method for capturing, sharing, and leveraging this valuable metadata — opening the door for more efficient job processing, workflow automation, and rights management, among many other possibilities. With XMP, Adobe has taken the “heavy lifting” out of metadata integration, offering content creators an easy way to embed meaningful information about their projects and providing industry partners with standards-based building blocks to develop optimized workflow solutions.

Finally, I used iTextSharp(of course iText also ok) to batch extract XMP from PDF, and save it to XML.

Found super fast tools to divide A4 to 2 A5 pages

Saturday, June 28th, 2008 by rubypdf

It is just a article talks about my two softwares posted by JimmyZou on http://www.mobileread.com/forums/archive/index.php/t-10515.html

After google a lot, I find super fast tool to cut A4 pdf into double pages A5 file.

I did one, and attach it behide, you guys can check it out.

Using DOS command to do it, and it done very fast, I split the 210K size pdf in 2 seconds, and the outcome file is 240K only, very effective!

You can find the software here:
http://www.rubypdf.com/

There are two softwares needed: PDFRotate and PDFDivide

First copy all software and the PDFs into one directory, the 2 simple steps:
1.First use PDFRoate
\PDFRoate 1-A4.pdf 1-A4-90.pdf 90
Rotate pdf 90 degree first, prepare to divide it.
2.USE PDFDivide
\Divide 1-A4-90.PDF 1-2A5.PDF

The only none-beatiful thing is that it cut directly, so in some pages there are 1 line letters been cut into 2 parts.

But anyway, it’s nice and fast.

A Requirement about Document File Viewer

Friday, June 27th, 2008 by rubypdf

I think it is a interesting requirement about PDF and ebook, so I quote it here,

Many computer users have thousands of digital photos (aka pictures, images) and so, they need a tool like ACDSee or IrfanView or XnView, this is, a Photo/Picture/Image quick/rapid viewer/browser/explorer <=> Rather than manually click on each picture to launch a new instance of a viewer (what would be thousands of loads), this tools allow an extremely rapid, easy and comfortable viewing, sparing a lot of clicks, time consumed, CPU/RAM loads.

My project is not about photos, it is about eBooks/eArticles… I have thousands and thousands of PDF, CHM, DJVU, RGO, PDB files, and I need to see them rapidly (to delete the junk, and store the good ones). Well, I cannot imagine myself loading on Adobe Reader 10000 times, nor Microsoft hh.exe (Windows CHM Viewer) nor WinDjView (DJVU Viewer) nor RepliGo nor iSilo… Thus, I need something like ACDSee/IrfanView/XnView but for PDF, CHM, DJVU, RGO, PDB; rather than photos (such as JPG, GIF, TIFF, BMP)…

My idea is NOT to build an app “from zero”/”from scratch”; my idea is to build an easy/simple app which acts as a “front-end”/”wrapper” to launch automatically instances of already-built viewers. So, I attach the portable versions of the viewers:

      Foxit PDF Reader(as PDF viewer)
      WinDjView (as DJVU viewer)
      xCHM (as CHM viewer)
      RepliGo Viewer (as RGO viewer)
      iSilo (as PDB viewer)

It is bid on RAC, and if you have interesting, you can try it, Document File Viewer