Batch Extract XMP from PDF to XML

Here is a requirement that want to batch dump xmp from PDF to xml file or Database,

I’d like to know if you have developed or if you can develop an application for extracting customize XMP from PDF documents.

I’ll try to be more relevant: I customized a specific card for additional metadata in Acrobat Professional. If I save the xmp properties in xml format, I obtain the value that I inserted, after that I import xml file in Database. I’d like to know if is possible to develop an application that can extract xmp customized value from a group of PDF files.

And what is XMP?

Adobe’s Extensible Metadata Platform (XMP) is a labeling technology that allows you to embed data about a file, known as metadata, into the file itself. With XMP, desktop applications and back-end publishing systems gain a common method for capturing, sharing, and leveraging this valuable metadata — opening the door for more efficient job processing, workflow automation, and rights management, among many other possibilities. With XMP, Adobe has taken the “heavy lifting” out of metadata integration, offering content creators an easy way to embed meaningful information about their projects and providing industry partners with standards-based building blocks to develop optimized workflow solutions.

Finally, I used iTextSharp(of course iText also ok) to batch extract XMP from PDF, and save it to XML.

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.