March 31, 2004

Parsing Blind

I needed to parse some PDF's yesterday and today and found the PJ library which is seemingly doing a good job. Unfortunately, the docs assume you have some idea of the inner workings of PDF which I don't. Adobe has put the details behind their developer program and this isn't important enough to shell out some money to join.

So, basically, I had to start walking through the parsed page and see if I could find the form variables I needed. It is kind of silly though to have to print out all of the name-value pairs of a Map in order to find the one you are looking for. Hopefully it will be the same across all of the PDF's I need to parse.

Posted by Josh at March 31, 2004 10:40 AM | TrackBack
Comments
Post a comment