Hi all!
I’m using Informatica DT with PDF source.
While I was loading the file in the DT studio , using a PDFToTxt_4 preprocessor, the textual output shows PDF tables with columns that are not aligned, making it very difficult to parse it .
Is there a way to fix it?
DT offers an internal tool in order to deal with such a use case. Using this tool, you can define tables and columns in the PDF document, and the tool will produce well aligned PDF tables, which will simplify further parsing. In DT studio under preprocessor you can click the value option, then new screen will open with text editor like in Excel, there you can align the rows according to columns. I have attached the screen shot of this option in UDT.

