Within the general concept, there are two roads can go: The general concept is that using current python library to parse pdf to text, image, layout, and style data, then using these data to rebuild the idml file that Indesign can read and edit. It is a potential format that could be translated and modified easily from other formats (PDF, PPT, EPUB. Adobe made a descent job because those files can completely express the content of the native (binary) documents. IDML ( InDesign Markup Language) files are a Zip archives (Adobe calls them packages) storing essentially XML files. The biggist different is PDF2DTP parse the text to paragraph, while PDF2ID parse the text into single line.
0 Comments
Leave a Reply. |