m (fmt) |
('''This page is for exploring how to speed this process.) |
||
Line 1: | Line 1: | ||
'''This page is for exploring how to speed this process of porting PDF documents. The main help page (which needs updating) is at: [[Help:Porting content from PDF format]].''' | |||
Acrobat has an "export as text" function which makes this easier - save to RTF or Word format. If you open this in OpenOffice v 2.3 or greater, you should be able to export it as MediaWiki format. (Does this work smoothly?) | Acrobat has an "export as text" function which makes this easier - save to RTF or Word format. If you open this in OpenOffice v 2.3 or greater, you should be able to export it as MediaWiki format. (Does this work smoothly?) | ||
Revision as of 18:03, 3 November 2007
This page is for exploring how to speed this process of porting PDF documents. The main help page (which needs updating) is at: Help:Porting content from PDF format.
Acrobat has an "export as text" function which makes this easier - save to RTF or Word format. If you open this in OpenOffice v 2.3 or greater, you should be able to export it as MediaWiki format. (Does this work smoothly?)
Useful links:
- Export PDFs as (formated) text - help.adobe.com. Note: "Images in the PDF are saved by default in JPEG format." Is their location saved? Is there a way of smoothing the process of locating the image in the document?
- User talk:LeissKG - discussion on alternative techniques and issues in porting PDFs - OCR, text export.
Images
Are images saved automatically during file export? It appears so for exporting to HTML/XML, at least. Do any of the formats include tags to indicate image location?