(→‎Choose a name and create your article: added some words to this section)
(→‎Prepare for migration: added some text here)
Line 29: Line 29:


===Prepare for migration===
===Prepare for migration===
This can be a simple matter of opening the source document at the same time as keeping the destination window available.  That may mean using two browser windows, or you may prefer to download the PDF document to your personal machine and open the doc without the browser.  Another alternative may be to work offline by transfering the PDF textual content to a text file or other word processing format.  (This can be a good approach if you have a slow or somehow flaky link.  There's less chance of losing a lot of work!)  In that case, you wouldn't have any browsers open.


===Transfer the text first===
===Transfer the text first===

Revision as of 06:41, 30 November 2006

Adobe's PDF (Portable Document Format) was designed to provide an application and OS independent format that could be viewed freely (in both the "no cost" and "no constraint" senses) by anyone. The encoding of PDF documents does not always lend itself to easy migration to the "wiki text" format (or even to other formats). The steps below are offered as a relatively straightforward, low stress process to approach the porting process.

The steps below assume that you have downloaded a recent version of Adobe Acrobat, which includes some features that the earlier versions did not include.

Please also review Help:How to port pages to be sure that you handle copyright permissions and Appropedia conventions correctly.

Steps in porting PDF to Wikitext

  1. Choose a reasonable source document
  2. Choose a name and create your article
  3. Prepare for migration
  4. Migrate the text and format it
  5. Extract the images
  6. Upload and embed the images
  7. Do a final compare between the article and the original content
  8. Update the templates and categorization to show completion

Choosing a reasonable source document

Before you begin the porting process, be sure that your wikitext skills are compatible with the document content that you wish to port. If you have a collection of documents that are available for porting (as often happens), begin by porting some simpler documents. Content features that require a bit of experience are listed below, along with a link to the WikiMedia help for the content:

  1. Tables. See Wikimedia:Help:table
  2. Images. See Wikipedia:Wikipedia:Extended image syntax
  3. Web links. See Wikimedia:Help:Link
  4. Footnotes. See Wikimedia:Help:Footnotes

Once you're comfortable with the wiki formatting side of things, the tricky bit is often in extracting the content from the PDF document. The best approach is to be patient, break it down in to phases, and move through it step by step. You can port a fairly simple 5 page PDF document in 15 minutes, but complexity will add time; 5 pages could be an hour or more if there are lots of tables, footnotes, images and links.

Choose a name and create your article

Choosing a name is usually just a matter of taking the title of the original work, then making sure it's unique at Appropedia. Enter the name in the search box at left and click "Go". If the title is already used, or if for some reason it is not a clear reference to the subject, you'll need to modify the name. Or it may be that all articles ported from a particular source will get a certain tag in their names. In any case, choose a reasonable name. There is no perfect name, and the article can be moved to a different name later if necessary.

When you have entered a unique name and clicked "Go", the screen will say There is no page titled "XYZ". You can create this page. Click on the "create this page" to begin editing the article.

Prepare for migration

This can be a simple matter of opening the source document at the same time as keeping the destination window available. That may mean using two browser windows, or you may prefer to download the PDF document to your personal machine and open the doc without the browser. Another alternative may be to work offline by transfering the PDF textual content to a text file or other word processing format. (This can be a good approach if you have a slow or somehow flaky link. There's less chance of losing a lot of work!) In that case, you wouldn't have any browsers open.

Transfer the text first

Transfering the text is pretty straightforward, but there are pitfalls. The basic approach is to copy, paste. Should be obvious, but there can be surprises. You can copy/paste directly into an Appropedia edit window, or into an intermediate document (such as Word) if you prefer.

Selecting

In Acrobat, to mark a section of text you must first "pick" (click on) the "select text" tool (this step is not always required in other applications). With the text-select tool chosen, select the text by clicking the mouse at the beginning of a text block (say, a paragraph) and dragging your mouse to the end of the text block. This should be obvious, but frequently PDF documents are produced with complex layouts (multiple columns, embedded text boxes and images, etc) that can result in strange text sequences, so it's best to keep alert for strange selection behavior, since you may not get what you expect.

Copy and paste

Once the text is selected, use "copy" function (there are many ways to invoke this function) to put the selected text on a "clipboard", then use the "paste" function to put the text into your destination. As soon as you have pasted, it's a good practice to check a few places in the destination to see that you actually transferred what you expected to. If not, you can usually use the "undo" feature to back up and try again.

Format the text

Removing "new lines"

Formating tables

Footnotes

Transfer the images

Extracting images from PDF documents

Loading images into Appropedia

Embedding images into the article

See also

Cookies help us deliver our services. By using our services, you agree to our use of cookies.