PDF Optimization Guide | Purdue | 5.16.13 PDF Optimization Guide Intrapromote LLC / 866.570.1785 / www.intrapromote.com PDF Optimization Guide | Purdue | 5.16.13 PDF Optimization SEO for PDF Files The following is a guide to help optimize PDF files located on Purdue's web assets. Making a crawlable PDF When making a PDF, it is best to use a text program, such as Microsoft Word or Adobe PageMaker. This creates a crawlable text file. On the other hand, steer clear of using a program like Photoshop, because you actually end up creating an image file. Additionally, if your PDF has images, we recommend creating captions for those images so that they too are crawlable. Directions for setting up PDF Files for SEO The big three search engines are all capable of crawling PDF's in search of relevant information. The following fields are critical to making your PDF documents discoverable. 1. When in Adobe Acrobat, go to File > Properties 2. In the popup, you will see Title, Author, and Subject fields. 1. In the Title, insert your keyword title, as you would with a web page. 2. For Author, insert the document author's name. 3. For Subject, incorporate 1-2 keyword-rich sentences, clearly explaining the article, but be careful not to keyword stuff. 3. Finally, be sure to save the PDF with a name including a keyword that would make it relevant in a search. (see next page) Intrapromote LLC / 866.570.1785 / www.intrapromote.com PDF Optimization Guide | Purdue | 5.16.13 After setting up a PDF file using the aforementioned instructions, your PDF will look like the following to users in a Google search results page: Regarding this example, “The Anatomy of a Search Engine” is the Title, “S Brin” is the Author, and the text beginning “In this paper...” is the description entered into the Subject field. Additional, more in depth instruction PDF optimization is similar to optimization for a regular content page. We recommend good use of keyword phrases, appropriate headings and solid content that reads well. If the PDF includes images, include a targeted keyword phrase in the caption underneath each image. How is the PDF is created? Don’t use Photoshop to make your PDF, because when you do that, you’re actually making a big image file, not a true PDF and search engine spiders cannot crawl or "read" the text from that image file. The PDF should be created with a text-based program, like MS Word or Adobe PageMaker, so that the final product is text-based and can be crawled. Intrapromote LLC / 866.570.1785 / www.intrapromote.com PDF Optimization Guide | Purdue | 5.16.13 PDFs can reside anywhere on the site, but the same rule about search engine spiders not being likely to crawl content that’s too deep applies. The safest thing to do is to put it as close to the root directory as possible. When publishing a PDF on your site, you should very visibly link to the PDF from the home page, or from some page that gets crawled regularly. You have to lead the crawler along so it finds the new content as quickly as possible. Use keywords when naming the files, such as "keyword-phrase.pdf". The title of the document or page could be considered as the name for the pdf. Complete the document properties. The Title property represents the words that will be displayed as the heading of the search result. It’s the equivalent of the html title tag. If you don’t complete the Title property, the search engine is going to generate a title from the PDF’s content. There are other document properties (meta data) you can supply, including Author, Subject, and Keywords. Optimize the copy. Copy in text-based PDFs is no different than web-page copy. Build links into PDFs. Make sure you include links in your PDFs and pay attention to the anchor text used. Search engines do recognize these links. Pay attention to the version. While search engines do "read" and index PDFs, search engines’ capabilities tend to lag new versions of Acrobat. Save your PDFs as version 1.6 (Acrobat 7) or lower to ensure search engines can index the content. Optimize the file size for search. Don’t post a huge PDF for download. Not only is this annoying and unnecessary for site visitors, it’s also burdensome for the search engines. If it’s too big, the search engines may abandon the PDF before even getting access to its content. Using the full version of Acrobat, select Advanced > PDF Optimizer to "right-size" the document. Another option is to enable the "Optimize for Fast Web View" option in the Preferences > General Settings panel. This allows the PDF to be "loaded" a page at a time, rather than waiting for the whole PDF to download. Pay attention to placement. Don't bury links to PDFs deep within your site’s file structure, they’re less likely to get indexed. For higher ranking search results, links to PDFs should be on web pages closer to the root level of the site’s file structure. Influence meta descriptions for PDFs. For web pages, the meta description is what is displayed under the title in a search result. With PDFs, the search engines search the copy of the PDF and select something to display. With PDFs you have less control of what is displayed as the description to the search result, but you can still influence this. The best way to do this is to Intrapromote LLC / 866.570.1785 / www.intrapromote.com PDF Optimization Guide | Purdue | 5.16.13 make sure that you have a good, optimized sentence or two near the start of your PDF. If these sentences correspond to the search term used, it’s likely that these sentences are the ones that will be displayed as the description under the search result’s heading. Specify the reading order. Search engines search the copy of the PDF and select something to display as a description under the search result’s heading. Depending on how the reading order of your PDF is specified, this may lead the search engine to select some pretty strange stuff to display. Every PDF has a reading order. Similar to properly optimized web pages, you want to make sure that valuable content is read first. How to determine the reading order? With the PDF open and while using the full version of Acrobat, select Advanced > Accessibility > Add Tags to Document. Then select Advanced > Accessibility > Touch Up Reading Order. Then, the reading order of the PDF will be displayed. Add tags to your PDFs, similar to html tags. With the PDF open and while using the full version of Acrobat, select Advanced > Accessibility > Add Tags to Document. Acrobat will give you a document report and recommend things you may want to consider changing. You’ll have the ability to tag headings, alternate text for images, etc. Every time you open a PDF, make even a small change, and save it once again, major unseen things may change. The reading order may change automatically. You may inadvertently save it as a higher version. It may get saved using the default size setting instead of a properly optimized size. If you’re going to further optimize existing PDFs, make sure you check all of these things before posting a new version of the PDF. Intrapromote LLC / 866.570.1785 / www.intrapromote.com