<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=us-ascii"><meta name=Generator content="Microsoft Word 12 (filtered medium)"><!--[if !mso]><style>v\:* {behavior:url(#default#VML);}
o\:* {behavior:url(#default#VML);}
w\:* {behavior:url(#default#VML);}
.shape {behavior:url(#default#VML);}
</style><![endif]--><style><!--
/* Font Definitions */
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Tahoma;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        margin-bottom:.0001pt;
        font-size:11.0pt;
        font-family:"Calibri","sans-serif";}
h3
        {mso-style-priority:9;
        mso-style-link:"Heading 3 Char";
        mso-margin-top-alt:auto;
        margin-right:0cm;
        mso-margin-bottom-alt:auto;
        margin-left:0cm;
        font-size:13.5pt;
        font-family:"Times New Roman","serif";
        font-weight:bold;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:blue;
        text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
        {mso-style-priority:99;
        color:purple;
        text-decoration:underline;}
p
        {mso-style-priority:99;
        mso-margin-top-alt:auto;
        margin-right:0cm;
        mso-margin-bottom-alt:auto;
        margin-left:0cm;
        font-size:12.0pt;
        font-family:"Times New Roman","serif";}
code
        {mso-style-priority:99;
        font-family:"Courier New";}
pre
        {mso-style-priority:99;
        mso-style-link:"HTML Preformatted Char";
        margin:0cm;
        margin-bottom:.0001pt;
        font-size:10.0pt;
        font-family:"Courier New";}
p.MsoAcetate, li.MsoAcetate, div.MsoAcetate
        {mso-style-priority:99;
        mso-style-link:"Balloon Text Char";
        margin:0cm;
        margin-bottom:.0001pt;
        font-size:8.0pt;
        font-family:"Tahoma","sans-serif";}
span.EmailStyle17
        {mso-style-type:personal-compose;
        font-family:"Calibri","sans-serif";
        color:windowtext;}
span.BalloonTextChar
        {mso-style-name:"Balloon Text Char";
        mso-style-priority:99;
        mso-style-link:"Balloon Text";
        font-family:"Tahoma","sans-serif";}
span.Heading3Char
        {mso-style-name:"Heading 3 Char";
        mso-style-priority:9;
        mso-style-link:"Heading 3";
        font-family:"Times New Roman","serif";
        font-weight:bold;}
span.HTMLPreformattedChar
        {mso-style-name:"HTML Preformatted Char";
        mso-style-priority:99;
        mso-style-link:"HTML Preformatted";
        font-family:"Courier New";}
.MsoChpDefault
        {mso-style-type:export-only;}
@page WordSection1
        {size:612.0pt 792.0pt;
        margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
        {page:WordSection1;}
/* List Definitions */
@list l0
        {mso-list-id:876624915;
        mso-list-template-ids:-448762310;}
@list l0:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7;
        mso-level-tab-stop:36.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
@list l1
        {mso-list-id:1529954445;
        mso-list-template-ids:970336368;}
@list l1:level1
        {mso-level-number-format:bullet;
        mso-level-text:\F0B7;
        mso-level-tab-stop:36.0pt;
        mso-level-number-position:left;
        text-indent:-18.0pt;
        mso-ansi-font-size:10.0pt;
        font-family:Symbol;}
ol
        {margin-bottom:0cm;}
ul
        {margin-bottom:0cm;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="2050" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=EN-ZA link=blue vlink=purple><div class=WordSection1><p class=MsoNormal>Setting up sitemaps in DSpace: <a href="http://www.dspace.org/1_6_1Documentation/ch03.html">http://www.dspace.org/1_6_1Documentation/ch03.html</a> <o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><h3>3.4.5. <a name=docbook-install.html-sitemaps></a>Google and HTML sitemaps<o:p></o:p></h3><p>To aid web crawlers index the content within your repository, you can make use of sitemaps. There are currently two forms of sitemaps included in DSpace; Google sitemaps and HTML sitemaps.<o:p></o:p></p><p>Sitemaps allow DSpace to expose it's content without the crawlers having to index every page. HTML sitemaps provide a list of all items, collections and communities in HTML format, whilst Google sitemaps provide the same information in gzipped XML format.<o:p></o:p></p><p>To generate the sitemaps, you need to run <code><span style='font-size:10.0pt'>[dspace]/bin/generate-sitemaps</span></code> This creates the sitemaps in <code><span style='font-size:10.0pt'>[dspace]/sitemaps/</span></code><o:p></o:p></p><p>The sitemaps can be accessed from the following URLs:<o:p></o:p></p><p style='margin-left:36.0pt;text-indent:-18.0pt;mso-list:l1 level1 lfo1'><![if !supportLists]><span style='font-size:10.0pt;font-family:Symbol'><span style='mso-list:Ignore'>·<span style='font:7.0pt "Times New Roman"'>         </span></span></span><![endif]>http://dspace.example.com/dspace/sitemap - Index sitemap<o:p></o:p></p><p style='margin-left:36.0pt;text-indent:-18.0pt;mso-list:l1 level1 lfo1'><![if !supportLists]><span style='font-size:10.0pt;font-family:Symbol'><span style='mso-list:Ignore'>·<span style='font:7.0pt "Times New Roman"'>         </span></span></span><![endif]>http://dspace.example.com/dspace/sitemap?map=0 - First list of items (up to 50,000)<o:p></o:p></p><p style='margin-left:36.0pt;text-indent:-18.0pt;mso-list:l1 level1 lfo1'><![if !supportLists]><span style='font-size:10.0pt;font-family:Symbol'><span style='mso-list:Ignore'>·<span style='font:7.0pt "Times New Roman"'>         </span></span></span><![endif]>http://dspace.example.com/dspace/sitemap?map=n - Subsequent lists of items (e.g. 50,0001 to 100,000) etc...<o:p></o:p></p><p>HTML sitemaps follow the same procedure: <o:p></o:p></p><p style='margin-left:36.0pt;text-indent:-18.0pt;mso-list:l0 level1 lfo2'><![if !supportLists]><span style='font-size:10.0pt;font-family:Symbol'><span style='mso-list:Ignore'>·<span style='font:7.0pt "Times New Roman"'>         </span></span></span><![endif]>http://dspace.example.com/dspace/htmlmap - Index sitemap<o:p></o:p></p><p style='margin-left:36.0pt;text-indent:-18.0pt;mso-list:l0 level1 lfo2'><![if !supportLists]><span style='font-size:10.0pt;font-family:Symbol'><span style='mso-list:Ignore'>·<span style='font:7.0pt "Times New Roman"'>         </span></span></span><![endif]>etc...<o:p></o:p></p><p>When running <code><span style='font-size:10.0pt'>[dspace]/bin/generate-sitemaps</span></code> the script informs Google that the sitemaps have been updated. For this update to register correctly, you must first register your Google sitemap index page (<code><span style='font-size:10.0pt'>/dspace/sitemap</span></code>) with Google at <a href="http://www.google.com/webmasters/sitemaps/" target="_top">http://www.google.com/webmasters/sitemaps/</a>. If your DSpace server requires the use of a HTTP proxy to connect to the Internet, ensure that you have set <code><span style='font-size:10.0pt'>http.proxy.host</span></code> and <code><span style='font-size:10.0pt'>http.proxy.port</span></code> in <code><span style='font-size:10.0pt'>[dspace]/config/dspace.cfg</span></code><o:p></o:p></p><p>The URL for pinging Google, and in future, other search engines, is configured in <code><span style='font-size:10.0pt'>[dspace-space]/config/dspace.cfg</span></code> using the <code><span style='font-size:10.0pt'>sitemap.engineurls</span></code> setting where you can provide a comma-separated list of URLs to 'ping'.<o:p></o:p></p><p>You can generate the sitemaps automatically every day using an additional cron job:<o:p></o:p></p><pre># Generate sitemaps<o:p></o:p></pre><pre><o:p> </o:p></pre><pre><o:p> </o:p></pre><pre>0 6 * * * [dspace]/bin/generate-sitemaps<o:p></o:p></pre><pre>       <o:p></o:p></pre><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal><b>Ina Smith<o:p></o:p></b></p><p class=MsoNormal style='line-height:115%'><span lang=EN-GB style='font-size:8.0pt;line-height:115%;color:black'>E-Research Repository Manager | </span><span style='font-size:8.0pt;line-height:115%;color:black'>Library and Information Service | University of Stellenbosch | Private Bag X5036, 7599 | South Africa<o:p></o:p></span></p><p class=MsoNormal style='line-height:115%'><b><span style='font-size:8.0pt;line-height:115%;color:red'>http://scholar.sun.ac.za</span></b><span style='font-size:8.0pt;line-height:115%;color:black'> | </span><b><span style='font-size:8.0pt;line-height:115%;color:red'>http://oa.sun.ac.za</span></b><span style='font-size:8.0pt;line-height:115%;color:black'> </span><span style='font-size:8.0pt;line-height:115%;color:#002060'>| </span><span style='font-size:8.0pt;line-height:115%;color:black'>E-mail: </span><b><span style='font-size:8.0pt;line-height:115%;color:#002060'><a href="mailto:ismith@sun.ac.za"><span style='line-height:115%;color:#002060'>ismith@sun.ac.za</span></a> </span></b><span style='font-size:8.0pt;line-height:115%;color:#002060'>|<b> </b></span><span style='font-size:8.0pt;line-height:115%;color:black'>Tel:  +27 21 808 9139 | </span><b><span style='font-size:8.0pt;line-height:115%;color:#002060'>Skype: smith.ina </span></b><span style='font-size:8.0pt;line-height:115%;color:#002060'>| </span><span style='font-size:8.0pt;line-height:115%;color:black'>Office hours: Mo-Fr: 08h00-16h30</span><b><span style='font-size:8.0pt;line-height:115%;color:#002060'><br><br></span></b><span style='font-size:8.0pt;line-height:115%;color:black'><o:p></o:p></span></p><p class=MsoNormal style='line-height:115%'><span lang=AF style='font-size:8.0pt;line-height:115%;color:black'>E-Navorsingsbewaarplekbestuurder | Biblioteek- en Inligtingsdiens | Universiteit van Stellenbosch | Privaatsak X5036, 7599 | Suid-Afrika<o:p></o:p></span></p><p class=MsoNormal style='line-height:115%'><b><span style='font-size:8.0pt;line-height:115%;color:red'>http://scholar.sun.ac.za</span></b><span style='font-size:8.0pt;line-height:115%;color:black'>  | </span><b><span style='font-size:8.0pt;line-height:115%;color:red'>http://oa.sun.ac.za</span></b><span style='font-size:8.0pt;line-height:115%;color:black'> </span><span style='font-size:8.0pt;line-height:115%;color:#002060'>| </span><span style='font-size:8.0pt;line-height:115%;color:black'>E-pos: </span><b><span style='font-size:8.0pt;line-height:115%;color:#002060'><a href="mailto:ismith@sun.ac.za"><span style='line-height:115%;color:#002060'>ismith@sun.ac.za</span></a> </span></b><span style='font-size:8.0pt;line-height:115%;color:#002060'>| T</span><span style='font-size:8.0pt;line-height:115%;color:black'>el:  +27 21 808 9139 | </span><b><span style='font-size:8.0pt;line-height:115%;color:#002060'>Skype: smith.ina | </span></b><span style='font-size:8.0pt;line-height:115%;color:black'>Kantoorure: Mo-Fr: 08h00-16h30<br><br></span><span lang=AF style='font-size:8.0pt;line-height:115%;color:black'><o:p></o:p></span></p><table class=MsoNormalTable border=0 cellspacing=0 cellpadding=0 width="45%" style='width:45.78%'><tr><td width="50%" valign=bottom style='width:50.16%;background:white;padding:0cm 0cm 0cm 0cm'></td><td width="0%" valign=top style='width:.14%;background:white;padding:0cm 0cm 0cm 0cm'></td><td valign=top style='background:white;padding:0cm 0cm 0cm 0cm'></td><td valign=top style='background:white;padding:0cm 0cm 0cm 0cm'></td><td width="49%" valign=top style='width:49.16%;background:white;padding:0cm 0cm 0cm 0cm'></td><td width="0%" nowrap valign=top style='width:.14%;background:white;padding:0cm 0cm 0cm 0cm'></td></tr></table><p class=MsoNormal><b><i><span style='color:#1F497D'><img border=0 width=353 height=26 id="Picture_x0020_2" src="cid:image001.jpg@01CB9AB0.E55F9800" alt="cid:image004.jpg@01C9FFC1.4A20BBC0"></span></i></b><span style='color:#1F497D'><o:p></o:p></span></p><p class=MsoNormal><b><span style='font-size:8.0pt;color:red'>Confidentiality Notice</span></b><span style='font-size:8.0pt'>: This message (including attachments) is intended for the person/entity to whom it is addressed and contains privileged and confidential information. Should the reader hereof not be the intended recipient, kindly notify the sender immediately by return e-mail, delete the original message and do not use, disclose, distribute or copy it.<o:p></o:p></span></p><p class=MsoNormal><o:p> </o:p></p></div></body></html>