View Single Post
  #4  
Old 02-15-2007, 07:45 PM
Connie
Guest
 
Posts: n/a
#3 how to control Bots with Meta-Tags

last but not least, these tags

Mother Wikipedia says: Meta elements are HTML elements used to provide structured metadata about a web page. Such elements are placed as tags in the head section of an HTML document.

They are helpful to stop robots and crawlers, at least the good behaving ones, as they are part of Web-Standard.

For Pixelpost, Meta-Tags must be placed in the head section of the templates which you activated in the admin section.

The head-section of, f.e., the simple-template comes like this:
Quote:
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en">
<head>
<title><SITE_TITLE></title>
<!-- Link for ATOM feed autodiscovery -->
<ATOM_AUTODETECT_LINK>
<!-- Link for RSS feed autodiscovery -->
<RSS_AUTODETECT_LINK>
<!-- META -->
<meta http-equiv="Content-Type" content="text/html;charset=utf-8" />
<meta http-equiv="content-type" name="keywords" content="PhotoBlog,<SITE_TITLE>,<IMAGE_TITLE>,Pixe lpost" />
<meta http-equiv="content-type" name="description" content="<SITE_TITLE>-PhotoBlog: <IMAGE_TITLE>, <IMAGE_NOTES_CLEAN>" />
<!-- CSS -->
<link rel="stylesheet" type="text/css" href="templates/simple/styles/light.css" title="light" />
<link rel="alternate stylesheet" type="text/css" href="templates/simple/styles/dark.css" title="dark" />
<!-- SCRIPTS -->
<script type="text/javascript" src="templates/simple/scripts/styleswitcher.js"></script>
</head>
so why not add these lines to the header?
Quote:
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" lang="en" xml:lang="en">
<head>
....
<!-- Spider-Control -->
<meta name="robots" content="noimageindex,nomediaindex" />
<meta name="robots" content="noarchive" />
<meta http-equiv="pragma" content="no-cache" />
<meta http-equiv="imagetoolbar" content="false" />
</head>
what do these directives mean?

Microsoft itself recomments to use this tag to stop indexing of Image-Files and Media-Files (I did not find a specification which mediafiles they will stop to index...)
Quote:
<meta name="robots" content="noimageindex,nomediaindex" />
This will stop the Bots to show your page as "archived version", especially useful for dynamic content, and Pixelpost is dynamically generated content!
Quote:
<meta name="robots" content="noarchive" />
Proxy-Agents should not cache your content at Proxy-Servers:
Quote:
<meta http-equiv="pragma" content="no-cache" />
and the last one, especially for our good friend, the Internet Explorer, this one: Do not show this molesting toolbar whenever an image is included at the site:
Quote:
<meta http-equiv="imagetoolbar" content="false" />

add these lines to the head-sections of image_template.html, about_template.html, browse_template.html, comment_template.html .. to all your template-files in your template-folder

this was the third and last part of the small tutorial, when I find time (after adding all this code to all my .htaccess, robots.txt and template-files ), I will add this to the Pixelpost Wiki as well