<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BPWrap &#187; Popular</title>
	<atom:link href="http://www.bpwrap.com/category/popular/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.bpwrap.com</link>
	<description>Just another WordPress weblog</description>
	<lastBuildDate>Tue, 18 May 2010 22:29:52 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Content May Be The SEO King But Check The Palace Architecture</title>
		<link>http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/</link>
		<comments>http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/#comments</comments>
		<pubDate>Sat, 01 Aug 2009 02:30:16 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Popular]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Google Webmaster Tools]]></category>
		<category><![CDATA[Search engine optimization]]></category>
		<category><![CDATA[website architecture]]></category>
		<category><![CDATA[Xenu Link Sleuth]]></category>

		<guid isPermaLink="false">http://www.bpwrap.com/?p=1120</guid>
		<description><![CDATA[Where should effort go to be highly visible in keyword searches with Google? A precise answer probably demands an evaluation of all the Google Search Engine Ranking Factors like that done by SEOmoz. However the Google search debate normally seems to revolve around Content or Links. Mark Jackson stated quite categorically The Golden Rule of [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/">Content May Be The SEO King But Check The Palace Architecture</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><p>Where should effort go to be highly visible in keyword searches with Google?  A precise answer probably demands an evaluation of all the <a href="http://www.seomoz.org/article/search-ranking-factors">Google Search Engine Ranking Factors</a> like that done by <strong>SEOmoz</strong>.  However the Google search debate normally seems to revolve around Content or Links.</p>
<p><strong>Mark Jackson</strong> stated quite categorically <a href="http://searchenginewatch.com/3625720">The Golden Rule of SEO: Content is King</a>.  <strong>Aaron Wall</strong> also commented that although there are those who push for linking, <a href="http://www.seobook.com/archives/000183.shtml">Content is King</a> when you get down to it.</p>
<p>The debate is ongoing and there are even those who stay on the fence and declare both <a href="http://www.btobonline.com/apps/pbcs.dll/article?AID=/20090729/FREE/907299992/1112/FREE">content and inbound linking is king</a>.</p>
<p>How should you budget your effort then in trying to improve rankings in Google keyword searches?  Should it be equal effort on <strong>content</strong> and <strong>links</strong> or some other ratio? </p>
<p>That is a false dichotomy since there is another equally important website dimension that does not even feature in this debate.  Indeed if you look at the hot issues in the past two months in SEO, that third dimension has been the one the experts have been talking about.  The third leg of the tripod is the website architecture.   It deserves equal effort as illustrated in the following table, which is derived from a chart in an article on <a href="http://www.smmbc.ca/seokey.htm">Key SEO Services</a>.  In this table, the Plus value is a reminder of the Upsides that can be created by working on this aspect.  The Minus value indicates the Downsides if this aspect is handled poorly.</p>
<style type="text/css">
	.yellow	{	background-color:#ffffe1;width:167px;	}
	.pink	{	background-color:#ffe1ff;width:166px;	}
	.blue	{	background-color:#e1ffff;width:167px;	}
	.bold		{	font-weight:bold;text-align:center;line-height:1.4em;	}
</style>
<div style="margin: 10px 0pt; width: 500px;">
<table border="1" width="500px">
<tbody>
<tr>
<td colspan="3" class="bold">
<h2 style="margin-top: 10px;">Triple Power SEO</h2>
</td>
</tr>
<tr>
<td class="bold yellow">Architecture</td>
<td class="bold pink">Content</td>
<td class="bold blue">Links</td>
</tr>
<tr>
</tr>
<tr>
<td colspan="3"><img src="http://www.bpwrap.com/images/seotrio.jpg" alt="SEO Trio" height="139px" width="498px"></td>
</tr>
<tr>
<td class="bold yellow">Plus &#8211; Visibility</td>
<td class="bold pink">Plus &#8211; Appeal</td>
<td class="bold blue">Plus &#8211; Authority</td>
</tr>
<tr>
<td class="bold yellow">Minus &#8211; Barriers to search engines</td>
<td class="bold pink">Minus &#8211; Spam</td>
<td class="bold blue">Minus &#8211; Penalties</td>
</tr>
<tr>
<td colspan="3"><img src="http://www.bpwrap.com/images/eyes.jpg" alt="eyes" height="82px" width="498px"></td>
</tr>
<tr>
<td colspan="3" class="bold">Monitor&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&gt;&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Measure&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&gt;&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Manage&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&gt;&gt;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Improve</td>
</tr>
</tbody>
</table>
</div>
<p>It is somewhat surprising website architecture gets so little air time given its importance.  It can be complex but a great deal of improvement can be obtained with very simple methods.  It is likely that the vast majority of websites that do not perform well in Google keyword searches suffer from website architecture problems.  Yet much progress can be made with some of the most basic tools.</p>
<p>For example, the <a href="http://home.snafu.de/tilman/xenulink.html">Xenu Link Sleuth (TM)</a> will find broken links on web sites, which can be a major problem for the search engine spiders.  It crawls the website in exactly the way a search engine spider does.  Google itself provides a great deal of information through its <a href="https://www.google.com/webmasters/tools/">Google Webmaster Tools</a> website.  If you have <a href="https://www.google.com/analytics/">Google Analytics</a> or <a href="https://www.google.com/adsense/">Google Adsense</a> ads on your website, then both give statistics that often pinpoint problems in the website architecture.  Efforts put into correction efforts will have clear and certain results, unlike those hoped for when efforts go into content or links.</p>
<p>The aim of this article was to discuss the high level allocation of effort rather than to provide a compendium of website architecture problem areas and solutions. For the latter we can recommend an article by <strong>Richard Baxter</strong>, <a href="http://www.searchenginejournal.com/seo-website-architecture-issues/9760/">Diagnose Critical Website Architecture Issues for SEO</a>.  He also has <a href="http://www.slideshare.net/richardbaxterseo/smx-london-2009-diagnosing-website-architecture-issues-richard-baxter">a slide show of the presentation</a> he gave at SMX London 2009 on the same topic.  It&#8217;s very good value if you want pointers on how to put effort into this priority SEO dimension.</p>
<div style="margin-top: 10px; height: 15px;" class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/e2c4bcb5-abb9-428a-8ea9-9ea7c7a7a759/" title="Reblog this post [with Zemanta]"><img style="border: medium none ; float: right;" class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=e2c4bcb5-abb9-428a-8ea9-9ea7c7a7a759" alt="Reblog this post [with Zemanta]"></a><span class="zem-script more-related pretty-attribution"><script type="text/javascript" src="http://static.zemanta.com/readside/loader.js" defer="defer"></script></span></div>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F07%2Fcontent-may-be-the-seo-king-but-check-the-palace-architecture%2F', 'Content+May+Be+The+SEO+King+But+Check+The+Palace+Architecture')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F07%2Fcontent-may-be-the-seo-king-but-check-the-palace-architecture%2F', title: '+Content+May+Be+The+SEO+King+But+Check+The+Palace+Architecture+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/">Content May Be The SEO King But Check The Palace Architecture</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>October 10, 2009 -- <a href="http://www.bpwrap.com/2009/10/users-or-search-engines-is-a-false-seo-debate/" title="Users Or Search Engines Is A False SEO Debate">Users Or Search Engines Is A False SEO Debate</a> (10)</li><li>August 20, 2009 -- <a href="http://www.bpwrap.com/2009/08/natural-links-bring-search-engine-visibility/" title="Natural Links Bring Search Engine Visibility">Natural Links Bring Search Engine Visibility</a> (9)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/feed/</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>PageRank Calculation &#8211; Null Hypothesis</title>
		<link>http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/</link>
		<comments>http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/#comments</comments>
		<pubDate>Tue, 09 Jun 2009 17:46:55 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Popular]]></category>
		<category><![CDATA[algorithm]]></category>
		<category><![CDATA[Andy Beard]]></category>
		<category><![CDATA[Matt Cutts]]></category>
		<category><![CDATA[PageRank]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/?p=1044</guid>
		<description><![CDATA[Summary The SEO world continues to be shaken by the hints offered by Matt Cutts last week on the nofollow tag and PageRank Sculpting. Many seem shaken but perhaps people have made assumptions about PageRank that are not true. It is difficult to prove how things really work since Google remains cagey on what really [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/">PageRank Calculation &#8211; Null Hypothesis</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><h2>Summary</h2>
<p>The SEO world continues to be shaken by the hints offered by <strong>Matt Cutts</strong> last week on the nofollow tag and PageRank Sculpting.  Many seem shaken but perhaps people have made assumptions about PageRank that are not true.  It is difficult to prove how things really work since Google remains cagey on what really happens.  Here we offer a Null Hypothesis, a simple explanation, that people may wish to consider.  This should only be replaced by a more complex view, when that can be proven to be better.</p>
<h2>Introduction</h2>
<p><strong>Andy Beard</strong> is a keen observer of the Google scene and poses the key question following the Matt Cutts revelations last week: <a href="http://andybeard.eu/1865/pagerank-sculpting-dead.html">Is PageRank Sculpting Dead &#038; Can Comments Kill Your PageRank?</a></p>
<blockquote><p>Has Google in one quick swipe removed all benefit of Dynamic Linking (old school term) or PageRank sculpting (when it became &#8220;trendy&#8221;), and potentially caused massive penalties for sites nofollowing links for user generated content and comments?</p></blockquote>
<p>Important articles he cites on this are:</p>
<ul>
<li><a href="http://searchengineland.com/google-loses-backwards-compatibility-on-paid-link-blocking-pagerank-sculpting-20408">Google Loses “Backwards Compatibility” On Paid Link Blocking &amp; PageRank Sculpting</a> </li>
<li><a href="http://www.seomoz.org/blog/google-maybe-changes-how-the-pagerank-algorithm-handles-nofollow">Google (Maybe) Changes How the PageRank Algorithm Handles Nofollow</a> </li>
<li><a href="http://www.seomoz.org/blog/no-clarification-forthcoming-from-google-on-nofollow-pagerank-flow">No Clarification Forthcoming from Google on Nofollow &amp; PageRank Flow</a> </li>
<li><a href="http://www.aimclearblog.com/2009/06/03/is-whats-good-for-google-good-for-seo/">Is What&#8217;s Good For Google, Good For SEO</a></li>
</ul>
<p>Not everyone is so concerned and <strong>Andrew Goodman</strong> frankly states, <a href="http://www.traffick.com/2009/06/pagerank-sculpting-is-dead-good.asp">PageRank Sculpting is Dead? Good Riddance</a>.</p>
<h2>How Is PageRank Calculated</h2>
<p>The Google website is strangely obscure on how PageRank is calculated.  There are a few explanations in Google Answers (no longer supported) by others on such issues as <a href="http://answers.google.com/answers/threadview?id=223807">My Page Rank</a> and <a href="http://answers.google.com/answers/threadview?id=379557">Page Rank Definition &#8211; Proof of convergence and uniqueness</a>.  <strong>Phil Craven</strong> offers a reasonably straightforward account in <a href="http://www.webworkshop.net/pagerank.html">Google&#8217;s PageRank Explained and how to make the most of it</a>.  That includes the following:</p>
<blockquote><p><strong>Notes:</strong><br />
Not all links are counted by Google. For instance, they filter out links from known link farms. Some links can cause a site to be penalized by Google. They rightly figure that webmasters cannot control which sites link to their sites, but they can control which sites they link out to. For this reason, links into a site cannot harm the site, but links from a site can be harmful if they link to penalized sites.</p></blockquote>
<h2>The  KISS principle (Keep It Simple, Sweetheart)</h2>
<p>The problem in trying to estimate how the Google PageRank process works is that PageRank is only one of over 100 or more factors involved in how web pages rank in keyword searches.  Any attempt to prove a given assumption about PageRank involves a typical statistical analysis where one tries to infer an explanation from somewhat fuzzy data.  That is where the <a href="http://www.bpwrap.com/2008/05/microsoft-should-kiss-more-often/">KISS principle</a> comes in.  In such a situation, we should perhaps rely on the approach favored by some great minds.</p>
<dl style="margin:0 40px 0 20px;font-size:small;">
<dt><em>Of two competing theories or explanations, all other things being equal, the simpler one is to be preferred.</em></dt>
<dd><strong>Occam</strong> of Occam&#8217;s Razor</dd>
<dt><em>A scientific theory should be as simple as possible, but no simpler.</em></dt>
<dd><strong>Albert Einstein</strong></dd>
<dt><em>The Null Hypothesis is presumed true until statistical evidence indicates otherwise.</em></dt>
<dd><strong>Sir Roland Fisher</strong></dd>
</dl>
<p>Given the Matt Cutts suggestions that some find perplexing, what is the simplest explanation of the Google PageRank process that could explain what is involved?</p>
<h2>The Null Hypothesis</h2>
<p>What the PageRank process attempts is mind-boggling.  It aims to attach a PageRank value to any hyperlink in the total space of web pages and their inter links. It involves an iterative calculation since the values are interdependent.  This PageRank can be considered as the probability that a random surfer (human or robot) will pass down that link as compared with all the other links in the total space (graph).</p>
<p>As Matt Cutts reminded us last week, even if a web page is excluded by its owner using a robots.txt file, it may still get into consideration if it is included in a sitemap file or has a link coming in from another external web page.  Having an extra indicator on each link indicating whether it is influential in passing PageRank would increase the complexity of the data enormously.  Given that, we propose the following Null Hypothesis <span style="font-size:0.9em;"><em>(See foot of article for definition of this expression)</em></span>.  This of course could be abandoned in favor of a more complex explanation if that could be proven statistically with sufficient confidence or if Google chose to provide a more correct explanation of what is done.</p>
<p>The Null Hypothesis runs as follows.  The whole process splits into two phases.  The first phase looks at all web pages (URLs) and all associated links to determine the PageRank of each web page and thus the contributions that would flow down each link. There are no exclusions and this calculation process handles all URLs in the total Internet space (graph).  Modifiers that website owners may have applied such as robots.txt files or tags such as noindex, nofollow, etc. do not get involved at this stage.  For all URLs and links without exception, values such as those illustrated below would be calculated.</p>
<div style="clear:both;margin:10px 40px;">
<img src="http://www.bpwrap.com/images/pagerankchart.jpg" width="397" height="323" alt="PageRank Chart" />
</div>
<p>The second phase of the process involves how these PageRank values are then used within the search algorithms.  Here whatever is specified via robots.txt or nofollow tags would apply.  The PageRank contribution from nofollow-ed links thus would not be included in the calculation.  Also any filtering factors that Google may wish to apply for bad neighborhoods, etc. would only apply in this second phase.  The underlying web pages and links would still have a first-phase PageRank calculated but this would in no way influence the second phase results.</p>
<p>This two-phase approach would seem to square with what we have been hearing recently.  It is offered very much as a Null Hypothesis, so if someone has an Alternative Hypothesis, we look forward to hearing it.  Over to you.</p>
<h2>Implications of this Null Hypothesis</h2>
<p>In the mean time, if this explanation is true, some obvious considerations apply.  The basic PageRank calculation is determined by the total set of URLs and links.  The PageRank value for a URL is never changed by modifications that apply in the second phase.  All that happens in the second phase is that some of the PageRank contributions are ignored.  So the effective PageRank that a URL has in the second phase is always less than or equal to what it had in the first phase.  A URL can be more prominent solely because others become less prominent.</p>
<p>Significant changes can only come about through influences that affect the first phase.  These relate to the site architecture rather than the interlinkages.  That is after all what Matt Cutts was recommending.</p>
<p><strong>Footnote:</strong><br />
See this link for <a href="http://www.null-hypothesis.co.uk/science//item/what_is_a_null_hypothesis">an explanation of Null Hypothesis</a>.</p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F06%2Fpagerank-calculation-null-hypothesis%2F', 'PageRank+Calculation+%26%238211%3B+Null+Hypothesis')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F06%2Fpagerank-calculation-null-hypothesis%2F', title: '+PageRank+Calculation+%26%238211%3B+Null+Hypothesis+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/">PageRank Calculation &#8211; Null Hypothesis</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>April 17, 2010 -- <a href="http://www.bpwrap.com/2010/04/higher-search-engine-rankings-without-a-home-page/" title="Higher Search Engine Rankings Without A Home Page">Higher Search Engine Rankings Without A Home Page</a> (5)</li><li>June 19, 2009 -- <a href="http://www.bpwrap.com/2009/06/seo-gets-simpler-in-2009/" title="SEO Gets Simpler In 2009">SEO Gets Simpler In 2009</a> (23)</li><li>February 22, 2009 -- <a href="http://www.bpwrap.com/2009/02/deep-web-now-web-more-headaches-for-google/" title="Deep Web, NOW Web &#8211; more headaches for Google">Deep Web, NOW Web &#8211; more headaches for Google</a> (13)</li><li>May 24, 2008 -- <a href="http://www.bpwrap.com/2008/05/someone-wrong-internet/" title="Someone is wrong on the Internet">Someone is wrong on the Internet</a> (0)</li><li>December 27, 2007 -- <a href="http://www.bpwrap.com/2007/12/forget-we-ever-mentioned-a-supplemental-index-google/" title="Forget We Ever Mentioned A Supplemental Index &#8211; Google">Forget We Ever Mentioned A Supplemental Index &#8211; Google</a> (6)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2009/06/pagerank-calculation-null-hypothesis/feed/</wfw:commentRss>
		<slash:comments>29</slash:comments>
		</item>
		<item>
		<title>Hanging / Dangling Web Pages Can Be PageRank Black Holes</title>
		<link>http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/</link>
		<comments>http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/#comments</comments>
		<pubDate>Sun, 07 Jun 2009 18:47:56 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Popular]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[canonical]]></category>
		<category><![CDATA[dangling]]></category>
		<category><![CDATA[follow]]></category>
		<category><![CDATA[hanging]]></category>
		<category><![CDATA[links]]></category>
		<category><![CDATA[nofollow]]></category>
		<category><![CDATA[noindex]]></category>
		<category><![CDATA[web page]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/?p=1026</guid>
		<description><![CDATA[Summary Search Engine Optimization (SEO) for blogs is often not done effectively and posts rank below where they should be in keyword searches.&#160; One particular problem can be hanging/dangling web pages created by the blogging software coupled with inappropriate use of robots.txt files and tags.&#160; Such hanging web pages can act as sinks or black [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/">Hanging / Dangling Web Pages Can Be PageRank Black Holes</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><h2>Summary </h2>
<p>Search Engine Optimization (SEO) for blogs is often not done effectively and posts rank below where they should be in keyword searches.&#160; One particular problem can be hanging/dangling web pages created by the blogging software coupled with inappropriate use of robots.txt files and tags.&#160; Such hanging web pages can act as sinks or black holes for PageRank, a key factor in the Google search algorithm.&#160; This article provides a simple explanation of the issues involved and appropriate solutions. </p>
<h2>Introduction </h2>
<p>&quot;<em>You are creating hanging/dangling pages</em>&quot;, wrote <strong>Andy Beard</strong> in a recent comment on a post on <a href="http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/">Avoiding WordPress Duplicate Content</a>. After an e-mail exchange, I could understand his concern.&#160; It is a potential problem that robots.txt files could create.&#160; As Andy wrote some time back, it is one of the <a href="http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html">SEO Linking Gotchas Even The Pros Make</a>.&#160; </p>
<p>More recently, <strong>Rand Fishkin</strong> has pointed out that you should not <a href="http://www.seomoz.org/blog/headsmacking-tip-13-dont-accidentally-block-link-juice-with-robotstxt">Accidentally Block Link Juice with Robots.txt</a>.&#160; Rand advised doing the following: </p>
<ol>
<li>Conserve link juice by using nofollow when linking to a URL that is robots.txt disallowed </li>
<li>If you know that disallowed pages have acquired link juice (particularly from external links), consider using meta noindex, follow instead so they can pass their link juice on to places on your site that need it. </li>
</ol>
<p>Link juice is just another term for PageRank.&#160; This PageRank value for any web page is an important element in how well it will rank in any keyword search.&#160; It may be one of over 100 factors but it probably is the most important in the Google keyword search process. Avoiding losing PageRank that a web page could amass is an important function that SEOs should pursue. </p>
<p>After doing some research, it turns out to be a somewhat more complex issue requiring an understanding of some weighty articles.&#160; Anyone involved in doing SEO or hiring an SEO consultant should be aware of the potential problem to ensure things are done correctly.&#160; I also realized that there was no simple explanation of the issues so this post will attempt to rectify that omission. </p>
<h2>Research on Hanging / Dangling Web Pages</h2>
<p>If you want to do some of your own research, before checking out the later explanations, I found the following useful: </p>
<ul>
<li><a href="http://www.webproworld.com/search-engine-optimization-forum/85010-dangling-pages.html#post440559">Dangling Pages</a> &#8211; WebProWorld SEO Forum </li>
<li><a href="http://www.hobo-web.co.uk/seo-blog/index.php/robots-txt-files/">What Do SEO/SEM People Put In Robots.txt Files?</a> &#8211; <strong>Shaun Anderson</strong> </li>
<li><a href="http://www.askapache.com/seo/updated-robotstxt-for-wordpress.html">WordPress robots.txt SEO</a> &#8211; AskApache Web Development </li>
<li><a href="http://ez-onlinemoney.com/blog/search-engine-optimization/internal-linking-meta-nofollow-rel-nofollow-robotstxt-confusion/">Internal Linking &#8211; META nofollow, rel nofollow, robots.txt Confusion thereon</a> &#8211; <strong>Josh Spaulding</strong> </li>
</ul>
<p>Of course with search engine algorithms, things are always in evolution.&#160; The official word on the Google website gives the following information on <a href="http://www.google.com/support/webmasters/bin/answer.py?hl=en&amp;answer=96569">rel=&quot;nofollow&quot;</a>. </p>
<blockquote><p><strong>How does Google handle nofollowed links?</strong> </p>
<p><em>We don&#8217;t follow them. This means that Google does not transfer PageRank or anchor text across these links. Essentially, using nofollow causes us to drop the target links from our overall graph of the web. However, the target pages may still appear in our index if other sites link to them without using nofollow, or if the URLs are submitted to Google in a Sitemap. Also, it&#8217;s important to note that other search engines may handle nofollow in slightly different ways.</em> </p>
</blockquote>
<p>That lead to the practice of PageRank sculpting, whereby people try to manage how PageRank is distributed among the web pages in a website.&#160; More recently <strong>Matt Cutts</strong> of Google in a Q&amp;A session at SMX Advanced 2009 in Seattle, WA, provided the <a href="http://outspokenmedia.com/internet-marketing-conferences/chat-with-matt-cutts/">current thinking on nofollow</a> as recorded by <strong>Lisa Barone</strong>: </p>
<blockquote><p>Q: It seems like you supported PageRank sculpting a year ago and now it seems like you don’t support it anymore. Why is that and will it become a negative indicator? </p>
<p>A: <em>No, it won’t hurt your site. You can do your links however you want. You can use it to eliminate links to sign in forms and whatnot, but it is a better use of your time to fix your site architecture and fix the problem from the core. Suppose you have 10 links and 5 of them are nofollowed. There is this assumption that the other 5 links get ALL that PageRank and that may not be as true anymore (your leftover PageRank will now “evaporate”, says Matt.). You can’t shunt your PageRank where you want it to go. It’s not a penalty. It’s not going to get you in trouble. However, it’s not as effective. It’s a better use of your time to go make new content and do all the other things. If you’re using nofollow to change how PageRank flows, it’s like a band-aid. It’s better to build your site how you want PageRank to flow from the beginning.</em> </p>
</blockquote>
<p>Let us now try to pull all that together in a short number of simple explanations covering the important issues involved.</p>
<h2>How PageRank is calculated </h2>
<p>Google is not always completely open on what is involved in its search algorithms for obvious reasons.&#160; The algorithms also evolve as the Q&amp;A quote above shows.&#160; The following is a best judgment on what is involved, but if anyone has corrections or modifications to what is shown, they are encouraged to add a comment. </p>
<p>The following diagram illustrates how PageRank is calculated for any web page and how fractions of the PageRank flow to and from linked web pages.&#160; PageRank here is not the value that appears in the &#8216;thermometer&#8217; in the Google Toolbar, and which goes from 0 to 10.&#160; Instead this PageRank is the mathematical value used in the Google keyword search algorithm.&#160; It is calculated for any web page and represents the probability that a random visitor would visit the given web page as opposed to visiting other web pages. </p>
<p>Here we have multiplied this mathematical value by a huge multiplier to give values that are easier to talk about.&#160; We will use the term, PageRank factor, for this derived number.&#160; The resulting number would normally be a value like 5.6 or 16.2 but here we have simplified yet again to round off to whole numbers.&#160; This illustrates a typical web page (but with very few links).&#160; Some links are external links involving other web pages on other websites (domains).&#160; Some are internal links from web pages on the same website (domain).&#160; The inlinks are hyperlinks on other web pages leading to this web page.&#160; The outlinks are hyperlinks on the given web page to other web pages. </p>
<div style="clear:both;margin:10px auto;"><img src="http://www.bpwrap.com/images/pagerank.jpg" width="480" height="320" alt="PageRank Illustration" /></div>
<p>What the image illustrates is that the PageRank factor of this web page (16) is determined by the sum of the PageRank factor contributions flowing through the inlinks.&#160; This PageRank factor then flows out via the 4 outlinks with an equal PageRank factor contribution (4) on each link. </p>
<p>You can imagine this particular web page as being only one among the whole set of web pages on the Internet.&#160; For the technically inclined, we should mention that these PageRank values all are interdependent so they are developed by a process of iteration starting with starting values and repeatedly recalculating to determine what the values are. That goes beyond the scope of this article. </p>
<h2>How a robots.txt file changes the picture </h2>
<div style="clear:right;float:right;margin:35px auto;"><img src="http://www.bpwrap.com/images/pagerankwithrobotstxt.jpg" width="240" height="160" alt="PageRank with robots.txt file  Illustration" /></div>
<p>If a robots.txt file disallows this web page for crawl visits by the search engine spiders, then provided they obey the robots.txt file, they would record the values and links shown in this image.&#160;These PageRank values are the same, whether or not the web page is blocked to crawlers by the robots.txt file. The record is indexed because there is an external inlink that the Google robots are crawling and they would also note the outlink going to another domain.&#160; The outlinks to other web pages on the same domain (internal links) would not be recorded so these PageRank contributions are lost.&#160; In this sense the web page has become a sink or black hole for these PageRank contributions.&#160; They can no longer contribute to the PageRank of these other web pages. </p>
<p>Note that the PageRank factor values on the remaining links are the same as they were when the other links were being included.  Merely saying the links should not be crawled, does not necessarily mean they should be assumed not to exist.  This is in line with Matt Cutt&#8217;s most recent pronouncements.</p>
<h2>How nofollow changes the calculation </h2>
<p>Even if this web page was not excluded by a robots.txt file, a similar effect is created if all outlinks from the web page carry an attribute, rel=nofollow.&#160; Again this assumes that the search engine correctly observes this attribute.&#160; If on the other hand the links are assigned a follow attribute, then the PageRank contribution would flow through to all such links. </p>
<h2>How to get only one web page that counts for any specific content </h2>
<p>As Rand Fishkin suggested above, if more than one web page contains the same content, you can use a meta tag on all the secondary ones to signal noindex.&#160; Then only the primary web page is in the search database, provided the meta tags are being observed.&#160; Coupling this with a follow attribute in the meta tag, then assures that the PageRank contributions still flow out to the other web pages. </p>
<p>A better approach according to <strong>John Mueller</strong>, a Google representative, is to <a href="http://www.seroundtable.com/archives/020151.html">use a Rel=Canonical Tag rather than NoIndex</a>.&#160; Here is how Google describes this <a href="http://googlewebmastercentral.blogspot.com/2009/02/specify-your-canonical.html">canonical tag</a>. </p>
<blockquote><p>We now support a format that allows you to publicly specify your preferred version of a URL. If your site has identical or vastly similar content that&#8217;s accessible through multiple URLs, this format provides you with more control over the URL returned in search results. It also helps to make sure that properties such as link popularity are consolidated to your preferred version. </p>
</blockquote>
<p>Apparently Google treats this as a hint rather than a standard so it is not fool-proof. Others see <a href="http://www.conversationmarketing.com/2009/02/3-reasons-to-use-rel-canonical.htm">Reasons to use rel=canonical, and reasons not to</a>. </p>
<h2>Best Practices</h2>
<p>As Matt Cutts recommended, given the wooliness in some of the above, the preferred approach is to develop the website architecture so that duplicate web pages do not arise.&#160; Then one does not have to rely on the canonical tag or the noindex follow combination.&#160; In this way one avoids the hanging / dangling web pages problem. </p>
<p>The exact methods will depend on the architecture. One very useful approach is to show only an initial excerpt on the blog Home Page with a &#8230; more link to the full post as a single web page.&#160;For category or tag archive pages, you can show only the titles of items so this again avoids the duplicate content problem. The important thing is to be vigilant and look out for essentially duplicate web pages as revealed by a full website scan using the equivalent of a search engine robot such as Xenu.</p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F06%2Fhanging-dangling-web-pages-can-be-pagerank-black-holes%2F', 'Hanging+%2F+Dangling+Web+Pages+Can+Be+PageRank+Black+Holes')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F06%2Fhanging-dangling-web-pages-can-be-pagerank-black-holes%2F', title: '+Hanging+%2F+Dangling+Web+Pages+Can+Be+PageRank+Black+Holes+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/">Hanging / Dangling Web Pages Can Be PageRank Black Holes</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>April 17, 2010 -- <a href="http://www.bpwrap.com/2010/04/higher-search-engine-rankings-without-a-home-page/" title="Higher Search Engine Rankings Without A Home Page">Higher Search Engine Rankings Without A Home Page</a> (5)</li><li>October 16, 2009 -- <a href="http://www.bpwrap.com/2009/10/hyperlinks-get-even-more-respect/" title="Hyperlinks Get Even More Respect">Hyperlinks Get Even More Respect</a> (4)</li><li>June 30, 2009 -- <a href="http://www.bpwrap.com/2009/06/seo-factories-lose-their-edge/" title="SEO Factories Lose Their Edge ">SEO Factories Lose Their Edge </a> (10)</li><li>June 19, 2009 -- <a href="http://www.bpwrap.com/2009/06/seo-gets-simpler-in-2009/" title="SEO Gets Simpler In 2009">SEO Gets Simpler In 2009</a> (23)</li><li>December 9, 2008 -- <a href="http://www.bpwrap.com/2008/12/visitors-bounce/" title="Visitors Bounce">Visitors Bounce</a> (18)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2009/06/hanging-dangling-web-pages-can-be-pagerank-black-holes/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Avoid WordPress Duplicate Content Problems With Google</title>
		<link>http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/</link>
		<comments>http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/#comments</comments>
		<pubDate>Thu, 04 Jun 2009 03:10:40 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Popular]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[content]]></category>
		<category><![CDATA[duplicate]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Wo]]></category>
		<category><![CDATA[WordPress]]></category>
		<category><![CDATA[Xenu]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/?p=1012</guid>
		<description><![CDATA[The best way to ensure a web page ranks well in Google keyword searches is to make sure it is the only one on the web that includes the content on the page. In this way you avoid several web pages all having a somewhat equal possibility of being judged relevant for the particular keyword [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/">Avoid WordPress Duplicate Content Problems With Google</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><p>The best way to ensure a web page ranks well in Google keyword searches is to make sure it is the only one on the web that includes the content on the page.  In this way you avoid several web pages all having a somewhat equal possibility of being judged relevant for the particular keyword search.  This increases the chance that this unique page will outrank other quite independent web pages that cover the same topic.  That&#8217;s the theory and it seems to work out well in practice.</p>
<p>WordPress is a great software for producing blogs but out-of-the-box the WordPress content management system produces a series of pages that all contain the same content.   Just see the concerns expressed in this WebmasterWorld thread about <a href=" http://www.webmasterworld.com/google/3097706.htm">WordPress And Google: Avoiding Duplicate Content Issues</a> where several coding suggestions were offered to avoid the problems.  More recently, David Bradley has suggested that something called the canonical link element can be the solution to <a href="http://www.sciencetext.com/avoiding-duplicate-content-penalties.html">Avoiding Duplicate Content Penalties</a>.</p>
<p>We should quickly add that this is not an inherent weakness of WordPress alone since many other CMSs will suffer from similar problems.  It is a well known problem and you can find an excellent article on how to  <a href="http://www.addicottweb.com/2009/05/avoid-duplicate-content-on-wordpress-websites/">Avoid Duplicate Content on WordPress Websites</a>, which gives the appropriate steps to take.  The most important step of all is to have the right robots.txt file.</p>
<p>I wondered how well people were grappling with this duplicate content problem and decided to check out some of the <a href="http://technorati.com/pop/blogs/">Technorati&#8217;s Blogger Central / top 100 blogs</a>. In particular I thought a check of their robots.txt files would give an indication on whether they had tried to solve the problem.  Here is what I found for the robots.txt files for the most popular 8 blogs.</p>
<ol>
<li><a href="http://www.huffingtonpost.com/">The Huffington Post</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
# All robots will spider the domain
User-agent: *
Disallow:

# Disallow directory /backstage/
User-agent: *
Disallow: /backstage/
</pre>
</div>
<li><a href="http://www.techcrunch.com/">TechCrunch</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-agent: *
Disallow: /*/feed/
Disallow: /*/trackback/
</pre>
</div>
<li><a href="http://www.engadget.com/">Engadget</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
(empty)
</pre>
</div>
<li><a href="http://www.boingboing.net/">Boing Boing</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-agent: *
Disallow: /cgi-bin
</pre>
</div>
<li><a href="http://mashable.com/">Mashable!</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-agent: *
Disallow: /feed
Disallow: /*/feed/
Disallow: /*/trackback/

Disallow: /adcentric
Disallow: /adinterax
Disallow: /atlas
Disallow: /doubleclick
Disallow: /eyereturn
Disallow: /eyewonder
Disallow: /klipmart
Disallow: /pointroll
Disallow: /smartadserver
Disallow: /unicast
Disallow: /viewpoint

Disallow: /LiveSearchSiteAuth.xml
Disallow: /mashableadvertising2.xml
Disallow: /rpc_relay.html

Disallow: /browser.html
Disallow: /canvas.html

User-agent: Fasterfox
Disallow: /
</pre>
</div>
<li><a href="http://lifehacker.com/">Lifehacker</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-Agent: Googlebot
Disallow: /index.xml$
Disallow: /excerpts.xml$
Allow: /sitemap.xml$
Disallow: /*view=rss$
Disallow: /*?view=rss$
Disallow: /*format=rss$
Disallow: /*?format=rss$
Disallow: /*?mailto=true
</pre>
</div>
<li><a href="http://arstechnica.com/">Ars Technica</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-agent: *
Disallow: /kurt/
Disallow: /errors/
</pre>
</div>
<li><a href="http://stuffwhitepeoplelike.wordpress.com/">Stuff White People Like</a></li>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-agent: IRLbot
Crawl-delay: 3600

User-agent: *
Disallow: /next/

# har har
User-agent: *
Disallow: /activate/

User-agent: *
Disallow: /signup/

User-agent: *
Disallow:
</pre>
</div>
</ol>
<p>As you may notice, the most popular blogs seem to have a singular disregard for this issue with minimal robots.txt files.  As you come down the list, it would seem that even these top blogs realize the importance of limiting what the search engine robots crawl and index.</p>
<p>The impetus for exploring this issue came after noticing an additional complication that results if you put <a href="http://www.seo-scoop.com/2009/05/20/wordpress-blog-elegant-face/">An Elegant Face On Your WordPress Blog</a> by using <a href="http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/">Multiple WordPress Loops</a>.</p>
<p>This could have resulted in many extra web pages that humans would likely not see but search engine spiders would certainly crawl.  Changes were made in the site architecture to avoid this.  To avoid other potential duplicate content problems, the current robots.txt file for this blog appears as follows:</p>
<div style="margin:10px 40px;font-size:0.6em;line-height:0.9em;border:1px solid;padding:10px;width:300px;background:#fffff1;">
<pre>
User-agent: *
Disallow: /wp-login.php
Disallow: /wp-admin/
Disallow: /wp-register.php
Disallow: /wp-login.php?action=lostpassword
Disallow: /index.php?paged
Disallow: /?m
Disallow: /test/
Disallow: /feed/
Disallow: /?feed=comments-rss2
Disallow: /?feed=atom
Disallow: /?s=
Disallow: /index.php?s
Disallow: /wp-trackback
Disallow: /xmlrpc
Disallow: /?feed=rss2&#038;p
</pre>
</div>
<h3>Conclusion</h3>
<p>Getting the robots.txt file correct is one of the easiest ways of increasing the visibility of your blog pages in search engine keyword searches.  Leaving two essentially similar web pages means that the two divide up the &#8216;relevance&#8217; that a single web page would have. That means approaching a 50% reduction in potential keyword ranking.  Perhaps the top blogs can ignore such improvements but most of us should not.   Check out what the spiders may crawl by doing an evaluation of your website with <a href="http://home.snafu.de/tilman/xenulink.html">Xenu Link Sleuth</a>.  We should carefully consider our robots.txt files and make sure they are doing an effective job.  Is yours?<br />
<i></p>
<h3>Update</h3>
<p>Andy Beard added a comment that he has concerns about using the robots.txt file as a solution to the WordPress Duplicate Content problem.  He explained these in a post some time ago called <a href="http://andybeard.eu/1121/seo-linking-gotchas-even-the-pros-make.html">SEO Linking Gotchas Even The Pros Make</a>.  There is much food for thought there and we will follow up in a subsequent post.</i></p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F06%2Favoiding-wordpress-duplicate-content-problems-with-google%2F', 'Avoid+WordPress+Duplicate+Content+Problems+With+Google')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F06%2Favoiding-wordpress-duplicate-content-problems-with-google%2F', title: '+Avoid+WordPress+Duplicate+Content+Problems+With+Google+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/">Avoid WordPress Duplicate Content Problems With Google</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>May 3, 2010 -- <a href="http://www.bpwrap.com/2010/05/seth-godin-thinks-bigger-is-better/" title="Seth Godin Thinks Bigger Is Better">Seth Godin Thinks Bigger Is Better</a> (2)</li><li>October 16, 2009 -- <a href="http://www.bpwrap.com/2009/10/hyperlinks-get-even-more-respect/" title="Hyperlinks Get Even More Respect">Hyperlinks Get Even More Respect</a> (4)</li><li>July 19, 2009 -- <a href="http://www.bpwrap.com/2009/07/wot-no-google-local-search/" title="Wot No Google Local Search">Wot No Google Local Search</a> (5)</li><li>May 17, 2009 -- <a href="http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/" title="Multiple WordPress Loops For A New Blog Look">Multiple WordPress Loops For A New Blog Look</a> (21)</li><li>September 18, 2008 -- <a href="http://www.bpwrap.com/2008/09/foolish-footers/" title="Foolish Footers">Foolish Footers</a> (11)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/feed/</wfw:commentRss>
		<slash:comments>23</slash:comments>
		</item>
		<item>
		<title>Arial Font No Longer</title>
		<link>http://www.bpwrap.com/2009/05/arial-font-no-longer/</link>
		<comments>http://www.bpwrap.com/2009/05/arial-font-no-longer/#comments</comments>
		<pubDate>Mon, 25 May 2009 16:44:32 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Popular]]></category>
		<category><![CDATA[Web Design]]></category>
		<category><![CDATA[arial]]></category>
		<category><![CDATA[font]]></category>
		<category><![CDATA[line height]]></category>
		<category><![CDATA[size]]></category>
		<category><![CDATA[verdana]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/?p=999</guid>
		<description><![CDATA[Or perhaps that should read &#8211; Arial Font &#8211; No &#8211; Wider. Our ever sharp-eyed readers may notice a subtle change in the appearance of the SMM online properties, in other words the website and the blogs. It is perhaps a long overdue change and indeed many others may wish to adopt this change so [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/05/arial-font-no-longer/">Arial Font No Longer</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><p>Or perhaps that should read &#8211; Arial Font &#8211; No &#8211; Wider. Our ever sharp-eyed readers may notice a subtle change in the appearance of the SMM online properties, in other words the website and the blogs. It is perhaps a long overdue change and indeed many others may wish to adopt this change so we offer a few words of explanation.  Graphic designers will find nothing new here but this is a case where a little knowledge can be helpful.</p>
<p>This is hardly a new topic since back in December 2001 Mark Simonson wrote an excellent article on <a href="http://www.ms-studio.com/articles.html">The Scourge of Arial</a>.</p>
<blockquote><p>Arial is everywhere. If you don’t know what it is, you don’t use a modern personal computer. Arial is a font that is familiar to anyone who uses Microsoft products, whether on a PC or a Mac. It has spread like a virus through the typographic landscape and illustrates the pervasiveness of Microsoft’s influence in the world.</p>
<p>Arial’s ubiquity is not due to its beauty. It’s actually rather homely. Not that homeliness is necessarily a bad thing for a typeface. With typefaces, character and history are just as important. Arial, however, has a rather dubious history and not much character. In fact, Arial is little more than a shameless impostor.</p></blockquote>
<p>He then goes on to lay out the history of how Arial evolved from a font much used in print media, Helvetica.</p>
<blockquote><p>Throughout the latter half of the twentieth century, one of the most popular typefaces in the western world was Helvetica. It was developed by the Haas Foundry of Switzerland in the 1950s. Later, Haas merged with Linotype and Helvetica was heavily promoted. More weights were added and it really began to catch on.</p>
<p>An icon of the Swiss school of typography, Helvetica swept through the design world in the ’60s and became synonymous with modern, progressive, cosmopolitan attitudes. With its friendly, cheerful appearance and clean lines, it was universally embraced for a time by both the corporate and design worlds as a nearly perfect typeface to be used for anything and everything. “When in doubt, use Helvetica” was a common rule.</p></blockquote>
<p>Arial is an example of a font without serifs. Serifs are those small projections that are added to letters in a font like Times Roman, which is a standard font used in MS Word.  They look fine in print but do not work particularly well on a computer screen.  Arial is just one example of a sans-serif font and there are now many others.</p>
<p>If Arial is so popular why would anyone wish to change.  One reason is to stand out from the crowd.  The other reason is that Arial can have a slightly cramped look and others spread out the letters a little more.  This helps readability particularly with smaller letters.</p>
<p>Our choice is Verdana, which is now a very popular font.  In consequence, the CSS style sheets now start off with the following lines of code:</p>
<div style="clear:both;margin:10px 50px;background:#f1f1ff;font-size:90%;padding:10px;">body{<br />
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;	font-family: verdana, arial, sans-serif;<br />
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;font-size: 0.9em;<br &nbsp;&nbsp;&nbsp;/><br />
}</div>
<p>The font-family line means that verdana is used if available.  If not, then arial is used if available.  If not then whatever sans-serif font is available is used.  No surprises there.</p>
<p>The <span style="padding:5px;background:#f1f1ff;">font-size: 0.9em;</span> introduces a very important point.  It could equally have been written as <span style="padding:5px;background:#f1f1ff;">font-size: 90%;</span> If a font-size is not specified, then each browser uses whatever it has designated as the default size for that font.  Here are the default sizes for Arial and Verdana in Firefox and in this case Internet Explorer looks very similar.</p>
<div style="clear:both;margin:10px 88px;">
<img src="http://www.bpwrap.com/images/fontdefault.png" width="267" height="225" alt="font default sizes">
</div>
<p>In short, Verdana is slightly bigger.  To have approximately the same height for Verdana as appears with Arial the font-size must be set at 90%.  Here is how that appears:</p>
<div style="clear:both;margin:10px 88px;">
<img src="http://www.bpwrap.com/images/fontverdana90.png" width="307" height="226" alt="verdana font 90%">
</div>
<p>The letters are spread out a little more, which gives improved visibility.  This is more apparent at smaller character sizes.  This is shown in the following comparison where both are half the size of the previous version.</p>
<div style="clear:both;margin:10px 88px;">
<img src="http://www.bpwrap.com/images/fontverdana45.png" width="305" height="178" alt="verdana font 45%">
</div>
<p>The other spacing consideration is the distance between the lines and this is governed by line-height.  More and more people seem to be moving to line-height set at 140% or 1.4em.  The difference is illustrated in the following image:</p>
<div style="clear:both;margin:10px 88px;">
<img src="http://www.bpwrap.com/images/fontverdanalineheight.png" width="305" height="413" alt="verdana font line height">
</div>
<p>As you will see, setting line-height at 100% reduces the default spacing which is equivalent to a line-height of 120%.  That is now our standard &#8211; Verdana with a line-height of 140%.  We hope you find that more legible and welcome your comments on any thoughts you may have on this subject.</p>
<p><strong>Update</strong><br />
Just for balance, I should mention the very fine article by <strong>Sander Baumann</strong> entitled, <a href="http://www.designworkplan.com/typography-fonts/arial-is-everywhere.htm">Font Series: Arial is everywhere</a>.  It has some very fine examples of large signs in Arial that are really very striking.</p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F05%2Farial-font-no-longer%2F', 'Arial+Font+No+Longer')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F05%2Farial-font-no-longer%2F', title: '+Arial+Font+No+Longer+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/05/arial-font-no-longer/">Arial Font No Longer</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2009/05/arial-font-no-longer/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2009/05/arial-font-no-longer/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Posts from the Archives</h3><ul class="related_post"><li>July 31, 2009 -- <a href="http://www.bpwrap.com/2009/07/content-may-be-the-seo-king-but-check-the-palace-architecture/" title="Content May Be The SEO King But Check The Palace Architecture">Content May Be The SEO King But Check The Palace Architecture</a> (14)</li><li>May 23, 2005 -- <a href="http://www.bpwrap.com/2005/05/search-engine-optimizations-three-false-truths/" title="Search Engine Optimization&#039;s Three False Truths">Search Engine Optimization&#039;s Three False Truths</a> (0)</li><li>March 15, 2005 -- <a href="http://www.bpwrap.com/2005/03/metoo-in-the-payperclick-war-wont-work/" title="Me-Too In The Pay-Per-Click War Won&#039;t Work">Me-Too In The Pay-Per-Click War Won&#039;t Work</a> (0)</li><li>November 8, 2007 -- <a href="http://www.bpwrap.com/2007/11/world-usability-day-2007/" title="World Usability Day 2007">World Usability Day 2007</a> (0)</li><li>January 3, 2007 -- <a href="http://www.bpwrap.com/2007/01/will-google-start-marketing/" title="Will Google Start Marketing?">Will Google Start Marketing?</a> (1)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2009/05/arial-font-no-longer/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Multiple WordPress Loops For A New Blog Look</title>
		<link>http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/</link>
		<comments>http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/#comments</comments>
		<pubDate>Sun, 17 May 2009 23:04:18 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Popular]]></category>
		<category><![CDATA[Web Design]]></category>
		<category><![CDATA[rewind]]></category>
		<category><![CDATA[template tag]]></category>
		<category><![CDATA[The Loop]]></category>
		<category><![CDATA[WordPress]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/?p=986</guid>
		<description><![CDATA[The WordPress blogging software is quite rightly immensely popular. The Open-Source community that supports it has developed a whole raft of supporting code and applications that make it a pleasure to use. This blog and the other SMM blogs now have a slightly different look. This is because we are now using a feature of [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/">Multiple WordPress Loops For A New Blog Look</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><p>The WordPress blogging software is quite rightly immensely popular.  The Open-Source community that supports it has developed a whole raft of supporting code and applications that make it a pleasure to use.</p>
<p>This blog and the other SMM blogs now have a slightly different look.  This is because we are now using a feature of WordPress, Multiple Loops.  The beauty of WordPress is that the concepts are all very easy to understand.  This particular feature will hopefully give much more reader satisfaction.</p>
<p>There are two elements to the change.  The first is that you will now only see one blog post when you visit.  Although Google tries to be helpful, it can occasionally provide more from a blog that you wish to see.  Now you will see precisely the blog post that is relevant to your keyword query.</p>
<p>The other element is that at the bottom of this single blog post you will now see a list of the most popular blog posts as determined by our visitors.  If you like what you see in the initial post then perhaps some of the other blog posts may tweak your interest.</p>
<p>The changes to the theme required for this new look are very straightforward.  Hopefully this simple explanation will help you apply this same approach to your blog if you so wish.</p>
<h2>The WordPress Loop</h2>
<p>Web pages in a blog are produced by WordPress templates.  These are files with names such as index.php or single.php.  Within these template files, the heart of the WordPress software is code which is called <a href="http://codex.wordpress.org/The_Loop">The Loop</a>.    The WordPress Codex gives a more detailed explanation of <a href="http://codex.wordpress.org/The_Loop_in_Action">The Loop In Action</a>.</p>
<p>The content of all blog posts is held in an associated mySQL database.  When a blog web page is displayed, the web page will show certain information for every blog post entry that fits the particular database query implied by your selection of web page.  For example, if you want the most recent blog post, then the loop picks up only the information on that blog post.  This is illustrated by the following image of a WordPress blog page.</p>
<div style="clear:both;margin:10px 30px;"><img src="http://www.bpwrap.com/images/blog0.jpg" width="450" height="600" alt="WordPress blog for single post" /></div>
<p>When you visit the home page of the blog, you are often seeing a series of blog posts that are created via the file named index.php.  The number of blog posts is selected by the blog webmaster.  For example if it is arranged that the blog would normally show the 3 most recent blog posts, then this is illustrated by the following image.</p>
<div style="clear:both;margin:10px 30px;"><img src="http://www.bpwrap.com/images/blog1.jpg" width="450" height="600" alt="WordPress blog for three posts" /></div>
<p>When Google sends you to a single blog post to fit your keyword query then you are often seeing what a file named single.php produces by applying the loop.  Normally in the index.php and the single.php files, the loop starts with the following code (note that a space has been inserted after < in the following code samples to ensure the code could be printed.):</p>
<div style="clear:both;background:#f1f1f1;font-size:1em;">
<pre>
< !-- start of the loop -->
  < ?php  if (have_posts()) : ?>
</pre>
</div>
<p>The Loop is then all the code before a final < ?php endif; ?> terminates the Loop  (again as mentioned a space has been inserted between < and ? to allow this code to display correctly).</p>
<h2>Multiple Loops</h2>
<p>As mentioned the new feature is that we are now using <a href="http://codex.wordpress.org/The_Loop#Multiple_Loops_Example_1">Multiple Loops</a> in the WordPress software. The following image shows what is being displayed.  The first post is produced by a first Loop.  The list of Most Popular Posts is then produced by a second Loop.</p>
<div style="clear:both;margin:10px 30px;"><img src="http://www.bpwrap.com/images/blog2.jpg" width="450" height="600" alt="WordPress blog for post and popular list" /></div>
<p>In order to have only one post appear in the first loop, we use a <a href="http://codex.wordpress.org/Template_Tags/query_posts">Template Tag, query_posts</a>.  This query_posts tag can modify what the Loop accesses from the database.  In this case we wish to have only one blog post to appear so the first Loop starts off as follows:</p>
<div style="clear:both;background:#f1f1f1;font-size:1em;">
<pre>
< !-- start of first loop -->
  < ?php  query_posts('showposts=1');
      if (have_posts()) : ?>
</pre>
</div>
<p> The only complication here is that as entries are taken from database in The Loop, the database software maintains a count of the data that has been used.  Normally other applications of the Loop will access only new data since past data has already been accessed.  In order to see the posts which are of interest, we use another WordPress Template Tag as the second Loop starts to reset the count.  This tag is the rewind_posts tag.</p>
<p>In order for the list of Most Popular Posts to be displayed, a new category, popular, has been created.  When visitor traffic confirms that a blog post is popular, then it is assigned to this category in addition to whatever category may have been appropriate.</p>
<p>The code to start the second Loop to achieve the effect shown here is as follows:</p>
<div style="clear:both;background:#f1f1f1;font-size:1em;">
<pre>
< !-- start of second loop -->

< ?php  rewind_posts();
    query_posts('category_name=popular&#038;showposts=-1');
    if (have_posts()) : ?>
</pre>
</div>
<h2>Other Benefits Of This Type of Display</h2>
<p>In addition to displaying popular posts for greater visitor satisfaction, this also ensures that such posts have more links pointing to them.  This should ensure greater search engine visibility for these posts.  In all, this format would seem to have only benefits.  It will be interesting to see what reactions blog visitors have to this new format.</p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F05%2Fmultiple-wordpress-loops-for-a-new-blog-look%2F', 'Multiple+WordPress+Loops+For+A+New+Blog+Look')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2009%2F05%2Fmultiple-wordpress-loops-for-a-new-blog-look%2F', title: '+Multiple+WordPress+Loops+For+A+New+Blog+Look+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/">Multiple WordPress Loops For A New Blog Look</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>June 3, 2009 -- <a href="http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/" title="Avoid WordPress Duplicate Content Problems With Google">Avoid WordPress Duplicate Content Problems With Google</a> (23)</li><li>March 9, 2008 -- <a href="http://www.bpwrap.com/2008/03/write-seo-titles-for-high-rankings/" title="Headlines Are For Humans, Titles Are For Robots">Headlines Are For Humans, Titles Are For Robots</a> (69)</li><li>February 19, 2008 -- <a href="http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/" title="How Wordpress Blogs Are Hacked">How Wordpress Blogs Are Hacked</a> (17)</li><li>February 7, 2008 -- <a href="http://www.bpwrap.com/2008/02/guarding-your-wordpress-blog/" title="Guarding Your Wordpress Blog">Guarding Your Wordpress Blog</a> (18)</li><li>February 5, 2008 -- <a href="http://www.bpwrap.com/2008/02/wordpress-blog-hacked/" title="Wordpress Blog Hacked">Wordpress Blog Hacked</a> (46)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/feed/</wfw:commentRss>
		<slash:comments>21</slash:comments>
		</item>
		<item>
		<title>How WordPress Blogs Are Hacked</title>
		<link>http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/</link>
		<comments>http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/#comments</comments>
		<pubDate>Wed, 20 Feb 2008 01:27:15 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Blogs]]></category>
		<category><![CDATA[Popular]]></category>
		<category><![CDATA[hacker]]></category>
		<category><![CDATA[security]]></category>
		<category><![CDATA[upgrade]]></category>
		<category><![CDATA[vigilance]]></category>
		<category><![CDATA[WordPress]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/2008/02/how-wordpress-blogs-are-hacked/</guid>
		<description><![CDATA[The previous two articles in this series have suggested ways to combat the ever-increasing hack attacks that WordPress blogs are receiving. In this final article, we will discuss some real-life examples and what can be learned from them. As a disclaimer, it should be noted that some hackers are very skilled and are continually improving [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/">How WordPress Blogs Are Hacked</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><p>The previous two articles in this series have suggested ways to combat the ever-increasing hack attacks that WordPress blogs are receiving.  In this final article, we will discuss some real-life examples and what can be learned from them.  As a disclaimer, it should be noted that some hackers are very skilled and are continually improving their methods.  These are anecdotes from the past and the future will undoubtedly be very different.</p>
<h2>Typical Hacking Exploits</h2>
<div style="float:right;clear:right;width:336px;margin:0 0 0 10px;">
<script type="text/javascript"><!--
google_ad_client = "pub-0853354960558559";
/* BPW_above_the_fold_336x280 */
google_ad_slot = "7449271641";
google_ad_width = 336;
google_ad_height = 280;
//-->
</script><br />
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script>
</div>
<p>For specific details of typical hacking exploits, the following accounts are particularly good:</p>
<ul>
<li><a href="http://www.earnersblog.com/wordpress-hacked/">Is your WordPress Installation Compromised? Al Gore&#8217;s is</a> &#8211; by <strong>Stuart McKeown</strong>  (12. Nov 2007)</li>
<li><a href="http://chaoskaizer.wordpress.com/2007/12/03/matt-heaton-bluehost-and-hostmoster-ceo-wordpress-blog-got-hacked-by-mick-jagger/">Matt Heaton (Bluehost and Hostmoster CEO) WordPress blog Hacked by Mick Jagger from Moscow</a> &#8211; by <strong>Noah</strong> (3 Dec 2007)</li>
<li><a href="http://blog.kakkoi.net/wordpress/blackhat-seo-spammer-target-high-pr-wordpress-blog/">Blackhat SEO Spammer targeting High PR WordPress Blog</a> &#8211; by <strong>Noah</strong> (14 Feb 2008)</li>
</ul>
<p>The methods used in these cases are probably all the work of one hacker, by nickname <strong>goro</strong>, who may well have been one of the commenters on the first of these three posts.</p>
<p>We will not go into the specific details here (since they will undoubtedly evolve), but rather discuss the bigger picture associated with these exploits.  In the case of the hacking done on the SMM blogs, there were some clever refinements.  The mechanism inserted on the domain generated hundreds of random, unique blog post web pages, which included links to online pharmaceutical web pages.  Since the websites were well ranked in Google, many of these hundreds of blog posts were served to the search engine spiders as they made their visits.  After a period of hours, the mechanism then stopped.  This may have been to avoid a huge spike in traffic, which would have been more easily detected.</p>
<h2>How Google May Have Rewarded Their Efforts</h2>
<p>During the last two or three months, Google has been giving much more rapid visibility and higher ranking to blog posts in its regular web search.  In the latter part of January, blog posts appropriate for particular keyword searches would appear within a small number of hours in the regular web search. The algorithm may well be using the RSS news feeds associated with the blogs. This gave particular prominence to the blog posts generated by the hacking mechanism.  They would almost always appear among the top five positions on a search for particular online pharmaceuticals and often in the first position. Presumably this gave a significant economic advantage to the hacker.</p>
<p>Although the hacking mechanism was removed within 36 hours, the false and now non-existent blog posts still persist in the Google index over 3 weeks later.  In some cases the cached versions of the false blog posts are still available.</p>
<p>An interesting parallel development during this time is that Google Blogsearch now has a delay of a few days in displaying new blog posts. Until recently such a new blog post might have appeared within an hour or two, since it was triggered by the pinging of the RSS news feed.  Whether this is a reaction to a large volume of blog posts generated by hackers one can only surmise.</p>
<h2>How To Repair The Damage</h2>
<p>Hopefully this series of articles has sensitized you to the dangers of hacking.  This should prompt you to maintain a constant vigil so that any hacker intrusions will be spotted rapidly.  You should also as <strong>Wayne Liew</strong> suggests regard <a href="http://www.wayneliew.com/wordpress-upgrade-update-benefit-tips/">WordPress Upgrades as a Must</a>.  The continuing improvement in security may not serve to keep out hackers but at least it may encourage them to attack an easier prey.</p>
<p>If your WordPress blog is hacked, it can be quite a challenge to find out what has been changed. Sometimes the hacker may have modified files deep within folders that are not normally touched in upgrading, such as the images folder or the wp-content folder.  Checking the size in bytes of particular files compared with versions in the most recent backup will reveal suspicious differences.  Sometimes the .htaccess file may have been modified to create additional and inappropriate mechanisms.  In such cases, you&#8217;ve got to make sure that you eliminate all such additions to the website.  If you have backed-up a clean version of the website recently, it might be better to take down the website and replace it with a clean version.</p>
<p><strong>Related</strong>:<br />
<a href="http://www.bpwrap.com/2008/02/blogs-take-center-stage-for-marketers-and-for-google/">Blogs Take Center Stage For Marketers And For Google</a><br />
<a href="http://blog.kakkoi.net/wordpress/how-to-removed-wordpress-net-in-spam-injection-infected-by-mike-jagger-goro-class-mailphp/">How to Remove WordPress.net.in Spam Injection</a></p>
<p><strong>Previous articles in this series</strong>:<br />
<a href="http://www.bpwrap.com/2008/02/wordpress-blog-hacked/">WordPress Blog Hacked</a><br />
<a href="http://www.bpwrap.com/2008/02/guarding-your-wordpress-blog/">Guarding Your WordPress Blog</a></p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2008%2F02%2Fhow-wordpress-blogs-are-hacked%2F', 'How+WordPress+Blogs+Are+Hacked')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2008%2F02%2Fhow-wordpress-blogs-are-hacked%2F', title: '+How+WordPress+Blogs+Are+Hacked+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/">How WordPress Blogs Are Hacked</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>February 7, 2008 -- <a href="http://www.bpwrap.com/2008/02/guarding-your-wordpress-blog/" title="Guarding Your Wordpress Blog">Guarding Your Wordpress Blog</a> (18)</li><li>February 5, 2008 -- <a href="http://www.bpwrap.com/2008/02/wordpress-blog-hacked/" title="Wordpress Blog Hacked">Wordpress Blog Hacked</a> (46)</li><li>June 17, 2009 -- <a href="http://www.bpwrap.com/2009/06/opera-unite-for-your-own-web-hosting/" title="Opera Unite For Your Own Web Hosting">Opera Unite For Your Own Web Hosting</a> (11)</li><li>June 3, 2009 -- <a href="http://www.bpwrap.com/2009/06/avoiding-wordpress-duplicate-content-problems-with-google/" title="Avoid WordPress Duplicate Content Problems With Google">Avoid WordPress Duplicate Content Problems With Google</a> (23)</li><li>May 17, 2009 -- <a href="http://www.bpwrap.com/2009/05/multiple-wordpress-loops-for-a-new-blog-look/" title="Multiple WordPress Loops For A New Blog Look">Multiple WordPress Loops For A New Blog Look</a> (21)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2008/02/how-wordpress-blogs-are-hacked/feed/</wfw:commentRss>
		<slash:comments>17</slash:comments>
		</item>
		<item>
		<title>Favicons &#8211; How To Make Them Work For You</title>
		<link>http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/</link>
		<comments>http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/#comments</comments>
		<pubDate>Sun, 29 Oct 2006 15:04:58 +0000</pubDate>
		<dc:creator>Barry Welford</dc:creator>
				<category><![CDATA[Marketing]]></category>
		<category><![CDATA[Popular]]></category>
		<category><![CDATA[Web Design]]></category>
		<category><![CDATA[favicon]]></category>
		<category><![CDATA[Firefox]]></category>
		<category><![CDATA[icon]]></category>
		<category><![CDATA[IE]]></category>
		<category><![CDATA[Internet Explorer]]></category>

		<guid isPermaLink="false">http://blog.cre8asite.net/bwelford/2006/10/favicons-how-to-make-them-work-for-you/</guid>
		<description><![CDATA[A previous post, The Favicons Are Coming, provides a short introduction to favicons. It was written in the fond hope that Internet Explorer Version 7, in trying to emulate Mozilla Firefox as it so often does, would perhaps make favicons a robust feature of Internet surfing. In Firefox, you&#8217;ll find these small 16 x 16 [...]<p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/">Favicons &#8211; How To Make Them Work For You</a></p>
]]></description>
			<content:encoded><![CDATA[<!-- sphereit start --><p>A previous post, <a href="http://www.bpwrap.com/2006/09/the-favicons-are-coming/">The Favicons Are Coming</a>, provides a short introduction to favicons.  It was written in the fond hope that Internet Explorer Version 7, in trying to emulate Mozilla Firefox as it so often does, would perhaps make favicons a robust feature of Internet surfing.  In Firefox, you&#8217;ll find these small 16 x 16 pixels icon images for a given web page appear in several places, namely the address window, on a tab and also in your Favorites or Bookmarks list.  They provide instant recognition of a web site and for the ones I know well are all I display in my Bookmarks toolbar in my default Firefox browser.  Here for example are the ones associated with this and related websites: <img src="http://www.bpwrap.com/favicon.png" alt="BPWrap" width="16" height="16" /> <a href="http://www.smmbc.ca/"><img src="http://www.smmbc.ca/favicon.png" alt="Strategic Marketing Montreal" width="16" height="16" /></a> <a href="http://www.otherbb.com/"><img src="http://www.smmbc.ca/img/andyfavicon.png" alt="The Other Bloke's Blog" width="16" height="16" /></a> <a href="http://www.staygolinks.com/"><img src="http://www.staygolinks.com/favicon.png" alt="StayGoLinks" width="16" height="16" /></a></p>
<div style="float:right;clear:right;width:336px;margin:0 0 0 10px;">
<script type="text/javascript"><!--
google_ad_client = "pub-0853354960558559";
/* BPW_above_the_fold_336x280 */
google_ad_slot = "7449271641";
google_ad_width = 336;
google_ad_height = 280;
//-->
</script><br />
<script type="text/javascript"
src="http://pagead2.googlesyndication.com/pagead/show_ads.js">
</script>
</div>
<p>Well now Internet Explorer Version 7 is officially released.  After some thorough testing, the IE v.7 way of handling favicons seems no better than previous versions. You can get the sense of frustration of many others in a forum entry, <a href="http://www.killersites.com/mvnforum/mvnforum/printthread?thread=3329&#038;printall=yes">Keeping The Favicon in IE Favorites</a>, which started in November 2005 and still has not produced an answer. In The Favicons Are Coming, we paid tribute to Microsoft as the inventor of the .ico favicon file and set out the advice accordingly.  It was largely correct, but here we re-present the information in a more practical fashion.  The first part will detail how to make favicons work in Firefox.  The second part (optional and provided only for the really keen) will detail how you can try to make them work somewhat in IE and other browsers.</p>
<h3>Making Favicons Work In Mozilla Firefox</h3>
<p>1.  Make your 16 x 16 pixel icon image as a .png or a .jpg file.  For example you might call it myfavicon.jpg  Load it into a suitable place on your domain, say http://www.mysite.com/images/myfavicon.jpg<br />
2.  Add the following code in the HEAD section of any web page for which you wish the favicon to appear.<br />
<code style="font-size:x-small;">&lt;link rel="shortcut icon"<br />
href="http://www.mysite.com/images/myfavicon.jpg" type="image/x-icon"&gt;</code><br />
3.  The favicon image will appear correctly in all the places it should.</p>
<h3>Trying To Make Favicons Work In Internet Explorer</h3>
<p>1.  The IE way involves an icon image file with the extension .ico   A .ico file is not just a .bmp file renamed with the extension changed to .ico.  It is a much more complex file that holds two icon images, a 32 x 32 pixel file and a 16 x 16 pixel file.  You can if you wish only include the 16 x 16 pixel image.  A somewhat old article, <a href="http://www.december14.net/ways/rest/favicon.html">Making a Favicon</a>, describes the process well.  You normally can start with a .bmp file or .jpg file and convert it to a .ico file.  The <a href="http://www.axiomx.com/PixelToolbox/index.htm">AxiomX Pixel Toolbox 1.1</a> is free software that will handle this conversion, usually without too much trouble.<br />
2.  You then load up the .ico image file to the root folder of the website, i.e. at http://www.mysite.com/<br />
3.  The favicon image will sometimes appear the very first time you visit a website and sometimes will persist in your Favorites list.  More often it disappears after a first appearance and is replaced by the generic IE favicon.  Since Firefox will correctly handle such a .ico file, this is a worthwhile thing to do if you have the time.  It may also be helpful for other browsers.</p>
<h3>How Well Will Your Favicons Work?</h3>
<div class="divr">
<div class="divrin">Always in Firefox: rarely in Internet Explorer</div>
</div>
<p>However much effort you put into creating and loading favicons, other browsers than Firefox may still handle favicons unreliably.  For example, Opera will sometimes display them and sometimes not.  It will also use the favicon.ico image in the root file if it exists for some purposes and will use the favicon identified by the HEAD link tag for other purposes.  Other websites may also use favicons in an unpredictable way too.  Normally Bloglines should show favicons correctly but even when viewed in Firefox will not always do so.</p>
<p>The favicon is such a useful identifier for those who see it that the effort is certainly worthwhile.  Hopefully as time goes on, more and more browsers will come in line with the Firefox approach to favicons.</p>
<div id="st200810011297" class="st-taf"><script src="http://cdn.socialtwist.com/200810011297/script.js"></script><img style="border:0;margin:0;padding:0;" src="http://images.socialtwist.com/200810011297/button.png" onmouseout="hideHoverMap(this)" onmouseover="showHoverMap(this, '200810011297',  'http%3A%2F%2Fwww.bpwrap.com%2F2006%2F10%2Ffavicons-how-to-make-them-work-for-you%2F', 'Favicons+%26%238211%3B+How+To+Make+Them+Work+For+You')" onclick="cw(this, {id:'200810011297',link: 'http%3A%2F%2Fwww.bpwrap.com%2F2006%2F10%2Ffavicons-how-to-make-them-work-for-you%2F', title: '+Favicons+%26%238211%3B+How+To+Make+Them+Work+For+You+' })"/></div><hr/>Copyright &copy; 2010 <strong><a href="http://www.bpwrap.com">BPWrap</a></strong>. This Feed is for personal non-commercial use only. If you are not reading this material in your news aggregator, the site you are looking at is guilty of copyright infringement. Please contact admin@bpwrap.com so we can take legal action immediately.<br/><span style="float: right;font-size: 7pt"><a href="http://blog.taragana.com/index.php/archive/wordpress-plugins-provided-by-taraganacom/">Plugin</a> by <a href="http://www.taragana.com/">Taragana</a></span><p>Post from: <a href="http://www.bpwrap.com">BPWrap</a><br/><br/><a href="http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/">Favicons &#8211; How To Make Them Work For You</a></p>
<!-- sphereit end --><span style="margin-bottom:40px; border-bottom:none;"><a class="iconsphere" title="Sphere: Related Content" onclick="return Sphere.Widget.search('http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/')" href="http://www.sphere.com/search?q=sphereit:http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/">Sphere: Related Content</a></span><br/><br/><h3  class="related_post_title">Related Posts</h3><ul class="related_post"><li>July 25, 2007 -- <a href="http://www.bpwrap.com/2007/07/internet-explorer-dilemmas/" title="Internet Explorer Dilemmas">Internet Explorer Dilemmas</a> (5)</li><li>January 20, 2009 -- <a href="http://www.bpwrap.com/2009/01/a-fluid-google-favicon/" title="A Fluid Google Favicon">A Fluid Google Favicon</a> (13)</li><li>June 28, 2008 -- <a href="http://www.bpwrap.com/2008/06/ultimate-simplicity-for-firefox-3-full-screen/" title="Ultimate Simplicity For Firefox 3 Full Screen">Ultimate Simplicity For Firefox 3 Full Screen</a> (11)</li></ul>]]></content:encoded>
			<wfw:commentRss>http://www.bpwrap.com/2006/10/favicons-how-to-make-them-work-for-you/feed/</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
	</channel>
</rss>
