<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:creativeCommons="http://backend.userland.com/creativeCommonsRssModule"
	>
<channel>
	<title>Comments on: Strange corners of the Web</title>
	<atom:link href="http://www.robweir.com/blog/2009/02/strange-corners-of-web.html/feed" rel="self" type="application/rss+xml" />
	<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=strange-corners-of-web</link>
	<description>Thinking the unthinkable, pondering the imponderable, effing the ineffable and scruting the inscrutable</description>
	<lastBuildDate>Tue, 07 Feb 2012 11:20:47 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: Dave Leigh</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2193</link>
		<dc:creator>Dave Leigh</dc:creator>
		<pubDate>Tue, 03 Mar 2009 18:18:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2193</guid>
		<description>Hi, Rob,&lt;br/&gt;&lt;br/&gt;I used to maintain the particular network that generated those &quot;Sky King&quot; messages. In fact, I helped to build two of the stations. Your rumor mill is not entirely inaccurate.</description>
		<content:encoded><![CDATA[<p>Hi, Rob,</p>
<p>I used to maintain the particular network that generated those &#8220;Sky King&#8221; messages. In fact, I helped to build two of the stations. Your rumor mill is not entirely inaccurate.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Răzvan Sandu</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2170</link>
		<dc:creator>Răzvan Sandu</dc:creator>
		<pubDate>Sun, 22 Feb 2009 16:28:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2170</guid>
		<description>Here&#039;s an actual example on which I fell upon - not about ODF, but about Linux:&lt;br/&gt;&lt;br/&gt;http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc/ro/&lt;br/&gt;&lt;br/&gt;&quot;Shell&quot; is translated in Romanian as &quot;raft&quot;, which means is not the CLI, but the piece of furniture in a store, holding and exposing the goods for sale.  ;-)&lt;br/&gt;&lt;br/&gt;The test is almost unreadable in Romanian: an attempt to do an automated translation of a HOWTO?&lt;br/&gt;&lt;br/&gt;&lt;br/&gt;Regards,&lt;br/&gt;Răzvan</description>
		<content:encoded><![CDATA[<p>Here&#8217;s an actual example on which I fell upon &#8211; not about ODF, but about Linux:</p>
<p><a href="http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc/ro/" rel="nofollow">http://www.howtogeek.com/howto/ubuntu/keyboard-shortcuts-for-bash-command-shell-for-ubuntu-debian-suse-redhat-linux-etc/ro/</a></p>
<p>&#8220;Shell&#8221; is translated in Romanian as &#8220;raft&#8221;, which means is not the CLI, but the piece of furniture in a store, holding and exposing the goods for sale.  ;-)</p>
<p>The test is almost unreadable in Romanian: an attempt to do an automated translation of a HOWTO?</p>
<p>Regards,<br />Răzvan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Gerard</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2169</link>
		<dc:creator>David Gerard</dc:creator>
		<pubDate>Sun, 22 Feb 2009 13:49:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2169</guid>
		<description>It&#039;s just blog spam. Word salad (generated by a Markov chain) is too obvious, so they grab any old text and run it through a translator and back again. The result is &lt;i&gt;almost&lt;/i&gt; sensible, but, importantly, isn&#039;t word-for-word identical to the original, so doesn&#039;t get penalised in Google.&lt;br/&gt;&lt;br/&gt;Appropriate response: report as spam blog.</description>
		<content:encoded><![CDATA[<p>It&#8217;s just blog spam. Word salad (generated by a Markov chain) is too obvious, so they grab any old text and run it through a translator and back again. The result is <i>almost</i> sensible, but, importantly, isn&#8217;t word-for-word identical to the original, so doesn&#8217;t get penalised in Google.</p>
<p>Appropriate response: report as spam blog.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Anonymous</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2168</link>
		<dc:creator>Anonymous</dc:creator>
		<pubDate>Sun, 22 Feb 2009 09:04:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2168</guid>
		<description>I think its #4, intentional pollution.&lt;br/&gt;Porn purveyors suffered this fate.  Probably religiously-motivated folk polluted the Internet with phony sites so that Googling for an on-topic site is damned difficult.  Try particularly searching for a particular niche, e.g. tentacle porn, and you&#039;ll have to wade through a few hundred pollution sites deliberately designed as interference before you find the good stuff.</description>
		<content:encoded><![CDATA[<p>I think its #4, intentional pollution.<br />Porn purveyors suffered this fate.  Probably religiously-motivated folk polluted the Internet with phony sites so that Googling for an on-topic site is damned difficult.  Try particularly searching for a particular niche, e.g. tentacle porn, and you&#8217;ll have to wade through a few hundred pollution sites deliberately designed as interference before you find the good stuff.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2167</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Sat, 21 Feb 2009 23:08:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2167</guid>
		<description>That is weird.  But I at least can understand the motivation.  Whether Dadaist art, the poetry of Gertrude Stein, the music of John Cage, or the teachings of the Church of the SubGenius, there are times when the unexpected is expected, the inappropriate is de rigueur.  &lt;br/&gt;&lt;br/&gt;But these Microsoft/ODF web sites are not nearly bad enough to be interesting from an artistic standpoint.  They to make them truly bad.  &lt;br/&gt;&lt;br/&gt;And I&#039;m not convinced it is a commercial motivation either.  If you want to get links to sell illegal OEM copies of Vista, then you load up a page with words like &quot;Brittany Spears&quot; or &quot;free clip art&quot;.  If your most distinguishing key words are Microsoft and ODF, then you are not going to get many clicks.  So I don&#039;t think the OEM software link explains this.  &lt;br/&gt;&lt;br/&gt;Similarly, when the police come across a body shot in the back, and notice his wallet is gone, they don&#039;t automatically assume that means he was shot during a robbery.  It could just as well be premeditated murder by someone who took the wallet to make it look like a robbery.&lt;br/&gt;&lt;br/&gt;So I&#039;m not sure we&#039;re any closer to a solution.  Is it: 1. Search engine optimization, 2.Art, 3. selling OEM software to people who happen to be searching for Microsoft and ODF, 4. an attempt to make searches for these keywords less effective by decreasing the signal/noise ratio, or 5. All of the above ?</description>
		<content:encoded><![CDATA[<p>That is weird.  But I at least can understand the motivation.  Whether Dadaist art, the poetry of Gertrude Stein, the music of John Cage, or the teachings of the Church of the SubGenius, there are times when the unexpected is expected, the inappropriate is de rigueur.  </p>
<p>But these Microsoft/ODF web sites are not nearly bad enough to be interesting from an artistic standpoint.  They to make them truly bad.  </p>
<p>And I&#8217;m not convinced it is a commercial motivation either.  If you want to get links to sell illegal OEM copies of Vista, then you load up a page with words like &#8220;Brittany Spears&#8221; or &#8220;free clip art&#8221;.  If your most distinguishing key words are Microsoft and ODF, then you are not going to get many clicks.  So I don&#8217;t think the OEM software link explains this.  </p>
<p>Similarly, when the police come across a body shot in the back, and notice his wallet is gone, they don&#8217;t automatically assume that means he was shot during a robbery.  It could just as well be premeditated murder by someone who took the wallet to make it look like a robbery.</p>
<p>So I&#8217;m not sure we&#8217;re any closer to a solution.  Is it: 1. Search engine optimization, 2.Art, 3. selling OEM software to people who happen to be searching for Microsoft and ODF, 4. an attempt to make searches for these keywords less effective by decreasing the signal/noise ratio, or 5. All of the above ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Christian Walde</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2166</link>
		<dc:creator>Christian Walde</dc:creator>
		<pubDate>Sat, 21 Feb 2009 22:30:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2166</guid>
		<description>If you&#039;re looking for actual noise on the internet, here&#039;s a fine example: http://ricedoutyugo.com/</description>
		<content:encoded><![CDATA[<p>If you&#8217;re looking for actual noise on the internet, here&#8217;s a fine example: <a href="http://ricedoutyugo.com/" rel="nofollow">http://ricedoutyugo.com/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Răzvan Sandu</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2165</link>
		<dc:creator>Răzvan Sandu</dc:creator>
		<pubDate>Sat, 21 Feb 2009 22:06:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2165</guid>
		<description>Hello Rob,&lt;br/&gt;&lt;br/&gt;Happy to hear that someone did the same shortwave radio listening there, on the other coast of the Atlantic Ocean. I personally did it for many years here, in Romania... (I still do, from time to time)  ;-)&lt;br/&gt;&lt;br/&gt;As a person interested in the ODF/OOXML battle here, in Romania, I&#039;ve noticed the same phenomenon here, about the Web materials written IN ROMANIAN on the OOXML matter.&lt;br/&gt;&lt;br/&gt;Namely, the blogs of some proeminent Romanian free software advocates, that blogged frequently about ODF, OOXML and their situation here, got superceded (in Google rankings) by this type of &quot;automatic translations&quot; you&#039;ve mentioned.&lt;br/&gt;&lt;br/&gt;It seems to me like an effort to &quot;bury&quot; those blogs in a lot of &quot;noise&quot;, trying to make them less visible in search engines...&lt;br/&gt;&lt;br/&gt;Regards,&lt;br/&gt;Răzvan</description>
		<content:encoded><![CDATA[<p>Hello Rob,</p>
<p>Happy to hear that someone did the same shortwave radio listening there, on the other coast of the Atlantic Ocean. I personally did it for many years here, in Romania&#8230; (I still do, from time to time)  ;-)</p>
<p>As a person interested in the ODF/OOXML battle here, in Romania, I&#8217;ve noticed the same phenomenon here, about the Web materials written IN ROMANIAN on the OOXML matter.</p>
<p>Namely, the blogs of some proeminent Romanian free software advocates, that blogged frequently about ODF, OOXML and their situation here, got superceded (in Google rankings) by this type of &#8220;automatic translations&#8221; you&#8217;ve mentioned.</p>
<p>It seems to me like an effort to &#8220;bury&#8221; those blogs in a lot of &#8220;noise&#8221;, trying to make them less visible in search engines&#8230;</p>
<p>Regards,<br />Răzvan</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2164</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Sat, 21 Feb 2009 21:37:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2164</guid>
		<description>If meaningfully mutilated articles about Open Standards adoption hidden in the dark corners of the blogosphere are not yet being used as modern day numbers stations someone is going to have to start. Because that&#039;s so much more exciting than some idiotic SEO tactic to sell pirated software.</description>
		<content:encoded><![CDATA[<p>If meaningfully mutilated articles about Open Standards adoption hidden in the dark corners of the blogosphere are not yet being used as modern day numbers stations someone is going to have to start. Because that&#8217;s so much more exciting than some idiotic SEO tactic to sell pirated software.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rob</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2163</link>
		<dc:creator>Rob</dc:creator>
		<pubDate>Sat, 21 Feb 2009 19:54:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2163</guid>
		<description>Hmmm... I think I see the technique.  The syntax of new articles is identical to the original text.  But it appears that they are have a thesaurus and are doing a random substitution of synonyms.  No attention is given to the grammatical context.  So &quot;handle&quot; used as a verb is replaced by &quot;knob&quot;.&lt;br/&gt;&lt;br/&gt;The net effect of this is that the article is not immediately traceable to the source document.  Also, terms that are not in the dictionary, such as &#039;ODF&#039;, &#039;Microsoft&#039; and proper names are not changed.  Maybe they are trying to get hits from people searching on these proper names?&lt;br/&gt;&lt;br/&gt;The first article I mention doesn&#039;t seem to have any outgoing links, however.  Maybe the idea is to set it up, wait a few weeks for the PageRank of the page to go up, then auction off outgoing links?  But that will only work if the blog post has high PageRank links to it.  And right now I don&#039;t see anyone linking to these pseudo-posts.  Uh... except me...  Doh!</description>
		<content:encoded><![CDATA[<p>Hmmm&#8230; I think I see the technique.  The syntax of new articles is identical to the original text.  But it appears that they are have a thesaurus and are doing a random substitution of synonyms.  No attention is given to the grammatical context.  So &#8220;handle&#8221; used as a verb is replaced by &#8220;knob&#8221;.</p>
<p>The net effect of this is that the article is not immediately traceable to the source document.  Also, terms that are not in the dictionary, such as &#8216;ODF&#8217;, &#8216;Microsoft&#8217; and proper names are not changed.  Maybe they are trying to get hits from people searching on these proper names?</p>
<p>The first article I mention doesn&#8217;t seem to have any outgoing links, however.  Maybe the idea is to set it up, wait a few weeks for the PageRank of the page to go up, then auction off outgoing links?  But that will only work if the blog post has high PageRank links to it.  And right now I don&#8217;t see anyone linking to these pseudo-posts.  Uh&#8230; except me&#8230;  Doh!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marbux</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2162</link>
		<dc:creator>Marbux</dc:creator>
		<pubDate>Sat, 21 Feb 2009 19:39:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2162</guid>
		<description>I&#039;ve been getting a ton of these returned by Google Blogs Alert set with terms like &quot;ODF&quot; and &quot;OOXML&quot;. I&#039;d guess that it&#039;s only been over the last couple of weeks, if that. &lt;br/&gt;&lt;br/&gt;One last night was a site with Corel&#039;s name up in lights that had Amazon links to buy Corel programs. One product per page. The pages on the site each looked like they had begun as Corel pages and maybe run through a translator program to another human language and then back to English. &lt;br/&gt;&lt;br/&gt;All of the others I have seen that I bothered to check have one page of the kind you describe but the other pages on the same sites looked like link farms intended to up the linked sites in web search engine rankings. I didn&#039;t bother to follow any of the links to see where they led.&lt;br/&gt;&lt;br/&gt;But this kind of thing is outnumbering legitimate blog hits returned by Google Blogs Alert for me, roughly 2-1.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve been getting a ton of these returned by Google Blogs Alert set with terms like &#8220;ODF&#8221; and &#8220;OOXML&#8221;. I&#8217;d guess that it&#8217;s only been over the last couple of weeks, if that. </p>
<p>One last night was a site with Corel&#8217;s name up in lights that had Amazon links to buy Corel programs. One product per page. The pages on the site each looked like they had begun as Corel pages and maybe run through a translator program to another human language and then back to English. </p>
<p>All of the others I have seen that I bothered to check have one page of the kind you describe but the other pages on the same sites looked like link farms intended to up the linked sites in web search engine rankings. I didn&#8217;t bother to follow any of the links to see where they led.</p>
<p>But this kind of thing is outnumbering legitimate blog hits returned by Google Blogs Alert for me, roughly 2-1.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex</title>
		<link>http://www.robweir.com/blog/2009/02/strange-corners-of-web.html#comment-2161</link>
		<dc:creator>Alex</dc:creator>
		<pubDate>Sat, 21 Feb 2009 18:41:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.robweir.com/blog/2009/02/strange-corners-of-the-web.html#comment-2161</guid>
		<description>Rob hi&lt;br/&gt;&lt;br/&gt;Isn&#039;t this just an SEO tactic? At the end of these pages there is a telltale link, so at the end of the &quot;knob documents in contrary technology format&quot; one (hah) there is a link to &quot;buy openpim&quot;: the target site has &quot;oem downloads&quot; and Vista Ultimate is only $99 there. Hmmmmm.&lt;br/&gt;&lt;br/&gt;It does look like the pages are auto-generated; maybe some software selects the text based on known terms which are traffic drivers (for a time &#039;Microsoft&#039; and &#039;ODF&#039; together would have been hot tickets here).&lt;br/&gt;&lt;br/&gt;There might be an idea that these pages will directly drive traffic to the target sites (in which case, they need to improve their software); but in any case the inbound links should improve the rank of the targetted pages in search engines ...&lt;br/&gt;&lt;br/&gt;- Alex.</description>
		<content:encoded><![CDATA[<p>Rob hi</p>
<p>Isn&#8217;t this just an SEO tactic? At the end of these pages there is a telltale link, so at the end of the &#8220;knob documents in contrary technology format&#8221; one (hah) there is a link to &#8220;buy openpim&#8221;: the target site has &#8220;oem downloads&#8221; and Vista Ultimate is only $99 there. Hmmmmm.</p>
<p>It does look like the pages are auto-generated; maybe some software selects the text based on known terms which are traffic drivers (for a time &#8216;Microsoft&#8217; and &#8216;ODF&#8217; together would have been hot tickets here).</p>
<p>There might be an idea that these pages will directly drive traffic to the target sites (in which case, they need to improve their software); but in any case the inbound links should improve the rank of the targetted pages in search engines &#8230;</p>
<p>- Alex.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Dynamic Page Served (once) in 0.540 seconds -->

