<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments on: Relevance in Mini and GSA searches</title>
	<atom:link href="http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/</link>
	<description>Google Search Appliance and Google Mini development</description>
	<pubDate>Wed, 09 Jul 2008 01:26:41 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>By: Joel</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-5969</link>
		<dc:creator>Joel</dc:creator>
		<pubDate>Thu, 10 Jan 2008 16:25:14 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-5969</guid>
		<description>Currently nutch-IICE open source project is similar with Google GSA. You can take a look at it.

http://nutch-iice.sourceforge.net/</description>
		<content:encoded><![CDATA[<p>Currently nutch-IICE open source project is similar with Google GSA. You can take a look at it.</p>
<p><a href="http://nutch-iice.sourceforge.net/" >http://nutch-iice.sourceforge.net/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-16</link>
		<dc:creator>Paul</dc:creator>
		<pubDate>Fri, 24 Mar 2006 22:34:15 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-16</guid>
		<description>Hi Jim,

Nope, I'd have to put a software update in to change something like that and there haven't been any for the Mini recently. Funnily enough, I've just done a software update on the GSA I work on and RK is still there and acting like it did before. 

The Mini &#038; GSA don't use the same ranking algorithm as big Google, and you can use RK to give an indication of how good a result is for a search (i.e. turn it in to stars or something) so I doubt they'll turn it off in the search appliances. 

It's rather interesting that they have within the main API though. If it wasn't doing something, you'd have thought they'd leave it on (and indeed if it wasn't doing something, what's it doing there in the first place?)</description>
		<content:encoded><![CDATA[<p>Hi Jim,</p>
<p>Nope, I&#8217;d have to put a software update in to change something like that and there haven&#8217;t been any for the Mini recently. Funnily enough, I&#8217;ve just done a software update on the GSA I work on and RK is still there and acting like it did before. </p>
<p>The Mini &#038; GSA don&#8217;t use the same ranking algorithm as big Google, and you can use RK to give an indication of how good a result is for a search (i.e. turn it in to stars or something) so I doubt they&#8217;ll turn it off in the search appliances. </p>
<p>It&#8217;s rather interesting that they have within the main API though. If it wasn&#8217;t doing something, you&#8217;d have thought they&#8217;d leave it on (and indeed if it wasn&#8217;t doing something, what&#8217;s it doing there in the first place?)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim Westergren</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-15</link>
		<dc:creator>Jim Westergren</dc:creator>
		<pubDate>Fri, 24 Mar 2006 20:32:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-15</guid>
		<description>Hi,

Google has now put all RK values to zero for all URLs.

Either some temporary glitch OR Google didn't like that value to be public ...

Is it the same for the Google mini??</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>Google has now put all RK values to zero for all URLs.</p>
<p>Either some temporary glitch OR Google didn&#8217;t like that value to be public &#8230;</p>
<p>Is it the same for the Google mini??</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-13</link>
		<dc:creator>Paul</dc:creator>
		<pubDate>Wed, 08 Mar 2006 16:17:39 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-13</guid>
		<description>There's no doubt 'RK' is a way of scoring a page in the Mini and GSA. What it may be in the main Google API is another matter entirely.

Working out the ranking system in the Mini is one of the things on my 'to do' list, which is unfortunately filled with other stuff as well. 

From what I've seen, on page factors have a much greater effect than they do in big Google, but interlinking does still have an effect. I'm not sure about effects of where a document is in an overall site or directory tree yet, it can be difficult to assess that without outputting large amounts of test pages, which I haven't had time for.

Of course, it might be that interlinking doesn't have much effect because there isn't much interlinking in the relatively small datasets that I'm working with. That's another thing I'm going to have to test. It could be that a relatively few links will have a very large effect, because there generally wouldn't be a lot of linking to the same place on an intranet - which is what the search appliances were generally made to search.</description>
		<content:encoded><![CDATA[<p>There&#8217;s no doubt &#8216;RK&#8217; is a way of scoring a page in the Mini and GSA. What it may be in the main Google API is another matter entirely.</p>
<p>Working out the ranking system in the Mini is one of the things on my &#8216;to do&#8217; list, which is unfortunately filled with other stuff as well. </p>
<p>From what I&#8217;ve seen, on page factors have a much greater effect than they do in big Google, but interlinking does still have an effect. I&#8217;m not sure about effects of where a document is in an overall site or directory tree yet, it can be difficult to assess that without outputting large amounts of test pages, which I haven&#8217;t had time for.</p>
<p>Of course, it might be that interlinking doesn&#8217;t have much effect because there isn&#8217;t much interlinking in the relatively small datasets that I&#8217;m working with. That&#8217;s another thing I&#8217;m going to have to test. It could be that a relatively few links will have a very large effect, because there generally wouldn&#8217;t be a lot of linking to the same place on an intranet - which is what the search appliances were generally made to search.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dayo_UK</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-12</link>
		<dc:creator>Dayo_UK</dc:creator>
		<pubDate>Wed, 08 Mar 2006 14:45:02 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-12</guid>
		<description>Ok, thanks.

So there is little doubt that rk tag is a way of scoring a page - obv the way of scoring a page in Google Mini is a lot different to Big Google. 

Is it all on-page factors ?

or would Google Mini recognize a more important document by number of references to it or where it sites in the directory tree ?</description>
		<content:encoded><![CDATA[<p>Ok, thanks.</p>
<p>So there is little doubt that rk tag is a way of scoring a page - obv the way of scoring a page in Google Mini is a lot different to Big Google. </p>
<p>Is it all on-page factors ?</p>
<p>or would Google Mini recognize a more important document by number of references to it or where it sites in the directory tree ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Paul</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-11</link>
		<dc:creator>Paul</dc:creator>
		<pubDate>Wed, 08 Mar 2006 12:01:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-11</guid>
		<description>Yup, the results are shown in relevancy order by default. You can get several of the same RK rating, so it must have a decimal level internally, or something else it sorts by as well, so you can get...

First - 6
Second - 6
Third - 5
Fourth - 5
Fifth - 5
Sixth - 2
Seventh - 2
Eighth - 0
Ninth - 0
etc.

This is consistent from what I've seen, it's not like PageRank in big Google where you can have a PR2 page come higher up than a PR5 page in a set of results.

NB: The top result doesn't always have an RK of 10, with the rest decreasing from that, so the Mini must have some sort of relevancy algorithm that says "this is the most relevant page for the search, but I still only give it a 5/10 for relevancy for the term."</description>
		<content:encoded><![CDATA[<p>Yup, the results are shown in relevancy order by default. You can get several of the same RK rating, so it must have a decimal level internally, or something else it sorts by as well, so you can get&#8230;</p>
<p>First - 6<br />
Second - 6<br />
Third - 5<br />
Fourth - 5<br />
Fifth - 5<br />
Sixth - 2<br />
Seventh - 2<br />
Eighth - 0<br />
Ninth - 0<br />
etc.</p>
<p>This is consistent from what I&#8217;ve seen, it&#8217;s not like PageRank in big Google where you can have a PR2 page come higher up than a PR5 page in a set of results.</p>
<p>NB: The top result doesn&#8217;t always have an RK of 10, with the rest decreasing from that, so the Mini must have some sort of relevancy algorithm that says &#8220;this is the most relevant page for the search, but I still only give it a 5/10 for relevancy for the term.&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dayo_UK</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-10</link>
		<dc:creator>Dayo_UK</dc:creator>
		<pubDate>Wed, 08 Mar 2006 11:44:05 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-10</guid>
		<description>Oops - the comments does not expect tags - before the numbers in the above posts should be the rk tag. EG RK of 5, 4, 2 and 4.

Cheers

Dayo</description>
		<content:encoded><![CDATA[<p>Oops - the comments does not expect tags - before the numbers in the above posts should be the rk tag. EG RK of 5, 4, 2 and 4.</p>
<p>Cheers</p>
<p>Dayo</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dayo_UK</title>
		<link>http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-9</link>
		<dc:creator>Dayo_UK</dc:creator>
		<pubDate>Wed, 08 Mar 2006 11:42:47 +0000</pubDate>
		<guid isPermaLink="false">http://www.gsadeveloper.com/2006/03/07/relevance-in-mini-and-gsa-searches/#comment-9</guid>
		<description>Hi, So results are returned in a relevacy order by default.

So if you choose to display the  value you will get results like this ?:-

First Title
First Desc
First URL -  5

Second Title
Second Desc
Second Url -  4

etc

So the  seems to directly reflect the relevancy in a relevancy search ? - or can an  of 2 show higher than an  of 4 in a relevancy ordered search ?</description>
		<content:encoded><![CDATA[<p>Hi, So results are returned in a relevacy order by default.</p>
<p>So if you choose to display the  value you will get results like this ?:-</p>
<p>First Title<br />
First Desc<br />
First URL -  5</p>
<p>Second Title<br />
Second Desc<br />
Second Url -  4</p>
<p>etc</p>
<p>So the  seems to directly reflect the relevancy in a relevancy search ? - or can an  of 2 show higher than an  of 4 in a relevancy ordered search ?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
