<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/rss2full.xsl" type="text/xsl" media="screen"?><?xml-stylesheet href="http://feeds.feedburner.com/~d/styles/itemcontent.css" type="text/css" media="screen"?><!-- generator="wordpress/2.3.2" --><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:feedburner="http://rssnamespace.org/feedburner/ext/1.0" version="2.0">

<channel>
	<title>Pranam Kolari</title>
	<link>http://pranamkolari.com</link>
	<description>Search, Spam and Social Media</description>
	<pubDate>Thu, 23 Oct 2008 04:54:23 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.3.2</generator>
	<language>en</language>
			<geo:lat>37.412539</geo:lat><geo:long>-121.944618</geo:long><atom10:link xmlns:atom10="http://www.w3.org/2005/Atom" rel="self" href="http://feeds.feedburner.com/PranamKolari" type="application/rss+xml" /><item>
		<title>Siri</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/429270105/</link>
		<comments>http://pranamkolari.com/2008/10/22/siri/#comments</comments>
		<pubDate>Thu, 23 Oct 2008 04:54:23 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[socialmedia]]></category>

		<category><![CDATA[startup]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/10/22/siri/</guid>
		<description><![CDATA[Siri might just turn out to be a perfectly timed AI startup. Via hchen1.
Siri is a new Silicon Valley start-up that attempts to change to the way people use the internet. I joined Siri in Sept. 2008, but I was unable to talk about it until this week. Siri (previously known as stealth-company.com) is an [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://siri.com" onclick="javascript:urchinTracker ('/outbound/article/siri.com');">Siri</a> might just turn out to be a perfectly timed AI startup. Via <a href="http://harry.hchen1.com/2008/10/14/724" onclick="javascript:urchinTracker ('/outbound/article/harry.hchen1.com');">hchen1</a>.</p>
<blockquote><p>Siri is a new Silicon Valley start-up that attempts to change to the way people use the internet. I joined Siri in Sept. 2008, but I was unable to talk about it until this week. Siri (previously known as stealth-company.com) is an SRI spin-off company armed with $8.5M VC funding. The company inherits  technology innovations resulted from many years of AI research (e.g., the DARPA-funded CALO project).</p></blockquote>
<p>I am quite excited by a recent sneak preview.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/429270105" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/10/22/siri/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/10/22/siri/</feedburner:origLink></item>
		<item>
		<title>HYPERTEXT 2009</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/418965608/</link>
		<comments>http://pranamkolari.com/2008/10/12/hypertext-2009/#comments</comments>
		<pubDate>Sun, 12 Oct 2008 23:21:10 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[conferences]]></category>

		<category><![CDATA[research]]></category>

		<category><![CDATA[socialmedia]]></category>

		<category><![CDATA[society]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/10/12/hypertext-2009/</guid>
		<description><![CDATA[HYPERTEXT 2009, will be held at Torino, Italy between June 29th and July 1st next year. Perhaps, the evolution of HYPERTEXT conference reflects the increasing scope and influence of the Web over the past decade.

The Web, the Semantic Web, the Web 2.0, and Social Networks are all manifestations of the success of the link. The [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.ht2009.org/" onclick="javascript:urchinTracker ('/outbound/article/www.ht2009.org');">HYPERTEXT 2009</a>, will be held at Torino, Italy between June 29th and July 1st next year. Perhaps, the evolution of HYPERTEXT conference reflects the increasing scope and influence of the Web over the past decade.</p>
<p><a href="http://pranamkolari.com/wp-content/uploads/2008/10/turin3-final2.jpg" ><img style="border-top-width: 0px; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="93" alt="turin3-final" src="http://pranamkolari.com/wp-content/uploads/2008/10/turin3-final-thumb2.jpg" width="239" align="right" border="0" /></a></p>
<blockquote><p>The Web, the Semantic Web, the Web 2.0, and Social Networks are all manifestations of the success of the link. The Hypertext Conference provides the forum for all research concerning links: their semantics, their presentation, the applications they have been put to, the knowledge that can be derived from their analysis, and their effect on society.</p>
</blockquote>
<p>Main themes in <a href="http://www.informatik.uni-trier.de/~ley/db/conf/ht/ht96.html" onclick="javascript:urchinTracker ('/outbound/article/www.informatik.uni-trier.de');">HYPERTEXT 1996</a> included:</p>
<ul>
<li>Spatial Hypertexts </li>
<li>Autonomous Hypertext Systems and Link Discovery </li>
<li>Hypertext Rhetoric and Criticism </li>
<li>Models of Hypermedia Design and Evaluation </li>
<li>Open Hypermedia </li>
<li>Navigation in the World-Wide Web </li>
<li>Systems and Infrastructure </li>
<li>Extending the World-Wide Web </li>
</ul>
<p>With many of the above questions, now answered, researchers are moving towards the more &quot;social aspects&quot;. <a href="http://www.informatik.uni-trier.de/~ley/db/conf/ht/ht2008.html" onclick="javascript:urchinTracker ('/outbound/article/www.informatik.uni-trier.de');">HYPERTEXT 2008</a> themes included:</p>
<ul>
<li>Information linking: new models and techniques for interacting with information, automating the &quot;trailblazer&quot; </li>
<li>Social linking: link inference, analysis and modeling, similarity and retrieval, applications </li>
<li>Hypertext, culture, and communication </li>
<li>Applications of hypertext </li>
</ul>
<p>Submission deadline is February 2009. If you are interested in this area, please consider participating.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/418965608" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/10/12/hypertext-2009/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/10/12/hypertext-2009/</feedburner:origLink></item>
		<item>
		<title>Political Streams</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/416589808/</link>
		<comments>http://pranamkolari.com/2008/10/10/political-streams/#comments</comments>
		<pubDate>Fri, 10 Oct 2008 08:23:45 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[blogs]]></category>

		<category><![CDATA[research]]></category>

		<category><![CDATA[socialmedia]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/10/10/political-streams/</guid>
		<description><![CDATA[Political Streams from LiveLabs, is now open to business. From the FAQ,
Political Streams is an application which mines social media content in real time for political discussion. It surfaces the news articles and documents that are being discussed as well as the people and places that appear in those articles. In addition, it provides related [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://socialstreams.livelabs.com/politics/" onclick="javascript:urchinTracker ('/outbound/article/socialstreams.livelabs.com');">Political Streams</a> from LiveLabs, is <a href="http://datamining.typepad.com/data_mining/2008/10/political-streams-released.html" onclick="javascript:urchinTracker ('/outbound/article/datamining.typepad.com');">now open to business</a>. <a href="http://livelabs.com/social-streams/faq/" onclick="javascript:urchinTracker ('/outbound/article/livelabs.com');">From the FAQ</a>,</p>
<blockquote><p>Political Streams is an application which mines social media content in real time for political discussion. It surfaces the news articles and documents that are being discussed as well as the people and places that appear in those articles. In addition, it provides related information for any news article, weblog post, person or place. This related information gives a broader context, allowing the user to understand how both the mainstream and social media are discussing an issue, person or place.&#160;&#160; </p>
</blockquote>
<p>It is this last part that makes this tool interesting. </p>
<p> <center>
<p><a href="http://pranamkolari.com/wp-content/uploads/2008/10/trend1.png" ><img style="border-top-width: 0px; border-left-width: 0px; border-bottom-width: 0px; border-right-width: 0px" height="184" alt="trend" src="http://pranamkolari.com/wp-content/uploads/2008/10/trend-thumb1.png" width="314" border="0" /></a></p>
<p> </center>
<p>Very impressive and promising start. Clearly designed to &quot;scale across verticals&quot;, this is a result of work by some very smart researchers at Live Labs. I look forward to many of these &quot;yet to be uncovered&quot; verticals. A &quot;200 OK&quot; from <a href="http://socialstreams.livelabs.com" onclick="javascript:urchinTracker ('/outbound/article/socialstreams.livelabs.com');">social streams</a> is keenly awaited.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/416589808" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/10/10/political-streams/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/10/10/political-streams/</feedburner:origLink></item>
		<item>
		<title>Crawling Blogs</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/416589809/</link>
		<comments>http://pranamkolari.com/2008/10/09/crawling-blogs/#comments</comments>
		<pubDate>Fri, 10 Oct 2008 02:54:30 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[Spam]]></category>

		<category><![CDATA[blogs]]></category>

		<category><![CDATA[research]]></category>

		<category><![CDATA[socialmedia]]></category>

		<category><![CDATA[splogs]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/crawling-blogs/</guid>
		<description><![CDATA[Through a period when my blog was updated only once, this is how Feedburner viewed bots. 
 
Note that crawling blogs is an interesting problem: 

Recency is critical 
Ping servers are available, albeit with incomplete coverage 

Crawling blogs is also highly resource intensive: 

Network latency 
Disk access/write latency 

How do your numbers look?
]]></description>
			<content:encoded><![CDATA[<p>Through a period when my blog was updated only once, this is how <a href="http://feedburner.com" onclick="javascript:urchinTracker ('/outbound/article/feedburner.com');">Feedburner</a> viewed bots. </p>
<p><a href="http://pranamkolari.com/wp-content/uploads/2008/10/blogcrawler.jpg" ><img style="border-right: 0px; border-top: 0px; border-left: 0px; border-bottom: 0px" height="382" alt="blogcrawler" src="http://pranamkolari.com/wp-content/uploads/2008/10/blogcrawler-thumb.jpg" width="496" border="0" /></a> </p>
<p>Note that crawling blogs is an interesting problem: </p>
<ul>
<li>Recency is critical </li>
<li>Ping servers are available, albeit with incomplete coverage </li>
</ul>
<p>Crawling blogs is also highly resource intensive: </p>
<ul>
<li>Network latency </li>
<li>Disk access/write latency </li>
</ul>
<p>How do your numbers look?</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/416589809" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/10/09/crawling-blogs/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/10/09/crawling-blogs/</feedburner:origLink></item>
		<item>
		<title>aaaTestppp</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/415356639/</link>
		<comments>http://pranamkolari.com/2008/10/08/aaatestppp/#comments</comments>
		<pubDate>Thu, 09 Oct 2008 02:34:46 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/10/08/aaatestppp/</guid>
		<description><![CDATA[A test post, to see who is indexing/aggregating this content, and how fast.
]]></description>
			<content:encoded><![CDATA[<p>A test post, to see who is indexing/aggregating this content, and how fast.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/415356639" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/10/08/aaatestppp/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/10/08/aaatestppp/</feedburner:origLink></item>
		<item>
		<title>ICWSM 2009</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/412110448/</link>
		<comments>http://pranamkolari.com/2008/09/24/icwsm-2009/#comments</comments>
		<pubDate>Wed, 24 Sep 2008 06:56:36 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[conferences]]></category>

		<category><![CDATA[icwsm2009]]></category>

		<category><![CDATA[research]]></category>

		<category><![CDATA[socialmedia]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/09/24/icwsm-2009/</guid>
		<description><![CDATA[CFP now open. This is an excellent event, in its third year and hosted right here in San Jose.
The social and community driven aspects of our digital lives continue to rapidly increase, resulting in transformative behaviours and, significantly, publishing and distributing huge amounts of fascinating data. The International Conference on Weblogs and Social Media will [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://icwsm.org/2009/cfp.shtml" onclick="javascript:urchinTracker ('/outbound/article/icwsm.org');">CFP now open</a>. This is an excellent event, in its third year and hosted right here in San Jose.</p>
<blockquote><p>The social and community driven aspects of our digital lives continue to rapidly increase, resulting in transformative behaviours and, significantly, publishing and distributing huge amounts of fascinating data. The International Conference on Weblogs and Social Media will meet once more in 2009 to discuss the latest research analyzing and leveraging this resource. As with previous meetings, we will bring together a wide range of researchers and industry practitioners from many disciplines providing a unique opportunity for sharing ideas and collaboration in this space.</p>
</blockquote>
<p><a href="http://en.wikipedia.org/wiki/Jon_Kleinberg" onclick="javascript:urchinTracker ('/outbound/article/en.wikipedia.org');">John Kleinberg</a> is one of the invited speakers. I wasn&#8217;t aware of the &quot;Rebel King&quot; anagram:</p>
<p> <center> <embed src="http://www.youtube.com/v/_bJ0dfp13Tg&amp;hl=en&amp;fs=1" width="425" height="344" type="application/x-shockwave-flash" allowfullscreen="true" /> </center>
<p>Prof. Kleinberg needs no introduction, but how so apt that the above piece of fascinating data is courtesy <strong>*social media*.</strong></p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/412110448" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/09/24/icwsm-2009/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/09/24/icwsm-2009/</feedburner:origLink></item>
		<item>
		<title>The Numerati</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/412110449/</link>
		<comments>http://pranamkolari.com/2008/09/17/the-numerati/#comments</comments>
		<pubDate>Wed, 17 Sep 2008 06:58:54 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[books]]></category>

		<category><![CDATA[privacy]]></category>

		<category><![CDATA[research]]></category>

		<category><![CDATA[socialmedia]]></category>

		<category><![CDATA[society]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/09/17/the-numerati/</guid>
		<description><![CDATA[Stephen Baker&#8217;s Take on Life and Technology. 
I recently came across Stephen Baker&#8217;s book via the Wall Street Review. This note on splogs got me interested:
A splog, though unreadable, is seeded with words that will attract Google ads. A computer-user may be annoyed at finding himself staring at a screen full of gibberish but click [...]]]></description>
			<content:encoded><![CDATA[<p>Stephen Baker&#8217;s Take on Life and Technology. </p>
<p>I recently came across Stephen Baker&#8217;s book via the <a href="http://online.wsj.com/article/SB122143747437734337.html?mod=2_1167_1" onclick="javascript:urchinTracker ('/outbound/article/online.wsj.com');">Wall Street Review.</a> This note on <a href="http://en.wikipedia.org/wiki/Spam_blog" onclick="javascript:urchinTracker ('/outbound/article/en.wikipedia.org');">splogs</a> got me interested:</p>
<blockquote><p>A splog, though unreadable, is seeded with words that will attract Google ads. A computer-user may be annoyed at finding himself staring at a screen full of gibberish but click on an ad anyway, allowing the robot blogger to harvest revenue. This sleight of hand has the Numerati hard at work getting their software to distinguish between a blog and a splog. Mr. Baker gives a helpful sketch of the math involved, each blog reduced to a vector in a space of several dozen dimensions.</p>
</blockquote>
<p>The problem of splogs, is one case study, through which Baker shares the positive side of the &quot;Numeratis&quot;. So what/who is a Numerati anyway? <a href="http://pranamkolari.com/wp-content/uploads/2008/09/ed-ai225-book09-dv-20080914184328.jpg" ><img style="border-right: 0px; border-top: 0px; border-left: 0px; border-bottom: 0px" height="244" alt="ED-AI225_book09_DV_20080914184328" src="http://pranamkolari.com/wp-content/uploads/2008/09/ed-ai225-book09-dv-20080914184328-thumb.jpg" width="164" align="right" border="0" /></a> <a href="http://www.thenumerati.net/index.cfm?postID=30" onclick="javascript:urchinTracker ('/outbound/article/www.thenumerati.net');">According to Baker</a>:</p>
<blockquote><p>They&#8217;re members of a global elite, and are busy analyzing our every move. They&#8217;re rummaging through mountains of data, looking for patterns of our behavior so that they can predict what we might want to buy, who we&#8217;re likely to vote for, what job we&#8217;d do better than our colleagues. Some are even matching us with potential lovers&#8230;</p>
</blockquote>
<p>Baker, through his book, uncovers the &quot;numerati cult&quot;, who they are, the positives, negatives, and the unknown. Overall, his attempt is to share what these Numeratis mean to, well, a non-Numerati. Yahoo!, Google, and IBM appear to feature prominently, so do many Numeratis. </p>
<p>Elsewhere, both positives and negatives highlighted:</p>
<p><a href="http://blogs.bnet.com/ceo/?p=1321" onclick="javascript:urchinTracker ('/outbound/article/blogs.bnet.com');">The Corner Office:</a></p>
<blockquote><p>The &quot;Numerati&quot;<strong> </strong>are an evolving class of quant-humping, algorithm experts who will be playing an enormous role in shaping our society, our economy and our lives. They are the types who founded Google and Yahoo<strong> </strong>but they are going beyond simple searching to manipulating and massaging the tremendous mass of data that we generate from Web clicks and cell phones.</p>
</blockquote>
<p><a href="http://sentimine.com/2008/were-now-the-numerati-according-to-the-wall-street-journal-and-steven-baker/" onclick="javascript:urchinTracker ('/outbound/article/sentimine.com');">Sentimine</a>:</p>
<blockquote><p>I have already ordered my copy&#8230;How could I resist when we&#8217;re mining the blogoisphere for sentiment and about to test our own home-grown splog detector?</p>
</blockquote>
<p><a href="http://baconsrebellion.blogspot.com/2008/09/numerati-how-number-crunchers-change.html" onclick="javascript:urchinTracker ('/outbound/article/baconsrebellion.blogspot.com');">Bacon Rebellion:</a></p>
<blockquote><p>&#8230;&#8220;The Numerati,&#8221; a class of math experts who quietly orchestrate the massaging of the zillions of bits of data about us. We generate the stuff every time we use our cell phones or search Google, use a grocery loyalty card or whisk through a toll booth using a Smarttag.</p>
</blockquote>
<p><a href="http://www.thinkor.org/2008/09/numerati-casting-or-folks-in-evil-light.html" onclick="javascript:urchinTracker ('/outbound/article/www.thinkor.org');">ThinkOR:</a></p>
<blockquote><p>I think it is great that operations research is getting some publicity with <a href="http://www.thinkor.org/2008/08/numerati-new-book-managing-with-math.html" onclick="javascript:urchinTracker ('/outbound/article/www.thinkor.org');">The Numerati</a>. However, there can be such a thing as a bad publicity. Is it just me or does it seem to everybody (OR folks) that this book is casting us in a rather negative light?</p>
</blockquote>
<p><a href="http://mat.tepper.cmu.edu/blog/?p=339" onclick="javascript:urchinTracker ('/outbound/article/mat.tepper.cmu.edu');">Michael Trick:</a></p>
<blockquote><p>This is a book primarily about what I would call data mining and clustering, so there are wide swathes of the &#8220;numerati&#8221; field that are not covered.&#160; But for a popular look on how our mathematics is used to characterize and predict human behavior, <em>The Numerati</em> is an extremely interesting book.</p>
</blockquote>
<p>I hope to see this book influence, and promote the positives. The target audience are the non-Numerati&#8217;s. But still, this has piqued my curiosity, <strong>ordered</strong>.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/412110449" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/09/17/the-numerati/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/09/17/the-numerati/</feedburner:origLink></item>
		<item>
		<title>Scott Huffman on Search Evaluation at Google</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/412110450/</link>
		<comments>http://pranamkolari.com/2008/09/16/scott-huffman-on-search-evaluation-at-google/#comments</comments>
		<pubDate>Tue, 16 Sep 2008 17:54:00 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[Spam]]></category>

		<category><![CDATA[ir]]></category>

		<category><![CDATA[research]]></category>

		<category><![CDATA[search]]></category>

		<category><![CDATA[sigir2008]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/09/16/scott-huffman-on-search-evaluation-at-google/</guid>
		<description><![CDATA[Search continues to present many interesting problems, some focusing on new parameters, some others rewiring existing parameters, and a few others shielding these parameters from adversaries (e.g. spam). Through a blog post, Scott Huffman shares how Google evaluates improvements. 
..but we are constantly evaluating everything, which can include:      - proposed [...]]]></description>
			<content:encoded><![CDATA[<p>Search continues to present many interesting problems, some focusing on new parameters, some others rewiring existing parameters, and a few others shielding these parameters from adversaries (e.g. spam). Through a <a href="http://googleblog.blogspot.com/2008/09/search-evaluation-at-google.html" onclick="javascript:urchinTracker ('/outbound/article/googleblog.blogspot.com');">blog post</a>, Scott Huffman shares how Google evaluates improvements. </p>
<blockquote><p>..but we are constantly evaluating everything, which can include:      <br />- proposed improvements to segmentation of Chinese queries       <br />- new approaches to fight spam       <br />- techniques for improving how we handle compound Swedish words       <br />- changes to how we handle links and anchortext       <br />- and everything in between</p>
</blockquote>
<p>Though spam and the web graph feature prominently, it is interesting to note how &quot;internationalization&quot; features in many of these evaluation examples, reflecting Google&#8217;s overall push in this direction. Evaluation is through click-through improvements, and statistically sound relevance metrics.</p>
<p>Evaluation metrics, I think, is one of those areas where academia can greatly influence and impact search. This is one of the more theory centric problems in search, not limited by the lack of large information retrieval data sets. Indeed, the recently concluded <a href="http://www.sigir2008.org" onclick="javascript:urchinTracker ('/outbound/article/www.sigir2008.org');">SIGIR</a> conference featured many papers in this direction.</p>
<p><strong>Score Standardization for Inter-Collection Comparison of Retrieval Systems [<a href="http://www.cs.mu.oz.au/~jz/fulltext/adcs07.pdf" onclick="javascript:urchinTracker ('/outbound/article/www.cs.mu.oz.au');">PDF</a>]</strong>     <br />W. Webber, A. Moffat and J. Zobel&#160; (University of Melbourne)</p>
<p><strong>The Good and the Bad System: Does the Test Collection Predict Users&#8217; Effectiveness? [<a href="http://dis.shef.ac.uk/mark/publications/my_papers/fp440-almaskari.pdf" onclick="javascript:urchinTracker ('/outbound/article/dis.shef.ac.uk');">PDF</a>]</strong>     <br />A. Al-Maskari, M. Sanderson and P. Clough&#160; (University of Sheffield)</p>
<p><strong>Retrieval Sensitivity Under Training Using Different Measures [<a href="http://terrierteam.blogspot.com/2008/08/sigir-2008.html" onclick="javascript:urchinTracker ('/outbound/article/terrierteam.blogspot.com');">blog</a>]</strong>     <br />B. He, C. Macdonald and I. Ounis&#160; (University of Glasgow)</p>
<p><strong>Evaluation Over Thousands of Queries [<a href="http://ciir-publications.cs.umass.edu/getpdf.php?id=809" onclick="javascript:urchinTracker ('/outbound/article/ciir-publications.cs.umass.edu');">PDF</a>]</strong>     <br />B. Carterette, V. Pavlu, E. Kanoulas, J. Allan, and J. A. Aslam&#160; (University of Massachusetts Amherst/Northeastern University)</p>
<p><strong>Novelty and Diversity in Information Retrieval Evaluation [<a href="http://plg.uwaterloo.ca/~gvcormac/novelty.pdf" onclick="javascript:urchinTracker ('/outbound/article/plg.uwaterloo.ca');">PDF</a>]</strong>     <br />C. Clarke, M. Kolla, G. Cormack, O. Vechtomova, A. Ashkan, S. B&#252;ttcher, and I. MacKinnon&#160; (University of Waterloo)</p>
<p><strong>Relevance Assessment: Are Judges Exchangeable and Does it Matter [<a href="http://es.csiro.au/pubs/bailey_sigir08.pdf" onclick="javascript:urchinTracker ('/outbound/article/es.csiro.au');">PDF</a>]</strong>     <br />P. Bailey, N. Craswell, I. Soboroff, P. Thomas, A. de Vries and E. Yilmaz&#160; (NIST/Northeastern University/Microsoft/CWI/CSIRO ICT Centre)<strong></strong></p>
<p><strong>Intuition-Supporting Visualization of User&#8217;s Performance Based on Explicit Negative Higher-Order Relevance [<a href="http://portal.acm.org/citation.cfm?doid=1390334.1390448" onclick="javascript:urchinTracker ('/outbound/article/portal.acm.org');">link</a>]</strong>     <br />H. Keskustalo, K. Jarvelin, A. Pirkola and J. Kekalainen&#160; (University of Tampere)</p>
<p>Elsewhere, comments on the article&#160; &#8211;</p>
<p><a href="http://seodialect.com/2008/09/16/google-discusses-search-evaluation-process/" onclick="javascript:urchinTracker ('/outbound/article/seodialect.com');">Seo Dialect</a>:</p>
<blockquote><p>The rest of the points are things we&#8217;ve been hearing from Google for a long time. We know they&#8217;re progressing on universal and personalization search efforts, all in their famous intent to create the best user experience.</p>
</blockquote>
<p><a href="http://webtribution.com/2008/09/16/search-evaluation-at-google/" onclick="javascript:urchinTracker ('/outbound/article/webtribution.com');">Webtribution</a>:</p>
<blockquote><p>Anyone remotely involved in SEO or digital marketing should always take advantage of any information / insight Google opens to the public.</p>
</blockquote>
<p><a href="http://www.searchenginecaffe.com/2008/09/beyond-relevance.html" onclick="javascript:urchinTracker ('/outbound/article/www.searchenginecaffe.com');">SearchEngineCaffe</a>:</p>
<blockquote><p>One of my biggest issues with <a href="http://trec.nist.gov/" onclick="javascript:urchinTracker ('/outbound/article/trec.nist.gov');">TREC</a> and similar environments is the single focus on relevance &#8230; for example, a spam post that is relevant to a topic would be acceptable, even if you would never want to read it in real life. It&#8217;s time we move beyond the basics and find ways to tackle the more challenging retrieval quality aspects&#8230;</p>
</blockquote>
<p>Also, at <a href="http://www.webmasterworld.com/google/3745339.htm" onclick="javascript:urchinTracker ('/outbound/article/www.webmasterworld.com');">webmasterworld</a>.</p>
<p>For readers interested in the overall problem of IR evaluation, a paper by Kalervo &amp; Jaana on &quot;<a href="http://www.info.uta.fi/tutkimus/fire/archive/KJJKSIGIR00.pdf" onclick="javascript:urchinTracker ('/outbound/article/www.info.uta.fi');"><strong>IR evaluation methods for retrieving highly relevant documents</strong></a>&quot; offers an excellent introduction.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/412110450" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/09/16/scott-huffman-on-search-evaluation-at-google/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/09/16/scott-huffman-on-search-evaluation-at-google/</feedburner:origLink></item>
		<item>
		<title>LinkedIn Hacked? No, Just Down</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/412110451/</link>
		<comments>http://pranamkolari.com/2008/09/06/linkedin-hacked-no-just-down/#comments</comments>
		<pubDate>Sun, 07 Sep 2008 03:47:03 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[socialmedia]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/09/06/linkedin-hacked-no-just-down/</guid>
		<description><![CDATA[Just noticed LinkedIn is down. Downtimes remind us how important these social sites have grown to be. Lloyd Taylor, LinkedIn&#8217;s VP of Technical Operations clarifies.
Update: Site Now Carries this message:
LinkedIn is currently unavailable while we make upgrades to improve our service to you.  We’ll return around 12:00am (PT) September 7th, 2008.
We apologize for the [...]]]></description>
			<content:encoded><![CDATA[<p>Just noticed LinkedIn is down. Downtimes remind us how important these social sites have grown to be. Lloyd Taylor, LinkedIn&#8217;s VP of Technical Operations <a href="http://kameir.com/linkedin-hacked.html#IDComment5770733" onclick="javascript:urchinTracker ('/outbound/article/kameir.com');">clarifies</a>.</p>
<p>Update: Site Now Carries this message:</p>
<blockquote><p><img src="http://pranamkolari.com/wp-content/uploads/2008/09/pic_li_wizard_411x389.thumbnail.gif" alt="pic_li_wizard_411x389.gif" align="right" />LinkedIn is currently unavailable while we make upgrades to improve our service to you.  We’ll return around 12:00am (<abbr title="Pacific Time">PT</abbr>) September 7th, 2008.</p>
<p>We apologize for the inconvenience and appreciate your patience. Thank you for using LinkedIn!</p></blockquote>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/412110451" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/09/06/linkedin-hacked-no-just-down/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/09/06/linkedin-hacked-no-just-down/</feedburner:origLink></item>
		<item>
		<title>Wikimatix: Engagement around Buzzy Keywords</title>
		<link>http://feeds.feedburner.com/~r/PranamKolari/~3/412110452/</link>
		<comments>http://pranamkolari.com/2008/08/16/wikimatix-engagement-around-buzzy-keywords/#comments</comments>
		<pubDate>Sat, 16 Aug 2008 09:09:02 +0000</pubDate>
		<dc:creator>pranamkolari</dc:creator>
		
		<category><![CDATA[social]]></category>

		<category><![CDATA[socialmedia]]></category>

		<guid isPermaLink="false">http://pranamkolari.com/2008/08/16/wikimatix-engagement-around-buzzy-keywords/</guid>
		<description><![CDATA[Wikimatix is a uber-cool app developed by Akshay Java, from UMBC. The tool mashes up buzzy keywords, with Wikipedia to bootstrap conservations. Go check it out.
Btw, Akshay Java, and UMBC sounds very familiar. I wonder why.
]]></description>
			<content:encoded><![CDATA[<p><a href="http://wikimatix.com/" onclick="javascript:urchinTracker ('/outbound/article/wikimatix.com');">Wikimatix</a> is a uber-cool app developed by Akshay Java, from UMBC. The tool mashes up buzzy keywords, with Wikipedia to bootstrap conservations. Go check it out.</p>
<p>Btw, Akshay Java, and UMBC sounds very familiar. I wonder why.</p>
<img src="http://feeds.feedburner.com/~r/PranamKolari/~4/412110452" height="1" width="1"/>]]></content:encoded>
			<wfw:commentRss>http://pranamkolari.com/2008/08/16/wikimatix-engagement-around-buzzy-keywords/feed/</wfw:commentRss>
		<feedburner:origLink>http://pranamkolari.com/2008/08/16/wikimatix-engagement-around-buzzy-keywords/</feedburner:origLink></item>
	</channel>
</rss>
