<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Exalead Blog &#187; Tips and tricks</title>
	<atom:link href="http://blog.exalead.com/category/tips-and-tricks/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.exalead.com</link>
	<description>The blog of Exalead</description>
	<lastBuildDate>Fri, 20 Nov 2009 16:29:57 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Microsoft and Google Play Catch Up</title>
		<link>http://blog.exalead.com/2009/06/04/microsoft-and-google-play-catch-up/</link>
		<comments>http://blog.exalead.com/2009/06/04/microsoft-and-google-play-catch-up/#comments</comments>
		<pubDate>Thu, 04 Jun 2009 14:44:14 +0000</pubDate>
		<dc:creator>Paul</dc:creator>
				<category><![CDATA[On the personal side...]]></category>
		<category><![CDATA[Powered by Exalead]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/?p=635</guid>
		<description><![CDATA[With names like Wonder Wheel  and Bing xRank, it’s easy to see how someone can get caught up in the marketing fanfare of these recent technology announcements.
Fortunately, for our customers, these capabilities (and more) are already available to them.
For example, Miiget  is a technology that we&#8217;ve had since 2008 and is equivalent to Wonder Wheel [...]]]></description>
			<content:encoded><![CDATA[<p>With names like <a href="http://blogs.zdnet.com/BTL/?p=17842" target="_blank">Wonder Wheel </a> and <a href="http://techfragments.com/news/839/Tech/Microsoft_Bing_Search_Launches_Early_Preview.html" target="_blank">Bing xRank</a>, it’s easy to see how someone can get caught up in the marketing fanfare of these recent technology announcements.</p>
<p>Fortunately, for <a href="http://www.exalead.com/software/customers" target="_blank">our customers</a>, these capabilities (and more) are already available to them.</p>
<p>For example, <a href="http://miiget.labs.exalead.com" target="_blank">Miiget</a>  is a technology that we&#8217;ve had since 2008 and is equivalent to Wonder Wheel in concept.</p>
<p> <a href="http://www.tweepz.com">www.tweepz.com</a> was built using Exalead technology as a small evening project by an exalead consultant and is similar to xRank. </p>
<p>But to be fair, Microsoft had to do something and Google had to respond (or preempt?). Yet, the battle isn&#8217;t just between Microsoft and Google, but between these two and other web businesses as well.</p>
<p>These new offerings raise the user experience bar for every business that depends upon web traffic. As a provider to these other businesses, we&#8217;re keenly aware of that. We have been from our beginning. That&#8217;s why customers such as <a href="www.yakaz.com " target="_blank">Yakaz</a>,  <a href="http://www.hometrader.ca" target="_blank">Hometrader</a>, <a href="http://www.118218.fr" target="_blank">118218</a>, and <a href="http://mashable.com/2006/08/09/skyblog-and-skyrock-the-french-myspace" target="_blank">Skyblog</a> use our technology. To improve user experience and user traffic and, in some cases, dramatically so.</p>
<p>Exalead believes in the concept of the long tail. That is, not everyone is satisfied with services provided by Google and Microsoft. In fact we find many users rely on multiple search sites, each for a different purpose. It&#8217;s a best-of-breed idea. This makes sense to us, since the time taken to go to a specific site is negligible compared to the length of the conversation with the site. So why not go to the best?</p>
<p>At least that&#8217;s how we see it. So in support of our customers, Exalead will continue to lead search technology performance and innovation. Current work in the area of semantics for increased recall, precision and insight will enable our customers to be a step ahead of their competition and provide better information with less cost and effort.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2009/06/04/microsoft-and-google-play-catch-up/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Exalead’s Morgan Zimmermann to Discuss Search Opportunities for Online Publishers</title>
		<link>http://blog.exalead.com/2009/03/02/exalead%e2%80%99s-morgan-zimmermann-to-discuss-search-opportunities-for-online-publishers/</link>
		<comments>http://blog.exalead.com/2009/03/02/exalead%e2%80%99s-morgan-zimmermann-to-discuss-search-opportunities-for-online-publishers/#comments</comments>
		<pubDate>Mon, 02 Mar 2009 16:03:06 +0000</pubDate>
		<dc:creator>Paul</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Tips and tricks]]></category>
		<category><![CDATA[Webinar]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/?p=435</guid>
		<description><![CDATA[Since Exalead started as a web search company in 2000, we’ve gained insights into the kind of search system that users need to locate specific information across a complex set of media and data types on the web. Many online publishers are finding that standard built-in search solutions don’t fit the bill when success with [...]]]></description>
			<content:encoded><![CDATA[<p>Since Exalead started as a web search company in 2000, we’ve gained insights into the kind of search system that users need to locate specific information across a complex set of media and data types on the web. Many online publishers are finding that standard built-in search solutions don’t fit the bill when success with readers and users depends in large part on the ease with which they can navigate through content within the website and beyond.</p>
<p>For instance, with the growth of video and audio as outlets for content on the web, advanced search is necessary to cull this data and make it readily available and integrated with more structured data types. In addition, users are joining the information creation process with social reviews and rankings, and online publishers need search that effectively tracks and analyzes these interactions.</p>
<p>Our customers have also found that highly scalable search architecture is important as the web is becoming increasingly interconnected in interesting ways. From the perspective of online publishers, there’s a great opportunity for “mash-ups” between data from the local site itself and internal databases, and useful contextual data from the web and outside applications. </p>
<p>On Thursday March 5th at 8:00am PT, Morgan Zimmermann, Exalead VP of Business Development, will continue our discussion about the business opportunities that advanced search presents to online publishers looking to do more with their content. Morgan will discuss how online publishers can:</p>
<p>-	Regain complete control over their content and transform it into a long-term, organic, profitable business<br />
-	Achieve strategic independence from content aggregators and advertisers<br />
-	Secure brand positioning across a spectrum of innovative user experiences<br />
-	Use &#8216;mash up&#8217; and Hybrid search to improve profitability</p>
<p><a href="http://go-mb.exalead.com/pages/start/onlinepubweb1template/index.html?Campaign_Id=801&#038;Activity_Id=1121">You can register for the webinar entitled “Online Publishers: Is Content the Only Key to Success?” here.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2009/03/02/exalead%e2%80%99s-morgan-zimmermann-to-discuss-search-opportunities-for-online-publishers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>FAST&#8217;s Performance Slowdown</title>
		<link>http://blog.exalead.com/2008/11/27/fasts-performance-slowdown/</link>
		<comments>http://blog.exalead.com/2008/11/27/fasts-performance-slowdown/#comments</comments>
		<pubDate>Thu, 27 Nov 2008 14:34:15 +0000</pubDate>
		<dc:creator>Paul</dc:creator>
				<category><![CDATA[Events]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2008/11/27/fasts-performance-slowdown/</guid>
		<description><![CDATA[Heard something notable at the Butler Group Enterprise Search Strategy Briefing in late November.
A rep from Scotland&#8217;s National Health Service talked through a case study of their use of FAST and offered up some &#8230; interesting &#8230; metrics.
The customer indicated that they were anticipating growing their system from 11 million documents to 18 million documents [...]]]></description>
			<content:encoded><![CDATA[<p>Heard something notable at the <strong><a href="http://www.butlergroup.com/" target="_blank">Butler Group</a> Enterprise Search Strategy Briefing</strong> in late November.</p>
<p>A rep from <strong>Scotland&#8217;s National Health Service</strong> talked through a case study of their use of FAST and offered up some &#8230; interesting &#8230; metrics.</p>
<p>The customer indicated that they were anticipating growing their system from 11 million documents to 18 million documents &#8230; but that this growth would require <strong>22 servers</strong>.  Considering that NHS employes a staff of roughly 150,000, and assuming all these staff run 10 searches a day for a maximum of &#8230; say &#8230; 16 hours per day, this is roughly 1 query per second.</p>
<p>This means <strong>FAST</strong>, for this implementation, needs 22 servers to run 1 query per second across 18 million docs. Without going into all the technical detail, this isn&#8217;t entirely surprising given <strong>FAST</strong>&#8217;s dependence on a slew of different technologies (which adds to the complexity of their deployment) and their need to distribute to more and more servers as the amount of content that needs to be located, searched and indexed grows (which presents a challenge for companies whose data pools are increasing &#8230;  i.e. all of them).</p>
<p>Just for the sake of comparison, <strong>Exalead customers get 20 queries per second across 20 million docs with only 1 server</strong> &#8212; less cumbersome, more efficient and <strong>greener</strong> than the 22 servers described by NHS.</p>
<p>Especially in this time of economic downturn and budget belt-tightening, it&#8217;s even more crucial that businesses get the most IT bang for their buck.   Make sure you make the right choice for your information access so you can utilize your important data and preserve your corporate resources.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2008/11/27/fasts-performance-slowdown/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Map the Web with Gephi</title>
		<link>http://blog.exalead.com/2008/11/03/map-the-web-with-gephi/</link>
		<comments>http://blog.exalead.com/2008/11/03/map-the-web-with-gephi/#comments</comments>
		<pubDate>Mon, 03 Nov 2008 17:15:27 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[Programming]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2008/11/03/map-the-web-with-gephi/</guid>
		<description><![CDATA[Innovation is a leading priority for Exalead. That is why the company often gives its support to external initiatives like this project set up by students from U.T.C. that developed Gephi, in collaboration with WebAtlas association. Gephi is an open source software under GPL3 license that enables 3D networks graphics manipulation, exploration and visualization.

What is [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Innovation</strong> is a leading priority for <strong>Exalead</strong>. That is why the company often gives its support to external initiatives like this project set up by students from <a href="http://www.utc.fr/the_university/index.php" target="_blank">U.T.C.</a> that developed <a href="http://gephi.org/" target="_blank"><strong>Gephi</strong></a>, in collaboration with <strong><a href="http://webatlas.fr/" target="_blank">WebAtlas</a></strong> association. <strong>Gephi</strong> is an <strong>open source software</strong> under GPL3 license that enables <strong>3D networks graphics manipulation, exploration and visualization.</strong></p>
<p style="text-align: center"><a href="http://web-mining.fr/files/droit_auteur/carto_droit_auteur_generale.pdf" target="_blank"><img src="http://web-mining.fr/files/droit_auteur/Map1.png" alt="Carte DPI" /></a></p>
<p><em>What is this graphic about?<br />
</em>It represents a <strong>semantic analysis</strong> of the relationship between terms used on the Web to speak about <strong>Intellectual Property Rights</strong> in the French language.  Each <strong>node</strong> symbolizes a word or a group of words and each <strong>edge</strong> connects two expressions when these are <u>co-cited in more than 120 000 web pages</u>.  Each color refers to a <strong>&#8220;semantic cluster&#8221;</strong>, which is a bunch of words than concern the same topic.</p>
<p><em>How can I get this type of graphic?</em><br />
After an <strong>extraction of related terms found on Exalead databases</strong> and a manual filtering phase, the project team receives a <strong><a href="http://gephi.org/wp-content/uploads/2008/10/ipr-semantic-graphe.gdf" target="_blank">GDF file with ordered data</a></strong>.  Then, the exploitation of this file by <strong>Gephi</strong> combined with a specific algorithm leads to the <strong>data “spatialization”</strong>. Then color filters highlight different semantic clusters.</p>
<p>Here is one of the first demonstrations of <strong>Gephi</strong> with <strong>real-time spatialization of several keyword clusters.</strong> In this video, the blue color refers to a “genetics” cluster, orange nodes relate to terms about biology and laboratories, green ones concern words speaking about controversy in the domain of GMOs and purple nodes relate to innovation and research development in biotechnology.</p>
<p><center><a href="http://blog.exalead.fr/wp-content/uploads/2008/10/processus_expansion_raffinement1.JPG" alt="processus_expansion_raffinement" width="382" height="270"><br />
<object width="400" height="251"><param name="allowfullscreen" value="true"></param><param name="allowscriptaccess" value="always"></param><param name="movie" value="http://vimeo.com/moogaloop.swf?clip_id=2035117&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1"></param>	<embed src="http://vimeo.com/moogaloop.swf?clip_id=2035117&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" type="application/x-shockwave-flash" allowfullscreen="true" allowscriptaccess="always" width="400" height="251"></embed></object><br />
</a><a href="http://vimeo.com/2035117?pg=embed&amp;sec=2035117">Gephi &#8211; Dynamic demo</a> from <a href="http://vimeo.com/user861314?pg=embed&amp;sec=2035117">gephi</a> on <a href="http://vimeo.com?pg=embed&amp;sec=2035117">Vimeo</a></center><center> </center><strong>Congratulations</strong> to the project team for this <strong>great web mapping tool!</strong><br />
Do not hesitate to visit the <a href="http://gephi.org/" target="_blank">Gephi website</a> to obtain more information and <a href="http://gephi.org/support/demo/" target="_blank">test this software.<br />
</a>If you are interested in this subject, you should know that <strong>the team continues to recruit</strong>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2008/11/03/map-the-web-with-gephi/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exalead : Right on target !</title>
		<link>http://blog.exalead.com/2008/05/19/exalead-right-on-target/</link>
		<comments>http://blog.exalead.com/2008/05/19/exalead-right-on-target/#comments</comments>
		<pubDate>Mon, 19 May 2008 13:08:21 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[Kudos]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2008/05/19/exalead-right-on-target/</guid>
		<description><![CDATA[ 
Exalead has been part of the server revolution, providing faster and more efficient service over the years.
This is not the first time nor the last time you will hear about our server improvements.  In fact, we will be providing regular updates to address the evolution of traffic, the increase in the number of [...]]]></description>
			<content:encoded><![CDATA[<p style="margin: 10px 5px 5px; float: right; padding-left: 2px"> <a href="http://blog.exalead.com/2008/05/19/exalead-right-on-target/bondjpg/" rel="attachment wp-att-233" title="bond.JPG"><img src="http://blog.exalead.com/wp-content/uploads/2008/05/bond.JPG" alt="bond.JPG" border="0" height="126" width="87" /></a></p>
<p>Exalead has been part of the server revolution, providing faster and more efficient service over the years.</p>
<p>This is not the first time nor the last time you will hear about our server improvements.  In fact, we will be providing regular updates to address the evolution of traffic, the increase in the number of indexed pages and our improvements in service.</p>
<p>Here is a brief summary of the stages that have affected the life of our production center.</p>
<p>To begin, Exalead installed some machines in the offices of our service providers.  But considering our growth, it was necessary to give them dedicated homes that did not use our equipment.</p>
<p>March 2005: We installed the first dedicated room with the opening of our Site 1, consisting of more than 10 machines shared in more than 6 racks.  Yes, they were big machines!  This allowed us to index 1 billion pages.</p>
<p>August 2005: We added around 30 servers to address the traffic, with the capability of indexing more than 2 billion pages.</p>
<p>March 2006:  Then things really heated up, and we opened a second site and added more than 50 servers (10 racks) that enabled us to index more than 8 billion pages.</p>
<p>January 2007: As a result of the abundance of services and ideas that leave our laboratories, we had to add more servers to Site 1.</p>
<p>2007 to Present: Our laboratories continue to work and prepare for an upgrade to enrich our architecture, improve speed, and become more robust and efficient.  But we had to add 20 machines to Site 1 in august 2007.</p>
<p>Since then, we have been actively working to put these improvements on line, so you can see the evolution, but this is not the calm before the storm&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2008/05/19/exalead-right-on-target/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Guide for Webmasters: Part 1, Making the Most of Your Content</title>
		<link>http://blog.exalead.com/2008/03/17/guide-for-webmasters-part-1-making-the-most-of-your-content/</link>
		<comments>http://blog.exalead.com/2008/03/17/guide-for-webmasters-part-1-making-the-most-of-your-content/#comments</comments>
		<pubDate>Mon, 17 Mar 2008 10:39:44 +0000</pubDate>
		<dc:creator>Sébastien</dc:creator>
				<category><![CDATA[Programming]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2008/03/17/guide-for-webmasters-part-1-making-the-most-of-your-content/</guid>
		<description><![CDATA[Interested in improving the visibility of your site on our engine? Hopefully this series of posts will help.
First up: answers to the two most frequently posed webmaster questions:
1) Why doesn’t my site appear (or why does it only partially appear) when I do a site search (i.e., typing “site: mysitename.com” in the search box)?
  [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal"><span lang="EN-GB">Interested in improving the visibility of your site on our engine? Hopefully this series of posts will help.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><o:p></o:p>First up: answers to the two most frequently posed webmaster questions:<o:p></o:p></span></p>
<p class="MsoNormal" style="margin-left: 36pt; text-indent: -18pt"><!----><strong><span lang="EN-GB"><span>1) </span></span></strong><span lang="EN-GB"><strong>Why doesn’t my site appear (or why does it only partially appear) when I do a site search (i.e., typing “site: mysitename.com” in the search box)?</strong><o:p></o:p></span></p>
<p class="MsoNormal" style="margin-left: 18pt"><span lang="EN-GB"><o:p> </o:p>        All or part of your site may be inaccessible to our robots. Try the following to improve your performance:<o:p></o:p></span></p>
<ul>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">Make sure that all pages are accessible by at least one static link.<o:p></o:p></span></li>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">Place links to your most important content on every page of your site.<o:p></o:p></span></li>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">Keeping in mind that certain dynamic pages can’t be accessed by our robots, move content as needed to static (or simply more accessible) pages (see “<a href="http://blog.exalead.com/2007/07/11/the-road-to-better-site-indexing-%e2%80%93-introduction-and-episode-1/">The Road to Better Site Indexing – Introduction and Episode 1</a>”)<o:p></o:p></span></li>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">Be sure the robots.txt file in your root directory is not blocking access to our crawler (use our <a href="http://www.exalead.com/search?action=displayRobotCheckerForm">robot checker form</a> to test accessibility).<o:p></o:p></span></li>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span><span style="font-family: 'Times New Roman'; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal"></span></span></span><span lang="EN-GB">Create a site map (see “<a href="http://blog.exalead.com/2007/08/28/episode-4-sitemaps-based-on-a-true-story/">The Road to Better Site Indexing: Episode 3, Sitemaps</a>”) and <a href="http://www.exalead.com/search/submitYourSitePage">submit it on our site</a>. <o:p></o:p></span></li>
</ul>
<p class="MsoNormal" style="margin-left: 36pt; text-indent: -18pt"><!----><strong><span lang="EN-GB"><span>2) </span></span></strong><span lang="EN-GB"><strong>Why doesn’t my site appear for a given keyword?</strong><o:p></o:p></span></p>
<ul>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">First, check to see that the keyword is in our index for your site. Enter the keyword in the search field, along with “site:mysitename.com” to limit the search for that keyword to just your site (replacing “mysitename.com” with your domain name, of course). If it is not indexed, follow the steps for question 1 above.<o:p></o:p></span></li>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">Refine the keywords in your site so they are as specific as possible. It could be the keyword you are checking is too general, and sites that larger, more relevant and/or more popular are ranking ahead of your site for that keyword.<o:p></o:p></span></li>
<li><!----><span style="font-family: Symbol" lang="EN-GB"><span></span></span><span lang="EN-GB">Verify that the content of your site corresponds well to the keyword. It’s not enough for a keyword to simply appear, it must be integrally related to the rest of the site content. <o:p></o:p></span></li>
</ul>
<p class="MsoNormal"><span lang="EN-GB">        You&#8217;ll find further info on keyword relevancy in <a href="http://blog.exalead.com/2007/06/04/search-engine-optimization-seo-more-old-school-than-you-think/">Search Engine Optimization (SEO): More Old-School Than You Think</a>.”<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB">        And be careful out there! Stick to keeping your content fresh and relevant for your target audience. Reverting to tricks like hidden text, duplicate content, spam link exchanges or other such tactics to improve your ranking could get you banned from our index (for more info, see “<a href="http://blog.exalead.com/2007/07/11/the-road-to-better-site-indexing-%e2%80%93-episode-2/">The Road to Better Site Indexing – Episode 2</a>”).<o:p></o:p></span></p>
<p><span lang="EN-GB">You’ll also find <a href="http://www.exalead.com/about/document/53#22">general webmaster tips</a> in our site’s help pages.<o:p></o:p></span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2008/03/17/guide-for-webmasters-part-1-making-the-most-of-your-content/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Video Search Update, Part 3: Preview &amp; Refine Results</title>
		<link>http://blog.exalead.com/2008/02/01/video-search-update-part-3-preview-refine-results/</link>
		<comments>http://blog.exalead.com/2008/02/01/video-search-update-part-3-preview-refine-results/#comments</comments>
		<pubDate>Fri, 01 Feb 2008 09:54:59 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[New products & features]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2008/02/01/video-search-update-part-3-preview-refine-results/</guid>
		<description><![CDATA[Now that we’ve updated you about new platforms added to the index (Part 2), and told you how you can add your own videos, let’s take a closer look at the structure of the search results. 
Enter for example ‘Daft Punk’ in the video search engine:
http://www.exalead.com/video/results?q=daft+punk
When you click on a video’s thumbnail image, you can [...]]]></description>
			<content:encoded><![CDATA[<p>Now that we’ve updated you about new platforms added to the index (Part 2), and told you how you can add your own videos, let’s take a closer look at the structure of the search results. <o:p></o:p></p>
<p><span lang="EN-GB">Enter for example ‘Daft Punk’ in the video search engine:<br />
<a href="http://www.exalead.com/video/results?q=daft+punk" target="_blank">http://www.exalead.com/video/results?q=daft+punk</a><o:p></o:p></span></p>
<p><span lang="EN-GB">When you click on a video’s thumbnail image, you can preview the video without leaving the search results page. Handy, huh?<o:p></o:p></span></p>
<p><span lang="EN-GB">You can also refine your results by confining them to a particular source, a specific video duration, or even a specific topical category and descriptive keyword.<o:p></o:p></span></p>
<p class="MsoNormal">Happy video hunting!</p>
<p class="MsoNormal"><img src="http://blog.exalead.com/wp-content/uploads/2008/02/exalead-video-results2.jpg" alt="Refining Exalead Video Search Results" /></p>
<p class="MsoNormal">&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2008/02/01/video-search-update-part-3-preview-refine-results/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Search secrets: searching like a pro with regular expressions</title>
		<link>http://blog.exalead.com/2007/09/13/search-secrets-searching-like-a-pro-with-regular-expressions/</link>
		<comments>http://blog.exalead.com/2007/09/13/search-secrets-searching-like-a-pro-with-regular-expressions/#comments</comments>
		<pubDate>Thu, 13 Sep 2007 13:27:01 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2007/09/13/search-secrets-searching-like-a-pro-with-regular-expressions/</guid>
		<description><![CDATA[    Well known to computer programmers, regular expressions (“regex” or “regexp” to insiders) are also a secret search weapon of librarians around the globe. A regular expression is simply a text pattern that can be used to find matching text strings. Regular expressions use wildcards and special shorthand notations to describe these [...]]]></description>
			<content:encoded><![CDATA[<p>    Well known to computer programmers, <strong>regular expressions</strong> (“regex” or “regexp” to insiders) are also a secret search weapon of librarians around the globe. A regular expression is simply a text pattern that can be used to find matching text strings. Regular expressions use wildcards and special shorthand notations to describe these patterns. Regular expressions are not available in most search engines, but they are part of <strong>Exalead’s Advanced Search options</strong> (which is one reason hard-core info-geeks are so fond of Exalead!).</p>
<p><em>What does a regular expression look like?</em> Let’s look at an example using a period (“.”), the regular expression wildcard representing all letters of the alphabet. If you wanted to use this wildcard within a regular expression in the Exalead engine, you would first frame your query with forward-slash marks “/” to indicate it’s a regular expression, then place the period wherever you wanted variations of a single letter to appear. Thus, the regular expression “<a href="http://www.exalead.fr/search/results?q=%2Fc.p%2F&amp;x=399&amp;y=8&amp;%24mode=allweb">/c.p/</a>” would return matches where the “.” is replaced by any single letter, as in &#8220;cop,&#8221; &#8220;cup&#8221; and &#8220;cap&#8221;.</p>
<p>Now one would be hard pressed to imagine a practical reason for running a search that would return both &#8220;cop&#8221; and &#8220;cup,&#8221; but using regular expressions to search for potentially misspelled proper names, product codes or technical terms can be very handy.</p>
<p>Imagine for instance you’re doing some research on Exalead. To make sure you haven’t missed an important document in which Exalead has been misspelled, you might try something like “<a href="http://www.exalead.fr/search/results?q=%2Fex.lead%2F&amp;x=389&amp;y=5&amp;%24mode=allweb">/ex.lead/</a>” to catch variants such as “exelead” or “exilead”.</p>
<p>You could also try “<a href="http://www.exalead.fr/search/results?q=%2Fexa*lead%2F&amp;x=378&amp;y=8&amp;%24mode=allweb">/exa*lead/</a>”, with the asterisk (“*”) being a regex wildcard that indicates the preceding letter can be repeated 0 or more times. A search on “/exa*lead/” would therefore return variants like “exalead”, “exaalead” and “exaaalead”.</p>
<p>If you wanted to exclude documents in which Exalead was correctly spelled, you could simply add “-exalead” to your query, i.e. “<a href="http://www.exalead.fr/search/results?q=%2Fexa*lead%2F+-exalead&amp;x=0&amp;y=0&amp;%24mode=allweb">/exa*lead/ -exalead</a>”, returning only matches like “exaalead” and “exaaalead”.  (The minus sign is an Exalead Advanced Search option that lets you exclude words from the results for <em>any</em> query. Looking for company names containing &#8220;Einstein&#8221; but no time to wade through a zillion articles on Albert Einstein? Try &#8220;<a href="http://www.exalead.fr/search/results?q=einstein+-albert&amp;x=0&amp;y=0&amp;%24mode=allweb">einstein -albert</a>&#8220;!).</p>
<p>Sometimes, you may not be using regular expressions to hunt for misspellings but rather to include legitimate spelling variations, like “color” (American English) and “colour” (British English).  Here, you could use a vertical bar (“|”) between alternative characters or words, which is regex ‘shorthand’ for “or”.  For example, entering &#8220;<a href="http://www.exalead.fr/search/results?q=%2Fgr%28a%7Ce%29y%2F+whale&amp;x=0&amp;y=0&amp;%24mode=allweb">/gr(a|e)y/ whale</a>” would tell ExaBot to find all matches for either “gray whale” <em>or</em> “grey whale.”</p>
<p>To learn more about regular expressions, take a look at <a href="http://en.wikipedia.org/wiki/Regular_expression">the regex Wikipedia article</a>. Be sure to also look over all of <a href="http://www.exalead.com/about/document/24">Exalead’s Advanced Search options</a>. Used alone or in combination (as with the “/exa*lead/ -exalead” example), <strong>they offer an easy way to inject some high-octane fuel into your next query.</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2007/09/13/search-secrets-searching-like-a-pro-with-regular-expressions/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Exalead: A New Addition to the Prediction Research Toolbox?</title>
		<link>http://blog.exalead.com/2007/08/29/exalead-a-new-addition-to-the-prediction-research-toolbox/</link>
		<comments>http://blog.exalead.com/2007/08/29/exalead-a-new-addition-to-the-prediction-research-toolbox/#comments</comments>
		<pubDate>Wed, 29 Aug 2007 12:46:38 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2007/08/29/exalead-a-new-addition-to-the-prediction-research-toolbox/</guid>
		<description><![CDATA[
Formulating predictions, such as the movements of the stock market or the likelihood of a movie&#8217;s success, have traditionally been costly, and unevenly successful, endeavors.  Prediction research often involves labor-intensive efforts to understand geographically localized social trends and “on-the-ground” conditions.  Now, as reported in Knowledge@Wharton , two Wharton professors, Albert Saiz and Uri [...]]]></description>
			<content:encoded><![CDATA[<p><img src="wp-content/imported/images/en_US/wharton.jpg" title="Wharton" alt="Wharton" style="margin: 0px 0px 5px 5px; float: right" border="0" /><br />
Formulating predictions, such as the movements of the stock market or the likelihood of a movie&#8217;s success, have traditionally been costly, and unevenly successful, endeavors.  Prediction research often involves labor-intensive efforts to understand geographically localized social trends and “on-the-ground” conditions.  Now, as reported in <a href="http://knowledge.wharton.upenn.edu/article.cfm?articleid=1786">Knowledge@Wharton</a> , two Wharton professors, Albert Saiz and Uri Simonsohn, have found a cheaper way to deliver some of the same benefits as this type of resource-intensive research: an Internet search.</p>
<p>Using Exalead as their Internet search tool of choice, they chose to study political corruption as a test case. They found that the Internet search results for this topic on Exalead showed a strong correlation to ‘real world’ facts regarding corruption, namely, the frequency and proximity of the word ‘corruption’ alongside various locality names and socioeconomic indicators matched known ‘real-world’ corruption linkages.</p>
<p>This reliable correlation means social scientists are likely to use Internet search statistics as a proxy for measuring local social trends that are otherwise difficult to assess (such as measurements within relatively closed societies), and certainly astute market researchers will be adding Internet search results analysis to their arsenal in determining the best markets for product launches or the best geographical distribution for campaign election funds.</p>
<p>Of course at Exalead, we’re as interested in innovative ways to use Internet search as we are pleased that these two professors assessed all the major search engines over the course of their research, and selected Exalead as the most reliable (giving high marks on reliability to Ask.com as well). The others, Simonsohn stated, either couldn’t support a single automated search or were simply too unreliable, producing radically different results from week to week. You can <a href="http://papers.ssrn.com/sol3/papers.cfm?abstract_id=990021">download the complete paper</a> from the <a href="http://www.ssrn.com/">Social Science Research Network site</a>.</p>
<p>Carole&amp;Co</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2007/08/29/exalead-a-new-addition-to-the-prediction-research-toolbox/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Road to Better Site Indexing: Episode 3, Sitemaps (based on a true story)</title>
		<link>http://blog.exalead.com/2007/08/28/episode-4-sitemaps-based-on-a-true-story/</link>
		<comments>http://blog.exalead.com/2007/08/28/episode-4-sitemaps-based-on-a-true-story/#comments</comments>
		<pubDate>Tue, 28 Aug 2007 14:21:43 +0000</pubDate>
		<dc:creator>Sébastien</dc:creator>
				<category><![CDATA[Programming]]></category>
		<category><![CDATA[Tips and tricks]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/2007/08/28/episode-4-sitemaps-based-on-a-true-story/</guid>
		<description><![CDATA[
In our prior episodes:
The crawler known as “Bot” travels across the web, moving from page to page and site to site by following links he discovers along the way. But Bot isn’t the type to let himself be led about aimlessly. He tries to imitate his hero Humphrey Bogart, who never shied away from a [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.exalead.com/wikipedia/results?q=Humphrey%20Bogart" target="_blank"><img src="wp-content/imported/images/en_US/humphreybogart.jpg" title="Humphreybogart" alt="Humphrey Bogart" style="margin: 0px 0px 5px 5px; float: right" border="0" /></a><em><br />
In our prior episodes:<br />
The crawler known as “Bot” travels across the web, moving from page to page and site to site by following links he discovers along the way. But Bot isn’t the type to let himself be led about aimlessly. He tries to imitate his hero Humphrey Bogart, who never shied away from a tangled web yet always managed to stay on the right track.</em></p>
<p>But being a perfectionist, Bot wasn’t entirely satisfied with his own method. Was he overlooking a significant thread? Leaving an important page unturned? He had a hunch he could do better.</p>
<p>Leaving important content in the dustbin of unindexed pages was just the sort of slip-up that really peeved Bot’s equally perfectionist client Betty, a.k.a. “The Webmaster.” Betty had specifically called on Bot to crawl her entire site, and Bot had missed several pages.</p>
<p>To get their relationship back on the right track, Bot had an idea: he would ask Betty to tell him flat out everything she wanted him to know about her site. And being a guy always in the know, Bot knew just what tool Betty could use to set the record straight: a sitemap.<br />
He proposed; she accepted.</p>
<p>Now Betty can rest easy knowing all the content she wants to share with the world will be indexed. And just what is this handy tool known as a sitemap?<br />
It’s actually not much more than a laundry list of links. Constructing one is a snap. You simply create a text file listing the URLs you want indexed, along with any key facts you want Bot to know (like how often a file is updated), and place it anywhere you’d like, giving Bot the location in your robots.txt file, for example at the root of your web site: http://www.example.com/sitemap.xml.</p>
<p>Sitemaps can be written in XML (the preferred method), or communicated via syndication feeds or simple text files. A sitemap in XML looks something like this:</p>
<p>&lt;urlset xmlns=&#8221;http://www.sitemaps.org/schemas/sitemap/0.9&#8243;&gt;<br />
&lt;url&gt;<br />
&lt;loc&gt;http://www.example.com/&lt;/loc&gt;<br />
&lt;lastmod&gt;2005-01-01&lt;/lastmod&gt;<br />
&lt;changefreq&gt;monthly&lt;/changefreq&gt;<br />
&lt;priority&gt;0.8&lt;/priority&gt;<br />
&lt;/url&gt;<br />
&lt;url&gt;<br />
&lt;loc&gt;http://www.example.com/catalog?item=12&amp;desc=vacation_hawaii&lt;/loc&gt;<br />
&lt;changefreq&gt;weekly&lt;/changefreq&gt;<br />
&lt;/url&gt;<br />
&lt;url&gt;<br />
&lt;loc&gt;http://www.example.com/catalog?item=83&amp;desc=vacation_usa&lt;/loc&gt;<br />
&lt;/url&gt;<br />
&lt;/urlset&gt;</p>
<p>You can visit <a href="http://www.sitemaps.org/" target="_blank">http://www.sitemaps.org/</a> for all the details. It’s the official site of the Sitemaps protocol, which was first proposed by Google, then fleshed out through discussions with MSN, Yahoo and Ask. It’s now the standard adopted by Google, Yahoo, Ask, and, as of July 2007, Exalead.<br />
But bad guys consider yourselves forewarned: Bot knows not every webmaster is not as straight up as Betty. He stays a step ahead of all nefarious sitemap tricks, checking out every list of links spun his way and skipping right over bum lists.</p>
<p>Sébastien</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2007/08/28/episode-4-sitemaps-based-on-a-true-story/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
