<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Exalead Blog &#187; Exalabs</title>
	<atom:link href="http://blog.exalead.com/category/exalabs/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.exalead.com</link>
	<description>The blog of Exalead</description>
	<lastBuildDate>Wed, 10 Mar 2010 13:33:17 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Transforming a demo into a full-scale production-ready application</title>
		<link>http://blog.exalead.com/2009/11/10/transforming-a-demo-into-a-full-scale-production-ready-application/</link>
		<comments>http://blog.exalead.com/2009/11/10/transforming-a-demo-into-a-full-scale-production-ready-application/#comments</comments>
		<pubDate>Tue, 10 Nov 2009 09:18:17 +0000</pubDate>
		<dc:creator>Sébastien</dc:creator>
				<category><![CDATA[Exalabs]]></category>
		<category><![CDATA[New products & features]]></category>
		<category><![CDATA[Products]]></category>
		<category><![CDATA[Programming]]></category>
		<category><![CDATA[Technology]]></category>
		<category><![CDATA[chromatik]]></category>
		<category><![CDATA[color search]]></category>
		<category><![CDATA[image search]]></category>
		<category><![CDATA[labs]]></category>
		<category><![CDATA[production]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/?p=1053</guid>
		<description><![CDATA[Jean Marc brought you  a very delightful post about Chromatik last week with a lot of beautiful images. I will now describe in more detail how it was built. As with the DVD you perhaps watched last night, I am afraid there will be fewer big special effects in this blog post than in [...]]]></description>
			<content:encoded><![CDATA[<p>Jean Marc brought you  <a href="../2009/10/27/chromatik-adds-color-to-exalead%E2%80%99s-image-search/" target="_blank">a very delightful post about Chromatik</a> last week with a lot of beautiful images. I will now describe in more detail how it was built. As with the DVD you perhaps watched last night, I am afraid there will be fewer big special effects in this blog post than in Jean Marc’s post, but I hope to give you an insightful view of what happened behind the scenes.</p>
<p><a href="http://chromatik.labs.exalead.com/" target="_blank">Chromatik</a> was an elaborate demo, the result of a long effort on both the back-end and the front-end. It indexes one million images. For each image, a unique color signature was built and indexed. Our current intuitive user interface, exploits this index to help you filter and select images by choosing a combination of colors, luminosity or text.</p>
<p>A large number of people tried and liked the Chromatik demo so much that we received several requests to integrate it into the official Exalead search site. And because the demo ran relatively bug free and smoothly, our friends thought it was a piece of cake. Of course, it was a bit more work than we initially expected. So where are the challenges?</p>
<p><strong>1)  The front-end side</strong></p>
<p>A lot of questions needed to be answered:</p>
<ul>
<li>How will I adapt the GUI of my      application to integrate the new features?</li>
<li>Are all these new features      necessary?</li>
<li>What is the feedback we’ve      received on the different features?</li>
<li>What is the added value of      these features?</li>
</ul>
<p>The answers to these questions will impact the total amount of space on the GUI we will take for surfacing them.</p>
<p><strong>2) The back-end side</strong></p>
<p>Let’s begin with a little theory:</p>
<p><strong>Theorem of the factor 10 effect:</strong><br />
<em>No matter how good a developer you are, if  non-trivial code has been designed and tested with only N elements, it won’t work without modifications when applied to 10 * N elements.<br />
</em><br />
<strong>Demonstration: </strong>Rather simple: if you don’t believe it, try it yourself…</p>
<p>In this case we wanted a factor 1000, so we knew it would need some adjustments. When you know this theorem, the advantage is that you can anticipate potential problems. And the experience we have accumulated from similar situations at Exalead help us predict most of the bottlenecks.</p>
<p><strong>Example 1:</strong> Chromatik needed 300MB RAM, which is quite good for 1M images. But, if you multiply this number by 2000, you have 600GB RAM, which is quite large, even if  the final index is distributed over multiple machines.<br />
We therefore decided to reduce the richness of the colors, while maintaining usability, migrate from version 4.6 to version 5.0 of Exalead CloudView, and use a more compressed encoding. In the end, it now only costs 9GB.</p>
<p><strong>Example 2:</strong> When you want to analyze two billion images, you need to have a robust code, which means that’s able to handle all sort of images even those which do not have a valid RFC. It’s not that easy, when even the most used library in the world for basic image manipulation can crash on some images <a href="http://bugs.libgd.org/?do=details&amp;task_id=86" target="_blank">as we reported</a>.<br />
The result was that this run spotted some bugs in our code we hadn’t seen before and therefore had to fix.</p>
<p><strong>Example 3:</strong> The demo was initially a single machine application. We needed to use the distributed system framework included in the CloudView technology to be able to run the whole process of extracting, crawling, and indexing in only a few weeks. This framework really helped us transform the single machine demo to a fully load-balanced and monitored application. This use case is a little different than our standard www.exalead.com chain, so we discovered and tweaked a few cumbersome points in the code.</p>
<p>The purpose of this integration was to offer a new service to the users of the exalead.com search engine and improve the robustness of the Chromatik technology. We now better understand the impact of different tweaks on color indexing.</p>
<p>Transforming a demo into a real product is not as easy as it seems. I hope this post helps you understand why a lot of companies only show you demos but never real live applications.</p>
<p>At Exalead, we don’t sell demos to our customers; we sell tested and robust solutions. We make sure we work hard to test and uncover all the issues so our customers’ implementations go smoothly.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2009/11/10/transforming-a-demo-into-a-full-scale-production-ready-application/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Will all information be video in the future?</title>
		<link>http://blog.exalead.com/2009/11/05/will-all-information-be-video-in-the-future/</link>
		<comments>http://blog.exalead.com/2009/11/05/will-all-information-be-video-in-the-future/#comments</comments>
		<pubDate>Thu, 05 Nov 2009 14:16:15 +0000</pubDate>
		<dc:creator>Gregory</dc:creator>
				<category><![CDATA[Exalabs]]></category>
		<category><![CDATA[Technology]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/?p=1083</guid>
		<description><![CDATA[It is hard to detect trends in what is happening all around you, but the following major institution clearly shows a world moving from text and fixed images to video.
Press photography is about to undergo fundamental changes in its business model. Up until now, it has been “buy photo content, store content, wait for client, [...]]]></description>
			<content:encoded><![CDATA[<p>It is hard to detect trends in what is happening all around you, but the following major institution clearly shows a world moving from text and fixed images to video.</p>
<p>Press photography is about to undergo fundamental changes in its business model. Up until now, it has been “buy photo content, store content, wait for client, sell photo”. They have seen the signs, or possibly heard Clay Shirky’s <a href="http://www.shirky.com/weblog/2009/03/newspapers-and-thinking-the-unthinkable/">message about the death of old business models in the newspaper industry</a></p>
<blockquote><p>&#8230;there is one possible answer to the question “If the old model is broken, what will work in its place?” The answer is: Nothing will work, but everything might. Now is the time for experiments, lots and lots of experiments&#8230;</p></blockquote>
<p>Press photography is soon to experiment with a new business model: moving from content owner to content producer and distributor, moving from photos to videos.</p>
<p>They plan (I learned this last night from a friend in the photography world) to no longer wait for customers to come buy their photos, but rather to harvest the <a href="http://www.ted.com/talks/lang/eng/clay_shirky_how_cellphones_twitter_facebook_can_make_history.html"> vast amount of locally produced, real time news images</a> and to use their photojournalistic experience to filter and edit and package to produce their own content: story, video, editing. They plan to sell or give away a packaged widget to (online) news outlets. Activating the widget will send control back to the press photography site, which then streams its story and manages it own advertising revenue from the stream.</p>
<p>What does this shift portend for information retrieval?</p>
<p>Information retrieval began as text based storage, gradually <a href="http://www.exalead.com/software/common/pdfs/products/cloudview/Exalead-Connectors-and-Formats.pdf">incorporating other formats, including structured textual information from databases</a>.</p>
<p>Then search engines began to index photos and videos. Google added its Image tab to its home page around <a href="http://web.archive.org/web/20011009024651/http://images.google.com/">October 9, 2001</a>, and its Video tab around <a href="http://">August 14, 2006</a>. We added our Exalead Video search tab about <a href="http://web.archive.org/web/20070703175954/http://www.exalead.com/search/">a year later</a>.</p>
<p>These first video search systems used the same methods as with image search, using the text found around the raw data sources, and the text in file names, and in links pointing to the multimedia file, to serve as index terms.</p>
<p>But as more and more information may only be presented in video form, it will be necessary to <a href="http://www.vecsysresearch.fr/technos.html">index the content of videos</a>, too. We do this in our experimental  <a href="http://voxaleadnews.labs.exalead.com/">video search system, VoxaleadNews</a>.</p>
<p>But what I didn&#8217;t realize until last night is that systems like Voxalead might not just be some cute add-on to a search engine, but that this might actually be what future search engines are: that information might only come in video, and that things like text and even still images, will become some quaint, outdated media, like engravings are now.</p>
<p><img class="aligncenter size-medium wp-image-1123" src="http://blog.exalead.com/wp-content/uploads/2009/11/Grko_engraving_1885-300x233.jpg" alt="Grko_engraving_1885" width="300" height="233" /></p>
<p><span id="more-1083"></span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2009/11/05/will-all-information-be-video-in-the-future/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exalead’s Voxalead Video Search Named “Most Practical” Innovation</title>
		<link>http://blog.exalead.com/2009/10/30/exalead%e2%80%99s-voxalead-video-search-named-%e2%80%9cmost-practical%e2%80%9d-innovation/</link>
		<comments>http://blog.exalead.com/2009/10/30/exalead%e2%80%99s-voxalead-video-search-named-%e2%80%9cmost-practical%e2%80%9d-innovation/#comments</comments>
		<pubDate>Fri, 30 Oct 2009 14:22:02 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[Exalabs]]></category>
		<category><![CDATA[News]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/?p=1023</guid>
		<description><![CDATA[It’s official! Exalead’s Voxalead News video search has been named “Most Practical” Innovation in ACM’s Multimedia Grand Challenge 2009.



An innovation produced by Exalead’s Exalabs research laboratory, Voxalead News lets you search for keywords inside videos, rather than simply searching limited external information like titles or descriptions. A tremendous timesaver, Voxalead further lets you jump right [...]]]></description>
			<content:encoded><![CDATA[<p style="text-align: left;">It’s official! Exalead’s <a href="http://voxaleadnews.labs.exalead.com/">Voxalead News</a> video search has been named “Most Practical” Innovation in <a href="http://comminfo.rutgers.edu/conferences/mmchallenge/2009/10/26/winners-of-the-multimedia-grand-challenge-2009/">ACM’s Multimedia Grand Challenge 2009</a>.</p>
<p style="text-align: left;">
<p style="text-align: center;"><a href="http://blog.exalead.com/wp-content/uploads/2009/10/diploma.jpg"><img class="size-medium wp-image-1043 aligncenter" title="diploma" src="http://blog.exalead.com/wp-content/uploads/2009/10/diploma-300x217.jpg" alt="diploma" width="300" height="217" /></a></p>
<p style="text-align: left;">
<p style="text-align: left;">An innovation produced by Exalead’s <a href="http://labs.exalead.com/">Exalabs</a> research laboratory, <a href="http://voxaleadnews.labs.exalead.com/">Voxalead News</a> lets you search for keywords <strong><em>inside </em></strong>videos, rather than simply searching limited external information like titles or descriptions. A tremendous timesaver, Voxalead further lets you jump right to the point in the video in which your search term is used! The Voxalead demonstration currently offers search in four languages (English, French, Mandarin Chinese and Arabic) across a select set of news sources.</p>
<p style="text-align: left;">In addition to captivating news aficionados, this ‘most practical’ innovation is capturing the attention of business professionals in sectors such as Media, Publishing, eDiscovery, Competitive Intelligence, and Social Networking and Content Sharing.</p>
<p style="text-align: left;">For more information, visit the <a href="http://comminfo.rutgers.edu/conferences/mmchallenge/2009/10/26/winners-of-the-multimedia-grand-challenge-2009/">Multimedia Grand Challenge 2009</a> Web page. You can also <a href="http://voxaleadnews.labs.exalead.com/">test drive Voxalead News</a> and other Exalead innovations at the <a href="http://labs.exalead.com/">Exalabs</a> site.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2009/10/30/exalead%e2%80%99s-voxalead-video-search-named-%e2%80%9cmost-practical%e2%80%9d-innovation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sourcier: A New Exalabs &#8216;Geo-Search&#8217; Demo</title>
		<link>http://blog.exalead.com/2009/10/20/sourcier-a-new-exalabs-geo-search-demo/</link>
		<comments>http://blog.exalead.com/2009/10/20/sourcier-a-new-exalabs-geo-search-demo/#comments</comments>
		<pubDate>Tue, 20 Oct 2009 08:55:36 +0000</pubDate>
		<dc:creator>Carole</dc:creator>
				<category><![CDATA[Exalabs]]></category>
		<category><![CDATA[New products & features]]></category>
		<category><![CDATA[brgm]]></category>
		<category><![CDATA[demo]]></category>
		<category><![CDATA[geo search]]></category>
		<category><![CDATA[sourcier]]></category>

		<guid isPermaLink="false">http://blog.exalead.com/?p=987</guid>
		<description><![CDATA[More Mappable Data, and Map-Hungry Users
More and more information on the Internet is ‘mappable’:

The physical addresses of offices or stores


Wikipedia articles (country/city info, geo-science articles, etc.)


Photos and videos with content of local interest on “Web 2.0” content-sharing sites


POI (Point Of Interest) services like tourism portals (attractions, hotels, etc.)

Naturally enough, numerous services have recently appeared allowing [...]]]></description>
			<content:encoded><![CDATA[<p><strong>More Mappable Data, and Map-Hungry Users</strong><br />
More and more information on the Internet is ‘mappable’:</p>
<ul>
<li>The physical addresses of offices or stores</li>
</ul>
<ul>
<li>Wikipedia articles (country/city info, geo-science articles, etc.)</li>
</ul>
<ul>
<li>Photos and videos with content of local interest on “Web 2.0” content-sharing sites</li>
</ul>
<ul>
<li>POI (Point Of Interest) services like tourism portals (attractions, hotels, etc.)</li>
</ul>
<p>Naturally enough, numerous services have recently appeared allowing Net surfers to view search results on a map, bridging the gap between the ‘real world’ and Web content. No longer satisfied with viewing mappable information presented in an old-fashioned laundry list of search results, users today are hungry to search for information within and through a map. And Exalead is eager to join users in exploring the boundaries of map-based search. We’ve therefore added a new geo-spatial search demo, <strong>Sourcier</strong>, to our <a href="http://labs.exalead.com/" target="_blank">Exalabs</a> site.</p>
<p><strong>Map-Based Search with Exalead</strong><br />
Sourcier was produced in collaboration with France’s leading public earth science institution, BRGM (Bureau of Geological and Mining Research). It provides map-centered search for metropolitan France groundwater resources using publicly available data from the <a href="http://ades.eaufrance.fr/" target="_blank">ADES site</a>.</p>
<p>Sourcier enables users to search water resources and water-related points of interest (along with associated scientific data) according to:</p>
<ul>
<li> A chemical component, environmental factor, or micro-biological element</li>
</ul>
<ul>
<li> A concentration range</li>
</ul>
<ul>
<li> A time period</li>
</ul>
<ul>
<li> A geographical area, by navigating the map using drawing tools, or specifying an administrative area (region, county or municipality)</li>
</ul>
<p>Results are displayed directly on the map along with markers for exploring related data in a graphical form.</p>
<p style="text-align: center;"><a href="http://blog.exalead.fr/wp-content/uploads/sourcier1.jpg"><img class="aligncenter" title="Points of interest for the city of Lyon and the Brittany region " src="http://blog.exalead.fr/wp-content/uploads/sourcier1.jpg" alt="" width="300" height="180" /></a></p>
<p style="text-align: center;"><a href="http://blog.exalead.fr/wp-content/uploads/sourcier2.jpg"><img class="aligncenter" title="Points of interest for the town of Meung-sur-Loire with a graph of related data" src="http://blog.exalead.fr/wp-content/uploads/sourcier2.jpg" alt="" width="300" height="183" /></a><strong></strong></p>
<p style="text-align: left;"><strong>To Come&#8230;</strong><br />
Forthcoming indexation and geo-search experiments using Exalead technology will involve support for more complex geometric objects and distance calculations to support extrapolation to different geometric shapes (for example, using a specified radius to manipulate maximum distance).</p>
<p>The indexation of geometric objects  such as polygons would permit one, for example, to search London parks and reuse their geo-spatial ‘fingerprints’ on the map as search criteria, and to retrieve related Web data like geo-tagged photos or videos.
</p>
<p style="text-align: center;"><a href="http://blog.exalead.fr/wp-content/uploads/londres.jpg"><img class="aligncenter" title="Search of London parks" src="http://blog.exalead.fr/wp-content/uploads/londres.jpg" alt="" width="300" height="182" /></a></p>
<p>Sourcier is available at: <a href="http://sourcier.labs.exalead.com/" target="_blank">http://sourcier.labs.exalead.com/</a></p>
<p>(Note: the service is currently available in French.)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.exalead.com/2009/10/20/sourcier-a-new-exalabs-geo-search-demo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
