<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Well-formed data &#187; statistics</title>
	<atom:link href="http://well-formed-data.net/archives/tag/statistics/feed" rel="self" type="application/rss+xml" />
	<link>http://well-formed-data.net</link>
	<description>Moritz Stefaner / Visualization</description>
	<lastBuildDate>Wed, 11 Jan 2012 20:44:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Elastic lists</title>
		<link>http://well-formed-data.net/archives/54/elastic-lists</link>
		<comments>http://well-formed-data.net/archives/54/elastic-lists#comments</comments>
		<pubDate>Sun, 11 Mar 2007 22:04:40 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[elastic-lists]]></category>
		<category><![CDATA[facet-browser]]></category>
		<category><![CDATA[facets]]></category>
		<category><![CDATA[nobel-prize]]></category>
		<category><![CDATA[sparklines]]></category>
		<category><![CDATA[statistics]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/54/elastic-lists</guid>
		<description><![CDATA[Just a short post, but another demo is online. It is a demonstration of the “elastic list” principle for browsing multi-facetted data structures. Click any number of list entries to query the database for a combination of the selected attributes. If you create an “impossible” configuration, your selection will be reduced until a match is [...]]]></description>
			<content:encoded><![CDATA[<p>Just a short post, but <a href="http://well-formed-data.net/experiments/elastic_lists/">another demo is online</a>. </p>

<p><a href="http://well-formed-data.net/experiments/elastic_lists/"><img src='http://well-formed-data.net/wp-content/uploads/2007/03/nobel_480.png' alt='nobel_480.png' /></a></p>

<p>It is a demonstration of the “elastic list” principle for browsing multi-facetted data structures. Click any number of list entries to query the database for a combination of the selected attributes. If you create an “impossible” configuration, your selection will be reduced until a match is possible.</p>

<p>The example data is based on the Noble prize winners dataset used in the <a href="http://flamenco.berkeley.edu/">Flamenco facet browser</a>.</p>

<p>Elastic lists enhance traditional facet browsing approaches by 
• visualizing relative proportions (weights) of metadata values by size
• visualizing unusualness of a metadata weight by brightness
• and animated filtering transitions.</p>

<p>In unfiltered view, the brightness shows a trend measure, indicating a rising number of prices of the last years. </p>

<p>In filtered views, a brighter background indicates a higher weight of the metadata value compared to the overall distribution. </p>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/03/peace.png' alt='peace.png' /></p>

<p>If, for instance, you click “peace” as in the example above, you will see that “female” and “Switzerland” are much brighter, indicating that the proportion of women and Swiss is much higher in this context than compared to the whole data set. That’s interesting information and could also be used to characterize the result set of a keyword query or any other collection in terms of its “characteristic” metadata values. Besides that, it fosters understanding of how metadata values are correlated with each other, which is often interesting information itself. </p>

<p>You can also switch on little sparklines to see the temporal distribution of each metadata value:
<img src='http://well-formed-data.net/wp-content/uploads/2007/03/picture-7.png' alt='picture-7.png' /></p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=54&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/54/elastic-lists/feed</wfw:commentRss>
		<slash:comments>38</slash:comments>
		</item>
		<item>
		<title>Tag maps update</title>
		<link>http://well-formed-data.net/archives/42/tag-maps-update</link>
		<comments>http://well-formed-data.net/archives/42/tag-maps-update#comments</comments>
		<pubDate>Mon, 22 Jan 2007 21:49:59 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[CCA]]></category>
		<category><![CDATA[elastic]]></category>
		<category><![CDATA[PCA]]></category>
		<category><![CDATA[similarity]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[tagMaps]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/42/tag-maps-update</guid>
		<description><![CDATA[As promised, here is an update to the tag maps application I introduced below along with some explanations. For the impatient: HERE’S THE LINK (Update again: The latest version can be found here) And for the curious: Here’s the explanations: What are tags? Tags are personally chosen, free-form keywords assigned to digital contents. So instead [...]]]></description>
			<content:encoded><![CDATA[<p>As promised, here is an update to the tag maps application I introduced <a href="http://well-formed-data.net/archives/31/tag-clouds">below</a> along with some explanations.</p>

<p><a id="p41" href="http://well-formed-data.net/experiments/tag_maps/" title="Elastic tag maps experiment"><img id="image41" src="http://well-formed-data.net/wp-content/uploads/2007/01/tag_maps.jpg" alt="tag_maps.jpg" /></a></p>

<p>For the impatient: <a href="http://well-formed-data.net/experiments/tag_maps/">HERE’S THE LINK</a></p>

<p>(Update again: <a href="http://well-formed-data.net/archives/84/tag-maps-update-again"><em>The latest version can be found here</em></a>)</p>

<p>And for the curious: Here’s the explanations:<span id="more-42"></span></p>

<div class="line"></div>

<p><strong>What are tags?</strong>
Tags are personally chosen, free-form keywords assigned to digital contents. So instead of putting a bookmark into a folder, you might assign it the tags “photography nature germany lomo”. If later you are looking for sites about photography, you will find it again under this term, but also if you are searching for “nature”. Another nice aspect is that  your tagging creates an annotation to the existing content. If you share these with others (e.g. via a public bookmarking service such as <a href="http://del.icio.us">delicious</a>), everybody benefits by discovering new sites and getting better matches for their searches. </p>

<p>You can find more info about tagging <a href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">e.g. at Wikipedia.</a></p>

<div class="line"></div>

<p><strong>What are tags clouds?</strong>
Tag clouds represent a whole bunch of tags as weighted lists. The more often a tag has been used, the larger it will be displayed in the list. This can be used to both characterize single users, webpages, as well as whole communities.</p>

<p>As an example, see my delicious tag cloud here:</p>

<script type="text/javascript" src="http://del.icio.us/feeds/js/tags/der_mo?icon;count=156;size=10-26;color=cccccc-000000;title=my%20del.icio.us%20tags;name;showadd"></script>

<p>Tag clouds can also be used for navigation: Click one of the tags and you will come to a web page displaying all of my bookmarks matching this tag.</p>

<div class="line"></div>

<p><strong>What’s the problem with tags clouds?</strong></p>

<p>Tag clouds are nice and really popular, but still there is quite some room for improvement:
* <strong>Tag clouds are not suited for long tail navigation:</strong> By simply adding up over time, a certain pattern consistently emerges: there will be a some dominating tags (the “big head”) and a vast number of rarely used tags (the “long tail”). Whilst the “big head” tags remain pretty constant over time and broadly characterize your interests, the “long tail” contains all the variety of things you encounter. Tag clouds visually prioritize the “big head”. However, both for browsing and for searching, access to the long tail is vital, since this is where the real information is contained. 
* <strong>Summing up over time does not represent the dynamics of interests:</strong>Additionally, it can be questioned if merely summing up tags is the right approach in general. How about topics you <em>were</em> interested in, but now you aren’t anymore? Or conversely, very recent interests, which are pretty important to you but haven’t been tagged often enough to show up in the cloud? To solve this problem, <a href="http://chir.ag/">Chirag Mehta</a> had the nice idea of implementing <a href="http://chir.ag/phernalia/preztags/">tag clouds with a time slider</a>. However, if you look at these, another problem becomes evident:
*  <strong>Tag clouds are not suitable for animation:</strong> This is due to their alphabetical list order and visual messiness. Since every tag’s position in a tag cloud is defined by its predecessor’s size and position, things start jumping around once you start scaling tags. So tag clouds are not really suited to display the dynamical nature of tagging structures — how tags appear and disappear.
* <strong> Tag clouds are ordered the wrong way:</strong> Tags denote concepts. As such, they have meaningful relations to each other. Tag clouds are ordered alphabetically or by size — it would be much more effective, if tags that belong together could also be presented together. Some of these relations can be deduced automatically, by observing how tags are used: Some tags might always appear together, others sometimes and others never. If tags co-occur frequently or have many common “neighbors”, you can be sure the concepts denoted will be related in some manner. </p>

<div class="line"></div>

<p><strong>So whatcha you gonna do about it?</strong></p>

<p>All these issues lead me to developing a mapping algorithm to analyse and display tag structures based on <strong>how tags occur together</strong>. Technically, it is based on a vector-space model, where each tagging action is assigned a point in a high-dimensional vector space. By applying the dimensionality reduction algorithms PCA (Principal Components Analysis) and CCA (Curvilinear Component Analysis), I calculate a two-dimensional map, which places frequently co-coccurring tags close together. Additionally, covariance values for all tags are stored, so I know exactly how “related” each tag is to the others. </p>

<p>This information can be used to display both maps as well as lists, which are ordered by “relatedness”. You can play around with some of the maps in the <a href="http://well-formed-data.net/experiments/tag_maps/">interactive demo</a>. </p>

<p><a id="p41" rel="attachment" class="imagelink" href="http://well-formed-data.net/archives/42/tag-maps-update/tag_mapsjpg/" title="tag_maps.jpg"><img id="image41" src="http://well-formed-data.net/wp-content/uploads/2007/01/tag_maps.jpg" alt="tag_maps.jpg" /></a></p>

<p>In its initial state, all tags are scaled according to their frequency. Clicking a tag will transform the map or list and will bring all related tags to the front according to their degree of “relatedness”.
Tags are colored according to their “freshness”. Tags are considered fresh if their average usage has increased over the last 30 days. </p>

<p>To see how that freshness measure changes over time, there is an <a href="http://well-formed-data.net/experiments/emerging_topics">animated version</a> as well described in an <a href="http://well-formed-data.net/archives/38/emerging-topics">earlier post</a>. </p>

<p>When clicking the 1D tab, you will find a list representation of the same information. The interaction principle is the same.</p>

<p><a id="p44" rel="attachment" class="imagelink" href="http://well-formed-data.net/archives/42/tag-maps-update/tag_map_1djpg/" title="tag_map_1d.jpg"><img id="image44" src="http://well-formed-data.net/wp-content/uploads/2007/01/tag_map_1d.jpg" alt="tag_map_1d.jpg" /></a></p>

<p>Additionally, I added two little diagrams depicting the “freshness” distribution and the number of occurrences for each tag. These will be filtered once you click a tag.</p>

<p><a id="p43" rel="attachment" class="imagelink" href="http://well-formed-data.net/archives/42/tag-maps-update/mini_graphsjpg/" title="mini_graphs.jpg"><img id="image43" src="http://well-formed-data.net/wp-content/uploads/2007/01/mini_graphs.jpg" alt="mini_graphs.jpg" /></a></p>

<div class="line"></div>

<p><strong>Insights gained and how to turn these experiments into usable interfaces</strong></p>

<p>I would love to write a little bit about these issues, but this post is already long enough. I hope I will find the time in the next couple of days to write a follow-up!</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=42&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/42/tag-maps-update/feed</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
	</channel>
</rss>

