<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Well-formed data &#187; tagging</title>
	<atom:link href="http://well-formed-data.net/archives/tag/tagging/feed" rel="self" type="application/rss+xml" />
	<link>http://well-formed-data.net</link>
	<description>Moritz Stefaner / Visualization</description>
	<lastBuildDate>Wed, 11 Jan 2012 20:44:13 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>I’m in yr bookz…</title>
		<link>http://well-formed-data.net/archives/124/im-in-yr-bookz%e2%80%a6</link>
		<comments>http://well-formed-data.net/archives/124/im-in-yr-bookz%e2%80%a6#comments</comments>
		<pubDate>Thu, 24 Jan 2008 14:24:39 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Recognition]]></category>
		<category><![CDATA[Reviews]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[atomiq]]></category>
		<category><![CDATA[book]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/124/im-in-yr-bookz%e2%80%a6</guid>
		<description><![CDATA[… visualizing your tags :) The “Tagging” book by Gene Smith is out. I am still awaiting a paper copy, but had a chance to look at the online version already. It looks really comprehensive, concise and covering all important tagging concepts. Which is not easy for such a moving target topic. Besides, my elastic [...]]]></description>
			<content:encoded><![CDATA[<p>… visualizing your tags :)</p>

<p><img src='http://well-formed-data.net/wp-content/uploads/2008/01/41bm4kxez8l_ss500_.jpg' alt='41bm4kxez8l_ss500_.jpg' />
<br />
The <a href="http://atomiq.org/archives/2008/01/tagging_the_book.html">“Tagging” book</a> by Gene Smith is out. I am still awaiting a paper copy, but had a chance to look at the online version already. It looks really comprehensive, concise and covering all important tagging concepts. Which is not easy for such a moving target topic. </p>

<p>Besides, my <a href="http://well-formed-data.net/experiments/tag_maps_v5/">elastic tag maps visualization</a> is featured on p.102f. Nice!</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=124&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/124/im-in-yr-bookz%e2%80%a6/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Weinberger at the Well</title>
		<link>http://well-formed-data.net/archives/85/weinberger-at-the-well</link>
		<comments>http://well-formed-data.net/archives/85/weinberger-at-the-well#comments</comments>
		<pubDate>Thu, 31 May 2007 12:23:41 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Thesis]]></category>
		<category><![CDATA[folksonomies]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[weinberger]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/85/weinberger-at-the-well</guid>
		<description><![CDATA[There is an interesting discussion with David Weinberger (author of “Everything is miscellaneous”) going on at the Well about folksonomies and metadata.]]></description>
			<content:encoded><![CDATA[<p>There is an interesting <a href="http://www.well.com/conf/inkwell.vue/topics/300/David-Weinberger-Everything-is-M-page01.html">discussion</a> with <a href="http://www.hyperorg.com/blogger/">David Weinberger</a> (author of “<a href="http://www.everythingismiscellaneous.com/">Everything is miscellaneous</a>”) going on at the <a href="http://www.well.com/">Well</a> about folksonomies and metadata. </p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=85&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/85/weinberger-at-the-well/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Tag maps update again</title>
		<link>http://well-formed-data.net/archives/84/tag-maps-update-again</link>
		<comments>http://well-formed-data.net/archives/84/tag-maps-update-again#comments</comments>
		<pubDate>Mon, 28 May 2007 20:58:47 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[similarity]]></category>
		<category><![CDATA[tag-cloud]]></category>
		<category><![CDATA[tag-map]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/84/tag-maps-update-again</guid>
		<description><![CDATA[&#62; Tag maps v5 PS: 12 days to go, wish me luck!]]></description>
			<content:encoded><![CDATA[<p><a href="http://well-formed-data.net/experiments/tag_maps_v5"><img src='http://well-formed-data.net/wp-content/uploads/2007/05/tag_maps_update.png' alt='tag_maps_update.png' /></a>
<a href="http://well-formed-data.net/experiments/tag_maps_v5">&gt; Tag maps v5</a></p>

<p>PS: 12 days to go, wish me luck!</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=84&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/84/tag-maps-update-again/feed</wfw:commentRss>
		<slash:comments>10</slash:comments>
		</item>
		<item>
		<title>Emerging topics update</title>
		<link>http://well-formed-data.net/archives/77/emerging-topics-update</link>
		<comments>http://well-formed-data.net/archives/77/emerging-topics-update#comments</comments>
		<pubDate>Wed, 23 May 2007 21:31:38 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[emerging-topics]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[temporal-dynamics]]></category>
		<category><![CDATA[vocabulary-evolution]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/77/emerging-topics-update</guid>
		<description><![CDATA[My thesis is due pretty soon, so I am currently writing A LOT and make some on–the–go beautifications to my experiments. First one is the emerging topics histogram. I followed my own advice and vertically centered the stacked histogram. Additionally, I never liked those sharp edges, so now I do not only “fade out” tags [...]]]></description>
			<content:encoded><![CDATA[<p>My thesis is due pretty soon, so I am currently writing A LOT and make some on–the–go beautifications to my experiments.</p>

<p>First one is the emerging topics histogram. I followed my own advice and vertically centered the stacked histogram. Additionally, I never liked those sharp edges, so now I do not only “fade out” tags visually, but also fade them in, resulting in a much more organic picture, and largely improved readability of the chart. New color scheme: old tags are cold, freshly introduced ones in warm color. This is all very much inspired by the fabulous <a href="http://megamu.com/lastfm/">last.fm charts by Lee Byron</a> — thanks!</p>

<blockquote>
  <p><a href="http://well-formed-data.net/experiments/emerging_topics_v2/">new interactive version here</a></p>
</blockquote>

<p>and some pix:
<img src='http://well-formed-data.net/wp-content/uploads/2007/05/picture-12_480x229shkl.png' alt='picture-12_480×229shkl.png' />
<img src='http://well-formed-data.net/wp-content/uploads/2007/05/picture-3_480x354shkl.png' alt='picture-3_480×354shkl.png' />
<img src='http://well-formed-data.net/wp-content/uploads/2007/05/picture-13_480x388shkl.png' alt='picture-13_480×388shkl.png' />
<img src='http://well-formed-data.net/wp-content/uploads/2007/05/picture-6_480x361shkl.png' alt='picture-6_480×361shkl.png' /></p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=77&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/77/emerging-topics-update/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>You say… We say…</title>
		<link>http://well-formed-data.net/archives/72/you-say-we-say</link>
		<comments>http://well-formed-data.net/archives/72/you-say-we-say#comments</comments>
		<pubDate>Thu, 10 May 2007 18:15:38 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[intersubjectivity]]></category>
		<category><![CDATA[long-tail]]></category>
		<category><![CDATA[Personal]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[visualization]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/72/you-say%e2%80%a6-we-say%e2%80%a6</guid>
		<description><![CDATA[Another visual experiment on tagging: How do individuals use tags — compared to the community? Do you use private language or are you a mainstream tagger? When you tag with “design” — do the others think it is “art”? On the left: the personal tags for the personal bookmarks — ordered by frequency — the [...]]]></description>
			<content:encoded><![CDATA[<p>Another visual experiment on tagging: 
How do individuals use tags — compared to the community? Do you use private language or are you a mainstream tagger? When you tag with “design” — do the others think it is “art”?</p>

<div class="line"></div>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/05/ysws_04.jpg' alt='ysws_04.jpg' /></p>

<p><strong>On the left</strong>: the personal tags for the personal bookmarks — ordered by frequency — the containing box per is log-scaled so you get an impression of the long–tail position of a tag. Which means: Often used tags are large, bright and go to the top.</p>

<p><strong>On the right</strong>: community tags for the tagged ressources. </p>

<p><strong>In the middle</strong>: tags with the same name are connected. If a line is horizontal, the indvidual and the community essentially agree on the relevance of the tag for the ressources. The steeper it is — the larger the disagreement. If no line starts at a tag, it means it is not present in the other list </p>

<p>So in the picture above, you can see my tags. Same facts you can read from the picture:
• “m.a.thesis” is a very often used, but private tag of mine.
• The ressources I tag in general are mostly tagged with “design” by the community. I, however, use the tag “design” much less often.
• “news”, “semantic web”, “webdev” are tags I use often, but not the community.
etc. </p>

<div class="line"></div>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/05/ysws_01.jpg' alt='ysws_01.jpg' /></p>

<p>And often course, you can click individual tags to see what the comparison is like for subsets of the bookmarks. That’s especially interesting for obscure tags like “guru” — you can see what the tagger “means” by looking at the distribution of the community bookmarks (in this case “design — art — programmer — artist”). Interesting!</p>

<div class="line"></div>

<p>Some more shots:</p>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/05/ysws_03.jpg' alt='ysws_03.jpg' /></p>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/05/ysws_02.jpg' alt='ysws_02.jpg' /></p>

<p>I wish I could say “click here for the interactive version” as usual — but unfortunately, del.icio.us offers a JSON API, but did not put a crossdomain.xml file on their server. Which means the visualization (which runs nicely on my harddisc) cannot load data when put in the web. Bummer. I hope I can figure something out. </p>

<p>So for now — I can only offer a <a href="http://well-formed-data.net/experiments/YouSayWeSay/wfd_YouSayWeSay.zip">download link</a>. Click the index.html. You might have to adjust you Flash player security settings in order to load the community tags. Caveat: The application is still a bit buggy and pretty heavy concerning processor ressources.</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=72&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/72/you-say-we-say/feed</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Emerging topics v2</title>
		<link>http://well-formed-data.net/archives/51/emerging-topics-v2</link>
		<comments>http://well-formed-data.net/archives/51/emerging-topics-v2#comments</comments>
		<pubDate>Sun, 18 Feb 2007 23:55:18 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[stack-chart]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[temporalDynamics]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/51/emerging-topics-v2</guid>
		<description><![CDATA[I am currently working on trends in individual tagging behaviour. You might have seen a first, animated version of my studies based on tag maps. The original animation shows the emergence of previously rarely used tags over time. Now I dug deeper and made a richer visualization for investigating this topic. For the impatient: » [...]]]></description>
			<content:encoded><![CDATA[<p>I am currently working on trends in individual tagging behaviour. You might have seen a <a href="http://well-formed-data.net/archives/38/emerging-topics">first, animated version of my studies</a> based on <a href="http://well-formed-data.net/archives/42/tag-maps-update">tag maps</a>. The original animation shows the emergence of previously rarely used tags over time. Now I dug deeper and made a richer visualization for investigating this topic.</p>

<p>For the impatient:
<a href="http://well-formed-data.net/experiments/tag_dynamics">» Check out the interactive version here</a></p>

<p>And here’s the explanation:</p>

<p>It has been <a href="http://www.citeulike.org/user/MoritzStefaner/article/984374">shown before</a> (for a plausibility argument, also check out the <a href="http://cloudalicio.us/">marvellous cloudalicious tool</a>, where you can track tag proportions for any website on the web), that tag proportions for ressources stabilize over time. Which means that the tag cloud representing a tag profile for a resource does not change much, once a sufficient number of tags has been collected. In a folksonomy, this is generally considered a good sign, since this indicates a certain agreement on how to judge a certain ressource and what vocabulary to use.  </p>

<p>For tagging individuals, and communities, this might — at first glance — hold true as well. Consider the following the visualization of a <a href="http://incom.org">tagging community</a>’s evolution, for example:</p>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/02/picture-8_480x250shkl.png' alt='picture-8_480×250shkl.png' /></p>

<p>Each tag is assigned a band, with the thickness indicating the overall summed usage of a tag over time (time runs left to right). Thus, a vertical cut through the graph corresponds to taking a tag cloud snapshot at this time point. The vertical order is based on the overall frequency of the tags. The color is used to to give an impression of the long tail distortion — if all tags would appear equally often, you would see a linear transition from red to green instead of the skewed distribution. 
So — what do we see? Apparently, most of the bands seem to grow in parallel, indicating a stable growth proportion for all tags. Of course, we cannot see much for the smaller tags, and there are some edgy parts of the graph which might indicate different behavior at specific time points, but overall — pretty stable impression. </p>

<p>However, this does not make much sense. For individuals and communities, the topics of interest evolve over time, so there must be some hidden variability not captured by the visualization and the underlying linear model.</p>

<p>So I decided to provide an alternative visualization for the data based on a decay model, where tags “age” over time and finally get “forgotten” if they are not used anymore. This idea is loosely based on the Yules-Simon memory model for tag generation presented in <a href="http://www.citeulike.org/user/MoritzStefaner/article/695889">this paper</a>.  </p>

<p><img src='http://well-formed-data.net/wp-content/uploads/2007/02/picture-7_480x266shkl.png' alt='picture-7_480×266shkl.png' />
A radically different picture emerges. Not only does the overall shape now nicely display phases of community activity over time, but also the life cycle of single tags is much more transparent. You can rollover single layers highlight it and display the corresponding tag name. Great fun. </p>

<p><a href="http://well-formed-data.net/experiments/tag_dynamics">» Check out the interactive version here</a></p>

<p>What I am now curious about:
– Is there a correlation between time-dependency and over-all frequency of tags? In other words, are frequent tags more evely distributed over time, whilst the low frequency tags tend to be more variable over time?
– Is there a correlation between temporal synchronization and general co-occurrence? Which means — do related tags also appear and disappear together over time?</p>

<p>I think the answer is YES to both questions, but that would definitely need some statistical analysis (any bored neuroscientists around to help me? ;)</p>

<p>To-dos for the visualization:
– Implement a slider, so you can see how a linear and decayed tag cloud would have looked like at a specific time point.
– <a href="http://stamen.com/projects/trace">Stamen got it right</a>: Maybe I should have plotted from the vertical center. Or at least provide an optional inversion of the sorting. Because right now, all the top (green) layers are really distorted, making visual analysis really hard.
– Put some numbers on the axis 
– Show single tagging events on rollover. Or even “unfold” the layer to improve readability and avoid misconceptions.</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=51&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/51/emerging-topics-v2/feed</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Husserl and tagging</title>
		<link>http://well-formed-data.net/archives/45/husserl-and-tagging</link>
		<comments>http://well-formed-data.net/archives/45/husserl-and-tagging#comments</comments>
		<pubDate>Wed, 31 Jan 2007 19:30:39 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Reviews]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[community]]></category>
		<category><![CDATA[husserl]]></category>
		<category><![CDATA[intersubjectivity]]></category>
		<category><![CDATA[Personal]]></category>
		<category><![CDATA[semantic-web]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/45/husserl-and-tagging</guid>
		<description><![CDATA[A very nice paper on the “laissez-faire librarianship” often associated with tagging vs. more structured semantic web approaches. Most notable is that the discussion is put in the context of Husserl’s theory of reflections, intentionality and intersubjectivity. A PHENOMENOLOGICAL FRAMEWORK FOR THE RELATIONSHIP BETWEEN THE SEMANTIC WEB AND USER-CENTERED TAGGING SYSTEMS D. Grant Campbell Faculty [...]]]></description>
			<content:encoded><![CDATA[<p>A very nice paper on the “laissez-faire librarianship” often associated with tagging vs. more structured semantic web approaches. Most notable is that the discussion is put in the context of <a href="http://plato.stanford.edu/entries/husserl/">Husserl’s</a> theory of reflections, intentionality and intersubjectivity. </p>

<p><a href="http://www.slais.ubc.ca/users/sigcr/sigcr-06campbell.pdf">A PHENOMENOLOGICAL FRAMEWORK FOR THE RELATIONSHIP BETWEEN THE 
SEMANTIC WEB AND USER-CENTERED TAGGING SYSTEMS</a></p>

<p>D. Grant Campbell <gcampbel @uwo.ca> 
Faculty of Information and Media Studies 
University of Western Ontario<br />
London, Ontario N6A 5B7, Canada </gcampbel></p>

<blockquote>Abstract 
 
This paper uses Husserl’s theory of phenomenology to provide a model for the relationship 
between user-centered tagging systems, such as del.icio.us, and the more highly structured 
systems of the Semantic Web.  Using three aspects of phenomenological theory—the movement 
of the mind out towards an entity and then back in an act of reflection, multiplicities within unity, 
and the sharing of intentionalities within a community—the discussion suggests that both tagging 
systems and the Semantic Web foster an intersubjective domain for the sharing and use of 
information resources.  The Semantic Web, however, resembles traditional library systems, in 
that it relies for this intersubjective domain on the conscious implementation of domain-centered 
standards which are then encoded for machine processing, while tagging systems work on 
implied principles of emergence. </blockquote>

<p></p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=45&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/45/husserl-and-tagging/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Tag maps update</title>
		<link>http://well-formed-data.net/archives/42/tag-maps-update</link>
		<comments>http://well-formed-data.net/archives/42/tag-maps-update#comments</comments>
		<pubDate>Mon, 22 Jan 2007 21:49:59 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[CCA]]></category>
		<category><![CDATA[elastic]]></category>
		<category><![CDATA[PCA]]></category>
		<category><![CDATA[similarity]]></category>
		<category><![CDATA[statistics]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[tagMaps]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/42/tag-maps-update</guid>
		<description><![CDATA[As promised, here is an update to the tag maps application I introduced below along with some explanations. For the impatient: HERE’S THE LINK (Update again: The latest version can be found here) And for the curious: Here’s the explanations: What are tags? Tags are personally chosen, free-form keywords assigned to digital contents. So instead [...]]]></description>
			<content:encoded><![CDATA[<p>As promised, here is an update to the tag maps application I introduced <a href="http://well-formed-data.net/archives/31/tag-clouds">below</a> along with some explanations.</p>

<p><a id="p41" href="http://well-formed-data.net/experiments/tag_maps/" title="Elastic tag maps experiment"><img id="image41" src="http://well-formed-data.net/wp-content/uploads/2007/01/tag_maps.jpg" alt="tag_maps.jpg" /></a></p>

<p>For the impatient: <a href="http://well-formed-data.net/experiments/tag_maps/">HERE’S THE LINK</a></p>

<p>(Update again: <a href="http://well-formed-data.net/archives/84/tag-maps-update-again"><em>The latest version can be found here</em></a>)</p>

<p>And for the curious: Here’s the explanations:<span id="more-42"></span></p>

<div class="line"></div>

<p><strong>What are tags?</strong>
Tags are personally chosen, free-form keywords assigned to digital contents. So instead of putting a bookmark into a folder, you might assign it the tags “photography nature germany lomo”. If later you are looking for sites about photography, you will find it again under this term, but also if you are searching for “nature”. Another nice aspect is that  your tagging creates an annotation to the existing content. If you share these with others (e.g. via a public bookmarking service such as <a href="http://del.icio.us">delicious</a>), everybody benefits by discovering new sites and getting better matches for their searches. </p>

<p>You can find more info about tagging <a href="http://en.wikipedia.org/wiki/Tag_%28metadata%29">e.g. at Wikipedia.</a></p>

<div class="line"></div>

<p><strong>What are tags clouds?</strong>
Tag clouds represent a whole bunch of tags as weighted lists. The more often a tag has been used, the larger it will be displayed in the list. This can be used to both characterize single users, webpages, as well as whole communities.</p>

<p>As an example, see my delicious tag cloud here:</p>

<script type="text/javascript" src="http://del.icio.us/feeds/js/tags/der_mo?icon;count=156;size=10-26;color=cccccc-000000;title=my%20del.icio.us%20tags;name;showadd"></script>

<p>Tag clouds can also be used for navigation: Click one of the tags and you will come to a web page displaying all of my bookmarks matching this tag.</p>

<div class="line"></div>

<p><strong>What’s the problem with tags clouds?</strong></p>

<p>Tag clouds are nice and really popular, but still there is quite some room for improvement:
* <strong>Tag clouds are not suited for long tail navigation:</strong> By simply adding up over time, a certain pattern consistently emerges: there will be a some dominating tags (the “big head”) and a vast number of rarely used tags (the “long tail”). Whilst the “big head” tags remain pretty constant over time and broadly characterize your interests, the “long tail” contains all the variety of things you encounter. Tag clouds visually prioritize the “big head”. However, both for browsing and for searching, access to the long tail is vital, since this is where the real information is contained. 
* <strong>Summing up over time does not represent the dynamics of interests:</strong>Additionally, it can be questioned if merely summing up tags is the right approach in general. How about topics you <em>were</em> interested in, but now you aren’t anymore? Or conversely, very recent interests, which are pretty important to you but haven’t been tagged often enough to show up in the cloud? To solve this problem, <a href="http://chir.ag/">Chirag Mehta</a> had the nice idea of implementing <a href="http://chir.ag/phernalia/preztags/">tag clouds with a time slider</a>. However, if you look at these, another problem becomes evident:
*  <strong>Tag clouds are not suitable for animation:</strong> This is due to their alphabetical list order and visual messiness. Since every tag’s position in a tag cloud is defined by its predecessor’s size and position, things start jumping around once you start scaling tags. So tag clouds are not really suited to display the dynamical nature of tagging structures — how tags appear and disappear.
* <strong> Tag clouds are ordered the wrong way:</strong> Tags denote concepts. As such, they have meaningful relations to each other. Tag clouds are ordered alphabetically or by size — it would be much more effective, if tags that belong together could also be presented together. Some of these relations can be deduced automatically, by observing how tags are used: Some tags might always appear together, others sometimes and others never. If tags co-occur frequently or have many common “neighbors”, you can be sure the concepts denoted will be related in some manner. </p>

<div class="line"></div>

<p><strong>So whatcha you gonna do about it?</strong></p>

<p>All these issues lead me to developing a mapping algorithm to analyse and display tag structures based on <strong>how tags occur together</strong>. Technically, it is based on a vector-space model, where each tagging action is assigned a point in a high-dimensional vector space. By applying the dimensionality reduction algorithms PCA (Principal Components Analysis) and CCA (Curvilinear Component Analysis), I calculate a two-dimensional map, which places frequently co-coccurring tags close together. Additionally, covariance values for all tags are stored, so I know exactly how “related” each tag is to the others. </p>

<p>This information can be used to display both maps as well as lists, which are ordered by “relatedness”. You can play around with some of the maps in the <a href="http://well-formed-data.net/experiments/tag_maps/">interactive demo</a>. </p>

<p><a id="p41" rel="attachment" class="imagelink" href="http://well-formed-data.net/archives/42/tag-maps-update/tag_mapsjpg/" title="tag_maps.jpg"><img id="image41" src="http://well-formed-data.net/wp-content/uploads/2007/01/tag_maps.jpg" alt="tag_maps.jpg" /></a></p>

<p>In its initial state, all tags are scaled according to their frequency. Clicking a tag will transform the map or list and will bring all related tags to the front according to their degree of “relatedness”.
Tags are colored according to their “freshness”. Tags are considered fresh if their average usage has increased over the last 30 days. </p>

<p>To see how that freshness measure changes over time, there is an <a href="http://well-formed-data.net/experiments/emerging_topics">animated version</a> as well described in an <a href="http://well-formed-data.net/archives/38/emerging-topics">earlier post</a>. </p>

<p>When clicking the 1D tab, you will find a list representation of the same information. The interaction principle is the same.</p>

<p><a id="p44" rel="attachment" class="imagelink" href="http://well-formed-data.net/archives/42/tag-maps-update/tag_map_1djpg/" title="tag_map_1d.jpg"><img id="image44" src="http://well-formed-data.net/wp-content/uploads/2007/01/tag_map_1d.jpg" alt="tag_map_1d.jpg" /></a></p>

<p>Additionally, I added two little diagrams depicting the “freshness” distribution and the number of occurrences for each tag. These will be filtered once you click a tag.</p>

<p><a id="p43" rel="attachment" class="imagelink" href="http://well-formed-data.net/archives/42/tag-maps-update/mini_graphsjpg/" title="mini_graphs.jpg"><img id="image43" src="http://well-formed-data.net/wp-content/uploads/2007/01/mini_graphs.jpg" alt="mini_graphs.jpg" /></a></p>

<div class="line"></div>

<p><strong>Insights gained and how to turn these experiments into usable interfaces</strong></p>

<p>I would love to write a little bit about these issues, but this post is already long enough. I hope I will find the time in the next couple of days to write a follow-up!</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=42&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/42/tag-maps-update/feed</wfw:commentRss>
		<slash:comments>14</slash:comments>
		</item>
		<item>
		<title>Emerging topics</title>
		<link>http://well-formed-data.net/archives/38/emerging-topics</link>
		<comments>http://well-formed-data.net/archives/38/emerging-topics#comments</comments>
		<pubDate>Sun, 10 Dec 2006 22:46:48 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Experiments]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[clouds]]></category>
		<category><![CDATA[elastic]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[tagMaps]]></category>
		<category><![CDATA[temporalDynamics]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/38/emerging-topics</guid>
		<description><![CDATA[You might have seen the tag clouds posted below. I calculate tag positions based on co-occurrence, such that tags used together are placed closer to each other. Additionally, tags are scaled áccording to frequency. A general problem I have with the resulting representation (and common tag clouds as well) is the fact, that every tag [...]]]></description>
			<content:encoded><![CDATA[<p><a id="p37" href="http://well-formed-data.net/experiments/emerging_topics" ><img id="image37" src="http://well-formed-data.net/wp-content/uploads/2006/12/picture-8_480x336shkl.png" alt="picture-8_480x336shkl.png" width=480 height=336 /></a>
You might have seen the <a href="http://der-mo.net/tagClouds2/">tag clouds</a> posted below. I calculate tag positions based on co-occurrence, such that tags used together are placed closer to each other. Additionally, tags are scaled áccording to frequency.<br />
A general problem I have with the resulting representation (and common tag clouds as well) is the fact, that every tag occurrence is weighted equally. As a result, these tag clouds never represent the current state of interest, but a very sluggishly changing summary of your archive. However, your interests and the corresponding vocabulary keeps moving on.
So I am currently investigating trends in tag clouds and how groups of related tags emerge and disappear again. </p>

<p><a href="http://well-formed-data.net/experiments/emerging_topics">A first glimpse into the dynamical nature of tag structures.</a></p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=38&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/38/emerging-topics/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Papers on tagging</title>
		<link>http://well-formed-data.net/archives/36/papers-on-tagging</link>
		<comments>http://well-formed-data.net/archives/36/papers-on-tagging#comments</comments>
		<pubDate>Fri, 08 Dec 2006 11:37:37 +0000</pubDate>
		<dc:creator>Moritz Stefaner</dc:creator>
				<category><![CDATA[Reviews]]></category>
		<category><![CDATA[Thesis]]></category>
		<category><![CDATA[papers]]></category>
		<category><![CDATA[review]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://well-formed-data.net/archives/36/papers-on-tagging</guid>
		<description><![CDATA[I am currently organizing my literature on tagging. When I started with research on that topic about a year ago, you could count the number of substantial contributions on one hand. Over the current year, however, the number of papers on this topic has sky-rocketed, which makes the whole area hard to oversee at the [...]]]></description>
			<content:encoded><![CDATA[<p>I am currently organizing my literature on tagging. When I started with research on that topic about a year ago, you could count the number of substantial contributions on one hand. Over the current year, however, the number of papers on this topic has sky-rocketed, which makes the whole area hard to oversee at the moment.</p>

<p>I will start with an overview of my subjective must-reads for now (all linked to citeulike):</p>

<p><span id="more-36"></span>
<a href="http://www.citeulike.org/user/MoritzStefaner/article/984374" target="_blank">Position Paper, Tagging, Taxonomy, Flickr, Article, ToRead </a>
Marlow C, Naaman M, Boyd D, Davis M 
Produced for the WWW06 workshop, this paper was an essential contribution to characterizing different tagging systems and the resulting tagging dynamics. </p>

<p><a href="http://www.citeulike.org/user/MoritzStefaner/article/305755" target="_blank">The Structure of Collaborative Tagging Systems </a>
Golder S, Huberman BA 
A seminal reference paper introducing tagging in the scientific community. The most notable contribution are the insights the authors provide into the emergence of stable patterns in tag proportions over time. </p>

<p><a href="%0Ahttp://www.citeulike.org/user/MoritzStefaner/article/468899" target="_blank">Folksonomies: Tidying up Tags? </a>
Guy M, Tonkin E 
A discussion of principle shortcomings of tagging and possible resolutions. </p>

<p><a href="http://www.citeulike.org/user/MoritzStefaner/article/117727" target="_blank">The vocabulary problem in human-system communication </a>
Furnas GW, Landauer TK, Gomez LM, Dumais ST 
Although not directly applicable to tagging, this paper from the late 80s is the baseline for experiments with free-form naming and annotation in the digital realm. </p>

<p><a href="http://www.citeulike.org/user/MoritzStefaner/article/984400" target="_blank">Magic Words: How Language Augments Human Computation</a> 
Clark A 
A great discussion of words as cognitive tools from a cognitive science and philosophy perspective. </p>

<p><a href="http://www.citeulike.org/user/MoritzStefaner/article/695889" target="_blank">Collaborative Tagging and Semiotic Dynamics </a>
Cattuto C, Loreto V, Pietronero L 
An inspiring discussion of the temporal dynamics of tagging systems. The core contribution is a very simple probabilistic model based on  Yule-Simon memory model. Although many questions remain open, this model has some direct consequences on dealing with tags in user interfaces, which I will elaborate on in a later post.</p>
<img src="http://well-formed-data.net/?ak_action=api_record_view&id=36&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://well-formed-data.net/archives/36/papers-on-tagging/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

