It’s alive! Cloudalicio.us is back online

November 16th, 2006

In preparing slides for my talk earlier today (770kb pdf), I was grabbing some screenshots from del.icio.us and rummaging through some of my old Cloudalicio.us graphs from before we went offline in March.

In clicking around at del.icio.us, I realized that the /url history pages are making available (again) the full post history of the users who have bookmarked that URL.

Before the shutdown in March, the /url history pages were displaying full historical posting data, providing the three entities that make up folksonomy (person, item, tags) as well as the date on which the tagging occurred. Cloudalicio.us would grab that data, parse it, and then display a graph allowing the visitor to see how the del.icio.us-using-public’s opinion of that page had changed over time.

Today, I realized that the full folksonomy data was again available (person, item, tags) but the date data is now compressed into postings by month. And that’s okay - it’s actually less points to graph, so the graphs should generate a little more quickly now. Additionally, I’ve set the cache to 15 days instead of 1 day as daily updates wouldn’t look that different on these new graphs anyways.

So, Behold, we’re back! Everybody say “Welcome back, Cloudy”!

Oh, and I’ve updated the greasemonkey script (v3) as well.


we’re offline - del.icio.us site structure has changed

March 10th, 2006

I am sad to report today that del.icio.us has changed the format and content of its /url information. This has rendered Cloudalicious largely inoperable as the historical posting data for a particular URL is no longer available.

I am hoping the value of Cloudalicious is large enough that either an agreement can be worked out with del.icio.us/Yahoo! to access the datafeed or Cloudalicious will need to find another source of tagging data. Any ideas?

Off to SXSW this morning. Perhaps I’ll see some of you there.


claimID Invite Key - Public Beta!

February 25th, 2006

ClaimID has gone into public beta. If you want to see what it’s all about - please use the key below to register for an account.

Skip the long lines at the beta signup form. Shhh. Don’t tell anyone.

Invite Key = a8dcaec7945a44b915cf81c1a5253070a70d81d6

Happy Saturday!


claimID is coming…

January 18th, 2006

claimID is coming…

This is kind of off topic as it’s not directly about tagclouds - but I like to cross promote.

The site Fred and I are building is almost ready. We’re taking emails of interested parties now.

claimID

claimID will allow users to track, classify, annotate, prioritize and share the information that is about them online.

It’s about staking your claim to the things out there about you. It’s about drawing a line in the sand over the things that aren’t about you. It’s about managing the facts and the image you want. Take back some of the control that’s been lost to the search engines. Use the search engines to help you reclaim who you are.

- Terrell Russell


Tag Dissolve - Give your users the power to forget

November 22nd, 2005

Humans remember things in a fuzzy way. The connections we make between disparate objects happen because of what they share in common - in time, in space, in smell and sound. Burning leaves? Grandparents’ house. Trampoline? 8th grade pool party. Florida? Hanging chads.

I’ve read about techniques of running through your rolodex, or address book, occasionally and without reason, just to see the names and visualize each person. This has been shown to be very effective in helping to remember people’s names since we are more likely to remember what we have seen recently - and forget what we have not.

Reinforcement plays a large role in memory and recall.

If websites begin to implement a tag cloud epoch, they can begin to “forget” the stale tags in their system. Sites can begin to have their most visible tags dissolve in an organic, human way. As people do not continue to tag a certain thing a certain way, this thing should fade slowly from view. It should still be findable (and re-findable) through search and browse, but the tags describing it should count less and less when considering what hot lists to put the item on.

This arbitrary date of oldness, this epoch, should be customizable, of course. But it should be available and it should be prominent. Allow a user to define how far back the tags should be counted. Allow a user to define how old is too old and how recent is recent enough.

Folksonomy will be around for a while, I suspect. It has proven itself useful in many ways - probably some yet to be seen. Some of yesterday’s simple tags will seem quaint tomorrow. Give the user the power to decide whether quaint is signal or quaint is noise. Go implement the “Since” date filter on all your tag clouds today.


A Tag Cloud Epoch for Freshness

November 20th, 2005

Tag clouds are young. The term folksonomy itself was coined just over a year ago by Thomas Vander Wal on the IA Institute’s Members’ Discussion List in a response to Gene Smith and has yet to be a word even a medium-sized minority has heard, much less come to understand.

However, here on Cloudalicio.us, we can already see that many items being tagged since the beginning of del.icio.us have stabilized. New tags being added to these URLs are no longer visibly affecting the overall weights as each tag’s marginal weight has decreased since the first tag was assigned.

We have a better view of what everyone “has thought” of an item by looking at it’s full, cumulative tag cloud, but we miss what everyone “is thinking” of an item when we allow ourselves to look back into time and see possibly “stale” tags from long ago intermingled with more contemporary, more recent tags.

This is not to say that tags from long ago are not valid or no longer important. They are. It’s just that if we’re going to be looking at tag clouds as tea leaves into what a population thinks about something, we need to recognize that unless we silence the voices from long ago, they’ll continue to speak to us and influence our counts. Opinion polls for presidential elections are considered old almost before they get to press. Yesterday’s numbers don’t mean anything. Having snapshots in time is what allows trending to be done.

Trending in tags, here on Cloudalicio.us and everywhere else, has been cumulative only. We do not yet have the tools to slice all this new data and make it more meaningful than to look at it in toto, in summary form.

And so, we need to define a tag cloud epoch.

Tag clouds should be viewed within a framework of five points in time.

  1. system epoch
  2. tag cloud epoch
  3. zoom window start
  4. zoom window end
  5. now

tag cloud epoch

As it currently stands, most systems that have tagging enabled count all the tags between the two outermost points in time (#1 and #5). Cloudalicio.us currently allows four of these to be defined (#1, #3, #4, #5) when used with Date Zooming. Unfortunately, there is no current way to define the point in time from which tags should start to be counted. The addition of point #2 is what will let us see any arbitrary slice in time. This will allow us to see what a population was thinking during a specific week or month about a specific item.

Hopefully, it will appear here soon…


another xhtml tweak

September 14th, 2005

del.icio.us changed the /url pages again today.

parser has been updated and should function normally.


It’s not about exhaustivity

September 7th, 2005

Ian Davis talks about how tagging is too expensive and will not, in the end, be quite the revolution…

I’ve come to the conclusion that this simple technology is hobbled by being expensive. This worries me. I’m not convinced that tagging will persist in the long term and I have a feeling that five years from now we’ll be looking back at the fad that was tagging and shaking our heads over the vast inpenetrable databases of tagged content. Then we’ll get back to Google and to our librarians and get on with finding the things that matter to us.

The advantage classification holds is that you can lookup the classification you need and be confident that you have found all that is available. With tagging there is no way to exhaustively search all the possible tags that people might have used, in all possible languages and spellings.

But, it’s not about exhaustivity.

It’s about the ease of finding enough to satisfy the searcher’s need for information. If the need for information is very deep, then it’s up to the searcher to continue her research. If the need for information is fairly shallow (or rather, anything other than deep), the searcher can feel fairly confident that others have been interested in the topic area before and done enough prior heavy lifting on the classification and categorization of resources to satisfy her search.

‘All that is available’ seems to be a strawman in this discussion. Very very few searches could ever truthfully be exhaustively answered. In fact, it’s the professional researcher’s opinion we usually trust when it comes to whether something is related or not. With raw information, as opposed to physical artifacts, I’d even venture that an objectively exhaustive search result (regardless of how it was organized/returned) is impossible.

I think Clay has made a very strong argument and I think he’s hedged his statements where appropriate. Tagging has shifted the cost of classification and categorization. The tools for elegant retrieval aren’t here yet.

But they’re coming.

That’s what I hear him saying.

I also think, in the long term, professionals will still be professionals. They provide information on the resources available and how best to use them. Before, this consisted of the actual sorting and managing of the hierarchy. In the future, this will be much more based in managing the many voices and scrubbing the messiness that bubbles up from below.

Embrace the messiness.


Date Zooming

September 5th, 2005

Date Zooming has been added.

You can now zoom in on a section of a particularly interesting graph. This should alleviate some of the forward-moving pressure whereby older/established sites will eventually become boring and flat.

One nice effect is the ability to go ‘back’ in time to more precisely determine where and when a site has ‘leveled out’ and become ‘mature’. Over time, I would expect some statistics to begin to fall out concerning how many taggers are required before a site’s Tag Cloud matures.

Some of this, of course, is determined by the community doing the tagging. But over time, as more of a cross-section of society is using these tools, the true averages will begin to show themselves.


del.icio.us date_added formatting change

June 25th, 2005

Made a small parsing change today.

del.icio.us /url/ pages changed from having the date_added listed for each entry to grouping the entries under a single date_added.

Edit: 6/27 - del.icio.us changed again slightly - ‘theday’ -> ‘date’