Sign in | Display Options

Completeness

Conversations tagged with 'completeness'

FriendFeed
Robert Scoble posted an entry
May 22, 2010 1:08 PM - Sign in to comment - Link

There are a slew of social relationship managers that have been released in the past year and the space is really heating up.

There’s Gist, here’s CEO T.A. McCann talking about it in front of the Google IO conference.

But there’s Xobni, Rapportive, eTacts, and others.

I’m trying them all.

So far Gist is my favorite. Why? Because of its completeness of information about other people it brings into my view and because of its web centricity. Xobni is nice if you use Outlook, for instance, but I don’t use that anymore.

Rapportive is nice because it works on regular old Gmail. Gist just shipped support for Gmail, but only if you are using Google Apps. I don’t use that either.

But which one is your favorite? Have you considered using a relationship manager? They aren’t for everyone, but if you need to keep up to date on a large number of people, which usually means executives, PR people, or salespeople, then these things are really great. It’s interesting that everyday Gist sends me an email and presents information about the people I’m emailing with the most. That often lets me learn about people in a new way that I miss on Twitter, LinkedIn or Facebook.

For instance, you can see my public profile on Gist and get an idea of the kinds of things it presents about the people in your life. Now, imagine you were headed to a meeting with a VC, or trying to sell something. Wouldn’t having all this data about that person be useful? Also, Gist has a nice iPhone app that shows you info and it just got an update today that lets you import your iPhone’s contacts.

By the way, I’ve created a Twitter list of “weapons for entrepreneurs” like these and other tools, people, and services that help entrepreneurs do their jobs better. Why aren’t you following that?

Blog: what's your favorite social relationship contact manager? http://bit.ly/dnKQK5 Mine is @gist and @rapportive, but @xobni is in there 2

- Robert Scoble
FriendFeed
Adam Sherk shared an item on Google Reader
February 16, 2010 1:43 PM - Sign in to comment - Link

Posted by Nick Gerner

Since the launch of Open Site Explorer and our API update, Chas, Ben and I have invested a lot of time and energy into improving the freshness and completeness of Linkscape's data.  I'm pleased to announce that we've updated the Linkcape index with crawl data that's between two and five weeks old—the freshest it's ever been.  We've also changed how we select pages, in order to get deeper coverage on important domains and waste less time on prolific but unimportant domains. 

You may recall Rand's recent post about prioritizing the best pages to crawl, and mine about churn in the web. We've applied some of the principles from these posts to our own crawling and indexing. Rand discussed how crawlers might discover good content on a domain by selecting well-linked-to entry points:


In the past, we've selected pages to crawl based purely on mozRank.  That turned out to favor some unsavory elements (you know who you are :P).  Now, we look at each domain and determine how authoritative it is.  From there we select pages using the principle illustrated above:  Highly linked-to pages—the homepage, category pages, important pieces of deep content—link to other important pages we should crawl.  From intuition and experience we believe this gives the right behavior to crawl like a search engine would.

In a past post, I discussed the importance of fresh data.  After all, if 25% of pages on the web disappear after one month, data collected two or more months ago just isn't actionable.

From now on, we're focusing on that first bar in the graph above. By the time our data approaches that second bar (meaning most of it is out of date), we should have an index update for you.  If and when we show you historical data, we'll mark it as such.

What this means for you is that all our tools powered by Linkscape will provide fresher, more relevant data, and we'll have better coverage than ever.  This includes things like:
As well as products and tools developed outside SEOmoz using either the free or paid API: There are plenty more.  In fact, you could build one too!

Because I know how much everyone likes numbers, here are some stats from our latest index:
  • URLs: 43,813,674,337
  • Subdomains: 251,428,688
  • Root Domains: 69,881,887
  • Links: 9,204,328,536,611
Our last index update was on January 17th.  You might recall some bigger numbers in the last update.  Because of the changes to our crawl selection, our latest index should exclude a lot of duplicate content, spam pages, link farms, and spider traps while keeping high quality content.

Our next update is scheduled for March 11. But we'll update the index before then if the data is ready early :)

As always, keep the feedback coming.  With our own toolset relying on this data, and dozens of partners using our API to develop their own applications, it's critical that we hear what you guys think.

NOTE: we're still updating the top 500 list at the moment.  We'll tweet when that's ready.

Do you like this post? Yes No

Please choose your display preferences:

CLOSE [ X ]