meebo rooms

we launched meebo rooms tonight. 

the result of about 4 months work, many meals of takeout, lots of afternoon 4 square, and I have to think about 5,000 testing cycles.

Sandy had a great blog post on it here.

Danny and I worked for the last month solid putting the partners together, and I think the final tally was 14 for launch, including CNET, Capital Records, NBC and a bunch of other great content partners.

Below is a room on fast cars - my previous hobby before I became a father and started commuting to mountain view (100 miles/day).  I used to have a 1971 porsche 911T, then a 1975 aston martin V8, and second to last, a 1994 porsche RS America. Now, I drive a 2007 prius...and I love it.

Feel free to contribute.

http://www.meebo.com/rooms

UI words to live by

UI words of wisdom here

Iphonefluidity

Mobile Revolution needed ASAP

These good gentlemen are happily chatting away on their cells in Nepal. 

Image hosted by Webshots.com
by daske

Nice.  I'm happy for them.

Everyone else on the planet is able to watch full length feature films, check the latest news, pay for their groceries, text with their children and play world of warcraft on their phone.

Here in the nest of global innovation: California USA, I lose cell reception on my drive home in Marin County, California - *EVERY SINGLE* time.

Sprint's generated something like three quarters of a billion dollars in EBITDA last quarter.  You'd think they could find a few bucks to put a cell tower in a major commute area like 101 North out of San Francisco around Sausalito?  After all, there are only 38 million crossings per year.

Global Neighborhoods

My friend Shel is writing a new book - the main subject centers around Global Neighborhoods - and he just posted a blog interview with me on the topic.

In case you don't know Shel, he and Robert Scoble wrote the book on business blogs, Naked Conversations - a must read.  Shel did some consulting for me in the spring - I highly recommend him.

The concepts that are being explored by Shel will contribute to a more human web and one that is ultimately more personal and useful, as well as socially rewarding.  He's a very thoughtful guy, and a good storyteller, so the book should be very good.

Many of the ideas we discussed are being applied to the new and improved Webshots, which relaunches in August.

under the radar

Just came back from the under the radar conference, where I had the pleasure of "judging" about 10 start ups from podcasting to video remixing.  A few of my fellow CNET'ers were there, Rafe moderated one of panels and Dan Farber was there with digital camera in hand, and each had insights galore.  I met Debbie Landa, who runs the conference, and have to say how impressed I was - a real human being, smart, down to earth, opinionated - like to see her again.  I will be back, and hope to bring more people from the CNET Networks business side.  We all don't get out as much as we should.

More on Haystack

Haystack powers allyoucanupload and will soon power all of Webshots (Webshots is a photo sharing community with 19,000,000 members who have uploaded over 375,000,000 photos).  Allyoucanupload is an image hosting service that we built to run alongside Webshots.

Haystack is designed to provide a very scalable, reliable and cost effective platform for object storage and delivery to the Internet.  It just went live 2 weeks ago - we are currently using it for the allyoucanupload Webshots image hosting service (gif, jpeg and png).   In the very near future, it will serve all Webshots photos, and soon, video. 

I'm doing a long post because I'm very proud of what our technical team has accomplished, but also to give some insight into how finance, user and technical strategy intersect in our group.  Building a large and sustainable (aka profitable) business in social media requires a balancing act between the delivering on a great user promise, a revenue model and keeping your costs under control.  Haystack will allow us to deliver very robust storage solutions for users at a very low marginal cost.

Disclaimer/Credit: Almost all of the technical content in this blog post was written by Paul O, who runs CNET Networks' data center services, including database architecture, network systems and operations and the actual data center.  Paul, Jim, Rodolphe, Marcus, and Matthew built Haystack - they don't (yet) have blogs so I'm doing this post.  Please do not attribute any technical props to me because of this post.  My only contribution to Haystack was to approve its development and cheerlead along the way.

Haystacks' content (social media files) has several interesting characteristics: it grows without bound; it tends toward write-once, read-many; the most recent content tends to be the most frequently accessed.  Haystack's design leverage these characteristics. 

The challenge:
A big financial and therefore technical issue is the relationship of storage to delivery.  It's relatively easy to deliver a small number of files to a lot of people.  And a large number of files to a small number of people.  A large number of files to a large number of people gets more complicated and can get very expensive very quickly.

Haystack gives us the ability to finely match the raw storage capacity of the system with it's overall IO capacity.  Haystack grows very naturally through incremental addition of capacity.  Haystack is designed to handle failures automatically and to keep reliability constant as the system grows.  Haystack uses commodity hardware and software.

The promise of Haystack is that we can handle reliability at scale at a very low (perhaps the lowest) marginal operating cost. Reliability means that we never have to say we're sorry - we lost your photos.  Scale is scale.  Low marginal cost directly goes to our ability to give users as much storage as we can, and run the least intrusive ads, while running Webshots with a sustainable profit margin - and keeping the data center team focused on talent vs hands and well paid:)  You'll see the effects of Haystack on Webshots in our upcoming redesign and soon to be revised storage limits. 

BSU
  Haystack consists of many Basic Storage Units (BSU), which are just servers with a lot of disks.  Content is scattered more or less randomly across all BSU and spindles to maximize the IO throughput of the system.  Multiple copies of the content are maintained on disparate equipment so that no single failure can loose all copies of an object.

Failure: In the event of a component failure, Haystack immediately begins a process to copy the "missing" content from one of the redundant sources to the available components.  Because the content is scattered across all available components, recovery time is on the order of 1/N.  Of course, the failure rate is on the order of N, so the overall availability is on the order of N * 1/N or a constant.  Recovery is also has very little impact on the overall performance of the system.

As new capacity is added, existing content is migrated to the new capacity to rebalance the storage and IO across all available units.  The rebalancing time is approximately constant.

Separation of Church and State
To minimize the overhead in tracking the location of any given object, Haystack puts objects into buckets and needs to track only the location of the buckets.  The applications using Haystack must independently track metadata about each object including it's bucket.  At some level, a bucket is really just a directory and each BSU knows what buckets it contains.  Haystack maintains a proper cache of the bucket locations and each BSU checks in periodically to report the state of its buckets.  The proper cache can easily be rebuilt and the overall system is very tolerant of data inconsistencies between clients, the cache and the BSUs.

Processes
Various processes monitor the overall condition of Haystack and initiate actions as needed to maintain the health.  For example, when new capacity is added, these monitoring processes detect the availability of "under-utilized" capacity and begin a bucket migration process to bring the new capacity up to the same levels as the old capacity.

Some of the jobs have names...they are:
leon's job is to identify and remove extra instances
jeopardy makes more copies of data that has too few copies
scalpel removes outdated copies of data
optimist moves data to fill up less-full nodes
pessimist moves data from busy nodes to less busy nodes

Changing the drives over time
Currently we use 400 GB sata drives.  As time goes by, the average age of the content will grow and the average number of access per object will decrease.  This will allow us to introduce larger capacity disk drives as the system grows, helping to keep the overall hosting and depreciations costs low.  The number of sata drives attached to a given BSU is determined by the overall network throughput of the BSU and the ability of the BSU to effectively use the file system cache.

Caching
Haystack uses various content caching and redirection mechanisms to both hide the complexity of the system from clients and to leverage the raw IO capacity of the spindles.  The system is very loosely coupled and designed to be quite tolerant of failures.  This means that failures are very localized.  In addition, the types of failures that can have the largest negative impact are with very proven technologies, such as file systems and disk arrays.  Thus the overall reliability is high and the operational costs are low.

Rant - lots of private social media properties are venture funded and lose tons of money storing/serving content to get big to either get bought or change the model. 

Finale
The challenge at public companies like CNET Networks is to build a social media business model as well as a user & technical model from the "get go".  That operating plan must be good for user and cheap enough to operate so we can return a healthly return on invested capital to our shareholders.  Haystack is a significant innovation that should help us to build a better product for users, scale for marketing partners, and generate operating cash flow for shareholders.

Whew - if you're still reading then you are likely one of the Webshots or CNET Networks engineers - nice work folks :)

picture in picture

Steve Rubel just posted on how marketers could build widgets to participate in the conversation. 

Great idea.  And it's very relevant to sustaining communities, forums, and blogs -  a good revenue model and a nice complement to the direct response nature of ad sense.

Widgets built by interesting brands would be useful.  And maybe entertaining.

And since users are in control they could choose to align with those that fit their personal brand.

Brian McAndrews came in last month to speak to some people at our company.  He's CEO of aQuantive, the largest digital marketing company.  I asked him about marketers using widgets to extend their brand and content into community sites and he said he'd not heard of any brands doing it.  They really should - and aQuantive has the technical resources to do this for them.    I know a bunch of people who work here who'd put a coke zero widget on their blog.   I would.

You'd need to figure out the money angle though - you'd want to help incent true fans to align themselves with the brands.

My Photo

Recent Comments

Recent Posts

Blog powered by TypePad