Fork me on GitHub

When software embraces failure

I recently heard someone talk about software as being operational. The context was large scale systems and how it is important not just to run hardware systems that operate at scale but also software that operates at scale. And having spent a lot of time lately in a world where operational excellence is paramount as well as the Hadoop world, the importance of operational software is becoming only more evident. As we work with very large data sets that need to be processed across 100′s, even 1000′s of commodity compute nodes, you need to run software that accepts and embraces failure. What makes the Hadoop framework work isn’t just the part that you can run map-reduce jobs, but the part where data is replicated in multiple locations, where the filesystem is distributed across a number of nodes, where hardware failure is expected. As we go towards multiple terabytes per workload, maybe someday even petabytes of data in the life sciences, understanding and appreciating operational software is going to become critical. Most of us aren’t used to doing so and it’s going to require a bit of a cultural shift to get there, but it will be fun when people figure it out.

In the meantime follow the work Rajarshi is doing with Hadoop and of course work coming from people like Mike Schatz.

Reblog this post [with Zemanta]

This entry was posted in BioIT, Computing, Informatics, Software & Internet. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present