Fork me on GitHub

PubChem, CouchDB and data pipelines

Rich Apodaca has a great set of blog posts on using PubCouch, the CouchDB interface for PubChem. The series is great in itself, but I was especially intrigued by the title of the third installment, PubCouch: Streams Aren’t Just for Pipeline Pilot. In the post Rich describes how PubCouch makes it possible to work with the PubChem FTP archive like it was a single large SD file.

Recently, I’ve started believing that modern programming paradigm and increasing awareness of RESTful architectures, distributed data processing, messaging etc makes systems like Pipeline Pilot look somewhat dated. They aren’t going to go anywhere soon, but I believe that the power of being able to deliver services to the end user is becoming the norm and developing those services is getting easier while the quality of developers in the biopharma/life sciences world is getting better.

Reblog this post [with Zemanta]

This entry was posted in Bytes, Chemistry, Informatics, Molecules, Programming. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

One Trackback

  1. By Programming the cloud « GenomeQuest Industry on January 31, 2010 at 13:23

    [...] final remark: Deepak Singh from business|bytes|genes|molecules wonders aloud what is the role of Pipeline Pilot in this new programming paradigm? I’m guessing within a domain, the value proposition might [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present