Fork me on GitHub

PubChem, CouchDB and data pipelines

Rich Apodaca has a great set of blog posts on using PubCouch, the CouchDB interface for PubChem. The series is great in itself, but I was especially intrigued by the title of the third installment, PubCouch: Streams Aren’t Just for Pipeline Pilot. In the post Rich describes how PubCouch makes it possible to work with the PubChem FTP archive like it was a single large SD file.

Recently, I’ve started believing that modern programming paradigm and increasing awareness of RESTful architectures, distributed data processing, messaging etc makes systems like Pipeline Pilot look somewhat dated. They aren’t going to go anywhere soon, but I believe that the power of being able to deliver services to the end user is becoming the norm and developing those services is getting easier while the quality of developers in the biopharma/life sciences world is getting better.

Reblog this post [with Zemanta]

This entry was posted in Bytes, Chemistry, Informatics, Molecules, Programming. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.
blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present