Rich Apodaca has a great set of blog posts on using PubCouch, the CouchDB interface for PubChem. The series is great in itself, but I was especially intrigued by the title of the third installment, PubCouch: Streams Aren’t Just for Pipeline Pilot. In the post Rich describes how PubCouch makes it possible to work with the PubChem FTP archive like it was a single large SD file.
Recently, I’ve started believing that modern programming paradigm and increasing awareness of RESTful architectures, distributed data processing, messaging etc makes systems like Pipeline Pilot look somewhat dated. They aren’t going to go anywhere soon, but I believe that the power of being able to deliver services to the end user is becoming the norm and developing those services is getting easier while the quality of developers in the biopharma/life sciences world is getting better.
PubChem, CouchDB and data pipelines
Rich Apodaca has a great set of blog posts on using PubCouch, the CouchDB interface for PubChem. The series is great in itself, but I was especially intrigued by the title of the third installment, PubCouch: Streams Aren’t Just for Pipeline Pilot. In the post Rich describes how PubCouch makes it possible to work with the PubChem FTP archive like it was a single large SD file.
Recently, I’ve started believing that modern programming paradigm and increasing awareness of RESTful architectures, distributed data processing, messaging etc makes systems like Pipeline Pilot look somewhat dated. They aren’t going to go anywhere soon, but I believe that the power of being able to deliver services to the end user is becoming the norm and developing those services is getting easier while the quality of developers in the biopharma/life sciences world is getting better.
Related articles by Zemanta