Fork me on GitHub

ProteomeCommons.org Tranche – A commons for proteomics data

proteomecommons.orgAt the recent US HUPO conference, I happened to walk by a poster entitled “ProteomeCommons.org Tranche: Publicly storing and sharing large amounts of proteomics data”. I ended up having a short chat with the developer of Tranche, Jayson Falkner and was quite impressed with the effort. What is Tranche? Let me quote the poster introduction.

The ProteomeCommons.org Tranche project is designed to allow publication of all raw data relating to proteomics experiments. The system allows investigators to securely share files with collaborators, reviewers of manuscripts, and to publicly disseminate their data. Encryption and digital signatures allow authentication of data files, so provenance and integrity are unambiguous. The system provides for distributed downloads, background file compression, and allows file replication across the network to minimize the chance of file loss. While the system is intended to solve the raw data distribution problem, it is independent of file type so it can be used for peak lists, search results, images, and any other form of digital information.

The idea is that when people publish papers using mass spectroscopic methods for proteomics, the raw data are rarely included. The Tranche project essentially provides a resource for researchers to upload their raw data, share it, make it available to referees, etc. But the developers have gone one step further than just providing a repository. The framework looks quite solid, allowing data replication, self hosting, network hosting, etc. According to Jason they started off by implementing a BitTorrent, but a lot of people were apprehensive of the negative connotations that BitTorrent generates, so they ended up developing their own framework.

The ProteomeCommons.org moniker is somewhat unfortunate, since the organization is not affiliated with Science Commons. However, there has been a dialog between the two organizations recently and there is a possibility that ProteomeCommons.org might adopt Creative Commons licensing. That certainly makes sense and hopefully will not meet any resistance from the user community.

There is a Google Group for the effort where you can get a copy of the HUPO poster (which has more implementation details). I was just pleased to note that there were people in the community who understood the role internet technology can play in creating community and of the value of data sharing and transparency. There seems to be a fair degree of adoption among users with servers at some of the top proteomics facilities around the world. Another database project, the Human Proteinpedia, from the Pandey Group targeted for storing annotations and metadata related to proteomics experiments also plans to (perhaps already is) using the Tranche project as a repository for all raw data.


Update:
According to Jayson Falkner (see comments), ProteomeCommons.org is in the process of moving from Apache2 licensing to CC licensing.

Technorati Tags: , , ,

This entry was posted in BioIT, Informatics, Omics, Open Science, Software & Internet. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

6 Comments

  1. Posted March 14, 2007 at 20:26 | Permalink

    Thanks for the writeup Deepak! It was a pleasure to meet you in Seattle.

    Just a note. We (ProteomeCommons.org) are in fact in the process of converting most all of the ProteomeCommons.org code and related projects from the Apache 2 license to the Creative Commons Attribution license. Cheers.

  2. Posted March 14, 2007 at 20:45 | Permalink

    Jayson,

    It was good to meet you too and good to see the approach you were taking.

    Thanks for the clarification. I’ve updated the original post

  3. Posted March 15, 2007 at 00:26 | Permalink

    Thanks for the writeup Deepak! It was a pleasure to meet you in Seattle.

    Just a note. We (ProteomeCommons.org) are in fact in the process of converting most all of the ProteomeCommons.org code and related projects from the Apache 2 license to the Creative Commons Attribution license. Cheers.

  4. Posted March 15, 2007 at 00:45 | Permalink

    Jayson,

    It was good to meet you too and good to see the approach you were taking.

    Thanks for the clarification. I've updated the original post

  5. Posted May 26, 2007 at 08:10 | Permalink

    Jayson,

    This would be a valuable resource, but I don’t think enough people know about it.

    Is there a way Sage-N Research can help publicize it?

  6. Posted May 26, 2007 at 12:10 | Permalink

    Jayson,

    This would be a valuable resource, but I don't think enough people know about it.

    Is there a way Sage-N Research can help publicize it?

3 Trackbacks

  1. [...] Tranche on bbgm [...]

  2. [...] Bioscreencast « ProteomeCommons.org Tranche – A commons for proteomics data Blogroll update » [...]

  3. [...] Commons Tranche is one of the cooler resources on the web. Ever since I met Jayson Falkner, I have liked their approach to open data, and their early support for CC0. Looks like Tranche has hit the big time with the [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present