it’s just striking how some very basic kinds of data friction keep getting in the way of ever-more-amazing possibilities for analysis and insight. – Jon Udell
I don’t think I need to add anything to this line (at the end of a typically great post by Jon)
I wonder what kind of data friction life scientists have to fight through. Back in the day, it was all the non-standard terms and slight changes that used to show up in the PDB. In recent times, I have heard of datasets (same kind of data) from different groups that had one column of different with no directions on what the different column meant. Try designing a schema for that.
Of course, the need to come up with yet another data format is the favorite pastime of most life science data creators.
Data friction
I don’t think I need to add anything to this line (at the end of a typically great post by Jon)
I wonder what kind of data friction life scientists have to fight through. Back in the day, it was all the non-standard terms and slight changes that used to show up in the PDB. In recent times, I have heard of datasets (same kind of data) from different groups that had one column of different with no directions on what the different column meant. Try designing a schema for that.
Of course, the need to come up with yet another data format is the favorite pastime of most life science data creators.
Related articles by Zemanta