Fork me on GitHub

Data friction

it’s just striking how some very basic kinds of data friction keep getting in the way of ever-more-amazing possibilities for analysis and insight. Jon Udell

I don’t think I need to add anything to this line (at the end of a typically great post by Jon)

I wonder what kind of data friction life scientists have to fight through. Back in the day, it was all the non-standard terms and slight changes that used to show up in the PDB. In recent times, I have heard of datasets (same kind of data) from different groups that had one column of different with no directions on what the different column meant. Try designing a schema for that.

Of course, the need to come up with yet another data format is the favorite pastime of most life science data creators.

Reblog this post [with Zemanta]

This entry was posted in Informatics, Infotech, Open Science and tagged . Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present