Fork me on GitHub

Data driven science revisited

Chris Anderson once infamously wrote The data deluge makes the scientific method obsolete, an opinion that I do not share. Eric Drexler on the other hand comes at this new age of data driven science with the right mindset. In a post on data explosion and the scientific method, Eric writes

Tradition demands that science always be hypothesis-driven: First, try to guess the truth, and only afterward collect experimental data to test whether the guess predicts the results. Indeed, this has been termed “The Scientific Method”. The new data-driven approach suggests that we collect data first, then see what it tells us. This becomes practical when experimental methods can amass enormous amounts of data, enough data to test more hypotheses than any mortal scientist could conceivably imagine.

The thing that is important here is that testing hypotheses doesn’t go away. In fact, as Eric points out, in some ways this is no different from what we’ve done in some sciences for a long time; Eric mentions astronomers and microsocopists. Darwin’s work also falls into this category. In other words, science has long been based on observations, with those observations leading to hypotheses. The difference now is the sheer speed and volume at which data is collected, overwhelming our old pattern recognition tools, our eyes and mind. To help us make sense of those observations we need machines, but to do good science, we still need to develop models and theorems and then test them. As Eric writes in talking about data driven biology

For these methods to work, we must know enough about patterns (repetition, correlation, difference, functional correspondence…) that we can recognize some of them and separate the real patterns from the statistical illusions. This too is a hypothesis, but there is no pretense of vast insight.

We should not forget that

Reblog this post [with Zemanta]

This entry was posted in Genes. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.

2 Trackbacks

  1. [...] Data driven science revisited (mndoci.com) [...]

  2. [...] Data driven science revisited (mndoci.com) [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present