Fork me on GitHub

Academic software and infrastructure – AKA more ranting

Perhaps no one will listen, but my one man movement against scientific web resources is going to continue, both for the access models as well as the funding models. Our two latest examples are the Robetta server and the SDSC protein structure homology modeling server. I found both as I looked for resources to solve a question posed by Jean-Claude. Here is what I ran into.

With Robetta

At this time, Robetta is only available for use by the academic community and other not-for-profit entities.

I suppose that hobby scientists can be considered not-for-profit entities, but my previous two attempts to get access to software with similar language were not successful. We continue to be shackled by artificial boundaries and the artifacts of a system that needs to change, and fast. Admittedly, there is no incentive for anyone to do so, and under the current model, it’s a disincentive, but science is going to suffer if we continue in this vein. The models I prefer are ones where the underlying software is open source, and value add services either open to all, or paid by all. This crazy intermediate model has not worked for the companies selling the software and has really resulted in a poor open source model for a lot of scientific software. The tools are (usually) a means to an end and not the end in itself and the latter is where the value lies.

With SDSC

This server has been shut down, due to insufficiency of the resources to maintain it. Upon resources becoming available the server may start function again.

Note the small print

This service was developed by I.N. Shindyalov and P.E. Bourne and provided by San Diego Supercomputer Center (SDSC). This work was supported through grants DBI 9630339 and DBI 9808706 from the National Science Foundation and “National Biomedical Computational Resource” grant from the National Institute of Health, NRCC.

The names on that list are not exactly unknown. Now this was set up a long time ago, so the entire part about it being available may be moot, but that’s not the point. The fact remains that at some point in time the server was shut down due to a lack of resources. In the commercial world that would be a market decision, but I am not so sure that this is the case here. Perhaps when it comes to funding infrastructure resources, the NIH treat them as a market, i.e. depends on traffic and other criteria. The current infrastructure funding model just doesn’t provide sustainability. On the flip side those running a resource available to all should make the source available on Github or similar resource. That way, others can fork it and host it on AWS or similar service.

Reblog this post [with Zemanta]

This entry was posted in Business, Bytes, Innovation, Software & Internet. Bookmark the permalink. Post a comment or leave a trackback: Trackback URL.
  • I recently tried to talk a friend out of going to grad school in bioinformatics (computer side), arguing that

    * since he had no interest in further academic work -- only in the knowledge gained and the signaling mechanism thereto -- grad school is vastly expensive for that purpose
    * interesting parts of the field are so immature that huge chunks are accessible to the motivated amateur: eg hadoop, command of the baseline machine learning toolkit
    * armed with those tools, he can get a job to fill out his toolbox at a bioinformaticist's salary rather than at a grad student's salary
    * much of the interesting research done on the computer-engineering end of bioinformatics is done in a commercial and not academic setting

    It's frustrating to be reminded of the many petty obstacles to amateur science -- not only academic-only software with no carve-out for the amateur scientist, but lack of access to journals and all the rest.

    Imagine if we had to prove our bonafides to contribute to Linux or view its mailing lists?
  • harijay
    One way out is for companies and individuals to fund science that they get value from. If an academic site that had some value to the general community goes down, then the community should rally to write to the authors/maintainers and port it to a suitable platform like google app-engine or an AWS account backed by a loosely constructed paypal donation backed foundation. In these cash strapped times, I would not blame any government entity that shuts down a service that costs money to keep running.

    Also along these lines , I dont think it is practical or right given the funding situation that academics be expected to close all avenues for commercialization by giving away their developed algorithms . The markets involved are too small for academics to sit and wait for value to accrue from the application of any open-source model. The cash rich entities in the equation a.k.a most big Pharma companies should be expected to pay for software /algorithms developed with public funds. That helps academics stay afloat as government funding gets more and more scarce.
  • Open source does not close avenues for commercialization. Most of the current models (and I've worked in those for a long time) do not really work. Well perhaps for a few individuals, but not for the quality of the code and the improvement of science.

    The reason is that almost none of the software by itself is worth that much in the first place, i.e. unique enough to be absolute must. Bioinformatics has done fine with very little closed source software (in fact, closed source has lost) and the places where money is paid is in areas such as data management, not algorithms. Pharma etc will pay for custom development and would rather be contributors to the open source world. Can you imagine if R were not open source? Would all of you be even talking that much about it? Would it be half as successful and it's actually broadly usable and has a lot of value.
  • I was just saying that for Robetta or whatever to restrict useage to academia is perfectly acceptable. You get your grants rejected ! , your funding dries up..what do you do . Reign in the give it all away ideals, and start getting pragmatic!. Allow academics to use your software free , and charge everyone else to use it!

    That said open source works and works very well, there is no denying it. Numpy/Python, R , Ruby on Rails etc succeed because they are open source and quite universally applicable. Its a different matter to open source a niche application that which is used by a small group of people. I dont think its fair to expect the same model to translate to other platforms or software.

    Although I can see your point that setting things free guarantees their long term survival , helps keep quality up etc etc , it rarely pays the bills! Accordingly, I think the biggest way to encourage open source is to start contributing to projects , and hit those paypal buttons everytime an open source projects makes your work easier. I would rather do that than expect academics or government funded projects to give things away everytime.
  • Disagree pretty violently with this. It is not the responsibility of researchers to determine funding mechanisms to ensure the effective continuity of research outputs, it is the responsibility of funders to ensure that their (public) money is effectively spent. And if that includes funding continuity for important software projects (or data projects, or specialist materials) then we have to have that discussion.

    Okay realistically this is not going to happen but given that the interest of the researchers is in having the software used, and in finding a way of keeping it supported surely it would be preferable to allow it to be available for any non-profit use? The more people using it the more likely it is for them to come up with something that might turn a profit - limiting use to people who have neither the time nor the inclination to actually make it useful seems farcical.

    And of course, open sourcing it would be be even better in this regard :-)
  • harijay
    I agree with you Cameron, a researcher needs to do research, not worry about funding mechanisms and such. It will be great when every researcher realizes that opening things up builds community in away that almost guarantees continuity and maybe eventually profitability.

    It is also an unfortunate by-product of closed to non academics cases that "hobbyists" and non-profit users get caught in the crossfire , there is nothing worse than alienating the motivated.

    But my point still is that , to demand or even expect academics to adopt open source because of funds coming from the public is unrealistic and impractical.

    From my own experience , the best crystallography software I use happens to be split down the middle, 50% of them are closed source and non-free for non academics and the rest of them are free and open source. The going is equally good for both of them and I cannot live without either. I would never expect the non-free software to adopt the model of the free, because what they have going just works! If I have to stop using the non-free version because I move to a non-academic setting , then so be it! , I still have the free version to fall back on.

    I think the best way to persuade academics to adopt open source is to adopt a "Gandhian" attitude to persuasion. Contribute selflessly and hope they see the point. If you are a hobbyist, I would email the author and request him to allow you to use it. If the email works great, if not just sit back and hope the software gets liberated sometime soon, or better yet fund an open source alternative.
  • Sort of adding on to this discussion

    1. Open Source does not preclude commercialization. You need to choose the appropriate license. The reason companies buy software is to get support, custom patches, more input into the dev cycle etc. There is an entire, well established model on how you can make this work.

    2. The one problem with niche software is that it requires you to have licensing fees, the kind no one is willing to pay which is why scientific software companies do not usually make money for most algorithmic software (you can make money on platforms, data management solutions, etc). This makes open source even more attractive really as it broadens out development and results in better code. Think of CHARMM. I think one reason the software has not evolved in quality (it has had oodles of algorithms thrown in) is because it's not open source. IMO the benevolent dictator model works really well in these cases
  • I'd like to add that where there is value, companies will and do fund software development, either directly, or through consortia and that is fine.

    This half-way model is the one I have always had problems with and more now than ever. It also results in academics not appreciating open source.
blog comments powered by Disqus
  • Archives

  • Disclaimer

    All opinions on this blog are my own and do not reflect those of my employers, past or present