This web page represents a bundle for the contents and additional material of the paper accepted at the PLOS One Journal. The purpose of this web page is to provide a summary of the paper, support links and short descriptions of the contents used as input and generated as output of the described work. The full paper is available online here.
How easy is it to reproduce the results found in a typical computational biology paper? Either through experience or intuition the reader will already know that the answer is with difficulty or not at all. Here we attempt to quantify this difficulty using a previously published paper [Kinnings et al. 2010] for different classes of user and suggest ways in which the situation might be improved. Quantification is achieved by estimating what is required to take the procedures described in the paper and include them in a formalized workflow that can reproduce the original results. We conclude with a brief discussion of the complexities of requiring reproducibility in terms of cost versus benefit. This has implications not only in reproducing the work of others from published papers, but reproducing work from your own laboratory.
Initially, some of the inputs defined in the Drugome's web page (accessible here) were considered. However those materials were mostly curated results to highlight the outputs of the experiment, so instead we took the inputs files and scripts from the original team of scientists. In particular, these inputs were:
Additionally, several tools were used to compute some of the steps of the experiment:
|Daniel Garijo is a PhD student in the Ontology Engineering Group at the Artificial Intelligence Department of the Computer Science Faculty of Universidad Politécnica de Madrid. His research activities focus on e-Science and the Semantic web, specifically on how to increase the understandability of scientific workflows using provenance, metadata, intermediate results and Linked Data.|
|Sarah Kinnings is a Bioinformatics Scientist at Sequenom. She is a PhD in Bioinformatics by the University of Leeds and worked as a post doctoral researcher at the Skaggs School of Pharmacy and Parmaceutical Sciences, UCSD.|
|(Photo not available)||Li Xie is a Senior scientist at the Skaggs School of Pharmacy and Parmaceutical Sciences, UCSD.|
|Lei Xie PhD is an Associate Professor at the Hunter College, CUNY. Previously he worked as a Research scientist at the Skaggs School of Pharmacy and Parmaceutical Sciences, UCSD. His current research focus is to develop and apply computational techniques to study the structure, function, dynamic, and evolution of molecular interactions on multiple scales, from atomic details to biological networks.|
|Yinliang Zhang is a PhD student at University of Science and Technology of China, and a Visiting Grad student at the UC San Diego.|
|Philip E. Bourne PhD is Associate Vice Chancellor for Innovation and Industrial Alliances, a Professor in the Department of Pharmacology and Skaggs School of Pharmacy and Pharmaceutical Sciences at the University of California San Diego, Associate Director of the RCSB Protein Data Bank and an Adjunct Professor at the Sanford Burnham Institute. Bourne's professional interests focus on relevant biological and educational outcomes derived from computation and scholarly communication. This implies algorithms, text mining, machine learning, metalanguages, biological databases, and visualization applied to problems in systems pharmacology, evolution, cell signaling, apoptosis, immunology and scientific dissemination. He has published over 300 papers and 5 books, one of which sold over 150,000 copies.|
|Yolanda Gil Yolanda Gil is Director of Knowledge Technologies and at the Information Sciences Institute of the University of Southern California, and Research Professor in the Computer Science Department. Her research interests include intelligent user interfaces, social knowledge collection, provenance and assessment of trust, and knowledge management in science. Her most recent work focuses on intelligent workflow systems to support collaborative data analytics at scale.|