HPA Activities
Projects
Current:
Teragrid Basic Support. We support several Teragrid users to get started on various resources. Also see: Local TG webpages and Knowledge Base articles related to this.
Teragrid Advanced Support. We are working with two Geologists and one Physicist from IU to get their codes to run on Teragrid resources -- activities include installation of libraries, porting code to different architectures, data storage strategies, etc.
Hidden Markov rates in fastDNAml.
HPA staff are implementing a new mutation rate generator in
fastDNAml.
Nuclear Pasta. We are collaborating with a theoretical physicist in the Physics Department/IU Nuclear Theory Center to study a phase of nuclear matter (neutrons and protons) known as nuclear pasta, which may occur in neutron star crusts and in type II supernovae. We are using a classical molecular dynamics model, and our program can use the MDGRAPE-2 board, to do the O(N^2) force calculations for each MD timestep in the N-body problem. The code is parallelized with MPI, and can run on either a conventional parallel computer or a parallel MDGRAPE-2 computer.
Optimizing the MILC code The MILC code is widely used to study Lattice Quantum Chromodynamics (Lattice QCD). We are optimizing its core linear algebra routines to run on the Intel Itanium2 processor.
Seismic Analysis. Collaboration with Department of Geological Sciences for USArray/EarthScope project. Designed and Implemented a X11 Motif based GUI front end to display seismic time series data and accept seismic analyst's input to perform backend series ensemble correlation to realign the time series data, will enable effecient automation of seismic data analysis process.
Completed:
MEDLINE text mining.
We
are working with School of Informatics to create a system to provide a
web search engine for MEDLINE documents. Several thesauri are
incorporated into the system to better navigate the search process.
Penelope. Radiation Oncology. The
program Penelope, which simulates the transfer of radiation through
various media, is being converted and optimized for the simulation of
radiation therapy with the Gamma Knife. Supercomputer versions of this
code will make feasible the clinical use of much more accurate methods
for targeting radiation therapy, thus eliminating one of the critical
sources of failure of such therapy today (insufficiently accurate
targeting due to oversimplified models of the human body).
SimWalk
II.
HPA staff are implementing a parallel version of the SimWalk program used by
the Department of Medical Genetics to study the genetic component of
diseases.
Prodigies. HPA staff are working
with the IUB Chemistry Department to migrate the Prodigies package onto
AVIDD. This package does protein analysis, based upon new instruments
developed locally. The migration will make it to handle larger genome
and increase the throughput.
Global
analysis of arthropod evolution.
This project addressed the question, whether arthropods with six legs
(insects and their relatives) constitute a single evolutionary family. A
team led by University Information Technology Services, the Indiana
University Center for Genomics and Bioinformatics, and the High
Performance Computing Center of the University of Stuttgart won a
prestigious High Performance Computing Grand Challenge award SC2003, the
premiere annual international supercomputing conference. This project
took top honors in the category of "Most Geographically Distributed
Application."
Merlin. HPA staff have expanded
Merlin, a sparse tree implementation of gene flow in pedigree analysis,
from a 32-bit package to a 64-bit package to accommodate larger trees.
MCNP5 and MCNPX. HPA staff installed, tested and assisted in the use of MCNP5 and MCNPX, Monte Carlo particle transport code for use in shielding designs by the Midwest Proton Therapy Center.
Simwalk. Genetics and pedigrees HPA has optimized the serial (single processor) version of the Simwalk program, used by the Department of Medical Genetics to study the genetic components of diseases (such as alcoholism). This decreased the time required to complete an analysis by 70%. HPA staff then increased the maximum number of affected descended from the pedigree founders in non-parametric linkage analysis from 20 to 30.
CRIMap
HPA worked with the
Medical Genetics Department to extend the capabilities of a
supercomputer version of CRIMap.
SHELX This program, used by the Department of Molecular Biology to compute protein structure, has been ported to the Research SP and AVIDD cluster.
RegRegion
HPA staff have
implemented a parallel version of a code used by the IUB Department of
Biology to simulate the evolution of the modular structure of a
two-function regulatory region of a locus, before and after duplication.
INGEN IT Core room
MSB 223 has been renovated for use by INGEN researchers.
GeneIndex. HPA has worked with
the Center for Genomics and Bioinformatics to create a new software
application. GeneIndex indexes and matches patterns in DNA sequences,
and is used in inference of amino acid sequences.
NBPack.
implemented and optimized a serial version of a code based upon an
advanced data structure, being used by the Center for Cell and Virus
Theory to study processes occurring over many different scales of space
and time.
fastDNAml. HPA staff have
implemented a parallel version of this code, used by the Department of
Biology and the School of Informatics to compute most likely
phylogenies, and has ported it to multiple environments.
PerfectMatch
Working with the Department
of Medicine's Biostatistics Division, the IT core is working to improve
detection of weak signals in microarray data.
Extracting
genetic information from MEDLINE.
Based on trie data structure, we are working with the Biochemistry
Department and the Computer Science Department to extract gene
information in MEDLINE database. This part will eventually be
incorporated into the BioMap application.
Publications by HPA Staff
Pavlis, G. L., P. Wang, F.L. Vernon, "Array Processing of Large Aperture Seismic Arrays with Generalized Cross-Correlation" (Invited talk to special session on signal processing of the meeting of the American Geophysical Union), Dec 2005
Ellisman, M., M. Brady, D. Hart, F.P. Lin, M. Muller, L. Smarr, "The Emerging Role of Biogrids", Communications of the ACM, 47:11, 2004.
Horowitz, C. J., M. A. Perez-Garcia, D. K. Berry, J. Piekarewicz, "Dynamical response of the nuclear pasta in neutron star crusts", Physical Review C 72, 035801 (2005)
Horowitz, C. J., M. A. Perez-Garcia, J. Carriere, D. K. Berry, J. Piekarewicz, "Nonuniform neutron-rich matter and coherent neutrino scattering", Physical Review C 70, 065806 (2004)
Stewart, C.A. R. Keller, R. Repasky, M. Hess, D. Hart, M. Mueller, R. Sheppard, U. Woessner, M. Aumueller, Huian Li, D.K. Berry, J. Colbourne, "A global grid for analysis of arthropod evolution", in: R. Buyya (ed.) Proceedings: Fifth IEEE/ACM International Workshop on Grid Computing, pp. 328-337, Pittsburgh, PA 2004.
Stewart, C.A., D. Hart, R.W. Sheppard, H. Li, R. Cruise, V. Moskvin, L. Papiez, Parallel computing in biomedical research and the search for peta-scale biomedical applications, pp. 719-716 In: G.R. Joubert, W.E. Nagel, F.J. Peters, and W.V. Walter (eds). Parallel computing: software technology, algorithms, architectures & applicaitons. Advances in Parallel Computing Vol. 13. Elsevier, Oxford. 2004.
Zina Ben Miled, Jin Liu, Omran Bukhres, Huian Li, J. Martin, C. Balagopalakrishna, Robert Oppelt, Use and Maintenance of Histograms for Large Scientific Database Access Planning: A Case Study of a Pharmaceutical Data Repository, Intelligent Information Systems, Vol 23-2, pp. 145-178, September, 2004.
Wang P., Turner G., Simms S., Hart D., Papakhian M., and Stewart A. C., "1 TFLOPS Achieved with a Geographically Distributed Linux Cluster", Part 4, Cluster and Grid Computing, High Performance Computing: Paradigm and Infrastructure, John Wiley & Sons, 2004
Wang P., Turner G., Lauer A. D., Allen M., Simms S., Hart D., Papakhian M., and Stewart A. C., "LINPACK Performance on a Geographically Distributed Linux Cluster", Proceedings of the 18th International Parallel and Distributed Processing Symposium, IPDPS 2004, Santa Fe, New Mexico, April 26-30, 2004
Kamal Kumar, Mathew J. Palakal, Snehasis Mukhopadhyay, Mathew J. Stephens, Huian Li, "BioMap: Toward the Development of a Knowledge Base of Biomedical Literature," 19th Annual ACM Symposium on Applied Computing, Cyprus, March 14 -17, 2004
Stewart, C.A., R. Repasky, J. Colbourne, D. Hart, D. K. Berry, R. Sheppard, E. Wernert, M. Papakhian, J. N. Huffman. "Global analysis of arthropod evolution", Proceedings of I-Light Applications Workshop, 9 March 2004, Indianapolis, IN.
Wernert, E. A., D. K. Berry, J. N. Huffman, C. A. Stewart, "Tree3D - A System for Temporal and Comparative Analysis of Phylogenetic Trees." Proceedings of IEEE Information Visualization 2003, Seattle, WA, October 2003.
Stewart, C.A., Repasky, R., Hart, D., Papakhian, M., Shankar, A., Wernert,E., Arenson, A., and G. Bernbom, "Advanced Information Technology Support For Life Sciences Research." Proceedings of SIGUCCS 2003, San Antonio, TX.
Stewart, C.A., Repasky, R., Hart, D., Papakhian, M., Shankar, A., Wernert,E., Arenson, A., and G. Bernbom. 2003, "Advanced Information Technology Support For Life Sciences Research. " Presented at SIGUCCS 2003, Sept. 21-24, San Antonio, TX.
Hart, D., Grover, D., Liggett, M., Repasky, R., Shields, C., Simms, S., Sweeny, A., and Wang, P, "Distributed Parallel Computing Using Windows Desktop Systems." Proceedings of CLADE 2003, June 20-27, 2003, Seattle, WA.
Pickett, B. K., A. C. Mejia, R. H. Durisen, P. M. Cassen, D. K. Berry, R. P. Link, "The thermal regulation of gravitational instabilities in protoplanetary disks", The Astrophysical Journal, 590:1060-1080, June 20, 2003
Cruise, R.B., Papiez, L.S, "Integral Equation Formulation of a Mixed Diffusion-Jump Model of Elastic Scattering." Nuclear Mathematical and Computational Sciences: A Century in Review, A Century Anew, Gatlinburg, Tennessee, April 6-11, 2003, on CD-ROM, American Nuclear Society, LaGrange Park, IL.
Cruise, R.B., Sheppard, R.W, Moskvin, V.P., "Parallelization of the Penelope Monte Carlo Particle Transport Simulation Package." Nuclear Mathematical and Computational Sciences: A Century in Review, A Century Anew, Gatlinburg, Tennessee, April 6-11, 2003, on CD-ROM, American Nuclear Society, LaGrange Park, IL.
Stewart, C., Hart, D., Berry, D.K., Olsen, G.J., Wernert, E., Fischer, W., "Parallel implementation and performance of fastDNAml - a program for maximum likelihood phylogenetic inference." Proceedings of SC2001. Denver, Colorado, November 2001.
Stewart C., Peebles C.S., Papakhian, M., Samuel, J., Hart, D., Simms, S., "High Performance Computing: Delivering Valuable and Valued Services at Colleges and Universities." Proceedings of SIGUCCS 2001. Portland, Oregon, October 2001.
Presentations by HPA Staff
Gopu A., Grover D., Hart. D, Repasky R., Rinkovsky J., Simms S., Sweeny A., Wang P., " HYDRA: Using Windows Desktop Systems in Distributed Parallel Computing", Booth presentation, Super Computing '05 Conference, Seattle, WA, Nov 18 2005. (Download latest version of presentation)
Gopu A., Grover D., Hart. D, Repasky R., Rinkovsky J., Simms S., Sweeny A., Wang P., " HYDRA: Using Windows Desktop Systems in Distributed Parallel Computing", Workshop presentation, GGF15-CampusGrids, Boston, MA, Oct 2 2005.
Pavlis, G. L., P. Wang, F.L.Vernon, Array Processing of Large Aperture Seismic Arrays with Generalized Cross-Correlation (Invited talk to Special Session on Signal Processing of the American Geophysical Union, Dec. 2005.
Horowitz, C. J., M. A. Perez-Garcia, J. Carriere, D. K. Berry, J. Piekarewicz, "Computer simulation of neutron rich matter using the MDGRAPE-2", presentation at the IU booth at SC2005, Seattle, WA, Nov 14-18, 2005
Berry D. K., "Computer simulation of neutron rich matter using the MDGRAPE-2", NSF TeraGrid Site Visit meeting, May 23, 2005.
Hart, D., High Performance Computing Support at IU, Center for Genomics and Bioinformatics Roundtable, February 3, 2005.
Stewart, C.A., R. Repasky, J. Colbourne, D. Hart, D.K. Berry, R. Sheppard, E. Wernert, M. Papakhian, J.N. Huffman, Global analysis of arthropod evolution. 2nd I-light Symposium, 9 March 2004.
Stewart C., Turner G., Wang P., Hart D., Simms S., Lauer D., Papakhian M., Allen M., Squyres J., and Lumstaine A., "1 Tflops Achieved with Distributed Linux Cluster", poster, SC2003, Phoenix, Arizona, Nov 15-21, 2003
Robert Cruise, "Adaptive Monte Carlo Methods," Department of Radiation Oncology Colloquium, IU School of Medicine, June 14, 2002.
Olsen, G. J., R. Repasky, W. Fischer, D. C. Hart, D. K. Berry, C. A. Stewart, "Parallel implementation of fastDNAml, a program for maximum likelihood estimation of phylogenetic trees", poster, First SIAM Conference on the Life Sciences, Boston, MA, March 6-8, 2002




