<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="6.x">Drupal-Biblio</source-app><ref-type>10</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Judy Qiu</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Generalizing MapReduce as a Unified Cloud and HPC Runtime</style></title><secondary-title><style face="normal" font="default" size="100%">Petascale Data Analytics: Challenges, and Opportunities Workshop(PDAC-11) at The International Conference for High Performance Computing, Networking, Storage and Analysis (SC11)</style></secondary-title><tertiary-title><style face="normal" font="default" size="100%">Proceedings of the Petascale Data Analytics: Challenges, and Opportunities </style></tertiary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Cloud</style></keyword><keyword><style  face="normal" font="default" size="100%">data analysis</style></keyword><keyword><style  face="normal" font="default" size="100%">fault tolerance</style></keyword><keyword><style  face="normal" font="default" size="100%">HPC</style></keyword><keyword><style  face="normal" font="default" size="100%">Interoperability</style></keyword><keyword><style  face="normal" font="default" size="100%">Iterative MapReduce</style></keyword><keyword><style  face="normal" font="default" size="100%">Map Collective</style></keyword><keyword><style  face="normal" font="default" size="100%">scaling performance</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2011</style></year><pub-dates><date><style  face="normal" font="default" size="100%">11/14/2011</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://grids.ucs.indiana.edu/ptliupages/publications/pdac22j-qiu.pdf</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Seattle, Washington, USA</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Computational simulation and analysis were one of the keys to the future in data-intensive science as a “fourth paradigm” of scientific discovery but facing a major challenge as handling the incredible increases in dataset sizes. This requires attractive powerful programming models that address issues of portability with scaling performance and fault tolerance. Further, one must meet these challenges for both computation and storage. We build on the success of our research on Iterative MapReduce with successful prototypes Twister (on HPC) and Twister4Azure (on clouds). We have designed a novel Map Collective runtime which generalizes previous work in both HPC and MapReduce communities, which we hypothesize can be used as the runtime for data analysis (mining) interoperably between clouds and clusters.</style></abstract><work-type><style face="normal" font="default" size="100%">Invited talk</style></work-type></record></records></xml>