Lecture Notes in Computer Science 7484
Commenced Publication in 1973 Founding and Former Series Editors:
Gerhard Goos, Juris Hartmanis, and Jan van Leeuwen
Editorial Board
David Hutchison
Lancaster University, UK Takeo Kanade
Carnegie Mellon University, Pittsburgh, PA, USA Josef Kittler
University of Surrey, Guildford, UK Jon M. Kleinberg
Cornell University, Ithaca, NY, USA Alfred Kobsa
University of California, Irvine, CA, USA Friedemann Mattern
ETH Zurich, Switzerland John C. Mitchell
Stanford University, CA, USA Moni Naor
Weizmann Institute of Science, Rehovot, Israel Oscar Nierstrasz
University of Bern, Switzerland C. Pandu Rangan
Indian Institute of Technology, Madras, India Bernhard Steffen
TU Dortmund University, Germany Madhu Sudan
Microsoft Research, Cambridge, MA, USA Demetri Terzopoulos
University of California, Los Angeles, CA, USA Doug Tygar
University of California, Berkeley, CA, USA Gerhard Weikum
Max Planck Institute for Informatics, Saarbruecken, Germany
Christos Kaklamanis
Theodore Papatheodorou Paul G. Spirakis (Eds.)
Euro-Par 2012
Parallel Processing
18th International Conference, Euro-Par 2012 Rhodes Island, Greece, August 27-31, 2012 Proceedings
1 3
Volume Editors Christos Kaklamanis University of Patras
Computer Technology Institute and Press "Diophantus"
N. Kazantzaki 26504 Rio, Greece
E-mail: kakl@ceid.upatras.gr Theodore Papatheodorou University of Patras University Building B 26504 Rio, Greece
E-mail: tsp@hpclab.ceid.upatras.gr Paul G. Spirakis
University of Patras
Computer Technology Institute and Press "Diophantus"
N. Kazantzaki 26504 Rio, Greece E-mail: spirakis@cti.gr
ISSN 0302-9743 e-ISSN 1611-3349
ISBN 978-3-642-32819-0 e-ISBN 978-3-642-32820-6 DOI 10.1007/978-3-642-32820-6
Springer Heidelberg Dordrecht London New York Library of Congress Control Number: 2012944429
CR Subject Classification (1998): D.1.3, D.3.3-4, C.1.4, D.4, C.4, C.2, G.1.0, C.3, H.3, I.6, I.2.6, F.1.2, H.2.8
LNCS Sublibrary: SL 1 – Theoretical Computer Science and General Issues
© Springer-Verlag Berlin Heidelberg 2012
This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, re-use of illustrations, recitation, broadcasting, reproduction on microfilms or in any other way, and storage in data banks. Duplication of this publication or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 1965, in its current version, and permission for use must always be obtained from Springer. Violations are liable to prosecution under the German Copyright Law.
The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use.
Typesetting: Camera-ready by author, data conversion by Scientific Publishing Services, Chennai, India Printed on acid-free paper
Springer is part of Springer Science+Business Media (www.springer.com)
Preface
Euro-Par is an annual series of international conferences dedicated to the promotion and advancement of all aspects of parallel and distributed computing.
Euro-Par covers a wide spectrum of topics from algorithms and theory to software technology and hardware-related issues, with application areas ranging from scientific to mobile and cloud computing.
Euro-Par provides a forum for the introduction, presentation, and discussion of the latest scientific and technical advances, extending the frontier of both the state of the art and the state of the practice.
The main audience of Euro-Par are researchers in academic institutions, gov- ernment laboratories, and industrial organizations. Euro-Par’s objective is to be the primary choice of such professionals for the presentation of new results in their specific areas. As a wide-spectrum conference, Euro-Par fosters the synergy of different topics in parallel and distributed computing. Of special interest are applications that demonstrate the effectiveness of the main Euro-Par topics.
In addition, Euro-Par conferences provide a platform for a number of ac- companying, technical workshops. Thus, smaller and emerging communities can meet and develop more focussed topics or as-yet less established topics.
Euro-Par 2012 was the 18th conference in the Euro-Par series, and was orga- nized by CTI (Computer Technology Institute and Press “Diophantus”). Previ- ous Euro-Par conferences took place in Stockholm, Lyon, Passau, Southampton, Toulouse, Munich, Manchester, Padderborn, Klagenfurt, Pisa, Lisbon, Dresden, Rennes, Las Palmas, Delft, Ischia, and Bordeaux. Next year the conference will take place in Aachen, Germany. More information on the Euro-Par conference series and organization is available on the websitehttp://www.europar.org
The conference was organized in 16 topics. The paper review process for each topic was managed and supervised by a committee of at least four persons: a Global Chair, a Local Chair, and two members. Some specific topics with a high number of submissions were managed by a larger committee with more members.
The final decisions on the acceptance or rejection of the submitted papers were made in a meeting of the Conference Co-chairs and Local Chairs of the topics.
The call for papers attracted a total of 228 submissions, representing 44 countries (based on the corresponding authors’ countries). A total of 873 review reports were collected, which makes an average of 3.83 review reports per paper.
In total 75 papers were selected as regular papers to be presented at the confer- ence and included in the conference proceedings, representing 29 countries from all continents, and yielding an acceptance rate of 32.9%. Three papers were se- lected as distinguished papers. These papers, which were presented in a separate session, are:
1. Ricardo J. Dias, Tiago M. Vale, and Jo˜ao M. S. Louren¸co “Efficient Support for In-Place Metadata in Transactional Memory”
VI Preface
2. Wesley Bland, Peng Du, Aurelien Bouteiller, Thomas Herault, George Bosilca, and Jack Dongarra “A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI”
3. Konstantinos Christodoulopoulos, Marco Ruffini, Donal O’Mahony, and Kostas Katrinis “Topology Configuration in Hybrid EPS/OCS Interconnects”
Euro-Par 2012 was very happy to present three invited speakers of high inter- national reputation, who discussed important developments in very interesting areas of parallel and distributed computing:
1. Ewa Deelman (Information Sciences Institute, University of Southern Cali- fornia, USA)
2. Burkhard Monien (University of Paderborn, Germany) 3. Thomas Schulthess (CSCS, ETH Zurich, Switzerland)
In this edition, 11 workshops were held in conjunction with the main track of the conference. These workshops were:
1. 10th International Workshop on Algorithms, Models and Tools for Parallel Computing on Heterogeneous Platforms (Heteropar)
2. 7th Workshop on Virtualization in High-Performance Cloud Computing (VHPC)
3. 5th Workshop on Unconventional High-Performance Computing (UCHPC) 4. 5th Workshop on Productivity and Performance (PROPER)
5. Third Workshop on High-Performance Bioinformatics and Biomedicine (HiBB)
6. Workshop on Resiliency in High-Performance Computing (Resilience) 7. CoreGRID/ERCIM Workshop on Grids, Clouds, and P2P Computing (CGWS) 8. First Workshop on Big Data Management in Clouds (BDMC)
9. Workshop on Architecture and Systems Software for Data Intensive Super- computing
10. First Workshop on On-chip Memory Hierarchies and Interconnects: Organi- zation, Management and Implementation (OMHI)
11. Paraphrase Workshop
The 18th Euro-Par conference in Rhodes was made possible thanks to the support of many individuals and organizations. Special thanks are due to the authors of all the submitted papers, the members of the Topic Committees, and all the reviewers in all topics, for their contributions to the success of the conference. We also thank the members of the Organizing Committee. We are grateful to the members of the Euro-Par Steering Committee for their support.
We acknowledge the help we had from Emmanuel Jeannot of the organization of Euro-Par 2011. It was our pleasure and honor to organize and host Euro-Par 2012 in Rhodes. We hope all the participants enjoyed the technical program and the social events organized during the conference.
August 2012 Christos Kaklamanis
Theodore Papatheodorou Paul Spirakis
Organization
Euro-Par Steering Committee
Chair
Chris Lengauer University of Passau, Germany
Vice-Chair
Luc Boug´e ENS Cachan, France
European Representatives
Jos´e Cunha New University of Lisbon, Portugal Marco Danelutto University of Pisa, Italy
Emmanuel Jeannot LaBRI-INRIA, France
Christos Kaklamanis Computer Technology Institute and Press “Diophantus”, Greece Paul Kelly Imperial College, UK
Thomas Ludwig University of Hamburg, Germany
Emilio Luque Autonomous University of Barcelona, Spain Tom`as Margalef Autonomous University of Barcelona, Spain Wolfgang Nagel Dresden University of Technology, Germany Rizos Sakellariou University of Manchester, UK
Henk Sips Delft University of Technology, The Netherlands Domenico Talia University of Calabria, Italy
Honorary Members
Ron Perrott Queen’s University Belfast, UK
Karl Dieter Reinartz University of Erlangen-Nuremberg, Germany
Observer
Felix Wolf RWTH Aachen, Germany
Euro-Par 2012 Organization
Conference Co-chairs
Christos Kaklamanis CTI and University of Patras, Greece Theodore Papatheodorou University of Patras, Greece
Paul Spirakis CTI and University of Patras, Greece
VIII Organization
Workshop Co-chairs
Luc Boug´e ENS Cachan, France
Ioannis Caragiannis CTI and University of Patras, Greece
Local Organizing Committee
Katerina Antonopoulou CTI, Greece
Stavros Athanassopoulos CTI and University of Patras, Greece Rozina Efstathiadou CTI, Greece
Lena Gourdoupi CTI, Greece
Panagiotis Kanellopoulos CTI and University of Patras, Greece Evi Papaioannou CTI and University of Patras, Greece
Euro-Par 2012 Program Committee
Topic 1: Support Tools and Environments Global Chair
Omer Rana Cardiff University, UK
Local Chair
Marios Dikaiakos University of Cyprus, Cyprus Members
Daniel Katz University of Chicago, USA Christine Morin INRIA, France
Topic 2: Performance Prediction and Evaluation Global Chair
Allen Malony University of Oregon, USA
Local Chair
Helen Karatza Aristotle University of Thessaloniki, Greece Members
William Knottenbelt Imperial College London, UK
Sally McKee Chalmers University of Technology, Sweden
Topic 3: Scheduling and Load Balancing Global Chair
Denis Trystram Grenoble Institute of Technology, France
Organization IX
Local Chair
Ioannis Milis Athens University of Economics and Business, Greece
Members
Zhihui Du Tsinghua University, China Uwe Schwiegelshohn TU Dortmund, Germany
Topic 4: High-Performance Architecture and Compilers Global Chair
Alex Veidenbaum University of California, USA
Local Chair
Nektarios Koziris National Technical University of Athens, Greece
Members
Avi Mendelson Microsoft, Israel
Toshinori Sato Kyushu University, Japan
Topic 5: Parallel and Distributed Data Management Global Chair
Domenico Talia University of Calabria, Italy
Local Chair
Alex Delis University of Athens, Greece
Members
Haimonti Dutta Columbia University, USA
Arkady Zaslavsky Lulea University of Technology, Sweden and CSIRO, Australia
Topic 6: Grid, Cluster and Cloud Computing Global Chair
Erik Elmroth Umea University, Sweden
Local Chair
Paraskevi Fragopoulou FORTH, Greece
X Organization
Members
Artur Andrzejak Heidelberg University, Germany
Ivona Brandic Vienna University of Technology, Austria Karim Djemame University of Leeds, UK
Paolo Romano INESC-ID, Portugal
Topic 7: Peer-to-Peer Computing Global Chair
Alberto Montresor University of Trento, Italy
Local Chair
Evaggelia Pitoura University of Ioannina, Greece Members
Anwitaman Datta Nanyang Technological University, Singapore Spyros Voulgaris Vrije Universiteit Amsterdam, The Netherlands
Topic 8: Distributed Systems and Algorithms Global Chair
Andrzej Goscinski Deakin University, Australia
Local Chair
Marios Mavronicolas University of Cyprus, Cyprus Members
Weisong Shi Wayne State University, USA
Teo Yong Meng National University of Singapore, Singapore
Topic 9: Parallel and Distributed Programming Global Chair
Sergei Gorlatch University of M¨unster, Germany
Local Chair
Rizos Sakellariou University of Manchester, UK Members
Marco Danelutto University of Pisa, Italy
Thilo Kielmann Vrije Universiteit Amsterdam, The Netherlands
Organization XI
Topic 10: Parallel Numerical Algorithms Global Chair
Iain Duff Rutherford Appleton Laboratory, UK
Local Chair
Efstratios Gallopoulos University of Patras, Greece Members
Daniela di Serafino Second University of Naples, Italy
Bora Ucar ENS, France
Topic 11: Multicore and Manycore Programming Global Chair
Eduard Ayguade Technical University of Catalonia, Spain Local Chair
Dionisios Pnevmatikatos Technical University of Crete and FORTH, Greece
Members
Rudolf Eigenmann Purdue University, USA Mikel Luj´an University of Manchester, UK Sabri Pllana University of Vienna, Austria
Topic 12: Theory and Algorithms for Parallel Computation Global Chair
Geppino Pucci University of Padova, Italy
Local Chair
Christos Zaroliagis CTI and University of Patras, Greece Members
Kieran Herley University College Cork, Ireland
Henning Meyerhenke Karlsruhe Institute of Technology, Germany
Topic 13: High-Performance Network and Communication Global Chair
Chris Develder Ghent University, Belgium
XII Organization
Local Chair
Emmanouel Varvarigos CTI and University of Patras, Greece
Members
Admela Jukan Technical University of Braunschweig, Germany Dimitra Simeonidou University of Essex, UK
Topic 14: Mobile and Ubiquitous Computing Global Chair
Paolo Santi IIT-CNR, Italy
Local Chair
Sotiris Nikoletseas CTI and University of Patras, Greece
Members
Cecilia Mascolo University of Cambridge, UK
Thiemo Voigt SICS, Sweden
Topic 15: High-Performance and Scientific Applications Global Chair
Thomas Ludwig University of Hamburg, Germany
Local Chair
Costas Bekas IBM Zurich, Switzerland
Members
Alice Koniges Lawrence Berkeley National Laboratory, USA Kengo Nakajima University of Tokyo, Japan
Topic 16: GPU and Accelerators Computing Global Chair
Alex Ramirez Technical University of Catalonia, Spain
Local Chair
Dimitris Nikolopoulos University of Crete and FORTH, Greece
Organization XIII
Members
David Kaeli Northeastern University, USA Satoshi Matsuoka Tokyo Institute of Technology, Japan
Euro-Par 2012 Referees
Luca Abeni Shoaib Akram Jay Alameda Susanne Albers Marco Aldinucci Ahmed Ali-ElDin Srinivas Aluru
Ganesh Ananthanarayanan Nikos Anastopoulos Eric Angel
Constantinos Marios Angelopoulos Ashiq Anjum
Mohammad Ansari Alexandros Antoniadis Christos Antonopoulos Filipe Araujo
Django Armstrong Cedric Augonnet Win Than Aung Aurangeb Aurangzeb Scott Baden
Rosa M. Badia Hansang Bae Enes Bajrovic Henri Bal
Harish Balasubramanian Michael Bane
Leonardo Bautista Gomez Ewnetu Bayuh Lakew Tom Beach
Vicen¸c Beltran Julien Bernard Carlo Bertolli Rob Bisseling Luiz Bittencourt Filip Blagojevic Fran¸cois Bodin Erik Boman Sara Bouchenak
Steven Brandt Ivan Breskovic Patrick Bridges Christopher Bun Daniele Buono Kevin Burrage Alfredo Buttari Javier Cabezas Rosario Cammarota Sonia Campa Edouard Canot Paul Carpenter Daniel Cederman Eugenio Cesario Nicholas Chaimov Kang Chen Zhongliang Chen Mosharaf Chowdhury Chunbo Chu
Murray Cole Carmela Comito Guojing Cong Fernando Costa Maria Couceiro Adrian Cristal Ruben Cuevas Rumin Yong Cui
Pasqua D’Ambra Gabriele D’Angelo Patrizio Dazzi Usman Dastgeer Ewa Deelman Karen Devine Diego Didona Ngoc Dinh
Julio Dondo Gazzano Nam Duong
Alejandro Duran Ismail El Helw
XIV Organization Kaoutar El Maghraoui Robert Elsaesser Vincent Emeakaroha Toshio Endo
Daniel Espling Montse Farreras Renato Figueiredo Salvatore Filippone Wan Fokkink Alexander F¨olling Agostino Forestiero Edgar Gabriel Giulia Galbiati Saurabh Garg Michael Garland Rong Ge
Bugra Gedik Isaac Gelado Michael Gerndt Aristotelis Giannakos Lee Gillam
Luc Giraud
Sarunas Girdzijauskas Harald Gjermundrod Alfredo Goldman Zeus G´omez Marmolejo Georgios Goumas Anastasios Gounaris Vincent Gramoli Clemens Grelck Christian Grimme Alessio Guerrieri Ajay Gulati
Panagiotis Hadjidoukas Eyad Hailat
Tim Harris Piyush Harsh Masae Hayashi Jiahua He Yuxiong He Bruce Hendrickson Pieter Hijma Torsten Hoefler Matthias Hofmann Christian Hoge
Jonathan Hogg Theus Hossmann Nathanael H¨ubbe Kevin Huck Sascha Hunold Felix Hupfeld Nikolas Ioannou Thomas Jahns Klaus Jansen Aubin Jarry Bahman Javadi Yvon Jegou Ming Jiang
Hideyuki Jitsumoto Fahed Jubair Vana Kalogeraki Yoshikazu Kamoshida Takahiro Katagiri Randy Katz Richard Kavanagh Stamatis Kavvadias Kamer Kaya Gabor Kecskemeti Safia Kedad-Sidhoum Ian Kelley
Paul Kelly
Markus Kemmerling Rajkumar Kettimuthu Le Duy Khanh Peter Kilpatrick Taesu Kim Mariam Kiran Nicolaj Kirchhof Thomas Kirkham Luc Knockaert Takeshi Kodaka Panagiotis Kokkinos
Charalampos Konstantopoulos Ulrich K¨orner
Christos Kotselidis Nektarios Kranitis Michael Kuhn Manaschai Kunaseth Julian Martin Kunkel Krzysztof Kurowski
Organization XV Okwan Kwon
Felix Langner Francis Lau Adrien L`ebre Chee Wai Lee Pierre Lemarinier Hermann Lenhart Ilias Leontiadis Dimitrios Letsios Dong Li
Wubin Li Youhuizi Li John Linford Luong Ba Linh Nicholas Loulloudes Jo˜ao Louren¸co Hatem Ltaief Dajun Lu Drazen Lucanin Giorgio Lucarelli Ewing Lusk Spyros Lyberis Maciej Machowiak Sandya Mannarswamy Osni Marques
Maxime Martinasso Xavier Martorell Naoya Maruyama Lukasz Masko Toni Mastelic Carlo Mastroianni Michael Maurer Gabriele Mencagli Massimiliano Meneghin Mohand Mezmaz George Michael Milan Mihajlovic Timo Minartz Perhaad Mistry Bernd Mohr
Matteo Mordacchini Benjamin Moseley Dheya Mustafa Hironori Nakajo Franco Maria Nardini
Rammohan Narendula Sarfraz Nawaz
Zsolt Nemeth Tung Nguyen Bogdan Nicolae Vincenzo Nicosia Konstantinos Nikas Andy Nisbet Akihiro Nomura Akira Nukada Richard O’Keefe Satoshi Ohshima Stephen Olivier Salvatore Orlando Per-Olov Ostberg Linda Pagli George Pallis Roberto Palmieri Costas Panagiotakis Harris Papadakis Andreas Papadopoulos Manish Parashar Nikos Parlavantzas Jean-Louis Pazat Sebastiano Peluso Raffaele Perego Miquel Pericas Dennis Pfisterer Bernard Philippe Vinicius Pinheiro Timothy Pinkston Polyvios Pratikakis Jan Prins
Bart Puype Nikola Puzovic Thanh Quach Jean-Noel Quintin Yann Radenac M. Mustafa Rafique Kees Reeuwijk Laurent R´eveill`ere Olivier Richard Thomas Ropars Mathis Rosenhauer Barry Rountree
XVI Organization Mema Roussopoulos Krzysztof Rzadca Amit Sabne P. Sadayappan Putt Sakdhnagool Friman S´anchez
Carlos Alberto Alonso Sanches Martin Sandrieser
Vijay Saraswat Hitoshi Sato Kento Sato
Thomas Sauerwald Philip Schill Elad Schiller Scott Schneider Mina Sedaghat Kenshu Seto Aamir Shafi Jawwad Shamsi Rajesh Sharma Sameer Shende Jinsong Shi Jun Shirako Yogesh Simmhan Fabrizio Sivestri Martin Skutella Georgios Smaragdakis Ismael Solis Moreno Michael Spear Jochen Speck Ivor Spence Cliff Stein Mark Stillwell John Stone John Stratton Petter Sv¨ard David Swanson Guangming Tan Yoshio Tanaka Osamu Tatebe Marc Tchiboukdjian Samuel Thibault Alex Tiskin Rub´en Titos Hiroyuki Tomiyama
Massimo Torquati Raul Torres Pedro Trancoso Paolo Trunfio Hong-Linh Truong Konstantinos Tsakalozos Tomoaki Tsumura Bogdan Marius Tudor Rafael Ubal
Yash Ukidave Osman Unsal
Philipp Unterbrunner Jacopo Urbani Marian Vajtersic Rob van Nieuwpoort Ben van Werkhoven Hans Vandierendonck Xavier Vasseur Lu´ıs Veiga Ioannis Venetis Salvatore Venticinque Vassilis Verroios Kees Verstoep Llu´ıs Vilanova Frederic Vivien David Walker Edward Walker John Walters
Lu´ıs Wanderley G´oes Jun Wang
Xinqi Wang Ian Watson Marc Wiedemann Tong Wieqin Adam Wierzbicki Martin Wimmer Justin Wozniak Di Wu
Yong Xia Wei Xing Lei Xu
Gagarine Yaikhom Ayse Yilmazer Yitong Yin Ossama Younis
Organization XVII Matei Zaharia
Mohamed Zahran Jidong Zhai Guoxing Zhan Haibo Zhang Sen Zhang Yunquan Zhang
Zhao Zhang Aqun Zhao Dali Zhao Gengbin Zheng Wei Zheng Eugenio Zimeo Michaela Zimmer
Table of Contents
Invited Talk
Selfish Distributed Optimization. . . . 1 Burkhard Monien and Christian Scheideler
Topic 1: Support Tools and Environments
Introduction. . . . 3 Omer Rana, Marios Dikaiakos, Daniel S. Katz, and Christine Morin Tulipse: A Visualization Framework for User-Guided Parallelization . . . . 4
Yi Wen Wong, Tomasz Dubrownik, Wai Teng Tang,
Wen Jun Tan, Rubing Duan, Rick Siow Mong Goh, Shyh-hao Kuo, Stephen John Turner, and Weng-Fai Wong
Enabling Cloud Interoperability with COMPSs . . . . 16 Fabrizio Marozzo, Francesc Lordan, Roger Rafanell, Daniele Lezzi,
Domenico Talia, and Rosa M. Badia
Pattern-Independent Detection of Manual Collectives in MPI
Programs. . . . 28 Alexandru Calotoiu, Christian Siebert, and Felix Wolf
A Type-Based Approach to Separating Protocol from Application
Logic: A Case Study in Hybrid Computer Programming . . . . 40 Geoffrey C. Hulette, Matthew J. Sottile, and Allen D. Malony
Topic 2: Performance Prediction and Evaluation
Introduction. . . . 52 Allen D. Malony, Helen Karatza, William Knottenbelt, and
Sally McKee
Energy Consumption Modeling for Hybrid Computing. . . . 54 Ami Marowka
HPC File Systems in Wide Area Networks: Understanding the
Performance of Lustre over WAN . . . . 65 Alvaro Aguilera, Michael Kluge, Thomas William, and
Wolfgang E. Nagel
XX Table of Contents
Understanding I/O Performance Using I/O Skeletal Applications . . . . 77 Jeremy Logan, Scott Klasky, Hasan Abbasi, Qing Liu,
George Ostrouchov, Manish Parashar, Norbert Podhorszki, Yuan Tian, and Matthew Wolf
ASK: Adaptive Sampling Kit for Performance Characterization. . . . 89 Pablo de Oliveira Castro, Eric Petit, Jean Christophe Beyler, and
William Jalby
CRAW/P: A Workload Partition Method for the Efficient Parallel
Simulation of Manycores. . . . 102 Shuai Jiao, Paolo Ienne, Xiaochun Ye, Da Wang,
Dongrui Fan, and Ninghui Sun
Topic 3: Scheduling and Load Balancing
Introduction. . . . 115 Denis Trystram, Ioannis Milis, Zhihui Du, and Uwe Schwiegelshohn
Job Scheduling Using Successive Linear Programming Approximations
of a Sparse Model. . . . 116 Stephane Chretien, Jean-Marc Nicod, Laurent Philippe,
Veronika Rehn-Sonigo, and Lamiel Toch
Speed Scaling on Parallel Processors with Migration. . . . 128 Eric Angel, Evripidis Bampis, Fadi Kacem, and Dimitrios Letsios
Dynamic Distributed Scheduling Algorithm for State Space Search. . . . . 141 Ankur Narang, Abhinav Srivastava, Ramnik Jain, and
R.K. Shyamasundar
Using Load Information in Work-Stealing on Distributed Systems with
Non-uniform Communication Latencies . . . . 155 Vladimir Janjic and Kevin Hammond
Energy Efficient Frequency Scaling and Scheduling for Malleable
Tasks. . . . 167 Peter Sanders and Jochen Speck
Scheduling MapReduce Jobs in HPC Clusters . . . . 179 Marcelo Veiga Neves, Tiago Ferreto, and C´esar De Rose
A Job Scheduling Approach for Multi-core Clusters Based on Virtual
Malleability. . . . 191 Gladys Utrera, Siham Tabik, Julita Corbalan, and Jes´us Labarta
Table of Contents XXI
Topic 4: High-Performance Architecture and Compilers
Introduction. . . . 204 Alex Veidenbaum, Nectarios Koziris, Toshinori Sato, and
Avi Mendelson
Dynamic Last-Level Cache Allocation to Reduce Area and Power
Overhead in Directory Coherence Protocols . . . . 206 Mario Lodde, Jose Flich, and Manuel E. Acacio
A Practical Approach to DOACROSS Parallelization. . . . 219 Priya Unnikrishnan, Jun Shirako, Kit Barton, Sanjay Chatterjee,
Raul Silvera, and Vivek Sarkar
Exploiting Semantics of Virtual Memory to Improve the Efficiency of
the On-Chip Memory System. . . . 232 Bin Li, Zhen Fang, Li Zhao, Xiaowei Jiang, Lin Li,
Andrew Herdrich, Ravishankar Iyer, and Srihari Makineni
From Serial Loops to Parallel Execution on Distributed Systems . . . . 246 George Bosilca, Aurelien Bouteiller, Anthony Danalis,
Thomas Herault, and Jack Dongarra
Topic 5: Parallel and Distributed Data Management
Introduction. . . . 258 Domenico Talia, Alex Delis, Haimonti Dutta, and Arkady Zaslavsky
DS-Means: Distributed Data Stream Clustering. . . . 260 Alessio Guerrieri and Alberto Montresor
3D Inverted Index with Cache Sharing for Web Search Engines . . . . 272 Esteban Feuerstein, Veronica Gil-Costa, Mauricio Marin,
Gabriel Tolosa, and Ricardo Baeza-Yates
Quality-of-Service for Consistency of Data Geo-replication in Cloud
Computing. . . . 285 S´ergio Esteves, Jo˜ao Silva, and Lu´ıs Veiga
A Fault-Tolerant Cache Service for Web Search Engines: RADIC
Evaluation. . . . 298 Carlos G´omez-Pantoja, Dolores Rexachs, Mauricio Marin, and
Emilio Luque
XXII Table of Contents
Topic 6: Grid, Cluster and Cloud Computing
Introduction. . . . 311 Erik Elmroth, Paraskevi Fragopoulou, Artur Andrzejak,
Ivona Brandic, Karim Djemame, and Paolo Romano Scalable Reed-Solomon-Based Reliable Local Storage for HPC
Applications on IaaS Clouds. . . . 313 Leonardo Bautista Gomez, Bogdan Nicolae, Naoya Maruyama,
Franck Cappello, and Satoshi Matsuoka
Caching VM Instances for Fast VM Provisioning: A Comparative
Evaluation. . . . 325 Pradipta De, Manish Gupta, Manoj Soni, and Aditya Thatte
Improving Scheduling Performance Using a Q-Learning-Based Leasing
Policy for Clouds . . . . 337 Alexander F¨olling and Matthias Hofmann
Impact of Variable Priced Cloud Resources on Scientific Workflow
Scheduling . . . . 350 Simon Ostermann and Radu Prodan
Topic 7: Peer to Peer Computing
Introduction. . . . 363 Alberto Montresor, Evaggelia Pitoura, Anwitaman Datta, and
Spyros Voulgaris
ID-Replication for Structured Peer-to-Peer Systems . . . . 364 Tallat M. Shafaat, Bilal Ahmad, and Seif Haridi
Changing the Unchoking Policy for an Enhanced Bittorrent . . . . 377 Vaggelis Atlidakis, Mema Roussopoulos, and Alex Delis
Peer-to-Peer Multi-class Boosting. . . . 389 Istv´an Heged˝us, R´obert Busa-Fekete, R´obert Orm´andi,
M´ark Jelasity, and Bal´azs K´egl
Topic 8: Distributed Systems and Algorithms
Introduction. . . . 401 Andrzej Goscinski, Marios Mavronicolas, Weisong Shi, and
Teo Yong Meng
Towards Load Balanced Distributed Transactional Memory . . . . 403 Gokarna Sharma and Costas Busch
Table of Contents XXIII CUDA-For-Clusters: A System for Efficient Execution of CUDA
Kernels on Multi-core Clusters. . . . 415 Raghu Prabhakar, R. Govindarajan, and Matthew J. Thazhuthaveetil From a Store-Collect Object and Ω to Efficient Asynchronous
Consensus . . . . 427 Michel Raynal and Julien Stainer
An Investigation into the Performance of Reduction Algorithms under
Load Imbalance. . . . 439 Petar Marendi´c, Jan Lemeire, Tom Haber, Dean Vuˇcini´c, and
Peter Schelkens
Achieving Reliability in Master-Worker Computing via Evolutionary
Dynamics. . . . 451 Evgenia Christoforou, Antonio Fern´andez Anta, Chryssis Georgiou,
Miguel A. Mosteiro, and Angel (Anxo) S´anchez
Topic 9: Parallel and Distributed Programming
Introduction. . . . 464 Sergei Gorlatch, Rizos Sakellariou, Marco Danelutto, and
Thilo Kielmann
Dynamic Thread Mapping Based on Machine Learning for Transactional
Memory Applications. . . . 465 M´arcio Castro, Lu´ıs Fabr´ıcio Wanderley G´oes,
Luiz Gustavo Fernandes, and Jean-Fran¸cois M´ehaut
A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in
Standard MPI. . . . 477 Wesley Bland, Peng Du, Aurelien Bouteiller, Thomas Herault,
George Bosilca, and Jack Dongarra
Hierarchical Partitioning Algorithm for Scientific Computing on Highly
Heterogeneous CPU + GPU Clusters. . . . 489 David Clarke, Aleksandar Ilic, Alexey Lastovetsky, and Leonel Sousa Encapsulated Synchronization and Load-Balance in Heterogeneous
Programming. . . . 502 Yuri Torres, Arturo Gonzalez-Escribano, and Diego Llanos
Transactional Access to Shared Memory in StarSs, a Task Based
Programming Model . . . . 514 Rahulkumar Gayatri, Rosa M. Badia, Eduard Ayguade,
Mikel Luj´an, and Ian Watson
On-the-Fly Task Execution for Speeding Up Pipelined MapReduce. . . . . 526 Diana Moise, Gabriel Antoniu, and Luc Boug´e
XXIV Table of Contents
Assessing the Performance and Scalability of a Novel Multilevel
K-Nomial Allgather on CORE-Direct Systems . . . . 538 Joshua S. Ladd, Manjunath Gorentla Venkata,
Richard Graham, and Pavel Shamis
Topic 10: Parallel Numerical Algorithms
Introduction. . . . 550 Iain Duff, Efstratios Gallopoulos, Daniela di Serafino, and Bora Ucar Avoiding Communication through a Multilevel LU Factorization. . . . 551
Simplice Donfack, Laura Grigori, and Amal Khabou
Locality Improvement of Data-Parallel Adams–Bashforth Methods
through Block-Based Pipelining of Time Steps. . . . 563 Matthias Korch
Parallel SOR for Solving the Convection Diffusion Equation Using
GPUs with CUDA . . . . 575 Yiannis Cotronis, Elias Konstantinidis, Maria A. Louka, and
Nikolaos M. Missirlis
Topic 11: Multicore and Manycore Programming
Introduction. . . . 587 Eduard Ayguade, Dionisios Pnevmatikatos, Rudolf Eigenmann,
Mikel Luj´an, and Sabri Pllana
Efficient Support for In-Place Metadata in Transactional Memory. . . . 589 Ricardo J. Dias, Tiago M. Vale, and Jo˜ao M. Louren¸co
Folding of Tagged Single Assignment Values for Memory-Efficient
Parallelism. . . . 601 Drago¸s Sbˆırlea, Kathleen Knobe, and Vivek Sarkar
High-Level Support for Pipeline Parallelism on Many-Core
Architectures. . . . 614 Siegfried Benkner, Enes Bajrovic, Erich Marth, Martin Sandrieser,
Raymond Namyst, and Samuel Thibault
Node.Scala: Implicit Parallel Programming for High-Performance Web
Services. . . . 626 Daniele Bonetta, Danilo Ansaloni, Achille Peternier,
Cesare Pautasso, and Walter Binder
Task-Parallel Programming on NUMA Architectures . . . . 638 Christian Terboven, Dirk Schmidl, Tim Cramer, and Dieter an Mey
Table of Contents XXV Speeding Up OpenMP Tasking . . . . 650
Spiros N. Agathos, Nikolaos D. Kallimanis, and Vassilios V. Dimakopoulos
An Efficient Unbounded Lock-Free Queue for Multi-core Systems . . . . 662 Marco Aldinucci, Marco Danelutto, Peter Kilpatrick,
Massimiliano Meneghin, and Massimo Torquati
Topic 12: Theory and Algorithms for Parallel Computation
Introduction. . . . 674 Geppino Pucci, Christos Zaroliagis, Kieran T. Herley, and
Henning Meyerhenke
A Lower Bound Technique for Communication on BSP with Application
to the FFT. . . . 676 Gianfranco Bilardi, Michele Scquizzato, and Francesco Silvestri
A Fast Parallel Algorithm for Minimum-Cost Small Integral Flows . . . . . 688 Andrzej Lingas and Mia Persson
Topic 13: High Performance Network and Communication
Introduction. . . . 700 Chris Develder, Emmanouel Varvarigos, Admela Jukan, and
Dimitra Simeonidou
Topology Configuration in Hybrid EPS/OCS Interconnects. . . . 701 Konstantinos Christodoulopoulos, Marco Ruffini,
Donal O’Mahony, and Kostas Katrinis
Towards an Efficient Fat–Tree Like Topology. . . . 716 D. Berm´udez Garz´on, C. G´omez, M.E. G´omez, P. L´opez, and
J. Duato
An Adaptive, Scalable, and Portable Technique for Speeding Up
MPI-Based Applications . . . . 729 Rosa Filgueira, Malcolm Atkinson, Alberto Nu˜nez, and
Javier Fern´andez
Cost-Effective Contention Avoidance in a CMP with Shared Memory
Controllers. . . . 741 Samuel Rodrigo, Frank Olaf Sem-Jacobsen, Herv´e Tatenguem,
Tor Skeie, and Davide Bertozzi
XXVI Table of Contents
Topic 14: Mobile and Ubiquitous Computing
Introduction. . . . 753 Paolo Santi, Sotiris Nikoletseas, Cecilia Mascolo, and Thiemo Voigt
Watershed-Based Clustering for Energy Efficient Data Gathering in
Wireless Sensor Networks with Mobile Collector . . . . 754 Charalampos Konstantopoulos, Basilis Mamalis,
Grammati Pantziou, and Vasileios Thanasias
Distribution of Liveness Property Connectivity Interval in Selected
Mobility Models of Wireless Ad Hoc Networks. . . . 767 Jerzy Brzezi´nski, Michal Kalewski, Marcin Kosiba, and Marek Libuda
Topic 15: High Performance and Scientific Applications
Introduction. . . . 779 Thomas Ludwig, Costas Bekas, Alice Koniges, and Kengo Nakajima
Memory-Access Optimization of Parallel Molecular Dynamics
Simulation via Dynamic Data Reordering . . . . 781 Manaschai Kunaseth, Ken-ichi Nomura, Hikmet Dursun,
Rajiv K. Kalia, Aiichiro Nakano, and Priya Vashishta
On Analyzing Quality of Data Influences on Performance of Finite
Elements Driven Computational Simulations . . . . 793 Michael Reiter, Hong-Linh Truong, Schahram Dustdar,
Dimka Karastoyanova, Robert Krause, Frank Leymann, and Dieter Pahr
Performance Evaluation and Optimization of Nested High Resolution
Weather Simulations . . . . 805 Preeti Malakar, Vaibhav Saxena, Thomas George, Rashmi Mittal,
Sameer Kumar, Abdul Ghani Naim, and Saiful Azmi bin Hj Husain Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations
on Complex Geometries. . . . 818 Jonas Fietz, Mathias J. Krause, Christian Schulz,
Peter Sanders, and Vincent Heuveline
Topology-Aware Mappings for Large-Scale Eigenvalue Problems . . . . 830 Hasan Metin Aktulga, Chao Yang, Esmond G. Ng,
Pieter Maris, and James P. Vary
Fast and Effective Lossy Compression Algorithms for Scientific
Datasets. . . . 843 Jeremy Iverson, Chandrika Kamath, and George Karypis
Table of Contents XXVII
Topic 16: GPU and Accelerators Computing
Introduction. . . . 857 Alex Ramirez, Dimitrios S. Nikolopoulos, David Kaeli, and
Satoshi Matsuoka
OpenACC — First Experiences with Real-World Applications. . . . 859 Sandra Wienke, Paul Springer, Christian Terboven, and
Dieter an Mey
accULL: An OpenACC Implementation with CUDA and OpenCL
Support . . . . 871 Ruym´an Reyes, Iv´an L´opez-Rodr´ıguez, Juan J. Fumero, and
Francisco de Sande
Understanding the Performance of Concurrent Data Structures on
Graphics Processors. . . . 883 Daniel Cederman, Bapi Chatterjee, and Philippas Tsigas
A New Programming Paradigm for GPGPU. . . . 895 Julio Toss and Thierry Gautier
GPU-Accelerated Asynchronous Error Correction for Mixed Precision
Iterative Refinement. . . . 908 Hartwig Anzt, Piotr Luszczek, Jack Dongarra, and Vincent Heuveline GPURoofline: A Model for Guiding Performance Optimizations on
GPUs . . . . 920 Haipeng Jia, Yunquan Zhang, Guoping Long, Jianliang Xu,
Shengen Yan, and Yan Li
Building a Collision for 75-Round Reduced SHA-1 Using GPU
Clusters . . . . 933 Andrew V. Adinetz and Evgeny A. Grechnikov
GPU-Vote: A Framework for Accelerating Voting Algorithms on
GPU. . . . 945 Gert-Jan van den Braak, Cedric Nugteren, Bart Mesman, and
Henk Corporaal
Author Index. . . . 957