Professur Rechnerarchitektur

Papers/Publications/Presentations

Copyright Notice

The documents distributed by this server have been provided by the contributing authors as a means to ensure timely dissemination of scholarly and technical work on a noncommercial basis. Copyright and all rights therein are maintained by the authors or by other copyright holders, notwithstanding that they have offered their works here electronically. It is understood that all persons copying this information will adhere to the terms and constraints invoked by each author's copyright. These works may not be reposted without the explicit permission of the copyright holder.

This page contains the publications of the years 2005 to 2011. Older publications can be found below. For a list dedicated to Computer Architecture Technical Reports see below.


2012

  • Hoefler, Torsten; Schneider, Timo; : Communication-centric optimizations by dynamically detecting collective operations. In: Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming: pp. 305-306. New Orleans, Louisiana, USA. ACM - ISBN: 978-1-4503-1160-1. New York, NY, USA. 2012.
    Onlineresource verfügbar. BibTex PDF

2011

  • Schneider, T.; Eckelmann, S.; Hoefler, T.; Rehm, W.; : Kernel-Based Offload of Collective Operations - Implementation, Evaluation and Lessons Learned. In: Proceedings of the 17th international conference on Parallel processing - Volume Part II: pp. 264-275. Bordeaux, France. Springer-Verlag - ISBN: 978-3-642-23396-8. Aug. 2011.
    BibTex

2010

  • Strunk, J.; Hiltscher, J.; Rehm, W.; Schick, H.; : Communication Architectures for Run-Time Reconfigurable Modules in a 2-D Mesh on FPGAs. In: Proceedings of the International Conference on ReConFigurable Computing and FPGAs (ReConFig'10). IEEE Computer Society - ISBN: 978-0-7695-4314-7. 2010.
    BibTex
  • Strunk, J.; Heinig, A.; Volkmer, T.; Rehm, W.; Schick, H.; : ACCFS - Virtual File System Support for Host Coupled Run-Time Reconfigurable FPGAs. In: Advances in Parallel Computing, Volume 19, Parallel Computing: From Multicores and GPU's to Petascale, 2010, from Parallel Computing with FPGAs (ParaFPGA) held in conjunction with International Conference on Parallel Computing (ParCo 2009). IOS Press - ISBN: 978-1-60750-529-7. 2010.
    Onlineresource verfügbar. BibTex

2009

  • Strunk, J.; Volkmer, T.; Rehm, W.; Schick, H.; : Design and Performance of a Grid of Asynchronously Clocked Run-Time Reconfigurable Modules on a FPGA. In: Accepted for publication in the Proceedings of the International Conference on ReConFigurable Computing and FPGAs (ReConFig'09). IEEE Computer Society - ISBN: 978-0-7695-3917-1. Los Alamitos, CA, USA. 2009.
    Onlineresource verfügbar. BibTex
  • Rinke, S.; Mehlan, T.; Rehm, W.; : Evaluation of Task Mapping Strategies for Regular Network Topologies. In: International Conference on Parallel Computing, ParCo 2009. Lyon. - ISBN: 978-1-60750-529-7. September 2009.
    Onlineresource verfügbar. BibTex
  • Strunk, J.; Volkmer, T.; Rehm, W.; Schick, H.; : An on Chip Network inside a FPGA for Run-Time Reconfigurable Low Latency Grid Communication. In: Proceedings of Euromicro Conference on Digital System Design (DSD). IEEE CS - ISBN: 978-0-7695-3782-5. 2009.
    BibTex
  • Strunk, J.; Volkmer, T.; Stephan, K.; Rehm, W.; Schick, H.; : Impact of Run-Time Reconfiguration on Design and Speed - A Case Study Based on a Grid of Run-Time Reconfigurable Modules inside a FPGA. In: Proceedings of Reconfigurable Architectures Workshop in conjunction with IPDPS. RAW / IPDPS - ISBN: 978-1-4244-3750-4 - ISSN: 1530-2075. 2009.
    BibTex
  • Heinig, A.; Strunk, J.; Rehm, W.; Schick, H.; : ACCFS - Operating System Integration of Computational Accelerators Using a VFS Approach. In: Proceedings of Applied Reconfigurable Computing (ARC). LNCS - ISBN: 978-3-642-00640-1 - ISSN: 0302-9743. 2009.
    BibTex
  • Strunk, J.; Heinig, A.; Volkmer, T.; Rehm, W.; Schick, H.; : Run-Time Reconfiguration for HyperTransport coupled FPGAs using ACCFS. In: Proceedings of First International Workshop on HyperTransport Research and Applications. WHTRA - ISBN: 978-3-00-027249-3. 2009.
    BibTex

2008

  • Hoefler, T.; Schneider, T.; Lumsdaine, A.; : Multistage Switches are not Crossbars: Effects of Static Routing in High-Performance Networks. In: Proceedings of the 2008 IEEE International Conference on Cluster Computing (Cluster 2008). IEEE. 2008.
    BibTex
  • Mietke, F.; : Experiences with the Chemnitzer High Performance Linux Cluster (CHiC). In: JSC NIC seminar series, Presented in Jülich.. June 2008.
    BibTex PDF
  • Schneider, T.; Volkmer, T.; Rehm, W.; Hoefler, T.; : Code optimization for Cell BE - Opportunities for ABINIT. In: Submitted to Scientific Programming Special Issue on HIGH PERFORMANCE COMPUTING ON CELL B.E. PROCESSORS. IOS Press. 2008.
    Onlineresource verfügbar. BibTex
  • Hoefler, T.; Schneider, T.; Lumsdaine, A.; : Accurately Measuring Collective Operations at Massive Scale. In: Proceedings of the 7th International Workshop on Performance Modeling, Evaluation, and Optimization of Ubiquitous Computing and Networked Systems (PMEO-UCNS 2008) at the IPDPS08.. 2008.
    BibTex , PMEO-UCNS 2008, IPDPS08
  • Schneider, T.; Hoefler, T.; Wunderlich, S.; Mehlan, T.; Rehm, W.; : An optimized ZGEMM implementation for the Cell BE. In: In Proceedings of the 9th Workshop on Parallel Systems and Algorithms (PASA) 2008, February 26, 2008 in Conjunction with 21st International Conference on Architecture of Computing Systems (ARCS). Dresden, Germany. Köllen - ISBN: 978-3-88579-218-5 - ISSN: 1617-5468. February 25th to 28th, 2008.
    BibTex PDF
  • Hoefler, T.; Lumsdaine, A.; Rehm, W.; : Implementation and performance analysis of non-blocking collective operations for MPI. In: SC '07: Proceedings of the 2007 ACM/IEEE conference on Supercomputing: pp. 1-10. Reno, Nevada. ACM - ISBN: 978-1-59593-764-3. New York, NY, USA. 2008.
    Onlineresource verfügbar. BibTex PDF

2007

Torsten Hoefler, Marek Mosch, Torsten Mehlan, Wolfgang Rehm:

  • CollGM - A Myrinet/GM optimized collective component for Open MPI
    In Proceedings of the 3rd Workshop KiCC (Kommunikation in Clustern und Clusterverbundsystemen), 2007, Aachen, Germany, URN: urn:nbn:de:hbz:82-opus-21137
    PDF Document (1)

Frank Mietke, Torsten Mehlan, Torsten Hoefler, Wolfgang Rehm:

  • Design and Evaluation of a 2048 Core Cluster System
    In Proceedings of the 3rd Workshop KiCC (Kommunikation in Clustern und Clusterverbundsystemen), 2007, Aachen, Germany, URN: urn:nbn:de:hbz:82-opus-21137
    PDF Document (0)

Torsten Hoefler and Prabhanjan Kambadur and Richard L. Graham and Galan Shipman and Andrew Lumsdaine:

  • A Case for Standard Non-Blocking Collective Operations
    In Proceedings of EuroPVM/MPI 2007, Paris, France, October 2007, Springer, ISSN: 0302-9743, ISBN: 978-3-540-75415-2
    PDF Document (0)

Timo Schneider and Simon Wunderlich and Wolfgang Rehm and Torsten Hoefler and Heiko Schick:

  • Code optimization for Cell/B.E. - Opportunities for ABINIT - a software package for physicists
    Poster at the IBM CAS Software and Systems Engineering Symposium
    October, 2007, Dublin, Ireland
    (PDF Document)

Andreas Heinig and Jochen Strunk and Wolfgang Rehm and Heiko Schick:

  • Heterogeneous Multiprocessing - On a tightly coupled Opteron Cell evaluation platform
    Poster at the IBM CAS Software and Systems Engineering Symposium
    October, 2007, Dublin, Ireland
    (PDF Document)

Frank Mietke:

  • Erfahrungen mit parallelen Dateisystemen
    Presented in Chemnitz, Germany, September 2007, Megware HPC Users Meeting
    PDF Document (0)

Torsten Hoefler and Torsten Mehlan and Andrew Lumsdaine and Wolfgang Rehm:

Frank Mietke:

  • Erfahrungsberichte - HPC im akademischen Umfeld CHiC - TU Chemnitz
    Presented in Böblingen, Germany, June 2007, IBM HPC Clusterworkshop
    PDF Document (0)

Torsten Hoefler and Peter Gottschling and Andrew Lumsdaine and Wolfgang Rehm:

  • Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations
    In the Elsevier Journal of Parallel Computing (PARCO), Vol 33, September 2007, SSN: 0167-8191
    PDF Document (0)

Frank Mietke, Torsten Hoefler, Torsten Mehlan and Wolfgang Rehm:

  • Diskless Cluster und Lustre - Erfahrungsbericht zum CHiC
    Presented in Chemnitz, Germany, April 2007, UNIX Stammtisch 24.04.2007
    PDF Document (0)

Frank Mietke, Torsten Mehlan, Torsten Hoefler and Wolfgang Rehm:

  • Stand HPC Cluster CHiC
    Presented in Leipzig, Germany, April 2007, ZKI Arbeitskreis Supercomputing 19.04.2007
    PDF Document (0)

Frank Mietke:

  • Diskless Cluster und Lustre - Erfahrungsbericht zum CHiC
    Presented in Chemnitz, Germany, March 2007, Chemnitzer Linux-Tage 2007
    PDF Document (0)

Torsten Hoefler, Andre Lichei and Wolfgang Rehm:

  • Low-Overhead LogGP Parameter Assessment for Modern Interconnection Networks
    In Proceedings of the 21st IEEE International Parallel & Distributed Processing Symposium IPDPS 2007, Long Beach, CA, USA, Mar. 2007, ISBN: 1-4244-0909-8
    PDF Document (0)

Torsten Hoefler, Christian Siebert and Wolfgang Rehm:

  • A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast
    In Proceedings of the 21st IEEE International Parallel & Distributed Processing Symposium IPDPS 2007, Long Beach, CA, USA, Mar. 2007, ISBN: 1-4244-0909-8
    PDF Document (0)

Frank Mietke, Dirk Dunger, Torsten Mehlan, Torsten Hoefler and Wolfgang Rehm:

  • A native InfiniBand Transporter for MySQL Cluster
    In Proceedings of the 2nd Workshop „Kommunikation in Clusterrechnern und Clusterverbundsystemen“ (KiCC'07), Chemnitz, Germany, February 8th, 2007 ISSN 0947-5125
    PDF Document (0)

2006

Torsten Hoefler, Rebecca Janisch and Wolfgang Rehm:

  • Parallel scaling of Teter's minimization for Ab Initio calculations
    In Proceedings, HPCnano Workshop 2006 held in conjunction with the SC06 International Conference for High Performance Computing, Networking, Storage and Analysis, Tampa, USA, November 2006
    PDF Document (0)

Torsten Hoefler, Jeffrey Squyres, Wolfgang Rehm and Andrew Lumsdaine:

  • A Case for Non-Blocking Collective Operations
    In Proceedings, Frontier on High Performance Computing and Networking (FHPCN-06), associated with the ISPA-06, Sorrento, Italy, December 2006, ISBN: 978-3-540-49860-5
    PDF Document (0)

Wolfgang Rehm:

Torsten Hoefler, Jeffrey Squyres, Graham Fagg, George Bosilca, Wolfgang Rehm and Andrew Lumsdaine:

  • A New Approach to MPI Collective Communication Implementations
    In Proceedings, Distributed and Parallel Systems - From Cluster to Grid Computing, presented in Innsbruck, Austria, pages 45-54, Springer, ISBN: 978-0-387-69857-1, Sep. 2006
    PDF Document (0)

Torsten Hoefler, Peter Gottschling, Wolfgang Rehm and Andrew Lumsdaine:

  • Optimizing a Conjugate Gradient Solver with Non Blocking Collective Operations
    In Proceedings, EuroPVM/MPI 2006, special session ParSim 2006, Bonn, September 2006, ISSN: 0302-9743, ISBN: 3-540-39110-X
    PDF Document (0)

Torsten Mehlan, Jochen Strunk, Torsten Hoefler, Frank Mietke and Wolfgang Rehm:

  • IRS - A portable Interface for Reconfigurable Systems
    In Proceedings, 5th International Symposium on Parallel Computing in Electrical Engineering, Bialystok, September 2006, ISBN: 0-7695-2554-7
    PDF Document (0)

Torsten Hoefler, Carsten Viertel, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:

  • Assessing Single-Message and Multi-Node Communication Performance of InfiniBand
    In Proceedings, 5th International Symposium on Parallel Computing in Electrical Engineering, Bialystok, September 2006, ISBN: 0-7695-2554-7
    PDF Document (0)

Torsten Hoefler, Mirko Reinhardt, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:

  • Low Overhead Ethernet Communication for Open MPI on Linux Clusters
    CSR-06-06, July, 2006, Chemnitz, ISSN 0947-5125
    PDF Document (0)

Frank Mietke, Robert Rex, Robert Baumgartl, Torsten Hoefler, Torsten Mehlan and Wolfgang Rehm:

  • Analysis of the Memory Registration Process in the Mellanox InfiniBand Software Stack
    In Proceedings, European Conference on Parallel Computing (Euro-Par 2006), Dresden, September 2006, ISBN: 3-540-37783-2
    PDF Document (0)

Robert Rex, Frank Mietke, Christoph Raisch, Hoang-Nam Nguyen and Wolfgang Rehm:

  • Improving Communication Performance on InfiniBand by Using Efficient Data Placement Strategies
    In Proceedings, International Conference on Cluster Computing (Cluster 2006), Barcelona, September 2006, ISBN: 1-4244-0328-6
    PDF Document (0)

Torsten Hoefler, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:

  • LogfP - A Model for small Messages in InfiniBand
    In Proceedings, 20th International Parallel and Distributed Processing Symposium IPDPS 2006 (PMEO-PDS 06), Greece, April 2006, ISBN: 1-4244-0054-6
    PDF Document (0)

Torsten Hoefler, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:

  • Fast Barrier Synchronization for InfiniBand
    In Proceedings of the Workshop on Communication Architecture for Clusters (CAC06), held in conjunction with IEEE International Parallel and Distributed Processing Symposium (IPDPS), Rhodes Island, Greece, April 2006, ISBN: 1-4244-0054-6
    PDF Document (0)

Torsten Hoefler, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:

  • Adding Low-Cost Hardware Barrier Support to Small Commodity Clusters
    In Proceedings of 19th International Conference on Architecture and Computing Systems - ARCS'06, Frankfurt, Germany, March 2006, ISSN: 3-88579-175-7
    PDF Document (0)

2005

Torsten Hoefler, Rebecca Janisch and Wolfgang Rehm:

  • A Performance Analysis of ABINIT on a Cluster System
    Published in Lecture Notes in Computational Science and Engineering, Springer, ISBN 3-540-33539-0, 200 5
    PDF Document (0)

Wolfgang Rehm (Ed.):

  • Kommunikation in Clusterrechnern und Clusterverbundsystemen
    Tagungsband zum 1. Workshop Kommunikation in Clusterrechnern und Clusterverbundsystemen (KiCC 2005), CSR-05-03, Chemnitz, Deutschland, November 29th, 2005, ISSN 0947-5125

Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm:

  • A practical Approach to the Rating of Barrier Algorithms using the LogP Model and Open MPI
    In Proceedings of the International Workshop on Performance Evaluation of Networks for Parallel, Cluster and Grid Computing Systems (PEN-PCGCS'05) in conjunction with the 2005 International Conference on Parallel Processing (ICPP-05), Univ. of Oslo, Norway, June 2005, ISBN: 0-7659-2381-1
    PDF Document (0)

Torsten Mehlan, Torsten Hoefler, Frank Mietke and Wolfgang Rehm:

  • Integration of the SISCI Shared Memory Interface into Open MPI
    In Proceedings of the 1st Workshop Kommunikation in Clusterrechnern und Clusterverbundsystemen (KiCC'05), CSR-05-03, Chemnitz, Germany, November 29th, 2005 ISSN 0947-5125
    PDF Document (0)

Frank Mietke, Robert Rex, Torsten Hoefler, Torsten Mehlan and Wolfgang Rehm:

  • Reducing the Impact of Memory Registration in InfiniBand™
    In Proceedings of the 1st Workshop Kommunikation in Clusterrechnern und Clusterverbundsystemen (KiCC'05), CSR-05-03, Chemnitz, Germany, November 29th, 2005 ISSN 0947-5125
    PDF Document (0)

Torsten Hoefler, Jeffrey M. Squyres, Torsten Mehlan, Frank Mietke and Wolfgang Rehm:

  • Implementing a Hardware-Based Barrier in Open MPI
    In Proceedings of the 1st Workshop Kommunikation in Clusterrechnern und Clusterverbundsystemen (KiCC'05), CSR-05-03, Chemnitz, Germany, November 29th, 2005 ISSN 0947-5125
    PDF Document (0)

Torsten Hoefler und Wolfgang Rehm:

  • A Communication Model for Small Messages with InfiniBand™
    In Proceedings of the Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS) Workshop 2005, Lübeck, June 2005, ISSN 0177-0454
    PDF Document (0)

Frank Mietke, Marco Steiger, Torsten Mehlan, Torsten Hofler und Wolfgang Rehm:

  • SHIBA ­ Shared Memory Support for InfiniBand™ MPICH2 Device \\In Proceedings of the Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS) Workshop 2005, Lübeck, June 2005, ISSN 0177-0454
    PDF Document (0)

Other Stuff and Archive