UNIVERSITY OF TEXAS AT AUSTIN
Modern networks (like InfiniBand and 10GigE) have capability to provide topology, routing and also network status information at run-time. This leads to the following broad challenge: Can the next generation petascale systems provide topology-aware MPI communication, mapping and scheduling which can improve performance and scalability for a range of applications? This challenge leads to the following research questions: 1) What are the topology- aware communication and scheduling requirements of petascale applications? 2) How to design a network topology and state management framework with static and dynamic network information? 3) How to design topology-aware point-to-point and collective communication schemes (such as broadcast, all-to-all, all-reduce) in an MPI library? 4) How to design topology-aware task mapping and scheduling schemes? and 5) How to define and design a flexible topology information interface? A synergistic and comprehensive research plan, involving computer scientists from The Ohio State University (OSU) and computational scientists from the Texas Advanced Computing Center (TACC) and The Univ. of Calif., San Diego, San Diego Supercomputer Center (SDSC), is proposed to address the above challenges. The research will be driven by a set of applications (PSDNS, UCSDH3D, AWM-Olsen and MPCUGLES) from established NSF computational science researchers running large scale simulations on the Ranger system and other NSF HEC systems. The transformative impact of the proposed research is to develop topology-aware MPI software and a framework for using derived topology information for scheduling integration in order to maximize petascale application performance. The proposed research is a collaborative and synergistic activity between computer scientists and computational scientists and thus, will have significant impact in deriving guidelines for designing, deploying and using next generation petascale systems. The proposed research directions and their solutions will be used in curriculum of the investigators to train graduate and undergraduate students. The established national-scale training and outreach programs at TACC and SDSC will be used to disseminate the results of this research to HEC users and developers. Research results will also be disseminated to the multiple collaborating organizations of the investigators (national laboratories and industry) to enable impact on their software products and applications. The modified MVAPICH2 library (currently being used by more than 840 organizations) and SGE scheduler plug-in will be available to the HEC community in an open-source manner. Case-studies from this research will be presented at the MPI Forum (OSU is a member of this forum) to in?uence the design of the upcoming MPI-3 standard and other MPI libraries.
Choose a quarter and click "Go."
| AWARD OVERVIEW |
| Award Number |
0926574 |
Funding Agency |
National Science Foundation |
| Total Award Amount |
$459,940 |
Project Location - City |
Austin |
| Award Date |
08/13/2009 |
Project Location - State |
TX |
| Project Status |
More than 50% Completed |
Project Location - Zip |
78758-4445
|
| Jobs Reported |
0.62 |
Congressional District |
10 |
| Project Location - Country |
US |
|
|
Recipient Information
(Grants)
| Recipient Information (Grants) |
|
Recipient Name
|
UNIVERSITY OF TEXAS AT AUSTIN |
| Recipient DUNS Number |
170230239
|
| Recipient Address |
101 EAST 27TH STREET STE 4308 |
| Recipient City |
AUSTIN |
| Recipient State |
Texas |
| Recipient Zip |
78712-1500 |
| Recipient Congressional District |
25 |
| Recipient Country |
USA |
Required to Report Top 5 Highly Compensated Officials |
No |
Projects and Jobs Information
| Projects and Jobs Information |
| Project Title |
Collaborative Research: Topology-Aware MPI Communication and Scheduling for Petascale Systems |
| Project Status |
More than 50% Completed |
| Final Project Report Submitted |
No |
| Project Activities Description |
Colleges, Universities, and Professional Schools |
| Quarterly Activities/Project Description |
The overall purpose of the project is to develop a software framework for increasing the message passing performance of a leading open-source MPI library for use on high-performance Linux supercomputers. To fulfill this goal, the project plans to augment the current MVAPICH2 library to support real-time queries of a high-speed InfiniBand interconnect for the purposes of scheduling distributed processes more efficiently in order matching the underlying switch topology (e.g. fat-tree routing). In addition to providing additional functionality to the MPI library itself, a second main thrust of the project is to develop a topology-aware scheduling plug-in within the popular SGE open-source batch system software to maximize the overall MPI performance for variable job distribution workloads. During this quarter, the team continued to hold bi-weekly telecons. Based on testing during the previous quarter, the developed topology service was put into production on TACC's newest supercomputer, Stampede. Some results from the effort using Stampede were presented at the HPC Advisory Council conference. A modification to the tree-pruning algorithm was also incorporated for mesh-based topologies. Significant deliverables at the conclusion of this project will be an updated MVAPICH2 library with topology-aware collectives and a novel topology-aware scheduling plug-in for SGE, along with performance impact studies made using these developments. |
| Jobs Created |
0.62 |
| Description of Jobs Created |
Retained RESEARCH SCIENTIST (0.10 FTE) Retained RESEARCH SCIENTIST (0.27 FTE) Created RESEARCH ASSOCIATE (0.25 FTE) |
Purchaser Information
(Grants)
| Purchaser Information |
| Contracting Office ID |
Not Reported |
| Contracting Office Name |
Not Available |
| Contracting Office Region |
Not Available |
| TAS Major Program |
49-0101 |
| Award Information |
| Award Date |
08/13/2009 |
| Award Number |
0926574 |
| Order Number |
|
| Award Type |
Grants |
| Funding Agency ID |
49 |
| Funding Agency Name |
National Science Foundation |
| Funding Office Name |
Not Available |
| Awarding Agency ID |
49 |
| Awarding Agency Name |
National Science Foundation |
| Amount of Award |
$459,940 |
| Funds Invoiced/Received |
$335,163 |
| Expenditure Amount |
$343,428 |
| Infrastructure Expenditure Amount |
$0 |
| Infrastructure Purpose and Rationale |
Not Reported |
| Infrastructure Point of Contact Name |
Not Reported |
| Infrastructure Point of Contact Email |
Not Reported |
| Infrastructure Point of Contact Phone |
Not Reported |
| Infrastructure Point of Contact Address |
Not Reported |
| Infrastructure Point of Contact City |
Not Reported |
| Infrastructure Point of Contact State |
Not Reported |
| Infrastructure Point of Contact Zip |
Not Reported |
Product or Service Information
(Grants)
| Product or Service Information |
| Primary Activity Code |
611310 |
| Activity Description |
Colleges, Universities, and Professional Schools |
| Sub-Awards Information |
| Sub-awards to Organizations |
0 |
| Sub-award Amounts to Organizations |
$0 |
| Sub-Awards to Individuals |
0 |
| Sub-Award Amounts to Individuals |
$0 |
| Number of Sub-awards less than $25,000/award |
0 |
| Amount of Sub-awards less than $25,000/award |
$0 |
| Number of payments to vendors greater than $25,000 |
0 |
| Total Amount of payments to vendors greater than $25,000/award |
$0 |
| Number of payments to vendors less than $25,000/award |
2 |
| Total Amount of payments to vendors less than $25,000/award |
$6,239 |
| Location Information |
| Latitude, Longitude |
30º 23' 7",
-97º 43' 25" |
| Congressional District |
10 |
| Address 1 |
10100 Burnet Road |
| Address 2 |
|
| City |
Austin |
| County |
Travis |
| State |
TX |
| Zip |
78758-4445 |
|
 |