Session
HPC Systems Professionals Workshop (HPCSYSPROS19)
Session Chairs
Event TypeWorkshop
W
Datacenter
SIGHPC Workshop
State of the Practice
System Administration
System Maintenance
System Reliability
TimeFriday, 22 November 20198:30am - 12pm
Location405-406-407
DescriptionThe complexity of High Performance Computing (HPC) systems necessitates advanced techniques in system administration, configuration, and engineering and (by proxy) staff who are well-versed on the best practices in this field. HPC Systems Professionals include system engineers, system administrators, network administrators, storage administrators, and operations staff who face problems that are unique to high performance computing systems. While many separate conferences exist for the HPC field and for the systems administration field, none exist that focus specifically on the needs of HPC systems professionals. As such, it can be difficult to find support resources who are able to help with the issues encountered in this specialized field. The ACM SIGHPC SYSPROS Virtual Chapter, the sponsor for this workshop, has been established to provide opportunities to develop and grow relationships among HPC systems administration practitioners and to act as a support resource for them.
This workshop is designed to share best practices for common HPC system deployment and maintenance, to provide a platform to discuss upcoming technologies, and to present state of the practice techniques that increase performance and reliability of systems, and in turn increase researcher and analyst productivity.
http://sighpc-syspros.org/workshops/2019/
This workshop is designed to share best practices for common HPC system deployment and maintenance, to provide a platform to discuss upcoming technologies, and to present state of the practice techniques that increase performance and reliability of systems, and in turn increase researcher and analyst productivity.
http://sighpc-syspros.org/workshops/2019/
Presentations
8:30am - 8:45am | HPC Systems Professionals Workshop (HPCSYSPROS19) | |
8:45am - 9:30am | Chameleon: How to Build a Cloud++ Presenter | |
9:30am - 9:45am | Decoupling OpenHPC Critical Services | |
9:45am - 10:00am | Implementing a Common HPC Environment in a Multi-User Spack Instance | |
10:00am - 10:30am | HPCSYSPROS19 Morning Break | |
10:30am - 10:37am | Arbiter: Dynamically Limiting Resource Consumption on Login Nodes | |
10:37am - 10:44am | Using GUFI in Data Management | |
10:44am - 10:59am | Monitoring HPC Services with CheckMK | |
10:59am - 11:14am | The Road to Devops HPC Cluster Management | |
11:14am - 11:29am | What Deploying MFA Taught Us about Changing Infrastructure | |
11:29am - 11:44am | A Better Way of Scheduling Jobs on HPC Systems: Simultaneous Fair-Share | |
11:44am - 12:00pm | Closing Remarks and Open Discussion |