Agenda is subject to change. Times listed below are in Pacific.
Lesson Materials: https://github.com/sdsc/sdsc-summer-institute-2024
Tuesday, July 30
Preparation day (virtual)
Pacific time |
Session |
9:00 AM – 11:00 AM |
1.0 Preparation Day - Welcome & Orientation Accounts, Login, Environment, Running Jobs and Logging into Expanse User Portal Q&A wrap up |
Monday, August 5
Pacific time |
Main Room Session |
8:00 AM – 8:30 AM |
Check-in & Registration |
8:30 AM - 9:30 AM | Welcome |
9:30 AM - 10:15 AM |
2.1 Parallel Computing Concepts Robert Sinkovits, Director of Education and Training |
10:15 AM – 11:00 AM |
2.2 Hardware Overview All users of advanced CI can benefit from a basic understanding of hardware, to determine which factors affect application performance. Here we give an overview starting from CPUs (processors, cores, hyperthreading, instruction sets), the anatomy of a compute node (sockets, memory, attached devices, accelerators), to an overview of cluster architecture (login and compute nodes, interconnects). We also cover how to obtain hardware information using Linux tools, pseudo-filesystems and commonly used hardware utilization monitoring tools. |
11:00 AM - 11:15 AM |
Break |
11:15 AM – 12:15PM |
2.3 Intermediate Linux Effective use of Linux based compute resources via the command line interface (CLI) can significantly increase researcher productivity. Assuming basic familiarity with the Linux CLI we cover some more advanced concepts with focus on the Bash shell. Among others this includes the filesystem hierarchy, file permissions, symbolic and hard links, wildcards and file globbing, finding commands and files, environment variables and modules, configuration files, aliases, history and tips for effective Bash shell scripting. |
12:15 PM - 1:45 PM | Lunch |
1:45 PM – 2:45 PM |
2.4 Batch Computing |
2:45 PM – 3:00 PM |
Break |
3:00 PM – 4:00 PM |
2.5 Interactive Computing |
4:00 PM - 4:30 PM | Q&A + Wrap-up |
4:45 PM - 7:00 PM |
Evening Reception (Off-Campus) Transportation will be provided. |
Tuesday, August 6
Pacific time |
Main Room Session |
8:00 AM – 8:30 AM |
Check-in & Light Breakfast |
8:30 AM – 9:00 AM |
3.1 Getting Help |
9:00 AM – 10:00 AM |
3.2 Data Management Proper data management is essential for the effective use of advanced CI. This session will cover an overview of file systems, data compression, archives (tar files), checksums and MD5 digests, downloading data using wget and curl, data transfer and long-term storage solutions. |
10:00 AM – 10:15 AM |
Break |
10:15 AM – 11:00 AM |
3.3 Security |
11:00 AM – 12:00 PM |
3.4 Code Migration |
12:00 PM - 1:30 PM |
Lunch |
1:30 PM – 2:45 PM |
3.5 High Throughput Computing |
2:45 PM - 3:00 PM | Break |
3:00 PM - 4:30 PM |
3.6 Linux Tools for File Processing |
4:30 PM - 4:45 PM | Q&A + Wrap-up |
Wednesday, August 7
Pacific time |
Main Room Session |
Breakout Room Session |
8:00 AM – 8:30 AM |
Check-in & Light Breakfast |
|
8:30 AM – 10:00 AM |
4.1a Intro to Git & GitHub |
4.1b Advanced Git & GitHub In today's fast-paced software development world, mastering GitHub and Git is a game-changer. This session will enhance your understanding beyond the basics, introducing advanced techniques to streamline workflows, manage complex projects, and automate tasks. You'll discover how to maximize productivity without compromising quality. By the end of this talk, you'll have a deeper grasp of GitHub's potential and a curiosity to explore tools like GitHub Actions, documentation, and automation features. Join us to elevate your development experience and skills. |
10:00 AM – 10:15 AM |
Break |
|
10:15 AM – 12:30 PM |
4.2a Python for HPC In this session we will introduce four key technologies in the Python ecosystem that provide significant benefits for scientific applications run in supercomputing environments. Previous Python experience is recommended but not required.
|
4.2b Information Visualization Concepts
|
12:30 PM - 2:00 PM | Group Photo Lunch |
|
2:00 PM – 4:30 PM |
4.3a Conducting Scientific Visualization with VTK and Unreal Engine 5 |
4.3b Scalable Machine Learning This session introduces approaches that can be used to perform machine learning at scale. Tools and procedures for executing machine learning techniques on HPC will be presented. Spark will also be covered for scalable data analytics and machine learning. Please note: Knowledge of fundamental machine learning algorithms and techniques is required.
|
4:30 PM - 4:45 PM | Q&A + Wrap-up |
Thursday, August 8
Pacific time |
Main Room Session |
Breakout Room Session |
8:00 AM – 8:30 AM |
Check-in & Light Breakfast |
|
8:30 AM - 9:30 AM |
5.1 Scaling up Interactive Data Analysis in Jupyter Lab: From Laptop to HPC |
|
9:30 AM - 9:45 AM | Break | |
9:45 AM – 12:15 PM |
5.2a Performance Tuning |
5.2b Deep Learning - Part 1 seen tremendous growth and success in the past few years. Deep learning approaches have achieved state-of-the-art performance across many domains, including image classification, speech recognition, and biomedical applications. This session provides an introduction to neural networks and deep learning concepts and approaches. Examples utilizing deep learning will be presented, and hands-on exercises will be covered using Keras. Please note: Knowledge of fundamental machine learning concepts and techniques is required. |
12:15 PM - 1:45 PM | Lunch | |
1:45 PM – 4:30 PM |
5.3a GPU Computing and Programming This session introduces massively parallel computing with graphics processing units (GPUs). The use of GPUs is popular across all scientific domains since GPUs can significantly accelerate time to solution for many computational tasks. Participants will be introduced to essential background of the GPU chip architecture and will learn how to program GPUs via the use of libraries, OpenACC compiler directives, and CUDA programming. The session will incorporate hands-on exercises for participants to acquire the basic skills to use and develop GPU aware applications.
|
5.3b Deep Learning – Part 2 Part 1 by going into more advanced examples. Concepts regarding architecture, layers, and applications will be presented. Additionally, more advanced tutorials and hands-on exercises with larger deep convolutional networks and transfer learning will be executed on GPUs. There will also be a chance to learn Keras more in depth and become familiar with building more flexible models. |
4:30 PM - 4:45 PM |
Q&A + Wrap-up |
|
5:00 PM - 7:00 PM | Dinner at the 15th Floor |
Friday, August 9
Pacific time |
Main Room Session |
|
8:00 AM – 8:30 AM |
Check-in & Light Breakfast |
|
8:30 AM – 11:30 AM |
6.1a Parallel Computing using MPI & Open MP |
6.1b Knowledge Management and Knowledge Graph In this session, we have three connected sections. The first section will help participants understand knowledge management and how to implement it, specifically within the scientific community. It will also highlight the fundamental shift in the machine learning paradigm and how to incorporate knowledge management into daily processes. This section will cover the basic concepts of knowledge management, from ontology development to document management. In the next part, we will cover two fundamental knowledge management techniques that can help users design their knowledge pipelines and improve their daily processes. The first technique focuses on how to use large language models (LLMs) beyond traditional engineering methods and other related technologies.
|
11:30 AM – 11:45 AM |
6.2 Overview of Voyager |
|
11:45 AM - 12:00 PM |
6.3 Overview of COSMOS Mahidhar Tatineni, Director of User Services |
|
12:15 PM – 12:30 PM |
Closing Remarks Robert Sinkovits, Director of Education and Training |