Diploma in HPC System Administration

0 Reviews

Course Description

Computer Architecture

Basic concepts of computer organization, Classes of computer architecture, Processor vs. System architecture, Elements of computer systems , Goals of computer architecture, Memory Hierarchy, CISC vs. RISC architectures. Multi and Many core Processor architecture, Memories and Caches, Cache coherency, Standard IO interfaces GPU elements, overview of the latest Intel processor.


Linux Operating System and Perl

Linux: Processes  Scheduling & synchronization, Memory management, File Systems, Case Study with Linux System Programming, Process, Signals ,Systems Concepts, Startup Files, Linux boot process, Installation of Linux, Disk partition, Controlling and managing  Services, Basic Linux commands, User administration of Linux, Network Configuring, System Configuration Files, Perform System Management, Maintenance and  troubleshooting ,Basic Service Security, Log Management, Network Authentication

Shell Scripting: Introduction to BASH Command Line Interface (CLI) Error Handling, Debugging & Redirection of scripts Control Structure, Loop, Variable & String, Conditional Statement Regular Expressions, Automate Task Using Bash Script, Security patches, Logging & Monitoring using script.

Perl: Control structure and loops, Useful/necessary functions to memorize, Array Functions, Hash Functions, Array and hash manipulation, Inbuilt special variables Regular Expressions basics, File Handling ,Introduction to Modules and Packages, Database Connectivity.


HPC System Administration and Management

Basics of Data Center Design Management

Data center overview, Real life issues on design, Cabinets, Power, cooling, Cable Management, Safety, efficient design and planning a strategy, Collecting the heat, Heat rejection or reuse, Energy use systems, Data Centre Metrics, Best Practices, Fire Protection and Security Systems

Grid computing:

Grid Architecture, Overview of Resource Managers, Overview of Grid Systems.

Design of HPC Cluster – Ecosystem

Requirement Analysis, Building blocks of HPC, Hardware and software selection process, Design of HPC Cluster, Cluster Planning, Architecture and Cluster software, Cluster building tools,  Multicore-architecture, Accelerator cards, Latest trends and technologies in HPC.

HPC System Management and Monitoring

IPMI, HMC, Node resources, processor usage, memory usage, network usage, statistics, network monitoring, Gangila, Collecli, Graphite

Benchmarking, theoretical peak performance, Linpack, HPL bench mark, Tuning HPL,  problem size, block size, process grid PxQ, Micro benchmarking, macro benching. Application benchmarking and check the scalability of the applications.


Python Programming

Introduction to Python, Python basics, Data Types and variables Operators, Looping & Control Structure List, Modules Dictionaries, string Regular Expressions, Functions and Functional Programming, Object Oriented Linux Scripting Environment, Classes, Objects and OOPS concepts, File and Directory Access Permissions, Libraries and Functionality Programming,Writing plugins in Python, data analysis Automation Process, Debugging basics, Task Automation with Python.


Fundamental of Computer Network and Management

Introduction to communication system, issues in Computer Networking, OSI Layers, TCP/IP Models, Discussion of Networking Protocols, IP Addressing and Routing, Hub, Switch, Router, Interconnect networks, Types of Interconnect networks, Gigabit Ethernet, Infiniband, Omni Path Architecture(OPA), types of protocol supported, Communication subnet, Interconnect networks subsystem: HCA, FC ports and other supported accessories, Network monitoring


Storage and Backup Management

Types of Storage, and Protocols, Components of a disk drive, physical disk and factors affecting disk drive performance. RAID level performance and availability considerations, Components and benefits of an intelligent storage system, (DAS) architecture, (SAN) attributes, components, topologies, connectivity options and zoning. FC protocol stack, addressing, flow control, and classes of service. Networked Attached Storage (NAS) components, protocols, IP Storage Area Network (IP SAN) iSCSI, FCIP and FCoE architecture, storage Replication

Parallel File Systems

Introduction to Parallel File Systems, types of Parallel File Systems, PVFS2, Lustre, GPSF, Components, Installation and configuration, benchmarking, comparison of Parallel File Systems, Optimization

Backup management

Backup, Backup tools, Types of backup, backup policies, Archive, retrieve, backup optimization, restore, Hierarchical Storage Management (HSM), Backup media (LTO), Tape library,


Resource Management and Accounting

Resource manager, Batch systems, Scheduler, various open source schedulers in HPC torque, Slurm, SGE, Components of resource manager, installation and configuration of Slurm and Torque, submitting and managing jobs, Writing the batch script , Application level check pointing, Managing nodes, setting server scheduling policies, scheduler integration, Maui, Moab, MPI support, Accounting  records, Gold


Security and Traffic Management

Security Fundamentals, Firewalls, Types of Firewalls, Limitations of firewall, Intrusion Detection And Prevention, Intrusion risks, Security policy, Monitoring  and reporting of  traffics, Traffic shaping, Investigating and verifying detected intrusions, Recovering from, reporting and documenting intrusions, Define the Types of intrusion Prevention Systems, Intrusion prevention system basics, Limitations of  Intrusion Prevention System, Spoof Prevention, Dos, Qos Policy, Web Application Firewall, Packet Signature and Analysis, Virtual Private Networks, Deploy


Hadoop Administration

Hadoop Framework: .What is Hadoop, Why Hadoop , History of Hadoop ,Use Cases of  Hadoop, Hadoop eco system , HDFS, Hadoop Distributed File System, HDFS Architecture , Name Nodes, Data Nodes, Secondary Name Node,  Command Line Interface, Reading and Writing Date, Hadoop on YARN

Map Reduce: Map Operation, Map Reduce Anatomy, Job Submissions, Job Initialization, Task Assignment, Job Completion, Job Scheduling, Job Failures, Shuffle and sort, Word Count Problem, Word Count Flow and Solution, Word Count Flow and Solution.

Hadoop Environment: Setting up a Hadoop Cluster, Cluster specification, Cluster Setup and Installation, Hadoop Configuration, Security in Hadoop (Security System Concepts used in Hadoop, Hadoop Cluster With LDAP), Administering Hadoop, HDFS – Monitoring & Maintenance (Data transfer Between Clusters, Adding and Removing Nodes, Cluster Rebalancing), Hadoop benchmarks


Cloud Services & Security

Cloud Computing: Definition, Characteristics, Components, Cloud provider, SAAS, PAAS, IAAS and other Organizational scenarios of clouds, Administering & Monitoring cloud services, benefits and limitations, Deploy application over cloud. Comparison among SAAS, PAAS, IAAS, Cloud computing platforms: Infrastructure as service: Amazon EC2, Platform as Service: Google App Engine, Microsoft Azure Utility Computing, Elastic Computing, SLA, clusters, cloud analytics, challenges of cloud environment, HPC and Hadoop in the cloud,

Cloud Technologies:  Virtualization, Virtual machine provisioning, virtualization applications in enterprises, Pitfalls of virtualization, Multitenant software: Multi-entity support, Multi-schema approach, Multi-tenance using cloud data stores, Data access control for enterprise applications.

Security in Cloud: Cloud security fundamentals, Vulnerability assessment tool for cloud, Privacy and Security in cloud, Cloud computing security architecture: Architectural Considerations- General Issues, Trusted Cloud computing, Secure Execution Environments and Communications, Micro-architectures; Identity Management and Access control-Identity management, Access control, Autonomic Security. Cloud computing security challenges: Virtualization security management- virtual threats, VM Security Recommendations, VM-Specific Security techniques, Secure Execution Environments and Communications in cloud.


Aptitude & General English

Effective Communication



About Instructor

  • admin

    Teaching is a very noble profession that shapes the character, caliber, and future of an individual. If people remember me as a good teacher, that will be the biggest honour for me.


0 Ratings
stars 5
stars 4
stars 3
stars 2
stars 1

There are no reviews yet.

Leave a Review

Be the first to review “Diploma in HPC System Administration”