Emerging Technologies for HPC Storage - Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018 - Dell EMC HPC Community
←
→
Page content transcription
If your browser does not render page correctly, please read the page content below
Emerging Technologies for HPC Storage Dr. Wolfgang Mertz CTO EMEA Unstructured Data Solutions June 2018
The very definition of HPC is expanding Blazing Fast Speed Accessibility and flexibility Traditional High High Performance Performance Data Analytics Computing Complex or time-critical ‘big Computationally-intensive data’ analytics workloads modeling and simulation • Genomics • Computer-aided engineering • Financial analytics • Weather forecasting • Business intelligence • Oil exploration Artificial Intelligence Machine and deep learning applications • Fraud / anomaly detection • Predictive maintenance 2 © Copyright 2018 Dell EMC • Personalized medicine
Value of data over time… Value of Data ($) “Big Data” “Fast Data” Time µs ms s hour day month year yr+ 4 © Copyright 2018 Dell EMC
HPC Storage Challenges High Availability, Backup, Data Data Sharing and Management & Critical Challenge Performance Protection Accessibility Integration Performance Uptime important, but no Traditional ‘/scratch’ bare metal, parallel access special backup Not the critical feature Important, but not the requirements critical feature Persistence Scalable performance, Able to fulfill compliance Pre and post Management Traditional ‘/home’ tunable for workload requirements, protect processing, analytics, functionality and important data desktop access support; connections with other tools 5 © Copyright 2018 Dell EMC
NVMe-oF Dell EMC & rd 3 Party Job Scheduler Performance HPC Fast Storage (Lustre, BeeGFS, GPFS) Isilon HPC NFS Storage Elastic Cloud Storage (ECS) Virtustream Scratch Project Archive Capacity 6 © Copyright 2018 Dell EMC
NVMe Usage Scenarios: Local Dedicated Devices Up to 24 U.2 Up to 24 U.2 Up to 24 U.2 Up to 24 U.2 NVMe (R740XD) NVMe (R740XD) NVMe (R740XD) NVMe (R740XD) Each host has one or more dedicated NVMe devices Targets: Up to 4 Intel AIC Targets: Up to 4 Intel AIC Targets: Up to 4 Intel AIC Targets: Up to 4 Intel AIC NVMe P3700 2TB NVMe P3700 2TB NVMe P3700 2TB NVMe P3700 2TB 7 © Copyright 2018 Dell EMC
NVMe Usage Scenarios: Sharing NVMe over Fabrics Targets: Up to 24 U.2 NVMe (R740XD) Even hosts with no space or Targets U.2 NVMe support for NVMe can use NVMeF devices. NVMeF Host Client NVMeF Host Targets: Up to 4 Intel AIC Client NVMe P3700 2TB 100 Gb/s Targets: Up to 4 Intel AIC Each host can mount one or NVMe P3700 2TB more NVMe Targets. 8 © Copyright 2018 Dell EMC
NVMe-oF Test System 1 R730 NVMeF Targets Server Mellanox EDR R730 NVMeF E5-2690 v4 @ 2.60 GHz. Host Client 1 ConnectX-5 256 GiB DDR4 2166 MHz R730 NVMeF Host Client 2 100 Gb/s 2x Xeon E5-2690 v3 @ 2.60 GHz 256 GiB DDR4 2166 MHz Targets: 4 Intel AIC Each host can mount one or more NVMe P3700 2TB Targets. RHEL 7.4 x86_64 (GA level), kernel version 3.10.0-693 Native Drivers 9 © Copyright 2018 Dell EMC
NVMe-oF Test System 2 R730 NVMeF Targets Server R730 NVMeF E5-2690 v4 @ 2.60 GHz. Host Client 1 256 GiB DDR4 2166 MHz OmniPath R730 NVMeF Host Client 2 100 Gb/s 2x Xeon E5-2690 v3 @ 2.60 GHz 256 GiB DDR4 2166 MHz Targets: 4 Intel AIC Each host can mount one or more NVMe P3700 2TB Targets. RHEL 7.4 x86_64 (GA level), kernel version 3.10.0-693 Native Drivers 10 © Copyright 2018 Dell EMC
Configuration Details of NVMe-oF • 3 Dell PowerEdge R730, one used as (target) server connected to two clients (hosts). – Clients Dual Intel Xeon E5-2690 v3 @ 2.60 GHz. – Server Dual Intel Xeon E5-2690 v4 @ 2.60 GHz. – 256 GiB of DDR4 @ 2133 MHz. • Omni-Path adapters installed on slot 4 (PCIe x16) connected to a switch. • RHEL 7.4 x86_64 (GA level), kernel version 3.10.0-693 • 4 Intel P3700 2TB AIC adapters on the server slots 1, 2, 3 & 5 (all PCIe x8) • FIO 2.99 compiled on each machine with libaio support • DirectIO and no buffered IO was used to prevent RAM cache • Ramp up time was one hour and test time was limited to 300 seconds for each data point • Each write test with a different block size was followed by a consistent read test 11 © Copyright 2018 Dell EMC
Bandwidth Baseline ib_write_bw -F -R -a 172.20.1.1 Infiniband EDR (100 Gb/s) BW Average OmniPath (100 Gb/s) BW Average 12000 10000 8000 MB/sec 6000 4000 2000 0 2 4 8 16 32 64 128 256 512 1K 2K 4K 8K 16K 32K 64K 128K 256K 512K 1M 2M 4M 8M Package Size 12 © Copyright 2018 Dell EMC
Going to Next Level with Hardware – ME4 • RAID Array: 2u12, 2u24, 5u84 • Expansion: 2u12,2u24,5u84 (DAE) • Backend Interface:12G SAS • FE Interface: • 16 FC 4 ports per controller • 10G iSCSI 4 ports per controller (SFP+ or BaseT) • 12G SAS 4 ports per controller • Reads IOPS: 320K – 4x over MD3 ME4012: 12-drive RBOD ME4024: 24-drive RBOD • Seq. Reads: 7000 MB/s ME4084: 84-drive RBOD • Seq. Writes: 5500 MB/s – 2.6x over MD3 ME412: 12-drive Expansion • Total system drive count: 336 -1.75x over MD3 ME424: 24-drive Expansion • Raw Capacity: 4PB ME484: 84-drive Expansion • Single or Dual Controller Note: ME Expansion Units (DAE) cannot be connected to a server directly (not a server-attached JBOD) 13 © Copyright 2018 Dell EMC
Parallel File System – Lustre w/ ME4 Dell PowerVault Dell PowerVault ME4024 ME4024 (optional for DNE) Dell PowerVault ME4084 14 © Copyright 2018 Dell EMC
Dell EMC Isilon Scale-out NAS High Performance to Archive • Few TBs to 100 PB in a single file system – Up to 1.5 TB/sec Aggregate Read • Policy-based Automatic Tiering F-Series – Flash, SAS, Sata Files Tier 1 • Native Multi Protocol Access Reduced cost/TB – NFS, CIFS, HDFS, Swift • Enterprise Features for Data Management, Long Term Archive and Compliance H-Series Tier 2 CloudPools A-Series Tier 3 15 © Copyright 2018 Dell EMC
ECS Scale-Out Object Store Modern archive Universal archive for existing primary storage. Replaces tape. No changes to applications or operations. Archive always online of analytics workflows Cloud native Enable new healthcare business operations. Cloud economics and ease of use on-premise. Lower TCO compared to public cloud providers Scalability Deployable in clusters for petabyte and exabyte scalability Data protection Provides geo-distributed data protection with no single point of failure. Globally accessible. One namespace. Multi-tenant architecture Accelerate cloud native applications Future healthcare IoT applications on private infrastructure Operational flexibility Multi-protocol support for legacy & modern applications 16 © Copyright 2018 Dell EMC
Emerging Technologies for Persistent Storage • Higher Scale – 100s PB for File – Exabyte for Object • High Performance Object • Removing Protocol Overheads • Gen 7 17 © Copyright 2018 Dell EMC
You can also read