Disclaimer: I recently attended Dell Technologies World 2018. My flights, accommodation and conference pass were paid for by Dell Technologies via the Press, Analysts and Influencers program. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event. Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.
Here are my rough notes from the storage.38 session. This was presented by Stephen Wright and covered Dell EMC Unity: Performance Best Practices. Firstly, though, you should read Dell EMC’s Unity Best Practices Guide.
Dell EMC Midrange Family
SC Series and Dell EMC Unity
Common Tools
- PowerPath
- ViPR
- VPLEX
- RecoverPoint
- Avamar and NetWorker
- Data Domain
Industry’s #1 Midrange Portfolio
- Intelligent Efficiency
- Post-process data reduction
- Federated
- Data mobility across multiple systems
- Best economics
- Low entry price
- Lowest $/GB
- Inline efficiency
- Inline data reduction
- Unified
- Unified file and block data
- Integrated hybrid cloud
- Unified cloud tiering
Agenda
Background
- What is Performance? What are “Best Practices”?
- Evolution of storage best practices
Hardware
- Unity All-Flash considerations
- Unity Hybrid considerations
Features
- Data reduction, snapshots, replication
Background
What is Performance?
“The ability to do the requested work in the required period of time”
- IOPs (small transactions), MB/s (bulk data)
- Latency and Response Time (Individual transactions)
- Window, Job (Batch transactions)
What are Best Practices?
Configuration Guidance
- Recommendations for options
- Advice based o experience
- Responsive to your application
- Best behaviour for your needs
Evolution of Storage Best Practices
Unity Simplicity
- Removes the need for detailed tweaking
- Let the system do the right thing for you
Unified File and Block – same recommendations apply
- One set of common guidance
Flash – changes the game for storage performance
- Stress on other components
Quantum Leap of Flash
Recommended maximum IOPS per drive – don’t use these for sizing – these numbers are speed limits and are generally based on small-block random workloads.
- NL-SAS – 150 IOPS
- SAS 10K RPM – 250 IOPS
- SAS 15K RPM – 350 IOPS
- Flash – 20000 IOPS
The Flash Effect, and CPU utilisation
- Flash is fast, and Dell EMC Unity can support hundreds of drives
- Driving a lot of Flash can take a lot of CPU power
- Provide best practices around CPU utilisation
Average CPU Utilisation | Below 50% | 50% to 70% | 70% to 90% | Above 90% |
Latency | Yes | Yes | Yes | Caution |
High Availability | Yes | Yes | Caution | No |
Approaching Best Practices: AFA or Hybrid?
Hardware Considerations
- All-Flash
- Drives are most likely not a bottleneck
- Focus on maximising other hardware resources
- Hybrid
- HDD performance can be determining factor
- A little Flash can add a lot of capability
Features
- Data reduction
- Snapshots
- Replication
- Both block and file
CPU Power and Flash Considerations
With All-Flash, CPU becomes the driving factor
- CPU power has largest impact on achievable performance
- Memory has largest impact on scalability
As of 4.3 online data-in-place conversions now available
Balanced Access – Back-end SAS
At least use the two onboard
- Maybe you also want the SAS expansion? (Up to 6 buses)
Largest impact is on bandwidth. Dell EMC advertise 5GB/s of bandwidth through the SAS bus.
Flash drives per bus recommendation? Take how ever many you have, and spread them across the buses you have
Balanced access – FC Ports
- For HA, zone 1 initiator to 1 port from SPA, 1 port from SPB
- For HA + load balancing, zone 2 ports per SP
- Cable and use as many front-end ports as possible
- We recommend at least 4 ports per SP in U3x0 and U4x0
- At least 6 ports per SP in U5x0 and U6x0
Balanced Access – Unity File
- Balance resource utilisation with file
- Means multiple NAS Servers
- Using multiple Ethernet ports (can leverage LACP)
- Failsafe Networking (FSN)
Front-end port considerations
- Speed is good – use faster ports when available
- Understand port limits – consult best practices guide
- Use more ports – better distribution across cores
Hybrid Considerations
- All previous considerations
- Size for HDD constraints
- Leverage Flash Tier, FAST VP
- Configure FAST Cache
FAST VP is at the pool level, FAST Cache is a global resource
Feature Considerations
Features Overview
In Dell EMC Unity, all system resources are always available
- Architectural philosophy
- CPUs are note reserved for any particular process
Features requires resources
- Use additional CPU and may add drive IOPS
- CPU cycles can shift as defined workload changes
e.g. RAID 6 may take a little more CPU than RAID 1/0. Same goes for Snapshots, data reduction and replication.
Decision Tree for Enabling Features
- Understand that enabling a feature may increase CPU utilisation
- This chart represents average CPU before implementing feature
Average CPU Utilisation | Below 50% | 50% to 70% | 70% to 90% | Above 90% |
Snapshots / Replication | Yes | Yes | Caution | No |
Data Reduction | Yes | Caution | Caution | No |
Data Reduction
Prior to 4.3, offered compression
As of OE v4.3, deduplication has been added, and together these provide data reduction
Data reduction
- Block and file objects
- All-Flash pools
- Enabled together
- Automatically licensed
How does it work?
- Data acknowledged in write cache
- Check for patterns
- Compress data if needed
Improved Performance
- Reduced overhead when pattern is found
- Code optimisations
Considerations
- Latency impacts
- CPU resource consumption
- Refer to decision tree
Snapshots
- Use less aggressive snapshot schedules (number of objects increases – decrease the snapshot schedule frequency)
- Stagger snapshot schedules
Asynchronous replication
- Leverages snapshots
- Similar considerations
- RPO = snapshot schedule
- Longer RPOs with lots of replicated objects
- Consider port capabilities
- Multiple links per SP
- Higher speed ports
Synchronous Replication
- Real-time replication over FC link
- Latency is key
- Zone so that clients are not on the replication link
Summary
Appropriate Model
- Choose the right model, based on CPU power
- Online Data-in-place conversions to move to more powerful model
- Consider differences between All-Flash and Hybrid
Enough Hardware
- Have enough drives / enclosures / ports?
- Utilise all ports
Flash Acceleration
- Start with Flash
- All-Flash
- Flash tier (Hybrid)
Plan for features
- Consider CPU consumption
- Snapshot schedules
- Replication RPOs
Useful session. 4 stars.