Dell Technologies World 2018 – storage.38 – Dell EMC Unity: Performance Best Practices Notes

Disclaimer: I recently attended Dell Technologies World 2018.  My flights, accommodation and conference pass were paid for by Dell Technologies via the Press, Analysts and Influencers program. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

Here are my rough notes from the storage.38 session. This was presented by Stephen Wright and covered Dell EMC Unity: Performance Best Practices. Firstly, though, you should read Dell EMC’s Unity Best Practices Guide.

Dell EMC Midrange Family

SC Series and Dell EMC Unity

Common Tools

  • PowerPath
  • ViPR
  • VPLEX
  • RecoverPoint
  • Avamar and NetWorker
  • Data Domain

Industry’s #1 Midrange Portfolio

SC Series

  • Intelligent Efficiency
    • Post-process data reduction
  • Federated
    • Data mobility across multiple systems
  • Best economics
    • Low entry price
    • Lowest $/GB

Dell EMC Unity

  • Inline efficiency
    • Inline data reduction
  • Unified
    • Unified file and block data
  • Integrated hybrid cloud
    • Unified cloud tiering

Agenda

Background

  • What is Performance? What are “Best Practices”?
  • Evolution of storage best practices

Hardware

  • Unity All-Flash considerations
  • Unity Hybrid considerations

Features

  • Data reduction, snapshots, replication

 

Background

What is Performance?

“The ability to do the requested work in the required period of time”

  • IOPs (small transactions), MB/s (bulk data)
  • Latency and Response Time (Individual transactions)
  • Window, Job (Batch transactions)

What are Best Practices?

Configuration Guidance

  • Recommendations for options
  • Advice based o experience
  • Responsive to your application
  • Best behaviour for your needs

Evolution of Storage Best Practices

Unity Simplicity

  • Removes the need for detailed tweaking
  • Let the system do the right thing for you

Unified File and Block – same recommendations apply

  • One set of common guidance

Flash – changes the game for storage performance

  • Stress on other components

Quantum Leap of Flash

Recommended maximum IOPS per drive – don’t use these for sizing – these numbers are speed limits and are generally based on small-block random workloads.

  • NL-SAS – 150 IOPS
  • SAS 10K RPM – 250 IOPS
  • SAS 15K RPM – 350 IOPS
  • Flash – 20000 IOPS

The Flash Effect, and CPU utilisation

  • Flash is fast, and Dell EMC Unity can support hundreds of drives
  • Driving a lot of Flash can take a lot of CPU power
  • Provide best practices around CPU utilisation
Average CPU Utilisation Below 50% 50% to 70% 70% to 90% Above 90%
Latency Yes Yes Yes Caution
High Availability Yes Yes Caution No

Approaching Best Practices: AFA or Hybrid?

Hardware Considerations

  • All-Flash
    • Drives are most likely not a bottleneck
    • Focus on maximising other hardware resources
  • Hybrid
    • HDD performance can be determining factor
    • A little Flash can add a lot of capability

Features

  • Data reduction
  • Snapshots
  • Replication
  • Both block and file

CPU Power and Flash Considerations

With All-Flash, CPU becomes the driving factor

  • CPU power has largest impact on achievable performance
  • Memory has largest impact on scalability

As of 4.3 online data-in-place conversions now available

Balanced Access – Back-end SAS

At least use the two onboard

  • Maybe you also want the SAS expansion? (Up to 6 buses)

Largest impact is on bandwidth. Dell EMC advertise 5GB/s of bandwidth through the SAS bus.

Flash drives per bus recommendation? Take how ever many you have, and spread them across the buses you have

Balanced access – FC Ports

  • For HA, zone 1 initiator to 1 port from SPA, 1 port from SPB
  • For HA + load balancing, zone 2 ports per SP
  • Cable and use as many front-end ports as possible
  • We recommend at least 4 ports per SP in U3x0 and U4x0
  • At least 6 ports per SP in U5x0 and U6x0

Balanced Access – Unity File

  • Balance resource utilisation with file
    • Means multiple NAS Servers
    • Using multiple Ethernet ports (can leverage LACP)
    • Failsafe Networking (FSN)

Front-end port considerations

  • Speed is good – use faster ports when available
  • Understand port limits – consult best practices guide
  • Use more ports – better distribution across cores

Hybrid Considerations

  1. All previous considerations
  2. Size for HDD constraints
  3. Leverage Flash Tier, FAST VP
  4. Configure FAST Cache

FAST VP is at the pool level, FAST Cache is a global resource

 

Feature Considerations

Features Overview

In Dell EMC Unity, all system resources are always available

  • Architectural philosophy
  • CPUs are note reserved for any particular process

Features requires resources

  • Use additional CPU and may add drive IOPS
  • CPU cycles can shift as defined workload changes

e.g. RAID 6 may take a little more CPU than RAID 1/0. Same goes for Snapshots, data reduction and replication.

Decision Tree for Enabling Features

  • Understand that enabling a feature may increase CPU utilisation
  • This chart represents average CPU before implementing feature
Average CPU Utilisation Below 50% 50% to 70% 70% to 90% Above 90%
Snapshots / Replication Yes Yes Caution No
Data Reduction Yes Caution Caution No

 

Data Reduction

Prior to 4.3, offered compression

As of OE v4.3, deduplication has been added, and together these provide data reduction

Data reduction

  • Block and file objects
  • All-Flash pools
  • Enabled together
  • Automatically licensed

How does it work?

  • Data acknowledged in write cache
  • Check for patterns
  • Compress data if needed

Improved Performance

  • Reduced overhead when pattern is found
  • Code optimisations

Considerations

  • Latency impacts
  • CPU resource consumption
  • Refer to decision tree

Snapshots

  • Use less aggressive snapshot schedules (number of objects increases – decrease the snapshot schedule frequency)
  • Stagger snapshot schedules

Asynchronous replication

  • Leverages snapshots
    • Similar considerations
    • RPO = snapshot schedule
    • Longer RPOs with lots of replicated objects
  • Consider port capabilities
    • Multiple links per SP
    • Higher speed ports

Synchronous Replication

  • Real-time replication over FC link
  • Latency is key
  • Zone so that clients are not on the replication link

 

Summary

Appropriate Model

  • Choose the right model, based on CPU power
  • Online Data-in-place conversions to move to more powerful model
  • Consider differences between All-Flash and Hybrid

Enough Hardware

  • Have enough drives / enclosures / ports?
  • Utilise all ports

Flash Acceleration

  • Start with Flash
  • All-Flash
  • Flash tier (Hybrid)

Plan for features

  • Consider CPU consumption
  • Snapshot schedules
  • Replication RPOs

 

Useful session. 4 stars.