EMC – HeatMap Analyzer – Basics – Part 3 – Charts

This is part 3 of a series where I will go into a little more detail on how you use HeatMap Analyzer. In this article I will look at the different sort of charts that can be drawn using the Analyzer component.

 

Line Chart

HMA_Basics_Charts_001

This is a standard Line chart. Line style charts (Line, Stacked and Line by Day) are all comprised of several sections – the main chart (upper) and the control section (lower). The control section allows you to zoom and pan the chart. If, for example, you are viewing several months worth of performance data (as above) then analysing specific points in time becomes quite difficult. Dragging the sliders in the control section closer together will allow you to view the data in greater detail.

 

HMA_Basics_Charts_002

Hovering over a data point will display a tool-tip with the details of that point.

 

Line by Day Chart

The Line by Day chart breaks down a standard Line chart and overlays each day. This can be useful in finding time related trends in data sets. For example, looking at a particular SP’s utilization over the past 3 weeks might look something like this:

 

HMA_Basics_Charts_003

Trying to work out time based trends in this data is visually difficult, however if we chart it using a Line by Day chart it allows you to analyze the data more intuitively. You could make the following conclusions:

1. Utilization over the weekend is generally lower.

2. There is consistently a peak in utilization at 5 am.

3.  The highest peak during business hours is at 9 am.

4. Utilization of the array begins to increase from around 6 pm.

 

HMA_Basics_Charts_004

 

Bear in mind that charting multiple objects in the one chart will potentially make analysis more difficult, and while tool tips are shown when you hover over a data point, the time will be correct however the date is all converted back to 1st January 2000.

 

Stacked Chart

 

HMA_Basics_Charts_005

A stacked chart aggregates the data points for the selected objects, and shows the total of the attribute charted for a particular time. Only certain types of Attributes can be charted in a Stacked chart (and Pie charts). For example it doesn’t make a lot of sense to draw a Stacked chart of SP utilization. However, where the unit of measurement is “aggregatable” like Total Bandwidth (measured in MB/s) or Total Throughput ( measured in IOPS ), it can be charted this way.

 

If you really want to be able to chart these other unit types using Stacked charts, you can modify the units aggregatable flag in the Configuration tab under the “Attribute Units”  section, removing the unit and re-adding it  with the aggregatable flag ticked. This should enable this chart type for those attributes.

 

Pie Chart

 

HMA_Basics_Charts_006

When selecting a Pie Chart, you are unable to select which objects are to be drawn (all objects are included in the chart calculations). Specifying Max Slices will limit the chart to display n-1 objects, the objects that are shown will be ordered largest to smallest, and all other objects will be combined under the “Other” slice.

 

Distribution Chart

 

HMA_Basics_Charts_007

A distribution chart allows you to determine the percentage of data sets that fall within particular bands.  The above chart shows that 46% of SP A’s response times are in the 0-3 ms band, whereas 45% of SP B’s response times are in the 6-9ms band.  When drawing a Distribution chart leaving the Minimum and Maximum blank will leave the script to determine the  ranges that will contain the entire data set. Using the Minimum and Maximum you can change the range that is charted, and changing the interval will change the number of bands displayed (by default 10).

 

HMA_Basics_Charts_008

Changing the Minimum to 0, Maximum to 20 would result in the above chart, breaking the distribution down into smaller bands

Table

HMA_Basics_Charts_009

This option displays a text table of the Attribute and Objects that you have selected, you can select all the data and paste it into you favorite analytic tool

 

Connectivity Graphic

HMA_Basics_Charts_010

Connectivity charts show the hierarchical connectivity of the array (currently this chart is only available for CLARiiON arrays)

Front End Ports → Storage Processor →LUN → MetaLUN Component → RAID Group / Pool  → Private RAID Group → Disk

Modifying the depth field will change the depth to which the chart is drawn. Modifying the Attribute field will change the Attribute that is represented on the chart (Connectivity is a special attribute that weights each link the same). Link size is relative to the average for the selected attribute for the particular object, where the object does not expose that attribute (i.e. Ports don’t expose Utilization) will show a thin link.

NOTE: Arrays with more complex configurations and larger number of disks don’t necessarily display very clearly in this format.

 

HMA_Basics_Charts_011

 

In Part 4 we will look at the Configuration tab and how to Automate NAR file collection.

EMC – HeatMap Analyzer – Basics – Part 2 – Filtering

This is part 2 of a series where I will go into a little more detail on how you use HeatMap Analyzer.

 

HMA_Basics_Filtering_001

 

Filtering

Being able to filter and order objects can be useful, especially when there are a lot of them and you are trying to find where a problem may be originating.

Expanding the Filter option will display the following window.

 

HMA_Basics_Filtering_002

 

This can be broken into four main sections. Each of these sections is effectively AND together when the filter is executed.

Filter by Attribute:  If this section is enabled it allows you to retrieve the top|bottom “n” objects ordered by avg|min|max attribute.

Object Type: For objects that expose a “Type” attribute (currently this is only LUNs and Disks). This option allows you to show only specific Object Types. For example if you only wanted to view Public RAID Group LUNs and Public Pool LUNs  then selecting these options will filter out all other types of LUNs. NOTE only object types are are present in the array will be available in the list, and the label that particular Object Types are given may vary depending on the version of Flare code.

Include: This option allows you to include objects based on whether the extended data for that object contain the text entered in this field, this may include Name, Owner, Type, Host (depending on what extended data is available for that object). This can be a comma separated list and is case insensitive. NOTE leading or trailing spaces will be included in the search.

Exclude: This option allows you to exclude objects based on whether the extended data for that object contains the text entered in this field, this may include Name, Owner, Type, Host (depending on what extended data is available for that object). This can be a comma separated list and is case insensitive. NOTE leading or trailing spaces will be included in the search.

 

So the following filter selections would display the Top 4 by average Queue Length Public Pool LUNs where the extended information includes SP A and excludes Hypervisor.

 

HMA_Basics_Filtering_003

 

Once you have filtered the object list you are able to chart the objects against any of the attributes that are exposed by that object type.

 

In Part 3 we will look at the different chart types and what they may be used for.

EMC – HeatMap Analyzer – Basics – Part 1 – Processing a NAR file

Mat has agreed to do some posts on the basics of using the EMC HMA. In this episode, he’s looking at manually loading NAR files and other cool stuff. Enjoy.

 

This is part 1 of a series where I will go into a little more detail on how you use HeatMap Analyzer. In this article we go through the process of manually loading a NAR file, basic charting and HeatMaps.

At this point you should have the HeatMap Analyzer appliance configured and you may be wondering what you can begin to do with it. I wrote the original HeatMap tool so I could visualize what was occurring over an entire array or parts thereof at any one point in time. This evolution of that script is HMA. It  now allows you to look closer at individual components of the array and while this is something that can be done with the current EMC Analyzer tool set, it is limited by the amount of data that you can load at any one time.

 

Manually Load NAR File

Point your browser at the appliance http://1.1.1.1/HMA.html and go to the Configuration tab.

 

HMA_Basics_Processing_001

 

Expand the “Manual Load NAR Files” section and select the “Choose Files” button. Here you can select one or more NAR files from a single or multiple arrays.  Depending on the number of “objects” in the NAR file (disks, LUNs, Pools, Ports, etc), processing of each NAR file may take a reasonable amount of time. As a rough guide for every 100 objects in the NAR file it will take about 30 seconds to process (assuming the standard 300 data points for each object).  Some of my CX4-960 arrays that have 3000 odd objects take 10-15 minutes to process each file. NOTE: if you are processing large NAR files or lots of smaller NAR files then the browser may currently timeout waiting for the processing to finish. If this is the case you can monitor the process through the Server Status tab or from the command line of the appliance (see below).

 

HMA_Basics_Processing_002

Note: you should let all processing complete before you begin to draw HeatMaps or charts under the Analyzer tab, if you don’t you may get some unexpected results due to the non-granular locking that SQLite uses while updating the database. 

Monitoring the NAR file load process

If you want to monitor the processes that are working on your NAR file(s) there are some tools available on the Server Status tab.

For basic process monitoring expand the “Process Monitoring” section, and tick the “Auto Update” option. If you are currently processing a NAR file you will see the ProcessNAR.pl script running.

 

HMA_Basics_Processing_003

 

For more detailed monitoring you can view the logs created by each script. On the Server Status tab expand the “Logs Viewer” section, select the log file that you want to monitor and select “Auto Update” and “Auto Scroll”. You may want to combine this with the basic process monitoring to get a better picture of what is occurring.

 

HMA_Basics_Processing_004

Note: it’s worthwhile turning off Auto Update after you have completed using these tools as leaving it running will place unnecessary load on the appliance

HeatMaps

To draw HeatMaps for a particular array go to the HeatMaps tab.

 

HMA_Basics_Processing_005

 

Array: If you have multiple arrays you can select which array that you want to draw a HeatMap for.

Array Type: This gives a description of the array type (currently only CLARiiON and Data Domain types are supported).

Start/End: This is the Start and End date of the HeatMap. NOTE: Selecting a long duration for the HeatMap may take a while to process, and it is not recommended that you draw a HeatMap that spans more than several days without increasing the Average Interval.

Avg Interval: The number of seconds between each chart point.

Object List: This is a list of objects that you can select attributes from to chart and may include.

All LUNs / Storage Processors / Disks / RAID Group / Port / Thin Pool / CPU / Asynchronous Mirror / Snap Session.

The object that are shown will depend on what you have configured on your array (if you don’t have any Asynchronous Mirrors defined then you wont see this option).

Selecting one of these drop down lists will show a list of attributes that can be charted for this object.

 

HMA_Basics_Processing_006

 

Select an attribute and it will be added to the list of object/attributes that will be drawn in the HeatMap. Select which object / attribute combinations that you want to display in the HeatMap and then press the “Draw HeatMap” button. The HeatMap configuration pane can be hidden by clicking on the top arrow in the top left corner.

 

HMA_Basics_Processing_007

 

The way the HeatMap works is by defining a minimum, a maximum and a skew for each object and attribute. These can be modified in the Configuration Tab under the HeatMap Attribute Normalization.

Analyzing the Data

For more detailed analysis of individual objects in the array go to the Analyzer tab. On this tab you will initially see three panes. The first left hand pane allows you to configure and draw charts for various objects, the second upper right pane gives global details for the array and the third lower right pane gives you details for the selected object.

 

HMA_Basics_Processing_008

 

Array: If you have multiple arrays you can select which array that you want to draw a HeatMap for.

Array Type: This gives a description of the array type (currently only CLARiiON and Data Domain types are supported).

Object Type: The type of object to chart (ie Storage Processor, LUN, Disk, etc)

Start/End: This is the Start and End date of the chart. .

Filter: Filter options that allow you to include / exclude objects based on set criteria (I’ll go into more detail about this later).

Object: The object(s) that you want to chart.

Attribute: The metric that you want to chart for the object, each object type exposes a different set of attributes.

Chart dependent options:

Line Charts, Stacked Charts and Tables

Avg Interval: The number of seconds to average the data points out to, leaving as zero or blank will process the data as is.

Pie Charts

Max Slices: The Maximum number of slices to draw in a Pie chart. If there are more than “n-1” objects  then extra objects will be grouped under “Other”.

Distribution Charts

Minimum: The minimum number to chart.

Maximum: The maximum number to chart.

Intervals: The number of intervals to chart.

Chart Type:

Line: A standard line chart, multiple object can be selected and charted against one another.

Line by Day: A standard line chart where each day is stacked against each other, while you can select multiple object it’s not recommended as it makes the chart difficult to interpret.

Stacked: A stacked line chart, this option is available where the attribute being graphed is able to be aggregated (ie MB, KB or GB rather than % Utilization), again multiple objects can be selected.

Pie: A pie chart, this will draw a pie chart for all of the objects for the selected “Object Type”.

Distribution: A bar chart showing the % of values that fall between certain ranges.

Table: A text table of the data.

Connectivity: A Connectivity / SanKey chart. This show how each of the objects in the array are connected. It is only avaliable for CLARiiON devices.

Once you have selected the Object Type, Object(s) and Attribute that you want to chart, click the “Graph” button and chart will be drawn for that selection.

HMA_Basics_Processing_009

 

Depending on the type of chart drawn you may have various controls that allow you to explore the data more closely. Line style charts have the slider control at the bottom which allow you to focus on a subset of the data. Tables allow you to select all the data so it can be copied. Connectivity charts allow you to control the depth of the chart. And the attribute that controls the weight of each of the connections, all chart types except Tables allow you to save the chart as a JPEG.

 

In the next article I will go into more detail regarding Filtering and the different Chart Types

 

EMC – HeatMap Analyzer v0.1 Now Available

It’s alive. Mat has been coding like crazy and enhancing the HeatMap script and turning it into like, an appliance kind of thing. You can grab it from the Utilities page and it comes in two parts – the core code and third-party scripts package. While the combined package size is small, it saves redistributing stuff that hasn’t changed. In any case, download it, give it a spin and let us know your thoughts. Obviously, it’s still a bit ugly, and still a bit version 0.1, but that’s what you get for free. Tell your friends.

EMC – VNX / CX4 LUN Allocation Owner and Default Owner

Mat’s been doing some useful scripting again. This time it’s a small PERL script that identifies the allocation owner and default owner of a pool LUN on a CX4 or VNX and lets you know whether the LUN is “non-optimal” or not. For those of you playing along at home, I found the following information on this (but can’t remember where I found it). “Allocation owner of a pool LUN is the SP that owns and maintains the metadata for that LUN. It is not advised to trespass the LUNs to an SP that is not the allocation owner. This introduces lag. The SP that provides the best performance for the pool LUN. The allocation owner SP is set by the system to match the default SP owner when you create the LUN. You cannot change the allocation owner after the LUN is created. If you change the default owner for the LUN, the software will display a warning that a performance penalty will occur if you continue.”

There’s a useful article by Jithin Nadukandathil on the ECN site, as well as a most excellent writeup by fellow EMC Elect member Jon Klaus here. In short, if you identify NonOptimal LUN ownership, your best option is to create a new LUN and migrate the data to that LUN via the LUN Migration tool. You can download a copy of the script here. Feel free to look at the other scripts that are on offer as well. Here’s what the output looks like.

 output1

 

 

EMC – Some new scripts

Mat has come up with a few new scripts – FlipFASTTiering and ReplicationCapacity. They’re PERL scripts that you can use to list / modify FAST Tiering scheduling and report on MirrorView replication data respectively. Hopefully you’ll find them of some use. Further information can be found on the Utilities page.

Get cracking with Stormons 1.3.2 GA

I covered the release of Stormons 1.3.2 GA here and mentioned a few of the highlights of the new version. In between doing some vCD stuff and the general faffing about that seems to happen at this time of year, I’ve had a chance to install it on my lab PC and monitor our two CX4-120s. I’m not going to regurgitate the installation manual (which can be downloaded from here). I thought it might be more useful if I covered off the bits that weren’t obvious to me when I first installed the product. Note that if you’re used to deploying web apps and familiar with Apache configuration files, you won’t have as much of an issue with getting this working as I did. As it happens, I’m more idiot than savant when it comes to these things so it took me a little longer than it should have to get going.

Windows Server 2003 is listed as the supported OS. I used 32-bit Windows 7, as that was the OS on my laptop at the time. I used Apache 2.2 as the web server. There’s a bunch of supported models listed in the installation guide, but I’ve only used this product with EMC CLARiiON CX4 arrays. I can’t speak for its usefulness with NetApp FAS, EMC Symmetrix, EMC Celerra or Brocade FC devices.

I copied the extracted (and renamed) installation files to C:\tools\ and set the System Variable accordingly. Here’re the changes I made to my httpd.conf file to get things working. If I’ve done things really badly, feel free to chime in.

The first bit is to add a Directory for the location of the SM1.3 installation.

# STORMONS 
<Directory "C:/tools/SM1.3/http">
SetEnv STORMONS_HOME "C:/tools/SM1.3"
AllowOverride None
Options Indexes FollowSymLinks ExecCGI
Order allow,deny
Allow from all
</Directory>

I then added in Aliases for the /SM and other directories.

# STORMONS
Alias /SM "C:/tools/SM1.3/http"
Alias /capa_spk "C:/tools/SM1.3/repository/capacity/spk"
Alias /perf_spk "C:/tools/SM1.3/repository/performance/spk"

I also uncommented the .cgi handler under

AddHandler cgi-script .cgi

And that was it from the Apache side of things. After that I ran through the Stormons configuration as per the installation guide, setting paths to various binaries, etc. Note that if you’ve just copied RRDtool from the internet it’s worth running it first. I was missing the msvcr100.dll and it was crapping out without my knowledge (re-installing the Visual Studio 2008 redistributable fixed that). When you’re installing the Stormons polling service, make sure that you’re running a command prompt with sufficient privileges to install services, or you won’t get any useful results. I’m still having problems with RRDtool drawing pictures, but I’m working on that.

I’m hoping to do a more thorough run-through on some of the stuff you can do with Stormons in the next week or two.

Stormons 1.3.2 GA Released

Didier from Stormons recently got in contact (I knew a degree in French would eventually come in handy for something) to let me know that Stormons 1.3.2 GA has now been released. I’ve been meaning to give this tool a run in the lab for a while now, and I’m hopeful that I can do something in the next few weeks when work gets a bit quieter. You can download pre-compiled Windows (32 and 64-bit) binaries or Linux source code here. I won’t go into all of the features now, but here’re a few highlights:

In terms of the code, it is comprised of 43000 lines of Perl 5.14, 70 HTML reports, a multi-threaded engine and a scheduler.

Some reports offer the ability to compare configurations between two dates (helpful for trending and capacity planning).

Detailed alerts can be created.

“Storage Classes” can be created and used as a basis for Chargeback reports.

There’s a bunch of other features, including support for Brocade fabric traffic monitoring and some Nagios integration. And, as mentioned previously, there’s 70 HTML reports covering RAID Groups, CX Storage Pools, MirrorView, NetApp Volumes, Celerra LUNs, and so on. The program has been written for storage admins by a storage admin. I’m looking forward to testing it out on our CX4s, and Mat might even have a little less work to do as a result.