Primary Data – Seeing the Future

It’s that time of year when public relations companies send out a heap of “What’s going to happen in 2018” type press releases for us blogger types to take advantage of. I’m normally reluctant to do these “futures” based posts, as I’m notoriously bad at seeing the future (as are most people). These types of articles also invariably push the narrative in a certain direction based on whatever the vendor being represented is selling. That said I have a bit of a soft spot for Lance Smith and the team at Primary Data, so I thought I’d entertain the suggestion that I at least look at what’s on his mind. Unfortunately, scheduling difficulties meant that we couldn’t talk in person about what he’d sent through, so this article is based entirely on the paragraphs I was sent, and Lance hasn’t had the opportunity to explain himself :)

 

SDS, What Else?

Here’s what Lance had to say about software-defined storage (SDS). “Few IT professionals admit to a love of buzzwords, and one of the biggest offenders in the last few years is the term, “software-defined storage.” With marketers borrowing from the successes of “software-defined-networking”, the use of “SDS” attempts all kinds of claims. Yet the term does little to help most of us to understand what a specific SDS product can do. Despite the well-earned dislike of the phrase, true software-defined storage solutions will continue to gain traction because they try to bridge the gap between legacy infrastructure and modern storage needs. In fact, even as hardware sales declines, IDC forecasts that the SDS market will grow at a rate of 13.5% from 2017 – 2021, growing to a $16.2B market by the end of the forecast period.”

I think Lance raises an interesting point here. There’re a lot of companies claiming to deliver software-defined storage solutions in the marketplace. Some of these, however, are still heavily tied to particular hardware solutions. This isn’t always because they need the hardware to deliver functionality, but rather because the company selling the solution also sells hardware. This is fine as far as it goes, but I find myself increasingly wary of SDS solutions that are tied to a particular vendor’s interpretation of what off the shelf hardware is.

The killer feature of SDS is the idea that you can do policy-based provisioning and management of data storage in a programmatic fashion, and do this independently of the underlying hardware. Arguably, with everything offering some kind of RESTful API capability, this is the case. But I think it’s the vendors who are thinking beyond simply dishing up NFS mount points or S3-compliant buckets that will ultimately come out on top. People want to be able to run this stuff anywhere – on crappy whitebox servers and in the public cloud – and feel comfortable knowing that they’ll be able to manage their storage based on a set of business-focused rules, not a series of constraints set out by a hardware vendor. I think we’re close to seeing that with a number of solutions, but I think there’s still some way to go.

 

HCI As Silo. Discuss.

His thoughts on HCI were, in my opinion, a little more controversial. “Hyperconverged infrastructure (HCI) aims to meet data’s changing needs through automatic tiering and centralized management. HCI systems have plenty of appeal as a fast fix to pay as you grow, but in the long run, these systems represent just another larger silo for enterprises to manage. In addition, since hyperconverged systems frequently require proprietary or dedicated hardware, customer choice is limited when more compute or storage is needed. Most environments don’t require both compute and storage in equal measure, so their budget is wasted when only more CPU or more capacity is really what applications need. Most HCI architecture rely on layers of caches to ensure good storage performance.  Unfortunately, performance is not guaranteed when a set of applications running in a compute node overruns a caches capacity.  As IT begins to custom-tailor storage capabilities to real data needs with metadata management software, enterprises will begin to move away from bulk deployments of hyperconverged infrastructure and instead embrace a more strategic data management role that leverages precise storage capabilities on premises and into the cloud.”

There’re are a few nuggets in this one that I’d like to look at further. Firstly, the idea that HCI becomes just another silo to manage is an interesting one. It’s true that HCI as a technology is a bit different to the traditional compute / storage / network paradigm that we’ve been managing for the last few decades. I’m not convinced, however, that it introduces another silo of management. Or maybe, what I’m thinking is that you don’t need to let it become another silo to manage. Rather, I’ve been encouraging enterprises to look at their platform management at a higher level, focusing on the layer above the compute / storage / network to deliver automation, orchestration and management. If you build that capability into your environment, then whether you consume compute via rackmount servers, blade or HCI becomes less and less relevant. It’s easier said than done, of course, as it takes a lot of time and effort to get that layer working well. But the sweat investment is worth it.

Secondly, the notion that “[m]ost environments don’t require both compute and storage in equal measure, so their budget is wasted when only more CPU or more capacity is really what applications need” is accurate, but most HCI vendors are offering a way to expand storage or compute now without necessarily growing the other components (think Nutanix with their storage-only nodes and NetApp’s approach to HCI). I’d posit that architectures have changed enough with the HCI market leaders to the point that this is no longer a real issue.

Finally, I’m not convinced that “performance is not guaranteed when a set of applications running in a compute node overruns a caches capacity” is as much of a problem as it was a few years ago. Modern hypervisors have a lot of smarts built into them in terms of service quality and the modelling for capacity and performance sizing has improved significantly.

 

Conclusion

I like Lance, and I like what Primary Data bring to the table with their policy-based SDS solution. I don’t necessarily agree with him on some of these points (particularly as I think HCI solutions have matured a bunch in the last few years) but I do enjoy the opportunity to think about some of these ideas when I otherwise wouldn’t. So what will 2018 bring in my opinion? No idea, but it’s going to be interesting, that’s for sure.

Primary Data Announces Survey Results (And A Very Useful Tool)

Survey Says …

Primary Data conducted a “Storage Census” at VMworld this year, and I had the opportunity to talk about the results with Lance Smith (CEO, Primary Data). You can read the press release here.

 

The Enterprise Responds

The majority of the 313 respondents worked at large enterprises with over 1,000 to 5,000 and more employees, and their companies were mostly over ten years old.

[image courtesy of Primary Data]

 

Performance and Cold Data

The survey also found that:

  • 38 percent of respondents selected performance as their biggest challenge for IT, making it the most pressing IT issue named in the survey;
  • Data migrations were the second most common headache for 36 percent of those surveyed;
  • Budget challenges (35 percent) and cloud adoption (27 percent) closely followed among the top concerns for respondents;
  • The majority of organizations estimate that at least 60 percent of their data is cold; and
  • 27 percent of organizations manage twenty or more storage systems, 44 percent manage ten or more storage systems.

Smith said that “[p]erformance remains a noisy problem, and the lack of insight into data forces IT to overprovision and overspend even though the professionals surveyed know that the majority of their data is actually sitting idle”. This leads me to the next part of Primary Data’s announcement: Data Profiler.

 

Data Profiler Could Be A Very Useful Tool

The team then ran me through a demo of a new tool available via DataSphere called Data Profiler. The Data Profiler provides a number of data points that change according to the chosen objectives, including

  • Cost per storage tier,
  • Number of files per tier,
  • Capacity per tier; and
  • Total cost of the analyzed global storage ecosystem.

Data tiers can be easily added to the Data Profiler to evaluate policies using the resources available in each customer’s unique environment.

[image courtesy of Primary Data]

The cool thing about this tool is you can do all your own modelling by plugging in various tiers of storage, cost per MB/GB/TB, etc and have it give you a clear view of potential cost savings, assuming that your data is, indeed, cold. You can then build these policies (Objectives) in DataSphere and have it move everything around to align with the policy you want to use. You want your hot data to live in the performance-oriented tiers, and you want the cold stuff to live in the cheap and deep parts. This is a simple thing to achieve when you have one array. But when you have a range of arrays from different vendors it becomes a little more challenging.

 

Conclusion

I wasn’t entirely surprised by the results of the survey, as a number of my enterprise customers have reported the same concerns. Enterprise storage can be hard to get right, and it’s clear that a lot of people don’t really understand the composition of their storage environment from an active data perspective.

I’ve been a fan of Pd’s approach to data virtualisation for some time, having observed the development of the company over the course of a number of Storage Field Day events. Any tool that you can use to get a better idea of how to best deploy your storage, particularly across heterogeneous environments, is a good thing in my book. I was advised by the team that this is built into DataSphere at the moment, although it’s easy enough to run a small script to gather the data to send back to their SEs for analysis. I believe there may also be an option to install the tool locally, but I’d recommend engaging with Primary Data’s team to find out more about that. As with most things in IT, the solution isn’t for everyone. If you’re running a single, small array, you may have a good idea of what it costs to use various tiers of storage, how old your data is and how frequently it’s being accessed. But there are plenty of enterprises out there with a painful variety of storage solutions deployed for any number of (usually) good reasons. These places could potentially benefit greatly from some improved insights into their environments.

I’ve said it before and I’ll keep saying it – Enterprise IT can be hard to do right. Any tools you can use to make things easier and potentially more cost effective should be on your list of things to investigate.

Primary Data Attacks Application Ignorance

Disclaimer: I recently attended Storage Field Day 13.  My flights, accommodation and other expenses were paid for by Tech Field Day and Pure Storage. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

 

Primary Data recently presented at Storage Field Day 13. You can see videos of their presentation here, and download my rough notes from here.

 

My Applications Are Ignorant

I had the good fortune of being in a Primary Data presentation at Storage Field Day. I’ve written about them a few times (here, here and here) so I won’t go into the basic overview again. What I would like to talk about is the idea raised by Primary Data that “applications are unaware”. Unaware of what exactly? Usually it’s the underlying infrastructure. When you deploy applications they generally want to run as fast as they can, or as fast as they need to. When you think about it though, it’s unusual that they can determine the platform they run on and only run as fast as the supporting infrastructure allows. This is different to phone applications, for example, that are normally written to operate within the constraints of the hardware.

An application’s ignorance has an impact. According to Primary Data, this impact can be in terms of performance, cost, protection (or all three). The cost of unawareness can have the following impact on your environment:

  • Bottlenecks hinder performance
  • Cold data hogs hot capacity
  • Over provisioning creates significant overspending
  • Migration headaches keep data stuck until retirements
  • Vendor lock-in limits agility and adds more cost

As well as this, the following trends are being observed in the data centre:

  • Cost: Budgets are getting smaller;
  • Time: We never have enough; and
  • Resources: We have limited resources and infrastructure to run all of this stuff.

Put this all together and you’ve got a problem on your hands.

 

Primary Data In The Mix

Primary Data tells us that DataSphere is solving the main pain points through:

They do this with DataSphere, “a metadata engine that automates the flow of data across the enterprise infrastructure and the cloud to meet evolving application demands”. It:

  • Is storage and vendor agnostic;
  • Virtualises the view of data;
  • Automates the flow of data; and
  • Solves the inefficiency of traditional storage and compute architectures.

 

Can We Be More Efficient?

Probably. But the traditional approach of architecting infrastructure for various workloads isn’t really working as well as we’d like. I like the way Primary Data are solving the problem of application ignorance. But I think it’s treating a symptom, rather than providing a cure. I’m not suggesting that I think what Pd are doing is wrong by any stretch, but rather that my applications will still remain ignorant. They’re still not going to have an appreciation of the infrastructure they’re running on, and they’re still going to run at the speed they want to run at. That said, with the approach that Primary Data takes to managing data, I have a better chance of having applications running with access to the storage resources they need.

Application awareness means different things to different people. For some people, it’s about understanding how the application is going to behave based on the constraints it was designed within, and what resources they think it will need to run as expected. For other people, it’s about learning the behavior of the application based on past experiences of how the application has run and providing infrastructure that can accommodate that behaviour. And some people want their infrastructure to react to the needs of the application in real time. I think this is probably the nirvana of infrastructure and application interaction.

Ultimately, I think Primary Data provides a really cool way of managing various bits of heterogeneous storage in a way that aligns with some interesting notions of how applications should behave. I think the way they pick up on the various behaviours of applications within the infrastructure and move data around accordingly is also pretty neat. I think we’re still some ways away from running the kind of infrastructure that interacts intelligently with applications at the right level, but Primary Data’s solution certainly helps with some of the pain of running ignorant applications.

You can read more about DataSphere Extended Services (DSX) here, and the DataSphere metadata engine here.

Storage Field Day 10 – Wrap-up and Link-o-rama

Disclaimer: I recently attended Storage Field Day 10.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

SFD-Logo2-150x150

This is a quick post to say thanks once again to Stephen, Tom, Megan and the presenters at Storage Field Day 10. I had an enjoyable and educational time. For easy reference, here’s a list of the posts I did covering the event (they may not match the order of the presentations).

Storage Field Day – I’ll Be At SFD10

Storage Field Day 10 – Day 0

Storage Field Day 10 – (Fairly) Full Disclosure

Kaminario are doing some stuff we’ve seen before, but that’s okay

Pure Storage really aren’t a one-trick pony

Tintri Keep Doing What They Do, And Well

Nimble Storage are Relentless in Their Pursuit of Support Excellence

Cloudian Does Object Smart and at Scale

Exablox Isn’t Just Pretty Hardware

It’s Hedvig, not Hedwig

The Cool Thing About Datera Is Intent

Data Virtualisation is More Than Just Migration for Primary Data

 

Also, here’s a number of links to posts by my fellow delegates (and Tom!). They’re all really quite smart, and you should check out their stuff, particularly if you haven’t before. I’ll try keep this updated as more posts are published. But if it gets stale, the SFD10 landing page has updated links.

 

Chris M Evans (@ChrisMEvans)

Storage Field Day 10 Preview: Hedvig

Storage Field Day 10 Preview: Primary Data

Storage Field Day 10 Preview: Exablox

Storage Field Day 10 Preview: Nimble Storage

Storage Field Day 10 Preview: Datera

Storage Field Day 10 Preview: Tintri

Storage Field Day 10 Preview: Pure Storage

Storage Field Day 10 Preview: Kaminario

Storage Field Day 10 Preview: Cloudian

Object Storage: Validating S3 Compatibility

 

Ray Lucchesi (@RayLucchesi)

Surprises in flash storage IO distributions from 1 month of Nimble Storage customer base

Has triple parity Raid time come?

Pure Storage FlashBlade well positioned for next generation storage

Exablox, bring your own disk storage

Hedvig storage system, Docker support & data protection that spans data centers

 

Jon Klaus (@JonKlaus)

I will be flying out to Storage Field Day 10!

Ready for Storage Field Day 10!

Simplicity with Kaminario Healthshield & QoS

Breaking down storage silos with Primary Data DataSphere

Cloudian Hyperstore: manage more PBs with less FTE

FlashBlade: custom hardware still makes sense

Squashing assumptions with Data Science

Bringing hyperscale operations to the masses with Datera

Making life a whole lot easier with Tintri VM-aware storage

 

Enrico Signoretti (@ESignoretti)

VM-aware storage, is it still a thing?

Scale-out, flash, files and objects. How cool is Pure’s FlashBlade?

 

Josh De Jong (@EuroBrew)

 

Max Mortillaro (@DarkkAvenger)

Follow us live at Storage Field Day 10

Primary Data: a true Software-defined Storage platform?

If you’re going to SFD10 be sure to wear microdrives in your hair

Hedvig Deep Dive – Is software-defined the future of storage?

Pure Storage’s FlashBlade – Against The Grain

Pure Storage Flashblade is now available!

 

Gabe Maentz (@GMaentz)

Heading to Tech Field Day

 

Arjan Timmerman (@ArjanTim)

We’re almost live…

Datera: Elastic Data Fabric

 

Francesco Bonetti (@FBonez)

EXABLOX – A different and smart approach to NAS for SMB

 

Marco Broeken (@MBroeken)

 

Rick Schlander (@VMRick)

Storage Field Day 10 Next Week

Hedvig Overview

 

Tom Hollingsworth (@networkingnerd)

Flash Needs a Highway

 

Finally, thanks again to Stephen, Tom, Megan (and Claire in absentia). It was an educational and enjoyable few days and I really valued the opportunity I was given to attend.

SFD10_GroupPhoto

Data Virtualisation is More Than Just Migration for Primary Data

Disclaimer: I recently attended Storage Field Day 10.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

logo-Primary_Data-400px

Before I get started, you can find a link to my raw notes on Primary Data’s presentation here. You can also see videos of the presentation here. I’ve seen Primary Data present at SFD7 and SFD8, and I’ve typically been impressed with their approach to Software-Defined Storage (SDS) and data virtualisation generally. And I’m also quite a fan of David Flynn‘s whiteboarding chops.

SFD10_Pd_DavidFlynn

 

Data Virtualisation is More Than Just Migration

Primary Data spent  some time during their presentation at SFD10 talking about Data Migration vs Data Mobility.

SFD10_Pd_DataMigrationvsMobility

[image courtesy of Primary Data]

Data migration can be a real pain to manage. It’s quite often a manual process and is invariably tied to the capabilities of the underlying storage platform hosting the data. The cool thing about Primary Data’s solution is that it offers dynamic data mobility, aligning “data’s needs (objectives) with storage capabilities (service levels) through automated mobility, arbitrated by economic value and reported as compliance”. Sounds like a mouthful, but it’s a nice way of defining pretty much what everyone’s been trying to achieve with storage virtualisation solutions for the last decade or longer.

What I like about this approach is that it’s a data-centric, rather than employing a storage platform focused approach. Primary Data supports “anything that can be presented to Linux as a block device”, so the options to deploy this stuff are fairly broad. Once you’ve presented your data to DSX, there’s some smart service level objectives (SLOs) that can be applied to the data. These can be broken down into the categories of protection, performance, and price/penalty:

Protection

  • Durability
  • Availability
  • Recoverability – Security
  • Priority
  • Sovereignty

Performance

  • IOPS / Bandwidth / Latency – Read / Write
  • Sustained / Burst

Price / Penalty

  • Per File
  • Per Byte
  • Per Operation

Access Control can also be applied to your data. With Primary Data, “[e]very storage container is a landlord with floorspace to lease and utilities available (capacity and performance)”.

 

Further Reading and Final Thoughts

I like the approach to data virtualisation that Primary Data have taken. There are a number of tools on the market that claim to fully virtualise storage and offer mobility across platforms. Some of them do it well, and some focus more on the benefits provided around ease of migration from one platform to another.

That said, there’s certainly some disagreement in the market place on whether Primary Data could be considered a fully-fledged SDS solution. Be that as it may, I really like the focus on data, rather than silos of storage. I’m also a big fan of applying SLOs to data, particularly when it can be automated to improve the overall performance of the solution and make the data more accessible and, ultimately, more valuable.

Primary Data has a bunch of use cases that extend beyond data mobility as well, including deployment options ranging from Hyperconverged, software-defined NAS and clustering across existing storage platforms. Primary Data want to “do for storage what VMware did for compute”. I think the approach they’ve taken has certainly gotten them on the right track, and the platform has matured greatly in the last few years.

If you’re after some alternative (and better thought out) posts on Primary Data, you can read Jon‘s post here. Max also did a good write-up here, while Chris M.Evans did a nice preview post on Primary Data that you can find here.

Storage Field Day 10 – (Fairly) Full Disclosure

Disclaimer: I recently attended Storage Field Day 10.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

SFD-Logo2-150x150

Here are my notes on gifts, etc, that I received as a delegate at Storage Field Day 10. I’d like to point out that I’m not trying to play companies off against each other. I don’t have feelings one way or another about receiving gifts at these events (although I generally prefer small things I can fit in my suitcase). Rather, I’m just trying to make it clear what I received during this event to ensure that we’re all on the same page as far as what I’m being influenced by. Some presenters didn’t provide any gifts as part of their session – which is totally fine. I’m going to do this in chronological order, as that was the easiest way for me to take notes during the week. While every delegate’s situation is different, I’d also like to clarify that I took 5 days of training / work time to be at this event (thanks to my employer for being on board).

 

Saturday

I paid for my taxi to BNE airport. I had a burger at Benny Burger in SYD airport. It was quite good. I flew Qantas economy class to SFO. The flights were paid for by Tech Field Day. Plane food was consumed on the flight. It was a generally good experience.

 

Tuesday

When I arrived at the hotel I was given a bag of snacks by Tom. The iced coffee and granola bars came in handy. We had dinner at Il Fornaio at the Westin Hotel. I had some antipasti, pizza fradiavola and 2 Hefeweizen beers (not sure of the brewery).

 

Wednesday

We had breakfast in the hotel. I had bacon, eggs, sausage, fruit and coffee. We also did the Yankee Gift Swap at that time and I scored a very nice stovetop Italian espresso coffee maker (thanks Enrico!). We also had lunch at the hotel, it was something Italian. Cloudian gave each delegate a green pen, bottle opener, 1GB USB stick, and a few Cloudian stickers. We had dinner at Gordon Biersch in San Jose. I had some sliders (hamburgers for small people) and about 5 Golden Export beers.

 

Thursday

Pure Storage gave each delegate a Tile, a pen, some mints, and an 8GB USB stick. Datera gave each delegate a Datera-branded “vortex 16oz double wall 18/8 stainless steel copper vacuum insulated thermal pilsner” (a cup) with our twitter handles on them. Tintri provided us with a Tintri / Nike golf polo shirt, a notepad, a pen, an 8GB USB stick, and a 2600mAh USB charger. We then had happy hour at Tintri. I had a Pt. Bonita Pilsner beer and a couple of fistfuls of prawns. For dinner we went to Taplands. I had a turkey sandwich and 2 Fieldwork Brewing Company Pilsners.

 

Friday

We had breakfast on Friday at Nimble Storage. I had some bacon, sausage and eggs for breakfast with an orange juice. I don’t know why my US comrades struggle so much with the concept of tomato sauce (ketchup) with bacon. But there you go. Nimble gave us each a custom baseball jersey with our name on the back and the Nimble logo. They also gave us each a white lab coat with the Nimble logo on it. My daughters love the coat. Hedvig provided us with a Hedvig sticker and a Hedvig-branded Rogue bluetooth speaker. We had lunch at Hedvig, which was a sandwich, some water, and a really delicious choc-chip cookie. Exablox gave each of us an Exablox-branded aluminium water bottle. We then had happy hour at Exablox. I had two Anchor Brewing Liberty Ale beers (“tastes like freedom”) and some really nice cheese. To finish off we had dinner at Mexicali in Santa Clara. I had a prawn burrito. I didn’t eat anything on the flight home.

 

Conclusion

I’d like to extend my thanks once again to the Tech Field Day organisers and the companies presenting at the event. I had a super enjoyable and educational time. Here’s a photo.

SFD10_disclosure1

 

Storage Field Day 10 – Day 0

Disclaimer: I recently attended Storage Field Day 10.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

SFD-Logo2-150x150

This is just a quick post to share some thoughts on day zero at Storage Field Day 10. I can do crappy tourist snaps as well if not better than the next guy. Here’s the obligatory wing shot. No wait here’s two – one leaving SYD and the other coming in to SFO. Bet you can’t guess which is which.

SFD10_plane1     SFD10_plane2

We all got together for dinner on Tuesday night in the hotel. I had the pizza. It was great.

SFD10_Food

But enough with the holiday snaps and underwhelming travel journal. Thanks again Stephen, Tom, Claire and Megan for having me back, making sure everything is running according to plan and for just being really very decent people. I’ve really enjoyed catching up with the people I’ve met before and meeting the new delegates. Look out for some posts related to the Tech Field Day sessions in the next few weeks. And if you’re in a useful timezone, check out the live streams from the event here, or the recordings afterwards.

Here’s the rough schedule (all times are ‘Merican Pacific).

Wednesday, May 25 9:30 – 11:30 Kaminario Presents at Storage Field Day 10
Wednesday, May 25 12:30 – 14:30 Primary Data Presents at Storage Field Day 10
Wednesday, May 25 15:00 – 17:00 Cloudian Presents at Storage Field Day 10
Thursday, May 26 9:30 – 11:30 Pure Storage Presents at Storage Field Day 10
Thursday, May 26 13:00 – 15:00 Datera Presents at Storage Field Day 10
Thursday, May 26 16:00 – 18:00 Tintri Presents at Storage Field Day 10
Friday, May 27 8:00 – 10:00 Nimble Storage Presents at Storage Field Day 10
Friday, May 27 10:30 – 12:30 Hedvig Presents at Storage Field Day 10
Friday, May 27 13:30 – 15:30 Exablox Presents at Storage Field Day 10

You can also follow along with the live stream here.


Storage Field Day 8 – Wrap-up and Link-o-rama

Disclaimer: I recently attended Storage Field Day 8.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

This is a quick post to say thanks once again to Stephen, Claire and the presenters at Storage Field Day 8. I had a fantastic time and learnt a lot. For easy reference, here’s a list of the posts I did covering the event (not necessarily in chronological order).

Storage Field Day – I’ll be at SFD8

Storage Field Day 8 – Day 0

Storage Field Day 8 – (Fairly) Full Disclosure

Cohesity – There’s more to this than just “Secondary Storage”

Violin Memory – Sounds a lot better than it used to

Pure Storage – Orange is the new black, now what?

INFINIDAT – What exactly is a “Moshe v3.0”?

Nimble Storage – InfoSight is still awesome

Primary Data – Because we all want our storage to do well

NexGen Storage – Storage QoS and other good things

Coho Data – Numerology, but not as we know it

Intel – They really are “Inside”

Qumulo – Storage for people who care about their data

 

Also, here’s a number of links to posts by my fellow delegates. They’re all switched-on people, and you’d do well to check out what they’re writing about. I’ll try and update this list as more posts are published. But if it gets stale, the SFD8 landing page has updated links.

 

Ray Lucchesi (@RayLucchesi)

Coho Data, the packet processing squeeze and working set exploits

Primary data’s path to better data storage presented at SFD8

PB are the new TB, GreyBeards talk with Brian Carmody, CTO Inifinidat

 

Mark May (@CincyStorage)

Can Violin Step back From the Brink?

Storage software can change enterprise workflow

Redefining Secondary Storage

 

Scott D. Lowe (@OtherScottLowe)

IT as a Change Agent: It’s Time to Look Inward, Starting with Storage

Overcoming “New Vendor Risk”: Pure Storage’s Techniques

So, What is Secondary Storage Cohesity-Style?

Data Awareness Is Increasingly Popular in the Storage Biz

 

Jon Klaus (@JonKlaus)

Storage Field Day – I will be attending SFD8!

Wow it’s early – Traveling to Storage Field Day 8

Coho Data: storage transformation without disruption

Pure Storage: Non Disruptive Everything

Cohesity is changing the definition of secondary storage

Qumulo: data-aware scale-out NAS

Nimble Storage – InfoSight VMVision

NexGen Storage: All-Flash Arrays can be hybrids too!

Infinidat: Enterprise reliability and performance

 

Alex Galbraith (@AlexGalbraith)

Looking Forward to Storage Field Day 8

Without good Analytics you dont have a competitive storage product

How often do you upgrade your storage array software?

Where and why is my data growing?…

Why are storage snapshots so painful?

 

Jarett Kulm (@JK47TheWeapon)

Storage Field Day 8 – Here I come!

 

Enrico Signoretti (@ESignoretti)

#SFD8, it’s storage prime time!

Analytics, the key to (storage) happiness

We are entering the Data-aware infrastructure era

Has the next generation of monolithic storage arrived?

Juku.beats 25: Qumulo, data-aware scale-out NAS

Infinidat: awesome tech, great execution

Juku.beats 27: NexGen Storage, QoS made easy.

Software defined? No no no, it’s poorly defined storage (and why Primary Data is different)

Juku.beats 28 – Infinidat storage: multiple nine resiliency, high performance, $1/GB

Are you going All-Flash? Nah, the future is hybrid

 

Vipin V.K. (@VipinVK111)

Tech Field Day Calling…! – #SFD8

Infinibox – Enterprise storage solution from Infinidat

Understanding NVMe (Non-Volatile Memory Express)

All-Flash symphony from Violin Memory

Cohesity – Secondary storage consolidation

With FLASH, things are changing ‘in a flash’ !?

 

Josh De Jong (@EuroBrew)

Storage Field Day Here I Come!

Thoughts in the Airport

NexGen Storage – The Future is Hybrid

Pure Storage – Enterprise Ready, Pure and Simple

 

Finally, thanks again to Stephen, Claire (and Tom in absentia). It was an educational and enjoyable few days and I really valued the opportunity I was given to attend.

SFD8_Group

 

Primary Data – Because we all want our storage to do well

Disclaimer: I recently attended Storage Field Day 8.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

For each of the presentations I attended at SFD8, there are a few things I want to include in the post. Firstly, you can see video footage of the Primary Data presentation here. You can also download my raw notes from the presentation here. Finally, here’s a link to the Primary Data website that covers some of what they presented.

 

Primary Data presented at Storage Field Day 7 – and I did a bit of a write-up here. At the time they had no shipping product, but now they do. Primary Data were big on putting data in the right place, and they’ve continued that theme with DataSphere.

 

Because we all want our storage to do well

I talk to my customers a lot about the concept of service catalogues for their infrastructure. Everyone is talking about X as a Service and the somewhat weird concept of applying consumer behaviour to the enterprise. But for a long time this approach has been painful, at least with mid-range storage products, because coming up with classifications of performance and availability for these environments is a non-trivial task. In larger environments, it’s also likely you won’t have consistent storage types across applications, with buckets of data being stored all over the place, and accessible via a bunch of different protocols. The following image demonstrates nicely the different kinds of performance levels you might apply to your environment, as not all applications were created equal. Neither are storage arrays, come to think of it.

PD_menu

[image courtesy of Primary Data]

Primary Data say that “every single capability and characteristic of the storage system can be thought of in terms of whether the data needs it or not”. As I said before, you need to look at your application requirements in terms of both:

  • Performance (reads vs writes, IOPS, bandwidth, latency); and
  • Protection (data durability, availability, security).

Of course, this isn’t simple when you then attempt to apply compute requirements and network requirements to the applications as well, but let’s just stick with storage requirements for the time being. Once you understand the client, and the value of the objectives being met on the data, you can start to apply Objectives and “Smart Objectives” (Objectives applied to particular types of data) to the data. With this approach, you can begin to understand the cost of specific performance and protection levels. Everyone wants Platinum, until they start having to pay for it. These costs can then be translated and presented as Service Level Agreements in your organisation’s service catalogue for consumption by various applications.

 

Closing Thoughts and Further Reading

Primary Data has a lot of smart people involved in the product, and they always put on one hell of  whiteboard session when I see them present. To my mind, the key thing to understand with DataSphere isn’t that it will automatically transform your storage environment into a fully optimised, service catalogue enabled, application performance nirvana. Rather, it’s simply providing the smarts to leverage the insights that you provide. If you don’t have a service catalogue, or a feeling in your organisation that this might be a good thing to have, then you’re not going to get the full value out of Primary Data’s offering.

And while you’re at it, check out Cormac’s post for a typically thorough overview of the technology involved.

 

 

Storage Field Day 8 – (Fairly) Full Disclosure

Disclaimer: I recently attended Storage Field Day 8.  My flights, accommodation and other expenses were paid for by Tech Field Day. There is no requirement for me to blog about any of the content presented and I am not compensated in any way for my time at the event.  Some materials presented were discussed under NDA and don’t form part of my blog posts, but could influence future discussions.

My full disclosure post will never reach the heady heights of Justin’s, all I can do is try and get somewhere near that level. Here are my notes on gifts, etc, that I received as a delegate at Storage Field Day 8. I’d like to point out that I’m not trying to play presenters / companies off against each other. I don’t have feelings one way or another about receiving gifts at these events (although I generally prefer small things I can fit in my suitcase). Rather, I just trying to make it clear what I received during this event to ensure that we’re all on the same page as far as what I’m being influenced by. Some presenters didn’t provide any gifts as part of their session – which is totally fine. I’m going to do this in chronological order, as that was the easiest way for me to take notes during the week. While every delegate’s situation is different, I’d also like to clarify that I took 2 days of vacation time and 3 days of training / work time to be at this event.

 

Saturday

I left my house Saturday morning at 4am and travelled BNE -> SYD -> SFO. I paid for my own cab to the airport in Brisbane, with the transfer in Sydney covered by my ticket. A period of time passed and I was given “dinner” and “breakfast”. This was included in the price of the ticket (paid for by Tech Field Day). Alcoholic beverages were not included, but I stuck with water. United wanted to sell me snacks at some stage too, but I politely declined. I had a spare seat next to me and the opportunity to watch “Office Space” for the umpteenth time on the tiny screen, so it wasn’t all bad.

 

Tuesday

On Tuesday night we had the delegate dinner at Antonella’s Ristorante in San Jose. It’s a nice Italian place. I had a bit of everything. As part of the gift exchange I received a pair of socks and some very nice beef jerky. I also had two Modelos and a Montejo at the hotel bar after dinner. Claire gave us all a bag of snacks when we got to the hotel. I particularly enjoyed the cashews and Reese’s things.

SW_BAG

 

Wednesday

We had breakfast at Mikayla’s. I had a breakfast roll. Coho Data provided us with doughnuts with bacon on top. While you can put bacon on all kinds of things, sometimes it’s best if you don’t. They also gave each of us a Coho-flavoured Raspberry Pi 2. We had lunch at Pure Storage provided by DishDash. They also gave us some brightly-coloured socks and a 8GB USB stick. Cohesity  provided us with a Hawaiian-themed food and beverages, including Mai Tais. This may or may not have been a mistake given the feistiness of some of the delegates during the ensuing session. They also gave us customised Amazon Echo devices. I gave mine to Claire as it wouldn’t work in Australia. We also received a Cohesity sleeveless fleece top – arguably these don’t work in Australia either, as it doesn’t get that cold where I live :). We had a social event at the Hotel Valencia on Santana Row. I had some sliders and a few Heinekens. I then went to the bar at the hotel and had a Fat Tire before retiring for the night.

 

Thursday

We had a buffet breakfast at the hotel. I had some bacon and eggs and fruit. Qumulo provided us with a Qumulo-branded notepad. We had a Californian BBQ-style lunch at the hotel. After a few more sessions, we then had dinner at Picasso’s in downtown San Jose. It was a tapas place, so I had a bit of everything. And two glasses of Sangria. I then had a Fat Tire at the hotel bar upon our return.

 

Friday

On Friday morning we had breakfast at Violin Memory. I had a breakfast burrito and coffee (I loves me some Mexican in the morning). They also gave us a T-shirt, stubbie holder, bottle opener, mints and toothpicks. Intel provided us with a nice buffet lunch at the Intel HQ Executive Briefing Centre. Nimble Storage gave us each a small tote bag, a webcam privacy screen, some headphones and a travel set. This was followed by “Happy Hour” at Nimble. I had a sparkling water. To finish off we had dinner at Mexicali in Santa Clara. I had a prawn burrito. I didn’t eat anything on the flight home.

 

Conclusion

I’d like to extend my thanks once again to the Storage Field Day organisers and the companies presenting at the event. I had an extremely enjoyable and educational time. Here’s a photo.

FINAL