Technical Support Services Blog
Discover the latest trends in technology, and the technical issues customers are overcoming with the aid of HP Technology Services.

Is your File System right-sized? Is it too BIG? Can it even be -- too BIG? Are there signs?

In the past few years, IT shops have grown accustomed to letting their file systems grow to staggering capacities.  File systems are advancing at similar rates to extend their capacity limits and design thresholds, especially around metadata management.  However, this increased capacity lays the groundwork for extremely volatile situations. Computer-room.png

 

Is your data safe?  Will you be able to access it when you need it?  How is your Storage solution architected?  When was the last backup? If you even have a backup, how long will it take you to restore it? Questions all business leaders and CIOs should be asking.

 

Is large, single name space file systems right for you?

 

 

Recently, I was engaged on a case that involved data migration at the block level of a customer’s business data.  After the migration was completed –successfully I might add –all systems were released to full production load.  Sometime later (weeks/months) the System administrator ran a simple script to change file and directory permissions –for which he felt would be a harmless action---affected the entire business unit by taking the file system offline and impacting thousands of users.  The “hot seat” takes all new meaning when the IT guy had to explain to executives that a benign task caused a single, extraordinarily large file systems to go offline.

This particular case revolved around a NTFS file system of 10 TB capacity containing 10’s of millions of files of varying in size from a few KB to multiple gigabytes.  At the particular time of the outage the customer had no idea that his file system was horrifically fragmented at both the NTFS file system layer and the MFT.  In this particular case the outage was caused by a system administrator applying ACL changes to millions of files via a batch job.  These particular actions caused extremely long wait times at the system layer in the file system driver stack.  As a result, the system became very lethargic leading network pauses, clients being disconnected from the share, and applications using those shares crashing.

 

It was not that the customer just had 10 TB NTFS file system, it was also the millions upon millions of files varying in size from a few KB to multiple gigabytes which lead to the climatic event.  In this particular case, the MFT had grown to be greater than 12 gigabytes, fragmentation was rife throughout the file system and MFT.   In this case, defrag, and contig of just MFT (metadata) ran for days trying to realign data on contiguous blocks but was never able to complete.  If in fact contig had completed, one pass would not have been enough.  It would have required several more executions to realign additional blocks not aligned on the initial pass due to locked address space as the file system was still online –not to mention the many other factors that existed.  In any case, a new design was required to fix this clients problem, they decided to go with multiple smaller NTFS file systems.  There’s far more to this story; for which, I am not including here, as it will just muddy the waters.  Suffice it to say, the customer had no choice but to keep his basic disk windows approach due to their style of clustering, so we had to reduce the size of the file system to manage the number of files and fragmentation.

Was this strictly a problem of NTFS?  Personally, as the debugger, coder, author, and Unix guru, you might expect me to say – yes.  However, I truly feel that sometimes good products get a bad rap for being utilized in areas they were never intended or in areas that were never conceived.  I’m not saying you cannot have a 10 TB or even a 200 TB NTFS file system.  Both of these are fully supported, and we do have customer’s that use them to these extremes.  At the end of the day it is extraordinary important to think about what is supported, but also think about the use model—would you take “any” automobile with four wheels and a high powered engine to the racetrack?

 

In this particular case, there are better solutions to accommodate extraordinary large file systems which contain extraordinarily high numbers of files of varying size and access patterns along with thousands of user accesses (IBRIX, Luster, etc…) If you want to know more, please comment. 

 

Until next time.

 

Takeaways:
This issue is not going away – In fact, similar issues are cropping up more and more as customers/companies continue to grow their legacy file systems bigger, and bigger, and –well, you get the picture. Before you put your business into a difficult corner, consider the ramifications of going to a single filesystem petabytes in size – Can you back it up, can you restore it before the company is out of business?  You will have to have a DR Strategy with a single file system of that size – but remember synchronous or asynchronous hardware replication only protects hardware failure, not filesystem issues, viruses, or human mistakes such as deleting or formatting the wrong disk…

 

Follow Greg & Chris Tinker (TinkerTwinsatHP) on Twitter, Facebook, or this Blog

 

 

Learn about game-changing innovations in software, hardware, services, and networking. HP Discover2012 http://bit.ly/LOt1qp

 

Discover2012

Leave a Comment

We encourage you to share your comments on this post. Comments are moderated and will be reviewed
and posted as promptly as possible during regular business hours

To ensure your comment is published, be sure to follow the community guidelines.

Be sure to enter a unique name. You can't reuse a name that's already in use.
Be sure to enter a unique email address. You can't reuse an email address that's already in use.
Type the characters you see in the picture above.Type the words you hear.
Search
About the Author
  • More than 30 years in Sales and Marketing in IT services business. Currently managing global campaigns for Datacenter Care.
  • I graduated in Software Engineering. Joined HP family five years ago, I deliver Insight Remote Support technical consulting for HP customers, in North America, Canada and Latin America. Assist setting up, installing and configuring the solution in customers' IT environments.
  • I am an identical twin. My brother’s name is Greg Tinker and we have been extremely fortunate working similar careers within HP, known to our HP colleagues and many of our customers as "The Tinkers". Our job is to be the technical lead on major business operational outages with millions of Dollars/Euros hanging in the balance. We both have a complete background in architectural, Infrastructure and application environments from both the proactive and reactive side of HP Enterprise Service (HP ES), and HP Enterprise Business (HP EB).
  • I am an identical twin. My brother’s name is Chris Tinker and we have been extremely fortunate working similar careers within HP, known to our HP colleagues and many of our customers as "The Tinkers". Our job is to be the technical lead on major business operational outages with millions of Dollars/Euros hanging in the balance. We both have a complete background in architectural, Infrastructure and application environments from both the proactive and reactive side of HP Enterprise Service (HP ES), and HP Enterprise Business (HP EB). We have always attended the same schools, studied the same material (big surprise, as we are identical twins), and have always worked as a close team and strive to demonstrate our teaming ability’s to others. We each have more than 11 years experience supporting mission-critical enterprise customers on a broad range of technologies. We’ve both won the HP MVP award multiple times as well as coauthored books, programs, and whitepapers in our spare time.
  • More than 25 years in the IT industry, managing ITSM, service development and delivery projects in Technology Services. Specialized in end2end support for ISV based business solutions. Certified ITIL and project management expert.
  • Eduardo Zepeda, WW TS Social Media Program Manager & Internal Communications for WW Technology Services Blogging on behalf of HP Technology Services (TS_Guest)
  • I have been with HP for 13 years, always in Services - first as a Services Channel Sales rep, then a Channel Services Segment Manager, and now, in WW Technology Services Marketing. These may be my formal job titles, but I'm really a Cheerleader for HP Services! I feel that HP has great services, exceptional Technical Experts and Delivery teams, and so many cool things are going on at HP Services. So, stay tuned...
  • I have 27 years of system, storage, and networking experience including detailed work with Data Protector (formerly Omniback II) for the past 14 years. My expertise includes StoreOnce deduplication technology, D2D appliances, performance tuning, complex remediation, and online backup integration with applications like Oracle and infrastructure like VMware. Traveling across the United States and Canada as a Sr. Technical Consultant, I deliver specialized consulting for a broad variety of HP customers.
  • MrCollaboration (aka Jim Evans) is an HP Global Services Alliance Manager. He has worked in the IT industry for more than 30 years, 22 of which were spent with Digital Equipment Corporation, Compaq and HP. He works with many third party vendors and partners to develop processes to facilitate excellent support and service for mutual customers. Jim is also HP’s representative to the Technical Support Alliance Network (TSANet).
  • I've been working in Customer Service for over 20 years. During my career I've provided support services for Languages, Programming Libraries and Operating Systems. During the last 10 years I've provide support for Linux and more recently VMware. My current role is as a Technical Account Manager working in the HP Custom Mission Critical Services Industry Standard Operating Systems team. I provide both reactive and proactive operating system support for proLiant servers and blades. Our services in the Custom teams are built on statement of work contracts for large HP customers who need a customized mission critical support offering.
  • I've been working in HP since 2007 like IT agent, developer, Web designer and then like Web Project Manager
  • I like to listen as much as I like to talk. Why? My 25+ years in the technology industry has taught me that the key to delivering value to customers is to understand what they value in the first place! I developed this passion for customers and consultative selling during my 12 years with Accenture, and I have continued to approach customers in a consultative way during my 12+ year tenure with HP. I also have a passion for HP given my knowledge of our Product and Service Portfolio and the differentiators we possess that position us as a leader in the areas our customers are telling us they want to go. Converged Infrastructure, Converged Cloud, Big Data – and the associated Service and Support implications – all such exciting technology trends where our success will hinge upon our ability to differentiate ourselves versus others in the areas that matter most to our customers. Right up my alley, and I am proud to be part of the great HP team where I know we have the best solutions in the industry!
  • Tom Clement has over 30 years experience in the areas of adult learning, secondary education, and leadership development. During this time Tom has been a consistent champion of “non-traditional” training delivery methods, including blended learning, virtual delivery (self paced and instructor led), the use of training games and simulations, and experiential learning. Tom has spent the past 25 years of his career at Hewlett Packard, focused most recently on HP’s global Virtualization, Cloud, and Converged Infrastructure customer training programs. Tom manages the strategic direction and overall performance of these training programs, ensuring these worldwide programs help HP’s customers capitalize on the business opportunities made available by IT advancements in each of these subject areas. Tom and his global teammates utilize best in class instructors, course content and supporting equipment infrastructure to deliver these training programs to HP’s customers. The team prides itself on providing the Virtualization, Cloud, and Converged Infrastructure content customers need when and where they need it, anywhere in the world. Tom is based in the Washington, DC suburbs and can be reached at tom.clement@hp.com.
Follow Us