Data floods into businesses, but trying to use it in an unstructured form is like trying to drink from a fire hose. What good is that rich stream of data if it can’t quench an organization’s thirst for useful information?
Data needs to be captured with intelligence, and the key to doing that is metadata. Metadata is like tagging, marking or stamping data with a label to make it recognizable during a search or reporting function. It’s like color-coding a filing system or using a card catalog in a library, and it saves time and resources.
“There is a data explosion, and we have to make sure that it is contained in a way where we can optimize, manage and present it,” said Bala Kuchibhotla, general manager of databases at Nutanix.
Kuchibhotla created Nutanix Era, picked by CRN as one of the Top Products of 2018 for its ability to provide "platform-layer services that bring one-click simplicity to database operations.. He said data about data gives software developers to more control to leverage data and build automation into data systems.
“Random data is all over," he said. "Metadata helps creates a structure around data.”
“There is a data explosion, and we have to make sure that it is contained in a way where we can optimize, manage and present it,” said Bala Kuchibhotla, general manager of databases at Nutanix. “Random data is all over. Metadata helps creates a structure around data.”
The ancient Greeks used the word “meta” to describe something that comes after or beyond. In casual context, it means having awareness from within or from a higher level. It’s like thinking about thinking or reading a book about reading a book. Add “meta” to “data” and it’s data about data. Because of its importance, the metadata of today will become meta-metadata of tomorrow or the metadata about metadata.
While search engines can find data, they can’t gauge its relevance or usefulness. Metadata, on the other hand, defines the data’s form or structure, its quality, and what it means to the end user. A good metadata system focuses on what information matters most to the organization.
Metadata is, simply, “data in context.” It answers the “who, what, why, where and how” questions about the data. For example, a digital image may include metadata that describes its size, resolution, content and when it was taken. A text document may contain information about the author, as well as how long the piece is and the point it makes.
“With metadata you can do more,” says Kuchibhotla. “You can analyze the data. You can look into more interesting use cases. Using metadata is a very key component in every aspect of major applications. It’s how you consume data.”
Yet while 80% of organizations believe that metadata is more important than it was 10 years ago, less than 14% have a clearly defined metadata strategy, according to a 2016 Dataversity study.
That’s a big problem. Today, we’re living in a rising ocean of data, said Anjana Susarla, associate professor of accounting and information systems at Michigan State University’s Broad College of Business.
“Each of us is leaving a digital trace captured on multiple sources,” she said. “We generate 2.5 quintillion bytes of data every day, and that number will increase with the Internet of Things. It’s hard to grasp how big the numbers will be. Gathering data isn’t enough. It has to be organized well to be useful.”
Susarla likens the world of big data to “The Library of Babel,” a short story written in 1941 by Argentine author Jorge Luis Borges. The piece is about a library whose books contain every possible combination of letters and numbers. However, the order and content of the books are completely meaningless.
“We are living in that world today,” said Susarla. “If you think about all of the connected devices and websites; we’re stuck in an infinite loop of data.”
Smart companies must understand the nuances of digital cataloguing in order to leverage metadata, said Susarla, “and that’s the challenge. You need an infrastructure in place where the data that people are making is stored in a way that makes it valuable.”
Companies that do it well include Netflix, Uber and Amazon, Susarla said. Each is “a data-driven operation based on metadata. These companies are data hungry.” For example, Netflix uses data to decide which of its own projects to fund, and Uber uses data to recommend locations to drivers who want the best fares.
Metadata Reduces Waste
In addition to making data useful, metadata helps reduce storage space, which can cut costs. Unstructured data can include meaningless fields or configurations, requiring maintenance, patching and updates. This can be wasteful if you’re paying for space that you don’t need.
Volume also hinders use. For example, banks collect terabytes of data. Because of the space problem, they're just not able to think about all of the use cases they want to do with it, said Kuchibhotla. Instead, metadata can create snapshots, reducing the terabytes from 110 to five.
“You can enjoy the same functionality and get space pressure relief,” said Kuchibhotla.
Due to data’s growing complexity, metadata organizes it and increases the relevance of datasets to bring more value to companies. Metadata protects the future accessibility of digital information by safeguarding its lineage. And it assists in the number of use cases that can be created.
“We have managed hundreds of databases,” said Kuchibhotla. “We manage metadata of these databases, what they're made of, who created it, what the access is and so on and so forth. It's all about metadata. It’s a key component in every aspect of major applications. If you're good at metadata, you can do wonders.”
Stephanie Vozza is a contributing writer who specializes in business and productivity. She is a columnist for FastCompany.com, and her byline has appeared in Inc., Entrepreneur and Success magazines. Find her on Twitter @StephanieVozza.
© 2019 Nutanix, Inc. All rights reserved. For additional legal information, please go here.