Videos

Mining Unstructured Data to Fuel Enterprise AI

An estimated 80% of all existing digital data isn't neatly organized in rows and columns, but that unstructured data can unlock significant value and fuel enterprise AI applications, explains data solutions specialist Kailin Hart.
  • Key Play:Enterprise AI
  • Nutanix-Newsroom:Article, Video

May 29, 2026

The pace of artificial intelligence (AI) innovation is unprecedented, and its voracious appetite for data has many organizations racing to wrangle, clean and label all of their digital data. Information left largely ignored piles up and more than ever requires corralling so new AI applications can put it to good use. While many find that their structured data is neatly organized and accessible, the bulk of data is messy and unstructured. 

But therein lies a massive amount of potential value, according to Kaitlin Hart, former director of strategic partnerships at Pryon, an enterprise software company that helps organizations connect AI models directly to their internal, private data sources.

"When you think about 80% of all data is unstructured, it's a huge opportunity for us to move the needle and deliver more value," Hart said in a video interview with The Forecast, recorded in April 2026 at the Nutanix .NEXT event in Chicago.

This realization is driving enterprises to ratchet up their data management now and going forward. 

Unstructured data includes everything from PDFs and documents to websites, legal briefs, and even multimodal content like video and audio. They represent a massive, untapped reservoir of knowledge. However, unlike its structured counterpart, unstructured data varies widely in size and format, Hart explained.

“Unstructured data, because it varies so much…makes it very complex to transform it at scale,” she said.

The challenge is significant. Industry analysts have told The Forecast that legacy systems were not built to handle the massive, unstructured datasets that can feed generative AI. On the other hand, the rate of data generated keeps growing exponentially, forcing enterprises to decide which data are essential to their business and how to leverage it for competitive advantage.

The Complexity of Transformation

Transforming unstructured data requires a fundamentally different approach than traditional extract, transform and load (ETL) processes used for structured data, Hart explained.

"There's a series of different types of models, machine learning-based models, that we've developed ourselves that handle the vectorization, the semantic understanding, and query routing to question mapping," Hart said. 

RELATED Building Software and IT Agility in the AI Era
Twenty-five years after the publication of the Agile Manifesto, co-author Jon Kern discusses how the rise of AI, cloud native technologies, and scalable IT resources has increased the pace and rigor of software development.
  • Article:Technology
  • Nutanix-Newsroom:Article
  • Products:Nutanix Kubernetes Platform (NKP)
  • Use Cases:AI ML, Cloud Native

May 20, 2026

This complexity is compounded by the need for security and compliance. For many organizations, particularly those in highly regulated industries, the idea of feeding sensitive, unstructured data into public large language models (LLMs) is a non-starter. Data strategies must protect intellectual property and ensure that customer data is not used for training without explicit consent.

Unlocking New Possibilities

When organizations successfully navigate these challenges and begin treating unstructured content as actionable data, the results can be transformative.

"When people are actually able to use it like the data that it is, it opens up a whole new world of possibilities,” she said. “With a strong data foundation, anything is possible."

RELATED Unknowns and Risk Intensify with AI
Analyst Scott Sinclair, practice director at Omdia, describes why the pace of AI innovation is unlike anything the IT industry has seen before, making data security a top priority for CIOs.
  • Key Play:Enterprise AI
  • Nutanix-Newsroom:Article, Video
  • Use Cases:AI ML

May 21, 2026

As the AI innovation evolves at a breakneck pace, it’s proving the value of having a solid data foundation. That enables organizations to securely and efficiently transform unstructured data into that fuel powering the future of intelligent applications.

Video transcript:

Kaitlin Hart: I'm Kaitlin Hart, director of strategic partnerships at Pryon. I help build together ecosystems and technologies with our partner to build full value driven solutions for our customers.

Ken Kaplan: Now tell me how you got into this.

Kaitlin Hart: Well, I'm a data nerd. I've been a data nerd in all kinds of different capacities of the market for almost 20 years. I have spent a lot of time in machine learning before it was popular. And then most recently I've moved into unstructured data transformation at Pryo.

Ken Kaplan: And what pulled you to the other side?

Kaitlin Hart: To the other side of data? Well, so I spent so much time in structured data trying to help customers solve really critical problems there, but it's only 20% of all data is actually structured. And so the types of problems that they would often talk to me about, data discoverability and being able to build more full applications, we actually couldn't serve in structured data. And so when you think about 80% of all data is unstructured, it's a huge opportunity for us to drive the needle, deliver more value. And that's really what we're doing here at Prime.

Ken Kaplan: And what is some of the unstructured data examples that people wanted to tap into and weren't able to?

Kaitlin Hart: It's PDFs, it's Word docs, it's websites. It could be research documents. It could be legal briefs. It could be really any ... There's about ... I'm trying to think about dozens of different types of unstructured data. Even getting out to multimodal type of content like video data, audio. It's a very large mass of data. And the difference between unstructured and structured data, structure is very organized, right? So it's columns and rows and it's very lightweight and size. Unstructured data because it varies so much in the type of content it actually is. So then does the size of that data. So it makes it very complex to actually transform it at scale.

Ken Kaplan: How are people coming to Priyan and what do they want and how do you help them?

Kaitlin Hart: Yeah, people who are focused on security, they're really driven to us because we take that as a day one priority. We've been engineered from day one not to train on customer data to make sure that their IP is protected at scale. And that's how we have customers like Nvidia and Leidos and some of the market leaders who take their data super seriously.

Ken Kaplan: And take us under the hood a little bit of ... Describe how it works.

Kaitlin Hart: Yeah. So well, I'm a data nerd, so I don't want to go too deep or I might lose everybody. But when you think about transformation, ETL, structured data, it's a similar process. It's a lot more complex because again, that data's a lot more complex. So there's a series of different types of models, machine learning based models that we've developed ourselves that handle the vectorization, the semantic understanding, query routing to question mapping. All that stuff is actually really complex, especially again at scale and to do that out of the box without training on a customer data. So we do all of that and at the very end we have the LLM, which does the smoothing and consolidation of the answer and we're agnostic to what LM folks might use.

Ken Kaplan: And so tell me what a customer the first time that they try and they get something done that they want done, what's their reaction and what's the next thing they want to do?

Kaitlin Hart: Well, this is ... Think about unstructured data. It's been called content, right? As a market, we've separated so far out that we don't even call it data. So when people are actually able to use it like the data that it is, it opens up in a whole new world of possibilities because with a strong data foundation, anything is possible. So they get really excited about all the different things that they could potentially build and all the ways that they might be able to evolve in the future with this. What's next in AI and data changes every single day. We're always on the cutting edge of something new it feels like. But I do think that data is a central really fundamental piece of this working long term. It has to be based on real data. It has to be factual. And then when you go to the next step of that at scale it has to be valuable and you can't sacrifice too much security. So as you kind of think about building your own solutions internally, those are the types of fundamentals you should be looking at.

Related articles:

Ken Kaplan is Editor in Chief for The Forecast by Nutanix. Find him on X @kenekaplan and LinkedIn.

© 2026 Nutanix, Inc. All rights reserved. For additional information and important legal disclaimers, please go here.

Related Articles