Trust the Machine: Making AI Automation Reliable in Master Data Management

Jesper Grode | April 4, 2025 | 5 minute read

We're powered by AI

Learn more

Trust the Machine: Making AI Automation Reliable in Master Data Management

Master Data Management Blog by Stibo Systems logo
| 5 minute read
April 04 2025
Trust the Machine: Making AI Automation Reliable in MDM
11:16

At any large organization, if you’re involved with master data management (MDM), you’re constantly wrestling with supplier data flows, regulatory compliance, sustainability metrics, marketplace sharing... while you need to keep costs down and satisfaction up.

With machine learning (ML), you can automate repetitive, time-consuming tasks at incredible speeds, but trusting AI predictions gives you a real challenge.

When your ML model suggests incorrect categories for products or misidentifies relationships, those errors create downstream problems that can erase your efficiency gains.

After all, if you need to manually verify every AI decision, are you really saving resources?

At Stibo Systems, we’re combining the speed of ML with mathematical verification, enabling you to achieve true automation with the level of accuracy your business needs.

And in this blog post, I’ll share how we arrived there and how it works. You’ll get the context AND the takeaways.

blog-making-ai-automation-reliable-in-mdm

 

Why do we need machines to help with MDM?

Your data management workload probably feels like it's expanding faster than your ability to handle it. You're not alone. Today's data managers navigate a complex landscape where accuracy, speed and volume all compete for priority.

  • Onboarding massive product assortments from suppliers with inconsistent data formats and quality
  • Maintaining governance processes that keep your data trustworthy across systems
  • Staying compliant with constantly evolving regulatory requirements
  • Managing an increasing array of sustainability and ESG data points
  • Distributing clean, consistent data to various channels and marketplaces

Traditional approaches to these challenges often rely heavily on manual processes, business rules and human verification.

This can work on a smaller scale, but when your data volumes grow – you get bottlenecks. A product manager who could once verify 50 new items per day now faces batches of thousands, making manual review impractical.

And it’s not just about saving time – it's about maintaining competitive advantage. When your competitors can onboard new products faster, update information more quickly and distribute data more effectively, they gain crucial market advantages. Manual processes simply can't scale to meet these demands without significant resource investments.

What makes these challenges particularly suited for technological intervention is their repetitive, pattern-based nature. It’s often about recognizing similarities, applying consistent rules and making evidence-based decisions.

Exactly the kind of work where ML excels.

 

How ML has changed the game completely

ML aligns perfectly with data management's most labor-intensive tasks.

At its core, ML excels at pattern recognition, categorization and prediction – precisely what you need when managing large volumes of complex data. Take these, for example:

  • Product categorization

    Automatically assigning new products to the correct categories in your taxonomy.
  • Data matching and deduplication

    Identifying when different records represent the same real-world entity.
  • Attribute mapping

    Connecting supplier-specific attributes to your standardized data model.
  • Data quality scoring

    Predicting completeness and accuracy levels without manual review.
  • Anomaly detection

    Flagging unusual data patterns that might indicate errors or opportunities.

 

Infographic-machine-learning-pattern-recognition

The efficiency gains can be remarkable. Tasks that once took days, can be done in minutes. And it’s not just speed:

ML brings consistency

Human categorization naturally varies between individuals and even by the same person at different times. ML applies the same logic consistently (statistically). You limit the variability that causes data issues downstream.

ML adapts as your business evolves

With traditional, rule-based systems, you need explicit reprogramming when your business conditions change. ML models, on the other hand, can spot shifting patterns in your data and adjust accordingly. This adaptability means your data management processes stay current with minimal intervention.

But these benefits come with an important caveat: The predictions are only as good as the model's accuracy. And that's where many data management teams hit a roadblock on their automation journey.

 

If you can’t trust your ML, it all falls apart

ML predictions come with an inherent uncertainty. While ML models are good at recognizing patterns, they don't give you the certainty of mathematical proof. This creates a trust challenge that can significantly limit automation potential.

The accuracy dilemma

Every ML model produces some level of inaccuracy, typically manifesting in two forms:

  1. False negatives, where valid matches aren't recognized
  2. False positives, where incorrect matches are made

For data management, false positives create the bigger problem. When a model incorrectly categorizes a product or incorrectly matches two different customers, these errors propagate through your systems, giving you data quality issues that can affect business operations.

Partial automation isn't enough

Many organizations respond to this challenge with a hybrid approach: using ML for initial processing, then manually reviewing the results. While it still beats fully manual processes, this approach:

  • Creates review bottlenecks during high-volume periods
  • Still needs significant human resources
  • Limits the scalability advantages you’re looking for with ML
  • Delays time-sensitive processes

The verification gap

The core issue isn't that ML makes mistakes – it's the lack of a reliable mechanism to verify which predictions you can trust.

Without knowing which predictions are certainly correct, you're forced to verify everything or accept a level of error in your master data.

This verification gap represents the critical barrier between assisted processing and true automation. And to close the gap, you need a fundamentally different approach that combines ML's pattern recognition capabilities with methods that can provide mathematical certainty about prediction accuracy.

After all: You don't need to know that your model is 95% accurate overall. You need to know which 95% of predictions you can trust.

This is indeed a difficult problem to tackle – one that varies in nature across use cases. But at Stibo Systems we’re relentless in solving such challenges, and to tackle this one, we’re starting with verification of AI Assistance Classification Recommendations – in research also known as Ontology (classification) Mapping.

 

Webinar

All In or All Out with AI?

Navigating the Risks of AI Adoption
Watch the Webinar
in-or-all-out
 

 

How to build mathematical verification mechanisms into your ML model

Our research with the Technical University of Denmark (DTU) – one of the leading technical universities in Europe, also named best technical university in Denmark – has led to a breakthrough approach.

It combines the speed of ML with the certainty of mathematical verification.

The verification challenges we’re solving

When using AI to map between classification systems – for example, matching "Hand tools" in one product taxonomy to "Handheld tools" in another – ML models can make impressive predictions.

But those predictions always carry statistical uncertainty that can undermine trust. And trust, in this context, is kind of binary: Either you can trust your results, or you can’t (and have to double-check).

The key issue is consistency. How can you verify that the mapping relationships make logical sense across entire classification structures?

Mathematical certainty meets practical application

In our solution, we apply formal mathematical methods to validate that recommended mappings are logically consistent. Mathematical proof confirms which predictions are correct, not just statistically likely.

For this, we use specifically propositional logic and “Horn Clauses.”

What are Horn Clauses?

They might sound complex, but they're simply logical statements that follow an "if-then" pattern. For example, "if a product is a hammer AND hammers belong to hand tools, THEN the product belongs to hand tools." By applying these logical rules across classification systems, we can mathematically verify whether mappings make sense.

But you might wonder: If formal methods provide the certainty we need, why use ML at all?

The answer lies in computational efficiency. Applying formal methods alone to large classification structures would be too computationally expensive for practical use.

By combining approaches, we get the best of both worlds:

  • Fast ML predictions narrow down the possible mappings.
  • Mathematical verification confirms which predictions are definitively correct.
  • Only uncertain predictions need human review.

The hybrid approach dramatically reduces verification workload while maintaining accuracy, making true automation possible.

Understand all this at a deeper level: This work has been published in “The Practice of Formal Methods,” an essay in the honor of Cliff Jones.

 

At Stibo Systems, we’re applying this in real life

The mathematical foundations are rock-solid – formal logic is either correct or incorrect, with no middle ground. The exciting part involves applying these principles to real-world MDM challenges.

Our implementation currently focuses on high-volume scenarios where verification creates significant bottlenecks.

For example, for retailers onboarding thousands of supplier items regularly, the ability to automatically validate AI-suggested categorizations creates tremendous value throughout the process.

But we work with companies in many industries, with potential use cases everywhere:

  • Financial services firms matching transaction categories across different systems
  • Healthcare organizations aligning medical coding taxonomies
  • Manufacturing companies standardizing part classifications from multiple suppliers
  • Public sector agencies harmonizing service categories across departments

Infographic-industries-machine-learning

Any situation involving taxonomy alignment, classification matching or standardization across systems could benefit from verified predictions.

Current status on our implementation

We're currently testing prototypes in real environments.

While still early in implementation, the mathematical verification works exactly as expected – it correctly identifies which predictions can be trusted. The next phase involves refining the user experience and expanding the verification capabilities to more data management scenarios.

 

Let’s sum up

Trust remains the biggest hurdle to true automation in data management. Our combination of ML with mathematical verification gives us a practical path forward that doesn't force you to choose between speed and accuracy.

The human role will always matter in data management, but we can now direct that expertise where it adds the most value – not on routine tasks that verified AI can handle reliably.

When you can trust your ML predictions, you can finally automate with confidence.

Master Data Management Powered by AI

Unlock the power of AI in your MDM processes to navigate complex supply chain dynamics with greater agility and foresight.

Discover how today!
Powered-by-AI_Hero

Master Data Management Blog by Stibo Systems logo

Decades of experience within master data management, technologies, people and processes has led Jesper into his current role, heading Stibo Systems' innovation efforts. He has a particular focus on multidomain MDM, augmented MDM and technology adoption. Being responsible for company-wide strategic initiatives on product innovations, he is constantly seeking to increase the value of product offerings to customers and partners. Jesper comes from prior roles as Product Strategy Director, Section Head R&D, Director Professional Services, and Associate Professor at a Danish university.

Discover Blogs by Topic

  • MDM strategy
  • Data governance
  • Customer and party data
  • See more
  • Retail and distribution
  • Data quality
  • AI and machine learning
  • Manufacturing
  • Product data and PIM
  • Supplier data
  • Financial services
  • CPG
  • Sustainability
  • GDPR
  • Customer Experience
  • Location data
  • Product Experience Data Cloud
  • Customer Story
  • PDX Syndication
  • Auto Classification
  • Business Partner Data Cloud
  • Cloud
  • Compliance
  • Data Cleanup
  • Data-Driven Decision Making
  • Employee Data
  • Enterprise Data Strategy
  • Location Data Cloud
  • Microsoft Azure
  • Product
  • Product Onboarding
  • Supplier Data Cloud
  • Sustainability Data

Trust the Machine: Making AI Automation Reliable in Master Data Management

4/4/25

How Agentic Workflows Are Changing Master Data Management at the Core

4/2/25

MDM and AI: Real-World Use Cases and Learnings From OfficeMax and Motion Industries

3/7/25

Reyes Holdings' MDM Journey to Better Data

2/27/25

AI Adoption: A High-Stakes Gamble for Business Leaders

1/28/25

How Kramp Optimizes Internal Efficiency with Data Strategy

1/27/25

From Patchwork to Precision: Moving Beyond Outdated and Layered ERP Systems

1/27/25

Thriving Beyond NRF 2025 with Trustworthy Product Data

1/24/25

Building the Future of Construction with AI and MDM

1/23/25

Why Addressing Data Complexity in Pharmaceutical Manufacturing Is Critical

1/17/25

How URBN Leverages Data Management to Support Its Sustainability Information  

1/17/25

How to Avoid Bad Retail Customer Data

1/6/25

Gen Z: Seeking Excitement Beyond Amazon

12/11/24

A Modern Guide to Data Quality Monitoring: Best Practices

12/10/24

CDP and MDM: Complementary Forces for Enhancing Customer Experiences

12/10/24

Using Machine Learning and MDM CBAM for Sustainability Compliance

12/3/24

How to Implement Master Data Management: Steps and Challenges

11/26/24

AAPEX and SEMA: The Automotive Aftermarket Industry’s Mega-Showcase

11/25/24

5 Key Trends in Product Experience Management

11/20/24

Solving Retail Data Fragmentation: The Key to Consistent Customer Journeys

11/11/24

Live Shopping: How to Leverage Product Information for Maximum Impact

10/22/24

Why Data Accuracy Matters for CPG Brands

10/16/24

Why Choose a Cloud-Based Data Solution: On-Premise vs. Cloud

10/15/24

How to Use Customer Data Modeling

10/10/24

How Master Data Management Can Enhance Your ERP Solution

9/23/24

Navigating Change: Engaging Business Users in Successful Change Management

9/20/24

What is Digital Asset Management?

9/11/24

How to Improve Your Data Management

9/3/24

The Future of Master Data Management: Trends in 2025

9/1/24

Digital Transformation in the CPG Industry

8/30/24

5 CPG Industry Trends and Opportunities for 2025

8/29/24

What is the difference between CPG and FMCG?

8/27/24

Responsible AI Relies on Data Governance

8/27/24

Making Master Data Accessible: What is Data as a Service (DaaS)?

8/19/24

6 Features of an Effective Master Data Management Solution

8/15/24

Great Data Minds: The Unsung Heros Behind Effective Data Management

8/13/24

A Data Monetization Strategy - Get More Value from Your Master Data

8/6/24

Introducing the Master Data Management Maturity Model

8/4/24

What is Augmented Data Management? (ADM)

7/31/24

Data Migration to SAP S/4HANA ERP: The Fast and Safe Approach with MDM

7/30/24

GDPR Data Governance and Data Protection, a Match Made in Heaven?

7/17/24

The 5 Biggest Retail Trends in 2025

6/10/24

The Difference Between Master Data and Metadata

5/26/24

Master Data Management Roles and Responsibilities

5/20/24

8 Best Practices for Customer Master Data Management

5/16/24

What Is Master Data Governance – And Why Do You Need It?

5/12/24

Guide: Deliver flawless rich content experiences with master data governance

4/11/24

Risks of Using LLMs in Your Business – What Does OWASP Have to Say?

4/10/24

Guide: How to comply with industry standards using master data governance

4/9/24

Digital Product Passports - A Data Management Challenge

4/8/24

Guide: Get enterprise data enrichment right with master data governance

4/2/24

Guide: Getting enterprise data modelling right with master data governance

4/2/24

Guide: Improving your data quality with master data governance

4/2/24

5 Tips for Driving a Centralized Data Management Strategy

3/18/24

What is Application Data Management and How Does It Differ From MDM?

3/18/24

5 Key Manufacturing Challenges in 2025

2/20/24

How to Enable a Single Source of Truth with Master Data Management

2/20/24

What is Data Quality and Why It's Important

2/12/24

Data Governance Trends 2025

2/7/24

What is Data Compliance? An Introductory Guide

2/6/24

How to Build a Master Data Management Strategy

1/18/24

The Best Data Governance Tools You Need to Know About

1/16/24

How to Choose the Right Master Data Management Solution

1/15/24

Building Supply Chain Resilience: Strategies & Examples

12/19/23

Shedding Light on Climate Accountability and Traceability in Retail

11/29/23

What is Party Data? All You Need to Know About Party Data Management

11/20/23

Location Analytics – All You Need to Know

11/13/23

Understanding the Role of a Chief Data Officer

10/16/23

What is Smart Manufacturing and Why Does it Matter?

10/11/23

5 Common Reasons Why Manufacturers Fail at Digital Transformation

10/5/23

How to Digitally Transform a Restaurant Chain

9/29/23

Three Benefits of Moving to Headless Commerce and the Role of a Modern PIM

9/14/23

12 Steps to a Successful Omnichannel and Unified Commerce

7/6/23

Navigating the Current Challenges of Supply Chain Management

6/28/23

Product Data Management during Mergers and Acquisitions

4/6/23

A Complete Master Data Management Glossary

3/14/23

Asset Data Governance is Central for Asset Management

3/1/23

4 Common Master Data Management Implementation Styles

2/21/23

How to Leverage Internet of Things with Master Data Management

2/14/23

Manufacturing Trends and Insights in 2025

2/14/23

Sustainability in Retail Needs Governed Data

2/13/23

A Quick Guide to Golden Customer Records in Master Data Management

1/9/23

Innovation in Retail

1/4/23

Life Cycle Assessment Scoring for Food Products

11/21/22

Retail of the Future

11/14/22

Omnichannel Strategies for Retail

11/7/22

Hyper-Personalized Customer Experiences Need Multidomain MDM

11/5/22

What is Omnichannel Retailing and What is the Role of Data Management?

10/25/22

Most Common ISO Standards in the Manufacturing Industry

10/18/22

How to Get Started with Master Data Management: 5 Steps to Consider

10/17/22

What is Supply Chain Analytics and Why It's Important

10/12/22

An Introductory Guide: What is Data Intelligence?

10/1/22

Revolutionizing Manufacturing: 5 Must-Have SaaS Systems for Success

9/15/22

An Introductory Guide to Supplier Compliance

9/7/22

Digital Transformation in the Manufacturing Industry

8/25/22

Master Data Management Framework: Get Set for Success

8/17/22

Discover the Value of Your Data: Master Data Management KPIs & Metrics

8/15/22

Supplier Self-Service: Everything You Need to Know

6/15/22

Omnichannel vs. Multichannel: What’s the Difference?

6/14/22

Create a Culture of Data Transparency - Begin with a Solid Foundation

6/10/22

What is Location Intelligence?

5/31/22

Omnichannel Customer Experience: The Ultimate Guide

5/30/22

Omnichannel Commerce: Creating a Seamless Shopping Experience

5/24/22

Top 4 Data Management Trends in the Insurance Industry

5/11/22

What is Supply Chain Visibility and Why It's Important

5/1/22

The Ultimate Guide to Data Transparency

4/21/22

How Manufacturers Can Shift to Product as a Service Offerings

4/20/22

How to Check Your Enterprise Data Foundation

4/16/22

An Introductory Guide to Manufacturing Compliance

4/14/22

Multidomain MDM vs. Multiple Domain MDM

3/31/22

How to Build a Successful Data Governance Strategy

3/23/22

What is Unified Commerce? Key Advantages & Best Practices

3/22/22

How to Choose the Right Data Quality Tool?

3/22/22

What is a Data Domain? Meaning & Examples

3/21/22

6 Best Practices for Data Governance

3/17/22

5 Advantages of a Master Data Management System

3/16/22

A Unified Customer View: What Is It and Why You Need It

3/9/22

Supply Chain Challenges in the CPG Industry

2/24/22

Top 5 Most Common Data Quality Issues

2/14/22

What Is Synthetic Data and Why It Needs Master Data Management

2/10/22

What is Cloud Master Data Management?

2/8/22

How to Implement Data Governance

2/7/22

Build vs. Buy Master Data Management Software

1/28/22

Why is Data Governance Important?

1/27/22

Five Reasons Your Data Governance Initiative Could Fail

1/24/22

How to Turn Your Data Silos Into Zones of Insight

1/21/22

How to Improve Supplier Experience Management

1/16/22

​​How to Improve Supplier Onboarding

1/16/22

What is a Data Quality Framework?

1/11/22

How to Measure the ROI of Master Data Management

1/11/22

What is Manufacturing-as-a-Service (MaaS)?

1/7/22

The Ultimate Guide to Building a Data Governance Framework

1/4/22

Master Data Management Tools - and Why You Need Them

12/20/21

The Dynamic Duo of Data Security and Data Governance

12/20/21

How to Choose the Right Supplier Management Solution

12/20/21

How Data Transparency Enables Sustainable Retailing

12/6/21

What is Supplier Performance Management?

12/1/21

How to Create a Marketing Center of Excellence

11/14/21

The Complete Guide: How to Get a 360° Customer View

11/7/21

How Location Data Adds Value to Master Data Projects

10/29/21

What is Supplier Lifecycle Management?

10/19/21

What is a Data Mesh? A Simple Introduction

10/15/21

10 Signs You Need a Master Data Management Platform

9/2/21

What Vendor Data Is and Why It Matters to Manufacturers

8/31/21

3 Reasons High-Quality Supplier Data Can Benefit Any Organization

8/25/21

4 Trends in the Automotive Industry

8/11/21

What is Reference Data and Reference Data Management?

8/9/21

GDPR as a Catalyst for Effective Data Governance

7/25/21

All You Need to Know About Supplier Information Management

7/21/21

How to Become a Customer-Obsessed Brand

5/12/21

How to Create a Master Data Management Roadmap in Five Steps

4/27/21

What is a Data Catalog? Definition and Benefits

4/13/21

How to Improve the Retail Customer Experience with Data Management

4/8/21

Business Intelligence and Analytics: What's the Difference?

3/25/21

What is a Data Lake? Everything You Need to Know

3/21/21

How to Extract More Value from Your Data

3/17/21

Are you making decisions based on bad HCO/HCP information?

2/24/21

CRM 2.0 – It All Starts With Master Data Management

12/19/20

5 Trends in Telecom that Rely on Transparency of Master Data

12/15/20

10 Data Management Trends in Financial Services

11/19/20

Seasonal Marketing Campaigns: What Is It and Why Is It Important?

11/8/20

What Is a Data Fabric and Why Do You Need It?

10/29/20

Transparent Product Information in Pharmaceutical Manufacturing

10/14/20

How to Improve Back-End Systems Using Master Data Management

9/19/20

How Retailers Can Increase Online Sales in 2025

8/23/20

Master Data Management (MDM) & Big Data

8/14/20

Key Benefits of Knowing Your Customers

8/9/20

Customer Data in Corporate Banking Reveal New Opportunities

7/21/20

How to Analyze Customer Data With Customer Experience Data Cloud

7/21/20

4 Ways Product Information Management (PIM) Improves the Customer Experience

7/18/20

How to Estimate the ROI of Your Customer Data

7/1/20

Women in Master Data: Rebecca Chamberlain, M&S

6/24/20

How to Personalise Insurance Solutions with MDM

6/17/20

How to Get Buy-In for a Master Data Management Solution

5/25/20

Marketing Data Quality: Why Is It Important and How to Get Started

3/26/20

Get More Value From Your CRM With Customer Master Data Management

2/17/20

Women in Master Data: Nagashree Devadas, Stibo Systems

2/4/20

How to Create Direct-to-Consumer (D2C) Success for CPG Brands

1/3/20

Women in Master Data: Anna Schéle, Ahlsell

10/25/19

How to Improve Your Product's Time to Market With PDX Syndication

7/18/19

8 Tips For Pricing Automation In The Aftermarket

6/1/19

How to Drive Innovation With Master Data Management

3/15/19

Discover PDX Syndication to Launch New Products with Speed

2/27/19

How to Benefit from Product Data Management

2/20/19

What is a Product Backlog and How to Avoid It

2/13/19

How to Get Rid of Customer Duplicates

2/7/19

4 Types of IT Systems That Should Be Sunsetted

1/3/19

How to Reduce Time-to-Market with Master Data Management

10/28/18

How to Start Taking Advantage of Your Data

9/12/18

GDPR: The DOs and DON’Ts of Personal Data

6/13/18

How Master Data Management Supports Data Security

6/7/18

Frequently Asked Questions (FAQ) About the GDPR

5/30/18

3 Steps: How to Plan, Execute and Evaluate Any IoT Initiative

2/20/18

How to Benefit From Customer-Centric Data Management

9/7/17

Product Information Management Trends to Consider

5/25/17

4 Major GDPR Challenges and How to Solve Them

5/12/17

How to Prepare for GDPR in Five Steps

2/21/17

How Data Can Help Fight Counterfeit Pharmaceuticals

1/24/17

Create the Best Customer Experience with a Customer Data Platform

1/11/17