Research - Entropy Data

Research

At Entropy Data, we're committed to advancing the field of data governance through research and innovation.

Automating Data Governance with Generative AI

This research examines how large language models can support data governance by generating warnings about data access decisions in decentralized systems. The study introduces Governance AI, an LLM-powered tool that evaluates whether data access requests comply with data contracts, company policies, and regulations like GDPR.

Rather than making final decisions, the system provides "structured warnings and suggestions for correction to guide human experts."

This approach ensures that AI augments human decision-making rather than replacing it, maintaining the necessary human oversight for contextual and legal accuracy in data governance decisions.

Publication Details

Authors:

Linus W. Dietz (King's College London)
Arif Wider (HTW Berlin)
Simon Harrer (Entropy Data)

Conference: AAAI/ACM Conference on Artificial Intelligence, Ethics, and Society 2025

Key Findings

Governance AI issued 3.6 times more warnings than human experts while catching all compliance concerns
80% of AI-generated warnings were judged correct after secondary review
LLM-generated synthetic test cases effectively simulated real-world governance scenarios
Human oversight remains essential for contextual and legal accuracy

Learn More

Data Product MCP: Chat with your Enterprise Data

This research introduces a Model Context Protocol (MCP) server that enables conversational interaction with enterprise data products. The solution addresses the challenge of integrating diverse data sources into large language model workflows while maintaining data governance standards.

The framework provides a practical pathway for organizations to leverage LLMs in data discovery and analysis without compromising data protection standards.

By bridging enterprise data products and AI systems through a standardized interface, organizations can democratize data access while maintaining security and governance controls.

Publication Details

Authors:

Marco Tonnarelli
Filippo Scaramuzza
Simon Harrer (Entropy Data)
Linus W. Dietz (King's College London)

Status: Under peer review (arXiv preprint, January 2025)

Key Contributions

MCP Server Implementation that bridges enterprise data products and AI systems, allowing natural language queries against structured datasets
Governance-aware design that maintains compliance with enterprise data governance requirements
Data product integration leveraging existing standards to create a standardized interface for LLM access
Open architecture that is extensible and allows organizations to adapt it to their specific data infrastructures

Learn More

View on arXiv Download PDF

Impact on Our Products

The findings from this research inform the development of our products, including the AI-powered governance features in Entropy Data. By exploring how AI can support and enhance human decision-making in data governance, we aim to make data more accessible, secure, and compliant across organizations.