Pragmatic Works Nerd News

What Are Fabric Lakehouse Schemas?

Written by Zane Goodman | Jun 02, 2026

     In this tutorial, Zane Goodman from Pragmatic Works explains Microsoft Fabric's new preview feature: Lakehouse Schemas. This feature provides an efficient way to organize, manage, and query data within a Lakehouse, making data workflows simpler and more collaborative. Let's dive into how Lakehouse Schemas work and why they should be part of your data management strategy.

 

What Are Lakehouse Schemas?

Lakehouse schemas are a powerful new feature within the Microsoft Fabric ecosystem. They act as logical structures that allow users to organize, manage, and query data in a Lakehouse using SQL-like capabilities. Essentially, they bridge the gap between structured and unstructured data, enabling teams to define structured views over raw or semi-structured data stored in a Lakehouse. This makes it easier for analysts, engineers, and business users to query and interact with the data.

Why Use Lakehouse Schemas?

  • Simplify Data Access: By using schemas, users can create logical views that make data more accessible without unnecessary duplication or transformation.
  • Improve Collaboration: Teams can share schemas and query the same dataset with consistent structure and rules, fostering better alignment across departments.
  • Enhance Performance & Scalability: The structured layer over raw data enables the Lakehouse to scale efficiently while maintaining flexibility, unlike traditional databases.

How to Enable Lakehouse Schemas

To enable Lakehouse schemas, users need to create a Lakehouse in Microsoft Fabric. When setting up a new Lakehouse, they can select the option to enable schemas. Once enabled, a default schema (dbo) will be assigned to all tables unless a specific schema is defined. Users can then add additional schemas (e.g., bronze, silver, gold) for better data organization.

Practical Example of Lakehouse Schemas

Zane demonstrates how to use the Medallion Architecture to organize data into Bronze, Silver, and Gold layers:

  • Bronze Layer: This layer contains raw, untransformed data.
  • Silver Layer: Transformed data that is cleaned and ready for analysis.
  • Gold Layer: The "single source of truth" where fully cleaned and validated data is stored.

By uploading data into a specific schema (like the Bronze layer) and then moving it through the Silver layer after transformation, teams can organize data effectively while ensuring a smooth flow for analytics and reporting.

Using Data Flows and Transformations

Once data is loaded into the appropriate schema, users can create data flows to transform and move the data between schemas. Zane walks through the process of creating a data flow to move the "Holiday" table from the Bronze schema to the Silver schema after cleaning up the data. This allows for efficient data processing and ensures that data is always available in the desired format for reporting or analysis.

Key Features and Benefits of Lakehouse Schemas

  • Shortcuts: Users can create schema shortcuts to bring over data from other Lakehouses, making it easier to work with external datasets.
  • Integration with Dataflows and Pipelines: Lakehouse schemas can be integrated into dataflows and pipelines to automate data updates and maintain data consistency.
  • Flexibility: Unlike warehouses, Lakehouse schemas allow for a hybrid approach that handles semi-structured and unstructured data without forcing users to move data into a rigid warehouse structure.

Drawbacks and Considerations

While Lakehouse schemas offer many benefits, there are some limitations to be aware of. Currently, table maintenance features are not available for schema-enabled Lakehouses, and users cannot migrate non-schema Lakehouses to schema-based Lakehouses. Additionally, some features like Spark views are not supported. These drawbacks are expected to be addressed as the feature moves beyond its preview phase.

Conclusion

Microsoft Fabric's Lakehouse schemas offer an innovative solution for organizing and querying data efficiently. By leveraging schemas, teams can simplify their data workflows, enhance collaboration, and improve scalability while retaining the flexibility of a Lakehouse. Although the feature is still in preview, it presents a significant opportunity for teams to optimize their data management practices and improve overall performance.

Don't forget to check out the Pragmatic Works' on-demand learning platform for more insightful content and training sessions on Microsoft Fabric and other Microsoft applications. Be sure to subscribe to the Pragmatic Works YouTube channel to stay up-to-date on the latest tips and tricks.