Newsletter
Join our blog
Join other Azure, Power Platform and SQL Server pros by subscribing to our blog.
Start with the FREE community plan and get your lifetime access to 20+ courses. Get Instant Access Now!
Need help? Talk to an expert: (904) 638-5743
Private Training
Customized training to master new skills and grow your business.
On-Demand Learning
Beginner to advanced classes taught by Microsoft MVPs and Authors.
Bootcamps
In-depth boot camps take you from a novice to mastery in less than a week.
Season Learning Pass
Get access to our very best training offerings for successful up-skilling.
Stream Pro Plus
Combine On-Demand Learning platform with face-to-face Virtual Mentoring.
Certification Training
Prepare and ace your next certification with CertXP.
Private Training
Cheat Sheets
Quick references for when you need a little guidance.
Nerd Guides
Summaries developed in conjunction with our Learn with the Nerds sessions.
Downloads
Digital goodies - code samples, student files, and other must have files.
Blog
Stay up-to-date on all things Power BI, Power Apps, Microsoft 365 and Azure.
Community Discord Server
Start here for technology questions to get answers from the community.
Career Guides
Breaking into the field? Let these guides help get you started with a plan.
Affiliate Program
Earn money by driving sales through the Pragmatic Works' Training Affiliate Program.
Reseller Partner
It's time to address your client's training needs.
Foundation
Learn how to get into IT with free training and mentorship.
Management Team
Discover the faces behind our success: Meet our dedicated team
Contact Us
How can we help? Connect with Our Team Today!
FAQs
Find all the information you’re looking for. We’re happy to help.
In today’s post, I want to talk about designing a data flow for a warehousing situation. I’ll refer to the reference cloud-based data warehouse data flow architecture that Microsoft has published which you’ll see in the image below.
I want to talk about the idea of taking data from our source systems and storing it in file storage. The diagram indicates we’d do this in blob storage. We sometimes prefer using Azure Date Lake, but regardless, the data is still being extracted from a source and put into files which are then loaded into the data warehouse.
I’m frequently asked, why would I store my data in files? For instance, if my source data is coming from a relational database, maybe pulling from an ERP system, why would I take data from that system, store it in files and then load it to my data warehouse?
Let me point out some reasons why:
So, these are some key reasons/benefits to help you understand why you may want to store data in files prior to loading it into the data warehouse. As I’m asked about this a lot, I hope you found this useful. If you have questions about data warehousing in Azure, cloud data platforms or anything Azure related, you’re in the right place. Click the link below or contact us—we’re here to help.
ABOUT THE AUTHOR
Free Trial
private training
Newsletter
Join other Azure, Power Platform and SQL Server pros by subscribing to our blog.
Leave a comment