Azure Data Lake is Microsoft’s Platform as a Service (PaaS) big data solution running on Azure. This gives you the ability to handle large volumes of data, as well as unstructured data, such as CSV, flat or log files; these can all be processed through the Azure Data Lake service.
Azure Data Lake consists of two different resources within Azure:
A benefit of running Azure Data Lake Analytics vs some of the other big data platforms, is that it uses a language called U-SQL, which is proprietary to Microsoft. This language is based off T-SQL (I call it a mash-up of T-SQL and C#). We utilize many of the functions and syntax that we use in C#, but we use it in the context of a T-SQL statement.
The benefit lies in the fact that we don’t have to learn some of the languages that are common to open source data platforms, such as PIG, HIVE, Spark or Python. We can take advantage of some big data capabilities and run them with some of the skill sets we already have in-house.
We help many of our over 7,000 customers by teaching them how to integrate Azure Data Lake into their overall data architectures and figuring out where big data may fit into their data strategy. If you’d like to learn more about integrating this in your business or if you have questions about anything Azure related, we are the people to talk to. Click the link below or contact us – we’d love to help.