Get trial

English

AG Automated Azure Big Data Analysis using Data Vault 2.0 by Exxeta

This webinar shows how Exxeta used AnalyticsCreator to implement a Data Vault-based Azure analytics architecture for the New York Taxi dataset. The session covers Azure Blob Storage, Azure Data Factory, Azure SQL Database, Power BI, Data Vault modelling, generated ETL pipelines, tabular model creation, and dashboard visualization. 

Duration: 47:54 Updated: Jun 2022 Level: intermediate Platform: Microsoft Azure, Azure Blob Storage, Azure Data Factory, Azure SQL Database, Power BI For: Data Engineers, BI Developers, Data Architects, Microsoft Azure Analytics TeamsAC Video, HubSpot Video Library

Questions

  • How can AnalyticsCreator automate Data Vault modelling on Azure?
  • What Azure services are used in the demo architecture?
  • How does Data Vault support large analytical datasets?
  • Can AnalyticsCreator generate Azure Data Factory pipelines automatically?
  • How is a Power BI tabular model created from AnalyticsCreator?
  • How is a Power BI tabular model created from AnalyticsCreator?
Platform shown AnalyticsCreator
Related tooling Data Vault, Microsoft Azure, Azure Blob Storage, Azure SQL Database, Azure Data Factory, Power BI

Key Takeaways

  • AnalyticsCreator supports automated Data Vault modelling.
  • The demo uses Azure Blob Storage as the source layer.
  • Source files include CSV and Parquet files.
  • Azure Data Factory pipelines are generated automatically.
  • Azure SQL Database stores the generated data warehouse structures.
  • Power BI Services hosts the tabular model and dashboard.
  • Data Vault enables scalable and parallel loading for larger datasets.
  • The raw vault stores business keys, relationships, and descriptive attributes.
  • Data marts and star schemas are still used for BI and reporting.
  • AnalyticsCreator generates import tables, views, hash keys, dimensions, facts, and tabular model structures.
  • Manual adjustments include defining keys, hiding technical columns, and creating measures.
  • Measures can use predefined aggregations or custom DAX statements.
  • The final dashboard visualizes New York taxi trips, revenue, trip count, pickup locations, and revenue per distance.
  • Collaboration and version management can be supported through repository export, version control, locking, and partial repository export/import.

Transcript

Hello everyone. I will start with a brief introduction to AnalyticsCreator so you have a clear idea of what it does.

AnalyticsCreator is a metadata-driven design application for data warehouse automation. It is designed for experts, but it can also be used by people without deep expert knowledge. Instead of programming everything manually, AnalyticsCreator generates source code from the model.

It supports the full lifecycle of a data warehouse, data lake, and data mart, including design, development, change management, and deployment. It can replace traditional ETL development approaches and help teams deliver results faster and at lower cost.