I will present our two-year journey designing and implementing a Data Vault integrating 200 entities (and growing) in a regulated pharmaceutical industry environment. We followed the 2.0 insert-only architecture patterns, our design has been recently positively reviewed by Kent Graziano.
We combined MarkLogic Data Hubs together with a Vault on Teradata and REST data services exposing the data. The implementation process is model-driven – the Vault and the ETL processes loading it are generated using erwin Data Intelligence Suite (formerly Analytix DS).
Testing is heavily automated with Python scripts leveraging erwin metadata. Last year we started the process of migration of the whole solution into the cloud.