Microsoft operates the Azure SQL Database (ASD) cloud service, one of
the dominant relational cloud database services in the market today. To
aid the academic community in their research on designing and
efficiently operating cloud database services, Microsoft is introducing
the release of production-level telemetry traces from the ASD service.
This telemetry data set provides, over a wide set of important hardware
resources and counters, the consumption level of each customer database
replica. The first release will be a multi-month time-series data set
that includes the full cluster traces from two different ASD global
We are providing a sample dataset for public viewing. The sample spans 50 databases for one day.
Raw sample dataset available here
Sample machines dimension table available here
Access to the full dataset is can be had by contacting email@example.com. Please include your name, institutional affiliation, and research interests if possible. Please attach a signed copy of the AzureDB Telemetry Dataset License Agreement.
This data release is currently for US academic institutions only. It will be available at a later date to international institutions pending approval by the Microsoft legal department.
- David DeWitt
- Willis Lang
- Nigel Ellis
- Frank Bertsch
Willis Lang, Frank Bertsch, David Dewitt, Nigel Ellis: Microsoft Azure SQL Database Telemetry. Socc 2015.