Data Quality is a huge issue, especially now with more and more data being created daily. According to Micro Focus back in 2016, we were creating 44 ExaBytes of data per day and are expected to produce 463 ExaBytes of data per day by 2025. That’s a LOT of data.
With all this data, it’s important now more than ever to profile your data BEFORE your warehouse or ETL projects start. No one wants to be several months into a project only to realize the design has to be reworked due to data quality issues. So I’ve created a session called Profiling Your Data that talks about Data Profiling, what it is, why you should do it and how you can do it with the tools already included in the SQL Server BI stack.
If you’re interested in having me present this for your group or event, just let me know, I’d be happy to.
2 thoughts on “New Session: Profiling Your Data”
You really should use the correct scale. 44 ExaBytes and 463Exabytes.
Thanks Garland. I’ve updated the scale.