Cataloging Your Data Lake

Analyst Report on Data Cataloging

There's a data deluge happening. The demand for data is increasing and the number of data sources is exploding. So how do we get a comprehensive view of our current data sources so we can bring order in this rapidly moving digital world?

This paper looks at this problem and discusses how information catalogs enable us to organise and rapidly discover new data, track what data and insights are being produced, and publish these as services so they are easy for others to find and consume.

The paper is authored by Mike Ferguson, an analyst and consultant specializing in business intelligence, analytics, data management, big data and enterprise architecture. 

In this in-depth report, you will learn best practices for:

  • Data profiling at scale
  • Discovery of data lineage and partitioned data sets
  • Schema generation on Hadoop data
  • Advanced scalable and fault tolerant search
  • The modern analytics ecosystem

    For immediate access, simply fill out the form.

Register for the Report