Modern data pipelines are more complex than traditional data warehousing pipelines. They must support a multiplicity of data sources, data types, and use cases; they must be designed and deployed in an agile manner; they must support batch, mini-batch, and streaming data flows, and, they must meet service level agreements for data quality, availability, and consistency.
Many organizations are turning to data catalogs to support modern data pipelines. Data catalogs can automate the tagging and organization of ingested content and derived data sets, harmonize data flowing through multiple new and existing pipelines, support data curation processes, and enable business users to explore content for analysis.
Join the webinar, hosted by Bloor Research and Collibra, to explore the role of data catalogs in managing modern data pipelines along with supplemental services.
|Mon Oct 22 @16:30 - 04:45PM|
Webinar: Engineering Machine Learning Data Pipelines Series: Big Data Quality - Cleansing Data at Scale
|Mon Oct 29 @16:30 - 04:45PM|
Webinar: Engineering Machine Learning Data Pipelines Series: Finding and Matching Duplicates to Resolve Entities
|Mon Nov 05 @16:30 - 04:45PM|
Webinar: Engineering Machine Learning Data Pipelines Series: Tracking Data Lineage from the Source
|Wed Nov 07 @08:00 - 05:00PM|
ITWeb GDPR Update
This week I am publishing the presentation that was delivered as the Customer Analytics conference last month In this presentation I talked about the challenges of delivering customer experience – including having to source new , unstructured data sets Data[…]Read more...