Let me start by saying that this is not an article about big data. While the source of big data is external to your organization, it is a topic of its own. Many of the concepts and approaches discussed will definitely apply to your big data initiatives, but that won’t be the focus of this article.
The objective of most MDM Hub projects is to establish a trusted source of master data. In addition to the right vendor, appropriate partner, and an efficient implementation plan, it is very important to come up with the right integration strategy. An organization's existing eco-system will typically consist of different source systems and integrating all of them with the new MDM system becomes a huge task in itself. Any small misstep here would lead to delays, cost overruns and substandard or missing data. In turn, sponsors will lose confidence in the MDM hub as a trusted source to be integrated with downstream applications.Net result: the ROI will seem a lot less attractive.
This blog post is the second part of the Data Warehouse Migration to AR series. The first part of the blog post series Data Warehouse Migration to Amazon Redshift – Part 1 details on how Amazon Redshift can make a significant impact in lowering the cost and operational overheads of a data warehouse.
Deterministic Matching versus Probabilistic Matching