<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Value Talk &#187; deduplication</title>
	<atom:link href="http://datavaluetalk.com/tag/deduplication/feed/" rel="self" type="application/rss+xml" />
	<link>http://datavaluetalk.com</link>
	<description>Customer data is a valuable asset. Why not treat it that way?</description>
	<lastBuildDate>Thu, 10 May 2012 14:49:53 +0000</lastBuildDate>
	<language>nl</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Your name is too &#8220;common&#8221;&#8230;.</title>
		<link>http://datavaluetalk.com/data-governance/your-name-is-too-common/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=your-name-is-too-common</link>
		<comments>http://datavaluetalk.com/data-governance/your-name-is-too-common/#comments</comments>
		<pubDate>Mon, 07 Sep 2009 13:14:24 +0000</pubDate>
		<dc:creator>Holger Wandt</dc:creator>
				<category><![CDATA[Data Governance]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[Banks]]></category>
		<category><![CDATA[Chinese characters]]></category>
		<category><![CDATA[customer view]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[single customer view]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1207</guid>
		<description><![CDATA[A major bank in Dongguan (China) refused a potential customer because his name is Li Jun. Apparently, there were already over 300 bank accounts assigned to the name Li Jun. Not that this particular Li Jun was responsible for opening all these accounts, there were just too many men with exactly the same name. The [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-thumbnail wp-image-1209" title="chinese-characters" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/09/chinese-characters-150x150.jpg" alt="chinese-characters" width="150" height="150" /></p>
<p>A major bank in Dongguan (China) refused a potential customer because his name is Li Jun. Apparently, there were already over 300 bank accounts assigned to the name Li Jun. Not that this particular Li Jun was responsible for opening all these accounts, there were just too many men with exactly the same name. The bank states that the refusal is nothing personal, since nobody with the name Li Jun will be accepted as customer in the near future&#8230;.. In the meanttime, Li Jun is taking legal action against the bank.<span id="more-1207"></span></p>
<p>When I read this news article this morning, my first thoughts were that it was perhaps a hoax. It turns out , however, that the news fact is true. From a data quality point of view this strikes me as really strange. How does this particular bank manage its customer data? Are there no additional identifiers (address, date of birth, etc.) to determine that you are actually dealing with the customer you think you are dealing with? Imagine that every John Smith would have a hard time to open a bank account, to apply for a job or to buy a product via the web. Or Jenny Jones? Bob Johnson? When is a name too &#8220;common&#8221;? It is common misbelief that the complexity of ideographic characacters such as Mandarin Chinese makes it harder to identify. At Human Inference we carried out some pretty serious dedups of Chinese files and-taking into account that Mandarin Chinese is a tonal language and other priciples of fault-tolearnce apply- the duplicate identification was rather accurate.</p>
<p>It is all a matter of using an intelligent <a title="data matching" href="http://www.humaninference.com/products/data-matching" target="_blank">data matching</a> method and knowing what kind of data one is working on. Every name can be identified; even &#8220;common&#8221; names.</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-governance/your-name-is-too-common/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How-to create the Golden Record</title>
		<link>http://datavaluetalk.com/mdm/how-to-create-the-golden-record/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=how-to-create-the-golden-record</link>
		<comments>http://datavaluetalk.com/mdm/how-to-create-the-golden-record/#comments</comments>
		<pubDate>Fri, 21 Aug 2009 08:54:30 +0000</pubDate>
		<dc:creator>Ramon de Noronha</dc:creator>
				<category><![CDATA[MDM for customer data]]></category>
		<category><![CDATA[ACCU]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[first name]]></category>
		<category><![CDATA[golden record]]></category>
		<category><![CDATA[matching methods]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1166</guid>
		<description><![CDATA[The term Golden Record is closely related to Customer Data Integration or MDM for Customer data. It refers to the &#8220;single truth&#8221; which has been created or calculated from all those duplicate customer records from different systems. This post is not about finding or tagging all those duplicate records. There all kinds of ways to [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-thumbnail wp-image-1190" title="puzzle" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/08/puzzle-150x150.jpg" alt="puzzle" width="150" height="150" /></p>
<p>The term Golden Record is closely related to Customer Data Integration or MDM for Customer data. It refers to the &#8220;single truth&#8221; which has been created or calculated from all those duplicate customer records from different systems. This post is not about finding or tagging all those duplicate records. There all kinds of ways to find them using advanced statistical methods, fuzzy matching etc.</p>
<p>But what do you once you have found the duplicates. How do you create the best possible customer data out of all gathered elements?<span id="more-1166"></span></p>
<p>First of all we have to define what is meant by the Golden Record. We at Human Inference use the acronym ACCU, short for Actual, Correct, Complete and Unique. Ofbviously, we want one unique record. That&#8217;s why we use matching or identity resolution software. But Actual, Correct and Complete are less absolute, they can be interpreted in a subjective manner. You can have never-ending discussions about it, build the most complex business-rules ever etc. But I prefer to start with simply determining the superlative of Actual, Correct and Complete. In other words the most actual, the most correct and the most complete data-element or attribute &#8220;wins&#8221; and makes it to the Golden Record. Let&#8217;s take the following example, two almost identical records are gathered from two different systems (A &amp; B).</p>
<table border="1" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td width="296">
<p align="center"><strong>Record 1 from System A</strong></p>
</td>
<td width="296">
<p align="center"><strong>Record 2 from System B</strong></p>
</td>
</tr>
<tr>
<td width="296" valign="top">J. (John) Miller</td>
<td width="296" valign="top">J.F. Miller</td>
</tr>
<tr>
<td width="296" valign="top">26 Spring Gdns</td>
<td width="296" valign="top">26 Spring Gardens</td>
</tr>
<tr>
<td width="296" valign="top">Manchester, Lancashire, M2 1BB</td>
<td width="296" valign="top">Manchester, Lancashire, M2 1BA</td>
</tr>
<tr>
<td width="296" valign="top">United Kingdom</td>
<td width="296" valign="top">United Kingdom</td>
</tr>
</tbody>
</table>
<p>The basic rule is that only Correct data will make it into the Golden Record. So, if you can validate data please do so. For instance you can check social security, bank account and credit card numbers using algorithms. You can validate email addresses. Using postal reference data, it is also possible to verify the correctness of addresses. The most difficult is to validate names. Extensive knowledge is needed to check whether names of persons and organizations are valid.</p>
<p>In my own experience and opinion you should always discard incorrect data, or let it be corrected by a data steward. In the end nobody should be in doubt whether a Golden Record has been established using doubtful data.</p>
<p>The next step is to examine attribute (field) by attribute. So using the example from above.</p>
<table border="1" cellspacing="0" cellpadding="0" width="601">
<tbody>
<tr>
<td width="132" valign="top">Initials</td>
<td width="415" valign="top">J.F. “wins” from  “J.”, because it consists of more characters (simply use the LEN function).</td>
</tr>
<tr>
<td width="132" valign="top">First Name</td>
<td width="415" valign="top">John wins from the non-existent first name in Record 2. You can also deduct this person is a male.</td>
</tr>
<tr>
<td width="132" valign="top">Street</td>
<td width="415" valign="top">&#8220;26 Spring Gardens&#8221; wins from &#8220;26 Spring Gdns&#8221;. Full length is preferred above abbreviated.</td>
</tr>
<tr>
<td width="132" valign="top">Housenumber</td>
<td width="415" valign="top">26/II wins, once again it consists of more characters (more complete).</td>
</tr>
<tr>
<td width="132" valign="top">Postcode</td>
<td width="415" valign="top">M2 1BB wins. This is the correct postal code for the even housenumbers.</td>
</tr>
<tr>
<td width="132" valign="top">City &amp; Country</td>
<td width="415" valign="top">It doesn&#8217;t matter, both records contain the same data.</td>
</tr>
</tbody>
</table>
<p>So using validation techniques to distinguish the correct data from incorrect data and determining the length of each attribute in the provided records will result in the following Golden Record:</p>
<p><strong>Mister J.F. (John) Miller</strong></p>
<p><strong>26 Spring Gardens</strong></p>
<p><strong>Manchester, Lancashire, M2 1BB</strong></p>
<p><strong>United Kingdom</strong></p>
<p>Even if you have a lot more of attributes in your Golden Record, this method still works. Determine the correct data and use only correct data. And using the function Length (LEN) to determine the &#8220;most complete&#8221; data. Most complete simply refers to consisting of the most characters. If the source systems also provide dates for &#8220;date entered&#8221; and &#8220;date last changed&#8221; you can use this to determine what the most recent data is. The most recent data is determined by formulas like MIN (&#8220;CurrentDate&#8221; minus &#8220;&#8221;Last Changed Date&#8221;).</p>
<p>I believe this method will lead to a very usable Golden Record in 90 to 95% of all cases. Only when you have to deal with complicated data, for instance father and son living on the same address and having the same initials it becomes much more complex. I am curious which rules-of-thumb and methods you use when calculating the Golden Record. Please put your ideas in the comments.</p>
<div style="left: -10000px; overflow: hidden; width: 1px; position: absolute; top: 644px; height: 1px;">&lt;!&#8211;[if gte mso 9]&gt; Normal 0 21 false false false NL X-NONE X-NONE MicrosoftInternetExplorer4 &lt;![endif]&#8211;&gt;&lt;!&#8211;[if gte mso 9]&gt; &lt;![endif]&#8211;&gt;<!--  /* Font Definitions */  @font-face 	{font-family:"Cambria Math"; 	panose-1:2 4 5 3 5 4 6 3 2 4; 	mso-font-charset:1; 	mso-generic-font-family:roman; 	mso-font-format:other; 	mso-font-pitch:variable; 	mso-font-signature:0 0 0 0 0 0;} @font-face 	{font-family:Calibri; 	panose-1:2 15 5 2 2 2 4 3 2 4; 	mso-font-charset:0; 	mso-generic-font-family:swiss; 	mso-font-pitch:variable; 	mso-font-signature:-520092929 1073786111 9 0 415 0;}  /* Style Definitions */  p.MsoNormal, li.MsoNormal, div.MsoNormal 	{mso-style-unhide:no; 	mso-style-qformat:yes; 	mso-style-parent:""; 	margin-top:0cm; 	margin-right:0cm; 	margin-bottom:10.0pt; 	margin-left:0cm; 	line-height:115%; 	mso-pagination:widow-orphan; 	font-size:11.0pt; 	font-family:"Calibri","sans-serif"; 	mso-ascii-font-family:Calibri; 	mso-ascii-theme-font:minor-latin; 	mso-fareast-font-family:Calibri; 	mso-fareast-theme-font:minor-latin; 	mso-hansi-font-family:Calibri; 	mso-hansi-theme-font:minor-latin; 	mso-bidi-font-family:"Times New Roman"; 	mso-bidi-theme-font:minor-bidi; 	mso-fareast-language:EN-US;} .MsoChpDefault 	{mso-style-type:export-only; 	mso-default-props:yes; 	mso-ascii-font-family:Calibri; 	mso-ascii-theme-font:minor-latin; 	mso-fareast-font-family:Calibri; 	mso-fareast-theme-font:minor-latin; 	mso-hansi-font-family:Calibri; 	mso-hansi-theme-font:minor-latin; 	mso-bidi-font-family:"Times New Roman"; 	mso-bidi-theme-font:minor-bidi; 	mso-fareast-language:EN-US;} .MsoPapDefault 	{mso-style-type:export-only; 	margin-bottom:10.0pt; 	line-height:115%;} @page Section1 	{size:612.0pt 792.0pt; 	margin:70.85pt 70.85pt 70.85pt 70.85pt; 	mso-header-margin:35.4pt; 	mso-footer-margin:35.4pt; 	mso-paper-source:0;} div.Section1 	{page:Section1;} --><!--[if gte mso 10]&gt; &lt;!   /* Style Definitions */  table.MsoNormalTable 	{mso-style-name:Standaardtabel; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	mso-style-noshow:yes; 	mso-style-priority:99; 	mso-style-qformat:yes; 	mso-style-parent:&quot;&quot;; 	mso-padding-alt:0cm 5.4pt 0cm 5.4pt; 	mso-para-margin-top:0cm; 	mso-para-margin-right:0cm; 	mso-para-margin-bottom:10.0pt; 	mso-para-margin-left:0cm; 	line-height:115%; 	mso-pagination:widow-orphan; 	font-size:11.0pt; 	font-family:&quot;Calibri&quot;,&quot;sans-serif&quot;; 	mso-ascii-font-family:Calibri; 	mso-ascii-theme-font:minor-latin; 	mso-fareast-font-family:&quot;Times New Roman&quot;; 	mso-fareast-theme-font:minor-fareast; 	mso-hansi-font-family:Calibri; 	mso-hansi-theme-font:minor-latin; 	mso-bidi-font-family:&quot;Times New Roman&quot;; 	mso-bidi-theme-font:minor-bidi;} --> &lt;!&#8211;[endif]&#8211;&gt;<span style="font-size: 12pt; line-height: 115%; font-family: &quot;Times New Roman&quot;,&quot;serif&quot;;">J. (John) Miller</span></div>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/mdm/how-to-create-the-golden-record/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Deduplication, first time wrong?</title>
		<link>http://datavaluetalk.com/data-quality/deduplication-first-time-wrong/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=deduplication-first-time-wrong</link>
		<comments>http://datavaluetalk.com/data-quality/deduplication-first-time-wrong/#comments</comments>
		<pubDate>Tue, 31 Mar 2009 13:25:28 +0000</pubDate>
		<dc:creator>Paul Tours</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[duplicate records]]></category>
		<category><![CDATA[duplicates]]></category>
		<category><![CDATA[match records]]></category>
		<category><![CDATA[matching]]></category>
		<category><![CDATA[merge records]]></category>
		<category><![CDATA[SAP]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=856</guid>
		<description><![CDATA[One of my current projects has been to take an intelligent approach to the removal of duplicates already on an existing system (SAP). The client has already successfully used our software in their IT environment to effectively stop all new duplicates being entered into SAP. They now want to use the same technology to remove [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-863" title="twins" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/03/twins.gif" alt="twins" width="248" height="260" /></p>
<p>One of my current projects has been to take an intelligent approach to the removal of duplicates already on an existing system (SAP).</p>
<p>The client has already successfully used our software in their IT environment to effectively stop all new duplicates being entered into SAP. They now want to use the same technology to remove all existing duplicates. Their idea is so simple I am amazed that I have not heard of it being done elsewhere before.</p>
<p>Every evening the whole clients SAP database will be searched for duplicates in their Companies and Contacts (&gt; 3 million records deduplicated in less than an hour!) The results are stored in a master result table that SAP has been given access to. Now depending on the likelihood of the match, the duplicates can fall into one of three categories: automatic merging, manual merging or no merge. If the score for the whole duplicate group is above the threshold for automatic merging then the automatic merging process is started. <span id="more-856"></span></p>
<p>This merge process has been created by an external SAP consultancy group that does a lot of clever stuff in giving each record a score depending on its&#8217; financial relevance. E.g. open payments, current order status, payment reminders etc. (Hey, it&#8217;s SAP and in the world according to SAP only financial dealings have a value!) In the end the one record with the highest score is set to be the lead duplicate. All information from the other records in the duplicate group is placed onto the leading record to create a unique (&#8216;Golden&#8217;) record. All duplicate records with the exception of the lead duplicate are then removed from the system, in the case of SAP, these records are given a &#8216;set for deletion&#8217; flag and subsequently archived.</p>
<p>The &#8216;Non merges&#8217;, i.e. where the match score is below the accepted threshold level, are discarded and all remaining records are sent to a separate SAP mask for manual inspection for the following day. All that is required is to identify if the records shown belong in a duplicate group or not. After this decision has been made each duplicate group goes to the &#8216;merging&#8217; process. Just the same as the automatic merge process.</p>
<p>At the end of the day the whole process starts again. Wash, rinse, repeat! Simple! The first thing to happen is that over a short period of time all the secure duplicates disappear as they are merged automatically. This is highly visible, no more multiple identical records that pop up whenever a new record has been entered. The impact on the quality on the surrounding systems is just as direct. No sending out bills or marketing mails x times to the same person (having worked in Marketing before, I know the problem and it always leaves such a professional impression with the customer!) So it&#8217;s already something easy to sell to your managers and so far you have not had to lift a finger. Great!</p>
<p>The brilliance of the <a title="SAP data quality" href="http://www.humaninference.com/solutions/first-time-right/data-quality-for-sap" target="_blank">SAP data quality</a> solution though lies elsewhere. The simple fact is that it really does not matter whether the rest of the results are worked through in 1 day, 1 month or a year &#8211; as they are always captured, every day anew. The net result is that the total level of duplicates is constantly decreasing. Where the merge process has taken place, the duplicates will disappear. Only a change on the record will force it to be rechecked in the next round of deduplication. This means that apart for the costs of enhancement of the current system the client has an effective DQ firewall that now not only protects them from duplicate data being entered onto their IT systems, but will now over time cleanse the system from within. Even if it means putting an employee to sporadically make a decision on the manual matches. It is something that the company/department can concentrate on where they have time/resources available. (That should be easy after showing what success you have had with it already!)</p>
<p>How about if it the process could be easily and readily monitored? Say by using Excel or a similar product. Bar graphs and pie charts always tell way more than actual figures! Then the impact on what is happening is all the more visible and easy to sell (a good budget retainer!)</p>
<p>Good luck in dealing with your duplicates.</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/deduplication-first-time-wrong/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Data Quality – who needs it!</title>
		<link>http://datavaluetalk.com/data-quality/data-quality-%e2%80%93-who-needs-it/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=data-quality-%25e2%2580%2593-who-needs-it</link>
		<comments>http://datavaluetalk.com/data-quality/data-quality-%e2%80%93-who-needs-it/#comments</comments>
		<pubDate>Wed, 18 Mar 2009 12:45:32 +0000</pubDate>
		<dc:creator>Paul Tours</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[costs]]></category>
		<category><![CDATA[data quality project]]></category>
		<category><![CDATA[data quality tools]]></category>
		<category><![CDATA[deduplication]]></category>
		<category><![CDATA[dq tools]]></category>
		<category><![CDATA[efficiency]]></category>
		<category><![CDATA[efficient]]></category>
		<category><![CDATA[expensive]]></category>
		<category><![CDATA[project]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=836</guid>
		<description><![CDATA[Okay, so the theme Data Quality (DQ) has been around for more than a couple of years now. If you are reading this, chances are that you are obviously already informed on what’s available. I came from a large logistics company, where DQ was preached heavily and seen as a way of reducing costs. The [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-medium wp-image-848" title="escher_gezichtsbedrog2" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/03/escher_gezichtsbedrog2-276x300.jpg" alt="escher_gezichtsbedrog2" width="276" height="300" />Okay, so the theme <a title="data quality" href="http://www.humaninference.com" target="_blank">Data Quality</a> (DQ) has been around for more than a couple of years now. If you are reading this, chances are that you are obviously already informed on what’s available.</p>
<p>I came from a large logistics company, where DQ was preached heavily and seen as a way of reducing costs. The further though we went into what DQ could actually mean &#8211; the more vague and indirect the costs and effects seemed to be. The one thing we knew we really suffered from it was that we had a whole lot of duplicates in the system. This was always visible and the effects from it very tangible. They effectively helped screw up a perfectly good CRM tool. The solution was simple. Buy a deduplication tool and identify the duplicates!</p>
<p><span id="more-836"></span>Now there’s a good few deduplication tools out on the market. All of them will tell you how good they are at finding duplicates using mathematical-, probabilistic- and fuzzy matching. The list can go on and on. Where all providers though seem to stop short, is what they do with the duplicates found. Now all vendors have ways of identifying the duplicate pairs or even duplicate groups and most vendors will offer clever and fancy ways of bringing the duplicates together, but in almost all of these cases this happens OUTSIDE the clients’ current IT systems. Of course bulk loading old/new data back into the IT system is always so simple! So much fun! IT/ICS departments are always so understanding and helpful!!</p>
<p>This is what we had to find out for ourselves in a painful fashion. We quickly came to realise that finding the duplicates is barely even the tip of the iceberg. Actually trying to group the information together in such a way as to create a unique (golden) record was going to cost the company a lot of money. It would need an involvement of a Systems Integrator &#8211; because the problems are never just related to only one system, right? Multiple man years spent in project time (God! Consultants just prey for those types of projects.) Naturally, project costs rapidly rising into million of euros.</p>
<p>Where’s the saving then? I mean, the company has lived with the same problem for years right? So it can’t be that bad, can it?!? (Ever wondered exactly how the banks got into the current credit crisis?)</p>
<p>Unfortunately for the logistics company &#8211; the project never really got off the ground. Although the benefits of non duplicates were visible: reduced overhead – a more streamlined sales force, increased effectiveness, the risks about project timelines and overall cost killed it.</p>
<p>All is not lost. Good DQ providers will also offer software that effectively stops duplicates being allowed on to the clients system. Of course the safest way of ensuring a high level of DQ is to capture all the mistakes by their entry. Sounds wonderful, but like the Irish saying goes “If I was you &#8211; I wouldn’t start from here.” The DQ providers are only asked for a solution because the need has already been identified by the prospective client. The client has no other choice but to start from where they are. Just stopping the problem coming in through the front doors does not make the waste that’s already on the system magically disappear.</p>
<p>So <a title="data quality tools" href="http://www.humaninference.com/solutions/data-quality-tools" target="_blank">data quality tools</a> will of course offer solutions &#8211; but far too often they won’t go far enough! DQ can never really be about just installing a CD to solve the problem. Good expertise and guidance is necessary. It will always require a deeper understanding of the origin of the problem, a sharp focus on the companies’ pain points and a tight integration into the IT landscape. Suddenly a simple CD solution doesn’t fit anywhere; it quickly becomes an exponential cost, sucking up time and resources simply to make it work. At a time where companies are looking to become more efficient, cut costs and unfortunately overhead too, is that really the best way?</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/data-quality-%e2%80%93-who-needs-it/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>

