<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Value Talk &#187; identification</title>
	<atom:link href="http://datavaluetalk.com/tag/identification/feed/" rel="self" type="application/rss+xml" />
	<link>http://datavaluetalk.com</link>
	<description>Customer data is a valuable asset. Why not treat it that way?</description>
	<lastBuildDate>Thu, 10 May 2012 14:49:53 +0000</lastBuildDate>
	<language>nl</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Know your customer to trust your data</title>
		<link>http://datavaluetalk.com/data-quality/know-your-customer-to-trust-your-data/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=know-your-customer-to-trust-your-data</link>
		<comments>http://datavaluetalk.com/data-quality/know-your-customer-to-trust-your-data/#comments</comments>
		<pubDate>Tue, 07 Dec 2010 13:53:36 +0000</pubDate>
		<dc:creator>Holger Wandt</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[customer llifetime value]]></category>
		<category><![CDATA[data quality strategy]]></category>
		<category><![CDATA[data trust]]></category>
		<category><![CDATA[identification]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[know your customer]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1590</guid>
		<description><![CDATA[The success of many business processes is linked directly to the quality of customer data. This is not only an obvious fact, but a recurring conclusion of many field studies: Incorrect, incomplete and inaccurate data will have a direct impact on your business succes rate. The symptomatology of this increase is established in inefficient marketing [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-thumbnail wp-image-1596" title="blinddoek" src="http://datavaluetalk.com/cms/wp-content/uploads/2010/12/blinddoek-150x150.jpg" alt="blinddoek" width="150" height="150" /></p>
<p>The success of many business processes is linked directly to the quality of customer data. This is not only an obvious fact, but a recurring conclusion of many field studies: Incorrect, incomplete and inaccurate data will have a direct impact on your business succes rate. The symptomatology of this increase is established in inefficient marketing and sales processes, customer dissatisfaction, difficult cross- and upsell, unreliable analyses and many other disturbances in the day-to-day business of almost every organization dealing with customer, supplier and/or partner data.</p>
<p>In essence, it all comes down to knowing your data, in order to be able to trust your data. If you trust your data, you are definitely doing something right. So, how do you establish that trust? For this, you first have to answer a short, yet rather complex question: What is what in my database(s)? In other words: You have to identify and interpret the data you are working with .</p>
<p>A robust customer data identification solution intelligently interprets the details of both natural and legal persons. That process has to take account of the significance of words in a specific context, usage of company names, abbreviations, synonyms, acronyms, spelling mistakes, notation methods, standards and phonetic similarity of words. All in all, this is not a simple task; it more or less mimics the capabilities that humans show when interpreting data &#8230;<span id="more-1590"></span></p>
<p>It is, however, the first step in a solid data quality strategy. This strategy should entail some sort of methodic, recursive approach. This makes sense, since data cleansing is basically a process of recurring steps. Initial cleansing should, for example, be combined with methods to prevent future pollution. In other words: Do not only fight the symptoms of bad quality, but eliminate the root causes and make sure your clean data will stay clean. Underneath you will find an illustration of such an approach:</p>
<p><img class="alignleft size-full wp-image-1591" title="DQ strategy" src="http://datavaluetalk.com/cms/wp-content/uploads/2010/12/DQ-strategy.jpg" alt="DQ strategy" width="444" height="256" /></p>
<p>Effective customer service, targeted cross- and upsell, cost decrease and creation of cutomer lifetime value are but a few goals that will be achieved by defining and deploying the right data quality strategy. So start to <a title="Banken - Know your customer" href="http://www.humaninference.nl/branches/banken" target="_blank">know your customer</a> and learn to trust your data&#8230;..</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/know-your-customer-to-trust-your-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Why there are maximum of (fe)males in a country</title>
		<link>http://datavaluetalk.com/data-quality/why-there-are-maximum-of-females-in-a-country/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=why-there-are-maximum-of-females-in-a-country</link>
		<comments>http://datavaluetalk.com/data-quality/why-there-are-maximum-of-females-in-a-country/#comments</comments>
		<pubDate>Tue, 19 Jan 2010 13:38:24 +0000</pubDate>
		<dc:creator>Winfried van Holland</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[identification]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[privacy-sensitive]]></category>
		<category><![CDATA[social security number]]></category>
		<category><![CDATA[unique identification]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1288</guid>
		<description><![CDATA[Within Europe there is no such system as European Social Security Number or European Identification Number. A lot of countries have their own system, and other countries are struggling to get a system into place. The struggle of some countries has to do with historical reasons and with privacy aspects. Unique identifiation is not always [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignnone" src="http://4.bp.blogspot.com/_jQS2yW8CbuY/Sb43ZcL28WI/AAAAAAAAADg/cNBvLb2bq6o/s320/CartaoCidadao_f.jpg" alt="" width="320" height="207" />Within Europe there is no such system as European Social Security Number or European Identification Number. A lot of countries have their own system, and other countries are struggling to get a system into place.</p>
<p>The struggle of some countries has to do with historical reasons and with privacy aspects. Unique identifiation is not always used in favour of the community. And some of the used identification systems contain privacy-sensitive information, among others date of birth, gender and/or place of birth, where older systems might even contain religious or other privacy-senitive information.</p>
<p>A wide range of countries use the combination of date of birth, gender identification and the political region where you are born. In such a mechanism it is most common that part of the identification number is a 2-digit or 3-digit serial number to identify the unique male or female born on a specific date (or born on a specific month). Some countries provide odd serial numbers for male, and even for female. Bulgaria is the only one that wants &#8220;odd&#8221; females. Some countries like to divide on range (0-499 male, 500-999 female).  And some countries like Norway make nice combinations to include the century of birth or period of birth in the serial number.<span id="more-1288"></span></p>
<p>This &#8216;number&#8217; generation brings the effect that pretty soon you will encounter the maximum number of citizens that the system can handle on a specific day. Some systems run out of numbers if there are more than 500 males or females born on a day. The Denmark system encountered that situation in 2007, where due to immigration the population exceeded the system for January 1st 1965! The Denmark system (CPR-nummer)  has a 3-digit serial number where one of the digits is also the control digit (diminishing the possible numbers than from 500 to less than 50).</p>
<p>Remarkable to see what some countries are doing to solve the &#8216;century&#8217; issue, people with the same ID but born in the 19th, 20th or 21st century, they add 20 or 40 to the month. Same is true for foreigner identification, e.g. Sweden that is adding 60 to the day of birth. Or again Sweden that is adding 20 to the month to distinguish persons from organisations.</p>
<p>If you want to see the details on these systems you might watch <a href="http://prezi.com/csnv3cynv4ai/">http://prezi.com/csnv3cynv4ai/</a> or <a href="http://en.wikipedia.org/wiki/National_identification_number">http://en.wikipedia.org/wiki/National_identification_number</a>. Be prepared, definitely there have been PhDs around to invent these systems.</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/why-there-are-maximum-of-females-in-a-country/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>The &#8220;miracle&#8221; of customer data integration</title>
		<link>http://datavaluetalk.com/mdm/the-miracle-of-customer-data-integration/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-miracle-of-customer-data-integration</link>
		<comments>http://datavaluetalk.com/mdm/the-miracle-of-customer-data-integration/#comments</comments>
		<pubDate>Mon, 24 Aug 2009 13:43:37 +0000</pubDate>
		<dc:creator>Holger Wandt</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[MDM for customer data]]></category>
		<category><![CDATA[cdi]]></category>
		<category><![CDATA[customer view]]></category>
		<category><![CDATA[data processes]]></category>
		<category><![CDATA[identification]]></category>
		<category><![CDATA[intelligent matching]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1193</guid>
		<description><![CDATA[The more a company knows about its customer’s wishes, needs and habits and the more that company is able to tailor its proposition accordingly, the greater the value it will eventually provide for its customers. We all know that there are countless examples where defective, fragmented, or just plain poor customer data cause unnecessary costs, [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-thumbnail wp-image-1196" title="mulitple view" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/08/mulitple-view-150x150.jpg" alt="mulitple view" width="150" height="150" /></p>
<p>The more a company knows about its customer’s wishes, needs and habits and the more that company is able to tailor its proposition accordingly, the greater the value it will eventually provide for its customers. We all know that there are countless examples where defective, fragmented, or just plain poor customer data cause unnecessary costs, decrease in revenue, employee dissatisfaction or frustation, damage of the corporate image and many other unsdesirable or painful consequences.</p>
<p>Customer data quality and integration problems impact every area of the value chain of organisations. Far too often companies have a multiple view of their customers. Customer Data Integration (or MDM for Customer Data) is the key to providing companies with a single view of their customer. <span id="more-1193"></span>According to Gartner, Customer Data Integration (CDI) is <em>a combination of technology, services and processes to deliver an accurate, timely and complete view of the customer across multiple channels, lines of business, departments and divisions drawing customer data from multiple sources and systems.</em></p>
<p>I think that the real &#8220;miracle&#8221; of CDI lies in the automated, intelligent matching of customer records. Mind you, I&#8217;m not questioning the importance of the various CDI-processes (for example, I think that <a href="http://datavaluetalk.com/2009/08/21/how-to-create-the-golden-record/" target="_blank"><span style="color: #ff0000;">the post of my colleague Ramon de Noronha on the creation of &#8220;golden&#8221; records </span></a>is majorly important), I&#8217;m just  saying that -whenever the integration of customer data is an issue- intelligent, automated  matching is the key prerequisite for success.</p>
<p><span style="color: #000000;"><em>The quality of your customer data integration solution is only as powerful as the quality of your matching engine.</em></span> If  this statement intrigues you, I strongly advise you to read the white paper <a href="http://www.humaninference.com/en/Our%20Solutions/Propositions/~/media/BD99FF359FF9413AAD6CA237E0176C1A.ashx" target="_blank"><span style="color: #ff0000;">&#8220;High Precision Matching at the heart of Customer Data Integration</span>&#8220;. </a>Enjoy!</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/mdm/the-miracle-of-customer-data-integration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Any close encounters with the FBI terrorist watchlist?</title>
		<link>http://datavaluetalk.com/data-governance/any-close-encounters-with-the-fbi-terrorist-watchlist/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=any-close-encounters-with-the-fbi-terrorist-watchlist</link>
		<comments>http://datavaluetalk.com/data-governance/any-close-encounters-with-the-fbi-terrorist-watchlist/#comments</comments>
		<pubDate>Mon, 17 Aug 2009 09:14:34 +0000</pubDate>
		<dc:creator>Ramon de Noronha</dc:creator>
				<category><![CDATA[Data Governance]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[compliance]]></category>
		<category><![CDATA[identification]]></category>
		<category><![CDATA[identity]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[persistent identification]]></category>
		<category><![CDATA[processes]]></category>
		<category><![CDATA[suspect list matching]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1125</guid>
		<description><![CDATA[Just before this summer the U.S. Department of Justice filed a report about the FBI Terrorist Watchlist. This watchtlist serves as a critical tool for screening and law enforcement personnel for alerting them when they come across a known or suspected terrorist. It is used by personnel at airports, harbours and the borderline. Also when [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1127" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/08/tsc080105a.jpg" alt="tsc080105a" width="160" height="152" />Just before this summer the U.S. Department of Justice filed a report about the FBI Terrorist Watchlist. This watchtlist serves as a critical tool for screening and  law enforcement personnel for alerting them when they come across a known or suspected terrorist. It is used by personnel at airports, harbours and the borderline. Also when you apply for a visum you are matched against this watchlist. The Terrorist Screening Center, a subsidiary of the FBI, is responsible for maintaining the watchlist.</p>
<p>This watchlist was created in 2004 from several other lists and at that time it consisted of about 68.000 entries. I use the word entries, because in the years after it became fuzzy if one record is the same as one individual. By the end of 2008 the list had grown to over 1,1 million entries. In 2008 after the American Civil Liberties Union (ACLU) mentioned that the list had <a title="Numbers don't add up" href="http://www.aclu.org/privacy/gen/36064res20080721.html" target="_blank">passed the 1 million</a>, the government came with an explanation. <em>Although we have recorded over 1 million entries in the database, the net result is that these records correspond to about 400.000 individuals. </em>Terrorist often use different and thus multiple identities, use several (falsified) passports etc. But adding entries with only the first initials and last name, while an entry of the full first names and last name already exists will result in unwanted side-effects.<span id="more-1125"></span></p>
<p>We all know, as being interested in data quality and identity resolution, that J. Robinson will result into much more matches (hits) than James Robinson. Indeed the number of found matches will sky-rocket and have to be evaluated manually. Might this be the reason, that we see more and more security personnel on airports?</p>
<p>In the<a href="http://www.usdoj.gov/oig/reports/FBI/a0925/final.pdf" target="_blank"> latest audit report</a> of the U.S. Department of Justice about this watchlist one other problem was analyzed. While extensive procedures were made for nominating and adding suspects to the watchlist, there is no procedure for removing people from the list. Based on a sample of almost 70.000 entries and investigation of the individuals an astounding number of 35% omissions was found. People who had died were still on the list, people who were no longer investigated upon, cases which had been closed etc. So this watchlist is <a href="http://www.aclu.org/privacy/spying/watchlistcounter.html" target="_blank">growing and growing</a>. Resulting in screening personnel who ensnare many innocent travelers as suspected terrorists. And wasting their time and divert their energies from looking for true terrorists. It seems to me that FBI and TSC can benefit from better Data Governance, what do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-governance/any-close-encounters-with-the-fbi-terrorist-watchlist/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What&#8217;s the value of unique identifiers</title>
		<link>http://datavaluetalk.com/mdm/whats-the-value-of-unique-identifiers/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=whats-the-value-of-unique-identifiers</link>
		<comments>http://datavaluetalk.com/mdm/whats-the-value-of-unique-identifiers/#comments</comments>
		<pubDate>Thu, 16 Oct 2008 09:57:41 +0000</pubDate>
		<dc:creator>Ron Mulderij</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[MDM for customer data]]></category>
		<category><![CDATA[identification]]></category>
		<category><![CDATA[name-number-check]]></category>
		<category><![CDATA[unique identifier]]></category>

		<guid isPermaLink="false">http://datavaluetalk.wordpress.com/?p=91</guid>
		<description><![CDATA[In a survey by Human Inference processes were discovered to be the most challenging data quality aspect. HI Survey Results Since in processes often unique identifiers are used to identify customers, accounts or data in general the reliability of these identifiers influence the quality of your process. To avoid incorrect results and decisions a check [...]]]></description>
			<content:encoded><![CDATA[<p>In a survey by Human Inference processes were discovered to be the most challenging <a title="data quality" href="http://www.humaninference.com/" target="_blank">data quality</a> aspect.</p>
<div class="mceTemp">
<dl class="wp-caption alignnone">
<dt class="wp-caption-dt"><a href="http://datavaluetalk.files.wordpress.com/2008/10/survey-challenge-dq.jpg"><img class="size-medium wp-image-75" title="survey-challenge-dq" src="http://datavaluetalk.files.wordpress.com/2008/10/survey-challenge-dq.jpg?w=300" alt="HI Survey Results" width="300" height="188" /></a></dt>
<dd class="wp-caption-dd">HI Survey Results</dd>
</dl>
<p>Since in processes often unique identifiers are used to identify customers, accounts or data in general the reliability of these identifiers influence the quality of your process. To avoid incorrect results and decisions a check of the data related to the identifiers is required. The illegal use of such an identifier leads to polluted data. A name-number-check should be implemented for e.g.:</p>
</div>
<ul>
<li>
<div class="mceTemp">bank account numbers</div>
</li>
<li>
<div class="mceTemp">social security numbers</div>
</li>
<li>
<div class="mceTemp">chamber of commerce numbers</div>
</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/mdm/whats-the-value-of-unique-identifiers/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

