<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Value Talk &#187; persistent identification</title>
	<atom:link href="http://datavaluetalk.com/tag/persistent-identification/feed/" rel="self" type="application/rss+xml" />
	<link>http://datavaluetalk.com</link>
	<description>Customer data is a valuable asset. Why not treat it that way?</description>
	<lastBuildDate>Mon, 09 Jan 2012 11:38:42 +0000</lastBuildDate>
	<language>nl</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Any close encounters with the FBI terrorist watchlist?</title>
		<link>http://datavaluetalk.com/data-governance/any-close-encounters-with-the-fbi-terrorist-watchlist/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=any-close-encounters-with-the-fbi-terrorist-watchlist</link>
		<comments>http://datavaluetalk.com/data-governance/any-close-encounters-with-the-fbi-terrorist-watchlist/#comments</comments>
		<pubDate>Mon, 17 Aug 2009 09:14:34 +0000</pubDate>
		<dc:creator>Ramon de Noronha</dc:creator>
				<category><![CDATA[Data Governance]]></category>
		<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[compliance]]></category>
		<category><![CDATA[identification]]></category>
		<category><![CDATA[identity]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[knowledge]]></category>
		<category><![CDATA[persistent identification]]></category>
		<category><![CDATA[processes]]></category>
		<category><![CDATA[suspect list matching]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1125</guid>
		<description><![CDATA[Just before this summer the U.S. Department of Justice filed a report about the FBI Terrorist Watchlist. This watchtlist serves as a critical tool for screening and law enforcement personnel for alerting them when they come across a known or suspected terrorist. It is used by personnel at airports, harbours and the borderline. Also when [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1127" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/08/tsc080105a.jpg" alt="tsc080105a" width="160" height="152" />Just before this summer the U.S. Department of Justice filed a report about the FBI Terrorist Watchlist. This watchtlist serves as a critical tool for screening and  law enforcement personnel for alerting them when they come across a known or suspected terrorist. It is used by personnel at airports, harbours and the borderline. Also when you apply for a visum you are matched against this watchlist. The Terrorist Screening Center, a subsidiary of the FBI, is responsible for maintaining the watchlist.</p>
<p>This watchlist was created in 2004 from several other lists and at that time it consisted of about 68.000 entries. I use the word entries, because in the years after it became fuzzy if one record is the same as one individual. By the end of 2008 the list had grown to over 1,1 million entries. In 2008 after the American Civil Liberties Union (ACLU) mentioned that the list had <a title="Numbers don't add up" href="http://www.aclu.org/privacy/gen/36064res20080721.html" target="_blank">passed the 1 million</a>, the government came with an explanation. <em>Although we have recorded over 1 million entries in the database, the net result is that these records correspond to about 400.000 individuals. </em>Terrorist often use different and thus multiple identities, use several (falsified) passports etc. But adding entries with only the first initials and last name, while an entry of the full first names and last name already exists will result in unwanted side-effects.<span id="more-1125"></span></p>
<p>We all know, as being interested in data quality and identity resolution, that J. Robinson will result into much more matches (hits) than James Robinson. Indeed the number of found matches will sky-rocket and have to be evaluated manually. Might this be the reason, that we see more and more security personnel on airports?</p>
<p>In the<a href="http://www.usdoj.gov/oig/reports/FBI/a0925/final.pdf" target="_blank"> latest audit report</a> of the U.S. Department of Justice about this watchlist one other problem was analyzed. While extensive procedures were made for nominating and adding suspects to the watchlist, there is no procedure for removing people from the list. Based on a sample of almost 70.000 entries and investigation of the individuals an astounding number of 35% omissions was found. People who had died were still on the list, people who were no longer investigated upon, cases which had been closed etc. So this watchlist is <a href="http://www.aclu.org/privacy/spying/watchlistcounter.html" target="_blank">growing and growing</a>. Resulting in screening personnel who ensnare many innocent travelers as suspected terrorists. And wasting their time and divert their energies from looking for true terrorists. It seems to me that FBI and TSC can benefit from better Data Governance, what do you think?</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-governance/any-close-encounters-with-the-fbi-terrorist-watchlist/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Bi-lingual streetnames in Amsterdam, do we really need it?</title>
		<link>http://datavaluetalk.com/data-quality/bi-lingual-streetnames-in-amsterdam-do-we-really-need-it/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=bi-lingual-streetnames-in-amsterdam-do-we-really-need-it</link>
		<comments>http://datavaluetalk.com/data-quality/bi-lingual-streetnames-in-amsterdam-do-we-really-need-it/#comments</comments>
		<pubDate>Mon, 17 Aug 2009 08:10:47 +0000</pubDate>
		<dc:creator>Ramon de Noronha</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[address standardization]]></category>
		<category><![CDATA[interpretation]]></category>
		<category><![CDATA[persistent identification]]></category>
		<category><![CDATA[standardization]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1110</guid>
		<description><![CDATA[So once in a while I visit Amsterdam and have a drink or two in the centre. Afterwards I use the tram to get back to the hotel. This weekend I was quite surprised to find out that all the streetnames are announced in English, at each stop. The easy and obvious one is of [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1111" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/08/Straatnaambord.jpg" alt="Straatnaambord" width="305" height="137" />So once in a while I visit Amsterdam and have a drink or two in the centre. Afterwards I use the tram to get back to the hotel. This weekend I was quite surprised to find out that all the streetnames are announced in English, at each stop. The easy and obvious one is of course Centraal Station, which was translated to Central Station. I also can see how they came up with Rembrandt Square instead of Rembrandtsplein. But translating &#8220;Spui&#8221; to &#8220;Courtyard with a chapel&#8221; doesn&#8217;t help any tourists to find their destination.<span id="more-1110"></span></p>
<p>In Holland we already have three officially approved manners of naming streets an addresses. Nowadays we have the TNT Post standard, based on the very first publication of the postal code book, and afterwards corrected several times. This naming convention was the basis for the NEN 5825 standard (NEN is the dutch variant of ISO). But the true source of streetnames comes from the municipality and is called &#8220;Raadsbesluit&#8221;.Due to the different versions this can easily result in five different ways of spelling of the same street, as the example below shows:</p>
<table border="0" cellspacing="1" cellpadding="2" width="85%" align="center">
<tbody>
<tr>
<td><span style="font-size: x-small;">Original Postal Code Book (1978):</span></td>
<td><span style="font-size: x-small;">s en schepenenstr</span></td>
</tr>
<tr>
<td><span style="font-size: x-small;">Corrected TNT Post &#8211; standard<br />
</span></td>
<td><span style="font-size: x-small;">schout en s str</span></td>
</tr>
<tr>
<td><span style="font-size: x-small;">NEN- 5825 standard, version 1991: </span></td>
<td><span style="font-size: x-small;">Schout en Schepenenstr</span></td>
</tr>
<tr>
<td><span style="font-size: x-small;">NEN- 5825 standard, version 2002: </span></td>
<td><span style="font-size: x-small;">Sch en Schepenenstraat</span></td>
</tr>
<tr>
<td><span style="font-size: x-small;">Raadsbesluit:<br />
</span></td>
<td><span style="font-size: x-small;">Schout en Schepenenstraat</span></td>
</tr>
</tbody>
</table>
<p>What do you think, should we add a new &#8220;English&#8221; standard to existing standards. What are the pro&#8217;s and con&#8217;s for having English labels of the streetnames, please add your opinion in the comments. Should we also replace all signs and add the English label for the streetnames?  For more information and history of Dutch street names I recommend the following site <em><a title="Alles over straatnaam" href="http://www.allesoverstraatnamen.nl/" target="_blank">&#8220;alles over straatnamen&#8221;</a></em>.</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/bi-lingual-streetnames-in-amsterdam-do-we-really-need-it/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The importance of persistent identification</title>
		<link>http://datavaluetalk.com/mdm/the-importance-of-persistent-identification/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=the-importance-of-persistent-identification</link>
		<comments>http://datavaluetalk.com/mdm/the-importance-of-persistent-identification/#comments</comments>
		<pubDate>Thu, 16 Oct 2008 09:42:55 +0000</pubDate>
		<dc:creator>Ramon de Noronha</dc:creator>
				<category><![CDATA[Data Services]]></category>
		<category><![CDATA[MDM for customer data]]></category>
		<category><![CDATA[movers index]]></category>
		<category><![CDATA[persistent identification]]></category>
		<category><![CDATA[verhuisberichten]]></category>

		<guid isPermaLink="false">http://datavaluetalk.wordpress.com/?p=82</guid>
		<description><![CDATA[A much overlooked issue in Customer Data Integration projects is &#8220;Persistent Identification&#8221;. Persons and companies are very often identified using their address data. But, what do you do if a person has moved from address A to address B. One, thing you really don&#8217;t want is that the person is added to database as a [...]]]></description>
			<content:encoded><![CDATA[<p>A much overlooked issue in Customer Data Integration projects is &#8220;Persistent Identification&#8221;.</p>
<p>Persons and companies are very often identified using their address data. But, what do you do if a person has moved from address A to address B. One, thing you really don&#8217;t want is that the person is added to database as a new person (INSERT). From that moment a duplicate person or company resides in your system. This should be prevented, by creating searching indexes which include the current and the previous address of the persons and companies in your database. </p>
<p><img src="http://verhuismanager.com/wp-content/uploads/verhuizen-1.jpg" alt="" /></p>
<p><span id="more-13"></span>A common way to solve this is to store all addresses which the person is known (historical addresses). For each address the start-date (FROM &#8230;) and end-date (TILL &#8230;) should be stored. Especially techniques which are used in Data Warehousing, like dimensional modelling, come into place.</p>
<p>Of course, you also need other identifying attributes to recognize the moved person. Focus on other customer touchpoints like the e-mail address of persons. Nowadays an e-mail is less subject to change then the physical address where somebody lives. A social security number or banking account number are even more precise, but unfortunately in many cases not available.</p>
<p>And, last but not least, if possible make uses of external sources. For instance in the Netherlands the movers index is maintained by TPG (TPG Verhuisberichten). In other countries similar services are available.</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/mdm/the-importance-of-persistent-identification/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

