<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Value Talk &#187; migration</title>
	<atom:link href="http://datavaluetalk.com/tag/migration/feed/" rel="self" type="application/rss+xml" />
	<link>http://datavaluetalk.com</link>
	<description>Customer data is a valuable asset. Why not treat it that way?</description>
	<lastBuildDate>Mon, 09 Jan 2012 11:38:42 +0000</lastBuildDate>
	<language>nl</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Data Quality Summit 2010 &#8211; a must-attend-event for every data quality professional</title>
		<link>http://datavaluetalk.com/data-quality/data-quality-summit-2010-a-must-attend-for-every-data-quality-professional/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=data-quality-summit-2010-a-must-attend-for-every-data-quality-professional</link>
		<comments>http://datavaluetalk.com/data-quality/data-quality-summit-2010-a-must-attend-for-every-data-quality-professional/#comments</comments>
		<pubDate>Wed, 21 Oct 2009 10:03:54 +0000</pubDate>
		<dc:creator>Holger Wandt</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[data quality maturity]]></category>
		<category><![CDATA[Data Quality Summit]]></category>
		<category><![CDATA[DQS 2010]]></category>
		<category><![CDATA[gartner]]></category>
		<category><![CDATA[Human Inference]]></category>
		<category><![CDATA[integration]]></category>
		<category><![CDATA[migration]]></category>
		<category><![CDATA[Sabine Palinckx]]></category>
		<category><![CDATA[Ted Friedman]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=1224</guid>
		<description><![CDATA[On 28 january 2010 the next Human Inference Data Quality Summit will be held in the Evoluon in Eindhoven (NL). The theme &#8211; Value your data, value your future- is inspired by the idea that investments in data quality have become part of standard business and that vision, strategy and solutions are being synchronized with [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-full wp-image-1225" title="evoluon" src="http://datavaluetalk.com/cms/wp-content/uploads/2009/10/evoluon.jpg" alt="evoluon" width="125" height="75" /></p>
<p>On  <strong><em>28 january 2010</em> </strong>the next  Human Inference <a href="http://www.dataqualitysummit.com/" target="_blank">Data Quality Summit </a>will be held in the Evoluon in Eindhoven (NL). The theme &#8211; <strong><em>Value your data, value your future</em></strong>- is inspired by the idea that investments in data quality have become part of standard business and that vision, strategy and solutions are being synchronized with these investments. As data quality has reached a certain level of maturity, it is  time to have an in-depth look at the (near) future of Data Quality.</p>
<p>The program is challenging, comprehensive and entertaining. Keynote speakers include <strong><em>Ted Friedman</em></strong> (vice president Gartner Research), <strong><em>Mathias K</em>l<em>ier</em></strong> (professor at the University of Innsbruck) and <strong><em>Sabine Palinckx</em></strong> (CEO Human Inference).  Additionally, in the break-out-session a wide variety of theme-related topics will be addressed: maximising the buisnes value of information, guiding a dq-project through migration, data quality maturity, marketing effectiveness and many more&#8230;.. In short, the Data Quality Summit is not to be missed!</p>
<p>Save the date and register by <a href="http://www.customerview.nl/HumanInference/reg01.aspx" target="_blank">clicking this link</a>!</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/data-quality-summit-2010-a-must-attend-for-every-data-quality-professional/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Migrating data; migrating meaning</title>
		<link>http://datavaluetalk.com/data-quality/migrating-meaning/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=migrating-meaning</link>
		<comments>http://datavaluetalk.com/data-quality/migrating-meaning/#comments</comments>
		<pubDate>Thu, 13 Nov 2008 19:34:52 +0000</pubDate>
		<dc:creator>hiqualityhenk</dc:creator>
				<category><![CDATA[Data Quality]]></category>
		<category><![CDATA[Add new tag]]></category>
		<category><![CDATA[HIquality Inspect]]></category>
		<category><![CDATA[migration]]></category>

		<guid isPermaLink="false">http://datavaluetalk.com/?p=109</guid>
		<description><![CDATA[At first glance, migrating data from one database to another looks trivial: you can easily discover the schema or datamodel of each database, then simply map the elements of one to the other. Any ETL tool worth its acronym will let you do that. In practice, migration is a nightmare&#8230; A database belongs to an [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://datavaluetalk.com/2008/11/13/migrating-meaning/"><img class="alignleft" title="Migration" src="http://farm4.static.flickr.com/3256/2428106795_c4ccaf0df5_t.jpg" alt="Migration " width="67" height="100" /></a>At first glance, migrating data from one database to another looks trivial: you can easily discover the schema or datamodel of each database, then simply map the elements of one to the other. Any ETL tool worth its acronym will let you do that.</p>
<p>In practice, migration is a nightmare&#8230;</p>
<p><span id="more-109"></span>A database belongs to an application and it is the application that gives meaning to the data. So to map from one to another, you need to understand each application. And you&#8217;ll soon find that one application has concepts that do not exist, in quite the same form, in the other.</p>
<p>For example, when migrating grants-management data for a charitable foundation, I found that the source package had broad indicators of the status of each grant application: Pending, Approved, Rejected, Closed. The target package had a similar top-level division, but allowed a much more fine‑grained specification of the lifecycle below that. My client was keen to exploit this, as this finer grain enabled better coordination of the workflow, more automation of letter generation, and better management and reporting of workload and progress. To populate this fine grained status, we had to scour the source database for additional information, and apply some 20 tests to a dozen additional fields.</p>
<p>OK, so this was a bit elaborate, but still entirely tractable. The data was there and could be interpreted unambiguously.</p>
<p>Things become much harder where it isn&#8217;t the application that gives meaning to the data, but the <em>users</em>. Users often find that their application does not quite allow them to express the distinctions they want to make, or to record all the facts they want to record. This is especially so in older applications that lack easy customisation &#8211; but it is still true of modern applications, as customisation involves time, effort and cost. Luckily, any application tends to contain a couple of spare fields, included by the designers because someone somewhere asked for them, but that are not actually any use in this particular organisation. These spare fields then become the receptacle for the information that won&#8217;t fit elsewhere. Experience and lore tell users the true meaning of these fields, ignoring whatever label they may have in the user interface.</p>
<p>And it gets worse. From time to time, users change their minds as to what is important. Fields fall into disuse and, after some time, are repurposed. In the grants-management migration, I found fields that had changed meaning three times, within the life of the application, plus a few more times in an older application from which the data had been converted, previously.</p>
<p>At the time, all I had available as a tool were my eyeballs. Now, of course, I would instantly reach for <a href="http://www.humaninference.com/products/hiquality-inspect">HIquality Inspect</a>.</p>
<p>The ready-made <em>Frequency info</em> check in Inspect is a very handy tool for quickly listing the different values that occur in a given field. On the basis of this, you can then build regular expressions to retrieve individual records based on particular strings or substrings. Regular expressions are very flexible and allow a degree of fuzzy matching. Assuming the records also contain audit attributes, like Date Modified, you can then see the date ranges when these various strings were most popular. All you need to do then is to find a user who remembers what these strings actually <em>meant</em>, between August 1999 and April 2003. Which may be easier than finding a manager who is prepared to say this data can be safely discarded&#8230;</p>
<p>Things get even hairier when users don&#8217;t have spare fields and resort to placing various remarks alongside &#8211; or embedded in &#8211; the expected content of well-used fields. But that is one for another day.</p>
]]></content:encoded>
			<wfw:commentRss>http://datavaluetalk.com/data-quality/migrating-meaning/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

