<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for The Data Quality Chronicle</title>
	<atom:link href="http://thedataqualitychronicle.org/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://thedataqualitychronicle.org</link>
	<description>Stuff I have learned, read, or think about ...</description>
	<lastBuildDate>Sat, 31 Mar 2012 14:17:08 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>Comment on Justin Bieber abuses Twitter but proves how similar phone numbers are by William Sharp</title>
		<link>http://thedataqualitychronicle.org/justin-bieber-abuses-twitter-but-proves-how-similar-phone-numbers-are/#comment-2130</link>
		<dc:creator>William Sharp</dc:creator>
		<pubDate>Sat, 31 Mar 2012 14:17:08 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=2027#comment-2130</guid>
		<description>They can be used as a confirmation of accuracy, however, if you consider how multiple customers can share the same phone number this may not be as accurate as you&#039;d think. In business situations different people can share a phone number or have very common phone number.  In retail type situations multiple customer households complicate this in the same way.
I have used phone number as confirmation of an accurate match, however, that can be misleading.</description>
		<content:encoded><![CDATA[<p>They can be used as a confirmation of accuracy, however, if you consider how multiple customers can share the same phone number this may not be as accurate as you&#8217;d think. In business situations different people can share a phone number or have very common phone number.  In retail type situations multiple customer households complicate this in the same way.<br />
I have used phone number as confirmation of an accurate match, however, that can be misleading.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Justin Bieber abuses Twitter but proves how similar phone numbers are by Prashanta C</title>
		<link>http://thedataqualitychronicle.org/justin-bieber-abuses-twitter-but-proves-how-similar-phone-numbers-are/#comment-2128</link>
		<dc:creator>Prashanta C</dc:creator>
		<pubDate>Sat, 31 Mar 2012 12:15:43 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=2027#comment-2128</guid>
		<description>Good one William,

Using phone number(s) in matching may not be a great idea as you mentioned, but I still tend to take it as an affirmative criteria when most of the other critical elements end up being good matches. Say you have a name, address, DOB, gender and identifier match, a matching phone number kind of confirms a good match. Usually we apply edit distance on phone. In above scenario we might give a good boost to overall score if it&#039;s exact phone number match and lower(significant) if the match is 1 or more edit distance. 

My 2 cents

-Prashant</description>
		<content:encoded><![CDATA[<p>Good one William,</p>
<p>Using phone number(s) in matching may not be a great idea as you mentioned, but I still tend to take it as an affirmative criteria when most of the other critical elements end up being good matches. Say you have a name, address, DOB, gender and identifier match, a matching phone number kind of confirms a good match. Usually we apply edit distance on phone. In above scenario we might give a good boost to overall score if it&#8217;s exact phone number match and lower(significant) if the match is 1 or more edit distance. </p>
<p>My 2 cents</p>
<p>-Prashant</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on How To Integrate Social Media Into Your WordPress Site by Prash Chan (@MDMGeek)</title>
		<link>http://thedataqualitychronicle.org/how-to-integrate-social-media-into-your-wordpress-site/#comment-1632</link>
		<dc:creator>Prash Chan (@MDMGeek)</dc:creator>
		<pubDate>Tue, 24 Jan 2012 21:59:25 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1911#comment-1632</guid>
		<description>Can&#039; wait 2 x&#039;plore RT @dqchronicle How To Integrate Social Media Into Your Wordpress Site &#124; &#124; The Data Quality Chronicle bit.ly/wKj81m #li </description>
		<content:encoded><![CDATA[<p>Can&#8217; wait 2 x&#8217;plore RT @dqchronicle How To Integrate Social Media Into Your WordPress Site | | The Data Quality Chronicle bit.ly/wKj81m #li </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Can Big Data and Social Media help HR in efficient/quality hiring? by Andy Wenzel (@AndyWenzel)</title>
		<link>http://thedataqualitychronicle.org/can-big-data-and-social-media-help-hr-in-efficientquality-hiring/#comment-1623</link>
		<dc:creator>Andy Wenzel (@AndyWenzel)</dc:creator>
		<pubDate>Mon, 23 Jan 2012 02:05:48 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1905#comment-1623</guid>
		<description>Can Big Data help with HR in hiring?  &lt;a href=&quot;http://t.co/eZyyfUNc&quot; rel=&quot;nofollow&quot;&gt;http://t.co/eZyyfUNc&lt;/a&gt; </description>
		<content:encoded><![CDATA[<p>Can Big Data help with HR in hiring?  <a href="http://t.co/eZyyfUNc" rel="nofollow">http://t.co/eZyyfUNc</a> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Too much push, not enough pull by Loraine Lawson (@LoraineLawson)</title>
		<link>http://thedataqualitychronicle.org/too-much-push-not-enough-pull/#comment-1619</link>
		<dc:creator>Loraine Lawson (@LoraineLawson)</dc:creator>
		<pubDate>Sat, 21 Jan 2012 15:16:37 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1712#comment-1619</guid>
		<description>Poll on data quality. &lt;a href=&quot;http://t.co/Re9PNkz3&quot; rel=&quot;nofollow&quot;&gt;http://t.co/Re9PNkz3&lt;/a&gt; by @dqchronicle </description>
		<content:encoded><![CDATA[<p>Poll on data quality. <a href="http://t.co/Re9PNkz3" rel="nofollow">http://t.co/Re9PNkz3</a> by @dqchronicle </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Big Data &#8230; Little Data Quality by DATAVOTE (@contactdata)</title>
		<link>http://thedataqualitychronicle.org/big-data-little-data-quality/#comment-1592</link>
		<dc:creator>DATAVOTE (@contactdata)</dc:creator>
		<pubDate>Wed, 18 Jan 2012 06:23:58 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1362#comment-1592</guid>
		<description>Interesting post &lt;a href=&quot;http://t.co/jgUCgSN6&quot; rel=&quot;nofollow&quot;&gt;http://t.co/jgUCgSN6&lt;/a&gt; by @dqchronicle </description>
		<content:encoded><![CDATA[<p>Interesting post <a href="http://t.co/jgUCgSN6" rel="nofollow">http://t.co/jgUCgSN6</a> by @dqchronicle </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on The Seven Habits of Highly Effective Data Quality by David Pratt (@DataMgmtWonk)</title>
		<link>http://thedataqualitychronicle.org/the-seven-habits-of-highly-effective-data-quality/#comment-1587</link>
		<dc:creator>David Pratt (@DataMgmtWonk)</dc:creator>
		<pubDate>Wed, 18 Jan 2012 02:28:17 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1092#comment-1587</guid>
		<description>Just read &quot;The Seven Habits of Highly Effective Data Quality&quot; via @dqchronicle &lt;a href=&quot;http://t.co/EfC5vs7N&quot; rel=&quot;nofollow&quot;&gt;http://t.co/EfC5vs7N&lt;/a&gt; #dataquality #7Habits #covey </description>
		<content:encoded><![CDATA[<p>Just read &#8220;The Seven Habits of Highly Effective Data Quality&#8221; via @dqchronicle <a href="http://t.co/EfC5vs7N" rel="nofollow">http://t.co/EfC5vs7N</a> #dataquality #7Habits #covey </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on When using data quality tools, what do you include? by (@AxelTroike) (@AxelTroike)</title>
		<link>http://thedataqualitychronicle.org/when-using-data-quality-tools-what-do-you-include/#comment-1621</link>
		<dc:creator>(@AxelTroike) (@AxelTroike)</dc:creator>
		<pubDate>Tue, 17 Jan 2012 13:09:13 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1727#comment-1621</guid>
		<description>My comment on post by W. Sharp ( @dqchronicle ): When using data quality tools, what do you include? &lt;a href=&quot;http://t.co/FMoZGgsI&quot; rel=&quot;nofollow&quot;&gt;http://t.co/FMoZGgsI&lt;/a&gt; #DataQuality </description>
		<content:encoded><![CDATA[<p>My comment on post by W. Sharp ( @dqchronicle ): When using data quality tools, what do you include? <a href="http://t.co/FMoZGgsI" rel="nofollow">http://t.co/FMoZGgsI</a> #DataQuality </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on When using data quality tools, what do you include? by Axel Troike (@AxelTroike)</title>
		<link>http://thedataqualitychronicle.org/when-using-data-quality-tools-what-do-you-include/#comment-1573</link>
		<dc:creator>Axel Troike (@AxelTroike)</dc:creator>
		<pubDate>Mon, 16 Jan 2012 19:46:19 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1727#comment-1573</guid>
		<description>Commented post by W. Sharp ( @dqchronicle ): When using data quality tools, what do you include? &lt;a href=&quot;http://t.co/FMoZGgsI&quot; rel=&quot;nofollow&quot;&gt;http://t.co/FMoZGgsI&lt;/a&gt; #DataQuality </description>
		<content:encoded><![CDATA[<p>Commented post by W. Sharp ( @dqchronicle ): When using data quality tools, what do you include? <a href="http://t.co/FMoZGgsI" rel="nofollow">http://t.co/FMoZGgsI</a> #DataQuality </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on When using data quality tools, what do you include? by Axel Troike</title>
		<link>http://thedataqualitychronicle.org/when-using-data-quality-tools-what-do-you-include/#comment-1572</link>
		<dc:creator>Axel Troike</dc:creator>
		<pubDate>Mon, 16 Jan 2012 19:39:21 +0000</pubDate>
		<guid isPermaLink="false">http://thedataqualitychronicle.org/?p=1727#comment-1572</guid>
		<description>First and foremost, people in any organization need to have a documented common understanding  what are the data objects of interest for their business and what are the describing attributes.  To create this common point of reference,  
a data modeling tool is required.  
 
In a second (better: parallel) step, the processes that create, update or delete those data objects need to be identified. Each process that manipulates data is a possible factor to influence the data quality for better or for worse. 
 
To document the identified processes and their specification, a process modeling tool is required. (I recommend to use a process modeling tool that is integrated with the data modeling tool in the way that processes can be linked to the data and vice versa.)  
 
Having such a fundament, you can examine process after process to eliminate possible quality risks, by adding integrity  checks to your transactions, by training your staff to double-check data at any point of entry, by additional batch tools that  correct / clean-up data on a regular basis etc. </description>
		<content:encoded><![CDATA[<p>First and foremost, people in any organization need to have a documented common understanding  what are the data objects of interest for their business and what are the describing attributes.  To create this common point of reference,<br />
a data modeling tool is required.  </p>
<p>In a second (better: parallel) step, the processes that create, update or delete those data objects need to be identified. Each process that manipulates data is a possible factor to influence the data quality for better or for worse. </p>
<p>To document the identified processes and their specification, a process modeling tool is required. (I recommend to use a process modeling tool that is integrated with the data modeling tool in the way that processes can be linked to the data and vice versa.)  </p>
<p>Having such a fundament, you can examine process after process to eliminate possible quality risks, by adding integrity  checks to your transactions, by training your staff to double-check data at any point of entry, by additional batch tools that  correct / clean-up data on a regular basis etc. </p>
]]></content:encoded>
	</item>
</channel>
</rss>

