CiviCRM Community Forums (archive)

*

News:

Have a question about CiviCRM?
Get it answered quickly at the new
CiviCRM Stack Exchange Q+A site

This forum was archived on 25 November 2017. Learn more.
How to get involved.
What to do if you think you've found a bug.



  • CiviCRM Community Forums (archive) »
  • Old sections (read-only, deprecated) »
  • Developer Discussion (Moderator: Donald Lobo) »
  • Need a sub forum on data quality!!
Pages: [1]

Author Topic: Need a sub forum on data quality!!  (Read 1021 times)

Erich Schulz

  • I post frequently
  • ***
  • Posts: 142
  • Karma: 5
    • When no-one understands what you are going on about its time to start a blog
  • CiviCRM version: 4.4
  • CMS version: Drupal 7
  • MySQL version: 5.somthing
  • PHP version: 5.3.3
Need a sub forum on data quality!!
August 17, 2011, 08:14:39 pm
Both the installs i'm involved with have significant data quality conerns... which is an impediment to effective system use... biggest issue seems to be excessive duplication of

- contacts
- contact details (addresses and telephone especially)

to locate and clean dupes requires standard formatting tho... just starting to think about addresses so ran this query to look at st/Street Rd/Road etc:
Code: [Select]
SELECT SUBSTRING_INDEX(street_address,' ',-1) as LastWord,
  count(*) AS n
FROM civicrm_address
WHERE street_address IS NOT NULL
 AND SUBSTRING_INDEX(street_address,' ',-1) < 1 /* filter out numbers! */
GROUP BY LastWord
ORDER BY n DESC

have a look at your own databases!

i'll post my fixes as they emerge

boonclocks

  • I’m new here
  • *
  • Posts: 1
  • Karma: 0
  • CiviCRM version: 3.3
  • CMS version: Joomla 1.5.25
  • MySQL version: 5.1.61
  • PHP version: 5.3.3
Re: Need a sub forum on data quality!!
November 13, 2012, 09:15:09 am
Hi, any progress on this? Emails and Phone numbers are constantly duplicating but I haven't the knowledge/patience to fix it...

Erich Schulz

  • I post frequently
  • ***
  • Posts: 142
  • Karma: 5
    • When no-one understands what you are going on about its time to start a blog
  • CiviCRM version: 4.4
  • CMS version: Drupal 7
  • MySQL version: 5.somthing
  • PHP version: 5.3.3
Re: Need a sub forum on data quality!!
November 17, 2012, 04:33:31 pm
Hi Boonclocks - there is a lot of functionality built in for basic deduping - you may need to adjust your dupe rules

petednz

  • Forum Godess / God
  • I’m (like) Lobo ;)
  • *****
  • Posts: 4899
  • Karma: 193
    • Fuzion
  • CiviCRM version: 3.x - 4.x
  • CMS version: Drupal 6 and 7
Re: Need a sub forum on data quality!!
November 17, 2012, 04:46:04 pm
May also help if you can explain a bit more about how you think your duplicate emails are coming in. I.e is the same contact ending up with the same email for eg Work and Home and Billing
or are the getting multiple 'home' emails with same email address
or ...
And, are these coming in from Profiles being filled in - and if so are those profiles all set to use the same Location type. Etc.
Sign up to StackExchange and get free expert advice: https://civicrm.org/blogs/colemanw/get-exclusive-access-free-expert-help

pete davis : www.fuzion.co.nz : connect + campaign + communicate

Pages: [1]
  • CiviCRM Community Forums (archive) »
  • Old sections (read-only, deprecated) »
  • Developer Discussion (Moderator: Donald Lobo) »
  • Need a sub forum on data quality!!

This forum was archived on 2017-11-26.