CiviCRM Community Forums (archive)

*

News:

Have a question about CiviCRM?
Get it answered quickly at the new
CiviCRM Stack Exchange Q+A site

This forum was archived on 25 November 2017. Learn more.
How to get involved.
What to do if you think you've found a bug.



  • CiviCRM Community Forums (archive) »
  • Old sections (read-only, deprecated) »
  • Support »
  • Using CiviCRM »
  • Using Core CiviCRM Functions (Moderator: Yashodha Chaku) »
  • Import Problems, Deduping
Pages: [1]

Author Topic: Import Problems, Deduping  (Read 2139 times)

lcdweb

  • Forum Godess / God
  • I live on this forum
  • *****
  • Posts: 1620
  • Karma: 116
    • www.lcdservices.biz
  • CiviCRM version: many versions...
  • CMS version: Joomla/Drupal
  • MySQL version: 5.1+
  • PHP version: 5.2+
Import Problems, Deduping
September 03, 2007, 09:59:31 pm
Using current CiviCRM 1.8 on Joomla 1.0.13 (stable)

I've been working on importing my data into CiviCRM. All was going well for a while, but I've run into some recent problems.

I first imported all the organization records into CiviCRM, being careful not to have dups. Great, no prob. Then I imported my individuals, and used the relationship option to say the individual was an employee of the organization name field. Based on the documentation, I think this should have imported the records in such a way so as create the parent-child relationships. Unfortunately, it didn't---at least not consistently. In a few cases it seemed to have created that, but for the most part it just created duplicate org records.

Now I have a database full of duplicate orgs. I hoped the deduping mechanism in CiviCRM would let me hunt down the dups and combine them or something, but when I run the dedupe tool from the admin menu, the page breaks on me:

Code: [Select]
500 Server Error
A misconfiguration on the server caused a hiccup. Check the server logs, fix the problem, then try again.
URL: http://www.....org/administrator/index2.php?option=com_civicrm&task=civicrm/admin/dedupefind&reset=1&action=update&rgid=2

I've tried deduping both indiv and org contacts, and I get the same error msg.

Any suggestions on what is the best way to backtrack and fix things?

-Brian
support CiviCRM through 'make it happen' initiatives!
http://civicrm.org/mih

sewen

  • Guest
Re: Import Problems, Deduping
September 13, 2007, 06:56:13 am
I am doing the same thing Brian is doing and have gotten the same results.  I have imported my members /businesses as Organizations and am now trying to import the employees as Individuals, with one of the fields having the business name (using employer-employee relationship).  I have made sure the company names match, but civiCRM still creates new business records.

Any Solution??

Scott E.

lcdweb

  • Forum Godess / God
  • I live on this forum
  • *****
  • Posts: 1620
  • Karma: 116
    • www.lcdservices.biz
  • CiviCRM version: many versions...
  • CMS version: Joomla/Drupal
  • MySQL version: 5.1+
  • PHP version: 5.2+
Re: Import Problems, Deduping
September 13, 2007, 07:06:24 am
Scott,
I figured out a couple problems in my process that cleared things up --

1) When creating your .csv file for import, be sure to use UTF-8 as the export standard. I was exporting from MS Access to .csv, and it defaults to a Windows code format that can create problems when importing into CiviCRM.

2) I broke my import into chunks of about 200-300 (originally 2,000). That's a bit of a pain, but is worth it. I ran into no problems, saw the full list for each file successfully import, and when I did the individuals (after the orgs), the relationships were built perfectly.

I still have issues with the dedupe functions (they crash on me). But I handled most of my deduping before I imported, so it's not an immediate need.

-Brian
support CiviCRM through 'make it happen' initiatives!
http://civicrm.org/mih

sewen

  • Guest
Re: Import Problems, Deduping
September 13, 2007, 01:17:49 pm
Thanks Brian, I looked at Excel and it looks like you can use encoding: Unicode(UTF-8).  So I will try it out.

Piotr Szotkowski

  • I live on this forum
  • *****
  • Posts: 1497
  • Karma: 57
Re: Import Problems, Deduping
September 14, 2007, 03:16:14 am
Quote from: lcdweb on September 13, 2007, 07:06:24 am
1) When creating your .csv file for import, be sure to use UTF-8 as the export standard. I was exporting from MS Access to .csv, and it defaults to a Windows code format that can create problems when importing into CiviCRM.

You can setup CiviCRM to use a non-UTF encoding on imports: Administer CiviCRM → Global Settings → Localization → Legacy Encoding, but it’s better to have a UTF-8 file in the first place.

Quote from: lcdweb on September 13, 2007, 07:06:24 am
I still have issues with the dedupe functions (they crash on me). But I handled most of my deduping before I imported, so it's not an immediate need.

Is there a chance you could send me (shot@civicrm.org) your database? I won’t disclose it to anyone, of course, and will delete it as soon as I’m done with the forensics. You can encrypt the email with my GPG key, D128F14A.
If you found the above helpful, please consider helping us in return – you can even steer CiviCRM’s future and help us extend CiviCRM in ways useful to you.

Piotr Szotkowski

  • I live on this forum
  • *****
  • Posts: 1497
  • Karma: 57
Re: Import Problems, Deduping
September 14, 2007, 03:17:24 am
Alternatively, are you sure your database is running in UTF-8 mode? It’s a bit hard to debug without the actual error (do you have access to your server’s logs?).
If you found the above helpful, please consider helping us in return – you can even steer CiviCRM’s future and help us extend CiviCRM in ways useful to you.

lcdweb

  • Forum Godess / God
  • I live on this forum
  • *****
  • Posts: 1620
  • Karma: 116
    • www.lcdservices.biz
  • CiviCRM version: many versions...
  • CMS version: Joomla/Drupal
  • MySQL version: 5.1+
  • PHP version: 5.2+
Re: Import Problems, Deduping
September 14, 2007, 09:08:14 am
Yes, it's running utf8_unicode_ci

And sure, I'll send you a copy of the db dump to take a look at a little later today.
-Brian
support CiviCRM through 'make it happen' initiatives!
http://civicrm.org/mih

sewen

  • Guest
Re: Import Problems, Deduping
September 15, 2007, 11:44:45 am
Hi Brian

I have spent hours trying to import Individuals as employees of Businesses (Organizations) and it seems haphazard.  Sometimes it work (small sample), then when I go to import a larger sample I get duplicate Businesses without any addresses (or other information).  I've played around with UTF-8 encoding and i still can't find a trend.  Right now I think I'm going to import individuals without using a relationship and create the relationship later.

Scott E.

lcdweb

  • Forum Godess / God
  • I live on this forum
  • *****
  • Posts: 1620
  • Karma: 116
    • www.lcdservices.biz
  • CiviCRM version: many versions...
  • CMS version: Joomla/Drupal
  • MySQL version: 5.1+
  • PHP version: 5.2+
Re: Import Problems, Deduping
September 15, 2007, 12:19:50 pm
Before importing either orgs and indivs, I was rigorous about making sure the names matched perfectly (obviously necessary since org name is essentially the key index field). As for quantity, I imported in chunks of 200. I'm no expert on the code for this function -- just sharing what worked for me.
-Brian
support CiviCRM through 'make it happen' initiatives!
http://civicrm.org/mih

sewen

  • Guest
Re: Import Problems, Deduping
September 15, 2007, 02:35:05 pm
Hi Brian,

I just successfully imported 1056 Individual Contacts, using the relationship of Employer. 

Before, I was using "No DuplicateChecking" for Duplicate Contacts.  This time I used "Skip".  I don't know why it would matter, but it seemed to have worked.

On to the next challenge.

Scott E.


Pages: [1]
  • CiviCRM Community Forums (archive) »
  • Old sections (read-only, deprecated) »
  • Support »
  • Using CiviCRM »
  • Using Core CiviCRM Functions (Moderator: Yashodha Chaku) »
  • Import Problems, Deduping

This forum was archived on 2017-11-26.