I have a tab separated text file in the format id | field 1 | field 2 ... I want to insert this into a mysql database with id as the primary key but the text file may contain duplicate id's . How to make sure that there's just one entry corresponding to each id. How to make a choice between two lines having the same id (Yes, they might not be consistent, but it's okay to choose one over other like the first or the last occurrence )

Reputation: 484

making a database in mysql without duplicate values

I have a tab separated text file in the format

id | field 1 | field 2 ...

I want to insert this into a mysql database with id as the primary key but the text file may contain duplicate id's .

How to make sure that there's just one entry corresponding to each id.
How to make a choice between two lines having the same id (Yes, they might not be consistent, but it's okay to choose one over other like the first or the last occurrence )

Upvotes: 0

Answers (3)

Cyberfox

Reputation: 1145

Presuming a Unix shell, I'd do this:

awk '!x[$1]++' inputfile.tsv > uniqfile.tsv

then do your import off of the uniqfile.

edit: to be clear, that script uniq's the input file based on the first field by only outputting rows that do not already have a non-zero value in a hash keyed off of the first field.

Upvotes: 0

KV Prajapati

Reputation: 94653

Read line by line from text file, parse that line and use INSERT ... ON DUPLICATE KEY UPDATE Syntax.

Upvotes: 2

Nobita

Reputation: 23713

I would do a SELECT before INSERT and count the number of rows returned by the SELECT. Something like this:

SELECT * FROM yourTable WHERE yourTable.id = :id

If that returns any row, don't insert and go to next. Otherwise insert it.

Edit: This would be a post strategy. It would be good if you could add a Unique Constraint to guarantee uniqueness. Something like:

ALTER TABLE yourTable ADD CONSTRAINT ukID UNIQUE (id)

Upvotes: 1

making a database in mysql without duplicate values

Answers (3)

Related Questions