Use the same approach as you described, but `DELETE` (or group, or modify ...) duplicate `PK` in the temp table before loading to the main table.

Something like:

    CREATE TEMP TABLE tmp_table 
    ON COMMIT DROP
    AS
    SELECT * 
    FROM main_table
    WITH NO DATA;
    
    COPY tmp_table FROM &#39;full/file/name/here&#39;;
    
    INSERT INTO main_table
    SELECT DISTINCT ON (PK_field) *
    FROM tmp_table
    ORDER BY (some_fields)

Details: [`CREATE TABLE AS`][1], [`COPY`][2], [`DISTINCT ON`][3]


  [1]: http://www.postgresql.org/docs/current/static/sql-createtableas.html
  [2]: http://www.postgresql.org/docs/current/static/sql-copy.html
  [3]: http://www.postgresql.org/docs/current/static/sql-select.html

PostgreSQL 9.5 now has [upsert functionality][1]. You can follow Igor&#39;s instructions, except that final INSERT includes the clause ON CONFLICT DO NOTHING.

    INSERT INTO main_table
    SELECT *
    FROM tmp_table
    ON CONFLICT DO NOTHING


  [1]: https://www.postgresql.org/docs/current/static/sql-insert.html#SQL-ON-CONFLICT

Igor’s answer helped me a lot, but I also ran into the problem Nate mentioned in his comment. Then I had the problem—maybe in addition to the question here—that the new data did not only contain duplicates internally but also duplicates with the existing data. What worked for me was the following.

    CREATE TEMP TABLE tmp_table AS SELECT * FROM newsletter_subscribers;
    COPY tmp_table (name, email) FROM stdin DELIMITER &#39; &#39; CSV;
    SELECT count(*) FROM tmp_table;  -- Just to be sure
    TRUNCATE newsletter_subscribers;
    INSERT INTO newsletter_subscribers
        SELECT DISTINCT ON (email) * FROM tmp_table
        ORDER BY email, subscription_status;
    SELECT count(*) FROM newsletter_subscribers;  -- Paranoid again

Both internal and external duplicates become the same in the `tmp_table` and then the `DISTINCT ON (email)` part removes them. The `ORDER BY` makes sure that the desired row comes first in the result set and `DISTINCT` then discards all further rows.

Insert into a temp table grouped by the key so you get rid of the duplicates

and then insert if not exists

I&#39;m trying to add all of the calorie contents in my javascript like this:

    $(function() {
        var data = [];
		
        $( &quot;#draggable1&quot; ).draggable();
        $( &quot;#draggable2&quot; ).draggable();
        $( &quot;#draggable3&quot; ).draggable();

		$(&quot;#droppable_box&quot;).droppable({
			drop: function(event, ui) {
				var currentId = $(ui.draggable).attr(&#39;id&#39;);
				var total = 0;
				data.push($(ui.draggable).attr(&#39;id&#39;));
		
				if(currentId == &quot;draggable1&quot;){
				var myInt1 = parseFloat($(&#39;#MealplanCalsPerServing1&#39;).val());
				}
				if(currentId == &quot;draggable2&quot;){
				var myInt2 = parseFloat($(&#39;#MealplanCalsPerServing2&#39;).val());
				}
				if(currentId == &quot;draggable3&quot;){
				var myInt3 = parseFloat($(&#39;#MealplanCalsPerServing3&#39;).val());
				}
			if ( typeof myInt1 === &#39;undefined&#39; || !myInt1 ) {
			myInt1 = parseInt(0);
			}
			if ( typeof myInt2 === &#39;undefined&#39; || !myInt2){
			myInt2 = parseInt(0);
			}
			if ( typeof myInt3 === &#39;undefined&#39; || !myInt3){
			myInt3 = parseInt(0);
			}
			total = parseFloat(myInt1 + myInt2 + myInt3);
			$(&#39;#response&#39;).append(total);
			}
		});
		$(&#39;#myId&#39;).click(function(event) {
			$.post(&quot;process.php&quot;, ({ id: data }), function(return_data, status) {
				alert(data);
				//alert(total);
			});
		});
	});


Instead of adding the variables they get concatenated. I&#39;ve tried using parseInt, parseFloat, and Number but I still just get concatenation and not addition. Please look at the view source at http://maureenmoore.com/momp_112412/121912_800.html

How to force addition instead of concatenation in javascript

I&#39;ll start with an example. Here&#39;s an equivalent of `List.fill` for tuples as a macro in Scala 2.10:

    import scala.language.experimental.macros
    import scala.reflect.macros.Context
    
    object TupleExample {
      def fill[A](arity: Int)(a: A): Product = macro fill_impl[A]
    
      def fill_impl[A](c: Context)(arity: c.Expr[Int])(a: c.Expr[A]) = {
        import c.universe._
    
        arity.tree match {
          case Literal(Constant(n: Int)) if n &lt; 23 =&gt; c.Expr(
            Apply(
              Select(Ident(&quot;Tuple&quot; + n.toString), &quot;apply&quot;),
              List.fill(n)(a.tree)
            )
          )
          case _ =&gt; c.abort(
            c.enclosingPosition,
            &quot;Desired arity must be a compile-time constant less than 23!&quot;
          )
        }
      }
    }

We can use this method as follows:

    scala&gt; TupleExample.fill(3)(&quot;hello&quot;)
    res0: (String, String, String) = (hello,hello,hello)

This guy is a weird bird in a couple of respects. First, the `arity` argument must be a literal integer, since we need to use it at compile time. In previous versions of Scala there was no way (as far as I know) for a method even to tell whether one of its arguments was a compile-time literal or not.

Second, the `Product` return type [is a lie][1]—the static return type will include the specific arity and element type determined by the arguments, as shown above.

So how would I document this thing? I&#39;m not expecting Scaladoc support at this point, but I&#39;d like to have a sense of conventions or best practices (beyond just making sure the compile-time error messages are clear) that would make running into a macro method—with its potentially bizarre demands—less surprising for users of a Scala 2.10 library.

The most mature demonstrations of the new macro system (e.g., [ScalaMock][2], [Slick][3], the others listed [here][4]) are still relatively undocumented at the method level. Any examples or pointers would be appreciated, including ones from other languages with similar macro systems.


  [1]: https://stackoverflow.com/a/13673950/334519
  [2]: https://github.com/paulbutcher/ScalaMock
  [3]: https://github.com/slick/slick
  [4]: http://scalamacros.org/news/2012/11/05/status-update.html

Documenting Scala 2.10 macros

I have to dump large amount of data from file to a table PostgreSQL. I know it does not support &#39;Ignore&#39; &#39;replace&#39; etc as done in MySql. Almost all posts regarding this in the web suggested the same thing like dumping the data to a temp table and then do a &#39;insert ... select ... where not exists...&#39;.

This will not help in one case, where the file data itself contained duplicate primary keys.
Any body have an idea on how to handle this in PostgreSQL?

P.S. I am doing this from a java program, if it helps

To ignore duplicate keys during &#39;copy from&#39; in postgresql

I have to dump large amount of data from file to a table PostgreSQL. I know it does not support 'Ignore' 'replace' etc as done in MySql. Almost all posts regarding this in the web suggested the same thing like dumping the data to a temp table and then do a 'insert ... select ... where not exists...'.
This will not help in one case, where the file data itself contained duplicate primary keys.
Any body have an idea on how to handle this in PostgreSQL?
P.S. I am doing this from a java program, if it helps

So here&#39;s what I want to do on my _MySQL_ database.

I would like to do:

    SELECT *
        FROM itemsOrdered
        WHERE purchaseOrder_ID = &#39;@purchaseOrdered_ID&#39;
            AND status = &#39;PENDING&#39;

If that would not return any rows, which is possible through `if(dr.HasRows == false)`, I would now create an `UPDATE` in the `purchaseOrder` database:

    UPDATE purchaseOrder
        SET purchaseOrder_status = &#39;COMPLETED&#39;
        WHERE purchaseOrder_ID = &#39;@purchaseOrder_ID&#39;

How would I be able to make this process a little shorter?


How do I put an &#39;if clause&#39; in an SQL string?

Title says it all, why can&#39;t I use a windowed function in a where clause in SQL Server?

This query makes perfect sense:

    select id, sales_person_id, product_type, product_id, sale_amount
    from Sales_Log
    where 1 = row_number() over(partition by sales_person_id, product_type, product_id order by sale_amount desc)

But it doesn&#39;t work. Is there a better way than a CTE/Subquery?

**EDIT**

For what its worth this is the query with a CTE:

    with Best_Sales as (
        select id, sales_person_id, product_type, product_id, sale_amount, row_number() over (partition by sales_person_id, product_type, product_id order by sales_amount desc) rank
        from Sales_log
    )
    select id, sales_person_id, product_type, product_id, sale_amount
    from Best_Sales
    where rank = 1

**EDIT**

+1 for the answers showing with a subquery, but really I&#39;m looking for the reasoning behind not being able to use windowing functions in where clauses.

Why no windowed functions in where clauses?

I&#39;ve had trouble understanding joins in sql and came upon this image which I think might help me. The problem is that I don&#39;t fully understand it.  For example, the join in the top right corner of the image, which colors the full B circle red and but only the overlap from A. The image makes it seem like circle B is the primary focus of the sql statement, but the sql statement itself, by starting with A (select from A, join B), conveys the opposite impression to me, namely that A would be the focus of the sql statement. 

Similarly, the image below that only includes data from the B circle, so why is A included at all in the join statement?

Question: Working clockwise from the top right and finishing in the center, can someone provide more information about the representation of each sql image, explaining 

a) why a join would be necessary in each case (for example, especially in situations where no data&#39;s taken from A or B i.e. where only A or B but not both is colored)

b) and any other detail that would clarify why the image is a good representation of the sql

![sql join diagram][1]

  [1]: http://i.stack.imgur.com/UI25E.jpg

sql joins as venn diagram

I got an error - 

&gt; Column &#39;Employee.EmpID&#39; is invalid in the select list because it is
&gt; not contained in either an aggregate function or the GROUP BY clause.

-----------------------------------------------------------------------------------------

    select loc.LocationID, emp.EmpID
    from Employee as emp full join Location as loc 
    on emp.LocationID = loc.LocationID
    group by loc.LocationID 


This situation fits into the answer given by Bill Karwin.

correction for above, fits into answer by ExactaBox - 

    select loc.LocationID, count(emp.EmpID) -- not count(*), don&#39;t want to count nulls
    from Employee as emp full join Location as loc 
    on emp.LocationID = loc.LocationID
    group by loc.LocationID 

-------------------------------------------------------------------------------------------

**ORIGINAL QUESTION -**

For the SQL query -
    
    select *
    from Employee as emp full join Location as loc 
    on emp.LocationID = loc.LocationID
    group by (loc.LocationID)


I don&#39;t understand why I get this error. All I want to do is join the tables and then group all the employees in a particular location together. 

**I think I have a partial explanation for my own question. Tell me if its ok -** 

To group all employees that work in the same location we have to first mention the LocationID. 

Then, we cannot/do not mention each employee ID next to it. Rather, we mention the total number of employees in that location, ie we should SUM() the employees working in that location. Why do we do it the latter way, i am not sure. 
So, this explains the &quot;it is not contained in either an aggregate function&quot; part of the error.

What is the explanation for the **`GROUP BY`** clause part of the error ?



Reason for Column is invalid in the select list because it is not contained in either an aggregate function or the GROUP BY clause

&gt; **Possible Duplicate:**  
&gt; [Mysql Like Case Sensitive](https://stackoverflow.com/questions/8083455/mysql-like-case-sensitive)  

&lt;!-- End of automatically inserted text --&gt;

Mysql ignores case for its LIKE comparisons.

How can you force it to perform case-sensitive LIKE comparisons?

How do you force mysql LIKE to be case sensitive?

I have a table in `PostgreSQL 8.3` with 2 `timestamp` columns. I would like to get the difference between these `timestamps` in seconds. Could you please help me how to get this done?

    TableA
    (
      timestamp_A timestamp,
      timestamp_B timestamp
    )

I need to get something like `(timestamo_B - timestamp_A)` in seconds *(not just the difference between seconds, it should include hours, minutes etc)*.

Find difference between timestamps in seconds in PostgreSQL

I am trying to configure ssl certificate for PostgreSQL server. I have created a certificate file (server.crt) and key (server.key) in data directory and update the parameter SSL to &quot;on&quot; to enable secure connection. 

I just want only the server to be authenticated with server certificates on the client side and don&#39;t require the authenticity of client at server side. I am using psql as a client to connect and execute the commands.

I am using PostgreSQL 8.4 and Linux. I tried with the below command to connect to server with SSL enabled

           psql &quot;postgresql://localhost:2345/postgres?sslmode=require&quot;

but I am getting 

           psql: invalid connection option &quot;postgresql://localhost:2345/postgres?sslmode&quot;

What am doing wrong here? Is the way I am trying to connect to server with SSL mode enabled is correct? Is it fine to authenticate only server and not the client ?  


Using psql to connect to PostgreSQL in SSL mode

Hi I am having trouble with postgres. I don&#39;t remember my postgres password and don&#39;t know how to change the password. I&#39;m guessing I should change the md5 password settings I set a month ago, but I don&#39;t know how to find the file and open it using my terminal. Can someone help?

Postgresql: How to find pg_hba.conf file using Mac OS X

I have a query like this that nicely generates a series of dates between 2 given dates:

    select date &#39;2004-03-07&#39; + j - i as AllDate 
    from generate_series(0, extract(doy from date &#39;2004-03-07&#39;)::int - 1) as i,
         generate_series(0, extract(doy from date &#39;2004-08-16&#39;)::int - 1) as j

It generates 162 dates between `2004-03-07` and `2004-08-16` and this what I want. The problem with this code is that it wouldn&#39;t give the right answer when the two dates are from different years, for example when I try `2007-02-01` and `2008-04-01`.

Is there a better solution?


Generating time series between two dates in PostgreSQL

I have two tables like here:

    DROP   TABLE  IF EXISTS schemas.book;
    DROP   TABLE  IF EXISTS schemas.category;
    DROP   SCHEMA IF EXISTS schemas;
    CREATE SCHEMA schemas;
    
    CREATE TABLE schemas.category (
      id          BIGSERIAL PRIMARY KEY,
      name        VARCHAR   NOT NULL,
      UNIQUE(name)
    );
    
    CREATE TABLE schemas.book (
      id          BIGSERIAL PRIMARY KEY,
      published   DATE      NOT NULL,
      category_id BIGINT    NOT NULL REFERENCES schemas.category
                                ON DELETE CASCADE 
                                ON UPDATE CASCADE,
      author      VARCHAR   NOT NULL,
      name        VARCHAR   NOT NULL,
      UNIQUE(published, author, name),
      FOREIGN KEY(category_id) REFERENCES schemas.category (id)
    );

So the logic is simple, after user removes all book under category x, x gets removed from cats, i tried method above but doesn&#39;t work, after i clean table book, table category still populated, what&#39;s wrong?

Content Type	Original Author	Original Content on Stackoverflow
Question	Kam	View Question on Stackoverflow
Solution 1 - Sql	Ihor Romanchenko	View Answer on Stackoverflow
Solution 2 - Sql	Barrel Roll	View Answer on Stackoverflow
Solution 3 - Sql	Denis Drescher	View Answer on Stackoverflow
Solution 4 - Sql	Jester	View Answer on Stackoverflow

To ignore duplicate keys during 'copy from' in postgresql

Sql Problem Overview

Sql Solutions

Solution 1 - Sql

Solution 2 - Sql

Solution 3 - Sql

Solution 4 - Sql

Documenting Scala 2.10 macros

How to force addition instead of concatenation in javascript

Attributions