PostgreSQL: SQL script to get a list of all tables that have a particular column as foreign key

SqlDatabasePostgresqlForeign Keys

Sql Problem Overview


I'm using PostgreSQL and I'm trying to list all the tables that have a particular column from a table as a foreign-key/reference. Can this be done? I'm sure this information is stored somewhere in information_schema but I have no idea how to start querying it.

Sql Solutions


Solution 1 - Sql

SELECT
    r.table_name
FROM information_schema.constraint_column_usage       u
INNER JOIN information_schema.referential_constraints fk
           ON u.constraint_catalog = fk.unique_constraint_catalog
               AND u.constraint_schema = fk.unique_constraint_schema
               AND u.constraint_name = fk.unique_constraint_name
INNER JOIN information_schema.key_column_usage        r
           ON r.constraint_catalog = fk.constraint_catalog
               AND r.constraint_schema = fk.constraint_schema
               AND r.constraint_name = fk.constraint_name
WHERE
    u.column_name = 'id' AND
    u.table_catalog = 'db_name' AND
    u.table_schema = 'public' AND
    u.table_name = 'table_a'

This uses the full catalog/schema/name triplet to identify a db table from all 3 information_schema views. You can drop one or two as required.

The query lists all tables that have a foreign key constraint against the column 'a' in table 'd'

Solution 2 - Sql

The other solutions are not guaranteed to work in postgresql, as the constraint_name is not guaranteed to be unique; thus you will get false positives. PostgreSQL used to name constraints silly things like '$1', and if you've got an old database you've been maintaining through upgrades, you likely still have some of those around.

Since this question was targeted AT PostgreSQL and that is what you are using, then you can query the internal postgres tables pg_class and pg_attribute to get a more accurate result.

NOTE: FKs can be on multiple columns, thus the referencing column (attnum of pg_attribute) is an ARRAY, which is the reason for using array_agg in the answer.

The only thing you need plug in is the TARGET_TABLE_NAME:

select 
  (select r.relname from pg_class r where r.oid = c.conrelid) as table, 
  (select array_agg(attname) from pg_attribute 
   where attrelid = c.conrelid and ARRAY[attnum] <@ c.conkey) as col, 
  (select r.relname from pg_class r where r.oid = c.confrelid) as ftable 
from pg_constraint c 
where c.confrelid = (select oid from pg_class where relname = 'TARGET_TABLE_NAME');

If you want to go the other way (list all of the things a specific table refers to), then just change the last line to:

where c.conrelid = (select oid from pg_class where relname = 'TARGET_TABLE_NAME');

Oh, and since the actual question was to target a specific column, you can specify the column name with this one:

select (select r.relname from pg_class r where r.oid = c.conrelid) as table, 
       (select array_agg(attname) from pg_attribute 
        where attrelid = c.conrelid and ARRAY[attnum] <@ c.conkey) as col, 
       (select r.relname from pg_class r where r.oid = c.confrelid) as ftable 
from pg_constraint c 
where c.confrelid = (select oid from pg_class where relname = 'TARGET_TABLE_NAME') and 
      c.confkey @> (select array_agg(attnum) from pg_attribute 
                    where attname = 'TARGET_COLUMN_NAME' and attrelid = c.confrelid);

Solution 3 - Sql

This query requires only the referenced table name and column name, and produces a result set containing both sides of the foreign key.

select confrelid::regclass, af.attname as fcol,
       conrelid::regclass, a.attname as col
from pg_attribute af, pg_attribute a,
  (select conrelid,confrelid,conkey[i] as conkey, confkey[i] as confkey
   from (select conrelid,confrelid,conkey,confkey,
                generate_series(1,array_upper(conkey,1)) as i
         from pg_constraint where contype = 'f') ss) ss2
where af.attnum = confkey and af.attrelid = confrelid and
      a.attnum = conkey and a.attrelid = conrelid 
  AND confrelid::regclass = 'my_table'::regclass AND af.attname = 'my_referenced_column';

Example result set:

confrelid |         fcol         |   conrelid    |     col
----------+----------------------+---------------+-------------
 my_table | my_referenced_column | some_relation | source_type
 my_table | my_referenced_column | some_feature  | source_type

All credit to Lane and Krogh at the PostgreSQL forum.

Solution 4 - Sql

Personally, I prefer to query based on the referenced unique constraint rather than the column. That would look something like this:

SELECT rc.constraint_catalog,
       rc.constraint_schema||'.'||tc.table_name AS table_name,
       kcu.column_name,
       match_option,
       update_rule,
       delete_rule
FROM information_schema.referential_constraints AS rc 
    JOIN information_schema.table_constraints AS tc USING(constraint_catalog,constraint_schema,constraint_name)
    JOIN information_schema.key_column_usage AS kcu USING(constraint_catalog,constraint_schema,constraint_name)
WHERE unique_constraint_catalog='catalog'
    AND unique_constraint_schema='schema'
    AND unique_constraint_name='constraint name';

Here is a version that allows querying by column name:

SELECT rc.constraint_catalog,
       rc.constraint_schema||'.'||tc.table_name AS table_name,
       kcu.column_name,
       match_option,
       update_rule,
       delete_rule
FROM information_schema.referential_constraints AS rc
    JOIN information_schema.table_constraints AS tc USING(constraint_catalog,constraint_schema,constraint_name)
    JOIN information_schema.key_column_usage AS kcu USING(constraint_catalog,constraint_schema,constraint_name)
    JOIN information_schema.key_column_usage AS ccu ON(ccu.constraint_catalog=rc.unique_constraint_catalog AND ccu.constraint_schema=rc.unique_constraint_schema AND ccu.constraint_name=rc.unique_constraint_name)
WHERE ccu.table_catalog='catalog'
    AND ccu.table_schema='schema'
    AND ccu.table_name='name'
    AND ccu.column_name='column';

Solution 5 - Sql

A simple request for recovered the names of foreign key as well as the names of the tables:

SELECT CONSTRAINT_NAME, table_name
FROM
   information_schema.table_constraints 
WHERE table_schema='public' and constraint_type='FOREIGN KEY'

Solution 6 - Sql

SELECT
  main_table.table_name            AS main_table_table_name,
  main_table.column_name           AS main_table_column_name,
  main_table.constraint_name       AS main_table_constraint_name,
  info_other_table.table_name      AS info_other_table_table_name,
  info_other_table.constraint_name AS info_other_table_constraint_name,
  info_other_table.column_name     AS info_other_table_column_name
FROM INFORMATION_SCHEMA.CONSTRAINT_COLUMN_USAGE main_table
  INNER JOIN INFORMATION_SCHEMA.REFERENTIAL_CONSTRAINTS other_table
    ON other_table.unique_constraint_name = main_table.constraint_name
  INNER JOIN INFORMATION_SCHEMA.KEY_COLUMN_USAGE info_other_table
    ON info_other_table.constraint_name = other_table.constraint_name
WHERE main_table.table_name = 'MAIN_TABLE_NAME';

Solution 7 - Sql

If you use the psql client, you can simply issue the \d table_name command to see which tables reference the given table. From the linked documentation page:

> \d[S+] [ pattern ] > > For each relation (table, view, materialized view, index, sequence, or foreign table) or composite type matching the pattern, > show all columns, their types, the tablespace (if not the default) and > any special attributes such as NOT NULL or defaults. Associated > indexes, constraints, rules, and triggers are also shown. For foreign > tables, the associated foreign server is shown as well.

Solution 8 - Sql

Table constraints can include multiple columns. The trick to getting this right is to join each column by their constraint ordinal positions. If you don't join correctly your script will blow up with duplicate rows  whenever a table has multiple columns in a unique constraint.

Query

Lists all foreign key columns and their references.

select
       -- unique reference info
       ref.table_catalog    as ref_database,
       ref.table_schema     as ref_schema,
       ref.table_name       as ref_table,
       ref.column_name      as ref_column,
       refd.constraint_type as ref_type, -- e.g. UNIQUE or PRIMARY KEY

       -- foreign key info
       fk.table_catalog as fk_database,
       fk.table_schema  as fk_schema,
       fk.table_name    as fk_table,
       fk.column_name   as fk_column,
       map.update_rule  as fk_on_update,
       map.delete_rule  as fk_on_delete

-- lists fk constraints and maps them to pk constraints
from information_schema.referential_constraints as map

-- join unique constraints (e.g. PKs constraints) to ref columns info
inner join information_schema.key_column_usage as ref
    on  ref.constraint_catalog = map.unique_constraint_catalog
    and ref.constraint_schema = map.unique_constraint_schema
    and ref.constraint_name = map.unique_constraint_name

-- optional: to include reference constraint type
left join information_schema.table_constraints as refd
    on  refd.constraint_catalog = ref.constraint_catalog
    and refd.constraint_schema = ref.constraint_schema
    and refd.constraint_name = ref.constraint_name

-- join fk columns to the correct ref columns using ordinal positions
inner join information_schema.key_column_usage as fk
    on  fk.constraint_catalog = map.constraint_catalog
    and fk.constraint_schema = map.constraint_schema
    and fk.constraint_name = map.constraint_name
    and fk.position_in_unique_constraint = ref.ordinal_position --IMPORTANT!

Explanation

consider the relationship between these to tables.

create table foo (
    a int,
    b int,
    primary key (a,b)
);

create table bar (
    c int,
    d int,
    foreign key (c,d) references foo (b,a) -- i flipped a,b to make a point later.
);

get table constraint names

select * from information_schema.table_constraints where table_name in ('foo','bar');
| constraint_name | table_name | constraint_type |
| --------------- | ---------- | --------------- |
| foo_pkey        | foo        | PRIMARY KEY     |
| bar_c_d_fkey    | bar        | FOREIGN KEY     |

constraint references

select * from information_schema.referential_constraints where constraint_name in ('bar_c_d_fkey');
| constraint_name | unique_constraint_name |
| --------------- | ---------------------- |
| bar_c_d_fkey    | foo_pkey               |

constraint ordinal_position of column.

select * from information_schema.key_column_usage where table_name in ('foo','bar');
| constraint_name | table_name | column_name | ordinal_position | position_in_unique_constraint |
| --------------- | ---------- | ----------- | ---------------- | ----------------------------- |
| foo_pkey        | foo        | a           | 1                | null                          |
| foo_pkey        | foo        | b           | 2                | null                          |
| bar_c_d_fkey    | bar        | c           | 1                | 2                             |
| bar_c_d_fkey    | bar        | d           | 2                | 1                             |

Now all that's left is to join them together. The main query above is one way you could do so.

Solution 9 - Sql

I turned @Tony K's answer into a reusable function that takes in a schema/table/column tuple and returns all tables that have a foreign key relationship: https://gist.github.com/colophonemes/53b08d26bdd219e6fc11677709e8fc6c

I needed something like this in order to implement a script that merged two records into a single record.

Function:

CREATE SCHEMA utils;

-- Return type for the utils.get_referenced_tables function
CREATE TYPE utils.referenced_table_t AS (
  constraint_name name,
  schema_name name,
  table_name name,
  column_name name[],
  foreign_schema_name name,
  foreign_table_name name
);
/*
 A function to get all downstream tables that are referenced to a table via a foreign key relationship
 The function looks at all constraints that contain a reference to the provided schema-qualified table column
 It then generates a list of the schema/table/column tuples that are the target of these references
 Idea based on https://stackoverflow.com/a/21125640/7114675
 Postgres built-in reference:
 - pg_namespace  => schemas
 - pg_class      => tables
 - pg_attribute  => table columns
 - pg_constraint => constraints
*/
CREATE FUNCTION utils.get_referenced_tables (schema_name name, table_name name, column_name name)
RETURNS SETOF utils.referenced_table_t AS $$
  -- Wrap the internal query in a select so that we can order it more easily
  SELECT * FROM (
    -- Get human-readable names for table properties by mapping the OID's stored on the pg_constraint
    -- table to the underlying value on their relevant table.
    SELECT
      -- constraint name - we get this directly from the constraints table
      pg_constraint.conname AS constraint_name,
      -- schema_name
      (
        SELECT pg_namespace.nspname FROM pg_namespace
        WHERE pg_namespace.oid = pg_constraint.connamespace
      ) as schema_name,
      -- table_name
      (
        SELECT pg_class.relname FROM pg_class
        WHERE pg_class.oid = pg_constraint.conrelid
      ) as table_name,
      -- column_name
      (
        SELECT array_agg(attname) FROM pg_attribute
        WHERE attrelid = pg_constraint.conrelid
          AND ARRAY[attnum] <@ pg_constraint.conkey
      ) AS column_name,
      -- foreign_schema_name
      (
        SELECT pg_namespace.nspname FROM pg_namespace
        WHERE pg_namespace.oid = (
          SELECT pg_class.relnamespace FROM pg_class
          WHERE pg_class.oid = pg_constraint.confrelid
        )
      ) AS foreign_schema_name,
      -- foreign_table_name
      (
        SELECT pg_class.relname FROM pg_class
        WHERE pg_class.oid = pg_constraint.confrelid
      ) AS foreign_table_name
    FROM pg_constraint
    -- confrelid = constraint foreign relation id = target schema + table
    WHERE confrelid IN (
        SELECT oid FROM pg_class
        -- relname = target table name
        WHERE relname = get_referenced_tables.table_name
        -- relnamespace = target schema
          AND relnamespace = (
            SELECT oid FROM pg_namespace
            WHERE nspname = get_referenced_tables.schema_name
          )
    )
    -- confkey = constraint foreign key = the column on the foreign table linked to the target column
    AND confkey @> (
      SELECT array_agg(attnum) FROM pg_attribute
      WHERE attname = get_referenced_tables.column_name
      AND attrelid = pg_constraint.confrelid
    )
  ) a
  ORDER BY
    schema_name,
    table_name,
    column_name,
    foreign_table_name,
    foreign_schema_name
 ;
$$ LANGUAGE SQL STABLE;

Example usage:

/*
  Function to merge two people into a single person
  The primary person (referenced by primary_person_id) will be retained, the secondary person
  will have all their records re-referenced to the primary person, and then the secondary person
  will be deleted
  Note that this function may be destructive! For most tables, the records will simply be merged,
  but in cases where merging would violate a UNIQUE or EXCLUSION constraint, the secondary person's
  respective records will be dropped. For example, people cannot have overlapping pledges (on the
  pledges.pledge table). If the secondary person has a pledge that overlaps with a pledge that is
  on record for the primary person, the secondary person's pledge will just be deleted.
*/
CREATE FUNCTION utils.merge_person (primary_person_id BIGINT, secondary_person_id BIGINT)
RETURNS people.person AS $$
DECLARE
  _referenced_table utils.referenced_table_t;
  _col name;
  _exec TEXT;
  _primary_person people.person;
BEGIN
  -- defer all deferrable constraints
  SET CONSTRAINTS ALL DEFERRED;
  -- This loop updates / deletes all referenced tables, setting the person_id (or equivalent)
  -- From secondary_person_id => primary_person_id
  FOR _referenced_table IN (SELECT * FROM utils.get_referenced_tables('people', 'person', 'id')) LOOP
    -- the column_names are stored as an array, so we need to loop through these too
    FOREACH _col IN ARRAY _referenced_table.column_name LOOP
      RAISE NOTICE 'Merging %.%(%)', _referenced_table.schema_name, _referenced_table.table_name, _col;

      -- FORMAT allows us to safely build a dynamic SQL string
      _exec = FORMAT(
        $sql$ UPDATE %s.%s SET %s = $1 WHERE %s = $2 $sql$,
        _referenced_table.schema_name,
        _referenced_table.table_name,
        _col,
        _col
      );

      RAISE NOTICE 'SQL:  %', _exec;

      -- wrap the execution in a block so that we can handle uniqueness violations
      BEGIN
        EXECUTE _exec USING primary_person_id, secondary_person_id;
        RAISE NOTICE 'Merged %.%(%) OK!', _referenced_table.schema_name, _referenced_table.table_name, _col;
      EXCEPTION
        -- Error codes are Postgres built-ins, see https://www.postgresql.org/docs/9.6/errcodes-appendix.html
        WHEN unique_violation OR exclusion_violation THEN
          RAISE NOTICE 'Cannot merge record with % = % on table %.%, falling back to deletion!', _col, secondary_person_id, _referenced_table.schema_name, _referenced_table.table_name;
          _exec = FORMAT(
            $sql$ DELETE FROM %s.%s WHERE %s = $1 $sql$,
            _referenced_table.schema_name,
            _referenced_table.table_name,
            _col
          );
          RAISE NOTICE 'SQL:  %', _exec;
          EXECUTE _exec USING secondary_person_id;
          RAISE WARNING 'Deleted record with % = % on table %.%', _col, secondary_person_id, _referenced_table.schema_name, _referenced_table.table_name;
      END;

    END LOOP;
  END LOOP;

  -- Once we've updated all the tables, we can safely delete the secondary person
  RAISE WARNING 'Deleted person with id = %', secondary_person_id;

  -- Get our primary person so that we can return them
  SELECT * FROM people.person WHERE id = primary_person_id INTO _primary_person;

  RETURN _primary_person;

END
$$ LANGUAGE plpgsql VOLATILE;

Note the use of SET CONSTRAINTS ALL DEFERRED; in the function, which ensures that foreign key relationships are checked at the end of the merge. You may need to update your constraints to be DEFERRABLE INITIALLY DEFERRED:

ALTER TABLE settings.contact_preference
  DROP CONSTRAINT contact_preference_person_id_fkey,
  DROP CONSTRAINT person_id_current_address_id_fkey,
  ADD CONSTRAINT contact_preference_person_id_fkey
    FOREIGN KEY (person_id)
    REFERENCES people.person(id)
    ON UPDATE CASCADE ON DELETE CASCADE
    DEFERRABLE INITIALLY IMMEDIATE,
  ADD CONSTRAINT person_id_current_address_id_fkey
    FOREIGN KEY (person_id, current_address_id)
    REFERENCES people.address(person_id, id)
    DEFERRABLE INITIALLY IMMEDIATE
;

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
QuestionGaurav DadhaniaView Question on Stackoverflow
Solution 1 - SqlRichardTheKiwiView Answer on Stackoverflow
Solution 2 - SqlTony K.View Answer on Stackoverflow
Solution 3 - SqlvallismortisView Answer on Stackoverflow
Solution 4 - SqlAnomieView Answer on Stackoverflow
Solution 5 - SqlFadidView Answer on Stackoverflow
Solution 6 - SqlLucasView Answer on Stackoverflow
Solution 7 - SqlEugene YarmashView Answer on Stackoverflow
Solution 8 - SqlJakobiiView Answer on Stackoverflow
Solution 9 - SqlcolophonemesView Answer on Stackoverflow