How to get a count of number of documents in a collection with Cloud Firestore

DatabaseFirebaseGoogle Cloud-Firestore

Database Problem Overview


In Firestore, how can I get the total number of documents in a collection?

For instance if I have

/people
    /123456
        /name - 'John'
    /456789
        /name - 'Jane'

I want to query how many people I have and get 2.

I could do a query on /people and then get the length of the returned results, but that seems a waste, especially because I will be doing this on larger datasets.

Database Solutions


Solution 1 - Database

You currently have 3 options:

Option 1: Client side

This is basically the approach you mentioned. Select all from collection and count on the client side. This works well enough for small datasets but obviously doesn't work if the dataset is larger.

Option 2: Write-time best-effort

With this approach, you can use Cloud Functions to update a counter for each addition and deletion from the collection.

This works well for any dataset size, as long as additions/deletions only occur at the rate less than or equal to 1 per second. This gives you a single document to read to give you the almost current count immediately.

If need need to exceed 1 per second, you need to implement distributed counters per our documentation.

Option 3: Write-time exact

Rather than using Cloud Functions, in your client you can update the counter at the same time as you add or delete a document. This means the counter will also be current, but you'll need to make sure to include this logic anywhere you add or delete documents.

Like option 2, you'll need to implement distributed counters if you want to exceed per second

Solution 2 - Database

Aggregations are the way to go (firebase functions looks like the recommended way to update these aggregations as client side exposes info to the user you may not want exposed) https://firebase.google.com/docs/firestore/solutions/aggregation

Another way (NOT recommended) which is not good for large lists and involves downloading the whole list: res.size like this example:

   db.collection("logs")
      .get()
      .then((res) => console.log(res.size));

Solution 3 - Database

If you use AngulareFire2, you can do (assuming private afs: AngularFirestore is injected in your constructor):

this.afs.collection(myCollection).valueChanges().subscribe( values => console.log(values.length));

Here, values is an array of all items in myCollection. You don't need metadata so you can use valueChanges() method directly.

Solution 4 - Database

Be careful counting number of documents for large collections with a cloud function. It is a little bit complex with firestore database if you want to have a precalculated counter for every collection.

Code like this doesn't work in this case:

export const customerCounterListener = 
    functions.firestore.document('customers/{customerId}')
    .onWrite((change, context) => {

    // on create
    if (!change.before.exists && change.after.exists) {
        return firestore
                 .collection('metadatas')
                 .doc('customers')
                 .get()
                 .then(docSnap =>
                     docSnap.ref.set({
                         count: docSnap.data().count + 1
                     }))
    // on delete
    } else if (change.before.exists && !change.after.exists) {
        return firestore
                 .collection('metadatas')
                 .doc('customers')
                 .get()
                 .then(docSnap =>
                     docSnap.ref.set({
                         count: docSnap.data().count - 1
                     }))
    }

    return null;
});

The reason is because every cloud firestore trigger has to be idempotent, as firestore documentation say: https://firebase.google.com/docs/functions/firestore-events#limitations_and_guarantees

Solution

So, in order to prevent multiple executions of your code, you need to manage with events and transactions. This is my particular way to handle large collection counters:

const executeOnce = (change, context, task) => {
    const eventRef = firestore.collection('events').doc(context.eventId);

    return firestore.runTransaction(t =>
	    t
		 .get(eventRef)
		 .then(docSnap => (docSnap.exists ? null : task(t)))
		 .then(() => t.set(eventRef, { processed: true }))
    );
};

const documentCounter = collectionName => (change, context) =>
    executeOnce(change, context, t => {
        // on create
	    if (!change.before.exists && change.after.exists) {
		    return t
                    .get(firestore.collection('metadatas')
                    .doc(collectionName))
                    .then(docSnap =>
			            t.set(docSnap.ref, {
				            count: ((docSnap.data() && docSnap.data().count) || 0) + 1
			            }));
        // on delete
	    } else if (change.before.exists && !change.after.exists) {
		    return t
                     .get(firestore.collection('metadatas')
                     .doc(collectionName))
                     .then(docSnap =>
			            t.set(docSnap.ref, {
                            count: docSnap.data().count - 1
			            }));
	    }

	    return null;
    });

Use cases here:

/**
 * Count documents in articles collection.
 */
exports.articlesCounter = functions.firestore
    .document('articles/{id}')
    .onWrite(documentCounter('articles'));

/**
 * Count documents in customers collection.
 */
exports.customersCounter = functions.firestore
	.document('customers/{id}')
	.onWrite(documentCounter('customers'));

As you can see, the key to prevent multiple execution is the property called eventId in the context object. If the function has been handled many times for the same event, the event id will be the same in all cases. Unfortunately, you must have "events" collection in your database.

Solution 5 - Database

Please check below answer I found on another thread. Your count should be atomic. Its required to use FieldValue.increment() function in such case.

https://stackoverflow.com/a/49407570/3337028

Solution 6 - Database

firebase-admin offers select(fields) which allows you to only fetch specific fields for documents within your collection. Using select is more performant than fetching all fields. However, it is only available for firebase-admin and firebase-admin is typically only used server side.

select can be used as follows:

select('age', 'name') // fetch the age and name fields
select() // select no fields, which is perfect if you just want a count

select is available for Node.js servers but I am not sure about other languages:

https://googleapis.dev/nodejs/firestore/latest/Query.html#select https://googleapis.dev/nodejs/firestore/latest/CollectionReference.html#select

Here's a server side cloud function written in Node.js which uses select to count a filtered collection and to get the IDs of all resulting documents. Its written in TS but easily converted to JS.

import admin from 'firebase-admin'

// https://stackoverflow.com/questions/46554091/cloud-firestore-collection-count

// we need to use admin SDK here as select() is only available for admin
export const videoIds = async (req: any): Promise<any> => {

  const id: string = req.query.id || null
  const group: string = req.query.group || null
  let processed: boolean = null
  if (req.query.processed === 'true') processed = true
  if (req.query.processed === 'false') processed = false

  let q: admin.firestore.Query<admin.firestore.DocumentData> = admin.firestore().collection('videos')
  if (group != null) q = q.where('group', '==', group)
  if (processed != null) q = q.where('flowPlayerProcessed', '==', processed)
  // select restricts returned fields such as ... select('id', 'name')
  const query: admin.firestore.QuerySnapshot<admin.firestore.DocumentData> = await q.orderBy('timeCreated').select().get()

  const ids: string[] = query.docs.map((doc: admin.firestore.QueryDocumentSnapshot<admin.firestore.DocumentData>) => doc.id) // ({ id: doc.id, ...doc.data() })

  return {
	id,
	group,
	processed,
	idx: id == null ? null : ids.indexOf(id),
	count: ids.length,
	ids
  }
}

The cloud function HTTP request completes within 1 second for a collection of 500 docs where each doc contains a lot of data. Not amazingly performant but much better than not using select. Performance could be improved by introducing client side caching (or even server side caching).

The cloud function entry point looks like this:

exports.videoIds = functions.https.onRequest(async (req, res) => {
  const response: any = await videoIds(req)
  res.json(response)
})

The HTTP request URL would be:

https://SERVER/videoIds?group=my-group&processed=true

Firebase functions detail where the server is located on deployment.

Solution 7 - Database

Following Dan Answer: You can have a separated counter in your database and use Cloud Functions to maintain it. (Write-time best-effort)

// Example of performing an increment when item is added
module.exports.incrementIncomesCounter = collectionRef.onCreate(event => {
  const counterRef = event.data.ref.firestore.doc('counters/incomes')

  counterRef.get()
  .then(documentSnapshot => {
    const currentCount = documentSnapshot.exists ? documentSnapshot.data().count : 0

    counterRef.set({
      count: Number(currentCount) + 1
    })
    .then(() => {
      console.log('counter has increased!')
    })
  })
})

This code shows you the complete example of how to do it: <https://gist.github.com/saintplay/3f965e0aea933a1129cc2c9a823e74d7>

Solution 8 - Database

I created an NPM package to handle all counters:

First install the module in your functions directory:

npm i adv-firestore-functions

then use it like so:

import { eventExists, colCounter } from 'adv-firestore-functions';

functions.firestore
    .document('posts/{docId}')
    .onWrite(async (change: any, context: any) => {

    // don't run if repeated function
    if (await eventExists(context)) {
      return null;
    }

    await colCounter(change, context);
}

It handles events, and everything else.

If you want to make it a universal counter for all functions:

import { eventExists, colCounter } from 'adv-firestore-functions';

functions.firestore
    .document('{colId}/{docId}')
    .onWrite(async (change: any, context: any) => {

    const colId = context.params.colId;

    // don't run if repeated function
    if (await eventExists(context) || colId.startsWith('_')) {
      return null;
    }

    await colCounter(change, context);
}

And don't forget your rules:

match /_counters/{document} {
  allow read;
  allow write: if false;
}

And of course access it this way:

const collectionPath = 'path/to/collection';
const colSnap = await db.doc('_counters/' + collectionPath).get();
const count = colSnap.get('count');

Read more: https://fireblog.io/post/Zebl6sSbaLdrnSFKbCJx/firestore-counters
GitHub: https://github.com/jdgamble555/adv-firestore-functions

Solution 9 - Database

Get a new write batch

WriteBatch batch = db.batch();

Add a New Value to Collection "NYC"

DocumentReference nycRef = db.collection("cities").document();
batch.set(nycRef, new City());

Maintain a Document with Id as Count and initial Value as total=0

During Add Operation perform like below

DocumentReference countRef= db.collection("cities").document("count");
batch.update(countRef, "total", FieldValue.increment(1));

During Delete Operation perform like below

DocumentReference countRef= db.collection("cities").document("count");
batch.update(countRef, "total", FieldValue.increment(-1));

Always get Document count from

DocumentReference nycRef = db.collection("cities").document("count");

Solution 10 - Database

Use Transaction to update the count inside the success listener of your database write.

FirebaseFirestore.getInstance().runTransaction(new Transaction.Function<Long>() {
                @Nullable
                @Override
                public Long apply(@NonNull Transaction transaction) throws FirebaseFirestoreException {
                    DocumentSnapshot snapshot = transaction
                            .get(pRefs.postRef(forumHelper.getPost_id()));
                    long newCount;
                    if (b) {
                        newCount = snapshot.getLong(kMap.like_count) + 1;
                    } else {
                        newCount = snapshot.getLong(kMap.like_count) - 1;
                    }
    
                    transaction.update(pRefs.postRef(forumHelper.getPost_id()),
                            kMap.like_count, newCount);
    
                    return newCount;
                }
            });

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
Questionjustinbc820View Question on Stackoverflow
Solution 1 - DatabaseDan McGrathView Answer on Stackoverflow
Solution 2 - DatabaseBen CochraneView Answer on Stackoverflow
Solution 3 - DatabaseJohan ChouquetView Answer on Stackoverflow
Solution 4 - DatabaseFerran VerdésView Answer on Stackoverflow
Solution 5 - DatabaseNikhilView Answer on Stackoverflow
Solution 6 - Databasedanday74View Answer on Stackoverflow
Solution 7 - DatabaseSaint PlayView Answer on Stackoverflow
Solution 8 - DatabaseJonathanView Answer on Stackoverflow
Solution 9 - DatabaseMithun SView Answer on Stackoverflow
Solution 10 - DatabaseVikash SharmaView Answer on Stackoverflow