How does deletion work with cloud firestore caching?

Question

Let's say I have a collection with tens of thousands of documents, and I'm storing the entire collection in the cache on the client to avoid making tons of unnecessary reads each time the app is opened. To get the latest content, the client just needs to query for all documents with a last_modified time after the previous fetch. This works great when creating or updating documents in the collection, but what happens when you want to delete something? If one client removes it from the database then the other clients will never realize that it's missing, and if you add a deleted field then it'll tie up storage space forever on the server and every single client that's ever fetched it.

What techniques are there to deal with this kind of thing? I'm open to making major changes to the architecture I've described if it's necessary to solve this problem, so I'd prefer an answer of "start over and do it like this" to "it's impossible". I've considered reusing deleted documents when new content is added instead of creating new ones, but that still means the database can never shrink.

In summary, the requirements for my app are:

I need to cache somehow because I have too many documents to be constantly reading them, and also because offline support is important
I need to be able to add, edit, and delete documents in a way where multiple clients can efficiently get those updates, and deleting documents frees up space

Edit: The method I'm using for improving query performance is from the docs, at the bottom of this page. I'm making a time tracker, where each activity is a very lightweight document (<1kb) and a user might log up to 10K activities per year. Each user only sees their own activities, but they might want to do it on multiple devices. Keeping the entire history on each device makes it very easy to calculate statistics on the fly without having to think about any kinds of costs, and it's small enough that I don't need to worry about space. I've considered trying to optimize by combining activities into larger documents or by pre-calculating the aggregations ahead of time, but that feels a lot more like premature optimization than this caching strategy would be.

How does deletion work with cloud firestore caching?

Answers (1)

Related Questions