Updates to reduce firestore calls to try and stay in free tier

### Firestore read reductions

**1. `doc_get_cached()` in `firestore.py` — new 5-min TTL cache**
One place, benefits everything. System and node config documents almost never change during a monitoring session.

**2. System doc: 4 reads → 1 per call**
| Before | After |
|---|---|
| `upload.py` — `doc_get("systems")` for ai_flags | `doc_get_cached` |
| `transcription.py` — `get_vocabulary()` → `doc_get("systems")` | cache hit |
| `intelligence.py` — `get_vocabulary()` → `doc_get("systems")` | cache hit |
| `intelligence.py` — `doc_get("systems")` again for ten_codes | eliminated (reads same cached doc) |

**3. Node doc: cached in `_on_call_start` and `intelligence.py`**
The node is read every call event to get `assigned_system_id` and lat/lon for geocoding. Both now use the cache — node assignments and positions essentially never change at runtime.

**4. Node sweeper: 30s → 90s interval**
The sweeper was doing a full node collection scan 3× more often than necessary — the offline threshold is already 90s. Cuts sweeper reads by 66%.

**5. Vocabulary induction: scans all-time calls → last 7 days**
Previously fetched every ended call for a system (could be thousands). Now scoped to the last 7 days.

> **Note:** The vocabulary induction query `(system_id == X, ended_at >= cutoff)` needs a Firestore
> composite index on `(system_id ASC, ended_at ASC)`. When the induction loop first fires it will log
> an error with a Firebase Console link to create it in one click.

This commit is contained in:

Logan

2026-05-04 02:05:00 -04:00

parent 97f4286810

commit 7e1b01a275

6 changed files with 43 additions and 15 deletions

									
										drb-c2-core/app/internal/node_sweeper.py
									
		+1
		-1
	
												View File
												
				@@ -4,7 +4,7 @@ from app.config import settings

				from app.internal.logger import logger

				from app.internal import firestore as fstore

				SWEEP_INTERVAL = 30  # seconds

				SWEEP_INTERVAL = 90  # seconds — matches node_offline_threshold; no gain in checking faster

				async def sweeper_loop():