Reduce backfill throttle to 0.1s to fit within gunicorn timeout

0.5s × 136 tools = 68s of sleep alone, which combined with network
time exceeded gunicorn's 120s worker timeout.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
This commit is contained in:
rob 2026-02-02 08:17:56 -04:00
parent 5159450feb
commit 33b807d545
1 changed files with 1 additions and 1 deletions

View File

@ -204,7 +204,7 @@ def backfill_all_embeddings(
ollama_url: Optional[str] = None, ollama_url: Optional[str] = None,
model: Optional[str] = None, model: Optional[str] = None,
batch_size: int = 20, batch_size: int = 20,
throttle: float = 0.5, throttle: float = 0.1,
) -> Dict[str, int]: ) -> Dict[str, int]:
"""Generate embeddings for all public/approved tools that don't have one. """Generate embeddings for all public/approved tools that don't have one.