← Back to blog

Running Stache on AWS with Stache Serverless

Jonathan Penny
Jonathan Penny ·

I’ve released Stache Serverless - a production-ready way to run Stache on AWS infrastructure.

Architecture

The stack uses:

S3 Vectors in Production

I’ve been running S3 Vectors since it hit GA. Here’s my honest assessment:

What works well:

Limitations:

Performance

Design Decisions

The system uses provider patterns for swappable components - you can swap out vector stores, LLMs, or embedding providers without changing the core code. Auto-split embeddings handle token limits automatically, and middleware plugins provide extensibility.

Deployment

Everything deploys via SAM template. You can also run local development against real AWS services - no mocking required.

Check out stache-serverless on GitHub and the full discussion for details.