Software Engineer – AI Solutions @ GEICO
- Architected & deployed production LLM backends with Python + FastAPI on Azure (vLLM), enabling scalable AI features.
- Improved claims model accuracy by ~2.5% and cut latency ~50% via prompt tuning and LLaMA evaluation; reduced costs.
- Built a Generative AI platform for document processing (PDF/CSV/DOCX/TXT) with role‑based access & dynamic UI.
- Created an LLM eval pipeline (30% faster via parallelization) and a Snowflake dashboard for model monitoring.
- Implemented robust token counting across 8 models with automated guardrails and error handling.
- Designed an event‑driven logging service (Kafka → Postgres) for auditing and real‑time monitoring.
- Shipped an MCP tools marketplace to register, discover, and test Model Context Protocol tools.