Location: Maryland, USA
Remote: Yes
Willing to relocate: No
Technologies:
AI Platform & LLM Systems: RAG architectures, LLM orchestration, control planes, routing & fallback logic, evaluation frameworks (Langfuse, deepeval), trace-level observability, prompt/version management, context engineering, agentic workflows
Distributed Systems & Infrastructure: High-availability design, autoscaling concepts, failure modeling, SLO/SLA enforcement, load balancing (HAProxy/Nginx), Kubernetes, Docker, cloud-native architectures(AWS/GCP/Azure), Rackspace, Terraform, Ansible, Chef, Linux, Solaris, ZFS
Systems Design & Leadership: Architecture Reviews, Design RFCs, Failure Modeling, Incident Postmortems, Cross-team platform enablement
Languages: Python, SQL, Bash
Databases: PostgreSQL (expert: performance tuning, query optimization, replication, partitioning, migrations, security), MySQL (strong: performance tuning, optimization), CockroachDB, Cassandra, MS SQL Server, Oracle
Résumé/CV: https://paste.centos.org/view/1edc49afEmail: psinghpayal@outlook.com
Summary: Over a decade of experience scaling distributed systems in production, designing reliable systems around critical processess, and optimizing for performance, security, robustness, and efficiency. If you want someone who can come in and build production-ready AI systems/agents, I'm the one you're looking for :)