
Senior Ml Engineer
Apply production-grade LLM integration and MLOps patterns when designing, shipping, and operating ML systems that must scale and stay observable.
Install
npx skills add https://github.com/davila7/claude-code-templates --skill senior-ml-engineerWhat is this skill?
- Production-first design: scalability, 99.9% uptime target, maintainability, and observability
- Three advanced patterns: distributed processing, real-time low-latency systems, and ML at scale with monitoring
- LLM integration principles: input validation, encryption, access control, and audit logging
- MLOps production patterns for fault-tolerant enterprise-scale and high-throughput workloads
- Best-practice checklists for testing, profiling, caching, retries, and circuit breakers
Adoption & trust: 669 installs on skills.sh; 27.8k GitHub stars; 3/3 security scanners passed (skills.sh audits).
Recommended Skills
Journey fit
Canonical shelf is Build/backend because the skill centers on implementing LLM integration and ML systems—not one-off marketing or launch tasks. Backend subphase fits API-serving models, batch and real-time inference pipelines, and service-level reliability targets called out in the guide.
Common Questions / FAQ
Is Senior Ml Engineer safe to install?
skills.sh reports 3 of 3 security scanners passed. Review the Security Audits panel on this page before installing in production.
SKILL.md
READMESKILL.md - Senior Ml Engineer
# Llm Integration Guide ## Overview World-class llm integration guide for senior ml/ai engineer. ## Core Principles ### Production-First Design Always design with production in mind: - Scalability: Handle 10x current load - Reliability: 99.9% uptime target - Maintainability: Clear, documented code - Observability: Monitor everything ### Performance by Design Optimize from the start: - Efficient algorithms - Resource awareness - Strategic caching - Batch processing ### Security & Privacy Build security in: - Input validation - Data encryption - Access control - Audit logging ## Advanced Patterns ### Pattern 1: Distributed Processing Enterprise-scale data processing with fault tolerance. ### Pattern 2: Real-Time Systems Low-latency, high-throughput systems. ### Pattern 3: ML at Scale Production ML with monitoring and automation. ## Best Practices ### Code Quality - Comprehensive testing - Clear documentation - Code reviews - Type hints ### Performance - Profile before optimizing - Monitor continuously - Cache strategically - Batch operations ### Reliability - Design for failure - Implement retries - Use circuit breakers - Monitor health ## Tools & Technologies Essential tools for this domain: - Development frameworks - Testing libraries - Deployment platforms - Monitoring solutions ## Further Reading - Research papers - Industry blogs - Conference talks - Open source projects # Mlops Production Patterns ## Overview World-class mlops production patterns for senior ml/ai engineer. ## Core Principles ### Production-First Design Always design with production in mind: - Scalability: Handle 10x current load - Reliability: 99.9% uptime target - Maintainability: Clear, documented code - Observability: Monitor everything ### Performance by Design Optimize from the start: - Efficient algorithms - Resource awareness - Strategic caching - Batch processing ### Security & Privacy Build security in: - Input validation - Data encryption - Access control - Audit logging ## Advanced Patterns ### Pattern 1: Distributed Processing Enterprise-scale data processing with fault tolerance. ### Pattern 2: Real-Time Systems Low-latency, high-throughput systems. ### Pattern 3: ML at Scale Production ML with monitoring and automation. ## Best Practices ### Code Quality - Comprehensive testing - Clear documentation - Code reviews - Type hints ### Performance - Profile before optimizing - Monitor continuously - Cache strategically - Batch operations ### Reliability - Design for failure - Implement retries - Use circuit breakers - Monitor health ## Tools & Technologies Essential tools for this domain: - Development frameworks - Testing libraries - Deployment platforms - Monitoring solutions ## Further Reading - Research papers - Industry blogs - Conference talks - Open source projects # Rag System Architecture ## Overview World-class rag system architecture for senior ml/ai engineer. ## Core Principles ### Production-First Design Always design with production in mind: - Scalability: Handle 10x current load - Reliability: 99.9% uptime target - Maintainability: Clear, documented code - Observability: Monitor everything ### Performance by Design Optimize from the start: - Efficient algorithms - Resource awareness - Strategic caching - Batch processing ### Security & Privacy Build security in: - Input validation - Data encryption - Access control - Audit logging ## Advanced Patterns ### Pattern 1: Distributed Processing Enterprise-scale data processing with fault tolerance. ### Pattern 2: Real-Time Systems Low-latency, high-throughput systems. ### Pattern 3: ML at Scale Production ML with monitoring and automation. ## Best Practices ### Code Quality - Comprehensive testing - Clear documentation - Code reviews - Type hints ### Performance - Profile before optimizing - Monitor continuously - Cache strategically - Batch operations ### Reliability - Design for failure - Implement retries - Use cir