Senior Software Engineer – DevOps / SRE (GCP)
Posted 2026-05-06
Remote, USA
Full-time
Immediate Start
Job Title: Senior Software Engineer - DevOps / SRE (GCP) Location: 100% Remote Employment Type: Contract (C2C Allowed) Experience Level: Senior (8+ Years Preferred)
Job Summary:
Seeking a highly skilled Senior Software Engineer with DevOps / Site Reliability Engineering (SRE) expertise and strong Google Cloud Platform (GCP) experience. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of distributed applications while driving automation and observability across systems.
This role involves close collaboration with engineering teams to support modern application environments and enhance operational excellence.
- Key Responsibilities:
- Support and maintain .NET, Node.js, and Angular applications in distributed environments
- Manage and optimize applications running on Windows Server and GCP infrastructure
- Improve system reliability, scalability, and performance through SRE best practices
- Design and implement automation solutions for operational processes
- Build, manage, and optimize CI/CD pipelines
- Enhance monitoring, logging, and observability frameworks
- Perform incident management, root cause analysis, and implement preventive measures
- Collaborate with cross-functional teams to ensure high system availability and stability
- Required Skills & Qualifications:
- Strong experience with .NET, Node.js, and Angular ecosystems
- Hands-on expertise in Google Cloud Platform (GCP) including:
- GKE (Google Kubernetes Engine)
- Cloud SQL / Spanner
- Networking, IAM, and Monitoring tools
- Experience with Windows Server and distributed application hosting
- Solid DevOps/SRE background with:
- CI/CD pipeline implementation
- Automation and scripting (PowerShell, Python, or Node.js)
- Strong SQL skills, including performance tuning and troubleshooting
- Experience with monitoring & observability tools:
- Prometheus
- Grafana
- OpenTelemetry
- Proven experience in incident response and reliability engineering (SLIs/SLOs)
- Knowledge of Infrastructure as Code (Terraform)
- Understanding of cloud security and networking fundamentals
- Preferred Qualifications:
- Experience working in large-scale distributed systems
- Strong problem-solving and troubleshooting mindset
- Excellent communication and collaboration skills