Job Description
Guidewire is seeking an experienced Cloud Operations Engineer to support the design, implementation, and on-going maintenance of our Cloud Solutions. Your job will be to help ensure that our customer development applications, services and websites are available, performing well and highly automated. The successful candidate will have a good mix of deep technical knowledge and a demonstrated background managing Cloud Applications.
Responsibilities
- Monitor and Support customer CI/CD environment ensuring that the infrastructure is available and running
- Investigate, troubleshoot and resolve any issues that impact the customer ability to successfully execute CI/CD functions
- Hands-on build and manage customer development and performance-testing environments in the cloud
- Work with stakeholders to drive scope definition, specifications, and architecture of Deployment Tools & Processes for AWS Cloud
- Work closely with engineers, test engineers, product owners and management to understand requirements, define and implement best practices and standards around DevOps and Service Resilience
- Measurement, optimization, and tuning of system performance and ensuring that systems will run reliably and are highly available in a 24/7 production environment
- Define, implement and document operational processes and procedures, with periodic review for efficiency and improvement
- Manage complex projects from inception to completion
- Participate in 24x7 on-call rotation monitoring systems for meeting defined SLAs
Required Skills and Experience
- 3+ years hands-on experience with AWS (EC2, S3, RDS, VPC, Route 53, IAM, etc.)
- 3+ years of experience using CI/CD tools (git, Jenkins, Bitbucket, AWS Code Pipeline, Nexus, Artifactory, Terraform, CloudFormation)
- 3 years of experience with modern automation, DevOps, and Agile development
- Networking fundamentals including VPNs, DNS, subnetting, route tables, firewalls, etc
- Strong understanding of AWS security and monitoring and experience implementing best practices
- Strong experience with Monitoring and Alerting Tools: CloudWatch, Prometheus/Grafana, Zabbix, PagerDuty, Site24x7, and Sumo Logic
- Experience with configuration management (Chef, Ansible)
- Experience administering infrastructure for traditional monolithic web-based business applications
- Highly proficient in Linux administration
- Experience with coding / scripting languages (Bash, Python, Ruby, Powershell)
- Excellent communication, troubleshooting, and analytical skills
- Kubernetes experiences a plus