DevOps Engineer (Hybrid 3x Onsite in Plano, TX | AWS | Terraform)
Optomi, in partnership with a leading client of ours in the cloud technologies space, is looking for a DevOps Engineer to join our dynamic and fast-paced team. The ideal candidate will have extensive experience in managing large-scale microservice based systems, ensuring high availability, and implementing best practices in reliability engineering. You will work closely with development and operations teams to enhance our infrastructure and improve system performance while being mindful of cost-effectiveness.
What the right candidate will enjoy:
- Annual Patent-a-thons and Hackathons to embrace your superior creative thinking ability!
- Free catered lunches when working onsite!
- Two fully stocked Nespresso bars for your coffee and tea fixes!
- No dress code! Come as you are!
- Free covered garage parking to shield your vehicle from those sneaky storms and the Texas sun!
- Being part of a team that values forward thought and innovation!
- The flexibility of a hybrid work schedule!
Responsibilities:
- Lead initiatives including defining project objectives, developing project schedules, preparing cost estimates, allocating/forecasting resources, and ensuring timely delivery under direction of Manager/Product Manager.
- Independently write and maintain complex automation scripts, refining for readability, maintainability, and efficiency.
- Perform advanced troubleshooting in scripts with multiple integration points.
- Lead or participate in code reviews to improve overall script quality starting to see constructive feedback.
- Write clean, reusable, and high-quality IaC to solve novel or complex problems and lead advanced IaC reviews, ensuring adherence to best practices and providing guidance during the merge review process.
- Design patterns, practices, and platforms that enable developers to integrate and deliver software components efficiently.
- Build and improve the CI/CD process, ensuring a fully functional software ecosystem.
- Independently manage virtual servers, containers, and orchestration platforms.
- Identify and develop local and cross team / org automation capabilities for products that impact key business SLAs and KPIs.
- Implement security measures to protect data integrity and confidentiality across pipelines.
- Lead the advanced troubleshooting and resolution of complex or novel system issues, including analyzing logs, performance data, and system behavior.
- Define and enforce consistent branching strategies to facilitate parallel development, code isolation, and seamless integration across team(s).
- Regularly review and analyze system performance metrics, proactively identifying and resolving bottlenecks, and implementing solutions to improve speed and responsiveness.
- Handle multi region or multi cloud network management, including zero trust networks and virtual private networks.
- Assist in canary deployments.
- Perform chaos engineering with networking to test resilience.
- Develop and implement strategies for network redundancy and disaster recovery.
- Ensure that network infrastructure can withstand and recover from failures/disasters with minimal impact.
- Lead and facilitate technical discussions, providing insights and recommendations based on DevOps principles.
- Coach more junior DevOps Engineers through targeted feedback and guidance to improve technical skills, overcome challenges, and achieve specific goals.
Required Qualifications:
- 3+ years of production experience working as a DevOps Engineer or Software Engineer
- DevOps expertise to efficiently provide valuable insights and solve challenging problems
- Expertise in Terraform to create, modify, and manage infrastructure configurations or IaC templates
- Expertise in containerization technologies (Docker & Kubernetes) to build, package, and deploy optimized container images
- Proficiency in designing, implementing, and maintaining complex CI/CD pipelines that span with increasing complexity and integrating across multiple environments
- Knowledge in cloud platforms (AWS) to optimize cloud resource utilization and costs throughout product lifecycle
- Experience in cloud architecture and design patterns to create scalable, resilient, and cost-effective cloud architectures and apply best practices for cloud deployments
- Expertise with a scripting language to develop custom tools and scripts for DevOps workflows or integrate with APIs
- Expertise in version control systems to perform branching, merging, and resolving merge conflicts
- Versed within InfoSec policies and procedures to adhere to security standards/regulations and identify gaps in security architecture
- Experience in monitoring and analytics platforms to set up monitors, alerts, and diagnostic tools for proactive issue detection, root cause analysis, and performance optimization across distributed systems
- Understanding of cloud billing and cost management tools to recognize total costs
- Ability to learn and apply new technologies, programming practices, patterns, and methods
- Demonstrates advanced problem-solving, troubleshooting, decision making skills
- Organized and detail-oriented
- Ability to develop healthy working relationships and collaborate with peers and leaders
- Exhibits integrity and high standards in work quality
- Excellent verbal and written communication skills
- Values diversity and differences amongst individuals in interactions