What if you could build a career where ambition meets innovation?
At LPL’s Global Capability Center, you'll find a collaborative culture where your voice matters, integrity guides every decision, and technology fuels progress. Your skills, talents, and ideas will redefine what's possible. LPL's success reflects its exceptional employees, who together pursue one noble purpose: empowering financial advisors to deliver personalized advice for all who need it. We’re proud to be expanding and reaching new heights in Hyderabad.
Join us as we create something extraordinary together.
We are seeking a dynamic, motivated and experienced Recovery Manager for our Production Services. This role is an individual contributor. In this role, you will lead recovery for critical incidents after major disruptions and help drive and oversee strategies for effective recovery management. This role drives Post-Mortem reviews, Incident Recovery for major and critical incidents, structuring SOPs and runbooks, and implementing proactive measures to prevent future failures, by adopting SRE principles.
This is an exciting opportunity to drive meaningful change and enhancing the advisor and investor experience. If you are passionate about production operations, stability, SRE and Observability and have a track record of success, we invite you to apply and be part of our journey toward greater resilience and efficiency.
Key Responsibilities
- Major Incident Support: Drive cross-functional teams to resolve critical incidents and attend post-mortem/post-incident reviews.
- Root Cause Analysis (RCA): Investigate underlying causes of major incidents, utilizing techniques like 5-Why, Fishbone, Blameless RCA and other techniques
- Recovery Strategy & Planning: Develop and test incident recovery plans, establish SOPs, knowledge base and mock drills
- Self Sufficiency: Develop playbooks by coordinating with domain owners and ensure more self-sufficiency and diagnosis accuracy
- Process Improvement: Identify opportunities to improve IT service reliability and reduce operational risks related to people, process and technology
- Feedback Loop: Provide continuous feedback to Observability, Automation, Resiliency and Domain teams on improving observability posture, automation, single points of failures, architectural and design gaps
- Training and Development: Mentor and develop other team members, providing training. Stay current with industry best practices and technologies, fostering a culture of continuous learning and professional growth.
Technical Competencies
- Progressive and proven experience and expertise in Production Services, Recovery and Problem Management, SRE, DevOps, or related fields with development background preferred
- Strong Understanding of foundational technology components across infra, cloud and app to be able to diagnose, ask right questions and effectively lead recovery of a critical incident
- Hands on experience with observability tools, logs and diagnostics to be able to troubleshoot and coach people
Core Competencies
- Excellent communication and interpersonal skills, with a focus on collaboration and relationship-building.
- Able to communicate effectively with CXOs and convey complex technical details into business terms
- Ability to influence and drive change across the organization.
- Analytical mindset with the ability to translate data into actionable insights.
- Experience in analyzing incident trends and implementing process improvements to enhance operational efficiency.
Goals and Objectives
- Achieve a 30% reduction in MTTD and MTTR within the first year of operation
- Able to identify offending service and root cause for at least 70% of incidents within 15-20 mins through effective triaging and engage right stakeholders to achieve MTTR targets
- Foster a culture of continuous improvement, with regular training sessions, mock drills and knowledge-sharing initiatives that empower team members.
- Develop and maintain strong relationships with key stakeholders, to ensure alignment and support for the team’s objectives, enhancing overall organizational resilience.
LPL Global Business Services, LLP - PRIVACY POLICY